KPI Tree

Metric Definition

Reply speed at the percentile level

Average response time = Sum of (Response timestamp - Request timestamp) / Number of requests
Response timestampThe moment the first meaningful reply is sent
Request timestampThe moment the request arrived
Number of requestsTotal requests responded to in the period

Track from

Metric GlossaryOperations Metrics

Response time analysis

Response time analysis is the practice of measuring how long it takes to respond to requests, then studying the full distribution rather than a single average. It shows not just the typical wait but how bad the slowest cases get. Read well, it turns a blunt average into a clear picture of where service breaks down.

7 min read

Generate AI summary

What is response time analysis?

Response time analysis is the practice of measuring how long it takes to respond to incoming requests, then examining the full distribution of those times rather than a single average. A request that arrives at 09:00 and gets a first reply at 09:30 has a response time of 30 minutes. Across hundreds of requests you can compute an average, but the average alone hides the cases that hurt most.

The analysis part matters because response times are rarely evenly spread. A team can show a comfortable 20-minute average while one in twenty requests waits over two hours. Looking at percentiles, such as the median and the 90th and 95th, tells you what the typical customer experiences and what the unlucky tail experiences. That tail is usually where complaints, escalations and churn come from.

Average hides the tail

A single average response time can look healthy while a long tail of slow responses goes unseen. Always pair the average with the median and a high percentile such as the 95th. If the 95th percentile is far above the median, a meaningful minority of requests is waiting far too long even when the headline number looks fine.

How to calculate response time analysis

Start with the response time of each request, which is the response timestamp minus the request timestamp. The average is the sum of those individual times divided by the number of requests, but the analysis only becomes useful once you also compute the median and the upper percentiles. The median is the middle value when all response times are sorted, and the 95th percentile is the value below which 95 percent of responses fall.

Work an example. Ten requests have response times in minutes of 5, 6, 7, 8, 9, 10, 12, 15, 40 and 120. The average is about 23 minutes, dragged up by the two slow cases. The median is 9.5 minutes, which is what most requests actually felt like. The 95th percentile sits around 120 minutes. Three numbers tell a far truer story than the average on its own.

  1. 1

    Capture both timestamps

    Record when each request arrived and when the first meaningful response was sent.

  2. 2

    Compute per-request response time

    Subtract the request time from the response time for every request in the period.

  3. 3

    Calculate the average

    Sum all response times and divide by the number of requests for the headline figure.

  4. 4

    Add median and percentiles

    Sort the times and read off the median, 90th and 95th percentiles to see the distribution.

Response time analysis in a metric tree

Knowing the 95th percentile crept up tells you there is a problem but not where it sits. A metric tree decomposes response time into the drivers that move it, so a slow tail traces back to a staffing gap, a routing delay or a spike in volume rather than a generic service issue. Response time is shaped by how fast requests arrive, how much capacity is on hand to absorb them, and how cleanly each request reaches the right person.

The tree below splits response time into incoming demand, available capacity and routing. Reading it, a slow tail at 17:00 might be a volume peak meeting a thin late shift, while slow mornings might be misrouted tickets sitting in the wrong queue. Each branch points to a different owner and a different lever.

Metric tree insight

When the slow tail grows, the tree shows whether the cause is a volume spike, thin coverage or bad routing. In KPI Tree each branch carries a RACI owner, so the person accountable for shift coverage and the person accountable for queue routing each see their node move. When the 95th percentile breaches its threshold, the alert goes to the owner of the branch driving it, not to a shared inbox.

Response time analysis benchmarks

Expectations vary sharply by channel. Live chat and phone are judged in seconds and minutes, email in hours, and social channels somewhere between. Set targets per channel and track the median and 95th percentile against them, because a single blended target will always flatter your fastest channel and punish your slowest. The ranges below are common first-response expectations rather than guarantees.

ChannelGood medianWatch the tail above
Live chatUnder 1 minute5 minutes
PhoneUnder 30 seconds3 minutes
EmailUnder 4 hours24 hours
Social mediaUnder 1 hour6 hours

How to improve response time analysis

Improving response time is mostly about matching capacity to demand and clearing the friction that lets requests sit. Focus first on the tail, because shaving the slowest cases improves customer experience far more than nudging an already-fast median.

Staff to demand peaks

Align shift patterns to the hourly volume curve so coverage is thickest when requests surge.

Fix routing

Route requests to the right queue first time to cut the reassignments that inflate the tail.

Automate first replies

Use templates and triage automation to acknowledge requests quickly while a person picks up the detail.

Watch percentiles, not averages

Track the 95th percentile so a growing tail is caught before it shows up in complaints.

Common mistakes when tracking response time analysis

  1. 1

    Reporting only the average

    The average masks a slow tail, so a healthy headline can sit on top of a poor experience for many.

  2. 2

    One target across all channels

    A blended target flatters chat and unfairly penalises email, hiding real problems in both.

  3. 3

    Measuring an auto-reply as a response

    Counting an automatic acknowledgement as the response makes times look fast while customers still wait for help.

  4. 4

    Ignoring business hours

    Including overnight gaps in response time distorts the figure unless you measure within agreed service hours.

Related metrics

First response time

Customer Support Metrics
IntercomPylon

Metric Definition

FRT = Total First Response Times / Total Tickets With a First Response

First response time measures the elapsed time between a customer creating a support ticket and receiving the first substantive response from a human agent. It is the metric that shapes the customer's initial impression of the support experience and sets the tone for the entire interaction.

View metric

Average resolution time

Customer Support Metrics
SalesforceIntercomPylon

Metric Definition

Average Resolution Time = Total Resolution Time Across All Tickets / Total Tickets Resolved

Average resolution time measures the mean elapsed time from when a support ticket is created to when it is fully resolved and closed. It captures the end-to-end customer experience of getting an issue fixed, encompassing wait times, agent work time, escalations, and any back-and-forth exchanges required to reach a solution.

View metric

Escalation rate

Customer Support Metrics
Pylon

Metric Definition

Escalation Rate = (Escalated Tickets / Total Tickets Handled) x 100

Escalation rate measures the percentage of support tickets that are transferred from one tier or team to a higher tier or specialist group for resolution. It reflects the gap between the issues customers raise and the ability of frontline agents to resolve them, making it a key indicator of agent readiness, process maturity, and product complexity.

View metric

Ticket volume

Customer Support Metrics

Metric Definition

Ticket Volume = Total New Tickets Created in Period

Ticket volume is the total number of new support tickets created within a defined period. It is the fundamental demand metric for support operations, determining staffing requirements, budget allocation, and the urgency of self-service and product quality investments.

View metric

Metric decomposition

Metric Definition

Decomposing response time into its percentile and stage components shows you exactly where reply speed is slipping and which lever to pull.

View metric

Metric trees for operations teams

Metric Definition

Response time analysis sits alongside the other throughput and service-level measures operations teams track, so this guide shows how it fits into a wider operations metric tree.

View metric

Build response time analysis as a metric tree

Model response time as a tree of demand, capacity and routing drivers, with the median and 95th percentile in view. Put a RACI owner on every branch and let KPI Tree push the alert to the accountable owner the moment the tail breaches its threshold.

Experience That Matters

Built by a team that's been in your shoes

Our team brings deep experience from leading Data, Growth and People teams at some of the fastest growing scaleups in Europe through to IPO and beyond. We've faced the same challenges you're facing now.

Checkout.com
Planet
UK Government
Travelex
BT
Sainsbury's
Goldman Sachs
Dojo
Redpin
Farfetch
Just Eat for Business