Metric Definition
Support capacity
Track from
Agent utilisation rate
Agent utilisation rate measures the percentage of an agent's available working time spent on active customer conversations. It captures how effectively support capacity is being used, balancing the need for productivity against the risk of burnout and declining service quality. The metric helps support leaders right-size their teams, plan shifts, and understand whether staffing levels match conversation demand.
8 min read
What is agent utilisation rate?
Agent utilisation rate is the proportion of an agent's working time spent directly handling customer conversations. If an agent is available for eight hours and spends six hours actively working on conversations (including research, composing responses, and post-conversation wrap-up), their utilisation rate is 75%.
The metric excludes scheduled breaks, training, team meetings, and other non-conversation activities. It specifically measures how much of the time designated for customer interaction is actually spent on customer interaction. The remaining time is idle time: periods where the agent is available but no conversations are assigned.
Agent utilisation rate is a double-edged metric. Too low and the team has excess capacity that could be redeployed or reduced. Too high and agents are overloaded, which leads to longer response times, lower quality interactions, higher error rates, and eventually burnout and turnover. The relationship between utilisation and service quality is non-linear: performance degrades gradually at moderate utilisation levels but collapses rapidly as utilisation approaches 100%.
This non-linear relationship is explained by queueing theory. When utilisation is high, any small spike in demand creates disproportionately long queues because there is no buffer capacity to absorb the spike. At 70% utilisation, a 10% demand increase is easily absorbed. At 95% utilisation, the same increase causes queue times to skyrocket. Support leaders must manage utilisation within a range that provides sufficient buffer for demand variability.
Agent utilisation is not a metric to maximise. Pushing utilisation above 85% to 90% consistently will degrade service quality, increase agent turnover, and ultimately cost more than the capacity it saves. The optimal range depends on channel type and conversation complexity, but most teams target 70% to 85%.
Decomposing agent utilisation rate with a metric tree
A metric tree breaks agent utilisation into its components, revealing what agents spend their time on and where capacity is consumed or wasted.
The tree reveals that utilisation is not simply about handling more conversations. If agents spend 25% of their active time on research because knowledge base articles are poor, fixing the knowledge base would reduce per-conversation handle time and allow the same utilisation rate to cover more conversations.
Connecting utilisation to average handle time, conversations per teammate, and first response time creates a complete capacity picture. When utilisation rises, handle time tends to stay constant but first response time degrades because the queue grows. The tree makes this relationship visible and helps leaders identify the utilisation threshold where service quality begins to deteriorate.
Benchmarks by channel and model
| Support model | Target utilisation | Key considerations |
|---|---|---|
| Phone support | 70% to 80% | Agents handle one conversation at a time. Utilisation above 80% creates long hold times and caller frustration. |
| Live chat (1:1) | 65% to 80% | Similar to phone but with some async buffering. Agents can handle brief pauses between messages. |
| Live chat (concurrent) | 75% to 85% | Agents handling 2 to 4 simultaneous chats can achieve higher utilisation because idle moments in one chat are filled by another. |
| Async messaging | 80% to 90% | Asynchronous nature allows agents to manage many open conversations. Higher utilisation is sustainable because response time expectations are longer. |
| Email and ticket | 80% to 90% | Fully asynchronous. Agents work through a queue at their own pace. Higher utilisation is achievable without quality degradation. |
The acceptable range increases as the channel becomes more asynchronous. Synchronous channels (phone, live chat) require buffer capacity because customers expect immediate responses. Asynchronous channels (messaging, email) tolerate higher utilisation because response time expectations are measured in hours rather than seconds.
Strategies to optimise agent utilisation
- 1
Align staffing to demand patterns
Support demand is rarely constant throughout the day or week. Analyse historical conversation volume by hour and day to identify peaks and troughs. Schedule shifts to match demand patterns, using part-time staff or flexible scheduling to cover peaks without overstaffing during quiet periods.
- 2
Enable concurrent conversation handling
For chat and messaging channels, allow agents to handle multiple conversations simultaneously. This fills the idle time within individual conversations (waiting for customer responses, loading account data) with productive work on other conversations. Increase concurrency gradually and monitor quality metrics to find the right balance.
- 3
Reduce handle time through better tooling
Every minute saved per conversation frees capacity that can be applied to additional conversations or used as buffer. Invest in macros and templates for common responses, customer context panels that eliminate research time, and diagnostic tools that speed up troubleshooting.
- 4
Deflect simple queries to self-service
Simple, repetitive queries consume agent time that could be spent on complex issues. Expand self-service coverage through better knowledge base content, chatbots for common questions, and automated workflows for routine requests like password resets or order status checks.
- 5
Monitor the utilisation-quality trade-off
Track utilisation alongside service quality metrics (CSAT, resolution rate, first response time) to identify the threshold where quality begins to degrade. Set a utilisation ceiling slightly below that threshold and treat it as a hiring trigger rather than a target to exceed.
Tracking agent utilisation rate with KPI Tree
KPI Tree lets you model agent utilisation alongside the service quality metrics it affects, creating a balanced view of support team performance. The tree can segment utilisation by agent, team, shift, and channel to identify where capacity is tight and where there is room to absorb more volume.
Linking utilisation to first response time, conversation resolution rate, and customer satisfaction score ensures that capacity optimisation does not come at the cost of customer experience. When utilisation crosses the quality threshold, the tree makes the trade-off visible and provides the data needed to justify hiring decisions.
Support operations leaders can set utilisation targets for each channel and team, with automatic alerts when the metric moves outside the optimal range. Over time, the tree builds a historical record that connects staffing decisions to their effects on both efficiency and quality.
Related metrics
Average handle time
Customer Support MetricsMetric Definition
AHT = (Total Talk Time + Total Hold Time + Total After-Call Work) / Total Interactions Handled
Average handle time measures the average total duration of a single customer support interaction, including talk time, hold time, and after-call work. It is one of the most widely tracked efficiency metrics in contact centres and support operations, directly influencing staffing models, cost forecasts, and service level planning.
Conversations per teammate
Customer Support MetricsMetric Definition
Conversations per Teammate = Total Active Conversations / Number of Active Agents
Conversations per teammate measures the average number of active support conversations each agent handles during a given period. It is a core workload and capacity metric that influences response times, resolution quality, agent wellbeing, and the overall cost efficiency of a support operation.
First response time
Customer Support MetricsMetric Definition
FRT = Total First Response Times / Total Tickets With a First Response
First response time measures the elapsed time between a customer creating a support ticket and receiving the first substantive response from a human agent. It is the metric that shapes the customer's initial impression of the support experience and sets the tone for the entire interaction.
Conversation resolution rate
Support effectiveness
Customer Support MetricsMetric Definition
Conversation Resolution Rate = (Resolved Conversations / Total Conversations) x 100
Conversation resolution rate measures the percentage of customer support conversations that are resolved, meaning the customer's issue is fully addressed and the conversation is closed. It captures the effectiveness of the support team at actually solving problems rather than simply responding to them. A high resolution rate indicates that the team is closing the loop on customer issues, while a low rate suggests that conversations are going unanswered, being abandoned, or left in limbo.
Balance utilisation and quality with KPI Tree
Build a support capacity metric tree that connects agent utilisation to service quality, staffing levels, and conversation demand. Find the optimal utilisation range and track the impact of staffing and process changes.