Metric Definition
Performance across devices
Track from
Device performance analysis
Device performance analysis is the practice of measuring how well a product loads, responds and stays stable across the range of devices, browsers and networks that real users bring. It exposes where the experience is fast for some users and broken for others, so teams stop optimising for the high-end devices they happen to test on.
8 min read
What is device performance analysis?
Device performance analysis is the practice of measuring product performance separately across the devices, browsers, operating systems and network conditions that real users have, rather than reporting one blended average. It breaks the user base into device classes, then compares load speed, responsiveness and stability across them to find where the experience degrades. The headline insight is almost always the same, which is that the average hides a wide gap between the fastest and slowest segments.
The analysis matters because a single average is reassuring and wrong. A team that tests on recent flagship phones over office wifi sees a fast product and ships with confidence, while a meaningful share of users on mid-range Android devices over mobile data experience slow loads, janky scrolling and occasional crashes. Those users churn quietly and rarely file a ticket. Device performance analysis is how that silent gap becomes visible.
The goal is not to make every device identical, which is impossible across years of hardware. The goal is to set a floor, so the worst-served realistic segment still gets a usable experience, and to catch regressions that hit one platform before they spread. Done as an ongoing analysis rather than a one-off audit, it keeps performance honest as the product grows heavier over time.
Always segment performance by real device class, not a blended average. A median load time can look healthy while the slowest quarter of users, often on older Android hardware and mobile networks, sit well past the point where they abandon. The average is the metric that lets a broken experience hide in plain sight.
How to calculate device performance analysis
Device performance analysis combines a few core experience signals into a per-device-class score, then compares those scores across segments. The reliable pattern is to capture the inputs below from real users in the field, normalise each to a common scale, weight them, and report the score per device class rather than as one number. Field data from real sessions matters more than lab tests, because lab conditions rarely match the worst device on the worst network.
- 1
Load speed
How long the product takes to load and become useful, measured with signals such as largest contentful paint and time to interactive. This is the first impression and the strongest predictor of whether a user on a slow device stays at all.
- 2
Responsiveness
How quickly the interface reacts to taps, clicks and scrolls once loaded, measured through interaction latency. Responsiveness degrades sharply on weaker processors even when the initial load looked acceptable.
- 3
Stability
The inverse of crash rate, error rate and layout shift on each device class. Stability is where low-end devices fail hardest, running out of memory or breaking on browser features the team never tested.
- 4
Device class segmentation
The dimension that makes the analysis meaningful, grouping sessions by hardware tier, operating system, browser and network type. Without segmentation the other three signals collapse back into a misleading average.
Combine the first three into a weighted score, then report it once per device class so the spread is visible. The value is not the composite number alone, it is the comparison: seeing that flagship users score in the strong range while a real segment of mid-range users sits in the failing range tells you exactly where to invest. Tracking the gap between best and worst segment over time is often more actionable than the average itself.
Device performance analysis in a metric tree
A device performance score tells you a segment is slow, but not what to change. A metric tree decomposes the score into its experience components, then breaks each component into the technical causes a team can fix. This is what turns a red bar on a chart into a prioritised list of work.
The first level splits the score into load speed, responsiveness and stability. Each then decomposes further. Load speed traces back to payload size, server response time, and how much rendering blocks on the main thread. Responsiveness traces to JavaScript execution cost and long tasks that block input. Stability traces to memory pressure, unhandled errors, and browser compatibility gaps. Every leaf is something an engineer can profile and improve.
This structure lets you target the worst device class precisely. If mid-range Android scores poorly on load speed because of payload size, the fix is bundle and image optimisation, not chasing a responsiveness problem that does not exist for that segment. The tree keeps the work aimed at the actual bottleneck for the actual users who are suffering.
Metric tree insight
Payload size is the lever that disproportionately punishes low-end devices and the one most under a teams control. A heavy JavaScript bundle a flagship parses in a moment can take several seconds on a budget phone over mobile data. Cutting payload often lifts the worst device class far more than the average, which is exactly the segment the score should protect.
Device performance analysis benchmarks
Because performance is judged per device class, the useful benchmarks sit on the underlying experience signals, where widely used field thresholds give concrete ranges. Apply these to each segment, not to a blended average, and pay closest attention to whether the slowest realistic segment clears the acceptable bar.
| Performance signal | Poor | Needs work | Good |
|---|---|---|---|
| Largest contentful paint | Over 4 seconds | 2.5 to 4 seconds | Under 2.5 seconds |
| Interaction latency | Over 500 milliseconds | 200 to 500 milliseconds | Under 200 milliseconds |
| Crash-free session rate | Under 98 percent | 98 to 99.5 percent | Over 99.5 percent |
| Gap between best and worst device class | Very wide | Moderate | Narrow |
Read the segments, not the mean. A product can clear every threshold on average while a quarter of users on older hardware sit firmly in the poor column. A healthy device performance profile keeps even the slowest realistic segment in the needs-work range or better and shows the gap between best and worst narrowing over time rather than widening as the product grows.
How to improve device performance
Improving device performance means fixing the worst segment, not polishing the experience for devices that were already fast. The metric tree shows which component drags the slowest class down, and that is where the next change belongs. The actions below map to the components of the score.
Shrink the payload
Reduce JavaScript and image weight through code splitting, lazy loading and modern formats. Smaller payloads help every device but transform the experience on low-end hardware and slow networks, which is where load speed scores collapse.
Cut main thread work
Break up long tasks, defer non-essential scripts, and move heavy computation off the main thread. Responsiveness on weaker processors depends almost entirely on keeping the main thread free to react to input.
Harden stability on weak devices
Profile memory use, handle errors gracefully, and test against the browsers and operating systems your slowest segment actually uses. Most crashes hit a specific device class that the team simply never ran.
Test on real low-end devices
Add representative budget hardware and throttled networks to your testing and field monitoring. You cannot improve a segment you never observe, and the fast machine on a developer desk hides every problem worth fixing.
The recurring failure is that performance is everyones concern and no ones job, so the slow-device segment degrades release after release. KPI Tree models device performance as a tree with RACI ownership on each branch, so load speed has an accountable owner, stability has another, and the worst-served device class has someone watching it specifically. When a segment regresses, the change is pushed to the person accountable for that branch rather than surfacing weeks later in a support trend. The verified impact loop then checks whether a payload cut or a main-thread fix actually moved the slow segments score, so the team learns which optimisations hold up in the field.
Common mistakes when tracking device performance
- 1
Reporting a single blended average
One number across all devices hides the segments that are actually broken. The average is dragged up by fast devices and lets a poor low-end experience go unnoticed until users quietly leave.
- 2
Testing only on high-end hardware
A product profiled on a recent flagship over office wifi always looks fast. The performance problems live on mid-range and older devices over mobile networks, which are exactly the conditions developer machines never reproduce.
- 3
Relying on lab data alone
Synthetic lab tests run under controlled conditions and miss the variance of real networks, background apps and aged hardware. Field data from real sessions is the only honest measure of what users experience.
- 4
Ignoring stability in favour of speed
Load speed gets the attention while crashes and errors on specific device classes go untracked. A fast page that crashes on a common low-end browser is worse than a slightly slower one that works.
- 5
Measuring without owning the worst segment
A dashboard that shows the slow segment but assigns it to no one changes nothing. Without an accountable owner watching the worst device class, the gap simply widens as the product grows heavier.
Related metrics
Daily Active Users
DAU
Product MetricsMetric Definition
DAU = Unique Users Who Performed a Qualifying Action in a Single Day
Daily active users measures the number of unique users who engage with your product on a given day. It is the primary engagement metric for consumer and SaaS products, indicating whether your product has become a daily habit for its users.
Retention Rate
Product MetricsMetric Definition
Retention Rate = (Users Active at End of Period / Users Active at Start of Period) × 100
Retention rate measures the percentage of users or customers who continue to use your product over a given period. It is the most important growth metric because sustainable growth is impossible when users leave faster than they arrive.
Feature Adoption Rate
Product MetricsMetric Definition
Feature Adoption Rate = (Users Who Used the Feature / Total Active Users) × 100
Feature adoption rate measures the percentage of users who use a specific feature within a given period. It tells product teams whether new features are resonating with users and which existing features are underutilised, guiding investment decisions and roadmap priorities.
Conversion Rate
CVR
Marketing MetricsMetric Definition
Conversion Rate = (Number of Conversions / Total Visitors or Leads) × 100
Conversion rate measures the percentage of visitors, users, or leads who take a desired action, such as making a purchase, signing up for a trial, or submitting a form. It is the fundamental metric for evaluating the effectiveness of any acquisition funnel, landing page, or marketing campaign.
How to build a metric tree
Metric Definition
Once you are tracking device performance analysis, this guide shows you how to slot it into a metric tree so you can see what drives differences across devices.
Metric trees for product teams
Metric Definition
Device performance analysis is a product concern, and this guide shows product teams how to connect it to the wider metrics they own.
Turn device performance into an owned tree
Decompose performance into load speed, responsiveness and stability across real device classes, give each branch an accountable owner, and let KPI Tree flag a regression before the slow segment churns.