How we score apps

Our testing rubric, our accuracy protocol, and the statistics behind every number we publish.

The principle: measure before you recommend

Most app reviews are written from a few hours of clicking around. We don't publish a verdict until we've used an app for at least four weeks of regular daily use, and — where an app makes a measurable claim — until we've tested that claim against a reference. Every quantitative figure on this site links back to a re-runnable dataset, and no point estimate is published without a confidence interval.

The scoring rubric

We score on a continuous 0–10 scale with one decimal of precision, weighting six dimensions:

The scale is anchored: 9.0+ exceptional and category-defining; 8.0–8.9 strong recommendation (Editor's Picks live here); 7.0–7.9 good for the right person; 6.0–6.9 okay, better options usually exist; 5.0–5.9 mediocre; below 5.0 avoid. We do not award a 10 — no app we have tested is perfect.

The accuracy protocol (calorie & nutrition apps)

For calorie-tracking apps, "accuracy" is the single most consequential and most-misreported number in the category. Our protocol is built to be reproducible:

Where an app has no published, independently validated accuracy figure, we say so explicitly and decline to rank it above apps that do. An unvalidated vendor claim is not evidence.

Sustained-use testing

Beyond accuracy, each app is used for a minimum of four weeks of daily logging by at least one reviewer, with attention to adherence drop-off, onboarding friction, and how the app behaves once the novelty fades. For apps whose behaviour depends on an adaptive model (TDEE estimation, AI coaching), we note the calibration window before the model is meaningfully personalised.

What we don't claim to do

We are not a metabolic ward. Our reference meals are weighed to the gram and anchored to USDA data, but we do not measure individual metabolic response, and we don't run doubly-labelled-water energy-expenditure studies. Where our protocol is the limiting factor, we say so in the review. The strength of our work is reproducibility and disclosure, not access to a clinical laboratory.

Questions about the protocol or the dataset? Write to editorial@independent-app-reviews.org.