The Proving Ground: How Movement Rebels Benchmarks Your Fitness
Most fitness apps tell you what you did. The Proving Ground tells you where you stand. It is a set of physical benchmarks, from a plank hold to a broad jump to a 2k row, that turn "am I getting fitter?" from a feeling into a number you can watch move.
The whole thing runs on one principle: you compete against your past self first, and the population norms second. A beginner and a national-level lifter can both use it and both get an honest read, because the question is never "are you elite," it is "are you better than you were, and how do you rank for your age and sex." The coach reads these marks too, so a stale grip test or a jump that dropped shows up in your plan without you saying a word.
This guide walks the whole system: the tests, the scores, and every way to run them.
The 26 tests, across five pillars
The Proving Ground holds 26 benchmark tests grouped into five movement pillars: Core, Upper Body, Lower Body, Posterior Chain, and Power. Some are holds measured in seconds (Plank Hold, Wall Sit, Dead Hang). Some are max-rep tests (Max Push-Ups, Max Sit-Ups, Air Squats). Some are power marks in distance (Standing Broad Jump, Vertical Jump). A separate set of Engine Tests covers your aerobic side: row, bike, run, beep test, Cooper, VO2 max estimates.
You do not have to do all of them. You map the ones that matter to you, and the board fills in as you go. A card reads "12 of 27 mapped" until you close the gaps. Every test carries a tier for your age and sex, from Untrained up to Elite, pulled from published norms rather than a number we invented.
Standing: your one honest number
Standing is the single figure at the top of the board, scored out of 100. It is your staleness-decayed average percentile across every tier-ranked test you have logged. Test well and it rises. Let marks go stale and it drops, even if you have not gotten weaker.
That last part is deliberate. A Standing that only ever went up would be a vanity metric. This one can fall when your data ages, which is the honesty working: it is telling you the picture is out of date, not that you have declined. Retest the stale mark and it climbs back. The board even names your weakest pillar and the single retest that would move the number most, so the highest-impact move is never a mystery.
Fitness Age
Fitness Age answers a blunt question: what age does the average athlete with your marks have? It is computed from the same published, age-banded norms as the tiers. If your marks match a typical 34-year-old, your Fitness Age reads 34, whatever your birthday says.
It needs at least five tier-ranked tests across three pillars before it shows anything. Below that it stays locked rather than guessing at a number it cannot support. Once it is live, it moves as your marks move, and it is one of the cleaner ways to see a training block pay off.
Freshness: why a mark goes stale
Every mark ages at the speed the underlying quality detrains. Grip and engine fade fast, so a dead hang or a row goes FADING around 10 days and STALE around 28. Core holds last longer, roughly 21 and 56 days. Strength and power hold longest, about 35 and 84. A stale mark still counts. It only pulls your Standing down until you refresh it, because a three-month-old sprint time is not a claim about today.
This is what keeps the board honest over months, not only the week you tested hard.
Combine Day: the adaptive testing session
Combine Day is the guided way to test several things in one sitting. It does not run a fixed list. It queues the tests worth doing right now: never-tested first, then stale ones, in a warm-up-safe order that puts balance and core work before max jumps so you are not testing power cold. If everything is fresh, it runs the full battery. Mini is the first four tests, Medium the first eight, Full is everything due.
Each station has its own rest timer and its own scoring, and the debrief at the end tallies your PRs and updates the board in one pass. If you get interrupted, it resumes where you left off.
Shuffle, Today's Rep, and Routines: the low-friction ways in
Not every session is a full Combine. Three lighter doors exist.
Shuffle deals you one random test worth doing, for when you have five spare minutes and want a quick benchmark. Today's Rep picks a single move scaled to your own baseline and sticks with it for the day, so you always have one honest thing to try. Routines are your own named mini-batteries: pick one to eight tests, name it "Morning routine," and run it through the same Combine engine whenever you want, with the same rest timers and PR gates.
Duels: one against one
A Duel is a private head-to-head on a single test. Pick the test, send the invite link to one training partner, and you both post one mark inside a window. Neither number is visible until both are in, so nobody sandbags off the other's score. Losing writes nothing anywhere. Your mark still lands on your own board as a normal attempt, so even a loss moves your own numbers.
Challenges: groups, and more than one exercise
A Challenge is a Duel with more people, or more tests, or both. You pick a window and race a group, and there are three shapes.
Improve the most scores each person against their own best. That is what makes it fair across levels: a beginner who adds nine seconds to a plank can out-improve a veteran who adds two, so the newcomer can win. Pool it together drops the competition and adds everyone's totals toward one shared goal, so a training group can chase "1,000 push-ups this week" as a team. Solo goal is you against a number you set.
Challenges can cover up to eight exercises at once. On Improve the most you can mix units freely, because it counts how many of the chosen tests you bettered rather than summing them. A plank measured in seconds and push-ups measured in reps each count as one improvement, so no big-number test drowns out the rest. Pool it together and Solo goal add the numbers up instead, so those two keep every test in the same unit. A live scoreboard updates as people log, so you watch it move without a stream of notifications.
Seasons: a before and an after
A Season measures how much you improve over a block of training. It never touches your PRs. It only marks two points in time and shows you the gap.
The flow is four steps. You arm a season. Your next Combine Day becomes your starting line, your before. You train for a few weeks. Then you close the season with a final Combine, and the debrief shows exactly what moved: your Standing, your Fitness Age, and your biggest per-test gains, open value to close value. Every Combine you run in between is a checkpoint. The delta is always measured from your starting line to now, derived from your real logged attempts rather than a stored guess.
The Coach Slot: your plan reads the board
The Proving Ground does not sit off to the side. When a test has gone stale and today's planned session matches its movement pattern, the coach stages at most one retest on that session's card. Accept it and log the result, or pass. Passing is free, carries no penalty, and puts that test to sleep for two weeks.
This is the cross-over that makes the benchmarks worth keeping fresh. The same coach that reads your Garmin recovery and your Rebel Fuel intake also reads your Standing, so a dropping jump or a stale grip mark can nudge the next week of programming. One retest, staged only when it fits what you were already going to train.
Why it stays honest
Two gates keep the board from lying to you. A strictly better mark is always banked as your new best, quietly, so you never lose a real result. But the PR banner only fires when you clear the noise floor for that test, so a one-second wobble on a plank never gets celebrated as progress you did not make. Beat a mark clearly and it counts. Nudge it inside the margin of error and it banks without the fanfare.
That, plus the staleness decay, is the point of the whole system. The number goes up when you improve for real and it tells you the truth when your data is old. Map a few tests, run a Combine, and start watching where you stand.
One app instead of five.
Strength, endurance, recovery, fueling, planning, and your AI coach. All under a 7-day free trial. No card.
Start 7-day trialMasters Athlete Training After 40: What Holds You Back
The research is clear: most VO2max decline after 40 comes from training volume reduction, not aging. Here's what actually changes,
Adaptive Training Plans: How the Plan Adjusts to You
Adaptive training is real science with honest limitations. What peer-reviewed research says about autoregulation and HRV-guided pl
Hybrid Athlete Training: Concurrent Done Right
The interference effect is real but smaller than you think, and the rules for avoiding it are simpler than the hype suggests. Here
Returning to Training After Injury: Slower Than You Think, Faster Than You Fear
The 10% rule is not science. Tendons recover three times slower than muscle. Here is what the evidence actually says about coming