Scientists still can’t agree on how to measure body wear
The idea is straightforward: a body that has endured years of stress is more worn down than one that hasn’t.
The concept is called allostatic load, a term for the cumulative physiological wear and tear that builds up when the body is repeatedly or chronically stressed. The premise is biologically sound. Sustained activation of the stress response affects the cardiovascular system, immune function, hormonal balance, and more. But the researchers point out a persistent problem: the field has produced dozens of different ways to measure allostatic load, and they do not agree with each other.
Some scores rely on blood pressure, cortisol, and inflammatory markers. Others incorporate sleep quality and heart rate variability. Still others focus on hormonal profiles or metabolic indicators. Each captures something real. But the scores correlate poorly across studies, making direct comparisons almost impossible and pooled analyses unreliable.
The measurement is the definition
This debate might seem like a technicality, but it reflects something fundamental. The same problem runs through biological aging research more broadly. Epigenetic clocks, protein profiles, metabolites, and physiological measurements each produce different estimates of biological age for the same individual. Allostatic load is a microcosm of that challenge. Without a standardized definition, it is unclear whether different studies are even measuring the same underlying phenomenon.
Why the concept still matters
Despite the lack of standardization, the concept is not without value. Studies consistently find that higher allostatic load scores, regardless of which version is used, correlate with worse health outcomes in later life. That suggests something real is being captured, even if imperfectly. For longevity research, the stakes are high. Evaluating whether an intervention actually reduces physiological wear requires reproducible, well-defined measures. Without them, it remains difficult to know whether a treatment is truly effective or merely moving numbers on an inconsistent scale.