Smart Ring Health Metrics Explained (HRV, Sleep, SpO2)
What HRV, RMSSD, sleep stages, skin temperature and SpO2 actually measure on a smart ring — the peer-reviewed accuracy evidence and what to ignore.
A smart ring's marketing leans heavily on health metrics — HRV, sleep stages, readiness scores, blood-oxygen, skin temperature — without explaining what any of them actually mean or how reliable the numbers are. This guide walks through the six categories of data a modern smart ring produces in 2026, cites the peer-reviewed accuracy evidence for each, and ends with what you should and shouldn't act on. The point is to make the daily dashboard intelligible without medical hand-waving in either direction.
If you are still choosing between rings, our smart ring buying guide is the better starting point. If you are deciding between a ring and a watch entirely, the smart ring vs smartwatch comparison covers the form-factor choice.
1. HRV — what the number actually measures
Heart rate variability — HRV — is the variation in time between successive heartbeats, measured in milliseconds. Your heart does not beat at a fixed metronomic interval; the gap between beats fluctuates moment to moment, driven by the autonomic nervous system. A 2017 PubMed Central review describes HRV as indexing "neurocardiac function...generated by heart-brain interactions and dynamic non-linear autonomic nervous system (ANS) processes." In plain English: when your parasympathetic ("rest and digest") system is dominant — you are well-rested, hydrated, calm — your HRV is high. When sympathetic ("fight or flight") tone is dominant — you are stressed, ill, sleep-deprived, or training hard — HRV drops.
This is the metric that drives almost every "readiness" or "recovery" score on a smart ring. The interpretation is intuitive: HRV down sharply versus your trailing average means the body has not recovered; take the workout intensity down a notch. HRV up versus average usually means you are well-recovered. But the number itself is calculated in two distinct ways, and they are not interchangeable.
RMSSD vs SDNN — the two HRV metrics
RMSSD — root mean square of successive differences — measures the variability between adjacent heartbeats. The 2017 review notes that "the RMSSD is identical to the non-linear metric SD1, which reflects short-term HRV." RMSSD is dominated by parasympathetic activity and is what Oura, Ultrahuman, RingConn, Whoop, Fitbit, Samsung and Google Pixel all report. It is well-suited to brief overnight measurement windows, which is what a ring actually collects.
SDNN — standard deviation of normal-to-normal intervals — measures variability over a longer recording window. The same review describes it as "the gold standard for medical stratification of cardiac risk when recorded over a 24 h period." SDNN is influenced by both sympathetic and parasympathetic activity. The Apple Watch reports SDNN, not RMSSD, which means an Apple Watch HRV of 35 ms and an Oura HRV of 35 ms are not directly comparable.
What HRV numbers actually mean
The same review gives broad medical reference ranges for 24-hour SDNN: below 50 ms is classified as unhealthy, 50-100 ms compromised, and above 100 ms healthy. These are clinical thresholds derived from long-window recordings — they are not what your ring is reporting overnight, and you should not pattern-match a ring's 28 ms RMSSD reading against them. The right way to read HRV from a ring is to track your own rolling average across weeks, watch for sharp drops against that personal baseline, and ignore the absolute number entirely.
HRV varies enormously between individuals — a healthy 25-year-old endurance athlete may report RMSSD in the 60-80 ms range overnight, a healthy 50-year-old in the 25-40 ms range, and both are perfectly fine. The same ring will give you wildly different numbers if you swap fingers, change the fit, or wear it during a workout. Trust the trend, ignore comparisons with anyone else.
2. Resting heart rate and ambient heart rate
Resting heart rate is the simplest health signal a ring measures and arguably the most reliable. The figure to look at is the lowest sustained nightly heart rate — typically reported as "lowest resting HR" or "RHR" depending on the brand. Adults healthy at rest sit somewhere between 50-80 beats per minute; trained endurance athletes can dip into the 40s.
Where this matters as a wellness signal is in trend deviation. A 5-7 bpm sustained increase in resting heart rate over a few nights is a robust early indicator of one of three things: an oncoming illness, the start of a menstrual cycle's luteal phase, or accumulated training fatigue. None of these are diagnostic — but the prompt to dig deeper is real, and the signal is independent of what the wearer is consciously feeling.
Ring measurement of resting heart rate is unusually accurate because the finger is a stable PPG location with minimal motion at night. The signal is essentially identical to what a chest strap would record in the same window, and considerably more reliable than wrist-based wearables that drift around during sleep.
3. Sleep stages — what the rings are doing and how accurate they are
The sleep-tracking pitch is the closest thing the smart-ring category has to a clinically-validated headline feature. The published evidence is unusually robust here.
A 2024 head-to-head study published in PubMed Central compared the Oura Ring, Apple Watch and Fitbit against polysomnography — the EEG-and-electrodes clinical gold standard for sleep staging. The Oura Ring "exhibited substantial agreement in the determination of specific sleep stages (Kappa > 0.61)" and was "not significantly different from PSG for seven of the eight measures" the study examined (total sleep time, sleep onset, wake after sleep onset, and the per-stage minutes of light, deep and REM sleep). The Apple Watch, in the same study, underestimated deep sleep by 43 minutes per night and overestimated light sleep by 45 minutes.
A separate Sensors study conducted at Brigham and Women's Hospital across 35 participants reported four-stage sleep-classification agreement of 79% for Oura Gen 3, 74% for Fitbit Sense 2 and 69% for Apple Watch Series 8 against PSG. Independent validation work by The Quantified Scientist — Rob ter Horst's side-by-side PSG and ECG-chest-strap testing, including a 15-device study with the University of Salzburg — has consistently put Oura among the most accurate consumer wearables for sleep staging, with Whoop's newer algorithm performing nearly as well on REM and light sleep.
What the sleep-stage numbers actually tell you
For a healthy adult, a normal sleep architecture is roughly 50-60% light sleep, 15-25% REM, and 13-23% deep (slow-wave) sleep, with brief wake intervals totalling around 10% of the night. The ring's per-stage estimates are accurate enough in aggregate to track your own trends across weeks, and accurate enough versus PSG to make population-scale conclusions ("Oura users average 1h 30m REM") meaningful. They are not accurate enough on any single night to be the basis of a medical decision.
The practical use cases for ring sleep data are:
- Spotting consistent sleep-debt patterns — chronically under 7 hours, or REM persistently under 15% of total — that warrant a behaviour change.
- Identifying alcohol-impact patterns — a glass of wine within three hours of bed typically shows up as elevated resting HR and reduced HRV the next morning, with REM compressed into the late hours.
- Detecting potential sleep-disordered breathing — sustained low SpO2 plus elevated resting HR plus fragmented sleep is a pattern that should send you to a GP for a formal sleep study, not a self-diagnosis.
What ring sleep data does not do is diagnose sleep disorders, replace polysomnography, or distinguish between primary and secondary insomnia. The honest framing is: it is a high-quality consumer-grade signal that surfaces patterns you would not otherwise see.
4. Skin temperature variance — cycle tracking, illness, recovery
Every premium smart ring measures continuous skin temperature overnight and reports the deviation from your personal baseline rather than an absolute Celsius value. This matters because the absolute reading from a finger-worn sensor is meaningfully cooler than core body temperature (typically 3-5°C cooler at rest) and varies with ambient conditions. The trend deviation, on the other hand, is robust enough to be useful for three distinct signals.
Menstrual cycle phase tracking
This is the strongest validated use case. A 2024 PubMed Central real-world study found that body temperature "increases 0.3°C to 0.7°C after ovulation, in the presence of progesterone" — the standard luteal-phase signature. Crucially, the same study found that wearable-detected temperature oscillation agreed with LH-kit ovulation confirmation in 82% of cycles. Even more importantly for the algorithm design, the study showed that menstrual temperature variation is better modelled as a smooth oscillation than as the classic biphasic basal-body-temperature square wave — meaning the shift is more gradual than the textbook BBT curve implies.
The honest caveat: the study also found that "cases of ovulation without a subsequent temperature surge have been reported in the literature." The temperature signal is a useful proxy, not a definitive ovulation marker. Wearables provide cycle-phase insight that is broadly more accurate than the classic morning-thermometer BBT method, but should not be used as a contraceptive on its own.
Illness early warning
Sustained nightly skin temperature elevation of 0.5-1.0°C above personal baseline, combined with elevated resting heart rate and depressed HRV, is a documented illness early-warning pattern that often surfaces 24-48 hours before symptoms. The peer-reviewed literature is more equivocal here than the marketing — skin temperature is not the most accurate way to measure fever, and the wearable signal is best treated as a multi-signal trigger to take a real thermometer reading rather than a fever proxy on its own. The pragmatic read: if a ring tells you your overnight temperature has spiked and your HRV has tanked and you are about to be ill, take a thermometer reading. Do not assume the inverse — a normal-looking ring reading does not rule out illness.
Training recovery
A persistent positive temperature deviation across multiple nights is also a reasonable proxy for accumulated training load not yet recovered — the body has not returned to thermoneutral baseline. This is more often picked up by HRV first, but the temperature signal is independent enough to corroborate.
Reading the data sensibly
The right way to read temperature data is to look at the trend across 5-7 nights, not any single night. Single-night temperature is heavily influenced by bedroom ambient temperature, bedding, alcohol, late meals, and finger circulation. Multi-night deviations are robust; single-night deviations are noise. Most rings restrict analysis to nocturnal sleep-classified periods specifically because that is when motion and ambient artifacts are lowest.
5. SpO2 — spot reading vs continuous overnight
Pulse oximetry — SpO2 — measures the percentage of haemoglobin in arterial blood saturated with oxygen, derived non-invasively from the wavelength absorption of light passed through the finger. A healthy adult at rest sits between 95-100%; sustained readings below 90% are a clinical concern.
The peer-reviewed accuracy evidence for smart-ring SpO2 is strong, particularly under the controlled conditions in which the validation was conducted. A 2024 PubMed Central controlled-hypoxia study reported smart-ring SpO2 RMSE of 2.1% across the 70-100% saturation range — meeting both ISO (4%) and FDA (3.5%) accuracy standards. The same ring performed comparably to the clinical-grade Masimo Radical-7 reference (RMSE 2.8%). Notably, on dark-skinned participants, the smart ring outperformed the medical reference device (RMSE 1.6-1.8% vs 2.9%), which is meaningful given the documented PPG accuracy gap on darker skin tones.
The important caveats: validation was on only 11 healthy young adults aged 22-34, under controlled nonmotion conditions, against invasive arterial-blood-gas SaO2 as the gold standard. That is a small sample, and the findings may not generalise to older or unwell populations. The marketing tendency to claim "medical-grade SpO2" on the basis of this kind of evidence overreaches.
Spot vs continuous SpO2
An important practical distinction is whether the SpO2 measurement is spot (a single triggered reading) or continuous overnight. The Apple Watch and most older Garmin devices use spot measurement; Oura, Ultrahuman, RingConn and several Wellue/O2Ring pulse-oximetry-specific rings run continuous overnight tracking. Continuous SpO2 is what surfaces sleep-disordered-breathing patterns — repeated desaturation dips across the night that suggest obstructive sleep apnea and warrant clinical investigation.
The honest framing on SpO2: a healthy non-smoking adult under 50 will almost certainly see normal SpO2 readings, in which case the metric is data confirming you are fine. The genuine value of overnight continuous SpO2 is the small percentage of users who discover repeated overnight desaturation they were unaware of — at which point the ring has effectively earned its purchase price by sending them to a sleep clinic.
6. Readiness, recovery and wellness scoring — the composite metrics
Every major brand layers a single 0-100 daily "readiness" or "recovery" or "wellness" score on top of the underlying metrics:
- Oura Readiness — combines HRV trend, resting HR trend, sleep duration and quality, skin temperature variance, and recent activity. Subscription-gated.
- Whoop Recovery — heavily HRV-weighted, with sleep and resting HR contributions; produces a percentage rather than 0-100 raw score.
- Ultrahuman Recovery / Readiness — similar inputs, with explicit "Movement Index" and "Sleep Index" sub-scores feeding the top-line.
- RingConn Wellness — composite of HRV, resting HR, sleep, stress proxy, and activity.
- Samsung Galaxy Ring Energy Score — Galaxy Health platform composite using HRV, sleep, activity.
These scores are proprietary, opaque, and somewhat over-claimed. The brand-marketing implication is that the score tells you "how to train today" — push hard if it is high, back off if it is low. The published evidence does not support that level of prescriptive specificity. The score is a useful daily summary of trends already visible in the underlying metrics, and it is reasonable to use as a tiebreaker between "hard session" and "easy session" days, but the underlying numbers — RHR, HRV trend, sleep — give you the same information with less black-box framing.
One thing to know about all of these scores: they have a baseline calibration period (typically 14-30 days) during which the algorithm is learning your personal averages. Scores during this period are essentially meaningless. The brand interfaces sometimes don't make this clear; the underlying technical documentation usually does.
7. What you should act on — and what is just noise
The compressed practical guide, after all of the above:
Act on:
- Sustained resting heart rate elevation of 5+ bpm above your personal trailing average across 3+ nights — investigate.
- Sustained HRV depression of more than ~20% versus your personal trailing average across 3+ nights — back off training intensity.
- Repeated overnight SpO2 dips below 90% — see a GP for a formal sleep-study referral.
- Chronically poor sleep architecture — total sleep under 7 hours across weeks, or REM persistently under 15% — change something behavioural (alcohol timing, bedroom temperature, schedule consistency, screen exposure).
- Skin temperature deviation plus HRV drop plus elevated RHR — early illness signal; check temperature with a real thermometer.
Ignore:
- Single-night readiness scores in either direction. Three-to-seven-night trends only.
- Comparisons with anyone else's HRV or RHR. The metrics are personal baselines, not population rankings.
- Absolute HRV numbers as a measure of fitness or health. Trends versus your own baseline only.
- Cross-platform HRV comparisons (Apple Watch SDNN vs Oura RMSSD). Different metrics, not comparable.
- Sleep-stage estimates on any single night as a basis for medical decisions. Use them for patterns across weeks.
The general principle behind all of this is that ring data is a population-grade signal applied to an individual context — accurate enough in aggregate to be useful, not accurate enough in isolation to be clinical. The right mental model is "trend dashboard" rather than "diagnostic device." That framing turns out to be more useful than the marketing alternative anyway.
Frequently asked questions
Is my smart ring's HRV reading accurate?
Why is my morning HRV so different from one day to the next?
How accurate is smart ring sleep tracking versus an Apple Watch?
Can a smart ring detect ovulation?
Is smart ring SpO2 medical grade?
What should I do if my readiness score is consistently low?
Do I need a subscription to get the health metrics?
Ready to pick a ring? Read the buying guide next
Subscription, sizing, sensor accuracy, app ecosystem, durability and US patent status — the seven factors that decide whether a £349 ring earns its place on your hand.