Background and purpose
Renal stone disease treated with percutaneous nephrolithotomy (PCNL) spans a wide spectrum of anatomic and compositional complexity. Outcomes such as single-procedure stone-free status, operative time, estimated blood loss, and length of stay depend not only on surgeon experience and technology but also on measurable stone and collecting-system characteristics visible on imaging. The S.T.O.N.E. nephrolithometry score, introduced as a standardized reporting framework, distills five reproducible variables from preoperative non-contrast computed tomography (NCCT) into a compact numeric total. The acronym encodes the domains: Stone size, Tract length, Obstruction (hydronephrosis), Number of calyces involved, and Essence (stone attenuation in Hounsfield units).
The intent of the score is not to replace clinical judgment, but to support preoperative counseling, risk communication, operative planning (including access strategy), and fair comparison of outcomes across patients, operators, and institutions when reporting PCNL series.
Why a CT-based composite score?
NCCT is the standard modality for stone characterization before invasive therapy. It provides precise stone dimensions and location, calyceal anatomy, hydronephrosis grade, and attenuation values that correlate with mineral composition. Prior descriptive systems and nomograms addressed pieces of this puzzle; the S.T.O.N.E. system was designed so that each component could be extracted from a routine preoperative scan with defined measurement rules, reducing dependence on idiosyncratic narrative descriptions such as "large branched stone" without quantitative backing.
In validation studies, higher totals have been associated with lower stone-free rates and, in several cohorts, longer operative times and greater resource use. Effect sizes for individual components vary by population: stone size and calyceal involvement frequently emerge as the strongest independent predictors of residual fragments, while tract length, hydronephrosis, and Hounsfield-based "essence" contribute to the composite and may show weaker standalone associations depending on case mix, stent status, and measurement conventions.
Component S: Stone size (axial area)
Stone "size" in this system refers to the largest cross-sectional area of the calculus on a single axial image, expressed in square millimeters (mm²), rather than a single linear diameter alone. This better reflects stone burden when stones are ovoid or irregular and aligns morphometric practice in endourology research.
Typical scoring bands (as widely applied in secondary sources) are:
- 1 point: area < 400 mm²
- 2 points: 400–799 mm²
- 3 points: 800–1599 mm²
- 4 points: ≥ 1600 mm²
Accurate segmentation matters: include the full stone contour on the axial slice where the lesion is largest, and ensure window/level settings allow clear discrimination of the stone from the papilla or adjacent soft tissue.
Component T: Tract length
Tract length estimates the distance from the skin surface to the center of the stone along the anticipated percutaneous access path. The original description emphasizes measurement on NCCT, often along a line approximating a 45° orientation relative to the horizontal, to mimic the geometric path of nephroscope access.
Commonly used thresholds are:
- 1 point: tract length < 100 mm
- 2 points: tract length ≥ 100 mm
Longer tracts can imply more challenging torque on instruments, greater parenchymal traversal, and sometimes different ergonomics for fragment extraction, although the independent contribution of this variable to stone-free status is inconsistent across studies.
Component O: Obstruction (hydronephrosis)
The O domain captures the degree of upper urinary tract obstruction related to the stone, usually graded from NCCT using a hydronephrosis schema such as the Society for Fetal Urology (SFU) scale or an equivalent institutional standard.
A practical binary mapping used in many calculators and secondary validations is:
- 1 point: none or mild hydronephrosis (often SFU grades 0–1)
- 2 points: moderate or severe hydronephrosis (often SFU grades 2–4)
Interpretation requires context: indwelling ureteral stents, prior manipulation, acute infection, and duplicated systems can alter calyceal appearance and may shift perceived obstruction independent of stone clearance difficulty during PCNL.
Component N: Number of involved calyces
This component describes how much of the collecting system contains stone material. It moves beyond simple stone volume to capture spatial distribution, which strongly influences how many angles must be cleared and whether a staghorn configuration is present.
Typical tiers are:
- 1 point: stone confined to one or two calyces
- 2 points: involvement of three calyces
- 3 points: staghorn morphology or extensive multi-calyceal branched burden
Exact assignment should follow your center's radiologic criteria for staghorn (complete vs partial) and whether multiple non-contiguous calculi are counted as multi-calyceal disease versus separate stones in the same kidney.
Component E: Essence (stone attenuation)
Essence reflects mean stone attenuation in Hounsfield units (HU) on NCCT, obtained by placing a region of interest over the stone body. Attenuation correlates imperfectly with composition but helps separate very dense calcium-dominant material from relatively lucent uric acid–predominant stones at the extremes.
A frequently applied cutoff is:
- 1 point: mean HU < 950
- 2 points: mean HU ≥ 950
Some groups prefer peak HU or multiple ROI averages; inter-vendor calibration, beam hardening, and slice thickness can shift measured values slightly. Consistency within a practice matters more than small numeric disagreements at the boundary.
Total score and complexity bands
The total S.T.O.N.E. score is the sum of the five components. In commonly cited secondary reports, observed totals often fall in a band of roughly 5 through 13, with 5 representing the lowest aggregate complexity in that framework and 13 the highest. Original single-center series may show a narrower empirical range; totals below 5 usually suggest incomplete data entry or nonstandard measurement rather than a distinct clinical class.
For orientation (labels vary slightly between publications), many users group totals as:
- Low complexity: approximately 5–7
- Moderate complexity: approximately 8–10
- High complexity: approximately 11–13
These bands summarize population-level associations with stone-free probability and operative metrics; they should not be read as rigid rules for triage to PCNL versus alternative modalities.
Integration with surgical planning and quality reporting
Clinically, the score helps structure multidisciplinary discussion: higher totals may prompt advance planning for multi-tract access, flexible nephroscopy adjuncts, longer anesthesia time, staged procedures, or clearer informed consent around residual fragment risk. For quality improvement, aggregating S.T.O.N.E. scores alongside Clavien complications and single-procedure stone-free rates allows more meaningful benchmarking than raw stone-free percentages alone, because case complexity differs markedly between centers.
Limitations and practical caveats
The S.T.O.N.E. score does not encode every determinant of PCNL success. Infundibular width, pole location (especially lower pole), body habitus, renal rotation, anomalous anatomy, active infection, and anticoagulation status are examples of factors that may outweigh a modest numeric total. External validations have reported moderate discriminative performance for stone-free prediction; calibration can differ when stents are present, when definitions of residual fragments differ (>4 mm vs clinically insignificant fragment thresholds), or when flexible ureteroscopy is used as an adjunct.
Users should also recognize competing systems (for example Guy's stone score, CROES nomogram, S-ReSC) that emphasize overlapping but non-identical features. Choice of system may depend on institutional audit habits, research comparability, or surgeon familiarity.
How to use this calculator responsibly
Enter the axial stone area and tract length from your NCCT, then select the best-matching categories for hydronephrosis, calyceal involvement, and stone HU. Verify that radiology and urology agree on staghorn classification and on whether mean HU is the appropriate metric for your protocol. Use the output as structured documentation and counseling support alongside individualized assessment—not as an isolated decision rule.