Overview
Solitary pulmonary nodules are common findings on chest imaging. Clinicians are often asked to estimate how likely a nodule is to represent primary lung cancer or other malignancy before committing to invasive testing, intensive follow-up, or surgical evaluation. The Mayo Clinic model for solitary pulmonary nodule malignancy risk—often called the Swensen model after the group that published the logistic regression equation—is a pretest probability tool. It combines a small set of patient-level and imaging features into a single number: the estimated probability that the nodule is malignant.
The model was derived in a defined referral population using logistic regression. Its output is not a diagnosis and does not replace multidisciplinary review, comparison with prior imaging, structured reporting systems used in screening (such as Lung-RADS), or guideline-based management algorithms for incidental nodules (for example, Fleischner Society recommendations where they apply). It remains useful for teaching, shared decision-making discussions, and understanding how classic clinical and radiologic features were quantified in one influential early framework.
Clinical role of pretest probability
Pretest probability is the estimated likelihood of disease before a definitive test. For nodules, that probability influences whether observation, further imaging, functional imaging, biopsy, or surgical excision is reasonable. Models like the Mayo equation attempt to standardize how much weight to give to age, tobacco exposure, cancer history, nodule size, lobe location, and margin morphology—features that clinicians already integrate qualitatively.
A logistic model outputs a probability between 0 and 1 (often expressed as a percent). Very low values align with scenarios where surveillance may be discussed when appropriate; higher values align with scenarios where expedited diagnosis is often considered. Actual thresholds for action depend on patient preferences, comorbidity, nodule stability over time, institutional pathways, and current specialty society guidance—not on any single historic probability cut point in isolation.
What the calculator estimates
This implementation estimates probability of malignancy for a solitary pulmonary nodule using the published logistic coefficients associated with the Mayo Clinic derivation. The underlying equation has the form:
P(malignant) = eX / (1 + eX), where X is a weighted sum (linear predictor, or logit) of the inputs listed below.
The calculator displays both the final probability and a term-by-term breakdown of the logit so learners can see how each factor raises or lowers the estimated risk.
Model inputs in clinical terms
Age (years)
Lung cancer incidence rises with age in the general population. In the logistic formulation, each additional year contributes linearly to the logit through its coefficient, reflecting higher baseline odds of malignancy in older patients in the derivation cohort. The model does not incorporate sex, race, family history, occupational exposures, or molecular risk—all of which can matter in real practice.
Current or recent cigarette smoking
Tobacco exposure is one of the strongest epidemiologic risk factors for lung cancer. The model uses a binary indicator for whether the patient is a current or recent cigarette smoker (coded as present or absent). It does not capture pack-years, time since quitting, passive smoke, or other inhaled exposures. Patients who never smoked can still have malignant nodules; the model simply assigns no smoking-related increment to the logit when the smoking item is negative.
Extrathoracic cancer diagnosed more than five years ago
A history of cancer outside the chest can increase concern for metastatic disease to the lung or for second primary malignancies, depending on tumor biology and staging history. The Mayo model uses a specific framing aligned with the original study: a binary variable for extrathoracic malignancy diagnosed more than five years before the nodule evaluation. It does not distinguish organ of origin, histology, stage, or treatment. Recent or thoracic primaries are not captured by this single item; clinicians must contextualize any prior malignancy separately.
Nodule diameter (millimeters)
Nodule size on thin-section CT is a central driver of malignancy risk in many cohorts. The implementation uses the largest dimension in millimeters, consistent with common reporting and with the coefficient convention used for this equation in widely reproduced references. Very small measurements and very large masses both require clinical judgment: the model was developed in a particular size spectrum and may be miscalibrated outside the range of patients used to fit the coefficients.
Upper lobe location
Primary lung cancers show anatomic predilections; upper lobe predominance has been described in multiple epidemiologic and imaging series. The model adds a fixed increment to the logit when the nodule is described as residing in an upper lobe versus other locations. Lingula, middle lobe, and lower lobe sites are grouped as “not upper” for this binary term. Multifocal disease, lymph node enlargement, and pleural findings are not part of the score.
Spiculated margin
Spiculation refers to fine linear strands radiating from the nodule margin on CT, a morphologic pattern associated with invasive growth and higher malignancy probability in many radiologic analyses. The model treats spiculation as a binary radiologic feature. Smooth, well-circumscribed appearances generally lower concern in isolation, though benign entities and some malignancies can still lack spiculation. Reader variability and slice thickness can affect whether spiculation is perceived; correlation with radiology report language and, when needed, subspecialty read is important.
How results should be read
The output is a single probability estimate conditioned only on the variables in the model. It does not incorporate growth rate between studies, solid versus subsolid composition, cavitation, calcification pattern, emphysema, infection workup, or laboratory data. Two patients with the same score can have very different true risks once those additional facts are known.
When probabilities are low, many clinicians still consider structured follow-up if guideline criteria are met. When probabilities are high, tissue diagnosis is not always immediate or safe; comorbidity, anticoagulation, lesion accessibility, and patient goals alter next steps. The Mayo probability is best viewed as one structured input into a broader assessment rather than a trigger rule.
Modern imaging pathways and calibration
Incidental nodule care has evolved with widespread CT, lung cancer screening programs, and standardized follow-up recommendations. Contemporary pathways often emphasize nodule type (solid, part-solid, ground-glass), volume doubling time, and reporting frameworks designed for screening populations. The Mayo model predates much of this infrastructure; its coefficients reflect the patients, scanners, and referral patterns of its era. In current practice, discrepancies between model output and Lung-RADS or guideline-based intervals are expected and do not necessarily indicate that either approach is “wrong”—they answer slightly different questions in different populations.
Patient communication and documentation
When discussing results, it can help to explain that the tool is a statistical estimate from a published formula, not a biopsy result. Patients may benefit from hearing which factors increased their estimate (for example, size or spiculation) and which factors were not considered (for example, prior scans showing stability). Documenting shared decisions should reference the full clinical picture, not the numeric output alone.
Educational and safety notes
- Intended for professional education and contextualization, not as a sole determinant of therapy.
- Does not establish indication for biopsy, surgery, PET, or surveillance interval.
- Should not be used for pediatric populations or for findings that are not reasonably classified as solitary pulmonary nodules without additional clinical reasoning.
- Always reconcile inputs with the official radiology report and, when applicable, prior imaging.