ecancermedicalscience

Review

A comparative analysis of cancer stage classification systems for registries

Abhinav Ramraja and Hariharasudhan Saravananb

Department of Community Medicine, Stanley Medical College, Chennai 600001, India

ahttps://orcid.org/0009-0002-9621-3248

bhttps://orcid.org/0009-0008-6029-9610


Abstract

Cancer stage at diagnosis is a critical determinant of survival outcomes and a key metric for population-based cancer surveillance. Despite the existence of several cancer staging classifications implemented in registries worldwide, their relative utility remains poorly understood. This review provides a comprehensive and comparative evaluation of the principles, data requirements and practical utility of the traditional tumor-node-metastasis (TNM), Surveillance, Epidemiology and End Result Summary, Condensed TNM, Essential TNM, registry-derived and extent-of-disease staging systems. It also introduces a conceptual framework for evaluating these systems, in order to aid registries in selecting context-appropriate staging methods. Our appraisal, focusing primarily on aspects pertaining to data collection and consolidation, recognises that while the traditional TNM system offers the highest clinical and prognostic value, its complexity leads to poor completeness in population-based registries, particularly in low- and middle-income countries. Simplified alternatives can achieve higher completion rates but offer limited clinical utility. A balanced approach jointly incorporating clinical value and practical feasibility is essential, highlighting the need for hybrid solutions to support cancer registration. Electronic aids such as staging applications and natural language processing or AI-driven tools can streamline staging by automating data extraction, minimising errors and inferring missing components. Future efforts must prioritise accessible, multilingual platforms to standardise surveillance and improve accuracy in resource-limited settings.

Keywords: neoplasm staging, registries, epidemiological monitoring, data collection, records

Correspondence to: Abhinav Ramraj
Email: rx.abhinavramraj@outlook.com

Published: 03/06/2025
Received: 19/01/2025

Publication costs for this article were supported by ecancer (UK Charity number 1176307).

Copyright: © the authors; licensee ecancermedicalscience. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.


Background

Cancer incidence continues to rise, predominantly affecting low- and middle-income countries which have higher incidence and mortality rates. Devising, implementing and monitoring cancer control strategies necessitate reliable real-world data on cancer incidence and survival at the population level. Cancer registries, especially population-based (PBCRs), are the only source of such population-level burden and outcome indicators and continue to provide timely, real-world data across diverse settings [1].

One of the most defining characteristics of cancer, proven to influence survival consistently across types, is its stage at presentation. Cancer stage classifies the extent or ‘spread’ of the disease and has both clinical and epidemiological significance; the stage at diagnosis determines the most suitable treatment approach and is also an established and strong correlate of survival. Recording the stage of cancer cases, ideally at the population level, is not only necessary to estimate stage-specific survival and evaluate the efficacy of healthcare systems, but is also crucial to monitor longitudinal trends in stage distributions [26]. This helps draw meaningful comparisons between regions or populations and assess the impact of cancer control programs. Cancers are staged using criteria defined by classification systems which categorise patients with similar anatomic extents of disease into discrete and ordinal categories or ‘stage-groups’. Cases within a stage category are expected to have similar prognosis and management, hence necessitating such classifications to undergo periodic and evidence-based revision based on real-world clinical data to better convey prognostic value through their categories.

Although lacking granular treatment-related details, PBCRs cover well-defined geographic and demographic regions and aggregate data from a wide range of sources (field, hospitals, diagnostic centers and so on) in their designated jurisdictions [79]. By offering a comprehensive overview of incidence, prevalence and survival trends at the population level, they continue to remain essential for understanding cancer burden and outcomes in regional public health planning. On the other hand, other registry types such as hospital-based (HBCR) and tumour specific or special purpose registries offer critical insights by leveraging detailed clinical records and treatment data. These systems are often integrated with electronic health records, thus enabling more accurate stage assignments and robust outcome evaluations [10]. HBCRs operate within healthcare facilities, compiling detailed treatment information on patients diagnosed or treated at their individual facilities, and support internal patient care and quality improvement [1113]. Special-purpose or tumour-specific registries, meanwhile, target specific aspects related to cancer care and often cater to specific tumour sites, rare cancers or niche population groups. They combine clinical data with more extensive molecular, genetic and treatment variables such as biomarkers and family histories, enabling in-depth analyses into aetiologies and therapeutic efficacies in order to enable and support focused research and personalised approaches [1417]. Collectively these registry types complement each other by addressing different dimensions of cancer surveillance – while HBCRs offer detailed and granular, but institution-specific, insights focusing on care-related outcomes, PBCRs provide a broad population-level perspective of burden and special purpose/tumour-specific registries deliver focused data to drive specialised research and improvements in targeted cancer care.

However, collecting stage-related information at scale is a challenging endeavour for both high-income countries and low- and middle-income countries (LMICs) [18]. While PBCRs across the globe have generally lacked stage-related information, with several not even attempting to capture stage data [19, 20], the data that does exist is often incomplete and of insufficient quality for analysis [6, 18, 21]. The extraction of TNM data poses significant challenges in LMICs, where infrastructural constraints and fragmented healthcare systems amplify existing complexities. Unlike high-income settings, LMICs often lack integrated health information systems, forcing registries to rely on disparate data sources such as clinical records, pathology reports and imaging findings, which are frequently inconsistent or incomplete [22]. These sources are often spread across multiple facilities, creating a fragmented data landscape that spans long periods of the clinical care continuum. In many instances, clinicians in resource-constrained settings may fail to explicitly document the TNM components in medical records, either due to time constraints, lack of awareness or the absence of standardised reporting protocols [2325]. This leaves critical details to be inferred, which in turn forces registrars—who may lack clinical backgrounds—to interpret ambiguous narrative descriptions, leading to errors and misclassification. Furthermore, the lack of centralised electronic health records (EHRs) and reliance on paper-based systems without structured reporting formats, exacerbates these issues as free-text narratives can omit critical details like nodal involvement or metastasis status, hinder timely data retrieval and increase the risk of data loss. In many LMICs, limited access to advanced diagnostic tools, such as PET-CT scans or molecular testing, further restricts the ability to accurately determine metastatic or nodal status. Addressing these challenges requires targeted investments in health infrastructure, standardised data collection and reporting protocols [25, 26] and capacity-building initiatives to ensure accuracy in point-of-care settings [27, 28].

Moreover, even if data collection is effective, the lack of a standardised approach to assigning cancer stage leads to registries within and across countries reporting stage assigned on the basis of different criteria. Several staging systems exist – the TNM system, the Surveillance, Epidemiology and End Result (SEER) Summary stage, Essential TNM (ETNM), Condensed TNM (CTNM) and the degree of spread systems. Although currently in use in registries worldwide, they are poorly understood, with little published literature detailing the nature of the categorisation offered. They differ based on their intended application, clinical relevance and categorise in a manner that often cannot be compared with each other. This hinders comparability and harmonisation of data for epidemiological or outcomes analyses and benchmarking studies within health systems, regions and between countries. Moreover, it is unlikely that we achieve international consensus in the near future [29] and registries continue to record and report stages using varying systems, with high-income countries like the United States, Canada, United Kingdom and in Europe, reporting in more than one[21, 22, 2931].

The lack of a conceptual framework comparing these systems prevents registries, especially those in LMICs, from critically appraising the various classification schemes and determining the system(s) and approaches best suited to their epidemiological needs and data collection methods. This review begins with a brief overview of the classification systems and later offers a comparison from the perspective of data collection and consolidation. The detailed tumour-specific criteria can be found in the latest versions of the corresponding manuals [18, 3234] but are not described here.


Cancer stage classification systems

Staging classification systems have been designed for diverse settings at different periods in time and with different intended applications. They primarily vary in the nature and depth of information required to assign a particular case of cancer to a stage category. The oldest and most widely implemented system, in clinical practice and in registries across the globe, the Union for International Cancer Control (UICC)/ American Joint Committee on Cancer (AJCC) TNM system (hereafter referred to as just the TNM system) has been associated with high rates of incompleteness [20, 29, 31]. Recognising the challenges in collecting TNM stage information, several simplified classifications and approaches to reconciling missing data have since been developed, which are summarised in Figure 1.


UICC/AJCC TNM

The tumour, node and metastasis (TNM) staging system developed in the mid-20th century and currently maintained by the UICC in collaboration with the AJCC has remained the global standard for assessing cancer extent for over 75 years. Although these are separate organisations that publish their own manuals, their goal is to maintain identical classifications and any variations that may exist are reconciled in subsequent editions, the latest of which was published in 2017 [18, 32]. However, despite shortcomings, and in view of its intuitive underlying principle and the consistent and strong association between the TNM stage and patient outcomes, it remains the most widely followed by clinicians and registries worldwide.

Figure 1. Cancer stage classification systems and approaches to data consolidation.


Condensed TNM

CTNM was developed by the European Network of Cancer Registries (ENCRs) in 2002 as a simplified alternative to the traditional TNM system [34, 35]. It is based on the same T, N and M components but uses general criteria applicable to all tumour types, utilising both clinical and pathological TNM and/or descriptive information. Since its inception in 2002, however, the guidelines have not been revised and it has not been widely adopted by European registries which seem to prefer traditional TNM instead [21].


Essential TNM

The ETNM staging system, a collaborative effort by the UICC, the International Agency for Research on Cancer (IARC), and the International Association of Cancer Registries, was designed for use when complete TNM data is unavailable, especially for registries in LMICs. It aims to enable stage assignment with lesser data but in a manner comparable with the TNM stage categories [20, 36]. However, it has been deemed to require further field-testing and dissemination programs [20] and has not yet been officially implemented in registries.


Registry-derived stage

The registry-derived (RD) stage emerged as a response to the lack of standardised stage data in Australian registries through the national stage, treatment and recurrence project, initiated in 2011 by Cancer Australia in collaboration with the Australian Institute of Health and Welfare and state and territory cancer registries [37, 38]. This approach represents an important step forward in standardising cancer stage data consolidation and has proven to be a valuable tool in addressing notable gaps in Australia’s national cancer data; it has significantly improved completeness of staging data across various cancer types enabling the country’s first national-level reporting of cancer stage at diagnosis for the top five incident cancers [38].


SEER Summary 2018

The SEER Summary Stage (SSS) was initially developed by the National Cancer Institute’s End Results Group in the 1950s for the SEER Program, an authoritative source of cancer stage data in the United States [30]. It continues to remain an integral part of reporting in the SEER Program and is currently (going forward from January 2018) mandated in central registries in the United States [39, 40].


Miscellaneous

Extent of disease (EoD) systems are locally developed by and tailored to meet the needs of specific PBCRs. For example, the New South Wales Cancer Registry (NSWCR) uses the ‘Degree of Spread (DoS)’ system for all non-hematopoietic cancers on the basis of criteria defined by the IARC [41]. Tumour-specific classification systems, such as Ann–Arbor for lymphomas and Duke’s for colorectal cancer, exist but these are largely not relevant in the context of data consolidation in registries.

The Toronto Guidelines address the unique biological and clinical features of childhood malignancies and provide a pragmatic, consensus-based framework for staging [42, 43]. Emphasising clinical, imaging and histological criteria for risk stratification and clinical decision-making, these guidelines harmonise data collection across institutions to improve comparability in survival analyses and facilitate collaborative research. The Guidelines advocate for type-specific staging criteria for 16 of the most prevalent paediatric malignancies and has been deemed optimal for PBCRs by the IACR and ENCR [44, 45]. Moreover, they delineate a two-tiered framework whereby registries constrained by limited resources or data access are provided with less granular criteria (Tier 1), whereas those with greater resources are afforded more detailed criteria (Tier 2) [42, 44, 46]. To ensure comparability across registries, the stage categories defined under Tier 2 may be consolidated to correspond with those of Tier 1.


Comparative analysis of cancer stage classifications

A comparative analysis of their underlying principles, classification schemes and depth of information required to stage cases, with a focus on data collection and consolidation within registries, is presented under the following headings and summarised in Figure 2.


Categorisation scheme

The TNM system has specific criteria for each cancer type, primarily based on the anatomic extent, with the core components being tumour size and/or extent (T), lymph node involvement (N) and metastasis (M). After assigning values for T, N and M using the defined criteria, cases are grouped into one of several disparate but tiered categories; allowing for ‘telescoping’ [18] between either broader stage groups 0, I, II, III and IV or narrower distinctive subcategories within these groups (Ia1, IIb and so on), based on the depth of available information. It also recognises the source of, and point during care the relevant information is obtained, by distinguishing between clinical, pathological, post-therapy and recurrence staging.

CTNM and ETNM, however, while retaining the core T, N and M components, employ a largely dichotomous categorisation; localised and advanced for tumour size/extent (L versus A in CTNM and L/L1/L2 versus A/A1/A2 in ETNM), nodes (N0 versus N+ in CTNM and R- versus R1/R2 in ETNM) and metastasis (M0 versus M+ in CTNM and M- versus M+ in ETNM). The resultant stage categories are fewer but broader – tumour localised (TLN0M0), tumour with local spread (TAN0M0), tumour with regional spread (any T N+ M0) and advanced cancer (any T any N M+) in CTNM. In ETNM, the categories – localised limited (L R- M-), localised advanced (A R- M-), regional limited or extensive (L/A R+ M-), to distant metastasis (L/A any R M+), are also designed to correlate with the traditional TNM stage groups (I-IV) [33], but with the relative position of localised advanced with regional interchanging based on cancer type.

RD staging, on the other hand, is not a classification system, but rather a country-specific approach to deriving stage for population-based analyses from routine sources where TNM data is incomplete [22]. It uses simplified business rules developed by the Victoria Cancer Registry [38, 41, 47] which articulate the decision-making process for defining the RD-stage categories in alignment with the TNM stage groups defined in the AJCC 7th Edition. The stage is reported I-IV, but the detailed business rules used for assignment are currently unavailable on their website or elsewhere in published literature.

Figure 2. Comparing the cancer stage classifications from the perspective of data collection and consolidation.

SSS represents a fundamental approach to categorisation quite distinct from the TNM-based systems. It employs a hierarchical one-digit coding system with six categories, but can be further condensed into three broad levels: localised, regional and distant [39]. Unlike TNM, where tumour size and number and pattern of nodal involvement are crucial criteria, SSS disregards either and only concerns with the organ(s) involved and follows the usual dichotomy for nodal involvement and metastasis; defaulting to code 3 when nodes are involved and code 7 when metastasis is present, regardless of primary tumour extent.

EoD/DoS systems classify tumours broadly as localised, regional or metastatic, (but sometimes as hybrid categories like L/R or R/M when uncertain) as well, but use unclear criteria that varies among registries [21, 22] and which are not published.


Data complexity and ease of data consolidation

The TNM system is the most comprehensive, offering granular and detailed criteria while also explicitly providing the range of ICD-O3 codes the classifications are applicable to, preventing ambiguity. However, it necessitates considerable detail on specific and ineludible aspects on disease extent to assign a stage. Assigning a stage group often requires all three (T, N, M) values and the source of information (clinical vs. pathological) and missing any crucial information may render assignment impossible. To deal with partial information and yield data sufficient for surveillance purposes, it does allow for combining clinical and pathological information to determine the ‘harmonized’ [48] or ‘working stage’ [18]. While a ‘restrictive’ approach to data consolidation requires all three values, a ‘non-restrictive’ or ‘relaxed’ approach assumes missing N or M values as zero to derive the final stage variable [6, 22, 29]. Attempts to include information from other systems (like Condensed TNM and degree of spread) as a substitute for missing TNM information to derive a ‘reconstructed stage’ have also been made [21].

ETNM prevents the need for exhaustive searches by advocating a logical decision process in the order from M >N >T (and stage groups IV> I) enabling cancer registrars to record the furthest extent of disease first, allowing them to stop searching for additional information [20, 36], but this can be applied for CTNM as well. While N and M are assigned purely on the basis of presence in both CTNM and ETNM (although N is differentiated as R1, regional limited and R2, regional extensive in ETNM, they do not contribute to a difference in the final stage category, which codes both as regional), T is determined by convention (usually T1-T2 for L and T3-T4 for A) in CTNM, whereas specific criteria that exist in traditional TNM are used for T in ETNM. However, as ETNM includes certain criteria found in the TNM classification deemed important, specific information clarifying them are required without which assignment is not possible, ensuring greater intrinsic clinical value, although making ETNM harder to achieve completion in as compared to CTNM.

RD staging uses individual TNM values or the TNM stage group if available, but a key feature is its ability to generate stage by leveraging data supplementation from secondary sources, (hospital morbidity data collections, pathology reports or clinical documentation) when assumptions are required due to data limitations [41]. Hence, data consolidation requires manual corroboration and can be time- and resource-intensive, and as it relies on assumptions and generalisations, may potentially lead to errors in staging.

SSS utilises clinical, pathological and radiological information primarily from narrative descriptions in medical records, pathology and imaging reports. SSS also provides explicit provision for less detailed information by including the ‘not-otherwise specified, NOS’ suffix within its categories. Furthermore, it employs a relaxed approach to unknown nodal or metastasis status by treating missing information as negative, which may lead to potential under-staging compared to TNM. Even isolated or unclarified and pointed information, as is often present in manual medical records or diagnostic reports, is sufficient for registrars to assign a stage, thus offering a determinate categorisation without any scope of ambiguity or need for in-depth knowledge of anatomy. The single-page coding scheme for each cancer type further supports its ease of use, especially for non-health professionals. SSS is not only less complex and easier to learn and use than the TNM system but is also updated much less frequently, not changing with revisions to the TNM system. However, it is not directly comparable with the TNM systems even when all T, N and M values (let alone the stage groups) are known. For example, as seen in the case of lip and oral cavity cancers, T information cannot be used to differentiate between codes 1 and 2 without additional information due to incomparable criteria used [39]. While it can use TNM values when it is the only data available, SSS prioritises narrative data, especially when such discrepancies arise, thus necessitating intensive manual corroboration.

EoD/DoS systems employ the same localised-regional-distant classification scheme as SSS [41], sourcing information from multiple sources, but rely on a subjective assignment to their categories, unlike SSS which explicitly lays down all possible terms clarifying disease extent to definitive stage categories. Since the criteria used by these registries are often unclear, comparisons or mapping with the TNM and other staging systems are not possible.


Utility and limitations

The UICC/AJCC TNM criteria undergo periodic evidence-based revisions based on newly acquired clinical data and advances in the understanding of cancer biology and prognosis by a dedicated international multi-disciplinary team, ensuring its consistency across global healthcare settings [18]. Hence the TNM system remains the most clinically useful classification offering the greatest prognostic value. Nevertheless, due to the complexity and specificity of information required, completeness of T, N, M or stage group data remain low in registries across the world, especially in LMICs [20, 29, 49]. Moreover, it has increasingly begun including non-anatomic prognostic factors (such as PSA for prostate, receptor status for breast and p16 positivity for oropharyngeal cancers) to better convey clinical and prognostic value across categories [18, 32]. These additional diagnostic data points are harder to perform and collect at scale, especially in LMICs with resource and infrastructure constraints, and can be expected to make complete staging information even harder to achieve for their registries [48]. There is also inconsistent and even incorrect use of terminology related to staging with this system, different editions or approaches being used without being recorded and pooling of stage at diagnosis with stage assessed at other points of care [29, 48]. While some registries only had the stage group and not the individual TNM variables, others combined and/or did not record the source (clinical vs. pathological) of information used to assign the final stage variable [29]. Registries often did not possess the edition of TNM classification that was used to assign variables as this was often not recorded in the medical record or pathology reports [29]. Hence, this system may require adaptation for use in registries and simpler systems are rightly considered.

In CTNM, due to the crude binary nature of categorisation of all three components, the resultant stage categories are extremely broad, often grouping cases with grossly differing prognoses together, thus holding minimal prognostic or clinical value. Since ETNM categories are comparable with the TNM stage groups, they offer more clinical value than CTNM. However, ETNM is currently available for only eight cancer types [33].

Although data consolidation using RD-staging is resource-intensive, with regards to prognostic value, it has demonstrated a higher agreement with the AJCC TNM stage groups compared to Degree of Spread [41, 47]. However, as it is derived solely from limited datasets excluding radiology reports and clinical correlation, may lack the nuanced details critical for individual patient management and clinical decision-making [41]. Regardless of its limitations, devising country- or registry-specific business rules suited to existing datasets and data collection pipelines presents the most likely and feasible solution for missing and/or hard-to-attain TNM information.

Although the SSS classification scheme offers more granularity and specificity than other simplified systems, since it is not directly comparable to the TNM categories, it is less suitable for clinical practice or research. The relative simplicity and broad applicability has allowed high rates of completeness of up to 90% for breast and cervical cancer in the US [20, 30], whereas stability over time and TNM editions has allowed long-term trend analyses [50]. However, since SSS has been designed exclusively for use in registries [31], it remains relatively unknown outside the registry community and has been used only in some countries such as the United States, Canada, Australia and New Zealand [22, 31, 50]. Nevertheless, it is a comprehensive staging classification system that offers a reliable and validated practical approach to stage data collection and consolidation that is broad enough to monitor longitudinal trends and evaluate the success of cancer control programs and other epidemiological efforts. The authors recognise it as the only reliable alternative to the TNM system for use in cancer registries in LMICs, which have consistently faced low rates of TNM information, in the near future.

EoD/DoS systems have been adopted by PBCRs in several countries in Europe, Australia and New Zealand, either solely or in combination with other staging systems [22, 31]. The relative simplicity of these systems has facilitated high levels of completeness in the NSWCR and in some countries like Norway and has been extensively used for reporting, survival analyses and international comparisons [29, 41]. It is important to note that while the broad categories of these systems allow for easier collection and staging, yielding data for epidemiological analyses, the lack of detail limits their application for prognostication.


Conclusion

This review offers a comprehensive and comparative appraisal of cancer stage classifications, providing registrars and researchers with a framework for independently evaluating these systems across key data consolidation characteristics. As a consequence of the complexities of cancer classification, it is evident that cancer registries are presented with two primary strategic options. They could either implement targeted strategies to improve existing TNM information abstraction or explore alternative classification systems that might better suit contextual constraints. With the continued addition of sophisticated data points reflecting non-anatomic prognostic factors, sufficient TNM information for stage categorisation using the newer versions of the criteria are expected to be even harder to attain, especially at scale and in developing nations. Considering the current global landscape which demonstrates significant variability in availability and quality, and with the lowest data contribution from LMICs, alternative strategies for deriving data for population-based analyses merit serious consideration.

By establishing a foundational subjective ontology for cancer staging, the review not only clarifies existing knowledge gaps, but also intends to encourage open discourse on methodological alternatives to established TNM-based systems for the purposes of epidemiological analysis. This review particularly directs the scope of further research by introducing new avenues for methodological refinement, wherein mapping disease extents of multiple cancer types across all available classification systems could generate more nuanced analytical insights and potentially develop sophisticated stage conversion algorithms capable of seamlessly interchanging or translating between different stage categories. The authors believe that cancer-type-specific strategies that effectively address the aforementioned challenges associated with data incompatibility can yield solutions to facilitating data harmonisation for benchmarking studies.

Nevertheless, the integration of electronic aids like staging applications and automated data retrieval tools offer transformative potential to address such challenges by consolidating and interpreting clinical data. Digital staging applications like CanStaging+ [51], AJCC Staging Online [52] and TNM Cancer Staging (Android) [53] among others, can streamline workflows by automating repetitive tasks. Such tools can not only minimise human error, but also ensure adherence to the latest classification guidelines. By enabling clinicians or even clerical assistants to quickly compute stage based on well-defined preset parameters, these tools can greatly reduce the need for laborious corroboration of retrieved clinical information to assign the T, N and M components and final stage group at the level of the registrar/data entry operator. Furthermore, these tools have the potential to be integrated with natural language processing frameworks to enable structured extraction of stage-related data from unstructured electronic medical record narratives [5456]. Predictive tools can further enhance completeness by leveraging machine learning to infer missing staging components from existing partial clinical inputs, particularly beneficial in low-resource settings with fragmented data. Future global efforts should prioritise the development and refinement of open-access, multilingual platforms tailored to diverse healthcare infrastructures and standardise their application to maximise impact on patient outcomes and cancer control strategies. While such tools require rigorous validation [57] and adaptation to local contexts, their adoption could significantly improve staging accuracy and efficiency, especially in LMICs who are more likely to face operational constraints.

However, irrespective of the chosen system(s) and/or approaches, it is imperative to ensure that an alternate method of stage categorisation does possess demonstrable clinical value, ideally encompassing both prognostic and management dimensions. The review unequivocally reiterates that staging classifications designed solely for data collection and reporting purposes are devoid of actionable value both at the individual and population levels. Ultimately, registrars must ensure that data consolidation and harmonisation remain not merely a bureaucratic exercise, but a meaningful tool for clinical and outcomes research capable of supporting population-level survival analyses and guiding tangible cancer surveillance and control programs.

To address these challenges a nuanced and balanced approach is called for, which involves implementing periodic evidence-based revisions to incrementally enhance prognostic value while also maintaining sufficient simplicity to facilitate large-scale data collection. Moreover, any such system must provide sufficient stability to enable meaningful longitudinal stage distribution monitoring. Only a pragmatic approach can reconcile the diverse needs of multiple stakeholders – clinicians, cancer registrars, public health professionals and policy makers – ultimately contributing to more effective resource allocation and ensuring compliance with evolving cancer care standards.


Acknowledgments

None.


Conflicts of interest

The authors report no conflicts of interest.


Funding

This research did not receive any financial support or funding.


Author contributions

AR conceived and designed the study, reviewed the literature and wrote the final manuscript. HS reviewed the literature and wrote the final manuscript. Both authors approved the final manuscript.


References

1. Allemani C, e Silva GA, and Mathur P, et al (2024) The CONCORD–lancet global commission on cancer The Lancet 404(10450) 320–322 https://doi.org/10.1016/S0140-6736(24)01345-X

2. Sant M, Allemani C, and Capocaccia R, et al (2003) Stage at diagnosis is a key explanation of differences in breast cancer survival across Europe Int J Cancer 106(3) 416–422 https://doi.org/10.1002/ijc.11226 PMID: 12845683

3. Coleman M, Forman D, and Bryant H, et al (2011) Cancer survival in Australia, Canada, Denmark, Norway, Sweden, and the UK, 1995–2007 (the International Cancer Benchmarking Partnership): an analysis of population-based cancer registry data The Lancet 377(9760) 127–138 https://doi.org/10.1016/S0140-6736(10)62231-3

4. Cabasag CJ, Arnold M, and Rutherford M, et al (2022) Pancreatic cancer survival by stage and age in seven high-income countries (ICBP SURVMARK-2): a population-based study Br J Cancer 126(12) 1774–1782 https://doi.org/10.1038/s41416-022-01752-3 PMID: 35236937 PMCID: 9174285

5. Cabasag CJ, Butler J, and Arnold M, et al (2020) Exploring variations in ovarian cancer survival by age and stage (ICBP SurvMark-2): a population-based study Gynecol Oncol 157(1) 234–244 https://doi.org/10.1016/j.ygyno.2019.12.047 PMID: 32005583

6. Benitez-Majano S, Fowler H, and Maringe C, et al (2016) Deriving stage at diagnosis from multiple population-based sources: colorectal and lung cancer in England Br J Cancer 115(3) 391–400 https://doi.org/10.1038/bjc.2016.177 PMID: 27328310 PMCID: 4973150

7. Tucker TC, Durbin EB, and McDowell JK, et al (2019) Unlocking the potential of population‐based cancer registries Cancer 125(21) 3729–3737 https://doi.org/10.1002/cncr.32355 PMID: 31381143 PMCID: 6851856

8. Giusti F, Martos C, and Trama A, et al (2023) Cancer treatment data available in European cancer registries: where are we and where are we going? Front Oncol 13 1109978 https://doi.org/10.3389/fonc.2023.1109978 PMID: 36845700 PMCID: 9944949

9. Behera P and Patro BK (2018) Population based cancer registry of India–the challenges and opportunities Asian Pac J Cancer Prev 19(10) 2885 PMID: 30362317 PMCID: 6291027

10. Mohammadzadeh Z, Ghazisaeedi M, and Nahvijou A, et al (2017) Systematic review of hospital based cancer registries (HBCRs): necessary tool to improve quality of care in cancer patients Asian Pac J Cancer Prev 18(8) 2027–2033 PMID: 28843217 PMCID: 5697455

11. Sørensen HT and Lash TL (2011) Use of administrative hospital registry data and a civil registry to measure survival and other outcomes after cancer Clin Epidemiol 3(Suppl 1) 1–2 https://doi.org/10.2147/CLEP.S22509 PMID: 21912489 PMCID: 3168248

12. Jedy-Agba EE, Curado MP, and Oga E, et al (2012) The role of hospital-based cancer registries in low and middle income countries—the Nigerian case study Cancer Epidemiol 36(5) 430–435 https://doi.org/10.1016/j.canep.2012.05.010 PMID: 22704971 PMCID: 3438360

13. Chen JG, Chen HZ, and Zhu J, et al (2023) Cancer survival: left truncation and comparison of results from hospital-based cancer registry and population-based cancer registry Front Oncol 13 1173828 https://doi.org/10.3389/fonc.2023.1173828 PMID: 37350938 PMCID: 10284078

14. Gandaglia G, Bray F, and Cooperberg MR, et al (2016) Prostate cancer registries: current status and future directions Eur Urol 69(6) 998–1012 https://doi.org/10.1016/j.eururo.2015.05.046

15. Moore EC, Ioannou L, and Ruseckaite R, et al (2022) Hereditary endocrine tumor registries J Endocr Soc 7(3) bvac194 https://doi.org/10.1210/jendso/bvac194

16. Anazawa T, Miyata H, and Gotoh M Cancer registries in Japan: national clinical database and site-specific cancer registries Int J Clin Oncol 20(1) 5–10 PMID: 25376769

17. Ostrom QT, Price M, and Neff C, et al (2023) CBTRUS statistical report: primary brain and other central nervous system tumors diagnosed in the United States in 2016-2020 Neuro-Oncol 25(12 Suppl 2) iv1–99 https://doi.org/10.1093/neuonc/noad149

18. Amin MB, Edge SB, and Greene FL, et al (2018) AJCC Cancer Staging Manual (Springer International Publishing) p. 1032

19. Bray F, Colombet M, and Mery L, et al (2024) Cancer Incidence in Five Continents Volume XI [Internet] (Lyon: IARC Scientific Publications) [https://publications.iarc.fr/Book-And-Report-Series/Iarc-Scientific-Publications/Cancer-Incidence-In-Five-Continents%C2%A0Volume-XI-2021]

20. Piñeros M, Parkin DM, and Ward K, et al (2019) Essential TNM: a registry tool to reduce gaps in cancer staging information Lancet Oncol 20(2) e103–11 https://doi.org/10.1016/S1470-2045(18)30897-0 PMID: 30712797

21. Minicozzi P, Innos K, and Sánchez MJ, et al (2017) Quality analysis of population-based information on cancer stage at diagnosis across Europe, with presentation of stage-specific cancer survival estimates: a EUROCARE-5 study Eur J Cancer 84 335–353 https://doi.org/10.1016/j.ejca.2017.07.015 PMID: 28865261

22. Pung L, Moorin R, and Trevithick R, et al (2023) Determining cancer stage at diagnosis in population-based cancer registries: a rapid scoping review Front Health Serv [Internet] 3 1039266 [https://www.frontiersin.org/journals/health-services/articles/10.3389/frhs.2023.1039266/full] https://doi.org/10.3389/frhs.2023.1039266

23. Noone AM, Negoita S, and Schussler N, et al (2017) Medical record-documented TNM categories and stage group: feasibility of use for cancer surveillance J Regist Manag 44(2) 46–53

24. Badia RR, Wong D, and Ghandour R, et al (2021) Validation of testicular germ cell tumor staging in nationwide cancer registries Urol Oncol 39(12) 838.e1–838.e6 https://doi.org/10.1016/j.urolonc.2021.09.011 PMID: 34711464

25. Cecchini M, Framski K, and Lazette P, et al (2016) Electronic intervention to improve structured cancer stage data capture J Oncol Pract 12(10) e949–56 https://doi.org/10.1200/JOP.2016.013540 PMID: 27650837

26. Ellis DW and Srigley J (2016) Does standardised structured reporting contribute to quality in diagnostic pathology? The importance of evidence-based datasets Virchows Arch Int J Pathol 468(1) 51–59 https://doi.org/10.1007/s00428-015-1834-4

27. Cronin KA, Ries LAG, and Edwards BK (2014) The surveillance, epidemiology, and end results (SEER) program of the national cancer institute Cancer 120(Suppl 23) 3755–3757 https://doi.org/10.1002/cncr.29049 PMID: 25412387

28. Pearson ML, Ganz PA, and McGuigan K, et al (2002) The case identification challenge in measuring quality of cancer care J Clin Oncol 20(21) 4353–4360 https://doi.org/10.1200/JCO.2002.05.527 PMID: 12409335

29. Walters S, Maringe C, and Butler J, et al (2013) Comparability of stage data in cancer registries in six countries: lessons from the international cancer benchmarking partnership Int J Cancer 132(3) 676–685 https://doi.org/10.1002/ijc.27651

30. SEER [Internet] (2024) About the SEER Program (Bethesda: National Cancer Institute) [https://seer.cancer.gov/about/overview.html]

31. Cabasag CJ, Arnold M, and Piñeros M, et al (2021) Population-based cancer staging for oesophageal, gastric, and pancreatic cancer 2012-2014: international cancer benchmarking partnership survMark-2 Int J Cancer 149(6) 1239–1246 https://doi.org/10.1002/ijc.33679 PMID: 33990959

32. Brierley JD, Gospodarowicz MK, and Wittekind C (2017) TNM Classification of Malignant Tumours (Hoboken: John Wiley & Sons, Ltd) p. 272

33. IARC (2022) User’s Guide to Essential TNM (E TNM) Version 3, 8 Cancer Sites (Lyon: IARC) [https://gicr.iarc.fr/static/public/docs/Essential%20TNM%20Users%20Guide_Version%203_8%20sites%201%20Apr%202022_ENG.pdf] Date accessed: 04/10/2024

34. European Network of Cancer Registries 2002 The ENCR recommendations; condensed TNM for coding the extent of disease (Isra: ENCR)

35. Tyczynski JE, Démaret E, and Parkin DM (2003) Standards and Guidelines for Cancer Registration in Europe (Lyon: IARC Technical Publications) p. 97

36. Brierley J, Piñeros M, and Bray F, et al (2018) Essential TNM: a means to collect stage data in population-based registries in low- and middle-income countries J Glob Oncol 4(Supplement 2) 154s–154s https://doi.org/10.1200/jgo.18.46100

37. Smith SJ, Moorin R, and Taylor K, et al (2024) Collecting routine and timely cancer stage at diagnosis by implementing a cancer staging tiered framework: the Western Australian Cancer Registry experience BMC Health Serv Res 24(1) 770 https://doi.org/10.1186/s12913-024-11224-4 PMID: 38943091 PMCID: 11214229

38. Cancer Australia (2024) National Cancer Control Indicators; National cancer stage at diagnosis data (Strawberry Hills: Cancer Australia) [https://ncci.canceraustralia.gov.au/features/national-cancer-stage-diagnosis-data] Date accessed: 18/09/2024

39. Ruhl JL, Callaghan C and Schussler N (2023) Summary Stage 2018: Codes and Coding Instructions (Bethesda: National Cancer Institute)

40. National Cancer Institute (2024) SEER Registrar Staging Assistant [Internet] (Bethesda: National Cancer Institute) [https://staging.seer.cancer.gov/] Date accessed 04/10/2024

41. Lawrance S, Bui C, and Mahindra V, et al (2019) Assessing a modified-AJCC TNM staging system in the New South Wales Cancer Registry, Australia BMC Cancer 19(1) 850 https://doi.org/10.1186/s12885-019-6062-x PMID: 31462255 PMCID: 6714314

42. Gupta S, Aitken JF, and Bartels U, et al (2016) Paediatric cancer stage in population-based cancer registries: the Toronto consensus principles and guidelines Lancet Oncol 17(4) e163–e172 https://doi.org/10.1016/S1470-2045(15)00539-2 PMID: 27300676

43. Gupta S, Aitken J, and Bartels U, et al (2020) Development of paediatric non-stage prognosticator guidelines for population-based cancer registries and updates to the 2014 Toronto paediatric cancer stage guidelines Lancet Oncol 21(9) e444–e451 https://doi.org/10.1016/S1470-2045(20)30320-X PMID: 32888473

44. IACR (2025) Pediatric Cancer Stage Guidelines [Internet] (Lyon: IACR) [http://www.iacr.com.fr/index.php?option=com_content&view=article&id=153&Itemid=657]

45. European Network of Cancer Registries (2025) ENCR endorsement of the Toronto Childhood Cancer Stage Guidelines [Internet] (Isra: ENCR) | [https://encr.eu/news/encr-endorsement-toronto-childhood-cancer-stage-guidelines]

46. Mallon B, Kaboré R, and Couitchere L, et al (2023) The feasibility of implementing Toronto childhood cancer stage guidelines and estimating the impact on outcome for childhood cancers in seven pediatric oncology units in sub-Saharan Africa. A study from the Franco-African Pediatric Oncology Group Pediatr Blood Cancer 70(12) e30664 https://doi.org/10.1002/pbc.30664 PMID: 37732944

47. Evans SM, Ivanova K, and Rome R, et al (2023) Registry-derived stage (RD-Stage) for capturing cancer stage at diagnosis for endometrial cancer BMC Cancer [Internet] 23 1222 [https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10714535/] https://doi.org/10.1186/s12885-023-11615-6

48. Brierley J, O’Sullivan B, and Asamura H, et al (2019) Global consultation on cancer staging: promoting consistent understanding and use Nat Rev Clin Oncol 16(12) 763–771 https://doi.org/10.1038/s41571-019-0253-x PMID: 31388125 PMCID: 7136160

49. Noone AM, Schussler N, and Negoita S, et al (2015) Availability of TNM staging data elements in the medical record and training needs assessment: results from the 2014 SEER training needs assessment for TNM study J Regist Manag 42(2) 40–47

50. Cunningham R, Sarfati D, and Hill S, et al (2008) An audit of colon cancer data on the New Zealand cancer registry N Z Med J 121(1279) 46–56 PMID: 18709047

51. Soerjomataram I, Ervik M, and Fox C, et al (2021) CanStaging+: an electronic staging tool for population-based cancer registries Lancet Oncol 22(8) 1069 https://doi.org/10.1016/S1470-2045(21)00188-1 PMID: 34339643

52. American College of Surgeons 2024 AJCC Staging Online [Internet] (Chicago: American College of Surgeons) [https://www.facs.org/quality-programs/cancer-programs/american-joint-committee-on-cancer/ajcc-staging-online/]

53. ICCP Portal (2024) TNM Cancer Staging (Android) [Internet] (Geneva: International Cancer Control Partnership, UICC) [https://www.iccp-portal.org/resources/tnm-cancer-staging-android]

54. Houston A, Williams S, and Ricketts W, et al (2024) Automated derivation of diagnostic criteria for lung cancer using natural language processing on electronic health records: a pilot study BMC Med Inform Decis Mak 24(1) 371 https://doi.org/10.1186/s12911-024-02790-y PMID: 39633397 PMCID: 11616170

55. Shah MP, Prichard L, and Glaspy JA, et al (2024) Novel approach to implementing natural language processing for clinical staging of non-small-cell lung cancer J Clin Oncol 42(16_suppl) e13624–e13624 https://doi.org/10.1200/JCO.2024.42.16_suppl.e13624

56. Warner JL, Levy MA, and Neuss MN, et al (2016) ReCAP: feasibility and accuracy of extracting cancer stage information from narrative electronic health record data J Oncol Pract 12(2) 157–8; e169-e167 https://doi.org/10.1200/JOP.2015.004622

57. Smith S, Trevithick RW, and Smith J, et al (2023) “Currently flying blind” Stakeholders’ perceptions of implementing statewide population-based cancer staging at diagnosis into the Western Australian cancer registry: a rapid qualitative process evaluation of the WA cancer staging project BMC Health Serv Res 23(1) 758 https://doi.org/10.1186/s12913-023-09662-7 PMID: 37454053 PMCID: 10349432

Related Articles

Afusat O Oduwoye, Elisha O Olabisi, Elizabeth F Ojo, Taiwo O Dosumu, Michael O Owoeye, Adelani Tijani, Deborah T Esan
Yanet Isabel Carrasco Rojas, Albert Gabriel Turpo Peqqueña, Natalia Camila Zenteno Tejada, Victoria Elena Quispe Pastor, Juan Augusto Martínez San Martín, Evelyn Maria Chambilla Huellcacure
Jeanine Justiniano, Ally H Mwanga, Daniel W Kitua, Nashivai E Kivuyo, Seif Wibonela, Cameron E Gaskill
Table of Contents
Table of Contents