A Sequential Proteomic Relay Defines a Decade-long Pre-diagnostic Window for Pancreatic Cancer
1Department of Radiation Oncology, Icahn School of Medicine at Mount Sinai, New York, NY, U.S.A.
2Severn Health Solutions, Severna Park, MD, U.S.A.
Abstract
Introduction
Pancreatic ductal adenocarcinoma (PDAC) remains one of the deadliest human malignancies, with a 5-year survival rate below 12%, a figure that has shown only modest improvement over the past several decades. This dismal prognosis is driven primarily by late-stage diagnosis: more than 80% of patients present with locally advanced or metastatic disease, at which point curative surgical resection is no longer feasible. In contrast, patients diagnosed with stage I disease can achieve 5-year survival rates exceeding 40-50%, underscoring that early detection represents the single most powerful lever for improving outcomes in PDAC.
Despite this imperative, effective population-level screening for pancreatic cancer does not exist. Current clinical practice relies heavily on imaging-based surveillance of narrowly defined high-risk groups (
These limitations reflect a broader conceptual challenge in pancreatic cancer early detection: most biomarkers in clinical use are designed to detect the tumor itself, rather than the biological process of tumorigenesis. Emerging genomic and evolutionary models of PDAC suggest that malignant transformation is a prolonged, multistep process unfolding over a decade or more, beginning with early oncogenic mutations and progressive remodeling of the pancreatic microenvironment long before invasive cancer becomes radiographically apparent (5). This extended latent phase represents a largely unexplored opportunity for early interception if biomarkers exist which can sensitively track these preclinical biological changes.
Recent advances in high-throughput plasma proteomics offer a promising avenue to address this gap. Proximity Extension Assay (PEA)–based platforms, such as Olink, enable simultaneous quantification of hundreds to thousands of circulating proteins with high sensitivity and reproducibility (6). Unlike genomics, which captures static inherited or somatic alterations, proteomics reflects dynamic, system-level physiology, integrating signals from stromal remodeling, immune activation, and tissue injury. This makes plasma proteomics uniquely suited to detect the earliest systemic consequences of a developing pancreatic neoplasm.
However, most prior proteomic studies in PDAC have been cross-sectional or limited to samples collected shortly before diagnosis, constraining their ability to resolve temporal trajectories. As a result, it remains unclear whether pancreatic cancer is preceded by a gradual, continuous biomarker drift or by discrete, staged biological transitions – and critically, how early such signals might be detectable in asymptomatic individuals.
The scope of the work was threefold. First, we aimed to characterize the long-term temporal behavior of circulating proteins across the extended preclinical phase of PDAC, rather than focusing on markers proximate to diagnosis. Second, we sought to determine whether distinct phases of disease evolution – such as early stromal remodeling and later immune dysregulation – can be inferred from proteomic trajectories, thereby providing mechanistic insight into PDAC pathophysiology before overt tumor burden develops. Third, we evaluated whether this temporally resolved proteomic framework can inform a risk-stratification paradigm that addresses key limitations of current diagnostic approaches, including late detection, reliance on tumor burden–dependent markers, and reduced sensitivity in genetically defined subpopulations. By reframing PDAC detection as a dynamic, staged biological process, this study aimed to establish a conceptual foundation for earlier surveillance strategies that align biomarker interpretation with the underlying biology of disease progression.
Materials and Methods
Candidate biomarkers were selected using a dual-screening approach: Linear mixed models were used to identify early continuous markers (
To reconstruct the pre-diagnostic timeline of biomarker expression, we performed a longitudinal analysis spanning 0 to 15 years prior to diagnosis. Non-parametric Locally Estimated Scatterplot Smoothing (LOESS) regression was used to visualize the continuous trajectories of CTHRC1 and RELT. 95% confidence intervals were generated to assess the stability of the signal, particularly at early timepoints (≥9 years) where sample density was lower.
To evaluate the clinical utility of the panel in distinguishing imminent from distant disease, participants were stratified into two risk windows. Near Diagnosis (High Risk): 0-5 years between blood draw and diagnosis (n=24). Far from Diagnosis (Baseline/Low Risk): 5-10 years between blood draw and diagnosis (n=20). Participants diagnosed >10 years post-recruitment were excluded from the binary classification analysis to ensure distinct separation of clinical phenotypes. Logistic regression models were constructed to predict the “Near Diagnosis” status. We evaluated single-protein models (CTHRC1 or RELT alone) and a combined multi-modal panel. The final combined model incorporated CTHRC1, RELT, biological sex, and Age at Recruitment. Age at Recruitment was explicitly calculated (age at diagnosis minus years to diagnosis) to act as a proper time-independent covariate, preventing confounding bias associated with the natural aging process during the lag period. Receiver operating characteristic (ROC) curves were generated using the pROC package in R. Model performance was assessed using the area under the curve (AUC). The optimal probability cutoff was determined using Youden’s Index to maximize the sum of sensitivity and specificity. A confusion matrix was generated at this optimal threshold to calculate final accuracy, sensitivity, and specificity (
Results
Our analysis revealed a staggered rise in circulating proteins. CTHRC1, a marker associated with extracellular matrix remodeling, exhibited the earliest shift, rising 8.93 years before clinical diagnosis. This was followed by RELT, which showed a sharper upward inflection at 5.60 years. These findings suggest a staged biological progression, where early tissue remodeling precedes more acute pro-inflammatory and tumor-driven signals (
The combined proteomic panel (CTHRC1, RELT, Age, and Sex) demonstrated predictive power (R2=0.434). To evaluate clinical utility, we tested the model’s ability to distinguish between the early “Far” window (5-10 years) and the acute “Near” window (0-5 years). The panel achieved a Combined AUC=0.814, significantly outperforming individual markers (CTHRC1 AUC=0.69; RELT AUC=0.639;
Discussion
In this study, we identified and validated a novel “Proteomic Relay” signature for the early detection of PDAC. Our longitudinal analysis of 62 pre-diagnostic samples reveals that PDAC development is characterized by a sequential handover between stromal and tumoral signals. Specifically, CTHRC1 acts as a “Stromal Sentinel,” elevating approximately 9 years prior to diagnosis, while RELT serves as a late-stage “Confirmatory Marker,” surging in the final 2 years. When combined with age and sex, this multi-marker panel achieved an AUC=0.814 in distinguishing imminent (0-5 years) from distant (5-10 years) disease, confirming its potential clinical utility as a risk-stratification tool.
The temporal divergence of these two proteins offers insight into the latent biology of PDAC. The early rise of CTHRC1 suggests that stromal remodeling (the “soil”) may precede detectable epithelial transformation (the “seed”) by nearly a decade. This aligns with recent genomic models suggesting that driver mutations (
The early pre-diagnostic elevation of CTHRC1 observed in our cohort is mechanistically supported by recent evidence identifying it as a master regulator of the pancreatic tumor microenvironment (TME). While our data positions CTHRC1 as a temporal “sentinel” rising years before diagnosis, Yin
The sharp “confirmatory” spike of RELT observed in the final two years prior to diagnosis aligns with the establishment of a profoundly immunosuppressive tumor microenvironment, a hallmark of advanced PDAC. While CTHRC1 drives the structural remodeling of the stroma, our data suggests RELT likely facilitates immune escape. A recent comprehensive review by Cusick
Current screening relies heavily on carbohydrate antigen 19-9 (CA19-9), which has significant limitations that our panel theoretically addresses (3). First, CA19-9 is a marker of tumor burden and ductal obstruction, typically elevating only when the tumor is large enough to be incurable. Our data indicates that CTHRC1 provides a lead time of up to 7 years during the “Diagnostic Blind Spot” where CA19-9 levels would remain baseline.
Our panel overcomes the genetic limitations of CA19-9. Approximately 10% of the Caucasian population are Lewis-negative non-secretors (genotype le/le), lacking the functional FUT3 enzyme required to synthesize the CA19-9 epitope (4). In these individuals, CA19-9 remains undetectable regardless of tumor size, leading to dangerous false negatives. Because CTHRC1 and RELT are structural and immune-regulatory proteins independent of fucosyltransferase activity, this panel remains valid in the Lewis-negative sub-population, potentially eliminating this critical disparity in screening equity.
We propose that this panel is best utilized not as a standalone diagnostic, but as a “Triage Filter” for high-risk surveillance. Given the cost and invasiveness of definitive imaging, a blood-based test with 81% accuracy could effectively enrich the screening pool. Patients showing the “Early Stromal” profile (High CTHRC1/Low RELT) could be prioritized for annual monitoring, while those showing the “Acute Relay” profile (High CTHRC1/High RELT) would warrant immediate imaging intervention.
Conclusion
The CTHRC1/RELT panel represents a paradigm shift from detecting the tumor to detecting the process of tumorigenesis. By targeting the sequential biology of stromal activation and immune escape, this panel offers a viable strategy to identify patients during the curative window, years before standard markers trigger an alarm.
Conflicts of Interest
The Authors declare that they have no competing interests.
Authors’ Contributions
Steven Lehrer conceived and designed the study, obtained access to the UK Biobank resource, performed all statistical and computational analyses, generated the figures and tables, interpreted the results, and drafted the manuscript. Peter H. Rheinstein contributed to study design and conceptual development, assisted with interpretation of proteomic and biological findings, provided critical intellectual input, and revised the manuscript for important scientific content. Both Authors reviewed and approved the final version of the manuscript and agree to be accountable for all aspects of the work.
Artificial Intelligence (AI) Disclosure
Artificial intelligence–assisted tools were used solely to support the generation and refinement of computer code for data processing, statistical analysis, and figure preparation. All analyses were designed, executed, validated, and interpreted by the Authors. No AI tools were used for data generation, data interpretation, or to write substantive portions of the scientific manuscript.