Bridging gaps across the CLL care pathway – Part 1: From Signal to Stratification

November 4, 2025

Clinical DevelopmentCommercial ExcellenceDiseases Earlier Detection Insights Medical AffairsOncology Value & Access HEOR

Bridging gaps across the CLL care pathway – Part 1: From Signal to Stratification ... Reimagining Early CLL Care through Algorithmic Insight

By Léon Van Wouwe, Clinical Innovation Director, Volv Global

Early detection and personalised treatment in chronic lymphocytic leukaemia (CLL) remain a major challenge across healthcare systems. This article by Volv Global explores how AI-driven algorithms can identify disease signals earlier, guide smarter diagnostic pathways, and reduce inefficiencies in patient care.

This piece is Part 1 of the “Bridging Gaps Across the CLL Care Pathway” series focusing on the shift from signal detection to risk stratification, setting the stage for how algorithmic insight can reimagine early CLL care.

From Signal to Stratification – Reimagining Early CLL Care through Algorithmic Insight

Introduction: the silent burden of early CLL

Chronic lymphocytic leukaemia (CLL) is heterogeneous and often clinically silent until late. Roughly 70–80% of patients are asymptomatic at diagnosis; one-third may never require therapy. [1, 2] Yet in the U.S., over 200,000 people live with CLL, and the disease causes ~4,410 deaths annually. [1] The median age at diagnosis is ~70 years. Because of its indolent nature and low prevalence, U.S. guidelines do not endorse population screening, leaving diagnosis largely “accidental” or delayed. [3]

Delays or missed opportunities in detection mean that many patients present with firmly established disease burden, limiting sensitivity to subtle early signals. For pharmaceutical developers, this latency represents both a challenge and an opportunity: how might we shift detection earlier, stratify risk more precisely, and improve alignment between first-line therapeutic options and our ability to find the majority of patients when they are still presenting with early-stage disease?

The status quo: diagnostic inertia and inefficiencies

Today, most CLL diagnoses arise from incidental lymphocytosis on a CBC or differential, followed by haematology referral and confirmatory flow cytometry (≥ 5 ×10⁹/L clonal B cells, sustained) with immunophenotyping. [1] Patients who present with nonspecific symptoms (fatigue, night sweats, low-grade fevers, recurring infections, lymphadenopathy) may traverse multiple outpatient encounters before evaluation. Because lymphocytosis has many benign causes, clinicians may not act until trends are evident.

This reactive workflow causes delays (weeks to months) and uneven referral patterns, especially when absolute lymphocyte count (ALC) is only modestly elevated or fluctuating. In non-academic settings, molecular and cytogenetic testing may not be readily available or may have long turnaround times, adding friction to early decision-making.

Algorithmic triage: making screening viable in a low-prevalence disease setting

The classic objection to screening in CLL is the low base rate: even a small false-positive rate can overwhelm downstream resources needed to confirm suspicion. But what if we reframed screening as smart triage using an AI-assisted flagging mechanism?

Recent work demonstrates promise. A 12-variable random forest model, built from routine demographic and lab data (age, sex, ALC, WBC, platelet metrics), predicted development of abnormal lymphocytosis associated with CLL/MBL up to five years ahead, achieving AUC ≈ 0.92 (cross-validated AUC ≈ 0.935) and good sensitivity and specificity. [4] While not diagnostic, the model illustrates that latent signals may exist in standard laboratory test series.

Multiple reviews of machine learning (ML) in CLL (20 studies between 2014–2023) show applications in diagnosis, classification, and treatment guidance with reported accuracies from ~83% to near 100 %. Still, most remain proof-of-concept, centre-specific, and not broadly integrated. [5, 6]

If deployed at scale (e.g., embedded in laboratory pipelines or electronic health record (EHR) decision support), such models could flag a limited subset of patients for confirmatory flow cytometry, keeping the number needed to test manageable. For pharma, this unlocks earlier patient capture, better natural history studies, and enriched trial recruitment.

Risk stratification at diagnosis: the current burden

Once a patient is confirmed to have CLL, clinicians order a battery of molecular, cytogenetic, and immunogenetic assays: FISH (del13q, del11q, del17p, trisomy 12), TP53 sequencing, IGHV mutation status, β2-microglobulin, possibly broader NGS panels. These guide prognosis, time-to-first-treatment (TTFT), therapy selection, and in some cases trial eligibility. [1]

But this testing is expensive, logistically burdensome, and not uniform across U.S. practice settings, nor in many other major international healthcare settings. In community centres, access to high-quality molecular diagnostics or fast turnaround may be limited, delaying therapeutic decisions or forcing empirical choices. The redundancy and cost are nontrivial friction in real-world precision care.

How algorithmic risk scoring can lighten the load

Algorithmic risk models (built on routinely collected data) can help in two complementary ways:

Selective escalation – models can triage which patients merit full molecular workup, sparing low-risk individuals from expensive blanket testing.
Augmented prognostic scoring – provide a probabilistic estimate of TTFT or need for therapy (e.g., within two years) even before full biomarker data are available. For example, one explainable ML model used only demographics and standard laboratory test results to predict treatment requirement in two years. [7]

In CLL, unsupervised ML clustering of immunophenotype/genetic profiles has also been used to refine risk groups beyond classical staging. One study clustering 2,243 Rai 0–II patients generated novel continuous prognostic relationships missed by standard hierarchical models. [8]

As new first-line targeted therapies have differential efficacy and toxicity profiles, better upfront stratification becomes increasingly valuable. For example, BTK- and BCL2-based regimens now supplant chemoimmunotherapy as first-line, and the presence of unmutated IGHV or TP53 aberrations influences choice. [9]

Vision: a prediction-first paradigm in early CLL

Imagine a diagnostic continuum:

Routine lab data + patient metadata → AI flagging
Confirmatory phenotyping only for flagged patients
Algorithmic risk score (even pre-biomarkers) to guide further testing
Integrated decision paths: which biomarkers to order, intensity of clinical surveillance, first-line therapy recommendation

Such a paradigm could reduce diagnostic delay, rationalise molecular testing, and enrich pharma pipelines with earlier-stage, better-stratified patients. Volv Global (or similar methodology-platforms) could host modular risk/triage engines, embed explainability and uncertainty quantification, and integrate into EHR/lab systems as decision support.

In concluding this first part, the imperative is clear: closing the diagnostic gap and intelligently stratifying risk at the outset is not only clinically logical – it is foundational to next-generation precision haematology.

About the author

Léon van Wouwe has 20+ years’ global experience in clinical development and operations, uniting data science with pharma and research. He drives cross-functional collaboration to advance innovative treatments.

References

Key Takeaways

Imagine a diagnostic continuum:

Early CLL is often asymptomatic; detection delays remain common.
Machine learning models using routine data can detect early CLL signals years before diagnosis.
AI-based triage may enable low-burden screening in real-world settings.
Algorithmic risk scoring helps target testing and personalisation of treatment.
Volv Global data-science expertise positions it at the forefront of precision haematology innovation.

Frequently Asked Questions

1. What is CLL?
CLL is a type of blood cancer affecting B-cells. It progresses slowly and often remains undetected until advanced stages.
2. How can AI contribute to early CLL detection?
AI algorithms can analyze analyse demographic and lab data to identify subtle risk patterns, supporting earlier intervention.
3. Why is risk stratification important?
It helps clinicians decide when and how aggressively to treat, improving outcomes and resource efficiency.
4. What challenges exist in early CLL diagnosis?
Limited screening and variable access to molecular testing delay diagnosis and therapy initiation.
5. What’s Volv Global role?
Volv Global integrates AI into healthcare workflows, improving precision diagnosis and accelerating therapeutic development.

Photo by skynesher on iStock.

Share the Post: