Patients and study design
Patient-level data from REALYSA (25) and LNH09-7B (5) databases were retrieved for SCA patients, and data from SENIOR trial (8) were used for validation. REALYSA is a French real-world multicentric observational cohort recruiting newly diagnosed lymphoma patients since November 2018. LNH09-7B was a phase II clinical trial assessing efficacy of Ofatumumab and miniCHOP preceded by a pre-phase treatment (vincristine and oral prednisone) as first line treatment of DLBCL patients ≥ 80 y.o, which resulted in a 2-years OS of 64.7% [95% CI: 55.3–72.7]. SENIOR trial was a phase III RCT assessing RminiCHOP vs. R-Lenalidomide-miniCHOP, with a pre-phase treatment, as first line treatment of DLBCL patients ≥ 80 y.o, with similar inclusion criteria as LNH09-7B. This study failed to show an improvement in OS with the addition of lenalidomide (2-years OS of 66% for RminiCHOP versus 65.7% for R²miniCHOP arm). Patients from LNH09-7B had their treatment started from June 2010 to November 2011, and those from SENIOR from October 2014 to September 2017.
From REALYSA cohort, we included patients aged ≥ 80 y.o., treated with RminiCHOP combination as first line treatment, included in the cohort from December 2018 to 31 December 2021. Patients with performans status (ECOG) of 3 or 4 and Ann Arbor stage I were not included to match SENIOR inclusion criteria. Data were exported from the registry the 25th of June 2024.
Clinical trials patients with non-compliant diagnoses (other than DLBCL or high-grade B cell lymphoma (HGBL)) according to anatomopathological centralized reviews realized in each trial or local diagnosis for REALYSA were excluded.
The study was designed as follows: first step was to build a mixed SCA (“Mixed SCA 1”) from real-world data (REALYSA) and clinical trial data (LNH09-7B) adjusted on SENIOR control arm and to evaluate if it could mimic the internal control arm from SENIOR. Second step was to build a mixed SCA (“Mixed SCA 2”) adjusted on experimental arm of SENIOR, to evaluate if we could replicate the efficacy results from SENIOR study by switching the internal control arm by the Mixed SCA 2. For each step, populations were balanced as described below.
Primary and only endpoint for arms comparison after weighting was OS, measured from date of inclusion (or date of diagnosis in REALYSA) to date of death. As a high number of REALYSA patients had a limited follow-up period, we chose to censor all patients at 24 months for all cohorts.
Matching procedures:
Propensity scores (PS) were estimated for each patient included using logistic regression with following covariates : sex, age (spline), Ann Arbor stage (I-II versus III-IV), Performans Status (ECOG) (0 vs. 1 vs. 2), number of extranodal sites involved (< 2 versus ≥ 2), international prognostic index (IPI) score (0–2 versus 3–5), B symptoms, lactate dehydrogenase (LDH) level (normal versus over the upper limit of normal (ULN)), bulky mass > 10cm, albumin level in gram per litter (g/L) (spline).
For comparison between the Mixed SCA 1 and SENIOR internal control arm, PS allows to estimate the probability for a patient to receive “RminiCHOP via SENIOR”. For comparison between the Mixed SCA 2 versus SENIOR experimental arm, PS allows to estimate the probability for a patient to receive R²miniCHOP.
Then, patients’ covariates between arms were weighted using the stabilized inverse probability of treatment weighting (sIPTW) statistical approach, the probability of treatment being reflected by PS (26).
To avoid positivity violations, patients with extreme PS (below 0.1 or over 0.9) were excluded (27). After matching procedures, standardized mean differences (SMD) were estimated between arms for each covariate included for PS estimation to check balance in covariates’ distributions. With the statistical approach used, sIPTW allowed to estimate the average treatment effect (ATE).
Missing data management:
Due to the high number of missing data for patients from REALYSA for some covariates, multiple imputation method “across” was performed (15 imputations by patient) (28, 29). That generated 15 complete datasets. PS were calculated for each of the datasets. Then, for each patient, the median of the 15 PS was used for weighting. The SMD presented in this manuscript were calculated on real data, i.e. before imputation. With the “across” missing management method, outcome analyses are performed on one dataset.
Statistics for outcome analyses:
As first objective was to assess if the Mixed SCA 1 could mimic the internal control arm of SENIOR, Hazard Ratios (HR) 95% confidence intervals were expected to include 1. The second objective was to reproduce SENIOR results by replacing the internal control arm by the Mixed SCA 2, HR were expected to overlap with the results obtained in SENIOR trial (HR [95%CI] of 0.996 [0.66–1.51]), including 1 in the HR 95%CI.
OS curves were generated using Kaplan-Meier method, and OS curves were compared with log-rank test. HR and 95% confidence intervals (CI) were calculated using Cox proportional hazards model. One-sided p-value below 0.05 was considered significant. Analyses were performed using SAS software 9.4.
Sensitivity analyses:
Sensitivity analyses were performed using data coming only from REALYSA cohort to evaluate the possibility to build a well-balanced SCA using only real-world data (REALYSA-SCA), with the same methodology used for the mixed SCA building.
Additionally, we performed sensitivity analysis using different missing data management methods on Mixed SCAs analyses or REALYSA-SCA analyses. “Complete cases” method excludes patients with missing data. For “within” method, analyses are performed on each imputed dataset (15 analyses) and results were combined using Rubin’s rules (28, 29).