2.1 Patients
This retrospective study was conducted at two institutions: Hubei Cancer Hospital (Center 1) and Union Hospital, Wuhan (Center 2). Eligible patients were those with colorectal cancer (CRC) who received targeted therapy between January 2018 and December 2022 at Center 1, and between January 2020 and December 2022 at Center 2.
Inclusion criteria(Figure 1) were defined as: (1) pathologically confirmed colorectal adenocarcinoma; (2) complete clinical, pathological, and follow-up records available; (3) presence of liver metastases at initial diagnosis (confirmed by contrast-enhanced imaging or pathology); (4) availability of baseline liver MRI (performed within 1 month before initiating targeted therapy) and follow-up liver MRI (performed within 3 months after starting targeted therapy), with sequences adequate for radiomic analysis (including T1-weighted contrast-enhanced and T2-weighted sequences).
Exclusion criteria included: (1) incomplete or unevaluable MRI datasets (e.g., missing key sequences, insufficient image quality for lesion segmentation); (2) history of prior immunotherapy or local treatments targeting liver metastases (e.g., radiofrequency ablation, transarterial chemoembolization, or radiotherapy) before the initiation of targeted therapy; (3) presence of extrahepatic metastases (except for regional lymph nodes, which were not considered exclusionary); (4) target liver lesions <1 cm in maximum diameter (per RECIST 1.1 criteria, as smaller lesions are prone to measurement bias and unreliable feature extraction); (5) significant image artifacts (e.g., motion, breathing, or susceptibility artifacts) precluding accurate lesion segmentation or feature calculation.
For patients with multiple liver metastases, the three largest target lesions (per RECIST 1.1 guidelines) were selected for analysis to align with standard clinical response assessment practices. Treatment regimens are detailed in Supplementary Table 1.
2.2 MRI Imaging Protocol
Liver MRI was performed using standardized protocols on multiple 3.0T and 1.5T scanners across both centers. Sequences included T2-weighted imaging (T2WI), T1-weighted imaging (T1WI), and contrast-enhanced T1WI. Gadopentetate dimeglumine was used as the contrast agent. Detailed scanning parameters are provided in Supplementary Table 2.
2.3 Clinical Data Collection and Imaging Analysis
Clinical characteristics potentially associated with patient prognosis were retrospectively collected, including gender, age, and baseline carcinoembryonic antigen (CEA) and glycocalyx antigen 19-9 (CA19-9). Tumor markers were categorized into two groups based on serum levels: within normal range or above normal range (normal range: CEA ≤ 5 ng/ml, CA19-9 ≤ 40 ng/ml).
Multi-sequence MRI images were jointly evaluated by two physicians (RZ.X and W.X, with 5 and 3 years of abdominal imaging diagnosis experience, respectively). In case of discrepancies, a third physician (Physician 3, with 20 years of abdominal imaging experience) made the final decision. Up to three largest lesions per patient were evaluated for the following imaging features:(a) Number of liver metastases (≤ 3 or > 3); (b) Diameter (cm); (c) Maximum cross-sectional area (cm²); (d) Volume (cm³); (e) Growth pattern[17]; (f) Metastasis homogeneity (homogeneous/heterogeneous); (g) Hepatic capsular retraction status; (h) Tumor related to adjacent vein (TRTAV); (i) Prominence of rim enhancement; (j) Clarity of boundary with surrounding liver parenchyma; (k) Regularity of lesion morphology.
Imaging Feature Assessment Principles(Figure2):
- Diameter, maximum cross-sectional area, and volume were obtained by manual 3D ROI segmentation of the entire tumor using 3D Slicer software (Version 5.3.0, open-source software, https://www.slicer.org/) by two radiologists (RZ.X and W.X), with calculations performed by the software.
- Growth patterns were classified into four types referring to Cai[17]:Smooth Type (ST): Round or round-like tumor with a definite boundary and without protrusion at the edge of the lesion; Rough Type (RT): Tumor with multiple sharp-angled protrusions at the edge of the lesion, without a clear boundary with the surrounding liver parenchyma; Focal Extranodular Protuberant (FEP): Non-nodular tumor with one or more local protrusions at the edge of the lesion; Nodular Confluent (NC): Multiple nodules fused with each other, and each nodule had a clear outline.
- Homogeneous: Signal intensity variation across the tumor (excluding necrotic cores) ≤10% of the mean intensity, as measured by 3D Slicer’s built-in intensity histogram tool. Visually, no distinct areas of hypo- or hyper-enhancement (relative to the tumor’s mean signal) were observed.
Heterogeneous: Signal intensity variation >10% of the mean intensity, with visually distinct regions of hypo-enhancement (e.g., necrosis) or hyper-enhancement (e.g., viable tumor).
- Hepatic capsular retraction: Focal indentation of the liver capsule caused by the tumor, identified on PVP or T2-weighted images.
- Positive TRTAV: Defined as (1) direct contact between the tumor and portal/hepatic vein trunks or their ≥2nd-order branches (distance between tumor margin and vessel wall <1 mm); or (2) visible intraluminal tumor thrombus or tumor vessels within the vein lumen, confirmed by lack of contrast enhancement in the affected segment.
Negative TRTAV: No contact with major veins, or distance between tumor and vessel ≥1 mm without intraluminal involvement.Prominent rim enhancement was defined as >75% of the tumor edge showing signal intensity significantly higher than surrounding liver parenchyma on axial images.
- Clear boundary: ≥75% of the tumor circumference shows a distinct demarcation between the tumor and adjacent liver parenchyma, with no blurring or intermingling of signal intensities.
- Regular morphology: Tumor contour approximates an ellipse, with no protrusions >5 mm in height or depressions >3 mm in depth relative to the best-fit ellipse .
2.4 Delineation of ROI and Extraction of Radiomic Features
The workflow of radiomic feature extraction is shown in Figure 3. ROIs were delineated on portal venous phase images of contrast-enhanced MRI at baseline and the first follow-up after targeted therapy. For patients with multiple lesions, up to three largest lesions (by diameter) were selected for ROI delineation. Two radiologists (RZ.X and W.X) manually segmented 3D ROIs of the entire tumor; peritumoral ROIsweregenerated by expanding the original ROI by 3 mm and manually correctingfor extrahepatic structures.
2.5 Image Preprocessing
All original MRI images underwent preprocessing to minimize central effects from different institutions and scanners. N4 bias field correction was performed to reduce image intensity inhomogeneity caused by magnetic field non-uniformity. To standardize voxel spacing and enhance texture resolution:
1.All ROIs were normalized using μ±3σ (μ = mean intensity within ROI; σ = standard deviation)
2.Gray-scale quantization was applied to reduce computation time and improve signal-to-noise ratio of texture results
3.Trilinear interpolation was used to isotropically resample images to 1×1×1 mm (x, y, z) voxel dimensions, preserving the proportion and orientation of 3D features
2.6 Feature Extraction
Features were extracted from intra-tumoral ROI (ROIIntra), peri-tumoral ROI (ROIPeri), and combined intra-peritumoral ROI (ROICombined) before and after targeted therapy using the Python Radiomics package. Delta radiomic features were calculated as: Δfti=fti-ft0, where fti denotes the feature value at time point ti, and ft0 denotes the baseline feature value.
2.7 Reliability Assessment
MRI data of 30 patients were randomly selected to calculate intra-observer and inter-observer correlation coefficients (ICC) for evaluating feature reliability. Inter-observer consistency was assessed by independent ROI segmentation of two radiologists at the same period. Intra-observer reproducibility was evaluated by repeated segmentation of one radiologist (RZ.X) two weeks later. Features with ICC > 0.75 were selected for further analysis (Supplementary Table 3).
2.8 Model Building
Before feature selection, the patient features were normalized using the Z-score formula: (x-μ)/σ, where x denotes the feature value, μ is the mean of all patient features, and σ is the corresponding standard deviation. The training cohort adopted a two-step feature selection approach.
First, Levene's test was performed to assess the homogeneity of variance for featuresand features with p-values > 0.05 were excluded. Second, the minimum redundancy maximum relevance (mRMR) method was used to select the top 5 features with high relevance and low redundancy.Finally, Six radiomics models (baseline and delta, intra/peri/combined) were constructed using logistic regression and 5-fold cross-validation. The best-performing model was selected. Univariate and multivariate Cox regression identified independent predictors of OS, which were used to build the final hybrid model.
2.8 Patient Follow-up
The overall survival (OS) of patients is defined as the time from initiation of targeted therapy to death or last follow-up.
2.9 Statistical Analysis
Statistical analyses were performed using Python (v3.9.6) and IBM SPSS Statistics 26.0. Continuous variables are presented as median (IQR); categorical variables as counts and percentages. Normality was tested using Kolmogorov-Smirnov test. Baseline characteristics were compared using Mann-Whitney U test/t-test or Chi-square test. Feature reliability was assessed using intraclass correlation coefficients (ICC > 0.75). Feature selection employed minimum redundancy maximum relevance (mRMR) algorithm. Model performance was evaluated using ROC/AUC with 95% confidence intervals, calibration curves, and decision curve analysis. Survival analysis used Kaplan-Meier method and Cox proportional hazards regression with proportional hazards assumption verification. Internal validation employed 5-fold cross-validation; external validation used independent cohort. SHAP analysis provided model interpretability. Statistical significance was set at p < 0.05.