Patient cohort and TMA-contruction
We have previously described a nested case-control cohort of women with DCIS with clinicopathological data available (21). Briefly, the cancer registries of three health care regions in Sweden were used to identify women treated for DCIS between 1992 and 2012 (n = 6 964). Cross-linking with the National Cause of Death Registry identified women who later died from breast cancer (cases, n = 95). Controls were selected using incidence density sampling, from the whole population of women with DCIS. Controls had to be alive and without distant metastasis of breast cancer at the time of death of the corresponding case (n = 318). Clinical data was gathered from medical records. In this cohort, tumour size larger than 25 mm and positive margin status were associated with increased risk of breast cancer death, while surgical treatment and radiotherapy were not. From the surgical specimens of the women in the cohort, tumour tissue was obtained for whole slide sectioning and construction of tissue microarrays (TMA) for 65 of the 95 cases (68%) and 195 of 318 controls (61%). Whole slides and TMA-slides were centrally evaluated by a specialized breast pathologist (G.R) for DCIS grade, presence of comedo necrosis, amount of tumour infiltrating lymphocytes (TILs), as well as expression of ER, progesterone receptor (PR), human epidermal growth factor receptor 2 (HER2) and Ki67 (22). For the present study there was sufficient tumour tissue in the TMAs for analysis of collagen IV in 43 cases and 119 controls. (Fig. 1)
Immunohistochemistry for collagen IV
Tissue sections were stained with rabbit anti-collagen IV (AB748, Millipore, Billerica, United States) diluted 1:50 in blocking buffer for 2 hours in room temperature. Sections were then washed in phosphate buffered saline (PBS) and incubated with the biotinylated secondary antibody, diluted 1:200 in blocking buffer, for 30 minutes in room temperature. Finally, slides were washed in PBS followed by addition of diaminobenzidine tetrahydrochloride (DAB) as chromogen.
Expression of collagen IV was scored separately in the compartment closest to the myoepithelial/stromal interface, defined as within 10 µm of the interface and dubbed periductal collagen IV, and in the rest of the stroma, henceforth called stromal collagen IV. Each compartment was scored on a scale 0–3 (0 = no, 1 = minimal, 2 = moderate and 3 = strong expression). Scoring was done by two independent researchers (G.R and M.J), blinded to outcome at the time of scoring. Discrepant cases were reviewed jointly, and a consensus score was set. There were one to six biopsies for each patient in the TMA, and in cases were the expression of collagen IV within a compartment differed between biopsies the maximum score was used for the final analyses.
Cell cultures and migration experiments
Three different breast cancer cell lines were used. An ER+/HER2-cell line (MCF-7; ATCC HTB-22), an ER-/HER2- cell line (MDA-MB-231; ATCC HTB-26) and a ER-/HER2 + cell line (JIMT-1; RRID:CVCL_2077). Cells were cultured in Dulbecco’s Modified Eagle’s Medium (DMEM) 4,5% glucose with 10% foetal bovine serum (FBS) and 1% penicillin/streptomycin. For migration experiments 96-well plates were coated with either DMEM or collagen IV (ab7536 Abcam; 1 mg/mL) diluted in DMEM to a concentration of 29 ug/mL. Cells were seeded at a density of 5000 cells/well (MCF-7), or 2500 cells/well (MDA-MB-231 and JIMT-1). After seeding the cells were left to adhere for 4–6 hours, then washed with PBS. New DMEM was added, and the plates were inserted to the Holomonitor M4, which uses quantitative phase imaging to automatically detect and measure live cells without labelling.(23) We used the adaptive Gaussian algorithm of detection to minimize background noise from the coating. Images were captured every 15 minutes for 24 hours. In each experiment, six wells per breast cancer cell line were used, three with and three without collagen IV, respectively. Three fields of view were analysed for each well, each field of view containing approximately 50–200 cells. The speed of individual cells between each consecutive captured image frame (distance travelled/time between frames) was calculated by the built-in software of the HoloMonitor, as well as the average speed of all cells of each type between consecutive frames. The experiment was repeated thrice and the values of the resulting averages for all experiments were used for the final analyses. After all experiments were finished, all three cell lines tested negative for potential contamination of various Mycoplasma species.
Statistical analysis
Odds ratios (OR) for dying of breast cancer in the nested case control cohort were calculated using conditional logistic regression and all analyses adjusted for time at risk. The following clinical variables were used: Age, tumour size (< 25 mm or ≥ 25 mm, as in previous studies on this cohort (21)), margin status (negative vs positive, uncertain or missing), mode of detection (screening vs non-screening) and surgical treatment (breast conserving therapy only (BCS), BCS with radiotherapy (BCS + RT) or mastectomy). Moreover the following biomarkers were used: Nuclear grade according to Holland (24), comedo necrosis (absent vs. present) and ER and PR expression (positive if ≥ 10% of nuclei stained, in keeping with national guidelines for invasive carcinoma) (25). Periductal TILs were categorized as 0–5%, 10% or ≥ 20% according to international guidelines (26). The expression of both stromal and periductal collagen IV as described above was simplified to a two-grade scale for the final analyses with 0–1 termed “low” and 2–3 termed “high”. Pearson Chi-square was used to test for independence between variables. The test for interaction between ER-status and outcome was conducted by adding an interaction term in the conditional logistic regression model.
Loss of material in sectioning resulted in missing values of the case response variable, making some strata entirely unused. To maximise data use, strata with cases treated at similar timepoints were fused. This gave a total of 32 strata, compared to the original 66. For the adjusted analyses, missing values for variables other than either collagen IV variable were imputed using multiple imputation in 20 datasets estimated with chained equations. For the imputation model, the variables in the conditional logistic regression models were used, including the responses periductal collagen IV and stromal collagen IV as well as time at risk and year of diagnosis. Proportion of imputed values were 16% for tumour size, 6% for ER-status and less than 5% for all other variables.
For comparison of means of average migration speed in the in vitro experiments, independent samples t-test was used.
Conditional logistic regression analyses, imputation and tests for interaction were performed in StataIC15.1 and all other analyses in SPSS® version 23 (IBM, Armonk, NY, USA).