We investigated the performance of the Infinium MethylationEPIC v2.0 microarray (EPIC v2.0) using a combination of fragmented (average sizes of 350, 230, 165, and 95 bp) and low-input (100, 50, 20, 10 ng) DNA material. Supplementary Figure 1 shows the obtained fragment size distributions of the four DNA sample sizes. These distributions are represented by Gaussian-like curves with elongated tails extending towards longer DNA fragments (right skewed), with shorter DNA fragment samples exhibiting a narrower distribution. The obtained DIs for the DNA samples with average fragment sizes of 350, 230, 165, and 95 bp were 2.09, 3.44, 8.88, and 8008.45, respectively. Supplementary Figure 2 illustrates the relationship between the average DNA fragment size and DI.
Quality control of degraded DNA methylation data
It was key to carry out a thorough QC procedure of the Illumina EPIC v2.0 microarray data to ensure reliable results given that samples of suboptimal DNA quality and quantity were employed.
Illumina EPIC v2.0 microarray utilizes beads with multiple copies of 50 bp oligonucleotide probes targeting specific loci in the genome. Once a DNA fragment hybridizes with its complementary probe, single-base extension (SBE) incorporates labeled nucleotides (green or red), which emits a signal. The intensities of the signals from probes targeting a given CpG site provide information about its DNAm level. Non-specific binding of negative control probes and fluorescence intrinsically associated with the microarray surface generate background signal intensities, which could interfere with the analysis if not properly removed. Figure 2A shows the relationship between the two mean background signal intensities (green and red) across all samples. Degraded samples with average DNA fragment sizes of 350 bp, 230 bp, and 165 bp (except for the 10 ng input) clustered near the high-quality DNA control sample (grey dot), with background signal intensities above 218. Degraded samples with an average DNA fragment size of 95 bp and 165 bp with 10 ng DNA input had at least one background signal intensity close to zero, indicating potential issues with QC. Figure 2B shows the mean signal intensities of all probes across all samples. The control sample exhibited a mean signal intensity of approximately 5,000, which was also observed in samples with DNA fragment sizes of 350 bp and 230 bp. For samples with a DNA fragment size of 165 bp, we observed a progressive decrease in mean signal intensity with decreasing DNA amounts. The low mean background signal intensity observed for degraded samples with a DNA fragment size of 95 bp and 165 bp with 10 ng DNA input was desirable, however, the mean signal intensity was low, which indicates poor DNA hybridization. Another important QC metric is the ratio of median red to green signal intensities, that represents the relative balance between the two fluorescent dye channels (green and red) used in the microarray. A balanced ratio (close to 1) indicates good microarray performance, while an imbalanced ratio suggests potential technical issues. When we compared the ratio of median red to green signal intensities, we observed a deviation from a ratio of 1 in samples with DNA fragment sizes of 95 bp and 165 bp with a DNA input of 10 ng (Figure 2C). Additionally, we assessed the genotyping performance of the 65 SNP probes included in the EPIC v2.0 microarray for sample matching (Supplementary Table 1). When comparing the SNP genotypes of the degraded samples with the control sample, we observed an increase in discordant genotype calls with decreasing length of DNA fragment size and lower DNA input. Particularly, more than half of the SNP genotypes were discordant in samples with an average fragment size of 165 bp at 10 ng input, as well as samples with DNA fragment size of 95 bp.
We performed PCA using beta values of the 6,160 CpGs that were shared among all samples (including the samples that failed QC) (Figure 2D) to check for outliers, batch effect, and degradation patterns (DNA fragment size and input amount) in the data. We observed that the control sample and the samples with average fragment sizes higher than 95 bp (except for the combination of 10 ng input and average fragment size of 165 bp DNA) clustered together, while the other samples were scattered across the PCA plane. PC1 and PC2 represented 44.41% and 19.12% of the total variance in the data, respectively, indicating that average DNA fragment size and input amount were the primary drivers of the variance. We also examined the β-value distribution in all samples. Degraded samples that clustered together with the control sample showed the expected bimodal β-value distribution shape (Supplementary Figure 3). In contrast, degraded samples with average fragment sizes of 95 bp and 165 bp (with a DNA input of 10 ng) presented a deviation from the bimodal β-value distribution shape in both replicates. Based on these results, we excluded all DNA samples with an average DNA fragment size of 95 bp, and DNA samples with an average DNA fragment size of 165 bp and a DNA input of 10 ng. Thus, 23 out of 33 samples, including both control and degraded samples, passed QC and were included in subsequent analyses. Looking at the corresponding PCA plot considering the 202,439 common CpGs (Figure 2E), degraded samples with an average DNA fragment size of 350 bp clustered together with the control sample, whereas samples with DNA fragment size of 230 bp and 165 bp were more spread out, indicating a different degradation pattern. PC1 and PC2 represented 20.25% and 8.94% of the total variance, respectively. Finally, considering the probe detection rate, 929,306 probes (99.1%) passed filtering for the control sample, whereas a decrease in probe detection rate was observed with both shorter DNA fragment size and lower DNA input (Figure 2F). Degraded samples with an average DNA fragment size of 350 bp, whereas the lowest probe detection rate (42.63%) and the smallest number of probes passing filtering (399,760) were found for the degraded samples with DNA fragment size of 165 bp and 20 ng input amount. We also investigated the failing probes across the different DNA fragment sizes and DNA input amounts. The SeSAMe pipeline masked 32,896 poorly designed probes, which were therefore excluded from this analysis. A total of 6,066 failing probes were common between all the samples passing QC (including the control sample), however, when considering only the degraded samples, we obtained 57,910 commonly failing probes. We calculated the number of common failing probes for each DNA fragment size and input amount. A total of 71,820, 132,622, and 173,396 common failing probes were detected in samples with average DNA fragment sizes of 350, 230, and 165 bp, respectively. Regarding DNA input amounts, we obtained 67,563, 93,651, 130,661, and 166,795 shared failing probes for input amounts of 100, 50, 20, and 10 ng, respectively (Supplementary Figure 4A-B). Lists with the names of the commonly failing probes across all samples, all degraded samples, DNA fragment sizes, and DNA input amounts are shown in Supplementary File 1.
Degraded DNA methylation data correlations and absolute differences
To investigate the impact of low quality and quantity DNA samples on EPIC v2.0 microarray reproducibility, we calculated the within-replicate correlation (precision) for each combination of average DNA fragment size and input amount (Supplementary Table 2). Degraded samples with an average DNA fragment size of 350 bp and DNA input of 100 ng had within-replicate r value of 0.990 (804,732 shared CpGs). We observed a progressive decrease in r values and number of shared CpGs with shorter DNA fragment sizes and lower DNA inputs. The lowest within-replicate r value (0.851) and number of shared CpGs (259,031) was observed in degraded samples with average DNA fragment size of 165 bp and DNA input of 20 ng. All correlation tests had an adjusted p-value below 0.05. We also assessed the correlation between degraded samples and control sample (reproducibility) for each combination of average DNA fragment size and input amount. Figure 3 shows the r values obtained from 202,439 common CpGs among all samples passing QC and the control sample. First, the correlation between the control sample and sample with average DNA fragment size of 350 bp and DNA input of 100 ng was r = 0.995. For the rest of the degraded samples, we observed a progressive decrease in r values with shorter DNA fragment sizes and lower DNA inputs. The lowest correlation value, r = 0.946, was observed in sample with an average DNA fragment size of 165 bp and DNA input of 20 ng. A drop in r values (from 0.995 to 0.989) was observed in degraded samples with DNA fragment size of 350 bp when reducing the DNA input to 20 ng. Degraded samples with DNA fragment size of 165 bp showed a larger r value drop (from 0.971 to 0.946) (Supplementary Figure 5).
Furthermore, we aimed to estimate the deviation of β-values from the control sample in all degraded samples that passed QC. Quantifying the impact of sample degradation on DNAm measurement could provide a criterion for determining whether to proceed with DNAm analysis. Therefore, we calculated the absolute difference of β-values (|∆β|) between the control and degraded samples, using mean β-values between sample replicates that passed QC. A distribution of |∆β| is generated for each pair of degraded samples (Figure 4). All distributions showed a peak close to 0 and elongated tails towards higher values of |∆β|. We observed a lower median |∆β| and percentage of probes with a |∆β| ≥ 0.05 and 0.10 for samples with larger average DNA fragment sizes and higher DNA inputs (Supplementary Table 3). Samples with average DNA fragment size of 350 bp and DNA input amount of 100 ng had a median |∆β| of 0.016 and 1.95% of probes with a |∆β| ≥ 0.10. While samples with average DNA fragment size of 165 bp and DNA input amount of 20 ng had a median |∆β| of 0.051 and 31.70% of probes with a |∆β| ≥ 0.10. We further investigated |∆β| in β-value intervals of 0.1 to comprehend whether EPIC v2.0 measurements of CpG sites with intermediate β-values were more affected by low quality and quantity DNA samples. We observed higher |∆β| values for intermediate β-values intervals (0.1-0.9) compared to the extreme β-values intervals (0.0-0.1 and 0.9-1.0) in all DNA fragment sizes investigated (Supplementary Figure 6). Samples with shorter average DNA fragment sizes and lower DNA inputs exhibited higher |∆β| in intermediate β-value intervals, while extreme β-values intervals were less affected by the quality and quantity of the DNA sample. We also calculated |∆β| between sample replicates and estimated median |∆β| and percentage of probes with a |∆β| ≥ 0.05 and 0.10. We observed an increase of these parameters with shorter average DNA fragment sizes and lower DNA inputs (Supplementary Table 4). Degraded samples with average DNA fragment size of 350 bp and DNA input of 100 ng had median |∆β| of 0.018 and percentage of probes with |∆β| ≥ 0.10 of 0.068 (804,732 shared CpGs). While for the sample with average DNA fragment size of 165 bp and DNA input of 20 ng, the median |∆β| was 0.03 and 0.23% of probes had |∆β| ≥ 0.10 (259,031 shared CpGs).
Then, we investigated the type of probes and target regions that were more robust in low quality and quantity DNA samples. Illumina EPIC v2.0 microarray uses two types of oligonucleotide probes, Type I probes target methylated and unmethylated epialleles of a CpG site using two different probes, while Type II probes use a single probe able to target both methylated and unmethylated epialleles.
We calculated the proportion of Type I and II probes in probes passing filtering for each DNA sample that passed QC. The relative proportion of Type I probes that passed filtering increased in samples with shorter average DNA fragment sizes and lower DNA input (Supplementary Figure 7A), which indicated that these probes were more efficient than the Type II probes. We also assessed the genomic (CpG island, shore, shelf, and open sea regions) and gene (promoters, gene body, and intergenic regions) distribution of probes passing filtering (Supplementary Figure 7B-C). We observed an increase in the proportion of successfully typed probes targeting CpG islands and gene promoters in samples with shorter average DNA fragment sizes and lower DNA input when compared to the control sample.
Age prediction of degraded samples
Finally, we wanted to test the impact of DNA fragment sizes and DNA inputs on a DNAm application. We chose to estimate DNAm age because it is a potential analysis of interest in the medical and forensic fields. We calculated MAE for the control and degraded samples using four epigenetic clocks: BLUP (319,607 CpGs), EN (514 CpGs), Horvath (353 CpGs), and skinHorvath (391 CpGs) clocks. The control sample had an absolute error (AE) of 0.19, 1.83, 8.77, and 0.21 years for BLUP, EN, Horvath, and skinHorvath clocks, respectively. For the degraded samples, we obtained higher MAE in all samples, especially for the BLUP, EN, and skinHorvath clocks (Figure 5). We observed a general increase in MAE with shorter DNA fragment size and lower DNA input amounts in all epigenetic clocks tested, indicating an impact of sample degradation on DNAm age prediction analysis. Samples with average DNA fragment size of 350 bp had a MAE <10 years for all DNA inputs and epigenetic clocks, except for the age predicted with the Horvath clock in 20 ng DNA input samples. In contrast, degraded samples with average DNA fragment size of 165 had MAE > 10 years, except for age predicted from 20 ng DNA input samples with the BLUP clock and the 50 ng DNA input sample with the skinHorvath clock. Supplementary Table 5 shows the number of missing CpGs in each epigenetic clock across all samples that passed QC. As expected, the number of missing CpGs in the four epigenetic clocks increased with shorter DNA fragment size and lower DNA input amounts.