In this study, a total of 205 BALF samples were collected from patients diagnosed with lower respiratory tract infections (Supplementary Fig. 1). Each sample underwent testing using mNGS, amplification-based tNGS, capture-based tNGS, and conventional microbiological tests (CMTs). Furthermore, more than two clinicians independently conducted a comprehensive diagnosis for each case. Notably, among these samples, only 40 were tested using both DNA mNGS and RNA mNGS due to considerations such as clinical needs, cost, and sample size, leaving the remaining 165 samples untested for RNA mNGS.
The performance of NGS, benchmarked against CMTs and the comprehensive clinical diagnosis as standards, is comprehensively detailed in Table 1. Capture-based tNGS demonstrated the greatest consistency with the comprehensive clinical diagnosis, exhibiting the highest rates of positivity, sensitivity, negative predictive value (NPV), and F1 score. On the other hand, mNGS excels in specificity and positive predictive value (PPV), while the performance of amplification-based tNGS occupies a middle ground. Nonetheless, a comparative analysis, summarized in Table 2, indicates a significant alignment in diagnostic efficacy across the three methods, with no substantial differences observed. This suggests that under certain conditions, the methods could be used interchangeably. Table 3 elaborates on the performance of NGS in CMT-negative cases, evaluated against the comprehensive clinical diagnosis. Here, capture-based tNGS continues to lead in positivity rates, sensitivity, NPV, and F1 score, albeit with the lowest specificity and PPV. Notably, amplification-based tNGS and mNGS exhibit similar performance, except for a notably higher specificity in the former.
Table 1
Performance of NGS in clinical pathogen detecting.
| Standard | Approach | Consistency | Sensitivity | Specificity | PPV | NPV | Positivity | F1 Score |
| | mNGS | 58.05% | 66.15% | 44.00% | 67.19% | 42.86% | 62.44% | 0.6667 |
| CMT | A_tNGS | 60.98% | 73.85% | 38.67% | 67.61% | 46.03% | 69.27% | 0.7059 |
| | C_tNGS | 51.22% | 71.54% | 16.00% | 59.62% | 24.49% | 76.10% | 0.6504 |
| | mNGS | 90.73% | 91.95% | 83.87% | 96.97% | 65.00% | 80.49% | 0.9439 |
| Diagnosis | A_tNGS | 90.24% | 92.53% | 77.42% | 95.83% | 64.86% | 81.95% | 0.9415 |
| | C_tNGS | 93.17% | 99.43% | 58.06% | 93.01% | 94.74% | 90.73% | 0.9611 |
PPV, positive predictive value; NPV, negative predictive value.
Table 2
Consistency and differences comparison of NGS performance.
| Test | Approach | Value | P-value |
| | mNGS vs. A_tNGS | 0.504 | < 0.001 |
| Kappa | mNGS vs. C_tNGS | 0.477 | < 0.001 |
| | A_tNGS vs. C_tNGS | 0.471 | < 0.001 |
| Cochran's Q | mNGS vs. A_tNGS vs. C_tNGS | 1.476 | 0.478 |
Table 3
Performance of NGS against diagnosis in CMT-negative cases.
| Approach | Consistency | Sensitivity | Specificity | PPV | NPV | Positivity | F1 Score |
| mNGS | 84.00% | 84.91% | 81.82% | 91.84% | 69.23% | 65.33% | 0.8824 |
| A_tNGS | 85.33% | 84.91% | 86.36% | 93.75% | 70.37% | 64.00% | 0.8911 |
| C_tNGS | 84.00% | 100.00% | 45.45% | 81.54% | 100.00% | 86.67% | 0.8983 |
Overall, when compared to the comprehensive clinical diagnosis, capture-based tNGS demonstrated the highest sensitivity rate but the lowest PPV. In contrast, amplification-based tNGS exhibited the highest miss rate. Regarding pathogen types, capture-based tNGS showed a high sensitivity rate across all categories, with its lowest PPV primarily observed in DNA viruses, followed by Gram-positive and Gram-negative bacteria. Amplification-based tNGS displayed its highest miss rates with Gram-positive bacteria, followed by Gram-negative bacteria and DNA viruses. Meanwhile, mNGS significantly underperformed in detecting RNA viruses (Fig. 1). Among the 505 pathogens identified through the comprehensive clinical diagnosis, 264 (52.3%) were detected by all three NGS techniques. These primarily included Acinetobacter baumannii, Klebsiella pneumoniae, Candida albicans, Pneumocystis jirovecii, Aspergillus fumigatus, Human betaherpes virus 5 (HHV-5), Human gammaherpes virus 4 (HHV-4), Human alphaherpes virus 1 (HHV-1), among others. The remaining 239 (47.3%) pathogens were detected by only two of the NGS techniques (Fig. 2A). In these cases, mNGS demonstrated high sensitivity for Enterococcus faecium, Haemophilus parainfluenzae, Corynebacterium striatum, Streptococcus mitis, C. albicans, Stenotrophomonas maltophilia, and P. jirovecii, but exhibited a high miss rate for SARS-CoV-2, HHV-1, Haemophilus influenzae, and A. fumigatus. Amplification-based tNGS showed high sensitivity in SARS-CoV-2, C. albicans, H. influenzae, and A. fumigatus, but had a high miss rate for E. faecium, HHV-4, H. parainfluenzae, C. striatum, HHV-5, S. mitis, HHV-1, S. maltophilia, and P. jirovecii. Capture-based tNGS demonstrated high sensitivity across all pathogens except C. albicans. However, in terms of PPV, capture-based tNGS showed relatively low PPV in HHV-4, C. striatum, HHV-5, and HHV-1 (Fig. 2B).
In this study, a comparative analysis was conducted between mNGS and tNGS at the nucleic acid level, due to the observed low sensitivity of mNGS. Overall, the RPM for pathogens reported by mNGS was significantly lower compared to those reported by capture-based tNGS, across all pathogen types with the exception of fungi (Fig. 3A). This disparity not only highlights the reduced sensitivity of mNGS but also underscores the enhanced nucleic acid enrichment capability of capture-based tNGS. Moreover, while amplification-based tNGS demonstrated the highest RPM for pathogen nucleic acids, this method's reliance on PCR amplification complicates direct comparisons (Supplementary Fig. 2). Specifically, PCR amplification can obfuscate the distinction between original and amplified DNA strands. Moreover, the observed shortfall of mNGS in detecting RNA viruses was initially thought to result from the absence of RNA sequencing in the bulk of our samples. However, even after these samples were excluded and a subsequent reevaluation was conducted, the shortfall remained. This suggests that the limitations are inherent to the mNGS technique itself (Supplementary Fig. 3A).
In-depth analysis was conducted on the pathogens missed by amplification-based tNGS. Firstly, a subset of pathogens fell outside the scope of the amplification-based tNGS panel utilized in this study, constituting 16.34% of the pathogens identified in the comprehensive clinical diagnosis (Supplementary Fig. 3B) (Supplementary Fig. 3C showed the most commonly missed pathogens within the amplification-based tNGS panel). After excluding the pathogens not covered by the panel, significant detection shortfalls persisted in amplification-based tNGS for both Gram-negative bacteria and DNA viruses (Supplementary Fig. 3A). This suggests a potential diagnostic gap in real-world clinical settings. Secondly, the detection of low-abundance pathogens poses a challenge for amplification-based tNGS. Among the pathogens identified by the comprehensive clinical diagnosis, those reported by both amplification-based tNGS and mNGS exhibited significantly higher RPM than those reported by mNGS alone. This discrepancy was particularly pronounced for Gram-negative bacteria and DNA viruses (Fig. 3B).
The lower PPV observed with capture-based tNGS can be attributed to its reports of redundant pathogens not identified in the comprehensive clinical diagnosis. Analysis of common redundant pathogen species was conducted at the nucleic acid level. Pathogens reported exclusively by capture-based tNGS generally displayed significantly lower RPM compared to those identified by both capture-based tNGS and the comprehensive clinical diagnosis (Fig. 3C). Consequently, receiver operating characteristic (ROC) curves were employed to establish improved reporting thresholds, aiming to mitigate the issue of low PPV (Fig. 3D). Subsequent optimization of reporting thresholds for capture-based tNGS markedly enhanced the PPV with the comprehensive clinical diagnosis (Fig. 3E).
In addition, this study assessed the capability to identify pathogen genotypes. For SARS-CoV-2, mNGS was limited to species-level identification, prompting a comparison between two tNGS approaches. Interestingly, among all the SARS-CoV-2 genotypes consistently identified by both tNGS methods, it was always the capture-based tNGS that provided a more refined genotype distinction (Fig. 4A).
Remarkably, examination of the outputs from these two tNGS approaches unveiled the presence of numerous antimicrobial resistance (AMR) genes and virulence factors (VFs). Both tNGS methodologies consistently identified several genes, including KPC, mecA, iucA, peg344, among others (Fig. 4B) (Fig. 4C). However, the capture-based tNGS approach demonstrated superior detection capabilities, identifying a broader spectrum of AMR genes and VFs.