Study design
This was a prospective observational case-control pilot study conducted at a 1,048-bed university-affiliated hospital in Korea between January 2022 and February 2024. The study included adult patients (≥ 19 years) with blood culture-confirmed candidemia, defined as at least one positive peripheral blood culture for Candida species with compatible clinical features. Non-survivors were defined as patients who died during hospitalization, whereas survivors were those that remained alive to be discharged.
Blood culture collection within 2 h after fever onset (≥ 38°C) and before antifungal therapy was prioritized to enhance diagnostic accuracy. Fecal specimens were collected within 5 days after candidemia diagnosis for the evaluation of gut microbiota and metabolite profiles. The fecal samples were aliquoted and stored frozen within 24 h of collection. Patients were excluded from this study if they refused to complete the consent form or if their stool sample was collected > 5 days after the date of candidemia diagnosis. Only the first episode of candidemia was analyzed for each patient.
The study protocol was approved by the Institutional Review Board of Korea University Anam Hospital (approval number: 2022AN0232), and written informed consent was provided by all participants or their surrogates prior to participation. This study was conducted in accordance with the ethical principles outlined in the Declaration of Helsinki and the Strengthening the Reporting of Observational Studies in Epidemiology (STROBE) guidelines for observational cohort studies.
Data collection
The following demographic and clinical data were prospectively obtained from the patients’ electronic medical records: age, sex, underlying diseases, Charlson’s Comorbidity Index [11], laboratory results, and risk factors identified within 1 month prior to candidemia diagnosis. The evaluated risk factors included neutropenia (absolute neutrophil count < 500 cells/µL), recent surgical history, and the use of immunosuppressive agents. The source of candidemia was determined based on clinical evidence of infection, regardless of whether causative organisms were isolated from the origin. [12–14] Clinical severity at the most severe stage of the disease was assessed using the following factors: presence of septic shock [15], the Pitt bacteremia scoring system [16], use of a central venous catheter, need for mechanical ventilation, admission to the intensive care unit, and in-hospital mortality.
Clinical microbiologic analysis
Identification of Candida spp. in blood culture and assessment of antifungal susceptibility were conducted using the BacT/ALERT 3D Microbial Detection System (bioMe´rieux, Inc., Durham, NC, USA) and the automated Vitek 2 Yeast Biochemical Card (bioMe´rieux, Inc., Durham, NC, USA), following a routine laboratory diagnostic procedure. The Candida strains were confirmed using matrix-assisted laser desorption/ionization-time of fight mass spectrometry (Bruker Daltonics, Bremen, Germany).
Stool specimen collection and sequencing
The V3–V4 region of the 16S rRNA gene was targeted for amplification to analyze the composition of the intestinal microbiota. The amplified products were purified using a magnetic bead-based purification process, and the appropriate concentration of the purified product was pooled together. Short fragments were removed using the ProNex® Size-Selective Purification System (Promega, Southampton, UK). The quality of the purified products was assessed using the PicoGreen assay (Molecular Probes, Invitrogen, USA). The pooled amplicons were sequenced using an Illumina MiSeq Sequencing System (Illumina, USA).
Low-quality sequence reads (Q < 25) were filtered out using Trimmomatic v0.32. Paired-end reads were merged using VSEARCH v2.13.4 with default parameters and trimmed at a similarity cutoff of 0.8 based on the Myers–Miller alignment algorithm. [17] Non-specific amplicons that did not encode the V3–V4 region of 16S rRNA were identified using HMMER v3.2.1. [18] Unique sequence reads were extracted, and duplicate reads were clustered using VSEARCH. [17] Taxonomic assignments were performed using the EzBioCloud 16S rRNA database, and chimeric sequences were removed using the UCHIME algorithm. [19, 20]
Metabolomic profiling
Fecal samples were homogenized using bead-beating and extracted with methanol for metabolomic analysis. After extraction, the samples were centrifuged, and the supernatant was filtered through a 0.45-µm syringe filter. All metabolites, including short chain fatty acids (SCFAs) such as acetic acid, butyric acid, valeric acid, and propionic acid, were analyzed using a gas chromatograph-mass spectrometer (gas chromatograph: Agilent 7890, mass spectrometer: LECO Pegasus HT TOFMS). All metabolites were analyzed using only significant peaks with a signal-to-noise ratio > 9. All procedures were performed in triplicate to ensure accuracy.
Statistical Analyses
Categorical variables were compared using Fisher’s exact test or Pearson’s chi-square test as appropriate and are expressed as numbers (proportions). Continuous variables were compared using either a two-sample Student’s t-test for normally distributed data or the Mann–Whitney U test for non-normally distributed data, and are summarized as median values (interquartile range [IQR]).
Potential prognostic factors associated with mortality were identified using univariate analyses. Variables with a P value < 0.2 in univariate analysis were included in the multivariate analysis. Backward stepwise selection was applied to refine the model and select the most relevant predictors. The goodness-of-fit of the final model was assessed using the Hosmer–Lemeshow test. Model discrimination was assessed by generating receiver operating characteristic curves, and predictive performance was validated using stratified 10-fold cross-validation.
Alpha diversity was evaluated using the Abundance-based Coverage Estimator and Shannon’s and Simpson’s diversity indices. Beta diversity was evaluated using the Bray–Curtis distance and visualized through principal-coordinate analysis. Taxonomic biomarkers were identified using linear discriminant analysis effect size (LEfSe) and the Kruskal–Wallis H Test. These analyses were conducted using the EzBioCloud 16s-based Microbiome Taxonomic Profiling platform.
Metabolomic data were normalized using the log10 fold-change for each metabolite. Differences between survivors and non-survivors were assessed using two-sample Student’s t-test. Correlations between stool metabolites and bacterial genera were analyzed using Spearman correlation. Partial least-squares discriminant analysis (PLS-DA) was performed to identify the stool metabolite signature of mortality, with a variable importance in projection (VIP) score threshold of 1.3.
Functional profiles were predicted from normalized taxonomic data using PICRUSt and MinPath algorithms. Differentially abundant functional pathways were identified using the Kruskal–Wallis H test and LEfSe, with statistical significance set at P < 0.05.
IBM SPSS Statistics version 20.0 (IBM Corporation, Armonk, NY, USA), SAS 9.4 (SAS Institute Inc., Cary, NC, USA), and R 4.4.1 with RStudio (v2024.04.2 + 764) (Te R Foundation for Statistical Computing, Vienna, Austria) were used for all statistical analyses. Two-sided P values < 0.05 were considered significant.