This study investigated potential early diagnostic biomarkers for EC. The results suggest that acetaldehyde and formaldehyde concentrations in exhaled breath may serve as useful noninvasive biomarkers for the early detection of EC. To the best of our knowledge, only three studies have reported on aldehyde-based biomarkers for EC using gas analysis of exhaled breath [8,9,29]. One study reported a diagnostic model for esophageal and gastric cancer that combined four volatile organic compounds—hexanoic acid, phenol, methylphenol, and ethylphenol—with an ROC curve AUC of 0.91 [8]. Another study proposed a diagnostic model for esophageal adenocarcinoma using volatile organic compounds, such as carboxylic acids, phenol, and aldehydes—including butanal, pentanal, hexanal, heptanal, octanal, nonanal, and decanal—which achieved an AUC of 0.90 [9]. The third study demonstrated elevated aldehydes, including acetaldehyde, in exhaled breath in patients with EC, but did not report ROC curve [29]. In contrast, the distinguishing feature of the present study is the use of EBC, which is a sample that is noninvasive and easy to collect and store. This is the first report of aldehyde measurement in EBC for EC diagnosis. Volatile organic compounds in exhaled breath in the gaseous phase are commonly measured using gas chromatography mass spectrometry. However, this approach has limitations in quantitative accuracy due to factors such as incomplete sample recovery during breath collection, adsorption within the column, and solvent elution. Analysis of EBC offers advantages, as it enables accurate collection of steady-state exhaled breath, reflects the status of the lower airways, allows absolute quantification by LC-ESI-MS/MS. Notably, in this study, formaldehyde in EBC alone achieved an AUC of 0.85, which is comparable to previous diagnostic models that relied on the combined analysis of multiple metabolites. In addition, both SCC and AC were studied as histological subtypes of EC, and the concentrations of acetaldehyde and formaldehyde in EBC were significantly elevated in each subtype compared to HC group.
Furthermore, the plasma acetaldehyde and formaldehyde concentrations were higher in patients with EC than in HC, which is consistent with the results of EBC measurements. This was attributed to a correlation between blood aldehyde concentrations and EBC to some extent, but no clear correlation was found between blood aldehyde concentrations and EBC in individual patient groups where both were measured (Supplementary Fig. 3). This may be attributable to the limited sample size, variability in specimen collection times and potential influence of medications on detoxification metabolic pathways. Furthermore, with respect to acetaldehyde in EBC, acetaldehyde production by oral bacteria may have been involved. However, in our analysis of oral microbiota, the relative abundance of acetaldehyde-producing bacteria did not differ significantly between HC and patients with EC. Nevertheless, variations in particular bacterial species, notably an elevated presence of Fusobacterium nucleatum, may indicate that differences in acetaldehyde-producing capacity among various taxa may influence overall acetaldehyde concentrations. Therefore, the origin of aldehydes in EBC warrants further investigation. Moreover, ALDH2 is a well-known enzyme that is involved in acetaldehyde metabolism. Although ALDH2 gene polymorphisms were not examined in this study, they may play a role in the elevated acetaldehyde concentrations observed in the EBC of patients with EC.
Formaldehyde is produced in vivo by various reactions, including demethylation of histones and nucleic acids [30,31]. Formaldehyde is produced during the synthesis of glycine from serine, and serine has been reported to be necessary for cancer cell growth [32]. Cancer cells overexpress serine hydroxymethyltransferase —an enzyme that is involved in the de novo serine synthesis pathway [33,34]. During the conversion of serine to glycine, formaldehyde is produced, and formaldehyde levels in cancer cells possibly increase owing to the upregulation of the one-carbon metabolism cycle [30,35]. This increase in intracellular formaldehyde possibly leads to elevated concentrations of formaldehyde in both blood and EBC. Alcohol dehydrogenase 5 (ADH5) is a well-known enzyme that is involved in formaldehyde metabolism [36,37], and, similar to acetaldehyde, genetic polymorphisms of ADH5 may influence formaldehyde levels. We plan to investigate these gene polymorphisms using next-generation sequencing analysis in future studies.
The main limitations of this study are as follows. First, the age, sex, smoking history, and drinking history of the participants were not matched between the EC and HC groups. In particular, age and sex represent significant confounding factors. Second, this study did not include patients with other cancer types; therefore, the specificity of the biomarkers for EC remains uncertain. Third, the effects of diet, alcohol consumption, and smoking could not be fully controlled, as fasting, alcohol abstinence, and smoking cessation prior to specimen collection were not strictly enforced in this study to reflect real-world clinical conditions. Especially, the influence of recent alcohol intake on acetaldehyde concentration could not be ruled out. Fourth, this study did not evaluate the presence of genetic polymorphisms in ALDH2 and ADH5—the enzymes responsible for metabolizing acetaldehyde and formaldehyde, respectively. Instead, the presence or absence of alcohol flushing response, as determined by a questionnaire survey, was employed as an indirect indicator of ALDH2 status with respect to acetaldehyde concentration. Fifth, blood aldehyde concentrations were measured using plasma samples. Although acetaldehyde is more abundant in erythrocytes than in plasma [38], its concentration in erythrocytes was not measured in this study. Sixth, the mechanism underlying the elevation of aldehydes in patients with cancer remains unclear. Although quantitative analysis demonstrated its potential usefulness as a diagnostic marker, the elucidation of the underlying mechanism requires further investigation.
In conclusion, acetaldehyde and formaldehyde concentrations in the EBC of patients with EC were significantly higher than those in HC, which indicates their potential utility as diagnostic biomarkers for EC. Furthermore, the noninvasive nature and ease of sample preservation make EBC a promising candidate for primary screening prior to upper gastrointestinal endoscopy. In future studies, we plan to examine ALDH2 and ADH5 gene polymorphisms using next-generation sequencing analysis to validate the origin of acetaldehyde and formaldehyde in EBC.