Introduction: Metabolomics technologies enable the identification of putative biomarkers for numerous diseases; however, the influence of confounding factors on metabolite levels poses a major challenge in moving forward with such metabolites for pre-clinical or clinical applications.
Objectives: To address this challenge, we analyzed metabolomics data from a colorectal cancer (CRC) study, and used seemingly unrelated regression (SUR) to account for the effects of confounding factors including gender, BMI, age, alcohol use, and smoking.
Methods: A SUR model based on 113 serum metabolites quantified using targeted mass spectrometry, identified 20 metabolites that differentiated CRC patients (n = 36), patients with polyp (n = 39), and healthy subjects (n = 83). Models built using different groups of biologically related metabolites achieved improved differentiation and were significant for 26 out of 29 groups. Furthermore, the networks of correlated metabolites constructed for all groups of metabolites using the ParCorA algorithm, before or after application of the SUR model, showed significant alterations for CRC and polyp patients relative to healthy controls.
Results: The results showed that demographic covariates, such as gender, BMI, BMI2, and smoking status, exhibit significant confounding effects on metabolite levels, which can be modeled effectively.
Conclusion: These results not only provide new insights into addressing the major issue of confounding effects in metabolomics analysis, but also shed light on issues related to establishing reliable biomarkers and the biological connections between them in a complex disease.
Keywords: Clinical factors; Colorectal cancer; Colorectal polyp; Metabolic profiling; Metabolomics; Seemingly unrelated regression; Targeted mass spectrometry.