Resumen:
We conducted a genome-wide interaction analysis between long-term exposure to trihalomethanes in drinking water and colorectal cancer (CRC) risk in a multicenter case-control study in Spain, including 1037 CRC cases and 2100 controls. Exposure categories were estimated based on sex-specific median and quartile values of total trihalomethanes (TTHM), chloroform (CHCl3), and brominated trihalomethanes (Br-THMs) among controls. In addition, TTHM exposure was assessed relative to the WHO guideline thresholds. Gene-environment interaction models were computed with the GxEScanR package. To explore biological plausibility, relevant results were inspected in search of expression quantitative trait loci (eQTLs) in two independent resources: BarcUVaSeq and the Genome Tissue Expression (GTEx) v8. Finally, we searched the Comparative Toxicogenomics Database to identify candidate genes previously linked to trihalomethane exposure, retrieved their eQTLs, and evaluated gene-environment interactions with TTHM levels. We found three variants that modulated CRC risk in relation to CHCl3 and TTHM exposure: rs77985109 near LRRC8B, chr15:28997737 near WHAMMP2, and rs7890183 near MAGEB2. Two additional variants were specifically found for women and one for rectal cancer. Functional assessment suggested a regulatory role of rs77985109 in LRRC8B expression. Moreover, eQTL analysis of candidate genes revealed an additional variant associated with CCL2 which could modulate CRC risk under different TTHM exposure levels. The present study identified novel loci potentially influencing CRC susceptibility under THM exposure, highlighting the importance of integrating environmental and genetic data to better understand environmental driven cancer risks. Further research is needed to confirm these results and clarify underlying mechanisms.