Colon cancer microbiome reconstruction from human RNA sequencing data

Google+ Pinterest LinkedIn Tumblr +


The association between microbes and cancer has been reported repeatedly; however, it is not clear if molecular tumour properties are connected to specific microbial colonisation patterns. This is due mainly to the current technical and analytical strategy limitations to characterise tumour-associated bacteria.

Researchers at the European Institute of Oncology have developed an approach to detect bacterial signals in human RNA sequencing data and associate them with the clinical and molecular properties of the tumours. The method was tested on public datasets from The Cancer Genome Atlas, and its accuracy was assessed on a new cohort of colorectal cancer patients. The analysis shows that intratumoural microbiome composition is correlated with survival, anatomic location, microsatellite instability, consensus molecular subtype and immune cell infiltration in colon tumours. In particular, the researchers found Faecalibacterium prausnitzii, Coprococcus comes, Bacteroides spp., Fusobacterium spp. and Clostridium spp. to be strongly associated with tumour properties.

General overview on bacteria reconstructed microbiomes

Fig. 1

a Summary of the microbial reconstruction workflow to detect clinical and molecular associations with bacteria. Each RNA sequencing BAM file was analysed by this workflow: after removing the human sequences, the rest of the reads were aligned to a set of microbial genomes from the National Center for Biotechnology Information (NCBI) and assigned to one or more species. The last step to reconstruct the microbiome of the samples involves a batch effect detection that identifies the influence of available technical properties on the reconstructed microbiome composition. Microbiome profiles are then corrected accordingly for the most relevant batch effects. Finally, the corrected microbiome profiles were tested for associations with clinical and molecular properties of the tumour, survival analysis and property classification. For the properties associated with microbiome composition, the bacterial composition underwent a property classification approach, while specific bacterial species were detected as linked to the property levels and the prognosis. Colon adenocarcinoma (COAD) bacterial reads were pooled into left and right-sided, CMS1 and CMSs pooled and mutation burden high and low and underwent pathway profiling to compare metabolic differences between the sides of the colon, CMSs and mutational burden levels, respectively. b Principal component analysis on all the reconstructed bacterial microbiomes of the cancer types analysed. GBM, glioblastoma multiforme; LUAD, lung adenocarcinoma; LUSC, lung squamous cell carcinoma; HNSC, head and neck squamous cell carcinoma; OV, ovarian serous cystadenocarcinoma; READ, rectum adenocarcinoma; SKCM, skin cutaneous melanoma; BRCA, breast invasive carcinoma. Number of samples analysed in brackets

These researchers implemented an approach to concurrently analyse clinical and molecular properties of the tumour as well as the composition of the associated microbiome. Their results may improve patient stratification and pave the path for mechanistic studies on microbiota-tumour crosstalk.


Sambruni G, Macandog AD, Wirbel J, Cagnina D, Catozzi C, Dallavilla T, Borgo F, Fazio N, Fumagalli-Romario U, Petz WL, Manzo T, Ravenda SP, Zeller G, Nezi L, Schaefer MH. (2023) Location and condition based reconstruction of colon cancer microbiome from human RNA sequencing data. Genome Med 15(1):32. [article]
Share.