Tokyo Tech News

Personalized Gut Microbiome Analysis for Colorectal Cancer Classification with Explainable AI


Published: April 14, 2023

Explainable AI offers a promising solution for finding links between diseases and certain species of gut bacteria, finds a research team at Tokyo Tech. Using a concept borrowed from game theory, the researchers developed a framework that reveals which bacterial species are closely associated with colorectal cancer in individual subjects, providing a more reliable way to find and characterize disease subgroups and identify biomarkers in the gut microbiome.

Personalized Gut Microbiome Analysis for Colorectal Cancer Classification with Explainable AI

The gut microbiome comprises a complex population of different bacterial species that are essential to human health. In recent years, scientists across several fields have found that changes in the gut microbiome can be linked to a wide variety of diseases, notably colorectal cancer (CRC). Multiple studies have revealed that a higher abundance of certain bacteria, such as Fusobacterium nucleatum and Parvimonas micra, is typically associated with CRC progression.

Based on these findings, researchers have developed various artificial intelligence (AI) models to help them analyze which bacterial species are useful as CRC biomarkers. However, most of these models rely on what is known as "global explanations," meaning that they can only consider the entirety of the input data to make predictions. As a result, such models are unable to identify bacterial species that could be relevant CRC biomarkers for smaller, less-representative groups of patients.

Against this backdrop, a research team from Tokyo Institute of Technology (Tokyo Tech), Japan, decided to adopt a different approach capable of addressing this limitation. As outlined in their paper, which was recently published in Genome Biology, the team employed an explainable AI framework that provides local, rather than global, explanations for its CRC predictions. "Local explanation techniques make it possible to discover the most contributing bacteria for each individual CRC patient, enabling us to examine inter-individual differences between subjects within a disease group," explains Associate Professor Takuji Yamada, the main author of the study.

The team used a framework called "Shapley additive explanations" (SHAP), which originated from a concept in game theory called the Shapley value. Put simply, the Shapley value tells us how a payout should be distributed among the players of a coalition or group. Similarly, in their study, the team used SHAP to calculate the contribution of each bacterial species to each individual CRC prediction.

Using this approach along with data from five CRC datasets, the researchers discovered that projecting the SHAP values into a two-dimensional (2D) space allowed them to see a clear separation between healthy and CRC subjects. Clustering this 2D information resulted in four distinct subgroups of CRC subjects, each differing in the CRC probability and the associated bacteria. In addition, the team found that subjects in the CRC subgroups with the highest CRC probability always had an enriched population of bacteria typically associated with CRC. Most remarkably, the results were consistent across the five datasets, showcasing the wide applicability of this method.

With these promising results, the team anticipates their approach to make solid contributions in the gut microbiome research community. "Considering the increasing use of machine learning in microbiome—disease association studies, our novel method could be beneficial for a more personalized microbiome data exploration as well as help uncover potential disease subgroups along with their potential associated biomarkers," speculates Dr. Yamada. Further, the technique is also applicable to other diseases with known links to the gut microbiome, such as ulcerative colitis, Chron's disease, and diabetes.

Hopefully, explainable AI will reveal more such secrets of the gut microbiome in the near future, so stay tuned!


Authors :
Ryza Rynazal1, Kota Fujisawa1, Hirotsugu Shiroma1, Felix Salim1, Sayaka Mizutani1, Satoshi Shiba5, Shinichi Yachida5,6, and Takuji Yamada1,2,3,4*
Title :
Leveraging explainable AI for gut microbiome-based colorectal cancer classification
Journal :
Genome Biology
Affiliations :
1School of Life Science and Technology, Tokyo Institute of Technology
2Metagen, Inc.
3Metagen Theurapeutics, Inc.
4Digzyme, Inc.
5Division of Genomic Medicine, National Cancer Center Research Institute
6Department of Cancer Genome Informatics, Graduate School of Medicine, Osaka University

* Corresponding author's email:

School of Life Science and Technology

School of Life Science and Technology
—Unravel the Complex and Diverse Phenomena of Life—

Information on School of Life Science and Technology inaugurated in April 2016

School of Life Science and Technology

Schools, Departments, and Institute for Liberal Artsouter

Further Information

Associate Professor Takuji Yamada

School of Life Science and Technology, Tokyo Institute of Technology



Public Relations Division, Tokyo Institute of Technology

Tel +81-3-5734-2975