Updates

Using the power of AI to make genetic diagnosis easier

News Release

HOUSTON (May 22, 2026) – A new computational tool called MARRVEL-MCP helps researchers move toward genetic diagnoses more efficiently by analyzing and interpreting vast amounts of genetic and biological information using everyday language. The study, conducted by researchers at Texas Children’s and Baylor College of Medicine, appeared in the American Journal of Human Genetics.

“Rare genetic diseases are often caused by small changes in a person’s DNA. However, not all genetic changes linked to a condition may play a role in the disease,” said co-corresponding author Dr. Hyun-Hwan Jeong, investigator at the Texas Children’s Duncan Neurological Research Institute and assistant professor of pediatrics – neurology at Baylor. “Some changes may contribute to disease, while others may not. Identifying whether a particular genetic change or variant is harmful or an innocent bystander is crucial for diagnosing these conditions, but the process requires sifting through large amounts of data, a complex and time-consuming task.”

“To reach a genetic diagnosis, doctors and researchers must gather information from many different biological databases, each with its own format and rules, and then carefully piece together the evidence. Even for experts, this can take hours for a single case,” said co-corresponding author Dr. Zhandong Liu, chief of computational sciences at Texas Children’s and associate professor of pediatrics – neurology at Baylor.

The current study introduces MARRVEL-MCP, a new computational tool that is designed to make this process faster and more accessible, especially for non-experts. It combines artificial intelligence, specifically large language models (LLMs) like ChatGPT and Gemini, with a structured set of biological databases to help interpret genetic variants using layman’s terms.

From MARRVEL to MARRVEL-MCP

The team previously had developed MARRVEL (Model organism Aggregated Resources for Rare Variant ExpLoration), a computational approach that allows researchers to comb in a matter of minutes through large genetic and biological databases all at once to search for information regarding gene variants. MARRVEL has been well received by the scientific community, recording more than 43,000 users worldwide in 2025 alone.

MARRVEL brings together genomic, functional and model-organism databases into a unified platform. These sources contain different types of information that need to be considered to determine whether a genetic variant causes a disease. For instance, how common a variant is in the population, whether it has been linked to disease before, predictions about whether it damages a gene, information from lab experiments and model organisms and scientific articles discussing similar cases.

“However, MARRVEL requires precisely formatted inputs and produces comprehensive but complex outputs that demand substantial manual interpretation,” Jeong said. “This poses barriers that limit its accessibility and efficiency for many users as it assumes they can interpret heterogeneous outputs and synthesize evidence across sections, which requires substantial expertise.”

MARRVEL-Model Context Protocol (MCP) changes how this process works. Instead of requiring users to learn technical formats and manually navigate databases, it allows them to ask questions in plain language, such as, “Is this BRCA1 mutation linked to cancer?”

In a matter of seconds, MARRVEL-MCP automatically identifies key pieces of information (like gene names or mutations), converts them into the formats required by databases, queries multiple data sources in the correct order and combines the results into a clear, evidence-based answer. MARRVEL-MCP covers areas like disease associations, genetic variation, gene expression and scientific literature and enables LLMs to autonomously compose and execute multi-step analytical workflows from simple language queries.

“What excites me most is that MARRVEL-MCP shows we do not always need the largest frontier AI models to make meaningful progress in biomedical research,” Jeong said. “By giving smaller models access to the right curated tools and structured context, we can make them smarter for specialized tasks. For example, gpt-oss-20b, a model that can be installed locally, improved to 94% with MARRVEL-MCP from 41% accuracy without MARRVEL-MCP. This suggests a path toward more accessible and cost-effective AI for rare disease research.”

“We have released MARRVEL-MCP as an open resource that allows for the integration of LLM agents with curated biomedical databases,” Liu said. “To facilitate independent exploration and reproducibility, we provide access to MARRVEL-MCP through a publicly available hosted interface at https://chat.marrvel.org, allowing users to interactively test the system without local installation. We also plan to revamp the main MARRVEL platform by adding agentic AI features – which would allow it to take independent actions rather than just generating text or responding to prompts – so users can move from plain-language questions to structured genetic analysis more easily.”

First author Zachary Everton, Jorge Botas, Seon Young Kim and Lin Yao, all Texas Children’s and Baylor College of Medicine, also contributed to this work.

This work was supported by the Cancer Prevention and Research Institute of Texas (CPRIT, RP240131), the Chan Zuckerberg Initiative (grant 2023-332162), the National Institutes of Health (NIH, U54NS093793), the Eunice Kennedy Shriver National Institute of Child Health and Human Development of the NIH (P50HD103555), the Chao Endowment, the Huffington Foundation and the Jan and Dan Duncan Neurological Research Institute at Texas Children’s Hospital.

About Texas Children’s 
Texas Children's, a nonprofit health care organization, is committed to creating a healthier future for children and women throughout the global community by leading in patient care, education and research. Consistently ranked as the best children's hospital in Texas and among the top in the nation, Texas Children's has garnered widespread recognition for its expertise and breakthroughs in pediatric and women's health. The system includes the Texas Children's Duncan NRI; the Feigin Tower for pediatric research; Texas Children's Pavilion for Women, a comprehensive obstetrics/gynecology facility focusing on high-risk births; Texas Children's Hospital West Campus, a community hospital in suburban West Houston; Texas Children's Hospital The Woodlands, the first hospital devoted to children's care for communities north of Houston and Texas Children's Hospital North Austin, the new state-of-the-art facility providing world-class pediatric and maternal care to Austin families. The organization also created Texas Children's Health Plan, the nation's first HMO focused on children; Texas Children's Pediatrics, the largest pediatric primary care network in the country; Texas Children's Urgent Care clinics that specialize in after-hours care tailored specifically for children; and a global health program that is channeling care to children and women all over the world. Texas Children's Hospital is affiliated with Baylor College of Medicine. For more information, visit www.texaschildrens.org