3D illustration of a metabolite

New Machine Learning Model Accurately Identifies Metabolites

The open-source software is the first metabolite identification tool that can distinguish between stereochemical variants

Written byHolden Galusha
| 2 min read
Register for free to listen to this article
Listen with Speechify
0:00
2:00

A new, open-source machine learning model called LC-MS2Struct can help identify metabolites with more accuracy than traditional liquid chromatography-mass spectrometry (LC-MS). The model may benefit the fields of drug discovery, diagnostics, and fundamental research by increasing identification reliability. This research was recently published in Nature Machine Intelligence.

Metabolites are tiny molecules that transfer energy and relay cellular information around the human body. Due to their size, they are difficult to distinguish in traditional blood sample analysis processes, such as LC-MS. In the LC-MS technique, metabolites are separated by running the sample through an LC column, which returns different flow rates, or retention times, for each metabolite. A mass spectrometer then sorts the metabolites by mass. This two-part process allows researchers to reliably identify a decent portion of metabolites, but the method is far from perfect. “Even the best methods can’t identify more than 40 percent of the molecules in samples without making some assumptions about the candidate molecules,” says study author Juho Rousu, professor at Aalto University.

Lab manager academy logo

Advanced Lab Management Certificate

The Advanced Lab Management certificate is more than training—it’s a professional advantage.

Gain critical skills and IACET-approved CEUs that make a measurable difference.

This new metabolite identification model, developed by researchers from Aalto University and the University of Luxembourg, sidesteps the issues associated with LC-MS identification because it was trained on information harvested from dozens of laboratories worldwide. Until now, metabolite data could not be compared between laboratories because the retention times were all different. Eric Bach, study co-author and doctoral student at Aalto, devised a solution that addresses this problem. “Our research shows that while absolute retention times may vary, the retention order is stable across measurements by different labs,” Bach explained in a news release. “This allowed us to merge all publicly available data on metabolites for the first time ever and feed it into our machine learning model.”

Because it was built on such a wide breadth of data from so many labs, LC-MS2Struct can accurately distinguish between stereochemical variants, or mirror image molecules—a capability that no other metabolite identification programs have. This ability is “expected to open up new avenues in drug design and other fields.”

“This new open-source model offers the whole research community an enriched view of small molecules. It will help research into methods to identify metabolic disorders, such as diabetes, or even cancer,” Rousu added.

The LC-MS2Struct source code is available on GitHub.

Interested in lab tools and techniques?

Subscribe to our free Lab Tools & Techniques Newsletter.

Is the form not loading? If you use an ad blocker or browser privacy features, try turning them off and refresh the page.

By subscribing, you agree to receive email related to Lab Manager content and products. You may unsubscribe at any time.

About the Author

  • Holden Galusha headshot

    Holden Galusha is the associate editor for Lab Manager. He was a freelance contributing writer for Lab Manager before being invited to join the team full-time. Previously, he was the content manager for lab equipment vendor New Life Scientific, Inc., where he wrote articles covering lab instrumentation and processes. Additionally, Holden has an associate of science degree in web/computer programming from Rhodes State College, which informs his content regarding laboratory software, cybersecurity, and other related topics. In 2024, he was one of just three journalists awarded the Young Leaders Scholarship by the American Society of Business Publication Editors. You can reach Holden at hgalusha@labmanager.com.

    View Full Profile

Related Topics

Loading Next Article...
Loading Next Article...

CURRENT ISSUE - May/June 2025

The Benefits, Business Case, And Planning Strategies Behind Lab Digitalization

Joining Processes And Software For a Streamlined, Quality-First Laboratory

Lab Manager May/June 2025 Cover Image