TY - JOUR
T1 - Multilaboratory Untargeted Mass Spectrometry Metabolomics Collaboration to Identify Bottlenecks and Comprehensively Annotate A Single Dataset
AU - Houriet, Joelle
AU - Manwill, Preston K.
AU - Magaña, Armando Alcázar
AU - Anderson, Victoria M.
AU - Beniddir, Mehdi A.
AU - Bertrand, Samuel
AU - Choi, Jaewoo
AU - Clark, Trevor N.
AU - Foster, Leonard J.
AU - Halabalaki, Maria
AU - Jarmusch, Alan K.
AU - de Jonge, Niek F.
AU - Khadilkar, Aswad
AU - MacMillan, John B.
AU - Maier, Claudia S.
AU - Marney, Luke C.
AU - Marti, Guillaume
AU - Mikropoulou, Eleni V.
AU - Olivier-Jimenez, Damien
AU - Perez, Amélie
AU - van der Hooft, Justin J.J.
AU - Zdouc, Mitja M.
AU - Linington, Roger G.
AU - Cech, Nadja B.
N1 - Publisher Copyright:
© 2025 The Authors. Published by American Chemical Society
PY - 2025/8/5
Y1 - 2025/8/5
N2 - Annotation is the process of assigning features in mass spectrometry metabolomics data sets to putative chemical structures or “analytes.” The purpose of this study was to identify challenges in the annotation of untargeted mass spectrometry metabolomics datasets and suggest strategies to overcome them. Toward this goal, we analyzed an extract of the plant ashwagandha (Withania somnifera) using liquid chromatography–mass spectrometry on two different platforms (an Orbitrap and Q-ToF) with various acquisition modes. The resulting 12 datasets were shared with ten teams that had established expertise in metabolomics data interpretation. Each team annotated at least one positive ion dataset using their own approaches. Eight teams selected the positive ion mode data-dependent acquisition (DDA) data collected on the Orbitrap platform, so the results reported for that dataset were chosen for an in-depth comparison. We compiled and cross-checked the annotations of this dataset from each laboratory to arrive at a “consensus annotation,” which included 142 putative analytes, of which 13 were confirmed by comparison with standards. Each team only reported a subset (24 to 57%) of the analytes in the consensus list. Correct assignment of ion species (clusters and fragments) in MS spectra was a major bottleneck. In many cases, in-source redundant features were mistakenly considered to be independent analytes, causing annotation errors and resulting in overestimation of sample complexity. Our results suggest that better tools/approaches are needed to effectively assign feature identity, group related mass features, and query published spectral and taxonomic data when assigning putative analyte structures.
AB - Annotation is the process of assigning features in mass spectrometry metabolomics data sets to putative chemical structures or “analytes.” The purpose of this study was to identify challenges in the annotation of untargeted mass spectrometry metabolomics datasets and suggest strategies to overcome them. Toward this goal, we analyzed an extract of the plant ashwagandha (Withania somnifera) using liquid chromatography–mass spectrometry on two different platforms (an Orbitrap and Q-ToF) with various acquisition modes. The resulting 12 datasets were shared with ten teams that had established expertise in metabolomics data interpretation. Each team annotated at least one positive ion dataset using their own approaches. Eight teams selected the positive ion mode data-dependent acquisition (DDA) data collected on the Orbitrap platform, so the results reported for that dataset were chosen for an in-depth comparison. We compiled and cross-checked the annotations of this dataset from each laboratory to arrive at a “consensus annotation,” which included 142 putative analytes, of which 13 were confirmed by comparison with standards. Each team only reported a subset (24 to 57%) of the analytes in the consensus list. Correct assignment of ion species (clusters and fragments) in MS spectra was a major bottleneck. In many cases, in-source redundant features were mistakenly considered to be independent analytes, causing annotation errors and resulting in overestimation of sample complexity. Our results suggest that better tools/approaches are needed to effectively assign feature identity, group related mass features, and query published spectral and taxonomic data when assigning putative analyte structures.
UR - https://www.scopus.com/pages/publications/105012938647
UR - https://www.scopus.com/pages/publications/105012938647#tab=citedBy
U2 - 10.1021/acs.analchem.4c05577
DO - 10.1021/acs.analchem.4c05577
M3 - Article
C2 - 40693864
AN - SCOPUS:105012938647
SN - 0003-2700
VL - 97
SP - 16110
EP - 16122
JO - Analytical Chemistry
JF - Analytical Chemistry
IS - 30
ER -