TY - JOUR
T1 - Strategies for metagenomic-guided whole-community proteomics of complex microbial environments
AU - Cantarel, Brandi L.
AU - Erickson, Alison R.
AU - VerBerkmoes, Nathan C.
AU - Erickson, Brian K.
AU - Carey, Patricia A.
AU - Pan, Chongle
AU - Shah, Manesh
AU - Mongodin, Emmanuel F.
AU - Jansson, Janet K.
AU - Fraser-Liggett, Claire M.
AU - Hettich, Robert L.
PY - 2011/11/23
Y1 - 2011/11/23
N2 - Accurate protein identification in large-scale proteomics experiments relies upon a detailed, accurate protein catalogue, which is derived from predictions of open reading frames based on genome sequence data. Integration of mass spectrometry-based proteomics data with computational proteome predictions from environmental metagenomic sequences has been challenging because of the variable overlap between proteomic datasets and corresponding short-read nucleotide sequence data. In this study, we have benchmarked several strategies for increasing microbial peptide spectral matching in metaproteomic datasets using protein predictions generated from matched metagenomic sequences from the same human fecal samples. Additionally, we investigated the impact of mass spectrometry-based filters (high mass accuracy, delta correlation), and de novo peptide sequencing on the number and robustness of peptide-spectrum assignments in these complex datasets. In summary, we find that high mass accuracy peptide measurements searched against non-assembled reads from DNA sequencing of the same samples significantly increased identifiable proteins without sacrificing accuracy.
AB - Accurate protein identification in large-scale proteomics experiments relies upon a detailed, accurate protein catalogue, which is derived from predictions of open reading frames based on genome sequence data. Integration of mass spectrometry-based proteomics data with computational proteome predictions from environmental metagenomic sequences has been challenging because of the variable overlap between proteomic datasets and corresponding short-read nucleotide sequence data. In this study, we have benchmarked several strategies for increasing microbial peptide spectral matching in metaproteomic datasets using protein predictions generated from matched metagenomic sequences from the same human fecal samples. Additionally, we investigated the impact of mass spectrometry-based filters (high mass accuracy, delta correlation), and de novo peptide sequencing on the number and robustness of peptide-spectrum assignments in these complex datasets. In summary, we find that high mass accuracy peptide measurements searched against non-assembled reads from DNA sequencing of the same samples significantly increased identifiable proteins without sacrificing accuracy.
UR - http://www.scopus.com/inward/record.url?scp=81755176064&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=81755176064&partnerID=8YFLogxK
U2 - 10.1371/journal.pone.0027173
DO - 10.1371/journal.pone.0027173
M3 - Article
C2 - 22132090
AN - SCOPUS:81755176064
SN - 1932-6203
VL - 6
JO - PloS one
JF - PloS one
IS - 11
M1 - e27173
ER -