The DBSAV Database: Predicting Deleteriousness of Single Amino Acid Variations in the Human Proteome

Jimin Pei, Nick V. Grishin

Research output: Contribution to journalArticlepeer-review

9 Scopus citations


Deleterious single amino acid variation (SAV) is one of the leading causes of human diseases. Evaluating the functional impact of SAVs is crucial for diagnosis of genetic disorders. We previously developed a deep convolutional neural network predictor, DeepSAV, to evaluate the deleterious effects of SAVs on protein function based on various sequence, structural, and functional properties. DeepSAV scores of rare SAVs observed in the human population are aggregated into a gene-level score called GTS (Gene Tolerance of rare SAVs) that reflects a gene's tolerance to deleterious missense mutations and serves as a useful tool to study gene-disease associations. In this study, we aim to enhance the performance of DeepSAV by using expanded datasets of pathogenic and benign variants, more features, and neural network optimization. We found that multiple sequence alignments built from vertebrate-level orthologs yield better prediction results compared to those built from mammalian-level orthologs. For multiple sequence alignments built from BLAST searches, optimal performance was achieved with a sequence identify cutoff of 50% to remove distant homologs. The new version of DeepSAV exhibits the best performance among standalone predictors of deleterious effects of SAVs. We developed the DBSAV database ( that reports GTS scores of human genes and DeepSAV scores of SAVs in the human proteome, including pathogenic and benign SAVs, population-level SAVs, and all possible SAVs by single nucleotide variations. This database serves as a useful resource for research of human SAVs and their relationships with protein functions and human diseases.

Original languageEnglish (US)
Article number166915
JournalJournal of Molecular Biology
Issue number11
StatePublished - May 28 2021


  • benign variants
  • genetic variations
  • neural network predictor
  • pathogenic variants
  • variant deleteriousness prediction

ASJC Scopus subject areas

  • Biophysics
  • Structural Biology
  • Molecular Biology


Dive into the research topics of 'The DBSAV Database: Predicting Deleteriousness of Single Amino Acid Variations in the Human Proteome'. Together they form a unique fingerprint.

Cite this