A systematic bioinformatics approach to identify high quality mass spectrometry data and functionally annotate proteins and proteomes

Research output: Chapter in Book/Report/Conference proceedingChapterResearchpeer-review

Abstract

In the past decade, proteomics and mass spectrometry have taken tremendous strides forward, particularly in the life sciences, spurred on by rapid advances in technology resulting in generation and conglomeration of vast amounts of data. Though this has led to tremendous advancements in biology, the interpretation of the data poses serious challenges for many practitioners due to the immense size and complexity of the data. Furthermore, the lack of annotation means that a potential gold mine of relevant biological information may be hiding within this data. We present here a simple and intuitive workflow for the research community to investigate and mine this data, not only to extract relevant data but also to segregate usable, quality data to develop hypotheses for investigation and validation. We apply an MS evidence workflow for verifying peptides of proteins from one’s own data as well as publicly available databases. We then integrate a suite of freely available bioinformatics analysis and annotation software tools to identify homologues and map putative functional signatures, gene ontology and biochemical pathways. We also provide an example of the functional annotation of missing proteins in human chromosome 7 data from the NeXtProt database, where no evidence is available at the proteomic, antibody, or structural levels. We give examples of protocols, tools and detailed flowcharts that can be extended or tailored to interpret and annotate the proteome of any novel organism.

LanguageEnglish
Title of host publicationProteome bionformatics
EditorsShivakumar Keerthikumar, Suresh Mathivanan
Place of PublicationNew York
PublisherHumana Press Inc.
Chapter13
Pages163-176
Number of pages14
ISBN (Electronic)9781493967407
ISBN (Print)9781493967384
DOIs
Publication statusPublished - 2017

Publication series

NameMethods in Molecular Biology
PublisherHumana Press Inc.
Volume1549
ISSN (Print)1064-3745

Fingerprint

Workflow
Proteome
Computational Biology
Proteomics
Mass Spectrometry
Databases
Software Design
Molecular Sequence Annotation
Gene Ontology
Chromosomes, Human, Pair 7
Biological Science Disciplines
Human Chromosomes
Gold
Proteins
Software
Technology
Peptides
Antibodies
Research
Data Accuracy

Keywords

  • Functional annotation
  • Missing proteins
  • MS evidence
  • MS validation

Cite this

Islam, M., Mohamedali, A., Ahn, S. B., Nawar, I., Baker, M. S., & Ranganathan, S. (2017). A systematic bioinformatics approach to identify high quality mass spectrometry data and functionally annotate proteins and proteomes. In S. Keerthikumar, & S. Mathivanan (Eds.), Proteome bionformatics (pp. 163-176). (Methods in Molecular Biology; Vol. 1549). New York: Humana Press Inc.. https://doi.org/10.1007/978-1-4939-6740-7_13
Islam, Mohammad ; Mohamedali, Abidali ; Ahn, Seong Beom ; Nawar, Ishmam ; Baker, Mark S ; Ranganathan, Shoba. / A systematic bioinformatics approach to identify high quality mass spectrometry data and functionally annotate proteins and proteomes. Proteome bionformatics. editor / Shivakumar Keerthikumar ; Suresh Mathivanan. New York : Humana Press Inc., 2017. pp. 163-176 (Methods in Molecular Biology).
@inbook{e2a9effede26482cbe8c5b354417b421,
title = "A systematic bioinformatics approach to identify high quality mass spectrometry data and functionally annotate proteins and proteomes",
abstract = "In the past decade, proteomics and mass spectrometry have taken tremendous strides forward, particularly in the life sciences, spurred on by rapid advances in technology resulting in generation and conglomeration of vast amounts of data. Though this has led to tremendous advancements in biology, the interpretation of the data poses serious challenges for many practitioners due to the immense size and complexity of the data. Furthermore, the lack of annotation means that a potential gold mine of relevant biological information may be hiding within this data. We present here a simple and intuitive workflow for the research community to investigate and mine this data, not only to extract relevant data but also to segregate usable, quality data to develop hypotheses for investigation and validation. We apply an MS evidence workflow for verifying peptides of proteins from one’s own data as well as publicly available databases. We then integrate a suite of freely available bioinformatics analysis and annotation software tools to identify homologues and map putative functional signatures, gene ontology and biochemical pathways. We also provide an example of the functional annotation of missing proteins in human chromosome 7 data from the NeXtProt database, where no evidence is available at the proteomic, antibody, or structural levels. We give examples of protocols, tools and detailed flowcharts that can be extended or tailored to interpret and annotate the proteome of any novel organism.",
keywords = "Functional annotation, Missing proteins, MS evidence, MS validation",
author = "Mohammad Islam and Abidali Mohamedali and Ahn, {Seong Beom} and Ishmam Nawar and Baker, {Mark S} and Shoba Ranganathan",
year = "2017",
doi = "10.1007/978-1-4939-6740-7_13",
language = "English",
isbn = "9781493967384",
series = "Methods in Molecular Biology",
publisher = "Humana Press Inc.",
pages = "163--176",
editor = "Shivakumar Keerthikumar and Suresh Mathivanan",
booktitle = "Proteome bionformatics",

}

Islam, M, Mohamedali, A, Ahn, SB, Nawar, I, Baker, MS & Ranganathan, S 2017, A systematic bioinformatics approach to identify high quality mass spectrometry data and functionally annotate proteins and proteomes. in S Keerthikumar & S Mathivanan (eds), Proteome bionformatics. Methods in Molecular Biology, vol. 1549, Humana Press Inc., New York, pp. 163-176. https://doi.org/10.1007/978-1-4939-6740-7_13

A systematic bioinformatics approach to identify high quality mass spectrometry data and functionally annotate proteins and proteomes. / Islam, Mohammad; Mohamedali, Abidali; Ahn, Seong Beom; Nawar, Ishmam; Baker, Mark S; Ranganathan, Shoba.

Proteome bionformatics. ed. / Shivakumar Keerthikumar; Suresh Mathivanan. New York : Humana Press Inc., 2017. p. 163-176 (Methods in Molecular Biology; Vol. 1549).

Research output: Chapter in Book/Report/Conference proceedingChapterResearchpeer-review

TY - CHAP

T1 - A systematic bioinformatics approach to identify high quality mass spectrometry data and functionally annotate proteins and proteomes

AU - Islam, Mohammad

AU - Mohamedali, Abidali

AU - Ahn, Seong Beom

AU - Nawar, Ishmam

AU - Baker, Mark S

AU - Ranganathan, Shoba

PY - 2017

Y1 - 2017

N2 - In the past decade, proteomics and mass spectrometry have taken tremendous strides forward, particularly in the life sciences, spurred on by rapid advances in technology resulting in generation and conglomeration of vast amounts of data. Though this has led to tremendous advancements in biology, the interpretation of the data poses serious challenges for many practitioners due to the immense size and complexity of the data. Furthermore, the lack of annotation means that a potential gold mine of relevant biological information may be hiding within this data. We present here a simple and intuitive workflow for the research community to investigate and mine this data, not only to extract relevant data but also to segregate usable, quality data to develop hypotheses for investigation and validation. We apply an MS evidence workflow for verifying peptides of proteins from one’s own data as well as publicly available databases. We then integrate a suite of freely available bioinformatics analysis and annotation software tools to identify homologues and map putative functional signatures, gene ontology and biochemical pathways. We also provide an example of the functional annotation of missing proteins in human chromosome 7 data from the NeXtProt database, where no evidence is available at the proteomic, antibody, or structural levels. We give examples of protocols, tools and detailed flowcharts that can be extended or tailored to interpret and annotate the proteome of any novel organism.

AB - In the past decade, proteomics and mass spectrometry have taken tremendous strides forward, particularly in the life sciences, spurred on by rapid advances in technology resulting in generation and conglomeration of vast amounts of data. Though this has led to tremendous advancements in biology, the interpretation of the data poses serious challenges for many practitioners due to the immense size and complexity of the data. Furthermore, the lack of annotation means that a potential gold mine of relevant biological information may be hiding within this data. We present here a simple and intuitive workflow for the research community to investigate and mine this data, not only to extract relevant data but also to segregate usable, quality data to develop hypotheses for investigation and validation. We apply an MS evidence workflow for verifying peptides of proteins from one’s own data as well as publicly available databases. We then integrate a suite of freely available bioinformatics analysis and annotation software tools to identify homologues and map putative functional signatures, gene ontology and biochemical pathways. We also provide an example of the functional annotation of missing proteins in human chromosome 7 data from the NeXtProt database, where no evidence is available at the proteomic, antibody, or structural levels. We give examples of protocols, tools and detailed flowcharts that can be extended or tailored to interpret and annotate the proteome of any novel organism.

KW - Functional annotation

KW - Missing proteins

KW - MS evidence

KW - MS validation

UR - http://www.scopus.com/inward/record.url?scp=85006341241&partnerID=8YFLogxK

U2 - 10.1007/978-1-4939-6740-7_13

DO - 10.1007/978-1-4939-6740-7_13

M3 - Chapter

SN - 9781493967384

T3 - Methods in Molecular Biology

SP - 163

EP - 176

BT - Proteome bionformatics

A2 - Keerthikumar, Shivakumar

A2 - Mathivanan, Suresh

PB - Humana Press Inc.

CY - New York

ER -

Islam M, Mohamedali A, Ahn SB, Nawar I, Baker MS, Ranganathan S. A systematic bioinformatics approach to identify high quality mass spectrometry data and functionally annotate proteins and proteomes. In Keerthikumar S, Mathivanan S, editors, Proteome bionformatics. New York: Humana Press Inc. 2017. p. 163-176. (Methods in Molecular Biology). https://doi.org/10.1007/978-1-4939-6740-7_13