Protein Ontology Project: 2006 Updates
Free (open access)
A. S. Sidhu, T. S. Dillon, B. S. Sidhu & E. Chang
Protein Ontology (PO) is a means of formalizing protein data and knowledge; protein ontology includes concepts or terms relevant to the domain, definitions of concepts, and defined relationships between the concepts. PO integrates protein data formats and provides a structured and unified vocabulary to represent protein synthesis concepts. PO provides integration of heterogeneous protein and biological data sources. This paper discusses the updates that happened to Protein Ontology Project since it was last presented at Data Mining 2005 Conference. Keywords: Protein Ontology, proteomics, bioinformatics, protein informatics, computational proteomics, protein structure, biomedical ontologies, data integration, data semantics. 1 Introduction Traditional approaches to integrate protein data generally involved keyword searches, which immediately excludes unannotated or poorly annotated data. It also excludes proteins annotated with synonyms unknown to the user. Of the protein data that is retrieved in this manner, some biological resources do not record information about the data source, so there is no evidence of the annotation. An alternative protein annotation approach is to rely on sequence identity, or structural similarity, or functional identification. The success of this method is dependent on the family the protein belongs to. Some proteins have
Protein Ontology, proteomics, bioinformatics, protein informatics, computational proteomics, protein structure, biomedical ontologies, data integration, data semantics.