Abstract
Understanding protein families requires the bringing together of many different kinds of data. These families are typically derived from multiple sequence alignments. Directed mutagenesis is one of the most common means of inferring which specific amino acid or set of amino acids are important in the function of a protein. Although there are a large number publicly available, protein specific repositories, e.g., PROSITE, UniProt, and Pfam, no tools exist for experimental biologists that provide a means for managing and visualizing the curation data of the protein families they study at the individual residue level. We present the development of a novel system designed for experimental biologists, called the Curation Alignment Tool for Protein Analysis (CATPA), that allows for the efficient and effective creation, storage, management, and querying of experimentally curated protein families.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Benson, D.A., Karsch-Mizrachi, I., Lipman, D.J., et al.: Genbank: update. Nucleic Acids Research 26, D23–D26 (2004)
Sonnhammer, E., Eddy, S., Birney, E., Bateman, A., Durbin, R.: Pfam - multiple sequence alignments and hmm-profiles of protein domains. Nucleic Acids Research 26, 320–322 (1998)
Bateman, A., Coin, L., Durbin, R., Finn, R., Hollich, V., et al.: The pfam protein families database. Nucleic Acids Research 32, D138–D141 (2004)
Apweiler, R., Bairoch, A., Wu, C., Barker, W., Boeckmann, B., Ferro, S., et al.: Uniprot: the universal protein knowledgebase. Nucleic Acids Research 32, D115–119 (2004)
Boeckmann, B., Bairoch, A., Apweiler, R., Blatter, M., Estreicher, A., Gasteiger, E., Martin, E., Michoud, K., O’Donovan, C., Phan, I., Pilbout, S., Schneider, M.: The swiss-prot protein knowledgebase and its supplement trembl in 2003. Nucleic Acids Research 31, 365–370 (2003)
Sigrist, C., Cerutti, L., Hulo, N., et al.: Prosite: a documented databases using patterns and profiles as motif descriptors. Brief Bioinformatics 3, 265–274 (2002)
Falquet, L., Pagni, M., Bucher, P., Hulo, N., Sigrist, C., et al.: The prosite database, its status in 2002. Nucleic Acids Research 30, 235–238 (2002)
Thompson, J., Gibson, T., Plewniak, F., Jeanmougin, F., Higgins, D.: The clustal_x windows interface: flexible strategies for multiple sequence alignment aided by quality analysis tools. Nucleic Acids Research 25, 4876–4882 (1997)
Norman, D.: The Design of Everyday things. Doubleday Currency (1990)
Gene Ontology Consortium: Creating the gene ontology resource: design and implementation. Genome Research 11, 1425–1433 (2001)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Dalkilic, M.M. (2005). An Architecture and Application for Integrating Curation Data at the Residue Level for Proteins. In: Ludäscher, B., Raschid, L. (eds) Data Integration in the Life Sciences. DILS 2005. Lecture Notes in Computer Science(), vol 3615. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11530084_34
Download citation
DOI: https://doi.org/10.1007/11530084_34
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-27967-9
Online ISBN: 978-3-540-31879-8
eBook Packages: Computer ScienceComputer Science (R0)