BioPP: a tool for web-publication of biological networks
- 7.2k Downloads
Cellular processes depend on the function of intracellular molecular networks. The curation of the literature relevant to specific biological pathways is important for many theoretical and experimental research teams and communities. No current tool supports web publication or hosting of user-developed large scale annotated pathway diagrams. Sharing via web publication is needed to allow real-time access to the current literature pathway knowledgebase, both privately within a research team or publicly among the outside research community. Web publication also facilitates team and/or community input into the curation process while allowing centralized control of the curation and validation process. We have developed new tool to address these needs. Biological Pathway Publisher (BioPP) is a software suite for converting CellDesigner Systems Biology Markup Language (CD-SBML) formatted pathways into a web viewable format. The BioPP suite is available for private use and for depositing knowledgebases into a newly created public repository.
BioPP suite is a web-based application that allows pathway knowledgebases stored in CD-SBML to be web published with an easily navigated user interface. The BioPP suite consists of four interrelated elements: a pathway publisher, an upload web-interface, a pathway repository for user-deposited knowledgebases and a pathway navigator. Users have the option to convert their CD-SBML files to HTML for restricted use or to allow their knowledgebase to be web-accessible to the scientific community. All entities in all knowledgebases in the repository are linked to public database entries as well as to a newly created public wiki which provides a discussion forum.
BioPP tools and the public repository facilitate sharing of pathway knowledgebases and interactive curation for research teams and scientific communities. BioPP suite is accessible at http://tsb.mssm.edu/pathwayPublisher/broadcast/
KeywordsJava Virtual Machine System Biology Markup Language System Biology Markup Annotate Pathway Converted File
Understanding cellular function requires detailed cell-type specific insight into the structure and operation of the molecular networks formed by a cell's genes and proteins . The communication among cellular components is governed by logical and signal transfer processes . These interactions are typically represented as a directed wiring diagram showing aspects of signaling, metabolic and gene pathway connections [3, 4]. Cellular pathway maps provide insight into network topology and capture the relationship of the nodes, and into the underlying biological responses. Due to the complexity of these networks, there is increasing impetus to create utility-based software tools to construct, visualize, and analyze pathways .
Most pathway maps are created using standalone software tool and stored in modifications of the Systems Biology Markup Language (SBML) [7, 8, 9]. The complexity of biological pathways can lead to the construction of breathtakingly detailed wiring diagrams which are very difficult to use and verify . Network maps published in the literature are fixed in time, do not provide flexible access to the detailed underlying knowledgebase, and are difficult to read [11, 12].
There are several software tools that provide good drawing platforms to build, annotate and visualize biological pathways. Tools that provide automated layout of SBML files have been developed as part of the Systems Biology Workbench [13, 14]. Cytoscape is another excellent open source software environment for the construction of biological networks . We have found that limitations in the symbolic representation of nodes, in drawing and layout tools and in visual representation using the current releases of Systems Biology Workbench SBML layout extension or Cytoscape make it difficult to construct an easily understood diagram for high granularity cell-specific signaling maps. For detailed networks, we find that CellDesigner  provides functional graphic tools, pathway visualization and navigation. However, CellDesigner is a standalone desktop application. As network maps rapidly evolve, this type of implementation makes the real-time sharing of the knowledgebase difficult.
In order to facilitate the incorporation of new information and the correction of errors in a detailed pathway map, web-based dissemination of the map and knowledgebase is required. Current web-based pathway maps, such as PANTHER  and Reactome [18, 19], do not provide publicly accessible tools for the conversion of user-created pathways into a web-accessible form. We were motivated to develop a pathway publishing tool for CD-SBML files in order to meet the present needs of our curation team. Our experience with the desktop application BioPathwise (BioAnalytics Group LLC), which integrates pathway drawing and web publication of small scale networks, indicated the importance of such a tool for the curation and dissemination of large scale maps constructed in CellDesigner. We believe this will be useful for many research teams that are constructing specialized pathway maps. In developing this tool, we also incorporated several helpful features not present in CellDesigner that assist access to the annotations. In order to fill the need for a mechanism to share specialized pathway maps developed by different groups, we have also established a pathway repository that automatically web-publishes user-created CellDesigner-based maps and knowledgebases.
The BioPP suite consists of four interrelated elements: a pathway publisher, an upload web-interface, a pathway repository and a pathway navigator, each of which is described separately below. The pathway publisher uses information provided through the pathway upload web-interface to web-publish the pathway and/or to populate the pathway repository. The networks in the pathway repository are publicly viewable via the pathway navigator. The BioPP suite is community accessible (see Availability and requirements).
This Perl implemented publisher application is launched on a dedicated server that hosts the BioPP suite through the pathway upload web-interface. The pathway publisher receives the user-uploaded information about the pathway, and the associated CD-SBML file and the png image exported from CellDesigner 4.0(alpha). The publisher parses the uploaded files to create a flat html file library describing the underlying pathway content and mapping the entity coordinates in the pathway. The hyperlinks on the entities point to the corresponding annotation information. The library contains various types of information such as sorted lists of different types of entities (Protein/Gene/RNA), sorted list of interactions, annotation for each interaction and for each entity, and the co-ordinates of entities.
The user can also choose to have the network published, using an automated procedure, into the pathway repository. After publication, the user will be returned (via browser) the converted files, in standard compressed zip format. Should the user prefer not to publish the converted pathway in the public pathway repository, the converted files are returned back to the user, and all user-uploaded information and converted files are deleted from the server. Thus, if desired, users can utilize the publisher to convert pathways for their own use without compromising confidential information.
Results and discussion
The BioPP suite provides a unique presentation of the biological pathways that facilitates sharing relevant information with the research team or the scientific community. BioPP converts the annotated pathways stored in CellDesigner's SBML format (CD-SBML) into a HTML library used for web-publication of the pathways. As CellDesigner permits import and conversion of the standard SBML format into CD-SBML, BioPP can be used to convert any pathway stored in the standard SBML format. Therefore, BioPP permits a very easy, flexible and quick access to large amounts of biological information. The pathways are presented in several HTML frames (Fig. 4), each with different types of useful information. We define the primary frame as the network frame and the other frames as the introduction, legend, interactions, proteins, genes, RNAs frame (Fig. 4).
The network frame contains a click-enabled image of the biological network. The entities and interactions with supporting annotation, pathway graphics legend and other pertinent information assist exploration of the network. The network is displayed through a JAVA applet which enables panning of the network via the index window, containing a smaller image of the map. Dragging a zoom rectangle inside the index window will result in displaying the underlying map location in the main frame.
All entities in the main frame are hyperlinked to the corresponding annotations, if available. All annotations are presented in the annotation frame. These annotations display a list of interactions in which the chosen entity is involved and the annotation corresponding that entity, if any. Each interaction in this list is hyperlinked to the corresponding interaction annotation, if available. In the introduction and legend frames, we present a brief user-specified description of the network and the CellDesigner legend  of the representations in the network.
Experience suggests that the proteins, genes, and interactions involved in an existing pathway are valuable information for the curators and accelerates construction of a pathway. The interactions, proteins, genes, and RNA frames are populated with the sorted list of interactions, proteins, genes and RNAs, respectively. All the interactions that contain relevant annotation are hyperlinked, which when clicked will display the corresponding annotations in the annotation frame. Note that several interactions may be annotated with the same information, e.g. citations. In order to avoid redundancy, the tool ensures that all the annotation HTML files are unique. As a result, should need arise, several interactions are hyperlinked to the same annotation information. In addition, a list of interactions that is annotated by the currently displayed information is presented in the annotation frame. Similarly, all the proteins, genes, and RNAs in the respective frames are hyperlinked.
An entity specific link to NCBI's Gene Entrez page which opens a new window is provided for all the proteins, genes, and RNAs in all the locations wherever cited. In addition, a link to a master, public entity-based wiki page is provided as a forum for community input and discussion.
A completely automated database of biological pathways (with hyperlinks to pertinent annotations) is publicly available (see Availability and requirements). The current interface supports (a) conversion of the original pathway into a web-publishable form and (b) web publication of the pathway in our database. In both cases, the software requires the user to upload the source CD-SBML file and the associated image. Unless the user specifically directs BioPP to publish the pathway to the repository, the server retains no information from the user about the pathway or the HTML pages that are delivered.
The BioPP suite provides easy to use tools for web-publication and viewing of pathways as well establishes a new public annotated pathway repository. These tools and repository should be useful to pathway curators, to research teams constructing specialized and cell-type specific pathways and to the general research community.
Availability and requirements
Project Name: Biological Pathway Publisher
Project home page: http://tsb.mssm.edu/pathwayPublisher/broadcast/
Operating systems: platform independent
Other requirements: Browser supporting Java Virtual Machine (JVM). For viewing large pathways, additional memory allocation to JVM may be required (for instructions on memory allocation see http://tsb.mssm.edu/pathwayPublisher/broadcast/FAQs.html)
This work was supported by NIH NIAID Contract No. HHSN2662000500021C and Grant U19 AI62623. We thank Boris Hartmann and Jeremy Seto for helpful discussions and J.S. for establishing the Wiki. We thank G. Scott Lett and Ned Haubein of The BioAnalytics Group for helpful advice, discussion and critique of the manuscript, and acknowledge the seminal contributions of Nick Aschenbach leading to the development of pathway web export tools.
- 6.Hoffmann R, Krallinger M, Andres E, Tamames J, Blaschke C, Valencia A: Text mining for metabolic pathways, signaling cascades, and protein networks. Sci STKE 2005, 2005: e21.Google Scholar
- 8.Hucka M FA: Escalating model sizes and complexities call for standardized forms of representation. Volume 1. Mol Sys Biol; 2005.Google Scholar
- 9.Hucka M, Finney A, Sauro HM, Bolouri H, Doyle JC, Kitano H, Arkin AP, Bornstein BJ, Bray D, Cornish-Bowden A, Cuellar AA, Dronov S, Gilles ED, Ginkel M, Gor V, Goryanin II, Hedley WJ, Hodgman TC, Hofmeyr JH, Hunter PJ, Juty NS, Kasberger JL, Kremling A, Kummer U, Le NN, Loew LM, Lucio D, Mendes P, Minch E, Mjolsness ED, Nakayama Y, Nelson MR, Nielsen PF, Sakurada T, Schaff JC, Shapiro BE, Shimizu TS, Spence HD, Stelling J, Takahashi K, Tomita M, Wagner J, Wang J: The systems biology markup language (SBML): a medium for representation and exchange of biochemical network models. Bioinformatics 2003, 19: 524–531. 10.1093/bioinformatics/btg015CrossRefPubMedGoogle Scholar
- 10.Hucka M, Finney A, Bornstein BJ, Keating SM, Shapiro BE, Matthews J, Kovitz BL, Schilstra MJ, Funahashi A, Doyle JC, Kitano H: Evolving a lingua franca and associated software infrastructure for computational systems biology: the Systems Biology Markup Language (SBML) project. Syst Biol (Stevenage ) 2004, 1: 41–53. 10.1049/sb:20045008CrossRefGoogle Scholar
- 16.Funahashi A, Tanimura N, Morohashi M, Kitano H: CellDesigner: a process diagram editor for gene-regulatory and biochemical networks. BIOSILICO; 2003.Google Scholar
- 18.Joshi-Tope G, Vastrik I, Gopinath GR, Matthews L, Schmidt E, Gillespie M, D'Eustachio P, Jassal B, Lewis S, Wu G, Birney E, Stein L: The Genome Knowledgebase: a resource for biologists and bioinformaticists. Cold Spring Harb Symp Quant Biol 2003, 68: 237–243. 10.1101/sqb.2003.68.237CrossRefPubMedGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.