STSE: Spatio-Temporal Simulation Environment Dedicated to Biology
- 4.2k Downloads
Recently, the availability of high-resolution microscopy together with the advancements in the development of biomarkers as reporters of biomolecular interactions increased the importance of imaging methods in molecular cell biology. These techniques enable the investigation of cellular characteristics like volume, size and geometry as well as volume and geometry of intracellular compartments, and the amount of existing proteins in a spatially resolved manner. Such detailed investigations opened up many new areas of research in the study of spatial, complex and dynamic cellular systems. One of the crucial challenges for the study of such systems is the design of a well stuctured and optimized workflow to provide a systematic and efficient hypothesis verification. Computer Science can efficiently address this task by providing software that facilitates handling, analysis, and evaluation of biological data to the benefit of experimenters and modelers.
The Spatio-Temporal Simulation Environment (STSE) is a set of open-source tools provided to conduct spatio-temporal simulations in discrete structures based on microscopy images. The framework contains modules to digitize, represent, analyze, and mathematically model spatial distributions of biochemical species. Graphical user interface (GUI) tools provided with the software enable meshing of the simulation space based on the Voronoi concept. In addition, it supports to automatically acquire spatial information to the mesh from the images based on pixel luminosity (e.g. corresponding to molecular levels from microscopy images). STSE is freely available either as a stand-alone version or included in the linux live distribution Systems Biology Operational Software (SB.OS) and can be downloaded from http://www.stse-software.org/. The Python source code as well as a comprehensive user manual and video tutorials are also offered to the research community. We discuss main concepts of the STSE design and workflow. We demonstrate it's usefulness using the example of a signaling cascade leading to formation of a morphological gradient of Fus3 within the cytoplasm of the mating yeast cell Saccharomyces cerevisiae.
STSE is an efficient and powerful novel platform, designed for computational handling and evaluation of microscopic images. It allows for an uninterrupted workflow including digitization, representation, analysis, and mathematical modeling. By providing the means to relate the simulation to the image data it allows for systematic, image driven model validation or rejection. STSE can be scripted and extended using the Python language. STSE should be considered rather as an API together with workflow guidelines and a collection of GUI tools than a stand alone application. The priority of the project is to provide an easy and intuitive way of extending and customizing software using the Python language.
KeywordsGraphical User Interface Python Script Binary Mask Simulation Engine Polygonal Mesh
With the availability of high-resolution microscopy and high-throughput technologies in molecular biology the amount of cellular images in very good resolution quality increases significantly. Such amount of available data consecutively demands for image analysis software adapted to utilize the full capacity of these imaging advancements. The state-of-the-art way of presenting, assessing and evaluating experimental images qualitatively is being increasingly replaced by computational data evaluation. Quantification of e.g. light intensities arising from fluorescent protein (FP) expression in different cellular compartments can be ascertained in a spatially resolved manner and enables us to mathematically verify the current understanding of biological systems.
Unambiguous and reproducible computational extraction increases the quality and exchangeability of information for subsequent automatic processing steps such as digitization, representation, analysis, and modeling. A variety of image processing-, analysis- or modeling-packages addressing these tasks exist already, either on a commercial basis or as open source software.
Recently, several eminent reviews have been published which outline the most common methods and tools addressing biological image processing, analysis and modeling (see [1, 2, 3]). One of the key conclusions is that these tasks are usually separately addressed. Cell segmentation and property extraction, for example, are well established and can be realized by dedicated software such as CellProfiler , Cell-ID  or generic image processing platforms like Labview (National Instruments, Austin, USA) or Imaris (Bitplane, Zurich, Switzerland). A widely used and freely available tool is ImageJ , which comprises standard segmentation algorithms as well as surface or profile plots. Also freely available are additional packages for R like EBImage , which can be used for the segmentation and analysis steps. When it comes to spatial modeling and simulation in microbiology one can distinguish the following classes of dedicated simulators i) spatially partitioned ODE systems (e.g. Virtual Cell) ii) spatially partitioned Gillespie systems (e.g. MesoRD , SmartCell ) iii) particle-based simulators (e.g. Smoldyn , MCell , Meredys ). These techniques differ mostly with respect to the mathematical framework which changes the level of detail represented in the system (e.g. spatially partitioned ODEs are giving the overview of the system and can be used at tissue scale and large time scale, whereas particle based simulators are able to represent the molecule-scale details, however time scale needs to be importantly shortened).
The STSE platform intends to close the gaps between various tools or software-packages that are in majority specifically designed for these separate steps (i.e image processing/analysis or modeling/simulation). By providing the workflow guidelines and the access to Python language, it offers the advantage of stratifying the interaction with different data-structures and thus minimizes the loss of time and information during the manual export and conversion processes. Therefore, it should be seen as a set of tools facilitating the intuitive workflow between the image analysis tools and simulators.
Additionally, in its current implementation, it provides examples of how to perform such a transition from segmenting tools to simulation engines implemented internally in Python (spatially partitioned ODEs). For these purposes STSE comprises modules for digitizing and representing microscopy data, enables data analysis as well as manipulation, and can be used for mathematical modeling and simulation of spatial distributions of chemical species. It is a powerful, multifaceted tool for interdisciplinary work.
Preprocessing of microscopic images for the studied object.
Definition of a discrete representation of the images.
Automatic integration of the information from images into the discrete representation.
Analyzing the digitized data.
Formulating a model: defining interactions between regions of interest and molecules of interest.
Running a model: previously digitized images are used as initial conditions for the evaluation of simulation results.
A detailed use case as well as comparative studies with some of the above mentioned state-of-the-art tools is provided in the additional file 1. Additionally, the webpage of the project contains examples, video tutorials, access to a discussion group and other helpful information sources. In the following, we give a concise overview of the fundamental methods used in STSE:
Spatial Segmentation and Digitization
The process of digitization generates a data structure, allowing for efficient analysis, representation and modeling. The classical approach is to decompose the microscopy image into physiologically distinguishable compartments (e.g. nucleus, cytoplasm, etc.) which is called image segmentation [13, 14]. Usually, image segmentation results in a data structure linking the compartments with pixels. STSE differs from this approach by introducing an abstract, intermediate layer composed of so-called subcompartments. To generate this layer, each compartment is divided into subcompartments which have the geometry of polygons and are organized in such a way that they fill the entire compartment and do not overlap with each other. The default geometry is automatically composed of equilateral hexagons. The purpose of introducing this abstract layer is to allow for adjusting the digitization precision separately for different compartments, which is useful when it comes to analysis and modeling. To edit the geometry of subcompartments a Voronoi 2D tessellation is used . With the help of the graphical user interface (GUI) editor, the user may move subcompartment centers (corresponding to the vertices in the Voronoi graph) for fine-tuning. This information implicitly specifies the geometry of each subcompartment. Since these subcompartments share edges, the representation resembles a polygonal mesh (PM).
Each subcompartment has an individual geometry as well as other user-customizable properties such as cellular compartment affiliation, concentrations of specific substances, etc. The GUI allows for user-friendly inspecting and editing of these properties. Additionally, due to the software implementation design, it is possible to extend the GUI editor by adding custom actions as well as to script the GUI with Python. One of the main goals of STSE is to provide the possibility of framework extension and customization to the users.
With STSE it is possible to acquire spatial luminosity information from microscopic images, which can correspond e.g. to the inhomogeneous distribution of tagged molecules within the cell. This process is performed on indexed color images (e.g. FP microscopic images). This is an important feature, since it allows for the comparison of simulation results with experimental data.
Representation and Analysis
Image representation is performed implicitly by the conversion of the Voronoi-based PM to an internal STSE data structure. This design involves less constraints and thus allows for more latitude in defining polygonal geometries (e.g. including non-convex ones) as well as physiological information. It is realized by storing the polygon corner coordinates explicitly in the data structure instead of computing them using the Voronoi algorithm. The datastructure may be easily modified or inspected via Python. This allows for simulating structures changing in time, which has been, for instance, successfully used in the dynamic modeling of meristem growth . The analysis is effected via the STSE-GUI as well as with Python scripts and enables a comprehensive and differentiated overview of topological, geometrical and physiological information. The routines provided by STSE allow for visualizing and inspecting compartment properties and can be used for computing different properties and for further, computational analysis of data from images. All structural information can be exported and saved for persistence and dissemination.
The digitized data can be used directly to perform spatial modeling (e.g. as initial conditions or evaluation). STSE does not restrict the user with the simulation framework. Instead, we suggest a workflow based on the so-called "cell-centered" finite volume method . According to this scheme, a mechanistic model of a studied process needs to be formalized using a set of ordinary differential equations (ODEs) describing the interplay of different actors (e.g. chemical molecules) and different cellular compartments with specified kinetic rules on diffusion, chemical reactions, transport, etc. In this case a SciPy library  may be used to solve the system within the STSE framework.
Due to its design STSE is fully extendable via Python. The simulation engine can be freely connected with multitude of solutions limited only by the accessibility of these engines via Python.
In the following, we demonstrate how to use STSE to analyze and simulate biological systems. A typical STSE workflow includes the modules for digitization, representation, analysis and modeling is presented using the running example of a mitogen-activated protein kinase gradient formation (the double-phosphorylated Fus3 (Fus3 PP ) in a mating yeast cell .
Fus3 signaling is part of the yeast mating pheromone pathways: upon stimulation with the pheromone α-factor, an intracellular signaling cascade becomes activated, which leads to the double phosphorylation of Fus3. The Fus3 PP gets released at the shmoo tip and can diffuse within the cell, which results in an observable Fus3 PP gradient. When reaching the nucleus, Fus3 PP is actively transported across the nuclear membrane and regulates transcription factors that modulate mating-specific gene expression. We would like to stress that the focus is set rather on the software specifications and the application scenario than on the biological results. To simplify the analysis and to facilitate the usage of examples in a confirmatory way, we work on test data, inspired by the experiments and explanations presented by Maeder et al. .
An extended workflow comprising amongst others the following examples is given and discussed in the additional file 1. Additionally, video tutorials covering this subject and all Python scripts we use to produce the here presented images and results are provided on the project homepage.
In a first step we demonstrate the analysis and characterization of the Fus3 PP gradient. For this purpose we:
Quantify the ratio of the average cytoplasm/nucleus expression of Fus3 PP based on fluorescence signal intensity acquired from microscopy images,
Show gradient curves for Fus3 PP along the x-axis of the cell data image and around the nucleus,
Simulate the process of Fus3 PP diffusion in the cytoplasm to determine the underlying conditions that lead to the qualitative values captured in the image.
We evaluate the results of the simulations and discuss i) whether the appearance of a Fus3 PP gradient throughout the cell can be explained by simple diffusion and ii) how to define plausible conditions and model parameters allowing to reproduce the experimental observations.
Representation and analysis
The distribution of Fus3 PP in the cytoplasm along the x-axis in a central part of the analyzed cell is exponential (Figure 6c),
The distribution of Fus3 PP around the nucleus reaches its maximum in the point closest to the shmoo tip (Figure 6d),
The average Fus3 PP signal in the nucleus is 64.0 (a.u.), which is ≈ 25% of the maximal signal measured in the image,
The average Fus3 PP signal in the cytoplasm is 52.07 (a.u.) (which is ≈ 20% of the maximal signal measured in the image),
The ratio of averaged Fus3 PP signal in the cytoplasm to nucleus is equal to 0.81.
Comparative values computed with ImageJ showed an average Fus3 PP signal in the nucleus of 64.0 (a.u.) and an average Fus3 PP signal in the cytoplasm of 51.27 (a.u.), resulting in a ratio of averaged Fus3 PP signal in the cytoplasm to nucleus of 0.801. Which means the relative error due to approximation (i.e. downsampling because of using averaged values from PM instead of actual pixel values) in our example is (0.81 - 0.801)/0.801 ≈ 1%.
The previously acquired, quantified and structured data can be used to create a dynamic model of the Fus3 PP diffusion. According to the STSE dataflow paradigm (see Figure 1), the mechanistic model of the studied process needs to be formalized. By focusing on the properties of the Fus3 PP gradient we can exclude processes such as i) mechanisms of the stimulation of Fus3 and ii) different mechanisms allowing Fus3 PP to enter the nucleus. The kinetic model of Fus3 PP is now defined as follows:
Fus3 PP appears in the shmoo tip compartment,
Fus3 PP diffuses freely in the cytoplasm compartment,
Fus3 PP gets dephosphorylated during the diffusion in the cytoplasm,
Fus3 PP is unable to cross the cellular/nuclear membrane compartments.
FUS3 PP i is the concentration of Fus3PP in the subcompartment i,
γ FUS3 PP is the diffusion constant for Fus3PP,
α FUS3 PP is the rate of Fus3PP release in the shmoo tip,
β FUS3 PP is the rate constant of Fus3PP dephosphorylation,
Si→nis the area of contact surface between subcompartments i and n,
V i is the volume of subcompartment i,
i ∈ A /i ∈ B if i belongs to cytoplasm/shmoo tip compartment,
N i is a set of neighbour subcompartments for subcompartment i,
To complete the model it is required to define the rate of Fus3PP release in the shmoo tip (α FUS3 PP ), the rate constant of Fus3PP dephosphorylation βFUS 3 PP , the diffusion constant γ FUS3 PP and the initial conditions. All values can be estimated from the literature or chosen arbitrarily. Additionally, initial conditions can be acquired from the digitization step of the image data. An exemplary implementation of the Fus 3 PP model can be downloaded from the project homepage. An animation showing the kinetics of Fus 3 PP distribution obtained with the implemented model is available as additional file 2.
We present STSE, a platform that facilitates execution of spatial simulations based on microscopy images. The application of the STSE is demonstrated on the example of the yeast pheromone MAP kinase cascade, focusing in particular on the distribution of the double-phosphorylated Fus3.
We demonstrate how to quantify the ratio of the average cytoplasm/nucleus expression of Fus 3 PP based on fluorescence signal intensity acquired from microscopy images, create gradient curves for Fus 3 PP along the x-axis of the cell and around the nucleus, and simulate the process of Fus 3 PP diffusion in the cytoplasm to determine the underlying mechanism.
The result of the simulations allow us to confirm that a set of hypothesis used in the model allows us to reproduce the experimental observations. We demonstrate also how to use the STSE to discriminate between model parameter sets.
The running example model yields the highest error in the nuclear compartment. This is consistent with our expectations: the test data set images suggest that Fus 3 PP is present in the nuclear compartment (Figure 4), but in the specification of the model we skipped the mechanism of Fus 3 PP transport via membranes, which results in the Fus 3 PP concentration in the nuclear compartment being equal to 0. To correct this property, the model should be extended by an assumption of Fus 3 PP transport via the nuclear membrane.
In the additional file 1 STSE modules are also compared with a selection of other available software tools which allow to perform each of the workflow substeps separately i.e. digitization, representation, analysis or modeling.
Results from STSE are confronted with results achieved by the use of the software ImageJ. Both tools allow for computing the ratio between the Fus 3 PP signal in the cytoplasm and the nucleus. Although STSE uses the approximation with subcompartments (which are very usefull when it comes to the simulation task) the approximation error in this example is below 1%. Likewise, the exponential decrease of Fus3PP along the cell center as well as the increase in the nucleus can be captured reasonably well with both tools. Nevertheless, if one is interested in the distribution of Fus3PP over the whole cell, there is no constitutive way to do so in ImageJ (however plugins allow to perform similar actions). Furthermore, in STSE it is possible to plot various profiles for any selected subcompartments.
Binary masks can be used for automatization in STSE. The choice of the segmentation algorithm to generate those binary masks depends highly on the particular problem and there already exist a multitude of advanced software packages dedicated to this task. Therefore in the current version of the STSE we decided to use already segmented images as the starting point of the proposed workflow and leave the choice of the optimal segmentation method to the user.
STSE conceptually differs from selection-based tools by operating on abstract subcompartments rather than on pixels directly. This approach results in the simplification of further processing and it allows for inhomogenous precision in different cellular compartments.
The precision is regulated in STSE by decreasing the subcompartment size. However, the increase in precision induces slower execution of the modeling routines. It is the modeler's choice to prepare the grid in such a way that both, the precision and speed of modeling routines is optimal. Also it should be kept in mind that, by using larger subcompartments, the assumption of homogeneously distributed molecules within one subcompartment might be disregarded.
In the additional file 1 we also compare STSE with a selected simulation engine, MesoRD , and discuss the main differences, pros and cons. It is important to understand that the scopes of these softwares are different: MesoRD is a mesoscopic, stochastic simulation engine, whereas STSE covers much broader area, but it does not provide a sophisticated simulation engine as such. Our future goal is to provide integration of STSE with a selection of existing simulation software. This would allow users to use state-of-the-art solutions for simulation, keeping the ease of validating the models with microscopy data, as demonstrated in our example.
STSE is a software platform, designed for constructing microscopy image-based simulations. It allows for an uninterrupted workflow including digitization, representation, analysis, and mathematical modeling. The main benefit of STSE is that it acts as a "glue" between different steps of the workflow, allowing the user to tailor the platform to their specific needs. Due to its open, modular architecture and integration of Python language, the software allows for full automatization (it applies also to GUI) via scripts, which is usually not possible or very limited with other stand alone applications.
Further versions of the STSE should provide integration of selected 3rd party simulators and simulation paradigms (e.g. stochastic, agent-based). It would be also crucial to support import and export of SBML (Systems Biology Markup Language) files. For the latter, a prior establishment of a standard for spatial modeling would be required. We strongly encourage the community to provide examples of various microscopy based simulation workflows, which we would be glad to integrate into STSE framework.
Availability and requirements
Project name: STSE
Project home page: http://stse-software.org
Operating system(s): Linux (availability on other systems depends on 3rd party libraries)
Programming language: Python
Other requirements: Openalea http://openalea.gforge.inria.fr/, Mayavi2 http://code.enthought.com/projects/mayavi/, Qhull http://www.qhull.org/, NetworkX http://networkx.lanl.gov/. It is also possible to use the software directly from a live DVD Linux distribution, SB.OS http://www.sbos.eu/, which comes with a comprehensive list of other systems biology software.
License: GNU GPL
The authors thank Jan Traas and Christophe Godin for early support of STSE development (known at this time point as mersim). We would like to thank Marta Hoffman-Sommer for the proof reading of the manuscript. This work was supported by the Marie Curie Research Training Network Aquaglyceroporins (LSHG-CT-2006-035995-2), the German Ministry for Eduction and Research (BMBF grant for the SysMO project Translucent), the German Research Foundation (DFG grant SFB 740) and by the Swiss HP2C-initiative "Swiss Platform for High-Performance and High-Productivity Computing".
- 6.Abramoff M, Magalhaes P, Ram S: Image processing with ImageJ. Biophotonics International 2004, 11: 36–42.Google Scholar
- 9.Ander M, Beltrao P, Di Ventura B, Ferkinghoff-Borg J, Foglierini M, Kaplan A, Lemerle C, Tomás-Oliveira I, Serrano L: SmartCell, a framework to simulate cellular processes that combines stochastic approximation with diffusion and localisation: analysis of simple networks. Systems biology 2004, 1: 129–138. 10.1049/sb:20045017CrossRefPubMedGoogle Scholar
- 11.Stiles JR, Bartol TM, Salpeter MM, Salpeter EE, Sejnowski TJ: Synaptic Variability: New Insights from Reconstructions and Monte Carlo Simulations with MCell. Synapses 2001, 1: 681–731.Google Scholar
- 12.Tolle DP, Le Novère N: Meredys, a multi-compartment reaction-diffusion simulator using multistate realistic molecular complexes. BMC systems biology 2010, 4.Google Scholar
- 20.Iverson KE:A Programming Language. John Wiley & Sons; [http://www.worldcat.org/isbn/0471430145]
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.