BioUML

BioUML is an open source integrated Java platform for analysis of data generated by omics technologies using advanced tools of computational biology. The BioUML vision is to provide a computational platform to build virtual cell and virtual physiological human. BioUML spans a comprehensive range of capabilities, including access to databases with experimental data, tools for formalized description of biological systems structure and functioning, as well as tools for their visualization, simulation, parameters fitting and analyses. Due to scripts (R, JavaScript) and workflow support it provides powerful possibilities for analyses of high-throughput data. The plug-in based architecture (Eclipse run time from IBM is used) allows to add new functionality using plug-ins.

The whole system aims at covering, with time, all areas of computational applications in bioinformatics and systems biology. The architecture is open, so that users' own scripts can be easily loaded into the system, and new modules can be programmed and added by any skilled person. The community is invited to contribute to the development, either as public-domain or as a commercial part of the platform. The developers are confident that this way a most powerful and needed system of tools can grow.

The team also hopes that BioUML will contribute to creation of virtual cell, virtual physiological human and virtual patient, which would be extremely useful in medicine as a means of computational identification of the most effective therapeutic interventions and for prediction of potential outcomes of intended treatments for a given patient.

Principles

Visual modeling

Visual modeling within the data workflow in BioUML

Reconstruction of complex biological systems from a huge amount of experimental data requires a formal language that can be easily understood both by the human and the computer. It is known that graphic depiction of complex systems is the most suitable way of understanding their structure by humans. Graphic notation allows humans to completely and formally specify model so computer programs can analyze the model and simulate its behavior.

This approach is widely used in engineering and computer science. Some examples are:

MATLAB/Simulink (http://www.mathworks.com)
AnyLogic (http://www.xjtek.com) - muli-method simulation software
UML (http://www.omg.org/uml/) - the most known graphical language for computer science.

BioUML adopts the visual modeling approach for formal description and simulation of complex biological systems.

Another distinctive feature of BioUML is a tight integration with databases on biological pathways, query engine allows user to find interacting components of the system and show results as an editable graph.

BioUML also fully exploits principles of modular modeling.

Meta-model

System of two consecutive chemical reactions (a), its formal description using three meta model levels (b), and corresponding mathematical model (c), that can be generated automatically for system simulations

The core of BioUML is a meta-model. It provides an abstract layer (compartmentalized attributed graph) for comprehensive formal description of wide range of biological and other complex systems. The content of databases on biological pathways, SBML (Hucka M. et al., 2003) and CellML(Lloyd C.M. et al., 2004) models, as well as biological pathways in BioPAX format can be expressed in terms of the meta model and used by the BioUML workbench.

This formal description can be used both for visual depiction and editing of biological system structure and for automated code generation to simulate a model behavior. Meta-model is problem domain neutral and splits the system description into 3 interconnected levels:

graph structure - the system structure is described as compartmentalized graph;
database level - each graph element can contain reference to some database object;
mathematical model - any graph element can be element of mathematical model.

BioUML supports the following mathematical elements: variable, formula, equation, event, state and transition.

The figure demonstrates how this approach is applied to modeling a system consisting of two consecutive chemical reactions. Here the graph nodes representing chemical substances are considered as variables and the corresponding graph edges contain right parts of corresponding differential equations. Using this information the BioUML workbench can generate MATLAB or Java code for model simulation.

Special BioUML diagrams markup language (DML) is developed to store BioUML meta model instance in XML format. Diagram description is divided into two parts:

graph structure - it describes location of diagram elements and contains references to associated with them database objects;
executable model - stores mathematical model associated with graph.

Detailed description of DML format is available at http://www.biouml.org/dml.shtml

Reproducible research

Reproducible research is the common name for functionality for saving, representing and reproducing a set of data manipulation. BioUML provides special Project collections, workflows and research diagrams for this purpose.

Collaborative research

Collaborative research is any research project, especially a large one involving various areas of expertise, that is carried out by at least two people. To enable collaborative research BioUML sustains the following functionality:

sharing projects with other users,
collaborative diagram editing,
access to diagram editing history (both session and version),
in-built group chat.

Architecture overview

BioUML platform consists of 3 parts:

BioUML server - provides access to data and analyses methods installed on the server side for BioUML clients (workbench and web edition) via the Internet.
BioUML workbench - Java application that can work standalone or as "thick" client for BioUML server.
BioUML web edition - "thin" client for BioUML server (you just need to start web browser) that provides most of functionality of BioUML workbench. It uses AJAX and HTML5 <canvas> technology for visual modeling and interactive data editing.

Plug-in based architecture

Plug-in based architecture scheme with extension points shown as sockets and plug-ins as plugs

The plug-in based architecture determines extensibility of the BioUML platform. This functionality is provided by Eclipse platform runtime kernel.

The basic components of the plug-in based architecture are:

Plugin, the smallest unit of BioUML workbench function that can be developed and delivered separately into the BioUML workbench. Plug-ins are coded in Java. A typical plug-in consists of Java code in a JAR library, some read-only files, and other resources such as images, message catalogs, native code libraries, etc. A plug-in is described in an XML manifest file, called plugin.xml. The parsed contents of plug-in manifest files are made available programmatically through a plug-in registry API provided by Eclipse runtime.
Extension points, well-defined function points in the system where other plug-ins can contribute functionality.
Extension, a specific contribution to an extension point. Plug-ins can define their own extension points, so that other plug-ins can integrate tightly with them.

BioUML

Contents

Principles

Visual modeling

Meta-model

Reproducible research

Collaborative research

Architecture overview

Plug-in based architecture

See also

Personal tools

Namespaces

Variants

Views

Actions

Search

BioUML platform

Community

Modelling

Analysis & Workflows

Collaborative research

Development

Virtual biology

Wiki

Toolbox