Difference between revisions of "Mapping to ontologies for multiple gene sets (TRANSPATH(R)) (workflow)"

From BioUML platform
Jump to: navigation, search
(Automatic synchronization with BioUML)
(Automatic synchronization with BioUML)
 
Line 6: Line 6:
 
[[File:Mapping-to-ontologies-for-multiple-gene-sets-TRANSPATH-R-workflow-overview.png|400px]]
 
[[File:Mapping-to-ontologies-for-multiple-gene-sets-TRANSPATH-R-workflow-overview.png|400px]]
 
== Description ==
 
== Description ==
This workflow classifies multiple input gene sets using several ontologies and identifies categories that are over-represented in each of the input sets. The input is a folder containing several gene or protein tables.  
+
This workflow is designed to classify an input gene set to several ontologies and to identify terms, hits for which are overrepresented in the input set. The input is a folder containing several gene or protein tables, and these tables are taken automatically by the workflow, one input table after another, in a cycle.
  
In the first step, the first table from the input folder is converted into a gene table with Ensembl IDs.
+
At the first step, one of the input tables from the input folder is converted into a table with Ensembl Gene IDs.
  
In the second step, the table with Ensembl IDs is submitted to the ''Functional classification'' analysis, which is done in parallel using the following ontologies: GO biological processes, GO cellular components, GO molecular functions, Reactome pathways, HumanCyc pathways, TRANSPATH® pathways and TF classification.
+
The table with Ensembl Gene IDs is subjected to functional classification, which is done in parallel by the following ontologies: GO biological processes, GO cellular components, GO molecular functions, Reactome pathways, HumanCyc pathways, Transpath® pathways, TF classification.
  
The first and second steps are repeated for the second table from the input folder, and it is repeatedly performed for each table from the input folder.
+
For each ontological term several parameters are calculated, including expected number of hits, actual number of hits, p-value, as well as hit names and the link to the corresponding ontological term.
  
As a result, a new folder is formed with several subfolders corresponding to each input table. Each subfolder contains the results of ''Functional classification.'' For each ontological category several parameters are calculated, including expected number of hits, actual number of hits, p-value as well as the names of genes falling into this category and the link to the corresponding ontological term. 
+
The same steps are repeated for the next input table, and several cycles are performed automatically corresponding to the number of tables in the input folder.
  
 
This workflow is available together with a valid TRANSPATH® license.
 
This workflow is available together with a valid TRANSPATH® license.

Latest revision as of 16:19, 11 December 2014

Workflow title
Mapping to ontologies for multiple gene sets (TRANSPATH(R))
Provider
geneXplain GmbH

[edit] Workflow overview

Mapping-to-ontologies-for-multiple-gene-sets-TRANSPATH-R-workflow-overview.png

[edit] Description

This workflow is designed to classify an input gene set to several ontologies and to identify terms, hits for which are overrepresented in the input set. The input is a folder containing several gene or protein tables, and these tables are taken automatically by the workflow, one input table after another, in a cycle.

At the first step, one of the input tables from the input folder is converted into a table with Ensembl Gene IDs.

The table with Ensembl Gene IDs is subjected to functional classification, which is done in parallel by the following ontologies: GO biological processes, GO cellular components, GO molecular functions, Reactome pathways, HumanCyc pathways, Transpath® pathways, TF classification.

For each ontological term several parameters are calculated, including expected number of hits, actual number of hits, p-value, as well as hit names and the link to the corresponding ontological term.

The same steps are repeated for the next input table, and several cycles are performed automatically corresponding to the number of tables in the input folder.

This workflow is available together with a valid TRANSPATH® license.

[edit] Parameters

Input folder
Folder to get input tables from
Species
Results folder
Folder to store results (will be created if not exists yet)
Personal tools
Namespaces

Variants
Actions
BioUML platform
Community
Modelling
Analysis & Workflows
Collaborative research
Development
Virtual biology
Wiki
Toolbox