Difference between revisions of "Mapping to ontologies for multiple gene sets (workflow)"

From BioUML platform
Jump to: navigation, search
(Automatic synchronization with BioUML)
(Automatic synchronization with BioUML)
 
(One intermediate revision by one user not shown)
Line 6: Line 6:
 
[[File:Mapping-to-ontologies-for-multiple-gene-sets-workflow-overview.png|400px]]
 
[[File:Mapping-to-ontologies-for-multiple-gene-sets-workflow-overview.png|400px]]
 
== Description ==
 
== Description ==
This workflow classifies multiple input gene sets using several ontologies and identifies categories that are over-represented in each of the input sets. The input is a folder containing several gene or protein tables.
+
This workflow is designed to classify a multiple set of genes by enrichment analysis using GO, Reactome, HumanCyc and TF classification databases. Gene sets should be located in one folder At the first step, the input table is converted into a table with Ensembl Gene IDs.
  
In the first step, the first table from the input folder is converted into a gene table with Ensembl IDs.
+
This table with Ensembl Gene Ids is subjected to functional classification, which is done in parallel by the following ontologies: GO biological processes, GO cellular components, GO molecular functions, Reactome pathways, HumanCyc pathways, TF classification.
  
In the second step, the table with Ensembl IDs is submitted to the ''Functional classification'' analysis, which is done in parallel using the following ontologies: GO biological processes, GO cellular components, GO molecular functions, Reactome pathways, HumanCyc pathways, and TF classification.
+
For each ontological term several parameters are calculated, including expected number of hits, actual number of hits, p-value, as well as hit names and the link to the corresponding ontological term. 
  
The first and second steps are repeated for the second table from the input folder, and it is repeatedly performed for each table from the input folder.
+
The same steps are repeated for the next input table, and several cycles are performed automatically corresponding to the number of tables in the input folder.
  
As a result, a new folder is formed with several subfolders corresponding to each input table. Each subfolder contains the results of ''Functional classification.'' For each ontological term several parameters are calculated, including expected number of hits, actual number of hits, p-value as well as the names of genes falling into this category and the link to the corresponding ontological term.
+
 
  
 
 
 
 

Latest revision as of 16:34, 12 March 2019

Workflow title
Mapping to ontologies for multiple gene sets
Provider
geneXplain GmbH

[edit] Workflow overview

Mapping-to-ontologies-for-multiple-gene-sets-workflow-overview.png

[edit] Description

This workflow is designed to classify a multiple set of genes by enrichment analysis using GO, Reactome, HumanCyc and TF classification databases. Gene sets should be located in one folder At the first step, the input table is converted into a table with Ensembl Gene IDs.

This table with Ensembl Gene Ids is subjected to functional classification, which is done in parallel by the following ontologies: GO biological processes, GO cellular components, GO molecular functions, Reactome pathways, HumanCyc pathways, TF classification.

For each ontological term several parameters are calculated, including expected number of hits, actual number of hits, p-value, as well as hit names and the link to the corresponding ontological term. 

The same steps are repeated for the next input table, and several cycles are performed automatically corresponding to the number of tables in the input folder.

 

 

[edit] Parameters

Input folder
Folder to get input tables from
Species
Results folder
Folder to store results (will be created if not exists yet)
Personal tools
Namespaces

Variants
Actions
BioUML platform
Community
Modelling
Analysis & Workflows
Collaborative research
Development
Virtual biology
Wiki
Toolbox