Difference between revisions of "ChIP-Seq - Identify and classify target genes (workflow)"

From BioUML platform
Jump to: navigation, search
(Automatic synchronization with BioUML)
(Automatic synchronization with BioUML)
 
Line 6: Line 6:
 
[[File:ChIP-Seq-Identify-and-classify-target-genes-workflow-overview.png|400px]]
 
[[File:ChIP-Seq-Identify-and-classify-target-genes-workflow-overview.png|400px]]
 
== Description ==
 
== Description ==
This workflow identifies and classifies target genes using positional information of peaks found by ChiP-Seq approach. As input, the peak regions in BED format, can be submitted. Actually, any dataset in BED format can be submitted as input track for this workflow.
+
This workflow identifies and classifies target genes using positional information of peaks found by a ChiP-Seq experiment. The peak regions in BED format can be submitted as input.  
  
First, the track file is converted into a gene set using ''Track to gene set'' analysis. The Ensembl gene list is then annotated with additional information, gene descriptions and chromosomal location.
+
Actually, any dataset in BED format can be submitted as input track for this workflow.
  
The annotated Ensembl genes are submitted to ''Functional  classification'' using the following ontologies: GO biological processes, GO cellular components, GO molecular function, TF classification, Reactome and HumanCyc pathways.
+
First, the track file is converted into a gene set using ''Track to gene set'' analysis. The Ensembl gene list is then annotated with additional information, gene descriptions and chromosomal location.
  
In parallel with the ''Functional  classification,'' the Ensembl gene list is converted to a list of Entrez genes, which are used in the next analysis ''Cluster by shortest path'' to find clusters at 2 steps in both directions, upstream and downstream, from the input genes.  The clustering is performed using GeneWays network.
+
The annotated Ensembl genes are submitted to ''Functional classification'' method, using the following ontologies: GO biological processes, GO cellular components, GO molecular function, Transcription Factor classification, Reactome and HumanCyc pathways.
  
 
+
In parallel with the ''Functional classification,'' the Ensembl gene list is used for a clustering analysis (''Cluster by shortest path'') to find gen clusters with a maximum search radius of 2. The searching steps were done in both directions, upstream and downstream, from the input gene list. The clustering is performed using the GeneWays database.
  
As output, a new folder is formed, which contains ''Functional  classification'' results, a list of annotated Ensemble genes and a diagram of clustered GeneWays genes. 
+
The output folder contains the mapped classification results, a list of annotated Ensembl genes and a diagram of the clustered GeneWays genes. 
  
 
 
 
 

Latest revision as of 16:34, 12 March 2019

Workflow title
ChIP-Seq - Identify and classify target genes
Provider
geneXplain GmbH

[edit] Workflow overview

ChIP-Seq-Identify-and-classify-target-genes-workflow-overview.png

[edit] Description

This workflow identifies and classifies target genes using positional information of peaks found by a ChiP-Seq experiment. The peak regions in BED format can be submitted as input.

Actually, any dataset in BED format can be submitted as input track for this workflow.

First, the track file is converted into a gene set using Track to gene set analysis. The Ensembl gene list is then annotated with additional information, gene descriptions and chromosomal location.

The annotated Ensembl genes are submitted to Functional classification method, using the following ontologies: GO biological processes, GO cellular components, GO molecular function, Transcription Factor classification, Reactome and HumanCyc pathways.

In parallel with the Functional classification, the Ensembl gene list is used for a clustering analysis (Cluster by shortest path) to find gen clusters with a maximum search radius of 2. The searching steps were done in both directions, upstream and downstream, from the input gene list. The clustering is performed using the GeneWays database.

The output folder contains the mapped classification results, a list of annotated Ensembl genes and a diagram of the clustered GeneWays genes. 

 

[edit] Parameters

Input track
Species
Results folder
Personal tools
Namespaces

Variants
Actions
BioUML platform
Community
Modelling
Analysis & Workflows
Collaborative research
Development
Virtual biology
Wiki
Toolbox