Difference between revisions of "Gene set enrichment analysis (Affymetrix probes) (workflow)"

From BioUML platform
Jump to: navigation, search
m (Cosmetic changes in category formatting)
(Automatic synchronization with BioUML)
Line 6: Line 6:
 
[[File:Gene-set-enrichment-analysis-Affymetrix-probes-workflow-overview.png|400px]]
 
[[File:Gene-set-enrichment-analysis-Affymetrix-probes-workflow-overview.png|400px]]
 
== Description ==
 
== Description ==
This workflow is designed to perform Gene Set Enrichment Analysis, GSEA, as it is described at [http://www.broadinstitute.org/gsea/index.jsp http://www.broadinstitute.org/gsea/index.jsp]. As input, the normalized data with Affymetrix probeset IDs can be submitted.
+
This workflow is designed to perform Gene Set Enrichment Analysis, GSEA, as it is described at [http://www.broadinstitute.org/gsea/index.jsp http://www.broadinstitute.org/gsea/index.jsp].  As input, the normalized data with Affymetrix probeset IDs can be submitted.
  
Such normalized files are resulting from the “Normalize data” procedure under “analyses/Methods/Data normalization/Normalize Affymetrix experiment and control”.
+
Such normalized files result from the “Normalize data” procedure under “Analyses/Methods/Data normalization/Normalize Affymetrix experiment and control”.
  
First, the input files are subjected to fold-change calculation.
+
First, the input files are subjected to fold-change calculation. The table with probeset IDs and calculated fold change values is converted into a table with Ensembl Gene IDs.
  
The table with probeset Ids and calculated fold change values is converted into a table with Ensembl Gene Ids.
+
In the next step, the Ensembl genes are annotated with additional information, gene descriptions and gene symbols.
  
At the next step, the Ensembl genes are annotated with additional information, gene description and gene symbols.
+
Finally the annotated Ensembl genes are subjected to GSEA using the PROTEOME<sup>TM</sup> database. Enrichment analysis is performed using the following ontologies: PROTEOME GO biological processes, PROTEOME GO cellular components, PROTEOME GO molecular function, PROTEOME disease, and TRANSPATH pathways.
  
Finally the annotated Ensembl genes are subjected to GSEA. Enrichment analysis is done in parallel by the following ontologies: GO biological processes, GO cellular components, GO molecular functions and by the Reactome pathways.
+
Output files include enrichment analysis results, a list of annotated Ensembl genes and a histogram of LogFoldChange distribution. For each ontological term several parameters are calculated, including nominal p-value, ES, NES, FDR, rank at max, hit names, the link to the corresponding ontological term, and the link to open a visualization plot.
 
+
For each ontological term several parameters are calculated, including nominal p-value, ES, NES, as well as hit names, the link to the corresponding ontological term, and the link to open a visualization plot.
+
  
 
== Parameters ==
 
== Parameters ==

Revision as of 13:34, 30 May 2013

Workflow title
Gene set enrichment analysis (Affymetrix probes)
Provider
geneXplain GmbH

Workflow overview

Gene-set-enrichment-analysis-Affymetrix-probes-workflow-overview.png

Description

This workflow is designed to perform Gene Set Enrichment Analysis, GSEA, as it is described at http://www.broadinstitute.org/gsea/index.jsp.  As input, the normalized data with Affymetrix probeset IDs can be submitted.

Such normalized files result from the “Normalize data” procedure under “Analyses/Methods/Data normalization/Normalize Affymetrix experiment and control”.

First, the input files are subjected to fold-change calculation. The table with probeset IDs and calculated fold change values is converted into a table with Ensembl Gene IDs.

In the next step, the Ensembl genes are annotated with additional information, gene descriptions and gene symbols.

Finally the annotated Ensembl genes are subjected to GSEA using the PROTEOMETM database. Enrichment analysis is performed using the following ontologies: PROTEOME GO biological processes, PROTEOME GO cellular components, PROTEOME GO molecular function, PROTEOME disease, and TRANSPATH pathways.

Output files include enrichment analysis results, a list of annotated Ensembl genes and a histogram of LogFoldChange distribution. For each ontological term several parameters are calculated, including nominal p-value, ES, NES, FDR, rank at max, hit names, the link to the corresponding ontological term, and the link to open a visualization plot.

Parameters

Experiment normalized
Control normalized
Species
Results folder
Personal tools
Namespaces

Variants
Actions
BioUML platform
Community
Modelling
Analysis & Workflows
Collaborative research
Development
Virtual biology
Wiki
Toolbox