Hypergeometric analysis for multiple inputs (workflow)

From BioUML platform
Jump to: navigation, search
Workflow title
Hypergeometric analysis for multiple inputs
Provider
geneXplain GmbH

Workflow overview

Hypergeometric-analysis-for-multiple-inputs-workflow-overview.png

Description

This workflow is designed to identify upregulated and downregulated genes for experimental data with any number of data points for each experiment and control. It can be used even for the cases with one data point in each experiment and control.

Input is a folder with multiple normalized tables. Each table is processed one after the other. Such normalized files are resulting from the output of the “Normalize data” procedure under “analyses/Methods/Data normalization/Normalize Affymetrix experiment and control”.

As the first step, for each normalized file p-value is calculated for up-and down-regulated probeset IDs. This workflow applies hypergeometric analysis for p-value calculation. Simultaneously, log fold change is calculated for each probeset ID, and as the result of this step, a table is produced in which both LogFoldChange and p-value are assigned to each probeset ID.

Further, this table is filtered by several conditions in parallel, to identify upregulated, downregulated, as well as a joint table of up- & downregulated Affymetrix probeset IDs.The filtering criteria are set as the following:

For upregulated probes: LogFoldChange>0.5 and -log_P_value_>3.

For downregulated probes: LogFoldChange<-0.5 and -log_P_value_<-3.

For up- & downregulated probes: (LogFoldChange>0.5 and -log_P_value_>3 & LogFoldChange<-0.5 and -log_P_value_<-3)

Resulting tables of the upregulated, downregulated, and up- & downregulated Affymetrix probeset IDs are annotated with additional information, gene description, gene symbols, species.Finally, these tables are converted into the tables of genes. Two tables are produced, with Ensembl Gene IDs and with Entrez IDs.

The same steps are repeated for the next input table, and several cycles are performed automatically corresponding to the number of tables in the input folder. 

 

Parameters

Input experiments
Input folder with all normalized CEL files (experiments)
Input controls
Input table with all normalized CEL files (controls)
Probe type
Species
Results folder
Personal tools
Namespaces

Variants
Actions
BioUML platform
Community
Modelling
Analysis & Workflows
Collaborative research
Development
Virtual biology
Wiki
Toolbox