Gene expression prediction

From BioUML platform
Revision as of 23:19, 31 March 2018 by Fedor Kolpakov (Talk | contribs)

(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to: navigation, search
Method, code, references Input data Algorithm Comment
INVOKE (R script)[1]

https://github.com/SchulzLab/TEPIC/tree/master/MachineLearningPipelines/INVOKE

Input:

  • TF-genes scores (calculated by TEPIC)
    • open chromatin data (DNaseI-seq, NOMe-seq)
    • PWM (Jaspar, HOCOMOCO, Uniprobe)
  • expression data (RNA-seq)

Output:

  • regression coefficients for TF
  • model performance: Pearson correlation, Spearman correlation, and MSE
    • boxplot showing model performance
    • heatmap (top 10 positive and negative coefficients)
    • scatter plots for predicted versus the measured gene expression data

INVOKE offers linear regression with various regularisation techniques (Lasso, Ridge, Elastic net) to infer potentially important transcriptional regulators by predicting gene expression from TEPIC TF-gene scores.


References

Error fetching PMID 27899623:
  1. Error fetching PMID 27899623: [Schmidt217]
Personal tools
Namespaces

Variants
Actions
BioUML platform
Community
Modelling
Analysis & Workflows
Collaborative research
Development
Virtual biology
Wiki
Toolbox