Difference between revisions of "GTRD"

From BioUML platform
Jump to: navigation, search
Line 55: Line 55:
  
 
Each ChIP-seq experiment has a row in 'chip_experiments' table, which assigns id and stores basic information about experiment.
 
Each ChIP-seq experiment has a row in 'chip_experiments' table, which assigns id and stores basic information about experiment.
'chip_experiments' table structure:
+
'chip_experiments' table has following structure:
  
 
{| class="wikitable"
 
{| class="wikitable"
Line 76: Line 76:
 
|}
 
|}
  
 +
The links to external databases stored in 'external_refs' table:
  
 +
{| class="wikitable"
 +
|-
 +
! Column !! Description || Example values
 +
|-
 +
| id || Experiment identifier || EXP000489
 +
|-
 +
| external_db || External database name || GEO or PUBMED or ENCODE or SRA
 +
|-
 +
| external_db_id || Identifier in external database || GSM320736
 +
|}
  
 
==Web interface to database==
 
==Web interface to database==

Revision as of 18:48, 1 July 2013

GTRD (Gene Transcription Regulation Database) is a database of transcription factor binding sites identified from ChIP-seq experiments. GTRD analyze freely avalable ChIP-seq experiments from literature, GEO, SRA and ENCODE databases.

The web interface to GTRD is available at http://192.168.199.241/bioumlweb/#anonymous=true&perspective=GTRD.

Database statistics

GTRD uses 2417 ChIP-seq experiments for 470 distinct sequence specific transcription factors.
ChIP-seq experiments by species
Most of ChIP-seq experiments (1638) have corresponding control experiment.
Control experiments

General statistics:

Object type Total count Per ChIP-seq experiment
ChIP-seq reads 80.808E9 34.937E6
Read alignments 58.848E9 25.675E6
ChIP-seq peaks 59.515E6 32899

In average each transcription factor is measured in 4.07 ChIP-seq experiments, but 284 (60%) transcription factors measured only in one experiment.

The ten most studied transcription factors listed bellow:

Transcription Factor Number of ChIP-seq experiments
CTCF 195
c-Myc 45
ERα 44
NRSF 37
C/EBPβ 37
GATA-1 33
NF-κB p65 30
Max 30
PU.1 29
GR 24


Database structure

The metadata concerning GTRD is stored in MySQL tables.

Each ChIP-seq experiment has a row in 'chip_experiments' table, which assigns id and stores basic information about experiment. 'chip_experiments' table has following structure:

Column Description Example value
id Unique experiment identifier EXP000489
antibody Antibody used in chromatin immunoprecipitation sc-345
tfClassId Id in TFClass[1] database of target transcription factor, NULL for control experiments 6.2.1.0.1
cell_line Studied cell line HeLa S3
specie Specie latin name Homo sapiens
treatment Cell treatment or conditions IFN gamma
control_id Id of control experiment, NULL for control experiments or experiments without control EXP000490

The links to external databases stored in 'external_refs' table:

Column Description Example values
id Experiment identifier EXP000489
external_db External database name GEO or PUBMED or ENCODE or SRA
external_db_id Identifier in external database GSM320736

Web interface to database

Personal tools
Namespaces

Variants
Actions
BioUML platform
Community
Modelling
Analysis & Workflows
Collaborative research
Development
Virtual biology
Wiki
Toolbox