Analyze any DNA sequence (GTRD) (workflow)

From BioUML platform
Jump to: navigation, search
Workflow title
Analyze any DNA sequence (GTRD)
Provider
geneXplain GmbH

Workflow overview

Analyze-any-DNA-sequence-GTRD-workflow-overview.png

Description

This workflow is designed to search for putative transcription factor binding sites, TFBS, in any input DNA sequence in EMBL, Fasta or Genbank formats. Using this workflow you can analyze DNA sequences of any species and of any genomic regions.

 

The input sequence is subjected to ‘Site search on track’ method using the profile from the GTRD database called moderate threshold. The output sites are then subjected to the method ‘Site search summary’, which generates summary on the site search result. 

The results folder consists of a summary table and a track with sites. The track shows TFBSs that are found in the input sequences. 

 

Each row in the track file corresponds to one resulting TFBS and includes sequence names, site positions (the columns From and To), site Length and Strand, score calculated by the algorithm and a site model (here, GTRD matrix). This table can be exported as a track in several different formats including intervals, bed, wig and more. DNA sequences can be exported in multi-FASTA format.

 

The table Summary gives the site density per thousand bp for each matrix in the input sequence. For each row, the column Site density per 1000bp shows the number of matches normalized per 1000 bp length for the sequences in the input set.  TFBSs can be visualized in the genome browser.

 

 

Parameters

Input sequence
Select Yes sequence set
Profile
Select Profile
Results folder
Select Results folder
Personal tools
Namespaces

Variants
Actions
BioUML platform
Community
Modelling
Analysis & Workflows
Collaborative research
Development
Virtual biology
Wiki
Toolbox