Difference between revisions of "Analyze SNP list (GTRD) (workflow)"
(Automatic synchronization with BioUML) |
(Automatic synchronization with BioUML) |
||
Line 8: | Line 8: | ||
This workflow is designed to analyze the SNP list and to predict TFBSs overlapping with regulatory SNPs. Any list with SNP IDs (in the format rs…) can be used as an input. | This workflow is designed to analyze the SNP list and to predict TFBSs overlapping with regulatory SNPs. Any list with SNP IDs (in the format rs…) can be used as an input. | ||
− | In the first step, the “SNP matching” analysis maps input SNPs on the genome. The result of this step is an annotated SNP table and a corresponding track file with all SNPs. | + | In the first step, the “SNP matching” analysis maps input SNPs on the genome. The result of this step is an annotated SNP table and a corresponding track file with all SNPs. |
The track file with all SNPs is further converted into a gene set using the “Track to gene set” analysis. This step identifies genes located within the region -10000 to 10000 around each SNP in the list. The region is available in the input form and can be modified. This step returns a schematic map of SNPs in genes mapped to either exons, introns or flanking regions. | The track file with all SNPs is further converted into a gene set using the “Track to gene set” analysis. This step identifies genes located within the region -10000 to 10000 around each SNP in the list. The region is available in the input form and can be modified. This step returns a schematic map of SNPs in genes mapped to either exons, introns or flanking regions. | ||
Line 14: | Line 14: | ||
The annotated SNP table is filtered using the “Filter table” analysis, which results in two tables, SNPs in exons and other SNPs. | The annotated SNP table is filtered using the “Filter table” analysis, which results in two tables, SNPs in exons and other SNPs. | ||
− | The SNPs located in introns or in gene-flanking regions are referred to as regulatory SNPs. They are further analyzed for their overlap with TFBSs. To this end, the track with regulatory SNPs is further processed using “Site search on track” and “Site search results optimization” to find TFBSs enriched around regulatory SNPs as compared to random genomic positions. The workflow uses the default profile | + | The SNPs located in introns or in gene-flanking regions are referred to as regulatory SNPs. They are further analyzed for their overlap with TFBSs. To this end, the track with regulatory SNPs is further processed using “Site search on track” and “Site search results optimization” to find TFBSs enriched around regulatory SNPs as compared to random genomic positions. The workflow uses the default profile moderate threshold from GTRD library, but other GTRD profiles can be chosen by the user. The region around SNPs analyzed for TFBSs is 50 bp on each flank; it is available in the input form and can be modified. |
The output is a result folder with three subfolders called “all SNPs”, “SNPs in exons” and “SNPs regulatory”, containing the resulting tables and tracks for the corresponding SNPs. All tracks can be used immediately to visualize SNPs, nearby located genes, as well as overlapping TFBS on chromosomes. | The output is a result folder with three subfolders called “all SNPs”, “SNPs in exons” and “SNPs regulatory”, containing the resulting tables and tracks for the corresponding SNPs. All tracks can be used immediately to visualize SNPs, nearby located genes, as well as overlapping TFBS on chromosomes. |
Latest revision as of 16:34, 12 March 2019
- Workflow title
- Analyze SNP list (GTRD)
- Provider
- geneXplain GmbH
[edit] Workflow overview
[edit] Description
This workflow is designed to analyze the SNP list and to predict TFBSs overlapping with regulatory SNPs. Any list with SNP IDs (in the format rs…) can be used as an input.
In the first step, the “SNP matching” analysis maps input SNPs on the genome. The result of this step is an annotated SNP table and a corresponding track file with all SNPs.
The track file with all SNPs is further converted into a gene set using the “Track to gene set” analysis. This step identifies genes located within the region -10000 to 10000 around each SNP in the list. The region is available in the input form and can be modified. This step returns a schematic map of SNPs in genes mapped to either exons, introns or flanking regions.
The annotated SNP table is filtered using the “Filter table” analysis, which results in two tables, SNPs in exons and other SNPs.
The SNPs located in introns or in gene-flanking regions are referred to as regulatory SNPs. They are further analyzed for their overlap with TFBSs. To this end, the track with regulatory SNPs is further processed using “Site search on track” and “Site search results optimization” to find TFBSs enriched around regulatory SNPs as compared to random genomic positions. The workflow uses the default profile moderate threshold from GTRD library, but other GTRD profiles can be chosen by the user. The region around SNPs analyzed for TFBSs is 50 bp on each flank; it is available in the input form and can be modified.
The output is a result folder with three subfolders called “all SNPs”, “SNPs in exons” and “SNPs regulatory”, containing the resulting tables and tracks for the corresponding SNPs. All tracks can be used immediately to visualize SNPs, nearby located genes, as well as overlapping TFBS on chromosomes.
[edit] Parameters
- Input SNP Table
- 5' and 3' gene bound extension
- Profile
- SNP surrounding region, bp
- Set SNP surrounding region, bp
- Species
- Results Folder