Hadoop

From BioUML platform
Revision as of 00:24, 17 November 2013 by Fedor Kolpakov (Talk | contribs)

(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to: navigation, search
This page or section is a stub. Please add more information here!


List of Hadoop applications for NGS

Hadoop MapReduce-based approaches have become increasingly popular due to their scalability in processing large sequencing data sets[1].

Tool, Ref Description URL
SeqPig [1] A library and a collection of tools to manipulate, analyze and query sequencing data sets in a scalable and simple manner.

SeqPig scripts use the Hadoop-based distributed scripting engine Apache Pig, which automatically parallelizes and distributes data processing tasks.

http://sourceforge.net/projects/seqpig/

http://seqpig.sourceforge.net/ (manual)


References

Error fetching PMID 24149054:
  1. Error fetching PMID 24149054: [Schumacher2013]
Personal tools
Namespaces

Variants
Actions
BioUML platform
Community
Modelling
Analysis & Workflows
Collaborative research
Development
Virtual biology
Wiki
Toolbox