SGA is a de novo assembler designed to assemble large genomes from high coverage short read data. The major goal of SGA is to be very memory efficient, which is achieved by using a compressed representation of DNA sequence reads.
- google sparse hash library
- bamtools library
:: MORE INFORMATION
Genome Res. 2012 Mar;22(3):549-56. doi: 10.1101/gr.126953.111.
Efficient de novo assembly of large genomes using compressed data structures.
Simpson JT, Durbin R.