CLC Assembly Cell is a high-performance computing solution for mapping reads to a reference and de novo assembly of next-generation sequencing data.
With the command-line interface of CLC Assembly Cell, you can easily include these functionalities in scripts and other next generation sequencing workflows. It is easy to install on your desktop computer or a larger compute cluster. CLC Assembly Cell is accelerated through advanced algorithm implementations: they use the SIMD instruction set to parallelize and accelerate compute intensive parts of the algorithms, and make the software one of the fastest and most accurate packages for NGS data analysis on the market.
Multiple CLC Assembly Cells can be run in parallel on a multi-node cluster.
In practice, almost every cluster is set up differently, and we therefore don’t provide an off-the-shelf solution that is guaranteed to work on your computer cluster. Instead we provide a free to download, free to use, and free to modify Perl script, as an example.
The latest version of CLC Assembly Cell introduces tools for error-correction and de novo assembly of raw PacBio reads. High quality assemblies can be generated in a fraction of the time that is needed by leading alternatives. CLC Assembly Cell consumes less than 10 percent of the memory used by alternative solutions, while completing the assembly faster.
We compared the performance of the industry standard HGAP1 when run on a high performance computer to the performance of a De Novo Assembly workflow in CLC Assembly Cell. Please note that our De Novo Assembly Pipeline was run on a standard laptop for this comparison.
Read the detailed benchmarks