Biotechnology firms, pharmaceutical companies, and research institutions actively involved in genomics, proteomics, and molecular modeling and simulation are producing massive datasets that are growing exponentially. This explosion in genomic and proteomic information, especially from Next-Gen Sequencing (NGS), and the emergence of data-intensive molecular diagnostic techniques demand a scalable, high performance storage infrastructure.
Panasas ActiveStor systems provide scalable bandwidth and random I/O to accelerate throughput for the most demanding life sciences applications. Scalable capacity ensures that multiple versions of genomic databases can be maintained and quickly made available across a workflow for computational chemistry, bioinformatics, and other data-intensive applications. Large datasets, such as micro array or mass spectrometry images, are effectively managed within a single scalable namespace and shared among researchers to streamline collaboration.
Proven at the world's leading life sciences organizations:
- Boehringer Ingelheim
- The Beijing Genomics Institute
- The European Molecular Biology Laboratory (EMBL) - European Bioinformatics Institute (EBI)
- The National Center for Biotechnology Information (NCBI) A division of the National Library of Medicine (NLM) at the National Institutes of Health (NIH)
- The Rockefeller University - Gaasterland Laboratory of Computational Genomics
- University of California: Berkeley - Center for Integrative Genomics
This workflow diagram illustrates how a Panasas ActiveStor solution can integrate into a mixed Linux and Windows NGS environment. Data sets from Next-Gen Sequencers are ingested into the ActiveStor system. Linux clients performing the data analysis have direct parallel access via the DirectFlow protocol to the sequencing data on the ActiveStor storage blades, speeding time to results.