Dr. Bonnie Hurwitz is an Assistant Professor of Biosystems Engineering at the University of Arizona and Bio5 Research Institute Fellow. She has worked as a computational biologist for nearly two decades on interdisciplinary projects in both industry and academia. Her research on the human/earth microbiome incorporates large-scale –omics datasets, high-throughput computing, and big data analytics towards research questions in “One Health”. In particular, Dr. Hurwitz is interested in the relationship between the environment, microbial communities, and their hosts. Dr. Hurwitz is well-cited for her work in computational biology in diverse areas from plant genomics to viral metagenomics with over 1200 citations.
Ocean Sciences meets Big Data Analytics
4:00 p.m. Mon., March 7, 2016
Forum Hall, Palmer Commons (100 Washtenaw Ave.)
Hundreds of researchers worldwide have joined forces in the Tara Oceans Expedition to create an unprecedented planetary-scale dataset comprised of state-of-the-art next generation sequencing, microscopy, and physical/chemical metadata to explore ocean biodiversity. This summer the complete collection of data from the 2009-2013 Tara voyage was released. Yet, despite herculean efforts by the Tara Oceans Consortium to make raw data and computationally derived assemblies and gene catalogs available, most researchers are stymied by the sheer volume of the data. Specifically, the most tantalizing research questions lie in understanding the unifying principles that guide the distribution of organisms across the sea and affect climate and ecosystem function. To use the data in this capacity researchers must download, integrate, and analyze more than 7.2 trillion bases of metagenomic data and associated metadata from viruses, bacteria, archaea and small eukaryotes at their own data centers ( ~9 TB of raw data). Accessing large-scale data sets in this way impedes scientists’ from replicating and building on prior work. To this end, we are developing a data platform called the Ocean Cloud Commons (OCC) as part of the iMicrobe project. The OCC is built using an algorithm we developed to pre-compute massive comparative metagenomic analyses in a Hadoop big data framework. By maintaining data in a cloud commons researchers have access to scalable computation and real-time analytics to promote the integrated and broad use of planetary-scale datasets, such as Tara.
This seminar is Co-sponsored by the department of Earth and Environmental Sciences.