Broad Institute

Marine Phage, Virus and Virome Sequencing Pipeline

To support the Broad Institute to sequence and perform initial characterizations of marine phage and virus genomes and the genomic content of environmental marine virus assemblages.

Title: Marine Phage, Virus and Virome Sequencing Pipeline
Date Awarded: Nov 2008
Amount: $1,848,572
Term: 28 months
Grant ID: GBMF1799
Funding Area: Science, Marine Microbiology Initiative
Organization Name: Broad Institute

Viruses constitute a critical component of the marine ecosystem, shaping the diversity, ecology and evolution of the microorganisms with which they interact and infect. It is estimated that there are greater than 1 billion virus particles in a liter of seawater, and yet we are only beginning to understand the nuances of how these tiny players impact marine microbial ecosystems. DNA sequencing technologies are enabling new insights into the vast diversity of laboratory-isolated viruses and natural populations collected from seawater to stimulate new hypotheses regarding how viruses interact with microbes to influence marine elemental cycling.

To better understand the genomic diversity of marine phage (which infect bacteria and archaea) and viruses (which infect microeukaryotes), the Broad Institute, the Marine Microbiology Initiative (MMI) at the Gordon and Betty Moore Foundation, and the international virus ecology research community initiated the Marine Phage, Virus and Virome Sequencing Project. The effort resulted in 127 annotated phage and virus genomes and 137 viral metagenomes (sequences of whole virus communities). All data are publicly available through NCBI, with accompanying accession information here. The project included a phylogenetic breath of bacterial hosts, including a diverse range of cyanophage and vibriophage, and unique algal-infecting viruses. Each metagenome received approximately one quarter of a 454 FLX sequencing plate. This community resource project enabled major participation by the international marine virus ecology community and greatly increased the number of environmental viral genomes and metagenomes.

The results of this effort encompass a wealth of new knowledge gained due to a significant increase in the amount of viral DNA sequence available to researchers, new bioinformatics training opportunities, and new ways to quantify and describe a hitherto ‘black box’ of viral ecology. Highlights from the project include delineation of new viral families and a more robust biogeography of viral lineages in the global oceans (for example, Holmfeldt et al., 2013Labonté and Suttle, 2013Kelly et al., 2013), and new insights into host-viral interactions (for example, Ankrah et al., 2014).

The project forms a portfolio with the Microbial Genome Sequencing Project to sequence diverse marine bacterial and archaeal isolates; the Environmental Metagenomics Sequencing Portfolio to sequence whole microbial communities from locations around the world; and the Marine Microbial Eukaryote Transcriptome Sequencing Project (MMETSP) to sequence the gene content of hundreds of single-celled marine eukaryote cultures.


The following workshops have highlighted the Marine Phage, Virus and Virome Sequencing Project, and have helped researchers exchange ideas and forge new collaborations.

Environmental Virology: A workshop on experimental methods, informatics tools, and theory, January 6-12, 2013, Tucson, AZ, USA 

A Viromics Workshop: Tools and Tricks to See the ‘Virus’ in Diverse Sequence Datasets, May 17, 2014, Boston, MA, USA

Aquatic Virus Workshop 7, November 3-7, St. Petersburg, FL, USA  

Aquatic Virus Workshop 6, October 30-November 3, 2011, Texel, The Netherlands

Bioinformatics Resource

A bioinformatics pipeline, the Viral Informatics Resource for Metagenome Exploration (VIROME), has been supported (Grant #2732) to enable classification of viral metagenome sequences based on homology search results against a curated reference database of known and environmental sequences.

Data Availability

All data are publicly available through NCBI, with accompanying accession information available for download.

Publications as of July 2014  

