Participate in ethics and data sharing community  | ​  Learn More 

Preprint: SARS-CoV-2 Data Specification


Well-structured, rich contextual data adds value, promotes reuse, and enables aggregation and integration of disparate data sets. We are delighted to announce that the PHA4GE Data Structures working group has released its first preprint on SARS-CoV-2 contextual data specification for open genomic epidemiology.


The preprint identifies a clear data standard which extends the INSDC pathogen package, to provide a contextual specification which is both harmonisable and publicly available.


Development of the specification was led by Dr. Emma Griffths and the PHA4GE Data Structures Working Group members spanning five continents and multiple time zones.


The specification can be implemented using a collection template, as well as an array of protocols and tools which support the harmonisation and submission of sequence data and contextual information to public repositories.


The adoption of the proposed standard and practices will better enable interoperability between datasets and systems, improve the consistency and utility of generated data, and ultimately facilitate novel insights and discoveries in SARS-CoV-2 and COVID-19.


Link to preprint: here

Subscribe to the PHA4GE Newsletter

We're committed to your privacy. PHA4GE uses the information you provide to us to contact you about our relevant content. You may unsubscribe from these communications at any time.

Follow PHA4GE

Related Articles

Wastewater Contextual Data Specification

The PHA4GE Wastewater Contextual Data Specification Package is scoped for data collection and sharing (within organizations, within networks and if desired, with public repositories) of both pathogen-agnostic genomics contextual data and genotypic attributes (such as antimicrobial resistance genes) derived from amplicon-based, WGS, and metagenomic sequencing approaches.

Wastewater Surveillance Guidance and Resources

This repository hosts guidance documents and resources developed by the PHA4GE Wastewater Surveillance Working Group. These documents address core challenges involved in designing effective wastewater surveillance strategies, analyzing wastewater pathogen sequencing and quantification data, and sharing this data with the global public health community.