About

This repository analyzes viral genomes using Nextstrain to understand how SARS-CoV-2, the virus that is responsible for the COVID-19 pandemic, evolves and spreads.

We maintain a number of publicly-available builds, visible at nextstrain.org/ncov.

In July 2020, we introduced some major changes to this repository. If you were running customized builds before this, these changes may cause some merge conflicts. Read here to find out about all the changes, and which ones may impact custom runs.

Resources

Use Nextstrain to analyze your SARS-CoV-2 data

We've written a comprehensive guide to get you up and running in <1 hr. Click on the below links to follow it. It covers:

Introduction (Start here)
Setup and installation
Preparing your data
Orientation: analysis workflow
Orientation: which files should I touch?
Running & troubleshooting
Customizing your analysis
Customizing your visualization
Options for visualizing and sharing results (including working with sensitive metadata)
Interpreting your results
Writing a narrative to highlight key findings

Download formatted datasets

The hCoV-19 / SARS-CoV-2 genomes were generously shared via GISAID. We gratefully acknowledge the Authors, Originating and Submitting laboratories of the genetic sequence and metadata made available through GISAID on which this research is based.

In order to download the GISAID data to run the analysis yourself, please see this guide.

Please note that data/metadata.tsv is no longer included as part of this repo. However, we provide continually-updated, pre-formatted metadata & fasta files for download through GISAID.

Read previous Situation Reports

We issued weekly Situation Reports for the first ~5 months of the pandemic. You can find the Reports and their translations here.

FAQs

Can't find your sequences in Nextstrain? Check here for common reasons why your sequences may not be appearing. You can also use clades.nextstrain.org to perform some basic quality control on your sequences. If they are flagged by this tool, they will likely be excluded by our pipeline.
For information about how clades are defined, and the currently named clades, please see here. To assign clades to your own sequences, you can use our clade assignment tool at clades.nextstrain.org.

Bioinformatics notes

Site numbering and genome structure uses Wuhan-Hu-1/2019 as reference. The phylogeny is rooted relative to early samples from Wuhan. Temporal resolution assumes a nucleotide substitution rate of 8 × 10^-4 subs per site per year. There were SNPs present in the nCoV samples in the first and last few bases of the alignment that were masked as likely sequencing artifacts.

Contributing

We welcome contributions from the community! Please note that we strictly adhere to the Contributor Covenant Code of Conduct.

Contributing to software or documentation

Please see our Contributor Guide to get started!

Contributing data

Please note that we automatically pick up any SARS-CoV-2 data that is submitted to GISAID.

If you're a lab and you'd like to get started sequencing, please see:

Protocols from the ARTIC network
Funding opportunities for sequencing efforts
Or, if these don't meet your needs, get in touch

Get in touch

To report a bug, error, or feature request, please open an isssue.

For questions, head over to the discussion board; we're happy to help!

Name		Name	Last commit message	Last commit date
Latest commit History 2,886 Commits
.github/ISSUE_TEMPLATE		.github/ISSUE_TEMPLATE
data		data
defaults		defaults
docs		docs
my_profiles		my_profiles
narratives		narratives
nextstrain_profiles		nextstrain_profiles
scripts		scripts
workflow		workflow
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
Snakefile		Snakefile

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Resources

Use Nextstrain to analyze your SARS-CoV-2 data

Download formatted datasets

Read previous Situation Reports

FAQs

Bioinformatics notes

Contributing

Contributing to software or documentation

Contributing data

Get in touch

About

Releases

Packages

Languages

License

babarlelephant/ncov

Folders and files

Latest commit

History

Repository files navigation

About

Resources

Use Nextstrain to analyze your SARS-CoV-2 data

Download formatted datasets

Read previous Situation Reports

FAQs

Bioinformatics notes

Contributing

Contributing to software or documentation

Contributing data

Get in touch

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages