In this section you can find information on obtaining raw sequencing data, data products, and processing scripts. Of course, all the code is also embedded in the workflows on the website as well.
All sequence data is deposited at the European Nucleotide Archive (ENA). See below for instructions on submitting data to the ENA and the files we used to deposit the data.
The trimmed 16S rRNA data (primers removed) are deposited under Project Accession number PRJEB36632 (ERP119845), sample accession numbers ERS4291994-ERS4292031.
The metagenomic data from the four water column samples are also deposited under Project Accession number PRJEB36632 (ERP119845). The individual sample accession numbers for these data are ERS4578390-ERS4578393.
Data for each individual pipeline are available through the Smithsonian figshare under a single collection at doi:10.25573/data.c.5025362. In addition, data from each pipeline are available for download from figshare using the links at the bottom of each page (where applicable).
We submitted out data to the European Nucleotide Archive (ENA). The ENA does not like RAW data and prefers to have primers removed. So we submitted the trimmed Fastq files to the ENA. You can find these data under the study accession number PRJEB36632 (ERP119845). The RAW files on our figshare site (see above).
To submit to the ENA you need two data tables (plus your sequence data). One file describes the samples and the other file describes the sequencing data.
You can download these data tables here:
md5sum
on all the tar.gz
files.ftp webin.ebi.ac.uk
. I entered my username and password. Then typed mput *.gz
. The problem was that I had to say yes for each file. Should be a way around this. Documentation can be found here. Probably need to type prompt
first.The source code for this page can be accessed on GitHub by clicking this link.
If you see mistakes or want to suggest changes, please create an issue on the source repository.
Text and figures are licensed under Creative Commons Attribution CC BY 4.0. Source code is available at https://github.com/hypocolypse/web/, unless otherwise noted. The figures that have been reused from other sources don't fall under this license and can be recognized by a note in their caption: "Figure from ...".