Biowulf High Performance Computing at the NIH
Scientific Databases

BAM

Binary SAM files
Back to main database page
Database Location on HPC systemsLast Updated
1000 Genomes
RNA-seq BAM files from Geuvadis experiment
/fdb/1000genomes/ftp/E-GEUV-1 =
(Updated occasionally)
Source: www.geuvadis.org
1000 Genomes
20100804 release containing analysis results sets (vcfs) and README files.
/fdb/1000genomes/ftp/data/ 05 Dec 2017
(Updated occasionally)
Source: ftp.ncbi.nlm.nih.gov
TCGA DREAM SMC synthetic data
The ICGC-TCGA DREAM Genomic Mutation Calling Challenge is an international effort to improve standard methods for identifying cancer-associated mutations and rearrangements in whole-genome sequencing data. The data set of this challenge is a collection of DNA sequence reads. For the analysis of somatic mutations, the DNA sequence reads derived from a tumor are compared to those of a normal tissue, typically from the same patient. The simulated data is anonymous (derived from cancer cell-lines and/or further anonymized). see https://www.synapse.org/#!Synapse:syn312572/wiki/62018 for more details.
/fdb/DREAM/SMC 06 Dec 2018
(Updated one-time)
Source: see /fdb/DREAM/SMC/Makefile

Back to main database page