Scientific Reference Data

We provide a set of centrally-maintained scientific reference databases for Biowulf users. You can search through this data here. To request a new database or an update, please contact us at staff@hpc.nih.gov.


OR

Search by keywordSearches through metadata using keywords
Search by filenameSearches through filenames where available


Browse Common Databases

Recently Updated:

2024-11-19 Betacoronavirus Blast database of Betacoronavirus nucleotide sequences. (Blast database full path and name - /fdb/blastdb/Betacoronavirus)
2024-11-19 taxonomy The Taxonomy Database is a curated classification and nomenclature for all of the organisms in the public sequence databases.
2024-11-18 biogans BioGANs is a novel application of Generative Adversarial Networks (GAN) to the synthesis of cells imaged by fluorescence microscopy.
2024-11-18 metawrap MetaWRAP is a modular pipeline for shotgun metagenomic data analysis.
2024-11-18 NCBI nr Blast database NCBI nonredundant comprehensive protein database, compiled from GenBank CDS translations, PDB, Swiss-Prot, PIR, and PRF (Blast database full path and name - /fdb/blastdb/nr )
2024-11-18 PMT PB-metagenomics-tools is a suite of tools for performing metagenomic analyses using HiFi sequencing data.
2024-11-18 Reference data for the cellranger pipeline References for the 10x Genomics cellranger pipeline
2024-11-18 velocyto Velocyto is a library for the analysis of RNA velocity. It includes a command line tool and an analysis pipeline.
2024-11-18 xTea xTea (x-Transposable element analyzer), is a tool for identifying TE insertions in whole-genome sequencing data.
2024-11-16 NCBI nt Blast database NCBI nonredundant comprehensive nucleotide database, compiled from Genbank, Refseq, TPA and PDB. (Blast database full path and name - /fdb/blastdb/nt )
2024-11-16 Patent nucleotide sequences Blast db Patent nucleotide sequences (Blast database full path and name - /fdb/blastdb/patnt )
2024-11-16 PDB nucleotide sequences Blast db Protein Data Bank nucleotide sequences. (Blast database full path and name - /fdb/blastdb/pdbnt )
2024-11-16 PDB protein sequences Blast db Protein Data Bank sequences. (Blast database full path and name - /fdb/blastdb/pdbaa )
2024-11-16 Swissprot Blast database Curated, highly-annotated protein sequence database (Blast database full path and name - /fdb/blastdb/swissprot )
2024-11-15 ampliconsuite AA_REPO copy [Not available]
2024-11-15 bakta [Not available]
2024-11-15 DRAM data [Not available]
2024-11-15 genomad db [Not available]
2024-11-15 Gimme Motifs reference db [Not available]
2024-11-15 MToolBox [Not available]