eggNOG-mapper: fast genome-wide functional annotation through orthology assignment.
eggNOG-mapper is a tool for functional annotation of large sets of sequences based on fast orthology assignments using precomputed clusters and phylogenies from the eggNOG database. Orthology assignment is ideally suited for functional inference. However, predicting orthology is computationally intensive at large scale, and most other pipelines are relatively inaccessible (e.g., new assignments only available through database updates), so less precise homology-based functional transfer was previously the default for (meta-)genome annotation.
References:
- Carlos P. Cantalapiedra, Ana Hernandez-Plaza, Ivica Letunic, Peer Bork and Jaime Huerta-Cepas.
eggNOG-mapper v2: Functional Annotation, Orthology Assignments, and Domain Prediction at the Metagenomic Scale.
Molecular Biology and Evolution 38i(12):5825–5829. - J.Huerta-Cepas, D.Szklarczyk, D.Heller, A.Hernández-Plaza, S.K.Forslund, H.Cook,
D.R.Mende, I.Letunic, T.Rattei, L.J.Jensen, C. von Mering, P.Bork
eggNOG 5.0: a hierarchical, functionally and phylogenetically annotated orthology resource based on 5090 organisms and 2502 viruses.
Nucleic Acids Research, Volume 47, Issue D1, 08 January 2019, Pages D309–D314.
Documentation
Important Notes
- Module Name: eggNOG-mapper (see the modules page for more information)
- Unusual environment variables set
- EGGNOG_HOME installation directory
- EGGNOG_BIN executable directory
- EGGNOG_SRC source code directory
- EGGNOG_TEST sample data directory
- EGGNOG_DATA_DIR database directory
Interactive job
Interactive jobs should be used for debugging, graphics, or applications that cannot be run as batch jobs.
Allocate an interactive session and run the program. Sample session:
[user@biowulf]$ sinteractive --mem=16g --gres=lscratch:10 [user@cn3200 ~]$ module load eggnog-mapper [+] Loading python 3.7 ... [+] Loading eggnog-mapper 2.1.2 ... [user@cn3200 ~]$ls $EGGNOG_BIN create_dbs.py download_eggnog_data.py hmm_mapper.py hmm_worker.py shell diamond emapper.py hmm_server.py pythonDownload test data files:
[user@cn3200 ~]$ mkdir /data/$USER/eggnog-mapper && cd /data/$USER/eggnog-mapper [user@cn3200 ~]$ cp $EGGNOG_TEST/* .Run emapper.py on the test data using bacteria.dmnd diamond database:
[user@cn3200 ~]$emapper.py --dmnd_db $EGGNOG_DATA_DIR/bacteria.dmnd -i test_queries.fa -o test /opt/conda/envs/eggNOGmapper/lib/python3.7/site-packages/eggnog_mapper-2.1.6-py3.7.egg/eggnogmapper/bin/diamond blastp -d /usr/local/apps/eggnog-mapper/2.1.6/data/bacteria.dmnd -q /gs7/users/user/eggnog-mapper/test_queries.fa --threads 1 -o /gs7/users/user/eggnog-mapper/test.emapper.hits --sensitive --iterate -e 0.001 --top 3 --outfmt 6 qseqid sseqid pident length mismatch gapopen qstart qend sstart send evalue bitscore qcovhsp scovhsp Functional annotation of hits... 0 3.5762786865234375e-06 0.00 q/s (% mem usage: 3.90, % mem avail: 96.09) 2 2.5320394039154053 0.79 q/s (% mem usage: 3.90, % mem avail: 96.08) Done /data/user/eggnog-mapper/test.emapper.hits /data/user/eggnog-mapper/test.emapper.seed_orthologs /data/user/eggnog-mapper/test.emapper.annotations ...Alternatively, you can create a diamond database on your own:
[user@cn3200 ~]$ mkdir data [user@cn3200 ~]$ export EGGNOG_DATA_DIR=./data [user@cn3200 ~]$ create_dbs.py -m diamond --dbname bacteria --taxa BacteriaThis will create a bacteria.dmnd diamond database in the directory specified in EGGNOG_DATA_DIR environment variable.