A comprehensive analysis framework for T-cell and B-cell repertoire sequencing data.
VDJtools is no longer maintained. See MiXCR for the software which succeeded VDJtools.
Allocate an interactive session and run the program. Sample session:
[user@biowulf]$ sinteractive --mem=4g --gres=lscratch:20
salloc.exe: Pending job allocation 46116226
salloc.exe: job 46116226 queued and waiting for resources
salloc.exe: job 46116226 has been allocated resources
salloc.exe: Granted job allocation 46116226
salloc.exe: Waiting for resource configuration
salloc.exe: Nodes cn3144 are ready for job
[user@cn3144 ~]$ module load vdjtools
[user@cn3144 ~]$ java -Xmx4g -jar $VDJ_JAR
VDJtools V1.1.10
Run as $java -jar vdjtools-1.1.10.jar ROUTINE_NAME arguments
[Basic]
CalcBasicStats
CalcSpectratype
CalcSegmentUsage
PlotFancySpectratype
PlotSpectratypeV
PlotFancyVJUsage
[Diversity]
CalcDiversityStats
RarefactionPlot
PlotQuantileStats
[Overlap]
OverlapPair
CalcPairwiseDistances
ClusterSamples
TestClusters
TrackClonotypes
[Preprocessing]
ApplySampleAsFilter
FilterNonFunctional
FilterByFrequency
DownSample
Decontaminate
FilterBySegment
SelectTop
[Operation]
PoolSamples
JoinSamples
(Enrichment) -> Deprecated
[Annotation]
(ScanDatabase) -> moved to VDJdb since 1.0.5, please visit vdjdb.cdr3.net
CalcCdrAaStats
CalcDegreeStats
Annotate
SegmentsToFamilies
[Util]
FilterMetadata
SplitMetadata
Convert
RInstall
* Run with 'discard_scripts' option prior to ROUTINE_NAME to clean up R scripts upon execution
[user@cn3144 ~]$ java -Xmx4g -jar $VDJ_JAR CalcBasicStats -h
usage: CalcBasicStats [options] [sample1 sample2 sample3 ... if -m is not
                      specified] output_prefix
 -h                         display help message
 -m,--metadata    Metadata file. First and second columns should
                            contain file name and sample id. Header is
                            mandatory and will be used to assign column
                            names for metadata.
 -u,--unweighted            Will count each clonotype only once, apart
                            from conventional frequency-weighted
                            histogram.
[user@cn3144 ~]$ java -Xmx4g -jar $VDJ_JAR CalcBasicStats sample1 sample2 sample3
[user@cn3144 ~]$ exit
salloc.exe: Relinquishing job allocation 46116226
[user@biowulf ~]$
 
Create a batch input file (e.g. vdjtools.sh). For example:
#!/bin/bash module load vdjtools java -Xmx4g -jar $VDJ_JAR CalcBasicStats sample1 sample2 sample3
Submit this job using the Slurm sbatch command.
sbatch --mem=4g vdjtools.sh
Create a swarmfile (e.g. vdjtools.swarm). For example:
java -Xmx4g -jar $VDJ_JAR CalcBasicStats sample1 java -Xmx4g -jar $VDJ_JAR CalcBasicStats sample2 java -Xmx4g -jar $VDJ_JAR CalcBasicStats sample3
Submit this job using the swarm command.
swarm -f vdjtools.swarm -g 4 --module vdjtoolswhere
| -g # | Number of Gigabytes of memory required for each process (1 line in the swarm command file) | 
| -t # | Number of threads/CPUs required for each process (1 line in the swarm command file). | 
| --module vdjtools | Loads the vdjtools module for each subjob in the swarm |