iDiffIR is a tool for identifying differential IR from RNA-seq data. It accepts any sorted, indexed BAM file for single- or paired-end reads.
Allocate an interactive session and run the program. Sample session:
[user@biowulf]$ sinteractive [user@cn3316 ~]$ module load iDiffIR [+] Loading singularity 3.8.5-1 on cn4193 [+] Loading iDiffIR 20220121The iDiffIR application involves a number of executables: [user@biowulf]$ ls $IDIFFIR_BIN build_classifiers.py getDepths.py realignment_pipeline.py classify_sites.py get_gene_expression.py run_miso_ir.py convert_models.py get_good_pairs.py sam_collate.py convertSam.py gtf2gff.py sam_filter.py ests_to_splicegraph.py idiffir_plotter.py sam_split.py f2py idiffir.py sam_to_depths.py find_splice_forms.py isolasso_pipeline.py select_model_parameters.py fix_unresolved.py isolasso_update_graphs.py shell gene_model_to_splicegraph.py make_MISO_AS_GFF.py simulate_IR.py generate_known_junctions.py make_MISO_IR_GFF.py smtpd.py generate_predicted_junctions.py plotter.py splicegraph_statistics.py generate_putative_sequences.py predict_graphs.py splice_junction_pipeline.py generate_roc.py predict_splicegraph.py view_splicegraph_multiplot.py generate_splice_site_data.py psginfer_pipeline.py genewise_statistics.py psginfer_update_graphs.py Their basic usage is as follows:
[user@cn3123 user]$ idiffir.py -h
usage: idiffir.py [-h] [-v] [-n] [-l FACTORLABEL FACTORLABEL] [-o OUTDIR] [-s]
[-k KRANGE KRANGE] [-c COVERAGE] [-d DEXPTHRESH] [-p PROCS]
[-f FDRLEVEL] [-g GRAPHDIRS] [-G GRAPHDIRSONLY]
[-m {BF,BH,QV}] [-e {IR,SE}]
genemodel factor1bamfiles factor2bamfiles
Identify differentially expressed introns.
positional arguments:
genemodel gene model file: NAME.gtf[.gz] | NAME.gff[.gz]
factor1bamfiles colon-separated list of bamfiles: PATH-TO-REPLICATE_1
[:PATH-TO-REPLICATE_2,...]
factor2bamfiles colon-separated list of bamfiles: PATH-TO-REPLICATE_1
[:PATH-TO-REPLICATE_2,...]
optional arguments:
-h, --help show this help message and exit
-v, --verbose verbose output [default is quiet running]
-n, --noplot Do not plot figures [default is to make figures]
-l FACTORLABEL FACTORLABEL, --factorlabel FACTORLABEL FACTORLABEL
factor labels, example: -f Mutant Wildtype
-o OUTDIR, --output-dir OUTDIR
output file directory name
-s, --shrink_introns shrink introns for depth plots [default is no
shrinking]
-k KRANGE KRANGE, --krange KRANGE KRANGE
kmin kmax; [default is to search for kmax]
-c COVERAGE, --coverage COVERAGE
coverage cutoff, default = 0.99
-d DEXPTHRESH, --dexpThresh DEXPTHRESH
differential gene expression threshold, [default = 10]
-p PROCS, --procs PROCS
Number of processing cores to use, [default = 1]
-f FDRLEVEL, --fdrlevel FDRLEVEL
FDR test level, [default = 0.05]
-g GRAPHDIRS, --graph-dirs GRAPHDIRS
colon-separated list of directories to recursively
search for SpliceGrapher predictions
-G GRAPHDIRSONLY, --graph-dirs-only GRAPHDIRSONLY
colon-separated list of directories to recursively
search for SpliceGrapher predictions. In this case
only the predicted graphs are used. i.e. the gene
models are only used in plots and not for building
reduced gene models. Useful for poorly annotated
genomes.
-m {BF,BH,QV}, --multTest {BF,BH,QV}
Multiple testing adjustment method BF: Bonferroni, BH:
Benjamini-Hochberg, QV: q-values [default = QV]
-e {IR,SE}, --event {IR,SE}
AS event to test, IR: Intron Retention, SE: Exon
Skipping [default = IR] [default is IR]
[user@cn3123 user]$ idiffir_plotter.py -h
usage: idiffir_plotter.py [-h] [-v] [-l FACTORLABEL FACTORLABEL] [-o OUTDIR]
[-s] [-g GRAPHDIRS] [-p PROCS]
genemodel genelist factor1bamfiles factor2bamfiles
Plot highlighted regions between two factors.
positional arguments:
genemodel gene model file: NAME.gtf[.gz] | NAME.gff[.gz]
genelist File containing gene records to plot. Format for lines
is geneID start_1,end_1;...start_n,end_n.
factor1bamfiles colon-separated list of bamfiles: PATH-TO-REPLICATE_1
[:PATH-TO-REPLICATE_2,...]
factor2bamfiles colon-separated list of bamfiles: PATH-TO-REPLICATE_1
[:PATH-TO-REPLICATE_2,...]
optional arguments:
-h, --help show this help message and exit
-v, --verbose verbose output [default is quiet running]
-l FACTORLABEL FACTORLABEL, --factorlabel FACTORLABEL FACTORLABEL
factor labels, example: -f Mutant Wildtype
-o OUTDIR, --output-dir OUTDIR
output file directory name
-s, --shrink_introns shrink introns for depth plots [default is no
shrinking]
-g GRAPHDIRS, --graph-dirs GRAPHDIRS
colon-separated list of directories to recursively
search for SpliceGrapher predictions
-p PROCS, --procs PROCS
Number of processing cores to use, [default = 1]
[user@cn3123 user]$ getDepths.py -h
usage: getDepths.py [-h] [-o OUTDIR] [-v] bamfile_in
Get chromosomal read depths and junctions
positional arguments:
bamfile_in Name of sorted, indexed BAM file
optional arguments:
-h, --help show this help message and exit
-o OUTDIR, --output-dir OUTDIR
output file directory name
-v, --verbose verbose output
[user@cn3123 user]$ convertSam.py -h
usage: convertSam.py [-h] [-o BAMFILE] [-p PROCS] [-m MEMORY] [-v] samfile
Generate sorted BAM and index files for given SAM file
positional arguments:
samfile Samfile to convert
optional arguments:
-h, --help show this help message and exit
-o BAMFILE, --outfile BAMFILE
Name of converted BAM file [default=.bam]
-p PROCS, --procs PROCS
Number of processors to use for BAM sorting (default
1)
-m MEMORY, --memory MEMORY
Max memory (in GBs) for each processor used for BAM
sorting (default 2)
-v, --verbose Print verbose output
etc.
[user@cn3316 ~]$ exit salloc.exe: Relinquishing job allocation 46116226 [user@biowulf ~]$