Birdsuite is a four-stage analytical framework instantiated in software for deriving integrated and mutually consistent copy number and SNP genotypes. The method sequentially assigns copy number across regions of common copy number polymorphisms (CNPs), calls genotypes of SNPs, identifies rare CNVs via a hidden Markov model (HMM), and generates an integrated sequence and copy number genotype at every locus.
Allocate an interactive session and run the program. Sample session:
[user@biowulf]$ sinteractive --mem=16g --cpus-per-task=16 [user@cn3335 ~]$module load birdsuite [+] Loading singularity 4.0.1 on cn4172 [+] Loading birdsuite 20231107
[user@biowulf]$ Birdsuite
Version File not found. Version checking disabled.
Argument is required: genderFile
Argument is required: basename
Error: No cel files specified.
usage: Birdsuite [options] cel-files...
Run the whole birdsuite pipeline.
Steps are:
1: apt_probeset_summarize
2: annotate_allele_summaries
3: cn_locus_summarize
4: cn_probeset_summarize
5: canary
6: cn_create_exclude_list
7: birdseed
8: pre_birdseye
9: birdseye
10: post_birdseye
11: larry_bird
12: birdsuite_report
defaultBlockSize Default block size Default: 0
firstStep What step to start with in birdsuite process.
Default: 1
platform Which platform the birdsuite data comes from.
Options:[AffyMetrix,Illumina] Default: AffyMetrix
genomeBuild What version of locus metadata to use. Must be
one of: ['hg17', 'hg18']. Default: hg18
configFile Birdsuite config file. Stores sets of command
line arguments. Any argument provided in the
argument file is overridden by a command line
argument.
canary.priors CNV priors file, can be population specific.
noLsf Do not use LSF to run birdseye jobs. Run them
synchronously instead. Default: false
metadataDir Location of metadata files. Default:
/humgen/affy_info/GAPProduction/dev
noOp Outputs all the commands birdsuite would use to
run all sub programs, but programs are not run.
Useful to debug. If this command is used, also
use --noLsf. Default: false
defaultMemUsage Default memory used in Megs Default: 4000
genderFile File containing a line for each sample in the
data file. 0=female, 1=male, 2=unknown. This file
must have a header line 'gender'.
canary.allele_freq_weight Allele frequency model weight. Change with
caution!
apt_probeset_summarize.force Passed through to apt-probeset-summarize.
Default: false
lsfQueue Use given LSF queue for parallelizing birdseye.
Default: birdseed
chipType Which chip type the cel files are. This is used
to select the appropriate metadata files.
Default: GenomeWideSNP_6
noversionCheck Do not check for latest version Default: false
outputDir Where to write the large volume of output data.
When running on LSF, this should probably be in
/local/scratch, so long as there is a way to
retrieve the files when done. Default: .
lastStep What step to end with in birdsuite process.
Default: 12
exeDir Location of executable files. Default:
/fg/software/Affymetrix/GAPProduction/dev
basename Used to name all the output files.
celFiles Text file specifying cel files to process, one
per line the first line being 'cel_files'.
Default: get cel files from command line.
Other available executables:
[user@cn3335 ~]$ ls $BS_BIN annotate_allele_summaries birdseedqc birdsuite_report cn_locus_summarize pre_birdseye apt_probeset_summarize birdseye_64 canary larry_bird shell birdseed Birdsuite cn_create_exclude_list post_birdseye [user@cn3335 ~]$ exit salloc.exe: Relinquishing job allocation 46116226 [user@biowulf ~]$