BWA on XSEDE0.36Burrows-Wheeler Alignment Tool.Assembly:Assemble_readshttp://bio-bwa.sourceforge.net/bwa_xsededbinDatabase sequences (FASTA format)ref_genome.fastacommand_line_singleperl!defined $queryinpairperl"&& bwa_expanse mem -M -t 16"50command_line_single2perl!defined $queryinpairperl"ref_genome.fasta query1.fq.gz > $SAM_NAME"70command_line_pairedperldefined $queryinpairperl"&& bwa_expanse mem -M -t 16 "50command_line_paired2perldefined $queryinpairperl" ref_genome.fasta query1.fq.gz query2.fq.gz > $SAM_NAME"70bwa_schedulerscheduler.confperl
"threads_per_process=12\\n" .
"node_exclusive=0\\n" .
"mem=23G\\n" .
"nodes=1\\n"
0runtime1scheduler.confMaximum Hours to Run (click here for help setting this correctly)0.25Estimate the maximum time your job will need to run. We recommend testimg initially with a < 0.5hr test run because Jobs set for 0.5 h or less depedendably run immediately in the "debug" queue.
Once you are sure the configuration is correct, you then increase the time. The reason is that jobs > 0.5 h are submitted to the "normal" queue, where jobs configured for 1 or a few hours times may
run sooner than jobs configured for the full 168 hours.
Maximum Hours to Run must be less than 168perl$runtime > 168.0Maximum Hours to Run must be greater than 0.1 perl$runtime < 0.1The job will run on 12 processors as configured. If it runs for the entire configured time, it will consume 12 x $runtime cpu hoursperl$runtime ne 0 perl"runhours=$value\\n"specify_pairedendreadsInput is paired end reads (uncheck for single reads)1specify_interleavedInput is one interleaved, paired end file (-p)perl($value) ? "-p":""051Assume the first input query file is interleaved paired-end FASTA/Q. See the command description for details.queryinQuery File 1(forward reads or interleaved reads)query1.fq.gzPlease select a read fileperl!defined $queryinqueryinpairperl$specify_interleaved || $specify_pairedendreadsQuery File 2 (reverse reads)query2.fq.gzPlease select a reverse reads fileperl!defined $queryinpair && !$specify_interleavedSAM_NAMEOutput File Namealn.samindexBWA indexing settingsbuildindexperl"bwa_expanse"10specify_indexalgIndexing algorithmisbwtswisperl"index -a $value"is - linear-time algorithm for constructing suffix array. It requires 5.37N memory where N is the size of the database. IS is moderately fast, but does not work with database larger than 2GB.bwtsw - Algorithm implemented in BWT-SW. This method works with the whole human genome, but it does not work with database smaller than 10MB and it is usually slower than IS.12colorspaceBuild color-space index?perl($value ? "-c":"")013buildindex2perl"ref_genome.fasta"14inoutoptsInput/Output optionsspecify_headerlineRead group header line (-R)perl(defined $value) ? "-R $value":""Complete read group header line. â\tâ can be used in STR and will be converted to a TAB in the output SAM.
The read group ID will be attached to every read in the output. An example is @RG\tID:foo\tSM:bar . [null] 51insert_headerstringInsert String to header (-H)perl($value) ? "-H $value":""51specify_verbosityVerbosity level: (-v)1234perl($value) ? "-v $value":""1521=error, 2=warning, 3=message, 4+=debugging [3]specify_minscoreMinimum score to output (-T)perl($value ne $vdef) ? "-T $value":""3052specify_minscoreminIf there are less than this number of hits with score >80% (-h INT)553 You can set a value using such that if there are -h hits with score >80% of the max score (200 by default), output will all be in XA. The box below this one allows you to set the maximum score for -hspecify_maxscoreOf this max score, output all in XA (-h INT,INT)20053 You can also set the maximum score for -h using this space. specify_minscoreonlyperl defined $specify_minscoremin && !defined $specify_maxscoreperl"-h $specify_minscoremin"54specify_maxminscoreperl defined $specify_minscoremin && defined $specify_maxscoreperl"-h $specify_minscoremin,$specify_maxscore"54output_allOutput all alignments for SE or unpaired PEperl$value ? "-a":""55 Output all found alignments for single-end or unpaired paired-end reads. These alignments will be flagged as secondary alignmentsappend_commentAppend FASTA/FASTQ comment to SAM output (-C)perl$value ? "-C":""56Append FASTA/Q comment to SAM output. This option can be used to transfer read meta information
(e.g. barcode) to the SAM output. Note that the FASTA/Q comment (the string after a space in the header line) must
conform the SAM spec (e.g. BC:Z:CGTAC). Malformed comments lead to incorrect SAM output.output_refheaderOutput the reference FASTA header in the XR tag (-V)perl$value ? "-V":""56use_softclippingUse soft clipping for supplementary alignments (-Y)perl$value ? "-Y":""57mark_shorthitsMark shorter split hits as secondary (-M)perl$value ? "-M":""58For Picard compatibilityspecify_meanSpecify the mean of insert size distribution (-I)For Picard compatibilityspecify_sdSpecify the standard deviation of insert size distribution (-I)perldefined $specify_meanspecify_maxSpecify the max of insert size distribution (-I)perldefined $specify_sdspecify_minSpecify the min of insert size distribution (-I)perldefined $specify_maxspecify_meansdetc158perldefined $specify_mean && !defined $specify_sdperl"-I $specify_mean"specify_meansdetc258perldefined $specify_sd && !defined $specify_maxperl"-I $specify_mean,$specify_sd"specify_meansdetc358perldefined $specify_max && !defined $specify_minperl"-I $specify_mean,$specify_sd,$specify_max"specify_meansdetc458perldefined $specify_minperl"-I $specify_mean,$specify_sd,$specify_max,$specify_min"scoreoptsScoring optionsspecify_matchscoreScore for a sequence match (-A)perl($specify_matchscore ne $vdef) ? "-A $specify_matchscore":""165specify_mispenaltyScore for a sequence match (-B)perl($specify_mispenalty ne $vdef) ? "-B $specify_mispenalty":""465specify_gapdpenaltyGap open penalties for deletions -O)6specify_gapipenaltyGap open penalties for insertions (-O)6specify_gappenaltystringperl$specify_gapdpenalty ne 6 || $specify_gapipenalty ne 6perl "-O $specify_gapdpenalty,$specify_gapipenalty"65specify_gapextpenaltyGap extension penalty for deletion (-E)1specify_gapextpenalty2Gap extension penalty for insertion (-E)1specify_gappenaltystringperl$specify_gapextpenalty ne 1 || $specify_gapextpenalty2 ne 1perl "-E $specify_gapextpenalty,$specify_gapextpenalty2"65specify_endclippenaltyPenalty for 5 prime- end clipping (-L)5specify_endclippenalty2Penalty for 3 prime- end clipping (-L)5specify_endclippenaltyhiddenperl$specify_endclippenalty ne 5 || $specify_endclippenalty2 ne 5perl "-L $specify_endclippenalty,$specify_endclippenalty2"65specify_unpairedpenaltyPenalty for an unpaired read pair (-U)perl($value ne $vdef) ? "-U $value":""1765specify_readtypeRead type (-x)pacbioont2dintractgperl(defined $value) ? "-x $value":""Setting -x changes multiple parameters unless overriden [null].pacbio: -k17 -W40 -r10 -A1 -B1 -O1 -E1 -L0 (PacBio reads to ref)ont2d: -k14 -W20 -r10 -A1 -B1 -O1 -E1 -L0 (Oxford Nanopore 2D-reads to ref)intractg: -B9 -O16 -L5 (intra-species contigs to ref)65algoptsAlgorithmic optionsspecify_minseedlengthMinimum seed length (-k)perl($value ne $vdef) ? "-k $value":""1965specify_babandwidthBand width for banded alignment (-w)perl($value ne $vdef) ? "-w $value":""10066specify_oddropoffOff-diagonal X-dropoff (-d)perl($value ne $vdef) ? "-d $value":""10068specify_internalseedsLook for internal seeds inside a seed longer than {-k} (-d)perl($value ne $vdef) ? "-d $value":""1.569specify_seedoccurrenceSeed occurrence for the 3rd round seeding (-y)perl($value ne $vdef) ? "-y $value":""2070specify_skipseedsSkip seeds with more than INT occurrence (-c)perl($value ne $vdef) ? "-c $value":""50071specify_dropchainsDrop chains shorter than this fraction of the longest overlapping chain (-D)perl($value ne $vdef) ? "-D $value":""0.572specify_discardchainsDiscard a chain if seeded bases shorter than this (-W)perl($value ne $vdef) ? "-D $value":""072specify_mostroundsPerform at most this many rounds of mate r-kes for each read (-m)perl($value ne $vdef) ? "-m $value":""5073stdoutfilestd.outall_results*