GBLOCKS on XSEDE0.91bSelection of conserved blocks from multiple alignments for their use in phylogenetic analysis - run on XSEDEJose Castresana
Talavera, G., and Castresana, J. (2007). Improvement of phylogenies after removing divergent and ambiguously aligned blocks from protein sequence alignments. Systematic Biology 56, 564-577.
Castresana, J. (2000). Selection of conserved blocks from multiple alignments for their use in phylogenetic analysis. Molecular Biology and Evolution 17, 540-552.
Phylogeny / Alignmentgblocks_xsedenoisy_cometperl""0number_nodesperl!$more_memory2scheduler.confperl
"nodes=1\\n" .
"node_exclusive=0\n" .
"threads_per_process=1\\n"
infileInput File (AFA format)perl"input.fasta"1input.fastaall_results*runtime1Maximum Hours to Run (up to 168 hours)scheduler.conf0.5The maximum hours to run must be less than 168perl$runtime > 168.0The maximum hours to run must be greater than 0.05perl$runtime < 0.05perl"runhours=$value\\n"The job will run on 1 processors as configured. If it runs for the entire configured time, it will consume $runtime cpu hoursperl$runtime ne 0 Estimate the maximum time your job will need to run. We recommend testimg initially with a < 0.5hr test run because Jobs set for 0.5 h or less depedendably run immediately in the "debug" queue.
Once you are sure the configuration is correct, you then increase the time. The reason is that jobs > 0.5 h are submitted to the "normal" queue, where jobs configured for 1 or a few hours times may
run sooner than jobs configured for the full 168 hours.
specify_datatypeType of sequence (-t)dcpperl(defined $value) ? "-t=$value":""Please choose a sequence typeperl!defined $specify_datatype2specify_fileextensSpecify a file extension (-e)perl($value ne $vdef) ? "-e=$value":""-gb30Any integer equal or bigger than Minimum Number Of Sequences For A Conserved Positionnum_seqconservedMinimum Number Of Sequences For A Conserved Position (-b1)perl(defined $value) ? "-b1=$value":"" 4This can be any integer bigger than half the number of sequences and smaller or equal than the total number of sequencesspecify_cutoffMinimum Number Of Sequences For A Flank Position (-b2)perl(defined $value) ? "-b2=$value":""6The number of sequences for a flank position must be greater than or equal to the minimum number of sequences for a conserved positionperldefined $value && $specify_cutoff < $num_seqconservedAny integer equal or bigger than Minimum Number Of Sequences For A Conserved Positionmax_contignonconvservedMaximum Number Of Contiguous Nonconserved Positions (-b3)perl($max_contignonconvserved != $vdef) ? " -b3=$value ":""88min_blocklengthMinimum Length Of A Block (-b4)perl($min_blocklength != $vdef) ? "-b4=$value":""1010The minimum length of a block must be 2 or greaterperldefined $min_blocklength && $min_blocklength < 2 specify_allowedgapsAllowed Gap Positions (-b5)perl$value ne $vdef ? "-b5=$value":""nhan12use_similarityUse Similarity Matrices (Protein only; -b6)perl$specify_datatype eq "p"perl($value) ? "-b6=y":""114min_initialblockMinimum Length Of An Initial Block (-b0)perl(defined $value) ? "-b0=$value":""16The minimum length of the initial block must be 2 or greaterperldefined $min_initialblock && $min_initialblock < 2 This will be the same as the minimum length of a block by defaultspecify_selectedblocksSelected Blocks (-s)perl($value) ? "-s=y":""118specify_nonconservedblocksNonconserved Blocks (-n)perl$specify_nonconservedblocks ? "-n=y":""020specify_ungappedUngapped Alignment (-u)perl($specify_ungapped) ? "-u=y":""022specify_maskfileSelected Blocks (-k)perl($specify_maskfile) ? "-k=y":""024specify_resultsfileResults And Parameters File (-p)ytsnperl"-p=$value "y26configure_resultsfileCharacters Per Line In Results And Parameters File (-v)perl($configure_resultsfile != $vdef) ? " -v=$value ":""6028The minimum characters per Line in results and parameters file is 50perldefined $configure_resultsfile && $configure_resultsfile < 50 specify_postscriptfilePostscript File With The Selected Blocks (-d)perl$specify_postscriptfile ? "-d=y":""030