ASTRAL on XSEDE5.15.4Coalescent-based species tree estimationSiavash Mirab and Tandy WarnowMirarab, S., and Warnow, T. (2015) ASTRAL-II: coalescent-based species tree estimation with many hundreds of taxa and thousands of genes. Bioinformatics 31, i44-i52. 10.1093/bioinformatics/btv234Phylogeny / Alignmentastral_xsedeastral_invokeperl$num_gtress < "4000"perl"astral_5.15.4_expanse"0astral_invoke_gpuperl$num_gtress > "3999"perl"astral_5.15.4_expanse.gpu"0number_cores12scheduler.confperl$num_gtrees < 1000perl
"ChargeFactor=1.0\\n" .
"nodes=1\\n" .
"mem=15G\\n" .
"node_exclusive=0\\n" .
"cpus-per-task=8\\n" .
"threads_per_process=8\\n"
number_cores22scheduler.confperl$num_gtrees > 999 && $num_gtrees < 4000 perl
"ChargeFactor=1.0\\n" .
"nodes=1\\n" .
"mem=30G\\n" .
"node_exclusive=0\\n" .
"cpus-per-task=16\\n" .
"threads_per_process=16\\n"
number_cores32scheduler.confperl$num_gtrees > 3999 && $num_gtrees < 8000 perl
"ChargeFactor=1.0\\n" .
"gpu=1\\n" .
"nodes=1\\n" .
"mem=92G\\n" .
"node_exclusive=0\\n" .
"threads_per_process=10\\n"
number_cores42scheduler.confperl$num_gtrees > 7999 && $num_gtrees < 16000 perl
"ChargeFactor=1.0\\n" .
"gpu=2\\n" .
"nodes=1\\n" .
"mem=184G\\n" .
"node_exclusive=0\\n" .
"threads_per_process=20\\n"
number_cores52scheduler.confperl$num_gtrees > 15999 perl
"ChargeFactor=1.0\\n" .
"gpu=4\\n" .
"nodes=1\\n" .
"mem=368G\\n" .
"node_exclusive=1\\n" .
"threads_per_process=40\\n"
infileInputinfile.tre1infile_invokeInputperl"-i infile.tre"1number_threads14perl$num_gtrees < 1000 perl"-T 8"number_threads24perl$num_gtrees > 999 && $num_gtrees < 4000 perl"-T 16"number_threads34perl$num_gtrees > 3999 && $num_gtrees < 8000 perl"-T 10"number_gpus14perl$num_gtrees > 3999 && $num_gtrees < 8000 perl"-G 1"number_threads44perl$num_gtrees > 7999 && $num_gtrees < 16000 perl"-T 20"number_gpus24perl$num_gtrees > 7999 && $num_gtrees < 16000 perl"-G 1,2"number_threads54perl$num_gtrees > 15999perl"-T 40"number_gpus44perl$num_gtrees > 15999perl"-G 1,2,3,4"write_log99perl"2> output.log"runtime1scheduler.confMaximum Hours to Run (up to 168 hours)0.5The maximum hours to run must be less than 168perl$runtime > 168.0 && $num_gtrees < 4000The maximum hours to run must be greater than 0.05perl$runtime < 0.05The maximum hours to run must be less than or equal to 120perl$runtime > 120.0 && $num_gtrees > 3999perl"runhours=$value\\n"The job will run on 8 processors as configured. If it runs for the entire configured time, it will consume 8 x $runtime cpu hoursperl$num_gtrees < 1000 The job will run on 16 processors as configured. If it runs for the entire configured time, it will consume 16 x $runtime cpu hoursperl$num_gtrees > 999 && $num_gtrees < 4000 The job will run on 10 processors and 1 GPU as configured. If it runs for the entire configured time, it will consume 21 x $runtime cpu hoursperl$num_gtrees > 3999 && $num_gtrees < 8000 The job will run on 20 processors and 2 GPU as configured. If it runs for the entire configured time, it will consume 42 x $runtime cpu hoursperl$num_gtrees > 7999 && $num_gtrees < 16000 The job will run on 40 processors and 4 GPU as configured. If it runs for the entire configured time, it will consume 84 x $runtime cpu hoursperl$num_gtrees > 15999Estimate the maximum time your job will need to run. We recommend testing initially with a < 0.5hr test run because Jobs set for 0.5 h or less depedendably run immediately in the "debug" queue.
Once you are sure the configuration is correct, you then increase the time. The reason is that jobs > 0.5 h are submitted to the "normal" queue, where jobs configured for 1 or a few hours times may
run sooner than jobs configured for the full 168 hours.
num_gtreesNumber of Gene trees in your input fileoutfile_nameOutput file name perl(defined $value) ? "-o $value":"" Specify a filename for storing the output species tree.score_onlyScore the provided species tree and exit (-q)species_tree.treperl($value) ? "-q species_tree.tre":"" 8use_internodedistUSe NJst-like internode distancesperl($value) ? "-A":"" 6USe NJst-like internode distances instead of quartet distance for
building the search space (X). Unpublished work.branch_annotateHow much annotations should be added to each branch (-t)perl($value) ? "-t $value":"" 0123481038specify_lambdaSet the lambda parameter for the Yule prior. (-c)perl(defined $value) ? "-c $value":"" 0.512Set the lambda parameter for the Yule prior used in the calculations of
branch lengths and posterior probabilities. Set to zero to get ML branch
lengths instead of MAP. Higher values tend to shorten estimated branch
lengths and very high values can give inaccurate results (or even result
in underflow). (default: 0.5)upload_namemapfileSelect a name mapping file30namemapfile.txtperl(defined $value) ? "-a namemapfile.txt":"" Select a file containing the mapping between names in gene tree and names in
the species tree. The mapping file has one line per species, with one of
two formats:
species: gene1,gene2,gene3,gene4
species 4 gene1 gene2 gene3 gene4specify_minleavesRemove genes with less than specified number of leaves (-m)perl(defined $value) ? "-m $value":"" 26specify_samplingroundsPerform this many rounds of individual sampling for building the set X (--samplingrounds)perl(defined $value) ? "--samplingrounds $value":"" 26For multi-individual datasets, perform these many rounds of individual
sampling for building the set X. The program automatically picks this
parameter if not provided or if below onespecify_samplenumtreesSample this number of trees for each locus (-w)perl(defined $value) ? "-w $value":"" 126specify_exactsolnFind the exact solution by looking at all clusters (-x)perl($value) ? "-x ":"" 032Please do not use the exact (-x) option when there are more than 18 taxaperl$specify_exactsolnextra_bipartsHow much extra bipartitions should be added? (-p)perl(defined $value) ? "-p $value":"" 0121Quadratic distance-based method can be quite slowperl$extra_biparts = 3380: adds nothing extra. 1: (default): adds to X but not excessively (greedy resolutions).
2: adds a potentially large number and therefore can be slow (quadratic
distance-based).provide_extratreesProvide extra trees (with gene labels) (-e)perl(defined $value) ? "-e extratrees.tre ":"" extratrees.tre32Provide extra trees (with gene labels) used to enrich the set of
clusters searched extra_speciestreesProvide extra trees (with species labels) (-f)perl(defined $value) ? "-f extraspeciestrees.tre":"" extraspeciestrees.tre32Provide extra trees (with species labels) used to enrich the set of
clusters searchedremove_bipartsRemove bipartitions of the provided extra trees (--remove-bipartition)perl($value) ? "--remove-bipartition ":"" extraspeciestrees.tre32Removes bipartitions of the provided extra trees (with species labels)specify_trimthresholdTrimming threshold (-d)perl(defined $value) ? "-d $value":"" 026The trimming threshold is the user's estimate on normalized score; the closer
user's estimate is, the faster Astral runs. (default: 0)boot_onlygtrPerform bootstrapping but only with gene tree resampling. (--gene-only)perl!$use_bootstrapping perl($value) ? "--gene-only":"" 14results*