HALIGN4 on ACCESS
1
A new strategy for rapidly aligning millions of sequences
Tong Zhou, Pinglu Zhang, Quan Zou, Wu Han
HAlign 4: a new strategy for rapidly aligning millions of sequences, Tong Zhou, Pinglu Zhang, Quan Zou, Wu Han, Bioinformatics, Volume 40, Issue 12, December 2024, btae718, https://doi.org/10.1093/
Phylogeny/Sequence Alignment
halign4_access
halign4_1
perl
"halign4_expanse "
1
mpi_processes_conf1
3
scheduler.conf
perl
!$scriptfile_specifies
perl
"jobtype=mpi\\n" .
"cpus-per-task=16\\n" .
"mem=32G\\n" .
"node_exclusive=0\\n" .
"mpi_processes=1\\n"
infile
Input File (must be in Nexus format) (-execute)
infile.fasta
perl
"infile.fasta output.fasta -t 16"
5
ALL_FILES
*
runtime
1
scheduler.conf
Maximum Hours to Run (click here for help setting this correctly)
168
The maximum hours to run must be less than 168
perl
$runtime > 168.0
Please enter a positive number for the maximum runtime
perl
$runtime < 0
Please specify a maximum runtime
perl
!defined $runtime
perl
"runhours=$value\\n"
The job will run on 32 processors as configured. If it runs for the entire configured time, it will consume 32 x $runtime cpu hours
perl
$runtime > 0
Estimate the maximum time your job will need to run. We recommend testing initially with a < 0.5hr test run because Jobs set for 0.5 h or less depedendably run immediately in the "debug" queue.
Once you are sure the configuration is correct, you then increase the time. The reason is that jobs > 0.5 h are submitted to the "normal" queue, where jobs configured for 1 or a few hours times may
run sooner than jobs configured for the full 168 hours.
specify_reference
Specify the reference sequence name (-r, no whitespace allowed)
perl
"-r $value"
10
The longest sequence will used as reference. White spaces are not allowed in sequence names
perl
!defined $specify_reference
specify_globalsa
Specify the Global SA threshold (-sa default=15)
perl
(defined $specify_globalsa) ? "-sa $value":""
15
15