]> BlastN 2.2.1 Search DBs for Nucleotide Sequence similarity Altschul, Madden, Schaeffer, Zhang, Miller, Lipman Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaeffer,Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), Gapped BLAST and PSI-BLAST: a new generation of protein database search programs, Nucleic Acids Res. 25:3389-3402. Nucleic Acid Sequence blastn blast_init perl " " -10 blast2 Blast program perl "blastall -p $value" 0 blastn blastn blastn compares a nucleotide query sequence against a nucleotide sequence database nb_proc perl " -a 2" 6 query Sequence File perl " -i query.fasta" query.fasta 3 8 seqfile perl 1 start_region Start of required region in query sequence (-L) 5 end_region End of required region in query sequence (-L) perl (defined $value) ? " -L \"$start_region $value\"": " -L $start_region" perl $start_region 5 nucleotid_db nucleotide db perl &blastDBpath; NCBI_NT 3 &nucdbs; choose a nucleotide db for blastn, tblastn or tblastx perl $blast2 =~ /^blastn|tblast[nx]$/ filter_opt Filtering and masking options 4 BLAST 2.0 and 2.1 uses the DUST low-complexity filter for blastn and seg for the other programs. Both 'DUST' and 'seg' are integral parts of the NCBI toolkit and are accessed automatically. If one uses '-F T' then normal filtering by seg or DUST (for blastn) occurs (likewise '-F F' means no filtering whatsoever). The -F option also takes a string as an argument. One may use such a string to change the specific parameters of seg or invoke other filters. Please see the 'Filtering Strings' section (below) for details. filter Filter query sequence (DUST with blastn, SEG with others) (-F) perl ($value) ? "" : " -F F" 1 4 other_filters Filtering options (-F must be true) The -F argument can take a string as input to specify that seg should be run with certain values, or that other non-standard filters should be used. For example, a coiled-coil filter may be invoked by specifying: -F 'C' . The filter is based on the work of Lupas et al. [Science, vol 252, pp. 1162-4 (1991)] written by John Kuzio [Wilson et al., J Gen Virol, vol. 76, pp. 2923-32 (1995)] One may also run both seg and coiled-coil together by using a semi colon: -F 'C;S' Filtering by DUST may also be specified by capital D: -F 'D' To specify that masking should only be done during the process of building the initial words, prepend the filtering command with the letter 'm', e.g.: -F 'm S' This specifies that seg (with default arguments) should be used for masking, but that the masking should only be done when the words are being built. If the -U option is used to mask any lower-case sequence in the input FASTA file; and one wishes to mask ONLY when building the lookup tables, specify: -F 'm' 4 v1 v2 v3 v4 v5 v1 " -F \"m S\"" v2 " -F C" v3 " -F \"C;S\"" v4 " -F D" v5 " -F m" lower_case Use lower case filtering (-U) perl ($value)? " -U T" : "" 0 4 This option specifies that any lower-case letters in the input FASTA file should be masked. selectivity_opt Selectivity options 5 The programs blastn and blastp offer fully gapped alignments. blastx and tblastn offer 'in-frame' gapped alignments and use sum statistics to link alignments from different frames. tblastx provides only ungapped alignments. Expect Expect: upper bound on the expected frequency of chance occurrence of a set of HSPs (-e) perl (defined $value && $value != $vdef)? " -e $value":"" 10 5 The statistical significance threshold for reporting matches against database sequences; the default value is 10, such that 10 matches are expected to be found merely by chance, according to the stochastic model of Karlin and Altschul (1990). If the statistical significance ascribed to a match is greater than the EXPECT threshold, the match will not be reported. Lower EXPECT thresholds are more stringent, leading to fewer chance matches being reported. Fractional values are acceptable. word_size Word Size (-W) (zero invokes default behavior) perl (defined $value) ? " -W $value" : "" 5 dist_hits Multiple Hits window size (zero for single hit algorithm) (-A) perl (defined $value) ? " -A $value" : "" 5 extend_hit Threshold for extending hits (-f) perl ($value)? " -f $value":"" Blast first searhcxes for short word pairs whose aligned score reaches at least this Threshold value (default for blastp is 11) (T in the NAR paper and in Blast 1.4) 5 dropoff_extent X dropoff for blast extention in bits (0.0 invokes default behavior) (-y) perl (defined $value) ? " -y $value" : "" 5 keep_hits Number of best hits from region to keep (-K) perl (defined $value) ? " -K $value" : "" If this option is used a value of 100 is recommended. 5 gapped_alig Perform gapped alignment (not available with tblastx) (-g) perl ($value)? "": " -g F" 1 perl $blast2 ne "tblastx" 5 dropoff X dropoff value for gapped alignment (in bits) (-X) perl (defined $value)? " -X $value":"" This value controls the path graph region explored by Blast during a gapped extension (Xg in the NAR paper) (default for blastp is 15). 5 dropoff_final X dropoff value for final alignment (in bits) (-Z) perl (defined $value) ? " -Z $value" : "" 5 scoring_opt Scoring options 4 mismatch Penalty for a nucleotide mismatch (blastn) (-q) perl (defined $value && $value != $vdef)? " -q $value":"" -3 perl $blast2 eq "blastn" 4 match Reward for a nucleotide match (blastn) (-r) perl (defined $value && $value != $vdef)? " -r $value":"" 1 perl $blast2 eq "blastn" 4 matrix Matrix (-M) perl (defined $value && $value ne $vdef)? " -M $value" : "" BLOSUM62 BLOSUM62 BLOSUM45 BLOSUM80 PAM30 PAM70 perl $blast2 ne "blastn" 4 open_a_gap Cost to open a gap (-G) perl (defined $value)? " -G $value":"" for blastn default=5; for blastp, blastx and tblastn default=10 4 extend_a_gap Cost to extend a gap (-E) perl (defined $value)? " -E $value":"" 2 5 for blastn default=2; for blastp, blastx and tblastn default=1 Limited values for gap existence and extension are supported for these three programs. Some supported and suggested values are: Existence Extension 10 1 10 2 11 1 8 2 9 2 (source: NCBI Blast page) translation_opt Translation options gc_query Query Genetic code to use (blastx) (-Q) perl (defined $value && $value != $vdef)? " -Q $value":"" 1 4 1 2 3 4 5 6 9 10 11 12 13 14 15 perl $blast2 =~ /blastx$/ gc_db DB Genetic code (for tblast[nx] only) (-D) perl ($value != $vdef) ? " -D $value":"" 1 4 1 2 3 4 5 6 9 10 11 12 13 14 15 perl $blast2 =~ /^tblast/ strand Query strand to search against database (for blastx and tblastx) (-S) perl ($value && $value != $vdef) ? " -S $value" : "" 3 1 2 3 perl $blast2 =~ /blastx$/ 4 affichage Report options Descriptions How many short descriptions? (-v) perl (defined $value && $value != $vdef)? " -v $value":"" 500 5 Maximum number of database sequences for which one-line descriptions will be reported (-v). Alignments How many alignments? (-b) perl (defined $value && $value != $vdef)? " -b $value":"" 250 5 Maximum number of database sequences for which high-scoring segment pairs will be reported (-b). view_alignments Alignment view options (not with blastx/tblastx) (-m) perl ($value)? " -m $value" : "" 0 4 0 1 2 3 4 5 6 7 8 perl $blast2 !~ /blastx$/ show_gi perl $protein_db eq "nrprot" Show GI's in deflines (only available for NCBI db such as nrprot) (-I) perl ($value)? " -I" : "" 0 4 Causes NCBI gi identifiers to be shown in the output along with the accession and/or locus name. Warning: only available for NCBI db's such as nrprot. seqalign_file SeqAlign file (-J option must be true) (-O) perl " -O seqalign_file.asn1" 4 SeqAlign is in ASN.1 format so it can be read with NCBI tools (such as sequin). This allows one to view the results in different formats. perl $believe seqalign_file.asn1 believe Believe the query defline (-J) perl ($value)? " -J":"" 0 4 htmloutput Html output 1 perl ($view_alignments !~ /^[78]$/) perl ($value)? " && html4blast -o blast2.html -g" : "" 20 htmlopt HTML output options (html4blast) perl $htmloutput && ($_html) && ($view_alignments !~ /^[78]$/) html4blast_input perl " blast2.txt" perl $htmloutput && ($view_alignments !~ /^[78]$/) 30 external_links Use external web sites for databases entries retrieval links (-e instead of -s) 0 perl $htmloutput && ($view_alignments !~ /^[78]$/) perl ($value)? " -e" : " -s" 25 -s option uses SRS for database entries retrieval links, whereas -e will use the original database site links. one_HSP_per_line Draw one HSP per line in image instead of putting all HSP in one line (-l) 0 25 perl $htmloutput && ($view_alignments !~ /^[78]$/) perl ($value)? " -l" : "" 25 Useful for genome searches where there is only one sequence in the database. image_query Generate images names based on corresponding query (-q) 0 25 perl $htmloutput && ($view_alignments !~ /^[78]$/) perl ($value)? " -q" : "" 25 Useful when you only want to keep the image. htmlfile perl $htmloutput && ($view_alignments !~ /^[78]$/) blast2.html png_file perl $htmloutput && ($view_alignments !~ /^[78]$/) blast2_1.png othersopt Other Options 5 restrict_db perl $protein_db eq "nrprot" Restrict search of database to GI's in file (-l) perl " -l restrict_db.param" 7 restrict_db.param psi_checkpoint PSI-TBLASTN checkpoint file (-R) perl "-R psi_checkpoint.param" perl $blast2 eq "psitblastn" psiblast_matrix perl 1 psiblast_matrix query 5 psi_checkpoint.param txtoutput perl " > blast2.txt" 7 tmp_outfile blast2.txt blast_output perl $view_alignments !~ [78]