PIMA

PIMA 1.40 Pattern-Induced Multi-sequence Alignment program alignment:multiple R. D. Smith and T. F. Smith R. D. Smith and T. F. Smith. Pattern-induced multi-sequence alignment (PIMA) algorithm employing secondary structure-dependent gap penalties for use in comparative modelling. protein Engineering, vol5, number 1, pp 35-41, 1992 alignment:multiple pima pima seqlab pima perl "pima" 0 sequence Sequences perl " $value" 3 Name of the input file containing the sequences to be clustered and multi-aligned. Sequences can be in any of the following formats: IG/Stanford, GenBank/GB, NBRF, EMBL, Pearson/Fasta, PIR/CODATA, Table (LOCUS_NAME SEQUENCE [one seq/line]). LOCUS_NAMES can not contain left or right parentheses. The format of the output sequence files will match the format of this input file. 1 8 cluster_name cluster_name An arbitrary name used to label the cluster. perl " $value" 2 pima_params Parameters ref_seq_name ref_seq_name [optional; if specified, then sec_struct_seq_filename must also be specified]. Locus name of one of the primary sequences for which the secondary structure is in the file seq_struct_seq_filename. perl


			($value)? " $value" : ""

4 sec_struc_seq_filename sec_struc_seq_filename [optional; if specified, then ref_seq_name must also be specified] Name of a file containing secondary structure sequences for one ormore of the primary sequences in the set. The secondary structure sequences in this file must be in one of the formats listed above (see sequence_filename, above). The locus name of each sequence must be the locus name of it's corresponding primary sequence with the suffix '.ss' (e.g. 1ldm.ss). An alpha-helix, 3-10 helix and beta-strand must be designated 'h', 'g', and 'e', repectively. All other characters in the secondary structure sequences will be ignored with respect to the the structure-dependent gap penalty. To allow gaps to be placed between the first and the second and the last elements of these structures, the first and last 2 elements of each should be changed to another character designation. In the secondary structure sequence file pdb-dssp.ss provided with this package, these end cap elements are designated 'i', 'f', and 'd', for alpha-helices, 3-10 helices and beta-strands, respectfully. perl ($value)? " $value" : "" 5 pima_options options 1 score_cutoff cluster score cutoff (-c) Use a cluster score cutoff of number. This is the lowest match score to be used to incorporate a sequence into a cluster. The default value of 0.0 will force all input sequences into 1 cluster, but the final pattern may be com-pletely degenerate. perl ($value)? " -c $value " : "" 0.0 ext_gap_cost gap extension penalty (-d) Use a length dependent gap penalty of number. This is the cost of extending a gap. The default value is dependent on the matrix file used. perl ($value)? " -d $value" : "" gap_open_cost gap opening penalty (-i) Use a length independent gap penalty of number. This is the cost of opening a gap. The default value is dependent on the matrix file used. perl ($value)? " -d $value" : "" min_score minimum local score (-l) Use minimum local score of number. This is the lowest score a quadrant can have before an attempt is made to join this local alignment with the local alignment at the previous step. The default value is dependent on the matrix file used. perl ($value)? " -l $value" : "" mat_file matrix file (-m) Use matrix file with the name file. The default matrix fil is patgen.mat and is provided with this package. The matrix file class1.mat uses the original pima alphabet. The matrix file class2.mat is also provided, which is similar to the matrix file class1.mat but uses the new alphabet. perl ($value)? " -m $value" : "" not_num_ext Do not use numerical extensions on each step of the alignment. (-n) perl ($value)? " -n" : "" 0 sec_struc_gap_cost secondary structure gap penalty (-t) Use a secondary structure gap penalty of number. This is the cost of a gap at a position matching a secondary structure character. The default value is dependent on the matrix file used and is always 10 times the value of the length independent gap penalty of the matrix file. perl ($value)? " -t $value " : "" results *.cluster *.pattern *.pima