download @ unique
abbreviator
@unique
abbreviator: unique abbreviator is a very useful program to rename
each fasta sequence with a unique name. This unique name is generated
based upon first letter of first part of name and two letters of second
part of name. For example, Escherichia coli is renamed as Eco. Also, if
there are more than one sequences present for some species, then unique
name is generated with a number as a postfix to the name. Example Eco1,
Eco2, Eco3 etc. The unique names of sequences are necessary for a fasta
file so as to align it in clustal program. Program takes input a fasta file and gives output file
containing fasta sequences with unique species names.
Example
fasta file can be obtained from here: ycgt.fasta
A
fasta file has to be in the following format:
>
-xx- | -xx- | -xx- [ -xx- ] -xx-
-xx-
If
fasta sequence is not in this format (which often is the case), program
leaves that sequence unprocessed with a "---ERROR SEQUENCE---"
message.
Command
to use the program:
>
java Names_c