Convert amino acid sequence from letter to integer representation
SeqInt
= aa2int(SeqChar
)
SeqChar | One of the following:
|
SeqInt | Amino acid sequence specified by a row vector of integers. |
converts SeqInt
= aa2int(SeqChar
)SeqChar
, a character vector or string containing
single-letter codes specifying an amino acid sequence, to
SeqInt
, a row vector of integers specifying the same
amino acid sequence. For valid letter codes, see the table Mapping Amino Acid Letter Codes to Integers.
Mapping Amino Acid Letter Codes to Integers
Amino Acid | Code | Integer |
---|---|---|
Alanine | A | 1 |
Arginine | R | 2 |
Asparagine | N | 3 |
Aspartic acid (Aspartate) | D | 4 |
Cysteine | C | 5 |
Glutamine | Q | 6 |
Glutamic acid (Glutamate) | E | 7 |
Glycine | G | 8 |
Histidine | H | 9 |
Isoleucine | I | 10 |
Leucine | L | 11 |
Lysine | K | 12 |
Methionine | M | 13 |
Phenylalanine | F | 14 |
Proline | P | 15 |
Serine | S | 16 |
Threonine | T | 17 |
Tryptophan | W | 18 |
Tyrosine | Y | 19 |
Valine | V | 20 |
Asparagine or Aspartic acid (Aspartate) | B | 21 |
Glutamine or Glutamic acid (Glutamate) | Z | 22 |
Unknown amino acid (any amino acid) | X | 23 |
Translation stop | * | 24 |
Gap of indeterminate length | - | 25 |
Unknown character (any character or symbol not in table) | ? | 0 |
aminolookup
| int2aa
| int2nt
| nt2int