Convert amino acid sequence to nucleotide sequence
SeqNT
= aa2nt(SeqAA
)
SeqNT
=
aa2nt(SeqAA
, ...'GeneticCode', GeneticCodeValue
,
...)
SeqNT
= aa2nt(SeqAA
,
...'Alphabet' AlphabetValue
, ...)
SeqAA | One of the following:
Examples: |
GeneticCodeValue | Integer, character vector, or string specifying a genetic code number or code name from
the table Genetic Code. Default is
Tip If you use a code name, you can truncate the name to the first two letters of the name. |
AlphabetValue | Character vector or string specifying a nucleotide alphabet. Choices are:
|
SeqNT | Nucleotide sequence specified by a character vector of letter codes. |
converts
an amino acid sequence, specified by SeqNT
= aa2nt(SeqAA
)
,
to a nucleotide sequence, returned in SeqAA
SeqNT
,
using the standard genetic code.
In general, the mapping from an amino acid to a nucleotide codon
is not a one-to-one mapping. For amino acids with multiple possible
nucleotide codons, this function randomly selects a codon corresponding
to that particular amino acid. For the ambiguous characters B
and Z
,
one of the amino acids corresponding to the letter is selected randomly,
and then a codon sequence is selected randomly. For the ambiguous
character X
, a codon sequence is selected randomly
from all possibilities.
calls SeqNT
= aa2nt(SeqAA
,
...'PropertyName
', PropertyValue
,
...)aa2nt
with optional properties
that use property name/property value pairs. You can specify one or
more properties in any order. Each PropertyName
must
be enclosed in single quotation marks and is case insensitive. These
property name/property value pairs are as follows:
specifies a genetic code to use when converting an amino acid sequence to a nucleotide
sequence. SeqNT
=
aa2nt(SeqAA
, ...'GeneticCode', GeneticCodeValue
,
...)GeneticCodeValue
can be an integer, character
vector, or string specifying a code number or code name from the table Genetic Code. Default is 1
or
'Standard'
. The amino acid to nucleotide codon mapping for the
Standard genetic code is shown in the table Standard Genetic Code.
Tip
If you use a code name, you can truncate the name to the first two letters of the name.
specifies
a nucleotide alphabet. SeqNT
= aa2nt(SeqAA
,
...'Alphabet' AlphabetValue
, ...)AlphabetValue
can
be 'DNA'
, which uses the symbols A
, C
, G
,
and T
, or 'RNA'
, which uses
the symbols A
, C
, G
,
and U
. Default is 'DNA'
.
Genetic Code
Code Number | Code Name |
---|---|
1 | Standard |
2 | Vertebrate Mitochondrial |
3 | Yeast Mitochondrial |
4 | Mold , Protozoan , Coelenterate
Mitochondrial , and Mycoplasma/Spiroplasma |
5 | Invertebrate Mitochondrial |
6 | Ciliate , Dasycladacean ,
and Hexamita Nuclear |
9 | Echinoderm Mitochondrial |
10 | Euplotid Nuclear |
11 | Bacterial and Plant Plastid |
12 | Alternative Yeast Nuclear |
13 | Ascidian Mitochondrial |
14 | Flatworm Mitochondrial |
15 | Blepharisma Nuclear |
16 | Chlorophycean Mitochondrial |
21 | Trematode Mitochondrial |
22 | Scenedesmus Obliquus Mitochondrial |
23 | Thraustochytrium Mitochondrial |
Standard Genetic Code
Amino Acid Name | Amino Acid Code | Nucleotide Codon |
---|---|---|
Alanine | A | GCT GCC GCA GCG |
Arginine | R | CGT CGC CGA CGG AGA AGG |
Asparagine | N | AAT AAC |
Aspartic acid (Aspartate) | D | GAT GAC |
Cysteine | C | TGT TGC |
Glutamine | Q | CAA CAG |
Glutamic acid (Glutamate) | E | GAA GAG |
Glycine | G | GGT GGC GGA GGG |
Histidine | H | CAT CAC |
Isoleucine | I | ATT ATC ATA |
Leucine | L | TTA TTG CTT CTC CTA CTG |
Lysine | K | AAA AAG |
Methionine | M | ATG |
Phenylalanine | F | TTT TTC |
Proline | P | CCT CCC CCA CCG |
Serine | S | TCT TCC TCA TCG AGT AGC |
Threonine | T | ACT ACC ACA ACG |
Tryptophan | W | TGG |
Tyrosine | Y | TAT, TAC |
Valine | V | GTT GTC GTA GTG |
Asparagine or Aspartic acid (Aspartate) | B | Random codon from D and N |
Glutamine or Glutamic acid (Glutamate) | Z | Random codon from E and Q |
Unknown amino acid (any amino acid) | X | Random codon |
Translation stop | * | TAA TAG TGA |
Gap of indeterminate length | - | --- |
Unknown character (any character or symbol not in table) | ? | ??? |
aminolookup
| baselookup
| geneticcode
| nt2aa
| rand
| revgeneticcode
| seqviewer