Nomenclature and Symbolism for Amino Acids and Peptides

3AA-22

Continued from 3AA-20 to 3AA-21

Contents of 3AA-22

3AA-22 Names and Symbols for Derivatives of Named Peptides

References for 3AA-22

Continued in 3AA appendix


Part 3. Modification of Named Peptides (a revision and updating of [12])

3AA-22. NAMES AND SYMBOLS FOR DERIVATIVES OF NAMED PEPTIDES

It is often convenient to specify the structure of a peptide by reference to a named sequence of which it is a variant. The recommendations that follow allow this, but they apply only to modifications of the sequence involving normal amide links between residues.

Note. To exemplify any named peptide, the imaginary peptide 'iupaciubin' (to symbolize the harmonious co-operation of IUPAC and IUB), Ala-Lys-Glu-Tyr-Leu, is used in formulating the recommendations below. See also the Addendum for the use of seco.

3AA-22.1. Replacement of Residues

In a peptide of trivial name iupaciubin, if the qth amino-acid residue, starting from the N-terminal end of the chain, is replaced by the amino acid Xaa, the semitrivial name of the modifed peptide is [q-amino acid]iupaciubin, and the abbreviated form is [Xaaq]iupaciubin. A designation of the chain may be placed before the residue number, e.g. [AlaB12]insulin(cattle) (see Comment f). Examples:

[8-Citrulline]vasopressin, [Cit8]vasopressin

[5-Isoleucine,7-alanine]angiotensin II, [Ile5,Ala7]angiotensin II

Notes

(a) In the full name, the replacement amino acid is designated by its residue name, not the name of its acyl group (e. g. glycine, not glycyl). This name, and the position of replacement, are given in square brackets.

(b) In the abbreviated form, the amino-acid residues are designated by standard 3-letter symbols (Table 1), the first letter only being a capital (3AA-14), in square brackets.

(c) In the abbreviated form the position of substitution is indicated in a special fashion, i.e. by a superior numeral, to indicate that it is a residue, not an individual atom, that is being replaced.

(d) The residue replaced is not designated in these semitrivial names in order to keep the names short, and because this form of nomenclature of 3AA-22.1 clearly differs from ordinary substitution nomenclature.

(e) The replacement of an amino-acid residue by its enantiomer may be shown by application of this rule as follows: the replacement in iupaciubin of L-tyrosine at position 4 by D-tyrosine results in [4-D-tyrosine]iupaciubin with the abbreviations [D-Tyr4]iupaciubin. A mixture of this with iupaciubin gives [4-ambo-tyrosine]iupaciubin or [ambo-Tyr4]iupaciubin (3AA-13.2 and 3AA-19.2). Examples: [D-Ser1]Corticotropin; [D-Asp1]angiotensin II.

(f) Specification of a sequence may require the species as well as the peptide to be named. If so, the name of the species should be attached, in parenthesis, to the name of the peptide whenever a modifying prefix is present. Thus a substitution in cattle insulin could give [AlaB12]insulin(cattle). (We prefer 'cattle' as the adjective for this species, since 'bovine' is not used in common speech to designate the species, but to compare human attributes with those of the cow, and 'ox' can be misleading.)

(g) It may be convenient to represent replacement of Gln by Glu or of Asn by Asp with the prefix 'desamido'. Thus [Glu30]corticotropin(pig) could be called desamido30-corticotropin(pig). Similarly replacement of Gla by Glu can be designated with the prefix 'decarboxy'.

3AA-22.2. Extension of the Peptide Chain

The compounds obtained by extension of a peptide at either the N-terminus on the C-terminus are designated by the kinds of names and abbreviations shown below; these are in accordance with general principles of peptide nomenclature (3AA-13.1). Examples:

a) Extension at the N-terminus

Aminoacyliupaciubin Xaa-iupaciubin

Valyliupaciubin Val-iupaciubin

Valylglycyliupaciubin Val-Gly-iupaciubin
(for extension by two residues)

b) Extension at the C-terminus

Iupaciubinylamino acid Iupaciubinyl-Xaa

Iupaciubinylleucine Iupaciubinyl-Leu

This rule is not directly applicable to extension at the C-terminus of natural peptides that possess a terminal amide group, such as oxytocin and α-melanotropin. For these, a new name should be given to the corresponding peptide with a free carboxyl group by adding 'oic acid' to the trivial name, e. g. oxytocinoic acid from oxytocin, so that extension can then be denoted as above, e. g. oxytocinoyl-Xaa.

Note

The enkephalins are the two peptides Tyr-Gly-Gly-Phe-Leu and Tyr-Gly-Gly-Phe-Met. Designations such as Leu-, leucyl- and leucine-enkephalin have been given to the former, with corresponding terms for the latter. These all wrongly imply N-terminal extension, and could not be used together with any indication of such extension. Morley [25] has advocated [Leu]enkephalin, if necessary [Leu5]enkephalin, in accordance with 3AA-22.1, implying that enkephalin means Tyr-Gly-Gly-Phe-Xaa. We believe that [Leu5]enkephalin is the best designation.

3AA-22.3. Insertion of Residues

The compound obtained by insertion of an additional amino-acid residue Xaa in the position between the qth and the (q+1)th residue of the peptide iupaciubin is named endo-qa-amino acid-iupaciubin (abbreviated form: endo-Xaaqa-iupaciubin). Example:

endo-4a-tyrosine-angiotensin II, endo-Tyr4a-angiotensin II

Notes

(a) This form has analogies in other fields where endo implies the insertion of something into a structure (e.g. endo-methylene). The prefix or index qa is based on analogies with the steroids where the atoms inserted into a ring after atom no. q are designated qa, qb, etc.

(b) The prefix 'homo' is not suitable for designating the insertion of a whole residue, especially since it is commonly used to modify the names of individual amino acids, e.g. homoserine (3AA-2.3).

(c) Multiple insertions, and insertion of two or more residues together in the same place in the chain, are shown by a logical extension of this recommendation. Thus the insertion into the peptide iupaciubin of threonine between residues 1 and 2, and of valine and glycine (in that order) between residues 4 and 5, is shown by the name 'endo-1a-threonine,4a-valine,4b-glycine-iupaciubin' and the abbreviation 'endo-Thr1a,Val4a,Gly 4b-iupaciubin'.

3AA-22.4. Removal of Residues

The compound obtained by the formal removal of an amino-acid residue from the peptide iupaciubin in position q is designated by the name des-q-amino acid-iupaciubin, abbreviated des-Xaaq-iupaciubin. Example:

des-7-proline-oxytocin, des-Pro7-oxytocin
Notes

(a) Removal of a whole residue is indicated in a way similar to that for removal of a ring in steroids, e.g. des-A-androstane.

(b) The form 'de' is not suitable as a prefix because it is easily confused, in speaking, with D (for configuration).

(c) Multiple deletions are designated similarly, e.g. des-Ile3,Asn5-oxytocin. If a complete sequence is to be removed, the first and last loci of this sequence are all that need be specified, and they should be put in parentheses with a hyphen between them, e.g. des-(B24-B28)-insulin(mouse).

See also the Addendum for the use of des with seco.

3AA-22.5. Substitution of Side Chains of Residues

The compound formed by introducing an additional amino-acid residue as a substituent of the side chain of a residue in a peptide is named by applying the rules of peptide nomenclature (3AA-7, -9 and -13) to the trivial name, as follows.

22.5.1. Acylation of a Side-Chain Amino Group

If the additional residue acylates an amino group of a peptide, the name of the additional amino acid is written, in its acyl group form (3AA-9.3), and prefixed by symbols indicating the position of substitution (atom and residue number, see 3AA-13.4).

Example: the imaginary compound

derived by acylation of the ε-amino group of the lysine residue at position 2 of iupaciubin (Ala-Lys-Glu-Tyr-Leu) with a valyl group is named Nε2-valyl-iupaciubin (abbreviated Nε2-Val-iupaciubin).

22.5.2. Other Substituents Named as Prefixcs

Other substituents that can be named as prefixes are treated similarly to amino acyl groups. Examples:

N6.2-Methyliupaciubin for

C4.3-Carboxyiupaciubin for
(alternative to replacement with Gla under 3AA-22.1)

O4.4-Sulfoiupaciubin or iupaciubin O4.4-sulfate for

O4.12-Sulfogastrin or gastrin O4.12-sulfate for gastrin II.

NαB1,NεB29-bis(Boc)insulin or N2.B1,N6.B29-bis(Boc)insulin

22.5.3. Acylation by a Side-Chain Carboxyl Group

If an additional amino acid is joined by amide formation between its 2-amino group and the side-chain carboxyl of the peptide iupaciubin, so that the additional amino acid has a free 1-carboxyl group, the derivative is named by specifying the position of substitution (atom and residue number, 3AA-13.4) and it is given the designation 'iupaciubinyl-amino acid'.

Example: the imaginary compound

in which valine is acylated by the [gamma]-carboxyl group of a glutamic residue in position 3 (Glu-3) of iupaciubin (Ala-Lys-Glu-Tyr-Leu), is named N-(iupaciubin-C δ3-yl)valine, or N-(iupaciubin-C5.3-yl)valine, abbreviated to iupaciubin-C5.3-yl-Val.

Prefixes may also need to be formed from peptide names for use as substituents in other types of compound, as described in 3AA-13.5.

3AA-22.6. Partial Sequences (fragments)

A peptide derived from a named peptide iupaciubin by removal of all residues before the pth and all after the qth is named as iupaciubin-(p-q)-peptide. Examples:

From α-melanotropin

we may have

or α-melanotropin-(4-10)-peptide

or, to illustrate the naming of a peptide that contains two fragments, and also a C-terminal amide group:

His-Phe-Arg-Lys-Pro-Val-NH2 may be called α-melanotropin-(6-8)-(11-13)-peptide amide.

For oligopeptides it may be convenient to state the length of peptide, e. g. iupaciubin-(2-4)-tripeptide, but this is not normally useful for peptides of over about twelve residues, because the larger multiplying affixes are not widely known. Such a check on the number of residues is particularly useful when two or more sequences are joined, e.g. α-melanotropin-(6-8)-(11-13)-hexapeptide amide.

3AA-22.7. Peptides with Reversed Sequence and Enantiomers

The peptide whose sequence is the reverse of a named peptide may itself be named with the prefix 'retro-', giving retro-iupaciubin from iupaciubin. The enantiomer of a named peptide may be specified with the prefix 'ent-' (a contracted form of enantio-, F-6.4 of [15]), giving ent-iupaciubin from iupaciubin.

3AA-22.8. Peptide Analogues

Analogues of peptides in which the -CO-NH- group that connects two residues is replaced by another group (3AA-19.7) may be indicated by placing ψ in square brackets before the name [25]. The ψ is placed between superscripts indicating the residues modified, and is followed by a comma and the replacing group, e.g. [3ψ4,CH2-S]iupaciubin.

3AA-22.9. Summary of Modification Nomenclature

Click here for "table free" view if the following is faulty.

The application of the principles of 3AA-22 is illustrated in Table 6.

Table 6. Application of the methods of modifying named peptides

SectionOperationShort nameStructure
--Iupaciubin
22.1Replacement[Phe4]iupaciubin
22.2aExtension (N-terminal) Arg-iupaciubin
22.2bExtension (C-terminal) Iupaciubinyl-MetAla-Lys-Glu-Tyr-Leu-Met
22.3InsertionEndo-Thr2a-iupaciubin
22.4RemovalDes-Glu3-iupaciubin
22.5.1Side-chain substitution on amino groupNε2-Val-iupaciubin
22.5.2Side-chain substitution on carboxyl groupCδ3-Iupaciubinyl-Val
22.6Partial sequence Iupaciubin-(2-4)-peptideLys-Glu-Tyr
22.7Reversal of sequence retro-IupaciubinLeu-Tyr-Glu-Lys-Ala
22.7Enantiomer ent-IupaciubinD-Ala-D-Lys-D-Glu-D-Tyr-D-Leu
22.8Replacement of CO-NH of peptide bond[3ψ4,CH2-S]iupaciubin Ala-Lys-Glu-ψ(CH2-S)-Tyr-Leu
Addendum 3Partial hydrolysis of a peptide [seco-3/4]oxytocin

* Square brackets are required to indicate replacement, but are not used for most other modifications.


References

7. International Union of Biochemistry (1978) Biochemical Nomenclature and Related Documents, The Biochemical Society, London.

12. IUPAC-IUB Commission on Biochemical Nomenclature (CBN), Rules for Naming Synthetic Modifications of Natural Peptides, 1966, Arch. Biochem. Biophys. 121, 6-8 (1967); Biochem. J. 104, 17-19 (1967), corrected 135, 9 (1973); Biochemistry, 6, 362-364 (1967); Biochim. Biophys. Acta, 133, 1-5 (1967); Bull. Soc. Chim. Biol. 49, 325-330 (1967) (in French); Eur. J. Biochem. 1, 379-381 (1967), corrected 45, 3 (1974); Hoppe-Seyler's Z. Physiol. Chem. 348, 262-265 (1967) (in German); J. Biol. Chem. 242, 555-557 (1967); Mol. Biol. 2, 466-469 (1968) (in Russian); Pure Appl. Chem. 31, 647-653 (1972); also pp. 85-87 in [7].

14. International Union of Pure and Applied Chemistry (1979) Nomenclature of Organic Chemistry, Sections A, B, C, D, E, F and H, Pergamon Press, Oxford.

15. IUPAC Commission on the Nomenclature of Organic Chemistry (CNOC). Nomenclature of Organic Chemistry, Section F: Natural Products and Related Compounds, Recommendations 1976, Eur. J. Biochem. 86, 1-8 (1978); also pp. 151-26 in [7] and pp. 491-511 in [14]. [See also Biochemical Nomenclature and Related Documents, 2nd edition, Portland Press, 1992, pages 19-26.]

25. Morley, J. S. (1981) Neuropeptides, 1, 231-235.


Continue to the next section with Appendix to Amino Acids and Peptides.

Return to Amino Acids and Peptides home page.