Peptide Basics

The 20 Amino Acids That Build Every Peptide

12 min read|March 25, 2026

Peptide Basics

20 Standard Amino Acids

Every peptide, protein, and enzyme in the human body is assembled from the same 20 amino acids, linked together by peptide bonds in sequences specified by DNA.

NCBI StatPearls, Essential Amino Acids, 2025

NCBI StatPearls, Essential Amino Acids, 2025

Grid showing the chemical structures of all 20 standard amino acids organized by side chain propertiesView as image

The human body contains tens of thousands of distinct peptides and proteins. Insulin is 51 amino acids. Semaglutide is 31. Oxytocin is 9. Thymosin beta-4 is 43. Despite their enormous functional diversity, every one of them is built from the same 20 amino acids, arranged in different sequences and lengths. The identity and order of amino acids in a peptide chain determines its shape, its receptor binding properties, its stability, and ultimately its biological function. Understanding these 20 building blocks is the foundation for understanding how every peptide therapeutic works. For how these amino acids link together, see The Peptide Bond: How Amino Acids Link Together.

Key Takeaways

  • All 20 standard amino acids share the same core structure: an amino group (NH2), a carboxyl group (COOH), a hydrogen atom, and a variable side chain (R-group) attached to a central alpha carbon
  • Nine amino acids are essential (must be obtained from diet): histidine, isoleucine, leucine, lysine, methionine, phenylalanine, threonine, tryptophan, and valine; the remaining 11 are synthesized endogenously
  • Side chain properties (charged, polar, nonpolar, aromatic) determine how a peptide folds, which receptors it binds, and how long it survives in the bloodstream
  • Non-natural amino acids (D-amino acids, alpha-methylated amino acids) are now routinely incorporated into therapeutic peptides to improve stability; Mendes et al. (2026) showed D-amino acid substitution and cyclization enhance antimicrobial peptide stability
  • Zhou et al. (2026) reviewed non-natural amino acid modifications in antibacterial peptides, cataloguing how specific substitutions at specific positions alter activity, resistance, and pharmacokinetics
  • The 20 standard amino acids produce enough sequence diversity for approximately 20^50 possible 50-amino-acid peptides, a number vastly exceeding the number of atoms in the observable universe, explaining why peptide drug discovery can find molecules for nearly any biological target

The shared structure

Every amino acid has four groups attached to a central carbon atom (the alpha carbon):

  • An amino group (-NH2): provides the "amino" in amino acid
  • A carboxyl group (-COOH): provides the "acid"
  • A hydrogen atom
  • A side chain (R-group): unique to each amino acid

When two amino acids are linked by a peptide bond (a condensation reaction releasing water), the amino group of one amino acid reacts with the carboxyl group of the next, forming the peptide backbone. The side chains project outward from this backbone and are responsible for essentially all of a peptide's chemical properties, receptor interactions, and biological behavior.

For the distinction between peptides and proteins, see Peptide vs Protein: Where the Line Is Drawn. For how peptide shape emerges from amino acid sequence, see Primary, Secondary, and Tertiary Structure: How Peptide Shape Determines Function.

The four groups of amino acids

Nonpolar (hydrophobic) amino acids

These amino acids have side chains made of carbon and hydrogen that repel water. In folded proteins and larger peptides, nonpolar residues cluster in the interior, away from the aqueous environment. In short peptides, they contribute to membrane interaction and receptor binding.

Glycine (Gly, G): The simplest amino acid, with only a hydrogen atom as its side chain. Provides maximum backbone flexibility. Found at positions requiring tight turns in peptide structure.

Alanine (Ala, A): A single methyl group (-CH3). Small and relatively inert. Commonly used as a reference amino acid in substitution studies.

Valine (Val, V): Branched-chain amino acid (BCAA). Essential. Important for muscle protein synthesis and energy metabolism.

Leucine (Leu, L): BCAA and the most potent amino acid activator of the mTOR pathway, which drives muscle protein synthesis. Essential.

Isoleucine (Ile, I): BCAA. Essential. Contributes to hydrophobic packing in protein cores.

Proline (Pro, P): Unique among the 20 because its side chain forms a ring with the backbone nitrogen, creating a rigid kink. Proline residues disrupt alpha helices and are found at turns and bends.

Phenylalanine (Phe, F): Contains a benzene ring. Essential. Precursor to tyrosine. Contributes to aromatic stacking interactions important for peptide-receptor binding.

Tryptophan (Trp, W): The largest amino acid, with an indole ring system. Essential. Precursor to serotonin and melatonin. Its aromatic side chain is critical for anchoring peptides in lipid membranes.

Methionine (Met, M): Contains a sulfur atom in its side chain. Essential. Often the first amino acid in newly synthesized proteins (start codon AUG).

Polar (uncharged) amino acids

These have side chains that can form hydrogen bonds with water but carry no formal charge at physiological pH.

Serine (Ser, S): Contains a hydroxyl group (-OH). Frequently a site of phosphorylation (post-translational modification). Nonessential.

Threonine (Thr, T): Similar to serine but with an additional methyl group. Essential. Also a phosphorylation site.

Asparagine (Asn, N): Contains an amide group. Nonessential. Common site for N-linked glycosylation.

Glutamine (Gln, Q): Longer version of asparagine. Conditionally essential during illness or stress. The most abundant free amino acid in plasma.

Tyrosine (Tyr, Y): Contains a hydroxyl-bearing phenol ring. Conditionally essential (synthesized from phenylalanine). Phosphorylation of tyrosine residues is central to receptor signaling (e.g., insulin receptor, growth factor receptors).

Cysteine (Cys, C): Contains a thiol group (-SH) that can form disulfide bonds with another cysteine. These covalent cross-links stabilize peptide structure. Disulfide bonds are critical in insulin (3 disulfide bonds), oxytocin (1 disulfide bond), and many other peptide hormones. Conditionally essential.

Positively charged (basic) amino acids

At physiological pH (7.4), these side chains carry a positive charge, enabling electrostatic interactions with negatively charged molecules (DNA, phospholipid head groups, acidic residues on receptor surfaces).

Lysine (Lys, K): Long, flexible side chain with a terminal amino group. Essential. Site of acetylation, methylation, and ubiquitination.

Arginine (Arg, R): Contains a guanidinium group that is almost always protonated. Conditionally essential. Found at cell-penetrating peptide surfaces. The arginine-rich motifs in HIV TAT peptide and other cell-penetrating sequences exploit the positive charge for membrane interaction.

Histidine (His, H): Contains an imidazole ring with a pKa near physiological pH (~6.0), meaning it can switch between protonated (charged) and deprotonated (uncharged) states in biological conditions. Essential. This pH sensitivity makes histidine critical for enzyme active sites and for peptides whose activity varies with pH.

Negatively charged (acidic) amino acids

At physiological pH, these carry a negative charge on their side chains.

Aspartic acid (Asp, D): Short side chain with a carboxyl group. Nonessential. Common in enzyme active sites and metal-binding domains.

Glutamic acid (Glu, E): Longer version of aspartate. Nonessential. The most common amino acid in proteins by frequency. Also functions as the brain's primary excitatory neurotransmitter (glutamate).

Essential vs nonessential: what the distinction means

Nine amino acids are classified as essential because humans lack the enzymatic pathways to synthesize them: histidine, isoleucine, leucine, lysine, methionine, phenylalanine, threonine, tryptophan, and valine. These must come from dietary protein.

Six are conditionally essential: arginine, cysteine, glutamine, glycine, proline, and tyrosine. The body can synthesize them under normal conditions but cannot produce enough during illness, injury, or growth phases. The remaining five (alanine, aspartic acid, asparagine, glutamic acid, serine) are always synthesized in sufficient quantities.

For peptide therapeutics, the essential/nonessential distinction is irrelevant to drug design (the peptide is manufactured, not biosynthesized by the patient). But it is relevant to collagen peptide supplements and branched-chain amino acid products, where the goal is to provide specific amino acids that are limiting in the diet.

Beyond the 20: non-natural amino acids in peptide drugs

Modern peptide drug design goes beyond the standard 20 amino acids. Non-natural amino acid substitutions improve peptide stability, receptor selectivity, and pharmacokinetics.

D-amino acids: All standard amino acids are L-form (left-handed chirality). Substituting D-amino acids (right-handed) at protease-sensitive positions makes the peptide resistant to enzymatic degradation because proteases evolved to recognize L-amino acid substrates. Mendes et al. (2026) demonstrated that D-amino acid substitution combined with cyclization enhances antimicrobial peptide stability against serum proteases while maintaining biological activity.[1] Zhao et al. (2026) used AI to predict optimal D-amino acid substitution positions, automating what was previously a labor-intensive empirical process.[2]

For dedicated coverage, see D-Amino Acid Peptides: Mirror-Image Molecules That Resist Degradation.

Alpha-methylated amino acids: Adding a methyl group to the alpha carbon increases resistance to proteolysis and can improve helical stability. Semaglutide contains an alpha-aminoisobutyric acid (Aib) substitution at position 8, which is a key modification extending its half-life.

Non-natural side chains: Amino acids with side chains not found in nature can be synthesized to achieve specific binding properties. Zhou et al. (2026) comprehensively reviewed non-natural amino acid modifications in antibacterial peptides, documenting how different substitution strategies affect activity against resistant bacteria.[3]

Unnatural amino acids in linkers: Gorzen et al. (2025) engineered unnatural amino acids into peptide linkers for peptide-drug conjugates, enabling cathepsin-selective cleavage that releases the drug payload only inside target cells.[4]

Why amino acid sequence determines peptide function

A peptide's biological activity is determined by its amino acid sequence (primary structure), which dictates its three-dimensional shape, which dictates its receptor binding. Change one amino acid in a bioactive peptide and you can abolish activity, enhance it, or redirect it to a different receptor entirely.

Oxytocin vs vasopressin: These two peptide hormones are each 9 amino acids long and differ at only 2 positions (positions 3 and 8). Oxytocin has isoleucine at position 3 and leucine at position 8; vasopressin has phenylalanine and arginine. This two-amino-acid difference is enough to direct oxytocin to the oxytocin receptor (controlling uterine contractions and social bonding) and vasopressin to the V1/V2 receptors (controlling blood pressure and kidney water retention). Two amino acids, entirely different biology.

Semaglutide vs native GLP-1: Native GLP-1 (7-36 amide) has a half-life of 2 minutes because DPP-4 cleaves between positions 8 and 9. Semaglutide modifies position 8 (Ala to Aib) to block DPP-4, attaches a C-18 fatty acid to lysine at position 26 for albumin binding, and substitutes position 34 (Lys to Arg) to prevent fatty acid attachment at the wrong site. Three amino acid-level changes transformed a 2-minute peptide into a once-weekly injectable.

These examples illustrate why peptide drug design is fundamentally amino acid engineering. Every modification is a decision about which of the 20 (or non-natural) building blocks to place at each position, and what that choice does to stability, binding, and function.

The Bottom Line

The 20 standard amino acids are the alphabet of peptide biology. Their side chains (nonpolar, polar, positive, negative) determine how peptides fold, bind receptors, and survive in the body. Nine are essential and must come from food; the rest are synthesized internally. Modern peptide drug design extends beyond the standard 20 by incorporating D-amino acids, alpha-methylated residues, and entirely synthetic side chains to overcome the natural limitations of L-amino acid peptides. Understanding these building blocks is the first step in understanding how every peptide therapeutic, from insulin to semaglutide to BPC-157, achieves its biological effect.

Frequently Asked Questions