What is an amino acid sequence called?

The term amino acid has seeped into common language, such as to market food products. You may have seen lists of foods with “high amounts of amino acids,” such as eggs or quinoa.  And now when you see those labels, you will know that they’re referring to a fundamental component of your proteins that is not static, but functional on a daily basis. And all of the information for encoding all of those amino acids in all of your proteins is contained within your genome!  There are about 20 amino acids and they link together in molecular chains called polypeptides, which are the building blocks of proteins. So, imagine each amino acid as a pearl strung together with other pearls in a long necklace.

Each protein or peptide consists of a linear sequence of amino acids. The protein primary structure conventionally begins at the amino-terminal (N) end and continues until the carboxyl-terminal (C) end. The structure of a protein may be directly sequenced or inferred from the sequence of DNA.

The amino acid sequence of a protein or peptide is useful information to understand the protein or peptide, identify it in a sample and categorize its post-translational modifications. The process of determining the amino acid sequence is known as protein sequencing.

Notation

The sequence of a protein is usually notated as a string of letters, according to the order of the amino acids from the amino-terminal to the carboxyl-terminal of the protein. Either a single or three-letter code may be used to represent each amino acid in the sequence.

There are 20 amino acids that occur naturally in nature, which can be represented by a three or single letter code as follows:

  • Alanine (Ala, A)
  • Arginine (Arg, R)
  • Asparagine (Asn, N)
  • Aspartic acid (Asp, D)
  • Cysteine (Cys, C)
  • Glutamic acid (Glu, E)
  • Glutamine (Gln, Q)
  • Glycine (Gly, G)
  • Histidine (His, H)
  • Isoleucine (Ile, I)
  • Leucine (Leu, L)
  • Lysine (Lys, K)
  • Methionine (Met, M)
  • Phenylalanine (Phe, F)
  • Proline (Pro, P)
  • Serine (Ser, S)
  • Threonine (Thr, T)
  • Tryptophan (Trp, W)
  • Tyrosine (Tyr, Y)
  • Valine (Val, V)

Methods of Protein Sequencing

There are two main methods used to find the amino acid sequences of proteins. Mass spectrometry is the most common method in use today because of its ease of use. Edman degradation using a protein sequenator is the second method, which is most useful if the N-terminus of a protein needs to be characterized.

Related Stories

  • Study indicates that the major determinants of SARS-CoV-2 pathogenicity reside outside of the spike protein
  • Motif-based SARS-CoV-2 protein-human protein interactions as potential antiviral target sites
  • Chinese scientists reveal a previously undefined pathway by which Mtb counteracts host immunity

It is helpful to know which amino acid is at the N-terminus of the protein both for ordering of the peptide fragments into the whole chain and to reduce the impact of impurities that commonly occur in the first round of Edman degradation. The N-terminus can be identified by:

  1. Using a reagent to label the amino acid at the end of the protein.
  2. Hydrolyzing the protein
  3. Using chromatography and other methods of comparison to identify the marked protein.

There are fewer methods that can practically be used to identify the C-terminus of the protein. However, one method that may be used involves adding carboxypeptidases to a solution of the protein and taking regular samples. Plotting the concentration of amino acids against time can help to identify the amino acid at the C-terminus.

Edman degradation allows the sequence of amino acids in the protein to be discovered with Edman sequencers, which are currently able to sequence peptides up to about 50 amino acids in length. This involves several steps to:

  1. Use a reducing agent to break any disulfide bridges in the protein.
  2. Separate the chain(s) of the protein complex and purify them.
  3. Determine the composition and terminal amino acids of each chain.
  4. Break each chain into small fragments (less than 50 amino acids in each)
  5. Separate the fragments and purify them.
  6. Use the fragments to determine amino acid sequence.
  7. The preceding steps should be repeated with a different fragment pattern so that the overall protein sequence can be reconstructed with minimal errors.

Amino Acid Composition and Analysis

The unordered composition of an amino acid is often useful information when attempting to determine the ordered sequence of the protein. This is because it can help identify errors and interpret ambiguous results.  Additionally, the frequency of amino acids can also help to decide upon the protease that is more appropriate for the protein digestion.

There are two main steps to determine the frequency of amino acids in a process known as amino acid analysis. Firstly, hydrolysis of a known quantity of the protein should break it up into the amino acid monomers. These can then be separated and quantified using various methods.

The hydrolysis is typically carried out by heating a sample of the protein to over 100°C in hydrochloric acid for an extended period of time (at least 24 hours), allowing more time for proteins with bulky hydrophobic groups. As there is a risk of protein degradation in these conditions, particularly for cysteine, glutamine, serine, threonine, tryptophan, and tyrosine, it is recommended to use several samples and to heat them for different times. Once hydrolyzed, the amino acids can be separated and identified with techniques such as ion-exchange chromatography or reverse phase HPLC.

References

  1. //www.ncbi.nlm.nih.gov/books/NBK22342/
  2. //www.ncbi.nlm.nih.gov/books/NBK22571/
  3. //www.youtube.com/watch?v=iACY379o1X4

Further Reading

  • All Protein Content
  • Protein Production: Initiation, Elongation and Termination
  • Protein Folding
  • Protein Complex Analysis
  • Challenges of Protein Complex Analysis

Last Updated: Feb 26, 2019

Written by

Yolanda Smith

Yolanda graduated with a Bachelor of Pharmacy at the University of South Australia and has experience working in both Australia and Italy. She is passionate about how medicine, diet and lifestyle affect our health and enjoys helping people understand this. In her spare time she loves to explore the world and learn about new cultures and languages.

Citations

Please use one of the following formats to cite this article in your essay, paper or report:

  • APA

    Smith, Yolanda. (2019, February 26). Amino Acids and Protein Sequences. News-Medical. Retrieved on October 24, 2022 from //www.news-medical.net/life-sciences/Amino-Acids-and-Protein-Sequences.aspx.

  • MLA

    Smith, Yolanda. "Amino Acids and Protein Sequences". News-Medical. 24 October 2022. <//www.news-medical.net/life-sciences/Amino-Acids-and-Protein-Sequences.aspx>.

  • Chicago

    Smith, Yolanda. "Amino Acids and Protein Sequences". News-Medical. //www.news-medical.net/life-sciences/Amino-Acids-and-Protein-Sequences.aspx. (accessed October 24, 2022).

  • Harvard

    Smith, Yolanda. 2019. Amino Acids and Protein Sequences. News-Medical, viewed 24 October 2022, //www.news-medical.net/life-sciences/Amino-Acids-and-Protein-Sequences.aspx.

Suggested Reading

What is the sequence of amino acids protein?

The sequence of amino acids in a protein is the primary structure. There are four levels of structure to a protein that determines its final form, primary, secondary, tertiary, and quaternary. Primary structure refers to the sequence of amino acids in a protein.

What is an amino acid also called?

The term amino acid is short for α-amino [alpha-amino] carboxylic acid. Each molecule contains a central carbon (C) atom, called the α-carbon, to which both an amino and a carboxyl group are attached.

Toplist

Última postagem

Tag