🔬Biological Chemistry I Unit 13 – RNA Transcription and Processing

RNA transcription and processing are fundamental processes in gene expression. They involve the synthesis of RNA molecules from DNA templates and their subsequent modifications. These steps are crucial for converting genetic information into functional RNA molecules that play diverse roles in cellular functions. Understanding RNA transcription and processing is essential for grasping how genes are expressed and regulated. This knowledge forms the basis for comprehending various biological processes, genetic disorders, and the development of RNA-based therapeutics. It also provides insights into the complexity and versatility of RNA in cellular functions.

RNA Basics

  • RNA (ribonucleic acid) consists of a single-stranded molecule composed of nucleotides
    • Each nucleotide contains a ribose sugar, a phosphate group, and one of four nitrogenous bases (adenine, guanine, cytosine, or uracil)
  • RNA plays a crucial role in the central dogma of molecular biology, acting as a messenger between DNA and proteins
  • RNA is synthesized from a DNA template through the process of transcription, which is carried out by RNA polymerases
  • Unlike DNA, RNA contains the sugar ribose instead of deoxyribose and the base uracil (U) instead of thymine (T)
  • RNA molecules are typically shorter than DNA molecules and are more prone to degradation due to the presence of the 2'-OH group on the ribose sugar
  • RNA can form secondary structures, such as hairpins and loops, through base pairing interactions (A-U and G-C)
  • The three main types of RNA are messenger RNA (mRNA), transfer RNA (tRNA), and ribosomal RNA (rRNA), each with distinct functions in gene expression

Transcription Initiation

  • Transcription initiation is the first step in the synthesis of RNA from a DNA template, which occurs in the nucleus of eukaryotic cells
  • RNA polymerase recognizes and binds to specific DNA sequences called promoters, which are located upstream of the gene to be transcribed
    • Promoters contain consensus sequences, such as the TATA box and the initiator element, that facilitate the binding of transcription factors and RNA polymerase
  • Transcription factors, such as TFIID and TFIIB, assemble at the promoter region to form the pre-initiation complex (PIC)
  • The PIC helps to recruit and position RNA polymerase at the transcription start site (TSS), which is typically located a short distance downstream of the promoter
  • RNA polymerase unwinds the double-stranded DNA template, exposing the template strand for RNA synthesis
  • The first nucleotide to be incorporated into the growing RNA chain is usually a purine (adenine or guanine) and is determined by the sequence of the DNA template
  • Once the first few nucleotides are synthesized, RNA polymerase clears the promoter region and enters the elongation phase of transcription

Elongation and Termination

  • During elongation, RNA polymerase moves along the DNA template in the 3' to 5' direction, synthesizing the complementary RNA strand in the 5' to 3' direction
  • RNA polymerase catalyzes the formation of a phosphodiester bond between the 3'-OH group of the growing RNA chain and the 5'-phosphate group of the incoming nucleotide triphosphate (NTP)
  • The elongation process continues until RNA polymerase encounters a termination signal, which can be either a sequence-dependent or a factor-dependent signal
    • Sequence-dependent termination involves the formation of a stable hairpin structure followed by a series of uracil residues, causing RNA polymerase to stall and dissociate from the DNA template
    • Factor-dependent termination requires the action of termination factors, such as Rho in prokaryotes, which bind to the nascent RNA and cause RNA polymerase to dissociate
  • Once transcription is terminated, the newly synthesized RNA molecule is released from the DNA template and undergoes further processing steps
  • The elongation rate of RNA polymerase varies depending on the organism and the specific gene being transcribed, but it typically ranges from 20 to 80 nucleotides per second in eukaryotes

Post-Transcriptional Modifications

  • After transcription, the newly synthesized RNA molecule undergoes various post-transcriptional modifications that alter its structure, stability, and function
  • One of the most common modifications is the addition of a 5' cap structure, which consists of a modified guanosine nucleotide attached to the 5' end of the RNA via a 5'-5' triphosphate linkage
    • The 5' cap protects the RNA from degradation by 5' exonucleases and facilitates its translation by eukaryotic ribosomes
  • Another important modification is the addition of a poly(A) tail to the 3' end of the RNA, which involves the enzymatic addition of a series of adenine nucleotides
    • The poly(A) tail enhances RNA stability, facilitates nuclear export, and promotes translation efficiency
  • RNA can also undergo base modifications, such as the conversion of adenosine to inosine (A-to-I editing) or cytidine to uridine (C-to-U editing), which can alter the coding potential or the stability of the RNA
  • Some RNA molecules, particularly tRNAs and rRNAs, undergo extensive post-transcriptional modifications, including methylation, pseudouridylation, and the addition of various chemical groups
  • These modifications can influence the structure, function, and interactions of the RNA molecules with other cellular components
  • Post-transcriptional modifications are carried out by a variety of enzymes, such as methyltransferases, pseudouridine synthases, and RNA editing enzymes, which recognize specific sequences or structural motifs in the RNA

RNA Processing and Splicing

  • In eukaryotes, most protein-coding genes are initially transcribed as precursor mRNAs (pre-mRNAs) that contain both exons (coding sequences) and introns (non-coding sequences)
  • To generate mature mRNAs, the introns must be removed, and the exons must be joined together through a process called RNA splicing
  • Splicing is carried out by the spliceosome, a large ribonucleoprotein complex composed of five small nuclear RNAs (snRNAs) and numerous protein factors
    • The spliceosome assembles on the pre-mRNA and catalyzes two transesterification reactions that excise the intron and ligate the flanking exons
  • Splicing requires the recognition of specific sequences at the intron-exon boundaries, known as the 5' splice site (GU) and the 3' splice site (AG), as well as a branch point sequence located within the intron
  • Alternative splicing is a mechanism by which different combinations of exons can be included or excluded from the final mRNA product, allowing a single gene to encode multiple protein isoforms
    • Alternative splicing is regulated by cis-acting regulatory elements (exonic splicing enhancers and silencers) and trans-acting factors (splicing regulators) that influence spliceosome assembly and splice site selection
  • RNA editing, such as A-to-I editing, can also occur in conjunction with splicing, further increasing the diversity of the transcriptome
  • Defects in RNA splicing can lead to various genetic disorders, such as spinal muscular atrophy and retinitis pigmentosa, highlighting the importance of accurate and efficient splicing for normal cellular function

Types of RNA and Their Functions

  • Messenger RNA (mRNA) serves as the template for protein synthesis, carrying the genetic information from DNA to the ribosomes
    • mRNAs contain a 5' cap, a coding sequence (CDS) flanked by untranslated regions (UTRs), and a 3' poly(A) tail
  • Transfer RNA (tRNA) acts as an adapter molecule that links the genetic code in mRNA to the amino acid sequence of proteins
    • tRNAs have a cloverleaf secondary structure and a 3' CCA end that is aminoacylated with a specific amino acid
    • The anticodon loop of tRNA base-pairs with the corresponding codon in mRNA during translation
  • Ribosomal RNA (rRNA) is a structural and catalytic component of ribosomes, the cellular machinery responsible for protein synthesis
    • Eukaryotic ribosomes contain four rRNA molecules (28S, 18S, 5.8S, and 5S) that, together with numerous ribosomal proteins, form the 60S and 40S ribosomal subunits
  • Small nuclear RNAs (snRNAs) are involved in the splicing of pre-mRNAs, forming the core components of the spliceosome (U1, U2, U4, U5, and U6 snRNAs)
  • Small nucleolar RNAs (snoRNAs) guide the post-transcriptional modification of rRNAs and other non-coding RNAs, such as methylation and pseudouridylation
  • MicroRNAs (miRNAs) are small non-coding RNAs that regulate gene expression post-transcriptionally by base-pairing with complementary sequences in target mRNAs, leading to translational repression or mRNA degradation
  • Long non-coding RNAs (lncRNAs) are a diverse class of RNA molecules (>200 nucleotides) that participate in various cellular processes, such as transcriptional regulation, chromatin remodeling, and RNA processing

Regulation of RNA Transcription

  • The regulation of RNA transcription is crucial for controlling gene expression and ensuring the proper development and function of cells and organisms
  • Transcriptional regulation is mediated by the interplay between cis-acting regulatory elements (promoters and enhancers) and trans-acting factors (transcription factors and chromatin modifiers)
  • Promoters contain specific DNA sequences, such as the TATA box and the initiator element, that recruit the basal transcription machinery and determine the transcription start site
  • Enhancers are distal regulatory elements that can activate transcription from promoters through long-range interactions mediated by chromatin looping
    • Enhancers contain binding sites for tissue-specific transcription factors that modulate gene expression in a cell type-dependent manner
  • Transcription factors are proteins that bind to specific DNA sequences and regulate the activity of RNA polymerase II
    • Activators promote transcription by recruiting coactivators and chromatin remodeling complexes, while repressors inhibit transcription by blocking the binding of activators or recruiting corepressors
  • Chromatin structure plays a critical role in transcriptional regulation, with open chromatin (euchromatin) being associated with active transcription and closed chromatin (heterochromatin) being associated with gene silencing
    • Histone modifications, such as acetylation and methylation, and DNA methylation are epigenetic marks that influence chromatin accessibility and transcription factor binding
  • Transcriptional regulation can also be modulated by external signals, such as hormones, growth factors, and environmental stresses, which activate signal transduction pathways that converge on transcription factors
  • Non-coding RNAs, such as miRNAs and lncRNAs, can also regulate transcription by targeting transcription factors or modifying chromatin structure
  • Dysregulation of transcriptional control is associated with various diseases, including cancer, developmental disorders, and metabolic syndromes, emphasizing the importance of precise gene regulation for maintaining cellular homeostasis

RNA in Disease and Therapeutics

  • Alterations in RNA metabolism, such as splicing defects, RNA editing abnormalities, and changes in RNA stability, can contribute to the development and progression of various diseases
  • Mutations in splicing regulatory elements or splicing factors can lead to aberrant splicing patterns, resulting in the production of non-functional or deleterious protein isoforms
    • Spinal muscular atrophy is caused by mutations in the SMN1 gene that impair the splicing of SMN2, leading to reduced levels of functional SMN protein
    • Retinitis pigmentosa can be caused by mutations in splicing factors, such as PRPF31 and PRPF8, leading to defects in the splicing of retinal genes
  • Alterations in RNA editing, particularly A-to-I editing, have been implicated in various neurological disorders, such as amyotrophic lateral sclerosis (ALS) and Alzheimer's disease
    • Reduced A-to-I editing of the glutamate receptor subunit GluA2 has been observed in ALS patients, potentially contributing to neuronal excitotoxicity
  • Changes in RNA stability, mediated by mutations in RNA-binding proteins or alterations in miRNA expression, can lead to the accumulation of toxic RNA species or the depletion of essential proteins
    • Myotonic dystrophy is caused by the expansion of CUG or CCUG repeats in non-coding regions of the DMPK or CNBP genes, respectively, leading to the sequestration of RNA-binding proteins and widespread RNA processing defects
  • RNA-based therapeutics have emerged as a promising approach for treating genetic diseases and cancer by modulating RNA metabolism or gene expression
    • Antisense oligonucleotides (ASOs) are short, synthetic nucleic acids that can bind to complementary sequences in target RNAs, promoting their degradation or modulating their splicing
      • Nusinersen is an FDA-approved ASO for the treatment of spinal muscular atrophy, which promotes the inclusion of exon 7 in SMN2 mRNA, increasing the production of functional SMN protein
    • Small interfering RNAs (siRNAs) are double-stranded RNA molecules that can silence gene expression by triggering the degradation of complementary mRNAs through the RNA interference (RNAi) pathway
      • Patisiran is an FDA-approved siRNA-based drug for the treatment of hereditary transthyretin-mediated amyloidosis, which reduces the production of mutant transthyretin protein by targeting its mRNA
  • RNA-based vaccines, such as mRNA vaccines, have gained attention as a rapid and flexible platform for developing vaccines against infectious diseases, as demonstrated by the COVID-19 mRNA vaccines developed by Pfizer-BioNTech and Moderna
  • The growing understanding of RNA biology and the development of novel RNA-targeting technologies hold promise for the discovery of new therapeutic strategies for a wide range of diseases


© 2024 Fiveable Inc. All rights reserved.
AP® and SAT® are trademarks registered by the College Board, which is not affiliated with, and does not endorse this website.

© 2024 Fiveable Inc. All rights reserved.
AP® and SAT® are trademarks registered by the College Board, which is not affiliated with, and does not endorse this website.