Researchers discover one million new components of the human genome
MedicalXpress Breaking News-and-Events Feb 13, 2024
Researchers at the University of Toronto's Donnelly Center for Cellular and Biomolecular Research have found nearly one million new exons—stretches of DNA that are expressed in mature RNA—in the human genome.
The findings were published in the journal Genome Research.
There are around 20,000 protein-coding genes in humans that contain approximately 180,000 known internal exons. These protein-coding regions account for only one percent of the entire human genome. The vast majority of what remains is a mystery—aptly referred to as the "dark genome."
"We've started to chip away at the dark genome by finding nearly one million previously unknown exons through a method called exon trapping," said Timothy Hughes, principal investigator on the study and professor and chair of the department of molecular genetics in U of T's Temerty Faculty of Medicine.
"The technique involves an assay with plasmids to find exons in DNA fragments of unknown composition," said Hughes, who holds the Canada Research Chair in decoding gene regulation and the John W. Billes Chair of Medical Research at U of T. "While exon trapping is not widely used anymore, it proved to be effective when used in combination with high-throughput sequencing to scan the entire human genome."
Exons are segments of the genome that can encode proteins to direct tissue development and biological processes within the body. They are considered to be autonomous if they don't require external assistance to splice into a mature RNA transcript, which is then translated into a protein.
The team behind the study was driven to test the exon definition model that guides research in molecular genetics after questioning one of its assumptions—that the accurate removal of non-protein-coding intron regions of the genome is aided by clear and consistent indicators of where exons begin and end. This assumption does not seem to hold in all cases as the splicing of exons does not always go smoothly, sometimes resulting in mature RNA transcripts that contain non-functional components.
"Almost none of the newly discovered exons are found consistently across genomes of different species," said Hughes. "They seem to appear in the human genome mainly due to random mutation and are unlikely to play a significant role in our biology. This is evidence that evolution in humans involves a lot of trial and error—most likely enabled by the vast size of our genome."
It is helpful to document randomly mutated exons within the human genome as their translation could potentially be harmful. Long non-coding RNA exons, which are autonomous but often have no known function, have been connected to the development of cancer. Of the roughly 1.25 million known and unknown exons the team found through exon trapping, almost four percent were long non-coding RNA exons.
In addition, the exons residing within non-coding introns, called pseudoexons, can mutate to make a weak splice site stronger. This results in the exon being included in a mature RNA transcript, potentially leading to disease.
-
Exclusive Write-ups & Webinars by KOLs
-
Daily Quiz by specialty
-
Paid Market Research Surveys
-
Case discussions, News & Journals' summaries