This study published in the journal Nature Genetics focuses on the probability that any given nucleotide in the human genome -- one of the four letters (A, C, G or T for adenine, cytosine, guanine or thymine) of the DNA alphabet -- is changed. Researchers focused on the simplest type of mutation, a "point" mutation in which a single letter is changed in a given sequence. Most of these changes -- often called single nucleotide polymorphisms (SNPs), or "snips" -- are usually not harmful to the functioning of the human body. Nevertheless, researchers examined why some sequences are more prone to mutate, whereas others are not.
"The crux of the paper examines the dependency of mutation rate on which nucleotides are one, two, or three bases away from either side of a SNP," author said. "We already know about one situation in which this placement matters: DNA sequences in the genome where methyl groups are attached to the cytosine nucleotide, also known as CpG sites, are hotspots for mutation. But are there other types of local sequences that matter beyond these?"
To address this question, they devised a mathematical model applicable to SNP data found in humans. Their approach took advantage of publicly available data from thousands of human subjects sampled from across the globe, namely from the 1000 Genomes Project. These individuals were sequenced as part of an international initiative to characterize the genetic variation that naturally occurs in human populations.
What they found was surprising: Knowing the three nucleotides flanking either side of a given SNP, for a total of seven nucleotides, predicted up to 93 percent of the variability in the chance of finding a SNP in a given sequence in individuals whose genome sequences are in the 1000 Genomes Project database. In addition, their model uncovered several distinctive sequences of local nucleotides that were not previously known to be prone to mutation.
"It turns out there are indeed DNA sequences beyond CpG sites that are also prone to mutation," author said. "What is not immediately obvious is why. The initial rates and our model need to be investigated more deeply to decipher the basic mechanisms that induce mutation in human genomes."
Another finding questioned the assumption that methylated CpG sites always have the same rate of mutation. "I think it is commonly assumed that all CpG sequences mutate at the same rate, though our results indicate far more variability that we expected," author said. Using another publicly available database that measured the methylation states at CpG sites across several individuals, authors found that the frequency in which different sequence contexts were methylated could not fully explain differences in mutation rates at these sites. "This certainly indicates the possibility of additional genetic mutation phenomenon at CpG hotspots that change how prone these sites are to mutate, for example how well DNA-repair machinery can correct new mutations that might arise," author said.
Beyond gleaning clues for different ways mutations occur, authors also examined applications of their model to human disease, providing ways to rank which newly discovered mutations identified from clinical genetics studies are the most likely to result in disease. Computational predictive measurements such as these are used to help prioritize rare or new gene variants discovered from these studies for follow-up investigation.
They focused on a set of autism sequencing studies by looking for genes with an excess of new mutations in children with autism not otherwise found in parents. When they applied their model to these data, they found an improvement over existing methods for predicting which rare or new mutations were associated with human disease.
Edited
Latest News
Abusive drugs hijack natura…
By newseditor
Posted 23 Apr
Mechanism of action of the…
By newseditor
Posted 23 Apr
Role of fat in rare neurolo…
By newseditor
Posted 23 Apr
How protein synthesis in de…
By newseditor
Posted 22 Apr
Atlas of mRNA variants in d…
By newseditor
Posted 22 Apr
Other Top Stories
A new role for chaperone, Hsp90
Read more
Molecular details of CRISPR-Cas9 RNA editing technology
Read more
Cutaneous gene therapy to treat cocaine overdose
Read more
The mechanism protecting replicated DNA from degradation
Read more
Nobel Prize for Chemistry goes to "directed evolution of enzymes an…
Read more
Protocols
A programmable targeted pro…
By newseditor
Posted 23 Apr
MemPrep, a new technology f…
By newseditor
Posted 08 Apr
A tangible method to assess…
By newseditor
Posted 08 Apr
Stem cell-derived vessels-o…
By newseditor
Posted 06 Apr
Single-cell biclustering fo…
By newseditor
Posted 01 Apr
Publications
Exploiting pancreatic cance…
By newseditor
Posted 23 Apr
Structure of antiviral drug…
By newseditor
Posted 23 Apr
Type-I-interferon-responsiv…
By newseditor
Posted 23 Apr
Selenium, diabetes, and the…
By newseditor
Posted 23 Apr
Long-term neuropsychologica…
By newseditor
Posted 23 Apr
Presentations
Hydrogels in Drug Delivery
By newseditor
Posted 12 Apr
Lipids
By newseditor
Posted 31 Dec
Cell biology of carbohydrat…
By newseditor
Posted 29 Nov
RNA interference (RNAi)
By newseditor
Posted 23 Oct
RNA structure and functions
By newseditor
Posted 19 Oct
Posters
A chemical biology/modular…
By newseditor
Posted 22 Aug
Single-molecule covalent ma…
By newseditor
Posted 04 Jul
ASCO-2020-HEALTH SERVICES R…
By newseditor
Posted 23 Mar
ASCO-2020-HEAD AND NECK CANCER
By newseditor
Posted 23 Mar
ASCO-2020-GENITOURINARY CAN…
By newseditor
Posted 23 Mar