Analysis of largest set of genomes from pregnant women reveals genetic links to disease, birth outcomes

Analysis of largest set of genomes from pregnant women reveals genetic links to disease, birth outcomes

Analysis of the world's largest set of genome data from pregnant women, totaling 141,431 expectant mothers from across China, has uncovered unsuspected associations between genes and birth outcomes, including the birth of twins and a woman's age at first pregnancy.

The analysis also allowed researchers to reconstruct the recent movement and intermarriage of different ethnic groups in China, and promises to help identify genes that make people susceptible to infectious diseases.

"It's amazing that this is even possible -- that you can take these massive samples and do association mapping to see what the genetic variants are that explain human traits," said co-author.

It's even more amazing because the researchers sequenced, on average, only 10 percent of each mother's genome, relying on large numbers of poor-quality genomes so as to leverage cheaper tests to discover new genetic links.

The mothers-to-be had provided blood samples to be tested for fetal chromosomal abnormalities, primarily Down syndrome. This technique, called cell-free fetal DNA testing, a form of non-invasive prenatal testing, is possible because mothers have DNA from their unborn child floating in their bloodstream. With rapid shotgun sequencing, labs can break up all the free-floating DNA in the blood and sequence just enough of the bits to diagnose Down syndrome.

Though not yet widespread in the United States, non-invasive prenatal testing is common in China: 70 percent of such tests worldwide have been performed in China. Sampling the mother's blood can be done early and risk-free, whereas standard prenatal testing in the U.S. involves amniocentesis or chorionic villus sampling, both of which require obtaining fetal cells from inside the uterus and risk harming the unborn child.

BGI was paid by maternity hospitals to conduct these tests, but obtained informed consent from each mother to also analyze the partially sequenced genomes for research purposes, maintaining anonymity. All the analyses were performed in China and the data is hosted in the China National GeneBank.

The data analysis revealed, for example, that variation in a gene called NRG1 is linked to a greater or lesser incidence of twins. One variant of the gene is more common in mothers with twins and is associated with hyperthyroidism, tightening a link between thyroid function and twinning that had previously been seen in mice.

A variant of another gene, EMB, was associated with older first-time mothers.

The analysis also pulled out several genes that had not previously been associated with height and body mass index.

Perhaps most interesting is what sequencing of all the DNA in maternal blood tells us about viruses circulating through the body, and thus the link between viruses and genes that determine susceptibility to disease.

A variation in one gene, for example, was associated with a higher concentration of herpesvirus 6 in a mother's blood. Herpesvirus 6 is the most common cause of the relatively benign baby rash called roseola, but a high "viral load" correlates with more severe symptoms. People with Alzheimer's disease also have higher levels of herpesvirus 6 in their brains.

"Most people are infected by herpesvirus 6 at some point in their life, but some people seem to be less affected than others. We have now found a human genetic variant that helps control the severity of the infection," author said. "This is quite interesting because we don't know much about the genetic variants that control why some people seem more susceptible to viral infection and not others."

More correlations remain to be discovered. The BGI team to date has sequenced genomes from more than 3 million pregnant women, much of it accompanied by information on the mothers' and babies' health that can be used to find genetic associations.

"If you have these genotypes and compare them to phenotypes, that is, something you can measure, you can find genetic variants that explain human traits," said the study's lead author.

The initial findings from the analysis are published in the journal Cell.