The Human Genome Project was an ambitious initiative to sequence every piece of human DNA. The project drew together collaborators from research institutions around the world, and was finally completed in 2003. Now, over two decades later, the researchers have gone beyond the sequence to present the first comprehensive functional map of genes that are expressed in human cells. The data from this project, published in Cell, ties each gene to its job in the cell, and is the culmination of years of collaboration on the single-cell sequencing method Perturb-seq.
The data is available on the Weissman Lab website for other scientists to use. “It’s a big resource in the way the human genome is a big resource, in that you can go in and do discovery-based research,” said the senior author. “Rather than defining ahead of time what biology you're going to be looking at, you have this map of the genotype-phenotype relationships and you can go in and screen the database without having to do any experiments.”
The screen allowed the researchers to delve into diverse biological questions. They used it to explore the cellular effects of genes with unknown functions, to investigate the response of mitochondria to stress, and to screen for genes that cause chromosomes to be lost or gained, a phenotype that has proved difficult to study in the past.
“I think this dataset is going to enable all sorts of analyses that we haven't even thought up yet by people who come from other parts of biology and suddenly they just have this available to draw on,” said a co-senior author of the paper.
The project takes advantage of the Perturb-seq approach which makes it possible to follow the impact of turning on or off genes with unprecedented depth. The method uses CRISPR/Cas9 genome editing to introduce genetic changes into cells, and then uses single-cell RNA sequencing to capture information about the RNAs that are expressed resulting from a given genetic change. Because RNAs control all aspects of how cells behave, this method can help decode the many cellular effects of genetic changes.
In the new study, the researchers scaled up the method to the entire genome. Using human blood cancer cell lines as well noncancerous cells derived from the retina, they performed Perturb-seq across more than 2.5 million cells, and used the data to build a comprehensive map tying genotypes to phenotypes.
Upon completing the screen, the researchers decided to put their new dataset to use and examine a few biological questions. The first, most obvious application was to look into genes with unknown functions. Because the screen also read out phenotypes of many known genes, the researchers could use the data to compare unknown genes to known ones and look for similar transcriptional outcomes, which could suggest the gene products worked together as part of a larger complex.
The mutation of one gene called C7orf26 in particular stood out. Researchers noticed that genes whose removal led to a similar phenotype were part of a protein complex called Integrator that played a role in creating small nuclear RNAs. The Integrator complex is made up of many smaller subunits – previous studies had suggested 14 individual proteins — and the researchers were able to confirm that C7orf26 made up a fifteenth component of the complex.
They also discovered that the 15 subunits worked together in smaller modules to perform specific functions within the Integrator complex. “Absent this thousand-foot-high view of the situation, it was not so clear that these different modules were so functionally distinct,” said the author.
Another perk of Perturb-seq is that because the assay focuses on single cells, the researchers could use the data to look at more complex phenotypes that become muddied when they are studied together with data from other cells. “We often take all the cells where ‘gene X’ is knocked down and average them together to look at how they changed,” the senior author said. “But sometimes when you knock down a gene, different cells that are losing that same gene behave differently, and that behavior may be missed by the average.”
The researchers found that a subset of genes whose removal led to different outcomes from cell to cell were responsible for chromosome segregation. Their removal was causing cells to lose a chromosome or pick up an extra one, a condition known as aneuploidy. “You couldn't predict what the transcriptional response to losing this gene was because it depended on the secondary effect of what chromosome you gained or lost,” the author said. “We realized we could then turn this around and create this composite phenotype looking for signatures of chromosomes being gained and lost. In this way, we've done the first genome-wide screen for factors that are required for the correct segregation of DNA.”
The researchers also used their dataset to study how mitochondria responded to stress. Mitochondria, which evolved from free-living bacteria, carry 13 genes in their genomes. Within the nuclear DNA, around 1000 genes are somehow related to mitochondrial function. “People have been interested for a long time in how nuclear and mitochondrial DNA are coordinated and regulated in different cellular conditions, especially when a cell is stressed,” another author said.
The researchers found that when they perturbed different mitochondria-related genes, the nuclear genome responded similarly to many different genetic changes. However, the mitochondrial genome responses were much more variable.
“There’s still an open question of why mitochondria still have their own DNA,” said the author. “A big-picture takeaway from our work is that one benefit of having a separate mitochondrial genome might be having localized or very specific genetic regulation in response to different stressors.
“If you have one mitochondria that’s broken, and another one that is broken in a different way, those mitochondria could be responding differentially,” the senior author said.
In the future, the researchers hope to use Perturb-seq on different types of cells besides the cancer cell line they started in. They also hope to continue to explore their map of gene functions, and hope others will do the same. “This really is the culmination of many years of work by the authors and other collaborators, and I’m really pleased to see it continue to succeed and expand,” said the co-senior author.
Tying every human gene to its function using CRISPR technology
- 1,161 views