Press Archive

Decoding the Human Immune System

SDSC, Vanderbilt University Collaborate in Human Vaccines Project Study

Published February 13, 2019

For the first time ever, researchers are comprehensively sequencing the human immune system, which is billions of times larger than the human genome. In a new study published in the February 13 online issue of Nature from the Human Vaccines Project, scientists have sequenced a key part of this vast and mysterious system: the genes encoding the circulating B cell receptor repertoire.

In sequencing these receptors in both adults and infants, the scientists found surprising overlaps that could provide potential new antibody targets for vaccines and therapeutics that work across populations. As part of a large multi-year initiative, this work seeks to define the genetic underpinnings of people’s ability to respond and adapt to an immense range of disease.

Led by scientists at Vanderbilt University Medical Center and the San Diego Supercomputer Center (SDSC) at UC San Diego, this advancement is possible due to the merging of biological research with high-powered frontier supercomputing. While the Human Genome Project sequenced the human genome and led to the development of novel genomics tools, it did not tackle the size and complexity of the human immune system.

“A continuing challenge in the human immunology and vaccine development fields has been that we do not have comprehensive reference data for what the normal healthy human immune system looks like,” said James E. Crowe, Jr. Director of the Vanderbilt Vaccine Center of Vanderbilt University Medical Center, and senior author on the new paper. “Prior to the current era, people assumed it would be impossible to do such a project because the immune system is theoretically so large, but this new paper shows it is possible to define a large portion, because the size of each person’s B cell receptor repertoire is unexpectedly small.”

The new study specifically looks at one part of the adaptive immune system, the circulating B cell receptors that are responsible for the production of antibodies that are considered the main determinant of immunity in people. The receptors randomly select and join gene segments, forming unique sequences of nucleotides known as receptor ‘clonotypes.’ In this way, a small number of genes can lead to an incredible diversity of receptors, allowing the immune system to recognize almost any new pathogen.

Conducting leukapheresis on three individual adults, the researchers cloned and sequenced up to 40 billion cells to sequence the combinations of gene segments that comprise the circulating B cell receptors, achieving a depth of sequencing never before done. They also sequenced umbilical cord blood from three infants. The idea was to collect a vast amount of data on a few individuals, rather than the traditional model of collecting only a few points of data on many.

“The overlap in antibody sequences between individuals was unexpectedly high,” Crowe explained, “even showing some identical antibody sequences between adults and babies at the time of birth.” Understanding this commonality is key to identifying antibodies that can be targets for vaccines and treatments that work more universally across populations.

A central question was whether the shared sequences across individuals were the result of chance, rather than the result of some shared common biological or environmental factor. To address this issue, the researchers developed a synthetic B cell receptor repertoire and found that “the overlap observed experimentally was significantly greater than what would be expected by chance,” said Robert Sinkovits, director of scientific computing applications at SDSC.

As part of a unique consortium created by the Human Vaccines Project, SDSC applied its considerable computing power to working with the multiple terabytes of data. A central tenet of the Project is the merger of biomedicine and advanced computing.

“The Human Vaccines Project allows us to study problems at a larger scale than would be normally possible in a single lab and it also brings together groups that might not normally collaborate,” according to Sinkovits.

Continued collaborative work is now under way to expand this study, including sequencing other areas of the adaptive immune system, the T cell repertoire; adding additional demographics such as supercentenarians and international populations; and applying AI-driven algorithms to further mine the datasets for insights. The goal is to continue to interrogate the shared components of the immune system to develop safer and highly targeted vaccines and immunotherapies that work across populations.

“Due to recent technological advances, we now have an unprecedented opportunity to harness the power of the human immune system to fundamentally transform human health,” said Wayne Koff, CEO of the Human Vaccines Project. “Decoding the human immune system is central to tackling the global challenges of infectious and non-communicable diseases, from cancer to Alzheimer’s to pandemic influenza. This study marks a key step toward understanding how the human immune system works, setting the stage for developing next-generation health products through the convergence of genomics and immune monitoring technologies with machine learning and artificial intelligence.”

The paper, called ‘High Frequency of Shared Clonotypes in Human B Cell Receptor Repertoires’, will also appear in the Feb. 21, 2019, print issue of Nature. The work was supported by a grant from the Human Vaccines Project, and institutional funding from Vanderbilt University Medical Center.

About the San Diego Supercomputer Center (SDSC)

As an Organized Research Unit of UC San Diego, SDSC is considered a leader in data-intensive computing and cyberinfrastructure, providing resources, services, and expertise to the national research community, including industry and academia. SDSC supports hundreds of multidisciplinary programs spanning a wide variety of domains, from earth sciences and biology to astrophysics, bioinformatics, and health IT. SDSC’s petascale Comet supercomputer is a key resource within the National Science Foundation’s XSEDE (eXtreme Science and Engineering Discovery Environment) program.