[ back ]

 

Information Overload

Modern medicine is generating new information at a dizzying pace, but Children’s researchers are developing the tools and techniques to make sense of the flood of data.

By Cyril Manning

Picture this: a vast checkerboard, stretching in every direction to the horizons of your imagination. There are roughly 30,000 spaces in this grid – one for every snippet of chemical information that makes up the human genome. Moment by moment, a scattered, apparently random sample of these spaces begins to glow, each fluorescent beacon burning at a specific intensity. It’s a secret code of sorts, genetic marching orders that tell living cells how to behave.

For years, researchers knew this code existed, but finally it has been cracked, at least in a sense. Today, a scientist can buy a small manufactured chip, called a microarray, containing this grid of genetic pieces. By physically extracting RNA (the chemical messenger that transcribes genetic information) from the nucleus of a cell, dyeing it with a fluorescent compound, and dropping it onto the chip, the scientist can see exactly what signals are being sent. That’s because the fluorescent RNA binds with certain pieces in the grid, lighting them up like microscopic beacons.

Since the mapping of the human genome in the 1990s, the problem for scientists studying genetic disease is no longer seeing these genetic messages, but making sense of them. Recall that immense checkerboard: 30,000 spaces, some of them glowing, each at its own intensity. This is only a snapshot of which genes are being expressed (that is, which ones deliver their encoded messages) at one moment in time in one individual. Now picture thousands of these grids stacked one upon another, snapshots of gene expression over time; then overlay them one on top of the next, one for each individual in a population. There are patterns here; the challenge is deciphering them.

“When we look at how these genes might be interacting with each other to cause disease, there are billions of possible combinations,” says Isaac Kohane, MD, PhD, director of the Children’s Hospital Informatics Program (CHIP). “It’s like the ultimate Rubik’s Cube.” But it’s a puzzle Kohane and others within the program, which uses a wide range of tools, computer power and analytic methods to address various issues in medicine, plan to solve. The CHIP researchers are sifting through vast amounts of health-related data, identifying previously indiscernible patterns in complex systems, and turning those patterns into valuable insights – not just in genetics, but also in basic biology, diagnosis and treatment of disease, and management of public health issues.

This is no small ambition. It requires investigators with expertise in two or more very different fields. “Math and computer science do not come naturally to most biologists,” says Kohane, himself an endocrinologist and computer scientist. “Many CHIP researchers are dually or triply trained in medicine and mathematics or computer science, and that puts us at the center of some very exciting work.”

Mathematics, not microscopes
One example is the research of Alvin Kho, PhD, which focuses on using computer analysis to understand the flawed genetic instructions that can lead to pediatric cancer – specifically, the brain tumor known as medulloblastoma. A mathematician by training, Kho analyzes these genetic signals in ways that cannot be matched by a traditional biologist looking through a microscope. As Kohane puts it, “There’s a whole generation of biologists who can’t do state-of-the-art work because the com- putational aspect of it is out of their reach.”

For over 100 years, biologists have speculated that there is a close correlation between human development and the process of cancer growth, known as tumorigenesis. Although lacking proof, the idea is that tumorigenesis is an instance when normal development goes awry, and cells keep on multiplying. That connection is difficult to test, however, because gene expression can’t be measured in the brains of living humans. Kho is working to show that the two processes are indeed parallel by comparing genes that appear to be related to human brain cancer to genes – in mice – that appear to be related to brain development.

Plugging the human and mouse data into his computer, Kho can tap a few keys and generate a three-dimensional cube that shows each point where a human and mouse gene turns on – each location corresponding to the signal the gene sends and when it is sent.

The analysis is complex, but the basic trend is easy to see: the genetic “beacons” seen at the earliest stage of brain development among mice cluster in roughly the same space as the “beacons” seen in tumorigenesis – suggesting that similar genetic instructions are involved in both processes. Just as important, the gene expression seen in late stages of brain development are largely missing in tumor development, giving more credibility to the idea that normal development includes an “off” switch that is missing in cancer. There is much more investigation to be done, but eventually Kho’s findings could bring other cancer researchers closer to treatment and prevention strategies.

Definitive diagnoses
Cancer is only one of many diseases with at least some genetic component; there are countless fields in which identifying genetic links could help clinicians diagnose disease sooner and more accurately, and even come up with new therapies targeting the specific genes responsible.

But it’s not as simple as it sounds. Single genetic anomalies rarely cause disease; instead, most diseases result from multiple genetic mutations and interactions. And before CHIP scientists can succeed, they need something they can’t derive from an equation: an enormous amount of patient data.

“You can’t study the genetics of a disease without patient data, and getting that data requires a lot of collaboration,” says Ingrid Holm, MD, an endocrinologist and geneticist at Children’s Genomics Center, which works closely with CHIP. “More clinicians are starting to get interested in the genetic side of disease now,” she says. “Children’s has done a lot of research into genetic diseases like muscular dystrophy, and is now starting to get into the study of the genetic factors responsible for diseases like congenital heart disease, asthma and allergy, autistic spectrum disorders, and diabetes.”

Holm helps those clinical researchers set up their studies and figure out what information they need to collect to facilitate useful genetic analysis. In addition, CHIP has developed numerous downloadable and Web-based tools to make it easier for researchers to integrate and interpret genetic information.

One of several clinicians currently working on a large-scale genomics study in collaboration with CHIP is Leonard Rappaport, MD, director of Children’s Developmental Medicine Center, and an expert in autistic spectrum disorders. Rappaport and Kohane hope to develop a genetic model for diagnosing autistic children, because, as Kohane puts it, “Even the best behavioral therapists are increasingly uncertain about their ability to characterize all the particular subclasses of the disorder.”

Autism is no longer regarded as a single developmental disorder, but a spectrum of disorders involving problems with social interaction and communication. Some autistic children function almost normally, while others lack a basic understanding of the world external to themselves.

“There’s strong empirical evidence that different types of autism may have specific prognoses and require specific interventions,” explains Rappaport. By associating specific genetic patterns (genotype) found in patients’ RNA with different autistic behavior and characteristics (phenotype), the collaborators aim to gain a new understanding of the disorder. “This study will help us identify the underlying biology of autism,” says Rappaport. “And will give us a more definitive way to diagnose individual children and tailor treatments specifically to their needs.”

Mapping public health patterns
It will take one to two years for Rappaport to collect enough patient information for a genomic analysis of autistic spectrum disorders. But another CHIP project is showing a real-world payoff right now. Kenneth Mandl, MD, MPH, an emergency medicine physician, CHIP investigator and research director of Children’s Biopreparedness Center, is using computer modeling to instantly detect unusual public health patterns.

In 2001 Mandl developed software that could detect unusual patterns in emergency room visits at Children’s. The system instantly compares the symptoms of new patients to a database of more than 500,000 emergency room visits over the past 11 years, and raises a red flag if it detects unusual activity. For example, if the system detects more respiratory symptoms than it predicts for a particular season and day of the week, physicians would be alerted to a possible virus outbreak.

Now, with funding from state and federal agencies, Mandl is developing a large-scale project called AEGIS (Automated Epidemiologic Geotemporal Integrated Surveillance) that will use data from multiple institutions across geographic regions. The system could allow public health officials to anticipate the trajectory of an infectious disease outbreak, identify environmental health problems such as contaminated groundwater, or catch the earliest signs of a biological weapons attack. The system can already predict an emergency room’s next-day volume within seven percent, but Mandl is continuing to expand the software’s capabilities by teaching it to interpret new types of information (such as lab results) to make even more sophisticated predictions.

“While the type of data this system sifts through is entirely different from the human genome, we’re using similar tools to what the geneticists are using,” says Mandl. “That’s because we’re both looking at how multiple systems interact and behave across time and space, and we’re both extracting patterns and clusters from this overwhelming amount of data,” he says.

At first glance, Mandl’s system may appear to have little in common with Rappaport’s autism study or Kho’s tumorigenesis investigations. But ultimately, they all wrestle with the same problem: modern medicine is generating new information at a pace far too fast for traditional analysis techniques to keep up with. Children’s Hospital’s Informatics Program is building the tools to help biologists and physicians clearly see the beacons that glow and recede in a sea of information, stretching to the horizons of their imagination. As the computing power of informatics grows stronger, deciphering the signals those beacons are sending out should become as simple as identifying blood cells through a microscope.

 


To support research in the
Children’s Hospital Informatics Program,
contact Donna Richardson in the Children’s Hospital Trust
at (617) 355-2061 or donna.richardson@chtrust.org.

Dream is published by Children's Hospital Boston. © 2003 Children's Hospital Boston. All rights reserved.