Three lessons from weather forecasting that will improve disease forecasting and outbreak prediction


The popularity and wide use of weather forecasts has been largely attributable to the dramatic improvement in forecast accuracy. Such improvements have been quantified in recent research showing that modern 5-day weather forecasts are as accurate as 1-day forecasts in 1980. Disease forecasts are not nearly as accurate as modern weather forecasts, as documented in ongoing evaluations of COVID-19 forecast models. So, what can we learn from weather forecasting that might help us develop more robust disease forecasting and outbreak predictions?

Dr. Dylan George, head of CDCs Center for Forecasting and Outbreak Analytics (CFA) describes how disease forecasting can follow the lead of weather forecasting:

“We use weather forecasts to pre-position resources for hurricanes and to determine if we need an umbrella on a rainy day. We can use disease forecasts to determine how much vaccine we need to manufacture or if we should wear a mask that day to go out. Better data and better analytics will definitely generate better responses to health emergencies.”

As the leading provider of weather data and analytics, we at IBM believe Dr. George offers a compelling vision. 

More data sources lead to greater accuracy

An explosion in the volume and variety of weather data has enabled dramatic improvements in forecast accuracy. Whereas fifty years ago, weather data was mostly confined to temperature, barometric and other readings taken at scattered weather stations, weather station data today is augmented with data from a growing network of satellites, remote sensors, radar stations, weather balloons and other sources. 

Today, disease surveillance data is still largely confined to case reports from health clinics and hospitals, although the variety and volume of data has been growing. Syndromic and wastewater surveillance data are adding to traditional case reporting as a means to monitor community infection. And non-traditional data sources (like internet search trends and social media user surveys) offer the potential to obtain more real-time and hyperlocal information. 

To make progress toward better disease forecasting, the volume and variety of disease surveillance data will need to continue growing. Public health investments need to focus on seeding and growing these new data sources for disease surveillance. And following the experience in weather forecasting, additional investment will be needed to harmonize these disparate data sources into a unified spacio-temporal view of community infection.

Learn more about how data strategies deliver insights to the public

Innovative modeling enables advanced disease surveillance

Advances in weather modeling and simulation—enabled by breakthroughs in machine learning and exponential growth in computing power—have been a key factor enabling improved weather forecasting.  In the 1970s, weather forecasts mostly relied on numerical weather prediction methods. These days, methods are augmented with machine learning algorithms that enable accurate prediction of storm events and paths. For example, the Weather Company generates the most accurate publicly available weather forecasts, leveraging the IBM GRAF machine learning algorithms for weather prediction.

Today, disease forecasting largely relies on long-standing SIR-based—Susceptible, Infectious, Recovered—epidemiological models, although recent COVID-19 modeling has begun to incorporate more advanced machine learning algorithms, with improvements in forecast accuracy. Recent developments like the CDC’s Epidemic Prediction Initiative show promise, and the CDC CFA is investing in continued innovation to improve disease forecasting in the United States.

Continued progress in developing innovative modeling techniques will be important for achieving the vision of robust disease forecasting and outbreak predictions. Public health authorities, university researchers and private corporations can productively partner to help advance the application of advanced analytics to disease surveillance. IBM’s engagement with the Rhode Island Department of Health is a good example of what can be accomplished through public-private collaboration. IBM collaborated with RIDOH and Brown University epidemiologists to develop smart ensembles of multiple COVID-19 models for more accurate pandemic forecasts, providing 95% accuracy in forecasting the large omicron outbreak in January 2022. Our collaboration continues today with the application of machine learning to infer community infection from syndromic surveillance and wastewater surveillance data.

Modern platforms will deliver data and insights to the public

As more data and better modeling dramatically improved the accuracy of weather forecasting, a robust technology infrastructure emerged to enable high speed data processing, modeling updates and easy access to actionable insights. While weather forecasts used to be largely distributed daily through newspapers, radio and television, they’re now available on demand through the internet and mobile applications, and updated multiple times per day as conditions evolve. The ubiquity of this information enables people throughout the world to adjust plans and behaviors to minimize weather-related property damage and fatalities.

Disease forecasts, however, are not readily available to the public, as COVID-19 forecasts are only accessible on the internet to those who know where to find them. We can see the beginnings of a modern data and analytics platform to support disease surveillance, enabling automated data processing and modeling. But much progress is still needed in the public dissemination of actionable insights. One can imagine a future where infectious disease warnings are as readily available as hazardous weather warnings, enabling people to adjust plans and behaviors to minimize morbidity and mortality related to infectious disease.

To achieve that future, public health authorities need to invest in modern platforms to process data, generate actionable insights and disseminate those insights to the public. The CDC’s Data Modernization Initiative and associated grant funding to states and localities is a good start. Such funding enables public-private collaboration to jumpstart public health data modernization. A good example of a successful public-private partnership is IBM’s collaboration with Canadian and other public health authorities to develop and deploy a modern public health data platform.   

Research shows that more accurate weather forecasting has saved lives and generated economic benefits exceeding required investments. Similar investments to improve the accuracy and availability of disease forecasts would also save lives and significantly reduce the economic burden of unmitigated infectious disease outbreaks.

Connect with IBM experts to unlock your data’s potential Learn more about driving data democratization with modern architecture

The post Three lessons from weather forecasting that will improve disease forecasting and outbreak prediction appeared first on IBM Blog.