https://pixabay.com/photos/good-bad-opposite-choice-choose-1123013/

Why amateur COVID-19 predictions are worthless — featuring ARIMA

Check out the latest COVID-19 predictions, and why I call them worthless. If you are not a public health expert, this post is for you. And if you happen to be an expert, you’ll find some really interesting results here. All with the ease of BigQuery and SQL.

tl;dr: ARIMA is widely used by domain experts to predict timeseries in different domains. With BigQuery you can easily find public datasets and ready to use ML models (including ARIMA, in alpha today). Amateurs beware, especially with topics this critical: #vizresponsibly.

ARIMA predictions vs reality, Japan

ARIMA predictions vs reported numbers, Japan

ARIMA predictions vs reality, USA

ARIMA predictions vs reported numbers, USA

Why your predictions are dangerous

What to do instead

What’s good, awesome, and useful about ARIMA in BigQuery

Easy ARIMA and easy access to data

Domain experts like ARIMA

Accelerating experts

Stupidly good results, stupidly large confidence intervals

The predictions from the start of this post, visualized.
Stupidly good results over 14 days of predictions
These predictions look dangerously attractive. Don’t trust this chart.
The forecast and confidence interval, visualized.

Additional note on the confidence intervals

Want more?

A thread full of examples on what disclaimers to include on your dashboards and charts.

Data Cloud Advocate at Snowflake ❄️. Originally from Chile, now in San Francisco and around the world. Previously at Google. Let’s talk data.

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store