In the early days of the pandemic, there wasn’t great data available, and it wasn’t easy to do better than trusting the standard epidemiological estimate that around 2% of people who got COVID-19 would die. My back of the envelope estimate at the time was way higher, but no one else I knew seemed to think that number made sense, so I let the matter drop. But now we have enough data to check.
Recently, my sister reached out to me to check her own thinking on the matter. She used the same method I initially tried and abandoned—simply dividing the number of deaths by the number of resolved cases (deaths + recoveries) - to estimate that in the US, COVID-19 kills around 1 in 6 people who get it.
The problem with using only resolved cases, in a country with an ongoing pandemic, is that if people die faster than they’re marked recovered, death rates can be inflated—and if they recover faster, deflated. Ideally, you’d want to wait until all cases have been resolved one way or the other. Fortunately, there are now countries where that situation nearly holds.
I looked at countries with the 25 lowest active case counts, using the 91-DIVOC visualization tool, to see which ones seemed to be mostly done with the pandemic, at least for now:
I copied the current numbers (as of 7 Jun 2020) into a spreadsheet, to see what the effective death rate is in countries where the vast majority of cases are resolved. For the People’s Republic of China, I only looked at numbers from Hubei.
In the countries and regions I looked at, active cases are around 1% of all confirmed cases, and around 5.8% of resolved COVID-19 cases end in death. But two thirds of cases came from Hubei. Excluding China, around 4% of confirmed cases are active, and around 3.5% of resolved cases ended in death. If I only look at other countries where fewer than 1% of confirmed cases are active, the average COVID-19 death rate is 2.6%, though in individual countries it ranges from 0.55% for Iceland to 4.67% for Croatia.
Infection fatality rate (IFR) is clearly the superior metric if you’re trying to do something like forecast the spread of the virus, and total death counts, because it corresponds more directly to a statement about underlying reality than the case fatality rate (CFR) does; the denominator of CFR is determined in part by who gets tested.
But if you’re trying to figure out what rough-and-ready multiplier to apply to the daily numbers reported in your area, then to use IFR estimates, you need to remember that reported cases are not the same as actual infections, and adjust accordingly.
Estimating COVID-19 Mortality Rates
Link post
In the early days of the pandemic, there wasn’t great data available, and it wasn’t easy to do better than trusting the standard epidemiological estimate that around 2% of people who got COVID-19 would die. My back of the envelope estimate at the time was way higher, but no one else I knew seemed to think that number made sense, so I let the matter drop. But now we have enough data to check.
Recently, my sister reached out to me to check her own thinking on the matter. She used the same method I initially tried and abandoned—simply dividing the number of deaths by the number of resolved cases (deaths + recoveries) - to estimate that in the US, COVID-19 kills around 1 in 6 people who get it.
The problem with using only resolved cases, in a country with an ongoing pandemic, is that if people die faster than they’re marked recovered, death rates can be inflated—and if they recover faster, deflated. Ideally, you’d want to wait until all cases have been resolved one way or the other. Fortunately, there are now countries where that situation nearly holds.
ETA: The other problem is that cases aren’t infections. But if, as I do, you want to use publicly reported case data to make informed personal decisions, you might also be interested in the easier to calculate expected deaths per reported case.
I looked at countries with the 25 lowest active case counts, using the 91-DIVOC visualization tool, to see which ones seemed to be mostly done with the pandemic, at least for now:
I copied the current numbers (as of 7 Jun 2020) into a spreadsheet, to see what the effective death rate is in countries where the vast majority of cases are resolved. For the People’s Republic of China, I only looked at numbers from Hubei.
In the countries and regions I looked at, active cases are around 1% of all confirmed cases, and around 5.8% of resolved COVID-19 cases end in death. But two thirds of cases came from Hubei. Excluding China, around 4% of confirmed cases are active, and around 3.5% of resolved cases ended in death. If I only look at other countries where fewer than 1% of confirmed cases are active, the average COVID-19 death rate is 2.6%, though in individual countries it ranges from 0.55% for Iceland to 4.67% for Croatia.
I’m adjusting my freak-out-and-hole-up threshold accordingly.
UPDATE: In the comments on my blog, Anna Salamon shared this estimate of the actual infection fatality rate for the state of New York.
Infection fatality rate (IFR) is clearly the superior metric if you’re trying to do something like forecast the spread of the virus, and total death counts, because it corresponds more directly to a statement about underlying reality than the case fatality rate (CFR) does; the denominator of CFR is determined in part by who gets tested.
But if you’re trying to figure out what rough-and-ready multiplier to apply to the daily numbers reported in your area, then to use IFR estimates, you need to remember that reported cases are not the same as actual infections, and adjust accordingly.