Dataset Information

Modeling zero inflation and overdispersion in provisional Covid-19 death counts

ABSTRACT:

SUBMITTER: Effiong A

PROVIDER: S-EPMC9444171 | biostudies-literature | 2022 Sep

REPOSITORIES: biostudies-literature

ACCESS DATA

Similar Datasets

Project description:PurposeTo facilitate use of timely, granular, and publicly available data on COVID-19 mortality, we provide a method for imputing suppressed COVID-19 death counts in the National Center for Health Statistic's 2020 provisional mortality data by quarter, county, and age.MethodsWe used a Bayesian approach to impute suppressed COVID-19 death counts by quarter, county, and age in provisional data for 3,138 US counties. Our model accounts for multilevel data structures; numerous zero death counts among persons aged <50 years, rural counties, early quarters in 2020; highly right-skewed distributions; and different levels of data granularity (county, state or locality, and national levels). We compared three models with different prior assumptions of suppressed COVID-19 deaths, including noninformative priors (M1), the same weakly informative priors for all age groups (M2), and weakly informative priors that differ by age (M3) to impute the suppressed death counts. After the imputed suppressed counts were available, we assessed three prior assumptions at the national, state/locality, and county level, respectively. Finally, we compared US counties by two types of COVID-19 death rates, crude (CDR) and age-standardized death rates (ASDR), which can be estimated only through imputing suppressed death counts.ResultsWithout imputation, the total COVID-19 death counts estimated from the raw data underestimated the reported national COVID-19 deaths by 18.60%. Using imputed data, we overestimated the national COVID-19 deaths by 3.57% (95% CI: 3.37%-3.80%) in model M1, 2.23% (95% CI: 2.04%-2.43%) in model M2, and 2.96% (95% CI: 2.76%-3.16%) in model M3 compared with the national report. The top 20 counties that were most affected by COVID-19 mortality were different between CDR and ASDR.ConclusionsBayesian imputation of suppressed county-level, age-specific COVID-19 deaths in US provisional data can improve county ASDR estimates and aid public health officials in identifying disparities in deaths from COVID-19.

Project description:In 2013, Thailand was ranked second in the world in road accident fatalities (RAFs), with 36.2 per 100,000 people. During the Songkran festival, which takes place during the traditional Thai New Year in April, the number of road traffic accidents (RTAs) and RAFs are markedly higher than on regular days, but few studies have investigated this issue as an effect of festivity. This study investigated the factors that contribute to RAFs using various count regression models. Data on 20,229 accidents in 2015 were collected from the Department of Disaster Prevention and Mitigation in Thailand. The Poisson and Conway-Maxwell-Poisson (CMP) distributions, and their zero-Inflated (ZI) versions were applied to fit the data. The results showed that RAFs in Thailand follow a count distribution with underdispersion and excessive zeros, which is rare. The ZICMP model marginally outperformed the CMP model, suggesting that having many zeros does not necessarily mean that the ZI model is required. The model choice depends on the question of interest, and a separate set of predictors highlights the distinct aspects of the data. Using ZICMP, road, weather, and environmental factors affected the differences in RAFs among all accidents, whereas month distinguished actual non-fatal accidents and crashes with or without deaths. As expected, actual non-fatal accidents were 2.37 times higher in April than in January. Using CMP, these variables were significant predictors of zeros and frequent deaths in each accident. The RAF average was surprisingly higher in other months than in January, except for April, which was unexpectedly lower. Thai authorities have invested considerable effort and resources to improve road safety during festival weeks to no avail. However, our study results indicate that people's risk perceptions and public awareness of RAFs are misleading. Therefore, nationwide road safety should instead be advocated by the authorities to raise society's awareness of everyday personal safety and the safety of others.

Dataset Information

Modeling zero inflation and overdispersion in provisional Covid-19 death counts

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets