Unknown

Dataset Information

0

Efficient estimation of smooth distributions from coarsely grouped data.


ABSTRACT: Ungrouping binned data can be desirable for many reasons: Bins can be too coarse to allow for accurate analysis; comparisons can be hindered when different grouping approaches are used in different histograms; and the last interval is often wide and open-ended and, thus, covers a lot of information in the tail area. Age group-specific disease incidence rates and abridged life tables are examples of binned data. We propose a versatile method for ungrouping histograms that assumes that only the underlying distribution is smooth. Because of this modest assumption, the approach is suitable for most applications. The method is based on the composite link model, with a penalty added to ensure the smoothness of the target distribution. Estimates are obtained by maximizing a penalized likelihood. This maximization is performed efficiently by a version of the iteratively reweighted least-squares algorithm. Optimal values of the smoothing parameter are chosen by minimizing Akaike's Information Criterion. We demonstrate the performance of this method in a simulation study and provide several examples that illustrate the approach. Wide, open-ended intervals can be handled properly. The method can be extended to the estimation of rates when both the event counts and the exposures to risk are grouped.

SUBMITTER: Rizzi S 

PROVIDER: S-EPMC4493979 | biostudies-literature | 2015 Jul

REPOSITORIES: biostudies-literature

altmetric image

Publications

Efficient estimation of smooth distributions from coarsely grouped data.

Rizzi Silvia S   Gampe Jutta J   Eilers Paul H C PH  

American journal of epidemiology 20150616 2


Ungrouping binned data can be desirable for many reasons: Bins can be too coarse to allow for accurate analysis; comparisons can be hindered when different grouping approaches are used in different histograms; and the last interval is often wide and open-ended and, thus, covers a lot of information in the tail area. Age group-specific disease incidence rates and abridged life tables are examples of binned data. We propose a versatile method for ungrouping histograms that assumes that only the un  ...[more]

Similar Datasets

| S-EPMC6540566 | biostudies-literature
| S-EPMC10653189 | biostudies-literature
| S-EPMC9841728 | biostudies-literature
| S-EPMC4232941 | biostudies-literature
| S-EPMC4431962 | biostudies-literature
| S-EPMC4476533 | biostudies-literature
| S-EPMC4581539 | biostudies-literature
| S-EPMC6075718 | biostudies-literature
| S-EPMC11650374 | biostudies-literature
| S-EPMC8652033 | biostudies-literature