Dataset Information

Comparison of confidence interval methods for an intra-class correlation coefficient (ICC).

ABSTRACT:

Background

The intraclass correlation coefficient (ICC) is widely used in biomedical research to assess the reproducibility of measurements between raters, labs, technicians, or devices. For example, in an inter-rater reliability study, a high ICC value means that noise variability (between-raters and within-raters) is small relative to variability from patient to patient. A confidence interval or Bayesian credible interval for the ICC is a commonly reported summary. Such intervals can be constructed employing either frequentist or Bayesian methodologies.

Methods

This study examines the performance of three different methods for constructing an interval in a two-way, crossed, random effects model without interaction: the Generalized Confidence Interval method (GCI), the Modified Large Sample method (MLS), and a Bayesian method based on a noninformative prior distribution (NIB). Guidance is provided on interval construction method selection based on study design, sample size, and normality of the data. We compare the coverage probabilities and widths of the different interval methods.

Results

We show that, for the two-way, crossed, random effects model without interaction, care is needed in interval method selection because the interval estimates do not always have properties that the user expects. While different methods generally perform well when there are a large number of levels of each factor, large differences between the methods emerge when the number of one or more factors is limited. In addition, all methods are shown to lack robustness to certain hard-to-detect violations of normality when the sample size is limited.

Conclusions

Decision rules and software programs for interval construction are provided for practical implementation in the two-way, crossed, random effects model without interaction. All interval methods perform similarly when the data are normal and there are sufficient numbers of levels of each factor. The MLS and GCI methods outperform the NIB when one of the factors has a limited number of levels and the data are normally distributed or nearly normally distributed. None of the methods work well if the number of levels of a factor are limited and data are markedly non-normal. The software programs are implemented in the popular R language.

SUBMITTER: Ionan AC

PROVIDER: S-EPMC4258044 | biostudies-literature | 2014 Nov

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Comparison of confidence interval methods for an intra-class correlation coefficient (ICC).

Ionan Alexei C AC Polley Mei-Yin C MY McShane Lisa M LM Dobbin Kevin K KK

BMC medical research methodology 20141122

<h4>Background</h4>The intraclass correlation coefficient (ICC) is widely used in biomedical research to assess the reproducibility of measurements between raters, labs, technicians, or devices. For example, in an inter-rater reliability study, a high ICC value means that noise variability (between-raters and within-raters) is small relative to variability from patient to patient. A confidence interval or Bayesian credible interval for the ICC is a commonly reported summary. Such intervals can b ...[more]

PMID: 25417040

Dataset Information

Comparison of confidence interval methods for an intra-class correlation coefficient (ICC).

Background

Methods

Results

Conclusions

Publications

Comparison of confidence interval methods for an intra-class correlation coefficient (ICC).

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Similar Datasets

Simulation data for an estimation of the maximum theoretical value and confidence interval for the correlation coefficient.
| S-EPMC5540710 | biostudies-literature

The coefficient of determination R2 and intra-class correlation coefficient from generalized linear mixed-effects models revisited and expanded.
| S-EPMC5636267 | biostudies-literature

Comparison of Bootstrap Confidence Interval Methods for GSCA Using a Monte Carlo Simulation.
| S-EPMC6797821 | biostudies-literature

Confidence interval methods for antimicrobial resistance surveillance data.
| S-EPMC8191092 | biostudies-literature

A self-normalized confidence interval for the mean of a class of nonstationary processes.
| S-EPMC3852676 | biostudies-literature

Indirect Effects in Sequential Mediation Models: Evaluating Methods for Hypothesis Testing and Confidence Interval Formation.
| S-EPMC6901816 | biostudies-literature

Confidence interval estimation in R-DAS.
| S-EPMC6003776 | biostudies-literature

Confidence interval for quantiles and percentiles.
| S-EPMC6294150 | biostudies-literature

A Comparison of Methods to Measure the Coupling Coefficient of Electromagnetic Vibration Energy Harvesters.
| S-EPMC6952930 | biostudies-literature

Confidence intervals for the common coefficient of variation of rainfall in Thailand.
| S-EPMC7513754 | biostudies-literature