<HashMap><database>biostudies-literature</database><scores/><additional><submitter>Millard LAC</submitter><funding>Integrative Epidemiology Unit</funding><funding>The UK Medical Research Council and Wellcome</funding><funding>NIHR</funding><funding>British Heart Foundation</funding><funding>Wellcome Trust’s</funding><funding>Elizabeth Blackwell Institute for Research</funding><funding>University of Bristol</funding><funding>Strategic Support Science</funding><funding>the Royal Society</funding><funding>University of Bristol for the Questionnaire 'COVID1'</funding><funding>Wellcome Trust's</funding><funding>University of Bristol and Medical Research Council</funding><funding>Longitudinal Population Study Covid-19 Steering Group and Secretariat</funding><funding>University of Bristol Faculty Director's Discretionary Fund for the Questionnaire 'COVID2'</funding><funding>University of Bristol for the Questionnaire ‘COVID1’</funding><funding>Medical Research Council</funding><funding>European Union’s Horizon 2020</funding><funding>Bristol BHF Accelerator Award</funding><funding>National Institute for Health Research (NIHR)</funding><funding>University of Bristol Faculty Director’s Discretionary Fund for the Questionnaire ‘COVID2’</funding><funding>Wellcome Trust</funding><funding>European Union's Horizon 2020</funding><funding>MRC</funding><pagination>44-57</pagination><full_dataset_link>https://www.ebi.ac.uk/biostudies/studies/S-EPMC9908043</full_dataset_link><repository>biostudies-literature</repository><omics_type>Unknown</omics_type><volume>52(1)</volume><pubmed_abstract>&lt;h4>Background&lt;/h4>Non-random selection of analytic subsamples could introduce selection bias in observational studies. We explored the potential presence and impact of selection in studies of SARS-CoV-2 infection and COVID-19 prognosis.&lt;h4>Methods&lt;/h4>We tested the association of a broad range of characteristics with selection into COVID-19 analytic subsamples in the Avon Longitudinal Study of Parents and Children (ALSPAC) and UK Biobank (UKB). We then conducted empirical analyses and simulations to explore the potential presence, direction and magnitude of bias due to this selection (relative to our defined UK-based adult target populations) when estimating the association of body mass index (BMI) with SARS-CoV-2 infection and death-with-COVID-19.&lt;h4>Results&lt;/h4>In both cohorts, a broad range of characteristics was related to selection, sometimes in opposite directions (e.g. more-educated people were more likely to have data on SARS-CoV-2 infection in ALSPAC, but less likely in UKB). Higher BMI was associated with higher odds of SARS-CoV-2 infection and death-with-COVID-19. We found non-negligible bias in many simulated scenarios.&lt;h4>Conclusions&lt;/h4>Analyses using COVID-19 self-reported or national registry data may be biased due to selection. The magnitude and direction of this bias depend on the outcome definition, the true effect of the risk factor and the assumed selection mechanism; these are likely to differ between studies with different target populations. Bias due to sample selection is a key concern in COVID-19 research based on national registry data, especially as countries end free mass testing. The framework we have used can be applied by other researchers assessing the extent to which their results may be biased for their research question of interest.</pubmed_abstract><journal>International journal of epidemiology</journal><pubmed_title>Exploring the impact of selection bias in observational studies of COVID-19: a simulation study.</pubmed_title><pmcid>PMC9908043</pmcid><funding_grant_id>733206</funding_grant_id><funding_grant_id>MC_UU_00011/3</funding_grant_id><funding_grant_id>MC_UU_12023/21</funding_grant_id><funding_grant_id>NF-0616-10102</funding_grant_id><funding_grant_id>MC_UU_00011/1</funding_grant_id><funding_grant_id>AA/18/1/34219</funding_grant_id><funding_grant_id>ES/T009101/1</funding_grant_id><funding_grant_id>215408/Z/19/Z</funding_grant_id><funding_grant_id>CH/F/20/90003</funding_grant_id><funding_grant_id>MC_UU_12023/29</funding_grant_id><funding_grant_id>217065/Z/19/Z</funding_grant_id><funding_grant_id>MC_UU_00004/07</funding_grant_id><funding_grant_id>221574/Z/20/Z</funding_grant_id><funding_grant_id>NF-0616–10102</funding_grant_id><funding_grant_id>MC_UU_00011/6</funding_grant_id><pubmed_authors>Fernandez-Sanles A</pubmed_authors><pubmed_authors>Griffith GJ</pubmed_authors><pubmed_authors>Major-Smith D</pubmed_authors><pubmed_authors>Tilling K</pubmed_authors><pubmed_authors>Clayton GL</pubmed_authors><pubmed_authors>Borges MC</pubmed_authors><pubmed_authors>Carter AR</pubmed_authors><pubmed_authors>Morris TP</pubmed_authors><pubmed_authors>Kawabata E</pubmed_authors><pubmed_authors>Lawlor DA</pubmed_authors><pubmed_authors>Hughes RA</pubmed_authors><pubmed_authors>Davey Smith G</pubmed_authors><pubmed_authors>Millard LAC</pubmed_authors></additional><is_claimable>false</is_claimable><name>Exploring the impact of selection bias in observational studies of COVID-19: a simulation study.</name><description>&lt;h4>Background&lt;/h4>Non-random selection of analytic subsamples could introduce selection bias in observational studies. We explored the potential presence and impact of selection in studies of SARS-CoV-2 infection and COVID-19 prognosis.&lt;h4>Methods&lt;/h4>We tested the association of a broad range of characteristics with selection into COVID-19 analytic subsamples in the Avon Longitudinal Study of Parents and Children (ALSPAC) and UK Biobank (UKB). We then conducted empirical analyses and simulations to explore the potential presence, direction and magnitude of bias due to this selection (relative to our defined UK-based adult target populations) when estimating the association of body mass index (BMI) with SARS-CoV-2 infection and death-with-COVID-19.&lt;h4>Results&lt;/h4>In both cohorts, a broad range of characteristics was related to selection, sometimes in opposite directions (e.g. more-educated people were more likely to have data on SARS-CoV-2 infection in ALSPAC, but less likely in UKB). Higher BMI was associated with higher odds of SARS-CoV-2 infection and death-with-COVID-19. We found non-negligible bias in many simulated scenarios.&lt;h4>Conclusions&lt;/h4>Analyses using COVID-19 self-reported or national registry data may be biased due to selection. The magnitude and direction of this bias depend on the outcome definition, the true effect of the risk factor and the assumed selection mechanism; these are likely to differ between studies with different target populations. Bias due to sample selection is a key concern in COVID-19 research based on national registry data, especially as countries end free mass testing. The framework we have used can be applied by other researchers assessing the extent to which their results may be biased for their research question of interest.</description><dates><release>2023-01-01T00:00:00Z</release><publication>2023 Feb</publication><modification>2025-04-04T12:55:55.529Z</modification><creation>2025-04-04T12:55:55.529Z</creation></dates><accession>S-EPMC9908043</accession><cross_references><pubmed>36474414</pubmed><doi>10.1093/ije/dyac221</doi></cross_references></HashMap>