Rasch analysis of the patient-rated wrist evaluation questionnaire.
ABSTRACT: The Patient-Rated Wrist Evaluation (PRWE) was developed as a wrist joint specific measure of pain and disability and evidence of sound validity has been accumulated through classical psychometric methods. Rasch analysis (RA) has been endorsed as a newer method for analyzing the clinical measurement properties of self-report outcome measures. The purpose of this study was to evaluate the PRWE using Rasch modeling.We employed the Rasch model to assess overall fit, response scaling, individual item fit, differential item functioning (DIF), local dependency, unidimensionality and person separation index (PSI). A convenience sample of 382 patients with distal radius fracture was recruited from the hand and upper limb clinic at large academic healthcare organization, London, Ontario, Canada, 6-month post-injury scores of the PRWE was used. RA was conducted on the 3 subscales (pain, specific activities, and usual activities) of the PRWE separately.The pain subscale adequately fit the Rasch model when item 4 "Pain - When it is at its worst" was deleted to eliminate non-uniform DIF by age group, and item 5 "How often do you have pain" was rescored by collapsing into 8 intervals to eliminate disordered thresholds. Uniform DIF for "Use my affected hand to push up from the chair" (by work status) and "Use bathroom tissue with my affected hand" (by injured hand) was addressed by splitting the items for analysis. After background rescoring of 2 items in pain subscale, 2 items in specific activities and 3 items in usual activities, all three subscales of the PRWE were well targeted and had high reliability (PSI?=?0.86). These changes provided a unidimensional, interval-level scaled measure.Like a previous analysis of the Patient-Rated Wrist and Hand Evaluation, this study found the PRWE could be fit to the Rasch model with rescoring of multiple items. However, the modifications required to achieve fit were not the same across studies, our fit statistics also suggested one of the pain items should be deleted. This study adds to the pool of evidence supporting the PRWE, but cannot confidently provide a Rasch-based scoring algorithm.
Project description:To analyze the validity of the Hospital Anxiety and Depression Scale (HADS) among Chinese cataract population.A total of 275 participants with unilateral or bilateral cataract were recruited to complete the Chinese version of HADS. The patients' demographic and ophthalmic characteristics were documented. Rasch analysis was conducted to examine the model fit statistics, the thresholds ordering of the polytomous items, targeting, person separation index and reliability, local dependency, unidimentionality, differential item functioning (DIF) and construct validity of the HADS individual and summary measures.Rasch analysis was performed on anxiety and depression subscales as well as HADS-Total score respectively. The items of original HADS-Anxiety, HADS-Depression and HADS-Total demonstrated evidence of misfit of the Rasch model. Removing items A7 for anxiety subscale and rescoring items D14 for depression subscale significantly improved Rasch model fit. A 12-item higher order total scale with further removal of D12 was found to fit the Rasch model. The modified items had ordered response thresholds. No uniform DIF was detected, whereas notable non-uniform DIF in high-ability group was found. The revised cut-off points were given for the modified anxiety and depression subscales.The modified version of HADS with HADS-A and HADS-D as subscale and HADS-T as a higher-order measure is a reliable and valid instrument that may be useful for assessing anxiety and depression states in Chinese cataract population.
Project description:There is no disease-specific instrument to describe health-related quality of life (HRQoL) in Indian patients with asthma. However, an abbreviated World Health Organization Quality of Life questionnaire (WHOQOL-Bref), a generic Hindi HRQoL measure, has been developed and validated in India.To evaluate the WHOQOL-Bref in adult patients with asthma and to test possible modifications to the instrument to improve its psychometric adequacy.Sixty-seven patients with asthma completed the WHOQOL-Bref. Rasch analysis was used to explore the psychometric performance of the four domains (physical, psychological, social relationships and environment) of the scale. Overall fit of data to model expectations, appropriate category ordering, presence of differential item functioning, individual item fit and targeting of item difficulty to patient ability were explored for each domain. Item deletion and rescoring were applied to misfitting items to improve overall performance.The overall fit of the WHOQOL-Bref data was adequate. Item 3 (pain prevents doing work) displayed a large positive fit residual value (indicating violation of unidimensionality), resulting in poor construct validity for the physical domain. No item exhibited differential item functioning. Ten items had disordered thresholds. The WHOQOL-Bref was modified by dropping item 3 and rescoring category structures of 16 items. The modified scale had good construct validity for all domains, ordered thresholds for all items and good targeting of items to persons.The WHOQOL-Bref performed inadequately in describing HRQoL in the asthma patients studied. However, when modified by Rasch analysis, the scale proved better than the original scale.
Project description:BACKGROUND:Evidence-based practice (EBP) is a complex process. To quantify it, one has to also consider individual and contextual factors using multiple measures. Modern measurement approaches are available to optimize the measurement of complex constructs. This study aimed to develop a robust measurement approach for constructs around EBP including practice, individual (e.g. knowledge, attitudes, confidence, behaviours), and contextual factors (e.g. resources). METHODS:One hundred eighty-one items arising from 5 validated EBP measures were subjected to an item analysis. Nominal group technique was used to arrive at a consensus about the content relevance of each item. Baseline questionnaire responses from a longitudinal study of the evolution of EBP in 128 new graduates of Canadian physical and occupational therapy programmes were analysed. Principles of Rasch Measurement Theory were applied to identify challenges with threshold ordering, item and person fit to the Rasch model, unidimensionality, local independence, and differential item functioning (DIF). RESULTS:The nominal group technique identified 70/181 items, and modified Delphi approach identified 68 items that fit a formative model (2 related EBP domains: self-use of EBP (9 items) and EBP activities (7 items)) or a reflective model (4 related EBP domains: attitudes towards EBP (17 items), self-efficacy (9 items), knowledge (11 items) and resources (15 items)). Rasch analysis provided a single score for reflective construct. Among attitudes items, 65% (11/17) fit the Rasch model, item difficulties ranged from - 7.51 to logits (least difficult) to + 5.04 logits (most difficult), and person separation index (PSI) = 0.63. Among self-efficacy items, 89% (8/9) fit the Rasch model, item difficulties ranged from - 3.70 to + 4.91, and PSI = 0.80. Among knowledge items, 82% (9/11) fit the Rasch model, item difficulties ranged from - 7.85 to 4.50, and PSI = 0.81. Among resources items, 87% (13/15) fit the Rasch model, item difficulties ranged from - 3.38 to 2.86, and PSI = 0.86. DIF occurred in 2 constructs: attitudes (1 by profession and 2 by language) and knowledge (1 by language and 2 by profession) arising from poor wording in the original version leading to poor translation. CONCLUSIONS:Rasch Measurement Theory was applied to develop a valid and reliable measure of EBP. Further modifications to the items can be done for subsequent waves of the survey.
Project description:The Swallowing Quality-of-Life Questionnaire (SWAL-QoL) is considered the gold standard for assessing health-related QoL in oropharyngeal dysphagia. The Dutch translation (DSWAL-QoL) and its adjusted version (aDSWAL-QoL) have been validated using classical test theory (CTT). However, these scales have not been tested against the Rasch measurement model, which is required to establish the structural validity and objectivity of the total scale and subscale scores. Thus, the purpose of this study was to examine the psychometric properties of these scales using item analysis according to the Rasch model.Item analysis with the Rasch model was performed using RUMM2030 software with previously collected data from a validation study of 108 patients. The assessment included evaluations of overall model fit, reliability, unidimensionality, threshold ordering, individual item and person fits, differential item functioning (DIF), local item dependency (LID) and targeting.The analysis could not establish the psychometric properties of either of the scales or their subscales because they did not fit the Rasch model, and multidimensionality, disordered thresholds, DIF, and/or LID were found. The reliability and power of fit were high for the total scales (PSI?=?0.93) but low for most of the subscales (PSI?<?0.70). The targeting of persons and items was suboptimal. The main source of misfit was disordered thresholds for both the total scales and subscales. Based on the results of the analysis, adjustments to improve the scales were implemented as follows: disordered thresholds were rescaled, misfit items were removed and items were split for DIF. However, the multidimensionality and LID could not be resolved. The reliability and power of fit remained low for most of the subscales.This study represents the first analyses of the DSWAL-QoL and aDSWAL-QoL with the Rasch model. Relying on the DSWAL-QoL and aDSWAL-QoL total and subscale scores to make conclusions regarding dysphagia-related HRQoL should be treated with caution before the structural validity and objectivity of both scales have been established. A larger and well-targeted sample is recommended to derive definitive conclusions about the items and scales. Solutions for the psychometric weaknesses suggested by the model and practical implications are discussed.
Project description:We sought to evaluate the validity of the Spanish language version of the patient health questionnaire-9 (PHQ-9) depression scale in a large sample of pregnant Peruvian women using Rasch item response theory (IRT) approaches. We further sought to examine the appropriateness of the response formats, reliability and potential differential item functioning (DIF) by maternal age, educational attainment and employment status.This cross-sectional study was conducted among 1520 pregnant women in Lima, Peru. A structured interview was used to collect information on demographic characteristics and PHQ-9 items. Data from the PHQ-9 were fitted to the Rasch IRT model and tested for appropriate category ordering, the assumptions of unidimensionality and local independence, item fit, reliability and presence of DIF.The Spanish language version of PHQ-9 demonstrated unidimensionality, local independence, and acceptable fit for the Rasch IRT model. However, we detected disordered response categories for the original four response categories. After collapsing "more than half the days" and "nearly every day", the response categories ordered properly and the PHQ-9 fit the Rasch IRT model. The PHQ-9 had moderate internal consistency (person separation index, PSI=0.72). Additionally, the items of PHQ-9 were free of DIF with regard to age, educational attainment, and employment status.The Spanish language version of the PHQ-9 was shown to have item properties of an effective screening instrument. Collapsing rating scale categories and reconstructing three-point Likert scale for all items improved the fit of the instrument. Future studies are warranted to establish new cutoff scores and criterion validity of the three-point Likert scale response options for the Spanish language version of the PHQ-9.
Project description:Objectives:To evaluate application of the Barthel Index (BI) in assessing basic activities of daily living (ADL) of patients with dementia using Rasch analysis. Design:A multi-country cross-sectional study. Setting and Participants:Nineteen long-term care facilities located in China, Japan, South Korea, and Thailand. A total of 644 patients with dementia were included. Methods:Unidimensionality, global and item fit, local dependence, person-item targeting, threshold disordering, and differential item functioning (DIF) were examined. Negative correlations between scores for DIF items and Neuropsychiatric Inventory Nursing Home version (NPI-NH) were evaluated. Results:Item reliability (1.0) and person reliability (.88) were acceptable. The Rasch dimension explained 72.9% of the variance (Eigenvalue = 27), while the first contrast explained 6.6% (Eigenvalue = 2.4). The "mobility" was misfitting to the Rasch model (infit mean square = 1.86). The overall difficulty of the BI exceeded patients' ability (person location = -2.27 logits). The "stairs climbing" and "mobility" showed narrow category thresholds (< 1.4 logits). The location of "controlling bladder" and "toilet use" overlapped. Removing "stairs climbing", collapsing categories with narrow threshold widths in "mobility", and combining "controlling bowel" and "controlling bladder" into one item, improved unidimensionality, and item fit of the scale. Only three items ("grooming", "dressing", and "toilet use") were free from DIF across countries. The scores for "feeding" were negatively related to scores for "disinhibition" (r = -0.46, P < 0.01), and scores for "controlling bowel" were negatively related to scores for "disinhibition" (r = -0.44, P < 0.01), "agitation" (r = -0.32, P < 0.05), and "aggression" (r = -0.27, P < 0.01) in Japanese samples. Conclusions and Implications:The performance of the BI for assessing patients with dementia might be compromised by misfit items, person-item mistargeting, measurement gaps, redundant items, narrow threshold width, and item bias. Mobility ability might not be helpful for determining capability of basic ADL in the patients. Comparisons of BI scores between countries should be undertaken with caution due to item bias. Neuropsychiatric symptoms might interact with basic ADL abilities of the patients. We will not suggest using the instrument in patients with dementia, without future refining to improve its performance.
Project description:BACKGROUND:Accurate measurement of health literacy is essential to improve accessibility and effectiveness of health care and prevention. One measure frequently applied in international research is the Short Assessment of Health Literacy (SAHL). While the Dutch SAHL (SAHL-D) has proven to be valid and reliable, its administration is time consuming and burdensome for participants. Our aim was to further validate, strengthen and shorten the SAHL-D using Rasch analysis. METHODS:Available cross-sectional SAHL-D data was used from adult samples (N = 1231) to assess unidimensionality, local independence, item fit, person fit, item hierarchy, scale targeting, precision (person reliability and person separation), and presence of differential item functioning (DIF) depending on age, gender, education and study sample. RESULTS:Thirteen items for a short form were selected based on item fit and DIF, and scale properties were compared between the two forms. The long form had several items with DIF for age, gender, educational level and study sample. Both forms showed lower measurement precision at higher health literacy levels. CONCLUSIONS:The findings support the validity and reliability of the SAHL-D for the long form and the short form, which can be used for a rapid assessment of health literacy in research and clinical practice.
Project description:The Interpersonal Needs Questionnaire (INQ) assesses Thwarted Belongingness (TB) and Perceived Burdensomeness (PB), two predictors of suicidal thoughts. Up to now, the use of item response theory (IRT) for the evaluation of the INQ has been restricted to a single study with clinically depressed and suicidal youth. Therefore, the psychometric properties of the two INQ-15-subscales TB and PB were now evaluated in a general population sample (N = 2508) and a clinical adult population sample (N = 185) using IRT, specifically the Rasch model (RM) and the graphical log-linear Rasch model (GLLRM). Of special interest was whether the INQ-subscales displayed differential item functioning (DIF) across the two different samples and how well the subscales were targeted to the two sample populations. For the clinical sample, fit to a GLLRM could be established for the PB-subscale and fit to a RM was established for a five-item version of the TB-subscale. In contrast, for the general population sample fit to a GLLRM could only be achieved for the PB-subscale. Overall, there was strong evidence of local dependence (LD) across items and of some age- and gender-related DIF. Both subscales exhibited massive DIF related to the sample, indicating that they don't work the same across the general population and clinical sample. As expected, targeting of both INQ-subscales was much better for the clinical population. Further investigations of the INQ-15 under the Rasch approach in a large clinical population are recommended to determine and optimize the scale performance.
Project description:BACKGROUND: To overcome the drawback of individual item-by-item box plots of disclosure for patient views on healthcare service quality, we propose to inspect interrelationships among items that measure a common entity. A visual diagram on the Internet is developed to provide thorough information for hospitals. METHODS: We used the Rasch rating scale model to analyze the 2003 English inpatient questionnaire data regarding patient satisfactory perception, which were collected from 169 hospitals, examined model-data fit, and developed a KIDMAP diagram on the Internet depicting the satisfaction level of each hospital and investigating aberrant responses with Z-scores and MNSQ statistics for individual hospitals. Differential item functioning (DIF) analysis was conducted to verify construct equivalence across types of hospitals. RESULTS: 18 of the 45 items fit to the model's expectations, indicating they jointly defined a common construct and an equal-interval logit scale was achieved. The most difficult aspect for hospitals to earn inpatients' satisfaction were item 29 (staff told you about any medication side effects to watch when going home). No DIF in the 18-item questionnaire was found between types of hospitals, indicating the questionnaire measured the same construct across hospitals. Different types of hospitals obtained different levels of satisfaction. The KIDMAP on the Internet provided more interpretable and visualized message than traditional item-by-item box plots of disclosure. CONCLUSION: After removing misfit items, we find that the 18-item questionnaire measures the same construct across types of hospitals. The KIDMAP on the Internet provides an exemplary comparison in quality of healthcare. Rasch analysis allows intra- and inter-hospital performances to be compared easily and reliably with each other on the Internet.
Project description:To evaluate the psychometric properties of the Adult Strabismus-20 (AS-20)- a health-related quality of life (HRQoL) questionnaire in adults with strabismus, and if flawed, to revise the AS-20 and its subscales creating valid measurement scales.584 adults (meanage, 27.5 years) with strabismus were recruited from an outpatient clinic at a South Indian tertiary eye care centre and were administered the AS-20 questionnaire.The AS-20 was translated and back translated into two Indian languages. The AS-20 and its two 10-item subscales - 'psychosocial' and 'function'were assessed separately for fit to the Rasch model, including an assessment of the rating scale, unidimensionality (by principal components analysis), measurement precision by person separation reliability, PSR, targeting, and differential item functioning (DIF; notable > 1.0 logits).Response categories were not used as intended, thereby, required re-organization and reducing their number from 5 to 3. The AS-20 had adequate measurement precision (PSR = 0.87) but lacked unidimensionality; however, deletion of the six multi-dimensionality causing items and an additional three misfitting items resulted in 11-item unidimensional questionnaire (AS-11). Two items failed to satisfy the model expectations in the 'psychosocial' subscale and were deleted - resulting in an 8-item unidimensional scale with adequate PSR (0.81) and targeting (0.23 logits). One item misfit in the 'function' subscale and was deleted-resulting in a 9 item Rasch-revised unidimensional subscale with acceptable PSR (0.80) and targeting (0.97 logits).None of the items displayed notable DIF by age, gender and level of education.The AS-11 and its two Rasch-revised subscales - 8-item psychosocial and 9-item function subscale may be more appropriate than the original AS-20 and its two 10-item subscales for use as unidimensional measures of HRQoL in adults with strabismus in India. Further work is required to establish the validity of the revised rating scale.