Project description:IntroductionSince the advent of medical education systems, managing high-stakes exams has been a top priority and challenge for all policymakers. However, considering machine learning (ML) techniques as a replacement for medical licensing examinations, particularly during crises such as the COVID-19 outbreak, could be an effective solution. This study uses ML models to develop a framework for predicting medical students' performance on high-stakes exams, such as the Comprehensive Medical Basic Sciences Examination (CMBSE).Material and methodsPrediction of students' status and score on high-stakes examinations faces several challenges, including an imbalanced number of failing and passing students, a large number of heterogeneous and complex features, and the need to identify at-risk and top-performing students. In this study, two major categories of ML approaches are compared: first, classic models (logistic regression (LR), support vector machine (SVM), and k-nearest neighbors (KNN)), and second, ensemble models (voting, bagging (BG), random forests (RF), adaptive boosting (ADA), extreme gradient boosting (XGB), and stacking).ResultsTo evaluate the models' discrimination ability, they are assessed using a real dataset containing information on medical students over a five-year period (n = 1005). The findings indicate that ensemble ML models demonstrate optimal performance in predicting CMBSE status (RF and stacking). Similarly, among the classic regressors, LR exhibited the highest root-mean-square deviation (RMSD) (0.134) and coefficient of determination (R2) (0.62), whereas the RF model had the highest RMSD (0.077) and R2 (0.80) overall. Furthermore, Anatomical Sciences, Biochemistry, Parasitology, and Entomology grade point average (GPA) and grades demonstrated the strongest positive correlation with the outcomes.ConclusionComparing classic and ensemble ML models revealed that ensemble models are superior to classic models. Therefore, the presented framework could be considered a suitable alternative for the CMBSE and other comparable medical licensing examinations.
Project description:High stakes examinations can have profound implications for how science is taught and learned. Limitations of school science such as the 'cookbook problem' can potentially be addressed if high stakes assessments target learning outcomes that are innovative. For example, less mindless procedural engagement and more thoughtful consideration of practical science can potentially improve science learning. In this paper, we investigate how practical work is represented in the assessment frameworks of several countries that demonstrate above average performance in the latest PISA science assessments. The main motivation is the need to understand if there are aspects of high stakes summative assessments in these countries that can provide insight into how best to structure national examinations. Assessment documents from a set of selected countries have been analysed qualitatively guided by questions such as 'what is the construct of practical science' and 'what is the format of assessment?' The examined jurisdictions used different approaches from traditional external pen-and-paper tests to internal teacher assessments that included different formats (e.g. laboratory report). Innovative approaches to the assessment of practical skills (e.g. PISA computer-based tasks) do not seem to be represented in these high-stakes assessments. Implications for innovative assessments for high-stakes purposes are discussed.
Project description:This research was conducted to determine trends in the career interests of high school students in Surabaya, East Java, Indonesia. The sample size was 981 consisting of 488 men and 493 women. The instrument used was a career interest scale that was compiled based on Holland's theory with six RIASEC domains (Realistic, Investigative, Artistic, Social, Enterprising, and Conventional). The study design uses non-experimental, data collection through questionnaires given directly. Data were analyzed descriptively without using an explicit theoretical model. The career fields that are in high demand by high school students are the conventional fields that reach 42.30%, while the less desirable areas are the investigative fields which are only 3.98%. There are differences in career interests between men and women. Men prefer more realistic, artistic and enterprising fields, while women prefer social and conventional fields.
Project description:IntroductionPredicting medical science students' performance on high-stakes examinations has received considerable attention. Machine learning (ML) models are well-known approaches to enhance the accuracy of determining the students' performance. Accordingly, we aim to provide a comprehensive framework and systematic review protocol for applying ML in predicting medical science students' performance on high-stakes examinations. Improving the current understanding of the input and output features, preprocessing methods, setting of ML models and required evaluation metrics seems essential.Methods and analysisA systematic review will be conducted by searching the electronic bibliographic databases of MEDLINE/PubMed, EMBASE, SCOPUS and Web of Science. The search will be limited to studies published from January 2013 to June 2023. Studies explicitly predicting student performance in high-stakes examinations and referencing their learning outcomes and use of ML models will be included. Two team members will first screen literature meeting the inclusion criteria at the title, abstract and full-text levels. Second, the Best Evidence Medical Education quality framework rates the included literature. Later, two team members will extract data, including the studies' general data and the ML approach's details. Finally, the information consensus will be reached and submitted for analysis. The synthesised evidence from this review provides helpful information for medical education policy-makers, stakeholders and other researchers in adopting the ML models to evaluate medical science students' performance in high-stakes exams.Ethics and disseminationThis systematic review protocol summarises findings of existing publications rather than primary data and does not require an ethics review. The results will be disseminated in publications of peer-reviewed journals.
Project description:BackgroundFailure rates in postgraduate examinations are often high and many candidates therefore retake examinations on several or even many times. Little, however, is known about how candidates perform across those multiple attempts. A key theoretical question to be resolved is whether candidates pass at a resit because they have got better, having acquired more knowledge or skills, or whether they have got lucky, chance helping them to get over the pass mark. In the UK, the issue of resits has become of particular interest since the General Medical Council issued a consultation and is considering limiting the number of attempts candidates may make at examinations.MethodsSince 1999 the examination for Membership of the Royal Colleges of Physicians of the United Kingdom (MRCP(UK)) has imposed no limit on the number of attempts candidates can make at its Part 1, Part 2 or PACES (Clinical) examination. The present study examined the performance of candidates on the examinations from 2002/2003 to 2010, during which time the examination structure has been stable. Data were available for 70,856 attempts at Part 1 by 39,335 candidates, 37,654 attempts at Part 2 by 23,637 candidates and 40,303 attempts at PACES by 21,270 candidates, with the maximum number of attempts being 26, 21 and 14, respectively. The results were analyzed using multilevel modelling, fitting negative exponential growth curves to individual candidate performance.ResultsThe number of candidates taking the assessment falls exponentially at each attempt. Performance improves across attempts, with evidence in the Part 1 examination that candidates are still improving up to the tenth attempt, with a similar improvement up to the fourth attempt in Part 2 and the sixth attempt at PACES. Random effects modelling shows that candidates begin at a starting level, with performance increasing by a smaller amount at each attempt, with evidence of a maximum, asymptotic level for candidates, and candidates showing variation in starting level, rate of improvement and maximum level. Modelling longitudinal performance across the three diets (sittings) shows that the starting level at Part 1 predicts starting level at both Part 2 and PACES, and the rate of improvement at Part 1 also predicts the starting level at Part 2 and PACES.ConclusionCandidates continue to show evidence of true improvement in performance up to at least the tenth attempt at MRCP(UK) Part 1, although there are individual differences in the starting level, the rate of improvement and the maximum level that can be achieved. Such findings provide little support for arguments that candidates should only be allowed a fixed number of attempts at an examination. However, unlimited numbers of attempts are also difficult to justify because of the inevitable and ever increasing role that luck must play with increasing numbers of resits, so that the issue of multiple attempts might be better addressed by tackling the difficult question of how a pass mark should increase with each attempt at an exam.
Project description:BackgroundIncorporating emerging knowledge into Emergency Medical Service (EMS) competency assessments is critical to reflect current evidence-based out-of-hospital care. However, a standardized approach is needed to incorporate new evidence into EMS competency assessments because of the rapid pace of knowledge generation.ObjectiveThe objective was to develop a framework to evaluate and integrate new source material into EMS competency assessments.MethodsThe National Registry of Emergency Medical Technicians (National Registry) and the Prehospital Guidelines Consortium (PGC) convened a panel of experts. A Delphi method, consisting of virtual meetings and electronic surveys, was used to develop a Table of Evidence matrix that defines sources of EMS evidence. In Round One, participants listed all potential sources of evidence available to inform EMS education. In Round Two, participants categorized these sources into: (a) levels of evidence quality; and (b) type of source material. In Round Three, the panel revised a proposed Table of Evidence. Finally, in Round Four, participants provided recommendations on how each source should be incorporated into competency assessments depending on type and quality. Descriptive statistics were calculated with qualitative analyses conducted by two independent reviewers and a third arbitrator.ResultsIn Round One, 24 sources of evidence were identified. In Round Two, these were classified into high- (n = 4), medium- (n = 15), and low-quality (n = 5) of evidence, followed by categorization by purpose into providing recommendations (n = 10), primary research (n = 7), and educational content (n = 7). In Round Three, the Table of Evidence was revised based on participant feedback. In Round Four, the panel developed a tiered system of evidence integration from immediate incorporation of high-quality sources to more stringent requirements for lower-quality sources.ConclusionThe Table of Evidence provides a framework for the rapid and standardized incorporation of new source material into EMS competency assessments. Future goals are to evaluate the application of the Table of Evidence framework in initial and continued competency assessments.
Project description:IntroductionSARS-CoV-2 may transmit across vaccinated cohorts during practical clinical examinations. We sought to assess the feasibility of facemask sampling (FMS) to identify individuals emitting SARS-CoV-2 during a mock PACES exam.MethodsIn May 2022 we recruited participants from a mock PACES examination in Leicester, UK. Following a negative lateral flow test assay, all participants wore modified facemasks able to capture exhaled virus during the assessment (FMS). A concomitant upper respiratory tract sample (URTS) was provided prior to FMS. Exposed facemasks were processed by removal and dissolution of sampling matrices fixed within the mask and cycle thresholds values quantified by RT-qPCR. Participants were asked to grade statements regarding the comfort, effort, ethics and communication when providing FMS; laboratory technicians were asked to grade key statements surrounding suitability of samples for processing.Results34 participants provided concomitant URTS and FMS during the examination. One participant was positive for SARS-CoV-2, with a cycle threshold value of 22.5 on URTS, but negative (no viral RNA detected) on FMS; no transmission to others was identified from this individual. Participants responded positively to statements regarding FMS describing all four domains; however, 69% of participants felt that a positive result from FMS alone was insufficient for diagnosis and that further tests were required. All but one FMS sample was suitable for processing.DiscussionFMS during PACES exams are acceptable among participants and samples provided are suitable for processing. Our results demonstrate feasibility of FMS within practical examination settings and support the further assessment of FMS as a scalable tool that can be compared with URTS to identify those who are infectious.
Project description:BackgroundAll sectors are affected due to COVID-19 pandemic occurring worldwide, including the education industry. School closure had been taking place for more than a year in Indonesia. Despite the controversies, Indonesian government had decided to begin school reopening.ObjectivesThis study aims to assess parental readiness for school reopening, and factors affecting parental attitude toward school reopening.MethodsA cross-sectional study using online questionnaire distributed via official Indonesian Pediatric Society (IPS) official social media account collected between March and April 2021. The questionnaire contained the general characteristics of study participants, parents' knowledge, and perspectives on COVID-19, and health protocols for school reopening.ResultsA total of 17,562 responses were collected, of which 55.7% parents were ready to send their children to school should school reopens. Factors significantly contribute to parental decision to keep their child at home were: presence of vulnerable population at home [OR = 1.18 (1.10-1.27), p < 0.001], children with comorbidities [OR = 2.56 (2.29-2.87), p < 0.001], perception of COVID-19 as a dangerous disease [OR = 28.87 (14.29-58.33), p < 0.001], experience with COVID-19 positive cases in the community [OR = 1.75 (1.61-1.90), p < 0.001], COVID-19 related death in the community [OR = 2.05 (1.90-2.21), P < 0.001], approval for adult COVID-19 vaccination [OR = 1.69 (1.53-1.87), p < 0.001], and ownership of private transportation [OR = 1.46 (1.30-1.66), p <0.001].ConclusionWe identified several factors affecting parental perception on school reopening during COVID-19 pandemic that should be addressed. This study can be used for policy-maker to make further recommendations and health educations prior to school reopening in Indonesia.
Project description:Using an incentivized experiment with statistical power, this paper explores the role of stakes in charitable giving of lottery prizes, where subjects commit to donate a fraction of the prize before they learn the outcome of the lottery. We study three stake levels: 5€ (n = 177), 100€ (n = 168), and 1,000€ (n = 171). Although the donations increase in absolute terms as the stakes increase, subjects decrease the donated fraction of the pie. However, people still share roughly 20% of 1,000€, an amount as high as the average monthly salary of people at the age of our subjects. The number of people sharing 50% of the pie is remarkably stable across stakes, but donating the the whole pie-the modal behavior in charity-donation experiments-disappears with stakes. Such hyper-altruistic behavior thus seems to be an artifact of the stakes typically employed in economic and psychological experiments. Our findings point out that sharing with others is a prevalent human feature, but stakes are an important determinant of sharing. Policies promoted via prosocial frames (e.g., stressing the effects of mask-wearing or social distancing on others during the Covid-19 pandemic or environmentally-friendly behaviors on future generations) may thus be miscalibrated if they disregard the stakes at play.