Publication Date
In 2025 | 67 |
Since 2024 | 901 |
Since 2021 (last 5 years) | 3415 |
Since 2016 (last 10 years) | 7595 |
Since 2006 (last 20 years) | 14761 |
Descriptor
Test Reliability | 14547 |
Test Validity | 9865 |
Reliability | 9544 |
Foreign Countries | 6751 |
Test Construction | 4608 |
Validity | 4120 |
Measures (Individuals) | 3750 |
Factor Analysis | 3720 |
Psychometrics | 3393 |
Interrater Reliability | 3054 |
Correlation | 3009 |
More ▼ |
Source
Author
Publication Type
Education Level
Audience
Researchers | 703 |
Practitioners | 447 |
Teachers | 204 |
Administrators | 121 |
Policymakers | 62 |
Counselors | 42 |
Students | 37 |
Parents | 11 |
Community | 7 |
Media Staff | 5 |
Support Staff | 5 |
More ▼ |
Location
Turkey | 1246 |
Australia | 428 |
Canada | 371 |
China | 329 |
United States | 264 |
United Kingdom | 246 |
Taiwan | 221 |
Netherlands | 217 |
Indonesia | 214 |
California | 208 |
Spain | 201 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Meets WWC Standards without Reservations | 8 |
Meets WWC Standards with or without Reservations | 9 |
Does not meet standards | 6 |
Yuting Han; Zhehan Jiang; Lingling Xu; Fen Cai – AERA Online Paper Repository, 2024
To address the computational constraints of parameter estimation in the polytomous Cognitive Diagnosis Model (pCDM) in large-scale high data volume situations, this study proposes two two-stage polytomous attribute estimation methods: P_max and P_linear. The effects of the two-stage methods were studied via a Monte Carlo simulation study, and the…
Descriptors: Medical Education, Licensing Examinations (Professions), Measurement Techniques, Statistical Data
Kramer, Robin S. S.; Jones, Alex L.; Gous, Georgina – Applied Cognitive Psychology, 2021
Deciding whether two different face photographs or voice samples are from the same person represent fundamental challenges within applied settings. To date, most research has focussed on average performance in these tests, failing to consider individual differences and within-person consistency in responses. Here, participants completed the same…
Descriptors: Individual Differences, Accuracy, Reliability, Correlation
Hartstein, Bonnie; Yackel, Edward – Learning Organization, 2021
Purpose: This study aims to describe how the Army and the Army Medical Department matured as a learning organization (LO) during the period after the 2014 Military Health System Review through the incorporation of changes aimed at improving patient safety, data transparency and becoming a high-reliability organization (HRO). This study explores…
Descriptors: Armed Forces, Medical Services, Organizational Learning, Organizational Change
Lewis, Carly A.; Myers, Carl L. – Contemporary School Psychology, 2021
Behavior rating scales are frequently used to assess social-emotional behaviors of children. While broadband behavior rating scales often measure similarly named constructs, it is unclear how consistently different instruments measure those constructs. Head Start teachers completed the preschool versions of the Behavior Assessment System for…
Descriptors: Preschool Teachers, Interrater Reliability, Child Behavior, Behavior Rating Scales
Schüler, Anne; Merkt, Martin – Journal of Computer Assisted Learning, 2021
In two experiments, the multimedia contradiction paradigm was used to investigate whether learners map information conveyed through the audio and the picture track of a video. In Experiment 1 (N = 85), the information conveyed through the audio track and the picture track was always consistent (control group) or was made inconsistent by changing…
Descriptors: Video Technology, Cognitive Processes, Multimedia Materials, Eye Movements
McNulty, Richard J.; Floyd, Randy G. – Psychology in the Schools, 2021
This study examined the factor structure of the Detroit Tests of Learning Abilities, Fifth Edition (DTLA-5) using principal axis factoring, multiple factor extraction criteria, and the Schmid-Leiman orthogonalization procedures not utilized by test publishers. Results suggest that the publisher's six-factor structure model was over factored.…
Descriptors: Aptitude Tests, Cognitive Ability, Factor Structure, Factor Analysis
Watts, Field M.; Finkenstaedt-Quinn, Solaire A. – Chemistry Education Research and Practice, 2021
The tradition of qualitative research drives much of chemistry education research activity. When performing qualitative studies, researchers must demonstrate the trustworthiness of their analysis so researchers and practitioners consuming their work can understand if and how the presented research claims and conclusions might be transferable to…
Descriptors: Qualitative Research, Educational Research, Research Methodology, Chemistry
Belur, Jyoti; Tompson, Lisa; Thornton, Amy; Simon, Miranda – Sociological Methods & Research, 2021
A methodologically sound systematic review is characterized by transparency, replicability, and a clear inclusion criterion. However, little attention has been paid to reporting the details of interrater reliability (IRR) when multiple coders are used to make decisions at various points in the screening and data extraction stages of a study. Prior…
Descriptors: Interrater Reliability, Decision Making, Accuracy, Coding
Ryan, Joseph J.; Glass Umfleet, Laura; Gontkovsky, Samuel T. – Journal of Psychoeducational Assessment, 2021
This investigation provides internal consistency reliabilities for the Wechsler Memory Scale--Fourth Edition (WMS-IV) subtest and index discrepancy scores using the standardization samples of the Adult and Older Adult batteries. Subtest reliabilities ranged from 0.00 to 0.93 for Adults and 0.25 to 0.94 for Older Adults. Three of 91 Adult…
Descriptors: Cognitive Tests, Memory, Adults, Intelligence Tests
Burkhardt, Amy; Lottridge, Susan; Woolf, Sherri – Educational Measurement: Issues and Practice, 2021
For some students, standardized tests serve as a conduit to disclose sensitive issues of harm or distress that may otherwise go unreported. By detecting this writing, known as "crisis papers," testing programs have a unique opportunity to assist in mitigating the risk of harm to these students. The use of machine learning to…
Descriptors: Scoring Rubrics, Identification, At Risk Students, Standardized Tests
Solano-Flores, Guillermo – Educational Measurement: Issues and Practice, 2021
This article proposes a Boolean approach to representing and analyzing interobserver agreement in dichotomous coding. Building on the notion that observations are samples of a universe of observations, it submits that coding can be viewed as a process in which observers sample pieces of evidence on constructs. It distinguishes between formal and…
Descriptors: Online Searching, Coding, Interrater Reliability, Evidence
Kapsner-Smith, Mara R.; Opuszynski, Amanda; Stepp, Cara E.; Eadie, Tanya L. – Journal of Speech, Language, and Hearing Research, 2021
Purpose: The reliability of auditory-perceptual judgments between listeners is a long-standing problem in the assessment of voice disorders. The purpose of this study was to determine whether a relatively novel experimental scaling method, called visual sort and rate (VSR), yielded stronger reliability than the more frequently used method of…
Descriptors: Voice Disorders, Interrater Reliability, Rating Scales, Severity (of Disability)
Chakrabartty, Satyendra Nath – International Journal of Psychology and Educational Studies, 2021
The paper proposes new measures of difficulty and discriminating values of binary items and test consisting of such items and find their relationships including estimation of test error variance and thereby the test reliability, as per definition using cosine similarities. The measures use entire data. Difficulty value of test and item is defined…
Descriptors: Test Items, Difficulty Level, Scores, Test Reliability
Limon, Ibrahim; Dilekçi, Ümit – Participatory Educational Research, 2021
The aim of this study is to develop a valid and reliable measurement tool that can be used to measure the level of school principals' micromanagement behavior. After a comprehensive literature review, a candidate item pool with 52 items was created. While writing the items, micro-manager behaviors defined in the literature were adapted to school…
Descriptors: Test Construction, Test Validity, Leadership Styles, Principals
Ge Zhang; Pengfei Chen; Si Xu – International Journal of Sustainability in Higher Education, 2025
Purpose: Given that the current sustainability assessment in higher education institutions primarily relies on qualitative methods with relatively limited quantitative tools, the purpose of this study is to design a tool that could be used to comprehensively assess the overall state of higher education institutions' sustainability.…
Descriptors: Test Construction, Test Validity, Colleges, Measures (Individuals)