Publication Date
In 2025 | 67 |
Since 2024 | 901 |
Since 2021 (last 5 years) | 3415 |
Since 2016 (last 10 years) | 7595 |
Since 2006 (last 20 years) | 14761 |
Descriptor
Test Reliability | 14547 |
Test Validity | 9865 |
Reliability | 9544 |
Foreign Countries | 6751 |
Test Construction | 4608 |
Validity | 4120 |
Measures (Individuals) | 3750 |
Factor Analysis | 3720 |
Psychometrics | 3393 |
Interrater Reliability | 3054 |
Correlation | 3009 |
More ▼ |
Source
Author
Publication Type
Education Level
Audience
Researchers | 703 |
Practitioners | 447 |
Teachers | 204 |
Administrators | 121 |
Policymakers | 62 |
Counselors | 42 |
Students | 37 |
Parents | 11 |
Community | 7 |
Media Staff | 5 |
Support Staff | 5 |
More ▼ |
Location
Turkey | 1246 |
Australia | 428 |
Canada | 371 |
China | 329 |
United States | 264 |
United Kingdom | 246 |
Taiwan | 221 |
Netherlands | 217 |
Indonesia | 214 |
California | 208 |
Spain | 201 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Meets WWC Standards without Reservations | 8 |
Meets WWC Standards with or without Reservations | 9 |
Does not meet standards | 6 |
De Almeida Maia, Denise; Pohl, Steffi; Okuda, Paola Matiko Martins; Liu, Ting; Puglisi, Marina Leite; Ploubidis, George; Eid, Michael; Cogo-Moreira, Hugo – Educational Assessment, Evaluation and Accountability, 2022
The Bracken School Readiness Assessment (BSRA) has been used in large studies such as the Millennium Cohort Study (MCS). Important conclusions might be done regarding its reliability for the prediction of children's school readiness taking advantage of such large-scale evaluation. Although BSRA has being largely used, few are the studies at…
Descriptors: School Readiness, Screening Tests, Psychometrics, Preschool Children
Kumar, Bimal Aklesh; Sharma, Bibhya; Nakagawa, Elisa Yumi – Education and Information Technologies, 2022
Context-aware mobile learning applications provide learning materials to suit the needs of individual learners. Despite several applications developed, there is a lack of architectural support for developing these applications. This has resulted in a number of challenges; lack of standardization, poor quality of developed applications, and…
Descriptors: Computer Software, Telecommunications, Handheld Devices, Standards
Chen, Qiongqiong – International Education Studies, 2022
Predictive research on the enrollment proportion of general education and vocational education is crucial to optimizing the regional talent structure and industrial structure adjustment. The reasonable enrollment proportion of general education and vocational education also plays an important role in the adjustment of the overall employment…
Descriptors: Prediction, Enrollment, General Education, Vocational Education
Krieglstein, Felix; Beege, Maik; Rey, Günter Daniel; Ginns, Paul; Krell, Moritz; Schneider, Sascha – Educational Psychology Review, 2022
For more than three decades, cognitive load theory has been addressing learning from a cognitive perspective. Based on this instructional theory, design recommendations and principles have been derived to manage the load on working memory while learning. The increasing attention paid to cognitive load theory in educational science quickly…
Descriptors: Cognitive Processes, Difficulty Level, Learning Theories, Test Reliability
Byers-Heinlein, Krista; Bergmann, Christina; Savalei, Victoria – Infant and Child Development, 2022
Infant research is often underpowered, undermining the robustness and replicability of our findings. Improving the reliability of infant studies offers a solution for increasing statistical power independent of sample size. Here, we discuss two senses of the term reliability in the context of infant research: reliable (large) effects and reliable…
Descriptors: Infants, Research, Reliability, Effect Size
Holcomb, T. Scott; Lambert, Richard; Bottoms, Bryndle L. – Journal of Educational Supervision, 2022
In this study, various statistical indexes of agreement were calculated using empirical data from a group of evaluators (n = 45) of early childhood teachers. The group of evaluators rated ten fictitious teacher profiles using the North Carolina Teacher Evaluation Process (NCTEP) rubric. The exact and adjacent agreement percentages were calculated…
Descriptors: Interrater Reliability, Teacher Evaluation, Statistical Analysis, Early Childhood Teachers
Karakus, Sena; Akbay, Sinem Evin; Uzun, Nezaket Bilge – Journal on Educational Psychology, 2022
The aim of the present research is to develop a psychometrically qualified measurement tool to find out the emotional authenticity levels of individuals. Taking roots from the rational approach, 53 items were written by the researchers in line with the relevant literature review and the opinions of the experts, and the expert opinion form prepared…
Descriptors: Test Construction, Psychological Patterns, Test Validity, Test Reliability
Steven D. Caminiti – ProQuest LLC, 2022
Research has focused on the various reasons why high school principals leave their positions, yet minimal research has been done on the reasons why they stay. The problem of inconsistency of high school principals' tenure within the first 4 years of service was addressed in this study. Deci and Ryan's self-determination theory was used in this…
Descriptors: High Schools, Principals, Administrator Attitudes, Tenure
Jaburek, Michal; Tápal, Adam; Portešová, Šárka; Pfeiffer, Steven I. – Journal of Psychoeducational Assessment, 2021
The factor structure, the concurrent validity, and test-retest reliability of the Czech translation of the Gifted Rating Scales-School Form [GRS-S; Pfeiffer, S. I., & Jarosewich, T. (2003). "GRS (gifted rating scales) - manual." Pearson] were evaluated. Ten alternative models were tested. Four models were found to exhibit acceptable…
Descriptors: Test Validity, Test Reliability, Gifted, Foreign Countries
VanDerHeyden, Amanda M.; Broussard, Carmen – Assessment for Effective Intervention, 2021
This study details the construction of parameters for generating subskill mastery math measures to be used for screening, intervention planning, progress monitoring, and proximal program evaluation. Parameters for generating assessment measures were built and tested to verify initial equivalence of generated measures using potential digits correct…
Descriptors: Mastery Tests, Mathematics Tests, Test Construction, Kindergarten
Pilditch, Toby D.; Lagator, Sandra; Lagnado, David – Journal of Experimental Psychology: Learning, Memory, and Cognition, 2021
How do we deal with unlikely witness testimonies? Whether in legal or everyday reasoning, corroborative evidence is generally considered a strong marker of support for the reported hypothesis. However, questions remain regarding how the prior probability, or base rate, of that hypothesis interacts with corroboration. Using a Bayesian network…
Descriptors: Evidence, Reliability, Logical Thinking, Probability
Pérez-Castilla, Alejandro; García-Ramos, Amador – Measurement in Physical Education and Exercise Science, 2021
An a-posteriori multicentre reliability study was conducted to compare the reliability and magnitude of the maximum power (P[subscript max]) and optimal velocity (V[subscript opt]) between the force-power-velocity relationships during the leg cycle-ergometer and bench press throw exercises. The force-power-velocity relationships were determined in…
Descriptors: Motion, Exercise, Measurement Techniques, Reliability
Brittany N. Zakszeski; Heather E. Ormiston; Malena A. Nygaard; Kane Carlock – School Psychology Review, 2025
Despite the widespread use of school-based universal screening systems for social, emotional, and behavioral risk, limited research has examined discrepancies in ratings provided by teachers and their secondary students. Using the Social, Academic, and Emotional Behavior Risk Screener (SAEBRS; teacher report) and mySAEBRS (student report) scores…
Descriptors: Middle School Students, Middle School Teachers, Screening Tests, Affective Behavior
Joseph A. Rios; Jiayi Deng – Educational and Psychological Measurement, 2025
To mitigate the potential damaging consequences of rapid guessing (RG), a form of noneffortful responding, researchers have proposed a number of scoring approaches. The present simulation study examines the robustness of the most popular of these approaches, the unidimensional effort-moderated (EM) scoring procedure, to multidimensional RG (i.e.,…
Descriptors: Scoring, Guessing (Tests), Reaction Time, Item Response Theory
Anirudhan Badrinath; Zachary Pardos – Journal of Educational Data Mining, 2025
Bayesian Knowledge Tracing (BKT) is a well-established model for formative assessment, with optimization typically using expectation maximization, conjugate gradient descent, or brute force search. However, one of the flaws of existing optimization techniques for BKT models is convergence to undesirable local minima that negatively impact…
Descriptors: Bayesian Statistics, Intelligent Tutoring Systems, Problem Solving, Audience Response Systems