Publication Date
In 2025 | 6 |
Since 2024 | 59 |
Since 2021 (last 5 years) | 268 |
Since 2016 (last 10 years) | 781 |
Since 2006 (last 20 years) | 1698 |
Descriptor
Scores | 2324 |
Test Reliability | 1083 |
Reliability | 1051 |
Test Validity | 596 |
Foreign Countries | 572 |
Correlation | 529 |
Validity | 456 |
Psychometrics | 436 |
Measures (Individuals) | 411 |
Factor Analysis | 392 |
Statistical Analysis | 329 |
More ▼ |
Source
Author
Thompson, Bruce | 21 |
Erford, Bradley T. | 13 |
Henson, Robin K. | 11 |
Zimmerman, Donald W. | 11 |
Haberman, Shelby J. | 10 |
Worrell, Frank C. | 10 |
Lee, Yong-Won | 9 |
Sinharay, Sandip | 9 |
Gill, Brian | 8 |
Petscher, Yaacov | 8 |
Wainer, Howard | 8 |
More ▼ |
Publication Type
Education Level
Audience
Researchers | 33 |
Practitioners | 21 |
Teachers | 9 |
Administrators | 4 |
Counselors | 2 |
Parents | 2 |
Policymakers | 2 |
Community | 1 |
Students | 1 |
Location
Turkey | 88 |
Canada | 42 |
China | 37 |
United States | 35 |
Australia | 31 |
Florida | 24 |
Netherlands | 24 |
California | 21 |
Spain | 21 |
United Kingdom | 21 |
United Kingdom (England) | 21 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Meets WWC Standards without Reservations | 3 |
Meets WWC Standards with or without Reservations | 3 |
Does not meet standards | 1 |
Adaptation of Scientific Reasoning Scale into Turkish and Examination of Its Psychometric Properties
Muslu Kaygisiz, Gülfem; Gürkan, Burcu; Akbas, Ufuk – Educational Sciences: Theory and Practice, 2018
In this study, it is aimed to adapt the Scientific Reasoning Scale (SRS) into Turkish. The translated form has been provided to the students enrolled at different levels together with a form in which they were requested to present what they have understood and the reason of their responses. It was seen that the explanations of students to one item…
Descriptors: Foreign Countries, Science Process Skills, Science Tests, Psychometrics
Beauvais, Lucie; Bouchafa, Houria; Beauvais, Caroline; Kleinsz, Nina; Magnan, Annie; Ecalle, Jean – Canadian Journal of School Psychology, 2018
The goal of the experiment was to examine the relevance of a new French web-based assessment, Tinfolec (Test INFOrmatisé d'évaluation de la LECture), the aim of which is to evaluate the reading abilities of children in primary grades. The participants were 1,016 children from Grades 2 to 5. They completed the five tasks of Tinfolec designed to…
Descriptors: Foreign Countries, Elementary School Students, Student Evaluation, Reading Tests
Matney, Gabriel; Jackson, Jack L., II.; Panarach, Yupadee – School Science and Mathematics, 2016
This article presents our work in translating the Mathematics Teaching Efficacy Beliefs Instrument (MTEBI) from English to Thai and our resulting investigation of validity with Thai preservice teachers. The translation process occurred over several meetings between two U.S. mathematics educators and one Thai mathematics educator. To check for…
Descriptors: Test Validity, Student Attitudes, Preservice Teachers, Teacher Effectiveness
Yang, Yan; Cox, Cody; Cho, YoonJung – Journal of Psychoeducational Assessment, 2020
Despite the critical role of emotions in multicultural teacher education, no attempt has been made to develop an instrument including affect as a dimension in measuring cultural competence for preservice teachers. To bridge this gap, the present three-study research used three distinct samples of 456 preservice teachers to develop and estimate the…
Descriptors: Cultural Awareness, Measures (Individuals), Student Attitudes, Preservice Teachers
Tosun, Cemal; Öztürk, Sakine – Teachers and Teaching: Theory and Practice, 2020
The present study aimed to develop science teaching competence belief scale (STCBS). Secondly, the study also investigated whether science teachers (STs) and pre-service science teachers' (PSTs) competence beliefs regarding science teaching in the resource room differed with respect to certain variables. Another purpose was to determine the…
Descriptors: Foreign Countries, Rating Scales, Measures (Individuals), Test Validity
Bunning, Karen; Alder, Ruth; Proudman, Lydia; Wyborn, Harriet – British Journal of Learning Disabilities, 2017
Background: Capturing the views of people with learning disabilities is not straightforward. Talking Mats® has been used successfully to solicit the views of such individuals. The aim was to co-produce an interview schedule using Talking Mats® on the subject of television-viewing habits and preferences of adults and young people with learning…
Descriptors: Learning Disabilities, Television Viewing, Adults, Youth
Lee, Minji K.; Sweeney, Kevin; Melican, Gerald J. – Educational Assessment, 2017
This study investigates the relationships among factor correlations, inter-item correlations, and the reliability estimates of subscores, providing a guideline with respect to psychometric properties of useful subscores. In addition, it compares subscore estimation methods with respect to reliability and distinctness. The subscore estimation…
Descriptors: Scores, Test Construction, Test Reliability, Test Validity
Gallagher, Chris W. – Composition Studies, 2014
This article offers a survey of three reliability theories in writing assessment: positivist, hermeneutic, and rhetorical. Drawing on an interdisciplinary investigation of the notion of "witnessing," this survey emphasizes the kinds of readers and readings each theory of reliability produces and the epistemological grounds on which it…
Descriptors: Writing Evaluation, Reliability, Rhetoric, Reader Text Relationship
van der Palm, Daniël W.; van der Ark, L. Andries; Sijtsma, Klaas – Journal of Educational Measurement, 2014
The latent class reliability coefficient (LCRC) is improved by using the divisive latent class model instead of the unrestricted latent class model. This results in the divisive latent class reliability coefficient (DLCRC), which unlike LCRC avoids making subjective decisions about the best solution and thus avoids judgment error. A computational…
Descriptors: Test Reliability, Scores, Computation, Simulation
Tracy, Allison; Charmaraman, Linda; Ceder, Ineke; Richer, Amanda; Surr, Wendy – Afterschool Matters, 2016
Out-of-school time (OST) youth programs are inherently difficult to assess. They are often very dynamic: Many youth interact with one another and with staff members in various physical environments. Despite the challenge, measuring quality is critical to help program directors and policy makers identify where to improve and how to support those…
Descriptors: After School Programs, Program Evaluation, Educational Quality, Youth Programs
Allen, Jeff M.; Mattern, Krista – ACT, Inc., 2019
States and districts have expressed interest in administering the ACT® to 10th-grade students. Given that the ACT was designed to be administered in the spring of 11th grade or fall of 12th grade, the appropriateness of this use should be evaluated. As such, the focus of this paper is to summarize empirical evidence evaluating the use of the ACT…
Descriptors: Test Validity, College Entrance Examinations, High School Students, Grade 10
Russo-Ponsaran, Nicole M.; Lerner, Matthew D.; McKown, Clark; Weber, Rebecca J.; Karls, Ashley; Kang, Erin; Sommer, Samantha L. – Grantee Submission, 2019
Few tools are available to comprehensively describe the unique social-emotional skill profiles of youth with autism spectrum disorder (ASD). The present study describes the usability, reliability, and validity of SELweb, a normed, web-based assessment designed to measure four core social-emotional domains, when used to measure these skills in a…
Descriptors: Social Development, Emotional Development, Skill Development, Autism
Starling, A. Leyf Peirce; Lo, Ya-Yu; Rivera, Christopher J. – Journal of the American Academy of Special Education Professionals, 2015
This study evaluated the differential effects of three different science teaching methods, namely engineering teaching kit (ETK), explicit instruction (EI), and a combination of the two methods (ETK+EI), in two sixth-grade science classrooms. Twelve students with learning disabilities (LD) and/or attention deficit hyperactivity disorder (ADHD)…
Descriptors: Science Education, Middle School Students, Problem Solving, Tests
Monroe, Scott; Cai, Li – Educational Measurement: Issues and Practice, 2015
Student growth percentiles (SGPs, Betebenner, 2009) are used to locate a student's current score in a conditional distribution based on the student's past scores. Currently, following Betebenner (2009), quantile regression (QR) is most often used operationally to estimate the SGPs. Alternatively, multidimensional item response theory (MIRT) may…
Descriptors: Item Response Theory, Reliability, Growth Models, Computation
Livingston, Samuel A.; Chen, Haiwen H. – ETS Research Report Series, 2015
Quantitative information about test score reliability can be presented in terms of the distribution of equated scores on an alternate form of the test for test takers with a given score on the form taken. In this paper, we describe a procedure for estimating that distribution, for any specified score on the test form taken, by estimating the joint…
Descriptors: Scores, Statistical Distributions, Research Reports, Equated Scores