Publication Date
In 2025 | 0 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 5 |
Since 2016 (last 10 years) | 26 |
Since 2006 (last 20 years) | 42 |
Descriptor
Test Reliability | 414 |
Test Use | 414 |
Test Validity | 293 |
Test Construction | 141 |
Elementary Secondary Education | 77 |
Higher Education | 66 |
Evaluation Methods | 59 |
Psychometrics | 55 |
Foreign Countries | 51 |
Scoring | 49 |
Standardized Tests | 49 |
More ▼ |
Source
Author
Stansfield, Charles W. | 4 |
Straus, Murray A. | 4 |
Thompson, Bruce | 4 |
Baker, Eva L. | 3 |
Alsalam, Nabeel | 2 |
Anderson, Stephen A. | 2 |
Axelrod, Bradley N. | 2 |
Boesel, David | 2 |
Bricker, Diane | 2 |
Burrell, Brenda | 2 |
Clark, Duncan B. | 2 |
More ▼ |
Publication Type
Education Level
Elementary Education | 9 |
Higher Education | 9 |
Postsecondary Education | 9 |
Early Childhood Education | 6 |
Elementary Secondary Education | 5 |
Secondary Education | 5 |
Grade 3 | 4 |
Grade 4 | 4 |
Grade 5 | 4 |
Grade 6 | 4 |
Grade 7 | 4 |
More ▼ |
Audience
Practitioners | 43 |
Teachers | 17 |
Researchers | 9 |
Students | 8 |
Administrators | 7 |
Parents | 5 |
Policymakers | 3 |
Community | 2 |
Counselors | 2 |
Support Staff | 1 |
Location
Australia | 10 |
Canada | 6 |
New York | 6 |
Hong Kong | 3 |
Finland | 2 |
Georgia | 2 |
Ireland | 2 |
Israel | 2 |
Massachusetts | 2 |
Michigan | 2 |
Netherlands | 2 |
More ▼ |
Laws, Policies, & Programs
Education Consolidation… | 2 |
Elementary and Secondary… | 1 |
Every Student Succeeds Act… | 1 |
Individuals with Disabilities… | 1 |
Individuals with Disabilities… | 1 |
No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Alatli, Betül – International Journal of Curriculum and Instruction, 2022
This study was conducted to review the use of tests. For this purpose, 45 articles in which the Turkish form of the "Test Anxiety Inventory (TAI)," which is one of the tests frequently used in the field of education, was employed and that were published between 2000 and 2020 were examined in terms of factors that should be considered in…
Descriptors: Anxiety, Likert Scales, Test Anxiety, Test Reliability
Knoch, Ute; Deygers, Bart; Khamboonruang, Apichat – Language Testing, 2021
Rating scale development in the field of language assessment is often considered in dichotomous ways: It is assumed to be guided either by expert intuition or by drawing on performance data. Even though quite a few authors have argued that rating scale development is rarely so easily classifiable, this dyadic view has dominated language testing…
Descriptors: Rating Scales, Test Construction, Language Tests, Test Use
David Meechan; Zeta Williams-Brown; Tracy Whatmore; Simon Halfhead – Education 3-13, 2024
The paper focuses on findings from research that investigated teachers' and key stakeholders' perspectives on the use of Reception Baseline Assessment. Data collection was carried out in 2021-2022, which was the year this assessment was introduced into Reception classes in England. In total, 70 teachers and key stakeholders from 47 Local…
Descriptors: Foreign Countries, Preschool Education, Preschool Teachers, Achievement Tests
Attali, Yigal – Educational Measurement: Issues and Practice, 2019
Rater training is an important part of developing and conducting large-scale constructed-response assessments. As part of this process, candidate raters have to pass a certification test to confirm that they are able to score consistently and accurately before they begin scoring operationally. Moreover, many assessment programs require raters to…
Descriptors: Evaluators, Certification, High Stakes Tests, Scoring
Kotowicz, Justyna; Woll, Bencie; Herman, Rosalind – Language Testing, 2021
The evaluation of sign language proficiency needs to be based on measures with well-established psychometric proprieties. To date, no valid and reliable test is available to assess Polish Sign Language ("Polski Jezyk Migowy," PJM) skills in deaf children. Hence, our aim with this study was to adapt the British Sign Language Receptive…
Descriptors: Language Tests, Receptive Language, Sign Language, Language Proficiency
Rehfeld, David M.; Padgett, R. Noah – Journal of Psychoeducational Assessment, 2019
This article presents a review of the Comprehensive Assessment of Spoken Language--Second Edition (CASL-2), in which reliability, utility, and validity are analyzed and discussed. Some limited recommendations for practice are made based on a review of the information provided by the publisher for clinicians.
Descriptors: Oral Language, Language Tests, Receptive Language, Expressive Language
Iaccarino, Stephanie; von der Embse, Nathaniel; Kilgus, Stephen – Journal of Psychoeducational Assessment, 2019
Detecting mental illness in school students may prevent poor school outcomes. Clinicians often use universal behavioral screeners to identify students at risk for mental illness. This study examined the applicability of Kane's interpretation and use argument (IUA) to the Social, Academic, and Emotional Behavior Risk Screener--Teacher Rating Scale…
Descriptors: Screening Tests, Test Interpretation, Test Use, Mental Disorders
Olsen, Jacob; Preston, Angela I.; Algozzine, Bob; Algozzine, Kate; Cusumano, Dale – Clearing House: A Journal of Educational Strategies, Issues and Ideas, 2018
Although it is widely agreed that there is no universally accepted definition for school climate, most professionals ground it in shared beliefs, values, and attitudes reflecting the quality and character of life in schools. In this article, we review and analyze measures accessible to school personnel charged with documenting and monitoring…
Descriptors: Educational Environment, Measures (Individuals), School Personnel, Test Format
Al-Owidha, Amjed A. – Language Testing in Asia, 2018
Background: This study investigated the psychometric properties of the recently developed Qiyas for L1 Arabic language test using a Rasch measurement framework. Methods: Responses from 271 examinees were analyzed in this study. The test is hypothesized to involve one dominant factor that assesses four skills: reading comprehension, rhetorical…
Descriptors: Semitic Languages, Language Tests, Psychometrics, Reading Comprehension
Haider, Muhammad Qadeer – ProQuest LLC, 2019
Inquiry-oriented teaching is a specific form of active learning gaining popularity in teaching communities. The goal of inquiry-oriented classes is to help students in gaining a conceptual understanding of the material. My research focus is to gauge students' performance and conceptual understanding in inquiry-oriented linear algebra classes. This…
Descriptors: Mathematics Tests, Test Construction, Test Validity, Test Reliability
Trierweiler, Tammy J.; Lewis, Charles; Smith, Robert L. – Journal of Educational Measurement, 2016
In this study, we describe what factors influence the observed score correlation between an (external) anchor test and a total test. We show that the anchor to full-test observed score correlation is based on two components: the true score correlation between the anchor and total test, and the reliability of the anchor test. Findings using an…
Descriptors: Scores, Correlation, Tests, Test Reliability
Galeoto, Giovanni; D'Elpidio, Giuliana; Alvaro, Rosaria; Zicari, Anna Maria; Valente, Donatella; Riccio, Marianna – International Association for Development of the Information Society, 2021
The Italian Disciplinary section of Test of Competences (TECO-D) project is an important longitudinal study used to analyze learning outcomes of ungraded students and to measure quality of the educational process. The aim of the present study was to evaluate the psychometric properties of the TECO-D in students enrolled in the Bachelor's Degree in…
Descriptors: Case Studies, Nursing Education, Psychometrics, Longitudinal Studies
International Journal of Testing, 2019
These guidelines describe considerations relevant to the assessment of test takers in or across countries or regions that are linguistically or culturally diverse. The guidelines were developed by a committee of experts to help inform test developers, psychometricians, test users, and test administrators about fairness issues in support of the…
Descriptors: Test Bias, Student Diversity, Cultural Differences, Language Usage
Otoyo, Lucia; Bush, Martin – Practical Assessment, Research & Evaluation, 2018
This article presents the results of an empirical study of "subset selection" tests, which are a generalisation of traditional multiple-choice tests in which test takers are able to express partial knowledge. Similar previous studies have mostly been supportive of subset selection, but the deduction of marks for incorrect responses has…
Descriptors: Multiple Choice Tests, Grading, Test Reliability, Test Format
Flett, Gordon L.; Nepon, Taryn; Hewitt, Paul L.; Zaki-Azat, Justeena; Rose, Alison L.; Swiderski, Kristina – Journal of Psychoeducational Assessment, 2020
In the current article, we describe the development and validation of the Mistake Rumination Scale as a supplement to existing trait and cognitive measures of perfectionism. The Mistake Rumination Scale is a seven-item inventory that taps the tendency to ruminate about a past personal mistake. Psychometric analyses confirmed that the Mistake…
Descriptors: Personality Traits, Cognitive Processes, Test Construction, Cognitive Tests