Publication Date
In 2025 | 0 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 9 |
Since 2016 (last 10 years) | 30 |
Since 2006 (last 20 years) | 54 |
Descriptor
Test Use | 54 |
Test Reliability | 42 |
Test Validity | 32 |
Test Construction | 16 |
Language Tests | 13 |
Scores | 13 |
Psychometrics | 12 |
Scoring | 10 |
Reliability | 9 |
Student Evaluation | 9 |
Evaluation Methods | 7 |
More ▼ |
Source
Author
Al-Owidha, Amjed A. | 1 |
Alatli, Betül | 1 |
Algozzine, Bob | 1 |
Algozzine, Kate | 1 |
Allen, Jeff M. | 1 |
Alvaro, Rosaria | 1 |
Attali, Yigal | 1 |
Avloniti, A. | 1 |
Barrow, Lloyd | 1 |
Basset, Katherine | 1 |
Bennett, Jessica G. | 1 |
More ▼ |
Publication Type
Education Level
Higher Education | 10 |
Postsecondary Education | 10 |
Elementary Education | 9 |
Early Childhood Education | 7 |
Elementary Secondary Education | 7 |
Secondary Education | 5 |
Grade 3 | 4 |
Grade 4 | 4 |
Grade 5 | 4 |
Grade 6 | 4 |
Grade 7 | 4 |
More ▼ |
Location
New York | 5 |
Oregon | 2 |
Colorado | 1 |
District of Columbia | 1 |
Georgia | 1 |
Germany | 1 |
Greece | 1 |
Idaho | 1 |
Illinois | 1 |
Italy (Rome) | 1 |
Kansas | 1 |
More ▼ |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 2 |
Every Student Succeeds Act… | 1 |
Individuals with Disabilities… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Alatli, Betül – International Journal of Curriculum and Instruction, 2022
This study was conducted to review the use of tests. For this purpose, 45 articles in which the Turkish form of the "Test Anxiety Inventory (TAI)," which is one of the tests frequently used in the field of education, was employed and that were published between 2000 and 2020 were examined in terms of factors that should be considered in…
Descriptors: Anxiety, Likert Scales, Test Anxiety, Test Reliability
Knoch, Ute; Deygers, Bart; Khamboonruang, Apichat – Language Testing, 2021
Rating scale development in the field of language assessment is often considered in dichotomous ways: It is assumed to be guided either by expert intuition or by drawing on performance data. Even though quite a few authors have argued that rating scale development is rarely so easily classifiable, this dyadic view has dominated language testing…
Descriptors: Rating Scales, Test Construction, Language Tests, Test Use
David Meechan; Zeta Williams-Brown; Tracy Whatmore; Simon Halfhead – Education 3-13, 2024
The paper focuses on findings from research that investigated teachers' and key stakeholders' perspectives on the use of Reception Baseline Assessment. Data collection was carried out in 2021-2022, which was the year this assessment was introduced into Reception classes in England. In total, 70 teachers and key stakeholders from 47 Local…
Descriptors: Foreign Countries, Preschool Education, Preschool Teachers, Achievement Tests
National Center on Improving Literacy, 2022
There are many available screeners for reading and other education or social-emotional outcomes. This brief outlines important things to consider when choosing and using a screener.
Descriptors: Screening Tests, Literacy, Social Emotional Learning, Decision Making
Ramsey Lee Cardwell – ProQuest LLC, 2022
The emergence of digital-first assessments is prompting reconsideration of, and innovation in, aspects of psychometrics, test validation, and test use. Using the Duolingo English Test (DET) as an example, this three-paper series seeks to address issues concerning the estimation of classification consistency and the reporting of results for such…
Descriptors: Classification, Reliability, Language Proficiency, Computer Assisted Testing
Attali, Yigal – Educational Measurement: Issues and Practice, 2019
Rater training is an important part of developing and conducting large-scale constructed-response assessments. As part of this process, candidate raters have to pass a certification test to confirm that they are able to score consistently and accurately before they begin scoring operationally. Moreover, many assessment programs require raters to…
Descriptors: Evaluators, Certification, High Stakes Tests, Scoring
Isbell, Daniel R.; Son, Young-A – Studies in Second Language Acquisition, 2022
Elicited Imitation Tests (EITs) are commonly used in second language acquisition (SLA)/bilingualism research contexts to assess the general oral proficiency of study participants. While previous studies have provided valuable EIT construct-related validity evidence, some key gaps remain. This study uses an integrative data analysis to further…
Descriptors: Bilingualism, Imitation, Language Tests, Second Language Learning
Grisham, Jennifer; Waddell, Misti; Crawford, Rebecca; Toland, Michael – Journal of Early Intervention, 2021
The purpose of this article is to provide evidence of the technical adequacy of the Assessment, Evaluation, and Programming System--Third Edition (AEPS-3). The AEPS has long been identified as one of the most psychometrically sound early childhood curriculum-based assessments. In this article, results of three studies of technical adequacy are…
Descriptors: Infants, Young Children, Curriculum Based Assessment, Psychometrics
Kotowicz, Justyna; Woll, Bencie; Herman, Rosalind – Language Testing, 2021
The evaluation of sign language proficiency needs to be based on measures with well-established psychometric proprieties. To date, no valid and reliable test is available to assess Polish Sign Language ("Polski Jezyk Migowy," PJM) skills in deaf children. Hence, our aim with this study was to adapt the British Sign Language Receptive…
Descriptors: Language Tests, Receptive Language, Sign Language, Language Proficiency
Rehfeld, David M.; Padgett, R. Noah – Journal of Psychoeducational Assessment, 2019
This article presents a review of the Comprehensive Assessment of Spoken Language--Second Edition (CASL-2), in which reliability, utility, and validity are analyzed and discussed. Some limited recommendations for practice are made based on a review of the information provided by the publisher for clinicians.
Descriptors: Oral Language, Language Tests, Receptive Language, Expressive Language
Iaccarino, Stephanie; von der Embse, Nathaniel; Kilgus, Stephen – Journal of Psychoeducational Assessment, 2019
Detecting mental illness in school students may prevent poor school outcomes. Clinicians often use universal behavioral screeners to identify students at risk for mental illness. This study examined the applicability of Kane's interpretation and use argument (IUA) to the Social, Academic, and Emotional Behavior Risk Screener--Teacher Rating Scale…
Descriptors: Screening Tests, Test Interpretation, Test Use, Mental Disorders
Olsen, Jacob; Preston, Angela I.; Algozzine, Bob; Algozzine, Kate; Cusumano, Dale – Clearing House: A Journal of Educational Strategies, Issues and Ideas, 2018
Although it is widely agreed that there is no universally accepted definition for school climate, most professionals ground it in shared beliefs, values, and attitudes reflecting the quality and character of life in schools. In this article, we review and analyze measures accessible to school personnel charged with documenting and monitoring…
Descriptors: Educational Environment, Measures (Individuals), School Personnel, Test Format
Al-Owidha, Amjed A. – Language Testing in Asia, 2018
Background: This study investigated the psychometric properties of the recently developed Qiyas for L1 Arabic language test using a Rasch measurement framework. Methods: Responses from 271 examinees were analyzed in this study. The test is hypothesized to involve one dominant factor that assesses four skills: reading comprehension, rhetorical…
Descriptors: Semitic Languages, Language Tests, Psychometrics, Reading Comprehension
Haider, Muhammad Qadeer – ProQuest LLC, 2019
Inquiry-oriented teaching is a specific form of active learning gaining popularity in teaching communities. The goal of inquiry-oriented classes is to help students in gaining a conceptual understanding of the material. My research focus is to gauge students' performance and conceptual understanding in inquiry-oriented linear algebra classes. This…
Descriptors: Mathematics Tests, Test Construction, Test Validity, Test Reliability
Trierweiler, Tammy J.; Lewis, Charles; Smith, Robert L. – Journal of Educational Measurement, 2016
In this study, we describe what factors influence the observed score correlation between an (external) anchor test and a total test. We show that the anchor to full-test observed score correlation is based on two components: the true score correlation between the anchor and total test, and the reliability of the anchor test. Findings using an…
Descriptors: Scores, Correlation, Tests, Test Reliability