NotesFAQContact Us
Collection
Advanced
Search Tips
Laws, Policies, & Programs
What Works Clearinghouse Rating
Showing 1 to 15 of 252 results Save | Export
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Alatli, Betül – International Journal of Curriculum and Instruction, 2022
This study was conducted to review the use of tests. For this purpose, 45 articles in which the Turkish form of the "Test Anxiety Inventory (TAI)," which is one of the tests frequently used in the field of education, was employed and that were published between 2000 and 2020 were examined in terms of factors that should be considered in…
Descriptors: Anxiety, Likert Scales, Test Anxiety, Test Reliability
Peer reviewed Peer reviewed
Direct linkDirect link
Knoch, Ute; Deygers, Bart; Khamboonruang, Apichat – Language Testing, 2021
Rating scale development in the field of language assessment is often considered in dichotomous ways: It is assumed to be guided either by expert intuition or by drawing on performance data. Even though quite a few authors have argued that rating scale development is rarely so easily classifiable, this dyadic view has dominated language testing…
Descriptors: Rating Scales, Test Construction, Language Tests, Test Use
Peer reviewed Peer reviewed
Direct linkDirect link
David Meechan; Zeta Williams-Brown; Tracy Whatmore; Simon Halfhead – Education 3-13, 2024
The paper focuses on findings from research that investigated teachers' and key stakeholders' perspectives on the use of Reception Baseline Assessment. Data collection was carried out in 2021-2022, which was the year this assessment was introduced into Reception classes in England. In total, 70 teachers and key stakeholders from 47 Local…
Descriptors: Foreign Countries, Preschool Education, Preschool Teachers, Achievement Tests
Peer reviewed Peer reviewed
Direct linkDirect link
Attali, Yigal – Educational Measurement: Issues and Practice, 2019
Rater training is an important part of developing and conducting large-scale constructed-response assessments. As part of this process, candidate raters have to pass a certification test to confirm that they are able to score consistently and accurately before they begin scoring operationally. Moreover, many assessment programs require raters to…
Descriptors: Evaluators, Certification, High Stakes Tests, Scoring
Peer reviewed Peer reviewed
Direct linkDirect link
Isbell, Daniel R.; Son, Young-A – Studies in Second Language Acquisition, 2022
Elicited Imitation Tests (EITs) are commonly used in second language acquisition (SLA)/bilingualism research contexts to assess the general oral proficiency of study participants. While previous studies have provided valuable EIT construct-related validity evidence, some key gaps remain. This study uses an integrative data analysis to further…
Descriptors: Bilingualism, Imitation, Language Tests, Second Language Learning
Peer reviewed Peer reviewed
Direct linkDirect link
Grisham, Jennifer; Waddell, Misti; Crawford, Rebecca; Toland, Michael – Journal of Early Intervention, 2021
The purpose of this article is to provide evidence of the technical adequacy of the Assessment, Evaluation, and Programming System--Third Edition (AEPS-3). The AEPS has long been identified as one of the most psychometrically sound early childhood curriculum-based assessments. In this article, results of three studies of technical adequacy are…
Descriptors: Infants, Young Children, Curriculum Based Assessment, Psychometrics
Peer reviewed Peer reviewed
Direct linkDirect link
Kotowicz, Justyna; Woll, Bencie; Herman, Rosalind – Language Testing, 2021
The evaluation of sign language proficiency needs to be based on measures with well-established psychometric proprieties. To date, no valid and reliable test is available to assess Polish Sign Language ("Polski Jezyk Migowy," PJM) skills in deaf children. Hence, our aim with this study was to adapt the British Sign Language Receptive…
Descriptors: Language Tests, Receptive Language, Sign Language, Language Proficiency
Peer reviewed Peer reviewed
Direct linkDirect link
Rehfeld, David M.; Padgett, R. Noah – Journal of Psychoeducational Assessment, 2019
This article presents a review of the Comprehensive Assessment of Spoken Language--Second Edition (CASL-2), in which reliability, utility, and validity are analyzed and discussed. Some limited recommendations for practice are made based on a review of the information provided by the publisher for clinicians.
Descriptors: Oral Language, Language Tests, Receptive Language, Expressive Language
Peer reviewed Peer reviewed
Direct linkDirect link
Iaccarino, Stephanie; von der Embse, Nathaniel; Kilgus, Stephen – Journal of Psychoeducational Assessment, 2019
Detecting mental illness in school students may prevent poor school outcomes. Clinicians often use universal behavioral screeners to identify students at risk for mental illness. This study examined the applicability of Kane's interpretation and use argument (IUA) to the Social, Academic, and Emotional Behavior Risk Screener--Teacher Rating Scale…
Descriptors: Screening Tests, Test Interpretation, Test Use, Mental Disorders
Peer reviewed Peer reviewed
Direct linkDirect link
Olsen, Jacob; Preston, Angela I.; Algozzine, Bob; Algozzine, Kate; Cusumano, Dale – Clearing House: A Journal of Educational Strategies, Issues and Ideas, 2018
Although it is widely agreed that there is no universally accepted definition for school climate, most professionals ground it in shared beliefs, values, and attitudes reflecting the quality and character of life in schools. In this article, we review and analyze measures accessible to school personnel charged with documenting and monitoring…
Descriptors: Educational Environment, Measures (Individuals), School Personnel, Test Format
Peer reviewed Peer reviewed
Direct linkDirect link
Al-Owidha, Amjed A. – Language Testing in Asia, 2018
Background: This study investigated the psychometric properties of the recently developed Qiyas for L1 Arabic language test using a Rasch measurement framework. Methods: Responses from 271 examinees were analyzed in this study. The test is hypothesized to involve one dominant factor that assesses four skills: reading comprehension, rhetorical…
Descriptors: Semitic Languages, Language Tests, Psychometrics, Reading Comprehension
Peer reviewed Peer reviewed
Direct linkDirect link
Trierweiler, Tammy J.; Lewis, Charles; Smith, Robert L. – Journal of Educational Measurement, 2016
In this study, we describe what factors influence the observed score correlation between an (external) anchor test and a total test. We show that the anchor to full-test observed score correlation is based on two components: the true score correlation between the anchor and total test, and the reliability of the anchor test. Findings using an…
Descriptors: Scores, Correlation, Tests, Test Reliability
Peer reviewed Peer reviewed
Direct linkDirect link
International Journal of Testing, 2019
These guidelines describe considerations relevant to the assessment of test takers in or across countries or regions that are linguistically or culturally diverse. The guidelines were developed by a committee of experts to help inform test developers, psychometricians, test users, and test administrators about fairness issues in support of the…
Descriptors: Test Bias, Student Diversity, Cultural Differences, Language Usage
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Otoyo, Lucia; Bush, Martin – Practical Assessment, Research & Evaluation, 2018
This article presents the results of an empirical study of "subset selection" tests, which are a generalisation of traditional multiple-choice tests in which test takers are able to express partial knowledge. Similar previous studies have mostly been supportive of subset selection, but the deduction of marks for incorrect responses has…
Descriptors: Multiple Choice Tests, Grading, Test Reliability, Test Format
Peer reviewed Peer reviewed
Direct linkDirect link
Flett, Gordon L.; Nepon, Taryn; Hewitt, Paul L.; Zaki-Azat, Justeena; Rose, Alison L.; Swiderski, Kristina – Journal of Psychoeducational Assessment, 2020
In the current article, we describe the development and validation of the Mistake Rumination Scale as a supplement to existing trait and cognitive measures of perfectionism. The Mistake Rumination Scale is a seven-item inventory that taps the tendency to ruminate about a past personal mistake. Psychometric analyses confirmed that the Mistake…
Descriptors: Personality Traits, Cognitive Processes, Test Construction, Cognitive Tests
Previous Page | Next Page »
Pages: 1  |  2  |  3  |  4  |  5  |  6  |  7  |  8  |  9  |  10  |  11  |  ...  |  17