Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 4 |
Since 2016 (last 10 years) | 6 |
Since 2006 (last 20 years) | 17 |
Descriptor
Source
Author
Publication Type
Education Level
Secondary Education | 3 |
Grade 9 | 2 |
High Schools | 2 |
Higher Education | 2 |
Postsecondary Education | 2 |
Elementary Secondary Education | 1 |
Grade 11 | 1 |
Grade 12 | 1 |
Middle Schools | 1 |
Location
Oklahoma | 3 |
Australia | 2 |
Canada | 2 |
Delaware | 2 |
Sweden | 2 |
China (Shanghai) | 1 |
Illinois | 1 |
Japan | 1 |
Kentucky | 1 |
Taiwan | 1 |
Ukraine | 1 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Student, Sanford R.; Gong, Brian – Educational Measurement: Issues and Practice, 2022
We address two persistent challenges in large-scale assessments of the Next Generation Science Standards: (a) the validity of score interpretations that target the standards broadly and (b) how to structure claims for assessments of this complex domain. The NGSS pose a particular challenge for specifying claims about students that evidence from…
Descriptors: Science Tests, Test Validity, Test Items, Test Construction
He, Qingping; Meadows, Michelle; Black, Beth – Research Papers in Education, 2022
A potential negative consequence of high-stakes testing is inappropriate test behaviour involving individuals and/or institutions. Inappropriate test behaviour and test collusion can result in aberrant response patterns and anomalous test scores and invalidate the intended interpretation and use of test results. A variety of statistical techniques…
Descriptors: Statistical Analysis, High Stakes Tests, Scores, Response Style (Tests)
Hryvko, Antonina V.; Zhuk, Yurii O. – Journal of Curriculum and Teaching, 2022
A feature of the presented study is a comprehensive approach to studying the reliability problem of linguistic testing results due to the several functional and variable factors impact. Contradictions and ambiguous views of scientists on the researched issues determine the relevance of this study. The article highlights the problem of equivalence…
Descriptors: Student Evaluation, Language Tests, Test Format, Test Items
Designing Computer-Based Tests: Design Guidelines from Multimedia Learning Studied with Eye Tracking
Dirkx, K. J. H.; Skuballa, I.; Manastirean-Zijlstra, C. S.; Jarodzka, H. – Instructional Science: An International Journal of the Learning Sciences, 2021
The use of computer-based tests (CBTs), for both formative and summative purposes, has greatly increased over the past years. One major advantage of CBTs is the easy integration of multimedia. It is unclear, though, how to design such CBT environments with multimedia. The purpose of the current study was to examine whether guidelines for designing…
Descriptors: Test Construction, Computer Assisted Testing, Multimedia Instruction, Eye Movements
Tengberg, Michael – Language Assessment Quarterly, 2018
Reading comprehension is often treated as a multidimensional construct. In many reading tests, items are distributed over reading process categories to represent the subskills expected to constitute comprehension. This study explores (a) the extent to which specified subskills of reading comprehension tests are conceptually conceivable to…
Descriptors: Reading Tests, Reading Comprehension, Scores, Test Results
Jerrim, John – Assessment in Education: Principles, Policy & Practice, 2016
The Programme for International Assessment (PISA) is an important cross-national study of 15-year olds academic achievement. Although it has traditionally been conducted using paper-and-pencil tests, the vast majority of countries will use computer-based assessment from 2015. In this paper, we consider how cross-country comparisons of children's…
Descriptors: Foreign Countries, Achievement Tests, International Assessment, Secondary School Students
Warner, Zachary B. – ProQuest LLC, 2013
This study compared an expert-based cognitive model of domain mastery with student-based cognitive models of task performance for Integrated Algebra. Interpretations of student test results are limited by experts' hypotheses of how students interact with the items. In reality, the cognitive processes that students use to solve each item may be…
Descriptors: Comparative Analysis, Algebra, Test Results, Measurement
Davies, Alan – Language Testing, 2010
This article presents the author's response to Xiaoming Xi's paper titled "How do we go about investigating test fairness?" In the paper, Xi offers "a means to fully integrate fairness investigations and practice". Given the current importance accorded to fairness in the language testing community, Xi makes a case for viewing fairness as an aspect…
Descriptors: Investigations, Testing, Language Tests, Validity
Hooker, Giles; Finkelman, Matthew – Psychometrika, 2010
Hooker, Finkelman, and Schwartzman ("Psychometrika," 2009, in press) defined a paradoxical result as the attainment of a higher test score by changing answers from correct to incorrect and demonstrated that such results are unavoidable for maximum likelihood estimates in multidimensional item response theory. The potential for these results to…
Descriptors: Models, Scores, Item Response Theory, Psychometrics
Moses, Tim; Liu, Jinghua; Tan, Adele; Deng, Weiling; Dorans, Neil J. – ETS Research Report Series, 2013
In this study, differential item functioning (DIF) methods utilizing 14 different matching variables were applied to assess DIF in the constructed-response (CR) items from 6 forms of 3 mixed-format tests. Results suggested that the methods might produce distinct patterns of DIF results for different tests and testing programs, in that the DIF…
Descriptors: Test Construction, Multiple Choice Tests, Test Items, Item Analysis
Wood, Timothy J. – Advances in Health Sciences Education, 2009
Reusing questions on an examination is a concern because test administrators do not want to unfairly aid examinees by exposing them to questions they have seen on previous examinations. The purpose of this study was to investigate the effect that prior exposure of questions has on the performance of repeat examinees. Two recent administrations of…
Descriptors: Item Response Theory, Multiple Choice Tests, Memory, Test Results
Beddow, Peter A. – International Journal of Disability, Development and Education, 2012
In the arena of educational testing, accessibility refers to the degree to which students are given the opportunity to participate in and engage a test. Accessibility theory is a model for examining the interactions between the test-taker and the test itself and defining how they may decrease some students' access to the test event, ultimately…
Descriptors: Test Results, Test Items, Educational Testing, Scores
Chen, Li-Ju; Ho, Rong-Guey; Yen, Yung-Chin – Educational Technology & Society, 2010
This study aimed to explore the effects of marking and metacognition-evaluated feedback (MEF) in computer-based testing (CBT) on student performance and review behavior. Marking is a strategy, in which students place a question mark next to a test item to indicate an uncertain answer. The MEF provided students with feedback on test results…
Descriptors: Feedback (Response), Test Results, Test Items, Testing
Moses, Tim; Yang, Wen-Ling; Wilson, Christine – Journal of Educational Measurement, 2007
This study explored the use of kernel equating for integrating and extending two procedures proposed for assessing item order effects in test forms that have been administered to randomly equivalent groups. When these procedures are used together, they can provide complementary information about the extent to which item order effects impact test…
Descriptors: Advanced Placement, Equated Scores, Test Items, Item Analysis
Wilhelm, Jennifer – International Journal of Science Education, 2009
This paper reports an examination on gender differences in lunar phases understanding of 123 students (70 females and 53 males). Middle-level students interacted with the Moon through observations, sketching, journalling, two-dimensional and three-dimensional modelling, and classroom discussions. These lunar lessons were adapted from the Realistic…
Descriptors: Test Results, Test Items, Females, Astronomy