ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	3
Since 2016 (last 10 years)	12
Since 2006 (last 20 years)	35

Descriptor

Test Items	35
Item Response Theory	32
Grade 7	31
Difficulty Level	22
Grade 6	17
Grade 5	15
Mathematics Tests	15
Test Construction	14
Grade 8	12
Grade 3	11
Middle School Students	10
Test Validity	10
Grade 4	9
Item Analysis	8
Foreign Countries	7
Test Reliability	7
Achievement Tests	6
Computer Assisted Testing	6
Grade 9	6
Multiple Choice Tests	6
Pilot Projects	6
Reading Comprehension	6
Reading Tests	6
Test Theory	6
Achievement Gains	5
More ▼

Publication Type

Reports - Research	22
Journal Articles	18
Numerical/Quantitative Data	12
Reports - Descriptive	5
Reports - Evaluative	5
Dissertations/Theses -…	3

Education Level

Grade 7	35
Middle Schools	27
Elementary Education	26
Junior High Schools	26
Secondary Education	25
Grade 6	19
Grade 5	18
Grade 3	15
Grade 8	14
Intermediate Grades	13
Grade 4	12
Early Childhood Education	6
Primary Education	6
Elementary Secondary Education	5
Grade 9	5
High Schools	5
Grade 1	1
Grade 2	1
More ▼

Audience

Location

Oregon	4
Idaho	3
Massachusetts	3
Arkansas	2
California	2
Illinois	2
Missouri	2
Turkey (Ankara)	2
Washington	2
Alabama	1
Arizona	1
Colorado	1
Connecticut	1
District of Columbia	1
Georgia	1
Germany	1
Indiana	1
Iowa	1
Kentucky	1
Maryland	1
Mississippi	1
Nevada	1
New Jersey	1
New Mexico	1
Ohio	1
More ▼

Laws, Policies, & Programs

Individuals with Disabilities…

Assessments and Surveys

Trends in International…	2
Wisconsin Knowledge and…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 35 results Save | Export

Comparison of G and Phi Coefficients Estimated in Generalizability Theory with Real Cases

Peer reviewed
PDF on ERIC

Download full text

Deniz, Kaan Zulfikar; Ilican, Emel – International Journal of Assessment Tools in Education, 2021

This study aims to compare the G and Phi coefficients as estimated by D studies for a measurement tool with the G and Phi coefficients obtained from real cases in which items of differing difficulty levels were added and also to determine the conditions under which the D studies estimated reliability coefficients closer to reality. The study group…

Descriptors: Generalizability Theory, Test Items, Difficulty Level, Test Reliability

A Factor Mixture Model for Item Responses and Certainty of Response Indices to Identify Student Knowledge Profiles

Peer reviewed

Direct link

Chen, Chia-Wen; Andersson, Björn; Zhu, Jinxin – Journal of Educational Measurement, 2023

The certainty of response index (CRI) measures respondents' confidence level when answering an item. In conjunction with the answers to the items, previous studies have used descriptive statistics and arbitrary thresholds to identify student knowledge profiles with the CRIs. Whereas this approach overlooked the measurement error of the observed…

Descriptors: Item Response Theory, Factor Analysis, Psychometrics, Test Items

Examining Differential Item Functions of Different Item Ordered Test Forms According to Item Difficulty Levels

Peer reviewed
PDF on ERIC

Download full text

Çokluk, Ömay; Gül, Emrah; Dogan-Gül, Çilem – Educational Sciences: Theory and Practice, 2016

The study aims to examine whether differential item function is displayed in three different test forms that have item orders of random and sequential versions (easy-to-hard and hard-to-easy), based on Classical Test Theory (CTT) and Item Response Theory (IRT) methods and bearing item difficulty levels in mind. In the correlational research, the…

Descriptors: Test Bias, Test Items, Difficulty Level, Test Theory

Item Characteristic Curve Asymmetry: A Better Way to Accommodate Slips and Guesses than a Four-Parameter Model?

Peer reviewed

Direct link

Liao, Xiangyi; Bolt, Daniel M. – Journal of Educational and Behavioral Statistics, 2021

Four-parameter models have received increasing psychometric attention in recent years, as a reduced upper asymptote for item characteristic curves can be appealing for measurement applications such as adaptive testing and person-fit assessment. However, applications can be challenging due to the large number of parameters in the model. In this…

Descriptors: Test Items, Models, Mathematics Tests, Item Response Theory

The Development and Validation of the Middle School-Life Science Concept Inventory (MS-LSCI) Using Rasch Analysis

Direct link

Stammen, Andria – ProQuest LLC, 2018

The aim of this research is to develop a measurement instrument that is valid and reliable, called the Middle School-Life Science Concept Inventory (MS-LSCI), for the purpose of measuring the life science conceptual understanding of middle school-level students. Although there are several existing concept inventories related to biology concepts…

Descriptors: Science Tests, Biological Sciences, Middle School Students, Scientific Concepts

Considerations for Using Mathematical Learning Progressions to Design Diagnostic Assessments

Peer reviewed

Direct link

Ketterlin-Geller, Leanne R.; Shivraj, Pooja; Basaraba, Deni; Yovanoff, Paul – Measurement: Interdisciplinary Research and Perspectives, 2019

Diagnostic assessments are intended to support teachers' instructional decision making by providing instructionally relevant information. In this article, we propose that using cognitive theories of learning to design diagnostic assessments can provide teachers with two diagnostic outcomes: (a) the location of a student's thinking within the…

Descriptors: Diagnostic Tests, Learning Theories, Test Construction, Learning Processes

Application of the IRT and TRT Models to a Reading Comprehension Test

Direct link

Kim, Weon H. – ProQuest LLC, 2017

The purpose of the present study is to apply the item response theory (IRT) and testlet response theory (TRT) models to a reading comprehension test. This study applied the TRT models and the traditional IRT model to a seventh-grade reading comprehension test (n = 8,815) with eight testlets. These three models were compared to determine the best…

Descriptors: Item Response Theory, Test Items, Correlation, Reading Tests

Study of the Reliability of CCSS-Aligned Math Measures (2012 Research Version): Grades 6-8. Technical Report #1312

Download full text

Anderson, Daniel; Alonzo, Julie; Tindal, Gerald – Behavioral Research and Teaching, 2012

In this technical report, we describe the results of a study of mathematics items written to align with the Common Core State Standards (CCSS) in grades 6-8. In each grade, CCSS items were organized into forms, and the reliability of these forms was evaluated along with an experimental form including items aligned with the National Council of…

Descriptors: Curriculum Based Assessment, Mathematics Tests, Academic Standards, State Standards

Applying Rasch Model and Generalizability Theory to Study Modified-Angoff Cut Scores

Peer reviewed

Direct link

Arce, Alvaro J.; Wang, Ze – International Journal of Testing, 2012

The traditional approach to scale modified-Angoff cut scores transfers the raw cuts to an existing raw-to-scale score conversion table. Under the traditional approach, cut scores and conversion table raw scores are not only seen as interchangeable but also as originating from a common scaling process. In this article, we propose an alternative…

Descriptors: Generalizability Theory, Item Response Theory, Cutting Scores, Scaling

Mode Comparability Study Based on Spring 2015 Operational Test Data

Download full text

Liu, Junhui; Brown, Terran; Chen, Jianshen; Ali, Usama; Hou, Likun; Costanzo, Kate – Partnership for Assessment of Readiness for College and Careers, 2016

The Partnership for Assessment of Readiness for College and Careers (PARCC) is a state-led consortium working to develop next-generation assessments that more accurately, compared to previous assessments, measure student progress toward college and career readiness. The PARCC assessments include both English Language Arts/Literacy (ELA/L) and…

Descriptors: Testing, Achievement Tests, Test Items, Test Bias

Construct Definition Using Cognitively Based Evidence: A Framework for Practice

Peer reviewed

Direct link

Ketterlin-Geller, Leanne R.; Yovanoff, Paul; Jung, EunJu; Liu, Kimy; Geller, Josh – Educational Assessment, 2013

In this article, we highlight the need for a precisely defined construct in score-based validation and discuss the contribution of cognitive theories to accurately and comprehensively defining the construct. We propose a framework for integrating cognitively based theoretical and empirical evidence to specify and evaluate the construct. We apply…

Descriptors: Test Validity, Construct Validity, Scores, Evidence

Effects of Item Parameter Drift on Vertical Scaling with the Nonequivalent Groups with Anchor Test (NEAT) Design

Peer reviewed

Direct link

Ye, Meng; Xin, Tao – Educational and Psychological Measurement, 2014

The authors explored the effects of drifting common items on vertical scaling within the higher order framework of item parameter drift (IPD). The results showed that if IPD occurred between a pair of test levels, the scaling performance started to deviate from the ideal state, as indicated by bias of scaling. When there were two items drifting…

Descriptors: Scaling, Test Items, Equated Scores, Achievement Gains

The Impact of Sub-Skills and Item Content on Students' Skills with Regard to the Control-of-Variables Strategy

Peer reviewed

Direct link

Schwichow, Martin; Christoph, Simon; Boone, William J.; Härtig, Hendrik – International Journal of Science Education, 2016

The so-called control-of-variables strategy (CVS) incorporates the important scientific reasoning skills of designing controlled experiments and interpreting experimental outcomes. As CVS is a prominent component of science standards appropriate assessment instruments are required to measure these scientific reasoning skills and to evaluate the…

Descriptors: Thinking Skills, Science Instruction, Science Experiments, Science Tests

Spring 2015 Digital Devices Comparability Research Study

Download full text

Steedle, Jeffrey; McBride, Malena; Johnson, Marc; Keng, Leslie – Partnership for Assessment of Readiness for College and Careers, 2016

The first operational administration of the Partnership for Assessment of Readiness for College and Careers (PARCC) took place during the 2014-2015 school year. In addition to the traditional paper-and-pencil format, the assessments were available for administration on a variety of electronic devices, including desktop computers, laptop computers,…

Descriptors: Computer Assisted Testing, Difficulty Level, Test Items, Scores

New Meridian Technical Report 2018-2019

Download full text

New Meridian Corporation, 2020

The purpose of this report is to describe the technical qualities of the 2018-2019 operational administration of the English language arts/literacy (ELA/L) and mathematics summative assessments in grades 3 through 8 and high school. The ELA/L assessments focus on reading and comprehending a range of sufficiently complex texts independently and…

Descriptors: Language Arts, Literacy Education, Mathematics Education, Summative Evaluation

Previous Page | Next Page »

Pages: 1 | 2 | 3

Behavioral Research and…	10
ProQuest LLC	3
Applied Measurement in…	2
Educational and Psychological…	2
International Journal of…	2
New Meridian Corporation	2
Partnership for Assessment of…	2
African Journal of Research…	1
ETS Research Report Series	1
Educational Assessment	1
Educational Research and…	1
Educational Sciences: Theory…	1
International Journal of…	1
International Journal of…	1
Journal of Educational…	1
Journal of Educational and…	1
Measurement:…	1
School Science and Mathematics	1
Universal Journal of…	1
More ▼

Tindal, Gerald	9
Ketterlin-Geller, Leanne R.	6
Liu, Kimy	6
Alonzo, Julie	4
Yovanoff, Paul	4
Anderson, Daniel	2
Geller, Josh	2
Jung, Eunju	2
Park, Bitnara Jasmine	2
Sundstrom-Hebert, Krystal	2
Albano, Anthony D.	1
Ali, Usama	1
Andersson, Björn	1
Arce, Alvaro J.	1
Basaraba, Deni	1
Bejar, Isaac I.	1
Bolt, Daniel M.	1
Boone, William J.	1
Brown, Terran	1
Cai, Li	1
Carling, Kristy	1
Chen, Chia-Wen	1
Chen, Jianshen	1
Chi, Eunlim	1
Cho, Hyun-Jeong	1
More ▼