ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	3
Since 2016 (last 10 years)	6
Since 2006 (last 20 years)	12

Descriptor

Foreign Countries	17
Test Length	17
Item Response Theory	15
Computer Assisted Testing	5
Statistical Analysis	5
Test Items	5
Adaptive Testing	4
Error of Measurement	4
Models	4
Sample Size	4
Test Bias	4
Ability	3
Comparative Analysis	3
Mathematics Tests	3
Measurement	3
Simulation	3
Test Format	3
Test Reliability	3
Accuracy	2
Achievement Tests	2
College Entrance Examinations	2
Computation	2
Correlation	2
Evaluation Methods	2
High Stakes Tests	2
More ▼

Source

Applied Psychological…	3
Educational and Psychological…	3
Educational Sciences: Theory…	1
Eurasian Journal of…	1
European Journal of Science…	1
International Journal of…	1
Journal of Educational…	1
Measurement:…	1
Physical Review Physics…	1
ProQuest LLC	1
Psychometrika	1
Research Matters	1
More ▼

Publication Type

Journal Articles	15
Reports - Research	13
Reports - Evaluative	2
Dissertations/Theses -…	1
Reports - Descriptive	1

Education Level

Higher Education	3
Postsecondary Education	3
Secondary Education	3
Elementary Education	1
Elementary Secondary Education	1
Grade 8	1
Junior High Schools	1
Middle Schools	1

Audience

Location

Taiwan	4
Netherlands	2
Turkey	2
Australia	1
Colombia	1
Germany	1
Indonesia	1
Israel	1
Japan	1
Jordan	1
Peru	1
Qatar	1
United Kingdom	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Center for Epidemiologic…	1
Program for International…	1
Trends in International…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 17 results Save | Export

Prevalence of Random Responders as a Function of Scale Position and Questionnaire Length in the TIMSS 2015 Eighth-Grade Student Questionnaire

Peer reviewed

Direct link

Saskia van Laar; Johan Braeken – International Journal of Testing, 2024

This study examined the impact of two questionnaire characteristics, scale position and questionnaire length, on the prevalence of random responders in the TIMSS 2015 eighth-grade student questionnaire. While there was no support for an absolute effect of questionnaire length, we did find a positive effect for scale position, with an increase of…

Descriptors: Middle School Students, Grade 8, Questionnaires, Test Length

Item Response Theory, Computer Adaptive Testing and the Risk of Self-Deception

Download full text

Benton, Tom – Research Matters, 2021

Computer adaptive testing is intended to make assessment more reliable by tailoring the difficulty of the questions a student has to answer to their level of ability. Most commonly, this benefit is used to justify the length of tests being shortened whilst retaining the reliability of a longer, non-adaptive test. Improvements due to adaptive…

Descriptors: Risk, Item Response Theory, Computer Assisted Testing, Difficulty Level

Estimation of Mixture Rasch Models from Skewed Latent Ability Distributions

Peer reviewed

Direct link

Karadavut, Tugba; Cohen, Allan S.; Kim, Seock-Ho – Measurement: Interdisciplinary Research and Perspectives, 2020

Mixture Rasch (MixRasch) models conventionally assume normal distributions for latent ability. Previous research has shown that the assumption of normality is often unmet in educational and psychological measurement. When normality is assumed, asymmetry in the actual latent ability distribution has been shown to result in extraction of spurious…

Descriptors: Item Response Theory, Ability, Statistical Distributions, Sample Size

Optimizing the Length of Computerized Adaptive Testing for the Force Concept Inventory

Peer reviewed

Direct link

Yasuda, Jun-ichiro; Mae, Naohiro; Hull, Michael M.; Taniguchi, Masa-aki – Physical Review Physics Education Research, 2021

As a method to shorten the test time of the Force Concept Inventory (FCI), we suggest the use of computerized adaptive testing (CAT). CAT is the process of administering a test on a computer, with items (i.e., questions) selected based upon the responses of the examinee to prior items. In so doing, the test length can be significantly shortened.…

Descriptors: Foreign Countries, College Students, Student Evaluation, Computer Assisted Testing

Profile Analyses as Feedback by Evaluating the Balance in Exam Scores

Peer reviewed
PDF on ERIC

Download full text

Vaheoja, Monika; Verhelst, N. D.; Eggen, T.J.H.M. – European Journal of Science and Mathematics Education, 2019

In this article, the authors applied profile analysis to Maths exam data to demonstrate how different exam forms, differing in difficulty and length, can be reported and easily interpreted. The results were presented for different groups of participants and for different institutions in different Maths domains by evaluating the balance. Some…

Descriptors: Feedback (Response), Foreign Countries, Statistical Analysis, Scores

Mixture IRT Model with a Higher-Order Structure for Latent Traits

Peer reviewed

Direct link

Huang, Hung-Yu – Educational and Psychological Measurement, 2017

Mixture item response theory (IRT) models have been suggested as an efficient method of detecting the different response patterns derived from latent classes when developing a test. In testing situations, multiple latent traits measured by a battery of tests can exhibit a higher-order structure, and mixtures of latent classes may occur on…

Descriptors: Item Response Theory, Models, Bayesian Statistics, Computation

Comparing Performances (Type I Error and Power) of IRT Likelihood Ratio SIBTEST and Mantel-Haenszel Methods in the Determination of Differential Item Functioning

Peer reviewed
PDF on ERIC

Download full text

Atalay Kabasakal, Kübra; Arsan, Nihan; Gök, Bilge; Kelecioglu, Hülya – Educational Sciences: Theory and Practice, 2014

This simulation study compared the performances (Type I error and power) of Mantel-Haenszel (MH), SIBTEST, and item response theory-likelihood ratio (IRT-LR) methods under certain conditions. Manipulated factors were sample size, ability differences between groups, test length, the percentage of differential item functioning (DIF), and underlying…

Descriptors: Comparative Analysis, Item Response Theory, Statistical Analysis, Test Bias

Optimal and Most Exact Confidence Intervals for Person Parameters in Item Response Theory Models

Peer reviewed

Direct link

Doebler, Anna; Doebler, Philipp; Holling, Heinz – Psychometrika, 2013

The common way to calculate confidence intervals for item response theory models is to assume that the standardized maximum likelihood estimator for the person parameter [theta] is normally distributed. However, this approximation is often inadequate for short and medium test lengths. As a result, the coverage probabilities fall below the given…

Descriptors: Foreign Countries, Item Response Theory, Computation, Hypothesis Testing

Curtailment and Stochastic Curtailment to Shorten the CES-D

Peer reviewed

Direct link

Finkelman, Matthew D.; Smits, Niels; Kim, Wonsuk; Riley, Barth – Applied Psychological Measurement, 2012

The Center for Epidemiologic Studies-Depression (CES-D) scale is a well-known self-report instrument that is used to measure depressive symptomatology. Respondents who take the full-length version of the CES-D are administered a total of 20 items. This article investigates the use of curtailment and stochastic curtailment (SC), two sequential…

Descriptors: Measures (Individuals), Depression (Psychology), Test Length, Computer Assisted Testing

Three Essays on Teacher Education Programs and Test-Takers' Response Times on Test Items

Direct link

Qian, Hong – ProQuest LLC, 2013

This dissertation includes three essays: one essay focuses on the effect of teacher preparation programs on teacher knowledge while the other two focus on test-takers' response times on test items. Essay One addresses the problem of how opportunities to learn in teacher preparation programs influence future elementary mathematics teachers'…

Descriptors: Teacher Education Programs, Pedagogical Content Knowledge, Preservice Teacher Education, Preservice Teachers

Multidimensional Rasch Analysis of a Psychological Test with Multiple Subtests: A Statistical Solution for the Bandwidth-Fidelity Dilemma

Peer reviewed

Direct link

Cheng, Ying-Yao; Wang, Wen-Chung; Ho, Yi-Hui – Educational and Psychological Measurement, 2009

Educational and psychological tests are often composed of multiple short subtests, each measuring a distinct latent trait. Unfortunately, short subtests suffer from low measurement precision, which makes the bandwidth-fidelity dilemma inevitable. In this study, the authors demonstrate how a multidimensional Rasch analysis can be employed to take…

Descriptors: Item Response Theory, Measurement, Correlation, Measures (Individuals)

Application of Computerized Adaptive Testing to Entrance Examination for Graduate Studies in Turkey

Peer reviewed
PDF on ERIC

Download full text

Bulut, Okan; Kan, Adnan – Eurasian Journal of Educational Research, 2012

Problem Statement: Computerized adaptive testing (CAT) is a sophisticated and efficient way of delivering examinations. In CAT, items for each examinee are selected from an item bank based on the examinee's responses to the items. In this way, the difficulty level of the test is adjusted based on the examinee's ability level. Instead of…

Descriptors: Adaptive Testing, Computer Assisted Testing, College Entrance Examinations, Graduate Students

Test Length and Validity.

Peer reviewed

Bell, Richard; Lumsden, James – Applied Psychological Measurement, 1980

The effect of test length on predictive validity is examined empirically. For four tests, the curve of validity against test length had a very gentle slope for the longer tests and all tests could be reduced by more than 60 percent without appreciable decreases in validity. (Author/BW)

Descriptors: Foreign Countries, High School Seniors, High Schools, Mathematical Models

Multidimensional Test Assembly Based on Lagrangian Relaxation Techniques. Research Report 98-08.

Download full text

Veldkamp, Bernard P. – 1998

In this paper, a mathematical programming approach is presented for the assembly of ability tests measuring multiple traits. The values of the variance functions of the estimators of the traits are minimized, while test specifications are met. The approach is based on Lagrangian relaxation techniques and provides good results for the two…

Descriptors: Ability, Estimation (Mathematics), Foreign Countries, Item Banks

Factors Influencing the Mantel and Generalized Mantel-Haenszel Methods for the Assessment of Differential Item Functioning in Polytomous Items

Peer reviewed

Direct link

Wang, Wen-Chung; Su, Ya-Hui – Applied Psychological Measurement, 2004

Eight independent variables (differential item functioning [DIF] detection method, purification procedure, item response model, mean latent trait difference between groups, test length, DIF pattern, magnitude of DIF, and percentage of DIF items) were manipulated, and two dependent variables (Type I error and power) were assessed through…

Descriptors: Test Length, Test Bias, Simulation, Item Response Theory

Previous Page | Next Page »

Pages: 1 | 2

Wang, Wen-Chung	3
Arsan, Nihan	1
Atalay Kabasakal, Kübra	1
Bell, Richard	1
Benton, Tom	1
Budescu, David V.	1
Bulut, Okan	1
Chen, Hsueh-Chu	1
Cheng, Ying-Yao	1
Cohen, Allan S.	1
Doebler, Anna	1
Doebler, Philipp	1
Eggen, T.J.H.M.	1
Finkelman, Matthew D.	1
Gök, Bilge	1
Ho, Yi-Hui	1
Holling, Heinz	1
Huang, Hung-Yu	1
Hull, Michael M.	1
Johan Braeken	1
Kan, Adnan	1
Karadavut, Tugba	1
Kelecioglu, Hülya	1
Kim, Seock-Ho	1
Kim, Wonsuk	1
More ▼