ERIC - Search Results

Publication Date

In 2025	0
Since 2024	2
Since 2021 (last 5 years)	8
Since 2016 (last 10 years)	34
Since 2006 (last 20 years)	69

Descriptor

Models	86
Test Items	86
Test Reliability	56
Item Response Theory	30
Test Validity	29
Test Construction	27
Item Analysis	23
Goodness of Fit	22
Reliability	21
Foreign Countries	20
Psychometrics	18
Scores	15
Difficulty Level	13
Scoring	13
Correlation	11
Factor Analysis	10
Test Format	10
Computation	9
Construct Validity	9
Elementary School Students	9
Interrater Reliability	9
Comparative Analysis	8
Factor Structure	8
Student Attitudes	8
Validity	8
More ▼

Publication Type

Journal Articles	65
Reports - Research	57
Reports - Evaluative	20
Speeches/Meeting Papers	8
Reports - Descriptive	6
Tests/Questionnaires	5
Dissertations/Theses -…	2
Numerical/Quantitative Data	2
Guides - Classroom - Teacher	1

Education Level

Elementary Education	11
Higher Education	10
Postsecondary Education	10
Secondary Education	8
Middle Schools	7
Elementary Secondary Education	6
High Schools	4
Junior High Schools	4
Early Childhood Education	2
Intermediate Grades	2
Primary Education	2
Adult Education	1
Grade 1	1
Grade 2	1
Grade 5	1
Grade 6	1
Grade 7	1
Grade 8	1
Kindergarten	1
More ▼

Audience

Practitioners	2
Administrators	1

Location

Canada	2
China	2
Georgia	2
Germany	2
Taiwan	2
Asia	1
Australia	1
California	1
France	1
Hong Kong	1
Indonesia	1
Iran	1
Italy	1
Malaysia	1
Netherlands	1
Saudi Arabia	1
Saudi Arabia (Riyadh)	1
Singapore	1
Thailand	1
United Kingdom	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Graduate Record Examinations	2
Alberta Grade Twelve Diploma…	1
Center for Epidemiologic…	1
Eysenck Personality Inventory	1
Hidden Figures Test	1
Program for International…	1
Trends in International…	1
Wechsler Adult Intelligence…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 86 results Save | Export

The Impact of Measurement Model Misspecification on Coefficient Omega Estimates of Composite Reliability

Peer reviewed

Direct link

Stephanie M. Bell; R. Philip Chalmers; David B. Flora – Educational and Psychological Measurement, 2024

Coefficient omega indices are model-based composite reliability estimates that have become increasingly popular. A coefficient omega index estimates how reliably an observed composite score measures a target construct as represented by a factor in a factor-analysis model; as such, the accuracy of omega estimates is likely to depend on correct…

Descriptors: Influences, Models, Measurement Techniques, Reliability

Improvised Progressive Model Based on Automatic Calibration of Difficulty Level: A Practical Solution of Competitive-Based Examination

Peer reviewed

Direct link

Aditya Shah; Ajay Devmane; Mehul Ranka; Prathamesh Churi – Education and Information Technologies, 2024

Online learning has grown due to the advancement of technology and flexibility. Online examinations measure students' knowledge and skills. Traditional question papers include inconsistent difficulty levels, arbitrary question allocations, and poor grading. The suggested model calibrates question paper difficulty based on student performance to…

Descriptors: Computer Assisted Testing, Difficulty Level, Grading, Test Construction

Violation of Conditional Independence in the Many-Facets Rasch Model

Peer reviewed

Direct link

DeMars, Christine E. – Applied Measurement in Education, 2021

Estimation of parameters for the many-facets Rasch model requires that conditional on the values of the facets, such as person ability, item difficulty, and rater severity, the observed responses within each facet are independent. This requirement has often been discussed for the Rasch models and 2PL and 3PL models, but it becomes more complex…

Descriptors: Item Response Theory, Test Items, Ability, Scores

The Reliability of the Posterior Probability of Skill Attainment in Diagnostic Classification Models

Peer reviewed

Direct link

Johnson, Matthew S.; Sinharay, Sandip – Journal of Educational and Behavioral Statistics, 2020

One common score reported from diagnostic classification assessments is the vector of posterior means of the skill mastery indicators. As with any assessment, it is important to derive and report estimates of the reliability of the reported scores. After reviewing a reliability measure suggested by Templin and Bradshaw, this article suggests three…

Descriptors: Reliability, Probability, Skill Development, Classification

A Comparison of Polytomous Rasch Models for the Analysis of C-Tests

Peer reviewed
PDF on ERIC

Download full text

Dhyaaldian, Safa Mohammed Abdulridah; Kadhim, Qasim Khlaif; Mutlak, Dhameer A.; Neamah, Nour Raheem; Kareem, Zaidoon Hussein; Hamad, Doaa A.; Tuama, Jassim Hassan; Qasim, Mohammed Saad – International Journal of Language Testing, 2022

A C-Test is a gap-filling test for measuring language competence in the first and second language. C-Tests are usually analyzed with polytomous Rasch models by considering each passage as a super-item or testlet. This strategy helps overcome the local dependence inherent in C-Test gaps. However, there is little research on the best polytomous…

Descriptors: Item Response Theory, Cloze Procedure, Reading Tests, Language Tests

Accounting for Rater Effects with the Hierarchical Rater Model Framework When Scoring Simple Structured Constructed Response Tests

Peer reviewed

Direct link

Nieto, Ricardo; Casabianca, Jodi M. – Journal of Educational Measurement, 2019

Many large-scale assessments are designed to yield two or more scores for an individual by administering multiple sections measuring different but related skills. Multidimensional tests, or more specifically, simple structured tests, such as these rely on multiple multiple-choice and/or constructed responses sections of items to generate multiple…

Descriptors: Tests, Scoring, Responses, Test Items

Development and Validation of the 'Mentoring for Effective Teaching Practicum Instrument'

Peer reviewed
PDF on ERIC

Download full text

Mateja Ploj Virtic; Andre Du Plessis; Andrej Šorgo – Center for Educational Policy Studies Journal, 2023

In the context of improving the quality of teacher education, the focus of the present work was to adapt the Mentoring for Effective Primary Science Teaching instrument to become more universal and have the potential to be used beyond the elementary science mentoring context. The adapted instrument was renamed the Mentoring for Effective Teaching…

Descriptors: Test Construction, Test Validity, Test Reliability, Measures (Individuals)

Establishing the Validity and Reliability of the LOCUS Assessments

Peer reviewed
PDF on ERIC

Download full text

Tim Jacobbe; Bob delMas; Brad Hartlaub; Jeff Haberstroh; Catherine Case; Steven Foti; Douglas Whitaker – Numeracy, 2023

The development of assessments as part of the funded LOCUS project is described. The assessments measure students' conceptual understanding of statistics as outlined in the GAISE PreK-12 Framework. Results are reported from a large-scale administration to 3,430 students in grades 6 through 12 in the United States. Items were designed to assess…

Descriptors: Statistics Education, Common Core State Standards, Student Evaluation, Elementary School Students

Automatic Multiple Choice Question Generation From Text: A Survey

Peer reviewed

Direct link

Rao, Dhawaleswar; Saha, Sujan Kumar – IEEE Transactions on Learning Technologies, 2020

Automatic multiple choice question (MCQ) generation from a text is a popular research area. MCQs are widely accepted for large-scale assessment in various domains and applications. However, manual generation of MCQs is expensive and time-consuming. Therefore, researchers have been attracted toward automatic MCQ generation since the late 90's.…

Descriptors: Multiple Choice Tests, Test Construction, Automation, Computer Software

Use of Full Hierarchy Consistency Index to Assess Response Consistency

Peer reviewed
PDF on ERIC

Download full text

Akbay, Lokman; Kilinç, Mustafa – International Journal of Assessment Tools in Education, 2018

Measurement models need to properly delineate the real aspect of examinees' response processes for measurement accuracy purposes. To avoid invalid inferences, fit of examinees' response data to the model is studied through "person-fit" statistics. Misfit between the examinee response data and measurement model may be due to invalid…

Descriptors: Reliability, Goodness of Fit, Cognitive Measurement, Models

An Application of Reliability Estimation in Longitudinal Designs through Modeling Item-Specific Error Variance

Peer reviewed

Direct link

Sideridis, Georgios D.; Tsaousis, Ioannis; Al-Sadaawi, Abdullah – Educational and Psychological Measurement, 2019

The purpose of the present study was to apply the methodology developed by Raykov on modeling item-specific variance for the measurement of internal consistency reliability with longitudinal data. Participants were a randomly selected sample of 500 individuals who took on a professional qualifications test in Saudi Arabia over four different…

Descriptors: Test Reliability, Test Items, Longitudinal Studies, Foreign Countries

Developing an Assessment Framework of Multidimensional Scientific Competencies

Peer reviewed
PDF on ERIC

Download full text

Intasoi, Sasima; Junpeng, Putcharee; Tang, Keow Ngang; Ketchatturat, Jatuphum; Zhang, Yidan; Wilson, Mark – International Journal of Evaluation and Research in Education, 2020

The study aimed to develop and validate an assessment framework of multidimensional scientific competencies for seventh-grade students in the northeastern region of Thailand. A total of 289 samples with three different scientific competency levels were randomly selected to participate as test-takers. The design-based research encompassing four…

Descriptors: Science Tests, Grade 7, Foreign Countries, Science Process Skills

Diagnostic Classification Models: Recent Developments, Practical Issues, and Prospects

Peer reviewed

Direct link

Ravand, Hamdollah; Baghaei, Purya – International Journal of Testing, 2020

More than three decades after their introduction, diagnostic classification models (DCM) do not seem to have been implemented in educational systems for the purposes they were devised. Most DCM research is either methodological for model development and refinement or retrofitting to existing nondiagnostic tests and, in the latter case, basically…

Descriptors: Classification, Models, Diagnostic Tests, Test Construction

Impact of Both Local Item Dependencies and Cut-Point Locations on Examinee Classifications

Peer reviewed

Direct link

Rubright, Jonathan D. – Educational Measurement: Issues and Practice, 2018

Performance assessments, scenario-based tasks, and other groups of items carry a risk of violating the local item independence assumption made by unidimensional item response theory (IRT) models. Previous studies have identified negative impacts of ignoring such violations, most notably inflated reliability estimates. Still, the influence of this…

Descriptors: Performance Based Assessment, Item Response Theory, Models, Test Reliability

Development of Information Functions and Indices for the GGUM-RANK Multidimensional Forced Choice IRT Model

Peer reviewed

Direct link

Joo, Seang-Hwane; Lee, Philseok; Stark, Stephen – Journal of Educational Measurement, 2018

This research derived information functions and proposed new scalar information indices to examine the quality of multidimensional forced choice (MFC) items based on the RANK model. We also explored how GGUM-RANK information, latent trait recovery, and reliability varied across three MFC formats: pairs (two response alternatives), triplets (three…

Descriptors: Item Response Theory, Models, Item Analysis, Reliability

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6

Educational and Psychological…	6
Applied Psychological…	5
Journal of Educational…	5
ETS Research Report Series	4
Journal of Educational and…	3
Applied Measurement in…	2
Assessment & Evaluation in…	2
Assessment in Education:…	2
Education and Information…	2
Grantee Submission	2
International Journal of…	2
International Journal of…	2
Journal of Psychoeducational…	2
ProQuest LLC	2
Society for Research on…	2
Advances in Health Sciences…	1
American Journal of…	1
Behavioral Research and…	1
Center for Educational Policy…	1
College Board	1
Current Issues in Education	1
Educational Measurement:…	1
Educational Psychology	1
Educational Research and…	1
Educational Sciences: Theory…	1
More ▼

Burton, Richard F.	2
Champagne, Zachary M.	2
DeMars, Christine E.	2
Farina, Kristy	2
Hambleton, Ronald K.	2
LaVenia, Mark	2
Lee, Won-Chan	2
Schoen, Robert C.	2
Trevisan, Michael S.	2
Wang, Wen-Chung	2
Aditya Shah	1
Ajay Devmane	1
Akbay, Lokman	1
Al-Jarf, Reima	1
Al-Sadaawi, Abdullah	1
Allan, Marjorie	1
Alonzo, Julie	1
Altintas, Kerim Hakan	1
Anderson, Daniel	1
Andre Du Plessis	1
Andrej Šorgo	1
Baghaei, Purya	1
Barefah, Allaa	1
Baron, Simon	1
More ▼