ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	3
Since 2016 (last 10 years)	25
Since 2006 (last 20 years)	109

Descriptor

Test Items	228
Test Reliability	144
Test Construction	90
Test Validity	88
Reliability	58
Scoring	46
Scores	45
Item Response Theory	42
Psychometrics	40
Item Analysis	36
Difficulty Level	34
Foreign Countries	28
Interrater Reliability	26
Multiple Choice Tests	25
Estimation (Mathematics)	24
Computer Assisted Testing	22
Factor Analysis	21
Comparative Analysis	20
Correlation	20
Models	20
Test Format	20
Validity	17
Classification	16
Error of Measurement	16
Mathematics Tests	16
More ▼

Publication Type

Reports - Evaluative	228
Journal Articles	146
Speeches/Meeting Papers	41
Numerical/Quantitative Data	8
Tests/Questionnaires	7
Opinion Papers	5
Book/Product Reviews	2
ERIC Digests in Full Text	1
ERIC Publications	1
Guides - Non-Classroom	1
Information Analyses	1
Reports - Research	1
More ▼

Education Level

Higher Education	21
Secondary Education	16
Elementary Secondary Education	14
Postsecondary Education	11
High Schools	9
Middle Schools	9
Elementary Education	7
Grade 8	7
Grade 5	5
Grade 7	4
Junior High Schools	4
Adult Education	2
Grade 6	2
Intermediate Grades	2
Kindergarten	2
Early Childhood Education	1
Grade 11	1
Grade 2	1
Grade 3	1
Grade 4	1
Grade 9	1
Preschool Education	1
More ▼

Audience

Practitioners	4
Researchers	3
Teachers	3

Location

California	6
Canada	4
Nebraska	4
United Kingdom	4
China	3
New York	3
Alabama	2
Malaysia	2
Oregon	2
Taiwan	2
Texas	2
Washington	2
Arkansas	1
Asia	1
Botswana	1
Colombia	1
Dominica	1
Florida	1
Germany	1
Grenada	1
Idaho	1
Illinois	1
Maryland	1
Mexico	1
Michigan	1
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	2
Individuals with Disabilities…	1

What Works Clearinghouse Rating

Meets WWC Standards without Reservations	1
Meets WWC Standards with or without Reservations	1

Showing 1 to 15 of 228 results Save | Export

Improvised Progressive Model Based on Automatic Calibration of Difficulty Level: A Practical Solution of Competitive-Based Examination

Peer reviewed

Direct link

Aditya Shah; Ajay Devmane; Mehul Ranka; Prathamesh Churi – Education and Information Technologies, 2024

Online learning has grown due to the advancement of technology and flexibility. Online examinations measure students' knowledge and skills. Traditional question papers include inconsistent difficulty levels, arbitrary question allocations, and poor grading. The suggested model calibrates question paper difficulty based on student performance to…

Descriptors: Computer Assisted Testing, Difficulty Level, Grading, Test Construction

The Reliability of the Posterior Probability of Skill Attainment in Diagnostic Classification Models

Peer reviewed

Direct link

Johnson, Matthew S.; Sinharay, Sandip – Journal of Educational and Behavioral Statistics, 2020

One common score reported from diagnostic classification assessments is the vector of posterior means of the skill mastery indicators. As with any assessment, it is important to derive and report estimates of the reliability of the reported scores. After reviewing a reliability measure suggested by Templin and Bradshaw, this article suggests three…

Descriptors: Reliability, Probability, Skill Development, Classification

Addressing Issues of Missing Values in the Survey Research of High School Mathematics Teachers' Digital Competencies

Peer reviewed
PDF on ERIC

Download full text

Jafri, Mairaj – Waikato Journal of Education, 2022

This paper reports how I addressed the issue of extensive missing values in my PhD study, "Digital Competencies of High School Mathematics Teachers". I collected data using an online survey. Several methods exist to address the issue of missing values. I utilised multiple imputation (MI) as it provides more accurate results. The mean…

Descriptors: Data Collection, Research Problems, Doctoral Dissertations, Online Surveys

Evaluating Human Scoring Using Generalizability Theory

Peer reviewed

Direct link

Bimpeh, Yaw; Pointer, William; Smith, Ben Alexander; Harrison, Liz – Applied Measurement in Education, 2020

Many high-stakes examinations in the United Kingdom (UK) use both constructed-response items and selected-response items. We need to evaluate the inter-rater reliability for constructed-response items that are scored by humans. While there are a variety of methods for evaluating rater consistency across ratings in the psychometric literature, we…

Descriptors: Scoring, Generalizability Theory, Interrater Reliability, Foreign Countries

Thanks Coefficient Alpha, We Still Need You!

Peer reviewed

Direct link

Raykov, Tenko; Marcoulides, George A. – Educational and Psychological Measurement, 2019

This note discusses the merits of coefficient alpha and their conditions in light of recent critical publications that miss out on significant research findings over the past several decades. That earlier research has demonstrated the empirical relevance and utility of coefficient alpha under certain empirical circumstances. The article highlights…

Descriptors: Test Validity, Test Reliability, Test Items, Correlation

Test Review: Hammill, D. D., McGhee, R. L., & Ehrler, D. J. (2018). "Detroit Tests of Learning Abilities--Fifth Edition." Austin, TX: Pro-Ed.

Peer reviewed

Direct link

Rigney, Alexander M. – Journal of Psychoeducational Assessment, 2019

The "Detroit Tests of Learning Aptitude" has been in use for more than three quarters of a century (Baker & Leland, 1935). Its longevity in the field speaks to its popularity as a broad measure of cognitive abilities. Its most recent iteration, in the form of the "Detroit Tests of Learning Abilities--Fifth Edition" (DTLA-5;…

Descriptors: Aptitude Tests, Cognitive Ability, Test Construction, Test Items

Diagnostic Classification Models: Recent Developments, Practical Issues, and Prospects

Peer reviewed

Direct link

Ravand, Hamdollah; Baghaei, Purya – International Journal of Testing, 2020

More than three decades after their introduction, diagnostic classification models (DCM) do not seem to have been implemented in educational systems for the purposes they were devised. Most DCM research is either methodological for model development and refinement or retrofitting to existing nondiagnostic tests and, in the latter case, basically…

Descriptors: Classification, Models, Diagnostic Tests, Test Construction

A Design for Comparing CTT and IRT in Test Assembly, Scoring and Argumentation: Differences among Reliability, Information and Validation

Peer reviewed

Direct link

Alqarni, Abdulelah Mohammed – Journal on Educational Psychology, 2019

This study compares the psychometric properties of reliability in Classical Test Theory (CTT), item information in Item Response Theory (IRT), and validation from the perspective of modern validity theory for the purpose of bringing attention to potential issues that might exist when testing organizations use both test theories in the same testing…

Descriptors: Test Theory, Item Response Theory, Test Construction, Scoring

A Validation Trajectory for the Washington Assessment of Risks and Needs of Students

Peer reviewed

Direct link

Gotch, Chad M.; French, Brian F. – Educational Assessment, 2020

The State of Washington requires school districts to file court petitions on students with excessive unexcused absences. The "Washington Assessment of Risks and Needs of Students" (WARNS), a self-report screening instrument developed for use by high school and juvenile court personnel in such situations, purports to measure six facets of…

Descriptors: Risk Assessment, Needs Assessment, Truancy, Measurement Techniques

Test Review: Stroud, K. C., & Reynolds, C. R. (2006), "School Motivation and Learning Strategies Inventory (SMALSI): College Form [Manual]." Torrance, CA: Western Psychological Services

Peer reviewed

Direct link

Babcock, Sarah E.; Wilson, Claire A.; Lau, Chloe – Canadian Journal of School Psychology, 2018

This article describes and reviews The School Motivation and Learning Strategies Inventory (SMALSI™; Stroud & Reynolds, 2006), published by Western Psychological Services, a self-report inventory designed to assess academic motivation, as well as learning and study strategies. The test identifies 10 primary constructs, referred to broadly as…

Descriptors: Motivation, Measures (Individuals), Test Anxiety, Test Wiseness

Assessing Instructional Sensitivity Using the Pre-Post Difference Index: A Nontechnical Tool for Extension Educators

Peer reviewed

Direct link

Adedokun, Omolola A. – Journal of Extension, 2018

This article provides an illustrative description of the pre-post difference index (PPDI), a simple, nontechnical yet robust tool for examining the instructional sensitivity of assessment items. Extension educators often design pretest-posttest instruments to assess the impact of their curricula on participants' knowledge and understanding of the…

Descriptors: Extension Education, Extension Agents, Pretests Posttests, Curriculum Evaluation

Loosening Psychometric Constraints on Educational Assessments

Peer reviewed

Direct link

Kane, Michael T. – Assessment in Education: Principles, Policy & Practice, 2017

In response to an argument by Baird, Andrich, Hopfenbeck and Stobart (2017), Michael Kane states that there needs to be a better fit between educational assessment and learning theory. In line with this goal, Kane will examine how psychometric constraints might be loosened by relaxing some psychometric "rules" in some assessment…

Descriptors: Educational Assessment, Psychometrics, Standards, Test Reliability

Reviewing the IELTS Speaking Test in East Asia: Theoretical and Practice-Based Insights

Peer reviewed

Direct link

Quaid, Ethan Douglas – Language Testing in Asia, 2018

This paper reviews the International English Language Testing System's speaking sub-test in the East Asia region with reference to theoretical and practice-based perspectives and identifies future research opportunities to enhance the measures of test qualities found. The test's construct validity was seen to accurately measure the abilities…

Descriptors: English (Second Language), Language Tests, Second Language Learning, Speech Tests

Spring 2021 NSCAS Phase I Pilot ELA, Mathematics, and Science Technical Report

Download full text

Nebraska Department of Education, 2021

This technical report documents the processes and procedures implemented to support the Spring 2021 Nebraska Student-Centered Assessment System (NSCAS) Phase I Pilot in English Language Arts (ELA), Mathematics, and Science assessments by NWEA® under the supervision of the Nebraska Department of Education (NDE). The technical report shows how the…

Descriptors: Psychometrics, Standard Setting, English, Language Arts

Designing Language Assessments in Context: Theoretical, Technical, and Institutional Considerations

Peer reviewed
PDF on ERIC

Download full text

Giraldo, Frank – HOW, 2019

The purpose of this article of reflection is to raise awareness of how poor design of language assessments may have detrimental effects, if crucial qualities and technicalities of test design are not met. The article first discusses these central qualities for useful language assessments. Then, guidelines for creating listening assessments, as an…

Descriptors: Test Construction, Consciousness Raising, Language Tests, Second Language Learning

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 16

Educational and Psychological…	15
Journal of Psychoeducational…	15
Applied Measurement in…	11
Applied Psychological…	8
Journal of Educational…	6
Online Submission	6
Psychometrika	4
Research in Developmental…	4
Assessment & Evaluation in…	3
Assessment in Education:…	3
National Center for Research…	3
Nebraska Department of…	3
Psychological Methods	3
Behavioral Research and…	2
Canadian Journal of School…	2
Educational Measurement:…	2
Educational Research and…	2
International Journal of…	2
Practical Assessment,…	2
Research in the Schools	2
Studies in Educational…	2
ACT, Inc.	1
Advances in Physiology…	1
Asia-Pacific Forum on Science…	1
Assessment for Effective…	1
More ▼

Lee, Guemin	4
Meijer, Rob R.	4
Feldt, Leonard S.	3
Frisbie, David A.	3
Nicewander, W. Alan	3
Alonzo, Julie	2
Bock, R. Darrell	2
Bramley, Tom	2
Budescu, David V.	2
Davis-Becker, Susan L.	2
Gierl, Mark J.	2
Gustafsson, Jan-Eric	2
Hogan, Thomas P.	2
Lai, Cheng-Fei	2
Lunz, Mary E.	2
Park, Bitnara Jasmine	2
Plake, Barbara S.	2
Raykov, Tenko	2
Su, Chwen-Yng	2
Tindal, Gerald	2
Trevisan, Michael S.	2
Wainer, Howard	2
Wise, Steven L.	2
Wuang, Yee-Pay	2
More ▼

SAT (College Admission Test)	6
ACT Assessment	2
Advanced Placement…	2
Armed Services Vocational…	2
Graduate Record Examinations	2
National Assessment of…	2
Work Keys (ACT)	2
ACT Interest Inventory	1
Acculturation Rating Scale…	1
Alberta Grade Twelve Diploma…	1
Armed Forces Qualification…	1
Autism Diagnostic Observation…	1
Differential Aptitude Test	1
Expressive One Word Picture…	1
Hidden Figures Test	1
International English…	1
Kaufman Test of Educational…	1
Marlowe Crowne Social…	1
Minnesota Multiphasic…	1
Peabody Developmental Motor…	1
Preliminary Scholastic…	1
Program for International…	1
Raven Progressive Matrices	1
Rosenberg Self Esteem Scale	1
Schools and Staffing Survey…	1
More ▼