ERIC - Search Results

Publication Date

In 2025	5
Since 2024	49
Since 2021 (last 5 years)	235
Since 2016 (last 10 years)	570
Since 2006 (last 20 years)	917

Descriptor

Test Items	917
Test Reliability	701
Test Validity	445
Test Construction	362
Foreign Countries	352
Item Response Theory	225
Psychometrics	218
Difficulty Level	199
Scores	181
Factor Analysis	169
Reliability	166
Correlation	156
Item Analysis	152
Statistical Analysis	114
Measures (Individuals)	104
Scoring	98
Goodness of Fit	95
Multiple Choice Tests	92
Undergraduate Students	90
Interrater Reliability	82
Construct Validity	80
Mathematics Tests	79
Factor Structure	78
Science Tests	78
Comparative Analysis	76
More ▼

Publication Type

Journal Articles	796
Reports - Research	715
Reports - Evaluative	109
Tests/Questionnaires	76
Reports - Descriptive	44
Dissertations/Theses -…	33
Numerical/Quantitative Data	22
Speeches/Meeting Papers	20
Information Analyses	7
Guides - Non-Classroom	5
Opinion Papers	5
Collected Works - General	3
Books	2
Guides - General	2
Multilingual/Bilingual…	1
Non-Print Media	1
Reference Materials - General	1
Reports - General	1
More ▼

Education Level

Higher Education	267
Postsecondary Education	220
Secondary Education	160
Elementary Education	143
High Schools	76
Middle Schools	75
Junior High Schools	54
Elementary Secondary Education	48
Early Childhood Education	47
Intermediate Grades	36
Primary Education	35
Grade 8	28
Grade 5	22
Grade 7	22
Grade 6	20
Kindergarten	20
Grade 2	18
Grade 3	17
Grade 4	17
Grade 1	13
Grade 9	13
Preschool Education	7
Adult Education	6
Grade 10	5
Grade 12	4
More ▼

Audience

Teachers	6
Administrators	5
Support Staff	3
Researchers	2
Counselors	1
Parents	1
Policymakers	1
Practitioners	1

Location

Turkey	76
Indonesia	29
Germany	24
China	17
Florida	17
Canada	16
Australia	15
India	12
California	11
United States	11
Malaysia	10
Netherlands	10
Taiwan	10
New York	9
Nigeria	8
United Kingdom	8
Illinois	7
Iran	7
South Korea	7
Turkey (Ankara)	7
Turkey (Istanbul)	7
Jordan	6
Maryland	6
Nebraska	6
Singapore	6
More ▼

Laws, Policies, & Programs

Individuals with Disabilities…	4
No Child Left Behind Act 2001	4
Every Student Succeeds Act…	3
Rehabilitation Act 1973…	3
Head Start	1
United Nations Convention on…	1

What Works Clearinghouse Rating

Meets WWC Standards without Reservations	1
Meets WWC Standards with or without Reservations	1

Showing 1 to 15 of 917 results Save | Export

Establishing a Physics Concept Inventory Using Computer Marked Free-Response Questions

Peer reviewed
PDF on ERIC

Download full text

Parker, Mark A. J.; Hedgeland, Holly; Jordan, Sally E.; Braithwaite, Nicholas St. J. – European Journal of Science and Mathematics Education, 2023

The study covers the development and testing of the alternative mechanics survey (AMS), a modified force concept inventory (FCI), which used automatically marked free-response questions. Data were collected over a period of three academic years from 611 participants who were taking physics classes at high school and university level. A total of…

Descriptors: Test Construction, Scientific Concepts, Physics, Test Reliability

Seeking the Real Reliability: Why the Traditional Estimators of Reliability Usually Fail in Achievement Testing and Why the Deflation-Corrected Coefficients Could Be Better Options

Peer reviewed
PDF on ERIC

Download full text

Metsämuuronen, Jari – Practical Assessment, Research & Evaluation, 2023

Traditional estimators of reliability such as coefficients alpha, theta, omega, and rho (maximal reliability) are prone to give radical underestimates of reliability for the tests common when testing educational achievement. These tests are often structured by widely deviating item difficulties. This is a typical pattern where the traditional…

Descriptors: Test Reliability, Achievement Tests, Computation, Test Items

Estimating the Psychometric Properties ("Item Difficulty, Discrimination and Reliability Indices") of Test Items Using Kuder-Richardson Approach (KR-20)

Peer reviewed
PDF on ERIC

Download full text

Ntumi, Simon; Agbenyo, Sheilla; Bulala, Tapela – Shanlax International Journal of Education, 2023

There is no need or point to testing of knowledge, attributes, traits, behaviours or abilities of an individual if information obtained from the test is inaccurate. However, by and large, it seems the estimation of psychometric properties of test items in classroomshas been completely ignored otherwise dying slowly in most testing environments. In…

Descriptors: Psychometrics, Accuracy, Test Validity, Factor Analysis

Reconceptualization of Coefficient Alpha Reliability for Test Summed and Scaled Scores

Peer reviewed

Direct link

Almehrizi, Rashid S. – Educational Measurement: Issues and Practice, 2022

Coefficient alpha reliability persists as the most common reliability coefficient reported in research. The assumptions for its use are, however, not well-understood. The current paper challenges the commonly used expressions of coefficient alpha and argues that while these expressions are correct when estimating reliability for summed scores,…

Descriptors: Reliability, Scores, Scaling, Statistical Analysis

The Impact of Inconsistent Responders to Mixed-Worded Scales on Inferences in International Large-Scale Assessments

Peer reviewed

Direct link

Steinmann, Isa; Sánchez, Daniel; van Laar, Saskia; Braeken, Johan – Assessment in Education: Principles, Policy & Practice, 2022

Questionnaire scales that are mixed-worded, i.e. include both positively and negatively worded items, often suffer from issues like low reliability and more complex latent structures than intended. Part of the problem might be that some responders fail to respond consistently to the mixed-worded items. We investigated the prevalence and impact of…

Descriptors: Response Style (Tests), Test Items, Achievement Tests, Foreign Countries

The Use of Open-Ended Questions in Large-Scale Tests for Selection: Generalizability and Dependability

Peer reviewed
PDF on ERIC

Download full text

Atilgan, Hakan; Demir, Elif Kübra; Ogretmen, Tuncay; Basokcu, Tahsin Oguz – International Journal of Progressive Education, 2020

It has become a critical question what the reliability level would be when open-ended questions are used in large-scale selection tests. One of the aims of the present study is to determine what the reliability would be in the event that the answers given by test-takers are scored by experts when open-ended short answer questions are used in…

Descriptors: Foreign Countries, Secondary School Students, Test Items, Test Reliability

The Impact of Measurement Model Misspecification on Coefficient Omega Estimates of Composite Reliability

Peer reviewed

Direct link

Stephanie M. Bell; R. Philip Chalmers; David B. Flora – Educational and Psychological Measurement, 2024

Coefficient omega indices are model-based composite reliability estimates that have become increasingly popular. A coefficient omega index estimates how reliably an observed composite score measures a target construct as represented by a factor in a factor-analysis model; as such, the accuracy of omega estimates is likely to depend on correct…

Descriptors: Influences, Models, Measurement Techniques, Reliability

The Effects of Reverse Items on Psychometric Properties and Respondents' Scale Scores According to Different Item Reversal Strategies

Peer reviewed
PDF on ERIC

Download full text

Mustafa Ilhan; Nese Güler; Gülsen Tasdelen Teker; Ömer Ergenekon – International Journal of Assessment Tools in Education, 2024

This study aimed to examine the effects of reverse items created with different strategies on psychometric properties and respondents' scale scores. To this end, three versions of a 10-item scale in the research were developed: 10 positive items were integrated in the first form (Form-P) and five positive and five reverse items in the other two…

Descriptors: Test Items, Psychometrics, Scores, Measures (Individuals)

Utilizing Real-Time Test Data to Solve Attenuation Paradox in Computerized Adaptive Testing to Enhance Optimal Design

Peer reviewed

Direct link

Jyun-Hong Chen; Hsiu-Yi Chao – Journal of Educational and Behavioral Statistics, 2024

To solve the attenuation paradox in computerized adaptive testing (CAT), this study proposes an item selection method, the integer programming approach based on real-time test data (IPRD), to improve test efficiency. The IPRD method turns information regarding the ability distribution of the population from real-time test data into feasible test…

Descriptors: Data Use, Computer Assisted Testing, Adaptive Testing, Design

Validation of an Elicited Imitation Test as a Measure of Korean Language Proficiency

Peer reviewed

Direct link

Hojung Kim; Changkyung Song; Jiyoung Kim; Hyeyun Jeong; Jisoo Park – Language Testing in Asia, 2024

This study presents a modified version of the Korean Elicited Imitation (EI) test, designed to resemble natural spoken language, and validates its reliability as a measure of proficiency. The study assesses the correlation between average test scores and Test of Proficiency in Korean (TOPIK) levels, examining score distributions among beginner,…

Descriptors: Korean, Test Validity, Test Reliability, Imitation

Is Effort Moderated Scoring Robust to Multidimensional Rapid Guessing?

Peer reviewed

Direct link

Joseph A. Rios; Jiayi Deng – Educational and Psychological Measurement, 2025

To mitigate the potential damaging consequences of rapid guessing (RG), a form of noneffortful responding, researchers have proposed a number of scoring approaches. The present simulation study examines the robustness of the most popular of these approaches, the unidimensional effort-moderated (EM) scoring procedure, to multidimensional RG (i.e.,…

Descriptors: Scoring, Guessing (Tests), Reaction Time, Item Response Theory

Improvised Progressive Model Based on Automatic Calibration of Difficulty Level: A Practical Solution of Competitive-Based Examination

Peer reviewed

Direct link

Aditya Shah; Ajay Devmane; Mehul Ranka; Prathamesh Churi – Education and Information Technologies, 2024

Online learning has grown due to the advancement of technology and flexibility. Online examinations measure students' knowledge and skills. Traditional question papers include inconsistent difficulty levels, arbitrary question allocations, and poor grading. The suggested model calibrates question paper difficulty based on student performance to…

Descriptors: Computer Assisted Testing, Difficulty Level, Grading, Test Construction

We Before Me: Developing a Self-Referent Measure of Cultural Humility for Postsecondary Students

Peer reviewed

Direct link

Melissa Whatley; Dominique Foster; Stephen Paul – Journal of Studies in International Education, 2024

The purpose of this study was to develop a measurement instrument that scholars and practitioners in international education can use as a means of exploring whether and how individuals who come into contact with international education programs develop a greater sense of cultural humility. Specifically, the study described here outlines the four…

Descriptors: Foreign Students, Cultural Awareness, Consciousness Raising, Test Construction

Development of the Inventory of Biotic Climate Literacy (IBCL)

Peer reviewed

Direct link

Emily A. Holt; Jessica Duke; Ryan Dunk; Krystal Hinerman – Environmental Education Research, 2024

Student understanding of climate change is an active and growing area of research, but little research has documented undergraduate students' knowledge about the biotic impacts of climate change. Here, we address this literature gap by presenting the Inventory of Biotic Climate Literacy (IBCL), a concept inventory developed to assess undergraduate…

Descriptors: Climate, Undergraduate Students, Knowledge Level, Test Construction

The Development of Epistemological Understanding Revisited: Enhancing Reliability of the Tool by Using Only Abstract Items

Peer reviewed

Direct link

Zyluk, Natalia; Karpe, Karolina; Urbanski, Mariusz – SAGE Open, 2022

The aim of this paper is to describe the process of modification of the research tool designed for measuring the development of personal epistemology--"Standardized Epistemological Understanding Assessment" (SEUA). SEUA was constructed as an improved version of the instrument initially proposed by Kuhn et al. SEUA was proved to be a more…

Descriptors: Epistemology, Research Tools, Beliefs, Test Items

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 62

Online Submission	34
Journal of Psychoeducational…	33
ProQuest LLC	33
Educational and Psychological…	32
ETS Research Report Series	26
Grantee Submission	20
International Journal of…	19
Applied Measurement in…	15
International Journal of…	12
SAGE Open	12
International Journal of…	11
Educational Measurement:…	10
International Journal of…	10
Educational Sciences: Theory…	9
Journal of Educational…	9
Practical Assessment,…	9
Chemistry Education Research…	8
Eurasian Journal of…	8
Journal of Education and…	8
Journal of Educational and…	8
Journal of Speech, Language,…	8
Language Assessment Quarterly	8
Measurement and Evaluation in…	8
Physical Review Physics…	8
Applied Psychological…	7
More ▼

Schoen, Robert C.	12
Anderson, Daniel	6
Guo, Hongwen	6
Liu, Ou Lydia	6
Alonzo, Julie	5
LaVenia, Mark	5
Baghaei, Purya	4
Bauduin, Charity	4
Brennan, Robert L.	4
Farina, Kristy	4
Lee, Won-Chan	4
Petscher, Yaacov	4
Sijtsma, Klaas	4
Tindal, Gerald	4
Yang, Xiaotong	4
Almehrizi, Rashid S.	3
Boone, William J.	3
Dogan, Nuri	3
Edwards, Michael C.	3
Emons, Wilco H. M.	3
Herman, Joan L.	3
Kim, Sooyeon	3
Kyllonen, Patrick	3
Liu, Jinghua	3
Metsämuuronen, Jari	3
More ▼

SAT (College Admission Test)	9
Program for International…	8
Trends in International…	8
Raven Progressive Matrices	6
ACT Assessment	5
Test of English as a Foreign…	5
Marlowe Crowne Social…	4
Peabody Picture Vocabulary…	4
Dynamic Indicators of Basic…	3
Graduate Record Examinations	3
Measures of Academic Progress	3
Progress in International…	3
Rosenberg Self Esteem Scale	3
Strengths and Difficulties…	3
Test of English for…	3
Autism Diagnostic Observation…	2
Center for Epidemiologic…	2
Child Behavior Checklist	2
Flesch Kincaid Grade Level…	2
International English…	2
Iowa Tests of Basic Skills	2
Peabody Developmental Motor…	2
Stanford Achievement Tests	2
Test of Nonverbal Intelligence	2
ACT Interest Inventory	1
More ▼