ERIC - Search Results

Publication Date

In 2025	5
Since 2024	49
Since 2021 (last 5 years)	235

Descriptor

Test Items	235
Test Reliability	207
Test Validity	130
Test Construction	118
Foreign Countries	108
Difficulty Level	61
Item Response Theory	53
Psychometrics	50
Factor Analysis	39
Item Analysis	39
Scores	38
Elementary School Students	32
Multiple Choice Tests	31
Factor Structure	28
Undergraduate Students	28
Construct Validity	27
Goodness of Fit	27
Measures (Individuals)	26
Reliability	25
Science Tests	25
Language Tests	22
Correlation	20
English (Second Language)	19
High School Students	19
Questionnaires	19
More ▼

Publication Type

Reports - Research	222
Journal Articles	217
Tests/Questionnaires	24
Speeches/Meeting Papers	5
Dissertations/Theses -…	4
Information Analyses	4
Reports - Descriptive	3
Reports - Evaluative	3
Numerical/Quantitative Data	2

Education Level

Higher Education	72
Postsecondary Education	72
Secondary Education	51
Elementary Education	46
High Schools	23
Middle Schools	23
Junior High Schools	16
Intermediate Grades	13
Early Childhood Education	10
Primary Education	9
Elementary Secondary Education	7
Grade 4	5
Grade 5	5
Grade 6	5
Grade 2	4
Grade 3	4
Grade 7	4
Grade 8	4
Adult Education	2
Grade 12	2
Grade 9	2
Grade 1	1
Grade 10	1
Kindergarten	1
Preschool Education	1
More ▼

Audience

Counselors	1
Practitioners	1

Location

Turkey	23
Indonesia	14
Malaysia	7
China	6
Germany	6
Turkey (Istanbul)	5
Thailand	4
India	3
Philippines	3
South Korea	3
Canada	2
Europe	2
Iran	2
Nebraska	2
Netherlands	2
Oman	2
Singapore	2
South Africa	2
Turkey (Ankara)	2
United Kingdom	2
United States	2
Vietnam	2
Australia	1
Bosnia and Herzegovina	1
Bosnia and Herzegovina…	1
More ▼

Laws, Policies, & Programs

Head Start

What Works Clearinghouse Rating

Showing 1 to 15 of 235 results Save | Export

Establishing a Physics Concept Inventory Using Computer Marked Free-Response Questions

Peer reviewed
PDF on ERIC

Download full text

Parker, Mark A. J.; Hedgeland, Holly; Jordan, Sally E.; Braithwaite, Nicholas St. J. – European Journal of Science and Mathematics Education, 2023

The study covers the development and testing of the alternative mechanics survey (AMS), a modified force concept inventory (FCI), which used automatically marked free-response questions. Data were collected over a period of three academic years from 611 participants who were taking physics classes at high school and university level. A total of…

Descriptors: Test Construction, Scientific Concepts, Physics, Test Reliability

Seeking the Real Reliability: Why the Traditional Estimators of Reliability Usually Fail in Achievement Testing and Why the Deflation-Corrected Coefficients Could Be Better Options

Peer reviewed
PDF on ERIC

Download full text

Metsämuuronen, Jari – Practical Assessment, Research & Evaluation, 2023

Traditional estimators of reliability such as coefficients alpha, theta, omega, and rho (maximal reliability) are prone to give radical underestimates of reliability for the tests common when testing educational achievement. These tests are often structured by widely deviating item difficulties. This is a typical pattern where the traditional…

Descriptors: Test Reliability, Achievement Tests, Computation, Test Items

Estimating the Psychometric Properties ("Item Difficulty, Discrimination and Reliability Indices") of Test Items Using Kuder-Richardson Approach (KR-20)

Peer reviewed
PDF on ERIC

Download full text

Ntumi, Simon; Agbenyo, Sheilla; Bulala, Tapela – Shanlax International Journal of Education, 2023

There is no need or point to testing of knowledge, attributes, traits, behaviours or abilities of an individual if information obtained from the test is inaccurate. However, by and large, it seems the estimation of psychometric properties of test items in classroomshas been completely ignored otherwise dying slowly in most testing environments. In…

Descriptors: Psychometrics, Accuracy, Test Validity, Factor Analysis

Reconceptualization of Coefficient Alpha Reliability for Test Summed and Scaled Scores

Peer reviewed

Direct link

Almehrizi, Rashid S. – Educational Measurement: Issues and Practice, 2022

Coefficient alpha reliability persists as the most common reliability coefficient reported in research. The assumptions for its use are, however, not well-understood. The current paper challenges the commonly used expressions of coefficient alpha and argues that while these expressions are correct when estimating reliability for summed scores,…

Descriptors: Reliability, Scores, Scaling, Statistical Analysis

The Impact of Inconsistent Responders to Mixed-Worded Scales on Inferences in International Large-Scale Assessments

Peer reviewed

Direct link

Steinmann, Isa; Sánchez, Daniel; van Laar, Saskia; Braeken, Johan – Assessment in Education: Principles, Policy & Practice, 2022

Questionnaire scales that are mixed-worded, i.e. include both positively and negatively worded items, often suffer from issues like low reliability and more complex latent structures than intended. Part of the problem might be that some responders fail to respond consistently to the mixed-worded items. We investigated the prevalence and impact of…

Descriptors: Response Style (Tests), Test Items, Achievement Tests, Foreign Countries

The Impact of Measurement Model Misspecification on Coefficient Omega Estimates of Composite Reliability

Peer reviewed

Direct link

Stephanie M. Bell; R. Philip Chalmers; David B. Flora – Educational and Psychological Measurement, 2024

Coefficient omega indices are model-based composite reliability estimates that have become increasingly popular. A coefficient omega index estimates how reliably an observed composite score measures a target construct as represented by a factor in a factor-analysis model; as such, the accuracy of omega estimates is likely to depend on correct…

Descriptors: Influences, Models, Measurement Techniques, Reliability

The Effects of Reverse Items on Psychometric Properties and Respondents' Scale Scores According to Different Item Reversal Strategies

Peer reviewed
PDF on ERIC

Download full text

Mustafa Ilhan; Nese Güler; Gülsen Tasdelen Teker; Ömer Ergenekon – International Journal of Assessment Tools in Education, 2024

This study aimed to examine the effects of reverse items created with different strategies on psychometric properties and respondents' scale scores. To this end, three versions of a 10-item scale in the research were developed: 10 positive items were integrated in the first form (Form-P) and five positive and five reverse items in the other two…

Descriptors: Test Items, Psychometrics, Scores, Measures (Individuals)

Utilizing Real-Time Test Data to Solve Attenuation Paradox in Computerized Adaptive Testing to Enhance Optimal Design

Peer reviewed

Direct link

Jyun-Hong Chen; Hsiu-Yi Chao – Journal of Educational and Behavioral Statistics, 2024

To solve the attenuation paradox in computerized adaptive testing (CAT), this study proposes an item selection method, the integer programming approach based on real-time test data (IPRD), to improve test efficiency. The IPRD method turns information regarding the ability distribution of the population from real-time test data into feasible test…

Descriptors: Data Use, Computer Assisted Testing, Adaptive Testing, Design

Validation of an Elicited Imitation Test as a Measure of Korean Language Proficiency

Peer reviewed

Direct link

Hojung Kim; Changkyung Song; Jiyoung Kim; Hyeyun Jeong; Jisoo Park – Language Testing in Asia, 2024

This study presents a modified version of the Korean Elicited Imitation (EI) test, designed to resemble natural spoken language, and validates its reliability as a measure of proficiency. The study assesses the correlation between average test scores and Test of Proficiency in Korean (TOPIK) levels, examining score distributions among beginner,…

Descriptors: Korean, Test Validity, Test Reliability, Imitation

Is Effort Moderated Scoring Robust to Multidimensional Rapid Guessing?

Peer reviewed

Direct link

Joseph A. Rios; Jiayi Deng – Educational and Psychological Measurement, 2025

To mitigate the potential damaging consequences of rapid guessing (RG), a form of noneffortful responding, researchers have proposed a number of scoring approaches. The present simulation study examines the robustness of the most popular of these approaches, the unidimensional effort-moderated (EM) scoring procedure, to multidimensional RG (i.e.,…

Descriptors: Scoring, Guessing (Tests), Reaction Time, Item Response Theory

Improvised Progressive Model Based on Automatic Calibration of Difficulty Level: A Practical Solution of Competitive-Based Examination

Peer reviewed

Direct link

Aditya Shah; Ajay Devmane; Mehul Ranka; Prathamesh Churi – Education and Information Technologies, 2024

Online learning has grown due to the advancement of technology and flexibility. Online examinations measure students' knowledge and skills. Traditional question papers include inconsistent difficulty levels, arbitrary question allocations, and poor grading. The suggested model calibrates question paper difficulty based on student performance to…

Descriptors: Computer Assisted Testing, Difficulty Level, Grading, Test Construction

We Before Me: Developing a Self-Referent Measure of Cultural Humility for Postsecondary Students

Peer reviewed

Direct link

Melissa Whatley; Dominique Foster; Stephen Paul – Journal of Studies in International Education, 2024

The purpose of this study was to develop a measurement instrument that scholars and practitioners in international education can use as a means of exploring whether and how individuals who come into contact with international education programs develop a greater sense of cultural humility. Specifically, the study described here outlines the four…

Descriptors: Foreign Students, Cultural Awareness, Consciousness Raising, Test Construction

Development of the Inventory of Biotic Climate Literacy (IBCL)

Peer reviewed

Direct link

Emily A. Holt; Jessica Duke; Ryan Dunk; Krystal Hinerman – Environmental Education Research, 2024

Student understanding of climate change is an active and growing area of research, but little research has documented undergraduate students' knowledge about the biotic impacts of climate change. Here, we address this literature gap by presenting the Inventory of Biotic Climate Literacy (IBCL), a concept inventory developed to assess undergraduate…

Descriptors: Climate, Undergraduate Students, Knowledge Level, Test Construction

The Development of Epistemological Understanding Revisited: Enhancing Reliability of the Tool by Using Only Abstract Items

Peer reviewed

Direct link

Zyluk, Natalia; Karpe, Karolina; Urbanski, Mariusz – SAGE Open, 2022

The aim of this paper is to describe the process of modification of the research tool designed for measuring the development of personal epistemology--"Standardized Epistemological Understanding Assessment" (SEUA). SEUA was constructed as an improved version of the instrument initially proposed by Kuhn et al. SEUA was proved to be a more…

Descriptors: Epistemology, Research Tools, Beliefs, Test Items

How to Obtain the Most Error-Free Estimate of Reliability? Eight Sources of Deflation in the Estimates of Reliability to Avoid

Peer reviewed
PDF on ERIC

Download full text

Metsämuuronen, Jari – Practical Assessment, Research & Evaluation, 2022

The reliability of a test score is usually underestimated and the deflation may be profound, 0.40 - 0.60 units of reliability or 46 - 71%. Eight root sources of the deflation are discussed and quantified by a simulation with 1,440 real-world datasets: (1) errors in the measurement modelling, (2) inefficiency in the estimator of reliability within…

Descriptors: Test Reliability, Scores, Test Items, Correlation

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 16

International Journal of…	12
SAGE Open	9
Educational and Psychological…	7
Grantee Submission	6
Journal of Psychoeducational…	6
Online Submission	5
Participatory Educational…	5
Education and Information…	4
International Journal of…	4
Journal of Educational and…	4
Journal of Speech, Language,…	4
Language Testing	4
Language Testing in Asia	4
ProQuest LLC	4
International Journal of…	3
International Journal of…	3
International Journal of…	3
Journal of Baltic Science…	3
Journal of Intelligence	3
Measurement and Evaluation in…	3
NWEA	3
Pegem Journal of Education…	3
Practical Assessment,…	3
Science Insights Education…	3
Applied Measurement in…	2
More ▼

Sachin Nedungadi	3
Al-Jarf, Reima	2
Almehrizi, Rashid S.	2
Boone, William J.	2
Braeken, Johan	2
Che Lah, Noor Hidayah	2
Chew, Cheng Meng	2
Chin, Huan	2
Ji-young Shin	2
Jumaat, Nurul Farhana	2
Metsämuuronen, Jari	2
Retnawati, Heri	2
Steinmann, Isa	2
Tasir, Zaidatun	2
Acar Guvendir, Meltem	1
Achmad Rante Suparman	1
Acikgul, Kubra	1
Acosta-Prado, Julio César	1
Adadan, Emine	1
Adam Carreon	1
Aditya Shah	1
Agbenyo, Sheilla	1
Aguilar-Rodriguez, Adriana	1
Ahmad, Jamilah	1
Ahmadi, Alireza	1
More ▼

Measures of Academic Progress	3
Program for International…	2
Progress in International…	2
Raven Progressive Matrices	2
Trends in International…	2
Big Five Inventory	1
Child Behavior Checklist	1
Computer Attitude Scale	1
General Social Survey	1
Mayer Salovey Caruso…	1
Peabody Developmental Motor…	1
Peabody Picture Vocabulary…	1
Social Skills Improvement…	1
Student Teacher Relationship…	1
Test of English as a Foreign…	1
Test of English for…	1
Test of Gross Motor…	1
Test of Nonverbal Intelligence	1
Watson Glaser Critical…	1
Wechsler Individual…	1
Woodcock Johnson Tests of…	1
More ▼