ERIC - Search Results

Publication Date

In 2025	1
Since 2024	5
Since 2021 (last 5 years)	38
Since 2016 (last 10 years)	103
Since 2006 (last 20 years)	181

Descriptor

Scores	246
Test Items	246
Test Reliability	164
Test Validity	83
Test Construction	72
Reliability	67
Foreign Countries	57
Item Response Theory	57
Correlation	56
Psychometrics	54
Difficulty Level	52
Item Analysis	45
Multiple Choice Tests	34
Statistical Analysis	34
Comparative Analysis	30
Scoring	30
Factor Analysis	29
Test Bias	29
Interrater Reliability	24
Test Format	24
English (Second Language)	23
Language Tests	23
Second Language Learning	23
Error of Measurement	22
Validity	21
More ▼

Publication Type

Journal Articles	179
Reports - Research	170
Reports - Evaluative	45
Speeches/Meeting Papers	23
Dissertations/Theses -…	13
Tests/Questionnaires	12
Numerical/Quantitative Data	8
Reports - Descriptive	8
Guides - Non-Classroom	3
Information Analyses	3
Guides - General	2
Books	1
Collected Works - General	1
Non-Print Media	1
Opinion Papers	1
Reference Materials - General	1
More ▼

Education Level

Higher Education	50
Postsecondary Education	39
Elementary Education	22
Secondary Education	18
High Schools	14
Middle Schools	11
Early Childhood Education	7
Grade 6	7
Elementary Secondary Education	6
Grade 8	6
Grade 9	6
Intermediate Grades	6
Junior High Schools	6
Grade 3	5
Grade 4	5
Primary Education	5
Grade 5	4
Grade 7	4
Grade 11	2
Kindergarten	2
Grade 12	1
Grade 2	1
Preschool Education	1
Two Year Colleges	1
More ▼

Audience

Researchers	4
Practitioners	2
Teachers	1

Location

Turkey	10
Indonesia	4
Israel	4
New York	4
Canada	3
China	3
Colorado	3
United Kingdom (England)	3
Alabama	2
Australia	2
California	2
India	2
Iran	2
Maryland	2
Mexico	2
Nebraska	2
North Dakota	2
Ohio	2
Oman	2
South Korea	2
Texas	2
Turkey (Ankara)	2
United Kingdom	2
United States	2
Vermont	2
More ▼

Laws, Policies, & Programs

What Works Clearinghouse Rating

Meets WWC Standards without Reservations	1
Meets WWC Standards with or without Reservations	1

Showing 1 to 15 of 246 results Save | Export

Seeking the Real Reliability: Why the Traditional Estimators of Reliability Usually Fail in Achievement Testing and Why the Deflation-Corrected Coefficients Could Be Better Options

Peer reviewed
PDF on ERIC

Download full text

Metsämuuronen, Jari – Practical Assessment, Research & Evaluation, 2023

Traditional estimators of reliability such as coefficients alpha, theta, omega, and rho (maximal reliability) are prone to give radical underestimates of reliability for the tests common when testing educational achievement. These tests are often structured by widely deviating item difficulties. This is a typical pattern where the traditional…

Descriptors: Test Reliability, Achievement Tests, Computation, Test Items

Reconceptualization of Coefficient Alpha Reliability for Test Summed and Scaled Scores

Peer reviewed

Direct link

Almehrizi, Rashid S. – Educational Measurement: Issues and Practice, 2022

Coefficient alpha reliability persists as the most common reliability coefficient reported in research. The assumptions for its use are, however, not well-understood. The current paper challenges the commonly used expressions of coefficient alpha and argues that while these expressions are correct when estimating reliability for summed scores,…

Descriptors: Reliability, Scores, Scaling, Statistical Analysis

The Impact of Inconsistent Responders to Mixed-Worded Scales on Inferences in International Large-Scale Assessments

Peer reviewed

Direct link

Steinmann, Isa; Sánchez, Daniel; van Laar, Saskia; Braeken, Johan – Assessment in Education: Principles, Policy & Practice, 2022

Questionnaire scales that are mixed-worded, i.e. include both positively and negatively worded items, often suffer from issues like low reliability and more complex latent structures than intended. Part of the problem might be that some responders fail to respond consistently to the mixed-worded items. We investigated the prevalence and impact of…

Descriptors: Response Style (Tests), Test Items, Achievement Tests, Foreign Countries

The Effects of Reverse Items on Psychometric Properties and Respondents' Scale Scores According to Different Item Reversal Strategies

Peer reviewed
PDF on ERIC

Download full text

Mustafa Ilhan; Nese Güler; Gülsen Tasdelen Teker; Ömer Ergenekon – International Journal of Assessment Tools in Education, 2024

This study aimed to examine the effects of reverse items created with different strategies on psychometric properties and respondents' scale scores. To this end, three versions of a 10-item scale in the research were developed: 10 positive items were integrated in the first form (Form-P) and five positive and five reverse items in the other two…

Descriptors: Test Items, Psychometrics, Scores, Measures (Individuals)

Validation of an Elicited Imitation Test as a Measure of Korean Language Proficiency

Peer reviewed

Direct link

Hojung Kim; Changkyung Song; Jiyoung Kim; Hyeyun Jeong; Jisoo Park – Language Testing in Asia, 2024

This study presents a modified version of the Korean Elicited Imitation (EI) test, designed to resemble natural spoken language, and validates its reliability as a measure of proficiency. The study assesses the correlation between average test scores and Test of Proficiency in Korean (TOPIK) levels, examining score distributions among beginner,…

Descriptors: Korean, Test Validity, Test Reliability, Imitation

How to Obtain the Most Error-Free Estimate of Reliability? Eight Sources of Deflation in the Estimates of Reliability to Avoid

Peer reviewed
PDF on ERIC

Download full text

Metsämuuronen, Jari – Practical Assessment, Research & Evaluation, 2022

The reliability of a test score is usually underestimated and the deflation may be profound, 0.40 - 0.60 units of reliability or 46 - 71%. Eight root sources of the deflation are discussed and quantified by a simulation with 1,440 real-world datasets: (1) errors in the measurement modelling, (2) inefficiency in the estimator of reliability within…

Descriptors: Test Reliability, Scores, Test Items, Correlation

The Development of the College Teaching Self-Efficacy Scale

Peer reviewed

Direct link

Kamau Oginga Siwatu; Kara Page; Narges Hadi – College Teaching, 2024

The purpose of this article is to document the development of a new measure of teaching self-efficacy -- "The College Teaching Self-Efficacy (CTSE) Scale." We designed the CTSE scale to examine individuals' beliefs in their abilities to perform specific teaching tasks in a college classroom successfully. We developed an instrument that…

Descriptors: Self Efficacy, Beliefs, Psychometrics, Measures (Individuals)

Psychometric Analysis of the Resonance Concept Inventory

Peer reviewed

Direct link

Grace C. Tetschner; Sachin Nedungadi – Chemistry Education Research and Practice, 2025

Many undergraduate chemistry students hold alternate conceptions related to resonance--an important and fundamental topic of organic chemistry. To help address these alternate conceptions, an organic chemistry instructor could administer the resonance concept inventory (RCI), which is a multiple-choice assessment that was designed to identify…

Descriptors: Scientific Concepts, Concept Formation, Item Response Theory, Scores

Assessment of Item and Test Parameters: Cosine Similarity Approach

Peer reviewed
PDF on ERIC

Download full text

Chakrabartty, Satyendra Nath – International Journal of Psychology and Educational Studies, 2021

The paper proposes new measures of difficulty and discriminating values of binary items and test consisting of such items and find their relationships including estimation of test error variance and thereby the test reliability, as per definition using cosine similarities. The measures use entire data. Difficulty value of test and item is defined…

Descriptors: Test Items, Difficulty Level, Scores, Test Reliability

Violation of Conditional Independence in the Many-Facets Rasch Model

Peer reviewed

Direct link

DeMars, Christine E. – Applied Measurement in Education, 2021

Estimation of parameters for the many-facets Rasch model requires that conditional on the values of the facets, such as person ability, item difficulty, and rater severity, the observed responses within each facet are independent. This requirement has often been discussed for the Rasch models and 2PL and 3PL models, but it becomes more complex…

Descriptors: Item Response Theory, Test Items, Ability, Scores

A Novel Examination of None-of-the-Above as It Influences Examinee Item Responses

Direct link

Thompson, Kathryn N. – ProQuest LLC, 2023

It is imperative to collect validity evidence prior to interpreting and using test scores. During the process of collecting validity evidence, test developers should consider whether test scores are contaminated by sources of extraneous information. This is referred to as construct irrelevant variance, or the "degree to which test scores are…

Descriptors: Test Wiseness, Test Items, Item Response Theory, Scores

A Reliability Generalization Meta-Analysis of Runco Ideational Behavior Scale

Peer reviewed

Direct link

Sen, Sedat – Creativity Research Journal, 2022

The purpose of this study was to estimate the overall reliability values for the scores produced by Runco Ideational Behavior Scale (RIBS) and explore the variability of RIBS score reliability across studies. To achieve this, a reliability generalization meta-analysis was carried out using the 86 Cronbach's alpha estimates obtained from 77 studies…

Descriptors: Generalization, Creativity, Meta Analysis, Higher Education

Spatial Ability Test for University Students: Development, Validity and Reliability Studies

Peer reviewed
PDF on ERIC

Download full text

Acikgul, Kubra; Sad, Suleyman Nihat; Altay, Bilal – International Journal of Assessment Tools in Education, 2023

This study aimed to develop a useful test to measure university students' spatial abilities validly and reliably. Following a sequential explanatory mixed methods research design, first, qualitative methods were used to develop the trial items for the test; next, the psychometric properties of the test were analyzed through quantitative methods…

Descriptors: Spatial Ability, Scores, Multiple Choice Tests, Test Validity

Estimating Difference-Score Reliability in Pretest-Posttest Settings

Peer reviewed

Direct link

Gu, Zhengguo; Emons, Wilco H. M.; Sijtsma, Klaas – Journal of Educational and Behavioral Statistics, 2021

Clinical, medical, and health psychologists use difference scores obtained from pretest--posttest designs employing the same test to assess intraindividual change possibly caused by an intervention addressing, for example, anxiety, depression, eating disorder, or addiction. Reliability of difference scores is important for interpreting observed…

Descriptors: Test Reliability, Scores, Pretests Posttests, Computation

Extended Multivariate Generalizability Theory with Complex Design Structures

Peer reviewed

Direct link

Brennan, Robert L.; Kim, Stella Y.; Lee, Won-Chan – Educational and Psychological Measurement, 2022

This article extends multivariate generalizability theory (MGT) to tests with different random-effects designs for each level of a fixed facet. There are numerous situations in which the design of a test and the resulting data structure are not definable by a single design. One example is mixed-format tests that are composed of multiple-choice and…

Descriptors: Multivariate Analysis, Generalizability Theory, Multiple Choice Tests, Test Construction

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 17

Educational and Psychological…	18
ProQuest LLC	13
ETS Research Report Series	12
Applied Measurement in…	7
Online Submission	7
Applied Psychological…	6
International Journal of…	5
Advances in Health Sciences…	4
Educational Measurement:…	4
Journal of Educational…	4
Language Assessment Quarterly	4
Assessment & Evaluation in…	3
Grantee Submission	3
Journal of Psychoeducational…	3
Physical Review Special…	3
Practical Assessment,…	3
Chemistry Education Research…	2
College Board	2
Creativity Research Journal	2
Education and Information…	2
Educational Assessment	2
Educational Research and…	2
International Journal of…	2
Journal of Intelligence	2
Measurement and Evaluation in…	2
More ▼

Liu, Ou Lydia	3
Metsämuuronen, Jari	3
Sijtsma, Klaas	3
Wainer, Howard	3
Almehrizi, Rashid S.	2
Bramley, Tom	2
Brennan, Robert L.	2
Bretz, Stacey Lowery	2
DeMars, Christine E.	2
Emons, Wilco H. M.	2
Feldt, Leonard S.	2
Friedman, Greg	2
Frisbie, David A.	2
Gelbal, Selahattin	2
Gustafsson, Jan-Eric	2
Haberman, Shelby J.	2
Lee, Guemin	2
Lee, Won-Chan	2
Lee, Yi-Hsuan	2
Li, Min	2
Mao, Liyang	2
Michaels, Hillary	2
Ochieng, Charles	2
Pollock, Steven J.	2
More ▼

SAT (College Admission Test)	6
Test of English as a Foreign…	6
ACT Assessment	5
Advanced Placement…	2
Flesch Kincaid Grade Level…	2
National Assessment of…	2
Raven Progressive Matrices	2
Stanford Achievement Tests	2
Test of English for…	2
ACT Interest Inventory	1
Armed Forces Qualification…	1
Center for Epidemiologic…	1
Clinical Evaluation of…	1
Computer Attitude Scale	1
Early Childhood Environment…	1
Flesch Reading Ease Formula	1
General Educational…	1
Iowa Tests of Basic Skills	1
Marlowe Crowne Social…	1
Mayer Salovey Caruso…	1
Measures of Academic Progress	1
North Carolina End of Course…	1
Peabody Developmental Motor…	1
Peabody Picture Vocabulary…	1
Pennsylvania Educational…	1
More ▼