ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	3
Since 2016 (last 10 years)	63
Since 2006 (last 20 years)	114

Descriptor

Statistical Analysis	156
Test Items	156
Test Reliability	108
Test Validity	69
Test Construction	48
Correlation	47
Foreign Countries	44
Item Analysis	41
Reliability	41
Difficulty Level	35
Psychometrics	35
Scores	34
Factor Analysis	33
Item Response Theory	28
Comparative Analysis	27
Goodness of Fit	23
Likert Scales	18
College Students	16
Test Bias	15
Mathematical Models	14
Multiple Choice Tests	14
Undergraduate Students	14
Scoring	13
English (Second Language)	12
Factor Structure	12
More ▼

Publication Type

Reports - Research	125
Journal Articles	111
Speeches/Meeting Papers	14
Reports - Evaluative	13
Tests/Questionnaires	13
Reports - Descriptive	8
Dissertations/Theses -…	3
Numerical/Quantitative Data	3
Opinion Papers	3
Books	1
Collected Works - General	1
Guides - Classroom - Teacher	1
Guides - General	1
Guides - Non-Classroom	1
Reports - General	1
More ▼

Education Level

Higher Education	50
Postsecondary Education	31
Secondary Education	18
Elementary Education	16
Middle Schools	10
Grade 8	8
High Schools	8
Junior High Schools	6
Grade 7	5
Elementary Secondary Education	3
Early Childhood Education	2
Grade 5	2
Grade 6	2
Grade 9	2
Intermediate Grades	2
Adult Education	1
Grade 1	1
Grade 10	1
Grade 11	1
Grade 12	1
Grade 2	1
Grade 3	1
Primary Education	1
More ▼

Audience

Researchers	5
Practitioners	1
Teachers	1

Location

Turkey	15
Germany	4
Australia	3
India	3
Maryland	3
California	2
Colorado	2
Florida	2
Jordan	2
Singapore	2
Texas	2
Turkey (Ankara)	2
Arizona	1
Canada	1
Chile	1
Colombia	1
Cyprus	1
District of Columbia	1
Egypt	1
Europe	1
Hong Kong	1
Illinois	1
Iowa	1
Iran	1
Israel	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

ACT Assessment	4
SAT (College Admission Test)	4
Test of English as a Foreign…	2
Test of English for…	2
Trends in International…	2
ACT Interest Inventory	1
Comprehensive Tests of Basic…	1
Defining Issues Test	1
Iowa Tests of Basic Skills	1
Need for Cognition Scale	1
Stanford Binet Intelligence…	1
More ▼

What Works Clearinghouse Rating

Meets WWC Standards without Reservations	1
Meets WWC Standards with or without Reservations	1

Showing 1 to 15 of 156 results Save | Export

Reconceptualization of Coefficient Alpha Reliability for Test Summed and Scaled Scores

Peer reviewed

Direct link

Almehrizi, Rashid S. – Educational Measurement: Issues and Practice, 2022

Coefficient alpha reliability persists as the most common reliability coefficient reported in research. The assumptions for its use are, however, not well-understood. The current paper challenges the commonly used expressions of coefficient alpha and argues that while these expressions are correct when estimating reliability for summed scores,…

Descriptors: Reliability, Scores, Scaling, Statistical Analysis

Designing and Evaluating Tasks to Measure Individual Differences in Experimental Psychology: A Tutorial

Peer reviewed

Direct link

Marc Brysbaert – Cognitive Research: Principles and Implications, 2024

Experimental psychology is witnessing an increase in research on individual differences, which requires the development of new tasks that can reliably assess variations among participants. To do this, cognitive researchers need statistical methods that many researchers have not learned during their training. The lack of expertise can pose…

Descriptors: Experimental Psychology, Individual Differences, Statistical Analysis, Task Analysis

The Role of Item Distributions on Reliability Estimation: The Case of Cronbach's Coefficient Alpha

Peer reviewed

Direct link

Olvera Astivia, Oscar Lorenzo; Kroc, Edward; Zumbo, Bruno D. – Educational and Psychological Measurement, 2020

Simulations concerning the distributional assumptions of coefficient alpha are contradictory. To provide a more principled theoretical framework, this article relies on the Fréchet-Hoeffding bounds, in order to showcase that the distribution of the items play a role on the estimation of correlations and covariances. More specifically, these bounds…

Descriptors: Test Items, Test Reliability, Computation, Correlation

Comparison of Cronbach's Alpha and McDonald's Omega for Ordinal Data: Are They Different?

Peer reviewed
PDF on ERIC

Download full text

Fatih Orcan – International Journal of Assessment Tools in Education, 2023

Among all, Cronbach's Alpha and McDonald's Omega are commonly used for reliability estimations. The alpha uses inter-item correlations while omega is based on a factor analysis result. This study uses simulated ordinal data sets to test whether the alpha and omega produce different estimates. Their performances were compared according to the…

Descriptors: Statistical Analysis, Monte Carlo Methods, Correlation, Factor Analysis

Somers' D as an Alternative for the Item-Test and Item-Rest Correlation Coefficients in the Educational Measurement Settings

Peer reviewed
PDF on ERIC

Download full text

Metsämuuronen, Jari – International Journal of Educational Methodology, 2020

Pearson product-moment correlation coefficient between item g and test score X, known as item-test or item-total correlation ("Rit"), and item-rest correlation ("Rir") are two of the most used classical estimators for item discrimination power (IDP). Both "Rit" and "Rir" underestimate IDP caused by the…

Descriptors: Correlation, Test Items, Scores, Difficulty Level

Does Comparative Judgement of Scripts Provide an Effective Means of Maintaining Standards in Mathematics? Research Report

Download full text

Benton, Tom; Leech, Tony; Hughes, Sarah – Cambridge Assessment, 2020

In the context of examinations, the phrase "maintaining standards" usually refers to any activity designed to ensure that it is no easier (or harder) to achieve a given grade in one year than in another. Specifically, it tends to mean activities associated with setting examination grade boundaries. Benton et al (2020) describes a method…

Descriptors: Mathematics Tests, Equated Scores, Comparative Analysis, Difficulty Level

A Multidisciplinary Assessment of Faculty Accuracy and Reliability with Bloom's Taxonomy

Peer reviewed
PDF on ERIC

Download full text

Welch, Adam C.; Karpen, Samuel C.; Cross, L. Brian; LeBlanc, Brandie N. – Research & Practice in Assessment, 2017

The aims of this study were to determine faculty's ability to accurately and reliably categorize exam questions using Bloom's Taxonomy, and if modified versions would improve the accuracy and reliability. Faculty experience and affiliation with a health sciences discipline were also considered. Faculty at one university were asked to categorize 30…

Descriptors: College Faculty, Medical School Faculty, Health Sciences, Test Items

Evaluating the Effectiveness of the Expectation-Maximization (EM) Algorithm for Bayesian Network Calibration

Direct link

Tingir, Seyfullah – ProQuest LLC, 2019

Educators use various statistical techniques to explain relationships between latent and observable variables. One way to model these relationships is to use Bayesian networks as a scoring model. However, adjusting the conditional probability tables (CPT-parameters) to fit a set of observations is still a challenge when using Bayesian networks. A…

Descriptors: Bayesian Statistics, Statistical Analysis, Scoring, Probability

Development and Validation of the Written Communication Assessment of the "HEIghten"® Outcomes Assessment Suite. Research Report. ETS RR-17-53

Peer reviewed
PDF on ERIC

Download full text

Rios, Joseph A.; Sparks, Jesse R.; Zhang, Mo; Liu, Ou Lydia – ETS Research Report Series, 2017

Proficiency with written communication (WC) is critical for success in college and careers. As a result, institutions face a growing challenge to accurately evaluate their students' writing skills to obtain data that can support demands of accreditation, accountability, or curricular improvement. Many current standardized measures, however, lack…

Descriptors: Test Construction, Test Validity, Writing Tests, College Outcomes Assessment

A Shorter Short Version of Barron's Ego Strength Scale

Peer reviewed

Direct link

Kelly, William E.; Daughtry, Don – College Student Journal, 2018

This study developed an abbreviated form of Barron's (1953) Ego Strength Scale for use in research among college student samples. A version of Barron's scale was administered to 100 undergraduate college students. Using item-total score correlations and internal consistency, the scale was reduced to 18 items (Es18). The Es18 possessed adequate…

Descriptors: Undergraduate Students, Self Concept Measures, Test Length, Scores

Determining Item Screening Criteria Using Cost-Benefit Analysis

Peer reviewed
PDF on ERIC

Download full text

Bashkov, Bozhidar M.; Clauser, Jerome C. – Practical Assessment, Research & Evaluation, 2019

Successful testing programs rely on high-quality test items to produce reliable scores and defensible exams. However, determining what statistical screening criteria are most appropriate to support these goals can be daunting. This study describes and demonstrates cost-benefit analysis as an empirical approach to determining appropriate screening…

Descriptors: Test Items, Test Reliability, Evaluation Criteria, Accuracy

Large Sample Confidence Intervals for Item Response Theory Reliability Coefficients

Peer reviewed

Direct link

Andersson, Björn; Xin, Tao – Educational and Psychological Measurement, 2018

In applications of item response theory (IRT), an estimate of the reliability of the ability estimates or sum scores is often reported. However, analytical expressions for the standard errors of the estimators of the reliability coefficients are not available in the literature and therefore the variability associated with the estimated reliability…

Descriptors: Item Response Theory, Test Reliability, Test Items, Scores

Loosening Psychometric Constraints on Educational Assessments

Peer reviewed

Direct link

Kane, Michael T. – Assessment in Education: Principles, Policy & Practice, 2017

In response to an argument by Baird, Andrich, Hopfenbeck and Stobart (2017), Michael Kane states that there needs to be a better fit between educational assessment and learning theory. In line with this goal, Kane will examine how psychometric constraints might be loosened by relaxing some psychometric "rules" in some assessment…

Descriptors: Educational Assessment, Psychometrics, Standards, Test Reliability

Statistically Comparing the Performance of Multiple Automated Raters across Multiple Items

Peer reviewed

Direct link

Kieftenbeld, Vincent; Boyer, Michelle – Applied Measurement in Education, 2017

Automated scoring systems are typically evaluated by comparing the performance of a single automated rater item-by-item to human raters. This presents a challenge when the performance of multiple raters needs to be compared across multiple items. Rankings could depend on specifics of the ranking procedure; observed differences could be due to…

Descriptors: Automation, Scoring, Comparative Analysis, Test Items

Can Reliability of Multiple Component Measuring Instruments Depend on Response Option Presentation Mode?

Peer reviewed

Direct link

Menold, Natalja; Raykov, Tenko – Educational and Psychological Measurement, 2016

This article examines the possible dependency of composite reliability on presentation format of the elements of a multi-item measuring instrument. Using empirical data and a recent method for interval estimation of group differences in reliability, we demonstrate that the reliability of an instrument need not be the same when polarity of the…

Descriptors: Test Reliability, Test Format, Test Items, Differences

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11

Educational and Psychological…	13
ETS Research Report Series	7
Online Submission	6
Journal of Education and…	5
Journal of Educational…	5
Chemistry Education Research…	4
Grantee Submission	3
International Journal of…	3
ProQuest LLC	3
CBE - Life Sciences Education	2
Educational Measurement:…	2
Educational Research and…	2
Eurasian Journal of…	2
International Journal of…	2
Journal of Education and…	2
Journal of Psychoeducational…	2
Language Assessment Quarterly	2
Practical Assessment,…	2
School Psychology Quarterly	2
ACT, Inc.	1
American Journal of…	1
Applied Measurement in…	1
Applied Psychological…	1
Assessment in Education:…	1
Behavioral Research and…	1
More ▼

Farina, Kristy	3
LaVenia, Mark	3
Schoen, Robert C.	3
Barbera, Jack	2
Benson, Jeri	2
Champagne, Zachary M.	2
Guo, Hongwen	2
Kim, Sooyeon	2
Liu, Ou Lydia	2
Aedo-Saravia, Jaime	1
Ahmed, Tamim	1
Aksin, Ezgi	1
Alavi, Seyed Mohammad	1
Algina, James	1
Alhaythami, Hassan	1
Almehrizi, Rashid S.	1
Alonzo, Julie	1
Alpayar, Cagla	1
Altun, Halis	1
Aly, Nagah Abd El-Fattah…	1
Andersson, Björn	1
Anthony, Christopher James	1
Attali, Yigal	1
Avasi, Victor	1
More ▼