ERIC - Search Results

Publication Date

In 2025	0
Since 2024	7
Since 2021 (last 5 years)	25
Since 2016 (last 10 years)	64
Since 2006 (last 20 years)	166

Descriptor

Reliability	242
Test Items	242
Validity	74
Scores	67
Test Construction	59
Correlation	54
Foreign Countries	52
Item Response Theory	48
Psychometrics	42
Statistical Analysis	41
Item Analysis	40
Factor Analysis	35
Comparative Analysis	34
Difficulty Level	31
Measurement	25
Classification	22
Scoring	22
Models	21
Error of Measurement	20
Goodness of Fit	20
Questionnaires	19
Computation	18
Likert Scales	18
Mathematics Tests	18
Student Evaluation	18
More ▼

Publication Type

Journal Articles	179
Reports - Research	155
Reports - Evaluative	58
Speeches/Meeting Papers	29
Reports - Descriptive	14
Tests/Questionnaires	11
Dissertations/Theses -…	8
Numerical/Quantitative Data	8
Opinion Papers	3
Collected Works - General	2
Reports - General	2
Books	1
ERIC Digests in Full Text	1
ERIC Publications	1
Guides - Non-Classroom	1
Historical Materials	1
Information Analyses	1
More ▼

Education Level

Higher Education	41
Postsecondary Education	38
Secondary Education	21
Elementary Education	20
High Schools	13
Middle Schools	10
Elementary Secondary Education	9
Early Childhood Education	7
Junior High Schools	7
Grade 5	5
Grade 8	5
Intermediate Grades	5
Grade 4	4
Primary Education	4
Grade 3	3
Grade 6	3
Grade 9	3
Kindergarten	3
Grade 10	2
Grade 11	2
Grade 2	2
Grade 7	2
Preschool Education	2
Grade 1	1
Grade 12	1
More ▼

Audience

Researchers	4
Teachers	1

Location

Turkey	12
Australia	6
Canada	4
Germany	3
Jordan	3
United Kingdom	3
United Kingdom (England)	3
United States	3
Florida	2
India	2
Israel	2
Maryland	2
Netherlands	2
New York	2
Spain	2
Austria	1
Belgium	1
California	1
Canada (Toronto)	1
Chile	1
China	1
Cyprus	1
Czech Republic	1
Denmark	1
Egypt	1
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001

What Works Clearinghouse Rating

Showing 1 to 15 of 242 results Save | Export

Reconceptualization of Coefficient Alpha Reliability for Test Summed and Scaled Scores

Peer reviewed

Direct link

Almehrizi, Rashid S. – Educational Measurement: Issues and Practice, 2022

Coefficient alpha reliability persists as the most common reliability coefficient reported in research. The assumptions for its use are, however, not well-understood. The current paper challenges the commonly used expressions of coefficient alpha and argues that while these expressions are correct when estimating reliability for summed scores,…

Descriptors: Reliability, Scores, Scaling, Statistical Analysis

The Impact of Inconsistent Responders to Mixed-Worded Scales on Inferences in International Large-Scale Assessments

Peer reviewed

Direct link

Steinmann, Isa; Sánchez, Daniel; van Laar, Saskia; Braeken, Johan – Assessment in Education: Principles, Policy & Practice, 2022

Questionnaire scales that are mixed-worded, i.e. include both positively and negatively worded items, often suffer from issues like low reliability and more complex latent structures than intended. Part of the problem might be that some responders fail to respond consistently to the mixed-worded items. We investigated the prevalence and impact of…

Descriptors: Response Style (Tests), Test Items, Achievement Tests, Foreign Countries

The Impact of Measurement Model Misspecification on Coefficient Omega Estimates of Composite Reliability

Peer reviewed

Direct link

Stephanie M. Bell; R. Philip Chalmers; David B. Flora – Educational and Psychological Measurement, 2024

Coefficient omega indices are model-based composite reliability estimates that have become increasingly popular. A coefficient omega index estimates how reliably an observed composite score measures a target construct as represented by a factor in a factor-analysis model; as such, the accuracy of omega estimates is likely to depend on correct…

Descriptors: Influences, Models, Measurement Techniques, Reliability

A Simulation Study on the Performance of Different Reliability Estimation Methods

Peer reviewed

Direct link

Edwards, Ashley A.; Joyner, Keanan J.; Schatschneider, Christopher – Educational and Psychological Measurement, 2021

The accuracy of certain internal consistency estimators have been questioned in recent years. The present study tests the accuracy of six reliability estimators (Cronbach's alpha, omega, omega hierarchical, Revelle's omega, and greatest lower bound) in 140 simulated conditions of unidimensional continuous data with uncorrelated errors with varying…

Descriptors: Reliability, Computation, Accuracy, Sample Size

There Are Many Greater Lower Bounds than Cronbach's [alpha]: A Monte Carlo Simulation Study

Peer reviewed

Direct link

Novak, Josip; Rebernjak, Blaž – Measurement: Interdisciplinary Research and Perspectives, 2023

A Monte Carlo simulation study was conducted to examine the performance of [alpha], [lambda]2, [lambda][subscript 4], [lambda][subscript 2], [omega][subscript T], GLB[subscript MRFA], and GLB[subscript Algebraic] coefficients. Population reliability, distribution shape, sample size, test length, and number of response categories were varied…

Descriptors: Monte Carlo Methods, Evaluation Methods, Reliability, Simulation

Violation of Conditional Independence in the Many-Facets Rasch Model

Peer reviewed

Direct link

DeMars, Christine E. – Applied Measurement in Education, 2021

Estimation of parameters for the many-facets Rasch model requires that conditional on the values of the facets, such as person ability, item difficulty, and rater severity, the observed responses within each facet are independent. This requirement has often been discussed for the Rasch models and 2PL and 3PL models, but it becomes more complex…

Descriptors: Item Response Theory, Test Items, Ability, Scores

Designing and Evaluating Tasks to Measure Individual Differences in Experimental Psychology: A Tutorial

Peer reviewed

Direct link

Marc Brysbaert – Cognitive Research: Principles and Implications, 2024

Experimental psychology is witnessing an increase in research on individual differences, which requires the development of new tasks that can reliably assess variations among participants. To do this, cognitive researchers need statistical methods that many researchers have not learned during their training. The lack of expertise can pose…

Descriptors: Experimental Psychology, Individual Differences, Statistical Analysis, Task Analysis

Practical Randomly Selected Question Exam Design to Address Replicated and Sequential Questions in Online Examinations

Peer reviewed

Direct link

Elkhatat, Ahmed M. – International Journal for Educational Integrity, 2022

Examinations form part of the assessment processes that constitute the basis for benchmarking individual educational progress, and must consequently fulfill credibility, reliability, and transparency standards in order to promote learning outcomes and ensure academic integrity. A randomly selected question examination (RSQE) is considered to be an…

Descriptors: Integrity, Monte Carlo Methods, Credibility, Reliability

A Comparison of Latent Semantic Analysis and Latent Dirichlet Allocation in Educational Measurement

Peer reviewed

Direct link

Jordan M. Wheeler; Allan S. Cohen; Shiyu Wang – Journal of Educational and Behavioral Statistics, 2024

Topic models are mathematical and statistical models used to analyze textual data. The objective of topic models is to gain information about the latent semantic space of a set of related textual data. The semantic space of a set of textual data contains the relationship between documents and words and how they are used. Topic models are becoming…

Descriptors: Semantics, Educational Assessment, Evaluators, Reliability

The Reliability of the Posterior Probability of Skill Attainment in Diagnostic Classification Models

Peer reviewed

Direct link

Johnson, Matthew S.; Sinharay, Sandip – Journal of Educational and Behavioral Statistics, 2020

One common score reported from diagnostic classification assessments is the vector of posterior means of the skill mastery indicators. As with any assessment, it is important to derive and report estimates of the reliability of the reported scores. After reviewing a reliability measure suggested by Templin and Bradshaw, this article suggests three…

Descriptors: Reliability, Probability, Skill Development, Classification

Relating Pictorial and Verbal Forms of Assessments of the Particle Model of Matter in Two Communities of Students

Peer reviewed

Direct link

Langbeheim, Elon; Akaygun, Sevil; Adadan, Emine; Hlatshwayo, Manzini; Ramnarain, Umesh – International Journal of Science and Mathematics Education, 2023

Linking assessment and curriculum in science education, particularly within the topic of matter and its changes, is often taken for granted. Some of the fundamental elements of the assessment, such as the choice of wording and visual representations, as well as its relation to the curricular sequence, remain understudied. In addition, very few…

Descriptors: Student Evaluation, Evaluation Methods, Science Education, Test Items

The Concurrent Validity of Comparative Judgement Outcomes Compared with Marks

Download full text

Gill, Tim – Research Matters, 2022

In Comparative Judgement (CJ) exercises, examiners are asked to look at a selection of candidate scripts (with marks removed) and order them in terms of which they believe display the best quality. By including scripts from different examination sessions, the results of these exercises can be used to help with maintaining standards. Results from…

Descriptors: Comparative Analysis, Decision Making, Scripts, Standards

Development of Ecology Achievement Test for Secondary School Students

Peer reviewed
PDF on ERIC

Download full text

Kevser Arslan; Asli Görgülü Ari – Shanlax International Journal of Education, 2024

This study aimed to develop a valid and reliable multiple-choice achievement test for the subject area of ecology. The study was conducted within the framework of exploratory sequential design based on mixed research methods, and the study group consisted of a total of 250 middle school students studying at the sixth and seventh grade level. In…

Descriptors: Ecology, Science Tests, Test Construction, Multiple Choice Tests

Development and Application of an Instrument for Assessing Upper-Secondary School Biology Teachers' Pedagogical Content Knowledge of Scientific Thinking

Peer reviewed
PDF on ERIC

Download full text

Shan Lin; Jian Wang – Journal of Baltic Science Education, 2024

Scientific thinking constitutes a vital component of scientific competencies, crucial for citizens to adapt to the evolving societal landscape. To cultivate students' scientific thinking, teachers should possess an adequate professional knowledge foundation, which encompasses pedagogical content knowledge (PCK). Assessing teachers' PCK of…

Descriptors: Secondary School Teachers, Teacher Attitudes, Biology, Pedagogical Content Knowledge

Using Rasch Analysis to Examine Raters' Expertise Turkish Teacher Candidates' Competency Levels in Writing Different Types of Test Items

Peer reviewed
PDF on ERIC

Download full text

Sayin, Ayfer; Sata, Mehmet – International Journal of Assessment Tools in Education, 2022

The aim of the present study was to examine Turkish teacher candidates' competency levels in writing different types of test items by utilizing Rasch analysis. In addition, the effect of the expertise of the raters scoring the items written by the teacher candidates was examined within the scope of the study. 84 Turkish teacher candidates…

Descriptors: Foreign Countries, Item Response Theory, Evaluators, Expertise

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 17

Educational and Psychological…	19
ETS Research Report Series	10
Applied Measurement in…	9
Applied Psychological…	9
ProQuest LLC	8
International Journal of…	6
Journal of Educational…	6
Educational Measurement:…	5
Journal of Education and…	5
Journal of Educational and…	4
Online Submission	4
Educational Research and…	3
Grantee Submission	3
Journal of Psychoeducational…	3
Measurement:…	3
Psychometrika	3
Advances in Health Sciences…	2
Assessment & Evaluation in…	2
Assessment in Education:…	2
CBE - Life Sciences Education	2
Educational Assessment	2
Educational Research and…	2
Educational Sciences: Theory…	2
Eurasian Journal of…	2
Intelligence	2
More ▼

Lee, Guemin	4
Plake, Barbara S.	4
Barnette, J. Jackson	3
Guo, Hongwen	3
Haberman, Shelby J.	3
Impara, James C.	3
Kolen, Michael J.	3
Liu, Jinghua	3
Almehrizi, Rashid S.	2
Bezruczko, Nikolaus	2
Bradshaw, Laine	2
Braeken, Johan	2
Bramley, Tom	2
Braun, Henry I.	2
Brennan, Robert L.	2
DeMars, Christine E.	2
Douglas, Jeff	2
Emons, Wilco H. M.	2
Fan, Xitao	2
Frisbie, David A.	2
Gustafsson, Jan-Eric	2
Kannan, Priya	2
Kim, Sooyeon	2
Meijer, Rob R.	2
More ▼

SAT (College Admission Test)	5
Program for International…	4
Trends in International…	3
Progress in International…	2
Raven Progressive Matrices	2
Test of English as a Foreign…	2
ACT Assessment	1
Acculturation Rating Scale…	1
Advanced Placement…	1
Armed Services Vocational…	1
Center for Epidemiologic…	1
Eysenck Personality Inventory	1
International English…	1
Iowa Tests of Basic Skills	1
Law School Admission Test	1
Marlowe Crowne Social…	1
Mayer Salovey Caruso…	1
National Assessment of…	1
National Longitudinal Study…	1
National Longitudinal Study…	1
Peabody Developmental Motor…	1
Peabody Individual…	1
Peabody Picture Vocabulary…	1
Rosenberg Self Esteem Scale	1
Schools and Staffing Survey…	1
More ▼