ERIC - Search Results

Publication Date

In 2025	3
Since 2024	27
Since 2021 (last 5 years)	92
Since 2016 (last 10 years)	265
Since 2006 (last 20 years)	1867

Descriptor

Test Reliability	1499
Reliability	1416
Test Validity	1053
Validity	622
Foreign Countries	531
Evaluation Methods	530
Measures (Individuals)	530
Interrater Reliability	514
Psychometrics	511
Test Construction	444
Scores	363
Factor Analysis	331
Correlation	327
Higher Education	298
Comparative Analysis	286
Questionnaires	253
Scoring	253
Elementary Secondary Education	246
Student Evaluation	243
Test Items	228
Measurement Techniques	225
Construct Validity	191
Research Methodology	190
Academic Achievement	188
Rating Scales	188
More ▼

Education Level

Higher Education	385
Elementary Secondary Education	193
Postsecondary Education	186
Elementary Education	152
Secondary Education	122
High Schools	120
Middle Schools	62
Early Childhood Education	57
Grade 5	50
Grade 3	39
Grade 4	39
Preschool Education	39
Grade 1	36
Grade 6	35
Grade 8	35
Adult Education	34
Kindergarten	34
Junior High Schools	30
Grade 7	27
Grade 2	26
Primary Education	20
Intermediate Grades	18
Grade 9	14
Grade 10	7
Grade 11	7
More ▼

Audience

Researchers	95
Practitioners	84
Teachers	32
Administrators	28
Policymakers	15
Counselors	5
Community	2
Parents	2
Students	2
Media Staff	1
Support Staff	1
More ▼

Location

Australia	62
United Kingdom	49
Canada	47
United States	44
California	41
United Kingdom (England)	30
Turkey	29
Florida	26
Texas	26
China	25
Taiwan	24
Netherlands	23
New York	21
Germany	15
Pennsylvania	15
Spain	14
Hong Kong	12
Illinois	12
Michigan	12
North Carolina	12
New Zealand	11
Nigeria	11
South Africa	11
South Korea	11
Tennessee	11
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	71
Individuals with Disabilities…	12
Race to the Top	8
Individuals with Disabilities…	3
American Recovery and…	1
Americans with Disabilities…	1
Debra P v Turlington	1
Education Amendments 1972	1
Education Amendments 1974	1
Education Consolidation…	1
Elementary and Secondary…	1
Emergency School Aid Act 1972	1
Improving Americas Schools…	1
Kentucky Education Reform Act…	1
Reading Excellence Act	1
Rehabilitation Act 1973…	1
Safe and Drug Free Schools…	1
Title IX Education Amendments…	1
More ▼

What Works Clearinghouse Rating

Meets WWC Standards without Reservations	5
Meets WWC Standards with or without Reservations	5

Reports - Evaluative X

Showing 1 to 15 of 3,311 results Save | Export

New Tests of Rater Drift in Trend Scoring

Peer reviewed

Direct link

John R. Donoghue; Carol Eckerly – Applied Measurement in Education, 2024

Trend scoring constructed response items (i.e. rescoring Time A responses at Time B) gives rise to two-way data that follow a product multinomial distribution rather than the multinomial distribution that is usually assumed. Recent work has shown that the difference in sampling model can have profound negative effects on statistics usually used to…

Descriptors: Scoring, Error of Measurement, Reliability, Scoring Rubrics

Educating with Style? Rethinking the Pedagogical Significance of (In)consistency between Calvino and Deleuze

Peer reviewed

Direct link

Wiebe Koopal – Studies in Philosophy and Education, 2024

In this paper I try to 'rethink' consistency as an educational quality for the 3rd millennium, following Italo Calvino's choice to take it up in his lecture series Memos for the Next Millennium, and despite the fact that the (final) lecture devoted to this quality remained unwritten. After reflecting on how consistency already plays a certain role…

Descriptors: Reliability, Education, Instruction, Lecture Method

Constructing a Roadmap to Measure the Quality of Business Assessments Aimed at Curriculum Management

Peer reviewed

Direct link

Silva, Thanuci; Santos, Regiane dos; Mallet, Débora – Journal of Education for Business, 2023

Assuring the quality of education is a concern of learning institutions. To do so, it is necessary to have assertive learning management, with consistent data on students' outcomes. This research provides associate deans and researchers, a roadmap with which to gather evidence to improve the quality of open-ended assessments. Based on statistical…

Descriptors: Student Evaluation, Evaluation Methods, Business Education, Higher Education

Chasing Rainbows? Ofsted's Quest for Inter-Inspector Reliability

Peer reviewed

Direct link

Pearson, Terry – FORUM: for promoting 3-19 comprehensive education, 2023

Ofsted has frequently defended the judgements made during inspections by claiming that inspection ratings are reliable, as shown by the results from the collection of studies the inspectorate has conducted. I outline the inspectorate's view of reliability and problematise the studies that it has carried out, noting that these provide insufficient…

Descriptors: Inspection, Interrater Reliability, Decision Making, Value Judgment

"Rater Training" Re-Imagined for Work-Based Assessment in Medical Education

Peer reviewed

Direct link

Tavares, Walter; Kinnear, Benjamin; Schumacher, Daniel J.; Forte, Milena – Advances in Health Sciences Education, 2023

In this perspective, the authors critically examine "rater training" as it has been conceptualized and used in medical education. By "rater training," they mean the educational events intended to "improve" rater performance and contributions during assessment events. Historically, rater training programs have focused…

Descriptors: Medical Education, Interrater Reliability, Evaluation Methods, Training

Inferential Theories of Retrospective Confidence

Peer reviewed

Direct link

Bennett L. Schwartz – Metacognition and Learning, 2024

Retrospective confidence refers to the phenomenological experience of the level of certainty that retrieved information is, in fact, correct. Retrospective confidence judgments are examined across a range of sub-disciplines in psychology from perception to memory research, and in education and legal applications. This paper focuses on…

Descriptors: Memory, Recall (Psychology), Cues, Learning Processes

Statistical Inference for G-Indices of Agreement

Peer reviewed

Direct link

Bonett, Douglas G. – Journal of Educational and Behavioral Statistics, 2022

The limitations of Cohen's ? are reviewed and an alternative G-index is recommended for assessing nominal-scale agreement. Maximum likelihood estimates, standard errors, and confidence intervals for a two-rater G-index are derived for one-group and two-group designs. A new G-index of agreement for multirater designs is proposed. Statistical…

Descriptors: Statistical Inference, Statistical Data, Interrater Reliability, Design

Can Attempts to Make Schools More Reliable Render Them Less Trustworthy?

Peer reviewed

Direct link

Atli Harðarson – Educational Philosophy and Theory, 2024

This paper has two aims. One is to draw a distinction between two types of trust. The other is to argue for its applicability in academic discourse on educational policies. One of the two types of trust is "ethical trust" that rests on beliefs about others' ethical virtues. The other is "institutional trust" that typically…

Descriptors: Trust (Psychology), Ethics, Reliability, Schools

Drawing on Siedentop's Legacy to Revisit Systematic Observation of Teaching: Identifying and Addressing Key Validity and Reliability Questions

Peer reviewed

Direct link

Tsangaridou, Niki; Charalambous, Charalambos Y. – Quest, 2023

Focusing on systematic observation, one of the most potent methods of studying teaching quality, represents one of the numerous contributions of Daryl Siedentop to the profession. While he had a clear focus on issues of validity and reliability concerning systematic observation, over the past decades, attention to such issues appears to have…

Descriptors: Physical Education Teachers, Observation, Validity, Reliability

Measuring Returns to Experience Using Supervisor Ratings of Observed Performance: The Case of Classroom Teachers

Peer reviewed

Direct link

Courtney Bell; Jessalynn James; Eric S. Taylor; James Wyckoff – Journal of Policy Analysis and Management, 2025

We study the returns to experience in teaching, estimated using supervisor ratings from classroom observations. We describe the assumptions required to interpret changes in observation ratings over time as the causal effect of experience on performance. We compare two difference-in-differences strategies: the two-way fixed effects estimator common…

Descriptors: Lesson Observation Criteria, Teaching Experience, Teacher Evaluation, Supervisors

Evaluating the Evaluators: Analysis of the Structure and Processes of Seven United States Health Professions Education Accreditors

Peer reviewed

Direct link

Robert H. Eaglen; Steven J. Durning; Holly S. Meyer; Christopher S. Candler – Quality in Higher Education, 2024

Higher education accreditation has spread internationally as a vehicle for quality assurance and improvement but is strongly influenced by accreditation practices in the United States. The organisational structure and processes of seven United States health professions accreditors were analysed to identify common characteristics that reflect…

Descriptors: Accreditation (Institutions), Quality Assurance, Evaluators, Evaluation Methods

Assessing the Assessment: Evidence of Reliability and Validity in the edTPA

Peer reviewed

Direct link

Gitomer, Drew H.; Martínez, José Felipe; Battey, Dan; Hyland, Nora E. – American Educational Research Journal, 2021

The Educative Teacher Performance Assessment (edTPA) is a system of standardized portfolio assessments of teaching performance mandated for use by educator preparation programs in 18 states, and approved in 21 others, as part of initial certification for preservice teachers. Because of the high stakes involved for examinees, it is critical that…

Descriptors: Evaluation, Performance Based Assessment, Test Reliability, Test Validity

On the Pitfalls of Estimating and Using Standardized Reliability Coefficients

Peer reviewed

Direct link

Raykov, Tenko; Marcoulides, George A. – Educational and Psychological Measurement, 2021

The population discrepancy between unstandardized and standardized reliability of homogeneous multicomponent measuring instruments is examined. Within a latent variable modeling framework, it is shown that the standardized reliability coefficient for unidimensional scales can be markedly higher than the corresponding unstandardized reliability…

Descriptors: Test Reliability, Computation, Measures (Individuals), Research Problems

The Discussions of Positivism and Interpretivism

Download full text

Junjie, Ma; Yingxin, Ma – Online Submission, 2022

This paper aims to explore the philosophical theoretical foundations of two basic research paradigms, namely positivism and interpretivism. In the discussion process, literature in the relevant fields including academic papers and books is reviewed and used as support for the analysis. Firstly, the paper explores the differences between the…

Descriptors: Ideology, Bias, Credibility, Research Methodology

The Importance of Thinking Multivariately When Setting Subscale Cutoff Scores

Peer reviewed

Direct link

Kroc, Edward; Olvera Astivia, Oscar L. – Educational and Psychological Measurement, 2022

Setting cutoff scores is one of the most common practices when using scales to aid in classification purposes. This process is usually done univariately where each optimal cutoff value is decided sequentially, subscale by subscale. While it is widely known that this process necessarily reduces the probability of "passing" such a test,…

Descriptors: Multivariate Analysis, Cutting Scores, Classification, Measurement

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 221

Educational and Psychological…	124
Journal of Psychoeducational…	110
Psychological Assessment	60
Online Submission	56
Applied Psychological…	51
Alliance for Excellent…	50
Journal of Autism and…	41
Applied Measurement in…	39
Assessment	38
Journal of Educational…	36
Research in Developmental…	36
Canadian Journal of School…	25
Educational Measurement:…	25
Assessment & Evaluation in…	23
Language Testing	23
Academic Medicine	21
Behavioral Research and…	21
Social Indicators Research	21
Advances in Health Sciences…	20
Psychometrika	20
Research on Social Work…	20
School Psychology Review	19
Assessment for Effective…	18
Assessment and Evaluation in…	17
Measurement and Evaluation in…	17
More ▼

Tindal, Gerald	25
Alonzo, Julie	20
Feldt, Leonard S.	14
Lai, Cheng-Fei	14
Park, Bitnara Jasmine	13
Matson, Johnny L.	12
Raykov, Tenko	12
Anderson, Daniel	11
Brennan, Robert L.	9
Marsh, Herbert W.	9
McCrimmon, Adam W.	9
Nicewander, W. Alan	8
Reckase, Mark D.	8
Thompson, Bruce	8
Erford, Bradley T.	7
Irvin, P. Shawn	7
Lunz, Mary E.	7
Alsawalmeh, Yousef M.	6
Baker, Eva L.	6
Epstein, Michael H.	6
Lee, Guemin	6
Marcoulides, George A.	6
Wainer, Howard	6
Zimmerman, Donald W.	6
More ▼

Reports - Evaluative	3311
Journal Articles	2545
Speeches/Meeting Papers	336
Information Analyses	116
Opinion Papers	74
Numerical/Quantitative Data	68
Tests/Questionnaires	67
Reports - Research	47
Reports - Descriptive	23
Guides - Non-Classroom	20
Book/Product Reviews	12
Books	8
Collected Works - General	7
Collected Works - Proceedings	5
Guides - Classroom - Teacher	4
Collected Works - Serials	3
Dissertations/Theses -…	2
Collected Works - Serial	1
Dissertations/Theses -…	1
ERIC Digests in Full Text	1
ERIC Publications	1
Guides - Classroom - Learner	1
Guides - General	1
Reference Materials -…	1
Reference Materials - General	1
More ▼

Wechsler Intelligence Scale…	24
National Assessment of…	21
SAT (College Admission Test)	21
Wechsler Adult Intelligence…	12
Beck Depression Inventory	11
General Educational…	11
Program for International…	11
Stanford Achievement Tests	10
Trends in International…	10
ACT Assessment	9
Minnesota Multiphasic…	9
Test of English as a Foreign…	9
Raven Progressive Matrices	8
Advanced Placement…	7
Conners Rating Scales	7
Peabody Picture Vocabulary…	7
Woodcock Johnson Tests of…	7
Adaptive Behavior Scale	6
Bayley Scales of Infant…	6
Behavior Assessment System…	6
Child Behavior Checklist	6
Childhood Autism Rating Scale	6
Graduate Record Examinations	6
Myers Briggs Type Indicator	6
National Household Education…	6
More ▼