ERIC - Search Results

Publication Date

In 2025	67
Since 2024	901
Since 2021 (last 5 years)	3415
Since 2016 (last 10 years)	7595
Since 2006 (last 20 years)	14761

Descriptor

Test Reliability	14547
Test Validity	9865
Reliability	9544
Foreign Countries	6751
Test Construction	4608
Validity	4120
Measures (Individuals)	3750
Factor Analysis	3720
Psychometrics	3393
Interrater Reliability	3054
Correlation	3009
Evaluation Methods	2674
Statistical Analysis	2527
Higher Education	2475
Questionnaires	2412
Scores	2324
College Students	2141
Student Attitudes	2060
Comparative Analysis	1930
Factor Structure	1755
Student Evaluation	1647
Rating Scales	1580
Measurement Techniques	1539
Elementary Secondary Education	1478
Test Items	1467
More ▼

Author

Thompson, Bruce	44
Tindal, Gerald	41
Raykov, Tenko	39
Erford, Bradley T.	37
Marsh, Herbert W.	36
Feldt, Leonard S.	33
Fraser, Barry J.	33
Brennan, Robert L.	32
Alonzo, Julie	31
Matson, Johnny L.	29
Zimmerman, Donald W.	29
Epstein, Michael H.	26
Briesch, Amy M.	24
Tsai, Chin-Chung	24
Lane, Kathleen Lynne	23
Petscher, Yaacov	23
Anderson, Daniel	22
Hambleton, Ronald K.	22
Michael, William B.	22
Reckase, Mark D.	22
Huynh, Huynh	21
Livingston, Samuel A.	21
Attali, Yigal	19
Elliott, Stephen N.	19
More ▼

Publication Type

Journal Articles	18597
Reports - Research	16784
Reports - Evaluative	3311
Speeches/Meeting Papers	1851
Reports - Descriptive	1526
Tests/Questionnaires	1520
Information Analyses	923
Opinion Papers	645
Dissertations/Theses -…	625
Guides - Non-Classroom	323
Numerical/Quantitative Data	249
Books	118
Guides - Classroom - Teacher	80
Reports - General	71
Guides - General	56
Reference Materials -…	53
Collected Works - General	39
Book/Product Reviews	38
Collected Works - Serials	35
Collected Works - Proceedings	32
ERIC Publications	31
Multilingual/Bilingual…	26
Dissertations/Theses	21
ERIC Digests in Full Text	20
Guides - Classroom - Learner	15
More ▼

Education Level

Higher Education	4474
Postsecondary Education	3487
Secondary Education	2126
Elementary Education	2086
High Schools	1018
Middle Schools	980
Elementary Secondary Education	851
Early Childhood Education	833
Junior High Schools	678
Primary Education	403
Intermediate Grades	374
Preschool Education	374
Grade 5	325
Grade 8	322
Grade 4	305
Grade 6	291
Grade 7	273
Grade 3	263
Kindergarten	257
Adult Education	205
Grade 1	197
Grade 2	165
Grade 9	152
Grade 10	137
Grade 11	101
More ▼

Audience

Researchers	703
Practitioners	447
Teachers	204
Administrators	121
Policymakers	62
Counselors	42
Students	37
Parents	11
Community	7
Media Staff	5
Support Staff	5
More ▼

Location

Turkey	1246
Australia	428
Canada	371
China	329
United States	264
United Kingdom	246
Taiwan	221
Netherlands	217
Indonesia	214
California	208
Spain	201
United Kingdom (England)	188
Germany	187
Malaysia	164
Florida	159
Hong Kong	159
Nigeria	146
Iran	145
Texas	130
South Korea	124
India	117
New York	117
Pennsylvania	109
South Africa	107
Greece	103
More ▼

What Works Clearinghouse Rating

Meets WWC Standards without Reservations	8
Meets WWC Standards with or without Reservations	9
Does not meet standards	6

Showing 631 to 645 of 26,352 results Save | Export

Evaluating the Evaluators: A Comparative Study of AI and Teacher Assessments in Higher Education

Peer reviewed
PDF on ERIC

Download full text

Tugra Karademir Coskun; Ayfer Alper – Digital Education Review, 2024

This study aims to examine the potential differences between teacher evaluations and artificial intelligence (AI) tool-based assessment systems in university examinations. The research has evaluated a wide spectrum of exams including numerical and verbal course exams, exams with different assessment styles (project, test exam, traditional exam),…

Descriptors: Artificial Intelligence, Visual Aids, Video Technology, Tests

Translation and Validation of Music Performance Self-Efficacy Scale into Turkish

Peer reviewed

Direct link

Alper Börekci; Esra Dalkiran; Zeki Nacakci – International Journal of Music Education, 2024

The Music Performance Self-Efficacy Scale (MPSES) is an important scale designed to reflect the four sources of self-efficacy of Bandura by Zelenak, and has been used in many studies of music education in the international literature in recent years. This study was carried out to ensure the validity and reliability of the Turkish translation of…

Descriptors: Translation, Test Validity, Test Reliability, Music

All Types of Experience Are Equal, but Some Are More Equal: The Effect of Different Types of Experience on Rater Severity and Rater Consistency

Peer reviewed

Direct link

Reeta Neittaanmäki; Iasonas Lamprianou – Language Testing, 2024

This article focuses on rater severity and consistency and their relation to different types of rater experience over a long period of time. The article is based on longitudinal data collected from 2009 to 2019 from the second language Finnish speaking subtest in the National Certificates of Language Proficiency in Finland. The study investigated…

Descriptors: Foreign Countries, Interrater Reliability, Error of Measurement, Experience

Developing and Evaluating an Assessment of Preschoolers' Science and Engineering Knowledge

Direct link

Lauren Westerberg – ProQuest LLC, 2024

A major challenge to promoting effective early science and engineering education is the lack of reliable and validated assessments that align with current educational guidelines for science and engineering. Existing early science and engineering assessments either cover a narrow range of concepts and practices and/or are not designed in a way to…

Descriptors: Preschool Curriculum, Preschool Education, Preschool Evaluation, Preschool Tests

Weighting Opt-In Surveys to Accommodate the Effects of Nonresponse

Peer reviewed

Direct link

Ashani Jayasekera; Laura Stapleton – Society for Research on Educational Effectiveness, 2024

Background: A growing number of surveys are conducted online where respondents can choose to complete the questionnaire (Lehdonvirta et al., 2020). As respondents are self-selected, there is potential that the respondents will not be an accurate representation of the population. For example, white people are disproportionately more likely to…

Descriptors: Online Surveys, Test Construction, Test Validity, Test Reliability

Accuracy Assessment of Two Electromagnetic Articulographs: Northern Digital Inc. WAVE and Northern Digital Inc. VOX

Peer reviewed

Direct link

Rebernik, Teja; Jacobi, Jidde; Tiede, Mark; Wieling, Martijn – Journal of Speech, Language, and Hearing Research, 2021

Purpose: This study compares two electromagnetic articulographs manufactured by Northern Digital, Inc.: the NDI Wave System (from 2008) and the NDI Vox-EMA System (from 2020). Method: Four experiments were completed: (1) comparison of statically positioned sensors; (2) tracking dynamic movements of sensors manipulated using a motor-driven LEGO…

Descriptors: Measurement Equipment, Articulation (Speech), Accuracy, Reliability

Does Reviewing Experience Reduce Disagreement in Proposals Evaluation? Insights from Marie Sklodowska-Curie and COST Actions

Peer reviewed

Direct link

Seeber, Marco; Vlegels, Jef; Reimink, Elwin; Marusic, Ana; Pina, David G. – Research Evaluation, 2021

We have limited understanding of why reviewers tend to strongly disagree when scoring the same research proposal. Thus far, research that explored disagreement has focused on the characteristics of the proposal or the applicants, while ignoring the characteristics of the reviewers themselves. This article aims to address this gap by exploring…

Descriptors: Foreign Countries, Evaluators, Interrater Reliability, Research Proposals

Reviewing the Test Reviews: Quality Judgments and Reviewer Agreements in the Mental Measurements Yearbook

Peer reviewed

Direct link

Hogan, Thomas; DeStefano, Marissa; Gilby, Caitlin; Kosman, Dana; Peri, Joshua – Applied Measurement in Education, 2021

Buros' "Mental Measurements Yearbook (MMY)" has provided professional reviews of commercially published psychological and educational tests for over 80 years. It serves as a kind of conscience for the testing industry. For a random sample of 50 entries in the "19th MMY" (a total of 100 separate reviews) this study determined…

Descriptors: Test Reviews, Interrater Reliability, Psychological Testing, Educational Testing

The Problem of Reliability of Information in the Global Network

Peer reviewed
PDF on ERIC

Download full text

Kazimi, Parviz Firudin Oqlu – Journal of Practical Studies in Education, 2021

The reliability of information in the global information space is one of the most important problems of globalization. The credibility of various information resources is currently being studied and considered in different ways. In some cases, the problem of the reliability of information can be assessed as harmful and dangerous. This article,…

Descriptors: Information Sources, Reliability, Credibility, Classification

Examining Inter-Rater Reliability of Evaluators Judging Teacher Performance: Proposing an Alternative to Cohen's Kappa. CEME Technical Report. CEMETR-2021-06

Download full text

Lambert, Richard G.; Holcomb, T. Scott; Bottoms, Bryndle L. – Center for Educational Measurement and Evaluation, 2021

The validity of the Kappa coefficient of chance-corrected agreement has been questioned when the prevalence of specific rating scale categories is low and agreement between raters is high. The researchers proposed the Lambda Coefficient of Rater-Mediated Agreement as an alternative to Kappa to address these concerns. Lambda corrects for chance…

Descriptors: Interrater Reliability, Teacher Evaluation, Test Validity, Evaluation Methods

Development and Validation of a 21st Century Skills Assessment: Using an Iterative Multimethod Approach

Peer reviewed

Direct link

Sondergeld, Toni A.; Johnson, Carla C. – School Science and Mathematics, 2019

In response to the call for more rigorously validated educational assessments, this study used an iterative multimethod validation process to develop and validate outcomes from the 21st Century Skills Assessment global rating scale. Qualitative and quantitative data sources were used to inform four types of validity evidence: content, response…

Descriptors: 21st Century Skills, Test Construction, Test Validity, Educational Assessment

The Reliability and Consequential Validity of Two Teacher-Administered Student Mathematics Diagnostic Assessments. REL 2020-039

Peer reviewed
PDF on ERIC

Download full text

Gersten, Russell; Jayanthi, Madhavi; Newman-Gonchar, Rebecca; Anderson, Daniel; Spallone, Samantha; Taylor, Mary Jo – Regional Educational Laboratory Southeast, 2020

Several school districts in Georgia use two teacher-administered diagnostic assessments of student knowledge of mathematics as part of their multi-tiered system of support in grades K-8: the Global Strategy Stage (GloSS; New Zealand Ministry of Education, 2012) and the Individual Knowledge Assessment of Number (IKAN; New Zealand Ministry of…

Descriptors: Mathematics Tests, Diagnostic Tests, Test Reliability, Test Validity

The Reliability and Consequential Validity of Two Teacher-Administered Student Mathematics Diagnostic Assessments. Appendixes. REL 2020-039

Peer reviewed
PDF on ERIC

Download full text

Regional Educational Laboratory Southeast, 2020

This document are the appendixes for the report, "The Reliability and Consequential Validity of Two Teacher-Administered Student Mathematics Diagnostic Assessments." Rather than relying on occasional testimonials from the field, decisions about using diagnostic assessments across the state should be based on psychometric data from an…

Descriptors: Mathematics Tests, Diagnostic Tests, Test Reliability, Test Validity

The Reliability and Consequential Validity of Two Teacher-Administered Student Mathematics Diagnostic Assessments. Study Snapshot. REL 2020-039

Peer reviewed
PDF on ERIC

Download full text

Regional Educational Laboratory Southeast, 2020

Teachers need to assess their students' current level of mathematical understanding to provide appropriate interventions for students who are struggling. Several school districts in Georgia currently use two assessments for this purpose--the Global Strategy Stage (GloSS) and the Individual Knowledge Assessment of Number (IKAN). The IKAN is…

Descriptors: Mathematics Tests, Diagnostic Tests, Test Reliability, Test Validity

Indices of Subscore Utility for Individuals and Subgroups Based on Multivariate Generalizability Theory

Peer reviewed

Direct link

Raymond, Mark R.; Jiang, Zhehan – Educational and Psychological Measurement, 2020

Conventional methods for evaluating the utility of subscores rely on traditional indices of reliability and on correlations among subscores. One limitation of correlational methods is that they do not explicitly consider variation in subtest means. An exception is an index of score profile reliability designated as [G], which quantifies the ratio…

Descriptors: Generalizability Theory, Multivariate Analysis, Scores, Reliability

« Previous Page | Next Page »

Pages: 1 | ... | 39 | 40 | 41 | 42 | 43 | 44 | 45 | 46 | 47 | ... | 1757

Educational and Psychological…	810
ProQuest LLC	612
Journal of Psychoeducational…	378
Online Submission	324
Journal of Educational…	242
Measurement and Evaluation in…	230
Journal of Autism and…	219
Psychology in the Schools	210
Psychological Assessment	180
Grantee Submission	177
Journal of Speech, Language,…	170
Measurement in Physical…	161
Applied Psychological…	149
Assessment for Effective…	134
Journal of Consulting and…	131
Educational Research and…	130
Psychometrika	120
Research on Social Work…	120
Educational Sciences: Theory…	119
Assessment & Evaluation in…	118
Language Testing	115
International Journal of…	112
Applied Measurement in…	111
ETS Research Report Series	101
Assessment	100
More ▼

No Child Left Behind Act 2001	136
Individuals with Disabilities…	43
Race to the Top	27
Elementary and Secondary…	19
Every Student Succeeds Act…	19
Elementary and Secondary…	15
Individuals with Disabilities…	11
American Recovery and…	10
Rehabilitation Act 1973…	8
Americans with Disabilities…	5
Elementary and Secondary…	5
Education Consolidation…	4
Education for All Handicapped…	4
Head Start	4
Individuals with Disabilities…	4
Adoption and Safe Families…	2
Child Abuse Prevention and…	2
Comprehensive Employment and…	2
Education Amendments 1974	2
Education of the Handicapped…	2
Elementary and Secondary…	2
Individuals with Disabilities…	2
Individuals with Disabilities…	2
Kentucky Education Reform Act…	2
Title IX Education Amendments…	2
More ▼

General Aptitude Test Battery	463
Wechsler Intelligence Scale…	173
Peabody Picture Vocabulary…	87
SAT (College Admission Test)	85
Test of English as a Foreign…	78
Wechsler Adult Intelligence…	74
Strengths and Difficulties…	62
Program for International…	58
Child Behavior Checklist	57
National Assessment of…	55
Minnesota Multiphasic…	52
Stanford Achievement Tests	52
ACT Assessment	49
Beck Depression Inventory	48
Stanford Binet Intelligence…	45
Woodcock Johnson Tests of…	44
Autism Diagnostic Observation…	43
Motivated Strategies for…	43
Behavior Assessment System…	42
Raven Progressive Matrices	42
Graduate Record Examinations	41
Iowa Tests of Basic Skills	41
Marlowe Crowne Social…	41
Kaufman Assessment Battery…	38
Vineland Adaptive Behavior…	36
More ▼