ERIC - Search Results

Publication Date

In 2025	67
Since 2024	212
Since 2021 (last 5 years)	703
Since 2016 (last 10 years)	1248
Since 2006 (last 20 years)	1684

Descriptor

Test Reliability	1706
Test Validity	1279
Foreign Countries	950
Test Construction	700
College Students	569
Factor Analysis	502
Undergraduate Students	436
Psychometrics	389
Measures (Individuals)	343
Student Attitudes	308
Preservice Teachers	275
Factor Structure	246
Questionnaires	213
Correlation	207
Construct Validity	201
Test Items	190
Scores	161
College Faculty	149
Statistical Analysis	147
Higher Education	142
Likert Scales	136
Student Evaluation	134
Attitude Measures	126
Self Efficacy	123
Evaluation Methods	121
More ▼

Education Level

Postsecondary Education	1706
Higher Education	1679
Secondary Education	117
High Schools	70
Elementary Education	69
Elementary Secondary Education	40
Early Childhood Education	25
Two Year Colleges	25
Middle Schools	19
Adult Education	16
Junior High Schools	16
Preschool Education	13
Intermediate Grades	6
Grade 10	5
Grade 11	5
Grade 5	5
Grade 12	4
Grade 4	4
Primary Education	4
Grade 7	3
Grade 8	3
Grade 9	3
Kindergarten	3
Adult Basic Education	2
Grade 1	2
More ▼

Audience

Teachers	4
Administrators	3
Policymakers	3
Practitioners	2
Counselors	1
Researchers	1
Students	1

Location

Turkey	275
China	63
Australia	44
Indonesia	39
Iran	36
Spain	36
Malaysia	33
Germany	30
Canada	29
Taiwan	29
United Kingdom	26
United States	22
Hong Kong	20
South Africa	16
Japan	15
Netherlands	15
Mexico	14
Saudi Arabia	14
India	13
Brazil	12
Italy	12
Nigeria	12
South Korea	12
Texas	12
Chile	11
More ▼

Laws, Policies, & Programs

Individuals with Disabilities…	1
No Child Left Behind Act 2001	1
Pell Grant Program	1
Title IX Education Amendments…	1
United Nations Convention on…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 1,706 results Save | Export

Using Automated Procedures to Score Educational Essays Written in Three Languages

Peer reviewed

Direct link

Tahereh Firoozi; Hamid Mohammadi; Mark J. Gierl – Journal of Educational Measurement, 2025

The purpose of this study is to describe and evaluate a multilingual automated essay scoring (AES) system for grading essays in three languages. Two different sentence embedding models were evaluated within the AES system, multilingual BERT (mBERT) and language-agnostic BERT sentence embedding (LaBSE). German, Italian, and Czech essays were…

Descriptors: College Students, Slavic Languages, German, Italian

Engaging Classroom Observation: A Brief Measure of Active Learning in the College Classroom

Peer reviewed

Direct link

Chase Young; Benjamin Mitchell-Yellin; George Kevin Randall – Active Learning in Higher Education, 2025

The purpose of this study was to develop a valid, reliable, and brief measure of active learning in college classrooms that is cheap and easy to complete and yields results that faculty can easily use to inform their development as instructors. Initial construct and face validity was achieved by modifying existing instruments and creating a draft…

Descriptors: College Faculty, College Students, Active Learning, Classroom Observation Techniques

Evidence-Based Evaluation of Student and Marker Performances in Assessment and Examination

Peer reviewed

Direct link

Ole J. Kemi – Advances in Physiology Education, 2025

Students are assessed by coursework and/or exams, all of which are marked by assessors (markers). Student and marker performances are then subject to end-of-session board of examiner handling and analysis. This occurs annually and is the basis for evaluating students but also the wider learning and teaching efficiency of an academic institution.…

Descriptors: Undergraduate Students, Evaluation Methods, Evaluation Criteria, Academic Standards

Synthesizing Validity and Reliability Evidence for the Draw-A-Scientist Test

Peer reviewed
PDF on ERIC

Download full text

Julia Brochey-Taylor; Joseph A. Taylor – Educational Research and Reviews, 2024

The purpose of this synthesis study was to assess the reliability and validity of the Draw-A-Scientist Test (DAST) and its variations across multiple studies, aiming to understand limitations and propose modifications for future application within and beyond the science domain. Given the existence of multiple DAST versions, this study quantified…

Descriptors: Cognitive Tests, Freehand Drawing, Personality Measures, Projective Measures

Different Methods for Assessing Preservice Teachers' Instruction: Why Measures Matter

Peer reviewed

Direct link

Arielle Boguslav; Julie Cohen – Journal of Teacher Education, 2024

Teacher preparation programs are increasingly expected to use data on preservice teacher (PST) skills to drive program improvement and provide targeted supports. Observational ratings are especially vital, but also prone to measurement issues. Scores may be influenced by factors unrelated to PSTs' instructional skills, including rater standards.…

Descriptors: Preservice Teachers, Measures (Individuals), Evaluation Problems, Teaching Skills

Constructing a Roadmap to Measure the Quality of Business Assessments Aimed at Curriculum Management

Peer reviewed

Direct link

Silva, Thanuci; Santos, Regiane dos; Mallet, Débora – Journal of Education for Business, 2023

Assuring the quality of education is a concern of learning institutions. To do so, it is necessary to have assertive learning management, with consistent data on students' outcomes. This research provides associate deans and researchers, a roadmap with which to gather evidence to improve the quality of open-ended assessments. Based on statistical…

Descriptors: Student Evaluation, Evaluation Methods, Business Education, Higher Education

Design of a Simple Rubric to Peer-Evaluate the Teamwork Skills of Engineering Students

Peer reviewed

Direct link

Swapneel Thite; Jayashri Ravishankar; Inmaculada Tomeo-Reyes; Araceli Martinez Ortiz – European Journal of Engineering Education, 2024

Effectively working in an engineering workplace requires strong teamwork skills, yet the existing literature within various disciplines reveals discrepancies in evaluating these skills. This complicates the design of a generic teamwork peer evaluation tool for engineering students. This study aims to address this gap by introducing the DRIVE…

Descriptors: Scoring Rubrics, Evaluation Methods, Peer Evaluation, Teamwork

The Value of Expanding Perspectives on Assessment

Peer reviewed

Direct link

Janice Kinghorn; Katherine McGuire; Bethany L. Miller; Aaron Zimmerman – Assessment Update, 2024

In this article, the authors share their reflections on how different experiences and paradigms have broadened their understanding of the work of assessment in higher education. As they collaborated to create a panel for the 2024 International Conference on Assessing Quality in Higher Education, they recognized that they, as assessment…

Descriptors: Higher Education, Assessment Literacy, Evaluation Criteria, Evaluation Methods

The Bank Robbery: A Behavioral Observation Exercise for Enhancing Understanding of Reliability

Peer reviewed

Direct link

Strelan, Peter – Teaching of Psychology, 2022

Background: The concept of reliability is central to conducting--and understanding--research in Psychology. Students' understanding of concepts are strengthened when they learn by applying concepts. Objective: This article describes initial evidence of an activity for teaching reliability. Method: Students watched a short video of a staged bank…

Descriptors: Learning Activities, Psychology, Recall (Psychology), Crime

Is It Actually Reliable? Examining Statistical Methods for Inter-Rater Reliability of a Rubric in Graduate Education

Peer reviewed
PDF on ERIC

Download full text

Brent J. Goertzen; Kaley Klaus – Research & Practice in Assessment, 2023

When evaluating student learning, educators often employ scoring rubrics, for which quality can be determined through evaluating validity and reliability. This article discusses the norming process utilized in a graduate organizational leadership program for a capstone scoring rubric. Concepts of validity and reliability are discussed, as is the…

Descriptors: Graduate Students, Graduate Study, Graduate School Faculty, Scoring Rubrics

Developing and Validating Instruments for Measuring English-as-a-Second/Foreign-Language (L2) Learners' Metaphor Awareness

Peer reviewed

Direct link

Ting Ma; Lawrence Jun Zhang; Judy M. Parr – Language Awareness, 2025

Studies have shown that raising L2 learners' metaphor awareness contributes to the acquisition of figurative language, which fosters students' development of language skills. However, the instruments measuring metaphor awareness, in the majority of relevant research, did not seem to have undergone proper methodological procedures for checking…

Descriptors: English (Second Language), Second Language Learning, Second Language Instruction, Figurative Language

Reliability of Measuring Constructs in Applied Linguistics Research: A Comparative Study of Domestic and International Graduate Theses

Peer reviewed

Direct link

Razavipour, Kioumars; Raji, Behnaz – Language Testing in Asia, 2022

The credibility of conclusions arrived at in quantitative research depends, to a large extent, on the quality of data collection instruments used to quantify language and non-language constructs. Despite this, research into data collection instruments used in Applied Linguistics and particularly in the thesis genre remains limited. This study…

Descriptors: Applied Linguistics, Test Reliability, Language Tests, Credibility

Reliability Analysis of Flipped Classroom Lesson Design and Evaluation Rubric

Peer reviewed

Direct link

Unal, Zafer – Journal of Interactive Learning Research, 2022

Despite over fifteen years of flipped classroom implementation, current literature does not provide any reliable, standardized rubric as a guideline to create or evaluate flipped classroom lessons based on effective flipped classroom design principles. In fact, at the time of this study, when an internet search for existing rubrics was conducted,…

Descriptors: Flipped Classroom, Lesson Plans, Scoring Rubrics, Graduate Students

Examining the Wording Effect: What Are We Measuring?

Peer reviewed

Direct link

Abdullah Faruk Kiliç; Meltem Acar Güvendir; Gül Güler; Tugay Kaçak – Measurement: Interdisciplinary Research and Perspectives, 2025

In this study, the extent to wording effects impact structure and factor loadings, internal consistency and measurement invariance was outlined. The modified form, which includes items that semantically reversed, explains %21.5 more variance than the original form. Also, reversed items' factor loadings are higher. As a result of CFA, indexes…

Descriptors: Test Items, Factor Structure, Test Reliability, Semantics

Establishing a Physics Concept Inventory Using Computer Marked Free-Response Questions

Peer reviewed
PDF on ERIC

Download full text

Parker, Mark A. J.; Hedgeland, Holly; Jordan, Sally E.; Braithwaite, Nicholas St. J. – European Journal of Science and Mathematics Education, 2023

The study covers the development and testing of the alternative mechanics survey (AMS), a modified force concept inventory (FCI), which used automatically marked free-response questions. Data were collected over a period of three academic years from 611 participants who were taking physics classes at high school and university level. A total of…

Descriptors: Test Construction, Scientific Concepts, Physics, Test Reliability

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 114

ProQuest LLC	70
Journal of Psychoeducational…	48
International Journal of…	32
Online Submission	30
SAGE Open	30
Measurement and Evaluation in…	25
Physical Review Physics…	22
Education and Information…	20
CBE - Life Sciences Education	18
ETS Research Report Series	18
Educational Research and…	16
Advances in Health Sciences…	15
International Journal of…	15
Measurement in Physical…	15
Assessment & Evaluation in…	14
Educational Sciences: Theory…	14
Eurasian Journal of…	14
International Journal of…	14
International Journal of…	14
Chemistry Education Research…	13
Journal of American College…	13
Learning Environments Research	13
Turkish Online Journal of…	13
International Education…	12
International Journal of…	12
More ▼

Ward, Phillip	6
Barbera, Jack	5
He, Yaohui	5
Liu, Ou Lydia	5
Lowe, Patricia A.	5
Bretz, Stacey Lowery	4
Bridgeman, Brent	4
Erford, Bradley T.	4
Liu, Xiufeng	4
Nordin, Mohamad Sahari	4
Sherman, Martin F.	4
Sriken, Julie	4
Tsuda, Emi	4
Yin, Hongbiao	4
Çetin, Filiz	4
Abell, Neil	3
Baghaei, Purya	3
Bao, Lei	3
Chan, Cecilia K. Y.	3
Chan, Fong	3
Cooper, Melanie M.	3
Fauzi, Ahmad	3
Flett, Gordon L.	3
Gleason, Jim	3
Greiff, Samuel	3
More ▼

Journal Articles	1576
Reports - Research	1494
Tests/Questionnaires	218
Reports - Evaluative	84
Dissertations/Theses -…	70
Reports - Descriptive	39
Speeches/Meeting Papers	25
Information Analyses	20
Numerical/Quantitative Data	7
Collected Works - Proceedings	6
Multilingual/Bilingual…	4
Guides - Non-Classroom	3
Books	2
Non-Print Media	2
Reference Materials - General	2
Collected Works - General	1
Guides - General	1
Opinion Papers	1
Translations	1
More ▼

SAT (College Admission Test)	13
ACT Assessment	11
Marlowe Crowne Social…	11
Motivated Strategies for…	10
Rosenberg Self Esteem Scale	9
Center for Epidemiologic…	8
Test of English as a Foreign…	8
Beck Depression Inventory	7
Graduate Record Examinations	7
Praxis Series	6
State Trait Anxiety Inventory	6
UCLA Loneliness Scale	5
Beck Anxiety Inventory	4
Multidimensional…	4
National Survey of Student…	4
Program for International…	4
Satisfaction With Life Scale	4
Academic Motivation Scale	3
Brief Symptom Inventory	3
Defining Issues Test	3
Student Adaptation to College…	3
edTPA (Teacher Performance…	3
Behavior Assessment System…	2
Behavioral Risk Factor…	2
Draw a Person Test	2
More ▼