ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	5
Since 2016 (last 10 years)	14
Since 2006 (last 20 years)	112

Descriptor

Models	183
Reliability	98
Test Reliability	55
Evaluation Methods	43
Validity	42
Test Validity	40
Interrater Reliability	31
Foreign Countries	24
Measures (Individuals)	23
Test Construction	23
Educational Assessment	22
Factor Analysis	21
Test Items	20
Statistical Analysis	19
Correlation	18
Scores	17
Elementary Secondary Education	16
Measurement Techniques	16
Psychometrics	16
Comparative Analysis	14
Higher Education	14
Item Response Theory	13
Performance Based Assessment	13
Academic Achievement	12
Research Methodology	12
More ▼

Publication Type

Reports - Evaluative	183
Journal Articles	135
Speeches/Meeting Papers	26
Information Analyses	9
Reports - Research	5
Tests/Questionnaires	5
Reports - Descriptive	3
Collected Works - Proceedings	2
Guides - Non-Classroom	2
Numerical/Quantitative Data	2
Opinion Papers	2
Collected Works - Serials	1
More ▼

Education Level

Higher Education	24
Elementary Secondary Education	19
Postsecondary Education	11
Elementary Education	10
Grade 5	6
High Schools	6
Secondary Education	6
Grade 3	4
Grade 8	4
Adult Education	3
Grade 1	3
Grade 4	3
Grade 6	3
Kindergarten	3
Middle Schools	3
Preschool Education	3
Grade 2	2
Grade 7	2
Intermediate Grades	2
Early Childhood Education	1
Grade 12	1
Junior High Schools	1
More ▼

Audience

Researchers	8
Practitioners	3
Administrators	1
Counselors	1
Students	1
Teachers	1

Location

Canada	5
Texas	4
California	3
New Hampshire	3
Taiwan	3
United Kingdom	3
Australia	2
Florida	2
New York	2
Rhode Island	2
United States	2
Vermont	2
Botswana	1
France	1
Georgia	1
Lebanon	1
Lithuania	1
Malaysia	1
Minnesota	1
Mississippi	1
Netherlands	1
North Carolina	1
Pennsylvania	1
Philippines (Manila)	1
South Africa	1
More ▼

Laws, Policies, & Programs

Education Consolidation…	1
Individuals with Disabilities…	1
No Child Left Behind Act 2001	1
Race to the Top	1
Safe and Drug Free Schools…	1

Assessments and Surveys

Brazelton Neonatal Assessment…	2
ACT Assessment	1
Advanced Placement…	1
Alberta Grade Twelve Diploma…	1
Hidden Figures Test	1
Kaufman Assessment Battery…	1
Pediatric Evaluation of…	1
Program for International…	1
Self Description Questionnaire	1
Teacher Efficacy Scale	1
Vineland Adaptive Behavior…	1
Wechsler Intelligence Scale…	1
More ▼

What Works Clearinghouse Rating

Showing 1 to 15 of 183 results Save | Export

Improvised Progressive Model Based on Automatic Calibration of Difficulty Level: A Practical Solution of Competitive-Based Examination

Peer reviewed

Direct link

Aditya Shah; Ajay Devmane; Mehul Ranka; Prathamesh Churi – Education and Information Technologies, 2024

Online learning has grown due to the advancement of technology and flexibility. Online examinations measure students' knowledge and skills. Traditional question papers include inconsistent difficulty levels, arbitrary question allocations, and poor grading. The suggested model calibrates question paper difficulty based on student performance to…

Descriptors: Computer Assisted Testing, Difficulty Level, Grading, Test Construction

A Novel Automated Essay Scoring Approach for Reliable Higher Educational Assessments

Peer reviewed

Direct link

Beseiso, Majdi; Alzubi, Omar A.; Rashaideh, Hasan – Journal of Computing in Higher Education, 2021

E-learning is gradually gaining prominence in higher education, with universities enlarging provision and more students getting enrolled. The effectiveness of automated essay scoring (AES) is thus holding a strong appeal to universities for managing an increasing learning interest and reducing costs associated with human raters. The growth in…

Descriptors: Automation, Scoring, Essays, Writing Tests

Evaluating Quadratic Weighted Kappa as the Standard Performance Metric for Automated Essay Scoring

Peer reviewed
PDF on ERIC

Download full text

Doewes, Afrizal; Kurdhi, Nughthoh Arfawi; Saxena, Akrati – International Educational Data Mining Society, 2023

Automated Essay Scoring (AES) tools aim to improve the efficiency and consistency of essay scoring by using machine learning algorithms. In the existing research work on this topic, most researchers agree that human-automated score agreement remains the benchmark for assessing the accuracy of machine-generated scores. To measure the performance of…

Descriptors: Essays, Writing Evaluation, Evaluators, Accuracy

The Reliability of the Posterior Probability of Skill Attainment in Diagnostic Classification Models

Peer reviewed

Direct link

Johnson, Matthew S.; Sinharay, Sandip – Journal of Educational and Behavioral Statistics, 2020

One common score reported from diagnostic classification assessments is the vector of posterior means of the skill mastery indicators. As with any assessment, it is important to derive and report estimates of the reliability of the reported scores. After reviewing a reliability measure suggested by Templin and Bradshaw, this article suggests three…

Descriptors: Reliability, Probability, Skill Development, Classification

A New Facets Model for Rater's Centrality/Extremity Response Style

Peer reviewed

Direct link

Jin, Kuan-Yu; Wang, Wen-Chung – Journal of Educational Measurement, 2018

The Rasch facets model was developed to account for facet data, such as student essays graded by raters, but it accounts for only one kind of rater effect (severity). In practice, raters may exhibit various tendencies such as using middle or extreme scores in their ratings, which is referred to as the rater centrality/extremity response style. To…

Descriptors: Scoring, Models, Interrater Reliability, Computation

Diagnostic Classification Models: Recent Developments, Practical Issues, and Prospects

Peer reviewed

Direct link

Ravand, Hamdollah; Baghaei, Purya – International Journal of Testing, 2020

More than three decades after their introduction, diagnostic classification models (DCM) do not seem to have been implemented in educational systems for the purposes they were devised. Most DCM research is either methodological for model development and refinement or retrofitting to existing nondiagnostic tests and, in the latter case, basically…

Descriptors: Classification, Models, Diagnostic Tests, Test Construction

Using a Many-Facet Rasch Model to Gain Insight into Measurement of Instructional Practice in Mathematics

Peer reviewed

Direct link

Robert Schoen; Lanrong Li; Xiaotong Yang; Ahmet Guven; Claire Riddell – Society for Research on Educational Effectiveness, 2021

Many classroom-observation instruments have been developed (e.g., Gleason et al., 2017; Nava et al., 2019; Sawada et al., 2002), but a very small number of studies published in refereed journals have rigorously examined the quality of the ratings and the instrument using measurement models. For example, Gleason et al. developed a mathematics…

Descriptors: Item Response Theory, Models, Measurement, Mathematics Instruction

Exploring Diversity through Machine Learning: A Case for the Use of Decision Trees in Social Science Research

Peer reviewed

Direct link

Srour, F. Jordan; Karkoulian, Silva – International Journal of Social Research Methodology, 2022

The literature provides multiple measures of diversity along a single demographic dimension, but when it comes to studying the interaction of multiple diversity types (e.g. age, gender, and race), the field of useable measures diminishes. We present the use of decision trees as a machine learning technique to automatically identify the…

Descriptors: Diversity, Decision Making, Artificial Intelligence, Correlation

Defining and Measuring the Occupational Performance of Children

Peer reviewed

Direct link

Mulligan, Shelley – Journal of Occupational Therapy, Schools & Early Intervention, 2017

Occupational performance assessments of children are essential for guiding occupational therapy intervention and for measuring the effectiveness of occupational therapy services for children. A review of relevant research and of occupational performance assessments designed for children was conducted to determine and describe how the occupational…

Descriptors: Occupational Therapy, Performance Tests, Children, Measurement

Coherence Threshold and the Continuity of Processing: The RI-Val Model of Comprehension

Peer reviewed

Direct link

O'Brien, Edward J.; Cook, Anne E. – Discourse Processes: A multidisciplinary journal, 2016

Common to all models of reading comprehension is the assumption that a reader's level of comprehension is heavily influenced by their standards of coherence (van den Broek, Risden, & Husbye-Hartman, 1995). Our discussion focuses on a subcomponent of the readers' standards of coherence: the coherence threshold. We situate this discussion within…

Descriptors: Reading Comprehension, Models, Rhetoric, Reading Ability

A Renewed Focus on Strengths-Based Assessment in Schools

Peer reviewed

Direct link

Climie, Emma; Henley, Laura – British Journal of Special Education, 2016

School-based practitioners are often called upon to provide assessment and recommendations for struggling students. These assessments often open doors to specialised services or interventions and provide opportunities for students to build competencies in areas of need. However, these assessments often fail to highlight the abilities of these…

Descriptors: Student Evaluation, Alternative Assessment, Relevance (Education), Models

IRT-Estimated Reliability for Tests Containing Mixed Item Formats

Peer reviewed

Direct link

Shu, Lianghua; Schwarz, Richard D. – Journal of Educational Measurement, 2014

As a global measure of precision, item response theory (IRT) estimated reliability is derived for four coefficients (Cronbach's a, Feldt-Raju, stratified a, and marginal reliability). Models with different underlying assumptions concerning test-part similarity are discussed. A detailed computational example is presented for the targeted…

Descriptors: Item Response Theory, Reliability, Models, Computation

Growth Models and Teacher Evaluation: What Teachers Need to Know and Do

Peer reviewed

Direct link

Katz, Daniel S. – Kappa Delta Pi Record, 2016

Including growth models based on student test scores in teacher evaluations effectively holds teachers individually accountable for students improving their test scores. While an attractive policy for state administrators and advocates of education reform, value-added measures have been fraught with problems, and their use in teacher evaluation is…

Descriptors: Teacher Evaluation, Models, Scores, Evaluation Criteria

Interrater Agreement Evaluation: A Latent Variable Modeling Approach

Peer reviewed

Direct link

Raykov, Tenko; Dimitrov, Dimiter M.; von Eye, Alexander; Marcoulides, George A. – Educational and Psychological Measurement, 2013

A latent variable modeling method for evaluation of interrater agreement is outlined. The procedure is useful for point and interval estimation of the degree of agreement among a given set of judges evaluating a group of targets. In addition, the approach allows one to test for identity in underlying thresholds across raters as well as to identify…

Descriptors: Interrater Reliability, Models, Statistical Analysis, Computation

Learner Centric in M-Learning: Integration of Security, Dependability and Trust

Download full text

Mahalingam, Sheila; Abdollah, Faizal Mohd; Sahib, Shahrin – International Association for Development of the Information Society, 2014

The paper focus on learner centric attributes in a m-learning environment encounters the security measurements. In order to build up a systematic threat and countermeasure for protecting the learners as well as providing awareness and satisfaction in utilizing the mobile learning system, a security model need to be overhauled. The brief literature…

Descriptors: Electronic Learning, Computer Security, Reliability, Trust (Psychology)

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13

Educational and Psychological…	7
Applied Psychological…	4
Journal of Educational…	4
Educational Technology &…	3
International Journal of…	3
School Psychology Review	3
Applied Measurement in…	2
Assessment for Effective…	2
Assessment in Education:…	2
Brain and Language	2
Educational Measurement:…	2
Educational Research and…	2
Journal of Educational…	2
Journal of Positive Behavior…	2
Learning Disability Quarterly	2
Monographs of the Society for…	2
National Center for Analysis…	2
Online Submission	2
Practical Assessment,…	2
Psychological Assessment	2
Psychological Methods	2
Regional Educational…	2
School Effectiveness and…	2
Studies in Educational…	2
Asia Pacific Education Review	1
More ▼

Amrein-Beardsley, Audrey	2
Crawford, Lindy	2
Goldhaber, Dan	2
Lee, Guemin	2
Loeb, Susanna	2
Louie, Josephine	2
O'Dwyer, Laura	2
Parker, Caroline E.	2
Reckase, Mark D.	2
Trevisan, Michael S.	2
Wang, Wen-Chung	2
Abdollah, Faizal Mohd	1
Abedi, Jamal	1
Aditya Shah	1
Ahmet Guven	1
Ajay Devmane	1
Akins, Ralitsa B.	1
Allen, Jeff M.	1
Alonzo, Julie	1
Alsop, Graham	1
Alzubi, Omar A.	1
Anderson, Daniel	1
Ash, Allison N.	1
Azevedo, Roger	1
More ▼