ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	3
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	12

Descriptor

Reliability	76
Item Response Theory	24
Validity	24
Generalizability Theory	23
Scores	15
Test Items	13
Error of Measurement	10
Higher Education	10
Mathematical Models	10
Test Construction	10
Test Theory	10
Elementary Secondary Education	8
Estimation (Mathematics)	8
Foreign Countries	8
Adults	7
Measurement Techniques	7
Correlation	6
Psychometrics	6
Scoring	6
Statistical Analysis	6
Ability	5
Comparative Analysis	5
Content Validity	5
Elementary School Students	5
Evaluation Methods	5
More ▼

Source

Online Submission	7
Educational Testing Service	2
Grantee Submission	1
International Educational…	1
Mathematics Education…	1
Mid-Western Educational…	1
North American Chapter of the…	1

Publication Type

Speeches/Meeting Papers	76
Reports - Research	41
Reports - Evaluative	24
Reports - Descriptive	7
Numerical/Quantitative Data	6
Opinion Papers	5
Information Analyses	2
Tests/Questionnaires	2
Journal Articles	1

Education Level

Higher Education	4
Early Childhood Education	2
Elementary Education	2
Postsecondary Education	2
Elementary Secondary Education	1
High Schools	1
Primary Education	1
Secondary Education	1

Audience

Researchers

Location

South Korea	2
United States	2
Australia	1
California (Berkeley)	1
Canada	1
China	1
Florida	1
France	1
Germany	1
Hawaii	1
Hong Kong	1
Israel	1
Japan	1
Louisiana	1
Ohio	1
South Dakota	1
Spain	1
West Germany	1
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	1
Reading Excellence Act	1

Assessments and Surveys

Test of English as a Foreign…	2
ACT Assessment	1
Armed Services Vocational…	1
Comprehensive Tests of Basic…	1
Iowa Tests of Basic Skills	1
Stanford Achievement Tests	1
Strong Interest Inventory	1
Work Keys (ACT)	1

What Works Clearinghouse Rating

Showing 1 to 15 of 76 results Save | Export

Knowledge Tracing over Time: A Longitudinal Analysis

Peer reviewed
PDF on ERIC

Download full text

Lee, Morgan P.; Croteau, Ethan; Gurung, Ashish; Botelho, Anthony F.; Heffernan, Neil T. – International Educational Data Mining Society, 2023

The use of Bayesian Knowledge Tracing (BKT) models in predicting student learning and mastery, especially in mathematics, is a well-established and proven approach in learning analytics. In this work, we report on our analysis examining the generalizability of BKT models across academic years attributed to "detector rot." We compare the…

Descriptors: Bayesian Statistics, Models, Generalizability Theory, Longitudinal Studies

Developing a Tool for Measuring Student Orientations with Respect to Understanding in Mathematical Learning

Peer reviewed
PDF on ERIC

Download full text

Siqi Huang – North American Chapter of the International Group for the Psychology of Mathematics Education, 2023

The goal of this paper is twofold. First, the paper clarifies and elaborates on an important theoretical construct called orientation with respect to understanding in mathematics, which denotes the degree to which students exhibit an inclination towards and demonstrate an earnest concern for understanding in mathematical learning. Second, the…

Descriptors: Mathematics Instruction, Teaching Methods, Problem Solving, Reliability

Validating a Claim-Evidence-Science Idea-Reasoning (CESR) Framework for Use in NGSS Assessment Tasks

Peer reviewed
PDF on ERIC

Download full text

Hardcastle, Joseph M.; Herrmann Abell, Cari F.; DeBoer, George E. – Grantee Submission, 2021

We developed assessment tasks aligned to the Next Generation Science Standards (NGSS) that require students to use argumentation and explanation practices along with disciplinary core ideas and crosscutting concepts to make sense of energy-related phenomena. Scoring rubrics were created to evaluate students' ability to make accurate claims, cite…

Descriptors: Academic Standards, Energy, Scientific Concepts, Persuasive Discourse

Reliability and Validity of Inferences about Teachers Based on Student Scores. William H. Angoff Memorial Lecture Series

Download full text

Haertel, Edward H. – Educational Testing Service, 2013

Policymakers and school administrators have embraced value-added models of teacher effectiveness as tools for educational improvement. Teacher value-added estimates may be viewed as complicated scores of a certain kind. This suggests using a test validation model to examine their reliability and validity. Validation begins with an interpretive…

Descriptors: Reliability, Validity, Inferences, Teacher Effectiveness

Considerations in the Measurement of Awareness

Download full text

Gafoor, Kunnathodi Abdul – Online Submission, 2012

Awareness is one of the most frequently measured construct by masters' students in education for their dissertation work. The author has observed that within the jurisdiction of his home university frequency of dissertations in education using "Awareness of" some social scientific or educational topic will be anywhere between 10 to…

Descriptors: Metacognition, Perception, Educational Theories, Measures (Individuals)

Entering the "New Frontier" of Mathematics Assessment: Designing and Trialling the PVAT-O (Online)

Download full text

Rogers, Angela – Mathematics Education Research Group of Australasia, 2013

As we move into the 21st century, educationalists are exploring the myriad of possibilities associated with Computer Based Assessment (CBA). At first glance this mode of assessment seems to provide many exciting opportunities in the mathematics domain, yet one must question the validity of CBA and whether our school systems, students and teachers…

Descriptors: Mathematics Tests, Student Evaluation, Computer Assisted Testing, Test Validity

Contemporary Treatment of Reliability and Validity in Educational Assessment

Peer reviewed

Direct link

Dimitrov, Dimiter M. – Mid-Western Educational Researcher, 2010

The focus of this presidential address is on the contemporary treatment of reliability and validity in educational assessment. Highlights on reliability are provided under the classical true-score model using tools from latent trait modeling to clarify important assumptions and procedures for reliability estimation. In addition to reliability,…

Descriptors: Educational Assessment, Validity, Item Response Theory, Reliability

Errors of Measurement, Theory, and Public Policy. William H. Angoff Memorial Lecture Series

Download full text

Kane, Michael – Educational Testing Service, 2010

The 12th annual William H. Angoff Memorial Lecture was presented by Dr. Michael T. Kane, ETS's (Educational Testing Service) Samuel J. Messick Chair in Test Validity and the former Director of Research at the National Conference of Bar Examiners. Dr. Kane argues that it is important for policymakers to recognize the impact of errors of measurement…

Descriptors: Error of Measurement, Scores, Public Policy, Test Theory

A Generalizability Investigation of Cognitive Demand and Rigor Ratings of Items and Standards in an Alignment Study

Download full text

Lombardi, Allison; Seburn, Mary; Conley, David; Snow, Eric – Online Submission, 2010

In alignment studies, expert raters evaluate assessment items against standards and ratings are used to compute various alignment indices. Questions about rater reliability, however, are often ignored or inadequately addressed. This paper reports the results of a generalizability theory study of cognitive demand and rigor ratings of assessment…

Descriptors: Generalizability Theory, Test Items, College Entrance Examinations, Readiness

Development and Validation of the FYI - A Preliminary Report

Download full text

Baker, Harley E.; Styer, Jane S.; Harmon, Lenore; Pommerich, Mary – Online Submission, 2010

Developed for the Armed Services Vocational Aptitude Battery (ASVAB) Career Exploration Program, the Find Your Interests (FYI) inventory was designed to help students learn about their career-related interests. The FYI is a 90-item interest inventory based on Holland's (1973, 1985, 1997) widely accepted theory and taxonomy of career choice. The…

Descriptors: Interest Inventories, Career Choice, High School Students, Career Exploration

Accuracy vs. Validity, Consistency vs. Reliability, and Fairness vs. Absence of Bias: A Call for Quality

Download full text

Lang, W. Steve; Wilkerson, Judy R. – Online Submission, 2008

The National Council for Accreditation of Teacher Education (NCATE, 2002) requires teacher education units to develop assessment systems and evaluate both the success of candidates and unit operations. Because of a stated, but misguided, fear of statistics, NCATE fails to use accepted terminology to assure the quality of institutional evaluative…

Descriptors: State Standards, Validity, Resource Materials, Reliability

Alternative Methods for Calculating Intercoder Reliability in Content Analysis: Kappa, Weighted Kappa and Agreement Charts Procedures.

Kang, Namjun – 1987

If content analysis is to satisfy the requirement of objectivity, measures and procedures must be reliable. Reliability is usually measured by the proportion of agreement of all categories identically coded by different coders. For such data to be empirically meaningful, a high degree of inter-coder reliability must be demonstrated. Researchers in…

Descriptors: Content Analysis, Interrater Reliability, Measurement Techniques, Media Research

Examining Reliability and Validity of Job Analysis Survey Data.

Download full text

Wang, Ning; Wiser, Randall F.; Newman, Larry S. – 1999

Job analysis has played a fundamental role in developing and validating licensure and certification examinations, but research on what constitutes reliable and valid job analysis data is lacking. This paper examines the reliability and validity of job analysis survey results. Generalizability theory and the multi-facet Rasch item response theory…

Descriptors: Generalizability Theory, Goodness of Fit, Item Response Theory, Job Analysis

The DAATS Model: Initial Psychometric and Statistical Findings. A Top Ten Illustration

Download full text

Lang, W. Steve – Online Submission, 2008

The INTASC Principles, when used as the basis for developing appropriate measurement instruments to assess teacher dispositions, provide a viable approach to the diagnosis and remediation of skill-related affective performance in teacher candidates and also to meeting NCATE requirements for Standard 1. In this symposium, the development and use of…

Descriptors: Computer Software, Teacher Education Programs, Rating Scales, Measurement

A Primer on Coefficient Alpha.

Download full text

Henson, Robin K. – 2000

Because reliability is a function of scores, and not tests per se, it is inaccurate to hold that a given test will yield scores with the same reliability across samples. Therefore, score reliability should always be reported and interpreted in both measurement and substantive studies. In an effort to facilitate this outcome, this paper is intended…

Descriptors: Reliability, Scores, Test Results, Test Theory

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6

Lee, Yong-Won	4
Dimitrov, Dimiter M.	3
Kantor, Robert	2
Lang, W. Steve	2
Mollaun, Pam	2
Wilkerson, Judy R.	2
Allen, Bem P.	1
Baker, C. Scott	1
Baker, Harley E.	1
Barke, Charles R.	1
Barton, Karen	1
Basturk, Ramazan	1
Bergstrom, Betty A.	1
Bezruczko, Nikolaus	1
Borst, Richard Alan	1
Botelho, Anthony F.	1
Butler, E. Dean	1
Campbell, Kathleen Taylor	1
Carey, Jill	1
Chandler, Theodore A.	1
Chason, Walter M.	1
Chevalier, Shirley A.	1
Chin-Chance, Selvin	1
Clonts, Jean G.	1
More ▼