ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	2
Since 2006 (last 20 years)	9

Descriptor

Educational Research	40
Test Theory	40
Evaluation Methods	10
Test Construction	9
Educational Testing	8
Higher Education	8
Elementary Secondary Education	7
Models	7
Student Evaluation	7
Test Validity	7
Measurement Techniques	6
Psychometrics	6
Research Methodology	6
Test Reliability	6
Measures (Individuals)	5
Scores	5
Test Items	5
Estimation (Mathematics)	4
Evaluation Research	4
Item Response Theory	4
Multiple Choice Tests	4
Standardized Tests	4
Statistical Analysis	4
Test Bias	4
Test Format	4
More ▼

Publication Type

Journal Articles	26
Reports - Research	23
Reports - Evaluative	9
Opinion Papers	7
Information Analyses	4
Speeches/Meeting Papers	4
Tests/Questionnaires	4
Reports - Descriptive	3
Collected Works - Proceedings	1
Dissertations/Theses -…	1

Education Level

Elementary Secondary Education	4
Higher Education	2
Adult Education	1

Audience

Researchers	3
Practitioners	2

Location

New York	2
Australia	1
California	1
Canada	1
Israel	1
Michigan	1
United Kingdom	1
Wisconsin	1

Laws, Policies, & Programs

Assessments and Surveys

National Assessment of…	2
Learning and Study Strategies…	1
Program for International…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 40 results Save | Export

Examining Rating Quality in Rater-Mediated Activities for Standard-Item Alignment Research

Direct link

Yvette Jackson – ProQuest LLC, 2023

Rater-mediated activities in educational research occur when an expert judge or rater utilizes an instrument to judge persons or items and generates scale scores. Scale scores are from a subjective judgment and must undergo a quality control measure called rating quality. Rating quality in this study is broadly defined as the extent to which…

Descriptors: Educational Research, Evaluators, Test Theory, Item Response Theory

Expertise on Offer: Why Isn't Anyone Buying?

Peer reviewed

Direct link

Braun, Henry – Journal of Educational and Behavioral Statistics, 2023

It is a much-lamented fact that research with the potential to inform or influence education policy instead remains policy inert. There are many reasons for this frustrating state of affairs, including a lack of strategic thinking on the part of researchers on how to successfully accomplish outreach--as opposed to communication with peers…

Descriptors: Educational Policy, Educational Research, Educational Researchers, Persuasive Discourse

Using IRT Trait Estimates versus Summated Scores in Predicting Outcomes

Peer reviewed

Direct link

Xu, Ting; Stone, Clement A. – Educational and Psychological Measurement, 2012

It has been argued that item response theory trait estimates should be used in analyses rather than number right (NR) or summated scale (SS) scores. Thissen and Orlando postulated that IRT scaling tends to produce trait estimates that are linearly related to the underlying trait being measured. Therefore, IRT trait estimates can be more useful…

Descriptors: Educational Research, Monte Carlo Methods, Measures (Individuals), Item Response Theory

On Applications of Rasch Models in International Comparative Large-Scale Assessments: A Historical Review

Peer reviewed

Direct link

Wendt, Heike; Bos, Wilfried; Goy, Martin – Educational Research and Evaluation, 2011

Several current international comparative large-scale assessments of educational achievement (ICLSA) make use of "Rasch models", to address functions essential for valid cross-cultural comparisons. From a historical perspective, ICLSA and Georg Rasch's "models for measurement" emerged at about the same time, half a century ago. However, the…

Descriptors: Measures (Individuals), Test Theory, Group Testing, Educational Testing

Prenotification, Incentives, and Survey Modality: An Experimental Test of Methods to Increase Survey Response Rates of School Principals

Peer reviewed

Direct link

Jacob, Robin Tepper; Jacob, Brian – Journal of Research on Educational Effectiveness, 2012

Teacher and principal surveys are among the most common data collection techniques employed in education research. Yet there is remarkably little research on survey methods in education, or about the most cost-effective way to raise response rates among teachers and principals. In an effort to explore various methods for increasing survey response…

Descriptors: Principals, Data Collection, Test Theory, Response Rates (Questionnaires)

Approaches to Data Analysis of Multiple-Choice Questions

Peer reviewed

Direct link

Ding, Lin; Beichner, Robert – Physical Review Special Topics - Physics Education Research, 2009

This paper introduces five commonly used approaches to analyzing multiple-choice test data. They are classical test theory, factor analysis, cluster analysis, item response theory, and model analysis. Brief descriptions of the goals and algorithms of these approaches are provided, together with examples illustrating their applications in physics…

Descriptors: Multiple Choice Tests, Factor Analysis, Data Interpretation, Item Response Theory

A "Conditional" Sense of Fairness in Assessment

Peer reviewed

Direct link

Mislevy, Robert J.; Haertel, Geneva; Cheng, Britte H.; Ructtinger, Liliana; DeBarger, Angela; Murray, Elizabeth; Rose, David; Gravel, Jenna; Colker, Alexis M.; Rutstein, Daisy; Vendlinski, Terry – Educational Research and Evaluation, 2013

Standardizing aspects of assessments has long been recognized as a tactic to help make evaluations of examinees fair. It reduces variation in irrelevant aspects of testing procedures that could advantage some examinees and disadvantage others. However, recent attention to making assessment accessible to a more diverse population of students…

Descriptors: Testing Accommodations, Access to Education, Testing, Psychometrics

Evaluating Alignment between Curriculum, Assessment, and Instruction

Peer reviewed

Direct link

Martone, Andrea; Sireci, Stephen G. – Review of Educational Research, 2009

The authors (a) discuss the importance of alignment for facilitating proper assessment and instruction, (b) describe the three most common methods for evaluating the alignment between state content standards and assessments, (c) discuss the relative strengths and limitations of these methods, and (d) discuss examples of applications of each…

Descriptors: Teaching Methods, Alignment (Education), Student Evaluation, Curriculum Development

Measuring Effect Sizes: The Effect of Measurement Error. Working Paper 19

Download full text

Boyd, Donald; Grossman, Pamela; Lankford, Hamilton; Loeb, Susanna; Wyckoff, James – National Center for Analysis of Longitudinal Data in Education Research, 2008

Value-added models in education research allow researchers to explore how a wide variety of policies and measured school inputs affect the academic performance of students. Researchers typically quantify the impacts of such interventions in terms of "effect sizes", i.e., the estimated effect of a one standard deviation change in the…

Descriptors: Credentials, Teacher Effectiveness, Models, Teacher Qualifications

Recontextualizing Mental Measurement.

Peer reviewed

Goldstein, Harvey – Educational Measurement: Issues and Practice, 1994

This article examines how psychometric models based on certain assumptions have come to be used counterproductively by many practitioners in ways that limit the kinds of conclusions that can be made. The general problem of the context's influence on performance is discussed, and some implications are drawn. (SLD)

Descriptors: Context Effect, Educational Research, Evaluation Methods, Measurement Techniques

The Improvement of Measurement in Education and Psychology: Contributions of Latent Trait Theories.

Download full text

Spearritt, Donald, Ed. – 1982

Educational and psychological measurement has been a main area of work for the Australian Council for Educational (ACER) since its inception. The theoretical and practical contributions of latent trait measurement and commentary on the relatively recent use of these models in Australia were the focus of a seminar celebrating the 50th anniversary…

Descriptors: Aptitude Tests, Cognitive Measurement, Educational Research, Educational Testing

Use of the Rasch Model in Communication Education: An Explanation and Example Application.

Warfel, Katherine Ann – 1984

The goal of test design is to devise an instrument that will provide a stable and accurate assessment of student ability in some area. One means of reaching this goal is through the use of latent trait models, which determine the relationship between the unobservable trait or ability and the observable test performance. Three common latent trait…

Descriptors: Educational Research, Item Analysis, Latent Trait Theory, Measurement Techniques

Prototype Measures of the Domain of Learning in Literature. Report Series 3.3.

Download full text

Purves, Alan; And Others – 1990

A study examined the results of an administration of a series of theoretically based prototype tests to 857 high school students in California, New York, and Wisconsin. By revising the existing framework of a prior study, tests were devised which attempted to measure three interrelated aspects of school literature: background knowledge, the…

Descriptors: Educational Research, Educational Testing, High Schools, Literature

Assessing for Learning: Some Dimensions Underlying New Approaches to Educational Assessment.

Peer reviewed

Biggs, John – Alberta Journal of Educational Research, 1995

Different models of performance assessment arise from interactions of three dimensions of assessment: the measurement versus the standards model of testing, quantitative and qualitative assumptions concerning the nature of learning, and whether learning and testing are situated or decontextualized. Addresses difficulties in implementing…

Descriptors: Competency Based Education, Educational Change, Educational Practices, Educational Research

Estimating the Reliability of Criterion-Referenced Tests before Administration.

Peer reviewed

Chase, Clint – Mid-Western Educational Researcher, 1996

Classical procedures for calculating the two indices of decision consistency (P and Kappa) for criterion-referenced tests require two testings on each child. Huynh, Peng, and Subkoviak have presented one-testing procedures for these indices. These indices can be estimated without any test administration using Ebel's estimates of the mean, standard…

Descriptors: Criterion Referenced Tests, Educational Research, Educational Testing, Estimation (Mathematics)

Previous Page | Next Page »

Pages: 1 | 2 | 3

Alberta Journal of…	3
Clearing House	2
Educational Research and…	2
Educational and Psychological…	2
Journal for Research in…	2
American Psychologist	1
Educational Measurement:…	1
Educational Studies	1
Journal of Educational…	1
Journal of Educational…	1
Journal of Educational and…	1
Journal of Research on…	1
Mid-Western Educational…	1
National Center for Analysis…	1
Physical Review Special…	1
ProQuest LLC	1
Reading Research and…	1
Research Quarterly	1
Review of Educational Research	1
Sociology of Education	1
Teaching English in the…	1
Teaching of Psychology	1
More ▼

Mislevy, Robert J.	3
Santee, Phillip	2
Whitehead, Bruce	2
Balch, William R.	1
Barnett-Foster, Debora	1
Beichner, Robert	1
Biggs, John	1
Bos, Wilfried	1
Boyd, Donald	1
Braun, Henry	1
Burry-Stock, Judith A.	1
Chase, Clint	1
Cheng, Britte H.	1
Colker, Alexis M.	1
Collis, Kevin F.	1
DeBarger, Angela	1
Ding, Lin	1
Glaser, Robert	1
Goldstein, Harvey	1
Gonzalez-Tamayo, Eulogio	1
Good, Frances	1
Goy, Martin	1
Gravel, Jenna	1
Grossman, Pamela	1
More ▼