Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 2 |
Since 2016 (last 10 years) | 2 |
Since 2006 (last 20 years) | 9 |
Descriptor
Educational Research | 40 |
Test Theory | 40 |
Evaluation Methods | 10 |
Test Construction | 9 |
Educational Testing | 8 |
Higher Education | 8 |
Elementary Secondary Education | 7 |
Models | 7 |
Student Evaluation | 7 |
Test Validity | 7 |
Measurement Techniques | 6 |
More ▼ |
Source
Author
Mislevy, Robert J. | 3 |
Santee, Phillip | 2 |
Whitehead, Bruce | 2 |
Balch, William R. | 1 |
Barnett-Foster, Debora | 1 |
Beichner, Robert | 1 |
Biggs, John | 1 |
Bos, Wilfried | 1 |
Boyd, Donald | 1 |
Braun, Henry | 1 |
Burry-Stock, Judith A. | 1 |
More ▼ |
Publication Type
Education Level
Elementary Secondary Education | 4 |
Higher Education | 2 |
Adult Education | 1 |
Audience
Researchers | 3 |
Practitioners | 2 |
Laws, Policies, & Programs
Assessments and Surveys
National Assessment of… | 2 |
Learning and Study Strategies… | 1 |
Program for International… | 1 |
What Works Clearinghouse Rating
Yvette Jackson – ProQuest LLC, 2023
Rater-mediated activities in educational research occur when an expert judge or rater utilizes an instrument to judge persons or items and generates scale scores. Scale scores are from a subjective judgment and must undergo a quality control measure called rating quality. Rating quality in this study is broadly defined as the extent to which…
Descriptors: Educational Research, Evaluators, Test Theory, Item Response Theory
Braun, Henry – Journal of Educational and Behavioral Statistics, 2023
It is a much-lamented fact that research with the potential to inform or influence education policy instead remains policy inert. There are many reasons for this frustrating state of affairs, including a lack of strategic thinking on the part of researchers on how to successfully accomplish outreach--as opposed to communication with peers…
Descriptors: Educational Policy, Educational Research, Educational Researchers, Persuasive Discourse
Xu, Ting; Stone, Clement A. – Educational and Psychological Measurement, 2012
It has been argued that item response theory trait estimates should be used in analyses rather than number right (NR) or summated scale (SS) scores. Thissen and Orlando postulated that IRT scaling tends to produce trait estimates that are linearly related to the underlying trait being measured. Therefore, IRT trait estimates can be more useful…
Descriptors: Educational Research, Monte Carlo Methods, Measures (Individuals), Item Response Theory
Wendt, Heike; Bos, Wilfried; Goy, Martin – Educational Research and Evaluation, 2011
Several current international comparative large-scale assessments of educational achievement (ICLSA) make use of "Rasch models", to address functions essential for valid cross-cultural comparisons. From a historical perspective, ICLSA and Georg Rasch's "models for measurement" emerged at about the same time, half a century ago. However, the…
Descriptors: Measures (Individuals), Test Theory, Group Testing, Educational Testing
Jacob, Robin Tepper; Jacob, Brian – Journal of Research on Educational Effectiveness, 2012
Teacher and principal surveys are among the most common data collection techniques employed in education research. Yet there is remarkably little research on survey methods in education, or about the most cost-effective way to raise response rates among teachers and principals. In an effort to explore various methods for increasing survey response…
Descriptors: Principals, Data Collection, Test Theory, Response Rates (Questionnaires)
Ding, Lin; Beichner, Robert – Physical Review Special Topics - Physics Education Research, 2009
This paper introduces five commonly used approaches to analyzing multiple-choice test data. They are classical test theory, factor analysis, cluster analysis, item response theory, and model analysis. Brief descriptions of the goals and algorithms of these approaches are provided, together with examples illustrating their applications in physics…
Descriptors: Multiple Choice Tests, Factor Analysis, Data Interpretation, Item Response Theory
Mislevy, Robert J.; Haertel, Geneva; Cheng, Britte H.; Ructtinger, Liliana; DeBarger, Angela; Murray, Elizabeth; Rose, David; Gravel, Jenna; Colker, Alexis M.; Rutstein, Daisy; Vendlinski, Terry – Educational Research and Evaluation, 2013
Standardizing aspects of assessments has long been recognized as a tactic to help make evaluations of examinees fair. It reduces variation in irrelevant aspects of testing procedures that could advantage some examinees and disadvantage others. However, recent attention to making assessment accessible to a more diverse population of students…
Descriptors: Testing Accommodations, Access to Education, Testing, Psychometrics
Martone, Andrea; Sireci, Stephen G. – Review of Educational Research, 2009
The authors (a) discuss the importance of alignment for facilitating proper assessment and instruction, (b) describe the three most common methods for evaluating the alignment between state content standards and assessments, (c) discuss the relative strengths and limitations of these methods, and (d) discuss examples of applications of each…
Descriptors: Teaching Methods, Alignment (Education), Student Evaluation, Curriculum Development
Boyd, Donald; Grossman, Pamela; Lankford, Hamilton; Loeb, Susanna; Wyckoff, James – National Center for Analysis of Longitudinal Data in Education Research, 2008
Value-added models in education research allow researchers to explore how a wide variety of policies and measured school inputs affect the academic performance of students. Researchers typically quantify the impacts of such interventions in terms of "effect sizes", i.e., the estimated effect of a one standard deviation change in the…
Descriptors: Credentials, Teacher Effectiveness, Models, Teacher Qualifications

Goldstein, Harvey – Educational Measurement: Issues and Practice, 1994
This article examines how psychometric models based on certain assumptions have come to be used counterproductively by many practitioners in ways that limit the kinds of conclusions that can be made. The general problem of the context's influence on performance is discussed, and some implications are drawn. (SLD)
Descriptors: Context Effect, Educational Research, Evaluation Methods, Measurement Techniques
Spearritt, Donald, Ed. – 1982
Educational and psychological measurement has been a main area of work for the Australian Council for Educational (ACER) since its inception. The theoretical and practical contributions of latent trait measurement and commentary on the relatively recent use of these models in Australia were the focus of a seminar celebrating the 50th anniversary…
Descriptors: Aptitude Tests, Cognitive Measurement, Educational Research, Educational Testing
Warfel, Katherine Ann – 1984
The goal of test design is to devise an instrument that will provide a stable and accurate assessment of student ability in some area. One means of reaching this goal is through the use of latent trait models, which determine the relationship between the unobservable trait or ability and the observable test performance. Three common latent trait…
Descriptors: Educational Research, Item Analysis, Latent Trait Theory, Measurement Techniques
Purves, Alan; And Others – 1990
A study examined the results of an administration of a series of theoretically based prototype tests to 857 high school students in California, New York, and Wisconsin. By revising the existing framework of a prior study, tests were devised which attempted to measure three interrelated aspects of school literature: background knowledge, the…
Descriptors: Educational Research, Educational Testing, High Schools, Literature

Biggs, John – Alberta Journal of Educational Research, 1995
Different models of performance assessment arise from interactions of three dimensions of assessment: the measurement versus the standards model of testing, quantitative and qualitative assumptions concerning the nature of learning, and whether learning and testing are situated or decontextualized. Addresses difficulties in implementing…
Descriptors: Competency Based Education, Educational Change, Educational Practices, Educational Research

Chase, Clint – Mid-Western Educational Researcher, 1996
Classical procedures for calculating the two indices of decision consistency (P and Kappa) for criterion-referenced tests require two testings on each child. Huynh, Peng, and Subkoviak have presented one-testing procedures for these indices. These indices can be estimated without any test administration using Ebel's estimates of the mean, standard…
Descriptors: Criterion Referenced Tests, Educational Research, Educational Testing, Estimation (Mathematics)