Publication Date
In 2025 | 0 |
Since 2024 | 2 |
Since 2021 (last 5 years) | 2 |
Since 2016 (last 10 years) | 3 |
Since 2006 (last 20 years) | 7 |
Descriptor
Source
Author
Bachman, Lyle F. | 2 |
Haladyna, Tom | 2 |
Roid, Gale | 2 |
Shohamy, Elana | 2 |
Abramson, Theodore | 1 |
Armour-Thomas, Eleanor | 1 |
Austin, James T. | 1 |
Bachor, Dan G. | 1 |
Badjadi, Nour El Imane | 1 |
Banchick, Gail | 1 |
Barron, Frank | 1 |
More ▼ |
Publication Type
Education Level
Higher Education | 3 |
Postsecondary Education | 1 |
Audience
Practitioners | 4 |
Researchers | 3 |
Teachers | 3 |
Counselors | 1 |
Laws, Policies, & Programs
Assessments and Surveys
Wechsler Intelligence Scale… | 3 |
Minnesota Multiphasic… | 1 |
Woodcock Johnson Tests of… | 1 |
What Works Clearinghouse Rating
Eray Selçuk; Ergül Demir – International Journal of Assessment Tools in Education, 2024
This research aims to compare the ability and item parameter estimations of Item Response Theory according to Maximum likelihood and Bayesian approaches in different Monte Carlo simulation conditions. For this purpose, depending on the changes in the priori distribution type, sample size, test length, and logistics model, the ability and item…
Descriptors: Item Response Theory, Item Analysis, Test Items, Simulation
Daniel M. Settlage; Jim R. Wollscheid – Journal of the Scholarship of Teaching and Learning, 2024
The examination of the testing mode effect has received increased attention as higher education has shifted to remote testing during the COVID-19 pandemic. We believe the testing mode effect consists of four components: the ability to physically write on the test, the method of answer recording, the proctoring/testing environment, and the effect…
Descriptors: College Students, Macroeconomics, Tests, Answer Sheets
Badjadi, Nour El Imane – Online Submission, 2013
The current paper on writing assessment surveys the literature on the reliability and validity of essay tests. The paper aims to examine the two concepts in relationship with essay testing as well as to provide a snapshot of the current understandings of the reliability and validity of essay tests as drawn in recent research studies. Bearing in…
Descriptors: Essay Tests, Writing Evaluation, Test Validity, Test Reliability
Opitz, Ansgar; Heene, Moritz; Fischer, Frank – Educational Research and Evaluation, 2017
Education systems increasingly emphasize the importance of scientific reasoning skills such as "generating hypotheses" and "evaluating evidence." Despite this importance, we do not know which tests of scientific reasoning exist, which skills they emphasize, how they conceptualize scientific reasoning, and how well they are…
Descriptors: Thinking Skills, Logical Thinking, Science Process Skills, Science Instruction
Kettler, Ryan J. – Review of Research in Education, 2015
This chapter introduces theory that undergirds the role of testing adaptations in assessment, provides examples of item modifications and testing accommodations, reviews research relevant to each, and introduces a new paradigm that incorporates opportunity to learn (OTL), academic enablers, testing adaptations, and inferences that can be made from…
Descriptors: Meta Analysis, Literature Reviews, Testing, Testing Accommodations
Sinharay, Sandip – Educational Testing Service, 2010
Recently, there has been an increasing level of interest in subscores for their potential diagnostic value. Haberman (2008) suggested a method based on classical test theory to determine whether subscores have added value over total scores. This paper provides a literature review and reports when subscores were found to have added value for…
Descriptors: Scores, Correlation, Reliability, Item Response Theory
Gump, Steven E. – Educational Research Quarterly, 2007
This review presents an overview of selected articles on the leniency hypothesis: the idea that students give higher evaluations to instructors who grade more leniently. Such articles comprise a small subset of the voluminous research on student evaluations of teaching (SETs). In this diverse literature, research methods and aims have frequently…
Descriptors: Student Evaluation of Teacher Performance, Research Methodology, Meta Analysis, Research Problems

Muraki, Eiji; Hombo, Catherine M.; Lee, Yong-Won – Applied Psychological Measurement, 2000
Presents an overview of linking methods applied to performance assessment and discusses major issues and recent developments in linking performance assessments. Compares three common linking designs and two major linking methodologies (classical and item response theory (IRT)). Describes two classical equating methods and several IRT equating…
Descriptors: Equated Scores, Item Response Theory, Performance Based Assessment, Test Theory
Hambleton, Ronald K.; Swaminathan, H. – 1985
Comments are made on the review papers presented by six Dutch psychometricians: Ivo Molenaar, Wim van der Linden, Ed Roskam, Arnold Van den Wollenberg, Gideon Mellenbergh, and Dato de Gruijter. Molenaar has embraced a pragmatic viewpoint on Bayesian methods, using both empirical and pure approaches to solve educational research problems. Molenaar…
Descriptors: Bayesian Statistics, Decision Making, Elementary Secondary Education, Foreign Countries

Shohamy, Elana – Annual Review of Applied Linguistics, 1995
Reviews recent trends in performance testing, focusing on different definitions of performance testing; the extent to which performance tests have drawn upon the theoretical discussions of competence and performance; research on performance tests; and future developmental and research questions. (66 references) (MDM)
Descriptors: Definitions, Evaluation Methods, Language Proficiency, Language Tests

Gresham, Frank M. – School Psychology Review, 1984
The evidence for the psychometric adequacy of behavioral interviews in terms of traditional psychometric theory and generalizability theory are reviewed. The review resulted in the conclusion that behavioral interviews have some evidence for interrater reliability, content validity, and criterion-related validity. Additional research in several…
Descriptors: Behavior Patterns, Behavior Problems, Functional Behavioral Assessment, Generalizability Theory

Roid, Gale; Haladyna, Tom – Review of Educational Research, 1980
A continuum of item-writing methods is proposed ranging from informal-subjective methods to algorithmic-objective methods. Examples of techniques include objective-based item writing, amplified objectives, item forms, facet design, domain-referenced concept testing, and computerized techniques. (Author/CP)
Descriptors: Achievement Tests, Algorithms, Computer Assisted Testing, Criterion Referenced Tests

Whitely, Susan E. – Intelligence, 1980
This article examines the potential contribution of latent trait models to the study of intelligence. Nontechnical introductions to both unidimensional and multidimensional latent trait models are given. Multidimensional latent trait models can be used to test alternative multiple component theories of test item processing. (Author/CTM)
Descriptors: Ability, Aptitude Tests, Cognitive Processes, Intelligence

Watkins, Marley W.; Kush, Joseph C. – School Psychology Review, 1994
Study compares Wechsler (WISC-R) profiles of special-education students to seven core types distinguished primarily by levels of global ability. More than 96% of these students were found to be similar to one of the core types considered to be common variants of normal intellectual ability. Based on data, it is recommended that "no way"…
Descriptors: Ability, Achievement Tests, Special Education, Test Theory

Leary, Linda F.; Dorans, Neil J. – Review of Educational Research, 1985
Research on the potential effects of different item arrangement schemes on item statistics is reviewed for three separate periods. Earliest studies investigated the simple main effect of item order on test performance. The late 1960s emphasized interactions between item order and examinees' characteristics. Current concern focuses on item…
Descriptors: Achievement Tests, Aptitude Tests, Item Analysis, Latent Trait Theory