Publication Date
In 2025 | 0 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 3 |
Since 2016 (last 10 years) | 4 |
Since 2006 (last 20 years) | 11 |
Descriptor
Educational Research | 15 |
Item Analysis | 15 |
Item Response Theory | 8 |
Test Items | 6 |
Measurement Techniques | 5 |
Evaluation Research | 4 |
Error of Measurement | 3 |
Evaluation Methods | 3 |
Higher Education | 3 |
Models | 3 |
Test Construction | 3 |
More ▼ |
Source
Author
Allal, Linda | 1 |
Anderson, Trevor R. | 1 |
Andreas Kurz | 1 |
Baker, Eva L. | 1 |
Barcikowski, Robert S. | 1 |
Brockx, Bert | 1 |
Buschang, Rebecca E. | 1 |
Can Gürer | 1 |
Chung, Gregory K. W. K. | 1 |
Clemens Draxler | 1 |
Delacruz, Girlie C. | 1 |
More ▼ |
Publication Type
Journal Articles | 8 |
Reports - Research | 8 |
Speeches/Meeting Papers | 3 |
Reports - Descriptive | 2 |
Reports - Evaluative | 2 |
Dissertations/Theses -… | 1 |
Information Analyses | 1 |
Tests/Questionnaires | 1 |
Education Level
Higher Education | 5 |
Elementary Secondary Education | 3 |
Postsecondary Education | 2 |
Elementary Education | 1 |
Grade 8 | 1 |
Junior High Schools | 1 |
Middle Schools | 1 |
Secondary Education | 1 |
Audience
Researchers | 2 |
Teachers | 1 |
Laws, Policies, & Programs
Assessments and Surveys
Medical College Admission Test | 1 |
Trends in International… | 1 |
What Works Clearinghouse Rating
Clemens Draxler; Andreas Kurz; Can Gürer; Jan Philipp Nolte – Journal of Educational and Behavioral Statistics, 2024
A modified and improved inductive inferential approach to evaluate item discriminations in a conditional maximum likelihood and Rasch modeling framework is suggested. The new approach involves the derivation of four hypothesis tests. It implies a linear restriction of the assumed set of probability distributions in the classical approach that…
Descriptors: Inferences, Test Items, Item Analysis, Maximum Likelihood Statistics
Wu, Tong – ProQuest LLC, 2023
This three-article dissertation aims to address three methodological challenges to ensure comparability in educational research, including scale linking, test equating, and propensity score (PS) weighting. The first study intends to improve test scale comparability by evaluating the effect of six missing data handling approaches, including…
Descriptors: Educational Research, Comparative Analysis, Equated Scores, Weighted Scores
Raykov, Tenko; Pusic, Martin – Educational and Psychological Measurement, 2023
This note is concerned with evaluation of location parameters for polytomous items in multiple-component measuring instruments. A point and interval estimation procedure for these parameters is outlined that is developed within the framework of latent variable modeling. The method permits educational, behavioral, biomedical, and marketing…
Descriptors: Item Analysis, Measurement Techniques, Computer Software, Intervals
Oon, Pey-Tee; Fan, Xitao – International Journal of Science Education, 2017
Students' attitude towards science (SAS) is often a subject of investigation in science education research. Survey of rating scale is commonly used in the study of SAS. The present study illustrates how Rasch analysis can be used to provide psychometric information of SAS rating scales. The analyses were conducted on a 20-item SAS scale used in an…
Descriptors: Item Response Theory, Psychometrics, Attitude Measures, Rating Scales
Kuechler, William L.; Simkin, Mark G. – Decision Sciences Journal of Innovative Education, 2010
Both professional certification and academic tests rely heavily on multiple-choice questions, despite the widespread belief that alternate, constructed-response questions are superior measures of a test taker's understanding of the underlying material. Empirically, the search for a link between these two assessment metrics has met with limited…
Descriptors: Multiple Choice Tests, Performance Based Assessment, Alternative Assessment, Knowledge Level
Mo, Lun; Yang, Fang; Hu, Xiangen – Educational Research and Evaluation, 2011
School climate surveys are widely applied in school districts across the nation to collect information about teacher efficacy, principal leadership, school safety, students' activities, and so forth. They enable school administrators to understand and address many issues on campus when used in conjunction with other student and staff data.…
Descriptors: Evidence, Academic Achievement, Questionnaires, Item Response Theory
Spooren, Pieter; Brockx, Bert; Mortelmans, Dimitri – Review of Educational Research, 2013
This article provides an extensive overview of the recent literature on student evaluation of teaching (SET) in higher education. The review is based on the SET meta-validation model, drawing upon research reports published in peer-reviewed journals since 2000. Through the lens of validity, we consider both the more traditional research themes in…
Descriptors: Student Evaluation of Teacher Performance, Teacher Evaluation, Test Validity, Educational Research
Anderson, Trevor R.; Rogan, John M. – Biochemistry and Molecular Biology Education, 2010
Student assessment is central to the educational process and can be used for multiple purposes including, to promote student learning, to grade student performance and to evaluate the educational quality of qualifications. It is, therefore, of utmost importance that assessment instruments are of a high quality. In this article, we present various…
Descriptors: Educational Assessment, Educational Quality, Student Evaluation, Educational Research
Vendlinski, Terry P.; Delacruz, Girlie C.; Buschang, Rebecca E.; Chung, Gregory K. W. K.; Baker, Eva L. – National Center for Research on Evaluation, Standards, and Student Testing (CRESST), 2010
The evaluation of educational interventions requires assessments that consistently (reliably) produce data from which accurate (valid) inferences about the test subjects can be made for some stated purpose. Despite codified definitions of all these terms, there remains vibrant debate about the assessment design process and how measures of…
Descriptors: Learning Theories, Student Evaluation, Educational Research, Video Games
Schulz, Wolfram; Fraillon, Julian – Educational Research and Evaluation, 2011
When comparing data derived from tests or questionnaires in cross-national studies, researchers commonly assume measurement invariance in their underlying scaling models. However, different cultural contexts, languages, and curricula can have powerful effects on how students respond in different countries. This article illustrates how the…
Descriptors: Citizenship Education, International Studies, Item Response Theory, International Education
Royal, Kenneth D. – Online Submission, 2009
Quality measurement is essential in every form of research, including institutional research and assessment. Unfortunately, most survey research today (both published and unpublished) is lacking with regards to quality measurement. Reporting means and standard deviations based on ordinal measures is an inappropriate, yet widespread practice in the…
Descriptors: Higher Education, Institutional Research, Measurement Techniques, Item Response Theory
Warfel, Katherine Ann – 1984
The goal of test design is to devise an instrument that will provide a stable and accurate assessment of student ability in some area. One means of reaching this goal is through the use of latent trait models, which determine the relationship between the unobservable trait or ability and the observable test performance. Three common latent trait…
Descriptors: Educational Research, Item Analysis, Latent Trait Theory, Measurement Techniques

Barcikowski, Robert S.; Olsen, Henry – Journal of Psychology, 1975
Provides evidence that students perceive test items arranged in a hard-medium-easy order as being easier than the same items arranged in the reverse order, but it was also found that the students' perceptions did not influence the scores. (RB)
Descriptors: Adaptation Level Theory, College Students, Educational Research, Higher Education
Wise, Lauress L. – 1986
A primary goal of this study was to determine the extent to which item difficulty was related to item position and, if a significant relationship was found, to suggest adjustments to predicted item difficulty that reflect differences in item position. Item response data from the Medical College Admission Test (MCAT) were analyzed. A data set was…
Descriptors: College Entrance Examinations, Difficulty Level, Educational Research, Error of Measurement
Allal, Linda – 1986
This paper discusses the theoretical scope and practical applicability of generalizability (G) theory through the principle of symmetry. Major ideas are summarized and factors hindering applications of G theory in research conducted in French-speaking Europe are presented. The principle of symmetry affirms that any factor of a design can be…
Descriptors: Data Collection, Educational Research, Factor Analysis, Factor Structure