Publication Date
In 2025 | 0 |
Since 2024 | 2 |
Since 2021 (last 5 years) | 10 |
Since 2016 (last 10 years) | 14 |
Since 2006 (last 20 years) | 36 |
Descriptor
Measurement Techniques | 63 |
Test Items | 63 |
Test Reliability | 44 |
Test Validity | 34 |
Test Construction | 22 |
Item Analysis | 16 |
Psychometrics | 15 |
Reliability | 14 |
Scoring | 14 |
Item Response Theory | 13 |
Correlation | 11 |
More ▼ |
Source
Author
Aiken, Lewis R. | 1 |
Allan S. Cohen | 1 |
Almehrizi, Rashid S. | 1 |
Alonzo, Julie | 1 |
Aman, Michael G. | 1 |
Anderson, Daniel | 1 |
Arnold, L. Eugene | 1 |
Aryadoust, Vahid | 1 |
Balbuena, Sherwin | 1 |
Batorowicz, Beata | 1 |
Brennan, Robert L. | 1 |
More ▼ |
Publication Type
Education Level
Audience
Teachers | 2 |
Practitioners | 1 |
Researchers | 1 |
Location
Canada | 1 |
Canada (Toronto) | 1 |
Georgia | 1 |
India | 1 |
New York | 1 |
Oregon | 1 |
Philippines | 1 |
Portugal | 1 |
Turkey | 1 |
Uganda | 1 |
Washington | 1 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
SAT (College Admission Test) | 2 |
Raven Progressive Matrices | 1 |
Stanford Binet Intelligence… | 1 |
Teaching and Learning… | 1 |
Wechsler Intelligence Scale… | 1 |
What Works Clearinghouse Rating
Stephanie M. Bell; R. Philip Chalmers; David B. Flora – Educational and Psychological Measurement, 2024
Coefficient omega indices are model-based composite reliability estimates that have become increasingly popular. A coefficient omega index estimates how reliably an observed composite score measures a target construct as represented by a factor in a factor-analysis model; as such, the accuracy of omega estimates is likely to depend on correct…
Descriptors: Influences, Models, Measurement Techniques, Reliability
Chakrabartty, Satyendra Nath – International Journal of Psychology and Educational Studies, 2021
The paper proposes new measures of difficulty and discriminating values of binary items and test consisting of such items and find their relationships including estimation of test error variance and thereby the test reliability, as per definition using cosine similarities. The measures use entire data. Difficulty value of test and item is defined…
Descriptors: Test Items, Difficulty Level, Scores, Test Reliability
Balbuena, Sherwin – International Journal of Assessment Tools in Education, 2023
Depression is a latent characteristic that is measured through self-reported or clinician-mediated instruments such as scales and inventories. The precision of depression estimates largely depends on the validity of the items used and on the truthfulness of people responding to these items. The existing methodology in instrumentation based on a…
Descriptors: Depression (Psychology), Test Items, Test Validity, Test Reliability
Jordan M. Wheeler; Allan S. Cohen; Shiyu Wang – Journal of Educational and Behavioral Statistics, 2024
Topic models are mathematical and statistical models used to analyze textual data. The objective of topic models is to gain information about the latent semantic space of a set of related textual data. The semantic space of a set of textual data contains the relationship between documents and words and how they are used. Topic models are becoming…
Descriptors: Semantics, Educational Assessment, Evaluators, Reliability
Schulte, Niklas; Holling, Heinz; Bürkner, Paul-Christian – Educational and Psychological Measurement, 2021
Forced-choice questionnaires can prevent faking and other response biases typically associated with rating scales. However, the derived trait scores are often unreliable and ipsative, making interindividual comparisons in high-stakes situations impossible. Several studies suggest that these problems vanish if the number of measured traits is high.…
Descriptors: Questionnaires, Measurement Techniques, Test Format, Scoring
Ibrahim Kasujja; Hugo Melgar-Quinonez; Joweria Nambooze – SAGE Open, 2023
Background: School feeding programs' evaluation requires the measurement of food insecurity, a more objective indicator, within school in low-income countries. The Global Child Nutrition Foundation (GCNF) uses subjective indicators to report school feeding coverage rates across many countries that participate in the global survey of school meal…
Descriptors: Hunger, Food, Program Effectiveness, Psychometrics
Aryadoust, Vahid; Ng, Li Ying; Sayama, Hiroki – Language Testing, 2021
Over the past decades, the application of Rasch measurement in language assessment has gradually increased. In the present study, we coded 215 papers using Rasch measurement published in 21 applied linguistics journals for multiple features. We found that seven Rasch models and 23 software packages were adopted in these papers, with many-facet…
Descriptors: Language Tests, Testing, Test Items, Network Analysis
Gary A. Troia; Frank R. Lawrence; Julie S. Brehmer; Kaitlin Glause; Heather L. Reichmuth – Grantee Submission, 2023
Much of the research that has examined the writing knowledge of school-age students has relied on interviews to ascertain this information, which is problematic because interviews may underestimate breadth and depth of writing knowledge, require lengthy interactions with participants, and do not permit a direct evaluation of a prescribed array of…
Descriptors: Writing Tests, Writing Evaluation, Knowledge Level, Elementary School Students
Nurnberger-Haag, Julie; Kratky, Joseph; Karpinski, Aryn C. – International Electronic Journal of Mathematics Education, 2022
Skills and understanding of operations with negative numbers, which are typically taught in middle school, are crucial aspects of numerical competence necessary for all subsequent mathematics. To more swiftly and coherently develop the field's understanding of how to foster this critical competence, we need shared measures that allow us to compare…
Descriptors: Numbers, Number Concepts, Middle School Students, Secondary School Mathematics
Datta, Sumona; Dutta Roy, Debdulal – Journal of Cognitive Education and Psychology, 2021
Measurement of mental rotation presents a serious challenge to cognitive researchers owing to the lack of a single comprehensive measure that can be applied across the developing age groups. Objective of the present study was to develop and validate a new measure of mental rotation for preadolescent and adolescent age groups. Items were…
Descriptors: Spatial Ability, Visualization, Preadolescents, Adolescents
Gotch, Chad M.; French, Brian F. – Educational Assessment, 2020
The State of Washington requires school districts to file court petitions on students with excessive unexcused absences. The "Washington Assessment of Risks and Needs of Students" (WARNS), a self-report screening instrument developed for use by high school and juvenile court personnel in such situations, purports to measure six facets of…
Descriptors: Risk Assessment, Needs Assessment, Truancy, Measurement Techniques
Edwards, Andrew S.; Edwards, Kinsey E.; Wesolowski, Brian C. – Research Studies in Music Education, 2019
The purpose of this study was to develop a valid and reliable rubric to be used for the evaluation of large ensemble wind band performances. The guiding questions for this study were: (a) what are the psychometric qualities (i.e., reliability and validity) of the scale developed to assess wind band ensemble performance at the high school level?…
Descriptors: Scoring Rubrics, Performance Based Assessment, Psychometrics, Musicians
Nilsen, Trude; Slot, Pauline; Cigler, Hynek; Chen, Minge – OECD Publishing, 2020
Situational Judgement Questions (SJQs) measuring process quality were included in the OECD Starting Strong Teaching and Learning International Survey 2018 (TALIS Starting Strong 2018) to address concerns of self-report bias in large-scale international surveys. These SJQs provide the staff in early childhood education and care with situations…
Descriptors: Educational Quality, Situational Tests, Administrator Surveys, Teacher Surveys
Murawska, Jaclyn M.; Walker, David A. – Mid-Western Educational Researcher, 2017
In this commentary, we offer a set of visual tools that can assist education researchers, especially those in the field of mathematics, in developing cohesiveness from a mixed methods perspective, commencing at a study's research questions and literature review, through its data collection and analysis, and finally to its results. This expounds…
Descriptors: Mixed Methods Research, Research Methodology, Visual Aids, Research Tools
Raker, Jeffrey R.; Trate, Jaclyn M.; Holme, Thomas A.; Murphy, Kristen – Journal of Chemical Education, 2013
Experts use their domain expertise and knowledge of examinees' ability levels as they write test items. The expert test writer can then estimate the difficulty of the test items subjectively. However, an objective method for assigning difficulty to a test item would capture the cognitive demands imposed on the examinee as well as be…
Descriptors: Organic Chemistry, Test Items, Item Analysis, Difficulty Level