Publication Date
In 2025 | 1 |
Since 2024 | 21 |
Since 2021 (last 5 years) | 80 |
Since 2016 (last 10 years) | 236 |
Since 2006 (last 20 years) | 782 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
Researchers | 73 |
Practitioners | 67 |
Teachers | 47 |
Administrators | 20 |
Policymakers | 14 |
Counselors | 5 |
Media Staff | 2 |
Students | 1 |
Location
Australia | 24 |
United Kingdom | 23 |
United Kingdom (England) | 17 |
United States | 17 |
New York | 15 |
California | 14 |
Florida | 12 |
Canada | 10 |
Illinois | 8 |
Massachusetts | 8 |
Vermont | 8 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Meets WWC Standards without Reservations | 1 |
Meets WWC Standards with or without Reservations | 1 |
Li, Shirong; Guo, Jianzhong; Wang, Kewang; Chen, Lin; Hu, Daodao; Bai, Yunshan – Journal of Chemical Education, 2017
An improved apparatus for measuring freezing points has been developed. Compared to the traditional Beckmann freezing point instrument, the improved one overcame prior difficulties with solidification of liquid and made the solid-liquid equilibrium reversible with heat compensation from a heating tube. The reliability and accuracy were carefully…
Descriptors: Chemistry, Physics, College Science, Undergraduate Students
Perrin, Charles L. – Journal of Chemical Education, 2017
The disadvantages of the usual linear least-squares analysis of first- and second-order kinetic data are described, and nonlinear least-squares fitting is recommended as an alternative.
Descriptors: Kinetics, Least Squares Statistics, Alternative Assessment, Goodness of Fit
Banerjee, Heidi Liu – Working Papers in TESOL & Applied Linguistics, 2016
Fairness, an essential quality of a test, has been broadly defined as equitable treatment of all test-takers during the testing process, absence of measurement bias, equitable access to the constructs being measured, and justifiable validity of test score interpretation for the intended purpose(s) (AREA, APA, & NCME, 2014). Given that test…
Descriptors: Second Language Programs, Language Tests, Test Reliability, Test Validity
Feinberg, Richard A.; Wainer, Howard – Educational Measurement: Issues and Practice, 2014
Subscores are often used to indicate test-takers' relative strengths and weaknesses and so help focus remediation. But a subscore is not worth reporting if it is too unreliable to believe or if it contains no information that is not already contained in the total score. It is possible, through the use of a simple linear equation provided in…
Descriptors: Scores, Equations (Mathematics), Prediction, Reliability
Raykov, Tenko; Marcoulides, George A. – Educational and Psychological Measurement, 2015
A direct approach to point and interval estimation of Cronbach's coefficient alpha for multiple component measuring instruments is outlined. The procedure is based on a latent variable modeling application with widely circulated software. As a by-product, using sample data the method permits ascertaining whether the population discrepancy…
Descriptors: Computation, Statistical Analysis, Reliability, Models
VanDerHeyden, Amanda M.; Burns, Matthew K. – School Psychology Review, 2018
Assessment is fundamental to school psychology, but its purpose has shifted from making predictions about children to improving outcomes for children. This commentary on the special issue focuses on screening and progress-monitoring decisions that can be used to solve student problems. We outline several psychometric and practical issues that…
Descriptors: School Psychology, Decision Making, Psychological Evaluation, Screening Tests
Regional Educational Laboratory Mid-Atlantic, 2023
This Snapshot highlights key findings from a study that used Bayesian stabilization to improve the reliability (long-term stability) of subgroup proficiency measures that the Pennsylvania Department of Education (PDE) uses to identify schools for Targeted Support and Improvement (TSI) or Additional Targeted Support and Improvement (ATSI). The…
Descriptors: At Risk Students, Low Achievement, Error of Measurement, Measurement Techniques
Powers, Sonya; Li, Dongmei; Suh, Hongwook; Harris, Deborah J. – ACT, Inc., 2016
ACT reporting categories and ACT Readiness Ranges are new features added to the ACT score reports starting in fall 2016. For each reporting category, the number correct score, the maximum points possible, the percent correct, and the ACT Readiness Range, along with an indicator of whether the reporting category score falls within the Readiness…
Descriptors: Scores, Classification, College Entrance Examinations, Error of Measurement
Eggen, Per-Odd; Persson, Jonas; Jacobsen, Elisabeth Egholm; Hafskjold, Bjørn – LUMAT: International Journal on Math, Science and Technology Education, 2017
A chemistry concept inventory (Chemical Concept Inventory 3.0/CCI 3.0) has been developed for assessing students learning and identifying the alternative conceptions that students may have in general chemistry. The conceptions in question are assumed to be mainly learned in school and to a less degree in student's daily life. The inventory…
Descriptors: Chemistry, Misconceptions, Scientific Concepts, Science Tests
Warfa, Abdi-Rizak M. – CBE - Life Sciences Education, 2016
Educational research often requires mixing different research methodologies to strengthen findings, better contextualize or explain results, or minimize the weaknesses of a single method. This article provides practical guidelines on how to conduct such research in biology education, with a focus on mixed-methods research (MMR) that uses both…
Descriptors: Mixed Methods Research, Research Methodology, Biology, Science Education
Davenport, Ernest C.; Davison, Mark L.; Liou, Pey-Yan; Love, Quintin U. – Educational Measurement: Issues and Practice, 2016
The main points of Sijtsma and Green and Yang in Educational Measurement: Issues and Practice (34, 4) are that reliability, internal consistency, and unidimensionality are distinct and that Cronbach's alpha may be problematic. Neither of these assertions are at odds with Davenport, Davison, Liou, and Love in the same issue. However, many authors…
Descriptors: Educational Assessment, Reliability, Validity, Test Construction
Noei, Nima; Imani, Iman Mohammadi; Wilson, Lee D.; Azizian, Saeid – Journal of Chemical Education, 2019
A low-cost and simple setup to measure the densities of liquids is introduced herein. The results and reliability of this setup were evaluated for pure liquids, water-ethanol binary mixtures, and aqueous NaCl solutions. The constructed densitometer provided density values with acceptable relative errors (less than ±3.0%), which were compared to…
Descriptors: Chemistry, Science Education, Science Instruction, Laboratory Experiments
Bao, Lei; Koenig, Kathleen; Xiao, Yang; Fritchman, Joseph; Zhou, Shaona; Chen, Cheng – Physical Review Physics Education Research, 2022
Abilities in scientific thinking and reasoning have been emphasized as core areas of initiatives, such as the Next Generation Science Standards or the College Board Standards for College Success in Science, which focus on the skills the future will demand of today's students. Although there is rich literature on studies of how these abilities…
Descriptors: Physics, Science Instruction, Teaching Methods, Thinking Skills
Earle, Sarah – Primary Science, 2017
Moderation is put forward as they key strategy for improving the reliability of teacher assessment. However, for many teachers the word "moderation" conjures up ideas of uncomfortable situations in which marking is being checked by others and there are prolonged arguments about tiny features of individual work. In this article, the…
Descriptors: Grading, Interrater Reliability, Faculty Development, Professional Continuing Education
Leckie, George – Journal of Educational and Behavioral Statistics, 2018
The traditional approach to estimating the consistency of school effects across subject areas and the stability of school effects across time is to fit separate value-added multilevel models to each subject or cohort and to correlate the resulting empirical Bayes predictions. We show that this gives biased correlations and these biases cannot be…
Descriptors: Value Added Models, Reliability, Statistical Bias, Computation