Publication Date
In 2025 | 7 |
Since 2024 | 36 |
Since 2021 (last 5 years) | 96 |
Since 2016 (last 10 years) | 224 |
Since 2006 (last 20 years) | 955 |
Descriptor
Evaluation Methods | 2689 |
Measurement Techniques | 2689 |
Program Evaluation | 420 |
Student Evaluation | 380 |
Higher Education | 349 |
Evaluation Criteria | 326 |
Models | 321 |
Elementary Secondary Education | 314 |
Foreign Countries | 300 |
Educational Assessment | 285 |
Research Methodology | 268 |
More ▼ |
Source
Author
Publication Type
Education Level
Audience
Practitioners | 138 |
Researchers | 93 |
Teachers | 59 |
Administrators | 39 |
Policymakers | 18 |
Media Staff | 9 |
Students | 8 |
Parents | 6 |
Community | 4 |
Counselors | 4 |
Support Staff | 2 |
More ▼ |
Location
Australia | 44 |
Canada | 33 |
United Kingdom | 31 |
United States | 31 |
California | 27 |
United Kingdom (England) | 24 |
Florida | 17 |
New York | 16 |
Turkey | 15 |
Texas | 13 |
Michigan | 12 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Meets WWC Standards with or without Reservations | 1 |
Milanowski, Anthony – Phi Delta Kappan, 2011
Managing the human capital in education requires measuring teacher performance. To measure performance, administrators need to combine measures of practice with measures of outcomes, such as value-added measures, and three measurement systems are needed: classroom observations, performance assessments or work samples, and classroom walkthroughs.…
Descriptors: Human Capital, Classroom Observation Techniques, Teacher Evaluation, Teacher Competency Testing
Maraun, Michael; Gabriel, Stephanie – Psychological Methods, 2010
In his article, "An Alternative to Null-Hypothesis Significance Tests," Killeen (2005) urged the discipline to abandon the practice of "p[subscript obs]"-based null hypothesis testing and to quantify the signal-to-noise characteristics of experimental outcomes with replication probabilities. He described the coefficient that he…
Descriptors: Hypothesis Testing, Statistical Inference, Probability, Statistical Significance
Meara, Paul M.; Alcoy, Juan Carlos Olmos – Reading in a Foreign Language, 2010
This paper addresses the issue of how we might be able to assess productive vocabulary size in second language learners. It discusses some previous attempts to develop measures of this sort, and argues that a fresh approach is needed in order to overcome some persistent problems that dog research in this area. The paper argues that there might be…
Descriptors: Vocabulary Development, Evaluation Methods, Second Language Learning, Measurement Techniques
Almy, Sarah – Education Trust, 2011
In schools across America, teachers know who among their peers is doing the best work and who is not. Yet the nation's evaluation systems tend to foster the notion that all teachers perform the same way, with the same results for students. Indeed, in an attempt at equality--uniform treatment for everyone--current evaluation systems often end up…
Descriptors: Classroom Observation Techniques, Teacher Effectiveness, Evaluators, Teacher Evaluation
Luecht, Richard M.; Sireci, Stephen G. – College Board, 2011
Over the past four decades, there has been incremental growth in computer-based testing (CBT) as a viable alternative to paper-and-pencil testing. However, the transition to CBT is neither easy nor inexpensive. As Drasgow, Luecht, and Bennett (2006) noted, many design engineering, test development, operations/logistics, and psychometric changes…
Descriptors: College Entrance Examinations, Computer Assisted Testing, Educational Technology, Evaluation Methods
Behizadeh, Nadia; Engelhard, George, Jr. – Assessing Writing, 2011
The purpose of this study is to examine the interactions among measurement theories, writing theories, and writing assessments in the United States from an historical perspective. The assessment of writing provides a useful framework for examining how theories influence, and in some cases fail to influence actual practice. Two research traditions…
Descriptors: Writing (Composition), Intellectual Disciplines, Writing Evaluation, Writing Tests
Rogge, Nicky – International Journal of Educational Management, 2011
Purpose: This paper proposes a benefit of the doubt (BoD) approach to construct and analyse teacher effectiveness scores (i.e. SET scores). Design/methodology/approach: The BoD approach is related to data envelopment analysis (DEA), a linear programming tool for evaluating the relative efficiency performance of a set of similar units (e.g. firms,…
Descriptors: Teacher Effectiveness, Foreign Countries, Teacher Evaluation, Evaluation Methods
Ertesvag, Sigrun K. – Teaching and Teacher Education: An International Journal of Research and Studies, 2011
High quality measurements are important to evaluate interventions. The study reports on the development of a measurement to investigate authoritative teaching understood as a two-dimensional construct of warmth and control. Through the application of confirmatory factor analysis (CFA) and structural equation modelling (SEM) the factor structure…
Descriptors: Factor Structure, Factor Analysis, Psychometrics, Intervention
Zhang, Bin – ProQuest LLC, 2012
Social scientists usually are more interested in consumers' dichotomous choice, such as purchase a product or not, adopt a technology or not, etc. However, up to date, there is nearly no model can help us solve the problem of multi-network effects comparison with a dichotomous dependent variable. Furthermore, the study of multi-network…
Descriptors: Social Networks, Network Analysis, Comparative Analysis, Population Groups
Ruscio, John – Assessment, 2009
Determining whether individuals belong to different latent classes (taxa) or vary along one or more latent factors (dimensions) has implications for assessment. For example, no instrument can simultaneously maximize the efficiency of categorical and continuous measurement. Methods such as taxometric analysis can test the relative fit of taxonic…
Descriptors: Classification, Measurement, Measurement Techniques, Evaluation Research
Feingold, Alan – Psychological Methods, 2009
The use of growth-modeling analysis (GMA)--including hierarchical linear models, latent growth models, and general estimating equations--to evaluate interventions in psychology, psychiatry, and prevention science has grown rapidly over the last decade. However, an effect size associated with the difference between the trajectories of the…
Descriptors: Control Groups, Effect Size, Raw Scores, Models
Reznitskaya, Alina; Kuo, Li-jen; Glina, Monica; Anderson, Richard C. – Learning and Individual Differences, 2009
The aim of this paper is to develop a more thorough, empirically-based understanding of the differences in measurement of written argumentation when alternative scoring frameworks are employed. Reflective compositions of 127 elementary school children were analyzed using analytic and holistic scales. The scales were derived from Argument Schema…
Descriptors: Elementary School Students, Persuasive Discourse, Academic Achievement, Scoring
Kim, Jiseon – ProQuest LLC, 2010
Classification testing has been widely used to make categorical decisions by determining whether an examinee has a certain degree of ability required by established standards. As computer technologies have developed, classification testing has become more computerized. Several approaches have been proposed and investigated in the context of…
Descriptors: Test Length, Computer Assisted Testing, Classification, Probability
Helding, Brandon Alan – ProQuest LLC, 2010
The purpose of this dissertation is to demonstrate one iterate of a process for developing a measurement instrument for student knowledge within educational interventions. Student mathematical knowledge is framed within Cognitively Guided Instruction (CGI) and its tenets. That is, the construct underlying the measurement instrument corresponded…
Descriptors: Mathematics Education, Definitions, Grade 1, Item Response Theory
Zeller-Berkman, Sarah – New Directions for Evaluation, 2010
A critical theory lens is used to explore the role of evaluation in youth development, a field aimed at recognizing youth as assets. A theory of change in the field is questioned for its emphasis on individual youth outcomes as programmatic outcome measures. A review of 209 evaluations of 131 programs in the Harvard Family Research Project's…
Descriptors: Critical Theory, Youth Programs, Evaluation, Outcomes of Education