Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 8 |
Since 2006 (last 20 years) | 39 |
Descriptor
Data Analysis | 91 |
Evaluation Methods | 91 |
Reliability | 51 |
Validity | 31 |
Test Reliability | 30 |
Data Collection | 24 |
Research Methodology | 24 |
Test Validity | 20 |
Interrater Reliability | 18 |
Measurement Techniques | 17 |
Comparative Analysis | 16 |
More ▼ |
Source
Author
Mudford, Oliver C. | 2 |
Aldridge, Jill M. | 1 |
Alfonso, Vincent C. | 1 |
Algozzine, Bob | 1 |
Algozzine, Kate | 1 |
Amrein-Beardsley, Audrey | 1 |
Andrews, David M. | 1 |
Austin, G. | 1 |
Bates, S. | 1 |
Bell, Lisa | 1 |
Bers, Trudy H. | 1 |
More ▼ |
Publication Type
Education Level
Location
Australia | 3 |
Florida | 3 |
United Kingdom (England) | 3 |
United States | 3 |
California | 2 |
Canada | 2 |
Connecticut | 2 |
Turkey | 2 |
Asia | 1 |
Brazil | 1 |
Denmark | 1 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
National Assessment of… | 1 |
What Works Clearinghouse Rating
Lin, Chih-Kai – Language Testing, 2017
Sparse-rated data are common in operational performance-based language tests, as an inevitable result of assigning examinee responses to a fraction of available raters. The current study investigates the precision of two generalizability-theory methods (i.e., the rating method and the subdividing method) specifically designed to accommodate the…
Descriptors: Data Analysis, Language Tests, Generalizability Theory, Accuracy
Raj, Gaurav; Mahajan, Manish; Singh, Dheerendra – International Journal of Web-Based Learning and Teaching Technologies, 2020
In secure web application development, the role of web services will not continue if it is not trustworthy. Retaining customers with applications is one of the major challenges if the services are not reliable and trustworthy. This article proposes a trust evaluation and decision model where the authors have defined indirect attribute, trust,…
Descriptors: Trust (Psychology), Models, Decision Making, Computer Software
Geiger, Tray J.; Amrein-Beardsley, Audrey – AASA Journal of Scholarship & Practice, 2017
In this commentary, we discuss three types of data manipulations that can occur within teacher evaluation methods: artificial inflation, artificial deflation, and artificial conflation. These types of manipulation are more popularly known in the education profession as instances of Campbell's Law (1976), which states that the higher the…
Descriptors: Teacher Evaluation, Evaluation Methods, Data Analysis, Personnel Policy
Lee, Minji K.; Sweeney, Kevin; Melican, Gerald J. – Educational Assessment, 2017
This study investigates the relationships among factor correlations, inter-item correlations, and the reliability estimates of subscores, providing a guideline with respect to psychometric properties of useful subscores. In addition, it compares subscore estimation methods with respect to reliability and distinctness. The subscore estimation…
Descriptors: Scores, Test Construction, Test Reliability, Test Validity
Capuano, Nicola; Loia, Vincenzo; Orciuoli, Francesco – IEEE Transactions on Learning Technologies, 2017
Massive Open Online Courses (MOOCs) are becoming an increasingly popular choice for education but, to reach their full extent, they require the resolution of new issues like assessing students at scale. A feasible approach to tackle this problem is peer assessment, in which students also play the role of assessor for assignments submitted by…
Descriptors: Participative Decision Making, Models, Peer Evaluation, Online Courses
Mearman, Kimberly A. – ProQuest LLC, 2013
Because of the critical function of the IEP in the planning and implementation of effective instruction for students with disabilities, educators need a reference to determine the standards of a quality IEP and a process by which to compare an IEP to those standards. A rubric can support educators in examining the quality of IEPs. This study used…
Descriptors: Construct Validity, Reliability, Scoring Rubrics, Individualized Education Programs
Goodwin, Adam; Chittle, Laura; Dixon, Jess C.; Andrews, David M. – Assessment & Evaluation in Higher Education, 2018
A multi-disciplinary academic unit at a Canadian university completed an evaluation of course syllabi used in its undergraduate programmes over the previous five years. This paper examines the reasons for the evaluation, the processes employed to collect and analyse the data, and how the results will be incorporated into the next steps of the…
Descriptors: Foreign Countries, College Curriculum, Curriculum Evaluation, Course Descriptions
National Centre for Vocational Education Research (NCVER), 2016
This work asks one simple question: "how reliable is the method used by the National Centre for Vocational Education Research (NCVER) to estimate projected rates of VET program completion?" In other words, how well do early projections align with actual completion rates some years later? Completion rates are simple to calculate with a…
Descriptors: Vocational Education, Graduation Rate, Predictive Measurement, Predictive Validity
Hailstone, Jono; Kilding, Andrew E. – Measurement in Physical Education and Exercise Science, 2011
The Zephyr[TM] BioHarness[TM] (Zephyr Technology, Auckland, New Zealand) is a wireless physiological monitoring system that has the ability to measure respiratory rate unobtrusively. However, the ability of the BioHarness[TM] to accurately and reproducibly determine respiratory rate across a range of intensities is currently unknown. The aim of…
Descriptors: Validity, Test Reliability, Foreign Countries, Data Analysis
Khan, R. Nazim – International Journal of Mathematical Education in Science and Technology, 2015
Open book assessment is not a new idea, but it does not seem to have gained ground in higher education. In particular, not much literature is available on open book examinations in mathematics and statistics in higher education. The objective of this paper is to investigate the appropriateness of open book assessments in a first-year business…
Descriptors: Evaluation Methods, Higher Education, Mathematics Tests, Statistics
Pelanek, Radek – Journal of Educational Data Mining, 2015
Researchers use many different metrics for evaluation of performance of student models. The aim of this paper is to provide an overview of commonly used metrics, to discuss properties, advantages, and disadvantages of different metrics, to summarize current practice in educational data mining, and to provide guidance for evaluation of student…
Descriptors: Models, Data Analysis, Data Processing, Evaluation Criteria
Sood, Vishal – Journal on Educational Psychology, 2013
For identifying children with four major kinds of verbal learning disabilities viz. reading disability, speech and language comprehension disability, writing disability and mathematics disability, the present task was undertaken to construct and standardize verbal learning disabilities checklist. This checklist was developed by keeping in view the…
Descriptors: Verbal Learning, Learning Disabilities, Children, Disability Identification
Swank, Jacqueline M.; Lambie, Glenn W.; Witta, E. Lea – Counselor Education and Supervision, 2012
The authors examined the psychometric properties of the Counseling Competencies Scale (CCS; University of Central Florida Counselor Education Faculty, 2009), an instrument designed to assess trainee competencies as measured in their counseling skills, dispositions, and behaviors. There was strong internal consistency for the 4-factor model for…
Descriptors: Test Validity, Interrater Reliability, Counselor Training, Measures (Individuals)
Riley-Ayers, Shannon – Center on Enhancing Early Learning Outcomes, 2014
This policy report provides a guide and framework to early childhood policymakers considering formative assessment. The report defines formative assessment and outlines its process and application in the context of early childhood. The substance of this document is the issues for consideration in the implementation of the formative assessment…
Descriptors: Formative Evaluation, Early Childhood Education, Educational Policy, Policy Formation
Castillo, Jose M.; Dedrick, Robert F.; Stockslager, Kevin M.; March, Amanda L.; Hines, Constance V.; Tan, Sim Yin – Journal of Applied School Psychology, 2015
This article presents information on the development and initial validation of the 16-item Response to Intervention (RTI) Beliefs Scale. The scale is designed to measure the extent to which educators working in schools hold beliefs consistent with the tenets of RTI. The authors administered the instrument to 2,430 educators in 62 elementary…
Descriptors: Response to Intervention, Teacher Attitudes, Test Construction, Test Validity