NotesFAQContact Us
Collection
Advanced
Search Tips
What Works Clearinghouse Rating
Showing 31 to 45 of 378 results Save | Export
Stevens, Olinger; Leigh, Erika – ProQuest LLC, 2012
Scope and Method of Study: The purpose of the study is to use an empirical approach to identify a simple, economical, efficient, and technically adequate performance measure that teachers can use to assess student growth in mathematics. The current study has been designed to expand the body of research for math CBM to further examine technical…
Descriptors: Mathematics Instruction, Evaluation Methods, Student Evaluation, Measurement Techniques
Peer reviewed Peer reviewed
Direct linkDirect link
McCarty, Allison M.; Christ, Theodore J. – Assessment for Effective Intervention, 2010
This article reviews the "Developmental Reading Assessment--Second Edition" (DRA2), a teacher-administered assessment that identifies students' instructional level, along with their strengths and weaknesses in reading. Once teachers calculate and interpret scores, the data can purportedly be used to inform, and possibly individualize,…
Descriptors: Reading Tests, Oral Reading, Reading Fluency, Criterion Referenced Tests
Peer reviewed Peer reviewed
Direct linkDirect link
Hosp, John L.; Hosp, Michelle A.; Dole, Janice K. – School Psychology Review, 2011
Universal screening measures are an integral component of any tiered system of instructional delivery. Recent studies of screening measures have often excluded examinations of bias in predictive validity. The present study examined a common screening instrument for evidence of bias in predictive validity across the four disaggregation categories…
Descriptors: Evidence, Reading Fluency, Federal Legislation, Predictive Validity
Peer reviewed Peer reviewed
Direct linkDirect link
Spooren, Pieter; Brockx, Bert; Mortelmans, Dimitri – Review of Educational Research, 2013
This article provides an extensive overview of the recent literature on student evaluation of teaching (SET) in higher education. The review is based on the SET meta-validation model, drawing upon research reports published in peer-reviewed journals since 2000. Through the lens of validity, we consider both the more traditional research themes in…
Descriptors: Student Evaluation of Teacher Performance, Teacher Evaluation, Test Validity, Educational Research
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Sawaki, Yasuyo; Sinharay, Sandip – ETS Research Report Series, 2013
This study investigates the value of reporting the reading, listening, speaking, and writing section scores for the "TOEFL iBT"® test, focusing on 4 related aspects of the psychometric quality of the TOEFL iBT section scores: reliability of the section scores, dimensionality of the test, presence of distinct score profiles, and the…
Descriptors: Scores, Computer Assisted Testing, Factor Analysis, Correlation
Peer reviewed Peer reviewed
Direct linkDirect link
Fox, Janna; Fairbairn, Shelley – Language Testing, 2011
This article reviews Assessing Comprehension and Communication in English State-to-State for English Language Learners ("ACCESS for ELLs"[R]), which is a large-scale, high-stakes, standards-based, and criterion-referenced English language proficiency test administered in the USA annually to more than 840,000 English Language Learners (ELLs), in…
Descriptors: Test Preparation, Feedback (Response), Instructional Design, Testing Accommodations
Peer reviewed Peer reviewed
Direct linkDirect link
Meisels, Samuel J.; Wen, Xiaoli; Beachy-Quick, Kristy – Applied Developmental Science, 2010
This study used a mixed methods methodology to investigate the reliability and validity of the Ounce Scale, an authentic, observational assessment of infants' and toddlers' development from birth through 42 months of age. Quantitative cross-sectional data were collected from 287 children and 124 teachers in seven urban Early Head Start programs;…
Descriptors: Performance Based Assessment, Measures (Individuals), Infants, Toddlers
Setzer, J. Carl – GED Testing Service, 2009
The GED[R] English as a Second Language (GED ESL) Test was designed to serve as an adjunct to the GED test battery when an examinee takes either the Spanish- or French-language version of the tests. The GED ESL Test is a criterion-referenced, multiple-choice instrument that assesses the functional, English reading skills of adults whose first…
Descriptors: Language Tests, High School Equivalency Programs, Psychometrics, Reading Skills
Peer reviewed Peer reviewed
PDF on ERIC Download full text
MacQuarrie, David; Applegate, Brooks; Lacefield, Warren – Journal of Career and Technical Education, 2008
Career and Technical Education (CTE) is a nationwide program that emphasizes training for primary, secondary, and post secondary educational stages for the career and workforce needs of today and tomorrow's society. Mandated indicators of success have been set in place and secondary schools are expected to improve student's skill levels in…
Descriptors: Criterion Referenced Tests, Content Validity, Test Validity, Test Reliability
Peer reviewed Peer reviewed
Direct linkDirect link
Mahar, Matthew T.; Rowe, David A. – Measurement in Physical Education and Exercise Science, 2008
Accurate measures of youth fitness are needed by researchers and practitioners. Evidence of validity and reliability are essential before results of youth fitness tests can be used to make sound decisions. This article describes a three-stage paradigm for validation research and provides guidance for conducting and understanding norm-referenced…
Descriptors: Test Reliability, Test Validity, Guidelines, Physical Education Teachers
Pell, Godfrey; Homer, Matthew S.; Roberts, Trudie E. – International Journal of Research & Method in Education, 2008
Increasingly, academic institutions are being required to improve the validity of the assessment process; unfortunately, often this is at the expense of reliability. In medical schools (such as Leeds), standardized tests of clinical skills, such as "Objective Structured Clinical Examinations" (OSCEs) are widely used to assess clinical…
Descriptors: Medical Education, Standardized Tests, Clinical Experience, Criterion Referenced Tests
Peer reviewed Peer reviewed
Direct linkDirect link
Hall, John D.; Howerton, D. Lynn; Jones, Craig H. – Research in the Schools, 2008
The No Child Left Behind Act and the accountability movement in public education caused many states to develop criterion-referenced academic achievement tests. Scores from these tests are often used to make high stakes decisions. Even so, these tests typically do not receive independent psychometric scrutiny. We evaluated the 2005 Arkansas…
Descriptors: Criterion Referenced Tests, Achievement Tests, High Stakes Tests, Public Education
Peer reviewed Peer reviewed
Direct linkDirect link
Vivo, Juana-Maria; Franco, Manuel – International Journal of Mathematical Education in Science and Technology, 2008
This article attempts to present a novel application of a method of measuring accuracy for academic success predictors that could be used as a standard. This procedure is known as the receiver operating characteristic (ROC) curve, which comes from statistical decision techniques. The statistical prediction techniques provide predictor models and…
Descriptors: Academic Achievement, Item Response Theory, Criterion Referenced Tests, Predictor Variables
Peer reviewed Peer reviewed
Direct linkDirect link
Meisels, Samuel J.; Xue, Yange; Shamblott, Melissa – Early Education and Development, 2008
Research Findings: We examined the reliability and validity of the language, literacy, and mathematics domains of "Work Sampling for Head Start" (WSHS), an observational assessment designed for 3- and 4-year-olds. Participants included 112 children who were enrolled over a two-year period in Head Start and a number of other programs…
Descriptors: Preschool Children, Preschool Education, Early Intervention, Criterion Referenced Tests
Peer reviewed Peer reviewed
Wilcox, Rand R. – Educational and Psychological Measurement, 1981
This paper describes and compares procedures for estimating the reliability of proficiency tests that are scored with latent structure models. Results suggest that the predictive estimate is the most accurate of the procedures. (Author/BW)
Descriptors: Criterion Referenced Tests, Scoring, Test Reliability
Pages: 1  |  2  |  3  |  4  |  5  |  6  |  7  |  8  |  9  |  10  |  11  |  ...  |  26