NotesFAQContact Us
Collection
Advanced
Search Tips
Laws, Policies, & Programs
What Works Clearinghouse Rating
Showing 1 to 15 of 72 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Hwanggyu Lim; Danqi Zhu; Edison M. Choe; Kyung T. Han – Journal of Educational Measurement, 2024
This study presents a generalized version of the residual differential item functioning (RDIF) detection framework in item response theory, named GRDIF, to analyze differential item functioning (DIF) in multiple groups. The GRDIF framework retains the advantages of the original RDIF framework, such as computational efficiency and ease of…
Descriptors: Item Response Theory, Test Bias, Test Reliability, Test Construction
Areekkuzhiyil, Santhosh – Online Submission, 2021
Assessment is an integral part of any teaching learning process. Assessment has large number of functions to perform, whether it is formative or summative. This paper analyse the issues involved and the areas of concern in the classroom assessment practice and discusses the recent reforms take place. [This paper was published in Edutracks v20 n8…
Descriptors: Student Evaluation, Formative Evaluation, Summative Evaluation, Test Validity
Peer reviewed Peer reviewed
Direct linkDirect link
Wang, Shiyu; Lin, Haiyan; Chang, Hua-Hua; Douglas, Jeff – Journal of Educational Measurement, 2016
Computerized adaptive testing (CAT) and multistage testing (MST) have become two of the most popular modes in large-scale computer-based sequential testing. Though most designs of CAT and MST exhibit strength and weakness in recent large-scale implementations, there is no simple answer to the question of which design is better because different…
Descriptors: Computer Assisted Testing, Adaptive Testing, Test Format, Sequential Approach
Peer reviewed Peer reviewed
Direct linkDirect link
Cimbricz, Sandra K.; McConn, Matthew L. – Changing English: Studies in Culture and Education, 2015
This article explores the intersection of new, large-scale standards-based testing, teacher accountability policy, and secondary curriculum and instruction in the United States. Two federally funded consortia--the Smarter Balanced Assessment Consortium and the Partnership for Readiness of College and Careers--prove focal to this paper, as these…
Descriptors: Group Testing, English Instruction, Secondary School Curriculum, Accountability
Kern, Justin L.; McBride, Brent A.; Laxman, Daniel J.; Dyer, W. Justin; Santos, Rosa M.; Jeans, Laurie M. – Grantee Submission, 2016
Measurement invariance (MI) is a property of measurement that is often implicitly assumed, but in many cases, not tested. When the assumption of MI is tested, it generally involves determining if the measurement holds longitudinally or cross-culturally. A growing literature shows that other groupings can, and should, be considered as well.…
Descriptors: Psychology, Measurement, Error of Measurement, Measurement Objectives
Peer reviewed Peer reviewed
Direct linkDirect link
St Clair-Thompson, Helen – Journal of Psychoeducational Assessment, 2014
The aim of the present study was to investigate the reliability and validity of a brief standardized assessment of children's working memory; "Lucid Recall." Although there are many established assessments of working memory, "Lucid Recall" is fully automated and can therefore be administered in a group setting. It is therefore…
Descriptors: Test Reliability, Test Validity, Computer Assisted Testing, Cognitive Tests
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Yaratan, Huseyin; Suphi, Nilgun – Turkish Online Journal of Educational Technology - TOJET, 2013
Questionnaires administered manually can cause surreptitious peer pressure on the candidate to finish when 'the others" have completed theirs, forcing students to rush or skip individual items or may hinder the ability of noticing participants who may be having difficulty understanding certain items. These drawbacks can have serious…
Descriptors: Synchronous Communication, Questionnaires, Computer Assisted Testing, Undergraduate Students
Rogers, W. Todd – Canadian Journal of Education, 2014
Principals and teachers do not use large-scale assessment results because the lack of distinct and reliable subtests prevents identifying strengths and weaknesses of students and instruction, the results arrive too late to be used, and principals and teachers need assistance to use the results to improve instruction so as to improve student…
Descriptors: Foreign Countries, Group Testing, Multidimensional Scaling, Evaluation Utilization
Peer reviewed Peer reviewed
Direct linkDirect link
Phillips, Gary W. – Applied Measurement in Education, 2015
This article proposes that sampling design effects have potentially huge unrecognized impacts on the results reported by large-scale district and state assessments in the United States. When design effects are unrecognized and unaccounted for they lead to underestimating the sampling error in item and test statistics. Underestimating the sampling…
Descriptors: State Programs, Sampling, Research Design, Error of Measurement
Peer reviewed Peer reviewed
Direct linkDirect link
Camilli, Gregory – Educational Research and Evaluation, 2013
In the attempt to identify or prevent unfair tests, both quantitative analyses and logical evaluation are often used. For the most part, fairness evaluation is a pragmatic attempt at determining whether procedural or substantive due process has been accorded to either a group of test takers or an individual. In both the individual and comparative…
Descriptors: Alternative Assessment, Test Bias, Test Content, Test Format
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Zacharis, Nick Z. – Contemporary Issues in Education Research, 2010
Although summative assessment is indispensable for determining whether or not students meet the content standards, it alone is insufficient for providing teachers and administrators with the information necessary to make ongoing decisions about instruction. This article looks at the motivational impact of the assessment on students' achievement…
Descriptors: Educational Practices, Educational Innovation, Alternative Assessment, Summative Evaluation
Peer reviewed Peer reviewed
Direct linkDirect link
Macqueen, Susy; Harding, Luke – Language Testing, 2009
In 2002 the University of Cambridge Local Examinations Syndicate (UCLES) implemented a revised version of the Certificate of Proficiency in English (CPE). CPE, which is the highest level of the Main Suite of Cambridge ESOL exams, comprises five modules, "Reading," "Writing," "Use of English," "Listening" and "Speaking," the latter of which is the…
Descriptors: Speech Communication, Test Reviews, Examiners, English (Second Language)
Peer reviewed Peer reviewed
Direct linkDirect link
Horton, Simon; de Lourdes Drachler, Maria; Fuller, Alison; de Carvalho Leite, Jose Carlos – International Journal of Language & Communication Disorders, 2008
Background: In the UK clinical supervision is regarded as an essential process supporting quality improvement within the clinical governance framework, and the Royal College of Speech and Language Therapists regards it as a tool for promoting critical reflective practice. There is limited evidence of the impact on practice or improvements in…
Descriptors: Health Personnel, Supervision, Questionnaires, Factor Analysis
Peer reviewed Peer reviewed
Direct linkDirect link
Keating, Daniel P. – Early Education and Development, 2007
This article is a commentary for the special issue on the Early Development Instrument (EDI), a community tool to assess children's school readiness and developmental outcomes at a group level. The EDI is administered by kindergarten teachers, who assess their kindergarten students on 5 developmental domains: physical health and well-being, social…
Descriptors: School Readiness, Formative Evaluation, Kindergarten, Cognitive Development
Costin, Frank – Educ Psychol Meas, 1969
Descriptors: Group Testing, Hostility, Measurement, Neurosis
Previous Page | Next Page ยป
Pages: 1  |  2  |  3  |  4  |  5