Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 2 |
Since 2016 (last 10 years) | 4 |
Since 2006 (last 20 years) | 15 |
Descriptor
Group Testing | 87 |
Test Validity | 87 |
Test Reliability | 45 |
Test Construction | 28 |
Achievement Tests | 18 |
Intelligence Tests | 16 |
Individual Testing | 14 |
Testing Problems | 14 |
Higher Education | 12 |
Standardized Tests | 12 |
Educational Testing | 11 |
More ▼ |
Source
Author
Publication Type
Education Level
Elementary Secondary Education | 4 |
Higher Education | 2 |
Early Childhood Education | 1 |
Postsecondary Education | 1 |
Secondary Education | 1 |
Audience
Researchers | 2 |
Administrators | 1 |
Practitioners | 1 |
Laws, Policies, & Programs
Elementary and Secondary… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Esther Hong; Johnny L. Matson – Journal of Developmental and Physical Disabilities, 2021
Despite the early onset of challenging behaviors, there is a lack of research investigating the function of challenging behavior in toddlers with autism spectrum disorder (ASD) and developmental disabilities (DDs). The current study evaluated group differences in the frequency and severity of five functions of challenging behavior (i.e.,…
Descriptors: Behavior Problems, Toddlers, Autism Spectrum Disorders, Functional Behavioral Assessment
Areekkuzhiyil, Santhosh – Online Submission, 2021
Assessment is an integral part of any teaching learning process. Assessment has large number of functions to perform, whether it is formative or summative. This paper analyse the issues involved and the areas of concern in the classroom assessment practice and discusses the recent reforms take place. [This paper was published in Edutracks v20 n8…
Descriptors: Student Evaluation, Formative Evaluation, Summative Evaluation, Test Validity
Wang, Shiyu; Lin, Haiyan; Chang, Hua-Hua; Douglas, Jeff – Journal of Educational Measurement, 2016
Computerized adaptive testing (CAT) and multistage testing (MST) have become two of the most popular modes in large-scale computer-based sequential testing. Though most designs of CAT and MST exhibit strength and weakness in recent large-scale implementations, there is no simple answer to the question of which design is better because different…
Descriptors: Computer Assisted Testing, Adaptive Testing, Test Format, Sequential Approach
Cimbricz, Sandra K.; McConn, Matthew L. – Changing English: Studies in Culture and Education, 2015
This article explores the intersection of new, large-scale standards-based testing, teacher accountability policy, and secondary curriculum and instruction in the United States. Two federally funded consortia--the Smarter Balanced Assessment Consortium and the Partnership for Readiness of College and Careers--prove focal to this paper, as these…
Descriptors: Group Testing, English Instruction, Secondary School Curriculum, Accountability
Kern, Justin L.; McBride, Brent A.; Laxman, Daniel J.; Dyer, W. Justin; Santos, Rosa M.; Jeans, Laurie M. – Grantee Submission, 2016
Measurement invariance (MI) is a property of measurement that is often implicitly assumed, but in many cases, not tested. When the assumption of MI is tested, it generally involves determining if the measurement holds longitudinally or cross-culturally. A growing literature shows that other groupings can, and should, be considered as well.…
Descriptors: Psychology, Measurement, Error of Measurement, Measurement Objectives
St Clair-Thompson, Helen – Journal of Psychoeducational Assessment, 2014
The aim of the present study was to investigate the reliability and validity of a brief standardized assessment of children's working memory; "Lucid Recall." Although there are many established assessments of working memory, "Lucid Recall" is fully automated and can therefore be administered in a group setting. It is therefore…
Descriptors: Test Reliability, Test Validity, Computer Assisted Testing, Cognitive Tests
Yaratan, Huseyin; Suphi, Nilgun – Turkish Online Journal of Educational Technology - TOJET, 2013
Questionnaires administered manually can cause surreptitious peer pressure on the candidate to finish when 'the others" have completed theirs, forcing students to rush or skip individual items or may hinder the ability of
noticing participants who may be having difficulty understanding certain items. These drawbacks can have serious…
Descriptors: Synchronous Communication, Questionnaires, Computer Assisted Testing, Undergraduate Students
Rogers, W. Todd – Canadian Journal of Education, 2014
Principals and teachers do not use large-scale assessment results because the lack of distinct and reliable subtests prevents identifying strengths and weaknesses of students and instruction, the results arrive too late to be used, and principals and teachers need assistance to use the results to improve instruction so as to improve student…
Descriptors: Foreign Countries, Group Testing, Multidimensional Scaling, Evaluation Utilization
Phillips, Gary W. – Applied Measurement in Education, 2015
This article proposes that sampling design effects have potentially huge unrecognized impacts on the results reported by large-scale district and state assessments in the United States. When design effects are unrecognized and unaccounted for they lead to underestimating the sampling error in item and test statistics. Underestimating the sampling…
Descriptors: State Programs, Sampling, Research Design, Error of Measurement
Li, Ying; Jiao, Hong; Lissitz, Robert W. – Journal of Applied Testing Technology, 2012
This study investigated the application of multidimensional item response theory (IRT) models to validate test structure and dimensionality. Multiple content areas or domains within a single subject often exist in large-scale achievement tests. Such areas or domains may cause multidimensionality or local item dependence, which both violate the…
Descriptors: Achievement Tests, Science Tests, Item Response Theory, Measures (Individuals)
Camilli, Gregory – Educational Research and Evaluation, 2013
In the attempt to identify or prevent unfair tests, both quantitative analyses and logical evaluation are often used. For the most part, fairness evaluation is a pragmatic attempt at determining whether procedural or substantive due process has been accorded to either a group of test takers or an individual. In both the individual and comparative…
Descriptors: Alternative Assessment, Test Bias, Test Content, Test Format
Zacharis, Nick Z. – Contemporary Issues in Education Research, 2010
Although summative assessment is indispensable for determining whether or not students meet the content standards, it alone is insufficient for providing teachers and administrators with the information necessary to make ongoing decisions about instruction. This article looks at the motivational impact of the assessment on students' achievement…
Descriptors: Educational Practices, Educational Innovation, Alternative Assessment, Summative Evaluation

Cromack, Theodore R.; Stone, Meredith K. – Perceptual and Motor Skills, 1980
Repeating validation procedures used for the Children's Group Embedded Figures Test Level II (ages 9-11), this Level I test of cognitive style was administered to a second grade sample. It proved reliable and significantly related to the individual Children's Embedded Figures Test and the Portable Rod-and-Frame Test. (Author/SJL)
Descriptors: Cognitive Style, Group Testing, Primary Education, Test Validity
Keating, Daniel P. – Early Education and Development, 2007
This article is a commentary for the special issue on the Early Development Instrument (EDI), a community tool to assess children's school readiness and developmental outcomes at a group level. The EDI is administered by kindergarten teachers, who assess their kindergarten students on 5 developmental domains: physical health and well-being, social…
Descriptors: School Readiness, Formative Evaluation, Kindergarten, Cognitive Development
Costin, Frank – Educ Psychol Meas, 1969
Descriptors: Group Testing, Hostility, Measurement, Neurosis