ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	4
Since 2006 (last 20 years)	15

Descriptor

Group Testing	72
Test Reliability	72
Test Validity	45
Test Construction	24
Intelligence Tests	16
Testing Problems	14
Testing	12
Standardized Tests	11
Individual Testing	10
Test Bias	10
Scoring	9
Test Interpretation	8
Aptitude Tests	7
Computer Assisted Testing	7
Factor Analysis	7
Higher Education	7
Elementary Education	6
Item Analysis	6
Test Format	6
Academic Achievement	5
Achievement Tests	5
Comparative Analysis	5
Elementary Secondary Education	5
Evaluation Methods	5
Exceptional Child Research	5
More ▼

Publication Type

Reports - Research	27
Journal Articles	19
Reports - Evaluative	8
Speeches/Meeting Papers	6
Tests/Questionnaires	4
Reports - Descriptive	2
Books	1
Collected Works - General	1
Guides - Non-Classroom	1
Information Analyses	1
Opinion Papers	1
Reference Materials -…	1
More ▼

Education Level

Elementary Secondary Education	3
Early Childhood Education	1
Higher Education	1
Postsecondary Education	1
Secondary Education	1

Audience

Practitioners	3
Researchers	2
Administrators	1
Teachers	1

Location

Canada	2
Cyprus	1
New Jersey	1
United Kingdom	1
United Kingdom (England)	1

Laws, Policies, & Programs

Assessments and Surveys

Stanford Binet Intelligence…	3
Wechsler Intelligence Scale…	2
Developmental Test of Visual…	1
Early Childhood Longitudinal…	1
Group Embedded Figures Test	1
Manifest Anxiety Scale	1
Minnesota Tests of Creative…	1
Nowicki Strickland Locus of…	1
Peabody Picture Vocabulary…	1
State Trait Anxiety Inventory	1
Stroop Color Word Test	1
Test of Logical Thinking	1
Wechsler Adult Intelligence…	1
More ▼

What Works Clearinghouse Rating

Test Reliability X

Showing 1 to 15 of 72 results Save | Export

Detecting Differential Item Functioning among Multiple Groups Using IRT Residual DIF Framework

Peer reviewed

Direct link

Hwanggyu Lim; Danqi Zhu; Edison M. Choe; Kyung T. Han – Journal of Educational Measurement, 2024

This study presents a generalized version of the residual differential item functioning (RDIF) detection framework in item response theory, named GRDIF, to analyze differential item functioning (DIF) in multiple groups. The GRDIF framework retains the advantages of the original RDIF framework, such as computational efficiency and ease of…

Descriptors: Item Response Theory, Test Bias, Test Reliability, Test Construction

Issues and Concerns in Classroom Assessment Practices

Download full text

Areekkuzhiyil, Santhosh – Online Submission, 2021

Assessment is an integral part of any teaching learning process. Assessment has large number of functions to perform, whether it is formative or summative. This paper analyse the issues involved and the areas of concern in the classroom assessment practice and discusses the recent reforms take place. [This paper was published in Edutracks v20 n8…

Descriptors: Student Evaluation, Formative Evaluation, Summative Evaluation, Test Validity

Hybrid Computerized Adaptive Testing: From Group Sequential Design to Fully Sequential Design

Peer reviewed

Direct link

Wang, Shiyu; Lin, Haiyan; Chang, Hua-Hua; Douglas, Jeff – Journal of Educational Measurement, 2016

Computerized adaptive testing (CAT) and multistage testing (MST) have become two of the most popular modes in large-scale computer-based sequential testing. Though most designs of CAT and MST exhibit strength and weakness in recent large-scale implementations, there is no simple answer to the question of which design is better because different…

Descriptors: Computer Assisted Testing, Adaptive Testing, Test Format, Sequential Approach

Changing the English Classroom: When Large-Scale "Common" Testing Meets Secondary Curriculum and Instruction in the United States

Peer reviewed

Direct link

Cimbricz, Sandra K.; McConn, Matthew L. – Changing English: Studies in Culture and Education, 2015

This article explores the intersection of new, large-scale standards-based testing, teacher accountability policy, and secondary curriculum and instruction in the United States. Two federally funded consortia--the Smarter Balanced Assessment Consortium and the Partnership for Readiness of College and Careers--prove focal to this paper, as these…

Descriptors: Group Testing, English Instruction, Secondary School Curriculum, Accountability

The Role of Multiple-Group Measurement Invariance in Family Psychology Research

Peer reviewed
PDF on ERIC

Download full text

Direct link

Kern, Justin L.; McBride, Brent A.; Laxman, Daniel J.; Dyer, W. Justin; Santos, Rosa M.; Jeans, Laurie M. – Grantee Submission, 2016

Measurement invariance (MI) is a property of measurement that is often implicitly assumed, but in many cases, not tested. When the assumption of MI is tested, it generally involves determining if the measurement holds longitudinally or cross-culturally. A growing literature shows that other groupings can, and should, be considered as well.…

Descriptors: Psychology, Measurement, Error of Measurement, Measurement Objectives

Establishing the Reliability and Validity of a Computerized Assessment of Children's Working Memory for Use in Group Settings

Peer reviewed

Direct link

St Clair-Thompson, Helen – Journal of Psychoeducational Assessment, 2014

The aim of the present study was to investigate the reliability and validity of a brief standardized assessment of children's working memory; "Lucid Recall." Although there are many established assessments of working memory, "Lucid Recall" is fully automated and can therefore be administered in a group setting. It is therefore…

Descriptors: Test Reliability, Test Validity, Computer Assisted Testing, Cognitive Tests

Synchronous Technological Administration of Data Collection Instruments: An Ergonomic Method for Group Administration

Peer reviewed
PDF on ERIC

Download full text

Yaratan, Huseyin; Suphi, Nilgun – Turkish Online Journal of Educational Technology - TOJET, 2013

Questionnaires administered manually can cause surreptitious peer pressure on the candidate to finish when 'the others" have completed theirs, forcing students to rush or skip individual items or may hinder the ability of noticing participants who may be having difficulty understanding certain items. These drawbacks can have serious…

Descriptors: Synchronous Communication, Questionnaires, Computer Assisted Testing, Undergraduate Students

Improving the Utility of Large-Scale Assessments in Canada

Peer reviewed
PDF on ERIC

Download full text

Direct link

Rogers, W. Todd – Canadian Journal of Education, 2014

Principals and teachers do not use large-scale assessment results because the lack of distinct and reliable subtests prevents identifying strengths and weaknesses of students and instruction, the results arrive too late to be used, and principals and teachers need assistance to use the results to improve instruction so as to improve student…

Descriptors: Foreign Countries, Group Testing, Multidimensional Scaling, Evaluation Utilization

Impact of Design Effects in Large-Scale District and State Assessments

Peer reviewed

Direct link

Phillips, Gary W. – Applied Measurement in Education, 2015

This article proposes that sampling design effects have potentially huge unrecognized impacts on the results reported by large-scale district and state assessments in the United States. When design effects are unrecognized and unaccounted for they lead to underestimating the sampling error in item and test statistics. Underestimating the sampling…

Descriptors: State Programs, Sampling, Research Design, Error of Measurement

Ongoing Issues in Test Fairness

Peer reviewed

Direct link

Camilli, Gregory – Educational Research and Evaluation, 2013

In the attempt to identify or prevent unfair tests, both quantitative analyses and logical evaluation are often used. For the most part, fairness evaluation is a pragmatic attempt at determining whether procedural or substantive due process has been accorded to either a group of test takers or an individual. In both the individual and comparative…

Descriptors: Alternative Assessment, Test Bias, Test Content, Test Format

Innovative Assessment for Learning Enhancement: Issues and Practices

Peer reviewed
PDF on ERIC

Download full text

Zacharis, Nick Z. – Contemporary Issues in Education Research, 2010

Although summative assessment is indispensable for determining whether or not students meet the content standards, it alone is insufficient for providing teachers and administrators with the information necessary to make ongoing decisions about instruction. This article looks at the motivational impact of the assessment on students' achievement…

Descriptors: Educational Practices, Educational Innovation, Alternative Assessment, Summative Evaluation

Test Review: Review of the Certificate of Proficiency in English (CPE) Speaking Test

Peer reviewed

Direct link

Macqueen, Susy; Harding, Luke – Language Testing, 2009

In 2002 the University of Cambridge Local Examinations Syndicate (UCLES) implemented a revised version of the Certificate of Proficiency in English (CPE). CPE, which is the highest level of the Main Suite of Cambridge ESOL exams, comprises five modules, "Reading," "Writing," "Use of English," "Listening" and "Speaking," the latter of which is the…

Descriptors: Speech Communication, Test Reviews, Examiners, English (Second Language)

Development and Preliminary Validation of a Measure for Assessing Staff Perspectives on the Quality of Clinical Group Supervision

Peer reviewed

Direct link

Horton, Simon; de Lourdes Drachler, Maria; Fuller, Alison; de Carvalho Leite, Jose Carlos – International Journal of Language & Communication Disorders, 2008

Background: In the UK clinical supervision is regarded as an essential process supporting quality improvement within the clinical governance framework, and the Royal College of Speech and Language Therapists regards it as a tool for promoting critical reflective practice. There is limited evidence of the impact on practice or improvements in…

Descriptors: Health Personnel, Supervision, Questionnaires, Factor Analysis

Formative Evaluation of the Early Development Instrument: Progress and Prospects

Peer reviewed

Direct link

Keating, Daniel P. – Early Education and Development, 2007

This article is a commentary for the special issue on the Early Development Instrument (EDI), a community tool to assess children's school readiness and developmental outcomes at a group level. The EDI is administered by kindergarten teachers, who assess their kindergarten students on 5 developmental domains: physical health and well-being, social…

Descriptors: School Readiness, Formative Evaluation, Kindergarten, Cognitive Development

The Scrambled Sentence Test: A Group Measure of Hostility

Costin, Frank – Educ Psychol Meas, 1969

Descriptors: Group Testing, Hostility, Measurement, Neurosis

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5

Journal of Educational…	2
Psychology in the Schools	2
American Educational Research…	1
Applied Measurement in…	1
Canadian Journal of Education	1
Changing English: Studies in…	1
Contemporary Issues in…	1
Early Education and…	1
Educ Psychol Meas	1
Educational Evaluation and…	1
Educational Research and…	1
Grantee Submission	1
Harvard Educational Review	1
International Journal of…	1
J Exp Educ	1
J Sch Psychol	1
Journal of Black Studies	1
Journal of Counseling &…	1
Journal of Educational…	1
Journal of Personality…	1
Journal of Psychoeducational…	1
Journal of Research and…	1
Journal of Research in…	1
Language Testing	1
Meas Evaluation Guidance	1
More ▼

Capie, William	2
Hopkins, Kenneth D.	2
Tobin, Kenneth G.	2
Alliger, R. J.	1
Areekkuzhiyil, Santhosh	1
Backman, Margaret E.	1
Barnette, J. Jackson	1
Batemen, Barbara	1
Becker, John T.	1
Betz, Nancy E.	1
Bracht, Glenn H.	1
Brennan, Robert L., Ed.	1
Broman, Harvey J.	1
Camilli, Gregory	1
Chang, Hua-Hua	1
Chissom, Brad	1
Chletsos, Peter N.	1
Cimbricz, Sandra K.	1
Cliff, Norman	1
Colligan, Robert C.	1
Costin, Frank	1
Danqi Zhu	1
Deck, Dennis	1
Denson, Teri A.	1
More ▼