Publication Date
In 2025 | 1 |
Since 2024 | 26 |
Since 2021 (last 5 years) | 75 |
Since 2016 (last 10 years) | 199 |
Since 2006 (last 20 years) | 410 |
Descriptor
Test Content | 820 |
Test Construction | 283 |
Test Items | 262 |
Test Validity | 186 |
Foreign Countries | 167 |
Test Format | 156 |
Student Evaluation | 137 |
Test Reliability | 134 |
Elementary Secondary Education | 125 |
Testing | 110 |
Standardized Tests | 105 |
More ▼ |
Source
Author
Sireci, Stephen G. | 9 |
Kitao, Kenji | 4 |
Kitao, S. Kathleen | 4 |
Papageorgiou, Spiros | 4 |
Thurlow, Martha L. | 4 |
Winnick, Joseph P. | 4 |
van der Linden, Wim J. | 4 |
Chang, Hua-Hua | 3 |
Donovan, Jenny | 3 |
Ewing, Maureen | 3 |
Hau, Kit-Tai | 3 |
More ▼ |
Publication Type
Education Level
Audience
Teachers | 68 |
Practitioners | 59 |
Administrators | 20 |
Students | 15 |
Policymakers | 9 |
Researchers | 7 |
Parents | 6 |
Counselors | 3 |
Community | 2 |
Support Staff | 1 |
Location
Australia | 18 |
California | 15 |
Canada | 14 |
China | 12 |
United States | 12 |
Massachusetts | 9 |
Europe | 8 |
Georgia | 8 |
Japan | 8 |
Rhode Island | 8 |
Turkey | 8 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Zhang, Xinxin; Gierl, Mark – Journal of Educational Issues, 2016
The purpose of this study is to describe a methodology to recover the item model used to generate multiple-choice test items with a novel graph theory approach. Beginning with the generated test items and working backward to recover the original item model provides a model-based method for validating the content used to automatically generate test…
Descriptors: Test Items, Automation, Content Validity, Test Validity
Harlacher, Jason – Regional Educational Laboratory Central, 2016
Educators have many decisions to make and it's important that they have the right data to inform those decisions and access to questionnaires that can gather that data. This guide, developed by REL Central and based on work done through separate projects with the Wyoming Office of Public Instruction and the Nebraska Department of Education,…
Descriptors: Questionnaires, Test Construction, Student Surveys, Teacher Surveys
Frost, Kellie; Clothier, Josh; Huisman, Annemiek; Wigglesworth, Gillian – Language Testing, 2020
Integrated speaking tasks requiring test takers to read and/or listen to stimulus texts and to incorporate their content into oral performances are now used in large-scale, high-stakes tests, including the TOEFL iBT. These tasks require test takers to identify, select, and combine relevant source text information to recognize key relationships…
Descriptors: Discourse Analysis, Scoring Rubrics, Speech Communication, English (Second Language)
Updated Assessment Principles and Guidelines for English Learners with Disabilities. NCEO Report 424
Liu, Kristin K.; Lazarus, Sheryl S.; Thurlow, Martha L.; Jarmin, Jaime; Ward, Jenna; Christensen, Laurene – National Center on Educational Outcomes, 2020
This report is an update of the assessment principles and guidelines for English language learners published in 2013 (Thurlow, Liu, Ward, & Christensen). That report, which was developed by the Improving the Validity of Assessment Results for English Language Learners with Disabilities (IVARED) project, presented essential principles of…
Descriptors: English Language Learners, Students with Disabilities, Student Evaluation, Evaluation Methods
Continual Improvement of a Student Evaluation of Teaching over Seven Semesters at a State University
Rates, Christopher; Liu, Xiufeng; Vanzile-Tamzen, Carol; Morreale, Cathleen – AERA Online Paper Repository, 2017
In the fall of 2014, the University at Buffalo created a new universal Student Evaluation of Teaching (SET). The purpose of the present study was to establish the construct validity of SET items. Rasch analyses of data from 7 semesters (N=203,194 students) revealed problems with item fit indices and threshold distances. Changes to items and…
Descriptors: Student Evaluation of Teacher Performance, State Universities, College Students, Teacher Effectiveness
Venticinque, Danilo; Whitworth, Andrew – Journal of Media Literacy Education, 2018
This article discusses the outcomes of research into the media literacy aspects of ENEM ("Exame Nacional do Ensino Médio"), Brazil's unified university entrance exam, which contains a significant number of exam questions based on excerpts from newspaper articles, online news and other media sources. Through content analysis, these…
Descriptors: Foreign Countries, College Entrance Examinations, Media Literacy, Test Content
Jenna A. Altherr Flores – Educational Linguistics, 2021
This study is a critical analysis of a low-stakes in-house English as a Second Language (ESL) and English literacy test from a local program in a large city in the southwestern United States. From a critical multimodal social semiotic perspective (Kress G. Multimodality: A social semiotic approach to contemporary communication. Routledge, 2010;…
Descriptors: Adult Literacy, Refugees, English (Second Language), Second Language Learning
Papageorgiou, Spiros; Xu, Xiaoqiu; Timpe-Laughlin, Veronika; Dugdale, Deborah M. – Educational Testing Service, 2020
The purpose of this study is to examine the appropriateness of using the "TOEFL Primary®" tests to evaluate the language abilities of students learning English as a foreign language (EFL) through an online-delivered curriculum, the VIPKid Major Course (MC). Data include student test scores on the TOEFL Primary Listening and Reading tests…
Descriptors: Alignment (Education), Language Tests, English (Second Language), Second Language Learning
Wolkowitz, Amanda A.; Davis-Becker, Susan L.; Gerrow, Jack D. – Journal of Applied Testing Technology, 2016
The purpose of this study was to investigate the impact of a cheating prevention strategy employed for a professional credentialing exam that involved releasing over 7,000 active and retired exam items. This study evaluated: 1) If any significant differences existed between examinee performance on released versus non-released items; 2) If item…
Descriptors: Cheating, Test Content, Test Items, Foreign Countries
Confrey, Jere; Shah, Meetal; Persson, Jennifer; Ciliano, Dagmara – North American Chapter of the International Group for the Psychology of Mathematics Education, 2019
This paper reports on a design-based implementation study of the use of a diagnostic classroom assessment tool framed on learning trajectories (LTs) for middle grades mathematics, where teachers and students are provided immediate data on students' progress along LTs. The study answers the question: "How can one characterize the challenges…
Descriptors: Middle School Students, Mathematics Instruction, Barriers, Diagnostic Tests
Clauser, Jerome C.; Hambleton, Ronald K.; Baldwin, Peter – Educational and Psychological Measurement, 2017
The Angoff standard setting method relies on content experts to review exam items and make judgments about the performance of the minimally proficient examinee. Unfortunately, at times content experts may have gaps in their understanding of specific exam content. These gaps are particularly likely to occur when the content domain is broad and/or…
Descriptors: Scores, Item Analysis, Classification, Decision Making
Förster, Manuel; Happ, Roland; Molerov, Dimitar – Journal of Economic Education, 2017
In this article, the authors present the adaptation and validation processes conducted to render the American "Test of Financial Literacy" (TFL) suitable for use in Germany (TFL-G). First, they outline the translation procedure followed and the various cultural adjustments made in line with international standards. Next, they present…
Descriptors: Money Management, Tests, Scores, Test Content
Komperda, Regis; Hosbein, Kathryn N.; Barbera, Jack – Chemistry Education Research and Practice, 2018
Increased understanding of the importance of the affective domain in chemistry education research has led to the development and adaptation of instruments to measure chemistry-specific affective traits, including motivation. Many of these instruments are adapted from other fields by using the word "chemistry" in place of other…
Descriptors: Chemistry, Affective Measures, Student Motivation, Majors (Students)
Ramírez-Uclés, Rafael; Castro-Rodríguez, Elena; Piñeiro, Juan Luis; Ruiz-Hidalgo, Juan F. – European Early Childhood Education Research Journal, 2018
This article begins with a theoretical discussion of the characteristics that a task should feature to be regarded as a mathematics problem suitable for pre-primary students. Those considerations are followed by a report of a classroom experience in which three problems involving quotative or partitive division were posed to pre-primary school…
Descriptors: Early Childhood Education, Task Analysis, Arithmetic, Class Activities
Catalano, Corinne Gaffney – ProQuest LLC, 2018
This is a multi-method study to develop and validate an instrument to measure teachers' self-efficacy for teaching students with autism spectrum disorder (ASD) in inclusive early childhood classrooms, "Teacher Self-efficacy for Teaching Students with ASD Inclusive Classrooms Scale": TSE-ASDI. I conducted literature and expert reviews as…
Descriptors: Teacher Effectiveness, Self Efficacy, Inclusion, Special Education