Publication Date
In 2025 | 1 |
Since 2024 | 38 |
Since 2021 (last 5 years) | 137 |
Since 2016 (last 10 years) | 279 |
Since 2006 (last 20 years) | 500 |
Descriptor
Source
Author
Powers, Donald E. | 10 |
Scruggs, Thomas E. | 7 |
Liu, Ou Lydia | 6 |
Wise, Steven L. | 6 |
Bridgeman, Brent | 5 |
Rios, Joseph A. | 5 |
Rogers, W. Todd | 5 |
Crisp, Victoria | 4 |
Lewandowski, Lawrence J. | 4 |
Moon, Jung Aa | 4 |
Petchauer, Emery | 4 |
More ▼ |
Publication Type
Education Level
Location
Canada | 17 |
United Kingdom | 16 |
China | 14 |
Germany | 13 |
Australia | 12 |
United States | 11 |
Taiwan | 9 |
United Kingdom (England) | 9 |
Texas | 8 |
California | 7 |
Iran | 7 |
More ▼ |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 10 |
Americans with Disabilities… | 1 |
Elementary and Secondary… | 1 |
Every Student Succeeds Act… | 1 |
Individuals with Disabilities… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Baird, Jo-Anne; Caro, Daniel H.; Hopfenbeck, Therese N. – Irish Educational Studies, 2016
Entirely predictable examinations are ones for which the questions are known in advance. Some assessments are designed this way, but in public examinations, predictability is subtler. Students familiarise themselves with the requirements broadly: likely topics that will come up, question formats and how to maximise their marks. If students can…
Descriptors: Foreign Countries, High Stakes Tests, Student Attitudes, Test Wiseness
Guo, Qian; Kim, Young-Suk Grace; Yang, Li; Liu, Lihui – Reading and Writing: An Interdisciplinary Journal, 2016
Previewing answer-choice options before finishing reading the text is a widely employed test-taking behavior. In the present study we examined whether previewing is related to item response accuracy and response time, using data from Chinese learners of varying English proficiency levels and English native speakers. We examined eye movement…
Descriptors: English Language Learners, Reading Comprehension, Test Wiseness, Response Style (Tests)
Rios, Joseph A.; Liu, Ou Lydia; Bridgeman, Brent – New Directions for Institutional Research, 2014
This chapter describes a study that compares two approaches (self-reported effort [SRE] and response time effort [RTE]) for identifying low-effort examinees in student learning outcomes assessment. Although both approaches equally discriminated from measures of ability (e.g., SAT scores), RTE was found to have a stronger relationship with test…
Descriptors: Student Evaluation, Educational Assessment, Reaction Time, Measures (Individuals)
Cavendish, Wendy; Márquez, Adrián; Roberts, Mary; Suarez, Kristen; Lima, Wesley – Penn GSE Perspectives on Urban Education, 2017
In a nationwide effort to create standardized performance criteria, there has been an emphasis on testing data as the strict measurement of teacher and student success or failure (Volante & Sonia, 2010). These testing accountability systems, developed under No Child Left Behind (2001), were based on assumptions that high-stakes assessments…
Descriptors: Learner Engagement, High Stakes Tests, Accountability, Educational Legislation
Penk, Christiane; Richter, Dirk – Educational Assessment, Evaluation and Accountability, 2017
Since the turn of the century, an increasing number of low-stakes assessments (i.e., assessments without direct consequences for the test-takers) are being used to evaluate the quality of educational systems. Internationally, research has shown that low-stakes test results can be biased due to students' low test-taking motivation and that…
Descriptors: Test Wiseness, Student Motivation, Academic Achievement, Cognitive Tests
Zaidi, Nikki B.; Hwang, Charles; Scott, Sara; Stallard, Stefanie; Purkiss, Joel; Hortsch, Michael – Anatomical Sciences Education, 2017
Bloom's taxonomy was adopted to create a subject-specific scoring tool for histology multiple-choice questions (MCQs). This Bloom's Taxonomy Histology Tool (BTHT) was used to analyze teacher- and student-generated quiz and examination questions from a graduate level histology course. Multiple-choice questions using histological images were…
Descriptors: Taxonomy, Anatomy, Graduate Students, Scoring Formulas
Winke, Paula; Lim, Hyojung – Language Assessment Quarterly, 2017
To examine the effects of listening test preparation, we had three groups, two experimental and one control (63 learners total), partake in three types of instruction sandwiched between two equally difficult listening tests (pretests and posttests). The first experimental group took four practice tests and received "explicit"…
Descriptors: Listening Comprehension Tests, Test Preparation, Pretests Posttests, Experimental Groups
Predictability in High-Stakes Examinations: Students' Perspectives on a Perennial Assessment Dilemma
Elwood, Jannette; Hopfenbeck, Therese; Baird, Jo-Anne – Research Papers in Education, 2017
Key debates within educational assessment continuously encourage us to reflect on the design, delivery and implementation of examination systems as well as their relevance to students. In more recent times, such reflections have also required a rethinking of who is authoritative about assessment issues and whose views we seek in order to better…
Descriptors: Foreign Countries, Prediction, High Stakes Tests, Student Attitudes
Tempel, Tobias; Neumann, Roland – Journal of Experimental Education, 2016
We investigated processes underlying performance decrements of highly test-anxious persons. Three experiments contrasted conditions that differed in the degree of activation of concepts related to failure. Participants memorized a list of words either containing words related to failure or containing no words related to failure in Experiment 1. In…
Descriptors: Test Anxiety, Cognitive Tests, Test Wiseness, Foreign Countries
Oliveri, María Elena; von Davier, Alina A. – International Journal of Testing, 2016
In this study, we propose that the unique needs and characteristics of linguistic minorities should be considered throughout the test development process. Unlike most measurement invariance investigations in the assessment of linguistic minorities, which typically are conducted after test administration, we propose strategies that focus on the…
Descriptors: Psychometrics, Linguistics, Test Construction, Testing
Sessoms, John; Henson, Robert A. – Measurement: Interdisciplinary Research and Perspectives, 2018
Diagnostic classification models (DCMs) classify examinees based on the skills they have mastered given their test performance. This classification enables targeted feedback that can inform remedial instruction. Unfortunately, applications of DCMs have been criticized (e.g., no validity support). Generally, these evaluations have been brief and…
Descriptors: Literature Reviews, Classification, Models, Criticism
Dawadi, Saraswati; Shrestha, Prithvi N. – Educational Assessment, 2018
There has been a steady interest in investigating the validity of language tests in the last decades. Despite numerous studies on construct validity in language testing, there are not many studies examining the construct validity of a reading test. This paper reports on a study that explored the construct validity of the English reading test in…
Descriptors: Foreign Countries, Construct Validity, Reading Tests, English (Second Language)
Chou, Mu-Hsuan – Journal of Educational Research, 2019
Considerable evidence indicates that self-efficacy, task value, anxiety, and the use of language learning strategies are related. However, there is currently an insufficient understanding about their relations in high-stakes testing contexts. The author aimed to investigate how well social factors, test value, anxiety, test performance, and…
Descriptors: Foreign Countries, High School Students, Grade 12, Predictor Variables
Hamid, M. Obaidul; Hoang, Ngoc T. H. – TESL-EJ, 2018
Test-takers' voices in relation to high-stakes language tests have received growing attention in recent years. While the perspectives of this stakeholder group can be utilised to improve test quality, test-taking experience, and test impact, we argue that this goal needs to be achieved by considering a fundamental shift in our conceptualisation of…
Descriptors: Language Tests, High Stakes Tests, Testing, Humanization
Couchman, Justin J.; Miller, Noelle E.; Zmuda, Shaun J.; Feather, Kathryn; Schwartzmeyer, Tina – Metacognition and Learning, 2016
Students often gauge their performance before and after an exam, usually in the form of rough grade estimates or general feelings. Are these estimates accurate? Should they form the basis for decisions about study time, test-taking strategies, revisions, subject mastery, or even general competence? In two studies, undergraduates took a real…
Descriptors: Higher Education, College Students, Tests, Metacognition