ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	5
Since 2006 (last 20 years)	8

Descriptor

Foreign Countries	10
Test Items	10
Test Reliability	5
Test Construction	4
Reliability	3
Scoring	3
Test Validity	3
College Students	2
Correlation	2
English (Second Language)	2
Evaluation Methods	2
Geography	2
High Stakes Tests	2
Interrater Reliability	2
Scores	2
Second Language Learning	2
Achievement Tests	1
Adaptive Testing	1
Adults	1
Cognitive Tests	1
College Admission	1
Communication Skills	1
Comparative Analysis	1
Comparative Testing	1
Computation	1
More ▼

Source

Applied Measurement in…	1
Assessment in Education:…	1
Evaluation & Research in…	1
Journal of Further and Higher…	1
Journal of Geography in…	1
Journal of Psychoeducational…	1
Language Teaching Research	1
OECD Publishing	1
Psychology Teaching Review	1
Research Matters	1

Author

Bramley, Tom	2
Bimpeh, Yaw	1
Black, Beth	1
Dagnall, Neil	1
Davila-Ross, Marina	1
Denovan, Andrew	1
Drinkwater, Ken	1
Gill, Tim	1
Harrington, Michael	1
Harrison, Liz	1
Jones, Allan	1
Pointer, William	1
Roche, Thomas	1
Sasao, Yosuke	1
Smith, Ben Alexander	1
Suto, Irenka	1
Turner, Mark	1
Webb, Stuart	1
More ▼

Publication Type

Journal Articles	9
Reports - Research	5
Reports - Evaluative	4
Collected Works - General	1
Information Analyses	1
Reports - General	1

Education Level

Higher Education	3
Postsecondary Education	2
Elementary Secondary Education	1
Secondary Education	1

Audience

Researchers

Location

United Kingdom	10
Australia	2
Japan	2
Austria	1
Belgium	1
Canada	1
Chile	1
Cyprus	1
Czech Republic	1
Denmark	1
Estonia	1
France	1
Germany	1
Ireland	1
Italy	1
Netherlands	1
Norway	1
Poland	1
Russia	1
Slovakia	1
South Korea	1
Spain	1
Sweden	1
United States	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 10 results Save | Export

Evaluating Human Scoring Using Generalizability Theory

Peer reviewed

Direct link

Bimpeh, Yaw; Pointer, William; Smith, Ben Alexander; Harrison, Liz – Applied Measurement in Education, 2020

Many high-stakes examinations in the United Kingdom (UK) use both constructed-response items and selected-response items. We need to evaluate the inter-rater reliability for constructed-response items that are scored by humans. While there are a variety of methods for evaluating rater consistency across ratings in the psychometric literature, we…

Descriptors: Scoring, Generalizability Theory, Interrater Reliability, Foreign Countries

The Concurrent Validity of Comparative Judgement Outcomes Compared with Marks

Download full text

Gill, Tim – Research Matters, 2022

In Comparative Judgement (CJ) exercises, examiners are asked to look at a selection of candidate scripts (with marks removed) and order them in terms of which they believe display the best quality. By including scripts from different examination sessions, the results of these exercises can be used to help with maintaining standards. Results from…

Descriptors: Comparative Analysis, Decision Making, Scripts, Standards

The Ego Resiliency Scale-Revised: Confirmatory Factor Analysis and Rasch Models

Peer reviewed

Direct link

Denovan, Andrew; Dagnall, Neil; Drinkwater, Ken – Journal of Psychoeducational Assessment, 2022

This study examined the psychometric properties of the Ego Resiliency Scale-Revised (ER89-R). Though support exists for a multidimensional conceptualisation using classical test theory approaches (i.e., a higher-order model comprising Openness to Life Experiences and Optimal Regulation factors), this measure has not been subjected to Rasch…

Descriptors: Likert Scales, Self Concept, Resilience (Psychology), Factor Analysis

The Word Part Levels Test

Peer reviewed

Direct link

Sasao, Yosuke; Webb, Stuart – Language Teaching Research, 2017

Knowledge of English affixes plays a significant role in increasing knowledge of words. However, few attempts have been made to create a valid and reliable measure of affix knowledge. The Word Part Levels Test (WPLT) was developed to measure three aspects of affix knowledge: form (recognition of written affix forms), meaning (knowledge of affix…

Descriptors: English (Second Language), Second Language Learning, Language Tests, Morphemes

Offshore and Onsite Placement Testing for English Pathway Programmes

Peer reviewed

Direct link

Roche, Thomas; Harrington, Michael – Journal of Further and Higher Education, 2018

English language programmes provide established pathways for international students seeking university admission in countries such as Australia and the United Kingdom. In order to refer international applicants to appropriate levels and durations of English language support prior to matriculation into their main course of study, pathway providers…

Descriptors: Student Placement, College Admission, College Students, Foreign Students

Using Oral Exams to Assess Psychological Literacy: The Final Year Research Project Interview

Peer reviewed
PDF on ERIC

Download full text

Turner, Mark; Davila-Ross, Marina – Psychology Teaching Review, 2015

The ability to reason scientifically and communicate research appropriately is central to psychological literacy. Scientific research has little value unless scientists are able to convey results and their consequences clearly to others. In this study, we outline a method of assessing the development of psychological literacy in undergraduate…

Descriptors: Interviews, Research Projects, Psychological Studies, Verbal Communication

The Interrelations of Features of Questions, Mark Schemes and Examinee Responses and Their Impact upon Marker Agreement

Peer reviewed

Direct link

Black, Beth; Suto, Irenka; Bramley, Tom – Assessment in Education: Principles, Policy & Practice, 2011

In this paper we develop an evidence-based framework for considering many of the factors affecting marker agreement in GCSEs and A levels. A logical analysis of the demands of the marking task suggests a core grouping comprising: (i) question features; (ii) mark scheme features; and (iii) examinee response features. The framework synthesises…

Descriptors: Interrater Reliability, Grading, Scoring, High Stakes Tests

Technical Report of the Survey of Adult Skills (PIAAC)

Direct link

OECD Publishing, 2013

The Programme for the International Assessment of Adult Competencies (PIAAC) has been planned as an ongoing program of assessment. The first cycle of the assessment has involved two "rounds." The first round, which is covered by this report, took place over the period of January 2008-October 2013. The main features of the first cycle of…

Descriptors: International Assessment, Adults, Skills, Test Construction

The Question Tariff Problem in GCSE Mathematics.

Peer reviewed

Bramley, Tom – Evaluation & Research in Education, 2001

Analyzed data from a session of the General Certificate of Secondary Education (GCSE) mathematics examination to identify items displaying a bi-modal expected score distribution, try to explain the bi-modality, rescore the items to remove under-used middle categories, and determine the effect on test reliability of rescoring the data. Discusses…

Descriptors: Foreign Countries, Mathematics Tests, Reliability, Scores

Setting Objective Tests.

Peer reviewed

Jones, Allan – Journal of Geography in Higher Education, 1997

Examines the increase in popularity of objective testing in the United Kingdom and addresses some of the accompanying academic issues. Reports on a case study of test production and implementation to illustrate issues of time costs and benefits. Discusses question styles, marking schemes, and the problem of guesswork. (MJP)

Descriptors: Comparative Testing, Educational Practices, Educational Trends, Foreign Countries