ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	2
Since 2006 (last 20 years)	4

Descriptor

Test Reliability	5
Scoring	3
Computer Assisted Testing	2
Psychometrics	2
Achievement Gains	1
Affordances	1
Artificial Intelligence	1
Automation	1
Bias	1
Certification	1
Cheating	1
Comparative Analysis	1
Comparative Education	1
Construct Validity	1
Correlation	1
Decision Making	1
Engineering	1
English (Second Language)	1
Error Correction	1
Essay Tests	1
Evaluators	1
Factor Analysis	1
Feedback (Response)	1
Graduate Study	1
Guessing (Tests)	1
More ▼

Source

ETS Research Report Series	2
Educational Measurement:…	1
Educational and Psychological…	1
Journal of Computer Assisted…	1

Author

Attali, Yigal	5
Baig, Basim	1
Horie, André Kenji	1
LaFlair, Geoffrey T.	1
Langenfeld, Thomas	1
Powers, Don	1
von Davier, Alina A.	1

Publication Type

Journal Articles	5
Reports - Research	5

Education Level

Higher Education

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

Test of English as a Foreign…

What Works Clearinghouse Rating

Showing all 5 results Save | Export

Digital-First Assessments: A Security Framework

Peer reviewed

Direct link

LaFlair, Geoffrey T.; Langenfeld, Thomas; Baig, Basim; Horie, André Kenji; Attali, Yigal; von Davier, Alina A. – Journal of Computer Assisted Learning, 2022

Background: Digital-first assessments leverage the affordances of technology in all elements of the assessment process--from design and development to score reporting and evaluation to create test taker-centric assessments. Objectives: The goal of this paper is to describe the engineering, machine learning, and psychometric processes and…

Descriptors: Computer Assisted Testing, Affordances, Scoring, Engineering

Rater Certification Tests: A Psychometric Approach

Peer reviewed

Direct link

Attali, Yigal – Educational Measurement: Issues and Practice, 2019

Rater training is an important part of developing and conducting large-scale constructed-response assessments. As part of this process, candidate raters have to pass a certification test to confirm that they are able to score consistently and accurately before they begin scoring operationally. Moreover, many assessment programs require raters to…

Descriptors: Evaluators, Certification, High Stakes Tests, Scoring

Immediate Feedback and Opportunity to Revise Answers to Open-Ended Questions

Peer reviewed

Direct link

Attali, Yigal; Powers, Don – Educational and Psychological Measurement, 2010

Two experiments examine the psychometric effects of providing immediate feedback on the correctness of answers to open-ended questions, and allowing participants to revise their answers following feedback. Participants answering verbal and math questions are able to correct many of their initial incorrect answers, resulting in higher revised…

Descriptors: Feedback (Response), Psychometrics, Test Anxiety, Error Correction

Reliability of Speeded Number-Right Multiple-Choice Tests. Research Report. RR-04-15

Peer reviewed
PDF on ERIC

Download full text

Attali, Yigal – ETS Research Report Series, 2004

Contrary to common belief, reliability estimates of number-right multiple-choice tests are not inflated by speededness. Because examinees guess on questions when they run out of time, the responses to these questions show less consistency with the responses of other questions, and the reliability of the test will be decreased. The surprising…

Descriptors: Multiple Choice Tests, Timed Tests, Test Reliability, Guessing (Tests)

Construct Validity of "e-rater"® in Scoring TOEFL® Essays. Research Report. ETS RR-07-21

Peer reviewed
PDF on ERIC

Download full text

Attali, Yigal – ETS Research Report Series, 2007

This study examined the construct validity of the "e-rater"® automated essay scoring engine as an alternative to human scoring in the context of TOEFL® essay writing. Analyses were based on a sample of students who repeated the TOEFL within a short time period. Two "e-rater" scores were investigated in this study, the first…

Descriptors: Construct Validity, Computer Assisted Testing, Scoring, English (Second Language)