NotesFAQContact Us
Collection
Advanced
Search Tips
Publication Date
In 20250
Since 20240
Since 2021 (last 5 years)0
Since 2016 (last 10 years)7
Since 2006 (last 20 years)24
What Works Clearinghouse Rating
Showing 1 to 15 of 252 results Save | Export
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Hsiao-Hui Lin; Tzeng, Yuh-Tsuen; Chen, Hsueh-Chih; Huang, Yao-Hsuan – Reading & Writing: Journal of the Reading Association of South Africa, 2020
Background: The issue of science is seldom brought into focus because of the way developing assessments of students' multiple text reading comprehension. Objectives: This study tested the sequential mediation model of scientific multi-text reading comprehension (SMTRC) by means of structural equation modelling (SEM), and aimed to advance the…
Descriptors: Science Education, Reading Comprehension, Reading Tests, Construct Validity
Peer reviewed Peer reviewed
Direct linkDirect link
Schulz, Andreas; Leuders, Timo; Rangel, Ulrike – Journal of Psychoeducational Assessment, 2020
We provide evidence of validity for a newly developed diagnostic competence model of operation sense, by both (a) describing the theoretically substantiated development of the competence model in close association with its use within a large-scale formative assessment and (b) providing empirical evidence for the theoretically described cognitive…
Descriptors: Diagnostic Tests, Models, Criterion Referenced Tests, Cognitive Measurement
Peer reviewed Peer reviewed
Direct linkDirect link
Avsec, Stanislav; Jamšek, Janez – International Journal of Technology and Design Education, 2016
Technological literacy is identified as a vital achievement of technology- and engineering-intensive education. It guides the design of technology and technical components of educational systems and defines competitive employment in technological society. Existing methods for measuring technological literacy are incomplete or complicated,…
Descriptors: Technological Literacy, Elementary School Students, Secondary School Students, Evaluation Methods
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Al-Habashneh, Maher Hussein; Najjar, Nabil Juma – Journal of Education and Practice, 2017
This study aimed at constructing a criterion-reference test to measure the research and statistical competencies of graduate students at the Jordanian governmental universities, the test has to be in its first form of (50) multiple choice items, then the test was introduced to (5) arbitrators with competence in measurement and evaluation to…
Descriptors: Foreign Countries, Criterion Referenced Tests, Graduate Students, Test Construction
Peer reviewed Peer reviewed
Direct linkDirect link
Combrinck, Celeste; Scherman, Vanessa; Maree, David – Perspectives in Education, 2016
This study describes how criterion-referenced feedback was produced from English language, mathematics and natural sciences monitoring assessments. The assessments were designed for grades 8 to 11 to give an overall indication of curriculum-standards attained in a given subject over the course of a year (N = 1113). The Rasch Item Map method was…
Descriptors: Item Response Theory, Feedback (Response), Criterion Referenced Tests, Academic Standards
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Liu, Jinghua; Zu, Jiyun; Curley, Edward; Carey, Jill – ETS Research Report Series, 2014
The purpose of this study is to investigate the impact of discrete anchor items versus passage-based anchor items on observed score equating using empirical data.This study compares an "SAT"® critical reading anchor that contains more discrete items proportionally, compared to the total tests to be equated, to another anchor that…
Descriptors: Equated Scores, Test Items, College Entrance Examinations, Comparative Analysis
Peer reviewed Peer reviewed
Direct linkDirect link
Holster, Trevor A.; Lake, J. – Language Assessment Quarterly, 2016
Stewart questioned Beglar's use of Rasch analysis of the Vocabulary Size Test (VST) and advocated the use of 3-parameter logistic item response theory (3PLIRT) on the basis that it models a non-zero lower asymptote for items, often called a "guessing" parameter. In support of this theory, Stewart presented fit statistics derived from…
Descriptors: Guessing (Tests), Item Response Theory, Vocabulary, Language Tests
Peer reviewed Peer reviewed
Direct linkDirect link
McElhiney, Danielle; Kang, Minsoo; Starkey, Chad; Ragan, Brian – Measurement in Physical Education and Exercise Science, 2014
The purpose of the study was to improve the immediate and delayed memory sections of the Standardized Assessment of Concussion (SAC) by identifying a list of more psychometrically sound items (words). A total of 200 participants with no history of concussion in the previous six months (aged 19.60 ± 2.20 years; N?=?93 men, N?=?107 women)…
Descriptors: Head Injuries, Athletes, Item Analysis, Observation
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Mileff, Milo – Bulgarian Comparative Education Society, 2013
In the present paper and the discussion that follows, the author presents aspects of test construction and a careful description of instructional objectives. Constructing tests involves several stages such as describing language objectives, selecting appropriate test task, devising and assembling test tasks, and devising a scoring system for…
Descriptors: Behavioral Objectives, Test Construction, Norm Referenced Tests, Criterion Referenced Tests
Peer reviewed Peer reviewed
Direct linkDirect link
Lieneck, Cristian; Morrison, Eileen; Price, Larry – Current Issues in Education, 2013
The Texas State University-San Marcos undergraduate healthcare administration program requires all bachelors of health administration (BHA) students to pass a comprehensive examination to demonstrate their knowledge of specific core competencies. This also demonstrates completion of their didactic coursework in order to enter a practical…
Descriptors: Exit Examinations, Health Services, Administrator Education, Psychometrics
Peer reviewed Peer reviewed
Direct linkDirect link
Frame, Laura B.; Vidrine, Stephanie M.; Hinojosa, Ryan – Journal of Psychoeducational Assessment, 2016
The Kaufman Test of Educational Achievement, Third Edition (KTEA-3) is a revised and updated comprehensive academic achievement test (Kaufman & Kaufman, 2014). Authored by Drs. Alan and Nadeen Kaufman and published by Pearson, the KTEA-3 remains an individual achievement test normed for individuals of ages 4 through 25 years, or for those in…
Descriptors: Achievement Tests, Elementary Secondary Education, Test Validity, Test Reliability
Peer reviewed Peer reviewed
Direct linkDirect link
Rivera, Jennifer E. – Career and Technical Education Research, 2011
The State of New York Agriculture Science Education secondary program is required to have a certification exam for students to assess their agriculture science education experience as a Regent's requirement towards graduation. This paper focuses on the procedure used to develop and validate two content sub-test questions within a…
Descriptors: Test Items, Item Banks, Test Construction, Test Validity
Peer reviewed Peer reviewed
Direct linkDirect link
Kim, Sooyeon; Livingston, Samuel A. – Journal of Educational Measurement, 2010
Score equating based on small samples of examinees is often inaccurate for the examinee populations. We conducted a series of resampling studies to investigate the accuracy of five methods of equating in a common-item design. The methods were chained equipercentile equating of smoothed distributions, chained linear equating, chained mean equating,…
Descriptors: Equated Scores, Test Items, Item Sampling, Item Response Theory
Peer reviewed Peer reviewed
Direct linkDirect link
Taylor, Catherine S.; Lee, Yoonsun – Applied Measurement in Education, 2012
This was a study of differential item functioning (DIF) for grades 4, 7, and 10 reading and mathematics items from state criterion-referenced tests. The tests were composed of multiple-choice and constructed-response items. Gender DIF was investigated using POLYSIBTEST and a Rasch procedure. The Rasch procedure flagged more items for DIF than did…
Descriptors: Test Bias, Gender Differences, Reading Tests, Mathematics Tests
Peer reviewed Peer reviewed
Direct linkDirect link
Wyse, Adam E. – Educational and Psychological Measurement, 2011
Standard setting is a method used to set cut scores on large-scale assessments. One of the most popular standard setting methods is the Bookmark method. In the Bookmark method, panelists are asked to envision a response probability (RP) criterion and move through a booklet of ordered items based on a RP criterion. This study investigates whether…
Descriptors: Testing Programs, Standard Setting (Scoring), Cutting Scores, Probability
Previous Page | Next Page »
Pages: 1  |  2  |  3  |  4  |  5  |  6  |  7  |  8  |  9  |  10  |  11  |  ...  |  17