ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	12
Since 2006 (last 20 years)	21

Descriptor

Correlation	35
Test Bias	35
Test Reliability	25
Test Items	16
Test Validity	15
Scores	14
Factor Analysis	10
Scoring	9
Statistical Analysis	9
College Students	8
Test Construction	8
Factor Structure	7
Foreign Countries	7
Psychometrics	7
Interrater Reliability	6
Reliability	6
College Entrance Examinations	4
Measures (Individuals)	4
Observation	4
Rating Scales	4
Response Style (Tests)	4
Simulation	4
Undergraduate Students	4
Construct Validity	3
Culture Fair Tests	3
More ▼

Source

Educational and Psychological…	9
ETS Research Report Series	4
ACT, Inc.	1
Educational Assessment	1
Eurasian Journal of…	1
Grantee Submission	1
Journal of Educational and…	1
Language Testing	1
Measurement and Evaluation in…	1
National Center for Education…	1
National Center for Education…	1
OECD Publishing	1
Partnership for Assessment of…	1
ProQuest LLC	1
Regional Educational…	1
Regional Educational…	1
Society for Research on…	1
More ▼

Publication Type

Reports - Research	24
Journal Articles	18
Reports - Evaluative	7
Numerical/Quantitative Data	2
Dissertations/Theses -…	1
Guides - Non-Classroom	1
Speeches/Meeting Papers	1
Tests/Questionnaires	1

Education Level

Higher Education	11
Postsecondary Education	9
Early Childhood Education	2
Grade 7	2
Grade 8	2
High Schools	2
Kindergarten	2
Middle Schools	2
Secondary Education	2
Elementary Education	1
Elementary Secondary Education	1
Grade 3	1
Primary Education	1
More ▼

Audience

Researchers

Location

California	1
Canada	1
China	1
Colorado (Denver)	1
Florida	1
New Mexico	1
New York (New York)	1
North Carolina (Charlotte)	1
South Africa	1
Taiwan	1
Tennessee (Memphis)	1
Texas	1
Texas (Dallas)	1
Turkey	1
United Arab Emirates	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

ACT Assessment	2
SAT (College Admission Test)	2
Stanford Binet Intelligence…	2
Wechsler Intelligence Scale…	2
ACT Interest Inventory	1
Beck Depression Inventory	1
Dynamic Indicators of Basic…	1
Early Childhood Longitudinal…	1
Personality Research Form	1
Rosenberg Self Esteem Scale	1
Sixteen Personality Factor…	1
Teaching and Learning…	1
Test of English as a Foreign…	1
More ▼

What Works Clearinghouse Rating

Meets WWC Standards without Reservations	1
Meets WWC Standards with or without Reservations	1

Showing 1 to 15 of 35 results Save | Export

Using Differential Item Functioning to Test for Interrater Reliability in Constructed Response Items

Peer reviewed

Direct link

Walker, Cindy M.; Göçer Sahin, Sakine – Educational and Psychological Measurement, 2020

The purpose of this study was to investigate a new way of evaluating interrater reliability that can allow one to determine if two raters differ with respect to their rating on a polytomous rating scale or constructed response item. Specifically, differential item functioning (DIF) analyses were used to assess interrater reliability and compared…

Descriptors: Test Bias, Interrater Reliability, Responses, Correlation

Estimating Difference-Score Reliability in Pretest-Posttest Settings

Peer reviewed

Direct link

Gu, Zhengguo; Emons, Wilco H. M.; Sijtsma, Klaas – Journal of Educational and Behavioral Statistics, 2021

Clinical, medical, and health psychologists use difference scores obtained from pretest--posttest designs employing the same test to assess intraindividual change possibly caused by an intervention addressing, for example, anxiety, depression, eating disorder, or addiction. Reliability of difference scores is important for interpreting observed…

Descriptors: Test Reliability, Scores, Pretests Posttests, Computation

Development and Validation of the Written Communication Assessment of the "HEIghten"® Outcomes Assessment Suite. Research Report. ETS RR-17-53

Peer reviewed
PDF on ERIC

Download full text

Rios, Joseph A.; Sparks, Jesse R.; Zhang, Mo; Liu, Ou Lydia – ETS Research Report Series, 2017

Proficiency with written communication (WC) is critical for success in college and careers. As a result, institutions face a growing challenge to accurately evaluate their students' writing skills to obtain data that can support demands of accreditation, accountability, or curricular improvement. Many current standardized measures, however, lack…

Descriptors: Test Construction, Test Validity, Writing Tests, College Outcomes Assessment

Mountain or Molehill? A Simulation Study on the Impact of Response Styles

Peer reviewed

Direct link

Plieninger, Hansjörg – Educational and Psychological Measurement, 2017

Even though there is an increasing interest in response styles, the field lacks a systematic investigation of the bias that response styles potentially cause. Therefore, a simulation was carried out to study this phenomenon with a focus on applied settings (reliability, validity, scale scores). The influence of acquiescence and extreme response…

Descriptors: Response Style (Tests), Test Bias, Item Response Theory, Correlation

Development of a Tool to Assess Inference-Making and Reasoning in Biology

Peer reviewed
PDF on ERIC

Download full text

Direct link

Cromley, Jennifer G.; Dai, Ting; Fechter, Tia; Nelson, Frank E.; Van Boekel, Martin; Du, Yang – Grantee Submission, 2021

Making inferences and reasoning with new scientific information is critical for successful performance in biology coursework. Thus, identifying students who are weak in these skills could allow the early provision of additional support and course placement recommendations to help students develop their reasoning abilities, leading to better…

Descriptors: Science Tests, Multiple Choice Tests, Logical Thinking, Inferences

As a Potential Source of Error, Measuring the Tendency of University Students to Copy the Answers: A Scale Development Study

Peer reviewed
PDF on ERIC

Download full text

Demir, Ergul – Eurasian Journal of Educational Research, 2018

Purpose: The answer-copying tendency has the potential to detect suspicious answer patterns for prior distributions of statistical detection techniques. The aim of this study is to develop a valid and reliable measurement tool as a scale in order to observe the tendency of university students' copying of answers. Also, it is aimed to provide…

Descriptors: College Students, Cheating, Test Construction, Student Behavior

Operational Study 4: Accessibility of New Items/Functionality. Component 3 Report

Download full text

Steedle, Jeffrey; LaSalle, Amy – Partnership for Assessment of Readiness for College and Careers, 2016

Partnership for Assessment of Readiness for College and Careers (PARCC) Operational Study 4 Component 3 was designed to compare performance on PARCC mathematics field-test items for grade 3 taken with and without a drawing tool. For the 2016 testing window, five field-test items were selected to have the directions edited to allow students to…

Descriptors: Grade 3, Mathematics Tests, Test Items, Freehand Drawing

Measuring Process Quality in Early Childhood Education and Care through Situational Judgement Questions: Findings from TALIS Starting Strong 2018 Field Trial. OECD Education Working Papers, No. 217

Direct link

Nilsen, Trude; Slot, Pauline; Cigler, Hynek; Chen, Minge – OECD Publishing, 2020

Situational Judgement Questions (SJQs) measuring process quality were included in the OECD Starting Strong Teaching and Learning International Survey 2018 (TALIS Starting Strong 2018) to address concerns of self-report bias in large-scale international surveys. These SJQs provide the staff in early childhood education and care with situations…

Descriptors: Educational Quality, Situational Tests, Administrator Surveys, Teacher Surveys

Scientific Evidence for the Validity of the New Mexico Kindergarten Observation Tool. REL 2018-281

Peer reviewed
PDF on ERIC

Download full text

Dahlke, Katie; Yang, Rui; Martínez, Carmen; Chavez, Suzette; Martin, Alejandra; Hawkinson, Laura; Shields, Joseph; Garland, Marshall; Carle, Jill – Regional Educational Laboratory Southwest, 2017

The New Mexico Public Education Department developed the Kindergarten Observation Tool (KOT) as a multidimensional observational measure of students' knowledge and skills at kindergarten entry. The primary purpose of the KOT is to inform instruction, so that kindergarten teachers can use the information about their students' knowledge and skills…

Descriptors: Test Validity, Observation, Measures (Individuals), Kindergarten

On the Bias-Amplifying Effect of Near Instruments in Observational Studies

Peer reviewed
PDF on ERIC

Download full text

Steiner, Peter M.; Kim, Yongnam – Society for Research on Educational Effectiveness, 2014

In contrast to randomized experiments, the estimation of unbiased treatment effects from observational data requires an analysis that conditions on all confounding covariates. Conditioning on covariates can be done via standard parametric regression techniques or nonparametric matching like propensity score (PS) matching. The regression or…

Descriptors: Observation, Research Methodology, Test Bias, Regression (Statistics)

Pilot Testing the Chinese Version of the ETS® Proficiency Profile Critical Thinking Test. Research Report. ETS RR-16-37

Peer reviewed
PDF on ERIC

Download full text

Liu, Ou Lydia; Mao, Liyang; Zhao, Tingting; Yang, Yi; Xu, Jun; Wang, Zhen – ETS Research Report Series, 2016

Chinese higher education is experiencing rapid development and growth. With tremendous resources invested in higher education, policy makers have requested more direct evidence of student learning. However, assessment tools that can be used to measure college-level learning are scarce in China. To mitigate this situation, we translated the…

Descriptors: Foreign Countries, Higher Education, Critical Thinking, College Students

Improving the Factor Structure of Psychological Scales: The Expanded Format as an Alternative to the Likert Scale Format

Peer reviewed

Direct link

Zhang, Xijuan; Savalei, Victoria – Educational and Psychological Measurement, 2016

Many psychological scales written in the Likert format include reverse worded (RW) items in order to control acquiescence bias. However, studies have shown that RW items often contaminate the factor structure of the scale by creating one or more method factors. The present study examines an alternative scale format, called the Expanded format,…

Descriptors: Factor Structure, Psychological Testing, Alternative Assessment, Test Items

Applying Longitudinal Mean and Covariance Structures (LMACS) Analysis to Assess Construct Stability Over Two Time Points: An Example Using Psychological Entitlement

Peer reviewed

Direct link

Bashkov, Bozhidar M.; Finney, Sara J. – Measurement and Evaluation in Counseling and Development, 2013

Traditional methods of assessing construct stability are reviewed and longitudinal mean and covariance structures (LMACS) analysis, a modern approach, is didactically illustrated using psychological entitlement data. Measurement invariance and latent variable stability results are interpreted, emphasizing substantive implications for educators and…

Descriptors: Statistical Analysis, Longitudinal Studies, Reliability, Psychological Patterns

Investigating ESL Students' Performance on Outcomes Assessments in Higher Education

Peer reviewed

Direct link

Lakin, Joni M.; Elliott, Diane Cardenas; Liu, Ou Lydia – Educational and Psychological Measurement, 2012

Outcomes assessments are gaining great attention in higher education because of increased demand for accountability. These assessments are widely used by U.S. higher education institutions to measure students' college-level knowledge and skills, including students who speak English as a second language (ESL). For the past decade, the increasing…

Descriptors: College Outcomes Assessment, Achievement Tests, English Language Learners, College Students

Validity, Reliability, and Potential Bias of Short Forms of Students' Evaluation of Teaching: The Case of UAE University

Peer reviewed

Direct link

Dodeen, Hamzeh – Educational Assessment, 2013

Students' opinions continue to be a significant factor in the evaluation of teaching in higher education institutions. The purpose of this study was to psychometrically assess short students evaluation of teaching (SET) forms using the UAE University form as a model. The study evaluated the form validity, reliability, the overall question, and…

Descriptors: Foreign Countries, Student Evaluation of Teacher Performance, Test Validity, Test Reliability

Previous Page | Next Page »

Pages: 1 | 2 | 3

Liu, Ou Lydia	3
Alliger, George M.	1
Bashkov, Bozhidar M.	1
Brown, R. L.	1
Carle, Jill	1
Chavez, Suzette	1
Chen, Minge	1
Cheng, Ying-Yao	1
Cigler, Hynek	1
Coen, Thomas	1
Cromley, Jennifer G.	1
Dahlke, Katie	1
Dai, Ting	1
Demir, Ergul	1
Dodeen, Hamzeh	1
Dorans, Neil J.	1
Du, Yang	1
Elliott, Diane Cardenas	1
Emons, Wilco H. M.	1
Fechter, Tia	1
Finney, Sara J.	1
Frary, Robert B.	1
Gallagher, Carole	1
Garland, Marshall	1
More ▼