ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	4
Since 2006 (last 20 years)	13

Descriptor

Sampling	40
Test Construction	40
Test Reliability	23
Test Validity	18
Reliability	15
Test Items	15
Foreign Countries	12
Questionnaires	11
Research Methodology	11
Statistical Analysis	9
Achievement Tests	8
Data Collection	8
Item Analysis	7
Scoring	7
Data Analysis	6
Evaluation Methods	6
Psychometrics	6
Academic Achievement	5
Comparative Analysis	5
Correlation	5
Error of Measurement	5
Mathematics Achievement	5
Scores	5
Surveys	5
Testing	5
More ▼

Source

ETS Research Report Series	2
OECD Publishing	2
Applied Psychological…	1
Calif J Educ Res	1
Chemistry Education Research…	1
Child Abuse & Neglect: The…	1
Crime & Delinquency	1
Educational Testing Service	1
Educational and Psychological…	1
Hacettepe University Journal…	1
International Association for…	1
Journal of Educational…	1
Measurement:…	1
OECD Publishing (NJ1)	1
Psychometrika	1
Sagamore-Venture	1
More ▼

Publication Type

Reports - Research	14
Journal Articles	10
Reports - Descriptive	7
Numerical/Quantitative Data	5
Books	4
Collected Works - General	4
Guides - Non-Classroom	4
Reports - Evaluative	4
Speeches/Meeting Papers	3
ERIC Digests in Full Text	1
ERIC Publications	1
Opinion Papers	1
Reports - General	1
Tests/Questionnaires	1
More ▼

Education Level

Elementary Education	1
Elementary Secondary Education	1
Grade 4	1
Higher Education	1
Intermediate Grades	1
Secondary Education	1

Audience

Researchers	3
Students	2

Location

Australia	2
Austria	1
Belgium	1
Canada	1
Chile	1
Cyprus	1
Czech Republic	1
Denmark	1
Estonia	1
France	1
Germany	1
India	1
Ireland	1
Italy	1
Japan	1
Netherlands	1
Norway	1
Poland	1
Russia	1
Slovakia	1
South Korea	1
Spain	1
Sweden	1
Turkey	1
United Kingdom	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Trends in International…	3
Program for International…	2
Flesch Kincaid Grade Level…	1
International Association for…	1
National Assessment of…	1
National Longitudinal Study…	1
Progress in International…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 40 results Save | Export

Neutrosophic Estimators for Estimating the Population Mean in Survey Sampling

Peer reviewed

Direct link

Vinay Kumar Yadav; Shakti Prasad – Measurement: Interdisciplinary Research and Perspectives, 2024

In sample survey analysis, accurate population mean estimation is an important task, but traditional approaches frequently ignore the intricacies of real-world data, leading to biassed results. In order to handle uncertainties, indeterminacies, and ambiguity, this work presents an innovative approach based on neutrosophic statistics. We proposed…

Descriptors: Sampling, Statistical Bias, Predictor Variables, Predictive Measurement

Investigating Constructed-Response Scoring over Time: The Effects of Study Design on Trend Rescore Statistics. Research Report. ETS RR-22-15

Peer reviewed
PDF on ERIC

Download full text

Donoghue, John R.; McClellan, Catherine A.; Hess, Melinda R. – ETS Research Report Series, 2022

When constructed-response items are administered for a second time, it is necessary to evaluate whether the current Time B administration's raters have drifted from the scoring of the original administration at Time A. To study this, Time A papers are sampled and rescored by Time B scorers. Commonly the scores are compared using the proportion of…

Descriptors: Item Response Theory, Test Construction, Scoring, Testing

Survey Research and Analysis, 2nd Edition

Direct link

Vaske, Jerry J. – Sagamore-Venture, 2019

Data collected from surveys can result in hundreds of variables and thousands of respondents. This implies that time and energy must be devoted to (a) carefully entering the data into a database, (b) running preliminary analyses to identify any problems (e.g., missing data, potential outliers), (c) checking the reliability and validity of the…

Descriptors: Surveys, Theories, Hypothesis Testing, Effect Size

Use of Jackknifing to Evaluate Effects of Anchor Item Selection on Equating with the Nonequivalent Groups with Anchor Test (NEAT) Design. Research Report. ETS RR-15-10

Peer reviewed
PDF on ERIC

Download full text

Lu, Ru; Haberman, Shelby; Guo, Hongwen; Liu, Jinghua – ETS Research Report Series, 2015

In this study, we apply jackknifing to anchor items to evaluate the impact of anchor selection on equating stability. In an ideal world, the choice of anchor items should have little impact on equating results. When this ideal does not correspond to reality, selection of anchor items can strongly influence equating results. This influence does not…

Descriptors: Test Construction, Equated Scores, Test Items, Sampling

Reliability and Validity of International Large-Scale Assessment: Understanding IEA's Comparative Studies of Student Achievement. IEA Research for Education. Volume 10

Download full text

Wagemaker, Hans, Ed. – International Association for the Evaluation of Educational Achievement, 2020

Although International Association for the Evaluation of Educational Achievement-pioneered international large-scale assessment (ILSA) of education is now a well-established science, non-practitioners and many users often substantially misunderstand how large-scale assessments are conducted, what questions and challenges they are designed to…

Descriptors: International Assessment, Achievement Tests, Educational Assessment, Comparative Analysis

Measuring Understanding of Nanoscience and Nanotechnology: Development and Validation of the Nano-Knowledge Instrument (NanoKI)

Peer reviewed

Direct link

Schönborn, K. J.; Höst, G. E.; Lundin Palmerius, K. E. – Chemistry Education Research and Practice, 2015

As the application of nanotechnology in everyday life impacts society, it becomes critical for citizens to have a scientific basis upon which to judge their perceived hopes and fears of 'nano'. Although multiple instruments have been designed for assessing attitudinal and affective aspects of nano, surprisingly little work has focused on…

Descriptors: Molecular Structure, Technology, Test Construction, Test Validity

Sources of Score Scale Inconsistency. Research Report. ETS RR-11-10

Download full text

Haberman, Shelby J.; Dorans, Neil J. – Educational Testing Service, 2011

For testing programs that administer multiple forms within a year and across years, score equating is used to ensure that scores can be used interchangeably. In an ideal world, samples sizes are large and representative of populations that hardly change over time, and very reliable alternate test forms are built with nearly identical psychometric…

Descriptors: Scores, Reliability, Equated Scores, Test Construction

Technical Report of the Survey of Adult Skills (PIAAC)

Direct link

OECD Publishing, 2013

The Programme for the International Assessment of Adult Competencies (PIAAC) has been planned as an ongoing program of assessment. The first cycle of the assessment has involved two "rounds." The first round, which is covered by this report, took place over the period of January 2008-October 2013. The main features of the first cycle of…

Descriptors: International Assessment, Adults, Skills, Test Construction

PISA 2012 Technical Report

Direct link

OECD Publishing, 2014

The "PISA 2012 Technical Report" describes the methodology underlying the PISA 2012 survey, which tested 15-year-olds' competencies in mathematics, reading and science and, in some countries, problem solving and financial literacy. It examines the design and implementation of the project at a level of detail that allows researchers to…

Descriptors: International Assessment, Secondary School Students, Foreign Countries, Achievement Tests

Development and Psychometric Properties Gender Roles Attitude Scale

Peer reviewed

Direct link

Zeyneloglu, Simge; Terzioglu, Fusun – Hacettepe University Journal of Education, 2011

This research was conducted for the purpose of developing a scaling tool to determine university students' attitudes towards gender roles. University students' attitudes should first be determined in order to change this traditional view to gender and to achieve a more egalitarian view. The research sample was comprised of one university's…

Descriptors: Student Attitudes, Sex Role, Measures (Individuals), Sampling

Introduction to the Development of the ISPCAN Child Abuse Screening Tools

Peer reviewed

Direct link

Runyan, Desmond K.; Dunne, Michael P.; Zolotor, Adam J. – Child Abuse & Neglect: The International Journal, 2009

The "World Report on Children and Violence", (Pinheiro, 2006) was produced at the request of the UN Secretary General and the UN General Assembly. This report recommended improvement in research on child abuse. ISPCAN representatives took this charge and developed 3 new instruments. We describe this background and introduce three new measures…

Descriptors: Child Abuse, Screening Tests, Child Welfare, Test Construction

PISA 2006 Technical Report

Direct link

OECD Publishing (NJ1), 2009

The Organisation for Economic Cooperation and Development's (OECD's) Programme for International Student Assessment (PISA) surveys, which take place every three years, have been designed to collect information about 15-year-old students in participating countries. PISA examines how well students are prepared to meet the challenges of the future,…

Descriptors: Policy Formation, Scaling, Academic Achievement, Interrater Reliability

The Effects of Test Construction Variables Upon Test Reliability and Validity

Wofford, J. C.; Willoughby, T. L. – Calif J Educ Res, 1969

Descriptors: Correlation, Item Analysis, Sampling, Test Construction

Costs of Matrix Sampling of Test Items. ERIC Digest.

Download full text

Childs, Ruth A.; Jaciw, Andrew P. – 2003

Matrix sampling of test items, the division of a set of items into different versions of a test form, is used by several large-scale testing programs. This Digest discusses nine categories of costs associated with matrix sampling. These categories are: (1) development costs; (2) materials costs; (3) administration costs; (4) educational costs; (5)…

Descriptors: Costs, Matrices, Reliability, Sampling

An Investigation of Full-And Subscale Reliabilities of Criterion-Referenced Tests.

Download full text

Haladyna, Thomas M. – 1974

Classical test theory has been rejected for application to criterion-referenced (CR) tests by most psychometricians due to an expected lack of variance in scores and other difficulties. The present study was conceived to resolve the variance problem and explore the possibility that classical test theory is both appropriate and desirable for some…

Descriptors: Criterion Referenced Tests, Error of Measurement, Sampling, Test Construction

Previous Page | Next Page »

Pages: 1 | 2 | 3

Meijer, Rob R.	2
Bailey, J. P., Jr.	1
Beaton, Albert E.	1
Bourque, Linda B.	1
Bruininks, Robert H.	1
Burrill, Lois E.	1
Cahen, Leonard S.	1
Childs, Ruth	1
Childs, Ruth A.	1
Cizek, Gregory J.	1
Dick, Walter	1
Dings, Jonathan	1
Donoghue, John R.	1
Dorans, Neil J.	1
Dunne, Michael P.	1
Farish, Stephen J.	1
Fielder, Eve P.	1
Fink, Arlene	1
Garg, Rashmi	1
Gifford, Janice A.	1
Gottfredson, Stephen D.	1
Guo, Hongwen	1
Haberman, Shelby	1
Haberman, Shelby J.	1
More ▼