Publication Date
In 2025 | 0 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 2 |
Since 2016 (last 10 years) | 4 |
Since 2006 (last 20 years) | 13 |
Descriptor
Sampling | 40 |
Test Construction | 40 |
Test Reliability | 23 |
Test Validity | 18 |
Reliability | 15 |
Test Items | 15 |
Foreign Countries | 12 |
Questionnaires | 11 |
Research Methodology | 11 |
Statistical Analysis | 9 |
Achievement Tests | 8 |
More ▼ |
Source
Author
Meijer, Rob R. | 2 |
Bailey, J. P., Jr. | 1 |
Beaton, Albert E. | 1 |
Bourque, Linda B. | 1 |
Bruininks, Robert H. | 1 |
Burrill, Lois E. | 1 |
Cahen, Leonard S. | 1 |
Childs, Ruth | 1 |
Childs, Ruth A. | 1 |
Cizek, Gregory J. | 1 |
Dick, Walter | 1 |
More ▼ |
Publication Type
Education Level
Elementary Education | 1 |
Elementary Secondary Education | 1 |
Grade 4 | 1 |
Higher Education | 1 |
Intermediate Grades | 1 |
Secondary Education | 1 |
Audience
Researchers | 3 |
Students | 2 |
Location
Australia | 2 |
Austria | 1 |
Belgium | 1 |
Canada | 1 |
Chile | 1 |
Cyprus | 1 |
Czech Republic | 1 |
Denmark | 1 |
Estonia | 1 |
France | 1 |
Germany | 1 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
Trends in International… | 3 |
Program for International… | 2 |
Flesch Kincaid Grade Level… | 1 |
International Association for… | 1 |
National Assessment of… | 1 |
National Longitudinal Study… | 1 |
Progress in International… | 1 |
What Works Clearinghouse Rating
Vinay Kumar Yadav; Shakti Prasad – Measurement: Interdisciplinary Research and Perspectives, 2024
In sample survey analysis, accurate population mean estimation is an important task, but traditional approaches frequently ignore the intricacies of real-world data, leading to biassed results. In order to handle uncertainties, indeterminacies, and ambiguity, this work presents an innovative approach based on neutrosophic statistics. We proposed…
Descriptors: Sampling, Statistical Bias, Predictor Variables, Predictive Measurement
Donoghue, John R.; McClellan, Catherine A.; Hess, Melinda R. – ETS Research Report Series, 2022
When constructed-response items are administered for a second time, it is necessary to evaluate whether the current Time B administration's raters have drifted from the scoring of the original administration at Time A. To study this, Time A papers are sampled and rescored by Time B scorers. Commonly the scores are compared using the proportion of…
Descriptors: Item Response Theory, Test Construction, Scoring, Testing
Vaske, Jerry J. – Sagamore-Venture, 2019
Data collected from surveys can result in hundreds of variables and thousands of respondents. This implies that time and energy must be devoted to (a) carefully entering the data into a database, (b) running preliminary analyses to identify any problems (e.g., missing data, potential outliers), (c) checking the reliability and validity of the…
Descriptors: Surveys, Theories, Hypothesis Testing, Effect Size
Lu, Ru; Haberman, Shelby; Guo, Hongwen; Liu, Jinghua – ETS Research Report Series, 2015
In this study, we apply jackknifing to anchor items to evaluate the impact of anchor selection on equating stability. In an ideal world, the choice of anchor items should have little impact on equating results. When this ideal does not correspond to reality, selection of anchor items can strongly influence equating results. This influence does not…
Descriptors: Test Construction, Equated Scores, Test Items, Sampling
Wagemaker, Hans, Ed. – International Association for the Evaluation of Educational Achievement, 2020
Although International Association for the Evaluation of Educational Achievement-pioneered international large-scale assessment (ILSA) of education is now a well-established science, non-practitioners and many users often substantially misunderstand how large-scale assessments are conducted, what questions and challenges they are designed to…
Descriptors: International Assessment, Achievement Tests, Educational Assessment, Comparative Analysis
Schönborn, K. J.; Höst, G. E.; Lundin Palmerius, K. E. – Chemistry Education Research and Practice, 2015
As the application of nanotechnology in everyday life impacts society, it becomes critical for citizens to have a scientific basis upon which to judge their perceived hopes and fears of 'nano'. Although multiple instruments have been designed for assessing attitudinal and affective aspects of nano, surprisingly little work has focused on…
Descriptors: Molecular Structure, Technology, Test Construction, Test Validity
Haberman, Shelby J.; Dorans, Neil J. – Educational Testing Service, 2011
For testing programs that administer multiple forms within a year and across years, score equating is used to ensure that scores can be used interchangeably. In an ideal world, samples sizes are large and representative of populations that hardly change over time, and very reliable alternate test forms are built with nearly identical psychometric…
Descriptors: Scores, Reliability, Equated Scores, Test Construction
OECD Publishing, 2013
The Programme for the International Assessment of Adult Competencies (PIAAC) has been planned as an ongoing program of assessment. The first cycle of the assessment has involved two "rounds." The first round, which is covered by this report, took place over the period of January 2008-October 2013. The main features of the first cycle of…
Descriptors: International Assessment, Adults, Skills, Test Construction
OECD Publishing, 2014
The "PISA 2012 Technical Report" describes the methodology underlying the PISA 2012 survey, which tested 15-year-olds' competencies in mathematics, reading and science and, in some countries, problem solving and financial literacy. It examines the design and implementation of the project at a level of detail that allows researchers to…
Descriptors: International Assessment, Secondary School Students, Foreign Countries, Achievement Tests
Zeyneloglu, Simge; Terzioglu, Fusun – Hacettepe University Journal of Education, 2011
This research was conducted for the purpose of developing a scaling tool to determine university students' attitudes towards gender roles. University students' attitudes should first be determined in order to change this traditional view to gender and to achieve a more egalitarian view. The research sample was comprised of one university's…
Descriptors: Student Attitudes, Sex Role, Measures (Individuals), Sampling
Runyan, Desmond K.; Dunne, Michael P.; Zolotor, Adam J. – Child Abuse & Neglect: The International Journal, 2009
The "World Report on Children and Violence", (Pinheiro, 2006) was produced at the request of the UN Secretary General and the UN General Assembly. This report recommended improvement in research on child abuse. ISPCAN representatives took this charge and developed 3 new instruments. We describe this background and introduce three new measures…
Descriptors: Child Abuse, Screening Tests, Child Welfare, Test Construction
OECD Publishing (NJ1), 2009
The Organisation for Economic Cooperation and Development's (OECD's) Programme for International Student Assessment (PISA) surveys, which take place every three years, have been designed to collect information about 15-year-old students in participating countries. PISA examines how well students are prepared to meet the challenges of the future,…
Descriptors: Policy Formation, Scaling, Academic Achievement, Interrater Reliability
Wofford, J. C.; Willoughby, T. L. – Calif J Educ Res, 1969
Descriptors: Correlation, Item Analysis, Sampling, Test Construction
Childs, Ruth A.; Jaciw, Andrew P. – 2003
Matrix sampling of test items, the division of a set of items into different versions of a test form, is used by several large-scale testing programs. This Digest discusses nine categories of costs associated with matrix sampling. These categories are: (1) development costs; (2) materials costs; (3) administration costs; (4) educational costs; (5)…
Descriptors: Costs, Matrices, Reliability, Sampling
Haladyna, Thomas M. – 1974
Classical test theory has been rejected for application to criterion-referenced (CR) tests by most psychometricians due to an expected lack of variance in scores and other difficulties. The present study was conceived to resolve the variance problem and explore the possibility that classical test theory is both appropriate and desirable for some…
Descriptors: Criterion Referenced Tests, Error of Measurement, Sampling, Test Construction