Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 4 |
Since 2016 (last 10 years) | 6 |
Since 2006 (last 20 years) | 15 |
Descriptor
Test Items | 24 |
Foreign Countries | 23 |
Item Response Theory | 17 |
Test Bias | 7 |
Latent Trait Theory | 6 |
Mathematical Models | 5 |
Models | 5 |
Computer Assisted Testing | 4 |
Equations (Mathematics) | 4 |
Scores | 4 |
Adaptive Testing | 3 |
More ▼ |
Source
Author
Publication Type
Journal Articles | 16 |
Reports - Research | 12 |
Reports - Evaluative | 9 |
Speeches/Meeting Papers | 3 |
Collected Works - Proceedings | 1 |
Information Analyses | 1 |
Opinion Papers | 1 |
Reports - Descriptive | 1 |
Education Level
Higher Education | 3 |
Elementary Education | 1 |
Elementary Secondary Education | 1 |
Grade 8 | 1 |
Secondary Education | 1 |
Audience
Researchers | 2 |
Location
Netherlands | 24 |
Indonesia | 2 |
Italy | 2 |
South Korea | 2 |
Turkey | 2 |
Australia | 1 |
Belgium | 1 |
Canada | 1 |
Chile | 1 |
Czech Republic | 1 |
Finland | 1 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Avsar, Asiye Sengül – Participatory Educational Research, 2022
It is necessary to supply proof regarding the construct validity of the scales. Especially, when new scales are developed the construct validity is researched by the Exploratory Factor Analysis (EFA). Generally, factor extraction is performed via the Principal Component Analysis (PCA) which is not exactly factor analysis and the Principal Axis…
Descriptors: Factor Analysis, Automation, Construct Validity, Item Response Theory
Roelofs, Erik C.; Emons, Wilco H. M.; Verschoor, Angela J. – International Journal of Testing, 2021
This study reports on an Evidence Centered Design (ECD) project in the Netherlands, involving the theory exam for prospective car drivers. In particular, we illustrate how cognitive load theory, task-analysis, response process models, and explanatory item-response theory can be used to systematically develop and refine task models. Based on a…
Descriptors: Foreign Countries, Psychometrics, Test Items, Evidence Based Practice
Blijd-Hoogewys, Els M. A.; Bulgarelli, Daniela; Molina, Paola; van Geert, Paul L. C. – European Journal of Developmental Psychology, 2022
The extent to which Theory of Mind (ToM) performance is influenced by cultural and gender differences remains a subject of debate. A sample of 324 Dutch and 511 Italian children (52% boys; 2.8-11.7 years; 50% boys; 2.6-10.3 years; respectively) was administered the ToM Storybooks. Analysis focused on indicators of nonlinearity: moving standard…
Descriptors: Theory of Mind, Child Development, Cross Cultural Studies, Gender Differences
Agelink van Rentergem, Joost A.; Lever, Anne Geeke; Geurts, Hilde M. – Autism: The International Journal of Research and Practice, 2019
The Autism Spectrum Quotient is a widely used instrument for the detection of autistic traits. However, the validity of comparisons of Autism Spectrum Quotient scores between groups may be threatened by differential item functioning. Differential item functioning entails a bias in items, where participants with equal values of the latent trait…
Descriptors: Autism, Pervasive Developmental Disorders, Measures (Individuals), Test Validity
van der Lans, Rikkert M.; Maulana, Ridwan; Helms-Lorenz, Michelle; Fernández-García, Carmen-María; Chun, Seyeoung; de Jager, Thelma; Irnidayanti, Yulia; Inda-Caro, Mercedes; Lee, Okhwa; Coetzee, Thys; Fadhilah, Nurul; Jeon, Meae; Moorer, Peter – SAGE Open, 2021
This study examines measurement invariance of student perceptions of teaching quality collected in five countries: Indonesia (n students = 6,331), the Netherlands (n students = 6,738), South Africa (n students = 3,422), South Korea (n students = 6,997) and Spain (n students = 4,676). The administered questionnaire was the My Teacher Questionnaire…
Descriptors: Foreign Countries, Student Attitudes, Student Evaluation of Teacher Performance, Teacher Effectiveness
Vaheoja, Monika; Verhelst, N. D.; Eggen, T.J.H.M. – European Journal of Science and Mathematics Education, 2019
In this article, the authors applied profile analysis to Maths exam data to demonstrate how different exam forms, differing in difficulty and length, can be reported and easily interpreted. The results were presented for different groups of participants and for different institutions in different Maths domains by evaluating the balance. Some…
Descriptors: Feedback (Response), Foreign Countries, Statistical Analysis, Scores
Egberink, Iris J. L.; Meijer, Rob R.; Tendeiro, Jorge N. – Educational and Psychological Measurement, 2015
A popular method to assess measurement invariance of a particular item is based on likelihood ratio tests with all other items as anchor items. The results of this method are often only reported in terms of statistical significance, and researchers proposed different methods to empirically select anchor items. It is unclear, however, how many…
Descriptors: Personality Measures, Computer Assisted Testing, Measurement, Test Items
Deunk, Marjolein I.; van Kuijk, Mechteld F.; Bosker, Roel J. – Applied Measurement in Education, 2014
Standard setting methods, like the Bookmark procedure, are used to assist education experts in formulating performance standards. Small group discussion is meant to help these experts in setting more reliable and valid cutoff scores. This study is an analysis of 15 small group discussions during two standards setting trajectories and their effect…
Descriptors: Cutting Scores, Standard Setting, Group Discussion, Reading Tests
Hessen, David J. – Psychometrika, 2012
A multinormal partial credit model for factor analysis of polytomously scored items with ordered response categories is derived using an extension of the Dutch Identity (Holland in "Psychometrika" 55:5-18, 1990). In the model, latent variables are assumed to have a multivariate normal distribution conditional on unweighted sums of item…
Descriptors: Foreign Countries, Factor Analysis, Testing, Scoring
Tendeiro, Jorge N.; Meijer, Rob R. – Applied Psychological Measurement, 2012
This article extends the work by Armstrong and Shi on CUmulative SUM (CUSUM) person-fit methodology. The authors present new theoretical considerations concerning the use of CUSUM person-fit statistics based on likelihood ratios for the purpose of detecting cheating and random guessing by individual test takers. According to the Neyman-Pearson…
Descriptors: Cheating, Individual Testing, Adaptive Testing, Statistics
Veldkamp, Bernard P.; Verschoor, Angela J.; Eggen, Theo J. H. M. – Psicologica: International Journal of Methodology and Experimental Psychology, 2010
Overexposure and underexposure of items in the bank are serious problems in operational computerized adaptive testing (CAT) systems. These exposure problems might result in item compromise, or point at a waste of investments. The exposure control problem can be viewed as a test assembly problem with multiple objectives. Information in the test has…
Descriptors: Adaptive Testing, Item Analysis, Computer Assisted Testing, Test Items
Korobko, Oksana B.; Glas, Cees A. W.; Bosker, Roel J.; Luyten, Johan W. – Journal of Educational Measurement, 2008
Methods are presented for comparing grades obtained in a situation where students can choose between different subjects. It must be expected that the comparison between the grades is complicated by the interaction between the students' pattern and level of proficiency on one hand, and the choice of the subjects on the other hand. Three methods…
Descriptors: Item Response Theory, Test Items, Comparative Analysis, Grades (Scholastic)
Keuning, Jos; Verhoeven, Ludo – Learning and Individual Differences, 2008
The purpose of the present study was to explore Dutch spelling development throughout the elementary grades. Two issues were considered (a) dimensional structure over time, and (b) rate of change. Whether the rate of change differs depending on gender, ethnicity, or word reading skill was examined in particular. A pseudolongitudinal dataset with…
Descriptors: Spelling, Reading Skills, Item Response Theory, Foreign Countries
Hambleton, Ronald K.; Swaminathan, H. – 1985
Comments are made on the review papers presented by six Dutch psychometricians: Ivo Molenaar, Wim van der Linden, Ed Roskam, Arnold Van den Wollenberg, Gideon Mellenbergh, and Dato de Gruijter. Molenaar has embraced a pragmatic viewpoint on Bayesian methods, using both empirical and pure approaches to solve educational research problems. Molenaar…
Descriptors: Bayesian Statistics, Decision Making, Elementary Secondary Education, Foreign Countries
Hol, A. Michiel; Vorst, Harrie C. M.; Mellenbergh, Gideon J. – Applied Psychological Measurement, 2007
In a randomized experiment (n = 515), a computerized and a computerized adaptive test (CAT) are compared. The item pool consists of 24 polytomous motivation items. Although items are carefully selected, calibration data show that Samejima's graded response model did not fit the data optimally. A simulation study is done to assess possible…
Descriptors: Student Motivation, Simulation, Adaptive Testing, Computer Assisted Testing
Previous Page | Next Page »
Pages: 1 | 2