Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 4 |
Since 2016 (last 10 years) | 9 |
Since 2006 (last 20 years) | 19 |
Descriptor
Mathematics Tests | 41 |
Reading Tests | 41 |
Test Validity | 41 |
Test Reliability | 17 |
Achievement Tests | 14 |
Test Items | 12 |
Elementary Secondary Education | 11 |
Comparative Analysis | 10 |
Elementary School Students | 10 |
Scores | 9 |
Test Construction | 9 |
More ▼ |
Source
Author
Publication Type
Education Level
Elementary Education | 6 |
Elementary Secondary Education | 6 |
Grade 4 | 6 |
Early Childhood Education | 5 |
Grade 3 | 4 |
Intermediate Grades | 4 |
Grade 10 | 3 |
Grade 5 | 3 |
Middle Schools | 3 |
Postsecondary Education | 3 |
Primary Education | 3 |
More ▼ |
Audience
Researchers | 4 |
Location
Australia | 2 |
Maryland | 2 |
Massachusetts | 2 |
Minnesota | 2 |
Nevada | 2 |
New Jersey | 2 |
New Mexico | 2 |
Rhode Island | 2 |
Alabama | 1 |
Arizona | 1 |
California | 1 |
More ▼ |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 2 |
Individuals with Disabilities… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Meyer, J. Patrick; Hu, Ann; Li, Sylvia – NWEA, 2023
The Content Proximity Project was designed to improve the content validity of the MAP® Growth™ assessments while retaining the ability for the test to adapt off-grade and meet students wherever they are in their learning. Two main features of the project were the development of an enhanced item selection algorithm, and a spring pilot study…
Descriptors: Achievement Tests, Mathematics Achievement, Content Validity, Mathematics Tests
Van Norman, Ethan R.; Forcht, Emily R. – Assessment for Effective Intervention, 2023
This study explored the validity of growth on two computer adaptive tests, Star Reading and Star Math, in explaining performance on an end-of-year achievement test for a sample of students in Grades 3 through 6. Results from quantile regression analyses indicate that growth on Star Reading explained a statistically significant amount of variance…
Descriptors: Test Validity, Computer Assisted Testing, Adaptive Testing, Grade Prediction
Shear, Benjamin R. – Journal of Educational Measurement, 2023
Large-scale standardized tests are regularly used to measure student achievement overall and for student subgroups. These uses assume tests provide comparable measures of outcomes across student subgroups, but prior research suggests score comparisons across gender groups may be complicated by the type of test items used. This paper presents…
Descriptors: Gender Bias, Item Analysis, Test Items, Achievement Tests
Anselmo, Giancarlo A.; Yarbrough, Jamie L.; Tran, Van Vi N. – Journal of Psychoeducational Assessment, 2021
This study analyzed the relationship between benchmark scores from the newly published Dynamic Indicators of Basic Early Literacy Skills Math (i.e., Acadience™) math probes and student performance on math and reading sections of a state-mandated high-stakes test. Participants were 420 students enrolled in third, fourth, and fifth grades in a rural…
Descriptors: Elementary School Students, Mathematics Tests, Benchmarking, High Stakes Tests
Clark, Judy – set: Research Information for Teachers, 2019
The PM Benchmark Reading Assessment Resource (PM Benchmarks) provides information about primary students' level of achievement in reading accurately, fluently, and in understanding unseen texts. As part of the assessment, students have to orally retell what they have read and orally answer comprehension questions. This oral use of language…
Descriptors: Benchmarking, Elementary School Students, Reading Achievement, Reading Fluency
Rogers, Christopher M.; Thurlow, Martha L.; Lazarus, Sheryl S.; Liu, Kristin K. – National Center on Educational Outcomes, 2019
The purpose of this report is to present a synthesis of the research on test accommodations published in 2015 and 2016. We summarize the research to review current research trends and enhance understanding of the implications of accommodations use in the development of future policy directions, to highlight implementation of current and new…
Descriptors: Testing Accommodations, Students with Disabilities, Elementary Secondary Education, Postsecondary Education
Steedle, Jeffrey; Quesen, Sarah; Boyd, Aimee – Partnership for Assessment of Readiness for College and Careers, 2017
On the Partnership for Assessment of Readiness for College and Careers (PARCC) assessments, the attainment of performance level 4 is intended to indicate college readiness or being "on track" to college and career readiness. Students who achieve Level 4 should have a 0.75 probability of attaining at least a C in entry-level,…
Descriptors: College Readiness, Career Readiness, Test Validity, Longitudinal Studies
Rindermann, Heiner; Baumeister, Antonia E. E. – International Journal of Testing, 2015
Scholastic tests regard cognitive abilities to be domain-specific competences. However, high correlations between competences indicate either high task similarity or a dependence on common factors. The present rating study examined the validity of 12 Programme for International Student Assessment (PISA) and Third or Trends in International…
Descriptors: Test Validity, Test Interpretation, Competence, Reading Tests
Wagemaker, Hans, Ed. – International Association for the Evaluation of Educational Achievement, 2020
Although International Association for the Evaluation of Educational Achievement-pioneered international large-scale assessment (ILSA) of education is now a well-established science, non-practitioners and many users often substantially misunderstand how large-scale assessments are conducted, what questions and challenges they are designed to…
Descriptors: International Assessment, Achievement Tests, Educational Assessment, Comparative Analysis
Halpin, Peter F.; Torrente, Catalina – Society for Research on Educational Effectiveness, 2014
Using reliable and valid measures of students' outcomes which are sensitive to change is critical for obtaining interpretable and therefore useful results from evaluations of school-based interventions. While measurement development for use in experimental evaluations receives a great deal of attention in the U.S., it lags behind in low-income…
Descriptors: Foreign Countries, Outcome Measures, Outcomes of Education, Cluster Grouping
Frame, Laura B.; Vidrine, Stephanie M.; Hinojosa, Ryan – Journal of Psychoeducational Assessment, 2016
The Kaufman Test of Educational Achievement, Third Edition (KTEA-3) is a revised and updated comprehensive academic achievement test (Kaufman & Kaufman, 2014). Authored by Drs. Alan and Nadeen Kaufman and published by Pearson, the KTEA-3 remains an individual achievement test normed for individuals of ages 4 through 25 years, or for those in…
Descriptors: Achievement Tests, Elementary Secondary Education, Test Validity, Test Reliability
Stancavage, Frances B., Ed.; Bohrnstedt, George W., Ed. – American Institutes for Research, 2013
Since its inception more than four decades ago, the National Assessment of Educational Progress (NAEP) has served as a key indicator of what the nation's students know and can do in academic subjects. NAEP assessments provide a mechanism for putting the achievements of students in all states on a common scale; the assessments also serve as…
Descriptors: National Competency Tests, State Standards, Academic Standards, Academic Achievement
Styles, Irene; Wildy, Helen; Pepper, Vivienne; Faulkner, Joanne; Berman, Ye'Elah – International Research in Early Childhood Education, 2014
The assessment of literacy and numeracy skills of students as they enter school for the first time is not yet established nation-wide in Australia. However, a large proportion of primary schools have chosen to assess their starting students on the Performance Indicators in Primary Schools-Baseline Assessment (PIPS-BLA). This series of three…
Descriptors: Foreign Countries, Indigenous Knowledge, Performance Based Assessment, Test Bias
Thissen, David; Norton, Scott – American Institutes for Research, 2013
Development of the Common Core State Standards (CCSS), and the creation of the Smarter Balanced Assessment Consortium (Smarter Balanced) and the Partnership for Assessment of Readiness for College and Careers (PARCC), changes the pattern of accountability testing. These changes raise the question: "How should NAEP's validity and utility be…
Descriptors: National Competency Tests, Psychometrics, State Standards, Academic Standards
Taylor, Catherine S.; Lee, Yoonsun – Applied Measurement in Education, 2010
Item response theory (IRT) methods are generally used to create score scales for large-scale tests. Research has shown that IRT scales are stable across groups and over time. Most studies have focused on items that are dichotomously scored. Now Rasch and other IRT models are used to create scales for tests that include polytomously scored items.…
Descriptors: Measures (Individuals), Item Response Theory, Robustness (Statistics), Item Analysis