Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 6 |
Since 2006 (last 20 years) | 10 |
Descriptor
Test Content | 24 |
Test Use | 24 |
Test Validity | 10 |
Test Construction | 9 |
Elementary Secondary Education | 6 |
Test Items | 5 |
Test Reliability | 5 |
Evaluation Methods | 4 |
Foreign Countries | 4 |
Scores | 4 |
State Programs | 4 |
More ▼ |
Source
Author
Bauer, Scott C. | 1 |
Bebell, Damian | 1 |
Behuniak, Peter | 1 |
Boland, Lyn | 1 |
Brown, Ivan | 1 |
Cabrera, George A. | 1 |
Cabrera, Nolan L. | 1 |
Cheng, Liying | 1 |
Darling-Hammond, Linda | 1 |
DeLuca, Christopher | 1 |
Dunbar, Stephen B. | 1 |
More ▼ |
Publication Type
Journal Articles | 24 |
Reports - Descriptive | 7 |
Reports - Research | 7 |
Reports - Evaluative | 4 |
Book/Product Reviews | 3 |
Opinion Papers | 3 |
Information Analyses | 1 |
Speeches/Meeting Papers | 1 |
Tests/Questionnaires | 1 |
Education Level
Elementary Secondary Education | 3 |
Higher Education | 3 |
Postsecondary Education | 2 |
Adult Education | 1 |
Audience
Laws, Policies, & Programs
Every Student Succeeds Act… | 1 |
Assessments and Surveys
Metropolitan Achievement Tests | 1 |
National Assessment of… | 1 |
Stanford Achievement Tests | 1 |
Wechsler Preschool and… | 1 |
What Works Clearinghouse Rating
Rehfeld, David M.; Padgett, R. Noah – Journal of Psychoeducational Assessment, 2019
This article presents a review of the Comprehensive Assessment of Spoken Language--Second Edition (CASL-2), in which reliability, utility, and validity are analyzed and discussed. Some limited recommendations for practice are made based on a review of the information provided by the publisher for clinicians.
Descriptors: Oral Language, Language Tests, Receptive Language, Expressive Language
Sinharay, Sandip – Educational Measurement: Issues and Practice, 2018
The choice of anchor tests is crucial in applications of the nonequivalent groups with anchor test design of equating. Sinharay and Holland (2006, 2007) suggested "miditests," which are anchor tests that are content-representative and have the same mean item difficulty as the total test but have a smaller spread of item difficulties.…
Descriptors: Test Content, Difficulty Level, Test Items, Test Construction
Welch, Catherine J.; Dunbar, Stephen B. – Educational Measurement: Issues and Practice, 2020
The use of assessment results to inform school accountability relies on the assumption that the test design appropriately represents the content and cognitive emphasis reflected in the state's standards. Since the passage of the Every Student Succeeds Act and the certification of accountability assessments through federal peer review practices,…
Descriptors: Accountability, Test Construction, State Standards, Content Validity
Oliveri, María Elena; Nastal, Jessica; Slomp, David – ETS Research Report Series, 2020
This report discusses frameworks and assessment development approaches to consider fairness, opportunity to learn, and consequences of test use in the design and use of assessments administered to diverse populations. Examples include the integrated design and appraisal framework and the sociocognitively based evidence-centered design approach.…
Descriptors: Culture Fair Tests, Guidelines, Test Use, Test Construction
Leonard, Jack – Education Policy Analysis Archives, 2018
This paper introduces the new Massachusetts Performance Assessment for Leaders (PAL) and uses critical policy analysis to re-examine the validity evidence (using the 2014 Standards for Educational and Psychological Testing and a theory of multicultural validity) for the use and interpretation of the PAL in regards to emerging school leadership.…
Descriptors: Performance Based Assessment, Test Validity, High Stakes Tests, School Administration
Förster, Manuel; Happ, Roland; Molerov, Dimitar – Journal of Economic Education, 2017
In this article, the authors present the adaptation and validation processes conducted to render the American "Test of Financial Literacy" (TFL) suitable for use in Germany (TFL-G). First, they outline the translation procedure followed and the various cultural adjustments made in line with international standards. Next, they present…
Descriptors: Money Management, Tests, Scores, Test Content
Cabrera, Nolan L.; Cabrera, George A. – Educational Horizons, 2011
Just like all the high-stakes tests that determine students' futures nowadays, The Chorizo Test is a standardized test rooted in the culture of the test makers. It was originally created to be used with students in teacher training programs to sensitize them to the pitfalls inherent in standardized pencil-and-paper tests, such as linguistic bias…
Descriptors: Test Use, Standardized Tests, Social Sciences, High Stakes Tests
Cheng, Liying; DeLuca, Christopher – Educational Assessment, 2011
Test-takers' interpretations of validity as related to test constructs and test use have been widely debated in large-scale language assessment. This study contributes further evidence to this debate by examining 59 test-takers' perspectives in writing large-scale English language tests. Participants wrote about their test-taking experiences in…
Descriptors: Language Tests, Test Validity, Test Use, English
Smith, Michael K. – Phi Delta Kappan, 2010
A national test can be designed for everyone--students, workers, etc.--that would measure their achievement in mathematics and other subjects and provide a score, normed at various levels from preschool to graduate school. The Internet and computer technology would enable both widespread administration of the test and access to scores that can be…
Descriptors: Test Use, Mathematics Achievement, Test Construction, National Competency Tests
Sireci, Stephen G. – Educational Researcher, 2007
Lissitz and Samuelsen (2007) propose a new framework for conceptualizing test validity that separates analysis of test properties from analysis of the construct measured. In response, the author of this article reviews fundamental characteristics of test validity, drawing largely from seminal writings as well as from the accepted standards. He…
Descriptors: Test Content, Test Validity, Guidelines, Test Items

Eignor, Daniel R. – Journal of Educational Measurement, 1997
The authors of the "Guidelines," a task force of eight, intend to present an organized list of features to be considered in reporting or evaluating computerized-adaptive assessments. Apart from a few weaknesses, the book is a useful and complete document that will be very helpful to test developers. (SLD)
Descriptors: Adaptive Testing, Computer Assisted Testing, Evaluation Methods, Guidelines

Tienken, Christopher; Wilson, Michael – Practical Assessment, Research & Evaluation, 2001
Describes a program used by two New Jersey educators to help teachers understand and use their state's standards and test specifications to improve classroom instruction and raise achievement. Suggests it is important for teachers to understand the entirety of each subject and where state test content fits within each area so that they can align…
Descriptors: Academic Achievement, Administrators, Instructional Improvement, State Programs

Raphael, Dennis; Brown, Ivan; Renwick, Rebecca – International Journal of Disability, Development and Education, 1999
A study examined the reliability and validity of the Quality of Life Instrument Package using data from 500 persons with developmental disabilities in Ontario. Data indicate that most of the instruments found in the package met acceptable psychometric standards. Appropriate uses for the full and short version are discussed. (Author/CR)
Descriptors: Adults, Evaluation Methods, Foreign Countries, Mental Retardation

Haney, Walt; Fowler, Clarke; Wheelock, Anne; Bebell, Damian; Malec, Nicole – Education Policy Analysis Archives, 1999
Using data from state and academic reports, an independent committee of researchers has evaluated the Massachusetts Teacher Tests. Scores are found to be highly unreliable, and the tests are found to contain questionable content. Suspending use of the tests is recommended. (SLD)
Descriptors: Beginning Teachers, Elementary Secondary Education, State Programs, Teacher Evaluation

Wainer, Howard – Education Policy Analysis Archives, 1999
The critique of the Massachusetts Teacher Tests by W. Haney and others points out some flaws in the tests but ignores the fact that the tests provide some useful information to guide teacher selection decisions. Calls for additional study of these teacher evaluation instruments. (SLD)
Descriptors: Beginning Teachers, Elementary Secondary Education, State Programs, Teacher Evaluation
Previous Page | Next Page »
Pages: 1 | 2