Publication Date
In 2025 | 1 |
Since 2024 | 18 |
Since 2021 (last 5 years) | 47 |
Since 2016 (last 10 years) | 126 |
Since 2006 (last 20 years) | 204 |
Descriptor
Test Content | 308 |
Test Items | 115 |
Foreign Countries | 96 |
Test Construction | 78 |
Test Validity | 65 |
Scores | 47 |
Language Tests | 45 |
Second Language Learning | 42 |
Student Evaluation | 42 |
Test Format | 40 |
Comparative Analysis | 38 |
More ▼ |
Source
Author
Sireci, Stephen G. | 4 |
Solano-Flores, Guillermo | 3 |
Steffen, Manfred | 3 |
Abedi, Jamal | 2 |
Agarwal, Pooja K. | 2 |
Bauer, Scott C. | 2 |
Binkley, Marilyn | 2 |
Borman, Walter C. | 2 |
Chang, Hua-Hua | 2 |
Cox, Shawna | 2 |
Dorans, Neil J. | 2 |
More ▼ |
Publication Type
Education Level
Audience
Teachers | 7 |
Practitioners | 5 |
Researchers | 2 |
Administrators | 1 |
Location
Australia | 8 |
Canada | 8 |
Turkey | 8 |
California | 7 |
Europe | 6 |
China | 5 |
United States | 5 |
Germany | 4 |
Hong Kong | 4 |
Iran | 4 |
Japan | 4 |
More ▼ |
Laws, Policies, & Programs
Every Student Succeeds Act… | 2 |
No Child Left Behind Act 2001 | 2 |
Individuals with Disabilities… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating

Hamilton, Laura S. – Educational Evaluation and Policy Analysis, 1998
Gender differences on the National Education Longitudinal Study of 1988 science tests were explored through statistical analyses and interviews with 25 high school students. Results show the importance of studying the validity of the outcome measure and suggest that conclusions about group differences and correlates of achievement depend on the…
Descriptors: Achievement Tests, Correlation, High School Students, High Schools

Raphael, Dennis; Brown, Ivan; Renwick, Rebecca – International Journal of Disability, Development and Education, 1999
A study examined the reliability and validity of the Quality of Life Instrument Package using data from 500 persons with developmental disabilities in Ontario. Data indicate that most of the instruments found in the package met acceptable psychometric standards. Appropriate uses for the full and short version are discussed. (Author/CR)
Descriptors: Adults, Evaluation Methods, Foreign Countries, Mental Retardation
Schierloh, Jane M. – 1993
A qualitative study investigated the test-taking behaviors, knowledge, and perceptions of 20 urban, adult basic education students reading at third to fifth grade equivalency levels. The entire reading comprehension subtest of the Test of Adult Basic Education, levels E and M, was administered under standardized conditions. A combination of…
Descriptors: Adult Basic Education, Construct Validity, Reading Comprehension, Scores
Dorans, Neil J. – College Entrance Examination Board, 2000
Distinctions were made between three classes of statistical linkage: equivalence, concordance, and prediction. These distinctions were based on rational content considerations and empirical statistical relationships. A large database involving SAT I and ACT scores was used to determine which type of linkage was best suited for different scores and…
Descriptors: Statistical Analysis, Prediction, Scores, Standardized Tests

Kyriacou, Chris; Wilkins, Michael – Educational Research, 1993
Survey of 43 British secondary teachers showed that the National Curriculum positively influenced teaching methods, encouraging greater variety in delivery and more active learning. Concern was expressed that these practices may be hindered if national assessment tests are narrow in nature and content. (SK)
Descriptors: British National Curriculum, Classroom Techniques, Educational Assessment, Foreign Countries

Sabol, F. Robert – Visual Arts Research, 1998
Observes that emphasis on educational accountability in visual arts has led to the acceptance of various assessment tools. Describes the format and content of existing state visual arts achievement tests and clarifies what is being done nationally in assessment through the use of state art achievement tests. (DSK)
Descriptors: Academic Standards, Art Education, Educational Policy, Elementary Secondary Education

Walters, Amy S.; Merrell, Kenneth W. – Psychology in the Schools, 1995
Examined administration method (standard written administration versus oral administration by an examiner) as a variable influencing children's self-report test scores. Subjects included 139 students in grades 3-6, randomly assigned to an administration condition. Results suggest that method of administration did not affect test performance. (JBJ)
Descriptors: Elementary Education, Grade 3, Grade 4, Grade 5

Chang, Lei; And Others – Applied Measurement in Education, 1996
The influence of judges' knowledge on standard setting for competency tests was studied with 17 judges who took an economics teacher certification test while setting competency standards using the Angoff procedure. Judges tended to set higher standards for items they answered correctly and lower standards for items they answered incorrectly. (SLD)
Descriptors: Competence, Difficulty Level, Economics, Judges

Clapham, Caroline – System, 2000
Discusses research into the effect of background knowledge on English for Academic Purposes (EAP) tests and discusses EAP tests in which the content of at least some of the test components is related to students' fields of academic study. Suggests that for international EAP tests, English for specific academic purposes testing be abandoned.…
Descriptors: English for Academic Purposes, Higher Education, Language Aptitude, Language Research

Papajohn, Dean – Language Testing, 1999
This study investigated topic features and the effect of topic variation on performance in a test designed to assess the language skills of international teaching assistants in chemistry. Results suggest a relationship between topic of input (as defined by the topic features of concepts, math, and calculations) and test scores. (Author/MSE)
Descriptors: Chemistry, Classroom Communication, English (Second Language), Foreign Students
House, Ernest R.; Lawrence, Nancy – 1990
This content assessment project is designed to determine what social studies content should be tested on national standardized tests and how that content should be defined. Sixteen historians, political scientists, and social studies educators were interviewed to identify key concepts. In a second phase, the cultural literacy rationale for content…
Descriptors: Content Analysis, High Schools, History, Interviews
Ross, Steven; Hua, Te-Fang – 1994
A general issue related to language program development involves the empirical rationalization of cut score decisions in criterion-referenced language tests. Cut score dependability focuses on the consistency of the decisions in repeated testing or the assessment of language learner performances. In this case, the issue is to determine the optimal…
Descriptors: Achievement Gains, Criterion Referenced Tests, English (Second Language), Higher Education
Gafni, Naomi – 1991
Items in the verbal (Hebrew and English) sections of the Psychometric Entrance Test (PET) administered for university admission in Israel were studied for differential item functioning (DIF) between the sexes. Analyses were conducted for 4,354 males and 4,901 females taking Form 3 of the PET in April 1984, and 3,786 males and 3,815 females taking…
Descriptors: College Entrance Examinations, Comparative Testing, Foreign Countries, Higher Education
Sireci, Stephen G.; And Others – 1990
Although some researchers have argued against use of the term "content validity," the ability of a test item to adequately represent the domain of knowledge tested continues to be an issue of paramount importance in test construction. The present paper reviews previous analyses of test content and proposes a new empirical method for…
Descriptors: Cluster Analysis, Content Analysis, Content Validity, Evaluators
Buckendahl, Chad W.; Plake, Barbara S.; Impara, James C.; Irwin, Patrick M. – 2000
Test publishers have promoted their commercially available, norm-referenced achievement tests as viable solutions to assessment challenges faced by states. They argue that their tests are developed professionally, and, therefore, possess sound psychometric properties not often found in state-specific efforts. This study compared judgments from two…
Descriptors: Achievement Tests, Elementary Secondary Education, Norm Referenced Tests, Standardized Tests