Publication Date
In 2025 | 1 |
Since 2024 | 18 |
Since 2021 (last 5 years) | 47 |
Since 2016 (last 10 years) | 126 |
Since 2006 (last 20 years) | 204 |
Descriptor
Test Content | 308 |
Test Items | 115 |
Foreign Countries | 96 |
Test Construction | 78 |
Test Validity | 65 |
Scores | 47 |
Language Tests | 45 |
Second Language Learning | 42 |
Student Evaluation | 42 |
Test Format | 40 |
Comparative Analysis | 38 |
More ▼ |
Source
Author
Sireci, Stephen G. | 4 |
Solano-Flores, Guillermo | 3 |
Steffen, Manfred | 3 |
Abedi, Jamal | 2 |
Agarwal, Pooja K. | 2 |
Bauer, Scott C. | 2 |
Binkley, Marilyn | 2 |
Borman, Walter C. | 2 |
Chang, Hua-Hua | 2 |
Cox, Shawna | 2 |
Dorans, Neil J. | 2 |
More ▼ |
Publication Type
Education Level
Audience
Teachers | 7 |
Practitioners | 5 |
Researchers | 2 |
Administrators | 1 |
Location
Australia | 8 |
Canada | 8 |
Turkey | 8 |
California | 7 |
Europe | 6 |
China | 5 |
United States | 5 |
Germany | 4 |
Hong Kong | 4 |
Iran | 4 |
Japan | 4 |
More ▼ |
Laws, Policies, & Programs
Every Student Succeeds Act… | 2 |
No Child Left Behind Act 2001 | 2 |
Individuals with Disabilities… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Kraft, Robert E. – 1991
A study was conducted to determine how fitness scores may be affected by varying the method of administration. Five fitness tests were administered to 426 children in grades 1-3. The children were randomly assigned to one of three treatment groups: traditional instruction, competition, and encouragement. The testing team administered five fitness…
Descriptors: Competition, Motivation Techniques, Physical Education, Physical Fitness
Bruce, Bertram C.; And Others – 1993
One of 10 reports commissioned by the National Academy of Education, this report investigates topics related to an assessment program piloted by the National Assessment of Educational Progress (NAEP) designed to support state-by-state and state-to-nation comparisons of student performance in reading. The report focuses on three issues: (1) the…
Descriptors: Content Validity, Elementary Secondary Education, Reading Achievement, Reading Research
Dings, Jonathan; Childs, Ruth; Kingston, Neal – 2002
This study examined matrix sampling of test content, the practice of giving various students in the same school differing test questions. This often-used approach to large-scale assessment allows for relatively broad coverage of the curriculum, but with fewer comparable individual student scores than a conventional test. One can be sure that…
Descriptors: Junior High School Students, Junior High Schools, Performance Based Assessment, Reliability
Milewski, Glenn B.; Patelis, Thanos – 2001
The 1999 Advanced Placement[R] (AP[R] Psychology Examination contains items drawn from 13 factors related to the study of psychology. This factor structure had not been explored previously. This study focuses on evaluating the fit of confirmatory factor analysis (CFA) models to examination items. Since examination items were dichotomous and…
Descriptors: Advanced Placement, Factor Structure, Goodness of Fit, High School Students

Hollwitz, John C.; Pawlowski, Donna R. – Journal of Business Communication, 1997
Describes the development and testing of a structured ethical integrity interview that can be used in the selection of applicants. Examines directions for future testing and study. (SR)
Descriptors: Communication Research, Employment Interviews, Ethics, Higher Education

Brady, David; Hostetter, Carol; Milkie, Melissa A.; Pescosolido, Bernice A. – Teaching Sociology, 2001
Describes the institutional practices related to qualifying examinations in U.S. sociology graduate departments (n=178). Indicates that there are differences between the Ph.D.-granting and M.A.-granting programs in relation to the structure of examinations; while the structures in the departments are consistent. Includes references. (CMK)
Descriptors: Comparative Analysis, Departments, Doctoral Programs, Educational Research
What's on the Test? An Analytical Framework and Findings from an Examination of Teachers' Math Tests
Archbald, Douglas A.; Grant, Theresa J. – Educational Assessment, 2000
Reports results from research that developed and applied a content analysis instrument to measure the content of 12 middle school mathematics teachers' tests and quizzes. Results shed light on content and methods and on "enacted" curriculum. One finding is the large preponderance of single-path/ single-solution problems related to number…
Descriptors: Content Analysis, Mathematics Instruction, Mathematics Teachers, Middle School Teachers

Kostin, Irene; Freedle, Roy – Language Testing, 1999
A study investigated whether examinees taking the Test of English as a Foreign Language (TOEFL) attended to the text passages in the "minitalks" when answering the multiple-choice items (n=337) testing listening comprehension. Results support the construct validity of the minitalks, and also allow comparison between reading and listening…
Descriptors: Construct Validity, English (Second Language), Language Tests, Listening Comprehension
Kobrin, Jennifer L.; Deng, Hui; Shaw, Emily J. – Journal of Applied Testing Technology, 2007
This study was designed to address two frequent criticisms of the SAT essay--that essay length is the best predictor of scores, and that there is an advantage in using more "sophisticated" examples as opposed to personal experience. The study was based on 2,820 essays from the first three administrations of the new SAT. Each essay was…
Descriptors: Testing Programs, Computer Assisted Testing, Construct Validity, Writing Skills

Schraw, Gregory – Journal of Experimental Education, 1997
The basis of students' confidence in their answers to test items was studied with 95 undergraduates. Results support the domain-general hypothesis that predicts that confidence judgments will be related to performance on a particular test and also to confidence judgments and performance on unrelated tests. (SLD)
Descriptors: Higher Education, Metacognition, Performance Factors, Scores

Quattlebaum, Judith A. – Language Quarterly, 1994
Argues that formal English is a prestige dialect containing select constructions so unnatural as to be outside the domain of normal language acquisition. Among these are nominative pronouns used as conjoined subjects. Prestige usage is unavailable for consistent use. While formal education may have some effect on normal usage, that effect is…
Descriptors: Case (Grammar), English, Language Patterns, Language Usage

Stecher, Brian M.; Klein, Stephen P.; Solano-Flores, Guillermo; McCaffrey, Dan; Robyn, Abby; Shavelson, Richard J.; Haertel, Edward – Applied Measurement in Education, 2000
Studied content domain, format, and level of inquiry as factors contributing to the large variation in student performance across open-ended measures. Results for more than 1,200 eighth graders do not support the hypothesis that tasks similar in content, format, and level of inquiry would correlate higher with each other than with measures…
Descriptors: Correlation, Inquiry, Junior High School Students, Junior High Schools

Straus, Murray A.; Hamby, Sherry L.; Finkelhor, David; Moore, David W.; Runyan, Desmond – Child Abuse & Neglect: The International Journal, 1998
A study of 1,000 children examined the effectiveness of the Parent-Child Conflict Tactics Scales (CTSPC) in measuring parental psychological and physical maltreatment of children, as well as nonviolent modes of discipline. The CTSPC was found to be better suited to measuring child maltreatment than the original Conflict Tactics Scales. (Author/CR)
Descriptors: Child Abuse, Child Neglect, Discipline, Evaluation Methods
Robin, Frédéric; van der Linden, Wim J.; Eignor, Daniel R.; Steffen, Manfred; Stocking, Martha L. – ETS Research Report Series, 2005
The relatively new shadow test approach (STA) to computerized adaptive testing (CAT) proposed by Wim van der Linden is a potentially attractive alternative to the weighted deviation algorithm (WDA) implemented at ETS. However, it has not been evaluated under testing conditions representative of current ETS testing programs. Of interest was whether…
Descriptors: Test Construction, Computer Assisted Testing, Simulation, Evaluation Methods
Zhang, Liru – 2000
A study investigated possible reasons for the low performance in 2000 on the writing portion of the Delaware Student Testing Program (DSTP) by students, especially in grades 3 and 5. The study also investigated ways to improve classroom instruction in writing. A panel of teachers reviewed the anchor papers and the process of testing. Panel members…
Descriptors: Elementary Secondary Education, Student Evaluation, Test Construction, Test Content