NotesFAQContact Us
Collection
Advanced
Search Tips
Audience
Laws, Policies, & Programs
No Child Left Behind Act 20011
What Works Clearinghouse Rating
Showing all 14 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Park, Yena; Lee, Senyung; Shin, Sun-Young – Language Testing, 2022
Despite consistent calls for authentic stimuli in listening tests for better construct representation, unscripted texts have been rarely adopted in high-stakes listening tests due to perceived inefficiency. This study details how a local academic listening test was developed using authentic unscripted audio-visual texts from the local target…
Descriptors: Listening Comprehension Tests, English for Academic Purposes, Test Construction, Foreign Students
Patrick, Helen; French, Brian F.; Mantzicopoulos, Panayota – Journal of Psychoeducational Assessment, 2020
We evaluated the score stability of the Framework for Teaching (FFT), a prominent observation instrument used for teacher evaluation. Three raters each scored 200 reading and mathematics lessons taught by 20 kindergarten teachers. Using Generalizability theory analyses, we decomposed the FFT's Classroom Environment, Instruction, and Total scores…
Descriptors: Teacher Evaluation, Observation, Scores, Test Reliability
Peer reviewed Peer reviewed
Direct linkDirect link
Byram, Jessica N.; Seifert, Mark F.; Brooks, William S.; Fraser-Cotlin, Laura; Thorp, Laura E.; Williams, James M.; Wilson, Adam B. – Anatomical Sciences Education, 2017
With integrated curricula and multidisciplinary assessments becoming more prevalent in medical education, there is a continued need for educational research to explore the advantages, consequences, and challenges of integration practices. This retrospective analysis investigated the number of items needed to reliably assess anatomical knowledge in…
Descriptors: Anatomy, Science Tests, Test Items, Test Reliability
Peer reviewed Peer reviewed
Direct linkDirect link
Riley, Ellyn A. – Communication Disorders Quarterly, 2017
The purpose of this study was to measure speech-language pathologists' (SLPs) perceptions of fatigue in clients with aphasia and identify strategies used to manage client fatigue during speech and language therapy. SLPs completed a short online survey containing a series of questions related to their perceptions of patient fatigue. Of 312…
Descriptors: Speech Language Pathology, Fatigue (Biology), Aphasia, Intervention
Peer reviewed Peer reviewed
Direct linkDirect link
Harrell-Williams, Leigh M.; Lovett, Jennifer N.; Lee, Hollylynne S.; Pierce, Rebecca L.; Lesser, Lawrence M.; Sorto, M. Alejandra – Journal of Psychoeducational Assessment, 2019
Recently adopted state standards for middle grades and high school mathematics content have an increased emphasis on statistical topics. With this change, teacher education programs may need to adapt how they prepare preservice secondary mathematics teachers (PSMTs) to teach statistics and require measures related to statistics teaching to assess…
Descriptors: Program Validation, Scores, Secondary Education, Preservice Teachers
Peer reviewed Peer reviewed
Direct linkDirect link
Tahiroglu, Deniz; Moses, Louis J.; Carlson, Stephanie M.; Mahy, Caitlin E. V.; Olofson, Eric L.; Sabbagh, Mark A. – Developmental Psychology, 2014
Children's theory of mind (ToM) is typically measured with laboratory assessments of performance. Although these measures have generated a wealth of informative data concerning developmental progressions in ToM, they may be less useful as the sole source of information about individual differences in ToM and their relation to other facets of…
Descriptors: Measures (Individuals), Theory of Mind, Individual Differences, Parents
Knobloch, Neil A.; Brady, Colleen M.; Orvis, Kathryn S.; Carroll, Natalie J. – Journal of Agricultural Education, 2016
Career development events develop career and life skills in youth, but limited work has been done to assess the motivation of students who participate in these events. The purpose of this study was to validate an instrument developed to measure youth motivation to participate in career development events. An instrument grounded in expectancy-value…
Descriptors: Test Construction, Program Validation, Career Development, Youth Programs
Peer reviewed Peer reviewed
Direct linkDirect link
Ling, Guangming – International Journal of Testing, 2016
To investigate possible iPad related mode effect, we tested 403 8th graders in Indiana, Maryland, and New Jersey under three mode conditions through random assignment: a desktop computer, an iPad alone, and an iPad with an external keyboard. All students had used an iPad or computer for six months or longer. The 2-hour test included reading, math,…
Descriptors: Educational Testing, Computer Assisted Testing, Handheld Devices, Computers
Peer reviewed Peer reviewed
Direct linkDirect link
Kettler, Ryan J.; Rodriguez, Michael C.; Bolt, Daniel M.; Elliott, Stephen N.; Beddow, Peter A.; Kurz, Alexander – Applied Measurement in Education, 2011
Federal policy on alternate assessment based on modified academic achievement standards (AA-MAS) inspired this research. Specifically, an experimental study was conducted to determine whether tests composed of modified items would have the same level of reliability as tests composed of original items, and whether these modified items helped reduce…
Descriptors: Multiple Choice Tests, Test Items, Alternative Assessment, Test Reliability
Yoon, So Yoon – ProQuest LLC, 2011
Working under classical test theory (CTT) and item response theory (IRT) frameworks, this study investigated psychometric properties of the Revised Purdue Spatial Visualization Tests: Visualization of Rotations (Revised PSVT:R). The original version, the PSVT:R was designed by Guay (1976) to measure spatial visualization ability in…
Descriptors: Undergraduate Students, Test Bias, Guessing (Tests), Construct Validity
Center for Innovation in Assessment (NJ1), 2007
Research was conducted to evaluate how well the "Indiana Reading Assessment--Kindergarten" evaluates various reading skills of kindergarten students. Multiple analyses were conducted; while the results of all the analyses were encouraging, the results derived from the concurrent validity study were most significant. All correlations were…
Descriptors: Reading Tests, Test Validity, Test Reliability, Interrater Reliability
Center for Innovation in Assessment (NJ1), 2005
Research was conducted to evaluate how well the "Indiana Reading Assessment--Grade 1" evaluates various reading skills of grade one students. Multiple analyses were conducted; while the results of all the analyses were encouraging, the results derived from the concurrent validity study were most significant. All the correlations were…
Descriptors: Reading Tests, Test Validity, Test Reliability, Interrater Reliability
Center for Innovation in Assessment (NJ1), 2005
Research was conducted to evaluate how well the "Indiana Reading Assessment--Grade 2" evaluates various reading skills of grade two students. Multiple analyses were conducted; while the results of all the analyses were encouraging, the results derived from the concurrent validity study were most significant. Correlations were either…
Descriptors: Reading Tests, Test Validity, Test Reliability, Interrater Reliability
Indiana Association of Area Vocational Districts, Inc., Speedway. – 1990
An Indiana project developed a model diagnostic basic skills testing program with statewide application for teachers certified under the Occupational Specialist Rules. Fifteen commercially produced tests were reviewed for adoption. The set of criteria that guided the selection process included technical aspects and application factors. The TABE…
Descriptors: Basic Skills, Beginning Teachers, Diagnostic Tests, Field Tests