NotesFAQContact Us
Collection
Advanced
Search Tips
Showing all 2 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Shermis, Mark D. – Educational Assessment, 2015
This study compared short-form constructed responses evaluated by both human raters and machine scoring algorithms. The context was a public competition on which both public competitors and commercial vendors vied to develop machine scoring algorithms that would match or exceed the performance of operational human raters in a summative high-stakes…
Descriptors: Test Scoring Machines, Responses, Interrater Reliability, High Stakes Tests
Peer reviewed Peer reviewed
Direct linkDirect link
Shermis, Mark D.; Long, Susanne K. – Journal of Psychoeducational Assessment, 2009
This study investigated the convergent and discriminant validity of the high-stakes Florida Comprehensive Assessment Test (FCAT) in both reading and writing at grade levels 4, 8, and 10. The data from the 2006 FCAT administration were analyzed via traditional multitrait-multimethod (MTMM) analysis to identify the factor structure and structural…
Descriptors: Structural Equation Models, Multitrait Multimethod Techniques, Writing Tests, Validity