NotesFAQContact Us
Collection
Advanced
Search Tips
Audience
Laws, Policies, & Programs
What Works Clearinghouse Rating
Showing all 15 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Szczytko, Rachel; Stevenson, Kathryn; Peterson, M. Nils; Nietfeld, John; Strnad, Renee L. – Environmental Education Research, 2019
Environmental education (EE) practitioners struggle to consistently and rigorously evaluate their programs, particularly when little time is available for evaluation. Since environmental literacy (EL) is the goal of environmental education, a very short EL instrument -- amenable to use when longer tests are not practical for practitioners -- would…
Descriptors: Adolescents, Environmental Education, Literacy, Item Response Theory
Peer reviewed Peer reviewed
Direct linkDirect link
Pennell, Adam; Patey, Matthew; Fisher, Jenna; Brian, Ali – Measurement in Physical Education and Exercise Science, 2022
Falls are a significant medical and economical concern worldwide. Younger individuals with visual impairment (VI) may be more susceptible to falling and fall-related injuries when compared to peers without a VI. Self-perceived balance confidence is a psychological construct that may predict and/or mediate fall- or other health-related outcomes in…
Descriptors: Psychomotor Skills, Self Efficacy, Accidents, Injuries
Peer reviewed Peer reviewed
Direct linkDirect link
Murphy, Daniel L.; Beretvas, S. Natasha – Applied Measurement in Education, 2015
This study examines the use of cross-classified random effects models (CCrem) and cross-classified multiple membership random effects models (CCMMrem) to model rater bias and estimate teacher effectiveness. Effect estimates are compared using CTT versus item response theory (IRT) scaling methods and three models (i.e., conventional multilevel…
Descriptors: Teacher Effectiveness, Comparative Analysis, Hierarchical Linear Modeling, Test Theory
Peer reviewed Peer reviewed
Direct linkDirect link
Harrell-Williams, Leigh M.; Lovett, Jennifer N.; Lee, Hollylynne S.; Pierce, Rebecca L.; Lesser, Lawrence M.; Sorto, M. Alejandra – Journal of Psychoeducational Assessment, 2019
Recently adopted state standards for middle grades and high school mathematics content have an increased emphasis on statistical topics. With this change, teacher education programs may need to adapt how they prepare preservice secondary mathematics teachers (PSMTs) to teach statistics and require measures related to statistics teaching to assess…
Descriptors: Program Validation, Scores, Secondary Education, Preservice Teachers
Casabianca, Jodi M.; Lewis, Charles – Journal of Educational and Behavioral Statistics, 2015
Loglinear smoothing (LLS) estimates the latent trait distribution while making fewer assumptions about its form and maintaining parsimony, thus leading to more precise item response theory (IRT) item parameter estimates than standard marginal maximum likelihood (MML). This article provides the expectation-maximization algorithm for MML estimation…
Descriptors: Item Response Theory, Maximum Likelihood Statistics, Computation, Comparative Analysis
Peer reviewed Peer reviewed
Direct linkDirect link
Shamir, Haya – Journal of Educational Multimedia and Hypermedia, 2018
Assessing students' emerging literacy skills is crucial for identifying areas where a child may be falling behind and can lead directly to an increased chance of reading success. The Waterford Assessment of Core Skills (WACS), a computerized adaptive test of early literacy for students in prekindergarten through 2nd grade, addresses this need.…
Descriptors: Computer Assisted Testing, Adaptive Testing, Reading Tests, Preschool Children
Peer reviewed Peer reviewed
Direct linkDirect link
Williamson, Gary L. – Cogent Education, 2018
Individual growth curves yield insights about growth that are not available from any other methodology; and developmental scales based on conjoint measurement models provide unique interpretive advantages for investigations of academic growth. The advantages are apparent when: (1) 15 consecutive statewide reading growth curves are annotated with…
Descriptors: Reading Achievement, Mathematics Achievement, Psychometrics, Educational Innovation
Royal, Kenneth D.; Gonzalez, Liara M. – Online Submission, 2016
The purpose of this study was to evaluate the psychometric properties of a newly developed instrument intended to measure faculty competence as it pertains to their role as advisors, particularly in medical and professional programs. A total of 166 students completed the Faculty Advisor's Skills and Behaviors Inventory (FASBI). The psychometric…
Descriptors: Psychometrics, Medical Students, College Faculty, Teacher Competencies
Peer reviewed Peer reviewed
Direct linkDirect link
Willoughby, Michael T.; Wirth, R. J.; Blair, Clancy B. – Psychological Assessment, 2012
This study tested the longitudinal measurement invariance and developmental changes of a newly developed battery of executive function (EF) tasks for use in early childhood. The battery was administered in the Family Life Project--a prospective longitudinal study (N = 1,292) of families who were oversampled from low-income and African American…
Descriptors: Executive Function, Young Children, Child Development, Measures (Individuals)
Peer reviewed Peer reviewed
Direct linkDirect link
Wu, Johnny; King, Kevin M.; Witkiewitz, Katie; Racz, Sarah Jensen; McMahon, Robert J. – Psychological Assessment, 2012
Research has shown that boys display higher levels of childhood conduct problems than girls, and Black children display higher levels than White children, but few studies have tested for scalar equivalence of conduct problems across gender and race. The authors conducted a 2-parameter item response theory (IRT) model to examine item…
Descriptors: Item Analysis, Test Bias, Test Items, Item Response Theory
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Dadey, Nathan; Briggs, Derek C. – Practical Assessment, Research & Evaluation, 2012
A vertical scale, in principle, provides a common metric across tests with differing difficulties (e.g., spanning multiple grades) so that statements of "absolute" growth can be made. This paper compares 16 states' 2007-2008 effect size growth trends on vertically scaled reading and math assessments across grades 3 to 8. Two patterns…
Descriptors: Meta Analysis, Scaling, Effect Size, Reading Tests
Sgammato, Adrienne N. – ProQuest LLC, 2009
This study examined the applicability of a relatively new unidimensional, unfolding item response theory (IRT) model called the generalized graded unfolding model (GGUM; Roberts, Donoghue, & Laughlin, 2000). A total of four scaling methods were applied. Two commonly used cumulative IRT models for polytomous data, the Partial Credit Model and…
Descriptors: Item Response Theory, Scaling, Models, Factor Analysis
Peer reviewed Peer reviewed
Direct linkDirect link
Woods, Carol M. – Multivariate Behavioral Research, 2009
Differential item functioning (DIF) occurs when an item on a test or questionnaire has different measurement properties for 1 group of people versus another, irrespective of mean differences on the construct. This study focuses on the use of multiple-indicator multiple-cause (MIMIC) structural equation models for DIF testing, parameterized as item…
Descriptors: Test Bias, Structural Equation Models, Item Response Theory, Testing
Peer reviewed Peer reviewed
Direct linkDirect link
Goldhaber, Dan; Hansen, Michael – American Educational Research Journal, 2010
Virtually all states require teachers to undergo licensure testing before participation in the public school labor market. This article analyzes the information these tests provide about teacher effectiveness. The authors find that licensure tests have different predicative validity for student achievement by teacher race. They also find that…
Descriptors: Teacher Effectiveness, Teacher Competency Testing, Academic Achievement, Licensing Examinations (Professions)
Peer reviewed Peer reviewed
Direct linkDirect link
Fox, J.-P.; Wyrick, Cheryl – Journal of Educational and Behavioral Statistics, 2008
The randomized response technique ensures that individual item responses, denoted as true item responses, are randomized before observing them and so-called randomized item responses are observed. A relationship is specified between randomized item response data and true item response data. True item response data are modeled with a (non)linear…
Descriptors: Item Response Theory, Models, Markov Processes, Monte Carlo Methods