Publication Date
In 2025 | 6 |
Since 2024 | 59 |
Since 2021 (last 5 years) | 268 |
Since 2016 (last 10 years) | 781 |
Since 2006 (last 20 years) | 1698 |
Descriptor
Scores | 2324 |
Test Reliability | 1083 |
Reliability | 1051 |
Test Validity | 596 |
Foreign Countries | 572 |
Correlation | 529 |
Validity | 456 |
Psychometrics | 436 |
Measures (Individuals) | 411 |
Factor Analysis | 392 |
Statistical Analysis | 329 |
More ▼ |
Source
Author
Thompson, Bruce | 21 |
Erford, Bradley T. | 13 |
Henson, Robin K. | 11 |
Zimmerman, Donald W. | 11 |
Haberman, Shelby J. | 10 |
Worrell, Frank C. | 10 |
Lee, Yong-Won | 9 |
Sinharay, Sandip | 9 |
Gill, Brian | 8 |
Petscher, Yaacov | 8 |
Wainer, Howard | 8 |
More ▼ |
Publication Type
Education Level
Audience
Researchers | 33 |
Practitioners | 21 |
Teachers | 9 |
Administrators | 4 |
Counselors | 2 |
Parents | 2 |
Policymakers | 2 |
Community | 1 |
Students | 1 |
Location
Turkey | 88 |
Canada | 42 |
China | 37 |
United States | 35 |
Australia | 31 |
Florida | 24 |
Netherlands | 24 |
California | 21 |
Spain | 21 |
United Kingdom | 21 |
United Kingdom (England) | 21 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Meets WWC Standards without Reservations | 3 |
Meets WWC Standards with or without Reservations | 3 |
Does not meet standards | 1 |
Fu, Yuanshu; Wen, Zhonglin; Wang, Yang – Educational and Psychological Measurement, 2018
The maximal reliability of a congeneric measure is achieved by weighting item scores to form the optimal linear combination as the total score; it is never lower than the composite reliability of the measure when measurement errors are uncorrelated. The statistical method that renders maximal reliability would also lead to maximal criterion…
Descriptors: Test Reliability, Test Validity, Comparative Analysis, Attitude Measures
Barker, Brittan A.; Jones, Hannah D.; Daquanno, Chelsi G. – Volta Review, 2018
The Infant-Toddler Meaningful Auditory Integration Scale (IT-MAIS) is used to assess auditory development in young children with hearing loss. Despite being widely used, previous research showed that its psychometric properties are not ideal. As a first step toward psychometric advancements of the IT-MAIS, this study aimed to create videos with…
Descriptors: Video Technology, Infants, Toddlers, Auditory Perception
Kelly, William E.; Daughtry, Don – College Student Journal, 2018
This study developed an abbreviated form of Barron's (1953) Ego Strength Scale for use in research among college student samples. A version of Barron's scale was administered to 100 undergraduate college students. Using item-total score correlations and internal consistency, the scale was reduced to 18 items (Es18). The Es18 possessed adequate…
Descriptors: Undergraduate Students, Self Concept Measures, Test Length, Scores
Plieninger, Hansjörg – Educational and Psychological Measurement, 2017
Even though there is an increasing interest in response styles, the field lacks a systematic investigation of the bias that response styles potentially cause. Therefore, a simulation was carried out to study this phenomenon with a focus on applied settings (reliability, validity, scale scores). The influence of acquiescence and extreme response…
Descriptors: Response Style (Tests), Test Bias, Item Response Theory, Correlation
Carli, Marta; Lippiello, Stefania; Pantano, Ornella; Perona, Mario; Tormen, Giuseppe – Physical Review Physics Education Research, 2020
In this article, we discuss the development and the administration of a multiple-choice test, which we named "Test of Calculus and Vectors in Mathematics and Physics" (TCV-MP), aimed at comparing students' ability to answer questions on derivatives, integrals, and vectors in a purely mathematical context and in the context of physics.…
Descriptors: Mathematics Tests, Science Tests, Multiple Choice Tests, Calculus
Li, Yixian; Babcock, Sarah E.; Stewart, Shannon L.; Hirdes, John P.; Schwean, Vicki L. – Child & Youth Care Forum, 2021
Background: While empirical testing of many of the interRAI Child and Youth Mental Health (ChYMH) assessment scales have been found to be reliable and valid across both child and adult samples, other scales have yet to be psychometrically evaluated within child populations. Objective: The current study evaluates the psychometric properties of the…
Descriptors: Psychometrics, Depression (Psychology), Measures (Individuals), Severity (of Disability)
Wu, Shu-Ling; Tio, Yee Pin; Ortega, Lourdes – Studies in Second Language Acquisition, 2022
Elicited imitation (EI), a short-cut measure of global proficiency in second language (L2) research, requires participants to listen to sentences and repeat them as closely as possible. To support instrument sharing and assessment of L2 proficiency for longitudinal and crosslinguistic research, we created a parallel form of an EI task (EIT) for L2…
Descriptors: Imitation, Second Language Learning, Second Language Instruction, Language Proficiency
de Mendonça Filho, Euclides José; da Silva, Mônia Aparecida; Koziol, Natalie; Hawley, Leslie; Bandeira, Denise Ruschel – European Journal of Developmental Psychology, 2022
Substantial evidence endorses the early assessment of cognitive development to promote children's developmental health and well-being. Especially in the Brazilian context, there is a paucity of standardized screening and assessment tools with normative data to evaluate young children. This study provided initial reliability and validity evidence…
Descriptors: Mother Attitudes, Cognitive Development, Item Analysis, Reliability
Abdelsamea, Mohammed Abdelhady; Bart, William – International Journal of Teaching and Learning in Higher Education, 2019
Although there is a robust body of research that has addressed the psychometric properties of the Learning and Study Strategies Inventory (LASSI) in different populations, no study has yet investigated the factor structure and congeneric reliability of the Arabic version of the Learning and Study Strategies Inventory, 2nd edition (LASSI-II) among…
Descriptors: Semitic Languages, Undergraduate Students, Factor Analysis, Factor Structure
Briggs, Derek C.; Alzen, Jessica L. – Educational and Psychological Measurement, 2019
Observation protocol scores are commonly used as status measures to support inferences about teacher practices. When multiple observations are collected for the same teacher over the course of a year, some portion of a teacher's score on each occasion may be attributable to the rater, lesson, and the time of year of the observation. All three of…
Descriptors: Observation, Inferences, Generalizability Theory, Scores
Chalmers, Kerry A.; Freeman, Emily E. – Journal of Psychoeducational Assessment, 2019
Low working memory (WM) capacity has been linked to poor academic performance and problem behavior. Availability of easy-to-administer screening tests would facilitate early detection of WM deficits. This study investigated the psychometric properties of the Working Memory Power Test for Children (WMPT) in 170 Australian schoolchildren (8½-11…
Descriptors: Short Term Memory, Academic Achievement, Behavior Problems, Correlation
Lowe, Patricia A. – Canadian Journal of School Psychology, 2018
The psychometric properties of a new multidimensional measure of test anxiety, the Test Anxiety Measure for College Students (TAM-C), based on theory and current research were examined in a sample of 312 Canadian college students online. The TAM-C consists of a Facilitating Anxiety scale and five test anxiety (Cognitive Interference, Physiological…
Descriptors: Foreign Countries, Test Anxiety, Measures (Individuals), College Students
Chow, Peter; Chalmers, R. Philip; Flynn, Deborah M.; McLandress, Adam J.; Steadman, Victoria G. L. – College Student Journal, 2018
With the intent of amending the 21-item BDI-II to improve its reliability and validity when administering the scale to nonclinical populations, a survey package consisting of 19 positive items with semantically reflected response options to mirror the negative scenario options in the original BDI-II (excluding items 16 and 18) was created. These…
Descriptors: Depression (Psychology), Measures (Individuals), Test Reliability, Test Validity
Pivovarova, Margarita; Amrein-Beardsley, Audrey – Educational Assessment, 2018
While states are no longer required to set up teacher evaluation systems based in significant part on student test scores, quite a few continue to use value-added (VAMs) or student growth percentile (SGP) models for that purpose. In this study, we analyzed three years of teacher data to illustrate the performance of teachers' median growth…
Descriptors: Growth Models, Teacher Evaluation, Value Added Models, Reliability
Uslu, Baris – Higher Education: The International Journal of Higher Education Research, 2020
Despite some theoretical and technical criticism, scholars largely acknowledge the influence of universities' ranking positions on the preferences of fund providers, academics and students, nationally and internationally. Considering their noticeable contribution to university rankings, prominent indicators can guide university leaders to develop…
Descriptors: Universities, Institutional Evaluation, Reputation, Financial Support