Publication Date
In 2025 | 0 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 2 |
Since 2016 (last 10 years) | 3 |
Since 2006 (last 20 years) | 10 |
Descriptor
Test Reliability | 53 |
Test Validity | 40 |
Elementary Secondary Education | 27 |
Reliability | 16 |
Student Evaluation | 13 |
Evaluation Methods | 10 |
Higher Education | 10 |
Handicap Identification | 9 |
Screening Tests | 9 |
Adults | 8 |
Validity | 8 |
More ▼ |
Source
Author
Publication Type
Reports - Descriptive | 73 |
Journal Articles | 50 |
Speeches/Meeting Papers | 12 |
Opinion Papers | 4 |
Tests/Questionnaires | 3 |
Information Analyses | 2 |
Reports - Research | 2 |
Books | 1 |
Guides - Non-Classroom | 1 |
Reports - Evaluative | 1 |
Education Level
Early Childhood Education | 3 |
Elementary Education | 2 |
Elementary Secondary Education | 1 |
Grade 1 | 1 |
Grade 2 | 1 |
Grade 3 | 1 |
High Schools | 1 |
Middle Schools | 1 |
Primary Education | 1 |
Secondary Education | 1 |
Audience
Researchers | 73 |
Practitioners | 16 |
Teachers | 5 |
Administrators | 4 |
Policymakers | 4 |
Media Staff | 1 |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Mikkel Helding Vembye; James Eric Pustejovsky; Therese Deocampo Pigott – Research Synthesis Methods, 2024
Sample size and statistical power are important factors to consider when planning a research synthesis. Power analysis methods have been developed for fixed effect or random effects models, but until recently these methods were limited to simple data structures with a single, independent effect per study. Recent work has provided power…
Descriptors: Sample Size, Robustness (Statistics), Effect Size, Social Science Research
Liou, Gloria; Bonner, Cavan V.; Tay, Louis – International Journal of Testing, 2022
With the advent of big data and advances in technology, psychological assessments have become increasingly sophisticated and complex. Nevertheless, traditional psychometric issues concerning the validity, reliability, and measurement bias of such assessments remain fundamental in determining whether score inferences of human attributes are…
Descriptors: Psychometrics, Computer Assisted Testing, Adaptive Testing, Data
Choi, Youn-Jeng; Asilkalkan, Abdullah – Measurement: Interdisciplinary Research and Perspectives, 2019
About 45 R packages to analyze data using item response theory (IRT) have been developed over the last decade. This article introduces these 45 R packages with their descriptions and features. It also describes possible advanced IRT models using R packages, as well as dichotomous and polytomous IRT models, and R packages that contain applications…
Descriptors: Item Response Theory, Data Analysis, Computer Software, Test Bias
Boller, Kimberly; Kisker, Ellen Eliason – Regional Educational Laboratory, 2014
This guide is designed to help researchers make sure that their research reports include enough information about study measures so that readers can assess the quality of the study's methods and results. The guide also provides examples of write-ups about measures and suggests resources for learning more about these topics. The guide assumes…
Descriptors: Research Reports, Research Methodology, Educational Research, Check Lists
Raykov, Tenko – Multivariate Behavioral Research, 2007
A method for point and interval estimation of change in criterion validity of multiple-component measuring instruments as a result of revision is outlined. The procedure is developed within the framework of covariance structure modeling, which complements earlier methods for testing change in composite reliability due to addition or deletion of…
Descriptors: Predictive Validity, Computation, Models, Reliability
Hayduk, Leslie A.; Robinson, Hannah Pazderka; Cummings, Greta G.; Boadu, Kwame; Verbeek, Eric L.; Perks, Thomas A. – Structural Equation Modeling: A Multidisciplinary Journal, 2007
Researchers using structural equation modeling (SEM) aspire to learn about the world by seeking models with causal specifications that match the causal forces extant in the world. This quest for a model matching existing worldly causal forces constitutes an ontology that orients, or perhaps reorients, thinking about measurement validity. This…
Descriptors: Validity, Structural Equation Models, Reliability, Causal Models
Nieminen, Timo A.; Choi, Serene Hyun-Jin – International Journal of Research & Method in Education, 2008
Quantitative behaviour analysis requires the classification of behaviour to produce the basic data. This can be challenging when the theoretical taxonomy does not match observational limitations, or if a theoretical taxonomy is unavailable. Binary keys allow qualitative observation to be used to modify a theoretical taxonomy to produce a practical…
Descriptors: Developmental Disabilities, Behavioral Science Research, Classification, Identification
Flood, Mirjam; Weinstein, Debra; Halle, Tamara; Martin, Laurie; Tout, Kathryn; Wandner, Laura; Vick, Jessica; Sherman, Juli; Hair, Elizabeth – Child Trends, 2007
Quality measures were originally developed for research aimed at describing the settings that children spend time in and identifying the characteristics of these environments that contribute to children's development. They were also developed to guide improvements in practice. Increasingly, however, measures of quality are being used for further…
Descriptors: Validity, Reliability, Child Care, Educational Quality
Beers, Pieter J.; Boshuizen, Henny P. A.; Kirschner, Paul A.; Gijselaers, Wim H. – Learning and Instruction, 2007
CSCL research has given rise to a plethora of analysis methods, all with specific analysis goals, units of analysis, and for specific types of data (chat, threaded discussions, etc.). This article describes some challenges of CSCL-analysis. The development of an analysis method for negotiation processes in multidisciplinary teams serves as an…
Descriptors: Content Analysis, Computer Mediated Communication, Research Methodology, Data Analysis
Guthrie, Abbie C. – 2000
Too many researchers speak of "the reliability of the test," thus indicating their basic misunderstanding of reliability. This paper explains classical reliability and the score features that influence coefficient alpha. It explains when coefficient alpha can be negative, even though it is conceptually a variance-accounted-for statistic.…
Descriptors: Effect Size, Measurement Techniques, Reliability, Scores
Lembke, Erica S.; Stecker, Pamela M. – Center on Instruction, 2007
One of the best methods of formative assessment in academic areas and a method that exemplifies the characteristics of good measures is Curriculum-Based Measurement (CBM; Deno, 1985). Developed at the University of Minnesota in the early 1970's, CBM has been researched in academic areas including mathematics computation, concepts, and…
Descriptors: Curriculum Based Assessment, Formative Evaluation, Mathematics Education, Educational Research

Lundgren, Terry D.; Garrett, Norman A. – Computers in the Schools, 1984
Briefly describes the objectives and design of the MicroRead Index, a microcomputer-based index which utilizes word and sentence length to measure readability of written materials. The program computes a reading level and a consistency index for comparative analysis and determines whether the sample size used was adequate. (MBR)
Descriptors: Computer Software, Literature Reviews, Microcomputers, Programing

Humphrey, Suzanne; Miller, Nancy E. – Journal of the American Society for Information Science, 1987
Describes the National Library of Medicine's (NLM) Indexing Aid Project for conducting research in knowledge representation and indexing for information retrieval, whose goal is to develop interactive knowledge-based systems for computer-assisted indexing of the periodical medical literature. Appendices include background information on NLM…
Descriptors: Expert Systems, Indexing, Information Retrieval, Medicine
Butler, E. Dean – 1984
This paper examines the metatheoretical concepts associated with ethnographic/qualitative educational inquiry and overviews the more commonly utilized research designs, data collection methods, and analytical approaches. The epistemological and ontological assumptions of this newer approach differ greatly from those of the traditional educational…
Descriptors: Cultural Context, Data Collection, Educational Research, Ethnography

Gross, Edward J.; And Others – Research in Developmental Disabilities, 1994
This study describes the development of the Active Treatment Client Rights checklist (ATCR), which was designed to facilitate the assessment, monitoring, and implementation of readily observable client active treatment services for adults with developmental disabilities. The ATCR was found to be highly reliable, valid, and useful in enhancing…
Descriptors: Adults, Check Lists, Developmental Disabilities, Evaluation Methods