Publication Date
In 2025 | 0 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 4 |
Since 2006 (last 20 years) | 5 |
Descriptor
Test Reliability | 51 |
Test Validity | 31 |
Test Construction | 19 |
Reliability | 18 |
Validity | 13 |
Evaluation Methods | 10 |
Foreign Countries | 9 |
Elementary Secondary Education | 8 |
Higher Education | 8 |
Scoring | 8 |
Psychometrics | 7 |
More ▼ |
Source
Author
Roessler, Richard | 2 |
Achenbach, Thomas M. | 1 |
Anastasi, Anne | 1 |
Andrews, Jac | 1 |
Arslan, Burcu | 1 |
Ashmore, Robert J. | 1 |
Bachman, Lyle F. | 1 |
Becker-Schutte, Ann M. | 1 |
Bernknopf, Stan | 1 |
Bijani, Houman | 1 |
Black, Paul | 1 |
More ▼ |
Publication Type
Education Level
Higher Education | 2 |
Postsecondary Education | 2 |
Adult Basic Education | 1 |
Adult Education | 1 |
Early Childhood Education | 1 |
Kindergarten | 1 |
Primary Education | 1 |
Audience
Practitioners | 7 |
Researchers | 4 |
Counselors | 1 |
Location
Canada | 5 |
Australia | 1 |
Austria | 1 |
Belgium | 1 |
Chile | 1 |
China | 1 |
Cyprus | 1 |
Czech Republic | 1 |
Denmark | 1 |
Estonia | 1 |
France | 1 |
More ▼ |
Laws, Policies, & Programs
United States Constitution | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Rafael Burgueño; Ángel Abós; Javier Sevil-Serrano; Leen Haerens; Katrien De Cocker; Luis García-González – Measurement in Physical Education and Exercise Science, 2024
Building upon self-determination theory and the circumplex approach, the objective of this study was to adapt the Situations-in-School-Physical Education (SIS-PE) questionnaire and to gather validity and reliability evidence in the Spanish PE context. Three samples of 1441 students (46.43% girls), 473 in-service teachers (35.73% women), and 654…
Descriptors: Physical Education, Physical Education Teachers, Preservice Teachers, Teaching Styles
Finn, Bridgid; Wendler, Cathy; Ricker-Pedley, Kathryn L.; Arslan, Burcu – ETS Research Report Series, 2018
This report investigates whether the time between scoring sessions has an influence on operational and nonoperational scoring accuracy. The study evaluates raters' scoring accuracy on constructed-response essay responses for the "GRE"® General Test. Binomial linear mixed-effect models are presented that evaluate how the effect of various…
Descriptors: Intervals, Scoring, Accuracy, Essay Tests
Wu, Zhongling; Hu, Bi Ying; Fan, Xitao – Journal of Psychoeducational Assessment, 2019
This study investigated the cross-cultural validity of the Preschool Learning Behavior Scale (PLBS) in the Chinese cultural context. Multiple approaches were used for this purpose, including exploratory factor analysis, confirmatory factor analysis, criterion-related validity evidence, and internal consistency reliability estimates. The findings…
Descriptors: Test Validity, Cultural Relevance, Test Reliability, Factor Structure
Bijani, Houman – Cogent Education, 2019
An Oral Proficiency Interview (OPI) may be evaluated either during the interview procedure (direct-method) or from a tape-made oral interaction (semi-direct method). Such variety in methods of assessment can influence test takers' scores extensively. However, it is not conclusive whether such differences are due to test format or something else.…
Descriptors: Program Effectiveness, Oral Language, Language Proficiency, Language Tests
OECD Publishing, 2013
The Programme for the International Assessment of Adult Competencies (PIAAC) has been planned as an ongoing program of assessment. The first cycle of the assessment has involved two "rounds." The first round, which is covered by this report, took place over the period of January 2008-October 2013. The main features of the first cycle of…
Descriptors: International Assessment, Adults, Skills, Test Construction

Goodman, Gail S.; And Others – Journal of Social Issues, 1984
Reviews research on juror, witness, and courtroom factors that influence a child's credibility on the witness stand. Presents results of studies of juror reactions to child witnesses. Observes that the influence of children's testimony may be great, but also that corroborating evidence may significantly determine the influence of children's…
Descriptors: Children, Court Litigation, Reliability

Jobes, David A.; And Others – Suicide and Life-Threatening Behavior, 1987
Notes that, while researchers seek accurate causative factors for suicidal behavior, validity and reliability of certification of suicide deaths by coroners and medical examiners have been questioned. Provides overview of existing vital statistics registry system, proposes innovations that could improve quality of officially reported suicide…
Descriptors: Death, Recordkeeping, Reliability, Statistics

Cyr, J. J.; Brooker, Barry H. – Journal of Consulting and Clinical Psychology, 1984
Considers both validity and reliability simultaneously in selecting the best short forms (SFs) of the Wechsler Adult Intelligence Scale-Revised (WAIS-R). Results indicate that incorporating reliability as a criterion has a dramatic impact on the obtained best SFs. (LLL)
Descriptors: Test Reliability, Test Selection, Test Validity

Silverstein, A. B. – Journal of Clinical Psychology, 1985
Reports the validities and reliabilities of two short forms of the Wechsler Adult Intelligence Scale (Revised) (Vocabulary and Block Design, and Arithmetic and Picture Arrangement) for each of nine age groups, together with standard errors of estimate and measurement. Results support the use of these forms for their intended purpose. (BH)
Descriptors: Age Differences, Test Reliability, Test Validity

Feingold, Alan – Journal of Clinical Psychology, 1984
Reports reliability data for Wechsler Subtest comparisons to supplement the data in the Wechsler Intelligence Scale for Children-Revised and Wechsler Adult Intelligence Scale-Revised manuals. Results indicated that the reliabilities of the differences between Wechsler Subtest scores are low enough to warrant the exercise of caution in interpreting…
Descriptors: Intelligence Tests, Scores, Test Manuals, Test Reliability

Sackett, Paul R.; Harris, Michael M. – Personnel Psychology, 1984
Describes paper and pencil predictions of employee theft and examines studies of validity, reliability, and adverse impact of these tests. Results showed consistently positive correlations, but identified a variety of methodological differences which make the direct comparison of test validities suspect. (LLL)
Descriptors: Employees, Honesty, Predictor Variables, Test Reliability

Anastasi, Anne – Journal of Counseling & Development, 1985
Describes the role of information on score reliabilities, significance of score differences, intercorrelations of scores, and differential validity of score patterns on the interpretation of results from multiscore batteries. (Author)
Descriptors: Psychological Testing, Scoring, Test Interpretation, Test Reliability

Gibbons, Jean D.; And Others – Psychometrika, 1979
On a multiple-choice test in which each item has k alternative responses, the test taker is permitted to choose any subset which he believes contains the one correct answer. A scoring system is devised. (Author/CTM)
Descriptors: Confidence Testing, Efficiency, Multiple Choice Tests, Scoring

O'Carroll, Patrick W. – Suicide and Life-Threatening Behavior, 1989
Briefly outlines problems associated with definition and official certification of suicide and reviews literature pertaining to validity and reliability of suicide statistics. Considers process of suicide certification as a test, estimating its sensitivity, specificity, and predictive value, using data from studies reviewed. (NB)
Descriptors: Attrition (Research Studies), Death, Evaluation Problems, Reliability

McCrae, Robert R.; Costa, Paul T., Jr. – Journal of Counseling and Development, 1991
Reviews NEO Personality Inventory (NEO-PI), which is based on Five-Factor Model taxonomy of personality traits. Summarizes characteristics of test, features for administration and scoring, and studies of reliability, stability, and validity. Claims NEO-PI may be particularly appropriate for use in counseling because it is brief,…
Descriptors: Personality Measures, Personality Traits, Test Reliability, Test Use