Publication Date
In 2025 | 0 |
Since 2024 | 6 |
Since 2021 (last 5 years) | 25 |
Since 2016 (last 10 years) | 860 |
Since 2006 (last 20 years) | 1810 |
Descriptor
Statistical Analysis | 2527 |
Reliability | 1276 |
Test Reliability | 1071 |
Foreign Countries | 940 |
Correlation | 633 |
Test Validity | 628 |
Factor Analysis | 559 |
Validity | 507 |
Questionnaires | 479 |
Measures (Individuals) | 411 |
Test Construction | 338 |
More ▼ |
Source
Author
Alonzo, Julie | 12 |
Price, Gary G. | 12 |
Tindal, Gerald | 10 |
Lai, Cheng-Fei | 9 |
Brennan, Robert L. | 8 |
Raykov, Tenko | 8 |
Feldt, Leonard S. | 7 |
Livingston, Samuel A. | 7 |
Park, Bitnara Jasmine | 7 |
Irvin, P. Shawn | 6 |
Anderson, Daniel | 5 |
More ▼ |
Publication Type
Education Level
Audience
Researchers | 33 |
Practitioners | 20 |
Teachers | 10 |
Students | 8 |
Administrators | 5 |
Counselors | 2 |
Parents | 1 |
Policymakers | 1 |
Location
Turkey | 204 |
Nigeria | 57 |
Jordan | 38 |
Australia | 35 |
Iran | 35 |
Taiwan | 35 |
Canada | 31 |
China | 30 |
Germany | 29 |
California | 28 |
United Kingdom | 25 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Meets WWC Standards without Reservations | 1 |
Meets WWC Standards with or without Reservations | 1 |
Does not meet standards | 1 |
Trierweiler, Tammy J.; Lewis, Charles; Smith, Robert L. – Journal of Educational Measurement, 2016
In this study, we describe what factors influence the observed score correlation between an (external) anchor test and a total test. We show that the anchor to full-test observed score correlation is based on two components: the true score correlation between the anchor and total test, and the reliability of the anchor test. Findings using an…
Descriptors: Scores, Correlation, Tests, Test Reliability
Bashkov, Bozhidar M.; Clauser, Jerome C. – Practical Assessment, Research & Evaluation, 2019
Successful testing programs rely on high-quality test items to produce reliable scores and defensible exams. However, determining what statistical screening criteria are most appropriate to support these goals can be daunting. This study describes and demonstrates cost-benefit analysis as an empirical approach to determining appropriate screening…
Descriptors: Test Items, Test Reliability, Evaluation Criteria, Accuracy
Regional Educational Laboratory West, 2020
These are the appendixes for the report, "The Association between Teachers' Use of Formative Assessment Practices and Students' Use of Self-Regulated Learning Strategies." Two appendixes are included in this document. Appendix A are the methods of the study. This includes the reliability of the teacher and student surveys and the…
Descriptors: Formative Evaluation, Learning Strategies, Elementary School Students, Elementary School Teachers
Andersson, Björn; Xin, Tao – Educational and Psychological Measurement, 2018
In applications of item response theory (IRT), an estimate of the reliability of the ability estimates or sum scores is often reported. However, analytical expressions for the standard errors of the estimators of the reliability coefficients are not available in the literature and therefore the variability associated with the estimated reliability…
Descriptors: Item Response Theory, Test Reliability, Test Items, Scores
Miciak, Jeremy; Taylor, W. Pat; Stuebing, Karla K.; Fletcher, Jack M. – Journal of Psychoeducational Assessment, 2018
We investigated the classification accuracy of learning disability (LD) identification methods premised on the identification of an intraindividual pattern of processing strengths and weaknesses (PSW) method using multiple indicators for all latent constructs. Known LD status was derived from latent scores; values at the observed level identified…
Descriptors: Accuracy, Learning Disabilities, Classification, Identification
Tan, Shiu Kuan; Chellappan, Kalaivani – Measurement and Evaluation in Counseling and Development, 2018
This study investigated the validity and reliability of scores on the instrument employing Rasch analysis in a sample of 299 Malaysian adolescents aged between 16 and 19 and provided further evidence for the validity among the sub-constructs: social self-efficacy, academic self-efficacy, and emotional self-efficacy.
Descriptors: Test Validity, Test Reliability, Self Efficacy, Questionnaires
Correia, Edgar A.; Sartóris, Vítor; Fernandes, Tiago; Cooper, Mick; Berdondini, Lucia; Sousa, Daniel; Pires, Branca Sá; da Fonseca, João – British Journal of Guidance & Counselling, 2018
Within the major therapeutic paradigms, observational instruments have been developed to assess orientation-specific interventions or processes. However, to date, no such instrument exists to assess existential practices. Recent research indicates the key practices of existential therapists, and forms an empirical basis on which to develop an…
Descriptors: Foreign Countries, Psychotherapy, Allied Health Personnel, Observation
Ssemakula, Mukasa E.; Liao, Gene Y.; Sawilowsky, Shlomo – American Journal of Engineering Education, 2018
There is a major trend in engineering education to provide students with realistic hands-on learning experiences. This paper reports on the results of work done to develop standardized test instruments to use for student learning outcomes assessment in an experiential hands-on manufacturing engineering and technology environment. The specific…
Descriptors: Test Construction, Psychometrics, Test Validity, Standardized Tests
Al Rabadi, Wail Minwer; Salem, Rifqa Khleif – International Education Studies, 2018
The study was designed to identify the effect of high-order thinking on the quality of life among Ajloun University students. The study used the associative method. The randomly selected sample consisted of 147 students from Ajloun University College. The study used two tools: The two measures were applied to the sample of the current study after…
Descriptors: Thinking Skills, Quality of Life, Foreign Countries, Correlation
Driller, Matthew; Brophy-Williams, Ned; Walker, Anthony – Measurement in Physical Education and Exercise Science, 2017
The purpose of the present study was to determine the reliability of a 5km run test on a motorized treadmill. Over three consecutive weeks, 12 well-trained runners completed three 5km time trials on a treadmill following a standardized warm-up. Runners were partially-blinded to their running speed and distance covered. Total time to complete the…
Descriptors: Athletics, Physical Activities, Athletes, Test Reliability
Kane, Michael T. – Assessment in Education: Principles, Policy & Practice, 2017
In response to an argument by Baird, Andrich, Hopfenbeck and Stobart (2017), Michael Kane states that there needs to be a better fit between educational assessment and learning theory. In line with this goal, Kane will examine how psychometric constraints might be loosened by relaxing some psychometric "rules" in some assessment…
Descriptors: Educational Assessment, Psychometrics, Standards, Test Reliability
Kieftenbeld, Vincent; Boyer, Michelle – Applied Measurement in Education, 2017
Automated scoring systems are typically evaluated by comparing the performance of a single automated rater item-by-item to human raters. This presents a challenge when the performance of multiple raters needs to be compared across multiple items. Rankings could depend on specifics of the ranking procedure; observed differences could be due to…
Descriptors: Automation, Scoring, Comparative Analysis, Test Items
Badru, Ademola K. – Journal of Education and Practice, 2016
The study investigated Problem-based Instructional Strategy and Numerical ability as determinants of Senior Secondary Achievement in Mathematics. This study used 4 x 2 x 2 non-randomised control group Pretest-Posttest Quasi-experimental Factorial design. It consisted of two independent variables (treatment and Numerical ability) and one moderating…
Descriptors: Teaching Methods, Problem Based Learning, Secondary School Students, Mathematics Achievement
Menold, Natalja; Raykov, Tenko – Educational and Psychological Measurement, 2016
This article examines the possible dependency of composite reliability on presentation format of the elements of a multi-item measuring instrument. Using empirical data and a recent method for interval estimation of group differences in reliability, we demonstrate that the reliability of an instrument need not be the same when polarity of the…
Descriptors: Test Reliability, Test Format, Test Items, Differences
Del Valle, Milenko; Matos, Lennia; Díaz, Alejandro; Pérez, María Victoria; Vergara, Jorge – Journal of Educational Psychology - Propositos y Representaciones, 2018
This research work aims to analyze the psychometric properties of the Basic Psychological Needs Satisfaction and Frustration Scale (BPNSFS)--autonomy, competence and relatedness--identified by the self-determination theory (Deci & Ryan, 2000b), in a sample of 297 university students from different faculties and programs belonging to a Chilean…
Descriptors: Foreign Countries, Likert Scales, Psychometrics, Psychological Needs