Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 3 |
Since 2006 (last 20 years) | 11 |
Descriptor
Source
Author
Nimmer, Donald N. | 2 |
Aiken, Lewis R. | 1 |
Alderson, J. Charles | 1 |
Amster, Judith B. | 1 |
Arndt, Stephan | 1 |
Bachman, Lyle F. | 1 |
Bates, Gary W. | 1 |
Bobie, Allen | 1 |
Brittain, Clay V. | 1 |
Brittain, Mary M. | 1 |
Brown, Linda | 1 |
More ▼ |
Publication Type
Education Level
Adult Education | 1 |
Elementary Education | 1 |
Elementary Secondary Education | 1 |
Higher Education | 1 |
Postsecondary Education | 1 |
Audience
Practitioners | 3 |
Researchers | 2 |
Teachers | 2 |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Wagner, Elvis; Krylova, Anna – Language Assessment Quarterly, 2021
When the COVID-19 pandemic made it impossible to do in-person, on campus testing, we were forced to create a new system to screen International Teaching Assistants (ITA) for Temple university. We used this opportunity to address many of the concerns and problems that we had identified with the previous test, and created a new test that could be…
Descriptors: Placement Tests, COVID-19, Pandemics, Computer Assisted Testing
Davenport, Ernest C.; Davison, Mark L.; Liou, Pey-Yan; Love, Quintin U. – Educational Measurement: Issues and Practice, 2016
The main points of Sijtsma and Green and Yang in Educational Measurement: Issues and Practice (34, 4) are that reliability, internal consistency, and unidimensionality are distinct and that Cronbach's alpha may be problematic. Neither of these assertions are at odds with Davenport, Davison, Liou, and Love in the same issue. However, many authors…
Descriptors: Educational Assessment, Reliability, Validity, Test Construction
Skaggs, Gary – Measurement: Interdisciplinary Research and Perspectives, 2013
The construct map is a particularly good way to approach instrument development, and this author states that he was delighted to read Adam Wyse's thoughts about how to use construct maps for standard setting. For a number of popular standard-setting methods, Wyse shows how typical feedback to panelists fits within a construct map framework.…
Descriptors: Standard Setting (Scoring), Maps, Test Construction, Measurement
Johnson, Alyce O. – Journal of Psychoeducational Assessment, 2015
The "Parenting Stress Index, Fourth Edition" (PSI-4) is a 120-item measure used to explore parental stress levels considering a parent's relationship with one of his or her children between the ages of 1 month and 12 years. The main purpose of the test is to define these stress levels and from where they originate in order to identify…
Descriptors: Anxiety, Measures (Individuals), Parents, Child Rearing
Eliasson, Ann-Christin – Physical & Occupational Therapy in Pediatrics, 2012
Assessments used for both clinical practice and research should show evidence of validity and reliability for the target group of people. It is easy to agree with this statement, but it is not always easy to choose the right assessment for the right purpose. Recently there have been increasing numbers of studies which investigate further the…
Descriptors: Psychometrics, Test Construction, Test Reliability, Test Validity
Lissitz, Robert W.; Calico, Tiago – Measurement: Interdisciplinary Research and Perspectives, 2012
This paper presents the authors' critique on "Clarifying the Consensus Definition of Validity" by Paul E. Newton (this issue). There are serious differences of opinion regarding the topic of validity. Newton is aware of these differences, as made clear by his choice of references and particularly his effort to respond to the various Borsboom…
Descriptors: Concept Formation, Test Construction, Test Validity, Scores
Greathouse, Dan; Shaughnessy, Michael F. – Journal of Psychoeducational Assessment, 2016
Whenever a major intelligence or achievement test is revised, there is always renewed interest in the underlying structure of the test as well as a renewed interest in the scoring, administration, and interpretation changes. In this interview, Amy Gabel discusses the most recent revision of the "Wechsler Intelligence Scale for Children-Fifth…
Descriptors: Children, Intelligence Tests, Test Use, Test Validity
Pommerich, Mary – Educational Measurement: Issues and Practice, 2012
Neil Dorans has made a career of advocating for the examinee. He continues to do so in his NCME career award address, providing a thought-provoking commentary on some current trends in educational measurement that could potentially affect the integrity of test scores. Concerns expressed in the address call attention to a conundrum that faces…
Descriptors: Testing, Scores, Measurement, Test Construction
Hall, Graham – ELT Journal, 2010
Uysal's article provides a research agenda for IELTS and lists numerous issues concerning the test's reliability and validity. She asks useful questions, but her analysis ignores the uncertainties inherent in all language test development and the wider social and political context of international high-stakes language testing. In this response, I…
Descriptors: Testing, Language Tests, English, High Stakes Tests
DeBacker, Teresa K.; Crowson, H. Michael – Contemporary Educational Psychology, 2008
Need for closure, as formulated by Kruglanski and colleagues [Kruglanski, A. W. (1990). Lay epistemic theory in social-cognitive psychology. "Psychological Inquiry," 1(3), 181-197; Kruglanski, A. W., & Webster, D. M. (1996). Motivated closing of the mind: Seizing and freezing. "Psychological Review," 103, 263-283; Webster,…
Descriptors: Construct Validity, Factor Analysis, Psychometrics, Cognitive Processes

Streiner, David L.; Miller, Harold R. – Journal of Clinical Psychology, 1986
Numerous short forms of the Minnesota Multiphasic Personality Inventory have been proposed in the last 15 years. In each case, the initial enthusiasm has been replaced by the questions about the clinical utility of the abbreviated version. Argues that the statistical properties of the test and reduced reliability due to shortening the scales…
Descriptors: Test Construction, Test Format, Test Length, Test Reliability
Salies, Tania Gastao – 1998
A discussion of the evaluation of writing, particularly in English as a Second Language, argues for a communicative approach reflecting the current approach to language teaching and learning. The movement toward more communication-oriented and more valid language testing is examined briefly, and direct assessment is chosen as the preferred format…
Descriptors: Communicative Competence (Languages), English (Second Language), Evaluation Criteria, Foreign Countries

Chambers, William V. – Social Behavior and Personality, 1985
Personal construct psychologists have suggested various psychological functions explain differences in the stability of constructs. Among these functions are constellatory and loose construction. This paper argues that measurement error is a more parsimonious explanation of the differences in construct stability reported in these studies. (Author)
Descriptors: Error of Measurement, Test Construction, Test Format, Test Reliability
Henson, Robin K. – 2000
The purpose of this paper is to highlight some psychometric cautions that should be observed when seeking to develop short form versions of tests. Several points are made: (1) score reliability is impacted directly by the characteristics of the sample and testing conditions; (2) sampling error has a direct influence on reliability and factor…
Descriptors: Factor Structure, Psychometrics, Reliability, Sampling
Haladyna, Thomas M. – Educational Horizons, 2006
This article argues that the validity of standardized achievement test-score interpretation and use is problematic; consequently, confidence and trust in such test scores may often be unwarranted. The problem is particularly severe in high-stakes situations. This essay provides a context for understanding standardized achievement testing, then…
Descriptors: Validity, Testing, Achievement Tests, Standardized Tests