Publication Date
In 2025 | 1 |
Since 2024 | 18 |
Since 2021 (last 5 years) | 47 |
Since 2016 (last 10 years) | 126 |
Since 2006 (last 20 years) | 204 |
Descriptor
Test Content | 308 |
Test Items | 115 |
Foreign Countries | 96 |
Test Construction | 78 |
Test Validity | 65 |
Scores | 47 |
Language Tests | 45 |
Second Language Learning | 42 |
Student Evaluation | 42 |
Test Format | 40 |
Comparative Analysis | 38 |
More ▼ |
Source
Author
Sireci, Stephen G. | 4 |
Solano-Flores, Guillermo | 3 |
Steffen, Manfred | 3 |
Abedi, Jamal | 2 |
Agarwal, Pooja K. | 2 |
Bauer, Scott C. | 2 |
Binkley, Marilyn | 2 |
Borman, Walter C. | 2 |
Chang, Hua-Hua | 2 |
Cox, Shawna | 2 |
Dorans, Neil J. | 2 |
More ▼ |
Publication Type
Education Level
Audience
Teachers | 7 |
Practitioners | 5 |
Researchers | 2 |
Administrators | 1 |
Location
Australia | 8 |
Canada | 8 |
Turkey | 8 |
California | 7 |
Europe | 6 |
China | 5 |
United States | 5 |
Germany | 4 |
Hong Kong | 4 |
Iran | 4 |
Japan | 4 |
More ▼ |
Laws, Policies, & Programs
Every Student Succeeds Act… | 2 |
No Child Left Behind Act 2001 | 2 |
Individuals with Disabilities… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
BijanKhan, Mahmood; ShayesteFar, Parvaneh; Mohebbi, Hassan – Language Testing in Asia, 2023
Drawing on a growing body of research on the interface between corpus linguistics and second/foreign language testing and assessment, we adopted "Peykare," a large-scale, annotated, Persian written language resource to evaluate the content (i.e., coverage and typicality) and construct validity of a Persian language proficiency test…
Descriptors: Indo European Languages, Language Tests, Test Construction, Test Validity
Do-Hong Kim; Chuang Wang; Thi Nhu Ngoc Truong – Language Teaching Research, 2024
Researchers and practitioners in the field of second language acquisition have come to realize the importance of non-cognitive skills such as self-efficacy and self-regulation in students' learning of a second language. However, there has been limited systematic research on such measures in the second language context and the validity and…
Descriptors: Psychometrics, Test Content, Self Efficacy, English Language Learners
Agus Santoso; Heri Retnawati; Timbul Pardede; Ibnu Rafi; Munaya Nikma Rosyada; Gulzhaina K. Kassymova; Xu Wenxin – Practical Assessment, Research & Evaluation, 2024
The test blueprint is important in test development, where it guides the test item writer in creating test items according to the desired objectives and specifications or characteristics (so-called a priori item characteristics), such as the level of item difficulty in the category and the distribution of items based on their difficulty level.…
Descriptors: Foreign Countries, Undergraduate Students, Business English, Test Construction
Tugçe Duran; Musa Dikmenli – Journal of Education in Science, Environment and Health, 2024
This study aimed to comprehensively examine the articles in which multi-tier concept diagnostic tests, which are among the alternative assessment methods frequently used in recent years to identify misconceptions, were used in biology education between 2000 and 2022. For this purpose, systematic review steps were followed and summarized in the…
Descriptors: Diagnostic Tests, Biology, Misconceptions, Science Education
Firdissa J. Aga – Intersection: A Journal at the Intersection of Assessment and Learning, 2024
The study investigated hurdles to the quality of student learning assessment by examining issues related to assessment procedures and practices, learners and learning, learning resources and test constructs, and test admin and feedback. Quantitative and qualitative data were collected from two Ethiopian universities using two types of…
Descriptors: Foreign Countries, College Faculty, College Students, Test Construction
Timothy S. Faith – Teaching and Learning Excellence through Scholarship, 2024
This study compared traditional methods of college-level instruction, including lecture and class discussion followed by assessment via course content exams, with a variety of other instructional techniques. The intent was to evaluate whether more contemporary instructional techniques are significantly correlated with improved average exam scores…
Descriptors: Community College Students, Business Administration Education, Teaching Methods, Alternative Assessment
Russell, Michael; Moncaleano, Sebastian – Practical Assessment, Research & Evaluation, 2020
Although both content alignment and standard-setting procedures rely on content-expert panel judgements, only the latter employs discussion among panel members. This study employed a modified form of the Webb methodology to examine content alignment for twelve tests administered as part of the Massachusetts Comprehensive Assessment System (MCAS).…
Descriptors: Test Content, Test Items, Discussion, Test Validity
Practices in Instrument Use and Development in "Chemistry Education Research and Practice" 2010-2021
Lazenby, Katherine; Tenney, Kristin; Marcroft, Tina A.; Komperda, Regis – Chemistry Education Research and Practice, 2023
Assessment instruments that generate quantitative data on attributes (cognitive, affective, behavioral, "etc.") of participants are commonly used in the chemistry education community to draw conclusions in research studies or inform practice. Recently, articles and editorials have stressed the importance of providing evidence for the…
Descriptors: Chemistry, Periodicals, Journal Articles, Science Education
Schaefer, Stephanie; Moore-Russo, Deborah – North American Chapter of the International Group for the Psychology of Mathematics Education, 2022
As standards documents have been introduced over the past 20 years, many states have seen an evolution in both the standards and related high stakes exams. For many teachers across the U.S., the rollout of standards and exams has not been an experience that builds trust in state education leaders. In this study, we consider three major changes in…
Descriptors: Mathematics Tests, High School Freshmen, Academic Standards, State Standards
Carrie L. Bonilla – Hispania, 2024
This article details the challenges and best practices of evaluating second language learners for placement into postsecondary Spanish language courses. The literature on testing for placement purposes in second language acquisition and language testing provides a great deal of insight, but language programs must make many decisions as well that…
Descriptors: Spanish, Language Tests, Placement Tests, Test Validity
Joseph Smith – Journal of Curriculum Studies, 2024
This paper offers an analysis of Modern Studies, a school subject unique to Scotland. First taught in the 1960s, Modern Studies was originally conceived as an option for students discontinuing their studies in history and geography. Since then, though, Modern Studies has carved a distinctive curricular niche and has become one of the most popular…
Descriptors: Foreign Countries, Philosophy, Modern History, Social Studies
Wu, Haiyan; Liang, Xinya; Yürekli, Hülya; Becker, Betsy Jane; Paek, Insu; Binici, Salih – Journal of Psychoeducational Assessment, 2020
The demand for diagnostic feedback has triggered extensive research on cognitive diagnostic models (CDMs), such as the deterministic input, noisy output "and" gate (DINA) model. This study explored two Q-matrix specifications with the DINA model in a statewide large-scale mathematics assessment. The first Q-matrix was developed based on…
Descriptors: Mathematics Tests, Cognitive Measurement, Models, Test Items
Nabor C. Mendonça – ACM Transactions on Computing Education, 2024
The recent integration of visual capabilities into Large Language Models (LLMs) has the potential to play a pivotal role in science and technology education, where visual elements such as diagrams, charts, and tables are commonly used to improve the learning experience. This study investigates the performance of ChatGPT-4 Vision, OpenAI's most…
Descriptors: Artificial Intelligence, Natural Language Processing, Technology Uses in Education, Foreign Countries
da Silva, Mônia Aparecida; de Mendonça Filho, Euclides J.; Mônego, Bruna G.; Bandeira, Denise R. – Early Child Development and Care, 2020
This study is a systematic review designed to identify the instruments most frequently used to evaluate children's development, describe their operational and psychometric characteristics and determine which are the most accurate. We carried out a systematic search of the online databases PsycINFO and PubMed Central using the descriptors…
Descriptors: Child Development, Measures (Individuals), Psychometrics, Accuracy
Noori, Mahdieh – TESOL Quarterly: A Journal for Teachers of English to Speakers of Other Languages and of Standard English as a Second Dialect, 2022
Shifting focus of asocial language tests toward social considerations recalls their ideological basis (Mirhosseini, De Costa, 2020). The recurrent exposure to and involvement in discursive constructions of high stakes' contents, may bring along certain sociocultural conceptualizations and values by test audiences (van Dijk, 1998). Thus, unless a…
Descriptors: Language Tests, Second Language Learning, English (Second Language), Test Content