Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 6 |
Since 2006 (last 20 years) | 21 |
Descriptor
Error Patterns | 30 |
Evaluation Methods | 30 |
Statistical Analysis | 30 |
Simulation | 9 |
Comparative Analysis | 8 |
Computation | 6 |
Correlation | 6 |
Research Methodology | 6 |
Educational Research | 5 |
Hypothesis Testing | 5 |
Measurement Techniques | 5 |
More ▼ |
Source
Author
Guo, Jiin-Huarng | 2 |
Luh, Wei-Ming | 2 |
Abraham, W. Todd | 1 |
Almoied, Ayed | 1 |
Baldwin, Scott A. | 1 |
Bentler, Peter M. | 1 |
Bird, Kevin D. | 1 |
Black, Ken | 1 |
Bulte, Isis | 1 |
Burstein, Leigh | 1 |
Cancino, Eduardo | 1 |
More ▼ |
Publication Type
Education Level
Elementary Education | 2 |
Higher Education | 2 |
Grade 3 | 1 |
Grade 4 | 1 |
Grade 5 | 1 |
Grade 6 | 1 |
Intermediate Grades | 1 |
Junior High Schools | 1 |
Middle Schools | 1 |
Postsecondary Education | 1 |
Secondary Education | 1 |
More ▼ |
Audience
Researchers | 1 |
Laws, Policies, & Programs
Assessments and Surveys
National Assessment of… | 1 |
Program for International… | 1 |
Stroop Color Word Test | 1 |
What Works Clearinghouse Rating
Yang, Shitao; Black, Ken – Teaching Statistics: An International Journal for Teachers, 2019
Summary Employing a Wald confidence interval to test hypotheses about population proportions could lead to an increase in Type I or Type II errors unless the hypothesized value, p0, is used in computing its standard error rather than the sample proportion. Whereas the Wald confidence interval to estimate a population proportion uses the sample…
Descriptors: Error Patterns, Evaluation Methods, Error of Measurement, Measurement Techniques
Almoied, Ayed – ProQuest LLC, 2017
Classical statistical tests are used in many disciplines such as education and psychology. Such tests are based on certain assumptions (e.g., normality and homoscedasticity) that are must to be met in order to produce accurate results. Violation of such assumptions is a common problem researchers encounter, particularly when analyzing real data.…
Descriptors: Evaluation, Statistical Analysis, Evaluation Methods, Simulation
Porter, Kristin E. – Society for Research on Educational Effectiveness, 2016
In recent years, there has been increasing focus on the issue of multiple hypotheses testing in education evaluation studies. In these studies, researchers are typically interested in testing the effectiveness of an intervention on multiple outcomes, for multiple subgroups, at multiple points in time or across multiple treatment groups. When…
Descriptors: Hypothesis Testing, Intervention, Error Patterns, Evaluation Methods
Lin, Olivia Y.-H.; MacLeod, Colin M. – Journal of Experimental Psychology: Learning, Memory, and Cognition, 2018
Three experiments investigated the learning of simple associations in a color-word contingency task. Participants responded manually to the print colors of 3 words, with each word associated strongly to 1 of the 3 colors and weakly to the other 2 colors. Despite the words being irrelevant, response times to high-contingency stimuli and to…
Descriptors: Associative Learning, Learning Processes, Contingency Management, Color
Ganzfried, Sam; Yusuf, Farzana – Education Sciences, 2018
A problem faced by many instructors is that of designing exams that accurately assess the abilities of the students. Typically, these exams are prepared several days in advance, and generic question scores are used based on rough approximation of the question difficulty and length. For example, for a recent class taught by the author, there were…
Descriptors: Weighted Scores, Test Construction, Student Evaluation, Multiple Choice Tests
Castilla-Earls, Anny; Pérez-Leroux, Ana Teresa; Restrepo, Maria Adelaida; Gaile, Daniel; Chen, Ziqiang – Language Acquisition: A Journal of Developmental Linguistics, 2018
This study investigates the use of the Spanish subjunctive in bilingual children with and without specific language impairments (SLI). Using an elicitation task, we examine: (i) the potential of the subjunctive as a grammatical marker of SLI in Spanish-English bilingual children, (ii) the extent to which degree of bilingualism affects performance,…
Descriptors: Spanish, Bilingualism, English (Second Language), Second Language Learning
Socha, Alan; DeMars, Christine E. – Educational and Psychological Measurement, 2013
Modeling multidimensional test data with a unidimensional model can result in serious statistical errors, such as bias in item parameter estimates. Many methods exist for assessing the dimensionality of a test. The current study focused on DIMTEST. Using simulated data, the effects of sample size splitting for use with the ATFIND procedure for…
Descriptors: Sample Size, Test Length, Correlation, Test Format
Kim, Eun Sook; Yoon, Myeongsun; Lee, Taehun – Educational and Psychological Measurement, 2012
Multiple-indicators multiple-causes (MIMIC) modeling is often used to test a latent group mean difference while assuming the equivalence of factor loadings and intercepts over groups. However, this study demonstrated that MIMIC was insensitive to the presence of factor loading noninvariance, which implies that factor loading invariance should be…
Descriptors: Test Items, Simulation, Testing, Statistical Analysis
Li, Hang; He, Lianzhen – Language Assessment Quarterly, 2015
This study used think-aloud protocols to compare essay-rating processes across holistic and analytic rating scales in the context of China's College English Test Band 6 (CET-6). A group of 9 experienced CET-6 raters scored the same batch of 10 CET-6 essays produced in an operational CET-6 administration twice, using both the CET-6 holistic…
Descriptors: Protocol Analysis, English (Second Language), Second Language Learning, Classification
Bird, Kevin D. – Psychological Methods, 2011
Any set of confidence interval inferences on J - 1 linearly independent contrasts on J means, such as the two comparisons [mu][subscript 1] - [mu][subscript 2] and [mu][subscript 2] - [mu][subscript 3] on 3 means, provides a basis for the deduction of interval inferences on all other contrasts, such as the redundant comparison [mu][subscript 1] -…
Descriptors: Intervals, Statistical Analysis, Inferences, Comparative Analysis
Joost C. F. de Winter; Dimitra Dodou – Practical Assessment, Research & Evaluation, 2010
Likert questionnaires are widely used in survey research, but it is unclear whether the item data should be investigated by means of parametric or nonparametric procedures. This study compared the Type I and II error rates of the "t" test versus the Mann-Whitney-Wilcoxon (MWW) for five-point Likert items. Fourteen population…
Descriptors: Evaluation Methods, Questionnaires, Likert Scales, Statistical Analysis
Murayama, Kou; Sakaki, Michiko; Yan, Veronica X.; Smith, Garry M. – Journal of Experimental Psychology: Learning, Memory, and Cognition, 2014
In order to examine metacognitive accuracy (i.e., the relationship between metacognitive judgment and memory performance), researchers often rely on by-participant analysis, where metacognitive accuracy (e.g., resolution, as measured by the gamma coefficient or signal detection measures) is computed for each participant and the computed values are…
Descriptors: Metacognition, Memory, Accuracy, Statistical Analysis
Haardorfer, Regine; Gagne, Phill – Focus on Autism and Other Developmental Disabilities, 2010
Some researchers have argued for the use of or have attempted to make use of randomization tests in single-subject research. To address this tide of interest, the authors of this article describe randomization tests, discuss the theoretical rationale for applying them to single-subject research, and provide an overview of the methodological…
Descriptors: Research Design, Researchers, Evaluation Methods, Research Methodology
Manolov, Rumen; Solanas, Antonio; Bulte, Isis; Onghena, Patrick – Journal of Experimental Education, 2010
This study deals with the statistical properties of a randomization test applied to an ABAB design in cases where the desirable random assignment of the points of change in phase is not possible. To obtain information about each possible data division, the authors carried out a conditional Monte Carlo simulation with 100,000 samples for each…
Descriptors: Monte Carlo Methods, Effect Size, Simulation, Evaluation Methods
Luh, Wei-Ming; Guo, Jiin-Huarng – Journal of Experimental Education, 2009
The sample size determination is an important issue for planning research. However, limitations in size have seldom been discussed in the literature. Thus, how to allocate participants into different treatment groups to achieve the desired power is a practical issue that still needs to be addressed when one group size is fixed. The authors focused…
Descriptors: Sample Size, Research Methodology, Evaluation Methods, Simulation
Previous Page | Next Page »
Pages: 1 | 2