ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	6
Since 2006 (last 20 years)	21

Descriptor

Error Patterns	30
Evaluation Methods	30
Statistical Analysis	30
Simulation	9
Comparative Analysis	8
Computation	6
Correlation	6
Research Methodology	6
Educational Research	5
Hypothesis Testing	5
Measurement Techniques	5
Inferences	4
Prediction	4
Error of Measurement	3
Experimental Psychology	3
Foreign Countries	3
Grammar	3
Models	3
Monte Carlo Methods	3
Research Design	3
Sample Size	3
Sampling	3
Student Evaluation	3
Test Items	3
Visual Stimuli	3
More ▼

Publication Type

Journal Articles	22
Reports - Research	17
Reports - Descriptive	6
Reports - Evaluative	3
Dissertations/Theses -…	2
Speeches/Meeting Papers	2
Collected Works - Proceedings	1
Guides - Non-Classroom	1
Tests/Questionnaires	1

Education Level

Elementary Education	2
Higher Education	2
Grade 3	1
Grade 4	1
Grade 5	1
Grade 6	1
Intermediate Grades	1
Junior High Schools	1
Middle Schools	1
Postsecondary Education	1
Secondary Education	1
More ▼

Audience

Researchers

Location

Canada	1
China	1
Finland	1
France	1

Laws, Policies, & Programs

Assessments and Surveys

National Assessment of…	1
Program for International…	1
Stroop Color Word Test	1

What Works Clearinghouse Rating

Showing 1 to 15 of 30 results Save | Export

Using the Standard Wald Confidence Interval for a Population Proportion Hypothesis Test Is a Common Mistake

Peer reviewed

Direct link

Yang, Shitao; Black, Ken – Teaching Statistics: An International Journal for Teachers, 2019

Summary Employing a Wald confidence interval to test hypotheses about population proportions could lead to an increase in Type I or Type II errors unless the hypothesized value, p0, is used in computing its standard error rather than the sample proportion. Whereas the Wald confidence interval to estimate a population proportion uses the sample…

Descriptors: Error Patterns, Evaluation Methods, Error of Measurement, Measurement Techniques

Robustness and Comparative Power of Welch-Aspin, Alexander-Govern and Yuen Tests under Non-Normality and Variance Heteroscedasticity

Direct link

Almoied, Ayed – ProQuest LLC, 2017

Classical statistical tests are used in many disciplines such as education and psychology. Such tests are based on certain assumptions (e.g., normality and homoscedasticity) that are must to be met in order to produce accurate results. Violation of such assumptions is a common problem researchers encounter, particularly when analyzing real data.…

Descriptors: Evaluation, Statistical Analysis, Evaluation Methods, Simulation

Estimating Statistical Power When Making Adjustments for Multiple Tests

Peer reviewed
PDF on ERIC

Download full text

Porter, Kristin E. – Society for Research on Educational Effectiveness, 2016

In recent years, there has been increasing focus on the issue of multiple hypotheses testing in education evaluation studies. In these studies, researchers are typically interested in testing the effectiveness of an intervention on multiple outcomes, for multiple subgroups, at multiple points in time or across multiple treatment groups. When…

Descriptors: Hypothesis Testing, Intervention, Error Patterns, Evaluation Methods

The Acquisition of Simple Associations as Observed in Color-Word Contingency Learning

Peer reviewed

Direct link

Lin, Olivia Y.-H.; MacLeod, Colin M. – Journal of Experimental Psychology: Learning, Memory, and Cognition, 2018

Three experiments investigated the learning of simple associations in a color-word contingency task. Participants responded manually to the print colors of 3 words, with each word associated strongly to 1 of the 3 colors and weakly to the other 2 colors. Despite the words being irrelevant, response times to high-contingency stimuli and to…

Descriptors: Associative Learning, Learning Processes, Contingency Management, Color

Optimal Weighting for Exam Composition

Peer reviewed
PDF on ERIC

Download full text

Ganzfried, Sam; Yusuf, Farzana – Education Sciences, 2018

A problem faced by many instructors is that of designing exams that accurately assess the abilities of the students. Typically, these exams are prepared several days in advance, and generic question scores are used based on rough approximation of the question difficulty and length. For example, for a recent class taught by the author, there were…

Descriptors: Weighted Scores, Test Construction, Student Evaluation, Multiple Choice Tests

The Complexity of the Spanish Subjunctive in Bilingual Children with SLI

Peer reviewed

Direct link

Castilla-Earls, Anny; Pérez-Leroux, Ana Teresa; Restrepo, Maria Adelaida; Gaile, Daniel; Chen, Ziqiang – Language Acquisition: A Journal of Developmental Linguistics, 2018

This study investigates the use of the Spanish subjunctive in bilingual children with and without specific language impairments (SLI). Using an elicitation task, we examine: (i) the potential of the subjunctive as a grammatical marker of SLI in Spanish-English bilingual children, (ii) the extent to which degree of bilingualism affects performance,…

Descriptors: Spanish, Bilingualism, English (Second Language), Second Language Learning

An Investigation of Sample Size Splitting on ATFIND and DIMTEST

Peer reviewed

Direct link

Socha, Alan; DeMars, Christine E. – Educational and Psychological Measurement, 2013

Modeling multidimensional test data with a unidimensional model can result in serious statistical errors, such as bias in item parameter estimates. Many methods exist for assessing the dimensionality of a test. The current study focused on DIMTEST. Using simulated data, the effects of sample size splitting for use with the ATFIND procedure for…

Descriptors: Sample Size, Test Length, Correlation, Test Format

Testing Measurement Invariance Using MIMIC: Likelihood Ratio Test with a Critical Value Adjustment

Peer reviewed

Direct link

Kim, Eun Sook; Yoon, Myeongsun; Lee, Taehun – Educational and Psychological Measurement, 2012

Multiple-indicators multiple-causes (MIMIC) modeling is often used to test a latent group mean difference while assuming the equivalence of factor loadings and intercepts over groups. However, this study demonstrated that MIMIC was insensitive to the presence of factor loading noninvariance, which implies that factor loading invariance should be…

Descriptors: Test Items, Simulation, Testing, Statistical Analysis

A Comparison of EFL Raters' Essay-Rating Processes across Two Types of Rating Scales

Peer reviewed

Direct link

Li, Hang; He, Lianzhen – Language Assessment Quarterly, 2015

This study used think-aloud protocols to compare essay-rating processes across holistic and analytic rating scales in the context of China's College English Test Band 6 (CET-6). A group of 9 experienced CET-6 raters scored the same batch of 10 CET-6 essays produced in an operational CET-6 administration twice, using both the CET-6 holistic…

Descriptors: Protocol Analysis, English (Second Language), Second Language Learning, Classification

Deduced Inference in the Analysis of Experimental Data

Peer reviewed

Direct link

Bird, Kevin D. – Psychological Methods, 2011

Any set of confidence interval inferences on J - 1 linearly independent contrasts on J means, such as the two comparisons [mu][subscript 1] - [mu][subscript 2] and [mu][subscript 2] - [mu][subscript 3] on 3 means, provides a basis for the deduction of interval inferences on all other contrasts, such as the redundant comparison [mu][subscript 1] -…

Descriptors: Intervals, Statistical Analysis, Inferences, Comparative Analysis

Five-Point Likert Items: t Test versus Mann-Whitney-Wilcoxon

Peer reviewed

Direct link

Joost C. F. de Winter; Dimitra Dodou – Practical Assessment, Research & Evaluation, 2010

Likert questionnaires are widely used in survey research, but it is unclear whether the item data should be investigated by means of parametric or nonparametric procedures. This study compared the Type I and II error rates of the "t" test versus the Mann-Whitney-Wilcoxon (MWW) for five-point Likert items. Fourteen population…

Descriptors: Evaluation Methods, Questionnaires, Likert Scales, Statistical Analysis

Type I Error Inflation in the Traditional By-Participant Analysis to Metamemory Accuracy: A Generalized Mixed-Effects Model Perspective

Peer reviewed

Direct link

Murayama, Kou; Sakaki, Michiko; Yan, Veronica X.; Smith, Garry M. – Journal of Experimental Psychology: Learning, Memory, and Cognition, 2014

In order to examine metacognitive accuracy (i.e., the relationship between metacognitive judgment and memory performance), researchers often rely on by-participant analysis, where metacognitive accuracy (e.g., resolution, as measured by the gamma coefficient or signal detection measures) is computed for each participant and the computed values are…

Descriptors: Metacognition, Memory, Accuracy, Statistical Analysis

The Use of Randomization Tests in Single-Subject Research

Peer reviewed

Direct link

Haardorfer, Regine; Gagne, Phill – Focus on Autism and Other Developmental Disabilities, 2010

Some researchers have argued for the use of or have attempted to make use of randomization tests in single-subject research. To address this tide of interest, the authors of this article describe randomization tests, discuss the theoretical rationale for applying them to single-subject research, and provide an overview of the methodological…

Descriptors: Research Design, Researchers, Evaluation Methods, Research Methodology

Data-Division-Specific Robustness and Power of Randomization Tests for ABAB Designs

Peer reviewed

Direct link

Manolov, Rumen; Solanas, Antonio; Bulte, Isis; Onghena, Patrick – Journal of Experimental Education, 2010

This study deals with the statistical properties of a randomization test applied to an ABAB design in cases where the desirable random assignment of the points of change in phase is not possible. To obtain information about each possible data division, the authors carried out a conditional Monte Carlo simulation with 100,000 samples for each…

Descriptors: Monte Carlo Methods, Effect Size, Simulation, Evaluation Methods

The Sample Size Needed for the Trimmed "t" Test when One Group Size Is Fixed

Peer reviewed

Direct link

Luh, Wei-Ming; Guo, Jiin-Huarng – Journal of Experimental Education, 2009

The sample size determination is an important issue for planning research. However, limitations in size have seldom been discussed in the literature. Thus, how to allocate participants into different treatment groups to achieve the desired power is a practical issue that still needs to be addressed when one group size is fixed. The authors focused…

Descriptors: Sample Size, Research Methodology, Evaluation Methods, Simulation

Previous Page | Next Page »

Pages: 1 | 2

Educational and Psychological…	4
Journal of Experimental…	3
Journal of Experimental…	2
ProQuest LLC	2
Cognitive Psychology	1
Diagnostique	1
Education Sciences	1
Focus on Autism and Other…	1
International Educational…	1
Journal of Consulting and…	1
Journal of Counseling…	1
Journal of Verbal Learning…	1
Language Acquisition: A…	1
Language Assessment Quarterly	1
Mathematica Policy Research,…	1
Practical Assessment,…	1
Psychological Methods	1
Society for Research on…	1
Structural Equation Modeling:…	1
Teaching Statistics: An…	1
More ▼

Guo, Jiin-Huarng	2
Luh, Wei-Ming	2
Abraham, W. Todd	1
Almoied, Ayed	1
Baldwin, Scott A.	1
Bentler, Peter M.	1
Bird, Kevin D.	1
Black, Ken	1
Bulte, Isis	1
Burstein, Leigh	1
Cancino, Eduardo	1
Carr, Sonya C.	1
Castilla-Earls, Anny	1
Chen, Ziqiang	1
Coleman, Edmund B.	1
DeMars, Christine E.	1
Dimitra Dodou	1
Ferreres, Doris	1
Fidalgo, Angel M.	1
Gagne, Phill	1
Gaile, Daniel	1
Ganzfried, Sam	1
Haardorfer, Regine	1
He, Lianzhen	1
Joost C. F. de Winter	1
More ▼