Showing posts with label Effect Size. Show all posts
Showing posts with label Effect Size. Show all posts

Wednesday, July 20, 2022

Effect Sizes (ES) in statistics

In statistics, an effect size (ES) indicates the strength of the relationship between two variables.

In psychological experiments, researchers are interested in the strength of the effect of the Independent Variable on the Dependent Variable.

In psychotherapy studies, researchers may be interested in the effects of treatment on a measure of the dependent variable. A research questions may be framed: How effective is a set of 6 CBT sessions on the reduction of depression?

Psychologists have often described effect sizes as small, medium, or large.

Cohen's d

Cohen's d is a measure of effect size between two groups. The mean of one group is subtracted from a second group and divided by the pooled standard deviation of the two groups.

ES = (M1 - M2) / SD

Effect Size  Label

0.2     Small

0.5     Medium

0.8     Large

Pearson Correlation Coefficient (r)

0.1 to 0.3  Small

0.3 to 0.5  Medium

0.5 to 1.0   Large

Converting Cohen's d to the correlation coefficient

r =  d/ d2 + 4

Note. Negative values also indicate strength of a relationship.

Cite this post

Sutton, G. (2022, July 20). Effect sizes (ES) in statistics. Assessment, Statistics, and Research. Retrieved from


Cohen, J. (1992). A power primer. Psychological Bulletin, 112(1), 155–159.

Cohen, J. (1988). Statistical power analysis for the behavioral sciences (2nd ed.). Hillsdale, NJ: Lawrence Erlbaum Associates.

Sutton, G. W. (2020). Applied statistics: Concepts for counselors, second edition. Springfield, MO: Sunflower.

 Link to an Index of Statistical Concepts in Psychology, Counseling, and Education

 Reference for using scales in research:

Creating Surveys on AMAZON      or     GOOGLE


Reference for clinicians on understanding assessment

Applied Statistics Concepts for Counselors on




Resource Link:  A-Z Statistical Terms

Resource Link:  A – Z Test Index

Related Terms

Standard Deviation


Links to Connections

Checkout My Website


See my Books





FOLLOW me on

   FACEBOOK   Geoff W. Sutton  


   TWITTER  @Geoff.W.Sutton




Read published articles:


  Academia   Geoff W Sutton   


  ResearchGate   Geoffrey W Sutton 

Tuesday, January 5, 2021

ANOVA in Counseling & Psychology Research

There are several types of ANOVA procedures. The term ANOVA refers to Analysis of Variance. Variance is a statistical term we will review later. Variance refers to differences, so the ANOVA procedures examine differences in scores among groups of people who complete a survey, a test, or produce a scorable response.

For example, an ANOVA can be used to assess the effects of three temperatures on math. The Independent Variable is temperature varies three ways (75, 85, 95 degrees F). The dependent variable is math. The dependent measure of math is a math test.

When there is only one independent variable (IV), the ANOVA is called a one-way ANOVA. If there are two IVs the ANOVA is a two-way ANOVA, and so forth. It is rare to go beyond a four-way because the interpretation of interactions is complicated.

The ANOVA procedure is usually reported with an F value. The larger the F value, the more likely it is that the differences the researchers found are not due to chance.

There may be several independent variables in a project. The effect of each variable is tested with an F test. When there are two or more variables, researchers also test for possible interaction effects, which results in additional F tests for each interaction. Interactions refer to the possibility that two or more variables combine to produce a change in the dependent variable. As with t tests, researchers include a probability (p) value with each F test. A common effect size associated with F tests is partial eta squared. An ANOVA is used when there are one or more independent variables but only one dependent variable.

Independent or grouping Variable = 1 or more IV

Dependent or criterion Variable = 1 DV

Dependent Measure = DM = a test score or quantitative measure of the DV

IV Groups or levels = 2 or more. An IV may have many levels (e.g., temperature, dosages) or groups (e.g., therapy groups, learning groups, work groups).

Overall tests are used to determine significant effects or differences among the groups or levels of the IVs.

An test indicates significance overall and for specific effects or relationships.

A commonly reported measure of effect size is eta squared. In psychology, researchers have been encouraged to focus on effect sizes rather than p values when analyzing and reporting research results.

value reveals the probability of a significant relationship-- one that is not due to chance factors. The level of significance is set by the researchers. A common level is p is .05. F values yielding a probability below 0.05 are commonly considered significant in psychology and education.

The concept or idea is that a difference between means yields a large F value that would only occur 95 times out of a hundred due to chance. When researchers analyze the differences, they are analyzing variance hence, ANOVA.

If the overall F test is significant, then researchers may compare group means two at a time to determine possible significant differences between pairs of groups. There are many tests of pairs. For example t tests, Tukey HSD, Bonferroni, Neuman-Keuls. These tests are called post hoc tests because they are used only if the overall F test is significant.

Read more about ANOVA and data analyses in the following books.

Applied Statistics Concepts for Counselors on AMAZON   or   GOOGLE

Creating Surveys on AMAZON    or   GOOGLE  Worldwide

Illustration of a three group design project. Volunteers in a math class are randomly assigned to three levels of room temperature (Fahrenheit) to determine if room temperature has an effect on the score. The random assignment should mean that individual characteristics of the participants would not be a factor in the score.

(The temperature is in the 90s today and expected to reach the 100s this week!)




75 degrees

Math score

85 degrees

Math score

95 degrees

Math Score

If the researchers worried that randomization did not truly even out the math skills of the participant's then they could give a math  pretest.

The IV is a research design term. In statistics, the IV is often represented by the letter X and the three groups would be X1, X2, and X3.

The DV if often referred to using the letter Y in statistics. In this study, there would be three Y values Y1, Y2 and Y3.

If the overall F value indicated a significant difference for the variation in scores, then the researchers could compare each pair. There would be three comparisons.

Y1 and Y2
Y2 and Y3
Y1 and Y3

What is compared?

In ANOVA, the differences among the mean scores are analyzed.
In the post hoc tests, the differences between the pairs of means are analyzed.

Serious stats help

If you need help with detailed analysis, one of the best statistics books for details is Discovering Statistics by Andy Field. I've used an earlier edition as a course textbook. It is of course not needed for those who desire only a conceptual understanding. See Discovering Statistics on AMAZON for more information.

Other notes
The letter F in the F value comes from the surname of the British statistician, Sir Ronald Fisher (1890-1962) born in my hometown, London, England.

Daniel Fahrenheit of temperature fame was born in Poland to a German family.

Links to Connections


Checkout My Website


See my Books






JOIN me on


   FACEBOOK   Geoff W. Sutton  


   TWITTER  @Geoff.W.Sutton




Read many published articles:


  Academia   Geoff W Sutton   


  ResearchGate   Geoffrey W Sutton 

Identity Salience Questionnaire (ISQ)

  Assessment name: Identity Salience Questionnaire (ISQ) Scale overview: The Identity Salience Questionnaire (ISQ) is a 6-item self-repor...