Effect sizes and p values: What should be reported and what should be replicated?

1 March 1996

journal article
Published by Wiley in Psychophysiology

Vol. 33 (2), 175-183
https://doi.org/10.1111/j.1469-8986.1996.tb02121.x

Abstract

Despite publication of many well-argued critiques of null hypothesis testing (NHT). behavioral science researchers continue to rely heavily on this set of practices. Although we agree with most critics' catalogs of NHT's flaws, this article also takes the unusual stance of identifying virtues that may explain why NHT continues to he so extensively used. These virtues include providing results in the form of a dichotomous (yes/no) hypothesis evaluation and providing an index (p value) Mini has a justifiable mapping onto confidence in repeatability of a null hypothesis rejection. The most-criticized flaws of NHT can be avoided when the importance of a hypothesis, rather than the p value of its test, is used to determine that a finding is worthy of report, and when p=.05 is treated as insufficient basis for confidence in the replicability of an isolated non-null finding. Together with many recent critics of NHT, we also urge reporting of important hypothesis tests in enough descriptive detail to permit secondary uses such as meta-analysis.

Keywords

This publication has 36 references indexed in Scilit:

The earth is round (p < .05).
American Psychologist, 1994
Half a minute: Predicting teacher evaluations from thin slices of nonverbal behavior and physical attractiveness.
Journal of Personality and Social Psychology, 1993
Does contact lead to similarity or similarity to contact?
Behavior Genetics, 1990
Things I have learned (so far).
American Psychologist, 1990
Hindsight bias: An interaction of automatic and motivational factors?
Memory & Cognition, 1988
Theoretical risks and tabular asterisks: Sir Karl, Sir Ronald, and the slow progress of soft psychology.
Journal of Consulting and Clinical Psychology, 1978
Theoretical risks and tabular asterisks: Sir Karl, Sir Ronald, and the slow progress of soft psychology.
Journal of Consulting and Clinical Psychology, 1978
Data-Dredging Procedures in Survey Analysis
The American Statistician, 1966
The test of significance in psychological research.
Psychological Bulletin, 1966
The Place of Statistics in Psychology
Educational and Psychological Measurement, 1960

Cited by 204 articles