How Statistics Can Be Used to Drive Business Decisions
Source: Dev.to
Introduction
In today’s competitive business landscape, intuition is no longer sufficient for making critical decisions. Companies that leverage statistical analysis to inform their strategies consistently outperform those that rely on experience or instinct. The story below demonstrates how a systematic statistical approach—from descriptive analytics to hypothesis testing—can provide clear, evidence‑based answers to complex business questions. It also shows how understanding concepts such as effect size, statistical power, and potential errors can prevent costly mistakes and unlock growth opportunities.
A retail company operating both online and physical stores wanted to answer three key questions:
- How are sales performing over time?
- How reliable are insights drawn from the data?
- Does running a marketing campaign actually increase revenue per transaction?
The company had three years of transaction data, including revenue, store type, region, and whether a marketing campaign was used. The goal was to use statistics to support decision‑making.
Descriptive Statistics
Central Tendency (The “Average”)
- Mean revenue: 8,272 per transaction
- Median revenue: 7,723 per transaction
The mean is higher than the median, indicating the presence of high‑value outliers. The median is often more “typical” for the bulk of transactions.
Distribution Shape
Skewness and kurtosis reveal that most transactions are low to moderate, but a few very high transactions pull the average up. The distribution is right‑skewed with a long tail of large values.
Visualizations
- Revenue Over Time (Line Chart) – Shows seasonal peaks (December) and valleys (January).
- Revenue by Store Type (Bar Chart) – Highlights differences between online and physical locations.
- Revenue by Region (Box Plot) – Indicates that a single marketing strategy won’t fit all regions; customization is needed.
- Units Sold vs. Revenue (Scatter Plot) – Illustrates the relationship between volume and monetary value.
Types of Bias
Selection Bias
- Urban areas systematically differ from rural areas.
- Higher income and different shopping behaviors.
- Better infrastructure and internet connectivity.
Geographic Bias
- Rural regions completely excluded.
- Findings cannot be generalized to the entire market.
Socioeconomic Bias
- Urban customers have different purchasing power.
- Product preferences may differ.
Business Impact
- Revenue estimates could be overstated.
- Marketing effectiveness might be overestimated.
- Regional strategy would be incomplete.
- Expansion decisions would lack an empirical foundation.
Recommended Sampling Method
Divide the population into strata (e.g., regions, store types) and randomly sample proportionally from each stratum. This approach:
- Ensures all segments are represented.
- Maintains the natural population distribution.
- Allows both overall and stratum‑specific analysis.
Fundamental Concepts
Law of Large Numbers
As the sample size grows, the sample mean converges to the true population mean, providing more reliable estimates.
Central Limit Theorem
Regardless of the underlying distribution, the sampling distribution of the mean approaches a normal distribution as the sample size increases, enabling the use of parametric tests.
Hypothesis Testing
A one‑tailed independent‑samples t‑test compared revenues from transactions with and without a marketing campaign.
- Result: Large t‑statistic; p‑value far below the 5 % significance level.
- Decision: Reject the null hypothesis; conclude that marketing campaigns significantly increase average revenue per transaction.
Error Considerations
- Type II error is especially concerning because lost revenue is permanent, competitors can gain market share, and recovery is expensive.
Effect Size
Although the effect was statistically significant, Cohen’s d indicated a small‑to‑medium effect size, meaning the impact per transaction is modest.
A statistically insignificant result could still matter in practice if:
- The effect is small but consistent.
- The business operates at a large scale.
- The sample size is insufficient.
Conclusion
- Understand performance realistically.
- Measure risk and variability.
- Test strategic decisions objectively.
- Avoid costly cognitive and sampling biases.
By integrating descriptive statistics, visualization, sampling theory, probability laws, and hypothesis testing, organizations can make evidence‑based decisions that are both statistically sound and commercially meaningful. In an increasingly competitive environment, businesses that leverage statistics effectively gain a decisive advantage—not by predicting the future perfectly, but by making better decisions under uncertainty.