Unbiasedness Statistics
1. **Stating the problem:** We want to understand the concept of unbiasedness in statistics and see examples.
2. **Definition:** An estimator \(\hat{\theta}\) of a parameter \(\theta\) is **unbiased** if its expected value equals the true parameter, i.e., $$E(\hat{\theta}) = \theta.$$ This means on average, the estimator hits the true value.
3. **Important rule:** Bias is defined as $$\text{Bias}(\hat{\theta}) = E(\hat{\theta}) - \theta.$$ If bias = 0, the estimator is unbiased.
4. **Example 1: Sample mean as estimator of population mean**
- Problem: Estimate population mean \(\mu\) using sample mean \(\bar{X}\).
- Formula: $$\bar{X} = \frac{1}{n} \sum_{i=1}^n X_i$$
- Expected value: $$E(\bar{X}) = \mu$$
- Conclusion: Sample mean is an unbiased estimator of population mean.
5. **Example 2: Sample variance (with denominator \(n-1\)) as estimator of population variance**
- Problem: Estimate population variance \(\sigma^2\) using
$$S^2 = \frac{1}{n-1} \sum_{i=1}^n (X_i - \bar{X})^2$$
- Expected value: $$E(S^2) = \sigma^2$$
- Conclusion: This sample variance is unbiased.
6. **Example 3: Sample variance with denominator \(n\) is biased**
- Problem: Use $$S_n^2 = \frac{1}{n} \sum_{i=1}^n (X_i - \bar{X})^2$$
- Expected value: $$E(S_n^2) = \frac{n-1}{n} \sigma^2 < \sigma^2$$
- Conclusion: This estimator underestimates variance, so it is biased.
7. **Example 4: Estimator for population proportion**
- Problem: Estimate population proportion \(p\) by sample proportion \(\hat{p} = \frac{X}{n}\) where \(X\) is number of successes.
- Expected value: $$E(\hat{p}) = p$$
- Conclusion: Sample proportion is unbiased.
8. **Example 5: Estimator for population mean using a biased estimator**
- Problem: Use \(\tilde{X} = \frac{1}{n+1} \sum_{i=1}^n X_i\)
- Expected value: $$E(\tilde{X}) = \frac{n}{n+1} \mu < \mu$$
- Conclusion: This estimator is biased (underestimates mean).
**Summary:** Unbiasedness means the estimator's expected value equals the true parameter. Sample mean and sample variance with denominator \(n-1\) are classic unbiased estimators, while some others can be biased.