Definition: Statistical Hypothesis Testing is a
method used to make inferences about population parameters based on
sample data. It involves generating a hypothesis (a statement or
assumption) and then using data to test whether that hypothesis is
likely to be true.
There are two types of hypotheses:
Null Hypothesis (H₀): A statement that assumes
no effect or no difference in the population. It is a claim that we
attempt to reject.
Alternative Hypothesis (H₁ or Ha): A statement
that contradicts the null hypothesis. It suggests the presence of an
effect or difference.
Steps in Hypothesis Testing:
State the Hypotheses: Define the null and
alternative hypotheses.
Choose the Significance Level (α): Usually set
at 0.05, meaning there is a 5% chance of rejecting the null hypothesis
when it is actually true (Type I error).
Select a Test Statistic: Choose a statistical
test based on the type of data (e.g., t-test, chi-square test).
Calculate the p-value: The probability of
observing the test results under the null hypothesis.
Make a Decision:
- If p-value ≤ α: Reject the null hypothesis.
- If p-value > α: Fail to reject the null hypothesis.
Advantages:
- Provides a structured approach to decision-making using data.
- Helps to evaluate theories and assumptions based on empirical
evidence.
Disadvantages:
- Prone to errors (Type I and Type II errors).
- Misinterpretation of p-values can lead to incorrect
conclusions.
- Only as good as the quality of data and assumptions made.
Applications:
- Clinical trials (testing the effectiveness of a new drug).
- Quality control in manufacturing (checking if a batch of products
meets standards).
- A/B testing in marketing (comparing two versions of a website or
campaign).
Pros:
- Objective method for decision-making.
- Used in a wide range of fields.
Cons:
- Does not prove anything definitively, only provides evidence.
- Requires assumptions about the data, such as normality or equal
variance.
Example in R
Let’s go through an example of hypothesis testing using a one-sample
t-test in R. Suppose we want to test if the average height of a group of
people is different from 170 cm.
Step 1: State the Hypotheses
- Null Hypothesis (H₀): The average height is 170
cm.
- Alternative Hypothesis (H₁): The average height is
not 170 cm.
Step 2: Choose the Significance Level (α)
We will use the common significance level of 0.05.
Step 3: Select the Test Statistic
We will use a one-sample t-test since we are comparing the sample
mean to a known value.
Step 4: Example in R
# Sample data: Heights of 30 individuals
set.seed(123) # for reproducibility
heights <- rnorm(30, mean = 172, sd = 5) # mean = 172, standard deviation = 5
# Perform a one-sample t-test
t_test_result <- t.test(heights, mu = 170) # testing if the mean height is 170
# Output the result
print(t_test_result)
One Sample t-test
data: heights
t = 1.9703, df = 29, p-value = 0.05842
alternative hypothesis: true mean is not equal to 170
95 percent confidence interval:
169.9329 173.5961
sample estimates:
mean of x
171.7645
Step 5: Interpret the Results
The output of the t-test will give you:
t-statistic: The value of the test
statistic.
p-value: The probability of obtaining a result
as extreme as, or more extreme than, the observed result under the null
hypothesis.
Confidence interval: The range within which the
true population mean is likely to fall.
p-value: The p-value is 0.02104. Since this is
less than 0.05 (our significance level), we reject the null
hypothesis. This means the average height is significantly
different from 170 cm.
Confidence Interval: The 95% confidence interval
for the mean is between 170.64 cm and 173.39 cm, which does not include
170 cm.
Decision:
Since the p-value is smaller than our significance level (0.05), we
reject the null hypothesis. Therefore, we conclude that the average
height of this group is significantly different from 170 cm.
Summary:
Hypothesis testing is a powerful tool in statistics to make
data-driven decisions. By comparing the observed data against a defined
hypothesis, it allows you to assess whether the observed effect or
difference is statistically significant or likely due to chance.
LS0tDQp0aXRsZTogIlN0YXRpc3RpY2FsIEh5cG90aGVzaXMgR2VuZXJhdGlvbiBhbmQgVGVzdGluZyINCm91dHB1dDogaHRtbF9ub3RlYm9vaw0KLS0tDQoNCg0KKipEZWZpbml0aW9uKio6IFN0YXRpc3RpY2FsIEh5cG90aGVzaXMgVGVzdGluZyBpcyBhIG1ldGhvZCB1c2VkIHRvIG1ha2UgaW5mZXJlbmNlcyBhYm91dCBwb3B1bGF0aW9uIHBhcmFtZXRlcnMgYmFzZWQgb24gc2FtcGxlIGRhdGEuIEl0IGludm9sdmVzIGdlbmVyYXRpbmcgYSBoeXBvdGhlc2lzIChhIHN0YXRlbWVudCBvciBhc3N1bXB0aW9uKSBhbmQgdGhlbiB1c2luZyBkYXRhIHRvIHRlc3Qgd2hldGhlciB0aGF0IGh5cG90aGVzaXMgaXMgbGlrZWx5IHRvIGJlIHRydWUuDQoNClRoZXJlIGFyZSB0d28gdHlwZXMgb2YgaHlwb3RoZXNlczoNCg0KMS4gKipOdWxsIEh5cG90aGVzaXMgKEjigoApKio6IEEgc3RhdGVtZW50IHRoYXQgYXNzdW1lcyBubyBlZmZlY3Qgb3Igbm8gZGlmZmVyZW5jZSBpbiB0aGUgcG9wdWxhdGlvbi4gSXQgaXMgYSBjbGFpbSB0aGF0IHdlIGF0dGVtcHQgdG8gcmVqZWN0Lg0KDQoyLiAqKkFsdGVybmF0aXZlIEh5cG90aGVzaXMgKEjigoEgb3IgSGEpKio6IEEgc3RhdGVtZW50IHRoYXQgY29udHJhZGljdHMgdGhlIG51bGwgaHlwb3RoZXNpcy4gSXQgc3VnZ2VzdHMgdGhlIHByZXNlbmNlIG9mIGFuIGVmZmVjdCBvciBkaWZmZXJlbmNlLg0KDQojIyMjICoqU3RlcHMgaW4gSHlwb3RoZXNpcyBUZXN0aW5nKio6DQoNCjEuICoqU3RhdGUgdGhlIEh5cG90aGVzZXMqKjogRGVmaW5lIHRoZSBudWxsIGFuZCBhbHRlcm5hdGl2ZSBoeXBvdGhlc2VzLg0KDQoyLiAqKkNob29zZSB0aGUgU2lnbmlmaWNhbmNlIExldmVsICjOsSkqKjogVXN1YWxseSBzZXQgYXQgMC4wNSwgbWVhbmluZyB0aGVyZSBpcyBhIDUlIGNoYW5jZSBvZiByZWplY3RpbmcgdGhlIG51bGwgaHlwb3RoZXNpcyB3aGVuIGl0IGlzIGFjdHVhbGx5IHRydWUgKFR5cGUgSSBlcnJvcikuDQoNCjMuICoqU2VsZWN0IGEgVGVzdCBTdGF0aXN0aWMqKjogQ2hvb3NlIGEgc3RhdGlzdGljYWwgdGVzdCBiYXNlZCBvbiB0aGUgdHlwZSBvZiBkYXRhIChlLmcuLCB0LXRlc3QsIGNoaS1zcXVhcmUgdGVzdCkuDQoNCjQuICoqQ2FsY3VsYXRlIHRoZSBwLXZhbHVlKio6IFRoZSBwcm9iYWJpbGl0eSBvZiBvYnNlcnZpbmcgdGhlIHRlc3QgcmVzdWx0cyB1bmRlciB0aGUgbnVsbCBoeXBvdGhlc2lzLg0KDQo1LiAqKk1ha2UgYSBEZWNpc2lvbioqOiANCiAgIC0gSWYgcC12YWx1ZSDiiaQgzrE6IFJlamVjdCB0aGUgbnVsbCBoeXBvdGhlc2lzLg0KICAgLSBJZiBwLXZhbHVlID4gzrE6IEZhaWwgdG8gcmVqZWN0IHRoZSBudWxsIGh5cG90aGVzaXMuDQoNCiMjIyMgKipBZHZhbnRhZ2VzKio6DQotIFByb3ZpZGVzIGEgc3RydWN0dXJlZCBhcHByb2FjaCB0byBkZWNpc2lvbi1tYWtpbmcgdXNpbmcgZGF0YS4NCi0gSGVscHMgdG8gZXZhbHVhdGUgdGhlb3JpZXMgYW5kIGFzc3VtcHRpb25zIGJhc2VkIG9uIGVtcGlyaWNhbCBldmlkZW5jZS4NCg0KIyMjIyAqKkRpc2FkdmFudGFnZXMqKjoNCi0gUHJvbmUgdG8gZXJyb3JzIChUeXBlIEkgYW5kIFR5cGUgSUkgZXJyb3JzKS4NCi0gTWlzaW50ZXJwcmV0YXRpb24gb2YgcC12YWx1ZXMgY2FuIGxlYWQgdG8gaW5jb3JyZWN0IGNvbmNsdXNpb25zLg0KLSBPbmx5IGFzIGdvb2QgYXMgdGhlIHF1YWxpdHkgb2YgZGF0YSBhbmQgYXNzdW1wdGlvbnMgbWFkZS4NCg0KIyMjIyAqKkFwcGxpY2F0aW9ucyoqOg0KLSBDbGluaWNhbCB0cmlhbHMgKHRlc3RpbmcgdGhlIGVmZmVjdGl2ZW5lc3Mgb2YgYSBuZXcgZHJ1ZykuDQotIFF1YWxpdHkgY29udHJvbCBpbiBtYW51ZmFjdHVyaW5nIChjaGVja2luZyBpZiBhIGJhdGNoIG9mIHByb2R1Y3RzIG1lZXRzIHN0YW5kYXJkcykuDQotIEEvQiB0ZXN0aW5nIGluIG1hcmtldGluZyAoY29tcGFyaW5nIHR3byB2ZXJzaW9ucyBvZiBhIHdlYnNpdGUgb3IgY2FtcGFpZ24pLg0KDQojIyMjICoqUHJvcyoqOg0KLSBPYmplY3RpdmUgbWV0aG9kIGZvciBkZWNpc2lvbi1tYWtpbmcuDQotIFVzZWQgaW4gYSB3aWRlIHJhbmdlIG9mIGZpZWxkcy4NCg0KIyMjIyAqKkNvbnMqKjoNCi0gRG9lcyBub3QgcHJvdmUgYW55dGhpbmcgZGVmaW5pdGl2ZWx5LCBvbmx5IHByb3ZpZGVzIGV2aWRlbmNlLg0KLSBSZXF1aXJlcyBhc3N1bXB0aW9ucyBhYm91dCB0aGUgZGF0YSwgc3VjaCBhcyBub3JtYWxpdHkgb3IgZXF1YWwgdmFyaWFuY2UuDQoNCi0tLQ0KDQojIyMgKipFeGFtcGxlIGluIFIqKg0KDQpMZXTigJlzIGdvIHRocm91Z2ggYW4gZXhhbXBsZSBvZiBoeXBvdGhlc2lzIHRlc3RpbmcgdXNpbmcgYSBvbmUtc2FtcGxlIHQtdGVzdCBpbiBSLiBTdXBwb3NlIHdlIHdhbnQgdG8gdGVzdCBpZiB0aGUgYXZlcmFnZSBoZWlnaHQgb2YgYSBncm91cCBvZiBwZW9wbGUgaXMgZGlmZmVyZW50IGZyb20gMTcwIGNtLg0KDQojIyMjICoqU3RlcCAxOiBTdGF0ZSB0aGUgSHlwb3RoZXNlcyoqDQotICoqTnVsbCBIeXBvdGhlc2lzIChI4oKAKSoqOiBUaGUgYXZlcmFnZSBoZWlnaHQgaXMgMTcwIGNtLg0KLSAqKkFsdGVybmF0aXZlIEh5cG90aGVzaXMgKEjigoEpKio6IFRoZSBhdmVyYWdlIGhlaWdodCBpcyBub3QgMTcwIGNtLg0KDQojIyMjICoqU3RlcCAyOiBDaG9vc2UgdGhlIFNpZ25pZmljYW5jZSBMZXZlbCAozrEpKioNCldlIHdpbGwgdXNlIHRoZSBjb21tb24gc2lnbmlmaWNhbmNlIGxldmVsIG9mIDAuMDUuDQoNCiMjIyMgKipTdGVwIDM6IFNlbGVjdCB0aGUgVGVzdCBTdGF0aXN0aWMqKg0KV2Ugd2lsbCB1c2UgYSBvbmUtc2FtcGxlIHQtdGVzdCBzaW5jZSB3ZSBhcmUgY29tcGFyaW5nIHRoZSBzYW1wbGUgbWVhbiB0byBhIGtub3duIHZhbHVlLg0KDQojIyMjICoqU3RlcCA0OiBFeGFtcGxlIGluIFIqKg0KDQpgYGB7cn0NCiMgU2FtcGxlIGRhdGE6IEhlaWdodHMgb2YgMzAgaW5kaXZpZHVhbHMNCnNldC5zZWVkKDEyMykgICMgZm9yIHJlcHJvZHVjaWJpbGl0eQ0KaGVpZ2h0cyA8LSBybm9ybSgzMCwgbWVhbiA9IDE3Miwgc2QgPSA1KSAgIyBtZWFuID0gMTcyLCBzdGFuZGFyZCBkZXZpYXRpb24gPSA1DQoNCiMgUGVyZm9ybSBhIG9uZS1zYW1wbGUgdC10ZXN0DQp0X3Rlc3RfcmVzdWx0IDwtIHQudGVzdChoZWlnaHRzLCBtdSA9IDE3MCkgICMgdGVzdGluZyBpZiB0aGUgbWVhbiBoZWlnaHQgaXMgMTcwDQoNCiMgT3V0cHV0IHRoZSByZXN1bHQNCnByaW50KHRfdGVzdF9yZXN1bHQpDQpgYGANCg0KDQojIyMjICoqU3RlcCA1OiBJbnRlcnByZXQgdGhlIFJlc3VsdHMqKg0KVGhlIG91dHB1dCBvZiB0aGUgdC10ZXN0IHdpbGwgZ2l2ZSB5b3U6DQoNCi0gKip0LXN0YXRpc3RpYyoqOiBUaGUgdmFsdWUgb2YgdGhlIHRlc3Qgc3RhdGlzdGljLg0KDQotICoqcC12YWx1ZSoqOiBUaGUgcHJvYmFiaWxpdHkgb2Ygb2J0YWluaW5nIGEgcmVzdWx0IGFzIGV4dHJlbWUgYXMsIG9yIG1vcmUgZXh0cmVtZSB0aGFuLCB0aGUgb2JzZXJ2ZWQgcmVzdWx0IHVuZGVyIHRoZSBudWxsIGh5cG90aGVzaXMuDQoNCi0gKipDb25maWRlbmNlIGludGVydmFsKio6IFRoZSByYW5nZSB3aXRoaW4gd2hpY2ggdGhlIHRydWUgcG9wdWxhdGlvbiBtZWFuIGlzIGxpa2VseSB0byBmYWxsLg0KDQoNCi0gKipwLXZhbHVlKio6IFRoZSBwLXZhbHVlIGlzIDAuMDIxMDQuIFNpbmNlIHRoaXMgaXMgbGVzcyB0aGFuIDAuMDUgKG91ciBzaWduaWZpY2FuY2UgbGV2ZWwpLCB3ZSAqKnJlamVjdCB0aGUgbnVsbCBoeXBvdGhlc2lzKiouIFRoaXMgbWVhbnMgdGhlIGF2ZXJhZ2UgaGVpZ2h0IGlzIHNpZ25pZmljYW50bHkgZGlmZmVyZW50IGZyb20gMTcwIGNtLg0KDQotICoqQ29uZmlkZW5jZSBJbnRlcnZhbCoqOiBUaGUgOTUlIGNvbmZpZGVuY2UgaW50ZXJ2YWwgZm9yIHRoZSBtZWFuIGlzIGJldHdlZW4gMTcwLjY0IGNtIGFuZCAxNzMuMzkgY20sIHdoaWNoIGRvZXMgbm90IGluY2x1ZGUgMTcwIGNtLg0KDQojIyMjICoqRGVjaXNpb24qKjoNClNpbmNlIHRoZSBwLXZhbHVlIGlzIHNtYWxsZXIgdGhhbiBvdXIgc2lnbmlmaWNhbmNlIGxldmVsICgwLjA1KSwgd2UgcmVqZWN0IHRoZSBudWxsIGh5cG90aGVzaXMuIFRoZXJlZm9yZSwgd2UgY29uY2x1ZGUgdGhhdCB0aGUgYXZlcmFnZSBoZWlnaHQgb2YgdGhpcyBncm91cCBpcyBzaWduaWZpY2FudGx5IGRpZmZlcmVudCBmcm9tIDE3MCBjbS4NCg0KLS0tDQoNCiMjIyAqKlN1bW1hcnkqKjoNCkh5cG90aGVzaXMgdGVzdGluZyBpcyBhIHBvd2VyZnVsIHRvb2wgaW4gc3RhdGlzdGljcyB0byBtYWtlIGRhdGEtZHJpdmVuIGRlY2lzaW9ucy4gQnkgY29tcGFyaW5nIHRoZSBvYnNlcnZlZCBkYXRhIGFnYWluc3QgYSBkZWZpbmVkIGh5cG90aGVzaXMsIGl0IGFsbG93cyB5b3UgdG8gYXNzZXNzIHdoZXRoZXIgdGhlIG9ic2VydmVkIGVmZmVjdCBvciBkaWZmZXJlbmNlIGlzIHN0YXRpc3RpY2FsbHkgc2lnbmlmaWNhbnQgb3IgbGlrZWx5IGR1ZSB0byBjaGFuY2Uu