
A statement that could be true, which might then be tested.

Example: Sam has a hypothesis that "large dogs are better at catching tennis balls than small dogs". We can test that hypothesis by having hundreds of different sized dogs try to catch tennis balls.

Sometimes the hypothesis won't be tested, it is simply a good explanation (which could be wrong). Conjecture is a better word for this.

Example: you notice the temperature drops just as the sun rises. Your hypothesis is that the sun warms the air high above you, which rises up and then cooler air comes from the sides.

Note: when someone says "I have a theory" they should say "I have a hypothesis", because in mathematics a theory is actually well proven.

hypothesis definition for math

A hypothesis is a proposition that is consistent with known data, but has been neither verified nor shown to be false.

In statistics, a hypothesis (sometimes called a statistical hypothesis) refers to a statement on which hypothesis testing will be based. Particularly important statistical hypotheses include the null hypothesis and alternative hypothesis .

In symbolic logic , a hypothesis is the first part of an implication (with the second part being known as the predicate ).

In general mathematical usage, "hypothesis" is roughly synonymous with " conjecture ."

Explore with Wolfram|Alpha


More things to try:

  • 1,000,000th prime
  • Gamma(1, 5, 2.0)

Cite this as:

Weisstein, Eric W. "Hypothesis." From MathWorld --A Wolfram Web Resource. https://mathworld.wolfram.com/Hypothesis.html

Subject classifications

  • Math Article

Hypothesis Definition

In Statistics, the determination of the variation between the group of data due to true variation is done by hypothesis testing. The sample data are taken from the population parameter based on the assumptions. The hypothesis can be classified into various types. In this article, let us discuss the hypothesis definition, various types of hypothesis and the significance of hypothesis testing, which are explained in detail.

Hypothesis Definition in Statistics

In Statistics, a hypothesis is defined as a formal statement, which gives the explanation about the relationship between the two or more variables of the specified population. It helps the researcher to translate the given problem to a clear explanation for the outcome of the study. It clearly explains and predicts the expected outcome. It indicates the types of experimental design and directs the study of the research process.

Types of Hypothesis

The hypothesis can be broadly classified into different types. They are:

Simple Hypothesis

A simple hypothesis is a hypothesis that there exists a relationship between two variables. One is called a dependent variable, and the other is called an independent variable.

Complex Hypothesis

A complex hypothesis is used when there is a relationship between the existing variables. In this hypothesis, the dependent and independent variables are more than two.

Null Hypothesis

In the null hypothesis, there is no significant difference between the populations specified in the experiments, due to any experimental or sampling error. The null hypothesis is denoted by H 0 .

Alternative Hypothesis

In an alternative hypothesis, the simple observations are easily influenced by some random cause. It is denoted by the H a or H 1 .

Empirical Hypothesis

An empirical hypothesis is formed by the experiments and based on the evidence.

Statistical Hypothesis

In a statistical hypothesis, the statement should be logical or illogical, and the hypothesis is verified statistically.

Apart from these types of hypothesis, some other hypotheses are directional and non-directional hypothesis, associated hypothesis, casual hypothesis.

Characteristics of Hypothesis

The important characteristics of the hypothesis are:

  • The hypothesis should be short and precise
  • It should be specific
  • A hypothesis must be related to the existing body of knowledge
  • It should be capable of verification

To learn more Maths definitions, register with BYJU’S – The Learning App.

Quiz Image

Put your understanding of this concept to test by answering a few MCQs. Click ‘Start Quiz’ to begin!

Select the correct answer and click on the “Finish” button Check your score and answers at the end of the quiz

Visit BYJU’S for all Maths related queries and study materials

Your result is as below

Request OTP on Voice Call

Leave a Comment Cancel reply

Your Mobile number and Email id will not be published. Required fields are marked *

Post My Comment

hypothesis definition for math

  • Share Share

Register with BYJU'S & Download Free PDFs

Register with byju's & watch live videos.


Reset password New user? Sign up

Existing user? Log in

Hypothesis Testing

Already have an account? Log in here.

A hypothesis test is a statistical inference method used to test the significance of a proposed (hypothesized) relation between population statistics (parameters) and their corresponding sample estimators . In other words, hypothesis tests are used to determine if there is enough evidence in a sample to prove a hypothesis true for the entire population.

The test considers two hypotheses: the null hypothesis , which is a statement meant to be tested, usually something like "there is no effect" with the intention of proving this false, and the alternate hypothesis , which is the statement meant to stand after the test is performed. The two hypotheses must be mutually exclusive ; moreover, in most applications, the two are complementary (one being the negation of the other). The test works by comparing the \(p\)-value to the level of significance (a chosen target). If the \(p\)-value is less than or equal to the level of significance, then the null hypothesis is rejected.

When analyzing data, only samples of a certain size might be manageable as efficient computations. In some situations the error terms follow a continuous or infinite distribution, hence the use of samples to suggest accuracy of the chosen test statistics. The method of hypothesis testing gives an advantage over guessing what distribution or which parameters the data follows.

Definitions and Methodology

Hypothesis test and confidence intervals.

In statistical inference, properties (parameters) of a population are analyzed by sampling data sets. Given assumptions on the distribution, i.e. a statistical model of the data, certain hypotheses can be deduced from the known behavior of the model. These hypotheses must be tested against sampled data from the population.

The null hypothesis \((\)denoted \(H_0)\) is a statement that is assumed to be true. If the null hypothesis is rejected, then there is enough evidence (statistical significance) to accept the alternate hypothesis \((\)denoted \(H_1).\) Before doing any test for significance, both hypotheses must be clearly stated and non-conflictive, i.e. mutually exclusive, statements. Rejecting the null hypothesis, given that it is true, is called a type I error and it is denoted \(\alpha\), which is also its probability of occurrence. Failing to reject the null hypothesis, given that it is false, is called a type II error and it is denoted \(\beta\), which is also its probability of occurrence. Also, \(\alpha\) is known as the significance level , and \(1-\beta\) is known as the power of the test. \(H_0\) \(\textbf{is true}\)\(\hspace{15mm}\) \(H_0\) \(\textbf{is false}\) \(\textbf{Reject}\) \(H_0\)\(\hspace{10mm}\) Type I error Correct Decision \(\textbf{Reject}\) \(H_1\) Correct Decision Type II error The test statistic is the standardized value following the sampled data under the assumption that the null hypothesis is true, and a chosen particular test. These tests depend on the statistic to be studied and the assumed distribution it follows, e.g. the population mean following a normal distribution. The \(p\)-value is the probability of observing an extreme test statistic in the direction of the alternate hypothesis, given that the null hypothesis is true. The critical value is the value of the assumed distribution of the test statistic such that the probability of making a type I error is small.
Methodologies: Given an estimator \(\hat \theta\) of a population statistic \(\theta\), following a probability distribution \(P(T)\), computed from a sample \(\mathcal{S},\) and given a significance level \(\alpha\) and test statistic \(t^*,\) define \(H_0\) and \(H_1;\) compute the test statistic \(t^*.\) \(p\)-value Approach (most prevalent): Find the \(p\)-value using \(t^*\) (right-tailed). If the \(p\)-value is at most \(\alpha,\) reject \(H_0\). Otherwise, reject \(H_1\). Critical Value Approach: Find the critical value solving the equation \(P(T\geq t_\alpha)=\alpha\) (right-tailed). If \(t^*>t_\alpha\), reject \(H_0\). Otherwise, reject \(H_1\). Note: Failing to reject \(H_0\) only means inability to accept \(H_1\), and it does not mean to accept \(H_0\).
Assume a normally distributed population has recorded cholesterol levels with various statistics computed. From a sample of 100 subjects in the population, the sample mean was 214.12 mg/dL (milligrams per deciliter), with a sample standard deviation of 45.71 mg/dL. Perform a hypothesis test, with significance level 0.05, to test if there is enough evidence to conclude that the population mean is larger than 200 mg/dL. Hypothesis Test We will perform a hypothesis test using the \(p\)-value approach with significance level \(\alpha=0.05:\) Define \(H_0\): \(\mu=200\). Define \(H_1\): \(\mu>200\). Since our values are normally distributed, the test statistic is \(z^*=\frac{\bar X - \mu_0}{\frac{s}{\sqrt{n}}}=\frac{214.12 - 200}{\frac{45.71}{\sqrt{100}}}\approx 3.09\). Using a standard normal distribution, we find that our \(p\)-value is approximately \(0.001\). Since the \(p\)-value is at most \(\alpha=0.05,\) we reject \(H_0\). Therefore, we can conclude that the test shows sufficient evidence to support the claim that \(\mu\) is larger than \(200\) mg/dL.

If the sample size was smaller, the normal and \(t\)-distributions behave differently. Also, the question itself must be managed by a double-tail test instead.

Assume a population's cholesterol levels are recorded and various statistics are computed. From a sample of 25 subjects, the sample mean was 214.12 mg/dL (milligrams per deciliter), with a sample standard deviation of 45.71 mg/dL. Perform a hypothesis test, with significance level 0.05, to test if there is enough evidence to conclude that the population mean is not equal to 200 mg/dL. Hypothesis Test We will perform a hypothesis test using the \(p\)-value approach with significance level \(\alpha=0.05\) and the \(t\)-distribution with 24 degrees of freedom: Define \(H_0\): \(\mu=200\). Define \(H_1\): \(\mu\neq 200\). Using the \(t\)-distribution, the test statistic is \(t^*=\frac{\bar X - \mu_0}{\frac{s}{\sqrt{n}}}=\frac{214.12 - 200}{\frac{45.71}{\sqrt{25}}}\approx 1.54\). Using a \(t\)-distribution with 24 degrees of freedom, we find that our \(p\)-value is approximately \(2(0.068)=0.136\). We have multiplied by two since this is a two-tailed argument, i.e. the mean can be smaller than or larger than. Since the \(p\)-value is larger than \(\alpha=0.05,\) we fail to reject \(H_0\). Therefore, the test does not show sufficient evidence to support the claim that \(\mu\) is not equal to \(200\) mg/dL.

The complement of the rejection on a two-tailed hypothesis test (with significance level \(\alpha\)) for a population parameter \(\theta\) is equivalent to finding a confidence interval \((\)with confidence level \(1-\alpha)\) for the population parameter \(\theta\). If the assumption on the parameter \(\theta\) falls inside the confidence interval, then the test has failed to reject the null hypothesis \((\)with \(p\)-value greater than \(\alpha).\) Otherwise, if \(\theta\) does not fall in the confidence interval, then the null hypothesis is rejected in favor of the alternate \((\)with \(p\)-value at most \(\alpha).\)

  • Statistics (Estimation)
  • Normal Distribution
  • Correlation
  • Confidence Intervals

Problem Loading...

Note Loading...

Set Loading...

9.1 Null and Alternative Hypotheses

The actual test begins by considering two hypotheses . They are called the null hypothesis and the alternative hypothesis . These hypotheses contain opposing viewpoints.

H 0 , the — null hypothesis: a statement of no difference between sample means or proportions or no difference between a sample mean or proportion and a population mean or proportion. In other words, the difference equals 0.

H a —, the alternative hypothesis: a claim about the population that is contradictory to H 0 and what we conclude when we reject H 0 .

Since the null and alternative hypotheses are contradictory, you must examine evidence to decide if you have enough evidence to reject the null hypothesis or not. The evidence is in the form of sample data.

After you have determined which hypothesis the sample supports, you make a decision. There are two options for a decision. They are reject H 0 if the sample information favors the alternative hypothesis or do not reject H 0 or decline to reject H 0 if the sample information is insufficient to reject the null hypothesis.

Mathematical Symbols Used in H 0 and H a :

H 0 always has a symbol with an equal in it. H a never has a symbol with an equal in it. The choice of symbol depends on the wording of the hypothesis test. However, be aware that many researchers use = in the null hypothesis, even with > or < as the symbol in the alternative hypothesis. This practice is acceptable because we only make the decision to reject or not reject the null hypothesis.

Example 9.1

H 0 : No more than 30 percent of the registered voters in Santa Clara County voted in the primary election. p ≤ 30 H a : More than 30 percent of the registered voters in Santa Clara County voted in the primary election. p > 30

A medical trial is conducted to test whether or not a new medicine reduces cholesterol by 25 percent. State the null and alternative hypotheses.

Example 9.2

We want to test whether the mean GPA of students in American colleges is different from 2.0 (out of 4.0). The null and alternative hypotheses are the following: H 0 : μ = 2.0 H a : μ ≠ 2.0

We want to test whether the mean height of eighth graders is 66 inches. State the null and alternative hypotheses. Fill in the correct symbol (=, ≠, ≥, <, ≤, >) for the null and alternative hypotheses.

  • H 0 : μ __ 66
  • H a : μ __ 66

Example 9.3

We want to test if college students take fewer than five years to graduate from college, on the average. The null and alternative hypotheses are the following: H 0 : μ ≥ 5 H a : μ < 5

We want to test if it takes fewer than 45 minutes to teach a lesson plan. State the null and alternative hypotheses. Fill in the correct symbol ( =, ≠, ≥, <, ≤, >) for the null and alternative hypotheses.

  • H 0 : μ __ 45
  • H a : μ __ 45

Example 9.4

An article on school standards stated that about half of all students in France, Germany, and Israel take advanced placement exams and a third of the students pass. The same article stated that 6.6 percent of U.S. students take advanced placement exams and 4.4 percent pass. Test if the percentage of U.S. students who take advanced placement exams is more than 6.6 percent. State the null and alternative hypotheses. H 0 : p ≤ 0.066 H a : p > 0.066

On a state driver’s test, about 40 percent pass the test on the first try. We want to test if more than 40 percent pass on the first try. Fill in the correct symbol (=, ≠, ≥, <, ≤, >) for the null and alternative hypotheses.

  • H 0 : p __ 0.40
  • H a : p __ 0.40

Collaborative Exercise

Bring to class a newspaper, some news magazines, and some internet articles. In groups, find articles from which your group can write null and alternative hypotheses. Discuss your hypotheses with the rest of the class.

As an Amazon Associate we earn from qualifying purchases.

This book may not be used in the training of large language models or otherwise be ingested into large language models or generative AI offerings without OpenStax's permission.

Want to cite, share, or modify this book? This book uses the Creative Commons Attribution License and you must attribute Texas Education Agency (TEA). The original material is available at: https://www.texasgateway.org/book/tea-statistics . Changes were made to the original material, including updates to art, structure, and other content updates.

Access for free at https://openstax.org/books/statistics/pages/1-introduction
  • Authors: Barbara Illowsky, Susan Dean
  • Publisher/website: OpenStax
  • Book title: Statistics
  • Publication date: Mar 27, 2020
  • Location: Houston, Texas
  • Book URL: https://openstax.org/books/statistics/pages/1-introduction
  • Section URL: https://openstax.org/books/statistics/pages/9-1-null-and-alternative-hypotheses

© Jan 23, 2024 Texas Education Agency (TEA). The OpenStax name, OpenStax logo, OpenStax book covers, OpenStax CNX name, and OpenStax CNX logo are not subject to the Creative Commons license and may not be reproduced without the prior and express written consent of Rice University.

logo white

  • Mathematicians
  • Math Lessons
  • Square Roots
  • Math Calculators
  • Hypothesis | Definition & Meaning


Explanation of Hypothesis

Contradiction, simple hypothesis, complex hypothesis, null hypothesis, alternative hypothesis, empirical hypothesis, statistical hypothesis, special example of hypothesis, solution part (a), solution part (b), hypothesis|definition & meaning.

A hypothesis is a claim or statement  that makes sense in the context of some information or data at hand but hasn’t been established as true or false through experimentation or proof.

In mathematics, any statement or equation that describes some relationship between certain variables can be termed as hypothesis if it is consistent with some initial supporting data or information, however, its yet   to be proven true or false by some definite and trustworthy experiment or mathematical law. 

Following example illustrates one such hypothesis to shed some light on this very fundamental concept which is often used in different areas of mathematics.

Example of Hypothesis

Figure 1: Example of Hypothesis

Here we have considered an example of a young startup company that manufactures state of the art batteries. The hypothesis or the claim of the company is that their batteries have a mean life of more than 1000 hours. Now its very easy to understand that they can prove their claim on some testing experiment in their lab.

However, the statement can only be proven if and only if at least one batch of their production batteries have actually been deployed in the real world for more than 1000 hours . After 1000 hours, data needs to be collected and it needs to be seen what is the probability of this statement being true .

The following paragraphs further explain this concept.

As explained with the help of an example earlier, a hypothesis in mathematics is an untested claim that is backed up by all the known data or some other discoveries or some weak experiments.

In any mathematical discovery, we first start by assuming something or some relationship . This supposed statement is called a supposition. A supposition, however, becomes a hypothesis when it is supported by all available data and a large number of contradictory findings.

The hypothesis is an important part of the scientific method that is widely known today for making new discoveries. The field of mathematics inherited this process. Following figure shows this cycle as a graphic:

Role of Hypothesis in the Scientific Method

Figure 2: Role of Hypothesis in the Scientific Method 

The above figure shows a simplified version of the scientific method. It shows that whenever a supposition is supported by some data, its termed as hypothesis. Once a hypothesis is proven by some well known and widely acceptable experiment or proof, its becomes a law. If the hypothesis is rejected by some contradictory results then the supposition is changed and the cycle continues.

Lets try to understand the scientific method and the hypothesis concept with the help of an example. Lets say that a teacher wanted to analyze the relationship between the students performance in a certain subject, lets call it A, based on whether or not they studied a minor course, lets call it B.

Now the teacher puts forth a supposition that the students taking the course B prior to course A must perform better in the latter due to the obvious linkages in the key concepts. Due to this linkage, this supposition can be termed as a hypothesis.

However to test the hypothesis, the teacher has to collect data from all of his/her students such that he/she knows which students have taken course B and which ones haven’t. Then at the end of the semester, the performance of the students must be measured and compared with their course B enrollments.

If the students that took course B prior to course A perform better, then the hypothesis concludes successful . Otherwise, the supposition may need revision.

The following figure explains this problem graphically.

Teacher and Course Example of Hypothesis

Figure 3: Teacher and Course Example of Hypothesis

Important Terms Related to Hypothesis

To further elaborate the concept of hypothesis, we first need to understand a few key terms that are widely used in this area such as conjecture, contradiction and some special types of hypothesis (simple, complex, null, alternative, empirical, statistical). These terms are briefly explained below:

A conjecture is a term used to describe a mathematical assertion that has notbeenproved. While testing   may occasionally turn up millions of examples in favour of a conjecture, most experts in the area will typically only accept a proof . In mathematics, this term is synonymous to the term hypothesis.

In mathematics, a contradiction occurs if the results of an experiment or proof are against some hypothesis.  In other words, a contradiction discredits a hypothesis.

A simple hypothesis is such a type of hypothesis that claims there is a correlation between two variables. The first is known as a dependent variable while the second is known as an independent variable.

A complex hypothesis is such a type of hypothesis that claims there is a correlation between more than two variables.  Both the dependent and independent variables in this hypothesis may be more than one in numbers.

A null hypothesis, usually denoted by H0, is such a type of hypothesis that claims there is no statistical relationship and significance between two sets of observed data and measured occurrences for each set of defined, single observable variables. In short the variables are independent.

An alternative hypothesis, usually denoted by H1 or Ha, is such a type of hypothesis where the variables may be statistically influenced by some unknown factors or variables. In short the variables are dependent on some unknown phenomena .

An Empirical hypothesis is such a type of hypothesis that is built on top of some empirical data or experiment or formulation.

A statistical hypothesis is such a type of hypothesis that is built on top of some statistical data or experiment or formulation. It may be logical or illogical in nature.

According to the Riemann hypothesis, only negative even integers and complex numbers with real part 1/2 have zeros in the Riemann zeta function . It is regarded by many as the most significant open issue in pure mathematics.

Riemann Hypothesis

Figure 4: Riemann Hypothesis

The Riemann hypothesis is the most well-known mathematical conjecture, and it has been the subject of innumerable proof efforts.

Numerical Examples

Identify the conclusions and hypothesis in the following given statements. Also state if the conclusion supports the hypothesis or not.

Part (a): If 30x = 30, then x = 1

Part (b): if 10x + 2 = 50, then x = 24

Hypothesis: 30x = 30

Conclusion: x = 10

Supports Hypothesis: Yes

Hypothesis: 10x + 2 = 50

Conclusion: x = 24

All images/mathematical drawings were created with GeoGebra.

Hour Hand Definition < Glossary Index > Identity Definition

Cambridge University Faculty of Mathematics

Or search by topic

Number and algebra

  • The Number System and Place Value
  • Calculations and Numerical Methods
  • Fractions, Decimals, Percentages, Ratio and Proportion
  • Properties of Numbers
  • Patterns, Sequences and Structure
  • Algebraic expressions, equations and formulae
  • Coordinates, Functions and Graphs

Geometry and measure

  • Angles, Polygons, and Geometrical Proof
  • 3D Geometry, Shape and Space
  • Measuring and calculating with units
  • Transformations and constructions
  • Pythagoras and Trigonometry
  • Vectors and Matrices

Probability and statistics

  • Handling, Processing and Representing Data
  • Probability

Working mathematically

  • Thinking mathematically
  • Mathematical mindsets
  • Cross-curricular contexts
  • Physical and digital manipulatives

For younger learners

  • Early Years Foundation Stage

Advanced mathematics

  • Decision Mathematics and Combinatorics
  • Advanced Probability and Statistics

Published 2018 Revised 2019

What Is a Hypothesis Test?

The null hypothesis significance testing (nhst) framework, our simple scenario.

  • Our null hypothesis is $H_0\colon \pi=\frac{1}{2}$.  This says that the proportion is what we believe it should be.
  • Our alternative hypothesis is $H_1\colon \pi\ne\frac{1}{2}$.  This says that the proportion has changed.

Testing our hypotheses

  • We can work out the critical region for $X$, that is, those extreme values of $X$ which would lead us to reject the null hypothesis at 5% significance.  (This can be done even before performing the experiment.)  The probability of $X$ taking a value in this critical region, assuming that the null hypothesis is true, should be 5%, or as close at we can get to 5% without going over it.  In symbols, we can say: $$\mathrm{P}(\text{$X$ in critical region} | \text{$H_0$ is true}) \le 0.05.$$ Then we reject the null hypothesis if $X$ lies in that region.
  • We can work out the probability of $X$ taking the value it did or a more extreme value, assuming that the null hypothesis is true.  This is known as the p-value .  If the p-value is less than 0.05, then we will reject the null hypothesis at 5% significance. [ note 1 ]  In symbols, we can write $$\text{p-value} = \mathrm{P}(\text{$X$ taking this or a more extreme value} | \text{$H_0$ is true}).$$

Other types of scenario

  • Does this drug/treatment/intervention/... have any effect?
  • Which of these drugs/... is more effective, or are they equally effective?
  • Is the mean height/mass/intelligence/test score/... of this population equal to some predicted value?
  • Is the standard deviation of the height/mass/... equal to some predicted value?
  • For two distinct groups of people, is their mean height/mass/... of each group the same?
  • Does this group of people's heights/masses/... appear to be following the probability distribution we expect?
  • Do these two populations' heights/masses/... appear to have the same distribution as each other?
  • Do this population's heights and weights appear to be correlated?

Interpreting the results

The key question that hypothesis testing (nhst) answers, what a hypothesis test does not tell us, a non-significant result.

  • It could be that the null hypothesis is true.  In this case, we would have to be unlucky to get a significant p-value, so most of the time, we will end up accepting the null hypothesis.  (If the null hypothesis is true, we would reject it with a probability of only 0.05.)  
  • On the other hand, it could be that the alternative hypothesis is true, but we did not use a large enough sample to obtain a significant result (or we were just unlucky).  In such a case, we could say that our test was insensitive .  In this situation (the alternative hypothesis is true but we do not reject the null hypothesis), we say that we have made a Type II error .  The probability of this happening depends on the sample size and on how different the true $\pi$ is from $\frac{1}{2}$ (or whatever our null hypothesis says), as is explored in Powerful Hypothesis Testing .

A significant result

  • It could be that the null hypothesis is true.  In this case, we reject the null hypothesis with a probability of $0.05=\frac{1}{20}$, that is, one time in 20 (at a significance level of 5%), so we were just unlucky.  
  • On the other hand, the alternative hypothesis could indeed be true.  Either the sample was large enough to obtain a significant result, or the sample size wasn't that large, but we were just lucky.

Using this tree diagram, we can work out the probabilities of $H_0$ being true or $H_1$ being true given our experimental results.  To avoid the expressions becoming unwieldy, we will write $H_0$ for "$\text{$H_0$ true}$", $H_1$ for "$\text{$H_1$ true}$" and "$\text{p}^+$" for "observed p-value or more extreme".  Then we can write (conditional) probabilities on the branches of the tree diagram leading to our observed p-value: [ note 2 ]

The two routes which give our observed p-value (or more extreme) have the following probabilities: $$\begin{align*} \mathrm{P}(H_0\cap \text{p}^+) &= \mathrm{P}(H_0) \times \mathrm{P}(\text{p}^+ | H_0) \\ \mathrm{P}(H_1\cap \text{p}^+) &= \mathrm{P}(H_1) \times \mathrm{P}(\text{p}^+ | H_1) \end{align*}$$ (Recall that $\mathrm{P}(H_0\cap \text{p}^+)$ means "the probability of $H_0$ being true and the p-value being that observed or more extreme".) We can therefore work out the probability of the alternative hypothesis being true given the observed p-value, using conditional probability: $$\begin{align*} \mathrm{P}(H_1|\text{p}^+) &= \frac{\mathrm{P}(H_1\cap \text{p}^+)}{\mathrm{P}(\text{p}^+)} \\ &= \frac{\mathrm{P}(H_1\cap \text{p}^+)}{\mathrm{P}(H_0\cap\text{p}^+)+\mathrm{P}(H_1\cap\text{p}^+)} \\ &= \frac{\mathrm{P}(H_1) \times \mathrm{P}(\text{p}^+ | H_1)}{\mathrm{P}(H_0) \times \mathrm{P}(\text{p}^+ | H_0) + \mathrm{P}(H_1) \times \mathrm{P}(\text{p}^+ | H_1)} \end{align*}$$ Though this is a mouthful, it is a calculation which only involves the four probabilities on the above tree diagram.  (This is an example of Bayes' Theorem , discussed further in this resource .) However, we immediately hit a big difficulty if we try to calculate this for a given experiment.  We know $\mathrm{P}(\text{p}^+ | H_0)$: this is just the p-value itself.  (The p-value tells us the probability of obtaining a result at least this extreme given that the null hypothesis is true.)  But we don't know the probability of the null hypothesis being true or false (that is, $\mathrm{P}(H_0)$ and $\mathrm{P}(H_1)=1-\mathrm{P}(H_0)$), nor do we know the probability of the observed result if the alternative hypothesis is true ($P(\text{p}^+|H_1)$), as knowing that the proportion of greens is not $\frac{1}{2}$ does not tell us what it actually is.  (Similar issues apply to all the other contexts of hypothesis testing listed above.)  So we are quite stuck: in the null hypothesis significance testing model, it is impossible to give a numerical answer to our key question: "Given our results, what is the probability that the alternative hypothesis is true?"  This is because we don't know two of the three probabilities that we need in order to answer the question. An example might highlight the issue a little better.  Let us suppose that we are trying to work out whether a coin is biased (alternative hypothesis), or whether the probability of heads is exactly $\frac{1}{2}$ (null hypothesis).  We toss the coin 50 times and obtain a p-value of 0.02.  Do we now believe that the coin is biased?  Most people believe that coins are not biased, and so are much more likely to attribute this result to chance or poor coin-tossing technique than to the coin being biased. On the other hand, consider a case of a road planner who introduces a traffic-calming feature to reduce the number of fatalities along a certain stretch of road.  The null hypothesis is that there is no change in fatality rate, while the alternative hypothesis is that the fatality rate has decreased.  A hypothesis test is performed on data collected for 24 months before and 24 months after the feature is built.  Again, the p-value was 0.02.  Do we believe that the alternative hypothesis is true?  In this case, we are more likely to believe that the alternative hypothesis is true, because it makes a lot of sense that this feature will reduce the number of fatalities. Our "instinctive" responses to these results are tied up with assigning values to the unknown probabilities in the formula above.  For the coin, we would probably take $\mathrm{P}(H_0)$ to be close to 1, say $0.99$, as we think it is very unlikely that the coin is biased, and $\mathrm{P}(\text{p}^+|H_1)$ will be, say, $0.1$: if the coin is biased, the bias is not likely to be very large, and so it is only a bit more likely that the result will be significant in this case.  Putting these figures into the formula above gives: $$\mathrm{P}(H_1|\text{p}^+) = \frac{0.01 \times 0.1}{0.99 \times 0.02 + 0.01 \times 0.1} \approx 0.05,$$ that is, we are still very doubtful that this coin is biased, even after performing the experiment.  Note that in this case, the probability of these results given that the null hypothesis is true is 0.02, whereas the probability that the null hypothesis is true given these results is $1-0.05=0.95$, which is very different.  This shows how dramatically different the answers to the two questions can be. On the other hand, for the fatalities situation, we might assume quite the opposite: we are pretty confident that the traffic-calming feature will help, so we might take $\mathrm{P}(H_0)$ to be $0.4$, and $\mathrm{P}(\text{p}^+|H_1)$ will be, say, $0.25$ (though the traffic-calming may help, the impact may be relatively small).  Putting these figures into the formula gives: $$\mathrm{P}(H_1|\text{p}^+) = \frac{0.6 \times 0.25}{0.4 \times 0.02 + 0.6 \times 0.25} \approx 0.95,$$ so we are now much more convinced that the traffic-calming feature is helping than we were before we had the data.  This time, the probability of these results given that the null hypothesis is true is still 0.02, whereas the probability that the null hypothesis is true given these results is $1-0.95=0.05$, which is not that different. This approach may seem very disturbing, as we have to make assumptions about what we believe before we do the hypothesis test.  But as we have seen, we cannot answer our key question without making such assumptions.  

Other approaches and some warnings

  • Because our test is two-tailed (in the alternative hypothesis, the true proportion could be less than $\frac{1}{2}$ or more than $\frac{1}{2}$), we must be careful when calculating the p-value: we calculate the probability of the observed outcome or more extreme occurring, and then double the answer to account for the other tail.  We could also compare the probability of the value or more extreme to 0.025 instead of 0.05, but that would not be called a p-value. Likewise, when we determine the critical region, we will have two parts: a tail with large values of $X$ and a tail with small values of $X$; we require that the probability of $X$ lying in the large-value tail is as close as possible to 0.025 without going over it, and the same for the probability of $X$ lying in the small-value tail.  
  • There are complications here when working with two-tail tests as opposed to one-tail tests.  We will ignore this problem, as it does not significantly affect the overall discussion.  
  • "Likelihood" is a technical term.  For a discrete test statistic $X$, the likelihood of $H_0$ given the data $X=x$ means $P(X=x|H_0)$, in other words, how likely would this data be if $H_0$ were true.  It is not the probability of $H_0$ being true given the data.

Further reading

Library homepage

  • school Campus Bookshelves
  • menu_book Bookshelves
  • perm_media Learning Objects
  • login Login
  • how_to_reg Request Instructor Account
  • hub Instructor Commons

Margin Size

  • Download Page (PDF)
  • Download Full Book (PDF)
  • Periodic Table
  • Physics Constants
  • Scientific Calculator
  • Reference & Cite
  • Tools expand_more
  • Readability

selected template will load here

This action is not available.

Statistics LibreTexts

9.1: Introduction to Hypothesis Testing

  • Last updated
  • Save as PDF
  • Page ID 10211

  • Kyle Siegrist
  • University of Alabama in Huntsville via Random Services

\( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)

\( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)

\( \newcommand{\id}{\mathrm{id}}\) \( \newcommand{\Span}{\mathrm{span}}\)

( \newcommand{\kernel}{\mathrm{null}\,}\) \( \newcommand{\range}{\mathrm{range}\,}\)

\( \newcommand{\RealPart}{\mathrm{Re}}\) \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)

\( \newcommand{\Argument}{\mathrm{Arg}}\) \( \newcommand{\norm}[1]{\| #1 \|}\)

\( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)

\( \newcommand{\Span}{\mathrm{span}}\)

\( \newcommand{\id}{\mathrm{id}}\)

\( \newcommand{\kernel}{\mathrm{null}\,}\)

\( \newcommand{\range}{\mathrm{range}\,}\)

\( \newcommand{\RealPart}{\mathrm{Re}}\)

\( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)

\( \newcommand{\Argument}{\mathrm{Arg}}\)

\( \newcommand{\norm}[1]{\| #1 \|}\)

\( \newcommand{\Span}{\mathrm{span}}\) \( \newcommand{\AA}{\unicode[.8,0]{x212B}}\)

\( \newcommand{\vectorA}[1]{\vec{#1}}      % arrow\)

\( \newcommand{\vectorAt}[1]{\vec{\text{#1}}}      % arrow\)

\( \newcommand{\vectorB}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)

\( \newcommand{\vectorC}[1]{\textbf{#1}} \)

\( \newcommand{\vectorD}[1]{\overrightarrow{#1}} \)

\( \newcommand{\vectorDt}[1]{\overrightarrow{\text{#1}}} \)

\( \newcommand{\vectE}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash{\mathbf {#1}}}} \)

Basic Theory


As usual, our starting point is a random experiment with an underlying sample space and a probability measure \(\P\). In the basic statistical model, we have an observable random variable \(\bs{X}\) taking values in a set \(S\). In general, \(\bs{X}\) can have quite a complicated structure. For example, if the experiment is to sample \(n\) objects from a population and record various measurements of interest, then \[ \bs{X} = (X_1, X_2, \ldots, X_n) \] where \(X_i\) is the vector of measurements for the \(i\)th object. The most important special case occurs when \((X_1, X_2, \ldots, X_n)\) are independent and identically distributed. In this case, we have a random sample of size \(n\) from the common distribution.

The purpose of this section is to define and discuss the basic concepts of statistical hypothesis testing . Collectively, these concepts are sometimes referred to as the Neyman-Pearson framework, in honor of Jerzy Neyman and Egon Pearson, who first formalized them.

A statistical hypothesis is a statement about the distribution of \(\bs{X}\). Equivalently, a statistical hypothesis specifies a set of possible distributions of \(\bs{X}\): the set of distributions for which the statement is true. A hypothesis that specifies a single distribution for \(\bs{X}\) is called simple ; a hypothesis that specifies more than one distribution for \(\bs{X}\) is called composite .

In hypothesis testing , the goal is to see if there is sufficient statistical evidence to reject a presumed null hypothesis in favor of a conjectured alternative hypothesis . The null hypothesis is usually denoted \(H_0\) while the alternative hypothesis is usually denoted \(H_1\).

An hypothesis test is a statistical decision ; the conclusion will either be to reject the null hypothesis in favor of the alternative, or to fail to reject the null hypothesis. The decision that we make must, of course, be based on the observed value \(\bs{x}\) of the data vector \(\bs{X}\). Thus, we will find an appropriate subset \(R\) of the sample space \(S\) and reject \(H_0\) if and only if \(\bs{x} \in R\). The set \(R\) is known as the rejection region or the critical region . Note the asymmetry between the null and alternative hypotheses. This asymmetry is due to the fact that we assume the null hypothesis, in a sense, and then see if there is sufficient evidence in \(\bs{x}\) to overturn this assumption in favor of the alternative.

An hypothesis test is a statistical analogy to proof by contradiction, in a sense. Suppose for a moment that \(H_1\) is a statement in a mathematical theory and that \(H_0\) is its negation. One way that we can prove \(H_1\) is to assume \(H_0\) and work our way logically to a contradiction. In an hypothesis test, we don't prove anything of course, but there are similarities. We assume \(H_0\) and then see if the data \(\bs{x}\) are sufficiently at odds with that assumption that we feel justified in rejecting \(H_0\) in favor of \(H_1\).

Often, the critical region is defined in terms of a statistic \(w(\bs{X})\), known as a test statistic , where \(w\) is a function from \(S\) into another set \(T\). We find an appropriate rejection region \(R_T \subseteq T\) and reject \(H_0\) when the observed value \(w(\bs{x}) \in R_T\). Thus, the rejection region in \(S\) is then \(R = w^{-1}(R_T) = \left\{\bs{x} \in S: w(\bs{x}) \in R_T\right\}\). As usual, the use of a statistic often allows significant data reduction when the dimension of the test statistic is much smaller than the dimension of the data vector.

The ultimate decision may be correct or may be in error. There are two types of errors, depending on which of the hypotheses is actually true.

Types of errors:

  • A type 1 error is rejecting the null hypothesis \(H_0\) when \(H_0\) is true.
  • A type 2 error is failing to reject the null hypothesis \(H_0\) when the alternative hypothesis \(H_1\) is true.

Similarly, there are two ways to make a correct decision: we could reject \(H_0\) when \(H_1\) is true or we could fail to reject \(H_0\) when \(H_0\) is true. The possibilities are summarized in the following table:

Of course, when we observe \(\bs{X} = \bs{x}\) and make our decision, either we will have made the correct decision or we will have committed an error, and usually we will never know which of these events has occurred. Prior to gathering the data, however, we can consider the probabilities of the various errors.

If \(H_0\) is true (that is, the distribution of \(\bs{X}\) is specified by \(H_0\)), then \(\P(\bs{X} \in R)\) is the probability of a type 1 error for this distribution. If \(H_0\) is composite, then \(H_0\) specifies a variety of different distributions for \(\bs{X}\) and thus there is a set of type 1 error probabilities.

The maximum probability of a type 1 error, over the set of distributions specified by \( H_0 \), is the significance level of the test or the size of the critical region.

The significance level is often denoted by \(\alpha\). Usually, the rejection region is constructed so that the significance level is a prescribed, small value (typically 0.1, 0.05, 0.01).

If \(H_1\) is true (that is, the distribution of \(\bs{X}\) is specified by \(H_1\)), then \(\P(\bs{X} \notin R)\) is the probability of a type 2 error for this distribution. Again, if \(H_1\) is composite then \(H_1\) specifies a variety of different distributions for \(\bs{X}\), and thus there will be a set of type 2 error probabilities. Generally, there is a tradeoff between the type 1 and type 2 error probabilities. If we reduce the probability of a type 1 error, by making the rejection region \(R\) smaller, we necessarily increase the probability of a type 2 error because the complementary region \(S \setminus R\) is larger.

The extreme cases can give us some insight. First consider the decision rule in which we never reject \(H_0\), regardless of the evidence \(\bs{x}\). This corresponds to the rejection region \(R = \emptyset\). A type 1 error is impossible, so the significance level is 0. On the other hand, the probability of a type 2 error is 1 for any distribution defined by \(H_1\). At the other extreme, consider the decision rule in which we always rejects \(H_0\) regardless of the evidence \(\bs{x}\). This corresponds to the rejection region \(R = S\). A type 2 error is impossible, but now the probability of a type 1 error is 1 for any distribution defined by \(H_0\). In between these two worthless tests are meaningful tests that take the evidence \(\bs{x}\) into account.

If \(H_1\) is true, so that the distribution of \(\bs{X}\) is specified by \(H_1\), then \(\P(\bs{X} \in R)\), the probability of rejecting \(H_0\) is the power of the test for that distribution.

Thus the power of the test for a distribution specified by \( H_1 \) is the probability of making the correct decision.

Suppose that we have two tests, corresponding to rejection regions \(R_1\) and \(R_2\), respectively, each having significance level \(\alpha\). The test with region \(R_1\) is uniformly more powerful than the test with region \(R_2\) if \[ \P(\bs{X} \in R_1) \ge \P(\bs{X} \in R_2) \text{ for every distribution of } \bs{X} \text{ specified by } H_1 \]

Naturally, in this case, we would prefer the first test. Often, however, two tests will not be uniformly ordered; one test will be more powerful for some distributions specified by \(H_1\) while the other test will be more powerful for other distributions specified by \(H_1\).

If a test has significance level \(\alpha\) and is uniformly more powerful than any other test with significance level \(\alpha\), then the test is said to be a uniformly most powerful test at level \(\alpha\).

Clearly a uniformly most powerful test is the best we can do.


In most cases, we have a general procedure that allows us to construct a test (that is, a rejection region \(R_\alpha\)) for any given significance level \(\alpha \in (0, 1)\). Typically, \(R_\alpha\) decreases (in the subset sense) as \(\alpha\) decreases.

The \(P\)-value of the observed value \(\bs{x}\) of \(\bs{X}\), denoted \(P(\bs{x})\), is defined to be the smallest \(\alpha\) for which \(\bs{x} \in R_\alpha\); that is, the smallest significance level for which \(H_0\) is rejected, given \(\bs{X} = \bs{x}\).

Knowing \(P(\bs{x})\) allows us to test \(H_0\) at any significance level for the given data \(\bs{x}\): If \(P(\bs{x}) \le \alpha\) then we would reject \(H_0\) at significance level \(\alpha\); if \(P(\bs{x}) \gt \alpha\) then we fail to reject \(H_0\) at significance level \(\alpha\). Note that \(P(\bs{X})\) is a statistic . Informally, \(P(\bs{x})\) can often be thought of as the probability of an outcome as or more extreme than the observed value \(\bs{x}\), where extreme is interpreted relative to the null hypothesis \(H_0\).

Analogy with Justice Systems

There is a helpful analogy between statistical hypothesis testing and the criminal justice system in the US and various other countries. Consider a person charged with a crime. The presumed null hypothesis is that the person is innocent of the crime; the conjectured alternative hypothesis is that the person is guilty of the crime. The test of the hypotheses is a trial with evidence presented by both sides playing the role of the data. After considering the evidence, the jury delivers the decision as either not guilty or guilty . Note that innocent is not a possible verdict of the jury, because it is not the point of the trial to prove the person innocent. Rather, the point of the trial is to see whether there is sufficient evidence to overturn the null hypothesis that the person is innocent in favor of the alternative hypothesis of that the person is guilty. A type 1 error is convicting a person who is innocent; a type 2 error is acquitting a person who is guilty. Generally, a type 1 error is considered the more serious of the two possible errors, so in an attempt to hold the chance of a type 1 error to a very low level, the standard for conviction in serious criminal cases is beyond a reasonable doubt .

Tests of an Unknown Parameter

Hypothesis testing is a very general concept, but an important special class occurs when the distribution of the data variable \(\bs{X}\) depends on a parameter \(\theta\) taking values in a parameter space \(\Theta\). The parameter may be vector-valued, so that \(\bs{\theta} = (\theta_1, \theta_2, \ldots, \theta_n)\) and \(\Theta \subseteq \R^k\) for some \(k \in \N_+\). The hypotheses generally take the form \[ H_0: \theta \in \Theta_0 \text{ versus } H_1: \theta \notin \Theta_0 \] where \(\Theta_0\) is a prescribed subset of the parameter space \(\Theta\). In this setting, the probabilities of making an error or a correct decision depend on the true value of \(\theta\). If \(R\) is the rejection region, then the power function \( Q \) is given by \[ Q(\theta) = \P_\theta(\bs{X} \in R), \quad \theta \in \Theta \] The power function gives a lot of information about the test.

The power function satisfies the following properties:

  • \(Q(\theta)\) is the probability of a type 1 error when \(\theta \in \Theta_0\).
  • \(\max\left\{Q(\theta): \theta \in \Theta_0\right\}\) is the significance level of the test.
  • \(1 - Q(\theta)\) is the probability of a type 2 error when \(\theta \notin \Theta_0\).
  • \(Q(\theta)\) is the power of the test when \(\theta \notin \Theta_0\).

If we have two tests, we can compare them by means of their power functions.

Suppose that we have two tests, corresponding to rejection regions \(R_1\) and \(R_2\), respectively, each having significance level \(\alpha\). The test with rejection region \(R_1\) is uniformly more powerful than the test with rejection region \(R_2\) if \( Q_1(\theta) \ge Q_2(\theta)\) for all \( \theta \notin \Theta_0 \).

Most hypothesis tests of an unknown real parameter \(\theta\) fall into three special cases:

Suppose that \( \theta \) is a real parameter and \( \theta_0 \in \Theta \) a specified value. The tests below are respectively the two-sided test , the left-tailed test , and the right-tailed test .

  • \(H_0: \theta = \theta_0\) versus \(H_1: \theta \ne \theta_0\)
  • \(H_0: \theta \ge \theta_0\) versus \(H_1: \theta \lt \theta_0\)
  • \(H_0: \theta \le \theta_0\) versus \(H_1: \theta \gt \theta_0\)

Thus the tests are named after the conjectured alternative. Of course, there may be other unknown parameters besides \(\theta\) (known as nuisance parameters ).

Equivalence Between Hypothesis Test and Confidence Sets

There is an equivalence between hypothesis tests and confidence sets for a parameter \(\theta\).

Suppose that \(C(\bs{x})\) is a \(1 - \alpha\) level confidence set for \(\theta\). The following test has significance level \(\alpha\) for the hypothesis \( H_0: \theta = \theta_0 \) versus \( H_1: \theta \ne \theta_0 \): Reject \(H_0\) if and only if \(\theta_0 \notin C(\bs{x})\)

By definition, \(\P[\theta \in C(\bs{X})] = 1 - \alpha\). Hence if \(H_0\) is true so that \(\theta = \theta_0\), then the probability of a type 1 error is \(P[\theta \notin C(\bs{X})] = \alpha\).

Equivalently, we fail to reject \(H_0\) at significance level \(\alpha\) if and only if \(\theta_0\) is in the corresponding \(1 - \alpha\) level confidence set. In particular, this equivalence applies to interval estimates of a real parameter \(\theta\) and the common tests for \(\theta\) given above .

In each case below, the confidence interval has confidence level \(1 - \alpha\) and the test has significance level \(\alpha\).

  • Suppose that \(\left[L(\bs{X}, U(\bs{X})\right]\) is a two-sided confidence interval for \(\theta\). Reject \(H_0: \theta = \theta_0\) versus \(H_1: \theta \ne \theta_0\) if and only if \(\theta_0 \lt L(\bs{X})\) or \(\theta_0 \gt U(\bs{X})\).
  • Suppose that \(L(\bs{X})\) is a confidence lower bound for \(\theta\). Reject \(H_0: \theta \le \theta_0\) versus \(H_1: \theta \gt \theta_0\) if and only if \(\theta_0 \lt L(\bs{X})\).
  • Suppose that \(U(\bs{X})\) is a confidence upper bound for \(\theta\). Reject \(H_0: \theta \ge \theta_0\) versus \(H_1: \theta \lt \theta_0\) if and only if \(\theta_0 \gt U(\bs{X})\).

Pivot Variables and Test Statistics

Recall that confidence sets of an unknown parameter \(\theta\) are often constructed through a pivot variable , that is, a random variable \(W(\bs{X}, \theta)\) that depends on the data vector \(\bs{X}\) and the parameter \(\theta\), but whose distribution does not depend on \(\theta\) and is known. In this case, a natural test statistic for the basic tests given above is \(W(\bs{X}, \theta_0)\).

Library homepage

  • school Campus Bookshelves
  • menu_book Bookshelves
  • perm_media Learning Objects
  • login Login
  • how_to_reg Request Instructor Account
  • hub Instructor Commons

Margin Size

  • Download Page (PDF)
  • Download Full Book (PDF)
  • Periodic Table
  • Physics Constants
  • Scientific Calculator
  • Reference & Cite
  • Tools expand_more
  • Readability

selected template will load here

This action is not available.

Mathematics LibreTexts

1.1: Statements and Conditional Statements

  • Last updated
  • Save as PDF
  • Page ID 7034

  • Ted Sundstrom
  • Grand Valley State University via ScholarWorks @Grand Valley State University

\( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)

\( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)

\( \newcommand{\id}{\mathrm{id}}\) \( \newcommand{\Span}{\mathrm{span}}\)

( \newcommand{\kernel}{\mathrm{null}\,}\) \( \newcommand{\range}{\mathrm{range}\,}\)

\( \newcommand{\RealPart}{\mathrm{Re}}\) \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)

\( \newcommand{\Argument}{\mathrm{Arg}}\) \( \newcommand{\norm}[1]{\| #1 \|}\)

\( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)

\( \newcommand{\Span}{\mathrm{span}}\)

\( \newcommand{\id}{\mathrm{id}}\)

\( \newcommand{\kernel}{\mathrm{null}\,}\)

\( \newcommand{\range}{\mathrm{range}\,}\)

\( \newcommand{\RealPart}{\mathrm{Re}}\)

\( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)

\( \newcommand{\Argument}{\mathrm{Arg}}\)

\( \newcommand{\norm}[1]{\| #1 \|}\)

\( \newcommand{\Span}{\mathrm{span}}\) \( \newcommand{\AA}{\unicode[.8,0]{x212B}}\)

\( \newcommand{\vectorA}[1]{\vec{#1}}      % arrow\)

\( \newcommand{\vectorAt}[1]{\vec{\text{#1}}}      % arrow\)

\( \newcommand{\vectorB}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)

\( \newcommand{\vectorC}[1]{\textbf{#1}} \)

\( \newcommand{\vectorD}[1]{\overrightarrow{#1}} \)

\( \newcommand{\vectorDt}[1]{\overrightarrow{\text{#1}}} \)

\( \newcommand{\vectE}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash{\mathbf {#1}}}} \)

Much of our work in mathematics deals with statements. In mathematics, a statement is a declarative sentence that is either true or false but not both. A statement is sometimes called a proposition . The key is that there must be no ambiguity. To be a statement, a sentence must be true or false, and it cannot be both. So a sentence such as "The sky is beautiful" is not a statement since whether the sentence is true or not is a matter of opinion. A question such as "Is it raining?" is not a statement because it is a question and is not declaring or asserting that something is true.

Some sentences that are mathematical in nature often are not statements because we may not know precisely what a variable represents. For example, the equation 2\(x\)+5 = 10 is not a statement since we do not know what \(x\) represents. If we substitute a specific value for \(x\) (such as \(x\) = 3), then the resulting equation, 2\(\cdot\)3 +5 = 10 is a statement (which is a false statement). Following are some more examples:

  • There exists a real number \(x\) such that 2\(x\)+5 = 10. This is a statement because either such a real number exists or such a real number does not exist. In this case, this is a true statement since such a real number does exist, namely \(x\) = 2.5.
  • For each real number \(x\), \(2x +5 = 2 \left( x + \dfrac{5}{2}\right)\). This is a statement since either the sentence \(2x +5 = 2 \left( x + \dfrac{5}{2}\right)\) is true when any real number is substituted for \(x\) (in which case, the statement is true) or there is at least one real number that can be substituted for \(x\) and produce a false statement (in which case, the statement is false). In this case, the given statement is true.
  • Solve the equation \(x^2 - 7x +10 =0\). This is not a statement since it is a directive. It does not assert that something is true.
  • \((a+b)^2 = a^2+b^2\) is not a statement since it is not known what \(a\) and \(b\) represent. However, the sentence, “There exist real numbers \(a\) and \(b\) such that \((a+b)^2 = a^2+b^2\)" is a statement. In fact, this is a true statement since there are such integers. For example, if \(a=1\) and \(b=0\), then \((a+b)^2 = a^2+b^2\).
  • Compare the statement in the previous item to the statement, “For all real numbers \(a\) and \(b\), \((a+b)^2 = a^2+b^2\)." This is a false statement since there are values for \(a\) and \(b\) for which \((a+b)^2 \ne a^2+b^2\). For example, if \(a=2\) and \(b=3\), then \((a+b)^2 = 5^2 = 25\) and \(a^2 + b^2 = 2^2 +3^2 = 13\).

Progress Check 1.1: Statements

Which of the following sentences are statements? Do not worry about determining whether a statement is true or false; just determine whether each sentence is a statement or not.

  • 2\(\cdot\)7 + 8 = 22.
  • \((x-1) = \sqrt(x + 11)\).
  • \(2x + 5y = 7\).
  • There are integers \(x\) and \(y\) such that \(2x + 5y = 7\).
  • There are integers \(x\) and \(y\) such that \(23x + 27y = 52\).
  • Given a line \(L\) and a point \(P\) not on that line, there is a unique line through \(P\) that does not intersect \(L\).
  • \((a + b)^3 = a^3 + 3a^2b + 3ab^2 + b^3\).
  • \((a + b)^3 = a^3 + 3a^2b + 3ab^2 + b^3\) for all real numbers \(a\) and \(b\).
  • The derivative of \(f(x) = \sin x\) is \(f' (x) = \cos x\).
  • Does the equation \(3x^2 - 5x - 7 = 0\) have two real number solutions?
  • If \(ABC\) is a right triangle with right angle at vertex \(B\), and if \(D\) is the midpoint of the hypotenuse, then the line segment connecting vertex \(B\) to \(D\) is half the length of the hypotenuse.
  • There do not exist three integers \(x\), \(y\), and \(z\) such that \(x^3 + y^2 = z^3\).

Add texts here. Do not delete this text first.

How Do We Decide If a Statement Is True or False?

In mathematics, we often establish that a statement is true by writing a mathematical proof. To establish that a statement is false, we often find a so-called counterexample. (These ideas will be explored later in this chapter.) So mathematicians must be able to discover and construct proofs. In addition, once the discovery has been made, the mathematician must be able to communicate this discovery to others who speak the language of mathematics. We will be dealing with these ideas throughout the text.

For now, we want to focus on what happens before we start a proof. One thing that mathematicians often do is to make a conjecture beforehand as to whether the statement is true or false. This is often done through exploration. The role of exploration in mathematics is often difficult because the goal is not to find a specific answer but simply to investigate. Following are some techniques of exploration that might be helpful.

Techniques of Exploration

  • Guesswork and conjectures . Formulate and write down questions and conjectures. When we make a guess in mathematics, we usually call it a conjecture.

For example, if someone makes the conjecture that \(\sin(2x) = 2 \sin(x)\), for all real numbers \(x\), we can test this conjecture by substituting specific values for \(x\). One way to do this is to choose values of \(x\) for which \(\sin(x)\)is known. Using \(x = \frac{\pi}{4}\), we see that

\(\sin(2(\frac{\pi}{4})) = \sin(\frac{\pi}{2}) = 1,\) and

\(2\sin(\frac{\pi}{4}) = 2(\frac{\sqrt2}{2}) = \sqrt2\).

Since \(1 \ne \sqrt2\), these calculations show that this conjecture is false. However, if we do not find a counterexample for a conjecture, we usually cannot claim the conjecture is true. The best we can say is that our examples indicate the conjecture is true. As an example, consider the conjecture that

If \(x\) and \(y\) are odd integers, then \(x + y\) is an even integer.

We can do lots of calculation, such as \(3 + 7 = 10\) and \(5 + 11 = 16\), and find that every time we add two odd integers, the sum is an even integer. However, it is not possible to test every pair of odd integers, and so we can only say that the conjecture appears to be true. (We will prove that this statement is true in the next section.)

  • Use of prior knowledge. This also is very important. We cannot start from square one every time we explore a statement. We must make use of our acquired mathematical knowledge. For the conjecture that \(\sin (2x) = 2 \sin(x)\), for all real numbers \(x\), we might recall that there are trigonometric identities called “double angle identities.” We may even remember the correct identity for \(\sin (2x)\), but if we do not, we can always look it up. We should recall (or find) that for all real numbers \(x\), \[\sin(2x) = 2 \sin(x)\cos(x).\]
  • We could use this identity to argue that the conjecture “for all real numbers \(x\), \(\sin (2x) = 2 \sin(x)\)” is false, but if we do, it is still a good idea to give a specific counterexample as we did before.
  • Cooperation and brainstorming . Working together is often more fruitful than working alone. When we work with someone else, we can compare notes and articulate our ideas. Thinking out loud is often a useful brainstorming method that helps generate new ideas.

Progress Check 1.2: Explorations

Use the techniques of exploration to investigate each of the following statements. Can you make a conjecture as to whether the statement is true or false? Can you determine whether it is true or false?

  • \((a + b)^2 = a^2 + b^2\), for all real numbers a and b.
  • There are integers \(x\) and \(y\) such that \(2x + 5y = 41\).
  • If \(x\) is an even integer, then \(x^2\) is an even integer.
  • If \(x\) and \(y\) are odd integers, then \(x \cdot y\) is an odd integer.

Conditional Statements

One of the most frequently used types of statements in mathematics is the so-called conditional statement. Given statements \(P\) and \(Q\), a statement of the form “If \(P\) then \(Q\)” is called a conditional statement . It seems reasonable that the truth value (true or false) of the conditional statement “If \(P\) then \(Q\)” depends on the truth values of \(P\) and \(Q\). The statement “If \(P\) then \(Q\)” means that \(Q\) must be true whenever \(P\) is true. The statement \(P\) is called the hypothesis of the conditional statement, and the statement \(Q\) is called the conclusion of the conditional statement. Since conditional statements are probably the most important type of statement in mathematics, we give a more formal definition.

A conditional statement is a statement that can be written in the form “If \(P\) then \(Q\),” where \(P\) and \(Q\) are sentences. For this conditional statement, \(P\) is called the hypothesis and \(Q\) is called the conclusion .

Intuitively, “If \(P\) then \(Q\)” means that \(Q\) must be true whenever \(P\) is true. Because conditional statements are used so often, a symbolic shorthand notation is used to represent the conditional statement “If \(P\) then \(Q\).” We will use the notation \(P \to Q\) to represent “If \(P\) then \(Q\).” When \(P\) and \(Q\) are statements, it seems reasonable that the truth value (true or false) of the conditional statement \(P \to Q\) depends on the truth values of \(P\) and \(Q\). There are four cases to consider:

  • \(P\) is true and \(Q\) is true.
  • \(P\) is false and \(Q\) is true.
  • \(P\) is true and \(Q\) is false.
  • \(P\) is false and \(Q\) is false.

The conditional statement \(P \to Q\) means that \(Q\) is true whenever \(P\) is true. It says nothing about the truth value of \(Q\) when \(P\) is false. Using this as a guide, we define the conditional statement \(P \to Q\) to be false only when \(P\) is true and \(Q\) is false, that is, only when the hypothesis is true and the conclusion is false. In all other cases, \(P \to Q\) is true. This is summarized in Table 1.1 , which is called a truth table for the conditional statement \(P \to Q\). (In Table 1.1 , T stands for “true” and F stands for “false.”)

Table 1.1: Truth Table for \(P \to Q\)

The important thing to remember is that the conditional statement \(P \to Q\) has its own truth value. It is either true or false (and not both). Its truth value depends on the truth values for \(P\) and \(Q\), but some find it a bit puzzling that the conditional statement is considered to be true when the hypothesis P is false. We will provide a justification for this through the use of an example.

Example 1.3:

Suppose that I say

“If it is not raining, then Daisy is riding her bike.”

We can represent this conditional statement as \(P \to Q\) where \(P\) is the statement, “It is not raining” and \(Q\) is the statement, “Daisy is riding her bike.”

Although it is not a perfect analogy, think of the statement \(P \to Q\) as being false to mean that I lied and think of the statement \(P \to Q\) as being true to mean that I did not lie. We will now check the truth value of \(P \to Q\) based on the truth values of \(P\) and \(Q\).

  • Suppose that both \(P\) and \(Q\) are true. That is, it is not raining and Daisy is riding her bike. In this case, it seems reasonable to say that I told the truth and that\(P \to Q\) is true.
  • Suppose that \(P\) is true and \(Q\) is false or that it is not raining and Daisy is not riding her bike. It would appear that by making the statement, “If it is not raining, then Daisy is riding her bike,” that I have not told the truth. So in this case, the statement \(P \to Q\) is false.
  • Now suppose that \(P\) is false and \(Q\) is true or that it is raining and Daisy is riding her bike. Did I make a false statement by stating that if it is not raining, then Daisy is riding her bike? The key is that I did not make any statement about what would happen if it was raining, and so I did not tell a lie. So we consider the conditional statement, “If it is not raining, then Daisy is riding her bike,” to be true in the case where it is raining and Daisy is riding her bike.
  • Finally, suppose that both \(P\) and \(Q\) are false. That is, it is raining and Daisy is not riding her bike. As in the previous situation, since my statement was \(P \to Q\), I made no claim about what would happen if it was raining, and so I did not tell a lie. So the statement \(P \to Q\) cannot be false in this case and so we consider it to be true.

Progress Check 1.4: xplorations with Conditional Statements

1 . Consider the following sentence:

If \(x\) is a positive real number, then \(x^2 + 8x\) is a positive real number.

Although the hypothesis and conclusion of this conditional sentence are not statements, the conditional sentence itself can be considered to be a statement as long as we know what possible numbers may be used for the variable \(x\). From the context of this sentence, it seems that we can substitute any positive real number for \(x\). We can also substitute 0 for \(x\) or a negative real number for x provided that we are willing to work with a false hypothesis in the conditional statement. (In Chapter 2 , we will learn how to be more careful and precise with these types of conditional statements.)

(a) Notice that if \(x = -3\), then \(x^2 + 8x = -15\), which is negative. Does this mean that the given conditional statement is false?

(b) Notice that if \(x = 4\), then \(x^2 + 8x = 48\), which is positive. Does this mean that the given conditional statement is true?

(c) Do you think this conditional statement is true or false? Record the results for at least five different examples where the hypothesis of this conditional statement is true.

2 . “If \(n\) is a positive integer, then \(n^2 - n +41\) is a prime number.” (Remember that a prime number is a positive integer greater than 1 whose only positive factors are 1 and itself.) To explore whether or not this statement is true, try using (and recording your results) for \(n = 1\), \(n = 2\), \(n = 3\), \(n = 4\), \(n = 5\), and \(n = 10\). Then record the results for at least four other values of \(n\). Does this conditional statement appear to be true?

Further Remarks about Conditional Statements

Suppose that Ed has exactly $52 in his wallet. The following four statements will use the four possible truth combinations for the hypothesis and conclusion of a conditional statement.

  • If Ed has exactly $52 in his wallet, then he has $20 in his wallet. This is a true statement. Notice that both the hypothesis and the conclusion are true.
  • If Ed has exactly $52 in his wallet, then he has $100 in his wallet. This statement is false. Notice that the hypothesis is true and the conclusion is false.
  • If Ed has $100 in his wallet, then he has at least $50 in his wallet. This statement is true regardless of how much money he has in his wallet. In this case, the hypothesis is false and the conclusion is true.

This is admittedly a contrived example but it does illustrate that the conventions for the truth value of a conditional statement make sense. The message is that in order to be complete in mathematics, we need to have conventions about when a conditional statement is true and when it is false.

If \(n\) is a positive integer, then \((n^2 - n + 41)\) is a prime number.

Perhaps for all of the values you tried for \(n\), \((n^2 - n + 41)\) turned out to be a prime number. However, if we try \(n = 41\), we ge \(n^2 - n + 41 = 41^2 - 41 + 41\) \(n^2 - n + 41 = 41^2\) So in the case where \(n = 41\), the hypothesis is true (41 is a positive integer) and the conclusion is false \(41^2\) is not prime. Therefore, 41 is a counterexample for this conjecture and the conditional statement “If \(n\) is a positive integer, then \((n^2 - n + 41)\) is a prime number” is false. There are other counterexamples (such as \(n = 42\), \(n = 45\), and \(n = 50\)), but only one counterexample is needed to prove that the statement is false.

  • Although one example can be used to prove that a conditional statement is false, in most cases, we cannot use examples to prove that a conditional statement is true. For example, in Progress Check 1.4 , we substituted values for \(x\) for the conditional statement “If \(x\) is a positive real number, then \(x^2 + 8x\) is a positive real number.” For every positive real number used for \(x\), we saw that \(x^2 + 8x\) was positive. However, this does not prove the conditional statement to be true because it is impossible to substitute every positive real number for \(x\). So, although we may believe this statement is true, to be able to conclude it is true, we need to write a mathematical proof. Methods of proof will be discussed in Section 1.2 and Chapter 3 .

Progress Check 1.5: Working with a Conditional Statement

The following statement is a true statement, which is proven in many calculus texts.

If the function \(f\) is differentiable at \(a\), then the function \(f\) is continuous at \(a\).

Using only this true statement, is it possible to make a conclusion about the function in each of the following cases?

  • It is known that the function \(f\), where \(f(x) = \sin x\), is differentiable at 0.
  • It is known that the function \(f\), where \(f(x) = \sqrt[3]x\), is not differentiable at 0.
  • It is known that the function \(f\), where \(f(x) = |x|\), is continuous at 0.
  • It is known that the function \(f\), where \(f(x) = \dfrac{|x|}{x}\) is not continuous at 0.

Closure Properties of Number Systems

The primary number system used in algebra and calculus is the real number system . We usually use the symbol R to stand for the set of all real numbers. The real numbers consist of the rational numbers and the irrational numbers. The rational numbers are those real numbers that can be written as a quotient of two integers (with a nonzero denominator), and the irrational numbers are those real numbers that cannot be written as a quotient of two integers. That is, a rational number can be written in the form of a fraction, and an irrational number cannot be written in the form of a fraction. Some common irrational numbers are \(\sqrt2\), \(\pi\) and \(e\). We usually use the symbol \(\mathbb{Q}\) to represent the set of all rational numbers. (The letter \(\mathbb{Q}\) is used because rational numbers are quotients of integers.) There is no standard symbol for the set of all irrational numbers.

Perhaps the most basic number system used in mathematics is the set of natural numbers . The natural numbers consist of the positive whole numbers such as 1, 2, 3, 107, and 203. We will use the symbol \(\mathbb{N}\) to stand for the set of natural numbers. Another basic number system that we will be working with is the set of integers . The integers consist of zero, the positive whole numbers, and the negatives of the positive whole numbers. If \(n\) is an integer, we can write \(n = \dfrac{n}{1}\). So each integer is a rational number and hence also a real number.

We will use the letter \(\mathbb{Z}\) to stand for the set of integers. (The letter \(\mathbb{Z}\) is from the German word, \(Zahlen\), for numbers.) Three of the basic properties of the integers are that the set \(\mathbb{Z}\) is closed under addition , the set \(\mathbb{Z}\) is closed under multiplication , and the set of integers is closed under subtraction. This means that

  • If \(x\) and \(y\) are integers, then \(x + y\) is an integer;
  • If \(x\) and \(y\) are integers, then \(x \cdot y\) is an integer; and
  • If \(x\) and \(y\) are integers, then \(x - y\) is an integer.

Notice that these so-called closure properties are defined in terms of conditional statements. This means that if we can find one instance where the hypothesis is true and the conclusion is false, then the conditional statement is false.

Example 1.6: Closure

  • In order for the set of natural numbers to be closed under subtraction, the following conditional statement would have to be true: If \(x\) and \(y\) are natural numbers, then \(x - y\) is a natural number. However, since 5 and 8 are natural numbers, \(5 - 8 = -3\), which is not a natural number, this conditional statement is false. Therefore, the set of natural numbers is not closed under subtraction.
  • We can use the rules for multiplying fractions and the closure rules for the integers to show that the rational numbers are closed under multiplication. If \(\dfrac{a}{b}\) and \(\dfrac{c}{d}\) are rational numbers (so \(a\), \(b\), \(c\), and \(d\) are integers and \(b\) and \(d\) are not zero), then \(\dfrac{a}{b} \cdot \dfrac{c}{d} = \dfrac{ac}{bd}.\) Since the integers are closed under multiplication, we know that \(ac\) and \(bd\) are integers and since \(b \ne 0\) and \(d \ne 0\), \(bd \ne 0\). Hence, \(\dfrac{ac}{bd}\) is a rational number and this shows that the rational numbers are closed under multiplication.

Progress Check 1.7: Closure Properties

Answer each of the following questions.

  • Is the set of rational numbers closed under addition? Explain.
  • Is the set of integers closed under division? Explain.
  • Is the set of rational numbers closed under subtraction? Explain.
  • Which of the following sentences are statements? (a) \(3^2 + 4^2 = 5^2.\) (b) \(a^2 + b^2 = c^2.\) (c) There exists integers \(a\), \(b\), and \(c\) such that \(a^2 + b^2 = c^2.\) (d) If \(x^2 = 4\), then \(x = 2.\) (e) For each real number \(x\), if \(x^2 = 4\), then \(x = 2.\) (f) For each real number \(t\), \(\sin^2t + \cos^2t = 1.\) (g) \(\sin x < \sin (\frac{\pi}{4}).\) (h) If \(n\) is a prime number, then \(n^2\) has three positive factors. (i) 1 + \(\tan^2 \theta = \text{sec}^2 \theta.\) (j) Every rectangle is a parallelogram. (k) Every even natural number greater than or equal to 4 is the sum of two prime numbers.
  • Identify the hypothesis and the conclusion for each of the following conditional statements. (a) If \(n\) is a prime number, then \(n^2\) has three positive factors. (b) If \(a\) is an irrational number and \(b\) is an irrational number, then \(a \cdot b\) is an irrational number. (c) If \(p\) is a prime number, then \(p = 2\) or \(p\) is an odd number. (d) If \(p\) is a prime number and \(p \ne 2\) or \(p\) is an odd number. (e) \(p \ne 2\) or \(p\) is a even number, then \(p\) is not prime.
  • Determine whether each of the following conditional statements is true or false. (a) If 10 < 7, then 3 = 4. (b) If 7 < 10, then 3 = 4. (c) If 10 < 7, then 3 + 5 = 8. (d) If 7 < 10, then 3 + 5 = 8.
  • Determine the conditions under which each of the following conditional sentences will be a true statement. (a) If a + 2 = 5, then 8 < 5. (b) If 5 < 8, then a + 2 = 5.
  • Let \(P\) be the statement “Student X passed every assignment in Calculus I,” and let \(Q\) be the statement “Student X received a grade of C or better in Calculus I.” (a) What does it mean for \(P\) to be true? What does it mean for \(Q\) to be true? (b) Suppose that Student X passed every assignment in Calculus I and received a grade of B-, and that the instructor made the statement \(P \to Q\). Would you say that the instructor lied or told the truth? (c) Suppose that Student X passed every assignment in Calculus I and received a grade of C-, and that the instructor made the statement \(P \to Q\). Would you say that the instructor lied or told the truth? (d) Now suppose that Student X did not pass two assignments in Calculus I and received a grade of D, and that the instructor made the statement \(P \to Q\). Would you say that the instructor lied or told the truth? (e) How are Parts ( 5b ), ( 5c ), and ( 5d ) related to the truth table for \(P \to Q\)?

Theorem If f is a quadratic function of the form \(f(x) = ax^2 + bx + c\) and a < 0, then the function f has a maximum value when \(x = \dfrac{-b}{2a}\). Using only this theorem, what can be concluded about the functions given by the following formulas? (a) \(g (x) = -8x^2 + 5x - 2\) (b) \(h (x) = -\dfrac{1}{3}x^2 + 3x\) (c) \(k (x) = 8x^2 - 5x - 7\) (d) \(j (x) = -\dfrac{71}{99}x^2 +210\) (e) \(f (x) = -4x^2 - 3x + 7\) (f) \(F (x) = -x^4 + x^3 + 9\)

Theorem If \(f\) is a quadratic function of the form \(f(x) = ax^2 + bx + c\) and ac < 0, then the function \(f\) has two x-intercepts.

Using only this theorem, what can be concluded about the functions given by the following formulas? (a) \(g (x) = -8x^2 + 5x - 2\) (b) \(h (x) = -\dfrac{1}{3}x^2 + 3x\) (c) \(k (x) = 8x^2 - 5x - 7\) (d) \(j (x) = -\dfrac{71}{99}x^2 +210\) (e) \(f (x) = -4x^2 - 3x + 7\) (f) \(F (x) = -x^4 + x^3 + 9\)

Theorem A. If \(f\) is a cubic function of the form \(f (x) = x^3 - x + b\) and b > 1, then the function \(f\) has exactly one \(x\)-intercept. Following is another theorem about \(x\)-intercepts of functions: Theorem B . If \(f\) and \(g\) are functions with \(g (x) = k \cdot f (x)\), where \(k\) is a nonzero real number, then \(f\) and \(g\) have exactly the same \(x\)-intercepts.

Using only these two theorems and some simple algebraic manipulations, what can be concluded about the functions given by the following formulas? (a) \(f (x) = x^3 -x + 7\) (b) \(g (x) = x^3 + x +7\) (c) \(h (x) = -x^3 + x - 5\) (d) \(k (x) = 2x^3 + 2x + 3\) (e) \(r (x) = x^4 - x + 11\) (f) \(F (x) = 2x^3 - 2x + 7\)

  • (a) Is the set of natural numbers closed under division? (b) Is the set of rational numbers closed under division? (c) Is the set of nonzero rational numbers closed under division? (d) Is the set of positive rational numbers closed under division? (e) Is the set of positive real numbers closed under subtraction? (f) Is the set of negative rational numbers closed under division? (g) Is the set of negative integers closed under addition? Explorations and Activities
  • Exploring Propositions . In Progress Check 1.2 , we used exploration to show that certain statements were false and to make conjectures that certain statements were true. We can also use exploration to formulate a conjecture that we believe to be true. For example, if we calculate successive powers of \(2, (2^1, 2^2, 2^3, 2^4, 2^5, ...)\) and examine the units digits of these numbers, we could make the following conjectures (among others): \(\bullet\) If \(n\) is a natural number, then the units digit of \(2^n\) must be 2, 4, 6, or 8. \(\bullet\) The units digits of the successive powers of 2 repeat according to the pattern “2, 4, 8, 6.” (a) Is it possible to formulate a conjecture about the units digits of successive powers of \(4 (4^1, 4^2, 4^3, 4^4, 4^5,...)\)? If so, formulate at least one conjecture. (b) Is it possible to formulate a conjecture about the units digit of numbers of the form \(7^n - 2^n\), where \(n\) is a natural number? If so, formulate a conjecture in the form of a conditional statement in the form “If \(n\) is a natural number, then ... .” (c) Let \(f (x) = e^(2x)\). Determine the first eight derivatives of this function. What do you observe? Formulate a conjecture that appears to be true. The conjecture should be written as a conditional statement in the form, “If n is a natural number, then ... .”

Want to create or adapt books like this? Learn more about how Pressbooks supports open publishing practices.

17 Introduction to Hypothesis Testing

Jenna Lehmann

What is Hypothesis Testing?

Hypothesis testing is a big part of what we would actually consider testing for inferential statistics. It’s a procedure and set of rules that allow us to move from descriptive statistics to make inferences about a population based on sample data. It is a statistical method that uses sample data to evaluate a hypothesis about a population.

This type of test is usually used within the context of research. If we expect to see a difference between a treated and untreated group (in some cases the untreated group is the parameters we know about the population), we expect there to be a difference in the means between the two groups, but that the standard deviation remains the same, as if each individual score has had a value added or subtracted from it.

Steps of Hypothesis Testing

The following steps will be tailored to fit the first kind of hypothesis testing we will learn first: single-sample z-tests. There are many other kinds of tests, so keep this in mind.

  • Null Hypothesis (H0): states that in the general population there is no change, no difference, or no relationship, or in the context of an experiment, it predicts that the independent variable has no effect on the dependent variable.
  • Alternative Hypothesis (H1): states that there is a change, a difference, or a relationship for the general population, or in the context of an experiment, it predicts that the independent variable has an effect on the dependent variable.

\alpha = 0.05,

  • Critical Region: Composed of the extreme sample values that are very unlikely to be obtained if the null hypothesis is true. Determined by alpha level. If sample data fall in the critical region, the null hypothesis is rejected, because it’s very unlikely they’ve fallen there by chance.
  • After collecting the data, we find the sample mean. Now we can compare the sample mean with the null hypothesis by computing a z-score that describes where the sample mean is located relative to the hypothesized population mean. We use the z-score formula.
  • We decided previously what the two z-score boundaries are for a critical score. If the z-score we get after plugging the numbers in the aforementioned equation is outside of that critical region, we reject the null hypothesis. Otherwise, we would say that we failed to reject the null hypothesis.

Regions of the Distribution

Because we’re making judgments based on probability and proportion, our normal distributions and certain regions within them come into play.

The Critical Region is composed of the extreme sample values that are very unlikely to be obtained if the null hypothesis is true. Determined by alpha level. If sample data fall in the critical region, the null hypothesis is rejected, because it’s very unlikely they’ve fallen there by chance.

These regions come into play when talking about different errors.

A Type I Error occurs when a researcher rejects a null hypothesis that is actually true; the researcher concludes that a treatment has an effect when it actually doesn’t. This happens when a researcher unknowingly obtains an extreme, non-representative sample. This goes back to alpha level: it’s the probability that the test will lead to a Type I error if the null hypothesis is true.


A result is said to be significant or statistically significant if it is very unlikely to occur when the null hypothesis is true. That is, the result is sufficient to reject the null hypothesis. For instance, two means can be significantly different from one another.

Factors that Influence and Assumptions of Hypothesis Testing

Assumptions of Hypothesis Testing:

  • Random sampling: it is assumed that the participants used in the study were selected randomly so that we can confidently generalize our findings from the sample to the population.
  • Independent observation: two observations are independent if there is no consistent, predictable relationship between the first observation and the second. The value of σ is unchanged by the treatment; if the population standard deviation is unknown, we assume that the standard deviation for the unknown population (after treatment) is the same as it was for the population before treatment. There are ways of checking to see if this is true in SPSS or Excel.
  • Normal sampling distribution: in order to use the unit normal table to identify the critical region, we need the distribution of sample means to be normal (which means we need the population to be distributed normally and/or each sample size needs to be 30 or greater based on what we know about the central limit theorem).

Factors that influence hypothesis testing:

  • The variability of the scores, which is measured by either the standard deviation or the variance. The variability influences the size of the standard error in the denominator of the z-score.
  • The number of scores in the sample. This value also influences the size of the standard error in the denominator.

Test statistic: indicates that the sample data are converted into a single, specific statistic that is used to test the hypothesis (in this case, the z-score statistic).

Directional Hypotheses and Tailed Tests

In a directional hypothesis test , also known as a one-tailed test, the statistical hypotheses specify with an increase or decrease in the population mean. That is, they make a statement about the direction of the effect.

The Hypotheses for a Directional Test:

  • H0: The test scores are not increased/decreased (the treatment doesn’t work)
  • H1: The test scores are increased/decreased (the treatment works as predicted)

Because we’re only worried about scores that are either greater or less than the scores predicted by the null hypothesis, we only worry about what’s going on in one tail meaning that the critical region only exists within one tail. This means that all of the alpha is contained in one tail rather than split up into both (so the whole 5% is located in the tail we care about, rather than 2.5% in each tail). So before, we cared about what’s going on at the 0.025 mark of the unit normal table to look at both tails, but now we care about 0.05 because we’re only looking at one tail.

A one-tailed test allows you to reject the null hypothesis when the difference between the sample and the population is relatively small, as long as that difference is in the direction that you predicted. A two-tailed test, on the other hand, requires a relatively large difference independent of direction. In practice, researchers hypothesize using a one-tailed method but base their findings off of whether the results fall into the critical region of a two-tailed method. For the purposes of this class, make sure to calculate your results using the test that is specified in the problem.

Effect Size

A measure of effect size is intended to provide a measurement of the absolute magnitude of a treatment effect, independent of the size of the sample(s) being used. Usually done with Cohen’s d. If you imagine the two distributions, they’re layered over one another. The more they overlap, the smaller the effect size (the means of the two distributions are close). The more they are spread apart, the greater the effect size (the means of the two distributions are farther apart).

Statistical Power

The power of a statistical test is the probability that the test will correctly reject a false null hypothesis. It’s usually what we’re hoping to get when we run an experiment. It’s displayed in the table posted above. Power and effect size are connected. So, we know that the greater the distance between the means, the greater the effect size. If the two distributions overlapped very little, there would be a greater chance of selecting a sample that leads to rejecting the null hypothesis.

This chapter was originally posted to the Math Support Center blog at the University of Baltimore on June 11, 2019.

Math and Statistics Guides from UB's Math & Statistics Center Copyright © by Jenna Lehmann is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License , except where otherwise noted.

Share This Book

Hypothesis Testing

Hypothesis testing is a tool for making statistical inferences about the population data. It is an analysis tool that tests assumptions and determines how likely something is within a given standard of accuracy. Hypothesis testing provides a way to verify whether the results of an experiment are valid.

A null hypothesis and an alternative hypothesis are set up before performing the hypothesis testing. This helps to arrive at a conclusion regarding the sample obtained from the population. In this article, we will learn more about hypothesis testing, its types, steps to perform the testing, and associated examples.

What is Hypothesis Testing in Statistics?

Hypothesis testing uses sample data from the population to draw useful conclusions regarding the population probability distribution . It tests an assumption made about the data using different types of hypothesis testing methodologies. The hypothesis testing results in either rejecting or not rejecting the null hypothesis.

Hypothesis Testing Definition

Hypothesis testing can be defined as a statistical tool that is used to identify if the results of an experiment are meaningful or not. It involves setting up a null hypothesis and an alternative hypothesis. These two hypotheses will always be mutually exclusive. This means that if the null hypothesis is true then the alternative hypothesis is false and vice versa. An example of hypothesis testing is setting up a test to check if a new medicine works on a disease in a more efficient manner.

Null Hypothesis

The null hypothesis is a concise mathematical statement that is used to indicate that there is no difference between two possibilities. In other words, there is no difference between certain characteristics of data. This hypothesis assumes that the outcomes of an experiment are based on chance alone. It is denoted as \(H_{0}\). Hypothesis testing is used to conclude if the null hypothesis can be rejected or not. Suppose an experiment is conducted to check if girls are shorter than boys at the age of 5. The null hypothesis will say that they are the same height.

Alternative Hypothesis

The alternative hypothesis is an alternative to the null hypothesis. It is used to show that the observations of an experiment are due to some real effect. It indicates that there is a statistical significance between two possible outcomes and can be denoted as \(H_{1}\) or \(H_{a}\). For the above-mentioned example, the alternative hypothesis would be that girls are shorter than boys at the age of 5.

Hypothesis Testing P Value

In hypothesis testing, the p value is used to indicate whether the results obtained after conducting a test are statistically significant or not. It also indicates the probability of making an error in rejecting or not rejecting the null hypothesis.This value is always a number between 0 and 1. The p value is compared to an alpha level, \(\alpha\) or significance level. The alpha level can be defined as the acceptable risk of incorrectly rejecting the null hypothesis. The alpha level is usually chosen between 1% to 5%.

Hypothesis Testing Critical region

All sets of values that lead to rejecting the null hypothesis lie in the critical region. Furthermore, the value that separates the critical region from the non-critical region is known as the critical value.

Hypothesis Testing Formula

Depending upon the type of data available and the size, different types of hypothesis testing are used to determine whether the null hypothesis can be rejected or not. The hypothesis testing formula for some important test statistics are given below:

  • z = \(\frac{\overline{x}-\mu}{\frac{\sigma}{\sqrt{n}}}\). \(\overline{x}\) is the sample mean, \(\mu\) is the population mean, \(\sigma\) is the population standard deviation and n is the size of the sample.
  • t = \(\frac{\overline{x}-\mu}{\frac{s}{\sqrt{n}}}\). s is the sample standard deviation.
  • \(\chi ^{2} = \sum \frac{(O_{i}-E_{i})^{2}}{E_{i}}\). \(O_{i}\) is the observed value and \(E_{i}\) is the expected value.

We will learn more about these test statistics in the upcoming section.

Types of Hypothesis Testing

Selecting the correct test for performing hypothesis testing can be confusing. These tests are used to determine a test statistic on the basis of which the null hypothesis can either be rejected or not rejected. Some of the important tests used for hypothesis testing are given below.

Hypothesis Testing Z Test

A z test is a way of hypothesis testing that is used for a large sample size (n ≥ 30). It is used to determine whether there is a difference between the population mean and the sample mean when the population standard deviation is known. It can also be used to compare the mean of two samples. It is used to compute the z test statistic. The formulas are given as follows:

  • One sample: z = \(\frac{\overline{x}-\mu}{\frac{\sigma}{\sqrt{n}}}\).
  • Two samples: z = \(\frac{(\overline{x_{1}}-\overline{x_{2}})-(\mu_{1}-\mu_{2})}{\sqrt{\frac{\sigma_{1}^{2}}{n_{1}}+\frac{\sigma_{2}^{2}}{n_{2}}}}\).

Hypothesis Testing t Test

The t test is another method of hypothesis testing that is used for a small sample size (n < 30). It is also used to compare the sample mean and population mean. However, the population standard deviation is not known. Instead, the sample standard deviation is known. The mean of two samples can also be compared using the t test.

  • One sample: t = \(\frac{\overline{x}-\mu}{\frac{s}{\sqrt{n}}}\).
  • Two samples: t = \(\frac{(\overline{x_{1}}-\overline{x_{2}})-(\mu_{1}-\mu_{2})}{\sqrt{\frac{s_{1}^{2}}{n_{1}}+\frac{s_{2}^{2}}{n_{2}}}}\).

Hypothesis Testing Chi Square

The Chi square test is a hypothesis testing method that is used to check whether the variables in a population are independent or not. It is used when the test statistic is chi-squared distributed.

One Tailed Hypothesis Testing

One tailed hypothesis testing is done when the rejection region is only in one direction. It can also be known as directional hypothesis testing because the effects can be tested in one direction only. This type of testing is further classified into the right tailed test and left tailed test.

Right Tailed Hypothesis Testing

The right tail test is also known as the upper tail test. This test is used to check whether the population parameter is greater than some value. The null and alternative hypotheses for this test are given as follows:

\(H_{0}\): The population parameter is ≤ some value

\(H_{1}\): The population parameter is > some value.

If the test statistic has a greater value than the critical value then the null hypothesis is rejected

Right Tail Hypothesis Testing

Left Tailed Hypothesis Testing

The left tail test is also known as the lower tail test. It is used to check whether the population parameter is less than some value. The hypotheses for this hypothesis testing can be written as follows:

\(H_{0}\): The population parameter is ≥ some value

\(H_{1}\): The population parameter is < some value.

The null hypothesis is rejected if the test statistic has a value lesser than the critical value.

Left Tail Hypothesis Testing

Two Tailed Hypothesis Testing

In this hypothesis testing method, the critical region lies on both sides of the sampling distribution. It is also known as a non - directional hypothesis testing method. The two-tailed test is used when it needs to be determined if the population parameter is assumed to be different than some value. The hypotheses can be set up as follows:

\(H_{0}\): the population parameter = some value

\(H_{1}\): the population parameter ≠ some value

The null hypothesis is rejected if the test statistic has a value that is not equal to the critical value.

Two Tail Hypothesis Testing

Hypothesis Testing Steps

Hypothesis testing can be easily performed in five simple steps. The most important step is to correctly set up the hypotheses and identify the right method for hypothesis testing. The basic steps to perform hypothesis testing are as follows:

  • Step 1: Set up the null hypothesis by correctly identifying whether it is the left-tailed, right-tailed, or two-tailed hypothesis testing.
  • Step 2: Set up the alternative hypothesis.
  • Step 3: Choose the correct significance level, \(\alpha\), and find the critical value.
  • Step 4: Calculate the correct test statistic (z, t or \(\chi\)) and p-value.
  • Step 5: Compare the test statistic with the critical value or compare the p-value with \(\alpha\) to arrive at a conclusion. In other words, decide if the null hypothesis is to be rejected or not.

Hypothesis Testing Example

The best way to solve a problem on hypothesis testing is by applying the 5 steps mentioned in the previous section. Suppose a researcher claims that the mean average weight of men is greater than 100kgs with a standard deviation of 15kgs. 30 men are chosen with an average weight of 112.5 Kgs. Using hypothesis testing, check if there is enough evidence to support the researcher's claim. The confidence interval is given as 95%.

Step 1: This is an example of a right-tailed test. Set up the null hypothesis as \(H_{0}\): \(\mu\) = 100.

Step 2: The alternative hypothesis is given by \(H_{1}\): \(\mu\) > 100.

Step 3: As this is a one-tailed test, \(\alpha\) = 100% - 95% = 5%. This can be used to determine the critical value.

1 - \(\alpha\) = 1 - 0.05 = 0.95

0.95 gives the required area under the curve. Now using a normal distribution table, the area 0.95 is at z = 1.645. A similar process can be followed for a t-test. The only additional requirement is to calculate the degrees of freedom given by n - 1.

Step 4: Calculate the z test statistic. This is because the sample size is 30. Furthermore, the sample and population means are known along with the standard deviation.

z = \(\frac{\overline{x}-\mu}{\frac{\sigma}{\sqrt{n}}}\).

\(\mu\) = 100, \(\overline{x}\) = 112.5, n = 30, \(\sigma\) = 15

z = \(\frac{112.5-100}{\frac{15}{\sqrt{30}}}\) = 4.56

Step 5: Conclusion. As 4.56 > 1.645 thus, the null hypothesis can be rejected.

Hypothesis Testing and Confidence Intervals

Confidence intervals form an important part of hypothesis testing. This is because the alpha level can be determined from a given confidence interval. Suppose a confidence interval is given as 95%. Subtract the confidence interval from 100%. This gives 100 - 95 = 5% or 0.05. This is the alpha value of a one-tailed hypothesis testing. To obtain the alpha value for a two-tailed hypothesis testing, divide this value by 2. This gives 0.05 / 2 = 0.025.

Related Articles:

  • Probability and Statistics
  • Data Handling

Important Notes on Hypothesis Testing

  • Hypothesis testing is a technique that is used to verify whether the results of an experiment are statistically significant.
  • It involves the setting up of a null hypothesis and an alternate hypothesis.
  • There are three types of tests that can be conducted under hypothesis testing - z test, t test, and chi square test.
  • Hypothesis testing can be classified as right tail, left tail, and two tail tests.

Examples on Hypothesis Testing

  • Example 1: The average weight of a dumbbell in a gym is 90lbs. However, a physical trainer believes that the average weight might be higher. A random sample of 5 dumbbells with an average weight of 110lbs and a standard deviation of 18lbs. Using hypothesis testing check if the physical trainer's claim can be supported for a 95% confidence level. Solution: As the sample size is lesser than 30, the t-test is used. \(H_{0}\): \(\mu\) = 90, \(H_{1}\): \(\mu\) > 90 \(\overline{x}\) = 110, \(\mu\) = 90, n = 5, s = 18. \(\alpha\) = 0.05 Using the t-distribution table, the critical value is 2.132 t = \(\frac{\overline{x}-\mu}{\frac{s}{\sqrt{n}}}\) t = 2.484 As 2.484 > 2.132, the null hypothesis is rejected. Answer: The average weight of the dumbbells may be greater than 90lbs
  • Example 2: The average score on a test is 80 with a standard deviation of 10. With a new teaching curriculum introduced it is believed that this score will change. On random testing, the score of 38 students, the mean was found to be 88. With a 0.05 significance level, is there any evidence to support this claim? Solution: This is an example of two-tail hypothesis testing. The z test will be used. \(H_{0}\): \(\mu\) = 80, \(H_{1}\): \(\mu\) ≠ 80 \(\overline{x}\) = 88, \(\mu\) = 80, n = 36, \(\sigma\) = 10. \(\alpha\) = 0.05 / 2 = 0.025 The critical value using the normal distribution table is 1.96 z = \(\frac{\overline{x}-\mu}{\frac{\sigma}{\sqrt{n}}}\) z = \(\frac{88-80}{\frac{10}{\sqrt{36}}}\) = 4.8 As 4.8 > 1.96, the null hypothesis is rejected. Answer: There is a difference in the scores after the new curriculum was introduced.
  • Example 3: The average score of a class is 90. However, a teacher believes that the average score might be lower. The scores of 6 students were randomly measured. The mean was 82 with a standard deviation of 18. With a 0.05 significance level use hypothesis testing to check if this claim is true. Solution: The t test will be used. \(H_{0}\): \(\mu\) = 90, \(H_{1}\): \(\mu\) < 90 \(\overline{x}\) = 110, \(\mu\) = 90, n = 6, s = 18 The critical value from the t table is -2.015 t = \(\frac{\overline{x}-\mu}{\frac{s}{\sqrt{n}}}\) t = \(\frac{82-90}{\frac{18}{\sqrt{6}}}\) t = -1.088 As -1.088 > -2.015, we fail to reject the null hypothesis. Answer: There is not enough evidence to support the claim.

go to slide go to slide go to slide

hypothesis definition for math

Book a Free Trial Class

FAQs on Hypothesis Testing

What is hypothesis testing.

Hypothesis testing in statistics is a tool that is used to make inferences about the population data. It is also used to check if the results of an experiment are valid.

What is the z Test in Hypothesis Testing?

The z test in hypothesis testing is used to find the z test statistic for normally distributed data . The z test is used when the standard deviation of the population is known and the sample size is greater than or equal to 30.

What is the t Test in Hypothesis Testing?

The t test in hypothesis testing is used when the data follows a student t distribution . It is used when the sample size is less than 30 and standard deviation of the population is not known.

What is the formula for z test in Hypothesis Testing?

The formula for a one sample z test in hypothesis testing is z = \(\frac{\overline{x}-\mu}{\frac{\sigma}{\sqrt{n}}}\) and for two samples is z = \(\frac{(\overline{x_{1}}-\overline{x_{2}})-(\mu_{1}-\mu_{2})}{\sqrt{\frac{\sigma_{1}^{2}}{n_{1}}+\frac{\sigma_{2}^{2}}{n_{2}}}}\).

What is the p Value in Hypothesis Testing?

The p value helps to determine if the test results are statistically significant or not. In hypothesis testing, the null hypothesis can either be rejected or not rejected based on the comparison between the p value and the alpha level.

What is One Tail Hypothesis Testing?

When the rejection region is only on one side of the distribution curve then it is known as one tail hypothesis testing. The right tail test and the left tail test are two types of directional hypothesis testing.

What is the Alpha Level in Two Tail Hypothesis Testing?

To get the alpha level in a two tail hypothesis testing divide \(\alpha\) by 2. This is done as there are two rejection regions in the curve.

Definition Of Hypothesis

Hypothesis is the part of a conditional statement just after the word if.

Examples of Hypothesis

In the conditional, "If all fours sides of a quadrilateral measure the same, then the quadrilateral is a square" the hypothesis is "all fours sides of a quadrilateral measure the same".

Video Examples: Hypothesis  

Solved Example on Hypothesis

Ques:  in the example above, is the hypothesis "all fours sides of a quadrilateral measure the same" always, never, or sometimes true.

A. always B. never C. sometimes Correct Answer: C

Step 1: The hypothesis is sometimes true. Because, its true only for a square and a rhombus, not for the other quadrilaterals rectangle, parallelogram, or trapezoid.

Related Worksheet

  • Identifying-and-Describing-Right-Triangles-Gr-4
  • Points,-Lines,-Line-Segments,-Rays-and-Angles-Gr-4
  • Two-dimensional-Geometric-Figures-Gr-4
  • Interpreting-Multiplication-as-Scaling-or-Resizing-Gr-5
  • Parallel-Lines-and-Transversals-Gr-8

HighSchool Math

  • MathDictionary
  • PhysicsDictionary
  • ChemistryDictionary
  • BiologyDictionary
  • MathArticles
  • HealthInformation
  • Bipolar Disorder
  • Therapy Center
  • When To See a Therapist
  • Types of Therapy
  • Best Online Therapy
  • Best Couples Therapy
  • Best Family Therapy
  • Managing Stress
  • Sleep and Dreaming
  • Understanding Emotions
  • Self-Improvement
  • Healthy Relationships
  • Student Resources
  • Personality Types
  • Guided Meditations
  • Verywell Mind Insights
  • 2024 Verywell Mind 25
  • Mental Health in the Classroom
  • Editorial Process
  • Meet Our Review Board
  • Crisis Support

How to Write a Great Hypothesis

Hypothesis Definition, Format, Examples, and Tips

Kendra Cherry, MS, is a psychosocial rehabilitation specialist, psychology educator, and author of the "Everything Psychology Book."

hypothesis definition for math

Amy Morin, LCSW, is a psychotherapist and international bestselling author. Her books, including "13 Things Mentally Strong People Don't Do," have been translated into more than 40 languages. Her TEDx talk,  "The Secret of Becoming Mentally Strong," is one of the most viewed talks of all time.

hypothesis definition for math

Verywell / Alex Dos Diaz

  • The Scientific Method

Hypothesis Format

Falsifiability of a hypothesis.

  • Operationalization

Hypothesis Types

Hypotheses examples.

  • Collecting Data

A hypothesis is a tentative statement about the relationship between two or more variables. It is a specific, testable prediction about what you expect to happen in a study. It is a preliminary answer to your question that helps guide the research process.

Consider a study designed to examine the relationship between sleep deprivation and test performance. The hypothesis might be: "This study is designed to assess the hypothesis that sleep-deprived people will perform worse on a test than individuals who are not sleep-deprived."

At a Glance

A hypothesis is crucial to scientific research because it offers a clear direction for what the researchers are looking to find. This allows them to design experiments to test their predictions and add to our scientific knowledge about the world. This article explores how a hypothesis is used in psychology research, how to write a good hypothesis, and the different types of hypotheses you might use.

The Hypothesis in the Scientific Method

In the scientific method , whether it involves research in psychology, biology, or some other area, a hypothesis represents what the researchers think will happen in an experiment. The scientific method involves the following steps:

  • Forming a question
  • Performing background research
  • Creating a hypothesis
  • Designing an experiment
  • Collecting data
  • Analyzing the results
  • Drawing conclusions
  • Communicating the results

The hypothesis is a prediction, but it involves more than a guess. Most of the time, the hypothesis begins with a question which is then explored through background research. At this point, researchers then begin to develop a testable hypothesis.

Unless you are creating an exploratory study, your hypothesis should always explain what you  expect  to happen.

In a study exploring the effects of a particular drug, the hypothesis might be that researchers expect the drug to have some type of effect on the symptoms of a specific illness. In psychology, the hypothesis might focus on how a certain aspect of the environment might influence a particular behavior.

Remember, a hypothesis does not have to be correct. While the hypothesis predicts what the researchers expect to see, the goal of the research is to determine whether this guess is right or wrong. When conducting an experiment, researchers might explore numerous factors to determine which ones might contribute to the ultimate outcome.

In many cases, researchers may find that the results of an experiment  do not  support the original hypothesis. When writing up these results, the researchers might suggest other options that should be explored in future studies.

In many cases, researchers might draw a hypothesis from a specific theory or build on previous research. For example, prior research has shown that stress can impact the immune system. So a researcher might hypothesize: "People with high-stress levels will be more likely to contract a common cold after being exposed to the virus than people who have low-stress levels."

In other instances, researchers might look at commonly held beliefs or folk wisdom. "Birds of a feather flock together" is one example of folk adage that a psychologist might try to investigate. The researcher might pose a specific hypothesis that "People tend to select romantic partners who are similar to them in interests and educational level."

Elements of a Good Hypothesis

So how do you write a good hypothesis? When trying to come up with a hypothesis for your research or experiments, ask yourself the following questions:

  • Is your hypothesis based on your research on a topic?
  • Can your hypothesis be tested?
  • Does your hypothesis include independent and dependent variables?

Before you come up with a specific hypothesis, spend some time doing background research. Once you have completed a literature review, start thinking about potential questions you still have. Pay attention to the discussion section in the  journal articles you read . Many authors will suggest questions that still need to be explored.

How to Formulate a Good Hypothesis

To form a hypothesis, you should take these steps:

  • Collect as many observations about a topic or problem as you can.
  • Evaluate these observations and look for possible causes of the problem.
  • Create a list of possible explanations that you might want to explore.
  • After you have developed some possible hypotheses, think of ways that you could confirm or disprove each hypothesis through experimentation. This is known as falsifiability.

In the scientific method ,  falsifiability is an important part of any valid hypothesis. In order to test a claim scientifically, it must be possible that the claim could be proven false.

Students sometimes confuse the idea of falsifiability with the idea that it means that something is false, which is not the case. What falsifiability means is that  if  something was false, then it is possible to demonstrate that it is false.

One of the hallmarks of pseudoscience is that it makes claims that cannot be refuted or proven false.

The Importance of Operational Definitions

A variable is a factor or element that can be changed and manipulated in ways that are observable and measurable. However, the researcher must also define how the variable will be manipulated and measured in the study.

Operational definitions are specific definitions for all relevant factors in a study. This process helps make vague or ambiguous concepts detailed and measurable.

For example, a researcher might operationally define the variable " test anxiety " as the results of a self-report measure of anxiety experienced during an exam. A "study habits" variable might be defined by the amount of studying that actually occurs as measured by time.

These precise descriptions are important because many things can be measured in various ways. Clearly defining these variables and how they are measured helps ensure that other researchers can replicate your results.


One of the basic principles of any type of scientific research is that the results must be replicable.

Replication means repeating an experiment in the same way to produce the same results. By clearly detailing the specifics of how the variables were measured and manipulated, other researchers can better understand the results and repeat the study if needed.

Some variables are more difficult than others to define. For example, how would you operationally define a variable such as aggression ? For obvious ethical reasons, researchers cannot create a situation in which a person behaves aggressively toward others.

To measure this variable, the researcher must devise a measurement that assesses aggressive behavior without harming others. The researcher might utilize a simulated task to measure aggressiveness in this situation.

Hypothesis Checklist

  • Does your hypothesis focus on something that you can actually test?
  • Does your hypothesis include both an independent and dependent variable?
  • Can you manipulate the variables?
  • Can your hypothesis be tested without violating ethical standards?

The hypothesis you use will depend on what you are investigating and hoping to find. Some of the main types of hypotheses that you might use include:

  • Simple hypothesis : This type of hypothesis suggests there is a relationship between one independent variable and one dependent variable.
  • Complex hypothesis : This type suggests a relationship between three or more variables, such as two independent and dependent variables.
  • Null hypothesis : This hypothesis suggests no relationship exists between two or more variables.
  • Alternative hypothesis : This hypothesis states the opposite of the null hypothesis.
  • Statistical hypothesis : This hypothesis uses statistical analysis to evaluate a representative population sample and then generalizes the findings to the larger group.
  • Logical hypothesis : This hypothesis assumes a relationship between variables without collecting data or evidence.

A hypothesis often follows a basic format of "If {this happens} then {this will happen}." One way to structure your hypothesis is to describe what will happen to the  dependent variable  if you change the  independent variable .

The basic format might be: "If {these changes are made to a certain independent variable}, then we will observe {a change in a specific dependent variable}."

A few examples of simple hypotheses:

  • "Students who eat breakfast will perform better on a math exam than students who do not eat breakfast."
  • "Students who experience test anxiety before an English exam will get lower scores than students who do not experience test anxiety."​
  • "Motorists who talk on the phone while driving will be more likely to make errors on a driving course than those who do not talk on the phone."
  • "Children who receive a new reading intervention will have higher reading scores than students who do not receive the intervention."

Examples of a complex hypothesis include:

  • "People with high-sugar diets and sedentary activity levels are more likely to develop depression."
  • "Younger people who are regularly exposed to green, outdoor areas have better subjective well-being than older adults who have limited exposure to green spaces."

Examples of a null hypothesis include:

  • "There is no difference in anxiety levels between people who take St. John's wort supplements and those who do not."
  • "There is no difference in scores on a memory recall task between children and adults."
  • "There is no difference in aggression levels between children who play first-person shooter games and those who do not."

Examples of an alternative hypothesis:

  • "People who take St. John's wort supplements will have less anxiety than those who do not."
  • "Adults will perform better on a memory task than children."
  • "Children who play first-person shooter games will show higher levels of aggression than children who do not." 

Collecting Data on Your Hypothesis

Once a researcher has formed a testable hypothesis, the next step is to select a research design and start collecting data. The research method depends largely on exactly what they are studying. There are two basic types of research methods: descriptive research and experimental research.

Descriptive Research Methods

Descriptive research such as  case studies ,  naturalistic observations , and surveys are often used when  conducting an experiment is difficult or impossible. These methods are best used to describe different aspects of a behavior or psychological phenomenon.

Once a researcher has collected data using descriptive methods, a  correlational study  can examine how the variables are related. This research method might be used to investigate a hypothesis that is difficult to test experimentally.

Experimental Research Methods

Experimental methods  are used to demonstrate causal relationships between variables. In an experiment, the researcher systematically manipulates a variable of interest (known as the independent variable) and measures the effect on another variable (known as the dependent variable).

Unlike correlational studies, which can only be used to determine if there is a relationship between two variables, experimental methods can be used to determine the actual nature of the relationship—whether changes in one variable actually  cause  another to change.

The hypothesis is a critical part of any scientific exploration. It represents what researchers expect to find in a study or experiment. In situations where the hypothesis is unsupported by the research, the research still has value. Such research helps us better understand how different aspects of the natural world relate to one another. It also helps us develop new hypotheses that can then be tested in the future.

Thompson WH, Skau S. On the scope of scientific hypotheses .  R Soc Open Sci . 2023;10(8):230607. doi:10.1098/rsos.230607

Taran S, Adhikari NKJ, Fan E. Falsifiability in medicine: what clinicians can learn from Karl Popper [published correction appears in Intensive Care Med. 2021 Jun 17;:].  Intensive Care Med . 2021;47(9):1054-1056. doi:10.1007/s00134-021-06432-z

Eyler AA. Research Methods for Public Health . 1st ed. Springer Publishing Company; 2020. doi:10.1891/9780826182067.0004

Nosek BA, Errington TM. What is replication ?  PLoS Biol . 2020;18(3):e3000691. doi:10.1371/journal.pbio.3000691

Aggarwal R, Ranganathan P. Study designs: Part 2 - Descriptive studies .  Perspect Clin Res . 2019;10(1):34-36. doi:10.4103/picr.PICR_154_18

Nevid J. Psychology: Concepts and Applications. Wadworth, 2013.

By Kendra Cherry, MSEd Kendra Cherry, MS, is a psychosocial rehabilitation specialist, psychology educator, and author of the "Everything Psychology Book."

Hypothesis Definition (Science)

  • Chemical Laws
  • Periodic Table
  • Projects & Experiments
  • Scientific Method
  • Biochemistry
  • Physical Chemistry
  • Medical Chemistry
  • Chemistry In Everyday Life
  • Famous Chemists
  • Activities for Kids
  • Abbreviations & Acronyms
  • Weather & Climate
  • Ph.D., Biomedical Sciences, University of Tennessee at Knoxville
  • B.A., Physics and Mathematics, Hastings College

A hypothesis is an explanation that is proposed for a phenomenon. Formulating a hypothesis is a step of the scientific method .

Alternate Spellings: plural: hypotheses

Examples: Upon observing that a lake appears blue under a blue sky, you might propose the hypothesis that the lake is blue because it is reflecting the sky. One alternate hypothesis would be that the lake is blue because water is blue.

Hypothesis Versus Theory

Although in common usage the terms hypothesis and theory are used interchangeably, the two words mean something different from each other in science. Like a hypothesis, a theory is testable and may be used to make predictions. However, a theory has been tested using the scientific method many times. Testing a hypothesis may, over time, lead to the formulation of a theory.

  • Null Hypothesis Definition and Examples
  • Hypothesis, Model, Theory, and Law
  • Theory Definition in Science
  • What Is a Testable Hypothesis?
  • Scientific Hypothesis, Model, Theory, and Law
  • Scientific Method Flow Chart
  • What Is a Hypothesis? (Science)
  • Why Ice is Blue
  • How To Design a Science Fair Experiment
  • Definition of a Hypothesis
  • Six Steps of the Scientific Method
  • What Are the Elements of a Good Hypothesis?
  • Scientific Method Lesson Plan
  • What 'Fail to Reject' Means in a Hypothesis Test
  • Hypothesis Test for the Difference of Two Population Proportions

Hypothesis test

A significance test, also referred to as a statistical hypothesis test, is a method of statistical inference in which observed data is compared to a claim (referred to as a hypothesis) in order to assess the truth of the claim. For example, one might wonder whether age affects the number of apples a person can eat, and may use a significance test to determine whether there is any evidence to suggest that it does.

Generally, the process of statistical hypothesis testing involves the following steps:

  • State the null hypothesis.
  • State the alternative hypothesis.
  • Select the appropriate test statistic and select a significance level.
  • Compute the observed value of the test statistic and its corresponding p-value.
  • Reject the null hypothesis in favor of the alternative hypothesis, or do not reject the null hypothesis.

The null hypothesis

The null hypothesis, H 0 , is the claim that is being tested in a statistical hypothesis test. It typically is a statement that there is no difference between the populations being studied, or that there is no evidence to support a claim being made. For example, "age has no effect on the number of apples a person can eat."

A significance test is designed to test the evidence against the null hypothesis. This is because it is easier to prove that a claim is false than to prove that it is true; demonstrating that the claim is false in one case is sufficient, while proving that it is true requires that the claim be true in all cases.

The alternative hypothesis

The alternative hypothesis is the opposite of the null hypothesis in that it is a statement that there is some difference between the populations being studied. For example, "younger people can eat more apples than older people."

The alternative hypothesis is typically the hypothesis that researchers are trying to prove. A significance test is meant to determine whether there is sufficient evidence to reject the null hypothesis in favor of the alternative hypothesis. Note that the results of a significance test should either be to reject the null hypothesis in favor of the alternative hypothesis, or to not reject the null hypothesis. The result should not be to reject the alternative hypothesis or to accept the alternative hypothesis.

Test statistics and significance level

A test statistic is a statistic that is calculated as part of hypothesis testing that compares the distribution of observed data to the expected distribution, based on the null hypothesis. Examples of test statistics include the Z-score, T-statistic, F-statistic, and the Chi-square statistic. The test statistic used is dependent on the significance test used, which is dependent on the type of data collected and the type of relationship to be tested.

In many cases, the chosen significance level is 0.05, though 0.01 is also used. A significance level of 0.05 indicates that there is a 5% chance of rejecting the null hypothesis when the null hypothesis is actually true. Thus, a smaller selected significance level will require more evidence if the null hypothesis is to be rejected in favor of the alternative hypothesis.

After the test statistic is computed, the p-value can be determined based on the result of the test statistic. The p-value indicates the probability of obtaining test results that are at least as extreme as the observed results, under the assumption that the null hypothesis is correct. It tells us how likely it is to obtain a result based solely on chance. The smaller the p-value, the less likely a result can occur purely by chance, while a larger p-value makes it more likely. For example, a p-value of 0.01 means that there is a 1% chance that a result occurred solely by chance, given that the null hypothesis is true; a p-value of 0.90 means that there is a 90% chance.

A p-value is significantly affected by sample size. The larger the sample size, the smaller the p-value, even if the difference between populations may not be meaningful. On the other hand, if a sample size is too small, a meaningful difference may not be detected.

The last step in a significance test is to determine whether the p-value provides evidence that the null hypothesis should be rejected in favor of the alternative hypothesis. This is based on the selected significance level. If the p-value is less than or equal to the selected significance level, the null hypothesis is rejected in favor of the alternative hypothesis, and the result is deemed statistically significant. If the p-value is greater than the selected significance level, the null hypothesis is not rejected, and the result is deemed not statistically significant.


  1. 13 Different Types of Hypothesis (2024)

    hypothesis definition for math

  2. What is an Hypothesis

    hypothesis definition for math

  3. hypothesis math definition example

    hypothesis definition for math

  4. Hypothesis

    hypothesis definition for math

  5. Research Hypothesis: Definition, Types, Examples and Quick Tips

    hypothesis definition for math

  6. Hypothesis Meaning

    hypothesis definition for math



  2. What Is A Hypothesis?

  3. Hypothesis|Meaning|Definition|Characteristics|Source|Types|Sociology|Research Methodology|Notes

  4. Differences Between Hypothesis Formulation and Hypothesis Development

  5. Hypothesis

  6. What does hypothesis mean?


  1. Hypothesis Definition (Illustrated Mathematics Dictionary)

    Hypothesis. A statement that could be true, which might then be tested. Example: Sam has a hypothesis that "large dogs are better at catching tennis balls than small dogs". We can test that hypothesis by having hundreds of different sized dogs try to catch tennis balls. Sometimes the hypothesis won't be tested, it is simply a good explanation ...

  2. Hypothesis -- from Wolfram MathWorld

    A hypothesis is a proposition that is consistent with known data, but has been neither verified nor shown to be false. In statistics, a hypothesis (sometimes called a statistical hypothesis) refers to a statement on which hypothesis testing will be based. Particularly important statistical hypotheses include the null hypothesis and alternative hypothesis. In symbolic logic, a hypothesis is the ...

  3. Hypothesis Definition

    Types of Hypothesis. The hypothesis can be broadly classified into different types. They are: Simple Hypothesis. A simple hypothesis is a hypothesis that there exists a relationship between two variables. One is called a dependent variable, and the other is called an independent variable. Complex Hypothesis.

  4. 8.1: The Elements of Hypothesis Testing

    Definition: statistical procedure. Hypothesis testing is a statistical procedure in which a choice is made between a null hypothesis and an alternative hypothesis based on information in a sample. The end result of a hypotheses testing procedure is a choice of one of the following two possible conclusions: Reject H0.

  5. Understanding Hypotheses

    A hypothesis is a statement or idea which gives an explanation to a series of observations. Sometimes, following observation, a hypothesis will clearly need to be refined or rejected. This happens if a single contradictory observation occurs. For example, suppose that a child is trying to understand the concept of a dog.

  6. Examples of null and alternative hypotheses

    The null hypothesis is what happens at baseline. It is the uninteresting hypothesis--the boring hypothesis. Usually, it is the hypothesis that assumes no difference. It is the opposite of your research hypothesis. The alternative hypothesis--that is, the research hypothesis--is the idea, phenomenon, observation that you want to prove.

  7. Hypothesis Testing

    A hypothesis test is a statistical inference method used to test the significance of a proposed (hypothesized) relation between population statistics (parameters) and their corresponding sample estimators. In other words, hypothesis tests are used to determine if there is enough evidence in a sample to prove a hypothesis true for the entire population. The test considers two hypotheses: the ...

  8. 9.1 Null and Alternative Hypotheses

    The actual test begins by considering two hypotheses.They are called the null hypothesis and the alternative hypothesis.These hypotheses contain opposing viewpoints. H 0, the —null hypothesis: a statement of no difference between sample means or proportions or no difference between a sample mean or proportion and a population mean or proportion. In other words, the difference equals 0.

  9. Significance tests (hypothesis testing)

    Unit test. Significance tests give us a formal process for using sample data to evaluate the likelihood of some claim about a population value. Learn how to conduct significance tests and calculate p-values to see how likely a sample result is to occur by random chance. You'll also see how we use p-values to make conclusions about hypotheses.

  10. Hypothesis

    Definition. A hypothesis is a claim or statement that makes sense in the context of some information or data at hand but hasn't been established as true or false through experimentation or proof. In mathematics, any statement or equation that describes some relationship between certain variables can be termed as hypothesis if it is consistent ...

  11. hypothesis

    hypothesis. In mathematics, a hypothesis is an unproven statement which is supported by all the available data and by many weaker results. An unproven mathematical statement is usually called a " conjecture ", and while experimentation can sometimes produce millions of examples to support a conjecture, usually nothing short of a proof can ...

  12. What Is a Hypothesis Test?

    The null hypothesis significance testing (NHST) framework. The general situation is this: we want to find out about some aspect of the real world, and we do this by performing an experiment. From the data collected in the experiment, we want to make a deduction about reality, a process known as statistical inference .

  13. 9.1: Introduction to Hypothesis Testing

    In hypothesis testing, the goal is to see if there is sufficient statistical evidence to reject a presumed null hypothesis in favor of a conjectured alternative hypothesis.The null hypothesis is usually denoted \(H_0\) while the alternative hypothesis is usually denoted \(H_1\). An hypothesis test is a statistical decision; the conclusion will either be to reject the null hypothesis in favor ...

  14. Hypothesis

    A hypothesis (pl.: hypotheses) is a proposed explanation for a phenomenon. For a hypothesis to be a scientific hypothesis, the scientific method requires that one can test it. Scientists generally base scientific hypotheses on previous observations that cannot satisfactorily be explained with the available scientific theories.

  15. What Is a Hypothesis? The Scientific Method

    A hypothesis (plural hypotheses) is a proposed explanation for an observation. The definition depends on the subject. In science, a hypothesis is part of the scientific method. It is a prediction or explanation that is tested by an experiment. Observations and experiments may disprove a scientific hypothesis, but can never entirely prove one.

  16. 1.1: Statements and Conditional Statements

    The statement \(P\) is called the hypothesis of the conditional statement, and the statement \(Q\) is called the conclusion of the conditional statement. Since conditional statements are probably the most important type of statement in mathematics, we give a more formal definition.

  17. Introduction to Hypothesis Testing

    Step 3: Collect Data and Compute Sample Statistics. After collecting the data, we find the sample mean. Now we can compare the sample mean with the null hypothesis by computing a z-score that describes where the sample mean is located relative to the hypothesized population mean. We use the z-score formula. Step 4: Make a Decision.

  18. Hypothesis Testing

    Hypothesis testing is a technique that is used to verify whether the results of an experiment are statistically significant. It involves the setting up of a null hypothesis and an alternate hypothesis. There are three types of tests that can be conducted under hypothesis testing - z test, t test, and chi square test.

  19. Definition and examples of hypothesis

    Solution: Step 1: The hypothesis is sometimes true. Because, its true only for a square and a rhombus, not for the other quadrilaterals rectangle, parallelogram, or trapezoid. Hypothesis is the part of a conditional statement...Complete information about the hypothesis, definition of an hypothesis, examples of an hypothesis, step by step ...

  20. Hypothesis: Definition, Examples, and Types

    A hypothesis is a tentative statement about the relationship between two or more variables. It is a specific, testable prediction about what you expect to happen in a study. It is a preliminary answer to your question that helps guide the research process. Consider a study designed to examine the relationship between sleep deprivation and test ...

  21. Hypothesis Definition (Science)

    A hypothesis is an explanation that is proposed for a phenomenon. Formulating a hypothesis is a step of the scientific method . Examples: Upon observing that a lake appears blue under a blue sky, you might propose the hypothesis that the lake is blue because it is reflecting the sky. One alternate hypothesis would be that the lake is blue ...

  22. Hypothesis test

    Hypothesis test. A significance test, also referred to as a statistical hypothesis test, is a method of statistical inference in which observed data is compared to a claim (referred to as a hypothesis) in order to assess the truth of the claim. For example, one might wonder whether age affects the number of apples a person can eat, and may use a significance test to determine whether there is ...

  23. hypothesis

    Hypothesis. A hypothesis is the 'if' part (antecedent) of a conditional statement.