Summarize this article:
Last updated on September 26, 2025
Binomial distribution refers to the probability of achieving a specific number of successes in a fixed number of independent trials. It follows a binomial distribution, a type of discrete probability distribution used for events with two possible outcomes.
The discrete probability distribution that an experiment results in only two possible outcomes, that is, success or failure, is known as the binomial distribution in probability theory and statistics. For example, when a coin is tossed, there are only 2 possible outcomes, which are heads or tails. Similarly, if a test is administered, there are only two possible outcomes: pass or fail. This distribution is also known as the binomial distribution.
The binomial distribution differs from the normal distribution in several key ways. The following table highlights the main differences and traits. Key differences between binomial and normal distributions.
Aspect |
Binomial Distribution |
Normal Distribution |
Type |
Discrete probability distribution |
Continuous probability distribution. |
Outcomes |
Two possible outcomes per trial (success or failure) |
A continuous range with an infinite number of possible outcomes. |
Parameters |
N (number of trials), p (probability of success) |
The mean (𝜇) and the standard deviation (σ) |
Shape |
Varies according to n and p; skewed unless n is large and p=0.5 |
Symmetrical bell-shaped curve |
Support |
x is an integer from 0 to n. |
Any real value between -∞ and +∞ can be represented by x. |
Mean |
𝜇=np |
𝜇 |
Variance |
σ2=np(1-p) |
σ2 |
Applicability |
Used to simulate the number of successes in a predetermined number of trials. |
Used to model continuous data around a central mean. |
Approximation |
When n is large and p is not close to 0 or 1, it approximates normal. |
Considered the limit of the binomial distribution as n becomes large and p is near 0.5 |
When the following criteria are met, the binomial distribution can be applied.
Fixed number of trials: An experiment, such as flipping a coin ten times, has a fixed number of trials, denoted by n.
Only two possible outcomes: Each trial has two possible outcomes, often called “success” and “failure”. For example, flipping a coin results in either heads or tails.
Independent trials: Each trial’s verdict is distinct from the others, so the outcome of one does not influence the outcome of another.
Constant Probability: For every trial, the chance of success
(represented by p) stays constant. When you flip a fair coin, for example, the probability of getting heads is always 0.5, meaning there’s an equal chance of getting heads or tails.
Properties of Binomial Distribution
The binomial distribution, formula for a random variable X is given by :
P(X=k)=nkpk(1-p)n-k
Where,
n=number of trials
k=number of successes
p=probability of success on a single trial
1-p=probability of failure
nk=n!k!(n-k)! is the number of combinations
For a given probability of success, a binomial distribution table shows the odds of obtaining varying numbers of successes in a predetermined number of trials.
n is the number of trials. p is the likelihood of success. P(X=x) is displayed in the table for all x values between 0 and n.
For example, (n =4, p = 0.5):
x | P(X=x) |
0 | 0.0625 |
1 | 0.2500 |
2 | 0.3750 |
3 | 0.2500 |
4 | 0.0625 |
There is a 0.3750 chance of achieving precisely two successes in four trials.
The binomial distribution graph shows the possibilities of obtaining different numbers of successes (X) in a given number of trials (N), which makes it a useful visualization tool. In the graph below, the distribution plot determines the probability of rolling exactly 0 sixes, 1 sixes, 2 sixes, 3 sixes, ..., and up to 10 sixes in the 10 dice rolls. This method allows the binomial distribution graph to include the entire range of potential successes up to the total number of trials.
Each bar in the chart shows the likelihood of rolling a particular number of sixes out of ten dice rolls. Because the probability of rolling seven or more sixes in ten dice rolls is very low, the graph does not show those values.
According to the binomial distribution graph, there is a roughly 16% chance of rolling no sixes. Rolling exactly one six in ten fair dice rolls is the most likely outcome, with a probability of about (32%). However, it often happens to roll two sixes. After three sixes, the odds always decrease. Furthermore, the three sixes bar corresponds to our previous result of 0.155095.
In statistics, the binomial distribution is used to model the number of successes in a predetermined number of independent trials, every trial has only two possible outcomes: one is the probability of success or the probability of failure.
Formula:
P(X=x)=nxpx(1-p)n-x
Where,
X=The number of successes
n=The number of trials
p=The probability of success
nx=The number of combinations
For example,
The binomial distribution can be used to calculate the probability of getting exactly three heads when flipping a fair coin five times.
The number of trials or observations in a binomial distribution applies when each trial has the same probability of success, and is interested in the number of successes over a fixed number of independent trials. In other words, it calculates the probability of a certain number of successes in a fixed number of trials.
A binomial distribution’s mean is:
The mean is represented by 𝜇=np, where n is the number of observations and p is the likelihood of success.
The distribution is symmetric about the mean for the instant p=0.5. The distribution is skewed to the left if p>0.5 and to the right if p<0.5.
A binomial distribution’s variance is represented as:
Variance σ2=npq or σ2=np(1-p)
A binomial distribution’s standard deviation is represented as:
Standard Deviation σ=npq or σ = np(1-p)
The following formula provides the coefficient of variation is represented as:
Coefficient of Variation=qnp or (1-p)np
Binomial Distribution Standard Deviation
The standard deviation is a common measure used to determine how the numbers are from the mean value.
Standard deviation = (Variance)1/2
= (npq)1/2
The central tendency of the dataset can be determined using three key measures: mean, median, and mode.
Mean
The mean denotes the average value of the dataset. The mean can be determined by dividing the total of all values in the dataset by the quantity of values. It is generally regarded as the arithmetic mean. Additional metrics of mean utilized
to ascertain central tendency, include the following:
Formula: Mean=x1 + x2 + .. + xnn
Median
The median is the central value of a dataset, arranged in either ascending or descending order. When the dataset comprises an even number of values, the median is determined by calculating the mean of the two central values.
Examine the provided dataset comprising an odd number of observations organized in descending order: 24, 22, 19, 17, 16, 14, 13, 11, 10, 8, 7, 6, and 3.
The number 13 is the median, with 6 values above and 6 values below it.
Mode
The mode denotes the most frequently occurring value within the dataset. Occasionally, the dataset may exhibit multiple modes, while in certain instances, it may lack any mode entirely.
Examine the dataset: 6, 5, 3, 4, 3, 2, 6, 5, 6
Let us arrange it in descending order 6, 6, 6, 5, 5, 4, 3, 3, 2
The mode represents the most common value occurring in a data set. Hence, the most frequently repeated value in the given dataset is 6.
Negative Binomial Distribution
The number of trials required to reach a specific number of successes in a series of independent trials, where the probability of success in each trial is constant, is known as the negative binomial distribution. Think about a scenario where the outcome of a die toss is 5. Now, it's a failure if we roll a die and don’t get five. We throw again now, but we don’t get 5. We throw again now, but we don’t get 5. The negative binomial distribution models the number of trials needed to achieve a fixed number of successes, like getting five on a die multiple times. For example, if we don’t get 5 for three consecutive tries and five is obtained on the fourth attempt or more, then the binomial distribution of the number of times we get 5 is known as the negative binomial distribution.
Formula
P(x)=Cr-1n+r-1prqn
Where,
n=Total number of trials
r=Number of trials in which we get the first success
p=Probability of success in each trial
q=Probability of failure in each trial
There are two possible outcomes, such as yes / no, head / tail, or even / odd. This means there are two possible outcomes, one that represents the event occurring and one that does not. If a random experiment meets the following criteria, it is referred to as a Bernoulli trial:
Two possible outcomes, such as “success” and binomial “failure”, can be used to define a binomial random variable. Take, for example, rolling a fair six-sided die focuses on which number appears, not just the value of the face. The probability of success for binomial distributions can be determined using the binomial distribution formula. It frequently says to “plug” in the numbers into the formula, to calculate the required values.
The following features serve as the foundation for the binomial distribution:
The binomial distribution is used in real life to predict outcomes and more. Let us see how binomial distribution helps.
Manufacturing quality control
To help guarantee product quality, the binomial distribution is used to calculate the probability of finding a certain number of defective items in a sample.
Clinical trials
The binomial distribution is used to estimate how often a new medication is successful by calculating the probability of a certain number of positive outcomes in clinical trials.
Advertising Campaigns
Companies forecast the number of consumers who will open a marketing email or reply to an advertisement using the binomial distribution.
Analytics for sports
The number of successful attempts a player may make, like a basketball free throw, is estimated by coaches and analysts using the binomial distribution.
Elections and Polling
To determine the proportion of survey participants who favor a specific candidate or policy, pollsters use the binomial distribution.
Students often make some common mistakes in binomial distribution. Let us look at the mistakes and how to quickly correct them.
Five times, a fair coin is tossed. How likely is it that you will get precisely three heads?
P(X = 3) = 0.3125
Here,
Let success = getting a head, p = 0.5, n = 5, x = 3
P(X=3)=53(0.5)3(0.5)2=10 × 0.125 × 0.25=0.3125
Using the binomial formula with, n=5, x=3 and p=0.5, the probability of receiving exactly three heads when a fair coin is tossed five times is 0.3125.
10% of a machine’s output is faulty. In a sample of six, what is the likelihood that precisely two of the items are defective?
P(X=2)=0.0984
Here,
p = 0.1, q = 0.9, n = 6, x = 2
P(X=2)=62(0.1)2(0.9)4=15 × 0.01 × 0.6561 = 0.0984
Using the binomial formula with, p=0.1, n=6, x=2 The probability of finding exactly 2 defective items when 6 items are sampled from a machine that produces 10% defective items is 0.0984.
A customer’s likelihood of opening a promotional email is said to be 20%. How likely is it that one email out of eight will be opened?
P(X=1)=0.3355
Here,
p = 0.2, q = 0.8, n = 8, x = 1
P(X=1)=81(0.2)1(0.8)7=8 × 0.2 ×0.2097=0.3355
Here, by using the binomial formula with p=0.2, n=8, and x=1, which finds the chance of 1 success and 7 failures, the probability that exactly 1 email is opened, when 8 promotional emails are sent. There is a 20% chance of a customer opens one is 0.3355
The success rate of a basketball player’s free throws is 80%. How likely is it that they will make exactly four or five shots?
P(X=4)=0.4096
p = 0.8, q = 0.2, n = 5, x = 4
P(X=4)=54(0.8)4(0.2)1=5 × 0.4096 × 0.2 = 0.4096
By using the binomial formula with p=0.8, n=5, and x=4, which represents 4 successful shots and 1 miss, the probability of a basketball player with an 80% of the success rate making exactly 4 shots when he takes 5 free throws that is resulted as 0.4096.
A student’s chances of correctly answering a quiz question are 75%. What is the likelihood that a student will correctly answer exactly three of the four questions on the quiz?
P(X=3)=0.421875
p = 0.75, q = 0.25, n = 4, x = 3
P(X=3)=43(0.75)3(0.25)1=4 × 0.421875 × 0.25=0.421875
Here, using the binomial formula for three successes and one failure, the probability of a student receiving exactly three correct answers out of four questions is said to be 0.4219 when they have a 75% chance of answering the question correctly.