8.3 A Population Proportion - Statistics | OpenStax (2024)

During an election year, we see articles in the newspaper that state confidence intervals in terms of proportions or percentages. For example, a poll for a particular candidate running for president might show that the candidate has 40 percent of the vote within 3 percentage points (if the sample is large enough). Often, election polls are calculated with 95 percent confidence, so the pollsters would be 95 percent confident that the true proportion of voters who favored the candidate would be between 0.37 and 0.43 (0.40 – 0.03, 0.40 + 0.03).

Investors in the stock market are interested in the true proportion of stocks that go up and down each week. Businesses that sell personal computers are interested in the proportion of households in the United States that own personal computers. Confidence intervals can be calculated for the true proportion of stocks that go up or down each week and for the true proportion of households in the United States that own personal computers.

The procedure to find the confidence interval, the sample size, the error bound for a population (EBP), and the confidence level for a proportion is similar to that for the population mean, but the formulas are different.

How do you know you are dealing with a proportion problem? First, the data that you are collecting is categorical, consisting of two categories: Success or Failure, Yes or No. Examples of situations where you are the following trying to estimate the true population proportion are the following: What proportion of the population smoke? What proportion of the population will vote for candidate A? What proportion of the population has a college-level education?

The distribution of the sample proportions (based on samples of size n) is denoted by P′ (read “P prime”).

The central limit theorem for proportions asserts that the sample proportion distribution P′ follows a normal distribution with mean value p, and standard deviation pq n pq n , where p is the population proportion and q = 1 -– p.

The confidence interval has the form (p′EBP, p′ + EBP). EBP is error bound for the proportion.

p= x n p= x n

p′ = the estimated proportion of successes (p′ is a point estimate for p, the true proportion.)

x = the number of successes

n = the size of the sample

The error bound for a proportion is

EBP=( z α 2 )( p q n ), EBP=( z α 2 )( p q n ), where q′ = 1 – p′.

This formula is similar to the error bound formula for a mean, except that the "appropriate standard deviation" is different. For a mean, when the population standard deviation is known, the appropriate standard deviation that we use is σnσn. For a proportion, the appropriate standard deviation is pqnpqn.

However, in the error bound formula, we use p q n p q n as the standard deviation, instead of pqnpqn.

In the error bound formula, the sample proportions p′ and q′, are estimates of the unknown population proportions p and q. The estimated proportions p′ and q′ are used because p and q are not known. The sample proportions p′ and q′ are calculated from the data: p′ is the estimated proportion of successes, and q′ is the estimated proportion of failures.

The confidence interval can be used only if the number of successes np′ and the number of failures nq′ are both greater than five.

That is, in order to use the formula for confidence intervals for proportions, you need to verify that both n p ' 5 n p ' 5 and n q ' 5 n q ' 5 .

Example 8.10

Problem

Suppose that a market research firm is hired to estimate the percentage of adults living in a large city who have cell phones. Five hundred randomly selected adult residents in this city are surveyed to determine whether they have cell phones. Of the 500 people surveyed, 421 responded yes, they own cell phones. Using a 95 percent confidence level, compute a confidence interval estimate for the true proportion of adult residents of this city who have cell phones.

Solution

  • The first solution is step-by-step (Solution A).
  • The second solution uses a function of the TI-83, 83+, or 84 calculators (Solution B).

Let X = the number of people in the sample who have cell phones. X is binomial. X~B( 500, 421 500 ) X~B( 500, 421 500 ) .

To calculate the confidence interval, you must find p′, q′, and EBP.

n = 500

x = the number of successes = 421

p = x n = 421 500 =0.842 p = x n = 421 500 =0.842

p′ = 0.842 is the sample proportion; this is the point estimate of the population proportion.

q′=1–p′=1–0.842=0.158 q′=1–p′=1–0.842=0.158

Because CL = 0.95, then α = 1 – CL = 1 – 0.95 = 0.05 ( α 2 ) ( α 2 ) = 0.025.

Then, zα2=z0.025=1.96.zα2=z0.025=1.96.

Use the TI-83, 83+, or 84+ calculator command invNorm(0.975,0,1) to find z0.025. Remember that the area to the right of z0.025is 0.025, and the area to the left of z0.025is 0.975. This can also be found using appropriate commands on other calculators, using a computer, or using a standard normal probability table.

EBP=( z α 2 ) p q n =(1.96) (0.842)(0.158) 500 =0.032 EBP=( z α 2 ) p q n =(1.96) (0.842)(0.158) 500 =0.032

p EBP=0.8420.032=0.81 pEBP=0.8420.032=0.81

p +EBP=0.842+0.032=0.874 p +EBP=0.842+0.032=0.874

The confidence interval for the true binomial population proportion is (p′EBP, p′ + EBP) = (0.810, 0.874).

Interpretation

We estimate with 95 percent confidence that between 81 percent and 87.4 percent of all adult residents of this city have cell phones.

Explanation of 95 percent Confidence Level

Ninety-five percent of the confidence intervals constructed in this way would contain the true value for the population proportion of all adult residents of this city who have cell phones.

Solution

Using the TI-83, 83+, 84, 84+ Calculator

Press STAT and arrow over to TESTS.
Arrow down to A:1-PropZint. Press ENTER.
Arrow down to x and enter 421.
Arrow down to n and enter 500.
Arrow down to C-Level and enter .95.
Arrow down to Calculate and press ENTER.
The confidence interval is (0.81003, 0.87397).

Try It 8.10

Suppose 250 randomly selected people are surveyed to determine whether they own tablets. Of the 250 surveyed, 98 reported owning tablets. Using a 95 percent confidence level, compute a confidence interval estimate for the true proportion of people who own tablets.

Example 8.11

Problem

For a class project, a political science student at a large university wants to estimate the percentage of students who are registered voters. He surveys 500 students and finds that 300 are registered voters. Compute a 90 percent confidence interval for the true percentage of students who are registered voters, and interpret the confidence interval.

Solution

  • The first solution is step-by-step (Solution A).
  • The second solution uses a function of the TI-83, 83+, or 84 calculators (Solution B).

Solution A

x=300andn=500 x=300andn=500

p = x n = 300 500 =0.600 p = x n = 300 500 =0.600

q =1 p =1-0.600=0.400 q =1 p =1-0.600=0.400

Because CL = 0.90, then α = 1 – CL = 1 – 0.90 = 0.10 ( α 2 ) ( α 2 ) = 0.05.

z α 2 = z 0.05 =1.645 z α 2 = z 0.05 =1.645

Use the TI-83, 83+, or 84+ calculator command invNorm(0.95,0,1) to find z0.05. Remember that the area to the right of z0.05 is 0.05, and the area to the left of z0.05 is 0.95. This can also be found using appropriate commands on other calculators, using a computer, or using a standard normal probability table.

EBP=( z α 2 ) p q n =(1.645) (0.60)(0.40) 500 =0.036 EBP=( z α 2 ) p q n =(1.645) (0.60)(0.40) 500 =0.036

p EBP=0.600.036=0.564 p EBP=0.600.036=0.564

p +EBP=0.60+0.036=0.636 p +EBP=0.60+0.036=0.636

The confidence interval for the true binomial population proportion is (p′EBP , p′ + EBP) = (0.564, 0.636).

Interpretation

  • We estimate with 90 percent confidence that the true percentage of all students who are registered voters is between 56.4 percent and 63.6 percent.
  • Alternate wording: We estimate with 90 percent confidence that between 56.4 percent and 63.6 percent of all students are registered voters.

Explanation of 90 percent Confidence Level

Ninety percent of all confidence intervals constructed in this way contain the true value for the population percentage of students who are registered voters.

Solution

Solution B

Using the TI-83, 83+, 84, 84+ Calculator

Press STAT and arrow over to TESTS.
Arrow down to A:1-PropZint. Press ENTER.
Arrow down to x and enter 300.
Arrow down to n and enter 500.
Arrow down to C-Level and enter 0.90.
Arrow down to Calculate and press ENTER.
The confidence interval is (0.564, 0.636).

Try It 8.11

A student polls her school to determine whether students in the school district are for or against the new legislation regarding school uniforms. She surveys 600 students and finds that 480 are against the new legislation.

a. Compute a 90 percent confidence interval for the true percentage of students who are against the new legislation, and interpret the confidence interval.

b. In a sample of 300 students, 68 percent said they own an iPod and a smartphone. Compute a 97 percent confidence interval for the true percentage of students who own an iPod and a smartphone.

Plus-Four Confidence Interval for p

There is a certain amount of error introduced into the process of calculating a confidence interval for a proportion. Because we do not know the true proportion for the population, we are forced to use point estimates to calculate the appropriate standard deviation of the sampling distribution. Studies have shown that the resulting estimation of the standard deviation can be flawed.

Fortunately, there is a simple adjustment that allows us to produce more accurate confidence intervals: We simply pretend that we have four additional observations. Two of these observations are successes, and two are failures. The new sample size, then, is n + 4, and the new count of successes is x + 2.

Computer studies have demonstrated the effectiveness of the plus-four confidence interval for p method. It should be used when the confidence level desired is at least 90 percent and the sample size is at least ten.

Example 8.12

Problem

A random sample of 25 statistics students was asked: “Have you used a product in the past week?” Six students reported using the product within the past week. Use the plus-four method to find a 95 percent confidence interval for the true proportion of statistics students who use the product weekly.

Solution

Solution A

Six students out of 25 reported using a product within the past week, so x = 6 and n = 25. Because we are using the plus-four method, we will use x = 6 + 2 = 8, and n = 25 + 4 = 29.

p = x n = 8 29 0.276 p = x n = 8 29 0.276

q =1 p =10.276=0.724 q =1 p =10.276=0.724

Because CL = 0.95, we know α = 1 – 0.95 = 0.05, and α 2 α 2 = 0.025.

z 0.025 =1.96 z 0.025 =1.96

EPB=( z α 2 ) p q n =(1.96) 0.276(0.724) 29 0.163 EPB=( z α 2 ) p q n =(1.96) 0.276(0.724) 29 0.163

p′–EPB=0.276–0.163=0.113 p′+EPB=0.276+0.163=0.439 p′–EPB=0.276–0.163=0.113 p′+EPB=0.276+0.163=0.439

We are 95 percent confident that the true proportion of all statistics students who use the product is between 0.113 and 0.439.

Solution

Using the TI-83, 83+, 84, 84+ Calculator

Press STAT and arrow over to TESTS.
Arrow down to A:1-PropZint. Press ENTER.

Arrow down to x and enter 8.
Arrow down to n and enter 29.
Arrow down to C-Level and enter 0.95.
Arrow down to Calculate and press ENTER.
The confidence interval is (0.113, 0.439).

Reminder

Remember that the plus-four method assumes an additional four trials: two successes and two failures. You do not need to change the process for calculating the confidence interval; simply update the values of x and n to reflect these additional trials.

Try It 8.12

Out of a random sample of 65 freshmen at State University, 31 students have declared their majors. Use the plus-four method to find a 96 percent confidence interval for the true proportion of freshmen at State University who have declared their majors.

Example 8.13

Problem

A group of researchers recently conducted a study analyzing the privacy management habits of teen internet users. In a group of 50 teens, 13 reported having more than 500 friends on a social media site. Use the plus four method to find a 90 percent confidence interval for the true proportion of teens who would report having more than 500 online friends.

Solution

Using plus-four, we have x = 13 + 2 = 15, and n = 50 + 4 = 54.

p ' = 15 54 0.278 p ' = 15 54 0.278

8.5

q ' =1 p ' =1-0.278=0.722 q ' =1 p ' =1-0.278=0.722

Because CL = 0.90, we know α = 1 – 0.90 = 0.10, and α 2 α 2 = 0.05.

z 0.05 =1.645 z 0.05 =1.645

EPB=( z α 2 )( p q n )=(1.645)( (0.278)(0.722) 54 )0.100 EPB=( z α 2 )( p q n )=(1.645)( (0.278)(0.722) 54 )0.100

p′–EPB=0.278–0.100=0.178 p′+EPB=0.278+0.160=0.378 p′–EPB=0.278–0.100=0.178 p′+EPB=0.278+0.160=0.378

We are 90 percent confident that between 17.8 percent and 37.8 percent of all teens would report having more than 500 friends on a social media site.

Solution

Using the TI-83, 83+, 84, 84+ Calculator


Press STAT and arrow over to TESTS.
Arrow down to A:1-PropZint. Press ENTER.
Arrow down to x and enter 15.
Arrow down to n and enter 54.
Arrow down to C-Level and enter 0.90.
Arrow down to Calculate and press ENTER.
The confidence interval is (0.178, 0.378).

Try It 8.13

The research group referenced in Example 8.13 talked to teens in smaller focus groups but also interviewed additional teens over the phone. When the study was complete, 588 teens had answered the question about their social media site friends, with 159 saying that they have more than 500 friends. Use the plus-four method to find a 90 percent confidence interval for the true proportion of teens who would report having more than 500 online friends based on this larger sample. Compare the results to those in Example 8.13.

Calculating the Sample Size n

If researchers desire a specific margin of error, then they can use the error bound formula to calculate the required sample size.

The margin of error formula for a population proportion is

  • EBP= z α 2 × pq n EBP= z α 2 × pq n , where p′ is the sample proportion, q′ = 1 – p′, and n is the sample size.
  • Solving for n gives you an equation for the sample size.
  • n= ( z α 2 ) 2 ( p q ) EB P 2 n= ( z α 2 ) 2 ( p q ) EB P 2 . This formula tells us that we can compute the sample size n required for a confidence level of Cl=1α Cl=1α by taking the square of the critical value z a 2 z a 2 , multiplying by the point estimate p′, and by q′ = 1 – p′ and finally dividing the result by the square of the margin of error. Always remember to round up the value of n.

Example 8.14

Problem

Suppose a mobile phone company wants to determine the current percentage of customers ages 50+ who use text messaging on their cell phones. How many customers ages 50+ should the company survey in order to be 90 percent confident that the estimated (sample) proportion is within 3 percentage points of the true population proportion of customers ages 50+ who use text messaging on their cell phones? Assume that p′ = 0.5.

Solution

From the problem, we know that EBP = 0.03 (3 percent=0.03), and z α 2 z α 2 z0.05 = 1.645 because the confidence level is 90 percent.

To calculate the sample size n, use the formula and make the substitutions.

n= z 2 p q EB P 2 givesn= 1.645 2 (0.5)(0.5) 0.03 2 =751.7 n= z 2 p q EB P 2 givesn= 1.645 2 (0.5)(0.5) 0.03 2 =751.7

Round the answer to the next higher value. The sample size should be 752 cell phone customers ages 50+ in order to be 90 percent confident that the estimated (sample) proportion is within 3 percentage points of the true population proportion of all customers ages 50+ who use text messaging on their cell phones.

Try It 8.14

An internet marketing company wants to determine the current percentage of customers who click on ads on their smartphones. How many customers should the company survey in order to be 90 percent confident that the estimated proportion is within 5 percentage points of the true population proportion of customers who click on ads on their smartphones? Assume that the sample proportion p′ is 0.50.

8.3 A Population Proportion - Statistics | OpenStax (2024)

FAQs

How do you calculate the population proportion? ›

The population proportion formula is p = x/n where the x is the number of individuals in the group with a given characteristic and n is the total population size.

What is the formula for the sample size of the population proportion? ›

n = N*X / (X + N – 1), where, X = Zα/22 *p*(1-p) / MOE2, and Zα/2 is the critical value of the Normal distribution at α/2 (e.g. for a confidence level of 95%, α is 0.05 and the critical value is 1.96), MOE is the margin of error, p is the sample proportion, and N is the population size.

What is the population proportion? ›

The population proportion is a number representing a part of a population and is therefore an example of a population parameter. The Greek letter π (pi) is a common symbol for the population proportion.

What is the formula for CI for proportion? ›

The result is the following formula for a confidence interval for a population proportion: p̂ +/- z* (p̂(1 - p̂)/n)0.5. Here the value of z* is determined by our level of confidence C. For the standard normal distribution, exactly C percent of the standard normal distribution is between -z* and z*.

How to calculate the proportion? ›

Answer: To find the proportion in statistics, divide the count of occurrences of a particular outcome by the total number of observations. In statistics, a proportion represents the fraction or percentage of a specific outcome relative to the total number of observations.

How to calculate sample proportion in statistics? ›

The sample proportion P is given by P=X/N, where X denotes the number of successes and N denotes the size of the sample in question. This sample proportion reflects that particular sample, and other samples of the population may result in different sample proportions.

How do you calculate population size in statistics? ›

The population size estimate is obtained by dividing the number of individuals receiving a service or the number of unique objects distributed (M) by the proportion of individuals in a representative survey who report receipt of the service or object (P).

What is population proportion to size sampling? ›

Probability proportional to size (PPS) sampling is a method of sampling from a finite population in which a size measure is available for each population unit before sampling and where the probability of selecting a unit is proportional to its size.

How to find minimum sample size for population proportion? ›

The minimum sample size required to achieve the desired level of accuracy is determined before collecting the sample data.
  1. Sample size for population means: n=(z×σE)2.
  2. Sample size for population proportions: n=p×(1−p)×(zE)2.

What is the formula for calculating sample size? ›

Sample Size Formula for Infinite and Finite Population
Formulas for Sample Size (SS)
For Infinite Sample SizeSS = [Z2p (1 − p)]/ C2
For Finite Sample SizeSS/ [1 + {(SS − 1)/Pop}]

What is the population formula? ›

Answer: Start by identifying the components of the population growth formula P=Poert, where P represents the final population, Po represents the initial population, e is the base, which is approximated as 2.71828, r represents the rate of change (as a decimal), and t represents time.

What is the population ratio in statistics? ›

Population ratios are used to describe the degree of balance between two elements of the population, e.g., Males vs. females, children versus women of reproductive age. The ratio is normalized to refer to a standard unit of people, usually 100 persons.

How to calculate population proportion? ›

p′ = x / n where x represents the number of successes and n represents the sample size. The variable p′ is the sample proportion and serves as the point estimate for the true population proportion. The variable p′ has a binomial distribution that can be approximated with the normal distribution shown here.

What is the CI of the population proportion? ›

Similar to confidence intervals for population means, a confidence interval for a population proportion is constructed by taking a sample of size n from the population, calculating the sample proportion ^p , and then adding and subtracting the margin of error from ^p to get the limits of the confidence interval.

What is the formula for the one population proportion? ›

Statistics - One Proportion Z Test

z=(p−P)σ where P is the hypothesized value of population proportion in the null hypothesis, p is the sample proportion, and σ is the standard deviation of the sampling distribution.

What is the equation for proportional to the population? ›

The rate of change of a certain population is proportional to the square root of its size. Model this situation with a differential equation. The solution is said to be dPdt=k√P, where k>0 is the proportionality constant.

What is the formula for calculating population? ›

How to you estimate population size from a sample? To estimate the population size (n) from a sample, you can extrapolate your data from the area you sampled (a quadrat, plot, or any sample unit) with the formula n = (total area of habitat / area of sample unit) x mean number of individuals per sample unit.

References

Top Articles
Latest Posts
Article information

Author: Catherine Tremblay

Last Updated:

Views: 6239

Rating: 4.7 / 5 (67 voted)

Reviews: 90% of readers found this page helpful

Author information

Name: Catherine Tremblay

Birthday: 1999-09-23

Address: Suite 461 73643 Sherril Loaf, Dickinsonland, AZ 47941-2379

Phone: +2678139151039

Job: International Administration Supervisor

Hobby: Dowsing, Snowboarding, Rowing, Beekeeping, Calligraphy, Shooting, Air sports

Introduction: My name is Catherine Tremblay, I am a precious, perfect, tasty, enthusiastic, inexpensive, vast, kind person who loves writing and wants to share my knowledge and understanding with you.