8.3: Estimating a Population Proportion (1 of 3) (2024)

  1. Last updated
  2. Save as PDF
  • Page ID
    14131
  • \( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}}}\)

    \( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash{#1}}} \)

    \( \newcommand{\id}{\mathrm{id}}\) \( \newcommand{\Span}{\mathrm{span}}\)

    ( \newcommand{\kernel}{\mathrm{null}\,}\) \( \newcommand{\range}{\mathrm{range}\,}\)

    \( \newcommand{\RealPart}{\mathrm{Re}}\) \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)

    \( \newcommand{\Argument}{\mathrm{Arg}}\) \( \newcommand{\norm}[1]{\| #1 \|}\)

    \( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)

    \( \newcommand{\Span}{\mathrm{span}}\)

    \( \newcommand{\id}{\mathrm{id}}\)

    \( \newcommand{\Span}{\mathrm{span}}\)

    \( \newcommand{\kernel}{\mathrm{null}\,}\)

    \( \newcommand{\range}{\mathrm{range}\,}\)

    \( \newcommand{\RealPart}{\mathrm{Re}}\)

    \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)

    \( \newcommand{\Argument}{\mathrm{Arg}}\)

    \( \newcommand{\norm}[1]{\| #1 \|}\)

    \( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)

    \( \newcommand{\Span}{\mathrm{span}}\) \( \newcommand{\AA}{\unicode[.8,0]{x212B}}\)

    \( \newcommand{\vectorA}[1]{\vec{#1}} % arrow\)

    \( \newcommand{\vectorAt}[1]{\vec{\text{#1}}} % arrow\)

    \( \newcommand{\vectorB}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}}}\)

    \( \newcommand{\vectorC}[1]{\textbf{#1}}\)

    \( \newcommand{\vectorD}[1]{\overrightarrow{#1}}\)

    \( \newcommand{\vectorDt}[1]{\overrightarrow{\text{#1}}}\)

    \( \newcommand{\vectE}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash{\mathbf {#1}}}} \)

    \( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}}}\)

    \( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash{#1}}} \)

    \(\newcommand{\avec}{\mathbf a}\) \(\newcommand{\bvec}{\mathbf b}\) \(\newcommand{\cvec}{\mathbf c}\) \(\newcommand{\dvec}{\mathbf d}\) \(\newcommand{\dtil}{\widetilde{\mathbf d}}\) \(\newcommand{\evec}{\mathbf e}\) \(\newcommand{\fvec}{\mathbf f}\) \(\newcommand{\nvec}{\mathbf n}\) \(\newcommand{\pvec}{\mathbf p}\) \(\newcommand{\qvec}{\mathbf q}\) \(\newcommand{\svec}{\mathbf s}\) \(\newcommand{\tvec}{\mathbf t}\) \(\newcommand{\uvec}{\mathbf u}\) \(\newcommand{\vvec}{\mathbf v}\) \(\newcommand{\wvec}{\mathbf w}\) \(\newcommand{\xvec}{\mathbf x}\) \(\newcommand{\yvec}{\mathbf y}\) \(\newcommand{\zvec}{\mathbf z}\) \(\newcommand{\rvec}{\mathbf r}\) \(\newcommand{\mvec}{\mathbf m}\) \(\newcommand{\zerovec}{\mathbf 0}\) \(\newcommand{\onevec}{\mathbf 1}\) \(\newcommand{\real}{\mathbb R}\) \(\newcommand{\twovec}[2]{\left[\begin{array}{r}#1 \\ #2 \end{array}\right]}\) \(\newcommand{\ctwovec}[2]{\left[\begin{array}{c}#1 \\ #2 \end{array}\right]}\) \(\newcommand{\threevec}[3]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \end{array}\right]}\) \(\newcommand{\cthreevec}[3]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \end{array}\right]}\) \(\newcommand{\fourvec}[4]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}\) \(\newcommand{\cfourvec}[4]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}\) \(\newcommand{\fivevec}[5]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}\) \(\newcommand{\cfivevec}[5]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}\) \(\newcommand{\mattwo}[4]{\left[\begin{array}{rr}#1 \amp #2 \\ #3 \amp #4 \\ \end{array}\right]}\) \(\newcommand{\laspan}[1]{\text{Span}\{#1\}}\) \(\newcommand{\bcal}{\cal B}\) \(\newcommand{\ccal}{\cal C}\) \(\newcommand{\scal}{\cal S}\) \(\newcommand{\wcal}{\cal W}\) \(\newcommand{\ecal}{\cal E}\) \(\newcommand{\coords}[2]{\left\{#1\right\}_{#2}}\) \(\newcommand{\gray}[1]{\color{gray}{#1}}\) \(\newcommand{\lgray}[1]{\color{lightgray}{#1}}\) \(\newcommand{\rank}{\operatorname{rank}}\) \(\newcommand{\row}{\text{Row}}\) \(\newcommand{\col}{\text{Col}}\) \(\renewcommand{\row}{\text{Row}}\) \(\newcommand{\nul}{\text{Nul}}\) \(\newcommand{\var}{\text{Var}}\) \(\newcommand{\corr}{\text{corr}}\) \(\newcommand{\len}[1]{\left|#1\right|}\) \(\newcommand{\bbar}{\overline{\bvec}}\) \(\newcommand{\bhat}{\widehat{\bvec}}\) \(\newcommand{\bperp}{\bvec^\perp}\) \(\newcommand{\xhat}{\widehat{\xvec}}\) \(\newcommand{\vhat}{\widehat{\vvec}}\) \(\newcommand{\uhat}{\widehat{\uvec}}\) \(\newcommand{\what}{\widehat{\wvec}}\) \(\newcommand{\Sighat}{\widehat{\Sigma}}\) \(\newcommand{\lt}{<}\) \(\newcommand{\gt}{>}\) \(\newcommand{\amp}{&}\) \(\definecolor{fillinmathshade}{gray}{0.9}\)

    Learning Objectives

    • Construct a confidence interval to estimate a population proportion when conditions are met. Interpret the confidence interval in context.

    Introduction

    In “Estimating a Population Proportion,” we continue our discussion of estimating a population proportion with a confidence interval. Recall that the purpose of a confidence interval is to use a sample proportion to construct an interval of values that we can be reasonably confident contains the true population proportion.

    The basic idea is summarized here:

    • When we select a random sample from the population of interest, we expect the sample proportion to be a good estimate of the population proportion. But we also know that sample proportions vary, so we expect some error. (Remember that the error here is due to chance. It is not due to a mistake that anyone made.)
    • For a given sample proportion, we will not know the amount of error, so we use the standard error as an estimate for the average amount of error we expect in sample proportions. (Recall that the standard error is the expected standard deviation of sample proportions when we take many, many random samples.)
    • If a normal model is a good fit for the sampling distribution, then about 95% of sample proportions estimate the population proportion within 2 standard errors. We say that we are 95% confident that the following interval contains the population proportion.

    8.3: Estimating a Population Proportion (1 of 3) (1)

    You may realize that this formula for the confidence interval is a bit odd, since our goal in calculating the confidence interval is to estimate the population proportion p. Yet the formula requires that we know p. In the section “Introduction to Statistical Inference,” we used an estimate for p from a previous study when calculating the confidence interval. This is not the usual way statisticians estimate the standard error, but it captured the main idea and allowed us to practice finding and interpreting confidence intervals. Now, we develop a different way to estimate standard error that is commonly used in statistical practice.

    Example

    Community College Students and Gender

    According to a 2010 report from the American Council on Education, females make up 57% of the college population in the United States. Students in a statistics class at Tallahassee Community College want to determine the proportion of female students at TCC. They select a random sample of 135 TCC students and find that 72 are female, which is a sample proportion of 72 / 135 ≈ 0.533. So 53.3% of the students in the sample are female.

    What can they conclude about the proportion of females at the college? How confident can they be in their estimate?

    To answer these questions, we need to find a confidence interval.

    Checking conditions:

    We learned in Linking Probability to Statistical Inference that a confidence interval comes from a normal model of the sampling distribution. Let’s first make sure that a normal model is appropriate here. Recall the two conditions for using a normal model for sample proportions:

    • The sample must be random.
    • The expected number of successes in the sample, np, and the expected number of failures, n(1 – p), are both greater than or equal to 10. In symbols, this is np ≥ 10 and n(1 − p) ≥ 10. Recall that success doesn’t mean good and failure doesn’t mean bad. A success is just what we are counting.

    When we try to check these conditions, we have a problem. We do not know p, the population proportion. In fact, p is what we are trying to estimate! So we cannot determine the expected number of successes and failures. Our solution to this problem is to adjust these conditions. Advanced theory tells us that if the actual number of successes and failures in the sample are greater than or equal to 10, then a normal model is still a good fit.

    This sample contains 72 successes (female students) and 63 failures (male students). Both are greater than 10. We therefore use the normal model for the sampling distribution.

    Finding the margin of error:

    We know that a sample proportion is only an estimate for the population proportion. We do not expect the sample proportion to equal the population proportion, so there is some error due to random chance. We use the standard deviation of the sample proportions to describe the amount of error we can expect in random samples. We call this the standard error.

    In Linking Probability to Statistical Inference, we learned that the standard error of the sample proportion depends on the population proportion and sample size. Here is the formula for the standard error:

    8.3: Estimating a Population Proportion (1 of 3) (2)

    When we use a normal model for the sampling distribution, 95% of sample proportions estimate the population proportion within approximately 2 standard errors. So the margin of error is the following:

    8.3: Estimating a Population Proportion (1 of 3) (3)

    Now let’s calculate the margin of error for the TCC estimate of 53.3%. Notice that we have the same problem we had earlier. We don’t know p, the population proportion. So we can’t calculate the margin of error! Our solution to this problem is to estimate the standard error using the sample proportion in place of p. We call this the estimated standard error, and the formula is:

    8.3: Estimating a Population Proportion (1 of 3) (4)

    For this example, the estimated standard error is

    8.3: Estimating a Population Proportion (1 of 3) (5)

    So the margin of error for the 95% confidence interval is:

    8.3: Estimating a Population Proportion (1 of 3) (6)

    Finding the confidence interval:

    We can interpret the margin of error by saying we are 95% confident that the proportion of all students at TCC who are female is within 0.086 of our sample proportion of 0.533. We can then write the interval in the following form:

    8.3: Estimating a Population Proportion (1 of 3) (7)

    When we add and subtract the margin of error from the sample proportion, the confidence interval is 0.447 to 0.619.

    Conclusion:

    We are 95% confident that the proportion of all TCC students who are female is between 0.447 and 0.619. We can also make this statement using percentages. We are 95% confident that the percentage of all TCC students who are female is between 44.7% and 61.9%.

    Recall from Linking Probability to Statistical Inference that 95% confidence means this method captures the population proportion about 95% of the time.

    Summary

    Conditions for using the normal model of the sampling distribution:

    In Linking Probability to Statistical Inference, we saw that a normal model describes the behavior of sample proportions if np ≥ 10 and n(1 − p) ≥ 10. These formulas say that the expected number of successes and failures in the sample must be 10 or greater. In Inference for One Proportion, we will never know the value of the population proportion p, so we estimate p with a sample proportion. Now we will assume that we can use a normal model if 8.3: Estimating a Population Proportion (1 of 3) (8).

    These formulas say that the actual number of successes and failures in the sample are 10 or greater.

    95% confidence interval for estimating population proportion p:

    In Linking Probability to Statistical Inference, we learned that the error in an estimate is related to the spread in the sampling distribution. We saw that the standard error of the sampling distribution of sample proportions is given by this formula:

    8.3: Estimating a Population Proportion (1 of 3) (9)

    In Inference for One Proportion, we are estimating the population proportion p. So we estimate the standard error by replacing p with the sample proportion, which affects the margin of error in the confidence interval. We have the following adjustment to the confidence interval formula:

    8.3: Estimating a Population Proportion (1 of 3) (10)

    Try It

    Foothill College’s athletic department wants to calculate the proportion of students who have attended a women’s basketball game at the college. They use student email addresses, randomly choose 220 students, and email them. Of the 145 who responded, 22 had attended a women’s basketball game.

    https://assessments.lumenlearning.co...sessments/3592

    https://assessments.lumenlearning.co...sessments/3593

    https://assessments.lumenlearning.co...sessments/3903

    Contributors and Attributions

    CC licensed content, Shared previously

    8.3: Estimating a Population Proportion (1 of 3) (2024)

    FAQs

    How to estimate population proportion? ›

    p′ = x / n where x represents the number of successes and n represents the sample size. The variable p′ is the sample proportion and serves as the point estimate for the true population proportion. The variable p′ has a binomial distribution that can be approximated with the normal distribution shown here.

    How to calculate sample proportion? ›

    The sample proportion P is given by P=X/N, where X denotes the number of successes and N denotes the size of the sample in question. This sample proportion reflects that particular sample, and other samples of the population may result in different sample proportions.

    What is the population proportion? ›

    The population proportion is a number representing a part of a population and is therefore an example of a population parameter. The Greek letter π (pi) is a common symbol for the population proportion.

    How to find population proportion in statistics calculator? ›

    To calculate a population proportion, divide the number of successes by the population size, then multiply by 100 to express the result as a percentage.

    How to calculate the proportion? ›

    Answer: To find the proportion in statistics, divide the count of occurrences of a particular outcome by the total number of observations. In statistics, a proportion represents the fraction or percentage of a specific outcome relative to the total number of observations.

    How do you calculate population estimate? ›

    How to you estimate population size from a sample? To estimate the population size (n) from a sample, you can extrapolate your data from the area you sampled (a quadrat, plot, or any sample unit) with the formula n = (total area of habitat / area of sample unit) x mean number of individuals per sample unit.

    How to estimate the population mean? ›

    The sum of all observations in the sample, divided by the size of the sample, N. The sample mean is an estimate of the population mean, ("mu") which is one of two parameters defining the normal distribution (the other is , see below).

    What is the population formula? ›

    Answer: Start by identifying the components of the population growth formula P=Poert, where P represents the final population, Po represents the initial population, e is the base, which is approximated as 2.71828, r represents the rate of change (as a decimal), and t represents time.

    What is population mean proportion? ›

    What is the difference between population mean and population proportion? In summary, the population mean describes the average value of a variable, while the population proportion describes the frequency with which a particular characteristic occurs in a population.

    How do you calculate population per? ›

    How to calculate per capita
    1. Determine the number that correlates with what you are trying to calculate. ...
    2. Determine how many people are in the population that you want to measure. ...
    3. Divide the measurement by the total number of people in the population. ...
    4. For smaller measurements, multiply the total by 100,000.
    Jul 1, 2024

    How to estimate sample size? ›

    For sample size estimation, researchers need to (1) provide information regarding the statistical analysis to be applied, (2) determine acceptable precision levels, (3) decide on study power, (4) specify the confidence level, and (5) determine the magnitude of practical significance differences (effect size).

    What is the equation for proportional to the population? ›

    The rate of change of a certain population is proportional to the square root of its size. Model this situation with a differential equation. The solution is said to be dPdt=k√P, where k>0 is the proportionality constant.

    What is the test for population proportion formula? ›

    Statistics - One Proportion Z Test

    z=(p−P)σ where P is the hypothesized value of population proportion in the null hypothesis, p is the sample proportion, and σ is the standard deviation of the sampling distribution.

    What is the formula to calculate population rate? ›

    A general formula for calculating the population growth rate is Gr = N / t. Gr is the growth rate measured in individuals, N is the change in population, and t is the period of time.

    References

    Top Articles
    Latest Posts
    Article information

    Author: Rubie Ullrich

    Last Updated:

    Views: 6249

    Rating: 4.1 / 5 (72 voted)

    Reviews: 95% of readers found this page helpful

    Author information

    Name: Rubie Ullrich

    Birthday: 1998-02-02

    Address: 743 Stoltenberg Center, Genovevaville, NJ 59925-3119

    Phone: +2202978377583

    Job: Administration Engineer

    Hobby: Surfing, Sailing, Listening to music, Web surfing, Kitesurfing, Geocaching, Backpacking

    Introduction: My name is Rubie Ullrich, I am a enthusiastic, perfect, tender, vivacious, talented, famous, delightful person who loves writing and wants to share my knowledge and understanding with you.