Lecture 20

Large-Sample Estimation

Published

November 30, 2023

8.1 - Where We’ve Been and Where We’re Going

Statistical Inference

  • Inference - specifically, decision making and prediciton - plays a very important role in many people’s’ lives. For example:

    • The government needs to predict short- and long-term interest rates.

    • An engineering needs to decide if a new typs of steel is more resistant to high temperatures

    • Student needs to decide to take the subway or bus depending on how long it takes for each.

    • FDA needs to decide if a new drug helps patients

    • A fan needs to decide if the Warriors are going to blow a 20 point lead at halftime.

  • The job of the mathemetical statistician is to provide methods to both make these guesses and provide a measure of accuracy around those guesses (goodness of the inference)

  • This is done by making decisions or predictions about parameters e.g. the mean, standard deviation.

  • Two different ways to make inferences:

    • Estimation
    • Hypothesis testing
  • To estimate the value of a population parameter, you use an estimator - a statistic calculated using informatino fro the sample

    • Point Estimation
    • Interval estimation

8.2 Point Estimation

https://www.researchgate.net/profile/Junhua-Ding/publication/318432363/figure/fig3/AS:667604972761097@1536180837464/Visualizing-bias-and-variance-tradeoff-using-a-bulls-eye-diagram.ppm

How to estimate a population mean or Proportion

  • To estimate the population mean \(\mu\) for a quantitative population, the point estimator \(\bar x\) is unbiased with standard error estimated as:

    \[ \text{SE} = \frac{s}{\sqrt{n}} \]

  • The 95% margin of error when \(n \ge 30\) is estimated as

    \[ \pm 1.96 \times \frac{s}{\sqrt{n}} \]

  • For a population proportion $p$ for a binomial population, the point estimator \(\hat p = x/n\) is unbiased. with standard error estimated as

\[ \text{SE} = \sqrt{\frac{\hat p \hat q}{n}} \]

  • The 95% margin of error when \(n\hat p \gt 5 \text{ and } n \hat q \gt 5\) is estimated as

    \[ \pm 1.96 \times \sqrt{\frac{\hat p \hat q}{n}} \]

Example 8.4

A scientist is studying a species of polar bear, found in and around the Arctic Ocean.

A random sample of \(n = 50\) polar bears produced an average weight of 980 pounds with a standard deviation of 105 pounds. Use this information to estimate the average weight of all Arctic polar bears.








You can be fairly confident that the sample estimate of 980 pounds is within \(\pm 29\) lbs of the population mean.








Example 8.5

Estimate the true population proportion of adults who believe that global warming is a very serious problem, and find the margin of error for the estimate. 100 adults are sampled and fine that 73% of the sampled adults think global warming is a very serious problem.








Margin of error is about .09, p can be as small as .64 or as large as .82

SE is related to numerator and denominator, but numerator doesn’t change very much.

p

pq

sqrt(pq)

0.1

0.09

0.30

0.2

0.16

0.40

0.3

0.21

0.46

0.4

0.24

0.49

0.5

0.25

0.50

0.6

0.24

0.49

0.7

0.21

0.46

0.8

0.16

0.40

0.9

0.09

0.30

Therefore, margin is mostly dictated by sample size.

8.3 Interval Estimation

  • A confidence interval:

95% confidence interval can be shown below. What are the tail. The probabiltiy remaining in the two tails is noted as \(\alpha\). How much does each tail have?

How does it looks drawn on the standard normal?

A (1 - \(\alpha\))100% Large-Sample Confidence Interval

\[ \text{(Point estimator) } \pm z_{\alpha/2} \times \text{SE} \]

This formula generates two values: the lower confidence limit and the upper confidence limit.

Table 8.2 Values of _z_ Commonly Used for Confidence Intervals
Confidence Coefficient, (1 - \(\alpha\)) \(\alpha\) \(\alpha/2\) \(z_{\alpha/2}\)
.90 .10 .05 1.645
.95 .05 .025 1.96
.98 .02 .01 2.33
.99 .01 .005 2.58

Example 8.6

The average daily intake of dairy products for a random sample of \(n = 50\) adult males was \(\bar x = 756\) with standard deviation \(s = 35\) grams per days.

Construct a 95% and 99% confidence interval.

Example 8.8

A random sample of 985 likely voters - were polled by the Republican party. Of those survey, 592 indicated that they intended to vote for the Republican candidate. Construct a 90% confidence interval for \(p\).

Interpreting the Confidence Interval

Let’s pretend we are interested in the heights of students, if we were all knowing we could have the heights of every student - let’s say there are 100,000:

   Min. 1st Qu.  Median    Mean 3rd Qu.    Max. 
  4.000   5.091   5.670   5.639   6.219   7.000 

Here is a sample (rounded to .1 feet):

 [1] 5.2 6.4 6.8 6.5 6.0 4.2 5.4 5.5 5.8 5.2 5.3 5.0 5.1 6.4 6.4 5.8 7.0 4.9 6.2
[20] 4.9 6.6 6.5 5.7 5.6 5.9 6.9 5.4 5.3 5.1 6.8 5.5 5.9 5.1 5.3 6.0 4.5 4.8 6.1
[39] 6.7 5.3 5.0 5.0 6.4 5.8 4.6 6.7 4.9 6.2 7.0 4.9

Mean:

[1] 5.71

SD:

[1] 0.7318963

SE:

[1] 0.104

Interval:

[1] 5.606 5.814

Homework

[1] "8.2.30, 8.2.32, 8.3.18-20, 8.3.31, 8.3.37"

Answers: Section 8.2 and 8.3