Sampling Distributions#

Sample Proportion#

Difference of Proportions#

Sample Mean#

Difference of Means#

Problems#

Sampling Distributions#

  1. Devore & Berk, 2007, Chapter 6.1 #2

There are two traffic lights on the way to work. Let \mathcal{X}_1 be the number of lights that are red, requiring a stop, and suppose the distribution of \mathcal{X}_1 is as follows,

x_1

0

1

2

p(x_1)

0.2

0.5

0.3

Let \mathcal{X}_2 be the numebr of lights that are red on the way home; \mathcal{X}_2 is independent of \mathcal{X}_1. Assume that \mathcal{X}_2 has the same distribution as \mathcal{X}_1, so that \mathcal{X}_1 and \mathcal{X}_2 is a random sample of size n=2.

  1. Let \mathcal{T} = \mathcal{X}_1 + \mathcal{X}_2. Find the probability distribution of \mathcal{T}.

  2. Calculate \mu_{T}. How does it relate to \mu, the population mean?

  3. Calculate {\sigma_T}^2. How does it relate to \sigma^2, the population variance?

  1. Absence Makes The Heart Grow Fonder

Let \mathcal{X} be the number of students absent from AP Statistics on a given day. Suppose the distribution of \mathcal{X} is as follows,

x

1

2

3

4

p(x)

0.4

0.3

0.2

0.1

  1. Consider a random sample of size n=2 (two days), and let \bar{\mathcal{X}} be the sample mean of the number of absent students. Obtain the probability distribution of \bar{\mathcal{X}}.

  2. What is the probability the average number of absent students for a sample of 2 is atleast 3 absent students?

  3. Is the sample mean an unbiased estimator?

  4. Consider again a random sample of size n=2, and let \mathcal{R} be the sample range of the number of absent students. Obtain the probability distribution of \mathcal{R}.

  5. What is the probability the sample range for a sample of 2 is at most 2 absent students?

  1. Is the sample range an unbiased estimator?

Sample Proportions#

  1. Handedness

Approximately 10% of the world’s population is left handed. There are 347 students in the Bishop Walsh student body. Let us assume the Bishop Walsh student body is an exact representative sample of the population, i.e. 10% of the 347 students are left-handed. You decide to draw a simple random sample of students from the study body and measure whether or not they are left-handed.

  1. What are the parameters (expected value and standard deviation) of the population distribution?

  2. How many samples do you need to make to ensure the conditions for inference are met?

  3. What are the parameters (expected value and standard deviation) of the sampling distribution?

  4. Describe the shape of the samping distribution for \hat{p}.

  5. Assuming the Wikipedia article referenced above is correct, what is the probability more than half of your sample of Bishop Walsh students are left-handed?

  6. Based on the answer to part b and part e, if you drew a sample of this size and estimated a value of \hat{p} = 0.5 for this sample, how would this evidence affect your belief in the claim from the Wikipedia article?

  1. Illuminati

According to a 2019 report by Business Insider, 15% of registered voters in the United States believe the Illuminati secretly controls world events.

  1. How many samples do you need to ensure the conditions for inference are met?

  2. Suppose you randomly sample 50 registered voters living in the US. How likely is it this sample will contain less than 10% of people who believe in the Illuminati?

  3. In a sample of 50 registered voters, how many people who believe in the Illuminati do you need to observe before you can be reasonably convinced the percentage of registered voters who believe in the Illuminati is not equal to the claimed percentage?

  1. DVD Players

According to a survey conducted in 2022, 38% of 18-29 year olds from the United States live in a household that owns a DVD player. Suppose you randomly sample 40 Americans between the ages of 18 and 29 years old. What is the probability the proportion of the people from this sample who live in a household with a DVD player will be greater than 45%?

  1. Coins

Suppose you flip a coin 100 times. If you allow for a 5% probability of being wrong, how many heads in a sample of 100 would you need to observe before you had convincing evidence the probability of heads for a single coin flip is greater than 50%?

Difference of Sample Proportions#

  1. Athletics

At Bishop Walsh, the administration claims 75% of high school students participate in atleast one extra curricular activity, while 69% of middle students participate in atleast one extracurricular activity.

During middle school lunch, Mr. Moore asks a random sample of 35 students whether or not they participate in atleast one extracurricular activity. During highschool lunch, Mr. Moore asks the same question to a random sample of 38 high school students.

Let \hat{p}_{\text{hs}} represent the proportion of high school students in Mr. Moore’s sample that participate in atleast one extracurricular activity. Let \hat{p}_{\text{ms}} represent the proporition of middle school students in Mr. Moore’s sample that participate in atleast one extracurricular activity.

  1. What is the shape of the sampling distribution \hat{p}_{\text{hs}} - \hat{p}_{\text{ms}}? Justify your answer.

  2. If the administration’s claim is true, find and interpret the mean of the sampling distribution for the difference of sample proportions between highschool and middleschool.

  3. If the administration’s claim is true, find and interpret the standard deviation of the sampling distribution for the difference of sample proportions between highschool and middleschool.

  4. Assuming the administration’s claim is true, what is the probability that the proportion of highschool students who participate in atleast one extracurricular is atleast 10% higher than the proportion of middle-school students who participate in atleast one extracurricular?

  5. Suppose the difference in proportions is exactly 0.10. Based on your result to part d does this give you reason to doubt the administration’s claim?

  1. Skittles and M&Ms

The manufacturers of Skittles claim 20% of each bag of Skittles is red. The manufacturers of M&M’s claim 25% of each bag of M&Ms is red. Each bag of M&Ms contains 56 pieces. Each bag of Skittles contains 54 pieces. Assuming the manufacturers’ claim to be true, what is the probability the proportion of red M&Ms in a bag will exceed the proportion of red Skittles in a bag?

Sample Means#

  1. Devore & Berk, 2007, Chapter 6.2 #17

The lifetime of a certain brand of battery is normally distributed with a mean of 10 hours and a standard deviation of 1 hour. There are four batteries to a package. If you buy a single package, what is the probability the average lifetime of all of the batteries in the package exceeds 11 hours?

  1. Exams

There are five students in Mr. Moore’s AP Statistics. On the basis of his many years of experience, Mr. Moore knows the time it takes to grade a randomly chosen examination paper is normally distributed with a mean of 10 minutes and a standard deviation of 2.5 minutes.

  1. What is the probability the average amount of time Mr. Moore spends grading the latest exams is less than 8 minutes?

  2. The latest Dragonball Z is on at 8 pm on the Cartoon Network. If Mr. Moore starts grading at 7:15 PM, what is the probability Mr. Moore misses part of the epic battle between Goku and Frieza?

Difference of Means#

  1. Chemistry

Mr. Myers notices the chemistry students in his morning class are measuring different pH values for a chemical compound than the students in his afternoon class. There are 10 students in his morning class and 15 students in his afternoon class. Mr. Myers decides to calculate the average pH of each class and compare them.

Suppose the pH measured by a randomly selected chemistry student is normally distributed with a mean pH of 5.00 and a standard deviation of 0.2. What is the probability the difference of the class averages for the pH will be within 0.1 of each other?

A.P. Exam Practice#

  1. 2008, Free Response, #2

Four different statistics have been proposed as estimators of a population parameter. To investigate the behavior of these estimators, 500 random samples are selected from a known population and each statistic is calculated for each sample. The true value of the population parameter is 75. The graphs below show the distribution of values for each statistic.

../../../_images/2008_apstats_frp_02.png

Use the information to solve the following problems.

  1. Which of the statistics appear to be unbiased estimators of the population parameter? How can you tell?

  2. Which of statistics A or B would be a better estimator of the population parameter? Explain your choice.

  3. Which of statistics C or D would be a better estimator of the population parameter? Explain your choice.

  1. 2006, Free Response, #3

The depth from the surface of Earth to a refracting layer beneath the surface can be estimated using methods developed by seismologists. One method is based on the time required for vibrations to travel from a distant explosion to a receiving point. The depth measurement M is the sum of the true depth D and the random measurement error E. That is, M = D + E. The measurement error E is assumed to be normally distributed with mean 0 feet and standard deviation 1.5 feet.

  1. If the true depth at a certain point is 2 feet, what is the probability that the depth measurement will be negative?

  2. Suppose three independent depth measurements are taken at the point where the true depth is 2 feet. What is the probability that at least one of these measurements will be negative?

Hint

Draw a Venn Diagram! Use the Multiplication Law for Independent Events to calculate the intersections!

  1. What is the probability that the mean of the three independent depth measurements taken at the point where the true depth is 2 feet will be negative?

  1. 2004, Free Response Form B, #3

Trains carry bauxite ore from a mine in Canada to an aluminum processing plant in northern New York state in hopper cars. Filling equipment is used to load ore into the hopper cars. When functioning properly, the actual weights of ore loaded into each car by the filling equipment at the mine are approximately normally distributed with a mean of 70 tons and a standard deviation of 0.9 ton. If the mean is greater than 70 tons, the loading mechanism is overfilling.

  1. If the filling equipment is functioning properly, what is the probability that the weight of the ore in a randomly selected car will be 70.7 tons or more? Show your work.

  2. Suppose that the weight of ore in a randomly selected car is 70.7 tons. Would that fact make you suspect that the loading mechanism is overfilling the cars? Justify your answer.

  3. If the filling equipment is functioning properly, what is the probability that a random sample of 10 cars will have a mean ore weight of 70.7 tons or more? Show your work.

  4. Based on your answer in part (c), if a random sample of 10 cars had a mean ore weight of 70.7 tons, would you suspect that the loading mechanism was overfilling the cars? Justify your answer.

  1. 2007, Free Response, #3

Big Town Fisheries recently stocked a new lake in a city park with 2,000 fish of various sizes. The distribution of the lengths of these fish is approximately normal.

  1. Big Town Fisheries claims that the mean length of the fish is 8 inches. If the claim is true, which of the following would be more likely?

    A random sample of 15 fish having a mean length that is greater than 10 inches

or

A random sample of 50 fish having a mean length that is greater than 10 inches

Justify your answer.

  1. Suppose the standard deviation of the sampling distribution of the sample mean for random samples of size 50 is 0.3 inch. If the mean length of the fish is 8 inches, use the normal distribution to compute the probability that a random sample of 50 fish will have a mean length less than 7.5 inches.

  2. Suppose the distribution of fish lengths in this lake was nonnormal but had the same mean and standard deviation. Would it still be appropriate to use the normal distribution to compute the probability in part b ? Justify your answer.

  1. 2009, Free Response, #2

A tire manufacturer designed a new tread pattern for its all-weather tires. Repeated tests were conducted on cars of approximately the same weight traveling at 60 miles per hour. The tests showed that the new tread pattern enables the cars to stop completely in an average distance of 125 feet with a standard deviation of 6.5 feet and that the stopping distances are approximately normally distributed.

  1. What is the 70th percentile of the distribution of stopping distances?

  2. What is the probability that at least 2 cars out of 5 randomly selected cars in the study will stop in a distance that is greater than the distance calculated in part a?

  3. What is the probability that a randomly selected sample of 5 cars in the study will have a mean stopping distance of at least 130 feet?

  1. 2010, Free Response, #2

A local radio station plays 40 rock-and-roll songs during each 4-hour show. The program director at the station needs to know the total amount of airtime for the 40 songs so that time can also be programmed during the show for news and advertisements. The distribution of the lengths of rock-and-roll songs, in minutes, is roughly symmetric with a mean length of 3.9 minutes and a standard deviation of 1.1 minutes.

  1. Describe the sampling distribution of the sample mean song lengths for random samples of 40 rock-and-roll songs.

  2. If the program manager schedules 80 minutes of news and advertisements for the 4-hour (240-minute) show, only 160 minutes are available for music. Approximately what is the probability that the total amount of time needed to play 40 randomly selected rock-and-roll songs exceeds the available airtime?