Statistics: Quarter Four

Statistics: Quarter Four#

Celebration of Knowledge#

  1. 2009, Free Response, #1

A simple random sample of 100 high school seniors was selected from a large school district. The gender of each student was recorded, and each student was asked the following questions.

  1. Have you ever had a part-time job?

  2. If you answered yes to the previous question, was your part-time job in the summer only?

The results are summarized in the following table,

../../../_images/2009_apstats_frp_01.png

Use this information to answer the following questions.

  1. Construct a graphical display that represents the association between gender and job experience for the students in the sample.

  2. Write a few sentences summarizing what the display in part (a) reveals about the association between gender and job experience for the students in the sample.

  3. Which test of significance should be used to test if there is an association between gender and job experience for the population of high school seniors in the district? State the null and alternative hypotheses for the test, but do not perform the test.

  1. 2007, Free Response, #6

TODO

  1. 2011, Free Response, #6

TODO

  1. 2019, Free Response, #6

Emma is moving to a large city and is investigating typical monthly rental prices of available one-bedroom apartments. She obtained a random sample of rental prices for 50 one-bedroom apartments taken from a Web site where people voluntarily list available apartments.

  1. Describe the population for which it is appropriate for Emma to generalize the results from her sample.

The distribution of the 50 rental prices of the available apartments is shown in the following histogram.

../../../_images/2019_apstats_frp_06a.png

Use this histogram to answer the following questions.

  1. Emma wants to estimate the typical rental price of a one-bedroom apartment in the city. Based on the distribution shown, what is a disadvantage of using the mean rather than the median as an estimate of the typical rental price?

  2. Instead of using the sample median as the point estimate for the population median, Emma wants to use an interval estimate. However, computing an interval estimate requires knowing the sampling distribution of the sample median for samples of size 50. Emma has one point, her sample median, in that sampling distribution. Using information about rental prices that are available on the Web site, describe how someone could develop a theoretical sampling distribution of the sample median for samples of size 50.

Because Emma does not have the resources to develop the theoretical sampling distribution, she estimates the sampling distribution of the sample median using a process called bootstrapping. In the bootstrapping process, a computer program performs the following steps,

  • Take a random sample, with replacement, of size 50 from the original sample.

  • Calculate and record the median of the sample.

  • Repeat the process to obtain a total of 15,000 medians.

Emma ran the bootstrap process, and the following frequency table is the bootstrap distribution showing her results of generating 15,000 medians.

../../../_images/2019_apstats_frp_06b.png

The bootstrap distribution provides an approximation of the sampling distribution of the sample median. A confidence interval for the median can be constructed using a percentage of the values in the middle of the bootstrap distribution.

  1. Use the frequency table to find the following.

    1. Value of the 5th percentile:

    2. Value of the 95th percentile:

  2. Find the percentage of bootstrap medians in the table that are equal to or between the values found in part d.

  3. Use your values from parts d and e to construct and interpret a confidence interval for the median rental price.