Random Variables#

Definition#

  1. A random variable \mathcal{X} is defined as the number of heads in three coin flips.

    1. List the elements of the sample space for this random variable.

    2. Write the elements of the sample space that belong to the event \mathcal{X}=0 in set notation.

    3. Write the elements of the sample space that belong to the event \mathcal{X}=1 in set notation.

    4. Write the elements of the sample space that belong to the event \mathcal{X}=2 in set notation.

    5. Write the elements of the sample space that belong to the event \mathcal{X}=3 in set notation.

    6. Find the probability distribution P(\mathcal{X}=x) for all values of x.

    7. Using the probability distribution derived in part e, find the expectation E(\mathcal{X}) of the sum of two die rolls.

    8. Using the probability distribution derived in part e, find the variance Var(\mathcal{X}) of the sum of two die rolls.

  2. A random variable \mathcal{X} is defined as the sum of two six-sided die rolls.

    1. List the elements of the sample space for this random variable.

    2. Write the elements of the sample space that belong to the event \mathcal{X}=6 in set notation.

    3. Write the elements of the sample space that belong to the event \mathcal{X}=7 in set notation.

    4. Write the elements of the sample space that belong to the event \mathcal{X}=8 in set notation.

    5. Find the probability distribution P(\mathcal{X}=x) for all values of x.

    6. Using the probability distribution derived in part e, find the expectation E(\mathcal{X}) of the sum of two die rolls.

    7. Using the probability distribution derived in part e, find the variance Var(\mathcal{X}) of the sum of two die rolls.

  3. The probability distribution for \mathcal{X} = \text{number of major defects on a randomly selected toaster oven} is given by the following table,

x

p(x)

0

0.08

1

0.15

2

0.45

3

0.27

4

0.05

Compute the following:

  1. E(\mathcal{X})

  2. Var(\mathcal{X}) directly from the definition.

  3. Var(\mathcal{X}) from the shortcut formula.

  4. sigma_{\mathcal{X}}

Expectations#

  1. In a lottery every week, 2,000,000 tickets are sold for $1 apiece. If 4000 of these tickets pay off $30 each, 500 pay off $800 each, one ticket pays off \$ 1,200,000, and no ticket pays off more than one prize, what is the expected value of the winning amount for a player with a single ticket?

  2. In the Maryland Lottery, players pick six different integers between 1 and 49, the order of selection being irrelevant. The lottery commission then selects six of these numbers at random as the winning numbers. A player wins the grand prize of $1,200,000 if all six numbers that he or she has selected match the winning numbers. The second and third prizes of $800 and $35, respectively, are awarded if exactly five and four of the selected numbers match the winning numbers. Lottery tickets cost $5. What is the expected value of the amount a player wins in one game?

  3. An urn contains five balls, two of which are marked $1 , two marked $5 and one marked $15. A game is played by paying $10 for winning the sum of the amounts marked on two balls selected randomly from the urn. Is this a fair game?

Theorems#

Scalar Properties#

  1. A box contains 10 frisbes of radii 1, 2, …, 10, respectively. What is the expected value of the circumference of a frisbee selected at random from this box?

  2. Margaret Margot owns the boutique bakery Cupcake Connoisseur. She makes cupcakes and sells them for $15 a piece (they are really good cupcakes). After several years of tracking the seasonal variations in cupcake sales, she has estimated the following probability distribution for the number of cupcakes she sells a day.

Number of Cupcakes

P(\mathcal{X}=x_i)

15

0.2

25

0.3

35

0.4

45

0.1

  1. Use this table to find Margaret’s expected revenue over the next month. Assume a month is 30 days.

  2. The monthly cost of running a bakery is $2000. What is Margaret’s expected profit over the next month?

  1. A small market orders copies of a certain magazine for its magazine rack each week. Let \mathcal{X} represent the number of magazine demanded by customers over the course of a week. The probability density for the magazine deman is given by,

x

1

2

3

4

5

6

p(x)

\frac{1}{15}

\frac{2}{15}

\frac{3}{15}

\frac{4}{15}

\frac{3}{15}

\frac{2}{15}

Suppose the store owner actually pays $1.00 for each copy of the magazine and the price to the customers is $2.00. If the magazines left at the end of the week have no salvage value, is it better to order three or four copies of the magazine each week?

Hint

For both three and four copies ordered, express net revenue as a function of demand \mathcal{X} and then compute the expected revenue.

  1. Let \mathcal{X} be the amount of damage in dollars incurred due to flooding in a certain region every year. Possible \mathcal{X} values are $0, $1000, $5000 and $10000, with probability 0.80, 0.1, 0.08 and 0.02, respectively. Seedy Insurance Company, Inc. offers a $500 deductible policy. If the company wishes its expected project to be $100 per policy, what premium amount should it charge?

Linearity of Expectations#

  1. Use the linearity of expectations to solve the following problems.

    1. What is the expected number of heads in three coin flips?

    2. What is the expected sum of outcomes for two die rolls?

  2. Clavius Hecklesnotz is playing a game of chance where he flips 10 fair coins. If Clavius Hecklesnotz gets 1 head, he will be paid $1. If Clavius Hecklesnotz gets 2 heads, he will be paid $2. In general, if Clavius Hecklesnotz gets n heads, he will be paid $*n*. What is the expected value of Clavius Hecklesnotz’ payout?

Hint

This problem can be solved with a Binomial Distribution, but it is much easier to understand through the Linearity of Expectations.

  1. A box contains a yellow ball, an orange ball, a green ball, and a blue ball. Matilda Weierstrauss randomly selects 4 balls from the box (with replacement). What is the expected value for the number of distinct colored balls Matilda Weierstrauss will select?

  2. Every time one of the AP Statistics students buys a Happy Meal from McDonalds, they receive a Strawberry Shortcake plush doll. There are five dolls in total: Strawberry Shortcake, Cherry Jam, Raspberry Torte, Blueberry Muffin and Lemon Meringue. The toys in every Happy Meal are awarded at random. What is the expected number of Happy Meals an AP Statistics Students must buy in order to get all the Strawberry Shortcake plush dolls?

A.P. Exam Practice#

  1. 2019, Free Response, #5

A company that manufactures smartphones developed a new battery that has a longer life span than that of a traditional battery. From the date of purchase of a smartphone, the distribution of the life span of the new battery is approximately normal with mean 30 months and standard deviation 8 months. For the price of $50, the company offers a two-year warranty on the new battery for customers who purchase a smartphone. The warranty guarantees that the smartphone will be replaced at no cost to the customer if the battery no longer works within 24 months from the date of purchase.

  1. In how many months from the date of purchase is it expected that 25 percent of the batteries will no longer work? Justify your answer.

  2. Suppose one customer who purchases the warranty is selected at random. What is the probability that the customer selected will require a replacement within 24 months from the date of purchase because the battery no longer works?

  3. The company has a gain of $50 for each customer who purchases a warranty but does not require a replacement. The company has a loss (negative gain) of $150 for each customer who purchases a warranty and does require a replacement. What is the expected value of the gain for the company for each warranty purchased?

  1. 2015, Free Response, #3

A shopping mall has three automated teller machines (ATMs). Because the machines receive heavy use, they sometimes stop working and need to be repaired. Let the random variable X represent the number of ATMs that are working when the mall opens on a randomly selected day. The table shows the probability distribution of X. Number of ATMs working when

../../../_images/2015_apstats_frp_3.png
  1. What is the probability that at least one ATM is working when the mall opens?

  2. What is the expected value of the number of ATMs that are working when the mall opens?

  3. What is the probability that all three ATMs are working when the mall opens, given that at least one ATM is working?

  4. Given that at least one ATM is working when the mall opens, would the expected value of the number of ATMs that are working be less than, equal to, or greater than the expected value from part b? Explain.

  1. 2014, Free Response, #3

Schools in a certain state receive funding based on the number of students who attend the school. To determine the number of students who attend a school, one school day is selected at random and the number of students in attendance that day is counted and used for funding purposes. The daily number of absences at High School A in the state is approximately normally distributed with mean of 120 students and standard deviation of 10.5 students.

  1. If more than 140 students are absent on the day the attendance count is taken for funding purposes, the school will lose some of its state funding in the subsequent year. Approximately what is the probability that High School A will lose some state funding?

  2. The principals’ association in the state suggests that instead of choosing one day at random, the state should choose 3 days at random. With the suggested plan, High School A would lose some of its state funding in the subsequent year if the mean number of students absent for the 3 days is greater than 140. Would High School A be more likely, less likely, or equally likely to lose funding using the suggested plan compared to the plan described in part a? Justify your choice.

  3. A typical school week consists of the days Monday, Tuesday, Wednesday, Thursday, and Friday. The principal at High School A believes that the number of absences tends to be greater on Mondays and Fridays, and there is concern that the school will lose state funding if the attendance count occurs on a Monday or Friday. If one school day is chosen at random from each of 3 typical school weeks, what is the probability that none of the 3 days chosen is a Tuesday, Wednesday, or Thursday?

  1. 2019, Free Response, #5

A company that manufactures smartphones developed a new battery that has a longer life span than that of a traditional battery. From the date of purchase of a smartphone, the distribution of the life span of the new battery is approximately normal with mean 30 months and standard deviation 8 months. For the price of $50, the company offers a two-year warranty on the new battery for customers who purchase a smartphone. The warranty guarantees that the smartphone will be replaced at no cost to the customer if the battery no longer works within 24 months from the date of purchase.

  1. In how many months from the date of purchase is it expected that 25 percent of the batteries will no longer work? Justify your answer.

  2. Suppose one customer who purchases the warranty is selected at random. What is the probability that the customer selected will require a replacement within 24 months from the date of purchase because the battery no longer works?

  3. The company has a gain of $50 for each customer who purchases a warranty but does not require a replacement. The company has a loss (negative gain) of $150 for each customer who purchases a warranty and does require a replacement. What is the expected value of the gain for the company for each warranty purchased?

  1. 2003, Free Response Form B, #5

../../../_images/2003_apstats_frp_formb_05.png

Contestants on a game show spin a wheel like the one shown in the figure above. Each of the four outcomes on this wheel is equally likely and outcomes are independent from one spin to the next.

  • The contestant spins the wheel.

  • If the result is a skunk, no money is won and the contestant’s turn is finished.

  • If the result is a number, the corresponding amount in dollars is won. The contestant can then stop with those winnings or can choose to spin again, and his or her turn continues.

  • If the contestant spins again and the result is a skunk, all of the money earned on that turn is lost and the turn ends.

  • The contestant may continue adding to his or her winnings until he or she chooses to stop or until a spin results in a skunk.

  1. What is the probability that the result will be a number on all of the first three spins of the wheel?

  2. Suppose a contestant has earned $800 on his or her first three spins and chooses to spin the wheel again. What is the expected value of his or her total winnings for the four spins?

  3. A contestant who lost at this game alleges that the wheel is not fair. In order to check on the fairness of the wheel, the data in the table below were collected for 100 spins of this wheel.

Result

Skunk

$100

$200

$500

Frequency

33

21

20

26

Based on these data, can you conclude that the four outcomes on this wheel are not equally likely? Give appropriate statistical evidence to support your answer.

  1. 2005, Free Response Form B, #2

For an upcoming concert, each customer may purchase up to 3 child tickets and 3 adult tickets. Let C be the number of child tickets purchased by a single customer. The probability distribution of the number of child tickets purchased by a single customer is given in the table below.

c

0

1

2

3

p(c)

0.4

0.3

0.2

0.1

  1. Compute the mean and the standard deviation of C.

  2. Suppose the mean and the standard deviation for the number of adult tickets purchased by a single customer are 2 and 1.2, respectively. Assume that the numbers of child tickets and adult tickets purchased are independent random variables. Compute the mean and the standard deviation of the total number of adult and child tickets purchased by a single customer.

  3. Suppose each child ticket costs $15 and each adult ticket costs $25. Compute the mean and the standard deviation of the total amount spent per purchase.

  1. 2008, Free Response Form B, #5

Flooding has washed out one of the tracks of the Snake Gulch Railroad. The railroad has two parallel tracks from Bullsnake to Copperhead, but only one usable track from Copperhead to Diamondback, as shown in the figure below. Having only one usable track disrupts the usual schedule. Until it is repaired, the washed-out track will remain unusable. If the train leaving Bullsnake arrives at Copperhead first, it has to wait until the train leaving Diamondback arrives at Copperhead.

../../../_images/2008_apstats_frp_formb_05.png

Every day at noon a train leaves Bullsnake heading for Diamondback and another leaves Diamondback heading for Bullsnake.

Assume that the length of time, X, it takes the train leaving Bullsnake to get to Copperhead is normally distributed with a mean of 170 minutes and a standard deviation of 20 minutes.

Assume that the length of time, Y, it takes the train leaving Diamondback to get to Copperhead is normally distributed with a mean of 200 minutes and a standard deviation of 10 minutes.

These two travel times are independent.

  1. What is the distribution of Y - X?

  2. Over the long run, what proportion of the days will the train from Bullsnake have to wait at Copperhead for the train from Diamondback to arrive?

  3. How long should the Snake Gulch Railroad delay the departure of the train from Bullsnake so that the probability that it has to wait is only 0.01?

  1. 2008, Free Response, #3

A local arcade is hosting a tournament in which contestants play an arcade game with possible scores ranging from 0 to 20. The arcade has set up multiple game tables so that all contestants can play the game at the same time; thus contestant scores are independent. Each contestant’s score will be recorded as he or she finishes, and the contestant with the highest score is the winner.

After practicing the game many times, Josephine, one of the contestants, has established the probability distribution of her scores, shown in the table below.

Crystal, another contestant, has also practiced many times. The probability distribution for her scores is shown in the table below.

  1. Calculate the expected score for each player.

  2. Suppose that Josephine scores 16 and Crystal scores 17. The difference (Josephine minus Crystal) of their scores is -1. List all combinations of possible scores for Josephine and Crystal that will produce a difference (Josephine minus Crystal) of -1, and calculate the probability for each combination.

  3. Find the probability that the difference (Josephine minus Crystal) in their scores is -1.

  4. The table below lists all the possible differences in the scores between Josephine and Crystal and some associated probabilities.

Complete the table and calculate the probability that Crystal’s score will be higher than Josephine’s score.

  1. 2010, Free Response Form B, #3

A test consisting of 25 multiple-choice questions with 5 answer choices for each question is administered. For each question, there is only 1 correct answer.

  1. Let \mathcal{X} be the number of correct answers if a student guesses randomly from the 5 choices for each of the 25 questions. What is the probability distribution of \mathcal{X}?

This test, like many multiple-choice tests, is scored using a penalty for guessing. The test score is determined by awarding 1 point for each question answered correctly, deducting 0.25 point for each question answered incorrectly, and ignoring any question that is omitted. That is, the test score is calculated using the following formula.

Score = (1 x number of correct answers) – (0.25 x number of incorrect answers) + (0 x number of omits)

For example, the score for a student who answers 17 questions correctly, answers 3 questions incorrectly, and omits 5 questions is

Score = (1 x 17) - (0.25 x 3) + (0 x 5) = 16.25.

  1. Suppose a student knows the correct answers for 18 questions, answers those 18 questions correctly, and chooses randomly from the 5 choices for each of the other 7 questions. Show that the expected value of the student’s score is 18 when using the scoring formula above.

  2. A score of at least 20 is needed to pass the test. Suppose a student knows the correct answers for 18 questions, answers those 18 questions correctly, and chooses randomly from the 5 choices for each of the other 7 questions. What is the probability that the student will pass the test?

  1. 2012, Free Response, #2

A charity fundraiser has a Spin the Pointer game that uses a spinner like the one illustrated in the figure below.

../../../_images/2012_apstats_frp_02.png

A donation of $2 is required to play the game. For each $2 donation, a player spins the pointer once and receives the amount of money indicated in the sector where the pointer lands on the wheel. The spinner has an equal probability of landing in each of the 10 sectors.

  1. Let X represent the net contribution to the charity when one person plays the game once. Complete the table for the probability distribution of X.

x

$2

$1

-$8

P(x)

  1. What is the expected value of the net contribution to the charity for one play of the game?

  2. The charity would like to receive a net contribution of $500 from this game. What is the fewest number of times the game must be played for the expected value of the net contribution to be at least $500 ?

  3. Based on last year’s event, the charity anticipates that the Spin the Pointer game will be played 1,000 times. The charity would like to know the probability of obtaining a net contribution of at least $500 in 1,000 plays of the game. The mean and standard deviation of the net contribution to the charity in 1,000 plays of the game are $700 and $92.79, respectively. Use the normal distribution to approximate the probability that the charity would obtain a net contribution of at least $500 in 1,000 plays of the game.