Statistics Chapter 6 Normal Distributions

February 11, 2018 | Author: Anonymous | Category: Math, Statistics And Probability, Normal Distribution
Share Embed Donate


Short Description

Download Statistics Chapter 6 Normal Distributions...

Description

Chapter 6 Normal Distributions

What is Normal Distribution?

Characteristics of Normal Distribution

ο‚› The curve is bell-shaped, with the highest point over the mean πœ‡ ο‚› The curve is symmetrical about a vertical line through πœ‡ ο‚› The curve approaches the horizontal axis but never touches or crosses it ο‚› The inflection (transition) points between cupping upward and downward occur above πœ‡ + 𝜎 π‘Žπ‘›π‘‘ πœ‡ βˆ’ 𝜎

Let’s test your knowledge A) Do these distribution s have the same mean? B) Compare the two graphs, what can you tell me about the standard deviation?

Answer

ο‚› A) Both graphs have the same mean ο‚› B) Red curve has a smaller standard deviation because the data is more compressed. Black curve has a bigger standard deviation because the data is more spread out.

Empirical rule

ο‚› For a distribution that is symmetrical and bell-shaped (normal distribution): ο‚› Approximately 68% of the data values will lie within one standard deviation on each side of the mean

ο‚› Approximately 95% of the data values will lie within two standard deviations on each side of the mean ο‚› Approximately 99.7% of the data values will lie within three standard deviations on each side of the mean

Again look at the graph

Group Work

ο‚› Check Chebyshev’s theorem. What are the similarities and differences between them?

Potential Answer

ο‚› Empirical rule gives a stronger statement than Chebyshev’s theorem in it gives definite percentages, not just lower limits.

Group Work: Creating the normal distribution curve

ο‚› The yearly corn per acre on a particular farm is normally distributed with mean = 60 and standard deviation of 7. Create the normal distribution curve.

Answer

ο‚› 49

56

63

60

67

74

81

Group Work: Empirical Rule

ο‚› The playing life of a Sunshine radio is normally distributed with mean 600 hours and standard deviation of 100 hours. What is the probability that a radio selected at random will last from 600 to 700 hours?

ο‚› 34.1%

Technology

ο‚› If you have TI-83/TI-84 ο‚› Press the Y= key. Then, under DISTR, select 1:normalpdf (π‘₯, πœ‡, 𝜎) and fill in desired πœ‡ π‘Žπ‘›π‘‘ 𝜎 values. Press the WINDOW key. Set Xmin to πœ‡ βˆ’ 3𝜎 π‘Žπ‘›π‘‘ π‘‹π‘šπ‘Žπ‘₯ π‘‘π‘œ πœ‡ + 3𝜎. Finally, press the ZOOM key and select option 0: ZoomFit

What are some applications of normal distribution curve?

ο‚› It gives us how reliable are the data

ο‚› Used for control charts

Control Charts

ο‚› It is useful when we are examining data over a period of equally spaced time interval or in some sequential order.

Example of Control Chart

How to Make a control chart of the random variable x ο‚› 1) Find the mean and standard deviation of the x distribution by ο‚› A) Using past data from a period during which the process was β€œin control” or ο‚› B) using specified β€œtarget” values for πœ‡ π‘Žπ‘›π‘‘ 𝜎

ο‚› 2) Create a graph in which the vertical axis represents x values and the horizontal axis represents time. ο‚› 3) Draw a horizontal line at height πœ‡ π‘Žπ‘›π‘‘ β„Žπ‘œπ‘Ÿπ‘–π‘§π‘œπ‘›π‘‘π‘Žπ‘™, π‘‘π‘Žπ‘ β„Žπ‘’π‘‘ π‘π‘œπ‘›π‘‘π‘Ÿπ‘œπ‘™ βˆ’ π‘™π‘–π‘šπ‘–π‘‘ 𝑙𝑖𝑛𝑒𝑠 π‘Žπ‘‘ πœ‡ Β± 2𝜎 π‘Žπ‘›π‘‘ πœ‡ Β± 3𝜎 ο‚› 4) Plot the variable x on the graph in time sequence order. Use line segments to connect the points in time sequence order

Example: Day 1 x

2

3

11 20 25

4

5

6

7

8

9

10 11 12 13 14 15

23

16

19

8

25

17

20

23

29

A) Find the mean and the standard deviation (population) B) Find control limit πœ‡ Β± 2𝜎 C) Find control limit πœ‡ Β± 3𝜎

18

14

10

Answer

ο‚› Mean = 18.53 ο‚› Standard deviation = 5.82 πœ‡ Β± 2𝜎 = 6.89 π‘Žπ‘›π‘‘ 30.17 πœ‡ Β± 3𝜎 = 1.07 π‘Žπ‘›π‘‘ 35.99

Interpreting control chart

ο‚› Ms. Tamara of the Antlers Lodge examines the control chart for housekeeping. During the staff meeting, she makes recommendations about improving service or, if all is going well, she gives her staff a well-deserved β€œpat in the back”. Determine if the housekeeping process is out of control

Answer

ο‚› Graph to the left: there are 9 consecutive days on one side of the mean. It is a warning signal. It would mean that the mean is slowly drifting upward from 19.3. ο‚› Graph to the right: We have data value beyond πœ‡ + 3𝜎. We have two of three data values beyond πœ‡ βˆ’ 2𝜎. Both are warning signals.

Homework Practice

ο‚› Pg 244 #1-7, 11,13

Standard Units and Areas Under the Standard Normal Distribution

Note:

ο‚› Normal distributions vary from one another in two ways: The mean πœ‡ may be located anywhere on the x axis, and the bell shape may be more or less spread according to the size of the standard deviation 𝜎

Scenario ο‚› Suppose you have two students comparing their test result in two different classes. ο‚› Jack got a 76 while the class average was a 66 ο‚› Jill got an 82 while the class scored 72 ο‚› Both scored 10 points above the mean. ο‚› How do we determine who did better in respect to their peers in the class?

The Answer

ο‚› You have to use the z test or z score because you want to know the position in term of the standard deviation away from the mean.

What is z test or z score?

ο‚› Z value or z score gives the number of standard deviations between the original measurement x and the mean πœ‡ of the x distribution

𝑧 =

π‘₯βˆ’πœ‡ 𝜎

Note:

ο‚› Unless otherwise stated, in the remainder of the book we will take the word average to be either the sample arithmetic mean or population mean.

Calculating z scores ο‚› Suppose the company states that their average large pizza is 10 inches in diameter and the standard deviation is 0.7 inches. A customer ordered a pizza and found out it is of 7 inches. Assume the pizza follows a normal distribution. If the size of the pizza is below three standard deviation, the company would be in danger of getting customer complaints and have a bad name in its company. ο‚› How many standard deviation s from the mean is 7? Is the company going to be in trouble?

Answer

𝑧 =

π‘₯βˆ’πœ‡ 𝜎

=

7βˆ’10 .7

= βˆ’4.28

ο‚›It is 4.28 below the mean, and therefore the company is in trouble.

Group Work

ο‚› (Fake Data) National weigh average for men at 5’8 is 139 lbs with a standard deviation of 15.7. If you are 5’8 and weigh 171 lbs. Find the z score. Should you be concerned?

Answer

ο‚› 𝑧=

π‘₯βˆ’πœ‡ 𝜎

=

171βˆ’139 15.7

= 2.03

ο‚› Answer may vary depending how you think of it. If you consider outlier is when z=2.5 or more then no, but if you say that he is not with the 95% (within 2 standard deviation) then he should. (based on empirical rule)

Question:

ο‚› How would you find the raw score x if you are given the z score, mean and standard deviation?

Answer

ο‚›π‘₯ = π‘§πœŽ + πœ‡

Group Work ο‚› Based on your experience, it takes you on average of 15 minutes to walk to school with standard deviation of 2 minutes. ο‚› A)On one particular day, it took you 12 minutes to get to school. What is the z score? Is the z value positive or negative? Why? ο‚› B) What would be the commuting time corresponding to a standard score of z=-2.5?

Answer

ο‚› A) z=-1.5. It is negative because it is shorter than the expected time. ο‚› B) 10 minutes

Standard normal distribution

ο‚› Standard normal distribution is a normal distribution with mean πœ‡ = 0 and standard deviation 𝜎 = 1

Area under the curve. The left tail rule

ο‚› Look at Appendix II Table 5 on A22

Group Activity ο‚› What is the value for and sketch the graph: ο‚› A) z=1.18 ο‚› B) z=-2.35 ο‚› C) z=0.56 ο‚› D) z=-3.34

Group Activity: challenge ο‚› What’s the value for: ο‚› A) z>1.25 ο‚› B) values in between z=1.21 and z=2.36 ο‚› C) values to the right of 0.95 ο‚› D) values in between z=-2.75 and z=1.38

Homework practice

ο‚› Pg 256 #1-48 eoe

Areas under any normal curve

How do you find the z score/z value?

Answer

ο‚› 𝑧=

π‘₯βˆ’πœ‡ 𝜎

Example:

ο‚› Let x have a normal distribution with πœ‡ = 7 π‘Žπ‘›π‘‘ 𝜎 = 3. Find the probability that an x value selected at random is between (5 and 8). 𝑃(5 ≀ π‘₯ ≀ 8)

Answer ο‚› You have to convert the x into z score:

ο‚› 𝑍=

5βˆ’7 3

= βˆ’3

2

ο‚› 𝑍=

8βˆ’7 3

=3

1

2

ο‚› 𝑃 𝑧 = βˆ’ 3 = .2546 1

ο‚› 𝑃 𝑧 = 3 = .6293 ο‚› .6293-.2546=.3747

Group Work

ο‚› A typical iphone have a battery life that is normally distributed with a mean of 18 hours with a standard deviation of 2 hours. What is the probability that an iphone will have a battery life that is greater than 24 hours?

Answer

ο‚› 𝑃 π‘₯ > 24 = 𝑃 𝑧 > 3 =.0013

Tech notes finding area under the curve in between z values

ο‚› For TI 83/TI 84

ο‚› Press DISTR key, select 2:normalcdf (π‘™π‘œπ‘€π‘’π‘Ÿ π‘π‘œπ‘’π‘›π‘‘ 𝑧 π‘£π‘Žπ‘™π‘’π‘’, π‘’π‘π‘π‘’π‘Ÿ π‘π‘œπ‘’π‘›π‘‘ 𝑧 π‘£π‘Žπ‘™π‘’π‘’, πœ‡, 𝜎)

Find x given probability

ο‚› A company sells a hybrid battery that has a mean life of 8 years with standard deviation of 2 years. How long can the company guarantee its warranty such that they do not want to refund on more than 5% of the hybrid battery?

Answer

ο‚› Since refund only occur when it falls below the mean. It is on the left side of the normal distribution curve. When is z such that it is .05? ο‚› Z=-1.645 π‘₯βˆ’πœ‡

ο‚› Using the formula 𝑧 = 𝜎 = βˆ’1.645 = 4.71, 4.71 π‘¦π‘’π‘Žπ‘Ÿπ‘ . π‘œπ‘Ÿ 55 π‘šπ‘œπ‘›π‘‘β„Žπ‘ 

π‘₯βˆ’8 2

,π‘₯ =

Group Work

ο‚› A company sells a product that has a mean shell life of 60 month with the standard deviation of 5 months. How long can the company guarantee its warranty such that they do not want to refund on more than 7% of the product?

Answer

ο‚› you have to use z=-1.48 ο‚› -1.48*5+60=52.6 months

Example: Find z

ο‚› Find the z value such that 90% of the area under the standard normal curve lies between –z and z

Answer

ο‚›

1βˆ’.90 2

= .05, π‘ π‘œ 𝑧 = 1.645

ο‚› Z values between -1.645 and 1.645

Group work

ο‚› Find the z value such that 80% of the area under the standard normal curve lies between –z and z

Group work

ο‚› Find the z value such that 75% of the area under the standard normal curve lies between –z and z

Tech note for finding x value

ο‚› Example find an x value from a normal distribution with mean 40 and standard deviation 5 such that 97% of the area lies to left of x ο‚› TI 83/TI 84: press DISTR key and select 3:invNorm(π‘Žπ‘Ÿπ‘’π‘Ž, πœ‡, 𝜎)

Check for normality ο‚› How do we tell if data follow a normal distribution?

ο‚› 1) Make a histogram, it should be roughly bell-shaped ο‚› 2) Outliers. Make a box-and-whiskers plot (5 number summary) ο‚› Outliers are either greater than Q3+1.5(IQR) or below Q1-1.5(IQR)

ο‚› 3) Skewness: Pearson’s index ο‚› π‘ƒπ‘’π‘Žπ‘Ÿπ‘ π‘œπ‘›β€² 𝑠 𝑖𝑛𝑑𝑒π‘₯ =

3 π‘₯βˆ’π‘šπ‘’π‘‘π‘–π‘Žπ‘› 𝑠

,

ο‚› π‘›π‘œ π‘ π‘˜π‘’π‘€π‘›π‘’π‘ π‘  𝑖𝑓 𝑖𝑛𝑑𝑒π‘₯ 𝑖𝑠 𝑖𝑛 𝑏𝑒𝑑𝑀𝑒𝑒𝑛 βˆ’ 1 π‘Žπ‘›π‘‘ 1 ο‚› Skewed distributions are not normal

4) Normal quantile plot (or normal probability plot) ο‚›

for TI 83/TI 84, put all the data in the calculator under edit, then turn on STATPLOT then highlight the sixth plot option

ο‚› It should look like a straight line

Check for normality 50

23

17

60

75

73

25

27

30

46

53

13

61

47

87

78

37

81

76

29

36

48

62

55

Homework Practice

ο‚› Pg 267 #1-30 eoe, 31

Normal Approximation to the Binomial Distribution

Normal approximation to the binomial distribution ο‚› Consider a binomial distribution where ο‚› 𝑛 = π‘›π‘’π‘šπ‘π‘’π‘Ÿ π‘œπ‘“ π‘‘π‘Ÿπ‘–π‘Žπ‘™π‘  ο‚› π‘Ÿ = π‘›π‘’π‘šπ‘π‘’π‘Ÿ π‘œπ‘“ 𝑠𝑒𝑐𝑐𝑒𝑠𝑠𝑒𝑠 ο‚› 𝑝 = π‘π‘Ÿπ‘œπ‘π‘Žπ‘π‘–π‘™π‘–π‘‘π‘¦ π‘œπ‘“ 𝑠𝑒𝑐𝑐𝑒𝑠𝑠 π‘œπ‘› π‘Ž 𝑠𝑖𝑛𝑔𝑙𝑒 π‘‘π‘Ÿπ‘–π‘Žπ‘™ ο‚› π‘ž = 1 βˆ’ 𝑝 = π‘π‘Ÿπ‘œπ‘π‘Žπ‘π‘–π‘™π‘–π‘‘π‘¦ π‘œπ‘“ π‘“π‘Žπ‘–π‘™π‘’π‘Ÿπ‘’ π‘œπ‘› π‘Ž 𝑠𝑖𝑛𝑔𝑙𝑒 π‘‘π‘Ÿπ‘–π‘Žπ‘™ ο‚› If np>5 and nq>5, then r has a binomial distribution that is approximated by a normal distribution with

ο‚› πœ‡ = 𝑛𝑝 π‘Žπ‘›π‘‘ 𝜎 = π‘›π‘π‘ž ο‚› As n increases, the approximation becomes better

Example:

ο‚› Find the 𝑛, 𝑝, π‘ž, πœ‡, 𝜎 of the binomial distributions for which p=0.25, q=0.75, and the number of trials is first n=3, n=10, then n=25 and then n=50

Answer

ο‚› Find the 𝑛, 𝑝, π‘ž, πœ‡, 𝜎

Important note about binomial approximation

ο‚› Remember how to graph the binomial distribution? Remember how you have to incorporate the 0.5?

ο‚› For example when r=5, you draw the binomial distribution from 4.5 to 5.5.

How to make the Continuity Correction ο‚› Convert the discrete random variable r (number of successes) to the continuous normal random variable x by doing the following: ο‚› 1) If r is a left point of an interval, subtract 0.5 to obtain the corresponding normal variable x; that is, x=r-0.5 ο‚› 2) If r is a right point of an interval, add 0.5 to obtain the correspoinding normal variable x: that is, x=r+0.5 ο‚› For instance, 𝑃 6 ≀ π‘Ÿ ≀ 10 , where r is a binomial random variable is approximated by 𝑃(5.5 ≀ π‘₯ ≀ 10.5), where x is the corresponding normal random variable.

Normal Approximation ο‚› The owner of a new apartment building must install 25 water heaters. From past experience in other apartment buildings, she knows that Quick Hot is a good brand. A Quick hot heater is guaranteed for 5 years only, but from the owner’s past experience, she knows that the probability it will last 10 years is 0.25.

ο‚› A) What is the probability that 8 or more of the 25 water heaters will last at least 10 years? ο‚› B) How does this result compare with the result we can obtain by using the formula for the binomial probability distribution with n=25 and p=0.25?

ο‚› C) How do the results of part a and b compare?

Answer ο‚› A) since n=25 and p=0.25, r is the binomial random variable corresponding to the number of successes out of n=25. We want to find 𝑃(π‘Ÿ β‰₯ 8). ο‚› πœ‡ = 𝑛𝑝 = 25 βˆ— 0.25 = 6.25 ο‚› 𝜎 = π‘›π‘π‘ž = 25 βˆ— 0.25 βˆ— 0.75 = 2.17 ο‚› 𝑠𝑖𝑛𝑐𝑒 π‘Ÿ β‰₯ 8, π‘‘β„Žπ‘’π‘Ÿπ‘’π‘“π‘œπ‘Ÿπ‘’ π‘₯ β‰₯ 7.5

ο‚› 𝑧=

π‘₯βˆ’πœ‡ 𝜎

=

7.5βˆ’6.25 2.17

β‰ˆ 0.58 so, 1-0.7190=0.2810

ο‚› B) Using TI83/TI84 you get 0.2735 ο‚› C) The error is only .0075, which is small.

Group Work: ο‚› The Denver Post stated that 80% of all new products introduced in grocery stores fail (are taken off the market) within 2 years. If a grocery store chain introduces 66 new products, what is the probability that within 2 years ο‚› A) 47 or more fail? ο‚› B) 58 of fewer fail? ο‚› C) 15 or more succeed? ο‚› D) fewer than 10 succeed?

Answer ο‚› π‘›π‘ž = 52.8>5, 𝑛𝑝 = 13.2>5, yes I can use normal approx to binomial distribution ο‚› 1) π‘›π‘ž = 52.8 𝜎 = 3.25, 𝑧 =

.0262=.9738 ο‚› 2) π‘›π‘ž = 52.8 𝜎 = 3.25, 𝑧 =

46.5βˆ’52.8 =-1.94 3.25

= .0262, so 1-

58.5βˆ’52.8 =1.75= 3.25

ο‚› 3) 𝑛𝑝 = 13.2, 𝜎 = 3.25, 𝑧 = .3446

14.5βˆ’13.2 3.25

ο‚› 4) 𝑛𝑝 = 13.2, 𝜎 = 3.25, 𝑧 =

9.5βˆ’13.2 3.25

.9599

= 0.4 = .6554, 1 βˆ’ .6554 =

= βˆ’1.14 = .1271

Group Work

ο‚› It is estimated that 3.5% of the general population will live past their 90th birthday. In a graduating class of 753 high school seniors, what is the probability that ο‚› A) 15 or more will live beyond their 90th birthday? ο‚› B) 30 or more will live beyond their 90th birthday? ο‚› C) Between 25 and 35 will live beyond their 90th birthday? ο‚› D) more than 40 will live beyond their 90th birthday?

Answer ο‚› π‘›π‘ž = 726.645>5, 𝑛𝑝 = 26.355>5, yes I can use normal approx to binomial distribution ο‚› 1) 𝑛𝑝 = 26.355 𝜎 = 5.04, 𝑧 = .0094=.9906 ο‚› 2) 𝑛𝑝 = 26.355 𝜎 = 5.04, 𝑧 =

.7324=.2676 ο‚› 3)𝑛𝑝 = 26.355 𝜎 = 5.04, 𝑧 = 35.5βˆ’26.355 5.04

14.5βˆ’26.355 =-2.35=.0094 5.04 29.5βˆ’26.355 =.62=.7324 5.04

so, 1-

so, 1-

24.5βˆ’26.355 =-.37=.3557, 5.04

,𝑧 =

= 1.81 = .9649, π‘ π‘œ .9649 βˆ’ .3557 = .6092

ο‚› 4)𝑛𝑝 = 26.355 𝜎 = 5.04, 𝑧 = .9975=.0025

40.5βˆ’26.355 =2.81=.9975=15.04

Use the examples and my answers to find out the pattern of the wording!

Homework Practice

ο‚› Pg 278 #1-14 eoo

View more...

Comments

Copyright οΏ½ 2017 NANOPDF Inc.
SUPPORT NANOPDF