Methods for Describing Sets of Data

Size: px
Start display at page:

Download "Methods for Describing Sets of Data"

Transcription

1 6 Chapter Methods for Describig Sets of Data Chapter. I a bar graph, a bar or rectagle is draw above each class of the qualitative variable correspodig to the class frequecy or class relative frequecy. I a pie chart, each slice of the pie correspods to the relative frequecy of a class of the qualitative variable..4 First, we fid the frequecy of the grade A. The sum of the frequecies for all 5 grades must be. Therefore, subtract the sum of the frequecies of the other 4 grades from. The frequecy for grade A is: ( ) = 184 = 16 To fid the relative frequecy for each grade, divide the frequecy by the total sample size,. The relative frequecy for the grade B is 36/ =.18. The rest of the relative frequecies are foud i a similar maer ad appear i the table: Grade o Statistics Exam Frequecy Relative Frequecy A: B: C: D: F: Below Total 1..6 a. The graph show is a pie chart. b. The qualitative variable described i the graph is opiio o library importace. c. The most commo opiio is more importat, with 46.% of the respoders idicatig that they thik libraries have become more importat. d. Usig MINITAB, the Pareto diagram is: 5 Chart of Percet 4 Percet 3 1 More Same Importace Less Of those who respoded to the questio, almost half (46%) believe that libraries have become more importat to their commuity. Oly 18% believe that libraries have become less importat. Copyright 17 Pearso Educatio, Ic.

2 Methods for Describig Sets of Data 7.8 a. From the pie chart, 5.4% or.54 of adults livig i the U.S. use the iteret ad pay to dowload music. From the data, 56 out of 1,3 adults or 56/1,3 =.54 of adults i the U.S. use the iteret ad pay to dowload music. These two results agree. b. Usig MINITAB, a pie chart of the data is: Pie Chart of Dowload-Music Category Pay No Pay No Pay 33.% Pay 67.%.1 a. Data were collected o 3 questios. For questios 1 ad, the resposes were either yes or o. Sice these are ot umbers, the data are qualitative. For questio 3, the resposes iclude character couts, roots of empathy, teacher desiged, other, ad oe. Sice these resposes are ot umbers, the data are qualitative. b. Usig MINITAB, bar charts for the 3 questios are: Chart of Classroom Pets Cout 3 1 Yes Classroom Pets No Copyright 17 Pearso Educatio, Ic.

3 8 Chapter Chart of Pet Visits 4 3 Cout 1 Yes Pet Visits No Chart of Educatio 3 5 Cout Character Couts Roots of Empathy Teacher desiged Educatio Other Noe c. May differet thigs ca be writte. Possible aswers might be: Most of the classroom teachers surveyed (61/ ) keep classroom pets. A little less tha half of the surveyed classroom teachers (35 / ) allow visits by pets..1 Usig MINITAB, the pie chart is: Pie Chart of Loc Rural 5.7% Category Urba Suburba Rural Suburba 3.8% Urba 61.5% Copyright 17 Pearso Educatio, Ic.

4 Methods for Describig Sets of Data 9.14 a. The two qualitative variables graphed i the bar charts are the occupatioal titles of cla idividuals i the cotiued lie ad the occupatioal titles of cla idividuals i the dropout lie. b. I the Cotiued Lie, about 63% were i either the high or the middle grade. Oly about % were i the oofficial category. I the Dropout Lie, oly about % were i either the high or middle grade while about 64% were i the oofficial category. The percetages i the low grade ad provicial official categories were about the same for the two lies..16 Usig MINITAB, the Pareto chart is:.14 Chart of Allocatio.1 Relative frequecy #5 #8 #6 #7 #1 #11 # #3 #4 #9 #1 Track Proportio withi all data. From the graph, it appears that tracks #5 ad #8 were over-utilized ad track #1 is uderutilized..18 a. Usig MINITAB, the Pareto chart of the total aual shootigs ivolvig the Bosto street gag is:.5 Chart of Total Shootigs Proportio of Total Shootigs Year Proportio withi all data. Copyright 17 Pearso Educatio, Ic.

5 1 Chapter b. Usig MINITAB, the Pareto chart of the aual shootigs of the Bosto gag members is:.5 Chart of Gag Member Shootigs Proportio of Gag Members Year Proportio withi all data. c. Because the proportio of shootigs per year dropped drastically after 7 for both the total aual shootigs ad aual shootigs of the Bosto street gag members, it appears that Operatio Ceasefire was effective.. Usig MINITAB, the side-by-side bar graphs showig the distributio of dives for the three match situatios are: Chart of Team behid, Tied, Team ahead Left Middle Right Team behid Tied Proportio.8 Team ahead Left Middle Right Dive Proportio withi all data. From the graphs, it appears that whe a team is tied or ahead, there is o differece i the proportio of times the goal-keeper dives right or left. However, if the team is behid, the goal-keeper teds to dive right much more frequetly tha left. Copyright 17 Pearso Educatio, Ic.

6 Methods for Describig Sets of Data 11. Usig MINITAB, a bar graph of the data is: 9 Chart of Measure Freq Total Visitors Payig visitors Big shows Measure Fuds raised Members The researcher cocluded that there is a large amout of variatio withi the museum commuity with regards to... performace measuremet ad evaluatio. From the data, there are oly 5 differet performace measures. I would ot say that this is a large amout. Withi these 5 categories, the umber of times each is used does ot vary that much. I would disagree with the researcher. There is ot much variatio..4 Usig MINITAB a bar chart for the Extict status versus flight capability is: 8 Chart of Extict, Flight Cout Flight Extict No Abset Yes No No Yes No Yes Yes It appears that extict status is related to flight capability. For birds that do have flight capability, most of them are preset. For those birds that do ot have flight capability, most are extict. Copyright 17 Pearso Educatio, Ic.

7 1 Chapter The bar chart for Extict status versus Nest Desity is: 6 Chart of Extict, Nest Desity 5 4 Cout 3 1 Nest Desity Extict H Abset L H No L H Yes L It appears that extict status is ot related to est desity. The proportio of birds preset, abset, ad extict appears to be very similar for est desity high ad est desity low. The bar chart for Extict status versus Habitat is: Chart of Extict, Habitat 4 3 Cout 1 Habitat Extict A TA Abset TG A TA No TG A TA Yes TG It appears that the extict status is related to habitat. For those i aerial terrestrial (TA), most species are preset. For those i groud terrestrial (TG), most species are extict. For those i aquatic, most species are preset..6 The differece betwee a bar chart ad a histogram is that a bar chart is used for qualitative data ad a histogram is used for quatitative data. For a bar chart, the categories of the qualitative variable usually appear o the horizotal axis. The frequecy or relative frequecy for each category usually appears o the vertical axis. For a histogram, values of the quatitative variable usually appear o the horizotal axis ad either frequecy or relative frequecy usually appears o the vertical axis. The quatitative data are grouped ito itervals which appear o the horizotal axis. The umber of observatios appearig i each iterval is the graphed. Bar charts usually leave spaces betwee the bars while histograms do ot..8 I a histogram, a class iterval is a rage of umbers above which the frequecy of the measuremets or relative frequecy of the measuremets is plotted. Copyright 17 Pearso Educatio, Ic.

8 Methods for Describig Sets of Data 13.3 a. This is a frequecy histogram because the umber of observatios is displayed rather tha the relative frequecies. b. There are 14 class itervals used i this histogram. c. The total umber of measuremets i the data set is Usig MINITAB, the relative frequecy histogram is:.5. Relative frequecy Class Iterval a. Usig MINITAB, the relative frequecy histogram is: Histogram of RDER.5. Relative Frequecy RDER Value b. From the graph, the proportio of subjects with RDER values betwee 75 ad 15 is about.18. The exact proportio is13 / b. From the graph, the proportio of subjects with RDER values below 15 is about / The exact proportio is.36 a. Because the label o the vertical axis is Percet, this is a relative frequecy histogram. Copyright 17 Pearso Educatio, Ic.

9 14 Chapter b. From the graph, the percetage of the 99 seior maagers who reported a high level of support for corporate sustaiability is about %..38 Usig MINITAB, the stem-ad-leaf display is: Stem-ad-Leaf Display: Depth Stem-ad-leaf of Depth N = 18 Leaf Uit = (3) The data i the stem-ad-leaf display are displayed to 1 decimal place while the actual data is displayed to decimal places. To 1 decimal place, there are 3 umbers that appear twice 14., 15.7, ad However, to decimal places, oe of these umbers are the same. Thus, o molar depth occurs more frequetly i the data..4 a. Usig MINITAB, the dot plot of the hoey dosage data is: Dotplot of Hoey Dosage Group Improvemet Score b. Both 1 ad 1 occurred 6 times i the hoey dosage group. c. From the graph i part c, 8 of the top 11 scores (7.7%) are from the hoey dosage group. Of the top 3 scores, 18 (6%) are from the hoey dosage group. This supports the coclusios of the researchers that hoey may be a preferable treatmet for the cough ad sleep difficulty associated with childhood upper respiratory tract ifectio. Copyright 17 Pearso Educatio, Ic.

10 Methods for Describig Sets of Data 15.4 a. Usig MINITAB, the stem-ad-leaf display is: Stem-ad-Leaf Display: Spider Stem-ad-leaf of Spider N = 1 Leaf Uit = (3) b. The spiders with a cotrast value of 7 or higher are i bold type i the stem-ad-leaf display i part a. There are 3 spiders i this group. c. The sample proportio of spiders that a bird could detect is 3 /1.3. Thus, we could ifer that a bird could detect a crab-spider sittig o the yellow cetral part of a daisy about 3% of the time..44 a. A stem-ad-leaf display of the data usig MINITAB is: Stem-ad-Leaf Display: FNE Stem-ad-leaf of FNE N = 5 Leaf Uit = () b. The umbers i bold i the stem-ad-leaf display represet the bulimic studets. Those umbers ted to be the larger umbers. The larger umbers idicate a greater fear of egative evaluatio. Thus, the bulimic studets ted to have a greater fear of egative evaluatio. c. A measure of reliability idicates how certai oe is that the coclusio draw is correct. Without a measure of reliability, ayoe could just guess at a coclusio. Copyright 17 Pearso Educatio, Ic.

11 16 Chapter.46 a. Usig MINITAB, histograms of the two sets of SAT scores are: Histogram of TOT11, TOT14 TOT TOT Frequecy It appears that the distributios of both sets of scores are somewhat skewed to the right. Although the distributios are ot idetical for the two years, they are similar. b. Usig MINITAB, a histogram of the differeces of the 14 ad 11 SAT scores is: 35 Histogram of Diff 3 5 Frequecy Diff -5 5 c. It appears that there are more differeces above tha below. Thus, it appears that i geeral, the 14 SAT scores are higher tha the 11 SAT scores. However, there are may differeces that are very close to. d. Wyomig had the largest improvemet i SAT scores from 11 to 14, with a icrease of 65 poits. Copyright 17 Pearso Educatio, Ic.

12 Methods for Describig Sets of Data Usig MINITAB, the side-by-side histograms are: Histogram of ZETA without, ZETA with GYPSUM ZETA without ZETA with GYPSUM Frequecy The additio of calcium/gypsum icreases the values of the zeta potetial of silica. All of the values of zeta potetial for the specimes cotaiig calcium/gypsum are greater tha all of the values of zeta potetial for the specimes without calcium/gypsum..5 A measure of cetral tedecy measures the ceter of the distributio while measures of variability measure how spread out the data are..5 A skewed distributio is a distributio that is ot symmetric ad ot cetered aroud the mea. Oe tail of the distributio is loger tha the other. If the mea is greater tha the media, the the distributio is skewed to the right. If the mea is less tha the media, the distributio is skewed to the left..54 a. For a distributio that is skewed to the left, the mea is less tha the media. b. For a distributio that is skewed to the right, the mea is greater tha the media. c. For a symmetric distributio, the mea ad media are equal..56 Assume the data are a sample. The mode is the observatio that occurs most frequetly. For this sample, the mode is 15, which occurs 3 times. The sample mea is: x x The media is the middle umber whe the data are arraged i order. The data arraged i order are: 1, 11, 1, 13, 15, 15, 15, 16, 17, 18, 18. The middle umber is the 6th umber, which is The media is the middle umber oce the data have bee arraged i order. If is eve, there is ot a sigle middle umber. Thus, to compute the media, we take the average of the middle two umbers. If is odd, there is a sigle middle umber. The media is this middle umber. Copyright 17 Pearso Educatio, Ic.

13 18 Chapter A data set with 5 measuremets arraged i order is 1, 3, 5, 6, 8. The media is the middle umber, which is 5. A data set with 6 measuremets arraged i order is 1, 3, 5, 5, 6, 8. The media is the average of the middle two umbers which is 5..6 a. xi i The mea is x 3.3. This is the average umber of sword shafts buried at each grave site. b. To fid the media, the data must be arraged i order. The data arraged i order are: There are a total of 13 observatios, which is a odd umber. The media is the middle umber which is. Half of the grave sites had or fewer sword shafts buried ad half had or more. c. The mode is the umber that occurs most frequetly. I this case, the mode is..6 a. xi i The mea is x This is the Performace 8 8 Axiety Ivetory scale for participats i 8 differet studies. b. To fid the media, the data must be arraged i order. The data arraged i order are: There are a total of 8 observatios, which is a eve umber. The media is the average of the middle umbers which are 4 ad 43. The media is Half of the studies had a PAI scale less tha 4.5 ad half had a value greater tha 4.5. c. If 39 were elimiated, the mea becomes xi i x The data arraged i order are ow: The media is the middle umber which is 43. The mea icreased by.946 while the media oly icreased by a. There are 35 observatios i the hoey dosage group. Thus, the media is the middle umber, oce the data have bee arraged i order from the smallest to the largest. The middle umber is the 18 th observatio which is 11. Copyright 17 Pearso Educatio, Ic.

14 Methods for Describig Sets of Data 19 b. There are 33 observatios i the DM dosage group. Thus, the media is the middle umber, oce the data have bee arraged i order from the smallest to the largest. The middle umber is the 17 th observatio which is 9. c. There are 37 observatios i the cotrol group. Thus, the media is the middle umber, oce the data have bee arraged i order from the smallest to the largest. The middle umber is the 19 th observatio which is 7. d. Sice the media of the hoey dosage group is the highest, the media of the DM groups is the ext highest, ad the media of the cotrol group is the smallest, we ca coclude that the hoey dosage is the most effective, the DM dosage is the ext most effective, ad othig (cotrol) is the least effective..66 a. x 77.7 The mea of the drivig performace idex values is: x The media is the average of the middle two umbers oce the data have bee arraged i order. After arragig the umbers i order, the th ad 1 st umbers are 1.75 ad The media is: The mode is the umber that occurs the most frequetly ad is 1.4. b. The average drivig performace idex is The media is Half of the players have drivig performace idex values less tha ad half have values greater tha Three of the players have the same idex value of 1.4. c. Sice the mea is greater tha the media, the data are skewed to the right. Usig MINITAB, a histogram of the data is: Histogram of Performace 1 8 Frequecy Performace Copyright 17 Pearso Educatio, Ic.

15 Chapter.68 a. The mea umber of at species discovered is: x x The media is the middle umber oce the data have bee arraged i order: 3, 3, 4, 4, 4, 5, 5, 5, 7, 49, 5. The media is 5. The mode is the value with the highest frequecy. Sice both 4 ad 5 occur 3 times, both 4 ad 5 are modes. b. For this case, we would recommed that the media is a better measure of cetral tedecy tha the mea. There are very large umbers compared to the rest. The mea is greatly affected by these umbers, while the media is ot. c. The mea total plat cover percetage for the Dry Steppe regio is: x x The media is the middle umber oce the data have bee arraged i order: 7, 4, 4, 43, 5. The media is 4. The mode is the value with the highest frequecy. Sice 4 occurs times, 4 is the mode. d. The mea total plat cover percetage for the Gobi Desert regio is: x x The media is the mea of the middle umbers oce the data have bee arraged i order: 14, 16,, 3, 3, 56. The media is The mode is the value with the highest frequecy. Sice 3 occurs times, 3 is the mode. e. Yes, the total plat cover percetage distributios appear to be differet for the regios. The percetage of plat coverage i the Dry Steppe regio is much greater tha that i the Gobi Desert regio. Copyright 17 Pearso Educatio, Ic.

16 Methods for Describig Sets of Data 1.7 a. Usig MINITAB, the simple statistics are: Descriptive Statistics: ZETA without, ZETA with GYPSUM N for Variable N Mea Media Mode Mode ZETA without ZETA with GYPSUM For the liquid solutios prepared without calcium/gypsum, the mea zeta potetial measuremet is -5.7, the media is -5.5 ad the mode is -5.. The average zeta potetial measuremet for liquid solutios prepared without calcium/gypsum is Half of the zeta potetial measuremets for liquid solutios prepared without calcium/gypsum are less tha or equal to -5.5 ad half are greater tha The most commo zeta potetial measuremet for liquid solutios prepared without calcium/gypsum is -5.. b. For the liquid solutios prepared with calcium/gypsum, the mea zeta potetial measuremet is , the media is ad the mode is The average zeta potetial measuremet for liquid solutios prepared with calcium/gypsum is Half of the zeta potetial measuremets for liquid solutios prepared with calcium/gypsum are less tha or equal to ad half are greater tha The most commo zeta potetial measuremet for liquid solutios prepared with calcium/gypsum is c. The iterpretatio remais the same as i Exercise.48. The additio of calcium/gypsum icreases the values of the zeta potetial of silica. The mea, media ad mode of the values of zeta potetial for the specimes cotaiig calcium/gypsum are greater tha the mea, media ad mode of the values of zeta potetial for the specimes without calcium/gypsum..7 a. The mea umber of power plats is: xi i x 3.95 The media is the mea of the middle umbers oce the data have bee arraged i order: 1, 1, 1, 1, 1,,, 3, 3, 3, 4, 4, 4, 5, 5, 5, 6, 7, 9, 11 The media is The umber 1 occurs 5 times. The mode is 1. b. Deletig the largest umber, 11, the ew mea is: xi i x Copyright 17 Pearso Educatio, Ic.

17 Chapter The media is the middle umber oce the data have bee arraged i order: 1, 1, 1, 1, 1,,, 3, 3, 3, 4, 4, 4, 5, 5, 5, 6, 7, 9 The media is 3. The umber 1 occurs 5 times. The mode is 1. By droppig the largest measuremet from the data set, the mea drops from 3.95 to The media drops from 3.5 to 3 ad the mode stays the same. c. Deletig the lowest ad highest measuremets leaves the followig: 1, 1, 1,, 3, 3, 3, 3, 4, 4, 4, 5, 5, 5, 6, 7 The ew mea is: xi i x The trimmed mea has the advatage that some possible outliers have bee elimiated..74 The primary disadvatage of usig the rage to compare variability of data sets is that the two data sets ca have the same rage ad be vastly differet with respect to data variatio. Also, the rage is greatly affected by extreme measures..76 The variace of a data set ca ever be egative. The variace of a sample is the sum of the squared deviatios from the mea divided by 1. The square of ay umber, positive or egative, is always positive. Thus, the variace will be positive. The variace is usually greater tha the stadard deviatio. However, it is possible for the variace to be smaller tha the stadard deviatio. If the data are betwee ad 1, the variace will be smaller tha the stadard deviatio. For example, suppose the data set is.8,.7,.9,.5, ad.3. The sample mea is: x x The sample variace is: s x 3. x The stadard deviatio is s Copyright 17 Pearso Educatio, Ic.

18 Methods for Describig Sets of Data 3.78 a. s x x s b. c. s s x 1 x x 17 x s s a. Rage 4 4 s x 8 x b. Rage 6 6 s x 17 x c. Rage 8 ( ) 1 s x 7 x d. Rage ( 3) 5 s x ( 5) x s s s 8.88 s This is oe possibility for the two data sets. Data Set 1:, 1,, 3, 4, 5, 6, 7, 8, 9 Data Set :,, 1, 1,,, 3, 3, 9, 9 The two sets of data above have the same rage = largest measuremet smallest measuremet 9 9. The meas for the two data sets are: x x Copyright 17 Pearso Educatio, Ic.

19 4 Chapter x x The dot diagrams for the two data sets are show below. Dotplot of Data x-bar1 Group 1 x-bar 4 Data a. s x 8 x s b. c. d. x 55 s x square feet s feet s x ( 15) x x 4 s s x square ouces s ouce.86 a. The rage is the differece betwee the largest ad smallest umbers. For this data set, the rage is b. The sample variace is s x 4 x c. The sample stadard deviatio is s Copyright 17 Pearso Educatio, Ic.

20 Methods for Describig Sets of Data 5 d. Both the rage ad the stadard deviatio have the same uits of measure as the origial variable..88 a. The rage is the differece betwee the largest ad smallest observatios ad is meters. b. The variace is: s x 16.3 x square meters c. The stadard deviatio is s meters..9 a. For Group A, the rage is 67.. From the pritout, the rage is b. For Group A, the stadard deviatio is From the pritout, s s c. Group B has the more variable permeability data. Group B has the largest rage, the largest variace ad the largest stadard deviatio..9 a. Rage s x 79 x b. Droppig the largest measuremet: Rage 91 8 x 68 x 334 s s s s s By droppig the largest observatio from the data set, the rage decreased from 1 to 8, the variace decreased from 7.54 to 5.35 ad the stadard deviatio decreased from.743 to.44. c. Droppig the largest ad smallest measuremets: Rage 91 8 x 67 x 333 s s s By droppig the largest ad smallest observatios from the data set, the rage decreased from 1 to 8, the variace decreased from 7.54 to ad the stadard deviatio decreased from.743 to.18. Copyright 17 Pearso Educatio, Ic.

21 6 Chapter.94 The Empirical Rule applies oly to data sets that are moud-shaped that are approximately symmetric, with a clusterig of measuremets about the midpoit of the distributio ad that tail off as oe moves away from the ceter of the distributio..96 Sice o iformatio is give about the data set, we ca oly use Chebyshev's rule. a. At least of the measuremets will fall betwee x s ad x s. b. At least 3/4 or 75% of the measuremets will fall betwee x s ad x s. c. At least 8/9 or 89% of the measuremets will fall betwee x 3s ad x 3s..98 a. b. x 6 x s x 6 x Iterval x Number of Measuremets i Iterval s s 1.83 Percetage s, or (6.41, 1.17) / 5.7 or 7% x s, or (4.58, 11.9) 4 4 / 5.96 or 96% x 3s, or (.75, 13.73) 5 5 / 5 1 or 1% c. The percetages i part b are i agreemet with Chebyshev's rule ad agree fairly well with the percetages give by the Empirical Rule. d. Rage s rage / 4 7 / The rage approximatio provides a satisfactory estimate of s..1 a. Usig MINITAB, the histogram is: 1 Histogram of Wheels 1 8 Frequecy Wheels Copyright 17 Pearso Educatio, Ic.

22 Methods for Describig Sets of Data 7 Although the distributio is somewhat moud-shaped, the distributio is skewed to the right. b. Usig MINITAB, the mea ad stadard deviatio are: Descriptive Statistics: Wheels Variable N Mea StDev Wheels c. x s 3.14 (1.371) (.47,5.956) d. Accordig to Chebyshev s rule, at least 3 of the measuremets will fall withi 4 stadard deviatios of the mea. e. Accordig to the Empirical Rule, approximately 95% of the measuremets will fall withi stadard deviatios of the mea. f. Twety-six of the twety-eight observatios fall withi the iterval. The proportio is The Empirical Rule does provide a good estimate of the proportio. The 8 actual percetage is 9.9% which is close to 95%..1 a. If the data are symmetric ad moud shaped, the the Empirical Rule will describe the data. About 95% of the observatios will fall withi stadard deviatio of the mea. The iterval two stadard deviatios below ad above the mea is x s 39 (6) 391 (7,51). This rage would be 7 to 51. b. To fid the umber of stadard deviatios above the mea a score of 51 would be, we subtract the mea from 51 ad divide by the stadard deviatio. Thus, a score of 51 is stadard deviatios above the mea. From the Empirical Rule, about.5 of 6 the drug dealers will have WR scores above 51. c. By the Empirical Rule, about 99.7% of the observatios will fall withi 3 stadard deviatios of the mea. Thus, early all the scores will fall withi 3 stadard deviatios of the mea. The iterval three stadard deviatios below ad above the mea is x 3s 39 3(6) 3918 (1,57). This rage would be 1 to a. Because the histogram i Exercise.34 is skewed to the right, Chebyshev s rule is more appropriate for describig the distributio of the RDER values. b. The iterval x s is (63.4) ( 48.9, 4.67). At least ¾ or 75% of the observatios will be betwee ad a. By Chebyshev s rule, at least 75% of the observatios will fall withi stadard deviatios of the mea. This iterval is x s.9 (1.1).9. ( 1.3, 3.1). Copyright 17 Pearso Educatio, Ic.

23 8 Chapter b. No. A value of 7 would be (7.9) / stadard deviatios above the mea. This would be very uusual..18 a. There are observatios with missig values for egg legth, so there are oly 13 useable observatios. x 7,885 x s x (7,885) x 77, , , s s 1, b. The data are ot symmetrical or moud-shaped. Thus, we will use Chebyshev s Rule. We kow that there are at least 8/9 or 88.9% of the observatios withi 3 stadard deviatios of the mea. Thus, at least 88.9% of the observatios will fall i the iterval: x 3s (43.99) ( 71.3, 19.69) Sice it is impossible to have egative egg legths, at least 88.9% of the egg legths will be betwee ad a. The mea ad stadard deviatio for Group A are 73.6 ad The histogram of the data from Group A is skewed to the right so Chebyshev s rule is more appropriate. We kow at least 8/9 or 88.9% of the observatios will fall withi 3 stadard deviatios of the mea. This iterval is x 3s (14.48) (3.18, 117.6). b. The mea ad stadard deviatio for Group B are ad The histogram of the data from Group B is skewed to the left so Chebyshev s rule is more appropriate. We kow at least 8/9 or 88.9% of the observatios will fall withi 3 stadard deviatios of the mea. This iterval is x 3s (1.97) (6.63, ). c. The mea ad stadard deviatio for Group C are 83.7 ad.5. The histogram of the data from Group C is skewed to the right so Chebyshev s rule is more appropriate. We kow at least 8/9 or 88.9% of the observatios will fall withi 3 stadard deviatios of the mea. This iterval is x 3s (.5) (.9, 143.). d. It appears that weatherig type B results i faster decay..11 To decide which group the patiet is most likely to come from, we will compute the z-score for each group. Copyright 17 Pearso Educatio, Ic.

24 Methods for Describig Sets of Data 9 x Group T: z x Group V: z x Group C: z The patiet is most likely to have come from Group T. The z-score for Group T is z This would ot be a uusual z-score if the patiet was i Group T. The z-scores for the other groups are both greater tha. We kow that z-scores greater tha are rather uusual..114 a. The 5 th percetile is also called the media. b. The Q L is the lower quartile. This is also the 5 th percetile or the score which has 5% of the observatios less tha it. c. The Q U is the upper quartile. This is also the 75 th percetile or the score which has 75% of the observatios less tha it..116 For moud-shaped distributios, we ca use the Empirical Rule. About 95% of the observatios will fall withi stadard deviatios of the mea. Thus, about 95% of the measuremets will have z-scores betwee - ad..118 We first compute z-scores for each x value. a. b. c. d. x 1 5 z 5 x 14 z 3 1 x z 1 x 1 5 z The above z-scores idicate that the x value i part a lies the greatest distace above the mea ad the x value of part b lies the greatest distace below the mea..1 The mea score is 153. This is the arithmetic average score of U.S. twelfth graders o the mathematics assessmet test. The 5 th percetile score is 111. This idicates that 5% of the U.S. twelfth graders scored 111 or lower o the assessmet test. The 75 th percetile score is 177. This idicates that 75% of the U.S. twelfth graders scored 177 or lower o the Copyright 17 Pearso Educatio, Ic.

25 3 Chapter assessmet test. The 9 th percetile score is 197. This idicates that 9% of the U.S. twelfth graders scored 197 or lower o the assessmet test..1 a. From Exercise.35, the proportio of fup/fumic ratios that fall above 1 is.34. The percetile rak of 1 is (1.34)1% 96.6 percetile. b. From Exercise.35, the proportio of fup/fumic ratios that fall below.4 is.695. The percetile rak of.4 is (.695)1% 69.5 th percetile..14 a. xx 3 39 The z-score associated with a score of 3 is z 1.5. This meas s 6 that a score of 3 is 1.5 stadard deviatios below the mea. b. xx The z-score associated with a score of 39 is z. Half or.5 of the s 6 observatios are below a score of Sice the 9th percetile of the study sample i the subdivisio was.37 mg/l, which is less tha the USEPA level of.15 mg/l, the water customers i the subdivisio are ot at risk of drikig water with uhealthy lead levels..18 a. If the distributio is moud-shaped ad symmetric, the the Empirical Rule ca be used. Approximately 68% of the scores will fall withi 1 stadard deviatio of the mea or betwee 53% 15% or betwee 38% ad 68%. Approximately 95% of the scores will fall withi stadard deviatios of the mea or betwee 53% (15%) or betwee 3% ad 83%. Approximately all of the scores will fall withi 3 stadard deviatios of the mea or betwee 53% 3(15%) or betwee 8% ad 98%. b. If the distributio is moud-shaped ad symmetric, the the Empirical Rule ca be used. Approximately 68% of the scores will fall withi 1 stadard deviatio of the mea or betwee 39% 1% or betwee 7% ad 51%. Approximately 95% of the scores will fall withi stadard deviatios of the mea or betwee 39% (1%) or betwee 15% ad 63%. Approximately all of the scores will fall withi 3 stadard deviatios of the mea or betwee 39% 3(1%) or betwee 3% ad 75%. c. Sice the scores o the red exam are shifted to the left of those o the blue exam, a score of % is more likely to occur o the red exam tha o the blue exam..13 Yes. From the graph i Exercise.19 c, we ca see that there are 4 observatios with z- scores greater tha 3. There is the a gap dow to.18. Those 4 observatios are quite differet from the rest of the data. After those 4 observatios, the data are fairly similar. We kow that by rakig the data, we ca reduce the ifluece of outliers. But, by doig this, we lose valuable iformatio..13 The iterquartile rage is the distace betwee the upper ad lower quartiles..134 For a moud-shaped distributio, the Empirical Rule ca be used. Almost all of the observatios will fall withi 3 stadard deviatios of the mea. Thus, almost all of the observatios will have z-scores betwee -3 ad 3. Copyright 17 Pearso Educatio, Ic.

26 Methods for Describig Sets of Data The iterquartile rage is IQR Q Q The lower ier fece Q L 1.5(IQR) 6 1.5(5).5. The upper ier fece Q U 1.5(IQR) 6 1.5(5) 1.5. The lower outer fece Q L 3(IQR) 6 3(5) 15. The upper outer fece Q U 3(IQR) 6 3(5) 16. U L With oly this iformatio, the box plot would look somethig like the followig: The whiskers exted to the ier feces uless o data poits are that small or that large. The upper ier fece is 1.5. However, the largest data poit is 1, so the whisker stops at 1. The lower ier fece is.5. The smallest data poit is 18, so the whisker exteds to.5. Sice 18 is betwee the ier ad outer feces, it is desigated with a *. We do ot kow if there is ay more tha oe data poit below.5, so we caot be sure that the box plot is etirely correct..138 To determie if the measuremets are outliers, compute the z-score. a. b. c. d. xx z.77 s 11 Sice this z-score is less tha 3 i magitude, 65 is ot a outlier. xx 157 z 3.73 s 11 Sice this z-score is more tha 3 i magitude, 1 is a outlier. xx 7 57 z s 11 Sice this z-score is less tha 3 i magitude, 7 is ot a outlier. xx z 3.77 s 11 Sice this z-score is more tha 3 i magitude, 98 is a outlier. Copyright 17 Pearso Educatio, Ic.

27 3 Chapter.14 a. Usig MINITAB, the box plot for data is give below. 1 Boxplot of Data Data b. I this data set, there is oe outlier. It correspods to the value 14. xx a. The z-score is z s 3 b. Yes, we would cosider this measuremet a outlier. Ay observatio with a z-score that has a absolute value greater tha 3 is cosidered a highly suspect outlier. xx The z-score correspodig to 155 is: z 3.5. Because this s observatio is more tha 3 stadard deviatios from the mea, it is cosidered a highly suspect outlier. It would ot be cosidered typical of the study sample..146 a. The approximate 5 th percetile PASI score before treatmet is 1. The approximate media before treatmet is 15. The approximate 75 th percetile PASI score before treatmet is 7.5. b. The approximate 5 th percetile PASI score after treatmet is 3.5. The approximate media after treatmet is 5. The approximate 75 th percetile PASI score after treatmet is 7.5. c. Sice the 75 th percetile after treatmet is lower tha the 5 th percetile before treatmet, it appears that the ichthyotherapy is effective i treatig psoriasis..148 Usig MINITAB, a boxplot of the data is: Boxplot of Rockfall Rockfall Copyright 17 Pearso Educatio, Ic.

28 Methods for Describig Sets of Data 33 From the boxplot, there is o idicatio that there are ay outliers. We will ow use the z-score criterio for determiig outliers. From Exercises.61 ad.88, x 9.7 ad s The z-score associated with the miimum value is xx z 1.18 ad the z-score associated with the maximum value is s 4.95 xx z Neither of these idicates there are ay outliers. s a. Usig MINITAB, the boxplots of the three groups are: 18 Boxplot of Hoey, DM, Cotrol Data Hoey DM Cotrol b. The media improvemet score for the hoey dosage group is larger tha the media improvemet scores for the other two groups. The media improvemet score for the DM dosage group is higher tha the media improvemet score for the cotrol group. c. Because the iterquartile rage for the DM dosage group is larger tha the iterquartile rages of the other groups, the variability of the DM group is largest. The variability of the hoey dosage group ad the cotrol group appear to be about the same. d. There appears to be oe outlier i the hoey dosage group ad oe outlier i the cotrol group..15 a. x 47 z 3 1 b. The z-score is low eough to suspect that the libraria's claim is icorrect. Eve without ay kowledge of the shape of the distributio, Chebyshev's rule states that at least 8/9 of the measuremets will fall withi 3 stadard deviatios of the mea (ad, cosequetly, at most 1/9 will be above z 3 or below z 3 ). c. The Empirical Rule states that almost oe of the measuremets should be above z 3 or below z 3. Hece, the libraria's claim is eve more ulikely. d. Whe, x 47 z 1.5 Copyright 17 Pearso Educatio, Ic.

29 34 Chapter This is ot a ulikely occurrece, whether or ot the data are moud-shaped. Hece, we would ot have reaso to doubt the libraria's claim..154 A bivariate relatioship is a relatioship betwee quatitative variables..156 A positive associatio betwee two variables meas that as oe variable icreases, the other variable teds to also icrease. A egative associatio betwee two variables meas that as oe variable icreases, the other variable teds to decrease..158 Usig MINITAB, the scatterplot is as follows: 18 Scatterplot of Variable vs Variable Variable Variable It appears that as variable 1 icreases, variable also icreases..16 Usig MINITAB, a scatter plot of the data is: Scatterplot of SLUGPCT vs ELEVATION SLUGPCT ELEVATION If oe uses the oe obvious outlier (Dever), the there does appear to be a tred i the data. As the elevatio icreases, the sluggig percetage teds to icrease. However, if the outlier is removed, the it does ot look like there is a tred to the data. Copyright 17 Pearso Educatio, Ic.

30 Methods for Describig Sets of Data a. A scattergram of the data is: 9 Scatterplot of Strikes vs Age Strikes Age b. There appears to be a tred. As the age icreases, the umber of strikes teds to decrease..164 Usig MINITAB, a scatterplot of the data is: 7 Scatterplot of Freq vs Resoace 6 5 Freq Resoace 15 5 There is a icreasig tred ad there is very little variatio i the plot. This supports the researcher s theory..166 a. Usig MINITAB, a graph of the Athropogeic Idex agaist the Natural Origi Idex is: 9 Scatterplot of F-Athro vs F-Natural F-Athro F-Natural Copyright 17 Pearso Educatio, Ic.

31 36 Chapter This graph does ot support the theory that there is a straight-lie relatioship betwee the Athropogeic Idex agaist the Natural Origi Idex. There are several poits that do ot lie o a straight lie. b. After deletig the three forests with the largest athropogeic idices, the graph of the data is: Scatterplot of F-Athro vs F-Natural F-Athro F-Natural After deletig the 3 data poits, the relatioship betwee the Athropogeic Idex agaist the Natural Origi Idex is much closer to a straight lie..168 Usig MINITAB, a scattergram of the data is: 7 Scatterplot of Mass vs Time Mass Time Yes, there appears to be a egative tred i this data. As time icreases, the mass teds to decrease. There appears to be a curviliear relatioship. As time icreases, mass decreases at a decreasig rate..17 Oe way the bar graph ca mislead the viewer is that the vertical axis has bee cut off. Istead of startig at, the vertical axis starts at 1. Aother way the bar graph ca mislead the viewer is that as the bars get taller, the widths of the bars also icrease..17 a. This graph is misleadig because it looks like as the days are icreasig, the umber of barrels collected per day is also icreasig. However, the bars are the cumulative umber of barrels collected. The cumulative value ca ever decrease. Copyright 17 Pearso Educatio, Ic.

32 Methods for Describig Sets of Data 37 b. Usig MINITAB, the graph of the daily collectio of oil is: Chart of Barrells 5 Barrells May-16 May-17 May-18 May-19 May- Day May-1 May- May-3 From this graph, it shows that there has ot bee a steady improvemet i the suctioig process. There was a icrease for 3 days, the a levelig off for 3 days, the a decrease..174 The rage ca be greatly affected by extreme measures, while the stadard deviatio is ot as affected..176 The z-score approach for detectig outliers is based o the distributio beig fairly moudshaped. If the data are ot moud-shaped, the the box plot would be preferred over the z- score method for detectig outliers..178 Oe techique for distortig iformatio o a graph is by stretchig the vertical axis by startig the vertical axis somewhere above..18 From part a of Exercise.179, the 3 z-scores are 1, 1 ad. Sice oe of these z-scores are greater tha i absolute value, oe of them are outliers. From part b of Exercise.179, the 3 z-scores are, ad 4. There is oly oe z-score greater tha i absolute value. The score of 8 (associated with the z-score of 4) would be a outlier. Very few observatios are as far away from the mea as 4 stadard deviatios. From part c of Exercise.179, the 3 z-scores are 1, 3, ad 4. Two of these z-scores are greater tha i absolute value. The scores associated with the two z-scores 3 ad 4 (7 ad 8) would be cosidered outliers. From part d of Exercise.179, the 3 z-scores are.1,.3, ad.4. Sice oe of these z-scores are greater tha i absolute value, oe of them are outliers..18 rage / 4 / a. x x x x 6 5 Copyright 17 Pearso Educatio, Ic.

33 38 Chapter s x 3 x b. x x x 5 x s x 5 x c. x x x 49 x 7 7 s x 49 x s s s d. x x x 1 x 3 4 s x 1 x s e. a) x s 6 (5.) ( 4.4, 16.4) All or 1% of the observatios are i this iterval. b) x s 6.5 (5.3) ( 4.39, 16.89) All or 1% of the observatios are i this iterval. c) x s 7 (6.14) ( 5.8, 19.8) All or 1% of the observatios are i this iterval. d) x s 3 () 3 (3,3) All or 1% of the observatios are i this iterval. Copyright 17 Pearso Educatio, Ic.

34 Methods for Describig Sets of Data Suppose we costruct a relative frequecy bar chart for this data. This will allow the archaeologists to compare the differet categories easier. First, we must compute the relative frequecies for the categories. These are foud by dividig the frequecies i each category by the total 837. For the burished category, the relative frequecy is133 / The rest of the relative frequecies are foud i a similar fashio ad are listed i the table. Pot Category Number Foud Computatio Relative Frequecy Burished / Moochrome / Slipped / Curviliear Decoratio / Geometric Decoratio / Naturalistic Decoratio 4 4 / Cycladic White clay 4 4 / Coical cup clay / 837. Total A relative frequecy bar chart is:.6 Chart of Pot Category.5 Relative Frequecy Burished Moochrome Slipped Curviliear Geometric Pot Category Naturalistic Cycladic Coical Proportio withi all data. The most frequetly foud type of pot was the Moochrome. Of all the pots foud, 55% were Moochrome. The ext most frequetly foud type of pot was the Paited i Geometric Decoratio. Of all the pots foud, 19.7% were of this type. Very few pots of the types Paited i aturalistic decoratio, Cycladic white clay, ad Coical cup clay were foud. Copyright 17 Pearso Educatio, Ic.

35 4 Chapter.188 a. Usig MINITAB, the stem-ad-leaf display is as follows. Character Stem-ad-Leaf Display Stem-ad-leaf of Books N = 14 Leaf Uit = (3) b. The leaves that correspod to studets who eared a A grade are highlighted i the graph above. Those studets who eared A s teded to read the most books. xi i c. The mea is x This is the average umber of books read per studet. To fid the media, the data must be arraged i order. I this problem, the data are already arraged i order. There are a total of 14 observatios, which is a eve umber. The media is the average of the middle umbers which are 3 ad 34. The media is Half of the studets read more tha 3 books ad half read fewer. The mode is the observatio appearig the most. I this data set, there are two modes - 34 ad 4 because each appears times i the data set. The most frequet umber of books read is either 34 or 4. d. Sice the mea ad the media are almost the same, the distributio of the data set is approximately symmetric. This ca be verified by the stem-ad-leaf display i part a. e. For those studets who eared A, the mea is xi i x The variace is s x 96 x 11, The stadard deviatio is s s Copyright 17 Pearso Educatio, Ic.

36 Methods for Describig Sets of Data 41 f. For those studets who eared a B or C, the mea is xi i x The variace is s x 147 x 3, The stadard deviatio is s s g. The studets who received A s have a more variable distributio of the umber of books read. The variace ad stadard deviatio for this group are greater tha the correspodig values for the B-C group. xx 4 37 h. The z-score for a score of 4 books is z.345. Thus, someoe who s 8.71 read 4 books read more tha the average umber of books, but that umber is ot very uusual. xx i. The z-score for a score of 4 books is z 1.8. Thus, someoe who s 8.56 read 4 books read may more tha the average umber of books. Very few studets who received a B or a C read more tha 4 books. j. The group of studets who eared A s is more likely to have read 4 books. For this group, the z-score correspodig to 4 books is.34. This is ot uusual. For the B-C group, the z-score correspodig to 4 books is 1.8. This is close to stadard deviatios from the mea. This would be fairly uusual..19 A pie chart of the data is: Pie Chart of Drive Star % 4.1% % Category % More tha half of the cars received 4 star ratigs (6.%). A little less tha a quarter of the cars tested received ratigs of 3 stars or less. Copyright 17 Pearso Educatio, Ic.

37 4 Chapter.19 a. Usig MINITAB, the descriptive statistics are: Descriptive Statistics: Ratio Variable N Mea StDev Miimum Maximum Ratio xx The z-score associated with the largest ratio is z.45 s.634 xx The z-score associated with the smallest ratio is z 1.98 s.634 xx The z-score associated with the mea ratio is z s.634 b. Yes, I would cosider the z-score associated with the largest ratio to be uusually large. We kow if the data are approximately moud-shaped that approximately 95% of the observatios will be withi stadard deviatios of the mea. A z-score of.45 would idicate that less tha.5% of all the measuremets will be larger tha this value. c. Usig MINITAB, the box plot is: Boxplot of Ratio Ratio From this box plot, there are o observatios marked as outliers..194 a. Usig MINITAB, a histogram of the data is: 1 Histogram of ph 1 8 Percet ph Copyright 17 Pearso Educatio, Ic.

38 Methods for Describig Sets of Data 43 From the graph, it looks like the proportio of wells with ph levels less tha 7. is: b. Usig MINITAB, a histogram of the MTBE levels for those wells with detectible levels is: 8 Histogram of MTBE-Level MTBE-Detect = Detect 7 6 Percet MTBE-Level 4 5 From the graph, it looks like the proportio of wells with MTBE levels greater tha 5 is: c. The sample mea is: x i1 x The variace is: i x i , , x 1, s The stadard deviatio is: s s x s 7.47 (.8169) (5.793, 9.68). From the histogram i part a, the data look approximately moud-shaped. From the Empirical Rule, we would expect about 95% of the wells to fall i this rage. I fact, 1 of 3 or 95.1% of the wells have ph levels betwee ad d. The sample mea of the wells with detectible levels of MTBE is: xi i x Copyright 17 Pearso Educatio, Ic.

39 44 Chapter The variace is: s x xi The stadard deviatio is: s s x s (8.756) ( 14.6,.94). From the histogram i part b, the data do ot look moud-shaped. From Chebyshev s Rule, we would expect at least ¾ or 75% of the wells to fall i this rage. I fact, 67 of 7 or 95.7% of the wells have MTBE levels betwee ad a. Usig MINITAB, the dot plot for the 9 measuremets is: Dotplot of Cesium Cesium b. Usig MINITAB, the stem-ad-leaf display is: Character Stem-ad-Leaf Display Stem-ad-leaf of Cesium N = 9 Leaf Uit = (3) Copyright 17 Pearso Educatio, Ic.

40 Methods for Describig Sets of Data 45 c. Usig MINITAB, the histogram is: Histogram of Cesium. 1.5 Frequecy Cesium d. The stem-ad-leaf display appears to be more iformative tha the other graphs. Sice there are oly 9 observatios, the histogram ad dot plot have very few observatios per category. e. There are 4 observatios with radioactivity level of -5. or lower. The proportio of measuremets with a radioactivity level of -5. or lower is 4/ a. From the histogram, the data do ot follow the true moud-shape very well. The itervals i the middle are much higher tha they should be. I additio, there are some extremely large velocities ad some extremely small velocities. Because the data do ot follow a moud-shaped distributio, the Empirical Rule would ot be appropriate. b. Usig Chebyshev's rule, at least11/4 11/1615/16or 93.8% of the velocities will fall withi 4 stadard deviatios of the mea. This iterval is: x 4s 7,117 4(1,8) 7,117 5,1 (1,997, 3,37) At least 93.75% of the velocities will fall betwee 1,997 ad 3,37 km per secod. c. Sice the data look approximately symmetric, the mea would be a good estimate for the velocity of galaxy cluster A14. Thus, this estimate would be 7,117 km per secod.. a. Usig MINITAB, the boxplot is: 35 Boxplot of Freckle 3 5 Freckle Copyright 17 Pearso Educatio, Ic.

Median and IQR The median is the value which divides the ordered data values in half.

Median and IQR The median is the value which divides the ordered data values in half. STA 666 Fall 2007 Web-based Course Notes 4: Describig Distributios Numerically Numerical summaries for quatitative variables media ad iterquartile rage (IQR) 5-umber summary mea ad stadard deviatio Media

More information

2: Describing Data with Numerical Measures

2: Describing Data with Numerical Measures : Describig Data with Numerical Measures. a The dotplot show below plots the five measuremets alog the horizotal axis. Sice there are two s, the correspodig dots are placed oe above the other. The approximate

More information

CHAPTER 2. Mean This is the usual arithmetic mean or average and is equal to the sum of the measurements divided by number of measurements.

CHAPTER 2. Mean This is the usual arithmetic mean or average and is equal to the sum of the measurements divided by number of measurements. CHAPTER 2 umerical Measures Graphical method may ot always be sufficiet for describig data. You ca use the data to calculate a set of umbers that will covey a good metal picture of the frequecy distributio.

More information

ENGI 4421 Probability and Statistics Faculty of Engineering and Applied Science Problem Set 1 Solutions Descriptive Statistics. None at all!

ENGI 4421 Probability and Statistics Faculty of Engineering and Applied Science Problem Set 1 Solutions Descriptive Statistics. None at all! ENGI 44 Probability ad Statistics Faculty of Egieerig ad Applied Sciece Problem Set Solutios Descriptive Statistics. If, i the set of values {,, 3, 4, 5, 6, 7 } a error causes the value 5 to be replaced

More information

Data Description. Measure of Central Tendency. Data Description. Chapter x i

Data Description. Measure of Central Tendency. Data Description. Chapter x i Data Descriptio Describe Distributio with Numbers Example: Birth weights (i lb) of 5 babies bor from two groups of wome uder differet care programs. Group : 7, 6, 8, 7, 7 Group : 3, 4, 8, 9, Chapter 3

More information

Lecture 1. Statistics: A science of information. Population: The population is the collection of all subjects we re interested in studying.

Lecture 1. Statistics: A science of information. Population: The population is the collection of all subjects we re interested in studying. Lecture Mai Topics: Defiitios: Statistics, Populatio, Sample, Radom Sample, Statistical Iferece Type of Data Scales of Measuremet Describig Data with Numbers Describig Data Graphically. Defiitios. Example

More information

Economics 250 Assignment 1 Suggested Answers. 1. We have the following data set on the lengths (in minutes) of a sample of long-distance phone calls

Economics 250 Assignment 1 Suggested Answers. 1. We have the following data set on the lengths (in minutes) of a sample of long-distance phone calls Ecoomics 250 Assigmet 1 Suggested Aswers 1. We have the followig data set o the legths (i miutes) of a sample of log-distace phoe calls 1 20 10 20 13 23 3 7 18 7 4 5 15 7 29 10 18 10 10 23 4 12 8 6 (1)

More information

Elementary Statistics

Elementary Statistics Elemetary Statistics M. Ghamsary, Ph.D. Sprig 004 Chap 0 Descriptive Statistics Raw Data: Whe data are collected i origial form, they are called raw data. The followig are the scores o the first test of

More information

Chapter 2 Descriptive Statistics

Chapter 2 Descriptive Statistics Chapter 2 Descriptive Statistics Statistics Most commoly, statistics refers to umerical data. Statistics may also refer to the process of collectig, orgaizig, presetig, aalyzig ad iterpretig umerical data

More information

ACCESS TO SCIENCE, ENGINEERING AND AGRICULTURE: MATHEMATICS 1 MATH00030 SEMESTER / Statistics

ACCESS TO SCIENCE, ENGINEERING AND AGRICULTURE: MATHEMATICS 1 MATH00030 SEMESTER / Statistics ACCESS TO SCIENCE, ENGINEERING AND AGRICULTURE: MATHEMATICS 1 MATH00030 SEMESTER 1 018/019 DR. ANTHONY BROWN 8. Statistics 8.1. Measures of Cetre: Mea, Media ad Mode. If we have a series of umbers the

More information

Example: Find the SD of the set {x j } = {2, 4, 5, 8, 5, 11, 7}.

Example: Find the SD of the set {x j } = {2, 4, 5, 8, 5, 11, 7}. 1 (*) If a lot of the data is far from the mea, the may of the (x j x) 2 terms will be quite large, so the mea of these terms will be large ad the SD of the data will be large. (*) I particular, outliers

More information

Parameter, Statistic and Random Samples

Parameter, Statistic and Random Samples Parameter, Statistic ad Radom Samples A parameter is a umber that describes the populatio. It is a fixed umber, but i practice we do ot kow its value. A statistic is a fuctio of the sample data, i.e.,

More information

1 Lesson 6: Measure of Variation

1 Lesson 6: Measure of Variation 1 Lesso 6: Measure of Variatio 1.1 The rage As we have see, there are several viable coteders for the best measure of the cetral tedecy of data. The mea, the mode ad the media each have certai advatages

More information

Chapter If n is odd, the median is the exact middle number If n is even, the median is the average of the two middle numbers

Chapter If n is odd, the median is the exact middle number If n is even, the median is the average of the two middle numbers Chapter 4 4-1 orth Seattle Commuity College BUS10 Busiess Statistics Chapter 4 Descriptive Statistics Summary Defiitios Cetral tedecy: The extet to which the data values group aroud a cetral value. Variatio:

More information

Anna Janicka Mathematical Statistics 2018/2019 Lecture 1, Parts 1 & 2

Anna Janicka Mathematical Statistics 2018/2019 Lecture 1, Parts 1 & 2 Aa Jaicka Mathematical Statistics 18/19 Lecture 1, Parts 1 & 1. Descriptive Statistics By the term descriptive statistics we will mea the tools used for quatitative descriptio of the properties of a sample

More information

Statistics 511 Additional Materials

Statistics 511 Additional Materials Cofidece Itervals o mu Statistics 511 Additioal Materials This topic officially moves us from probability to statistics. We begi to discuss makig ifereces about the populatio. Oe way to differetiate probability

More information

6.3 Testing Series With Positive Terms

6.3 Testing Series With Positive Terms 6.3. TESTING SERIES WITH POSITIVE TERMS 307 6.3 Testig Series With Positive Terms 6.3. Review of what is kow up to ow I theory, testig a series a i for covergece amouts to fidig the i= sequece of partial

More information

1 Inferential Methods for Correlation and Regression Analysis

1 Inferential Methods for Correlation and Regression Analysis 1 Iferetial Methods for Correlatio ad Regressio Aalysis I the chapter o Correlatio ad Regressio Aalysis tools for describig bivariate cotiuous data were itroduced. The sample Pearso Correlatio Coefficiet

More information

Continuous Data that can take on any real number (time/length) based on sample data. Categorical data can only be named or categorised

Continuous Data that can take on any real number (time/length) based on sample data. Categorical data can only be named or categorised Questio 1. (Topics 1-3) A populatio cosists of all the members of a group about which you wat to draw a coclusio (Greek letters (μ, σ, Ν) are used) A sample is the portio of the populatio selected for

More information

multiplies all measures of center and the standard deviation and range by k, while the variance is multiplied by k 2.

multiplies all measures of center and the standard deviation and range by k, while the variance is multiplied by k 2. Lesso 3- Lesso 3- Scale Chages of Data Vocabulary scale chage of a data set scale factor scale image BIG IDEA Multiplyig every umber i a data set by k multiplies all measures of ceter ad the stadard deviatio

More information

Number of fatalities X Sunday 4 Monday 6 Tuesday 2 Wednesday 0 Thursday 3 Friday 5 Saturday 8 Total 28. Day

Number of fatalities X Sunday 4 Monday 6 Tuesday 2 Wednesday 0 Thursday 3 Friday 5 Saturday 8 Total 28. Day LECTURE # 8 Mea Deviatio, Stadard Deviatio ad Variace & Coefficiet of variatio Mea Deviatio Stadard Deviatio ad Variace Coefficiet of variatio First, we will discuss it for the case of raw data, ad the

More information

(# x) 2 n. (" x) 2 = 30 2 = 900. = sum. " x 2 = =174. " x. Chapter 12. Quick math overview. #(x " x ) 2 = # x 2 "

(# x) 2 n. ( x) 2 = 30 2 = 900. = sum.  x 2 = =174.  x. Chapter 12. Quick math overview. #(x  x ) 2 = # x 2 Chapter 12 Describig Distributios with Numbers Chapter 12 1 Quick math overview = sum These expressios are algebraically equivalet #(x " x ) 2 = # x 2 " (# x) 2 Examples x :{ 2,3,5,6,6,8 } " x = 2 + 3+

More information

Chapter 4 - Summarizing Numerical Data

Chapter 4 - Summarizing Numerical Data Chapter 4 - Summarizig Numerical Data 15.075 Cythia Rudi Here are some ways we ca summarize data umerically. Sample Mea: i=1 x i x :=. Note: i this class we will work with both the populatio mea µ ad the

More information

Chapter 1 (Definitions)

Chapter 1 (Definitions) FINAL EXAM REVIEW Chapter 1 (Defiitios) Qualitative: Nomial: Ordial: Quatitative: Ordial: Iterval: Ratio: Observatioal Study: Desiged Experimet: Samplig: Cluster: Stratified: Systematic: Coveiece: Simple

More information

Measures of Spread: Variance and Standard Deviation

Measures of Spread: Variance and Standard Deviation Lesso 1-6 Measures of Spread: Variace ad Stadard Deviatio BIG IDEA Variace ad stadard deviatio deped o the mea of a set of umbers. Calculatig these measures of spread depeds o whether the set is a sample

More information

STP 226 EXAMPLE EXAM #1

STP 226 EXAMPLE EXAM #1 STP 226 EXAMPLE EXAM #1 Istructor: Hoor Statemet: I have either give or received iformatio regardig this exam, ad I will ot do so util all exams have bee graded ad retured. PRINTED NAME: Siged Date: DIRECTIONS:

More information

HUMBEHV 3HB3 Measures of Central Tendency & Variability Week 2

HUMBEHV 3HB3 Measures of Central Tendency & Variability Week 2 Describig Data Distributios HUMBEHV 3HB3 Measures of Cetral Tedecy & Variability Week 2 Prof. Patrick Beett Ofte we wish to summarize distributios of data, rather tha showig histograms Two basic descriptios

More information

MEASURES OF DISPERSION (VARIABILITY)

MEASURES OF DISPERSION (VARIABILITY) POLI 300 Hadout #7 N. R. Miller MEASURES OF DISPERSION (VARIABILITY) While measures of cetral tedecy idicate what value of a variable is (i oe sese or other, e.g., mode, media, mea), average or cetral

More information

Solution Manual for A First Course in Statistics 11th edition by McClave and Sincich

Solution Manual for A First Course in Statistics 11th edition by McClave and Sincich Solution Manual for A First Course in Statistics 11th edition by McClave and Sincich Link download: https://digitalcontentmarket.org/download/solution-manual-for-a-firstcourse-in-statistics-11th-edition-by-mcclave-and-sincich/

More information

Infinite Sequences and Series

Infinite Sequences and Series Chapter 6 Ifiite Sequeces ad Series 6.1 Ifiite Sequeces 6.1.1 Elemetary Cocepts Simply speakig, a sequece is a ordered list of umbers writte: {a 1, a 2, a 3,...a, a +1,...} where the elemets a i represet

More information

Analysis of Experimental Data

Analysis of Experimental Data Aalysis of Experimetal Data 6544597.0479 ± 0.000005 g Quatitative Ucertaity Accuracy vs. Precisio Whe we make a measuremet i the laboratory, we eed to kow how good it is. We wat our measuremets to be both

More information

Response Variable denoted by y it is the variable that is to be predicted measure of the outcome of an experiment also called the dependent variable

Response Variable denoted by y it is the variable that is to be predicted measure of the outcome of an experiment also called the dependent variable Statistics Chapter 4 Correlatio ad Regressio If we have two (or more) variables we are usually iterested i the relatioship betwee the variables. Associatio betwee Variables Two variables are associated

More information

Inferential Statistics. Inference Process. Inferential Statistics and Probability a Holistic Approach. Inference Process.

Inferential Statistics. Inference Process. Inferential Statistics and Probability a Holistic Approach. Inference Process. Iferetial Statistics ad Probability a Holistic Approach Iferece Process Chapter 8 Poit Estimatio ad Cofidece Itervals This Course Material by Maurice Geraghty is licesed uder a Creative Commos Attributio-ShareAlike

More information

4.3 Growth Rates of Solutions to Recurrences

4.3 Growth Rates of Solutions to Recurrences 4.3. GROWTH RATES OF SOLUTIONS TO RECURRENCES 81 4.3 Growth Rates of Solutios to Recurreces 4.3.1 Divide ad Coquer Algorithms Oe of the most basic ad powerful algorithmic techiques is divide ad coquer.

More information

MA238 Assignment 4 Solutions (part a)

MA238 Assignment 4 Solutions (part a) (i) Sigle sample tests. Questio. MA38 Assigmet 4 Solutios (part a) (a) (b) (c) H 0 : = 50 sq. ft H A : < 50 sq. ft H 0 : = 3 mpg H A : > 3 mpg H 0 : = 5 mm H A : 5mm Questio. (i) What are the ull ad alterative

More information

II. Descriptive Statistics D. Linear Correlation and Regression. 1. Linear Correlation

II. Descriptive Statistics D. Linear Correlation and Regression. 1. Linear Correlation II. Descriptive Statistics D. Liear Correlatio ad Regressio I this sectio Liear Correlatio Cause ad Effect Liear Regressio 1. Liear Correlatio Quatifyig Liear Correlatio The Pearso product-momet correlatio

More information

Summarizing Data. Major Properties of Numerical Data

Summarizing Data. Major Properties of Numerical Data Summarizig Data Daiel A. Meascé, Ph.D. Dept of Computer Sciece George Maso Uiversity Major Properties of Numerical Data Cetral Tedecy: arithmetic mea, geometric mea, media, mode. Variability: rage, iterquartile

More information

Discrete Mathematics for CS Spring 2008 David Wagner Note 22

Discrete Mathematics for CS Spring 2008 David Wagner Note 22 CS 70 Discrete Mathematics for CS Sprig 2008 David Wager Note 22 I.I.D. Radom Variables Estimatig the bias of a coi Questio: We wat to estimate the proportio p of Democrats i the US populatio, by takig

More information

AP Statistics Review Ch. 8

AP Statistics Review Ch. 8 AP Statistics Review Ch. 8 Name 1. Each figure below displays the samplig distributio of a statistic used to estimate a parameter. The true value of the populatio parameter is marked o each samplig distributio.

More information

Census. Mean. µ = x 1 + x x n n

Census. Mean. µ = x 1 + x x n n MATH 183 Basic Statistics Dr. Neal, WKU Let! be a populatio uder cosideratio ad let X be a specific measuremet that we are aalyzig. For example,! = All U.S. households ad X = Number of childre (uder age

More information

If, for instance, we were required to test whether the population mean μ could be equal to a certain value μ

If, for instance, we were required to test whether the population mean μ could be equal to a certain value μ STATISTICAL INFERENCE INTRODUCTION Statistical iferece is that brach of Statistics i which oe typically makes a statemet about a populatio based upo the results of a sample. I oesample testig, we essetially

More information

Estimation of a population proportion March 23,

Estimation of a population proportion March 23, 1 Social Studies 201 Notes for March 23, 2005 Estimatio of a populatio proportio Sectio 8.5, p. 521. For the most part, we have dealt with meas ad stadard deviatios this semester. This sectio of the otes

More information

Measures of Variation

Measures of Variation Chapter : Measures of Variatio from Statistical Aalysis i the Behavioral Scieces by James Raymodo Secod Editio 97814669676 01 Copyright Property of Kedall Hut Publishig CHAPTER Measures of Variatio Key

More information

Data Analysis and Statistical Methods Statistics 651

Data Analysis and Statistical Methods Statistics 651 Data Aalysis ad Statistical Methods Statistics 651 http://www.stat.tamu.edu/~suhasii/teachig.html Suhasii Subba Rao Review of testig: Example The admistrator of a ursig home wats to do a time ad motio

More information

(6) Fundamental Sampling Distribution and Data Discription

(6) Fundamental Sampling Distribution and Data Discription 34 Stat Lecture Notes (6) Fudametal Samplig Distributio ad Data Discriptio ( Book*: Chapter 8,pg5) Probability& Statistics for Egieers & Scietists By Walpole, Myers, Myers, Ye 8.1 Radom Samplig: Populatio:

More information

PSYCHOLOGICAL RESEARCH (PYC 304-C) Lecture 9

PSYCHOLOGICAL RESEARCH (PYC 304-C) Lecture 9 Hypothesis testig PSYCHOLOGICAL RESEARCH (PYC 34-C Lecture 9 Statistical iferece is that brach of Statistics i which oe typically makes a statemet about a populatio based upo the results of a sample. I

More information

CHAPTER 8 FUNDAMENTAL SAMPLING DISTRIBUTIONS AND DATA DESCRIPTIONS. 8.1 Random Sampling. 8.2 Some Important Statistics

CHAPTER 8 FUNDAMENTAL SAMPLING DISTRIBUTIONS AND DATA DESCRIPTIONS. 8.1 Random Sampling. 8.2 Some Important Statistics CHAPTER 8 FUNDAMENTAL SAMPLING DISTRIBUTIONS AND DATA DESCRIPTIONS 8.1 Radom Samplig The basic idea of the statistical iferece is that we are allowed to draw ifereces or coclusios about a populatio based

More information

Measures of Spread: Standard Deviation

Measures of Spread: Standard Deviation Measures of Spread: Stadard Deviatio So far i our study of umerical measures used to describe data sets, we have focused o the mea ad the media. These measures of ceter tell us the most typical value of

More information

Chapter 8: Estimating with Confidence

Chapter 8: Estimating with Confidence Chapter 8: Estimatig with Cofidece Sectio 8.2 The Practice of Statistics, 4 th editio For AP* STARNES, YATES, MOORE Chapter 8 Estimatig with Cofidece 8.1 Cofidece Itervals: The Basics 8.2 8.3 Estimatig

More information

Power and Type II Error

Power and Type II Error Statistical Methods I (EXST 7005) Page 57 Power ad Type II Error Sice we do't actually kow the value of the true mea (or we would't be hypothesizig somethig else), we caot kow i practice the type II error

More information

Chapter 23: Inferences About Means

Chapter 23: Inferences About Means Chapter 23: Ifereces About Meas Eough Proportios! We ve spet the last two uits workig with proportios (or qualitative variables, at least) ow it s time to tur our attetios to quatitative variables. For

More information

The Pendulum. Purpose

The Pendulum. Purpose The Pedulum Purpose To carry out a example illustratig how physics approaches ad solves problems. The example used here is to explore the differet factors that determie the period of motio of a pedulum.

More information

STAT 350 Handout 19 Sampling Distribution, Central Limit Theorem (6.6)

STAT 350 Handout 19 Sampling Distribution, Central Limit Theorem (6.6) STAT 350 Hadout 9 Samplig Distributio, Cetral Limit Theorem (6.6) A radom sample is a sequece of radom variables X, X 2,, X that are idepedet ad idetically distributed. o This property is ofte abbreviated

More information

Methods for Describing Sets of Data Chapter 2

Methods for Describing Sets of Data Chapter 2 Methods for Describig Sets of Data Chapter. a. To fid the frequecy for each class, cout the umber of times each letter occurs. The frequecies for the three classes are: Class Frequecy X 8 Y 9 Z 3 Total

More information

Read through these prior to coming to the test and follow them when you take your test.

Read through these prior to coming to the test and follow them when you take your test. Math 143 Sprig 2012 Test 2 Iformatio 1 Test 2 will be give i class o Thursday April 5. Material Covered The test is cummulative, but will emphasize the recet material (Chapters 6 8, 10 11, ad Sectios 12.1

More information

4.1 Sigma Notation and Riemann Sums

4.1 Sigma Notation and Riemann Sums 0 the itegral. Sigma Notatio ad Riema Sums Oe strategy for calculatig the area of a regio is to cut the regio ito simple shapes, calculate the area of each simple shape, ad the add these smaller areas

More information

1 of 7 7/16/2009 6:06 AM Virtual Laboratories > 6. Radom Samples > 1 2 3 4 5 6 7 6. Order Statistics Defiitios Suppose agai that we have a basic radom experimet, ad that X is a real-valued radom variable

More information

Eco411 Lab: Central Limit Theorem, Normal Distribution, and Journey to Girl State

Eco411 Lab: Central Limit Theorem, Normal Distribution, and Journey to Girl State Eco411 Lab: Cetral Limit Theorem, Normal Distributio, ad Jourey to Girl State 1. Some studets may woder why the magic umber 1.96 or 2 (called critical values) is so importat i statistics. Where do they

More information

Final Examination Solutions 17/6/2010

Final Examination Solutions 17/6/2010 The Islamic Uiversity of Gaza Faculty of Commerce epartmet of Ecoomics ad Political Scieces A Itroductio to Statistics Course (ECOE 30) Sprig Semester 009-00 Fial Eamiatio Solutios 7/6/00 Name: I: Istructor:

More information

Statistical Fundamentals and Control Charts

Statistical Fundamentals and Control Charts Statistical Fudametals ad Cotrol Charts 1. Statistical Process Cotrol Basics Chace causes of variatio uavoidable causes of variatios Assigable causes of variatio large variatios related to machies, materials,

More information

Lecture 24 Floods and flood frequency

Lecture 24 Floods and flood frequency Lecture 4 Floods ad flood frequecy Oe of the thigs we wat to kow most about rivers is what s the probability that a flood of size will happe this year? I 100 years? There are two ways to do this empirically,

More information

TABLES AND FORMULAS FOR MOORE Basic Practice of Statistics

TABLES AND FORMULAS FOR MOORE Basic Practice of Statistics TABLES AND FORMULAS FOR MOORE Basic Practice of Statistics Explorig Data: Distributios Look for overall patter (shape, ceter, spread) ad deviatios (outliers). Mea (use a calculator): x = x 1 + x 2 + +

More information

Chapter 6. Sampling and Estimation

Chapter 6. Sampling and Estimation Samplig ad Estimatio - 34 Chapter 6. Samplig ad Estimatio 6.. Itroductio Frequetly the egieer is uable to completely characterize the etire populatio. She/he must be satisfied with examiig some subset

More information

Questions about the Assignment. Describing Data: Distributions and Relationships. Measures of Spread Standard Deviation. One Quantitative Variable

Questions about the Assignment. Describing Data: Distributions and Relationships. Measures of Spread Standard Deviation. One Quantitative Variable Quetio about the Aigmet Read the quetio ad awer the quetio that are aked Experimet elimiate cofoudig variable Decribig Data: Ditributio ad Relatiohip GSS people attitude veru their characteritic ad poue

More information

11 Correlation and Regression

11 Correlation and Regression 11 Correlatio Regressio 11.1 Multivariate Data Ofte we look at data where several variables are recorded for the same idividuals or samplig uits. For example, at a coastal weather statio, we might record

More information

7-1. Chapter 4. Part I. Sampling Distributions and Confidence Intervals

7-1. Chapter 4. Part I. Sampling Distributions and Confidence Intervals 7-1 Chapter 4 Part I. Samplig Distributios ad Cofidece Itervals 1 7- Sectio 1. Samplig Distributio 7-3 Usig Statistics Statistical Iferece: Predict ad forecast values of populatio parameters... Test hypotheses

More information

Error & Uncertainty. Error. More on errors. Uncertainty. Page # The error is the difference between a TRUE value, x, and a MEASURED value, x i :

Error & Uncertainty. Error. More on errors. Uncertainty. Page # The error is the difference between a TRUE value, x, and a MEASURED value, x i : Error Error & Ucertaity The error is the differece betwee a TRUE value,, ad a MEASURED value, i : E = i There is o error-free measuremet. The sigificace of a measuremet caot be judged uless the associate

More information

BUSINESS STATISTICS (PART-9) AVERAGE OR MEASURES OF CENTRAL TENDENCY: THE GEOMETRIC AND HARMONIC MEANS

BUSINESS STATISTICS (PART-9) AVERAGE OR MEASURES OF CENTRAL TENDENCY: THE GEOMETRIC AND HARMONIC MEANS BUSINESS STATISTICS (PART-9) AVERAGE OR MEASURES OF CENTRAL TENDENCY: THE GEOMETRIC AND HARMONIC MEANS. INTRODUCTION We have so far discussed three measures of cetral tedecy, viz. The Arithmetic Mea, Media

More information

Statistics Lecture 27. Final review. Administrative Notes. Outline. Experiments. Sampling and Surveys. Administrative Notes

Statistics Lecture 27. Final review. Administrative Notes. Outline. Experiments. Sampling and Surveys. Administrative Notes Admiistrative Notes s - Lecture 7 Fial review Fial Exam is Tuesday, May 0th (3-5pm Covers Chapters -8 ad 0 i textbook Brig ID cards to fial! Allowed: Calculators, double-sided 8.5 x cheat sheet Exam Rooms:

More information

ANALYSIS OF EXPERIMENTAL ERRORS

ANALYSIS OF EXPERIMENTAL ERRORS ANALYSIS OF EXPERIMENTAL ERRORS All physical measuremets ecoutered i the verificatio of physics theories ad cocepts are subject to ucertaities that deped o the measurig istrumets used ad the coditios uder

More information

Overview. p 2. Chapter 9. Pooled Estimate of. q = 1 p. Notation for Two Proportions. Inferences about Two Proportions. Assumptions

Overview. p 2. Chapter 9. Pooled Estimate of. q = 1 p. Notation for Two Proportions. Inferences about Two Proportions. Assumptions Chapter 9 Slide Ifereces from Two Samples 9- Overview 9- Ifereces about Two Proportios 9- Ifereces about Two Meas: Idepedet Samples 9-4 Ifereces about Matched Pairs 9-5 Comparig Variatio i Two Samples

More information

Chapter 22. Comparing Two Proportions. Copyright 2010, 2007, 2004 Pearson Education, Inc.

Chapter 22. Comparing Two Proportions. Copyright 2010, 2007, 2004 Pearson Education, Inc. Chapter 22 Comparig Two Proportios Copyright 2010, 2007, 2004 Pearso Educatio, Ic. Comparig Two Proportios Read the first two paragraphs of pg 504. Comparisos betwee two percetages are much more commo

More information

Chapter 12 Correlation

Chapter 12 Correlation Chapter Correlatio Correlatio is very similar to regressio with oe very importat differece. Regressio is used to explore the relatioship betwee a idepedet variable ad a depedet variable, whereas correlatio

More information

Activity 3: Length Measurements with the Four-Sided Meter Stick

Activity 3: Length Measurements with the Four-Sided Meter Stick Activity 3: Legth Measuremets with the Four-Sided Meter Stick OBJECTIVE: The purpose of this experimet is to study errors ad the propagatio of errors whe experimetal data derived usig a four-sided meter

More information

Chapter 22. Comparing Two Proportions. Copyright 2010 Pearson Education, Inc.

Chapter 22. Comparing Two Proportions. Copyright 2010 Pearson Education, Inc. Chapter 22 Comparig Two Proportios Copyright 2010 Pearso Educatio, Ic. Comparig Two Proportios Comparisos betwee two percetages are much more commo tha questios about isolated percetages. Ad they are more

More information

Module 1 Fundamentals in statistics

Module 1 Fundamentals in statistics Normal Distributio Repeated observatios that differ because of experimetal error ofte vary about some cetral value i a roughly symmetrical distributio i which small deviatios occur much more frequetly

More information

ST 305: Exam 3 ( ) = P(A)P(B A) ( ) = P(A) + P(B) ( ) = 1 P( A) ( ) = P(A) P(B) σ X 2 = σ a+bx. σ ˆp. σ X +Y. σ X Y. σ X. σ Y. σ n.

ST 305: Exam 3 ( ) = P(A)P(B A) ( ) = P(A) + P(B) ( ) = 1 P( A) ( ) = P(A) P(B) σ X 2 = σ a+bx. σ ˆp. σ X +Y. σ X Y. σ X. σ Y. σ n. ST 305: Exam 3 By hadig i this completed exam, I state that I have either give or received assistace from aother perso durig the exam period. I have used o resources other tha the exam itself ad the basic

More information

CHAPTER SUMMARIES MAT102 Dr J Lubowsky Page 1 of 13 Chapter 1: Introduction to Statistics

CHAPTER SUMMARIES MAT102 Dr J Lubowsky Page 1 of 13 Chapter 1: Introduction to Statistics CHAPTER SUMMARIES MAT102 Dr J Lubowsky Page 1 of 13 Chapter 1: Itroductio to Statistics Misleadig Iformatio: Surveys ad advertisig claims ca be biased by urepresetative samples, biased questios, iappropriate

More information

Introduction There are two really interesting things to do in statistics.

Introduction There are two really interesting things to do in statistics. ECON 497 Lecture Notes E Page 1 of 1 Metropolita State Uiversity ECON 497: Research ad Forecastig Lecture Notes E: Samplig Distributios Itroductio There are two really iterestig thigs to do i statistics.

More information

Chapter 6 Part 5. Confidence Intervals t distribution chi square distribution. October 23, 2008

Chapter 6 Part 5. Confidence Intervals t distribution chi square distribution. October 23, 2008 Chapter 6 Part 5 Cofidece Itervals t distributio chi square distributio October 23, 2008 The will be o help sessio o Moday, October 27. Goal: To clearly uderstad the lik betwee probability ad cofidece

More information

October 25, 2018 BIM 105 Probability and Statistics for Biomedical Engineers 1

October 25, 2018 BIM 105 Probability and Statistics for Biomedical Engineers 1 October 25, 2018 BIM 105 Probability ad Statistics for Biomedical Egieers 1 Populatio parameters ad Sample Statistics October 25, 2018 BIM 105 Probability ad Statistics for Biomedical Egieers 2 Ifereces

More information

The Poisson Distribution

The Poisson Distribution MATH 382 The Poisso Distributio Dr. Neal, WKU Oe of the importat distributios i probabilistic modelig is the Poisso Process X t that couts the umber of occurreces over a period of t uits of time. This

More information

Lecture 7: Non-parametric Comparison of Location. GENOME 560, Spring 2016 Doug Fowler, GS

Lecture 7: Non-parametric Comparison of Location. GENOME 560, Spring 2016 Doug Fowler, GS Lecture 7: No-parametric Compariso of Locatio GENOME 560, Sprig 2016 Doug Fowler, GS (dfowler@uw.edu) 1 Review How ca we set a cofidece iterval o a proportio? 2 Review How ca we set a cofidece iterval

More information

Binomial Distribution

Binomial Distribution 0.0 0.5 1.0 1.5 2.0 2.5 3.0 0 1 2 3 4 5 6 7 0.0 0.5 1.0 1.5 2.0 2.5 3.0 Overview Example: coi tossed three times Defiitio Formula Recall that a r.v. is discrete if there are either a fiite umber of possible

More information

Describing the Relation between Two Variables

Describing the Relation between Two Variables Copyright 010 Pearso Educatio, Ic. Tables ad Formulas for Sulliva, Statistics: Iformed Decisios Usig Data 010 Pearso Educatio, Ic Chapter Orgaizig ad Summarizig Data Relative frequecy = frequecy sum of

More information

Sample Size Determination (Two or More Samples)

Sample Size Determination (Two or More Samples) Sample Sie Determiatio (Two or More Samples) STATGRAPHICS Rev. 963 Summary... Data Iput... Aalysis Summary... 5 Power Curve... 5 Calculatios... 6 Summary This procedure determies a suitable sample sie

More information

Big Picture. 5. Data, Estimates, and Models: quantifying the accuracy of estimates.

Big Picture. 5. Data, Estimates, and Models: quantifying the accuracy of estimates. 5. Data, Estimates, ad Models: quatifyig the accuracy of estimates. 5. Estimatig a Normal Mea 5.2 The Distributio of the Normal Sample Mea 5.3 Normal data, cofidece iterval for, kow 5.4 Normal data, cofidece

More information

6 Sample Size Calculations

6 Sample Size Calculations 6 Sample Size Calculatios Oe of the major resposibilities of a cliical trial statisticia is to aid the ivestigators i determiig the sample size required to coduct a study The most commo procedure for determiig

More information

Induction: Solutions

Induction: Solutions Writig Proofs Misha Lavrov Iductio: Solutios Wester PA ARML Practice March 6, 206. Prove that a 2 2 chessboard with ay oe square removed ca always be covered by shaped tiles. Solutio : We iduct o. For

More information

Statisticians use the word population to refer the total number of (potential) observations under consideration

Statisticians use the word population to refer the total number of (potential) observations under consideration 6 Samplig Distributios Statisticias use the word populatio to refer the total umber of (potetial) observatios uder cosideratio The populatio is just the set of all possible outcomes i our sample space

More information

10.2 Infinite Series Contemporary Calculus 1

10.2 Infinite Series Contemporary Calculus 1 10. Ifiite Series Cotemporary Calculus 1 10. INFINITE SERIES Our goal i this sectio is to add together the umbers i a sequece. Sice it would take a very log time to add together the ifiite umber of umbers,

More information

Math 113 Exam 3 Practice

Math 113 Exam 3 Practice Math Exam Practice Exam 4 will cover.-., 0. ad 0.. Note that eve though. was tested i exam, questios from that sectios may also be o this exam. For practice problems o., refer to the last review. This

More information

Agreement of CI and HT. Lecture 13 - Tests of Proportions. Example - Waiting Times

Agreement of CI and HT. Lecture 13 - Tests of Proportions. Example - Waiting Times Sigificace level vs. cofidece level Agreemet of CI ad HT Lecture 13 - Tests of Proportios Sta102 / BME102 Coli Rudel October 15, 2014 Cofidece itervals ad hypothesis tests (almost) always agree, as log

More information

µ and π p i.e. Point Estimation x And, more generally, the population proportion is approximately equal to a sample proportion

µ and π p i.e. Point Estimation x And, more generally, the population proportion is approximately equal to a sample proportion Poit Estimatio Poit estimatio is the rather simplistic (ad obvious) process of usig the kow value of a sample statistic as a approximatio to the ukow value of a populatio parameter. So we could for example

More information

Formulas and Tables for Gerstman

Formulas and Tables for Gerstman Formulas ad Tables for Gerstma Measuremet ad Study Desig Biostatistics is more tha a compilatio of computatioal techiques! Measuremet scales: quatitative, ordial, categorical Iformatio quality is primary

More information

Homework 5 Solutions

Homework 5 Solutions Homework 5 Solutios p329 # 12 No. To estimate the chace you eed the expected value ad stadard error. To do get the expected value you eed the average of the box ad to get the stadard error you eed the

More information

University of California, Los Angeles Department of Statistics. Hypothesis testing

University of California, Los Angeles Department of Statistics. Hypothesis testing Uiversity of Califoria, Los Ageles Departmet of Statistics Statistics 100B Elemets of a hypothesis test: Hypothesis testig Istructor: Nicolas Christou 1. Null hypothesis, H 0 (claim about µ, p, σ 2, µ

More information

ARITHMETIC PROGRESSIONS

ARITHMETIC PROGRESSIONS CHAPTER 5 ARITHMETIC PROGRESSIONS (A) Mai Cocepts ad Results A arithmetic progressio (AP) is a list of umbers i which each term is obtaied by addig a fixed umber d to the precedig term, except the first

More information

STA Learning Objectives. Population Proportions. Module 10 Comparing Two Proportions. Upon completing this module, you should be able to:

STA Learning Objectives. Population Proportions. Module 10 Comparing Two Proportions. Upon completing this module, you should be able to: STA 2023 Module 10 Comparig Two Proportios Learig Objectives Upo completig this module, you should be able to: 1. Perform large-sample ifereces (hypothesis test ad cofidece itervals) to compare two populatio

More information

Lecture 5: Parametric Hypothesis Testing: Comparing Means. GENOME 560, Spring 2016 Doug Fowler, GS

Lecture 5: Parametric Hypothesis Testing: Comparing Means. GENOME 560, Spring 2016 Doug Fowler, GS Lecture 5: Parametric Hypothesis Testig: Comparig Meas GENOME 560, Sprig 2016 Doug Fowler, GS (dfowler@uw.edu) 1 Review from last week What is a cofidece iterval? 2 Review from last week What is a cofidece

More information