Week 1 Basic Statistical Concepts, Part I

Size: px
Start display at page:

Download "Week 1 Basic Statistical Concepts, Part I"

Transcription

1 Week 1 Basic Statistical Concepts, Part I

2 Week 1 Objectives This week we will give an introduction to the statistical package R, and to statistics. R will be used throughout this class and new features will be introduced as needed. The introduction here will cover arithmetic operations, generating sequences of numbers and storing them in data objects, performing operations on data objects, sampling and importing data sets. After motivating the need for statistics in engineering and scientific research, we will introduce fundamental notions that forms the foundation for the material in later weeks. In particular we will introduce the notions of population, census, sampling, and sampling variability, and will define such basic quantities as the proportion, mean, variance and percentiles.

3 Population Proportions and Sample Proportions Population Averages and Sample Averages Population Variance and Sample Variance Sample Percentiles

4 R is a free download statistical software. In addition to its statistical functions and graphic capabilities, R is a powerful programming language with similar structure as MATLAB. We will mainly use the R statistical functions and graphics. To find out about R go to To download R go to For a more detailed introduction see r-project.org/doc/manuals/r-intro.pdf.

5 Commands can be given in separate lines or in the same line. If in the same line they need to be separated by ; Use # for inserting comments to be ignored by R. See below for examples. R as a calculator: 2*4; 2**3; (10+11)/7; sqrt(16); sin(pi); cos(2*pi); log(exp(1)); log(10,base=10) # Angles for trigonometric functions should be given in radians, not degrees Generate sequences of numbers: 1:10; seq(1,10); seq(1,10,1); seq(2,10, 2) rep(1,5); rep(seq(1,4),2); rep(1:4,2); rep( a,5) Combine sequences: c(2,7,4,6); c(rep(0,5),rep(1,7))

6 Store sequences of numbers in objects : x=c(rep(0,5),rep(1,7)); y=seq(2,10,2) x; y x[2]; x[2:5]; y[3]; x[2]*y[3] Arithmetic on objects: sum(x); mean(x); sum(x)/length(x) x/2; 2*x; x**2; sqrt(x); x+10 c(1,2,3)*c(4,5,6); sum(c(1,2,3)*c(4,5,6)) sort(c(2,7,4,6)) Sampling from an object x: sample(x,n) # for n<length(x) gives a simple random sample of size n from x sample(x,n,replace = T) # gives a sample of size n with replacement from x Try also: x=c( Alex, B., Elela, H., Chris, A., John, D., Tania, E. ); sample(x,3); sample(x,3,replace=t)

7 Define functions: f=function(x){x**2}. Try f(2); f(c(2,3)) Integrate a function: integrate(f, 0, 3). Try also g=function(x){x**(-2)}; integrate(g, 1, Inf) Import and access data: The data we will be using are available in http: //media.pearsoncmg.com/cmg/pmmg_mml_shared/ mathstatsresources/akritas/datasets.html Go to the above site, click on BearsData.txt and copy the url. The following command imports this data set into the R data frame br : br = read.table( paste the BearsData.txt url here, header =T). That is, type: br = read.table( mml shared/ mathstatsresources/akritas/bearsdata.txt, header =T)

8 The object br is a data frame br[1:3,] # shows the first three lines of the data frame br$head.l[1:3] attach(br); Head.L[1:3]; detach(br) Controlling decimal places: options(digits = 3); 2/3 Removing all objects from memory: rm(list=ls(all=true)) # R still remembers the commands you have issued

9 Example (Examples of Engineering/Scientific Studies) Comparing the compressive strength of two or more cement mixtures. Comparing the effectiveness of three cleaning products in removing four different types of stains. Predicting failure time on the basis of stress applied. Assessing the effectiveness of a new traffic regulatory measure in reducing the weekly rate of accidents. Testing a manufacturer s claim regarding a product s quality. Studying the relation between salary increases and employee productivity in a large corporation.

10 These studies require Statistics due to the intrinsic variability: The compressive strength of different preparations of the same cement mixture will differ. The figure in HistComprStrCement.pdf shows 32 compressive strength measurements (MegaPascal units), of test cylinders (6 in. diameter, 12 in. high), using water/cement ratio of 0.4, measured on the 28th day after they are made. Under the same stress, two beams fail at different times. The proportion of defective items of a certain product will differ from batch to batch. Intrinsic variability renders the objectives of the case studies, as stated, ambiguous.

11 The objectives of the case studies can be made precise if stated in terms of averages or means. Comparing the average hardness of two different cement mixtures. Predicting the average failure time on the basis of stress applied. Estimation of the average coefficient of thermal expansion. Estimation of the average proportion of defective items. Moreover, because of variability, the words average and mean have a technical meaning which can be made clear through the concepts of population and sample.

12 Definition Population is a well-defined collection of objects or subjects, of relevance to a particular study, which are exposed to the same treatment or method. Population members are called units. The objective of a study is to investigate certain characteristic(s) of the units of the population(s) of interest.

13 Example (Populations and Unit Characteristics) All water samples taken from a lake. Characteristics: Mercury concentration; Concentration of other pollutants. All items of a certain manufactured product (that have, or will be produced). Characteristic: Proportion of defective items. All students enrolled in Big Ten universities during the academic year. Characteristics: Favorite type of music; Political affiliation. Two types of cleaning products. Characteristic: cleaning effectiveness.

14 Populations consisting of the same type of units but differ in the treatment, or method, applied to them are called treatment populations. Example (Treatment Populations) The concentration of pollutants in water samples is analyzed by two different labs. Water samples sent to Lab 1 constitute population 1, and those sent to Lab 2 constitute population 2. The time to failure of beams is studied under different stress conditions. The beams subjected to each stress condition constitute different populations.

15 Census and Samples Full (i.e., population-level) understanding of a characteristic can only be achieved by examining all population units. This is called census. However, taking a census can be time consuming and expensive: The 2000 U.S. Census costed $6.5 billion, while the 2010 Census costed $13 billion. Moreover, census is not feasible if the population is hypothetical or conceptual, i.e., not all members are available for examination. Because of the above, we typically settle for examining all units in a sample, which is a subset of the population.

16 Due to the intrinsic variability, the sample properties/attributes of the characteristic of interest will differ from the population-level properties/attributes. For example: The average mercury concentration in 25 water samples will differ from the overall mercury concentration in the lake. The proportion in a sample of 100 PSU students who favor expanding the use of solar energy will differ from the corresponding proportion of all PSU students. The relation between bear s chest girth and weight in a sample of 10 bears, will differ from the corresponding relation in the entire population of 50 bears in a forested region.

17 The GOOD NEWS is that, if the sample is suitably drawn, then sample properties approximate the population properties.

18 Weight Chest Girth Figure: Population and sample relationships between chest girth and weight of black bears.

19 Sampling Variability Sample properties of the characteristic of interest also differ from sample to sample. For example: 1 The number of US citizens, in a sample of size 20, who favor expanding solar energy, will (most likely) be different from the corresponding number in a different sample of 20 US citizens. 2 The average mercury concentration in two sets of 25 water samples drawn from a lake will differ. The term sampling variability is used to describe such differences in the properties of the characteristic of interest from sample to sample.

20 Weight Chest Girth Figure: Illustration of Sampling Variability.

21 Parameters and Statistics Population level properties/attributes of characteristic(s) of interest are called (population) parameters. Examples of parameters include averages, proportions, percentiles, and the correlation coefficient. The corresponding sample properties/attributes of characteristics are called statistics. Sample statistics approximate the corresponding population parameters but are not equal to them. Statistical inference deals with the uncertainty issues which arise in approximating parameters by statistics. The tools of statistical inference include point and interval estimation, hypothesis testing and prediction.

22 Outline Population Proportions and Sample Proportions Population Averages and Sample Averages Population Variance and Sample Variance Sample Percentiles

23 For valid statistical inference the sample must be representative of the population. For example, a sample of PSU basketball players is not representative of PSU students, if the characteristic of interest is height. Typically it is hard to tell whether a sample is representative of the population. So, we define a sample to be representative if... (cyclical definition!!) it allows for valid statistical inference. The only guarantee for that comes from the sampling method, i.e., the method used to collect the sample. The good news is that there are several sampling methods which guarantee representativeness.

24 Definition A sample of size n is a simple random sample (s.r.s.) if the selection process ensures that every sample of size n has equal chance of being selected. In simple random sampling every member of the population has the same chance of being included in the sample. Example A class consists of 35 male and 15 female students. To obtain a sample of 2, one flips a coin and if H selects a female, if T selects a male student. Does every student have the same chance of being included in the sample? Is this s.r.s?

25 The fact that every member of the population has the same chance of being included in the sample does not, by itself, imply that the sample is s.r.s. Example To select a sample of 2 students from a population of 20 male and 20 female students, one selects at random one male and one female student. Does every student have the same chance of being included in the sample? Is this s.r.s?

26 Another sampling method for obtaining a representative sample is called stratified sampling. Definition A stratified sample consists of simple random samples from each of a number of groups (which are non-overlapping and make up the entire population) called strata. Examples of strata include: ethnic groups, age groups, and production facilities.

27 A common method of choosing the within-strata sample sizes assures that the sample representation of each stratum equals to its population representation (proportional allocation). The stratified sample in the last example used proportional allocation. If the units in the different strata differ in terms of the characteristic under study, stratified sampling is preferable to s.r.s. If different production facilities differ in terms of the proportion of defective products, a stratified sample is preferable for estimating the overall proportion of defective items produced.

28 Conceptually, selecting a s.r.s. of size n from a population of N units corresponds to the following steps: STEP 1: Assign to each unit a number from 1 to N. STEP 2: Write each number on a slips of paper, place the N slips of paper in an urn, and shuffle them. STEP 3: Select n slips of paper at random, one at a time and without replacement. Because of STEP 3, simple random sampling is also referred to as sampling without replacement.

29 If the selection of slips of paper in STEP 3 of the previous slide is done with replacement, we say that we have sampled with replacement. For example, rolling a die 10 times gives a sample with replacement from the numbers 1, 2,..., 6. Sampling with replacement is easier to work with from a mathematical point of view. When the population is very large, sampling with and without replacement are practically equivalent.

30 Non-representative Sampling Non-representative samples arise whenever there is selection bias. That is, the sampling process excludes, or leaves under-represented, parts of the population. Two examples of non-representative samples are self-selected and convenience samples. Self-selected samples often occur in opinion surveys or questionnaires. For example, in political surveys, those who feel that things are running smoothly, or without strong feelings, are less likely to respond than activists.

31 Convenience samples are made up from the most easily accessed units. For example, randomly selecting students from your classes will not result in a sample that is representative of all PSU students since your classes are mostly comprised of students with the same major as you. The Literary Digest poll of 1936 is often used to illustrate the misleading potential of self-selected and convenience samples.

32 Example (The Literary Digest poll of 1936) The magazine had been extremely successful in predicting the results in US presidential elections, but in 1936 it predicted a 3-to-2 victory for Republican Alfred Landon over the Democratic incumbent Franklin Delano Roosevelt. This prediction was based on 2.3 million responses to a questionnaire sent to 10 million voters randomly selected from phone books. On the other hand Gallup correctly predicted the outcome of that election by surveying only 50,000 people.

33 Outline Population Proportions and Sample Proportions Population Averages and Sample Averages Population Variance and Sample Variance Sample Percentiles

34 Variable = a Numerical Characteristic Numerical characteristics such as length, mercury concentration, or number of accidents are called quantitative. Non-quantitative characteristics are called qualitative or categorical. For example, gender, make of car, eye color, political affiliation, or strength of opinion. For statistical purposes, the categories in qualitative characteristics are labeled with numbers, e.g., male = 1, female = +1. For characteristics such as strength of opinion, the chosen numbers should reflect the implicit ordering. A characteristic expressed as a number is called a variable.

35 Types of Variables Quantitative variables expressing measurements on a continuous scale are called continuous. Measurements of length, strength, weight, or time to failure are examples of continuous variables. Qualitative variables, as well as quantitative variables expressing counts, e.g., number of defective items in a batch, are called discrete. When two or more characteristics are measured on each population unit, we have bivariate or multivariate variables. Example of bivariate: Salary increase and productivity. Example of multivariate: Age, income, education level.

36 When a unit is randomly sampled from a population, the value of its variable will be denoted by X (or Y, or Z, etc). Because of the intrinsic variability, X is not known a-priori and thus it is called a random variable (r.v.). The population from which a random variable is drawn is called the underlying population of the r.v. The collection of of the variable values of all population units is called the statistical population. The statistical population of a r.v. should not be confused with the set of values a variable can take.

37 Example (Examples of Statistical Populations) 1 A list of the weight of every PSU student is the statistical population of the r.v. weight. 2 A list of 1s and 0s representing every student s opinion on whether or not solar energy should be expanded is the statistical population of the r.v. expressing opinion on solar energy.

38 Sampling from the Statistical Population It should be intuitively clear that taking a sample of n units form some population and recording the variable of each sampled unit, is equivalent to taking a sample of n units from the statistical population of the random variable. Henceforth, the word sample will be mainly used to denote a sample from the statistical population. Thus, samples will be thought of as a collection of numbers. Moreover, 1 The numbers are not known a-priori, so they are r.v. s. 2 A sample of size n will be denoted by X 1, X 2,..., X n. 3 Once the sample values are recorded they will be denoted by x 1, x 2,..., x n.

39 Outline 1 Population Proportions and Sample Proportions Population Averages and Sample Averages Population Variance and Sample Variance Sample Percentiles Population Proportions and Sample Proportions Population Averages and Sample Averages Population Variance and Sample Variance Sample Percentiles

40 Population Proportions and Sample Proportions Population Averages and Sample Averages Population Variance and Sample Variance Sample Percentiles Proportions are relevant whenever the variable of interest is categorical, or has been categorized. Definition 1 If the population has N units, and N i units are in category i, then the population proportion for category i, is p i = #{population units of category i} #{population units} = N i N. 2 If a sample of size n is taken, and n i sample units are in category i, then the sample proportion for category i, is p i = #{sample units of category i} #{sample units} = n i n.

41 Population Proportions and Sample Proportions Population Averages and Sample Averages Population Variance and Sample Variance Sample Percentiles Example 1 In a sample of 1000 adults, 72% favor tougher penalties for drunk driving. Is the correct notation for 0.72 p or p? 2 In a population of 80 engineering majors taking a required statistics class, 40 are enthusiastic about having computer labs. In a s.r. sample of 20 from these students 8 are enthusiastic. What is the correct notation for 40/80 = 0.5 and for 8/20 = 2/5? Always remember that, under s.r. sampling, p approximates, but in general is different from p.

42 Outline 1 Population Proportions and Sample Proportions Population Averages and Sample Averages Population Variance and Sample Variance Sample Percentiles Population Proportions and Sample Proportions Population Averages and Sample Averages Population Variance and Sample Variance Sample Percentiles

43 Population Proportions and Sample Proportions Population Averages and Sample Averages Population Variance and Sample Variance Sample Percentiles Consider a population of N units, and let v 1, v 2,..., v N denote the statistical population corresponding to some variable. Then the population average or population mean, denoted by µ, is the arithmetic average of all values in the statistical population. Thus, µ = 1 N N v i. i=1 If the random variable X denotes the value of the variable of a randomly selected population unit, then the population mean is also called expected value of X, or mean value of X, and is denoted by µ X or E(X).

44 Population Proportions and Sample Proportions Population Averages and Sample Averages Population Variance and Sample Variance Sample Percentiles Example In a population of 500 tin plates, the number of plates with 0, 1 and 2 scratches is N 0 = 190, N 1 = 160 and N 2 = 150. Thus, in the statistical population v 1,..., v 500, 190 v i equal 0, 160 equal 1, and 150 equal 2. The population mean is µ = v i = 0 N N N = 0.92 i=1 If a tin plate is selected at random and X is the rv denoting the number of scratches, the mean value of X is 0.92 and we write µ X = 0.92, or E(X) = 0.92.

45 Two important properties of the mean Population Proportions and Sample Proportions Population Averages and Sample Averages Population Variance and Sample Variance Sample Percentiles The mean µ satisfies: n (v i µ) = 0. i=1 Indeed, n i=1 (v i µ) = n i=1 v i nµ = nµ nµ = 0. For any number a we have n (v i µ) 2 i=1 n (v i a) 2. i=1 Setting the first derivative of n i=1 (v i a) 2 with respect to a equal to zero and solving for a gives a = µ. Also the second derivative is positive.

46 Population Proportions and Sample Proportions Population Averages and Sample Averages Population Variance and Sample Variance Sample Percentiles Use the statistical population of the variable scratches for the 500 tin plates and the following R commands to illustrate the two properties of the mean: X=c(rep(0,190), rep(1,160), rep(2,150)); sum(x-mean(x)) sum((x-mean(x))**2); sum((x-1)**2)

47 Population Proportions and Sample Proportions Population Averages and Sample Averages Population Variance and Sample Variance Sample Percentiles If a sample of size n is taken, and x 1, x 2,..., x n denote the variable values of the sample units, then the sample average or sample mean, denoted by x, is x = 1 n x i n Under s.r. sampling, a sample mean approximates, but in general is different from the population mean. Example If a s.r. sample of n = 100 is taken from the 500 tin plates, it could be that there are n 0 = 40, n 1 = 34 and n 2 = 26 plates with 0, 1 and 2 scratches. In this case, x = i=1

48 Proportions are Averages! Population Proportions and Sample Proportions Population Averages and Sample Averages Population Variance and Sample Variance Sample Percentiles A proportion is always the mean of a suitably defined random variable. To see this consider the example with the tin plates, where N 1 = 160 out of N = 500 have one scratch. Then: For the random variable X which takes the value 1 if a tin plate has one scratch and the value 0 otherwise, its statistical population, v 1,..., v 500, consists of 160 1s and 340 0s. Thus, µ X = v i = = i=1 But 0.32 = N 1 /N is the proportion of tin plates with 1 scratch.

49 Outline 1 Population Proportions and Sample Proportions Population Averages and Sample Averages Population Variance and Sample Variance Sample Percentiles Population Proportions and Sample Proportions Population Averages and Sample Averages Population Variance and Sample Variance Sample Percentiles

50 Population Proportions and Sample Proportions Population Averages and Sample Averages Population Variance and Sample Variance Sample Percentiles Let v 1, v 2,..., v N be a statistical population with mean µ. Definition The population variance, σ 2, is defined as σ 2 = 1 N (v i µ) 2. N i=1 The standard deviation is the positive square root of the variance: σ = σ 2. If the rv X denotes a randomly selected value from the statistical population, then a synonymous terminology for the population variance is variance of X, and is denoted by σx 2, or Var(X). The standard deviation of X is σ X = σx 2.

51 Population Proportions and Sample Proportions Population Averages and Sample Averages Population Variance and Sample Variance Sample Percentiles A simpler computational formula for the variance is Ėxample σ 2 = 1 N N i=1 v 2 i µ 2 Consider the tin plate example, so the statistical population v 1,..., v 500, has 190 v i equal 0, 160 equal 1, 150 equal 2, and µ = Then, σ 2 = =

52 Population Proportions and Sample Proportions Population Averages and Sample Averages Population Variance and Sample Variance Sample Percentiles If x 1, x 2,..., x n denotes a sample from the statistical population, the sample variance and its computational formula are: S 2 = 1 n (x i x) 2 = 1 [ n xi 2 1 ( n ) 2 x i ]. n 1 n 1 n i=1 i=1 The sample standard deviation is S = S 2. i=1 Under s.r. sampling, S 2 approximates, but in general is different from σ 2. Example Consider the s.r. sample of n = 100 tin plates, which has 40, 34 and 26 plates with 0, 1 and 2 scratches. Then, S 2 = 1 [ ] =

53 Why Divide by n 1? Population Proportions and Sample Proportions Population Averages and Sample Averages Population Variance and Sample Variance Sample Percentiles Because this way the mean of the sample variances resulting from all possible samples taken with replacement is equal to the population variance. A simple demonstration of this follows: Take a s.r.s. of size two from the infinite (conceptual) population of all coin tosses, i.e., toss a coin twice, setting 0 for H and 1 for T. The possible samples are {0, 0}, {0, 1}, {1, 0}, {1, 1}. Verify that the four sample variances average to In Chapter 3, we will see that the variance of this infinite population of all coin tosses equals the population variance of {0, 1}, which is (check!) 0.25.

54 Outline 1 Population Proportions and Sample Proportions Population Averages and Sample Averages Population Variance and Sample Variance Sample Percentiles Population Proportions and Sample Proportions Population Averages and Sample Averages Population Variance and Sample Variance Sample Percentiles

55 Population Proportions and Sample Proportions Population Averages and Sample Averages Population Variance and Sample Variance Sample Percentiles Roughly speaking, the (1 α)100th sample percentile separates the part having the (1 α)100% smaller values, from that which has the α100% larger values. Thus: The 90th sample percentile separates the largest 10% from the lower 90% values in the data set. The 50th sample percentile is also called the sample median. The 25th, the 50th and the 75th sample percentiles are also called sample quartiles. The 25th and 75th percentiles are the lower quartile and upper quartile, respectively. The distance between the lower and upper quartiles is called the interquartile range or IQR.

56 Population Proportions and Sample Proportions Population Averages and Sample Averages Population Variance and Sample Variance Sample Percentiles Order Statistics as Sample Percentiles Let X 1,..., X n be a s.r. sample from a continuous population. The ordered sample values are denoted X (1), X (2),..., X (n). Thus, X (1) < X (2) < < X (n). X (i), the ith smallest sample value, is defined to be the 100 i 0.5 n -th sample percentile.

57 Population Proportions and Sample Proportions Population Averages and Sample Averages Population Variance and Sample Variance Sample Percentiles Example A s.r.s. of 10 black bears weights is: Give the order statistics, and state the sample percentiles they correspond to. Solution: The R command sort( c(154, 158, 356, 446, 40, 154, 90, 94, 150, 142) ) returns the order statistics: 40, 90, 94, 142, 150, 154, 154, 158, 356, 446. These order statistics are the 5th, 15th, 25th, 35th, 45th, 55th, 65th, 75th, 85th and 95th sample percentiles. For example, X (3) = 94 is the 100(3 0.5)/10 = 25-th sample percentile. In R these percentiles are obtained with: 100*(1:10-0.5)/10.]

58 Population Proportions and Sample Proportions Population Averages and Sample Averages Population Variance and Sample Variance Sample Percentiles In the above example none of the order statistics corresponds to the median or the 90th percentile. In general, if n is even, none of the order statistics corresponds to the median. For example, If n = 4 then X (2) is the = 37.5th sample percentile, while X (3) is the = 62.5th sample percentile. Depending on n, the above definition may not identify other percentiles of interest. In such cases, percentiles are defined by the use of interpolations.

59 Hand Calculation of Sample Median Population Proportions and Sample Proportions Population Averages and Sample Averages Population Variance and Sample Variance Sample Percentiles Definition Let X (1), X (2),..., X (n) denote the ordered sample values in a sample of size n. The sample median is defined as X ( n+1 2 ), if n is odd X = X ( n 2) + X ( n +1) 2, if n is even 2

60 Population Proportions and Sample Proportions Population Averages and Sample Averages Population Variance and Sample Variance Sample Percentiles Example (Relation Between X and X) Find the sample median of X 1 = 2.3, X 2 = 3.2, X 3 = 1.8, X 4 = 2.5, X 5 = 2.7. Solution. Here, X (1) = 1.8, X (2) = 2.3, X (3) = 2.5, X (4) = 2.7, X (5) = 3.2. Since sample size is odd, X = X ( n+1 2 ) = X (3) = 2.5. For this data, X = X = 2.5. If X (5) is changed to 4.2, then X = 2.7 but X = 2.5. Thus X is affected by outliers, where as X is not. In general, if the histogram of the data is positively skewed X > X, and if it is negatively skewed X < X.

61 Population Proportions and Sample Proportions Population Averages and Sample Averages Population Variance and Sample Variance Sample Percentiles Hand Calculation of Sample Quartiles and Sample IQR Definition The sample lower quartile or SLQ is defined as the median of the smallest n/2 values, if n is even the median the smallest (n + 1)/2 values, if n is odd The sample upper quartile or SUQ is defined as the median of the largest n/2 values, if n is even the median the largest (n + 1)/2 values, if n is odd The sample interquartile range, or SIQR, is defined as SIQR = SUQ SLQ,

62 Population Proportions and Sample Proportions Population Averages and Sample Averages Population Variance and Sample Variance Sample Percentiles Example Find the lower and upper quartiles of the n = 9 observations 9.39, 7.04, 7.17, 13.28, 9.00, 7.46, 21.06, 15.19, Solution. Since n is odd, the SLQ is the median of the Smallest 5(= (n + 1)/2) values: 7.04, 7.17, 7.46, 7.50, 9.00 and the SUQ is the median of the Largest 5(= (n + 1)/2) values: 9.00, 9.39, 13.28, 15.19, Thus SLQ = 7.46, and SUQ =

ST 371 (IX): Theories of Sampling Distributions

ST 371 (IX): Theories of Sampling Distributions ST 371 (IX): Theories of Sampling Distributions 1 Sample, Population, Parameter and Statistic The major use of inferential statistics is to use information from a sample to infer characteristics about

More information

1-1. Chapter 1. Sampling and Descriptive Statistics by The McGraw-Hill Companies, Inc. All rights reserved.

1-1. Chapter 1. Sampling and Descriptive Statistics by The McGraw-Hill Companies, Inc. All rights reserved. 1-1 Chapter 1 Sampling and Descriptive Statistics 1-2 Why Statistics? Deal with uncertainty in repeated scientific measurements Draw conclusions from data Design valid experiments and draw reliable conclusions

More information

CHAPTER 1. Introduction

CHAPTER 1. Introduction CHAPTER 1 Introduction Engineers and scientists are constantly exposed to collections of facts, or data. The discipline of statistics provides methods for organizing and summarizing data, and for drawing

More information

Lecture 3. Measures of Relative Standing and. Exploratory Data Analysis (EDA)

Lecture 3. Measures of Relative Standing and. Exploratory Data Analysis (EDA) Lecture 3. Measures of Relative Standing and Exploratory Data Analysis (EDA) Problem: The average weekly sales of a small company are $10,000 with a standard deviation of $450. This week their sales were

More information

Resistant Measure - A statistic that is not affected very much by extreme observations.

Resistant Measure - A statistic that is not affected very much by extreme observations. Chapter 1.3 Lecture Notes & Examples Section 1.3 Describing Quantitative Data with Numbers (pp. 50-74) 1.3.1 Measuring Center: The Mean Mean - The arithmetic average. To find the mean (pronounced x bar)

More information

Chapter Goals. To introduce you to data collection

Chapter Goals. To introduce you to data collection Chapter Goals To introduce you to data collection You will learn to think critically about the data collected or presented learn various methods for selecting a sample Formulate Theories Interpret Results/Make

More information

Chapter 1 - Lecture 3 Measures of Location

Chapter 1 - Lecture 3 Measures of Location Chapter 1 - Lecture 3 of Location August 31st, 2009 Chapter 1 - Lecture 3 of Location General Types of measures Median Skewness Chapter 1 - Lecture 3 of Location Outline General Types of measures What

More information

What is Statistics? Statistics is the science of understanding data and of making decisions in the face of variability and uncertainty.

What is Statistics? Statistics is the science of understanding data and of making decisions in the face of variability and uncertainty. What is Statistics? Statistics is the science of understanding data and of making decisions in the face of variability and uncertainty. Statistics is a field of study concerned with the data collection,

More information

Outline PMF, CDF and PDF Mean, Variance and Percentiles Some Common Distributions. Week 5 Random Variables and Their Distributions

Outline PMF, CDF and PDF Mean, Variance and Percentiles Some Common Distributions. Week 5 Random Variables and Their Distributions Week 5 Random Variables and Their Distributions Week 5 Objectives This week we give more general definitions of mean value, variance and percentiles, and introduce the first probability models for discrete

More information

Chapter 2: Tools for Exploring Univariate Data

Chapter 2: Tools for Exploring Univariate Data Stats 11 (Fall 2004) Lecture Note Introduction to Statistical Methods for Business and Economics Instructor: Hongquan Xu Chapter 2: Tools for Exploring Univariate Data Section 2.1: Introduction What is

More information

Statistical Inference, Populations and Samples

Statistical Inference, Populations and Samples Chapter 3 Statistical Inference, Populations and Samples Contents 3.1 Introduction................................... 2 3.2 What is statistical inference?.......................... 2 3.2.1 Examples of

More information

MATH 10 INTRODUCTORY STATISTICS

MATH 10 INTRODUCTORY STATISTICS MATH 10 INTRODUCTORY STATISTICS Tommy Khoo Your friendly neighbourhood graduate student. Week 1 Chapter 1 Introduction What is Statistics? Why do you need to know Statistics? Technical lingo and concepts:

More information

Part 7: Glossary Overview

Part 7: Glossary Overview Part 7: Glossary Overview In this Part This Part covers the following topic Topic See Page 7-1-1 Introduction This section provides an alphabetical list of all the terms used in a STEPS surveillance with

More information

Describing distributions with numbers

Describing distributions with numbers Describing distributions with numbers A large number or numerical methods are available for describing quantitative data sets. Most of these methods measure one of two data characteristics: The central

More information

Business Statistics:

Business Statistics: Department of Quantitative Methods & Information Systems Business Statistics: Chapter 7 Introduction to Sampling Distributions QMIS 220 Dr. Mohammad Zainal Chapter Goals After completing this chapter,

More information

Probability Experiments, Trials, Outcomes, Sample Spaces Example 1 Example 2

Probability Experiments, Trials, Outcomes, Sample Spaces Example 1 Example 2 Probability Probability is the study of uncertain events or outcomes. Games of chance that involve rolling dice or dealing cards are one obvious area of application. However, probability models underlie

More information

1 Measures of the Center of a Distribution

1 Measures of the Center of a Distribution 1 Measures of the Center of a Distribution Qualitative descriptions of the shape of a distribution are important and useful. But we will often desire the precision of numerical summaries as well. Two aspects

More information

1.3.1 Measuring Center: The Mean

1.3.1 Measuring Center: The Mean 1.3.1 Measuring Center: The Mean Mean - The arithmetic average. To find the mean (pronounced x bar) of a set of observations, add their values and divide by the number of observations. If the n observations

More information

MATH 1150 Chapter 2 Notation and Terminology

MATH 1150 Chapter 2 Notation and Terminology MATH 1150 Chapter 2 Notation and Terminology Categorical Data The following is a dataset for 30 randomly selected adults in the U.S., showing the values of two categorical variables: whether or not the

More information

Describing distributions with numbers

Describing distributions with numbers Describing distributions with numbers A large number or numerical methods are available for describing quantitative data sets. Most of these methods measure one of two data characteristics: The central

More information

AP Final Review II Exploring Data (20% 30%)

AP Final Review II Exploring Data (20% 30%) AP Final Review II Exploring Data (20% 30%) Quantitative vs Categorical Variables Quantitative variables are numerical values for which arithmetic operations such as means make sense. It is usually a measure

More information

Chapter 3: Probability 3.1: Basic Concepts of Probability

Chapter 3: Probability 3.1: Basic Concepts of Probability Chapter 3: Probability 3.1: Basic Concepts of Probability Objectives Identify the sample space of a probability experiment and a simple event Use the Fundamental Counting Principle Distinguish classical

More information

Describing Distributions with Numbers

Describing Distributions with Numbers Describing Distributions with Numbers Using graphs, we could determine the center, spread, and shape of the distribution of a quantitative variable. We can also use numbers (called summary statistics)

More information

Glossary. The ISI glossary of statistical terms provides definitions in a number of different languages:

Glossary. The ISI glossary of statistical terms provides definitions in a number of different languages: Glossary The ISI glossary of statistical terms provides definitions in a number of different languages: http://isi.cbs.nl/glossary/index.htm Adjusted r 2 Adjusted R squared measures the proportion of the

More information

F78SC2 Notes 2 RJRC. If the interest rate is 5%, we substitute x = 0.05 in the formula. This gives

F78SC2 Notes 2 RJRC. If the interest rate is 5%, we substitute x = 0.05 in the formula. This gives F78SC2 Notes 2 RJRC Algebra It is useful to use letters to represent numbers. We can use the rules of arithmetic to manipulate the formula and just substitute in the numbers at the end. Example: 100 invested

More information

Introduction to Statistical Data Analysis Lecture 1: Working with Data Sets

Introduction to Statistical Data Analysis Lecture 1: Working with Data Sets Introduction to Statistical Data Analysis Lecture 1: Working with Data Sets James V. Lambers Department of Mathematics The University of Southern Mississippi James V. Lambers Statistical Data Analysis

More information

Identify the scale of measurement most appropriate for each of the following variables. (Use A = nominal, B = ordinal, C = interval, D = ratio.

Identify the scale of measurement most appropriate for each of the following variables. (Use A = nominal, B = ordinal, C = interval, D = ratio. Answers to Items from Problem Set 1 Item 1 Identify the scale of measurement most appropriate for each of the following variables. (Use A = nominal, B = ordinal, C = interval, D = ratio.) a. response latency

More information

1. Descriptive stats methods for organizing and summarizing information

1. Descriptive stats methods for organizing and summarizing information Two basic types of statistics: 1. Descriptive stats methods for organizing and summarizing information Stats in sports are a great example Usually we use graphs, charts, and tables showing averages and

More information

Business Statistics:

Business Statistics: Chapter 7 Student Lecture Notes 7-1 Department of Quantitative Methods & Information Systems Business Statistics: Chapter 7 Introduction to Sampling Distributions QMIS 220 Dr. Mohammad Zainal Chapter Goals

More information

ACMS Statistics for Life Sciences. Chapter 13: Sampling Distributions

ACMS Statistics for Life Sciences. Chapter 13: Sampling Distributions ACMS 20340 Statistics for Life Sciences Chapter 13: Sampling Distributions Sampling We use information from a sample to infer something about a population. When using random samples and randomized experiments,

More information

(quantitative or categorical variables) Numerical descriptions of center, variability, position (quantitative variables)

(quantitative or categorical variables) Numerical descriptions of center, variability, position (quantitative variables) 3. Descriptive Statistics Describing data with tables and graphs (quantitative or categorical variables) Numerical descriptions of center, variability, position (quantitative variables) Bivariate descriptions

More information

BNG 495 Capstone Design. Descriptive Statistics

BNG 495 Capstone Design. Descriptive Statistics BNG 495 Capstone Design Descriptive Statistics Overview The overall goal of this short course in statistics is to provide an introduction to descriptive and inferential statistical methods, with a focus

More information

Describing Distributions with Numbers

Describing Distributions with Numbers Topic 2 We next look at quantitative data. Recall that in this case, these data can be subject to the operations of arithmetic. In particular, we can add or subtract observation values, we can sort them

More information

SESSION 5 Descriptive Statistics

SESSION 5 Descriptive Statistics SESSION 5 Descriptive Statistics Descriptive statistics are used to describe the basic features of the data in a study. They provide simple summaries about the sample and the measures. Together with simple

More information

Lecture 1: Description of Data. Readings: Sections 1.2,

Lecture 1: Description of Data. Readings: Sections 1.2, Lecture 1: Description of Data Readings: Sections 1.,.1-.3 1 Variable Example 1 a. Write two complete and grammatically correct sentences, explaining your primary reason for taking this course and then

More information

What is statistics? Statistics is the science of: Collecting information. Organizing and summarizing the information collected

What is statistics? Statistics is the science of: Collecting information. Organizing and summarizing the information collected What is statistics? Statistics is the science of: Collecting information Organizing and summarizing the information collected Analyzing the information collected in order to draw conclusions Two types

More information

Sampling, Frequency Distributions, and Graphs (12.1)

Sampling, Frequency Distributions, and Graphs (12.1) 1 Sampling, Frequency Distributions, and Graphs (1.1) Design: Plan how to obtain the data. What are typical Statistical Methods? Collect the data, which is then subjected to statistical analysis, which

More information

Fourier and Stats / Astro Stats and Measurement : Stats Notes

Fourier and Stats / Astro Stats and Measurement : Stats Notes Fourier and Stats / Astro Stats and Measurement : Stats Notes Andy Lawrence, University of Edinburgh Autumn 2013 1 Probabilities, distributions, and errors Laplace once said Probability theory is nothing

More information

Statistics 301: Probability and Statistics Introduction to Statistics Module

Statistics 301: Probability and Statistics Introduction to Statistics Module Statistics 301: Probability and Statistics Introduction to Statistics Module 1 2018 Introduction to Statistics Statistics is a science, not a branch of mathematics, but uses mathematical models as essential

More information

Mean/Average Median Mode Range

Mean/Average Median Mode Range Normal Curves Today s Goals Normal curves! Before this we need a basic review of statistical terms. I mean basic as in underlying, not easy. We will learn how to retrieve statistical data from normal curves.

More information

Lecture Notes for BUSINESS STATISTICS - BMGT 571. Chapters 1 through 6. Professor Ahmadi, Ph.D. Department of Management

Lecture Notes for BUSINESS STATISTICS - BMGT 571. Chapters 1 through 6. Professor Ahmadi, Ph.D. Department of Management Lecture Notes for BUSINESS STATISTICS - BMGT 571 Chapters 1 through 6 Professor Ahmadi, Ph.D. Department of Management Revised May 005 Glossary of Terms: Statistics Chapter 1 Data Data Set Elements Variable

More information

Sampling Distribution Models. Chapter 17

Sampling Distribution Models. Chapter 17 Sampling Distribution Models Chapter 17 Objectives: 1. Sampling Distribution Model 2. Sampling Variability (sampling error) 3. Sampling Distribution Model for a Proportion 4. Central Limit Theorem 5. Sampling

More information

Announcements. Lecture 1 - Data and Data Summaries. Data. Numerical Data. all variables. continuous discrete. Homework 1 - Out 1/15, due 1/22

Announcements. Lecture 1 - Data and Data Summaries. Data. Numerical Data. all variables. continuous discrete. Homework 1 - Out 1/15, due 1/22 Announcements Announcements Lecture 1 - Data and Data Summaries Statistics 102 Colin Rundel January 13, 2013 Homework 1 - Out 1/15, due 1/22 Lab 1 - Tomorrow RStudio accounts created this evening Try logging

More information

and the Sample Mean Random Sample

and the Sample Mean Random Sample MATH 183 Random Samples and the Sample Mean Dr. Neal, WKU Henceforth, we shall assume that we are studying a particular measurement X from a population! for which the mean µ and standard deviation! are

More information

A Brief Review of Statistics Concepts

A Brief Review of Statistics Concepts Brief Review of Statistics Concepts Statistics? set of principles and procedures for collecting, compiling, analyzing and interpreting data in order to assist in making decisions in the presence of uncertainty.

More information

CIVL 7012/8012. Collection and Analysis of Information

CIVL 7012/8012. Collection and Analysis of Information CIVL 7012/8012 Collection and Analysis of Information Uncertainty in Engineering Statistics deals with the collection and analysis of data to solve real-world problems. Uncertainty is inherent in all real

More information

Elisha Mae Kostka 243 Assignment Mock Test 1 due 02/11/2015 at 09:01am PST

Elisha Mae Kostka 243 Assignment Mock Test 1 due 02/11/2015 at 09:01am PST Elisha Mae Kostka 243 Assignment Mock Test 1 due 02/11/2015 at 09:01am PST 1. (1 pt) Luis Gonzalez began his career as a major league baseball player in 1990. You are given a sample of the number of homeruns

More information

Probabilities and Statistics Probabilities and Statistics Probabilities and Statistics

Probabilities and Statistics Probabilities and Statistics Probabilities and Statistics - Lecture 8 Olariu E. Florentin April, 2018 Table of contents 1 Introduction Vocabulary 2 Descriptive Variables Graphical representations Measures of the Central Tendency The Mean The Median The Mode Comparing

More information

Lecture 8 Sampling Theory

Lecture 8 Sampling Theory Lecture 8 Sampling Theory Thais Paiva STA 111 - Summer 2013 Term II July 11, 2013 1 / 25 Thais Paiva STA 111 - Summer 2013 Term II Lecture 8, 07/11/2013 Lecture Plan 1 Sampling Distributions 2 Law of Large

More information

Now we will define some common sampling plans and discuss their strengths and limitations.

Now we will define some common sampling plans and discuss their strengths and limitations. Now we will define some common sampling plans and discuss their strengths and limitations. 1 For volunteer samples individuals are self selected. Participants decide to include themselves in the study.

More information

Descriptive Statistics

Descriptive Statistics Descriptive Statistics CHAPTER OUTLINE 6-1 Numerical Summaries of Data 6- Stem-and-Leaf Diagrams 6-3 Frequency Distributions and Histograms 6-4 Box Plots 6-5 Time Sequence Plots 6-6 Probability Plots Chapter

More information

An introduction to biostatistics: part 1

An introduction to biostatistics: part 1 An introduction to biostatistics: part 1 Cavan Reilly September 6, 2017 Table of contents Introduction to data analysis Uncertainty Probability Conditional probability Random variables Discrete random

More information

Last few slides from last time

Last few slides from last time Last few slides from last time Example 3: What is the probability that p will fall in a certain range, given p? Flip a coin 50 times. If the coin is fair (p=0.5), what is the probability of getting an

More information

9/2/2010. Wildlife Management is a very quantitative field of study. throughout this course and throughout your career.

9/2/2010. Wildlife Management is a very quantitative field of study. throughout this course and throughout your career. Introduction to Data and Analysis Wildlife Management is a very quantitative field of study Results from studies will be used throughout this course and throughout your career. Sampling design influences

More information

STAT 200 Chapter 1 Looking at Data - Distributions

STAT 200 Chapter 1 Looking at Data - Distributions STAT 200 Chapter 1 Looking at Data - Distributions What is Statistics? Statistics is a science that involves the design of studies, data collection, summarizing and analyzing the data, interpreting the

More information

Chapter 5 Statistical Inference

Chapter 5 Statistical Inference Chapter 5 Statistical Inference CHAPTER OUTLINE Section 1 Why Do We Need Statistics? Section 2 Inference Using a Probability Model Section 3 Statistical Models Section 4 Data Collection Section 5 Some

More information

MATH 250 / SPRING 2011 SAMPLE QUESTIONS / SET 3

MATH 250 / SPRING 2011 SAMPLE QUESTIONS / SET 3 MATH 250 / SPRING 2011 SAMPLE QUESTIONS / SET 3 1. A four engine plane can fly if at least two engines work. a) If the engines operate independently and each malfunctions with probability q, what is the

More information

POPULATION AND SAMPLE

POPULATION AND SAMPLE 1 POPULATION AND SAMPLE Population. A population refers to any collection of specified group of human beings or of non-human entities such as objects, educational institutions, time units, geographical

More information

Final Exam STAT On a Pareto chart, the frequency should be represented on the A) X-axis B) regression C) Y-axis D) none of the above

Final Exam STAT On a Pareto chart, the frequency should be represented on the A) X-axis B) regression C) Y-axis D) none of the above King Abdul Aziz University Faculty of Sciences Statistics Department Final Exam STAT 0 First Term 49-430 A 40 Name No ID: Section: You have 40 questions in 9 pages. You have 90 minutes to solve the exam.

More information

Math 138 Summer Section 412- Unit Test 1 Green Form, page 1 of 7

Math 138 Summer Section 412- Unit Test 1 Green Form, page 1 of 7 Math 138 Summer 1 2013 Section 412- Unit Test 1 Green Form page 1 of 7 1. Multiple Choice. Please circle your answer. Each question is worth 3 points. (a) Social Security Numbers are illustrations of which

More information

Survey Sample Methods

Survey Sample Methods Survey Sample Methods p. 1/54 Survey Sample Methods Evaluators Toolbox Refreshment Abhik Roy & Kristin Hobson abhik.r.roy@wmich.edu & kristin.a.hobson@wmich.edu Western Michigan University AEA Evaluation

More information

Statistic: a that can be from a sample without making use of any unknown. In practice we will use to establish unknown parameters.

Statistic: a that can be from a sample without making use of any unknown. In practice we will use to establish unknown parameters. Chapter 9: Sampling Distributions 9.1: Sampling Distributions IDEA: How often would a given method of sampling give a correct answer if it was repeated many times? That is, if you took repeated samples

More information

Vocabulary: Samples and Populations

Vocabulary: Samples and Populations Vocabulary: Samples and Populations Concept Different types of data Categorical data results when the question asked in a survey or sample can be answered with a nonnumerical answer. For example if we

More information

2 Descriptive Statistics

2 Descriptive Statistics 2 Descriptive Statistics Reading: SW Chapter 2, Sections 1-6 A natural first step towards answering a research question is for the experimenter to design a study or experiment to collect data from the

More information

STAT:5100 (22S:193) Statistical Inference I

STAT:5100 (22S:193) Statistical Inference I STAT:5100 (22S:193) Statistical Inference I Week 3 Luke Tierney University of Iowa Fall 2015 Luke Tierney (U Iowa) STAT:5100 (22S:193) Statistical Inference I Fall 2015 1 Recap Matching problem Generalized

More information

Senior Math Circles November 19, 2008 Probability II

Senior Math Circles November 19, 2008 Probability II University of Waterloo Faculty of Mathematics Centre for Education in Mathematics and Computing Senior Math Circles November 9, 2008 Probability II Probability Counting There are many situations where

More information

ECLT 5810 Data Preprocessing. Prof. Wai Lam

ECLT 5810 Data Preprocessing. Prof. Wai Lam ECLT 5810 Data Preprocessing Prof. Wai Lam Why Data Preprocessing? Data in the real world is imperfect incomplete: lacking attribute values, lacking certain attributes of interest, or containing only aggregate

More information

Chapter 26: Comparing Counts (Chi Square)

Chapter 26: Comparing Counts (Chi Square) Chapter 6: Comparing Counts (Chi Square) We ve seen that you can turn a qualitative variable into a quantitative one (by counting the number of successes and failures), but that s a compromise it forces

More information

AP Statistics Cumulative AP Exam Study Guide

AP Statistics Cumulative AP Exam Study Guide AP Statistics Cumulative AP Eam Study Guide Chapters & 3 - Graphs Statistics the science of collecting, analyzing, and drawing conclusions from data. Descriptive methods of organizing and summarizing statistics

More information

SUMMARIZING MEASURED DATA. Gaia Maselli

SUMMARIZING MEASURED DATA. Gaia Maselli SUMMARIZING MEASURED DATA Gaia Maselli maselli@di.uniroma1.it Computer Network Performance 2 Overview Basic concepts Summarizing measured data Summarizing data by a single number Summarizing variability

More information

dates given in your syllabus.

dates given in your syllabus. Slide 2-1 For exams (MD1, MD2, and Final): You may bring one 8.5 by 11 sheet of paper with formulas and notes written or typed on both sides to each exam. For the rest of the quizzes, you will take your

More information

Introduction to Basic Statistics Version 2

Introduction to Basic Statistics Version 2 Introduction to Basic Statistics Version 2 Pat Hammett, Ph.D. University of Michigan 2014 Instructor Comments: This document contains a brief overview of basic statistics and core terminology/concepts

More information

Shape, Outliers, Center, Spread Frequency and Relative Histograms Related to other types of graphical displays

Shape, Outliers, Center, Spread Frequency and Relative Histograms Related to other types of graphical displays Histograms: Shape, Outliers, Center, Spread Frequency and Relative Histograms Related to other types of graphical displays Sep 9 1:13 PM Shape: Skewed left Bell shaped Symmetric Bi modal Symmetric Skewed

More information

Chapter 1: Introduction. Material from Devore s book (Ed 8), and Cengagebrain.com

Chapter 1: Introduction. Material from Devore s book (Ed 8), and Cengagebrain.com 1 Chapter 1: Introduction Material from Devore s book (Ed 8), and Cengagebrain.com Populations and Samples An investigation of some characteristic of a population of interest. Example: Say you want to

More information

Hypothesis testing. Data to decisions

Hypothesis testing. Data to decisions Hypothesis testing Data to decisions The idea Null hypothesis: H 0 : the DGP/population has property P Under the null, a sample statistic has a known distribution If, under that that distribution, the

More information

15: CHI SQUARED TESTS

15: CHI SQUARED TESTS 15: CHI SQUARED ESS MULIPLE CHOICE QUESIONS In the following multiple choice questions, please circle the correct answer. 1. Which statistical technique is appropriate when we describe a single population

More information

Announcements. Lecture 5: Probability. Dangling threads from last week: Mean vs. median. Dangling threads from last week: Sampling bias

Announcements. Lecture 5: Probability. Dangling threads from last week: Mean vs. median. Dangling threads from last week: Sampling bias Recap Announcements Lecture 5: Statistics 101 Mine Çetinkaya-Rundel September 13, 2011 HW1 due TA hours Thursday - Sunday 4pm - 9pm at Old Chem 211A If you added the class last week please make sure to

More information

Estadística I Exercises Chapter 4 Academic year 2015/16

Estadística I Exercises Chapter 4 Academic year 2015/16 Estadística I Exercises Chapter 4 Academic year 2015/16 1. An urn contains 15 balls numbered from 2 to 16. One ball is drawn at random and its number is reported. (a) Define the following events by listing

More information

Learning Objectives for Stat 225

Learning Objectives for Stat 225 Learning Objectives for Stat 225 08/20/12 Introduction to Probability: Get some general ideas about probability, and learn how to use sample space to compute the probability of a specific event. Set Theory:

More information

Conditional Probability

Conditional Probability Conditional Probability Idea have performed a chance experiment but don t know the outcome (ω), but have some partial information (event A) about ω. Question: given this partial information what s the

More information

Lecture Slides. Elementary Statistics. by Mario F. Triola. and the Triola Statistics Series

Lecture Slides. Elementary Statistics. by Mario F. Triola. and the Triola Statistics Series Lecture Slides Elementary Statistics Tenth Edition and the Triola Statistics Series by Mario F. Triola Slide 1 Chapter 13 Nonparametric Statistics 13-1 Overview 13-2 Sign Test 13-3 Wilcoxon Signed-Ranks

More information

Tribhuvan University Institute of Science and Technology 2065

Tribhuvan University Institute of Science and Technology 2065 1CSc. Stat. 108-2065 Tribhuvan University Institute of Science and Technology 2065 Bachelor Level/First Year/ First Semester/ Science Full Marks: 60 Computer Science and Information Technology (Stat. 108)

More information

Statistical Methods: Introduction, Applications, Histograms, Ch

Statistical Methods: Introduction, Applications, Histograms, Ch Outlines Statistical Methods: Introduction, Applications, Histograms, Characteristics November 4, 2004 Outlines Part I: Statistical Methods: Introduction and Applications Part II: Statistical Methods:

More information

STATISTICS 1 REVISION NOTES

STATISTICS 1 REVISION NOTES STATISTICS 1 REVISION NOTES Statistical Model Representing and summarising Sample Data Key words: Quantitative Data This is data in NUMERICAL FORM such as shoe size, height etc. Qualitative Data This is

More information

Lecture Slides. Section 13-1 Overview. Elementary Statistics Tenth Edition. Chapter 13 Nonparametric Statistics. by Mario F.

Lecture Slides. Section 13-1 Overview. Elementary Statistics Tenth Edition. Chapter 13 Nonparametric Statistics. by Mario F. Lecture Slides Elementary Statistics Tenth Edition and the Triola Statistics Series by Mario F. Triola Slide 1 Chapter 13 Nonparametric Statistics 13-1 Overview 13-2 Sign Test 13-3 Wilcoxon Signed-Ranks

More information

Week 1: Intro to R and EDA

Week 1: Intro to R and EDA Statistical Methods APPM 4570/5570, STAT 4000/5000 Populations and Samples 1 Week 1: Intro to R and EDA Introduction to EDA Objective: study of a characteristic (measurable quantity, random variable) for

More information

Introduction to Probability and Statistics Slides 1 Chapter 1

Introduction to Probability and Statistics Slides 1 Chapter 1 1 Introduction to Probability and Statistics Slides 1 Chapter 1 Prof. Ammar M. Sarhan, asarhan@mathstat.dal.ca Department of Mathematics and Statistics, Dalhousie University Fall Semester 2010 Course outline

More information

Unit 1: Statistics. Mrs. Valentine Math III

Unit 1: Statistics. Mrs. Valentine Math III Unit 1: Statistics Mrs. Valentine Math III 1.1 Analyzing Data Statistics Study, analysis, and interpretation of data Find measure of central tendency Mean average of the data Median Odd # data pts: middle

More information

MAT Mathematics in Today's World

MAT Mathematics in Today's World MAT 1000 Mathematics in Today's World Last Time We discussed the four rules that govern probabilities: 1. Probabilities are numbers between 0 and 1 2. The probability an event does not occur is 1 minus

More information

STP 420 INTRODUCTION TO APPLIED STATISTICS NOTES

STP 420 INTRODUCTION TO APPLIED STATISTICS NOTES INTRODUCTION TO APPLIED STATISTICS NOTES PART - DATA CHAPTER LOOKING AT DATA - DISTRIBUTIONS Individuals objects described by a set of data (people, animals, things) - all the data for one individual make

More information

Probability Distributions

Probability Distributions CONDENSED LESSON 13.1 Probability Distributions In this lesson, you Sketch the graph of the probability distribution for a continuous random variable Find probabilities by finding or approximating areas

More information

Interpret Standard Deviation. Outlier Rule. Describe the Distribution OR Compare the Distributions. Linear Transformations SOCS. Interpret a z score

Interpret Standard Deviation. Outlier Rule. Describe the Distribution OR Compare the Distributions. Linear Transformations SOCS. Interpret a z score Interpret Standard Deviation Outlier Rule Linear Transformations Describe the Distribution OR Compare the Distributions SOCS Using Normalcdf and Invnorm (Calculator Tips) Interpret a z score What is an

More information

Lecture 1: Descriptive Statistics

Lecture 1: Descriptive Statistics Lecture 1: Descriptive Statistics MSU-STT-351-Sum 15 (P. Vellaisamy: MSU-STT-351-Sum 15) Probability & Statistics for Engineers 1 / 56 Contents 1 Introduction 2 Branches of Statistics Descriptive Statistics

More information

Why Sample? Selecting a sample is less time-consuming than selecting every item in the population (census).

Why Sample? Selecting a sample is less time-consuming than selecting every item in the population (census). Why Sample? Selecting a sample is less time-consuming than selecting every item in the population (census). Selecting a sample is less costly than selecting every item in the population. An analysis of

More information

Sociology 6Z03 Review I

Sociology 6Z03 Review I Sociology 6Z03 Review I John Fox McMaster University Fall 2016 John Fox (McMaster University) Sociology 6Z03 Review I Fall 2016 1 / 19 Outline: Review I Introduction Displaying Distributions Describing

More information

Discrete Random Variables

Discrete Random Variables Discrete Random Variables An Undergraduate Introduction to Financial Mathematics J. Robert Buchanan 2014 Introduction The markets can be thought of as a complex interaction of a large number of random

More information

What are the mean, median, and mode for the data set below? Step 1

What are the mean, median, and mode for the data set below? Step 1 Unit 11 Review Analyzing Data Name Per The mean is the average of the values. The median is the middle value(s) when the values are listed in order. The mode is the most common value(s). What are the mean,

More information

Lecture 41 Sections Mon, Apr 7, 2008

Lecture 41 Sections Mon, Apr 7, 2008 Lecture 41 Sections 14.1-14.3 Hampden-Sydney College Mon, Apr 7, 2008 Outline 1 2 3 4 5 one-proportion test that we just studied allows us to test a hypothesis concerning one proportion, or two categories,

More information

Elementary Statistics

Elementary Statistics Elementary Statistics Q: What is data? Q: What does the data look like? Q: What conclusions can we draw from the data? Q: Where is the middle of the data? Q: Why is the spread of the data important? Q:

More information

Histograms allow a visual interpretation

Histograms allow a visual interpretation Chapter 4: Displaying and Summarizing i Quantitative Data s allow a visual interpretation of quantitative (numerical) data by indicating the number of data points that lie within a range of values, called

More information