University of California, Berkeley, Statistics 131A: Statistical Inference for the Social and Life Sciences. Michael Lugo, Spring 2012
|
|
- Gavin Bryan
- 5 years ago
- Views:
Transcription
1 University of California, Berkeley, Statistics 3A: Statistical Inference for the Social and Life Sciences Michael Lugo, Spring 202 Solutions to Exam Friday, March 2, 202. [5: 2+2+] Consider the stemplot below (a) What is the median of the data represented in this stemplot? There are 3 data points; the median is the (3 + )/2th largest, or 7th largest, which is 6. (b) The mean of the data represented in this stemplot is (circle one): much smaller than about equal to much larger than the median. Explain your answer without any explicit computations. Since the distribution is right-skewed, the mean is much larger than the median. (c) One of the images below is a boxplot for the data given in the stemplot. Circle that boxplot. No explanation is necessary. The bottom of the three boxplots is the correct one. There is a right outlier (corresponding to 78 in the data) and the boxplot is otherwise typical of a right-skewed distribution.
2 2. [6: 3+3] Below is a histogram for a data set containing nine numbers. Histogram of x Frequency x (a) Can you determine the median of this data set exactly? If you can, do so. If not, explain why not and give the best possible bounds on the median. (For example, the median is clearly between and 8. But you can do better.) There are nine data points, so the median is the fifth smallest. One is between and 2, one between 2 and 3, and two between 3 and 4. The fifth smallest data point is between 4 and 5 but we can t say what it is more precisely than that. (b) Can you determine the mean of this data set exactly? If you can, do so. If not, explain why not and give the best possible bounds on the mean. The smallest data point is between and 2; the second smallest is between 2 and 3; and so on. So the mean is at least ( )/9 = 36/9 = 4 and at most more than this, or 5. 2
3 3. [8: ] Consider the data set of four points given below: x y (a) Find the standard deviation s x. The mean is ( )/4 = 2; the standard deviation is 4 (( 2)2 + (2 2) 2 + (2 2) 2 + (3 2) 2 ) = 2 3. (b) Find the standard deviation s y. The mean is ( )/4 = 3; the standard deviation is 4 ((3 2)2 + (3 2) 2 + (3 4) 2 + (3 4) 2 ) = 4 3. (c) Find the coefficient of correlation r. We have the formula r = n and so Now, plugging in values, r = n i= (4 ) 2/3 4/3 x i x y i ȳ s x s y n (x i 2)(y i 3). r = 8 ( 2)(2 3) + (2 2)(2 3) + (2 2)(4 3) + (3 2)(4 3) = 2 8 = 2. i= (d) What is the equation of the regression line for predicting y from x? The regression line passes through ( x, ȳ) = (2, 3) and has slope rs y /s x =. Thus its equation is y = x +. 3
4 4. [5: 2+2+] Scores on the math section of the SAT are normally distributed with mean 500 and standard deviation 00. (a) What proportion of math SAT scores are between 60 and 680? Standardizing gives z =., z =.8. So we want Φ(.8) Φ(.) = = (b) What score is at the 80th percentile of math SAT scores? From the normal table, Φ (0.8) = Unstandardizing gives (00)(0.84) = 584. (c) The proportion of students scoring less than 350 is (circle one): greater than the number scoring at least 630 between the number scoring at least 630 and the number scoring at least 670 less than the number scoring at least 670 The normal distribution is symmetric around its mean, so the number scoring less than 350 (=500-50) is the same as the number scoring greater than 650 (=500+50). 4
5 Name: 5. [3] Let r M be the coefficient of correlation between the heights and weights of adult men. Let r A be the coefficient of correlation between the heights and weights of all adults. Which of the following is true? Circle one. r M < r A r M = r A r M > r A Explain your answer, using a clearly labeled diagram and/or a few sentences of text. This was intended to be a problem about the restricted range effect. If we know someone s height then knowing their gender doesn t give us much additional information about predicting their weight, so the residuals in the men-only case and in the all-adults case are similar. But the variance of the weights of all adults is much larger than the variance of the heights of men. We recall that r 2 is the variance of the residuals divided by the variance of the response variable. This quotient has larger denominator for all adults, so r 2 M > r2 A ; rearranging (and assuming correlations are positive) gives r M < r A. However, it turns out that there is significant overlap between the distribution of heights and weights of men and that of women, so this doesn t really happen. In fact, from actual data, r M > r A. If you put this, see us and we ll give you back a point. 6. [3] In one study, it was necessary to draw a representative sample of Japanese- Americans resident in San Francisco. The procedure was as follows. After consultation with representative figures in the Japanese community, the four most representative blocks in the Japanese area of the city were chosen. All persons resident in those four blocks were taken for the sample. However, a comparison with Census data shows that the sample did not include a high enough proportion of Japanese with college degrees. How can this be explained? People living within the Japanese community are likely to be less well assimilated to American culture (and perhaps more likely to not be fluent in English). As a result a sample which overrepresents this community will have a lower number of people with college degrees. 5
6 Name: 7. [6: ] The figure below shows a scatter diagram of the high temperatures in San Francisco (SFO) and Los Angeles (LAX) for each day in LAX temps vs. SFO temps high temperature at SFO high temperature at LAX Q R S S R Q (a) Three lines are drawn, and are labeled Q, R, and S. For each description circle the letter of the line it corresponds to. (i) Estimated average high at LAX, for a given high at SFO Q R S (ii) Estimated average high at SFO, for a given high at LAX Q R S (iii) Nearly equal percentile ranks in both data sets Q R S (b) The coefficient of correlation for these 365 points is closest to circle one (c) The average of the 3 high temperatures for January 20 at SFO was 56.7 degrees; the average of the 3 high temperatures for January 20 at LAX was 68.7 degrees. This gives us the point (56.7, 68.7). We could compute similar points for the other eleven months of the year, and compute the correlation coefficient of these twelve points. The coefficient of correlation of the twelve monthly averages is (circle one) less than equal to greater than the coefficient of correlation of the original 365 daily data points. Briefly explain your answer. This is an ecological correlation; the averaging removes the day-to-day fluctuation and just leaves the seasonal trend, namely that both places are cool in the winter and warm in the summer. 6
MATH 1150 Chapter 2 Notation and Terminology
MATH 1150 Chapter 2 Notation and Terminology Categorical Data The following is a dataset for 30 randomly selected adults in the U.S., showing the values of two categorical variables: whether or not the
More informationare the objects described by a set of data. They may be people, animals or things.
( c ) E p s t e i n, C a r t e r a n d B o l l i n g e r 2016 C h a p t e r 5 : E x p l o r i n g D a t a : D i s t r i b u t i o n s P a g e 1 CHAPTER 5: EXPLORING DATA DISTRIBUTIONS 5.1 Creating Histograms
More informationSTAT 200 Chapter 1 Looking at Data - Distributions
STAT 200 Chapter 1 Looking at Data - Distributions What is Statistics? Statistics is a science that involves the design of studies, data collection, summarizing and analyzing the data, interpreting the
More informationCHAPTER 5: EXPLORING DATA DISTRIBUTIONS. Individuals are the objects described by a set of data. These individuals may be people, animals or things.
(c) Epstein 2013 Chapter 5: Exploring Data Distributions Page 1 CHAPTER 5: EXPLORING DATA DISTRIBUTIONS 5.1 Creating Histograms Individuals are the objects described by a set of data. These individuals
More informationTEST 1 M3070 Fall 2003
TEST 1 M3070 Fall 2003 Show all work. Name: Problem 1. (10 points Below are the daily high temperatures, in degrees Fahrenheit, for Salt Lake City during July 2003 (31 days. The decimal point is 1 digit(s
More informationAP Final Review II Exploring Data (20% 30%)
AP Final Review II Exploring Data (20% 30%) Quantitative vs Categorical Variables Quantitative variables are numerical values for which arithmetic operations such as means make sense. It is usually a measure
More informationElementary Statistics
Elementary Statistics Q: What is data? Q: What does the data look like? Q: What conclusions can we draw from the data? Q: Where is the middle of the data? Q: Why is the spread of the data important? Q:
More informationChapter 6 Group Activity - SOLUTIONS
Chapter 6 Group Activity - SOLUTIONS Group Activity Summarizing a Distribution 1. The following data are the number of credit hours taken by Math 105 students during a summer term. You will be analyzing
More informationInference for the Regression Coefficient
Inference for the Regression Coefficient Recall, b 0 and b 1 are the estimates of the slope β 1 and intercept β 0 of population regression line. We can shows that b 0 and b 1 are the unbiased estimates
More informationResistant Measure - A statistic that is not affected very much by extreme observations.
Chapter 1.3 Lecture Notes & Examples Section 1.3 Describing Quantitative Data with Numbers (pp. 50-74) 1.3.1 Measuring Center: The Mean Mean - The arithmetic average. To find the mean (pronounced x bar)
More informationPractice Questions for Exam 1
Practice Questions for Exam 1 1. A used car lot evaluates their cars on a number of features as they arrive in the lot in order to determine their worth. Among the features looked at are miles per gallon
More informationReview. Midterm Exam. Midterm Review. May 6th, 2015 AMS-UCSC. Spring Session 1 (Midterm Review) AMS-5 May 6th, / 24
Midterm Exam Midterm Review AMS-UCSC May 6th, 2015 Spring 2015. Session 1 (Midterm Review) AMS-5 May 6th, 2015 1 / 24 Topics Topics We will talk about... 1 Review Spring 2015. Session 1 (Midterm Review)
More informationChapter 5: Exploring Data: Distributions Lesson Plan
Lesson Plan Exploring Data Displaying Distributions: Histograms Interpreting Histograms Displaying Distributions: Stemplots Describing Center: Mean and Median Describing Variability: The Quartiles The
More informationCh Inference for Linear Regression
Ch. 12-1 Inference for Linear Regression ACT = 6.71 + 5.17(GPA) For every increase of 1 in GPA, we predict the ACT score to increase by 5.17. population regression line β (true slope) μ y = α + βx mean
More informationQ 1 = 23.8 M = Q 3 = 29.8 IQR = 6 The numbers are in order and there are 18 pieces of data so the median is the average of the 9th and 10th
Sample Exam #1, Math 01 1. Use the data set given below to answer all of the following questions. 14.0, 18.4, 1.6,.1, 3.8, 4.3, 5.9, 6.5, 7.5, 9., 9.3, 9.4, 9.7, 9.8, 30., 30.8, 31.9, 33.5 HaL Use the
More informationMath 138 Summer Section 412- Unit Test 1 Green Form, page 1 of 7
Math 138 Summer 1 2013 Section 412- Unit Test 1 Green Form page 1 of 7 1. Multiple Choice. Please circle your answer. Each question is worth 3 points. (a) Social Security Numbers are illustrations of which
More informationYou have 3 hours to complete the exam. Some questions are harder than others, so don t spend too long on any one question.
Data 8 Fall 2017 Foundations of Data Science Final INSTRUCTIONS You have 3 hours to complete the exam. Some questions are harder than others, so don t spend too long on any one question. The exam is closed
More informationStatistical View of Least Squares
May 23, 2006 Purpose of Regression Some Examples Least Squares Purpose of Regression Purpose of Regression Some Examples Least Squares Suppose we have two variables x and y Purpose of Regression Some Examples
More informationExam: practice test 1 MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.
Exam: practice test MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Solve the problem. ) Using the information in the table on home sale prices in
More informationMultiple Choice Circle the letter corresponding to the best answer for each of the problems below (4 pts each)
Math 221 Hypothetical Exam 1, Wi2008, (Chapter 1-5 in Moore, 4th) April 3, 2063 S. K. Hyde, S. Barton, P. Hurst, K. Yan Name: Show all your work to receive credit. All answers must be justified to get
More informationWhat is statistics? Statistics is the science of: Collecting information. Organizing and summarizing the information collected
What is statistics? Statistics is the science of: Collecting information Organizing and summarizing the information collected Analyzing the information collected in order to draw conclusions Two types
More informationStatistics 100 Exam 2 March 8, 2017
STAT 100 EXAM 2 Spring 2017 (This page is worth 1 point. Graded on writing your name and net id clearly and circling section.) PRINT NAME (Last name) (First name) net ID CIRCLE SECTION please! L1 (MWF
More informationSTATISTICS/MATH /1760 SHANNON MYERS
STATISTICS/MATH 103 11/1760 SHANNON MYERS π 100 POINTS POSSIBLE π YOUR WORK MUST SUPPORT YOUR ANSWER FOR FULL CREDIT TO BE AWARDED π YOU MAY USE A SCIENTIFIC AND/OR A TI-83/84/85/86 CALCULATOR ONCE YOU
More informationMrs. Poyner/Mr. Page Chapter 3 page 1
Name: Date: Period: Chapter 2: Take Home TEST Bivariate Data Part 1: Multiple Choice. (2.5 points each) Hand write the letter corresponding to the best answer in space provided on page 6. 1. In a statistics
More informationStat 101 Exam 1 Important Formulas and Concepts 1
1 Chapter 1 1.1 Definitions Stat 101 Exam 1 Important Formulas and Concepts 1 1. Data Any collection of numbers, characters, images, or other items that provide information about something. 2. Categorical/Qualitative
More informationThe Normal Distribution. Chapter 6
+ The Normal Distribution Chapter 6 + Applications of the Normal Distribution Section 6-2 + The Standard Normal Distribution and Practical Applications! We can convert any variable that in normally distributed
More informationPractice problems from chapters 2 and 3
Practice problems from chapters and 3 Question-1. For each of the following variables, indicate whether it is quantitative or qualitative and specify which of the four levels of measurement (nominal, ordinal,
More information1.3.1 Measuring Center: The Mean
1.3.1 Measuring Center: The Mean Mean - The arithmetic average. To find the mean (pronounced x bar) of a set of observations, add their values and divide by the number of observations. If the n observations
More informationMath 140 Introductory Statistics
Math 140 Introductory Statistics Professor Silvia Fernández Chapter 2 Based on the book Statistics in Action by A. Watkins, R. Scheaffer, and G. Cobb. Visualizing Distributions Recall the definition: The
More informationMath 140 Introductory Statistics
Visualizing Distributions Math 140 Introductory Statistics Professor Silvia Fernández Chapter Based on the book Statistics in Action by A. Watkins, R. Scheaffer, and G. Cobb. Recall the definition: The
More informationQ1: What is the interpretation of the number 4.1? A: There were 4.1 million visits to ER by people 85 and older, Q2: What percent of people 65-74
Lecture 4 This week lab:exam 1! Review lectures, practice labs 1 to 4 and homework 1 to 5!!!!! Need help? See me during my office hrs, or goto open lab or GS 211. Bring your picture ID and simple calculator.(note
More informationChapter 4: Displaying and Summarizing Quantitative Data
Chapter 4: Displaying and Summarizing Quantitative Data This chapter discusses methods of displaying quantitative data. The objective is describe the distribution of the data. The figure below shows three
More informationMeasures of the Location of the Data
Measures of the Location of the Data 1. 5. Mark has 51 films in his collection. Each movie comes with a rating on a scale from 0.0 to 10.0. The following table displays the ratings of the aforementioned
More informationChapter 5. Understanding and Comparing. Distributions
STAT 141 Introduction to Statistics Chapter 5 Understanding and Comparing Distributions Bin Zou (bzou@ualberta.ca) STAT 141 University of Alberta Winter 2015 1 / 27 Boxplots How to create a boxplot? Assume
More informationLecture 18: Simple Linear Regression
Lecture 18: Simple Linear Regression BIOS 553 Department of Biostatistics University of Michigan Fall 2004 The Correlation Coefficient: r The correlation coefficient (r) is a number that measures the strength
More informationData Set 1A: Algal Photosynthesis vs. Salinity and Temperature
Data Set A: Algal Photosynthesis vs. Salinity and Temperature Statistical setting These data are from a controlled experiment in which two quantitative variables were manipulated, to determine their effects
More informationLecture 30. DATA 8 Summer Regression Inference
DATA 8 Summer 2018 Lecture 30 Regression Inference Slides created by John DeNero (denero@berkeley.edu) and Ani Adhikari (adhikari@berkeley.edu) Contributions by Fahad Kamran (fhdkmrn@berkeley.edu) and
More informationChapter 3. Measuring data
Chapter 3 Measuring data 1 Measuring data versus presenting data We present data to help us draw meaning from it But pictures of data are subjective They re also not susceptible to rigorous inference Measuring
More informationStat 20 Midterm 1 Review
Stat 20 Midterm Review February 7, 2007 This handout is intended to be a comprehensive study guide for the first Stat 20 midterm exam. I have tried to cover all the course material in a way that targets
More informationName: JMJ April 10, 2017 Trigonometry A2 Trimester 2 Exam 8:40 AM 10:10 AM Mr. Casalinuovo
Name: JMJ April 10, 2017 Trigonometry A2 Trimester 2 Exam 8:40 AM 10:10 AM Mr. Casalinuovo Part 1: You MUST answer this problem. It is worth 20 points. 1) Temperature vs. Cricket Chirps: Crickets make
More informationSem. 1 Review Ch. 1-3
AP Stats Sem. 1 Review Ch. 1-3 Name 1. You measure the age, marital status and earned income of an SRS of 1463 women. The number and type of variables you have measured is a. 1463; all quantitative. b.
More informationUnits. Exploratory Data Analysis. Variables. Student Data
Units Exploratory Data Analysis Bret Larget Departments of Botany and of Statistics University of Wisconsin Madison Statistics 371 13th September 2005 A unit is an object that can be measured, such as
More informationContinuous distributions
Continuous distributions In contrast to discrete random variables, like the Binomial distribution, in many situations the possible values of a random variable cannot be counted. For example, the measurement
More informationLecture 1: Description of Data. Readings: Sections 1.2,
Lecture 1: Description of Data Readings: Sections 1.,.1-.3 1 Variable Example 1 a. Write two complete and grammatically correct sentences, explaining your primary reason for taking this course and then
More informationChapter 7. Linear Regression (Pt. 1) 7.1 Introduction. 7.2 The Least-Squares Regression Line
Chapter 7 Linear Regression (Pt. 1) 7.1 Introduction Recall that r, the correlation coefficient, measures the linear association between two quantitative variables. Linear regression is the method of fitting
More informationStatistics I Chapter 2: Univariate data analysis
Statistics I Chapter 2: Univariate data analysis Chapter 2: Univariate data analysis Contents Graphical displays for categorical data (barchart, piechart) Graphical displays for numerical data data (histogram,
More informationDescribing distributions with numbers
Describing distributions with numbers A large number or numerical methods are available for describing quantitative data sets. Most of these methods measure one of two data characteristics: The central
More informationSection 5.4 Residuals
Section 5.4 Residuals A residual value is the difference between an actual observed y value and the corresponding predicted y value, y. Residuals are just errors. Residual error = observed value predicted
More informationM 225 Test 1 B Name SHOW YOUR WORK FOR FULL CREDIT! Problem Max. Points Your Points Total 75
M 225 Test 1 B Name SHOW YOUR WORK FOR FULL CREDIT! Problem Max. Points Your Points 1-13 13 14 3 15 8 16 4 17 10 18 9 19 7 20 3 21 16 22 2 Total 75 1 Multiple choice questions (1 point each) 1. Look at
More informationChapter 6 Assessment. 3. Which points in the data set below are outliers? Multiple Choice. 1. The boxplot summarizes the test scores of a math class?
Chapter Assessment Multiple Choice 1. The boxplot summarizes the test scores of a math class? Test Scores 3. Which points in the data set below are outliers? 73, 73, 7, 75, 75, 75, 77, 77, 77, 77, 7, 7,
More informationDensity Curves and the Normal Distributions. Histogram: 10 groups
Density Curves and the Normal Distributions MATH 2300 Chapter 6 Histogram: 10 groups 1 Histogram: 20 groups Histogram: 40 groups 2 Histogram: 80 groups Histogram: 160 groups 3 Density Curve Density Curves
More informationStatistics I Chapter 2: Univariate data analysis
Statistics I Chapter 2: Univariate data analysis Chapter 2: Univariate data analysis Contents Graphical displays for categorical data (barchart, piechart) Graphical displays for numerical data data (histogram,
More informationChapter 5: Exploring Data: Distributions Lesson Plan
Lesson Plan Exploring Data Displaying Distributions: Histograms For All Practical Purposes Mathematical Literacy in Today s World, 7th ed. Interpreting Histograms Displaying Distributions: Stemplots Describing
More informationChapters 1 & 2 Exam Review
Problems 1-3 refer to the following five boxplots. 1.) To which of the above boxplots does the following histogram correspond? (A) A (B) B (C) C (D) D (E) E 2.) To which of the above boxplots does the
More information3.1 Measure of Center
3.1 Measure of Center Calculate the mean for a given data set Find the median, and describe why the median is sometimes preferable to the mean Find the mode of a data set Describe how skewness affects
More information11 Correlation and Regression
Chapter 11 Correlation and Regression August 21, 2017 1 11 Correlation and Regression When comparing two variables, sometimes one variable (the explanatory variable) can be used to help predict the value
More informationQUANTITATIVE DATA. UNIVARIATE DATA data for one variable
QUANTITATIVE DATA Recall that quantitative (numeric) data values are numbers where data take numerical values for which it is sensible to find averages, such as height, hourly pay, and pulse rates. UNIVARIATE
More information1 Measures of the Center of a Distribution
1 Measures of the Center of a Distribution Qualitative descriptions of the shape of a distribution are important and useful. But we will often desire the precision of numerical summaries as well. Two aspects
More informationPrentice Hall Stats: Modeling the World 2004 (Bock) Correlated to: National Advanced Placement (AP) Statistics Course Outline (Grades 9-12)
National Advanced Placement (AP) Statistics Course Outline (Grades 9-12) Following is an outline of the major topics covered by the AP Statistics Examination. The ordering here is intended to define the
More informationChapter 2: Tools for Exploring Univariate Data
Stats 11 (Fall 2004) Lecture Note Introduction to Statistical Methods for Business and Economics Instructor: Hongquan Xu Chapter 2: Tools for Exploring Univariate Data Section 2.1: Introduction What is
More informationSections 6.1 and 6.2: The Normal Distribution and its Applications
Sections 6.1 and 6.2: The Normal Distribution and its Applications Definition: A normal distribution is a continuous, symmetric, bell-shaped distribution of a variable. The equation for the normal distribution
More informationMath 120 Introduction to Statistics Mr. Toner s Lecture Notes 3.1 Measures of Central Tendency
Math 1 Introduction to Statistics Mr. Toner s Lecture Notes 3.1 Measures of Central Tendency The word average: is very ambiguous and can actually refer to the mean, median, mode or midrange. Notation:
More informationMath Sec 4 CST Topic 7. Statistics. i.e: Add up all values and divide by the total number of values.
Measures of Central Tendency Statistics 1) Mean: The of all data values Mean= x = x 1+x 2 +x 3 + +x n n i.e: Add up all values and divide by the total number of values. 2) Mode: Most data value 3) Median:
More informationFurther Mathematics 2018 CORE: Data analysis Chapter 2 Summarising numerical data
Chapter 2: Summarising numerical data Further Mathematics 2018 CORE: Data analysis Chapter 2 Summarising numerical data Extract from Study Design Key knowledge Types of data: categorical (nominal and ordinal)
More informationSTP 420 INTRODUCTION TO APPLIED STATISTICS NOTES
INTRODUCTION TO APPLIED STATISTICS NOTES PART - DATA CHAPTER LOOKING AT DATA - DISTRIBUTIONS Individuals objects described by a set of data (people, animals, things) - all the data for one individual make
More information7. Do not estimate values for y using x-values outside the limits of the data given. This is called extrapolation and is not reliable.
AP Statistics 15 Inference for Regression I. Regression Review a. r à correlation coefficient or Pearson s coefficient: indicates strength and direction of the relationship between the explanatory variables
More informationQUIZ 1 (CHAPTERS 1-4) SOLUTIONS MATH 119 SPRING 2013 KUNIYUKI 105 POINTS TOTAL, BUT 100 POINTS = 100%
QUIZ 1 (CHAPTERS 1-4) SOLUTIONS MATH 119 SPRING 2013 KUNIYUKI 105 POINTS TOTAL, BUT 100 POINTS = 100% 1) (6 points). A college has 32 course sections in math. A frequency table for the numbers of students
More informationy = a + bx 12.1: Inference for Linear Regression Review: General Form of Linear Regression Equation Review: Interpreting Computer Regression Output
12.1: Inference for Linear Regression Review: General Form of Linear Regression Equation y = a + bx y = dependent variable a = intercept b = slope x = independent variable Section 12.1 Inference for Linear
More informationAP Statistics Semester I Examination Section I Questions 1-30 Spend approximately 60 minutes on this part of the exam.
AP Statistics Semester I Examination Section I Questions 1-30 Spend approximately 60 minutes on this part of the exam. Name: Directions: The questions or incomplete statements below are each followed by
More informationReview of Multiple Regression
Ronald H. Heck 1 Let s begin with a little review of multiple regression this week. Linear models [e.g., correlation, t-tests, analysis of variance (ANOVA), multiple regression, path analysis, multivariate
More informationRecall that the standard deviation σ of a numerical data set is given by
11.1 Using Normal Distributions Essential Question In a normal distribution, about what percent of the data lies within one, two, and three standard deviations of the mean? Recall that the standard deviation
More informationDetermining the Spread of a Distribution
Determining the Spread of a Distribution 1.3-1.5 Cathy Poliak, Ph.D. cathy@math.uh.edu Department of Mathematics University of Houston Lecture 3-2311 Lecture 3-2311 1 / 58 Outline 1 Describing Quantitative
More informationSection 2.3: One Quantitative Variable: Measures of Spread
Section 2.3: One Quantitative Variable: Measures of Spread Objectives: 1) Measures of spread, variability a. Range b. Standard deviation i. Formula ii. Notation for samples and population 2) The 95% rule
More informationFrancine s bone density is 1.45 standard deviations below the mean hip bone density for 25-year-old women of 956 grams/cm 2.
Chapter 3 Solutions 3.1 3.2 3.3 87% of the girls her daughter s age weigh the same or less than she does and 67% of girls her daughter s age are her height or shorter. According to the Los Angeles Times,
More informationUnit 6 - Introduction to linear regression
Unit 6 - Introduction to linear regression Suggested reading: OpenIntro Statistics, Chapter 7 Suggested exercises: Part 1 - Relationship between two numerical variables: 7.7, 7.9, 7.11, 7.13, 7.15, 7.25,
More informationDetermining the Spread of a Distribution
Determining the Spread of a Distribution 1.3-1.5 Cathy Poliak, Ph.D. cathy@math.uh.edu Department of Mathematics University of Houston Lecture 3-2311 Lecture 3-2311 1 / 58 Outline 1 Describing Quantitative
More information6.2b Homework: Fit a Linear Model to Bivariate Data
6.2b Homework: Fit a Linear Model to Bivariate Data Directions: For the following problems, draw a line of best fit, write a prediction function, and use your function to make predictions. Prior to drawing
More informationMean, Median, Mode, and Range
Mean, Median, Mode, and Range Mean, median, and mode are measures of central tendency; they measure the center of data. Range is a measure of dispersion; it measures the spread of data. The mean of a data
More informationExample 2. Given the data below, complete the chart:
Statistics 2035 Quiz 1 Solutions Example 1. 2 64 150 150 2 128 150 2 256 150 8 8 Example 2. Given the data below, complete the chart: 52.4, 68.1, 66.5, 75.0, 60.5, 78.8, 63.5, 48.9, 81.3 n=9 The data is
More informationEQ: What is a normal distribution?
Unit 5 - Statistics What is the purpose EQ: What tools do we have to assess data? this unit? What vocab will I need? Vocabulary: normal distribution, standard, nonstandard, interquartile range, population
More informationChapter 1. Looking at Data
Chapter 1 Looking at Data Types of variables Looking at Data Be sure that each variable really does measure what you want it to. A poor choice of variables can lead to misleading conclusions!! For example,
More informationChapter 1 - Lecture 3 Measures of Location
Chapter 1 - Lecture 3 of Location August 31st, 2009 Chapter 1 - Lecture 3 of Location General Types of measures Median Skewness Chapter 1 - Lecture 3 of Location Outline General Types of measures What
More informationThe empirical ( ) rule
The empirical (68-95-99.7) rule With a bell shaped distribution, about 68% of the data fall within a distance of 1 standard deviation from the mean. 95% fall within 2 standard deviations of the mean. 99.7%
More informationContinuous distributions
Continuous distributions In contrast to discrete random variables, like the Binomial distribution, in many situations the possible values of a random variable cannot be counted. For example, the measurement
More informationBasic Statistics Exercises 66
Basic Statistics Exercises 66 42. Suppose we are interested in predicting a person's height from the person's length of stride (distance between footprints). The following data is recorded for a random
More informationHOMEWORK (due Wed, Jan 23): Chapter 3: #42, 48, 74
ANNOUNCEMENTS: Grades available on eee for Week 1 clickers, Quiz and Discussion. If your clicker grade is missing, check next week before contacting me. If any other grades are missing let me know now.
More information(quantitative or categorical variables) Numerical descriptions of center, variability, position (quantitative variables)
3. Descriptive Statistics Describing data with tables and graphs (quantitative or categorical variables) Numerical descriptions of center, variability, position (quantitative variables) Bivariate descriptions
More informationCHAPTER 4 VARIABILITY ANALYSES. Chapter 3 introduced the mode, median, and mean as tools for summarizing the
CHAPTER 4 VARIABILITY ANALYSES Chapter 3 introduced the mode, median, and mean as tools for summarizing the information provided in an distribution of data. Measures of central tendency are often useful
More informationChapter 6. Exploring Data: Relationships. Solutions. Exercises:
Chapter 6 Exploring Data: Relationships Solutions Exercises: 1. (a) It is more reasonable to explore study time as an explanatory variable and the exam grade as the response variable. (b) It is more reasonable
More informationCHAPTER 1. Introduction
CHAPTER 1 Introduction Engineers and scientists are constantly exposed to collections of facts, or data. The discipline of statistics provides methods for organizing and summarizing data, and for drawing
More informationUNIT 12 ~ More About Regression
***SECTION 15.1*** The Regression Model When a scatterplot shows a relationship between a variable x and a y, we can use the fitted to the data to predict y for a given value of x. Now we want to do tests
More informationChapter 3: Examining Relationships Review Sheet
Review Sheet 1. A study is conducted to determine if one can predict the yield of a crop based on the amount of yearly rainfall. The response variable in this study is A) the yield of the crop. D) either
More informationAP Statistics Summer Assignment
AP Statistics Summer Assignment David_I_Beck@mcpsmd.org Welcome to AP Statistics. You will need to able to use your graphing calculator with its statistics package to enter data, calculate simple statistics
More informationLecture Slides. Elementary Statistics Twelfth Edition. by Mario F. Triola. and the Triola Statistics Series. Section 3.1- #
Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series by Mario F. Triola Chapter 3 Statistics for Describing, Exploring, and Comparing Data 3-1 Review and Preview 3-2 Measures
More informationUnit 6 - Simple linear regression
Sta 101: Data Analysis and Statistical Inference Dr. Çetinkaya-Rundel Unit 6 - Simple linear regression LO 1. Define the explanatory variable as the independent variable (predictor), and the response variable
More informationMeasures of center. The mean The mean of a distribution is the arithmetic average of the observations:
Measures of center The mean The mean of a distribution is the arithmetic average of the observations: x = x 1 + + x n n n = 1 x i n i=1 The median The median is the midpoint of a distribution: the number
More informationChapter 3: The Normal Distributions
Chapter 3: The Normal Distributions http://www.yorku.ca/nuri/econ2500/econ2500-online-course-materials.pdf graphs-normal.doc / histogram-density.txt / normal dist table / ch3-image Ch3 exercises: 3.2,
More informationFinal Exam - Solutions
Ecn 102 - Analysis of Economic Data University of California - Davis March 19, 2010 Instructor: John Parman Final Exam - Solutions You have until 5:30pm to complete this exam. Please remember to put your
More informationAlgebra Calculator Skills Inventory Solutions
Algebra Calculator Skills Inventory Solutions 1. The equation P = 1.25x 15 represents the profit in dollars when x widgets are sold. Find the profit if 450 widgets are sold. A. $427.50 B. $697.50 C. $562.50
More informationRecall, Positive/Negative Association:
ANNOUNCEMENTS: Remember that discussion today is not for credit. Go over R Commander. Go to 192 ICS, except at 4pm, go to 192 or 174 ICS. TODAY: Sections 5.3 to 5.5. Note this is a change made in the daily
More information