Histograms, Central Tendency, and Variability
|
|
- Miranda Cunningham
- 6 years ago
- Views:
Transcription
1 The Economist, September 6, Histograms, Central Tendency, and Variability Lecture 2 Reading: Sections Includes ALL margin notes and boxes: For Example, Guided Example, Notation Alert, Just Checking, Optional Math Boxes, What Can Go Wrong? and Ethics in Action 2 Histogram graphically describes how a single variable containing interval data is distributed Range of data divided into non-overlapping and equal width classes (bins) that cover range of values Histogram Fraction n = 174 countries Inflation Rate, 211 How many bins? Width of bins? 3 Lecture 2, Page 1 of 11
2 8 n = 174 countries n = 174 countries Frequency Fraction Inflation Rate, Inflation Rate, n = 174 countries Inflation Rate, 211 Frequency histogram: Bar height number of observations in bin Relative frequency histogram: Bar height fraction of obs. in bin histogram: Bar area measures the fraction of observations in bin Inflation Rate, Inflation Rate, Inflation Rate, Number of bins changes the appearance of the histogram Sturges formula: # of bins = 1 + 3*log(n) [Note log base 1.] Inflation Rate, 211 OECD inflation: 1 + 3*log(34) = [but STATA picked 5] 6 Lecture 2, Page 2 of 11
3 Shape of Things Histogram gives overview of a variable with a single picture Can make informal inferences about the shape of population Symmetric: If draw an imaginary line at center, have mirror image on each side Positively skewed: long tail to right (aka skewed to the right) Negatively skewed: long tail to left (aka skewed to the left) Modality: # major peaks Bell/Normal/Gaussian 7 Four Perfectly Symmetric Histograms Four Positively Skewed Histograms Lecture 2, Page 3 of 11
4 Four Negatively Skewed Histograms Four Bimodal Histograms Two Perfectly Bell Shaped Histograms Are these symmetric? Skewed? Unimodal? Take a good look because this is one of the last times you will see perfect bell shaped (normal) histograms 12 Lecture 2, Page 4 of 11
5 .5 n = 185 countries; Source CIA Fraction GDP per capita (PPP), 212 est Samples vs. Populations Sample is a random subset of population Sampling noise: Chance differences between population and a random sample Driven by the sample size, not sample size relative to the population size, which is assumed infinite (pp. 3 31, The Sample Size is What Matters ) Informal inference: consider sample size (n) Never see the perfect forms (Plato): statements about shape always approximate Nearly Normal Condition Population, N = 1,,.5 Sample 1; n = Sample 2; n = 1 How many samples would you have in real life? Why aren t these samples perfectly Bell shaped? Sample 3; n = Lecture 2, Page 5 of 11
6 5 5.5 Population, N = 1,, Sample 1; n = 3 Why are there more bins than last slide? Sample 2; n = 3 Sample 3; n = Population, N = 1,, Sample 1; n = Sample 2; n = 1 Sample 3; n = What to Conclude About Shape? n: X n: Y Is the graph on the left symmetric? Bell shaped? Is the graph on the right symmetric? Bell shaped? Bi-modal? 18 Lecture 2, Page 6 of 11
7 Hsieh and Olken (214) JEP The Missing Missing Middle Summer Appendix, There is a clear bimodality in the distribution of valueadded/capital for the large firms. However, the capital questionnaire for large firms was ambiguous as to whether the results were to be entered in thousands of Rupiah or Rupiah. Our best guess is that approximately half the firms used thousands and half used millions. 2 Summary Statistics Statistics (i.e. summary statistics) give a concise idea of what data look like For a single variable, statistics can give numeric measures of: Central tendency: mean and median Variability: range, variance, standard deviation, coefficient of variation, R Relative standing: percentiles For two variables, also measure relationship 21 Lecture 2, Page 7 of 11
8 Mean and Median Population mean, a parameter: μ = N Sample mean, a statistic: X = n n Which is subject to sampling error? i=1 x i N i=1 x i Median is the middle obs. after sorting if even # of obs., average 2 middle ones Fraction.5 mean = 3, median = Inflation Rate, Normal Distribution Population mu=1., med= Is X (a statistic) equal to µ (a parameter)? Sample, n=49 X-bar=13.9, med= Why is the population mean equal to the population median? Why isn t the sample mean equal to the sample median? 23 A Symmetric Distribution (Uniform).8.6 Population mu=5., med= Book Rating Sample, n=41 X-bar=55, med= Book Rating Why is the population mean equal to the population median? Why is the sample median different from the population median? 24 Lecture 2, Page 8 of 11
9 Fraction n = 174 countries mean = 6.6, median = 5. Why is the mean greater than the median? Inflation Rate, Measures of Variability (Spread) Range: max min Variance: N i=1 σ 2 = x i μ 2 N s 2 = n x 2 i=1 i X n 1 Standard deviation: s = variance Coefficient of variation (textbook) min = -, max = 6.5 var = 1.6, sd = Inflation Rate, Breaking Down Variance Numerator: total sum of squares (TSS) If all sampled countries have 3% inflation (x i = 3 for all i), what would TSS & s 2 be? Denominator: ν ( nu ) Only n 1 free obs left after calculate mean Units of variance? How about s.d.? s 2 = n x 2 i=1 i X n 1 n TSS = x i X 2 i=1 degrees of freedom: ν = n 1 27 Lecture 2, Page 9 of 11
10 Empirical Rule (Normal/Bell) If a random sample is drawn from a Normal population then about: 68% of observations will lie within 1 s.d. of the mean (i.e. between X s and X + s) 95% of observations will lie within 2 s.d. of the mean (i.e. between X 2s and X + 2s) 99.7% of observations will lie within 3 s.d. of the mean (i.e. between X 3s and X + 3s) Empirical Rule only applies if normal 28 n = 246, mean = 99, sd = Histogram #1, n = X.8.6 Histogram #2, n = X Histogram #3, n = Histogram #4, n = e X X Approximately, what is the s.d. of the variable in each histogram? 3 Lecture 2, Page 1 of 11
11 Chebysheff s Theorem At least 1*(1 1/k 2 )% of observations lie within k s.d. s of the mean for k>1 At least 75% of obs. lie within 2 s.d. of mean 1 1/2 2 = 3/4 At least 89% of obs. lie within 3 s.d. of mean 1-1/3 2 = 8/9 Can be applied to all samples no matter how population is distributed What about within one s.d.? 31.5 n = 185 countries mean = 14955, sd = Fraction GDP per capita (PPP), 212 est. 32 Lecture 2, Page 11 of 11
Chapter 3 Statistics for Describing, Exploring, and Comparing Data. Section 3-1: Overview. 3-2 Measures of Center. Definition. Key Concept.
Chapter 3 Statistics for Describing, Exploring, and Comparing Data 3-1 Overview 3- Measures of Center 3-3 Measures of Variation Section 3-1: Overview Descriptive Statistics summarize or describe the important
More informationChapter 3. Measuring data
Chapter 3 Measuring data 1 Measuring data versus presenting data We present data to help us draw meaning from it But pictures of data are subjective They re also not susceptible to rigorous inference Measuring
More informationLecture Slides. Elementary Statistics Tenth Edition. by Mario F. Triola. and the Triola Statistics Series. Slide 1
Lecture Slides Elementary Statistics Tenth Edition and the Triola Statistics Series by Mario F. Triola Slide 1 Chapter 3 Statistics for Describing, Exploring, and Comparing Data 3-1 Overview 3-2 Measures
More informationChapter 2: Tools for Exploring Univariate Data
Stats 11 (Fall 2004) Lecture Note Introduction to Statistical Methods for Business and Economics Instructor: Hongquan Xu Chapter 2: Tools for Exploring Univariate Data Section 2.1: Introduction What is
More informationBusiness Statistics. Lecture 10: Course Review
Business Statistics Lecture 10: Course Review 1 Descriptive Statistics for Continuous Data Numerical Summaries Location: mean, median Spread or variability: variance, standard deviation, range, percentiles,
More informationPreliminary Statistics course. Lecture 1: Descriptive Statistics
Preliminary Statistics course Lecture 1: Descriptive Statistics Rory Macqueen (rm43@soas.ac.uk), September 2015 Organisational Sessions: 16-21 Sep. 10.00-13.00, V111 22-23 Sep. 15.00-18.00, V111 24 Sep.
More informationChapter 5. Understanding and Comparing. Distributions
STAT 141 Introduction to Statistics Chapter 5 Understanding and Comparing Distributions Bin Zou (bzou@ualberta.ca) STAT 141 University of Alberta Winter 2015 1 / 27 Boxplots How to create a boxplot? Assume
More information3.1 Measure of Center
3.1 Measure of Center Calculate the mean for a given data set Find the median, and describe why the median is sometimes preferable to the mean Find the mode of a data set Describe how skewness affects
More informationChapter 3. Data Description
Chapter 3. Data Description Graphical Methods Pie chart It is used to display the percentage of the total number of measurements falling into each of the categories of the variable by partition a circle.
More informationStatistical Methods: Introduction, Applications, Histograms, Ch
Outlines Statistical Methods: Introduction, Applications, Histograms, Characteristics November 4, 2004 Outlines Part I: Statistical Methods: Introduction and Applications Part II: Statistical Methods:
More informationElementary Statistics
Elementary Statistics Q: What is data? Q: What does the data look like? Q: What conclusions can we draw from the data? Q: Where is the middle of the data? Q: Why is the spread of the data important? Q:
More informationMeelis Kull Autumn Meelis Kull - Autumn MTAT Data Mining - Lecture 03
Meelis Kull meelis.kull@ut.ee Autumn 2017 1 Demo: Data science mini-project CRISP-DM: cross-industrial standard process for data mining Data understanding: Types of data Data understanding: First look
More informationChapter 4. Displaying and Summarizing. Quantitative Data
STAT 141 Introduction to Statistics Chapter 4 Displaying and Summarizing Quantitative Data Bin Zou (bzou@ualberta.ca) STAT 141 University of Alberta Winter 2015 1 / 31 4.1 Histograms 1 We divide the range
More information1.0 Continuous Distributions. 5.0 Shapes of Distributions. 6.0 The Normal Curve. 7.0 Discrete Distributions. 8.0 Tolerances. 11.
Chapter 4 Statistics 45 CHAPTER 4 BASIC QUALITY CONCEPTS 1.0 Continuous Distributions.0 Measures of Central Tendency 3.0 Measures of Spread or Dispersion 4.0 Histograms and Frequency Distributions 5.0
More informationMATH 10 INTRODUCTORY STATISTICS
MATH 10 INTRODUCTORY STATISTICS Tommy Khoo Your friendly neighbourhood graduate student. Week 1 Chapter 1 Introduction What is Statistics? Why do you need to know Statistics? Technical lingo and concepts:
More informationDescriptive Univariate Statistics and Bivariate Correlation
ESC 100 Exploring Engineering Descriptive Univariate Statistics and Bivariate Correlation Instructor: Sudhir Khetan, Ph.D. Wednesday/Friday, October 17/19, 2012 The Central Dogma of Statistics used to
More informationBig Data Analysis with Apache Spark UC#BERKELEY
Big Data Analysis with Apache Spark UC#BERKELEY This Lecture: Relation between Variables An association A trend» Positive association or Negative association A pattern» Could be any discernible shape»
More informationIntroduction to Matrix Algebra and the Multivariate Normal Distribution
Introduction to Matrix Algebra and the Multivariate Normal Distribution Introduction to Structural Equation Modeling Lecture #2 January 18, 2012 ERSH 8750: Lecture 2 Motivation for Learning the Multivariate
More informationSection 5.4. Ken Ueda
Section 5.4 Ken Ueda Students seem to think that being graded on a curve is a positive thing. I took lasers 101 at Cornell and got a 92 on the exam. The average was a 93. I ended up with a C on the test.
More informationAPPENDIX 1 BASIC STATISTICS. Summarizing Data
1 APPENDIX 1 Figure A1.1: Normal Distribution BASIC STATISTICS The problem that we face in financial analysis today is not having too little information but too much. Making sense of large and often contradictory
More informationBNG 495 Capstone Design. Descriptive Statistics
BNG 495 Capstone Design Descriptive Statistics Overview The overall goal of this short course in statistics is to provide an introduction to descriptive and inferential statistical methods, with a focus
More information2.0 Lesson Plan. Answer Questions. Summary Statistics. Histograms. The Normal Distribution. Using the Standard Normal Table
2.0 Lesson Plan Answer Questions 1 Summary Statistics Histograms The Normal Distribution Using the Standard Normal Table 2. Summary Statistics Given a collection of data, one needs to find representations
More informationLecture Slides. Elementary Statistics Twelfth Edition. by Mario F. Triola. and the Triola Statistics Series. Section 3.1- #
Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series by Mario F. Triola Chapter 3 Statistics for Describing, Exploring, and Comparing Data 3-1 Review and Preview 3-2 Measures
More informationØ Set of mutually exclusive categories. Ø Classify or categorize subject. Ø No meaningful order to categorization.
Statistical Tools in Evaluation HPS 41 Fall 213 Dr. Joe G. Schmalfeldt Types of Scores Continuous Scores scores with a potentially infinite number of values. Discrete Scores scores limited to a specific
More information2011 Pearson Education, Inc
Statistics for Business and Economics Chapter 2 Methods for Describing Sets of Data Summary of Central Tendency Measures Measure Formula Description Mean x i / n Balance Point Median ( n +1) Middle Value
More informationHistograms allow a visual interpretation
Chapter 4: Displaying and Summarizing i Quantitative Data s allow a visual interpretation of quantitative (numerical) data by indicating the number of data points that lie within a range of values, called
More informationMath 141. Quantile-Quantile Plots. Albyn Jones 1. jones/courses/ Library 304. Albyn Jones Math 141
Math 141 Quantile-Quantile Plots Albyn Jones 1 1 Library 304 jones@reed.edu www.people.reed.edu/ jones/courses/141 Outline: Quantile-Quantile Plots Quantiles and Order Statistics Quantile-Quantile Plots
More informationIntroduction to Statistics
Introduction to Statistics Data and Statistics Data consists of information coming from observations, counts, measurements, or responses. Statistics is the science of collecting, organizing, analyzing,
More informationIntroduction to statistics
Introduction to statistics Literature Raj Jain: The Art of Computer Systems Performance Analysis, John Wiley Schickinger, Steger: Diskrete Strukturen Band 2, Springer David Lilja: Measuring Computer Performance:
More informationMA 1125 Lecture 15 - The Standard Normal Distribution. Friday, October 6, Objectives: Introduce the standard normal distribution and table.
MA 1125 Lecture 15 - The Standard Normal Distribution Friday, October 6, 2017. Objectives: Introduce the standard normal distribution and table. 1. The Standard Normal Distribution We ve been looking at
More informationDensity Curves and the Normal Distributions. Histogram: 10 groups
Density Curves and the Normal Distributions MATH 2300 Chapter 6 Histogram: 10 groups 1 Histogram: 20 groups Histogram: 40 groups 2 Histogram: 80 groups Histogram: 160 groups 3 Density Curve Density Curves
More informationChapter Four. Numerical Descriptive Techniques. Range, Standard Deviation, Variance, Coefficient of Variation
Chapter Four Numerical Descriptive Techniques 4.1 Numerical Descriptive Techniques Measures of Central Location Mean, Median, Mode Measures of Variability Range, Standard Deviation, Variance, Coefficient
More informationØ Set of mutually exclusive categories. Ø Classify or categorize subject. Ø No meaningful order to categorization.
Statistical Tools in Evaluation HPS 41 Dr. Joe G. Schmalfeldt Types of Scores Continuous Scores scores with a potentially infinite number of values. Discrete Scores scores limited to a specific number
More informationContinuous distributions
Continuous distributions In contrast to discrete random variables, like the Binomial distribution, in many situations the possible values of a random variable cannot be counted. For example, the measurement
More informationMath 120 Introduction to Statistics Mr. Toner s Lecture Notes 3.1 Measures of Central Tendency
Math 1 Introduction to Statistics Mr. Toner s Lecture Notes 3.1 Measures of Central Tendency The word average: is very ambiguous and can actually refer to the mean, median, mode or midrange. Notation:
More informationSampling, Frequency Distributions, and Graphs (12.1)
1 Sampling, Frequency Distributions, and Graphs (1.1) Design: Plan how to obtain the data. What are typical Statistical Methods? Collect the data, which is then subjected to statistical analysis, which
More informationStatistical Concepts. Constructing a Trend Plot
Module 1: Review of Basic Statistical Concepts 1.2 Plotting Data, Measures of Central Tendency and Dispersion, and Correlation Constructing a Trend Plot A trend plot graphs the data against a variable
More information1-1. Chapter 1. Sampling and Descriptive Statistics by The McGraw-Hill Companies, Inc. All rights reserved.
1-1 Chapter 1 Sampling and Descriptive Statistics 1-2 Why Statistics? Deal with uncertainty in repeated scientific measurements Draw conclusions from data Design valid experiments and draw reliable conclusions
More informationSummary statistics. G.S. Questa, L. Trapani. MSc Induction - Summary statistics 1
Summary statistics 1. Visualize data 2. Mean, median, mode and percentiles, variance, standard deviation 3. Frequency distribution. Skewness 4. Covariance and correlation 5. Autocorrelation MSc Induction
More informationSTT 315 This lecture is based on Chapter 2 of the textbook.
STT 315 This lecture is based on Chapter 2 of the textbook. Acknowledgement: Author is thankful to Dr. Ashok Sinha, Dr. Jennifer Kaplan and Dr. Parthanil Roy for allowing him to use/edit some of their
More informationDescriptive Statistics-I. Dr Mahmoud Alhussami
Descriptive Statistics-I Dr Mahmoud Alhussami Biostatistics What is the biostatistics? A branch of applied math. that deals with collecting, organizing and interpreting data using well-defined procedures.
More informationChapter 4.notebook. August 30, 2017
Sep 1 7:53 AM Sep 1 8:21 AM Sep 1 8:21 AM 1 Sep 1 8:23 AM Sep 1 8:23 AM Sep 1 8:23 AM SOCS When describing a distribution, make sure to always tell about three things: shape, outliers, center, and spread
More informationIntroduction to Statistics for Traffic Crash Reconstruction
Introduction to Statistics for Traffic Crash Reconstruction Jeremy Daily Jackson Hole Scientific Investigations, Inc. c 2003 www.jhscientific.com Why Use and Learn Statistics? 1. We already do when ranging
More informationADMS2320.com. We Make Stats Easy. Chapter 4. ADMS2320.com Tutorials Past Tests. Tutorial Length 1 Hour 45 Minutes
We Make Stats Easy. Chapter 4 Tutorial Length 1 Hour 45 Minutes Tutorials Past Tests Chapter 4 Page 1 Chapter 4 Note The following topics will be covered in this chapter: Measures of central location Measures
More informationStat 101 Exam 1 Important Formulas and Concepts 1
1 Chapter 1 1.1 Definitions Stat 101 Exam 1 Important Formulas and Concepts 1 1. Data Any collection of numbers, characters, images, or other items that provide information about something. 2. Categorical/Qualitative
More informationVariables, distributions, and samples (cont.) Phil 12: Logic and Decision Making Fall 2010 UC San Diego 10/18/2010
Variables, distributions, and samples (cont.) Phil 12: Logic and Decision Making Fall 2010 UC San Diego 10/18/2010 Review Recording observations - Must extract that which is to be analyzed: coding systems,
More informationLecture 2. Quantitative variables. There are three main graphical methods for describing, summarizing, and detecting patterns in quantitative data:
Lecture 2 Quantitative variables There are three main graphical methods for describing, summarizing, and detecting patterns in quantitative data: Stemplot (stem-and-leaf plot) Histogram Dot plot Stemplots
More informationLecture 30. DATA 8 Summer Regression Inference
DATA 8 Summer 2018 Lecture 30 Regression Inference Slides created by John DeNero (denero@berkeley.edu) and Ani Adhikari (adhikari@berkeley.edu) Contributions by Fahad Kamran (fhdkmrn@berkeley.edu) and
More informationSTP 420 INTRODUCTION TO APPLIED STATISTICS NOTES
INTRODUCTION TO APPLIED STATISTICS NOTES PART - DATA CHAPTER LOOKING AT DATA - DISTRIBUTIONS Individuals objects described by a set of data (people, animals, things) - all the data for one individual make
More informationConfidence Intervals. - simply, an interval for which we have a certain confidence.
Confidence Intervals I. What are confidence intervals? - simply, an interval for which we have a certain confidence. - for example, we are 90% certain that an interval contains the true value of something
More informationDensity Curves & Normal Distributions
Density Curves & Normal Distributions Sections 4.1 & 4.2 Cathy Poliak, Ph.D. cathy@math.uh.edu Office in Fleming 11c Department of Mathematics University of Houston Lecture 9-2311 Cathy Poliak, Ph.D. cathy@math.uh.edu
More informationEssentials of Statistics and Probability
May 22, 2007 Department of Statistics, NC State University dbsharma@ncsu.edu SAMSI Undergrad Workshop Overview Practical Statistical Thinking Introduction Data and Distributions Variables and Distributions
More informationBusiness Statistics. Lecture 5: Confidence Intervals
Business Statistics Lecture 5: Confidence Intervals Goals for this Lecture Confidence intervals The t distribution 2 Welcome to Interval Estimation! Moments Mean 815.0340 Std Dev 0.8923 Std Error Mean
More informationReview of the Normal Distribution
Sampling and s Normal Distribution Aims of Sampling Basic Principles of Probability Types of Random Samples s of the Mean Standard Error of the Mean The Central Limit Theorem Review of the Normal Distribution
More informationInteractietechnologie
Interactietechnologie Statistical Evaluation Remco Veltkamp www.scalable learning.com Select " SIGN IN OR SIGN UP" Select "Use your School/University Account" search "Utrecht" and select "Utrecht University"
More informationChapter. Numerically Summarizing Data. Copyright 2013, 2010 and 2007 Pearson Education, Inc.
Chapter 3 Numerically Summarizing Data Section 3.1 Measures of Central Tendency Objectives 1. Determine the arithmetic mean of a variable from raw data 2. Determine the median of a variable from raw data
More informationSummarizing and Displaying Measurement Data/Understanding and Comparing Distributions
Summarizing and Displaying Measurement Data/Understanding and Comparing Distributions Histograms, Mean, Median, Five-Number Summary and Boxplots, Standard Deviation Thought Questions 1. If you were to
More informationSTAT 200 Chapter 1 Looking at Data - Distributions
STAT 200 Chapter 1 Looking at Data - Distributions What is Statistics? Statistics is a science that involves the design of studies, data collection, summarizing and analyzing the data, interpreting the
More informationMeasures of Central Tendency. Mean, Median, and Mode
Measures of Central Tendency Mean, Median, and Mode Population study The population under study is the 20 students in a class Imagine we ask the individuals in our population how many languages they speak.
More informationContinuous distributions
Continuous distributions In contrast to discrete random variables, like the Binomial distribution, in many situations the possible values of a random variable cannot be counted. For example, the measurement
More informationUnit 1: Statistical Analysis. IB Biology SL
Unit 1: Statistical Analysis IB Biology SL Statistics Why do we use statistics in Biology? o Biologists use the scientific method when performing experiments. o Scientists gather measurable data when performing
More informationLecture 2. Descriptive Statistics: Measures of Center
Lecture 2. Descriptive Statistics: Measures of Center Descriptive Statistics summarize or describe the important characteristics of a known set of data Inferential Statistics use sample data to make inferences
More informationUnit 2: Numerical Descriptive Measures
Unit 2: Numerical Descriptive Measures Summation Notation Measures of Central Tendency Measures of Dispersion Chebyshev's Rule Empirical Rule Measures of Relative Standing Box Plots z scores Jan 28 10:48
More informationDover- Sherborn High School Mathematics Curriculum Probability and Statistics
Mathematics Curriculum A. DESCRIPTION This is a full year courses designed to introduce students to the basic elements of statistics and probability. Emphasis is placed on understanding terminology and
More informationChapter 6 Group Activity - SOLUTIONS
Chapter 6 Group Activity - SOLUTIONS Group Activity Summarizing a Distribution 1. The following data are the number of credit hours taken by Math 105 students during a summer term. You will be analyzing
More informationDescription of Samples and Populations
Description of Samples and Populations Random Variables Data are generated by some underlying random process or phenomenon. Any datum (data point) represents the outcome of a random variable. We represent
More informationProbability Methods in Civil Engineering Prof. Dr. Rajib Maity Department of Civil Engineering Indian Institute of Technology Kharagpur
Probability Methods in Civil Engineering Prof. Dr. Rajib Maity Department of Civil Engineering Indian Institute of Technology Kharagpur Lecture No. #13 Probability Distribution of Continuous RVs (Contd
More informationDescribing distributions with numbers
Describing distributions with numbers A large number or numerical methods are available for describing quantitative data sets. Most of these methods measure one of two data characteristics: The central
More informationUnits. Exploratory Data Analysis. Variables. Student Data
Units Exploratory Data Analysis Bret Larget Departments of Botany and of Statistics University of Wisconsin Madison Statistics 371 13th September 2005 A unit is an object that can be measured, such as
More informationSections 6.1 and 6.2: The Normal Distribution and its Applications
Sections 6.1 and 6.2: The Normal Distribution and its Applications Definition: A normal distribution is a continuous, symmetric, bell-shaped distribution of a variable. The equation for the normal distribution
More informationIntroduction to Statistics
Introduction to Statistics By A.V. Vedpuriswar October 2, 2016 Introduction The word Statistics is derived from the Italian word stato, which means state. Statista refers to a person involved with the
More informationSampling Populations limited in the scope enumerate
Sampling Populations Typically, when we collect data, we are somewhat limited in the scope of what information we can reasonably collect Ideally, we would enumerate each and every member of a population
More informationChapter 6. The Standard Deviation as a Ruler and the Normal Model 1 /67
Chapter 6 The Standard Deviation as a Ruler and the Normal Model 1 /67 Homework Read Chpt 6 Complete Reading Notes Do P129 1, 3, 5, 7, 15, 17, 23, 27, 29, 31, 37, 39, 43 2 /67 Objective Students calculate
More informationNote that we are looking at the true mean, μ, not y. The problem for us is that we need to find the endpoints of our interval (a, b).
Confidence Intervals 1) What are confidence intervals? Simply, an interval for which we have a certain confidence. For example, we are 90% certain that an interval contains the true value of something
More informationChapter 6. Estimation of Confidence Intervals for Nodal Maximum Power Consumption per Customer
Chapter 6 Estimation of Confidence Intervals for Nodal Maximum Power Consumption per Customer The aim of this chapter is to calculate confidence intervals for the maximum power consumption per customer
More informationChapter 23: Inferences About Means
Chapter 3: Inferences About Means Sample of Means: number of observations in one sample the population mean (theoretical mean) sample mean (observed mean) is the theoretical standard deviation of the population
More informationUnit 4 Probability. Dr Mahmoud Alhussami
Unit 4 Probability Dr Mahmoud Alhussami Probability Probability theory developed from the study of games of chance like dice and cards. A process like flipping a coin, rolling a die or drawing a card from
More informationLecture 27. DATA 8 Spring Sample Averages. Slides created by John DeNero and Ani Adhikari
DATA 8 Spring 2018 Lecture 27 Sample Averages Slides created by John DeNero (denero@berkeley.edu) and Ani Adhikari (adhikari@berkeley.edu) Announcements Questions for This Week How can we quantify natural
More informationP8130: Biostatistical Methods I
P8130: Biostatistical Methods I Lecture 2: Descriptive Statistics Cody Chiuzan, PhD Department of Biostatistics Mailman School of Public Health (MSPH) Lecture 1: Recap Intro to Biostatistics Types of Data
More informationAverages How difficult is QM1? What is the average mark? Week 1b, Lecture 2
Averages How difficult is QM1? What is the average mark? Week 1b, Lecture 2 Topics: 1. Mean 2. Mode 3. Median 4. Order Statistics 5. Minimum, Maximum, Range 6. Percentiles, Quartiles, Interquartile Range
More informationPractitioner's Guide To Statistical Sampling: Part 1
Practitioner's Guide To Statistical Sampling: Part 1 By Brian Kriegler January 8, 2018, 11:11 AM EST Below is the first of four think pieces on statistical sampling in practice. I start by demonstrating
More informationStatistics and Quantitative Analysis U4320. Segment 5: Sampling and inference Prof. Sharyn O Halloran
Statistics and Quantitative Analysis U4320 Segment 5: Sampling and inference Prof. Sharyn O Halloran Sampling A. Basics 1. Ways to Describe Data Histograms Frequency Tables, etc. 2. Ways to Characterize
More informationChapter 1. Looking at Data
Chapter 1 Looking at Data Types of variables Looking at Data Be sure that each variable really does measure what you want it to. A poor choice of variables can lead to misleading conclusions!! For example,
More informationMeasures of center. The mean The mean of a distribution is the arithmetic average of the observations:
Measures of center The mean The mean of a distribution is the arithmetic average of the observations: x = x 1 + + x n n n = 1 x i n i=1 The median The median is the midpoint of a distribution: the number
More informationNumerical Methods Lecture 7 - Statistics, Probability and Reliability
Topics Numerical Methods Lecture 7 - Statistics, Probability and Reliability A summary of statistical analysis A summary of probability methods A summary of reliability analysis concepts Statistical Analysis
More informationChapter 11 Sampling Distribution. Stat 115
Chapter 11 Sampling Distribution Stat 115 1 Definition 11.1 : Random Sample (finite population) Suppose we select n distinct elements from a population consisting of N elements, using a particular probability
More informationLesson Plan. Answer Questions. Summary Statistics. Histograms. The Normal Distribution. Using the Standard Normal Table
Lesson Plan Answer Questions Summary Statistics Histograms The Normal Distribution Using the Standard Normal Table 1 2. Summary Statistics Given a collection of data, one needs to find representations
More informationStatistics lecture 3. Bell-Shaped Curves and Other Shapes
Statistics lecture 3 Bell-Shaped Curves and Other Shapes Goals for lecture 3 Realize many measurements in nature follow a bell-shaped ( normal ) curve Understand and learn to compute a standardized score
More informationDescribing distributions with numbers
Describing distributions with numbers A large number or numerical methods are available for describing quantitative data sets. Most of these methods measure one of two data characteristics: The central
More informationEconometrics in a nutshell: Variation and Identification Linear Regression Model in STATA. Research Methods. Carlos Noton.
1/17 Research Methods Carlos Noton Term 2-2012 Outline 2/17 1 Econometrics in a nutshell: Variation and Identification 2 Main Assumptions 3/17 Dependent variable or outcome Y is the result of two forces:
More informationLecture 26: Chapter 10, Section 2 Inference for Quantitative Variable Confidence Interval with t
Lecture 26: Chapter 10, Section 2 Inference for Quantitative Variable Confidence Interval with t t Confidence Interval for Population Mean Comparing z and t Confidence Intervals When neither z nor t Applies
More informationUnit Two Descriptive Biostatistics. Dr Mahmoud Alhussami
Unit Two Descriptive Biostatistics Dr Mahmoud Alhussami Descriptive Biostatistics The best way to work with data is to summarize and organize them. Numbers that have not been summarized and organized are
More informationEXPERIMENT 2 Reaction Time Objectives Theory
EXPERIMENT Reaction Time Objectives to make a series of measurements of your reaction time to make a histogram, or distribution curve, of your measured reaction times to calculate the "average" or mean
More informationVisualizing and summarizing data
Visualizing and summarizing data Ken Rice, Dept of Biostatistics HUBIO 530 January 2015 Q. What s your talk about? Today I will describe: How to visualize small datasets How to summarize small datasets
More information4.12 Sampling Distributions 183
4.12 Sampling Distributions 183 FIGURE 4.19 Sampling distribution for y Example 4.22 illustrates for a very small population that we could in fact enumerate every possible sample of size 2 selected from
More informationSection 3. Measures of Variation
Section 3 Measures of Variation Range Range = (maximum value) (minimum value) It is very sensitive to extreme values; therefore not as useful as other measures of variation. Sample Standard Deviation The
More informationLecture 3B: Chapter 4, Section 2 Quantitative Variables (Displays, Begin Summaries)
Lecture 3B: Chapter 4, Section 2 Quantitative Variables (Displays, Begin Summaries) Summarize with Shape, Center, Spread Displays: Stemplots, Histograms Five Number Summary, Outliers, Boxplots Mean vs.
More informationa table or a graph or an equation.
Topic (8) POPULATION DISTRIBUTIONS 8-1 So far: Topic (8) POPULATION DISTRIBUTIONS We ve seen some ways to summarize a set of data, including numerical summaries. We ve heard a little about how to sample
More informationLecture 6: Chapter 4, Section 2 Quantitative Variables (Displays, Begin Summaries)
Lecture 6: Chapter 4, Section 2 Quantitative Variables (Displays, Begin Summaries) Summarize with Shape, Center, Spread Displays: Stemplots, Histograms Five Number Summary, Outliers, Boxplots Cengage Learning
More informationSlide 1. Slide 2. Slide 3. Pick a Brick. Daphne. 400 pts 200 pts 300 pts 500 pts 100 pts. 300 pts. 300 pts 400 pts 100 pts 400 pts.
Slide 1 Slide 2 Daphne Phillip Kathy Slide 3 Pick a Brick 100 pts 200 pts 500 pts 300 pts 400 pts 200 pts 300 pts 500 pts 100 pts 300 pts 400 pts 100 pts 400 pts 100 pts 200 pts 500 pts 100 pts 400 pts
More information