Statistics Terminology

Size: px
Start display at page:

Download "Statistics Terminology"

Transcription

1 <run>:\the\world Machine Learning Camp Day 1: Exploring Data with Excel Statistics Terminology Variance Variance is a measure of how spread out a set of data or numbers is from its mean. For a set of numbers x 1, x 2,..., x n, their variance is given by σ 2 x, where σx 2 = (x 1 x) 2 + (x 2 x) (x n x) 2, n and x is the mean of x 1, x 2,..., x n. Since each x i is some distance from the mean, x, we can think of variance as the average of these distances. Here, we measure this distance as always positive, giving greater weight to numbers further away from the mean. 1. Before calculating any variances, lets establish some intuition about variance. What interval of numbers can variance possibly range between? 2. Suppose that five darts are thrown at each dartboard below, so that the mean sample distance from the bullseye is zero. On the left dartboard, draw a distribution of darts with high variance, and on the right dartboard draw a distribution of darts with low variance. 3. Calculate the variance of {28, 29, 30, 31, 32}.

2 4. Calculate the variance of {10, 20, 30, 40, 50}. What do you observe about the variances of this set of numbers and the previous one? 5. In Excel, we can calculate variance using the command var.p. We will calculate the variances of the petal length and petal width variables of the iris data. Select the empty cell next to Variance, problem 5 in the Statistics sheet. Type in =var.p(. Now, navigate to the Iris sheet and click and drag over all the cells in one of the columns of data - for now, just use the data for Iris versicolor. The cell location should populate in the formula - you can see this in the formula bar above the sheet. Then close out the formula by typing ), and press enter. You can do the same for the other column of data to find its variance. 6. Using one of the datasets your explored before, calculate the variance of one of the measures. Does this data have higher or lower variance than the iris data? What can you conclude, if anything, from this?

3 Covariance Covariance is a measure of how two datasets vary with each other. For a set of numbers X = {x 1, x 2,..., x n }, and Y = {y 1, y 2,..., y n }, their covariance is given by cov(x, Y ) = (x 1 x)(y 1 ȳ) + (x 2 x)(y 2 ȳ) + + (x n x)(y n ȳ), n where x and ȳ are the means of X and Y, respectively. If we want to relate this back to the idea of variance, notice that if X = Y, then this precisely gives us the variance. Variance is a measure of how a dataset varies with itself, rather than with another set of data. 1. If large values of one set of data correspond with large values of another set of data, and similarly with small values, then covariance is positive. However, if large values of one dataset correspond with small values of another dataset, then covariance can be negative. What is the possible interval of values that the covariance of two samples of data can range between? 2. On the axes below, sketch a scatter plot of two datasets (one on the x axis and one on the y axis) with (a) positive covariance, (b) negative covariance, and (c) zero covariance. (a) (b) (c) 3. In Excel, we can calculate the covariance of two sets of data by typing =covariance.p( into the empty cell next to Covariance, problem 3. Then select one column of data, type a comma, and select a second column of data. For example, use Iris versicolor petal length and width. Close off the formula with a parenthesis and press enter. What do you observe? 4. Now, calculate the covariance of sepal length and width for Iris versicolor. Is anything different? What about petal width and sepal width for Iris setosa?

4 5. What are some possible drawbacks to using covariance as a statistical measure? To explore this question, consider the following two data sets: (a) What are the variances of A 1 and A 2? A A (b) Use Excel to generate two data sets, B 1 and B 2, so that A 1 = 2B 1 and A 2 = 2B 2. These two pairs of A i and B i have the same relationship to each other, so one might expect them to vary together identically. Calculate their covariance. (c) Why do you think the covariances are so different?

5 Correlation Correlation is a measure of how strongly related two samples of data are. For a set of numbers X = {x 1, x 2,..., x n }, and Y = {y 1, y 2,..., y n }, we can measure their correlation using the sample correlation coefficient, r, which is given by r = cov(x, Y ) (x 1 x)(y 1 ȳ) + (x 2 x)(y 2 ȳ) + + (x n x)(y n ȳ) = σ x σ y (x1 x) 2 + (x 2 x) (x n x) 2 (y 1 ȳ) 2 + (y 2 ȳ) (y n ȳ). 2 As before, x and ȳ are the means of the sets X and Y, respectively. Correlation is similar to covariance, but instead of describing how two data sets vary with each other using all real numbers, correlation normalizes the covariance, in a sense, by dividing the covariance of two data sets by the square root of the variance of each. If X = Y, then we are precisely dividing by the variance. 1. Correlation varies between 1 and 1. Using the definition given above, explain why. 2. The idea of correlation has frequent use in everyday life. List five ideas that should be highly correlated, and five ideas that should have low correlation, or none. Below is an example of each. Feel free to share ideas with your neighbor! High Correlation Your eye color and your relatives eye colors. Low Correlation A dog s name and the type of dog treats they prefer. 3. We can calculate correlation in Excel by typing =correl(, selecting two columns as with covariance, closing the formula, and pressing enter. Can you identify a pairing of the data sets in the iris data that has positive correlation? How about one with no correlation? (There isn t one with negative correlation!)

6 4. Think of the non-iris data you graphed earlier. Were there any pairs of variables with positive, negative, or no correlation? Go back and calculate the correlation for these variables. We ll have a contest to see who can find the variables with the highest correlation. 5. Warning: you may be familiar with the turn of phrase, Correlation does not imply causation. It is tempting, knowing that two data sets are correlated, to conclude that one may imply the other. Here s a funny example, courtesy of tylervigen.com: The correlation coefficient for these two data sets is r 0.952! Think of some examples of things that may be correlated, but a causal relationship would be absurd. Discuss with your neighbors! For example, what about a child s reading ability and shoe size? Kaitlin Hill (hillk@umn.edu)

Covariance and Correlation

Covariance and Correlation Covariance and Correlation ST 370 The probability distribution of a random variable gives complete information about its behavior, but its mean and variance are useful summaries. Similarly, the joint probability

More information

1. The Basic X-Y Scatter Plot

1. The Basic X-Y Scatter Plot 1. The Basic X-Y Scatter Plot EXCEL offers a wide range of plots; however, this discussion will be restricted to generating XY scatter plots in various formats. The easiest way to begin is to highlight

More information

Math Boot Camp Functions and Algebra

Math Boot Camp Functions and Algebra Fall 017 Math Boot Camp Functions and Algebra FUNCTIONS Much of mathematics relies on functions, the pairing (relation) of one object (typically a real number) with another object (typically a real number).

More information

Statistical Concepts. Constructing a Trend Plot

Statistical Concepts. Constructing a Trend Plot Module 1: Review of Basic Statistical Concepts 1.2 Plotting Data, Measures of Central Tendency and Dispersion, and Correlation Constructing a Trend Plot A trend plot graphs the data against a variable

More information

Week 3: Simple Linear Regression

Week 3: Simple Linear Regression Week 3: Simple Linear Regression Marcelo Coca Perraillon University of Colorado Anschutz Medical Campus Health Services Research Methods I HSMP 7607 2017 c 2017 PERRAILLON ALL RIGHTS RESERVED 1 Outline

More information

2. The Sample Mean and the Law of Large Numbers

2. The Sample Mean and the Law of Large Numbers 1 of 11 7/16/2009 6:05 AM Virtual Laboratories > 6. Random Samples > 1 2 3 4 5 6 7 2. The Sample Mean and the Law of Large Numbers The Sample Mean Suppose that we have a basic random experiment and that

More information

Math Section SR MW 1-2:30pm. Bekki George: University of Houston. Sections

Math Section SR MW 1-2:30pm. Bekki George: University of Houston. Sections Math 3339 Section 21155 - SR 117 - MW 1-2:30pm Bekki George: bekki@math.uh.edu University of Houston Sections 2.1-2.3 Bekki George (UH) Math 3339 Sections 2.1-2.3 1 / 28 Office Hours: Mondays 11am - 12:30pm,

More information

Using Microsoft Excel

Using Microsoft Excel Using Microsoft Excel Objective: Students will gain familiarity with using Excel to record data, display data properly, use built-in formulae to do calculations, and plot and fit data with linear functions.

More information

Variance. Standard deviation VAR = = value. Unbiased SD = SD = 10/23/2011. Functional Connectivity Correlation and Regression.

Variance. Standard deviation VAR = = value. Unbiased SD = SD = 10/23/2011. Functional Connectivity Correlation and Regression. 10/3/011 Functional Connectivity Correlation and Regression Variance VAR = Standard deviation Standard deviation SD = Unbiased SD = 1 10/3/011 Standard error Confidence interval SE = CI = = t value for

More information

BIOSTATISTICS NURS 3324

BIOSTATISTICS NURS 3324 Simple Linear Regression and Correlation Introduction Previously, our attention has been focused on one variable which we designated by x. Frequently, it is desirable to learn something about the relationship

More information

Relationships between variables. Association Examples: Smoking is associated with heart disease. Weight is associated with height.

Relationships between variables. Association Examples: Smoking is associated with heart disease. Weight is associated with height. Relationships between variables. Association Examples: Smoking is associated with heart disease. Weight is associated with height. Income is associated with education. Functional relationships between

More information

Section 1.8 Matrices as Linear Transformations

Section 1.8 Matrices as Linear Transformations Section.8 Matrices as Linear Transformations Up to this point in the course, we have thought of matrices as stand alone constructions, objects of interest in their own right. We have learned multiple matrix

More information

Experiment 0 ~ Introduction to Statistics and Excel Tutorial. Introduction to Statistics, Error and Measurement

Experiment 0 ~ Introduction to Statistics and Excel Tutorial. Introduction to Statistics, Error and Measurement Experiment 0 ~ Introduction to Statistics and Excel Tutorial Many of you already went through the introduction to laboratory practice and excel tutorial in Physics 1011. For that reason, we aren t going

More information

Section 1.4 Circles. Objective #1: Writing the Equation of a Circle in Standard Form.

Section 1.4 Circles. Objective #1: Writing the Equation of a Circle in Standard Form. 1 Section 1. Circles Objective #1: Writing the Equation of a Circle in Standard Form. We begin by giving a definition of a circle: Definition: A Circle is the set of all points that are equidistant from

More information

Midterm 2 - Solutions

Midterm 2 - Solutions Ecn 102 - Analysis of Economic Data University of California - Davis February 23, 2010 Instructor: John Parman Midterm 2 - Solutions You have until 10:20am to complete this exam. Please remember to put

More information

Measurement: The Basics

Measurement: The Basics I. Introduction Measurement: The Basics Physics is first and foremost an experimental science, meaning that its accumulated body of knowledge is due to the meticulous experiments performed by teams of

More information

Contents. 13. Graphs of Trigonometric Functions 2 Example Example

Contents. 13. Graphs of Trigonometric Functions 2 Example Example Contents 13. Graphs of Trigonometric Functions 2 Example 13.19............................... 2 Example 13.22............................... 5 1 Peterson, Technical Mathematics, 3rd edition 2 Example 13.19

More information

Unit 2. Describing Data: Numerical

Unit 2. Describing Data: Numerical Unit 2 Describing Data: Numerical Describing Data Numerically Describing Data Numerically Central Tendency Arithmetic Mean Median Mode Variation Range Interquartile Range Variance Standard Deviation Coefficient

More information

Introduction to Linear Regression

Introduction to Linear Regression Introduction to Linear Regression James H. Steiger Department of Psychology and Human Development Vanderbilt University James H. Steiger (Vanderbilt University) Introduction to Linear Regression 1 / 46

More information

An Introduction to Multivariate Methods

An Introduction to Multivariate Methods Chapter 12 An Introduction to Multivariate Methods Multivariate statistical methods are used to display, analyze, and describe data on two or more features or variables simultaneously. I will discuss multivariate

More information

Lecture 16 : Independence, Covariance and Correlation of Discrete Random Variables

Lecture 16 : Independence, Covariance and Correlation of Discrete Random Variables Lecture 6 : Independence, Covariance and Correlation of Discrete Random Variables 0/ 3 Definition Two discrete random variables X and Y defined on the same sample space are said to be independent if for

More information

Review of probability and statistics 1 / 31

Review of probability and statistics 1 / 31 Review of probability and statistics 1 / 31 2 / 31 Why? This chapter follows Stock and Watson (all graphs are from Stock and Watson). You may as well refer to the appendix in Wooldridge or any other introduction

More information

1 A Review of Correlation and Regression

1 A Review of Correlation and Regression 1 A Review of Correlation and Regression SW, Chapter 12 Suppose we select n = 10 persons from the population of college seniors who plan to take the MCAT exam. Each takes the test, is coached, and then

More information

Naïve Bayes Introduction to Machine Learning. Matt Gormley Lecture 18 Oct. 31, 2018

Naïve Bayes Introduction to Machine Learning. Matt Gormley Lecture 18 Oct. 31, 2018 10-601 Introduction to Machine Learning Machine Learning Department School of Computer Science Carnegie Mellon University Naïve Bayes Matt Gormley Lecture 18 Oct. 31, 2018 1 Reminders Homework 6: PAC Learning

More information

7. Be able to prove Rules in Section 7.3, using only the Kolmogorov axioms.

7. Be able to prove Rules in Section 7.3, using only the Kolmogorov axioms. Midterm Review Solutions for MATH 50 Solutions to the proof and example problems are below (in blue). In each of the example problems, the general principle is given in parentheses before the solution.

More information

WISE Regression/Correlation Interactive Lab. Introduction to the WISE Correlation/Regression Applet

WISE Regression/Correlation Interactive Lab. Introduction to the WISE Correlation/Regression Applet WISE Regression/Correlation Interactive Lab Introduction to the WISE Correlation/Regression Applet This tutorial focuses on the logic of regression analysis with special attention given to variance components.

More information

PRINCIPAL COMPONENTS ANALYSIS

PRINCIPAL COMPONENTS ANALYSIS PRINCIPAL COMPONENTS ANALYSIS Iris Data Let s find Principal Components using the iris dataset. This is a well known dataset, often used to demonstrate the effect of clustering algorithms. It contains

More information

AP Statistics. Chapter 6 Scatterplots, Association, and Correlation

AP Statistics. Chapter 6 Scatterplots, Association, and Correlation AP Statistics Chapter 6 Scatterplots, Association, and Correlation Objectives: Scatterplots Association Outliers Response Variable Explanatory Variable Correlation Correlation Coefficient Lurking Variables

More information

1-1. Expressions and Formulas. Lesson 1-1. What You ll Learn. Active Vocabulary

1-1. Expressions and Formulas. Lesson 1-1. What You ll Learn. Active Vocabulary 1-1 Expressions and Formulas What You ll Learn Skim the lesson. Write two things you already know about expressions and formulas. 1. Active Vocabulary 2. Review Vocabulary Identify the four grouping symbols

More information

Chapter 7: Sampling Distributions

Chapter 7: Sampling Distributions Chapter 7: Sampling Distributions Section 7.1 What is a Sampling Distribution? The Practice of Statistics, 4 th edition For AP* STARNES, YATES, MOORE Chapter 7 Sampling Distributions 7.1 What is a Sampling

More information

The Electric Field. So far, we have Coulomb s Law that gives the electric force between two charges:

The Electric Field. So far, we have Coulomb s Law that gives the electric force between two charges: The Electric Field So far, we have Coulomb s Law that gives the electric force between two charges: Coulomb s Law is written in the language of Newton i.e. Forces, which is OK. However, as quickly as possible,

More information

Assignment 2 (Sol.) Introduction to Machine Learning Prof. B. Ravindran

Assignment 2 (Sol.) Introduction to Machine Learning Prof. B. Ravindran Assignment 2 (Sol.) Introduction to Machine Learning Prof. B. Ravindran 1. Let A m n be a matrix of real numbers. The matrix AA T has an eigenvector x with eigenvalue b. Then the eigenvector y of A T A

More information

Physics 212E Spring 2004 Classical and Modern Physics. Computer Exercise #2

Physics 212E Spring 2004 Classical and Modern Physics. Computer Exercise #2 Physics 212E Spring 2004 Classical and Modern Physics Chowdary Computer Exercise #2 Launch Mathematica by clicking on the Start menu (lower left hand corner of the screen); from there go up to Science

More information

Probability Distributions

Probability Distributions CONDENSED LESSON 13.1 Probability Distributions In this lesson, you Sketch the graph of the probability distribution for a continuous random variable Find probabilities by finding or approximating areas

More information

Creative Data Mining

Creative Data Mining Creative Data Mining Using ML algorithms in python Artem Chirkin Dr. Daniel Zünd Danielle Griego Lecture 7 0.04.207 /7 What we will cover today Outline Getting started Explore dataset content Inspect visually

More information

AIM HIGH SCHOOL. Curriculum Map W. 12 Mile Road Farmington Hills, MI (248)

AIM HIGH SCHOOL. Curriculum Map W. 12 Mile Road Farmington Hills, MI (248) AIM HIGH SCHOOL Curriculum Map 2923 W. 12 Mile Road Farmington Hills, MI 48334 (248) 702-6922 www.aimhighschool.com COURSE TITLE: Statistics DESCRIPTION OF COURSE: PREREQUISITES: Algebra 2 Students will

More information

Boyle s Law and Charles Law Activity

Boyle s Law and Charles Law Activity Boyle s Law and Charles Law Activity Introduction: This simulation helps you to help you fully understand 2 Gas Laws: Boyle s Law and Charles Law. These laws are very simple to understand, but are also

More information

IAM 530 ELEMENTS OF PROBABILITY AND STATISTICS LECTURE 3-RANDOM VARIABLES

IAM 530 ELEMENTS OF PROBABILITY AND STATISTICS LECTURE 3-RANDOM VARIABLES IAM 530 ELEMENTS OF PROBABILITY AND STATISTICS LECTURE 3-RANDOM VARIABLES VARIABLE Studying the behavior of random variables, and more importantly functions of random variables is essential for both the

More information

Introduction to Computer Tools and Uncertainties

Introduction to Computer Tools and Uncertainties Experiment 1 Introduction to Computer Tools and Uncertainties 1.1 Objectives To become familiar with the computer programs and utilities that will be used throughout the semester. To become familiar with

More information

Lab 2. Electric Fields and Potentials

Lab 2. Electric Fields and Potentials Physics 2020, Fall 2005 Lab 2 page 1 of 8 Lab 2. Electric Fields and Potentials INTRODUCTION: In class we have learned about electric charges and the electrostatic force that charges exert on each other.

More information

Reteach 2-3. Graphing Linear Functions. 22 Holt Algebra 2. Name Date Class

Reteach 2-3. Graphing Linear Functions. 22 Holt Algebra 2. Name Date Class -3 Graphing Linear Functions Use intercepts to sketch the graph of the function 3x 6y 1. The x-intercept is where the graph crosses the x-axis. To find the x-intercept, set y 0 and solve for x. 3x 6y 1

More information

LAB 05B: Friction 2 times PURPOSE BACKGROUND MATERIALS PRELIMINARY QUESTIONS: (TO DO BEFORE THE LAB!!) Lab 05B: Friction 1/5 Mr.

LAB 05B: Friction 2 times PURPOSE BACKGROUND MATERIALS PRELIMINARY QUESTIONS: (TO DO BEFORE THE LAB!!) Lab 05B: Friction 1/5 Mr. LAB 05B: Friction 2 times PURPOSE To investigate how friction is related to other variable such as the normal force, weight, and surface coefficients. In this experiment we will determine the static and

More information

Statistics for Managers using Microsoft Excel 6 th Edition

Statistics for Managers using Microsoft Excel 6 th Edition Statistics for Managers using Microsoft Excel 6 th Edition Chapter 3 Numerical Descriptive Measures 3-1 Learning Objectives In this chapter, you learn: To describe the properties of central tendency, variation,

More information

Chapter 7: Sampling Distributions

Chapter 7: Sampling Distributions Chapter 7: Sampling Distributions Section 7.1 What is a Sampling Distribution? The Practice of Statistics, 4 th edition For AP* STARNES, YATES, MOORE Chapter 7 Sampling Distributions 7.1 What is a Sampling

More information

Causality in Econometrics (3)

Causality in Econometrics (3) Graphical Causal Models References Causality in Econometrics (3) Alessio Moneta Max Planck Institute of Economics Jena moneta@econ.mpg.de 26 April 2011 GSBC Lecture Friedrich-Schiller-Universität Jena

More information

Introduce Exploration! Before we go on, notice one more thing. We'll come back to the derivation if we have time.

Introduce Exploration! Before we go on, notice one more thing. We'll come back to the derivation if we have time. Introduce Exploration! Before we go on, notice one more thing. We'll come back to the derivation if we have time. Simplifying the calculation of variance Notice that we can rewrite the calculation of a

More information

This module focuses on the logic of ANOVA with special attention given to variance components and the relationship between ANOVA and regression.

This module focuses on the logic of ANOVA with special attention given to variance components and the relationship between ANOVA and regression. WISE ANOVA and Regression Lab Introduction to the WISE Correlation/Regression and ANOVA Applet This module focuses on the logic of ANOVA with special attention given to variance components and the relationship

More information

Free-Body Diagrams: Introduction

Free-Body Diagrams: Introduction Free-Body Diagrams: Introduction Learning Goal: To learn to draw free-body diagrams for various real-life situations. Imagine that you are given a description of a real-life situation and are asked to

More information

STA 2201/442 Assignment 2

STA 2201/442 Assignment 2 STA 2201/442 Assignment 2 1. This is about how to simulate from a continuous univariate distribution. Let the random variable X have a continuous distribution with density f X (x) and cumulative distribution

More information

Unsupervised Learning. k-means Algorithm

Unsupervised Learning. k-means Algorithm Unsupervised Learning Supervised Learning: Learn to predict y from x from examples of (x, y). Performance is measured by error rate. Unsupervised Learning: Learn a representation from exs. of x. Learn

More information

22 Approximations - the method of least squares (1)

22 Approximations - the method of least squares (1) 22 Approximations - the method of least squares () Suppose that for some y, the equation Ax = y has no solutions It may happpen that this is an important problem and we can t just forget about it If we

More information

371 Lab Rybolt Data Analysis Assignment Name

371 Lab Rybolt Data Analysis Assignment Name Data Analysis Assignment 1 371 Lab Rybolt Data Analysis Assignment Name You wake up one morning and feel you may have a fever. You have an oral thermometer marked in Celsius degrees and find your temperature

More information

Prelab: Complete the prelab section BEFORE class Purpose:

Prelab: Complete the prelab section BEFORE class Purpose: Lab: Projectile Motion Prelab: Complete the prelab section BEFORE class Purpose: What is the Relationship between and for the situation of a cannon ball shot off a with an angle of from the horizontal.

More information

Variance reduction. Michel Bierlaire. Transport and Mobility Laboratory. Variance reduction p. 1/18

Variance reduction. Michel Bierlaire. Transport and Mobility Laboratory. Variance reduction p. 1/18 Variance reduction p. 1/18 Variance reduction Michel Bierlaire michel.bierlaire@epfl.ch Transport and Mobility Laboratory Variance reduction p. 2/18 Example Use simulation to compute I = 1 0 e x dx We

More information

Mean, Mode, Median and Range. I know how to calculate the mean, mode, median and range.

Mean, Mode, Median and Range. I know how to calculate the mean, mode, median and range. Starter 1) The small box has a volume of 24cm. The boxes are similar. Find the new volume of the box. 2 3cm 9cm 2 2) What is the turning point of the graph y = (x - 9) + 4 Today we are revising... How

More information

Jointly Distributed Variables

Jointly Distributed Variables Jointly Distributed Variables Sec 2.6, 9.1 & 9.2 Cathy Poliak, Ph.D. cathy@math.uh.edu Office in Fleming 11c Department of Mathematics University of Houston Lecture 7-3339 Cathy Poliak, Ph.D. cathy@math.uh.edu

More information

Classification Methods II: Linear and Quadratic Discrimminant Analysis

Classification Methods II: Linear and Quadratic Discrimminant Analysis Classification Methods II: Linear and Quadratic Discrimminant Analysis Rebecca C. Steorts, Duke University STA 325, Chapter 4 ISL Agenda Linear Discrimminant Analysis (LDA) Classification Recall that linear

More information

EXPERIMENT 2 Reaction Time Objectives Theory

EXPERIMENT 2 Reaction Time Objectives Theory EXPERIMENT Reaction Time Objectives to make a series of measurements of your reaction time to make a histogram, or distribution curve, of your measured reaction times to calculate the "average" or mean

More information

Econ 371 Problem Set #1 Answer Sheet

Econ 371 Problem Set #1 Answer Sheet Econ 371 Problem Set #1 Answer Sheet 2.1 In this question, you are asked to consider the random variable Y, which denotes the number of heads that occur when two coins are tossed. a. The first part of

More information

Business Statistics. Lecture 10: Correlation and Linear Regression

Business Statistics. Lecture 10: Correlation and Linear Regression Business Statistics Lecture 10: Correlation and Linear Regression Scatterplot A scatterplot shows the relationship between two quantitative variables measured on the same individuals. It displays the Form

More information

Chapter 5. Piece of Wisdom #2: A statistician drowned crossing a stream with an average depth of 6 inches. (Anonymous)

Chapter 5. Piece of Wisdom #2: A statistician drowned crossing a stream with an average depth of 6 inches. (Anonymous) Chapter 5 Deviating from the Average In This Chapter What variation is all about Variance and standard deviation Excel worksheet functions that calculate variation Workarounds for missing worksheet functions

More information

Political Science 6000: Beginnings and Mini Math Boot Camp

Political Science 6000: Beginnings and Mini Math Boot Camp Political Science 6000: Beginnings and Mini Math Boot Camp January 20, 2010 First things first Syllabus This is the most important course you will take. 1. You need to understand these concepts in order

More information

Shifting Reactions B

Shifting Reactions B Shifting Reactions B Name Lab Section Log on to the Internet. Type the following address into the location-input line of your browser: http://introchem.chem.okstate.edu/dcicla/ergbn.htm This will load

More information

Making an H-R diagram Earth & Sky

Making an H-R diagram Earth & Sky Making an H-R diagram Earth & Sky Name: Introduction Astronomers have discovered relationships between the surface temperatures and luminosities (brightnesses) of stars. These relationships are often presented

More information

Math 243 OpenStax Chapter 12 Scatterplots and Linear Regression OpenIntro Section and

Math 243 OpenStax Chapter 12 Scatterplots and Linear Regression OpenIntro Section and Math 243 OpenStax Chapter 12 Scatterplots and Linear Regression OpenIntro Section 2.1.1 and 8.1-8.2.6 Overview Scatterplots Explanatory and Response Variables Describing Association The Regression Equation

More information

Data Mining and Analysis

Data Mining and Analysis 978--5-766- - Data Mining and Analysis: Fundamental Concepts and Algorithms CHAPTER Data Mining and Analysis Data mining is the process of discovering insightful, interesting, and novel patterns, as well

More information

Math Lab 10: Differential Equations and Direction Fields Complete before class Wed. Feb. 28; Due noon Thu. Mar. 1 in class

Math Lab 10: Differential Equations and Direction Fields Complete before class Wed. Feb. 28; Due noon Thu. Mar. 1 in class Matter & Motion Winter 2017 18 Name: Math Lab 10: Differential Equations and Direction Fields Complete before class Wed. Feb. 28; Due noon Thu. Mar. 1 in class Goals: 1. Gain exposure to terminology and

More information

AP Statistics. Chapter 9 Re-Expressing data: Get it Straight

AP Statistics. Chapter 9 Re-Expressing data: Get it Straight AP Statistics Chapter 9 Re-Expressing data: Get it Straight Objectives: Re-expression of data Ladder of powers Straight to the Point We cannot use a linear model unless the relationship between the two

More information

Section 20: Arrow Diagrams on the Integers

Section 20: Arrow Diagrams on the Integers Section 0: Arrow Diagrams on the Integers Most of the material we have discussed so far concerns the idea and representations of functions. A function is a relationship between a set of inputs (the leave

More information

PHY 111L Activity 2 Introduction to Kinematics

PHY 111L Activity 2 Introduction to Kinematics PHY 111L Activity 2 Introduction to Kinematics Name: Section: ID #: Date: Lab Partners: TA initials: Objectives 1. Introduce the relationship between position, velocity, and acceleration 2. Investigate

More information

Terminology for Statistical Data

Terminology for Statistical Data Terminology for Statistical Data variables - features - attributes observations - cases (consist of multiple values) In a standard data matrix, variables or features correspond to columns observations

More information

Data Analysis and Statistical Methods Statistics 651

Data Analysis and Statistical Methods Statistics 651 y 1 2 3 4 5 6 7 x Data Analysis and Statistical Methods Statistics 651 http://www.stat.tamu.edu/~suhasini/teaching.html Lecture 32 Suhasini Subba Rao Previous lecture We are interested in whether a dependent

More information

Galactic Rotation Activity*

Galactic Rotation Activity* Galactic Rotation Activity* Neutral hydrogen atoms (H I) consist of a single proton and a single electron. The electron and proton can spin in the same direction (parallel) or in the opposite direction

More information

Section Linear Correlation and Regression. Copyright 2013, 2010, 2007, Pearson, Education, Inc.

Section Linear Correlation and Regression. Copyright 2013, 2010, 2007, Pearson, Education, Inc. Section 13.7 Linear Correlation and Regression What You Will Learn Linear Correlation Scatter Diagram Linear Regression Least Squares Line 13.7-2 Linear Correlation Linear correlation is used to determine

More information

Expectations and Variance

Expectations and Variance 4. Model parameters and their estimates 4.1 Expected Value and Conditional Expected Value 4. The Variance 4.3 Population vs Sample Quantities 4.4 Mean and Variance of a Linear Combination 4.5 The Covariance

More information

Quantitative Understanding in Biology Short Course Session 9 Principal Components Analysis

Quantitative Understanding in Biology Short Course Session 9 Principal Components Analysis Quantitative Understanding in Biology Short Course Session 9 Principal Components Analysis Jinhyun Ju Jason Banfelder Luce Skrabanek June 21st, 218 1 Preface For the last session in this course, we ll

More information

Chapter 27 Summary Inferences for Regression

Chapter 27 Summary Inferences for Regression Chapter 7 Summary Inferences for Regression What have we learned? We have now applied inference to regression models. Like in all inference situations, there are conditions that we must check. We can test

More information

Student Exploration: Cell Types

Student Exploration: Cell Types 3. Names: Date: Student Exploration: Cell Types Prior Knowledge Questions (Do these BEFORE using the Gizmo.) 1. What are you supposed to be learning in this activity? 2. Why do you think learning this

More information

Algebra 2. Outliers. Measures of Central Tendency (Mean, Median, Mode) Standard Deviation Normal Distribution (Bell Curves)

Algebra 2. Outliers. Measures of Central Tendency (Mean, Median, Mode) Standard Deviation Normal Distribution (Bell Curves) Algebra 2 Outliers Measures of Central Tendency (Mean, Median, Mode) Standard Deviation Normal Distribution (Bell Curves) Algebra 2 Notes #1 Chp 12 Outliers In a set of numbers, sometimes there will be

More information

Expectations and Variance

Expectations and Variance 4. Model parameters and their estimates 4.1 Expected Value and Conditional Expected Value 4. The Variance 4.3 Population vs Sample Quantities 4.4 Mean and Variance of a Linear Combination 4.5 The Covariance

More information

MLE/MAP + Naïve Bayes

MLE/MAP + Naïve Bayes 10-601 Introduction to Machine Learning Machine Learning Department School of Computer Science Carnegie Mellon University MLE/MAP + Naïve Bayes Matt Gormley Lecture 19 March 20, 2018 1 Midterm Exam Reminders

More information

Statistical Analysis of Data

Statistical Analysis of Data Statistical Analysis of Data Goals and Introduction In this experiment, we will work to calculate, understand, and evaluate statistical measures of a set of data. In most cases, when taking data, there

More information

MA 1128: Lecture 08 03/02/2018. Linear Equations from Graphs And Linear Inequalities

MA 1128: Lecture 08 03/02/2018. Linear Equations from Graphs And Linear Inequalities MA 1128: Lecture 08 03/02/2018 Linear Equations from Graphs And Linear Inequalities Linear Equations from Graphs Given a line, we would like to be able to come up with an equation for it. I ll go over

More information

Harmonic Motion. Mass on a Spring. Physics 231: General Physics I Lab 6 Mar. 11, Goals:

Harmonic Motion. Mass on a Spring. Physics 231: General Physics I Lab 6 Mar. 11, Goals: Physics 231: General Physics I Lab 6 Mar. 11, 2004 Names: Harmonic Motion Goals: 1. To learn about the basic characteristics of periodic motion period, frequency, and amplitude 2. To study what affects

More information

Biostatistics and Design of Experiments Prof. Mukesh Doble Department of Biotechnology Indian Institute of Technology, Madras

Biostatistics and Design of Experiments Prof. Mukesh Doble Department of Biotechnology Indian Institute of Technology, Madras Biostatistics and Design of Experiments Prof. Mukesh Doble Department of Biotechnology Indian Institute of Technology, Madras Lecture - 39 Regression Analysis Hello and welcome to the course on Biostatistics

More information

Chapter 2: simple regression model

Chapter 2: simple regression model Chapter 2: simple regression model Goal: understand how to estimate and more importantly interpret the simple regression Reading: chapter 2 of the textbook Advice: this chapter is foundation of econometrics.

More information

Chapter 12 - Part I: Correlation Analysis

Chapter 12 - Part I: Correlation Analysis ST coursework due Friday, April - Chapter - Part I: Correlation Analysis Textbook Assignment Page - # Page - #, Page - # Lab Assignment # (available on ST webpage) GOALS When you have completed this lecture,

More information

Standard Error & Sample Means

Standard Error & Sample Means Math Objectives Students will recognize that samples have smaller variability than the population. Students will recognize that the variability in samples is a function of sample size, n, and that the

More information

Stat 101 Exam 1 Important Formulas and Concepts 1

Stat 101 Exam 1 Important Formulas and Concepts 1 1 Chapter 1 1.1 Definitions Stat 101 Exam 1 Important Formulas and Concepts 1 1. Data Any collection of numbers, characters, images, or other items that provide information about something. 2. Categorical/Qualitative

More information

LEC 4: Discriminant Analysis for Classification

LEC 4: Discriminant Analysis for Classification LEC 4: Discriminant Analysis for Classification Dr. Guangliang Chen February 25, 2016 Outline Last time: FDA (dimensionality reduction) Today: QDA/LDA (classification) Naive Bayes classifiers Matlab/Python

More information

Physics 201 Lab 2 Air Drag Simulation

Physics 201 Lab 2 Air Drag Simulation Physics 201 Lab 2 Air Drag Simulation Jan 28, 2013 Equipment Initial Set Up Type the data from Table 1 into the appropriate cells. By preceding the content of the cell with an equal sign (as in cell A6)

More information

REVIEW 8/2/2017 陈芳华东师大英语系

REVIEW 8/2/2017 陈芳华东师大英语系 REVIEW Hypothesis testing starts with a null hypothesis and a null distribution. We compare what we have to the null distribution, if the result is too extreme to belong to the null distribution (p

More information

Fundamentals of Measurement and Error Analysis

Fundamentals of Measurement and Error Analysis Lab 1 Fundamentals of Measurement and Error Analysis 1.1 Overview This first laboratory exercise introduces key concepts and statistical and plotting tools that are used throughout the entire sequence

More information

Probability. Hosung Sohn

Probability. Hosung Sohn Probability Hosung Sohn Department of Public Administration and International Affairs Maxwell School of Citizenship and Public Affairs Syracuse University Lecture Slide 4-3 (October 8, 2015) 1/ 43 Table

More information

University of Cambridge Engineering Part IIB Module 4F10: Statistical Pattern Processing Handout 2: Multivariate Gaussians

University of Cambridge Engineering Part IIB Module 4F10: Statistical Pattern Processing Handout 2: Multivariate Gaussians Engineering Part IIB: Module F Statistical Pattern Processing University of Cambridge Engineering Part IIB Module F: Statistical Pattern Processing Handout : Multivariate Gaussians. Generative Model Decision

More information

Data Mining and Analysis: Fundamental Concepts and Algorithms

Data Mining and Analysis: Fundamental Concepts and Algorithms Data Mining and Analysis: Fundamental Concepts and Algorithms dataminingbook.info Mohammed J. Zaki 1 Wagner Meira Jr. 1 Department of Computer Science Rensselaer Polytechnic Institute, Troy, NY, USA Department

More information

Using Scientific Measurements

Using Scientific Measurements Section 3 Main Ideas Accuracy is different from precision. Significant figures are those measured precisely, plus one estimated digit. Scientific notation is used to express very large or very small numbers.

More information

Causal Inference. Prediction and causation are very different. Typical questions are:

Causal Inference. Prediction and causation are very different. Typical questions are: Causal Inference Prediction and causation are very different. Typical questions are: Prediction: Predict Y after observing X = x Causation: Predict Y after setting X = x. Causation involves predicting

More information

Purpose: Materials: WARNING! Section: Partner 2: Partner 1:

Purpose: Materials: WARNING! Section: Partner 2: Partner 1: Partner 1: Partner 2: Section: PLEASE NOTE: You will need this particular lab report later in the semester again for the homework of the Rolling Motion Experiment. When you get back this graded report,

More information

Statistical Analysis How do we know if it works? Group workbook: Cartoon from XKCD.com. Subscribe!

Statistical Analysis How do we know if it works? Group workbook: Cartoon from XKCD.com. Subscribe! Statistical Analysis How do we know if it works? Group workbook: Cartoon from XKCD.com. Subscribe! http://www.xkcd.com/552/ Significant Concepts We structure the presentation and processing of data to

More information