Statistics Terminology
|
|
- Percival Campbell
- 5 years ago
- Views:
Transcription
1 <run>:\the\world Machine Learning Camp Day 1: Exploring Data with Excel Statistics Terminology Variance Variance is a measure of how spread out a set of data or numbers is from its mean. For a set of numbers x 1, x 2,..., x n, their variance is given by σ 2 x, where σx 2 = (x 1 x) 2 + (x 2 x) (x n x) 2, n and x is the mean of x 1, x 2,..., x n. Since each x i is some distance from the mean, x, we can think of variance as the average of these distances. Here, we measure this distance as always positive, giving greater weight to numbers further away from the mean. 1. Before calculating any variances, lets establish some intuition about variance. What interval of numbers can variance possibly range between? 2. Suppose that five darts are thrown at each dartboard below, so that the mean sample distance from the bullseye is zero. On the left dartboard, draw a distribution of darts with high variance, and on the right dartboard draw a distribution of darts with low variance. 3. Calculate the variance of {28, 29, 30, 31, 32}.
2 4. Calculate the variance of {10, 20, 30, 40, 50}. What do you observe about the variances of this set of numbers and the previous one? 5. In Excel, we can calculate variance using the command var.p. We will calculate the variances of the petal length and petal width variables of the iris data. Select the empty cell next to Variance, problem 5 in the Statistics sheet. Type in =var.p(. Now, navigate to the Iris sheet and click and drag over all the cells in one of the columns of data - for now, just use the data for Iris versicolor. The cell location should populate in the formula - you can see this in the formula bar above the sheet. Then close out the formula by typing ), and press enter. You can do the same for the other column of data to find its variance. 6. Using one of the datasets your explored before, calculate the variance of one of the measures. Does this data have higher or lower variance than the iris data? What can you conclude, if anything, from this?
3 Covariance Covariance is a measure of how two datasets vary with each other. For a set of numbers X = {x 1, x 2,..., x n }, and Y = {y 1, y 2,..., y n }, their covariance is given by cov(x, Y ) = (x 1 x)(y 1 ȳ) + (x 2 x)(y 2 ȳ) + + (x n x)(y n ȳ), n where x and ȳ are the means of X and Y, respectively. If we want to relate this back to the idea of variance, notice that if X = Y, then this precisely gives us the variance. Variance is a measure of how a dataset varies with itself, rather than with another set of data. 1. If large values of one set of data correspond with large values of another set of data, and similarly with small values, then covariance is positive. However, if large values of one dataset correspond with small values of another dataset, then covariance can be negative. What is the possible interval of values that the covariance of two samples of data can range between? 2. On the axes below, sketch a scatter plot of two datasets (one on the x axis and one on the y axis) with (a) positive covariance, (b) negative covariance, and (c) zero covariance. (a) (b) (c) 3. In Excel, we can calculate the covariance of two sets of data by typing =covariance.p( into the empty cell next to Covariance, problem 3. Then select one column of data, type a comma, and select a second column of data. For example, use Iris versicolor petal length and width. Close off the formula with a parenthesis and press enter. What do you observe? 4. Now, calculate the covariance of sepal length and width for Iris versicolor. Is anything different? What about petal width and sepal width for Iris setosa?
4 5. What are some possible drawbacks to using covariance as a statistical measure? To explore this question, consider the following two data sets: (a) What are the variances of A 1 and A 2? A A (b) Use Excel to generate two data sets, B 1 and B 2, so that A 1 = 2B 1 and A 2 = 2B 2. These two pairs of A i and B i have the same relationship to each other, so one might expect them to vary together identically. Calculate their covariance. (c) Why do you think the covariances are so different?
5 Correlation Correlation is a measure of how strongly related two samples of data are. For a set of numbers X = {x 1, x 2,..., x n }, and Y = {y 1, y 2,..., y n }, we can measure their correlation using the sample correlation coefficient, r, which is given by r = cov(x, Y ) (x 1 x)(y 1 ȳ) + (x 2 x)(y 2 ȳ) + + (x n x)(y n ȳ) = σ x σ y (x1 x) 2 + (x 2 x) (x n x) 2 (y 1 ȳ) 2 + (y 2 ȳ) (y n ȳ). 2 As before, x and ȳ are the means of the sets X and Y, respectively. Correlation is similar to covariance, but instead of describing how two data sets vary with each other using all real numbers, correlation normalizes the covariance, in a sense, by dividing the covariance of two data sets by the square root of the variance of each. If X = Y, then we are precisely dividing by the variance. 1. Correlation varies between 1 and 1. Using the definition given above, explain why. 2. The idea of correlation has frequent use in everyday life. List five ideas that should be highly correlated, and five ideas that should have low correlation, or none. Below is an example of each. Feel free to share ideas with your neighbor! High Correlation Your eye color and your relatives eye colors. Low Correlation A dog s name and the type of dog treats they prefer. 3. We can calculate correlation in Excel by typing =correl(, selecting two columns as with covariance, closing the formula, and pressing enter. Can you identify a pairing of the data sets in the iris data that has positive correlation? How about one with no correlation? (There isn t one with negative correlation!)
6 4. Think of the non-iris data you graphed earlier. Were there any pairs of variables with positive, negative, or no correlation? Go back and calculate the correlation for these variables. We ll have a contest to see who can find the variables with the highest correlation. 5. Warning: you may be familiar with the turn of phrase, Correlation does not imply causation. It is tempting, knowing that two data sets are correlated, to conclude that one may imply the other. Here s a funny example, courtesy of tylervigen.com: The correlation coefficient for these two data sets is r 0.952! Think of some examples of things that may be correlated, but a causal relationship would be absurd. Discuss with your neighbors! For example, what about a child s reading ability and shoe size? Kaitlin Hill (hillk@umn.edu)
Covariance and Correlation
Covariance and Correlation ST 370 The probability distribution of a random variable gives complete information about its behavior, but its mean and variance are useful summaries. Similarly, the joint probability
More information1. The Basic X-Y Scatter Plot
1. The Basic X-Y Scatter Plot EXCEL offers a wide range of plots; however, this discussion will be restricted to generating XY scatter plots in various formats. The easiest way to begin is to highlight
More informationMath Boot Camp Functions and Algebra
Fall 017 Math Boot Camp Functions and Algebra FUNCTIONS Much of mathematics relies on functions, the pairing (relation) of one object (typically a real number) with another object (typically a real number).
More informationStatistical Concepts. Constructing a Trend Plot
Module 1: Review of Basic Statistical Concepts 1.2 Plotting Data, Measures of Central Tendency and Dispersion, and Correlation Constructing a Trend Plot A trend plot graphs the data against a variable
More informationWeek 3: Simple Linear Regression
Week 3: Simple Linear Regression Marcelo Coca Perraillon University of Colorado Anschutz Medical Campus Health Services Research Methods I HSMP 7607 2017 c 2017 PERRAILLON ALL RIGHTS RESERVED 1 Outline
More information2. The Sample Mean and the Law of Large Numbers
1 of 11 7/16/2009 6:05 AM Virtual Laboratories > 6. Random Samples > 1 2 3 4 5 6 7 2. The Sample Mean and the Law of Large Numbers The Sample Mean Suppose that we have a basic random experiment and that
More informationMath Section SR MW 1-2:30pm. Bekki George: University of Houston. Sections
Math 3339 Section 21155 - SR 117 - MW 1-2:30pm Bekki George: bekki@math.uh.edu University of Houston Sections 2.1-2.3 Bekki George (UH) Math 3339 Sections 2.1-2.3 1 / 28 Office Hours: Mondays 11am - 12:30pm,
More informationUsing Microsoft Excel
Using Microsoft Excel Objective: Students will gain familiarity with using Excel to record data, display data properly, use built-in formulae to do calculations, and plot and fit data with linear functions.
More informationVariance. Standard deviation VAR = = value. Unbiased SD = SD = 10/23/2011. Functional Connectivity Correlation and Regression.
10/3/011 Functional Connectivity Correlation and Regression Variance VAR = Standard deviation Standard deviation SD = Unbiased SD = 1 10/3/011 Standard error Confidence interval SE = CI = = t value for
More informationBIOSTATISTICS NURS 3324
Simple Linear Regression and Correlation Introduction Previously, our attention has been focused on one variable which we designated by x. Frequently, it is desirable to learn something about the relationship
More informationRelationships between variables. Association Examples: Smoking is associated with heart disease. Weight is associated with height.
Relationships between variables. Association Examples: Smoking is associated with heart disease. Weight is associated with height. Income is associated with education. Functional relationships between
More informationSection 1.8 Matrices as Linear Transformations
Section.8 Matrices as Linear Transformations Up to this point in the course, we have thought of matrices as stand alone constructions, objects of interest in their own right. We have learned multiple matrix
More informationExperiment 0 ~ Introduction to Statistics and Excel Tutorial. Introduction to Statistics, Error and Measurement
Experiment 0 ~ Introduction to Statistics and Excel Tutorial Many of you already went through the introduction to laboratory practice and excel tutorial in Physics 1011. For that reason, we aren t going
More informationSection 1.4 Circles. Objective #1: Writing the Equation of a Circle in Standard Form.
1 Section 1. Circles Objective #1: Writing the Equation of a Circle in Standard Form. We begin by giving a definition of a circle: Definition: A Circle is the set of all points that are equidistant from
More informationMidterm 2 - Solutions
Ecn 102 - Analysis of Economic Data University of California - Davis February 23, 2010 Instructor: John Parman Midterm 2 - Solutions You have until 10:20am to complete this exam. Please remember to put
More informationMeasurement: The Basics
I. Introduction Measurement: The Basics Physics is first and foremost an experimental science, meaning that its accumulated body of knowledge is due to the meticulous experiments performed by teams of
More informationContents. 13. Graphs of Trigonometric Functions 2 Example Example
Contents 13. Graphs of Trigonometric Functions 2 Example 13.19............................... 2 Example 13.22............................... 5 1 Peterson, Technical Mathematics, 3rd edition 2 Example 13.19
More informationUnit 2. Describing Data: Numerical
Unit 2 Describing Data: Numerical Describing Data Numerically Describing Data Numerically Central Tendency Arithmetic Mean Median Mode Variation Range Interquartile Range Variance Standard Deviation Coefficient
More informationIntroduction to Linear Regression
Introduction to Linear Regression James H. Steiger Department of Psychology and Human Development Vanderbilt University James H. Steiger (Vanderbilt University) Introduction to Linear Regression 1 / 46
More informationAn Introduction to Multivariate Methods
Chapter 12 An Introduction to Multivariate Methods Multivariate statistical methods are used to display, analyze, and describe data on two or more features or variables simultaneously. I will discuss multivariate
More informationLecture 16 : Independence, Covariance and Correlation of Discrete Random Variables
Lecture 6 : Independence, Covariance and Correlation of Discrete Random Variables 0/ 3 Definition Two discrete random variables X and Y defined on the same sample space are said to be independent if for
More informationReview of probability and statistics 1 / 31
Review of probability and statistics 1 / 31 2 / 31 Why? This chapter follows Stock and Watson (all graphs are from Stock and Watson). You may as well refer to the appendix in Wooldridge or any other introduction
More information1 A Review of Correlation and Regression
1 A Review of Correlation and Regression SW, Chapter 12 Suppose we select n = 10 persons from the population of college seniors who plan to take the MCAT exam. Each takes the test, is coached, and then
More informationNaïve Bayes Introduction to Machine Learning. Matt Gormley Lecture 18 Oct. 31, 2018
10-601 Introduction to Machine Learning Machine Learning Department School of Computer Science Carnegie Mellon University Naïve Bayes Matt Gormley Lecture 18 Oct. 31, 2018 1 Reminders Homework 6: PAC Learning
More information7. Be able to prove Rules in Section 7.3, using only the Kolmogorov axioms.
Midterm Review Solutions for MATH 50 Solutions to the proof and example problems are below (in blue). In each of the example problems, the general principle is given in parentheses before the solution.
More informationWISE Regression/Correlation Interactive Lab. Introduction to the WISE Correlation/Regression Applet
WISE Regression/Correlation Interactive Lab Introduction to the WISE Correlation/Regression Applet This tutorial focuses on the logic of regression analysis with special attention given to variance components.
More informationPRINCIPAL COMPONENTS ANALYSIS
PRINCIPAL COMPONENTS ANALYSIS Iris Data Let s find Principal Components using the iris dataset. This is a well known dataset, often used to demonstrate the effect of clustering algorithms. It contains
More informationAP Statistics. Chapter 6 Scatterplots, Association, and Correlation
AP Statistics Chapter 6 Scatterplots, Association, and Correlation Objectives: Scatterplots Association Outliers Response Variable Explanatory Variable Correlation Correlation Coefficient Lurking Variables
More information1-1. Expressions and Formulas. Lesson 1-1. What You ll Learn. Active Vocabulary
1-1 Expressions and Formulas What You ll Learn Skim the lesson. Write two things you already know about expressions and formulas. 1. Active Vocabulary 2. Review Vocabulary Identify the four grouping symbols
More informationChapter 7: Sampling Distributions
Chapter 7: Sampling Distributions Section 7.1 What is a Sampling Distribution? The Practice of Statistics, 4 th edition For AP* STARNES, YATES, MOORE Chapter 7 Sampling Distributions 7.1 What is a Sampling
More informationThe Electric Field. So far, we have Coulomb s Law that gives the electric force between two charges:
The Electric Field So far, we have Coulomb s Law that gives the electric force between two charges: Coulomb s Law is written in the language of Newton i.e. Forces, which is OK. However, as quickly as possible,
More informationAssignment 2 (Sol.) Introduction to Machine Learning Prof. B. Ravindran
Assignment 2 (Sol.) Introduction to Machine Learning Prof. B. Ravindran 1. Let A m n be a matrix of real numbers. The matrix AA T has an eigenvector x with eigenvalue b. Then the eigenvector y of A T A
More informationPhysics 212E Spring 2004 Classical and Modern Physics. Computer Exercise #2
Physics 212E Spring 2004 Classical and Modern Physics Chowdary Computer Exercise #2 Launch Mathematica by clicking on the Start menu (lower left hand corner of the screen); from there go up to Science
More informationProbability Distributions
CONDENSED LESSON 13.1 Probability Distributions In this lesson, you Sketch the graph of the probability distribution for a continuous random variable Find probabilities by finding or approximating areas
More informationCreative Data Mining
Creative Data Mining Using ML algorithms in python Artem Chirkin Dr. Daniel Zünd Danielle Griego Lecture 7 0.04.207 /7 What we will cover today Outline Getting started Explore dataset content Inspect visually
More informationAIM HIGH SCHOOL. Curriculum Map W. 12 Mile Road Farmington Hills, MI (248)
AIM HIGH SCHOOL Curriculum Map 2923 W. 12 Mile Road Farmington Hills, MI 48334 (248) 702-6922 www.aimhighschool.com COURSE TITLE: Statistics DESCRIPTION OF COURSE: PREREQUISITES: Algebra 2 Students will
More informationBoyle s Law and Charles Law Activity
Boyle s Law and Charles Law Activity Introduction: This simulation helps you to help you fully understand 2 Gas Laws: Boyle s Law and Charles Law. These laws are very simple to understand, but are also
More informationIAM 530 ELEMENTS OF PROBABILITY AND STATISTICS LECTURE 3-RANDOM VARIABLES
IAM 530 ELEMENTS OF PROBABILITY AND STATISTICS LECTURE 3-RANDOM VARIABLES VARIABLE Studying the behavior of random variables, and more importantly functions of random variables is essential for both the
More informationIntroduction to Computer Tools and Uncertainties
Experiment 1 Introduction to Computer Tools and Uncertainties 1.1 Objectives To become familiar with the computer programs and utilities that will be used throughout the semester. To become familiar with
More informationLab 2. Electric Fields and Potentials
Physics 2020, Fall 2005 Lab 2 page 1 of 8 Lab 2. Electric Fields and Potentials INTRODUCTION: In class we have learned about electric charges and the electrostatic force that charges exert on each other.
More informationReteach 2-3. Graphing Linear Functions. 22 Holt Algebra 2. Name Date Class
-3 Graphing Linear Functions Use intercepts to sketch the graph of the function 3x 6y 1. The x-intercept is where the graph crosses the x-axis. To find the x-intercept, set y 0 and solve for x. 3x 6y 1
More informationLAB 05B: Friction 2 times PURPOSE BACKGROUND MATERIALS PRELIMINARY QUESTIONS: (TO DO BEFORE THE LAB!!) Lab 05B: Friction 1/5 Mr.
LAB 05B: Friction 2 times PURPOSE To investigate how friction is related to other variable such as the normal force, weight, and surface coefficients. In this experiment we will determine the static and
More informationStatistics for Managers using Microsoft Excel 6 th Edition
Statistics for Managers using Microsoft Excel 6 th Edition Chapter 3 Numerical Descriptive Measures 3-1 Learning Objectives In this chapter, you learn: To describe the properties of central tendency, variation,
More informationChapter 7: Sampling Distributions
Chapter 7: Sampling Distributions Section 7.1 What is a Sampling Distribution? The Practice of Statistics, 4 th edition For AP* STARNES, YATES, MOORE Chapter 7 Sampling Distributions 7.1 What is a Sampling
More informationCausality in Econometrics (3)
Graphical Causal Models References Causality in Econometrics (3) Alessio Moneta Max Planck Institute of Economics Jena moneta@econ.mpg.de 26 April 2011 GSBC Lecture Friedrich-Schiller-Universität Jena
More informationIntroduce Exploration! Before we go on, notice one more thing. We'll come back to the derivation if we have time.
Introduce Exploration! Before we go on, notice one more thing. We'll come back to the derivation if we have time. Simplifying the calculation of variance Notice that we can rewrite the calculation of a
More informationThis module focuses on the logic of ANOVA with special attention given to variance components and the relationship between ANOVA and regression.
WISE ANOVA and Regression Lab Introduction to the WISE Correlation/Regression and ANOVA Applet This module focuses on the logic of ANOVA with special attention given to variance components and the relationship
More informationFree-Body Diagrams: Introduction
Free-Body Diagrams: Introduction Learning Goal: To learn to draw free-body diagrams for various real-life situations. Imagine that you are given a description of a real-life situation and are asked to
More informationSTA 2201/442 Assignment 2
STA 2201/442 Assignment 2 1. This is about how to simulate from a continuous univariate distribution. Let the random variable X have a continuous distribution with density f X (x) and cumulative distribution
More informationUnsupervised Learning. k-means Algorithm
Unsupervised Learning Supervised Learning: Learn to predict y from x from examples of (x, y). Performance is measured by error rate. Unsupervised Learning: Learn a representation from exs. of x. Learn
More information22 Approximations - the method of least squares (1)
22 Approximations - the method of least squares () Suppose that for some y, the equation Ax = y has no solutions It may happpen that this is an important problem and we can t just forget about it If we
More information371 Lab Rybolt Data Analysis Assignment Name
Data Analysis Assignment 1 371 Lab Rybolt Data Analysis Assignment Name You wake up one morning and feel you may have a fever. You have an oral thermometer marked in Celsius degrees and find your temperature
More informationPrelab: Complete the prelab section BEFORE class Purpose:
Lab: Projectile Motion Prelab: Complete the prelab section BEFORE class Purpose: What is the Relationship between and for the situation of a cannon ball shot off a with an angle of from the horizontal.
More informationVariance reduction. Michel Bierlaire. Transport and Mobility Laboratory. Variance reduction p. 1/18
Variance reduction p. 1/18 Variance reduction Michel Bierlaire michel.bierlaire@epfl.ch Transport and Mobility Laboratory Variance reduction p. 2/18 Example Use simulation to compute I = 1 0 e x dx We
More informationMean, Mode, Median and Range. I know how to calculate the mean, mode, median and range.
Starter 1) The small box has a volume of 24cm. The boxes are similar. Find the new volume of the box. 2 3cm 9cm 2 2) What is the turning point of the graph y = (x - 9) + 4 Today we are revising... How
More informationJointly Distributed Variables
Jointly Distributed Variables Sec 2.6, 9.1 & 9.2 Cathy Poliak, Ph.D. cathy@math.uh.edu Office in Fleming 11c Department of Mathematics University of Houston Lecture 7-3339 Cathy Poliak, Ph.D. cathy@math.uh.edu
More informationClassification Methods II: Linear and Quadratic Discrimminant Analysis
Classification Methods II: Linear and Quadratic Discrimminant Analysis Rebecca C. Steorts, Duke University STA 325, Chapter 4 ISL Agenda Linear Discrimminant Analysis (LDA) Classification Recall that linear
More informationEXPERIMENT 2 Reaction Time Objectives Theory
EXPERIMENT Reaction Time Objectives to make a series of measurements of your reaction time to make a histogram, or distribution curve, of your measured reaction times to calculate the "average" or mean
More informationEcon 371 Problem Set #1 Answer Sheet
Econ 371 Problem Set #1 Answer Sheet 2.1 In this question, you are asked to consider the random variable Y, which denotes the number of heads that occur when two coins are tossed. a. The first part of
More informationBusiness Statistics. Lecture 10: Correlation and Linear Regression
Business Statistics Lecture 10: Correlation and Linear Regression Scatterplot A scatterplot shows the relationship between two quantitative variables measured on the same individuals. It displays the Form
More informationChapter 5. Piece of Wisdom #2: A statistician drowned crossing a stream with an average depth of 6 inches. (Anonymous)
Chapter 5 Deviating from the Average In This Chapter What variation is all about Variance and standard deviation Excel worksheet functions that calculate variation Workarounds for missing worksheet functions
More informationPolitical Science 6000: Beginnings and Mini Math Boot Camp
Political Science 6000: Beginnings and Mini Math Boot Camp January 20, 2010 First things first Syllabus This is the most important course you will take. 1. You need to understand these concepts in order
More informationShifting Reactions B
Shifting Reactions B Name Lab Section Log on to the Internet. Type the following address into the location-input line of your browser: http://introchem.chem.okstate.edu/dcicla/ergbn.htm This will load
More informationMaking an H-R diagram Earth & Sky
Making an H-R diagram Earth & Sky Name: Introduction Astronomers have discovered relationships between the surface temperatures and luminosities (brightnesses) of stars. These relationships are often presented
More informationMath 243 OpenStax Chapter 12 Scatterplots and Linear Regression OpenIntro Section and
Math 243 OpenStax Chapter 12 Scatterplots and Linear Regression OpenIntro Section 2.1.1 and 8.1-8.2.6 Overview Scatterplots Explanatory and Response Variables Describing Association The Regression Equation
More informationData Mining and Analysis
978--5-766- - Data Mining and Analysis: Fundamental Concepts and Algorithms CHAPTER Data Mining and Analysis Data mining is the process of discovering insightful, interesting, and novel patterns, as well
More informationMath Lab 10: Differential Equations and Direction Fields Complete before class Wed. Feb. 28; Due noon Thu. Mar. 1 in class
Matter & Motion Winter 2017 18 Name: Math Lab 10: Differential Equations and Direction Fields Complete before class Wed. Feb. 28; Due noon Thu. Mar. 1 in class Goals: 1. Gain exposure to terminology and
More informationAP Statistics. Chapter 9 Re-Expressing data: Get it Straight
AP Statistics Chapter 9 Re-Expressing data: Get it Straight Objectives: Re-expression of data Ladder of powers Straight to the Point We cannot use a linear model unless the relationship between the two
More informationSection 20: Arrow Diagrams on the Integers
Section 0: Arrow Diagrams on the Integers Most of the material we have discussed so far concerns the idea and representations of functions. A function is a relationship between a set of inputs (the leave
More informationPHY 111L Activity 2 Introduction to Kinematics
PHY 111L Activity 2 Introduction to Kinematics Name: Section: ID #: Date: Lab Partners: TA initials: Objectives 1. Introduce the relationship between position, velocity, and acceleration 2. Investigate
More informationTerminology for Statistical Data
Terminology for Statistical Data variables - features - attributes observations - cases (consist of multiple values) In a standard data matrix, variables or features correspond to columns observations
More informationData Analysis and Statistical Methods Statistics 651
y 1 2 3 4 5 6 7 x Data Analysis and Statistical Methods Statistics 651 http://www.stat.tamu.edu/~suhasini/teaching.html Lecture 32 Suhasini Subba Rao Previous lecture We are interested in whether a dependent
More informationGalactic Rotation Activity*
Galactic Rotation Activity* Neutral hydrogen atoms (H I) consist of a single proton and a single electron. The electron and proton can spin in the same direction (parallel) or in the opposite direction
More informationSection Linear Correlation and Regression. Copyright 2013, 2010, 2007, Pearson, Education, Inc.
Section 13.7 Linear Correlation and Regression What You Will Learn Linear Correlation Scatter Diagram Linear Regression Least Squares Line 13.7-2 Linear Correlation Linear correlation is used to determine
More informationExpectations and Variance
4. Model parameters and their estimates 4.1 Expected Value and Conditional Expected Value 4. The Variance 4.3 Population vs Sample Quantities 4.4 Mean and Variance of a Linear Combination 4.5 The Covariance
More informationQuantitative Understanding in Biology Short Course Session 9 Principal Components Analysis
Quantitative Understanding in Biology Short Course Session 9 Principal Components Analysis Jinhyun Ju Jason Banfelder Luce Skrabanek June 21st, 218 1 Preface For the last session in this course, we ll
More informationChapter 27 Summary Inferences for Regression
Chapter 7 Summary Inferences for Regression What have we learned? We have now applied inference to regression models. Like in all inference situations, there are conditions that we must check. We can test
More informationStudent Exploration: Cell Types
3. Names: Date: Student Exploration: Cell Types Prior Knowledge Questions (Do these BEFORE using the Gizmo.) 1. What are you supposed to be learning in this activity? 2. Why do you think learning this
More informationAlgebra 2. Outliers. Measures of Central Tendency (Mean, Median, Mode) Standard Deviation Normal Distribution (Bell Curves)
Algebra 2 Outliers Measures of Central Tendency (Mean, Median, Mode) Standard Deviation Normal Distribution (Bell Curves) Algebra 2 Notes #1 Chp 12 Outliers In a set of numbers, sometimes there will be
More informationExpectations and Variance
4. Model parameters and their estimates 4.1 Expected Value and Conditional Expected Value 4. The Variance 4.3 Population vs Sample Quantities 4.4 Mean and Variance of a Linear Combination 4.5 The Covariance
More informationMLE/MAP + Naïve Bayes
10-601 Introduction to Machine Learning Machine Learning Department School of Computer Science Carnegie Mellon University MLE/MAP + Naïve Bayes Matt Gormley Lecture 19 March 20, 2018 1 Midterm Exam Reminders
More informationStatistical Analysis of Data
Statistical Analysis of Data Goals and Introduction In this experiment, we will work to calculate, understand, and evaluate statistical measures of a set of data. In most cases, when taking data, there
More informationMA 1128: Lecture 08 03/02/2018. Linear Equations from Graphs And Linear Inequalities
MA 1128: Lecture 08 03/02/2018 Linear Equations from Graphs And Linear Inequalities Linear Equations from Graphs Given a line, we would like to be able to come up with an equation for it. I ll go over
More informationHarmonic Motion. Mass on a Spring. Physics 231: General Physics I Lab 6 Mar. 11, Goals:
Physics 231: General Physics I Lab 6 Mar. 11, 2004 Names: Harmonic Motion Goals: 1. To learn about the basic characteristics of periodic motion period, frequency, and amplitude 2. To study what affects
More informationBiostatistics and Design of Experiments Prof. Mukesh Doble Department of Biotechnology Indian Institute of Technology, Madras
Biostatistics and Design of Experiments Prof. Mukesh Doble Department of Biotechnology Indian Institute of Technology, Madras Lecture - 39 Regression Analysis Hello and welcome to the course on Biostatistics
More informationChapter 2: simple regression model
Chapter 2: simple regression model Goal: understand how to estimate and more importantly interpret the simple regression Reading: chapter 2 of the textbook Advice: this chapter is foundation of econometrics.
More informationChapter 12 - Part I: Correlation Analysis
ST coursework due Friday, April - Chapter - Part I: Correlation Analysis Textbook Assignment Page - # Page - #, Page - # Lab Assignment # (available on ST webpage) GOALS When you have completed this lecture,
More informationStandard Error & Sample Means
Math Objectives Students will recognize that samples have smaller variability than the population. Students will recognize that the variability in samples is a function of sample size, n, and that the
More informationStat 101 Exam 1 Important Formulas and Concepts 1
1 Chapter 1 1.1 Definitions Stat 101 Exam 1 Important Formulas and Concepts 1 1. Data Any collection of numbers, characters, images, or other items that provide information about something. 2. Categorical/Qualitative
More informationLEC 4: Discriminant Analysis for Classification
LEC 4: Discriminant Analysis for Classification Dr. Guangliang Chen February 25, 2016 Outline Last time: FDA (dimensionality reduction) Today: QDA/LDA (classification) Naive Bayes classifiers Matlab/Python
More informationPhysics 201 Lab 2 Air Drag Simulation
Physics 201 Lab 2 Air Drag Simulation Jan 28, 2013 Equipment Initial Set Up Type the data from Table 1 into the appropriate cells. By preceding the content of the cell with an equal sign (as in cell A6)
More informationREVIEW 8/2/2017 陈芳华东师大英语系
REVIEW Hypothesis testing starts with a null hypothesis and a null distribution. We compare what we have to the null distribution, if the result is too extreme to belong to the null distribution (p
More informationFundamentals of Measurement and Error Analysis
Lab 1 Fundamentals of Measurement and Error Analysis 1.1 Overview This first laboratory exercise introduces key concepts and statistical and plotting tools that are used throughout the entire sequence
More informationProbability. Hosung Sohn
Probability Hosung Sohn Department of Public Administration and International Affairs Maxwell School of Citizenship and Public Affairs Syracuse University Lecture Slide 4-3 (October 8, 2015) 1/ 43 Table
More informationUniversity of Cambridge Engineering Part IIB Module 4F10: Statistical Pattern Processing Handout 2: Multivariate Gaussians
Engineering Part IIB: Module F Statistical Pattern Processing University of Cambridge Engineering Part IIB Module F: Statistical Pattern Processing Handout : Multivariate Gaussians. Generative Model Decision
More informationData Mining and Analysis: Fundamental Concepts and Algorithms
Data Mining and Analysis: Fundamental Concepts and Algorithms dataminingbook.info Mohammed J. Zaki 1 Wagner Meira Jr. 1 Department of Computer Science Rensselaer Polytechnic Institute, Troy, NY, USA Department
More informationUsing Scientific Measurements
Section 3 Main Ideas Accuracy is different from precision. Significant figures are those measured precisely, plus one estimated digit. Scientific notation is used to express very large or very small numbers.
More informationCausal Inference. Prediction and causation are very different. Typical questions are:
Causal Inference Prediction and causation are very different. Typical questions are: Prediction: Predict Y after observing X = x Causation: Predict Y after setting X = x. Causation involves predicting
More informationPurpose: Materials: WARNING! Section: Partner 2: Partner 1:
Partner 1: Partner 2: Section: PLEASE NOTE: You will need this particular lab report later in the semester again for the homework of the Rolling Motion Experiment. When you get back this graded report,
More informationStatistical Analysis How do we know if it works? Group workbook: Cartoon from XKCD.com. Subscribe!
Statistical Analysis How do we know if it works? Group workbook: Cartoon from XKCD.com. Subscribe! http://www.xkcd.com/552/ Significant Concepts We structure the presentation and processing of data to
More information