STAT 4385 Topic 01: Introduction & Review

Size: px
Start display at page:

Download "STAT 4385 Topic 01: Introduction & Review"

Transcription

1 STAT 4385 Topic 01: Introduction & Review Xiaogang Su, Ph.D. Department of Mathematical Science University of Texas at El Paso Spring, 2016

2 Outline Welcome What is Regression Analysis? Basics Variable Types Expectation and Variance Basic Probability Rules Probability Distributions Binomial Distribution Normal Distribution Sampling Distributions Statistical Inference Estimation Example: Confidence Interval for µ Hypothesis Testing Example: Hypothesis Testing on µ

3 Welcome Message and Reminders Welcome Message and Reminders Self-introduction Class Website: Review syllabus Statistical Computing R: Questions and concerns

4 What is Regression Analysis? Regression Analysis in General Regression analysis refers to a set of statistical methods and procedures that are designed to model the functional association or relationships between one (or several) variables (often called response, target, or dependent variables) and another groups of variables (predictors, independent variables). Examples: The sale price of a house vs. selected physical characteristics (e.g., square footage, listed price, location, etc.) cigarette consumption vs. age, education, income, and price of cigas

5 What is Regression Analysis? The Set-Up The data {(y i, x i1,..., x ip ) : i = 1,..., n; j = 1,..., p} consist of n i.i.d. copies of variables (Y ; X 1,..., X p ). The variable Y is called the dependent variable, response, target, endpoint, or outcome variable, depending on the application setting. Variables Xj s (for j = 1,..., p) are called the independent variables, or predictors, inputs, attributes, or features. Functional Model General model (assuming Y is continuous) where ε is an error term. Y = f (X 1, X 2,..., X p ) + ε, Linear model Y = β0 + β1x1 + + βpxp + ε, where {β 0, β 1,..., β p } are regression coefficients or parameters.

6 What is Regression Analysis? Example: The Real Estate Data Suppose a property appraiser wants to model the relationship between the sale price (Y ) of a residential property in a mid-size city and the following three predictors: 1. X 1 Appraised land value (in dollars) 2. X 2 Appraised improvements (in dollars) 3. X 3 Area (square feet)

7 What is Regression Analysis? Example: The Data Sale Price Land Improvements Area Property (y) Value (x 1 ) Value (x 2 ) x

8 What is Regression Analysis? The Data Layout in General In general, the data available consist of {(y i, x i1,..., x ip ) : i = 1,..., n}, where n is the sample size. ID Y X 1 X 2 X p 1 y 1 x 11 x 12 X 1p 2 y 2 x 21 x 22 X 2p n y n x n1 x n2 X np

9 What is Regression Analysis? Purposes of Regression We want a functional form that models the relationship between Y and X j s. Two practical purposes of regression: Predict Y via X j s for prediction or forecasting purposes, e.g., stock market data Study the relationship between response and predictors, e.g., clinical trial data

10 What is Regression Analysis? Types of Regression Several different ways of categorizing regression models exist. Approach: parametric vs. non-parametric Functional form: linear vs. nonlinear Number of Y s: univariate regression vs. multivariate regression Number of X s: simple vs. multiple regression Measurement types of Y : logistic regression, log-linear regression, survival analysis, etc. Dependence structures: longitudinal data analysis, time series, spatial statistics, etc. In the course, attention is confined to single continuous Y, one or multiple X of mixed types, linear models. We will study simple/multiple linear regression models.

11 What is Regression Analysis? Steps in Regression Analysis Data Collection: statement of problem or research, selection of potentially relevant variables; question Exploratory Data Analysis (EDA): numerical measures and graphical tools for describing and summarizing data and associations; Model Specification: model form, model assumptions Model Fitting: estimation of parameters involved in the model and statistical inference; Model Selection: Model Diagnostics: detection; variable selection techniques; model assumption check and outlier Model Validation and Deployment: Using the final model to answer the initial scientific question.

12 Basics Variable Types Types of Variables

13 Basics Expectation and Variance Expectation and Variance: Simple Facts If X and Y are random variables and a and b are constants, then (i) E(X + Y ) = E(X ) + E(Y ) (ii) E(aX ) = a E(X ) (iii) var(x + Y ) = var(x ) + var(y ) + 2cov(X, Y ) (iv) var(ax ) = a 2 var(x ) (v) cov(ax, by ) = ab cov(x, Y ) (vi) cov(x, X ) = var(x )

14 Basics Expectation and Variance Expectation and Variance of Linear Combinations If X 1, X 2,..., X n are n random variables and a 1,..., a n are constants, then ( n ) n E a i X i = a i E(X i ) i=1 ( n ) var a i X i i=1 = i=1 n ai 2 var(x i ) + i=1 n i i =1 a i a i cov(x i, X i ). Example: Given X 1,, X n IID (µ, σ 2 ), let X = n i=1 X i/n denote their average. Verify that E( X ) = µ and var( X ) = σ 2 /n.

15 Basics Basic Probability Rules Basic Probability Rules (i) Pr( ) = 0 (ii) For any event A, 0 Pr(A) 1 (iii) If A B then Pr(A) Pr(b) (iv) Pr(A B) = Pr(A) + Pr(B) Pr(A B) (v) If A and B are mutually disjoint, i.e., A B =, then Pr(A B) = Pr(A) + Pr(B). (vi) For any event A, let A c or Ā denote its complement. Then Pr(A c ) = 1 Pr(A)

16 Basics Basic Probability Rules Basic Probability Rules (vii) Given events B 1, B 2,..., B n mutually exclusive and n i=1 B i = Ω (i.e., {B i } n i=1 form a partition of the probability space), then, for any event A, Pr(A) = n Pr(A B i ) i=1 (viii) For any events {A, B, C}, it follows that Pr(A B C) = Pr(A) + Pr(B) + Pr(C) Pr(A B) Pr(A C) Pr(B C) + Pr(A B C)

17 Basics Basic Probability Rules Bayes Theorem Conditional Probability: Pr(A B) = Pr(A B)/ Pr(B). For any events A and B, Pr(B A) = = Pr(A B) Pr(B) Pr(A) Pr(A B) Pr(B) Pr(A B) Pr(B) + Pr(A B c ) Pr(B c ) Two events A and B are independent if Pr(A B) = Pr(A) Pr(B) or, equivalently, Pr(A B) = Pr(A). Given that {B i } n i=1 forms a partition of Ω, then Pr(B i A) = P(A B i ) P(B i ) n i=1 Pr(A B i) Pr(B i )

18 Probability Distributions Probability Distributions Discrete Bernoulli Trial Binomial distribution Poisson distribution and etc. Continuous Uniform Normal and Multivariate Normal Sampling Distributions: Distribution of sample statistics, e.g., χ 2 (ν), t(ν), and F (ν 1, ν 2 )

19 Probability Distributions Binomial Distribution Binomial Distribution A Typical Example: tossing a (fair) coin for 100 times and record the number of heads obtained. Definition An experiment consists of n independent and identical trials, each trial having two possible outcomes success or failure with Pr(getting a success ) = p. Let X denote the total number of successes obtained. Then X is said to follow Binomial(n, p).

20 Probability Distributions Binomial Distribution Binomial Distribution: Facts Given X Binomial(n, p), it follows that E(X ) = np and var(x ) = np(1 p) Possible values of X are 0, 1, 2,..., n Probability distribution function ( ) n Pr(X = k) = p k (1 p) n k with k for k = 0, 1,..., n. ( ) n = k n! k!(n k)! X /n can be viewed as the sample average of n IID Bernoulli trials so that the central limit theorem (CLT) applies.

21 Probability Distributions Binomial Distribution An Example Consider an experiment of rolling a fair six-sided die 20 times. The probability p of rolling a six on any roll is 1/6. the count X of sixes has a Binomial(20, 1/6) distribution. The mean of this distribution is 20/6 = 3.33, and the variance is 20 1/6 5/6 = 100/36 = The mean of the proportion of sixes in the 20 rolls (X /20) is equal to p = 1/6 = 0.167, and the variance of the proportion is equal to (1/6 5/6)/20 = Find the probability of obtaining at most 2 sixes.

22 Probability Distributions Normal Distribution Normal Distribution A random variable X is said to follow N(µ, σ 2 ) if it has a density function as f X (x) = 1 { σ 2π exp 1 2 (x µ) 2 σ 2 }. Normal density curve is symmetrical, centered about its mean µ, with its spread determined by its standard deviation σ.

23 Probability Distributions Normal Distribution Facts Given that X N(µ, σ 2 ), it follows that a + bx N(a + bµ, b 2 σ 2 ) In general, a linear combination of normal variables also follows a normal distribution. Standard Normal Distribution Z N(0, 1). We have X µ σ N(0, 1) and µ + σ Z N(µ, σ 2 ).

24 Probability Distributions Normal Distribution Sample Average Given a random sample {X 1,..., X n } taken from a population P with mean µ and variance σ 2. Let X = n i=1 X i/n denote the sample average. If P is normal, then X N(µ, σ 2 /n) exactly no matter how small n is. (Central Limit Theorem) When n is large, one has, by CLT, X N(µ, σ 2 /n) approximately no matter what distribution P has.

25 Probability Distributions Normal Distribution Exercises on Normal Distributions Know how to find normal probabilities using Tables Standardization Use probability rules Normal approximation to binomial distribution is an application of CLT. Given X Binomial(n, p), X N{np, np(1 p)} approximately when n is large.

26 Probability Distributions Normal Distribution Example on Normal Distributions and Sampling Distribution Suppose that the weights of milk bottles is normally distributed with a mean of 1.1 lbs and a standard deviation (σ)=0.20. (note that normality is obviously an assumption). Let X denote the weight of a randomly selected milk bottle. What is the distribution of X? Solution: X {µ = 1.1, σ 2 = }. What is the probability that a randomly selected milk bottle will be greater than 0.99 lbs? Solution: Pr(X > 0.99) = Pr ( Z > ) = Pr(Z > 0.55) 0.20 = 1 Pr(Z 0.55) = =

27 Probability Distributions Normal Distribution Consider a random sample of 5 milk bottles. Let X denote their average weight. What is the distribution of X and why? Solution: X {µ = 1.1, σ 2 /n = /5 = 0.008}. The sample average from a normal population is always normally distributed, no matter how small the sample size is. What is the probability that the mean/average weight of a random sample of 5 bottles will be greater than 0.99 lbs? Solution: Pr( X > 0.99) = ( ) Pr Z > 0.20/ = Pr(Z > 1.23) 5 = 1 Pr(Z 1.23) = =

28 Probability Distributions Normal Distribution Multivariate Normal Distribution Definition A random vector X = (X 1,..., X n ) T R n is said to have a multivariate normal (or Gaussian) distribution when mean µ = (µ i ) R n and covariance matrix Σ = (σ ii ) 0 (meaning positive semidefinite), denote it as X N (µ, Σ), if its probability density function is given by density X1 X2 { } f (X; µ, Σ) = (2π) n/2 Σ 1/2 exp (x µ)t Σ 1 (x µ) 2.

29 Probability Distributions Normal Distribution Bivariate Normal Distribution Illustration

30 Probability Distributions Normal Distribution Properties of Multivariate Normal Distributions Given X N (µ, Σ), E(X) = µ and cov(x) = Σ Each component X i N (µ i, σ ii ) marginally. AX N (Aµ, AΣA T ) for constant matrix A of appropriate dimension. This implies that linear combinations of any components of X are normally distributed. For multivariate normal random variables, zero covariance (or correlation) implies independence.

31 Probability Distributions Sampling Distributions Sampling Distribution Sampling Distribution is the probability distribution of a sample statistic. A statistic is a numerical summary of sample data such as a sample proportion or sample mean. The sample statistic is used to estimate or infer about parameter. With random sampling, the sampling distribution provides probabilities for all the possible values of the statistic. The sampling distribution provides the key for telling us how close a sample statistic falls to the corresponding unknown parameter. Its standard deviation is called the standard error.

32 Probability Distributions Sampling Distributions Sampling Distribution of the Sample Mean X

33 Probability Distributions Sampling Distributions χ 2 (ν) Distribution Definition Given i.i.d. standard normal variables {Z 1,..., Z ν }, χ 2 = ν i=1 Z i 2 follows a (central) chi-squared distribution with ν degrees of freedom. Facts: Given X χ 2 (ν), X Gamma(ν/2, 1/2) E(X ) = ν and var(x ) = 2ν.

34 Probability Distributions Sampling Distributions t(ν) Distribution Definition Given that Z N(0, 1) and χ 2 χ 2 (ν) are independent, let t = Z follows a χ 2 /ν (central) t distribution with ν degrees of freedom. Facts: Given t t(ν) with ν > 0, E(t) = 0 for ν > 1 and var(t) = ν/(ν 2) > 1 for ν > 2 t density is also bell-shaped yet with more spread than N(0, 1). As ν, t approximates N(0, 1).

35 Probability Distributions Sampling Distributions F (ν 1, ν 2 ) Distribution Definition Given that χ 2 1 χ2 (ν 1 ) and χ 2 2 χ2 (ν 2 ) are independent, let F = χ2 1 /ν 1 χ 2 2 /ν 2 follows a (central) F distribution with numerator df ν 1 and denominator df ν 2. Facts: Given F F (ν 1, ν 2 ), it follows that t 2 (ν) = F (1, ν) F (ν 2, ν 1 ) = 1/F (ν 1, ν 2 )

36 Probability Distributions Sampling Distributions Relevant R Functions R has built-in functions for computing the density (d), probability (p), and quantile (q) for common distributions, as well as for simulating random data (r). Use R Help facilities to find out the details, e.g., help(rchisq). Example: pnorm(q, mean = 0, sd = 1, lower.tail = TRUE) qf(p, df1, df2, ncp, lower.tail = TRUE)

37 Statistical Inference Statistical Inference Statistical Inference is the process of drawing conclusions about the population based on analysis of sampled data. Two forms of inferences Estimation: point estimation and interval estimation. Hypothesis Testing Types of Inferences Parametric, nonparametric, semiparametric Inferences Bayesian vs. Frequentist Inferences

38 Statistical Inference Estimation Statistical Inference: Estimation Given a parameter θ, want to find the best estimate or best guess ˆθ based on sample data. Criteria for defining best include unbiasedness, minimum variance, minimum mean squared error,... The probability distribution of ˆθ is needed for reliability assessment. Confidence Intervals (CI): looking for an interval (L, U) such as Pr(L < θ < U) = (1 α) 100%.

39 Statistical Inference Estimation One-Sample Inference on Mean µ Suppose that a random sample {X 1, X 2,..., X n } of size n is taken from a population that has mean µ and variance σ 2. Want to infer about µ. Point Estimate: the sample average X = n i=1 X i/n. E( X ) = µ and var( X ) = σ 2 /n. Sampling Distribution When the original population is normal, X N(µ, σ 2 /n) exactly no matter how small or large the sample size n is. When n is large, X N(µ, σ 2 /n) approximately no matter what distribution the original population has.

40 Statistical Inference Estimation Confidence Interval for µ To remove the unknown nuisance parameter σ 2, note ( X µ)/ } σ 2 /n N(0, 1) (n 1)ˆσ 2 /σ 2 χ 2 independent = (n 1) t = X µ ˆσ 2 /n t(n 1). (1 α) 100% Confidence Interval for µ X ± t (n 1) ˆσ 1 α/2, n where t (n 1) 1 α/2 denotes the (1 α/2) 100% percentile of t distribution with DF (n 1).

41 Statistical Inference Estimation Example: Confidence Interval for µ We randomly select and measure the contents of 15 bottles of cough syrup. The results (in fluid ounces) are shown below: Construct a 95% CI for the mean content of cough syrup. Solution: ± /15, which leads to (4.2305, ).

42 Statistical Inference Hypothesis Testing Statistical Inference: Hypothesis Testing In statistics, a hypothesis is a statement about the population. The null hypothesis H0 The alternative (research) hypothesis H a Hypothesis testing is a statistical procedure used to makes decisions on the validity of a hypothesis based on analysis of sample data.

43 Statistical Inference Hypothesis Testing Concepts in Hypothesis Testing True State (Population) Decision (Sample) True H 0 False H 0 Fail to reject H 0 Correct (1 α) Type II error (β) Reject H 0 Type I error (α) Correct (1 β) Which of Type I or II error is more severe? The size is the probability of making type I error. The significance level α is the maximum size tolerable, i.e., α = max Pr (reject H 0 H 0 ). The power is the probability of rejecting the null H 0 if H 0 is false in reality, i.e., power = Pr (reject H 0 H a ).

44 Statistical Inference Hypothesis Testing Steps in Hypothesis Testing 1. State the null and alternative hypotheses: H 0 and H a ; 2. Compute the value of the observed test statistics T ; The value of T should be sensitive to whether data support H 0 or H a ; The probability distributions of T under H0 and H a are available. 3. Find the decision/rejection region; Critical values, e.g., reject H 0 if T is greater than some threshold. Reject H0 whenever P-value < α 4. Make conclusions and interpret within application context.

45 Statistical Inference Hypothesis Testing The P-Value The p-value is the probability of obtaining a value that is as extreme as or more unusual than the actually observed test statistics value, assuming that the null hypothesis H 0 is true. Reject H 0 whenever the p-value is smaller than the significance level α. Common Misconceptions The p-value is NOT the probability that the null hypothesis is true; NOR is it the probability that H a is false; NOR the probability of falsely rejecting the null hypothesis. A small p-value is evidence against the null hypothesis while a large p-value means little or no evidence against H 0. Note that little or no evidence against the null hypothesis is not the same as a lot of evidence for the null hypothesis.

46 Statistical Inference Hypothesis Testing Estimation vs. Hypothesis Testing The two types of inferences are equivalent analytically One may make a decision in hypothesis testing problems by looking at the appropriate CI; on the other hand, CI can be derived by inverting the hypothesis testing procedure. Nevertheless, CIs are sometimes preferable to hypothesis testing as a set of confidence intervals tells the range with which the parameter is likely to fall while the hypothesis testing only tells you whether the parameter is likely to be the same as a pre-specified or hypothesized value.

47 Statistical Inference Hypothesis Testing One-Sample Inference: Hypothesis Testing on µ Hypothesis The Null H0 : µ = µ 0 H a : µ µ 0 (two-sided); The Alternative H a : µ > µ 0 (upper-sided); H a : µ < µ 0 (lower-sided); where µ 0 is the hypothesized value. Test Statistic: t obs = X µ 0 ˆσ 2 /n t(n 1) only when H 0 : µ = µ 0 is true.

48 Statistical Inference Hypothesis Testing Rejection Rule: Reject H 0 at the significance level α if tobs (n 1) > t 1 α/2 (two-sided); t obs > t (n 1) 1 α (upper-sided); t obs > t α (n 1) = t (n 1) 1 α (lower-sided). Compute the associated P-Value 2 Pr ( t (n 1) > ) t obs (two-sided); P-Value = Pr ( t (n 1) ) > t obs (upper-sided); Pr ( t (n 1) ) < t obs (lower-sided).

49 Statistical Inference Hypothesis Testing Equivalence of Significance Testing and CI Consider the two-sided test of H 0 : µ = µ 0 vs. H a : µ µ 0. We reject H 0 at α when t obs > t (n 1) 1 α/2 > t (n 1) X µ 0 ˆσ/ n 1 α/2 µ 0 > X + t (n 1) 1 α/2 ˆσ/ n µ 0 < X t (n 1) 1 α/2 ˆσ/ n, Namely, we reject H 0 when µ 0 falls outside of the (1 α) 100% CI for µ. or

50 Statistical Inference Hypothesis Testing Consider the upper-sided test of H 0 : µ = µ 0 vs. H a : µ > µ 0. We reject H 0 when t obs > t (n 1) 1 α X µ 0 ˆσ/ n > t(n 1) 1 α or equivalently, µ 0 < X t (n 1) 1 α ˆσ/ n. Namely, reject H 0 at significance level α when µ 0 is smaller than the lower bound of the (1 2a) 100% CI for µ. Similarly, we reject H 0 at α in testing H 0 : µ = µ 0 vs. H a : µ < µ 0, when µ 0 is greater than the upper bound of the (1 2α) 100% CI for µ.

51 Statistical Inference Hypothesis Testing Example The average sleep time (µ) is supposed to be 8 hours a day. We think college students sleep a different amount, maybe more maybe less. We survey ten students to see how much they sleep. The data are as follows (each cell represents a student): R output from Function t.test() N Mean Std. Deviation SLEEP t df Sig. (2-tailed) SLEEP <.0001

52 Statistical Inference Hypothesis Testing Example (Continued) It can be found that t (9) 0.05 = and t(9) = The sample mean is 5.2. Compared against µ 0 = 8, the difference is 2.8 hours. The difference is significant as the observed t value is so big and the probability of this being chance is so low. Even when the sample size n is large, one can always use the t test as the t distribution with large d.f. gets close to standard normal anyways.

53 Statistical Inference Hypothesis Testing Discussion Thanks! Questions?

Lecture 2: Review of Probability

Lecture 2: Review of Probability Lecture 2: Review of Probability Zheng Tian Contents 1 Random Variables and Probability Distributions 2 1.1 Defining probabilities and random variables..................... 2 1.2 Probability distributions................................

More information

Class 26: review for final exam 18.05, Spring 2014

Class 26: review for final exam 18.05, Spring 2014 Probability Class 26: review for final eam 8.05, Spring 204 Counting Sets Inclusion-eclusion principle Rule of product (multiplication rule) Permutation and combinations Basics Outcome, sample space, event

More information

Sociology 6Z03 Review II

Sociology 6Z03 Review II Sociology 6Z03 Review II John Fox McMaster University Fall 2016 John Fox (McMaster University) Sociology 6Z03 Review II Fall 2016 1 / 35 Outline: Review II Probability Part I Sampling Distributions Probability

More information

Business Statistics. Lecture 10: Course Review

Business Statistics. Lecture 10: Course Review Business Statistics Lecture 10: Course Review 1 Descriptive Statistics for Continuous Data Numerical Summaries Location: mean, median Spread or variability: variance, standard deviation, range, percentiles,

More information

This does not cover everything on the final. Look at the posted practice problems for other topics.

This does not cover everything on the final. Look at the posted practice problems for other topics. Class 7: Review Problems for Final Exam 8.5 Spring 7 This does not cover everything on the final. Look at the posted practice problems for other topics. To save time in class: set up, but do not carry

More information

MA/ST 810 Mathematical-Statistical Modeling and Analysis of Complex Systems

MA/ST 810 Mathematical-Statistical Modeling and Analysis of Complex Systems MA/ST 810 Mathematical-Statistical Modeling and Analysis of Complex Systems Review of Basic Probability The fundamentals, random variables, probability distributions Probability mass/density functions

More information

STAT 4385 Topic 03: Simple Linear Regression

STAT 4385 Topic 03: Simple Linear Regression STAT 4385 Topic 03: Simple Linear Regression Xiaogang Su, Ph.D. Department of Mathematical Science University of Texas at El Paso xsu@utep.edu Spring, 2017 Outline The Set-Up Exploratory Data Analysis

More information

Stat 5101 Lecture Notes

Stat 5101 Lecture Notes Stat 5101 Lecture Notes Charles J. Geyer Copyright 1998, 1999, 2000, 2001 by Charles J. Geyer May 7, 2001 ii Stat 5101 (Geyer) Course Notes Contents 1 Random Variables and Change of Variables 1 1.1 Random

More information

Review of Statistics 101

Review of Statistics 101 Review of Statistics 101 We review some important themes from the course 1. Introduction Statistics- Set of methods for collecting/analyzing data (the art and science of learning from data). Provides methods

More information

Advanced Herd Management Probabilities and distributions

Advanced Herd Management Probabilities and distributions Advanced Herd Management Probabilities and distributions Anders Ringgaard Kristensen Slide 1 Outline Probabilities Conditional probabilities Bayes theorem Distributions Discrete Continuous Distribution

More information

Lecture 11. Multivariate Normal theory

Lecture 11. Multivariate Normal theory 10. Lecture 11. Multivariate Normal theory Lecture 11. Multivariate Normal theory 1 (1 1) 11. Multivariate Normal theory 11.1. Properties of means and covariances of vectors Properties of means and covariances

More information

STAT 135 Lab 6 Duality of Hypothesis Testing and Confidence Intervals, GLRT, Pearson χ 2 Tests and Q-Q plots. March 8, 2015

STAT 135 Lab 6 Duality of Hypothesis Testing and Confidence Intervals, GLRT, Pearson χ 2 Tests and Q-Q plots. March 8, 2015 STAT 135 Lab 6 Duality of Hypothesis Testing and Confidence Intervals, GLRT, Pearson χ 2 Tests and Q-Q plots March 8, 2015 The duality between CI and hypothesis testing The duality between CI and hypothesis

More information

Quick Tour of Basic Probability Theory and Linear Algebra

Quick Tour of Basic Probability Theory and Linear Algebra Quick Tour of and Linear Algebra Quick Tour of and Linear Algebra CS224w: Social and Information Network Analysis Fall 2011 Quick Tour of and Linear Algebra Quick Tour of and Linear Algebra Outline Definitions

More information

Lecture 2: Repetition of probability theory and statistics

Lecture 2: Repetition of probability theory and statistics Algorithms for Uncertainty Quantification SS8, IN2345 Tobias Neckel Scientific Computing in Computer Science TUM Lecture 2: Repetition of probability theory and statistics Concept of Building Block: Prerequisites:

More information

Statistical Inference

Statistical Inference Statistical Inference Bernhard Klingenberg Institute of Statistics Graz University of Technology Steyrergasse 17/IV, 8010 Graz www.statistics.tugraz.at February 12, 2008 Outline Estimation: Review of concepts

More information

Probability Theory and Statistics. Peter Jochumzen

Probability Theory and Statistics. Peter Jochumzen Probability Theory and Statistics Peter Jochumzen April 18, 2016 Contents 1 Probability Theory And Statistics 3 1.1 Experiment, Outcome and Event................................ 3 1.2 Probability............................................

More information

Stat 704 Data Analysis I Probability Review

Stat 704 Data Analysis I Probability Review 1 / 39 Stat 704 Data Analysis I Probability Review Dr. Yen-Yi Ho Department of Statistics, University of South Carolina A.3 Random Variables 2 / 39 def n: A random variable is defined as a function that

More information

Confidence Intervals, Testing and ANOVA Summary

Confidence Intervals, Testing and ANOVA Summary Confidence Intervals, Testing and ANOVA Summary 1 One Sample Tests 1.1 One Sample z test: Mean (σ known) Let X 1,, X n a r.s. from N(µ, σ) or n > 30. Let The test statistic is H 0 : µ = µ 0. z = x µ 0

More information

Topic 2: Probability & Distributions. Road Map Probability & Distributions. ECO220Y5Y: Quantitative Methods in Economics. Dr.

Topic 2: Probability & Distributions. Road Map Probability & Distributions. ECO220Y5Y: Quantitative Methods in Economics. Dr. Topic 2: Probability & Distributions ECO220Y5Y: Quantitative Methods in Economics Dr. Nick Zammit University of Toronto Department of Economics Room KN3272 n.zammit utoronto.ca November 21, 2017 Dr. Nick

More information

Lecture 1: Probability Fundamentals

Lecture 1: Probability Fundamentals Lecture 1: Probability Fundamentals IB Paper 7: Probability and Statistics Carl Edward Rasmussen Department of Engineering, University of Cambridge January 22nd, 2008 Rasmussen (CUED) Lecture 1: Probability

More information

ECON Fundamentals of Probability

ECON Fundamentals of Probability ECON 351 - Fundamentals of Probability Maggie Jones 1 / 32 Random Variables A random variable is one that takes on numerical values, i.e. numerical summary of a random outcome e.g., prices, total GDP,

More information

Random Variables and Their Distributions

Random Variables and Their Distributions Chapter 3 Random Variables and Their Distributions A random variable (r.v.) is a function that assigns one and only one numerical value to each simple event in an experiment. We will denote r.vs by capital

More information

EC2001 Econometrics 1 Dr. Jose Olmo Room D309

EC2001 Econometrics 1 Dr. Jose Olmo Room D309 EC2001 Econometrics 1 Dr. Jose Olmo Room D309 J.Olmo@City.ac.uk 1 Revision of Statistical Inference 1.1 Sample, observations, population A sample is a number of observations drawn from a population. Population:

More information

Table of z values and probabilities for the standard normal distribution. z is the first column plus the top row. Each cell shows P(X z).

Table of z values and probabilities for the standard normal distribution. z is the first column plus the top row. Each cell shows P(X z). Table of z values and probabilities for the standard normal distribution. z is the first column plus the top row. Each cell shows P(X z). For example P(X.04) =.8508. For z < 0 subtract the value from,

More information

Statistics for Managers Using Microsoft Excel/SPSS Chapter 4 Basic Probability And Discrete Probability Distributions

Statistics for Managers Using Microsoft Excel/SPSS Chapter 4 Basic Probability And Discrete Probability Distributions Statistics for Managers Using Microsoft Excel/SPSS Chapter 4 Basic Probability And Discrete Probability Distributions 1999 Prentice-Hall, Inc. Chap. 4-1 Chapter Topics Basic Probability Concepts: Sample

More information

Recitation 2: Probability

Recitation 2: Probability Recitation 2: Probability Colin White, Kenny Marino January 23, 2018 Outline Facts about sets Definitions and facts about probability Random Variables and Joint Distributions Characteristics of distributions

More information

Advanced topics from statistics

Advanced topics from statistics Advanced topics from statistics Anders Ringgaard Kristensen Advanced Herd Management Slide 1 Outline Covariance and correlation Random vectors and multivariate distributions The multinomial distribution

More information

Biost 518 Applied Biostatistics II. Purpose of Statistics. First Stage of Scientific Investigation. Further Stages of Scientific Investigation

Biost 518 Applied Biostatistics II. Purpose of Statistics. First Stage of Scientific Investigation. Further Stages of Scientific Investigation Biost 58 Applied Biostatistics II Scott S. Emerson, M.D., Ph.D. Professor of Biostatistics University of Washington Lecture 5: Review Purpose of Statistics Statistics is about science (Science in the broadest

More information

Mathematical Notation Math Introduction to Applied Statistics

Mathematical Notation Math Introduction to Applied Statistics Mathematical Notation Math 113 - Introduction to Applied Statistics Name : Use Word or WordPerfect to recreate the following documents. Each article is worth 10 points and should be emailed to the instructor

More information

Review of Statistics

Review of Statistics Review of Statistics Topics Descriptive Statistics Mean, Variance Probability Union event, joint event Random Variables Discrete and Continuous Distributions, Moments Two Random Variables Covariance and

More information

Probability Theory for Machine Learning. Chris Cremer September 2015

Probability Theory for Machine Learning. Chris Cremer September 2015 Probability Theory for Machine Learning Chris Cremer September 2015 Outline Motivation Probability Definitions and Rules Probability Distributions MLE for Gaussian Parameter Estimation MLE and Least Squares

More information

STA301- Statistics and Probability Solved Subjective From Final term Papers. STA301- Statistics and Probability Final Term Examination - Spring 2012

STA301- Statistics and Probability Solved Subjective From Final term Papers. STA301- Statistics and Probability Final Term Examination - Spring 2012 STA30- Statistics and Probability Solved Subjective From Final term Papers Feb 6,03 MC004085 Moaaz.pk@gmail.com Mc004085@gmail.com PSMD0 STA30- Statistics and Probability Final Term Examination - Spring

More information

STT 843 Key to Homework 1 Spring 2018

STT 843 Key to Homework 1 Spring 2018 STT 843 Key to Homework Spring 208 Due date: Feb 4, 208 42 (a Because σ = 2, σ 22 = and ρ 2 = 05, we have σ 2 = ρ 2 σ σ22 = 2/2 Then, the mean and covariance of the bivariate normal is µ = ( 0 2 and Σ

More information

Correlation Analysis

Correlation Analysis Simple Regression Correlation Analysis Correlation analysis is used to measure strength of the association (linear relationship) between two variables Correlation is only concerned with strength of the

More information

Probability and Distributions

Probability and Distributions Probability and Distributions What is a statistical model? A statistical model is a set of assumptions by which the hypothetical population distribution of data is inferred. It is typically postulated

More information

STAT2201. Analysis of Engineering & Scientific Data. Unit 3

STAT2201. Analysis of Engineering & Scientific Data. Unit 3 STAT2201 Analysis of Engineering & Scientific Data Unit 3 Slava Vaisman The University of Queensland School of Mathematics and Physics What we learned in Unit 2 (1) We defined a sample space of a random

More information

1 Presessional Probability

1 Presessional Probability 1 Presessional Probability Probability theory is essential for the development of mathematical models in finance, because of the randomness nature of price fluctuations in the markets. This presessional

More information

18.05 Final Exam. Good luck! Name. No calculators. Number of problems 16 concept questions, 16 problems, 21 pages

18.05 Final Exam. Good luck! Name. No calculators. Number of problems 16 concept questions, 16 problems, 21 pages Name No calculators. 18.05 Final Exam Number of problems 16 concept questions, 16 problems, 21 pages Extra paper If you need more space we will provide some blank paper. Indicate clearly that your solution

More information

Mock Exam - 2 hours - use of basic (non-programmable) calculator is allowed - all exercises carry the same marks - exam is strictly individual

Mock Exam - 2 hours - use of basic (non-programmable) calculator is allowed - all exercises carry the same marks - exam is strictly individual Mock Exam - 2 hours - use of basic (non-programmable) calculator is allowed - all exercises carry the same marks - exam is strictly individual Question 1. Suppose you want to estimate the percentage of

More information

Sampling Distribution: Week 6

Sampling Distribution: Week 6 Sampling Distribution: Week 6 Kwonsang Lee University of Pennsylvania kwonlee@wharton.upenn.edu February 27, 2015 Kwonsang Lee STAT111 February 27, 2015 1 / 16 Sampling Distribution: Sample Mean If X 1,

More information

Review of Basic Probability Theory

Review of Basic Probability Theory Review of Basic Probability Theory James H. Steiger Department of Psychology and Human Development Vanderbilt University James H. Steiger (Vanderbilt University) 1 / 35 Review of Basic Probability Theory

More information

BASICS OF PROBABILITY

BASICS OF PROBABILITY October 10, 2018 BASICS OF PROBABILITY Randomness, sample space and probability Probability is concerned with random experiments. That is, an experiment, the outcome of which cannot be predicted with certainty,

More information

Statistics for Managers Using Microsoft Excel (3 rd Edition)

Statistics for Managers Using Microsoft Excel (3 rd Edition) Statistics for Managers Using Microsoft Excel (3 rd Edition) Chapter 4 Basic Probability and Discrete Probability Distributions 2002 Prentice-Hall, Inc. Chap 4-1 Chapter Topics Basic probability concepts

More information

Statistical Hypothesis Testing

Statistical Hypothesis Testing Statistical Hypothesis Testing Dr. Phillip YAM 2012/2013 Spring Semester Reference: Chapter 7 of Tests of Statistical Hypotheses by Hogg and Tanis. Section 7.1 Tests about Proportions A statistical hypothesis

More information

Stat 5101 Notes: Brand Name Distributions

Stat 5101 Notes: Brand Name Distributions Stat 5101 Notes: Brand Name Distributions Charles J. Geyer September 5, 2012 Contents 1 Discrete Uniform Distribution 2 2 General Discrete Uniform Distribution 2 3 Uniform Distribution 3 4 General Uniform

More information

Lecture 2: Basic Concepts and Simple Comparative Experiments Montgomery: Chapter 2

Lecture 2: Basic Concepts and Simple Comparative Experiments Montgomery: Chapter 2 Lecture 2: Basic Concepts and Simple Comparative Experiments Montgomery: Chapter 2 Fall, 2013 Page 1 Random Variable and Probability Distribution Discrete random variable Y : Finite possible values {y

More information

INTERVAL ESTIMATION AND HYPOTHESES TESTING

INTERVAL ESTIMATION AND HYPOTHESES TESTING INTERVAL ESTIMATION AND HYPOTHESES TESTING 1. IDEA An interval rather than a point estimate is often of interest. Confidence intervals are thus important in empirical work. To construct interval estimates,

More information

1 Probability theory. 2 Random variables and probability theory.

1 Probability theory. 2 Random variables and probability theory. Probability theory Here we summarize some of the probability theory we need. If this is totally unfamiliar to you, you should look at one of the sources given in the readings. In essence, for the major

More information

CSE 103 Homework 8: Solutions November 30, var(x) = np(1 p) = P r( X ) 0.95 P r( X ) 0.

CSE 103 Homework 8: Solutions November 30, var(x) = np(1 p) = P r( X ) 0.95 P r( X ) 0. () () a. X is a binomial distribution with n = 000, p = /6 b. The expected value, variance, and standard deviation of X is: E(X) = np = 000 = 000 6 var(x) = np( p) = 000 5 6 666 stdev(x) = np( p) = 000

More information

Notes on the Multivariate Normal and Related Topics

Notes on the Multivariate Normal and Related Topics Version: July 10, 2013 Notes on the Multivariate Normal and Related Topics Let me refresh your memory about the distinctions between population and sample; parameters and statistics; population distributions

More information

Section 4.6 Simple Linear Regression

Section 4.6 Simple Linear Regression Section 4.6 Simple Linear Regression Objectives ˆ Basic philosophy of SLR and the regression assumptions ˆ Point & interval estimation of the model parameters, and how to make predictions ˆ Point and interval

More information

Single Maths B: Introduction to Probability

Single Maths B: Introduction to Probability Single Maths B: Introduction to Probability Overview Lecturer Email Office Homework Webpage Dr Jonathan Cumming j.a.cumming@durham.ac.uk CM233 None! http://maths.dur.ac.uk/stats/people/jac/singleb/ 1 Introduction

More information

Department of Large Animal Sciences. Outline. Slide 2. Department of Large Animal Sciences. Slide 4. Department of Large Animal Sciences

Department of Large Animal Sciences. Outline. Slide 2. Department of Large Animal Sciences. Slide 4. Department of Large Animal Sciences Outline Advanced topics from statistics Anders Ringgaard Kristensen Covariance and correlation Random vectors and multivariate distributions The multinomial distribution The multivariate normal distribution

More information

Algorithms for Uncertainty Quantification

Algorithms for Uncertainty Quantification Algorithms for Uncertainty Quantification Tobias Neckel, Ionuț-Gabriel Farcaș Lehrstuhl Informatik V Summer Semester 2017 Lecture 2: Repetition of probability theory and statistics Example: coin flip Example

More information

BIO5312 Biostatistics Lecture 6: Statistical hypothesis testings

BIO5312 Biostatistics Lecture 6: Statistical hypothesis testings BIO5312 Biostatistics Lecture 6: Statistical hypothesis testings Yujin Chung October 4th, 2016 Fall 2016 Yujin Chung Lec6: Statistical hypothesis testings Fall 2016 1/30 Previous Two types of statistical

More information

Statistical Inference

Statistical Inference Statistical Inference Classical and Bayesian Methods Revision Class for Midterm Exam AMS-UCSC Th Feb 9, 2012 Winter 2012. Session 1 (Revision Class) AMS-132/206 Th Feb 9, 2012 1 / 23 Topics Topics We will

More information

EC212: Introduction to Econometrics Review Materials (Wooldridge, Appendix)

EC212: Introduction to Econometrics Review Materials (Wooldridge, Appendix) 1 EC212: Introduction to Econometrics Review Materials (Wooldridge, Appendix) Taisuke Otsu London School of Economics Summer 2018 A.1. Summation operator (Wooldridge, App. A.1) 2 3 Summation operator For

More information

Business Statistics. Lecture 3: Random Variables and the Normal Distribution

Business Statistics. Lecture 3: Random Variables and the Normal Distribution Business Statistics Lecture 3: Random Variables and the Normal Distribution 1 Goals for this Lecture A little bit of probability Random variables The normal distribution 2 Probability vs. Statistics Probability:

More information

Bivariate distributions

Bivariate distributions Bivariate distributions 3 th October 017 lecture based on Hogg Tanis Zimmerman: Probability and Statistical Inference (9th ed.) Bivariate Distributions of the Discrete Type The Correlation Coefficient

More information

The t-distribution. Patrick Breheny. October 13. z tests The χ 2 -distribution The t-distribution Summary

The t-distribution. Patrick Breheny. October 13. z tests The χ 2 -distribution The t-distribution Summary Patrick Breheny October 13 Patrick Breheny Biostatistical Methods I (BIOS 5710) 1/25 Introduction Introduction What s wrong with z-tests? So far we ve (thoroughly!) discussed how to carry out hypothesis

More information

MATH 3510: PROBABILITY AND STATS June 15, 2011 MIDTERM EXAM

MATH 3510: PROBABILITY AND STATS June 15, 2011 MIDTERM EXAM MATH 3510: PROBABILITY AND STATS June 15, 2011 MIDTERM EXAM YOUR NAME: KEY: Answers in Blue Show all your work. Answers out of the blue and without any supporting work may receive no credit even if they

More information

Deep Learning for Computer Vision

Deep Learning for Computer Vision Deep Learning for Computer Vision Lecture 3: Probability, Bayes Theorem, and Bayes Classification Peter Belhumeur Computer Science Columbia University Probability Should you play this game? Game: A fair

More information

Lecture 3: Inference in SLR

Lecture 3: Inference in SLR Lecture 3: Inference in SLR STAT 51 Spring 011 Background Reading KNNL:.1.6 3-1 Topic Overview This topic will cover: Review of hypothesis testing Inference about 1 Inference about 0 Confidence Intervals

More information

Lecture 5: ANOVA and Correlation

Lecture 5: ANOVA and Correlation Lecture 5: ANOVA and Correlation Ani Manichaikul amanicha@jhsph.edu 23 April 2007 1 / 62 Comparing Multiple Groups Continous data: comparing means Analysis of variance Binary data: comparing proportions

More information

y ˆ i = ˆ " T u i ( i th fitted value or i th fit)

y ˆ i = ˆ  T u i ( i th fitted value or i th fit) 1 2 INFERENCE FOR MULTIPLE LINEAR REGRESSION Recall Terminology: p predictors x 1, x 2,, x p Some might be indicator variables for categorical variables) k-1 non-constant terms u 1, u 2,, u k-1 Each u

More information

Bandits, Experts, and Games

Bandits, Experts, and Games Bandits, Experts, and Games CMSC 858G Fall 2016 University of Maryland Intro to Probability* Alex Slivkins Microsoft Research NYC * Many of the slides adopted from Ron Jin and Mohammad Hajiaghayi Outline

More information

Correlation and Regression

Correlation and Regression Correlation and Regression October 25, 2017 STAT 151 Class 9 Slide 1 Outline of Topics 1 Associations 2 Scatter plot 3 Correlation 4 Regression 5 Testing and estimation 6 Goodness-of-fit STAT 151 Class

More information

Math 494: Mathematical Statistics

Math 494: Mathematical Statistics Math 494: Mathematical Statistics Instructor: Jimin Ding jmding@wustl.edu Department of Mathematics Washington University in St. Louis Class materials are available on course website (www.math.wustl.edu/

More information

Probability and Probability Distributions. Dr. Mohammed Alahmed

Probability and Probability Distributions. Dr. Mohammed Alahmed Probability and Probability Distributions 1 Probability and Probability Distributions Usually we want to do more with data than just describing them! We might want to test certain specific inferences about

More information

Probability Theory. Introduction to Probability Theory. Principles of Counting Examples. Principles of Counting. Probability spaces.

Probability Theory. Introduction to Probability Theory. Principles of Counting Examples. Principles of Counting. Probability spaces. Probability Theory To start out the course, we need to know something about statistics and probability Introduction to Probability Theory L645 Advanced NLP Autumn 2009 This is only an introduction; for

More information

Probability Methods in Civil Engineering Prof. Dr. Rajib Maity Department of Civil Engineering Indian Institution of Technology, Kharagpur

Probability Methods in Civil Engineering Prof. Dr. Rajib Maity Department of Civil Engineering Indian Institution of Technology, Kharagpur Probability Methods in Civil Engineering Prof. Dr. Rajib Maity Department of Civil Engineering Indian Institution of Technology, Kharagpur Lecture No. # 36 Sampling Distribution and Parameter Estimation

More information

Probability theory and inference statistics! Dr. Paola Grosso! SNE research group!! (preferred!)!!

Probability theory and inference statistics! Dr. Paola Grosso! SNE research group!!  (preferred!)!! Probability theory and inference statistics Dr. Paola Grosso SNE research group p.grosso@uva.nl paola.grosso@os3.nl (preferred) Roadmap Lecture 1: Monday Sep. 22nd Collecting data Presenting data Descriptive

More information

CME 106: Review Probability theory

CME 106: Review Probability theory : Probability theory Sven Schmit April 3, 2015 1 Overview In the first half of the course, we covered topics from probability theory. The difference between statistics and probability theory is the following:

More information

M(t) = 1 t. (1 t), 6 M (0) = 20 P (95. X i 110) i=1

M(t) = 1 t. (1 t), 6 M (0) = 20 P (95. X i 110) i=1 Math 66/566 - Midterm Solutions NOTE: These solutions are for both the 66 and 566 exam. The problems are the same until questions and 5. 1. The moment generating function of a random variable X is M(t)

More information

GEOMETRIC -discrete A discrete random variable R counts number of times needed before an event occurs

GEOMETRIC -discrete A discrete random variable R counts number of times needed before an event occurs STATISTICS 4 Summary Notes. Geometric and Exponential Distributions GEOMETRIC -discrete A discrete random variable R counts number of times needed before an event occurs P(X = x) = ( p) x p x =,, 3,...

More information

Statistical Inference: Estimation and Confidence Intervals Hypothesis Testing

Statistical Inference: Estimation and Confidence Intervals Hypothesis Testing Statistical Inference: Estimation and Confidence Intervals Hypothesis Testing 1 In most statistics problems, we assume that the data have been generated from some unknown probability distribution. We desire

More information

Bias Variance Trade-off

Bias Variance Trade-off Bias Variance Trade-off The mean squared error of an estimator MSE(ˆθ) = E([ˆθ θ] 2 ) Can be re-expressed MSE(ˆθ) = Var(ˆθ) + (B(ˆθ) 2 ) MSE = VAR + BIAS 2 Proof MSE(ˆθ) = E((ˆθ θ) 2 ) = E(([ˆθ E(ˆθ)]

More information

Topic 3: The Expectation of a Random Variable

Topic 3: The Expectation of a Random Variable Topic 3: The Expectation of a Random Variable Course 003, 2017 Page 0 Expectation of a discrete random variable Definition (Expectation of a discrete r.v.): The expected value (also called the expectation

More information

Course information: Instructor: Tim Hanson, Leconte 219C, phone Office hours: Tuesday/Thursday 11-12, Wednesday 10-12, and by appointment.

Course information: Instructor: Tim Hanson, Leconte 219C, phone Office hours: Tuesday/Thursday 11-12, Wednesday 10-12, and by appointment. Course information: Instructor: Tim Hanson, Leconte 219C, phone 777-3859. Office hours: Tuesday/Thursday 11-12, Wednesday 10-12, and by appointment. Text: Applied Linear Statistical Models (5th Edition),

More information

Review of probability and statistics 1 / 31

Review of probability and statistics 1 / 31 Review of probability and statistics 1 / 31 2 / 31 Why? This chapter follows Stock and Watson (all graphs are from Stock and Watson). You may as well refer to the appendix in Wooldridge or any other introduction

More information

Statistics Boot Camp. Dr. Stephanie Lane Institute for Defense Analyses DATAWorks 2018

Statistics Boot Camp. Dr. Stephanie Lane Institute for Defense Analyses DATAWorks 2018 Statistics Boot Camp Dr. Stephanie Lane Institute for Defense Analyses DATAWorks 2018 March 21, 2018 Outline of boot camp Summarizing and simplifying data Point and interval estimation Foundations of statistical

More information

STATISTICS 141 Final Review

STATISTICS 141 Final Review STATISTICS 141 Final Review Bin Zou bzou@ualberta.ca Department of Mathematical & Statistical Sciences University of Alberta Winter 2015 Bin Zou (bzou@ualberta.ca) STAT 141 Final Review Winter 2015 1 /

More information

Introduction to bivariate analysis

Introduction to bivariate analysis Introduction to bivariate analysis When one measurement is made on each observation, univariate analysis is applied. If more than one measurement is made on each observation, multivariate analysis is applied.

More information

UQ, Semester 1, 2017, Companion to STAT2201/CIVL2530 Exam Formulae and Tables

UQ, Semester 1, 2017, Companion to STAT2201/CIVL2530 Exam Formulae and Tables UQ, Semester 1, 2017, Companion to STAT2201/CIVL2530 Exam Formulae and Tables To be provided to students with STAT2201 or CIVIL-2530 (Probability and Statistics) Exam Main exam date: Tuesday, 20 June 1

More information

AP Statistics Cumulative AP Exam Study Guide

AP Statistics Cumulative AP Exam Study Guide AP Statistics Cumulative AP Eam Study Guide Chapters & 3 - Graphs Statistics the science of collecting, analyzing, and drawing conclusions from data. Descriptive methods of organizing and summarizing statistics

More information

18.05 Practice Final Exam

18.05 Practice Final Exam No calculators. 18.05 Practice Final Exam Number of problems 16 concept questions, 16 problems. Simplifying expressions Unless asked to explicitly, you don t need to simplify complicated expressions. For

More information

Frequency table: Var2 (Spreadsheet1) Count Cumulative Percent Cumulative From To. Percent <x<=

Frequency table: Var2 (Spreadsheet1) Count Cumulative Percent Cumulative From To. Percent <x<= A frequency distribution is a kind of probability distribution. It gives the frequency or relative frequency at which given values have been observed among the data collected. For example, for age, Frequency

More information

401 Review. 6. Power analysis for one/two-sample hypothesis tests and for correlation analysis.

401 Review. 6. Power analysis for one/two-sample hypothesis tests and for correlation analysis. 401 Review Major topics of the course 1. Univariate analysis 2. Bivariate analysis 3. Simple linear regression 4. Linear algebra 5. Multiple regression analysis Major analysis methods 1. Graphical analysis

More information

Introduction to bivariate analysis

Introduction to bivariate analysis Introduction to bivariate analysis When one measurement is made on each observation, univariate analysis is applied. If more than one measurement is made on each observation, multivariate analysis is applied.

More information

An introduction to biostatistics: part 1

An introduction to biostatistics: part 1 An introduction to biostatistics: part 1 Cavan Reilly September 6, 2017 Table of contents Introduction to data analysis Uncertainty Probability Conditional probability Random variables Discrete random

More information

z and t tests for the mean of a normal distribution Confidence intervals for the mean Binomial tests

z and t tests for the mean of a normal distribution Confidence intervals for the mean Binomial tests z and t tests for the mean of a normal distribution Confidence intervals for the mean Binomial tests Chapters 3.5.1 3.5.2, 3.3.2 Prof. Tesler Math 283 Fall 2018 Prof. Tesler z and t tests for mean Math

More information

Brief Review of Probability

Brief Review of Probability Maura Department of Economics and Finance Università Tor Vergata Outline 1 Distribution Functions Quantiles and Modes of a Distribution 2 Example 3 Example 4 Distributions Outline Distribution Functions

More information

Topic 3: Sampling Distributions, Confidence Intervals & Hypothesis Testing. Road Map Sampling Distributions, Confidence Intervals & Hypothesis Testing

Topic 3: Sampling Distributions, Confidence Intervals & Hypothesis Testing. Road Map Sampling Distributions, Confidence Intervals & Hypothesis Testing Topic 3: Sampling Distributions, Confidence Intervals & Hypothesis Testing ECO22Y5Y: Quantitative Methods in Economics Dr. Nick Zammit University of Toronto Department of Economics Room KN3272 n.zammit

More information

Gov 2000: 6. Hypothesis Testing

Gov 2000: 6. Hypothesis Testing Gov 2000: 6. Hypothesis Testing Matthew Blackwell October 11, 2016 1 / 55 1. Hypothesis Testing Examples 2. Hypothesis Test Nomenclature 3. Conducting Hypothesis Tests 4. p-values 5. Power Analyses 6.

More information

HYPOTHESIS TESTING: THE CHI-SQUARE STATISTIC

HYPOTHESIS TESTING: THE CHI-SQUARE STATISTIC 1 HYPOTHESIS TESTING: THE CHI-SQUARE STATISTIC 7 steps of Hypothesis Testing 1. State the hypotheses 2. Identify level of significant 3. Identify the critical values 4. Calculate test statistics 5. Compare

More information

Overview. CSE 21 Day 5. Image/Coimage. Monotonic Lists. Functions Probabilistic analysis

Overview. CSE 21 Day 5. Image/Coimage. Monotonic Lists. Functions Probabilistic analysis Day 5 Functions/Probability Overview Functions Probabilistic analysis Neil Rhodes UC San Diego Image/Coimage The image of f is the set of values f actually takes on (a subset of the codomain) The inverse

More information

FRANKLIN UNIVERSITY PROFICIENCY EXAM (FUPE) STUDY GUIDE

FRANKLIN UNIVERSITY PROFICIENCY EXAM (FUPE) STUDY GUIDE FRANKLIN UNIVERSITY PROFICIENCY EXAM (FUPE) STUDY GUIDE Course Title: Probability and Statistics (MATH 80) Recommended Textbook(s): Number & Type of Questions: Probability and Statistics for Engineers

More information

Week 12-13: Discrete Probability

Week 12-13: Discrete Probability Week 12-13: Discrete Probability November 21, 2018 1 Probability Space There are many problems about chances or possibilities, called probability in mathematics. When we roll two dice there are possible

More information

Econometrics. 4) Statistical inference

Econometrics. 4) Statistical inference 30C00200 Econometrics 4) Statistical inference Timo Kuosmanen Professor, Ph.D. http://nomepre.net/index.php/timokuosmanen Today s topics Confidence intervals of parameter estimates Student s t-distribution

More information

Glossary for the Triola Statistics Series

Glossary for the Triola Statistics Series Glossary for the Triola Statistics Series Absolute deviation The measure of variation equal to the sum of the deviations of each value from the mean, divided by the number of values Acceptance sampling

More information