REVIEW: Midterm Exam. Spring 2012

Similar documents
Glossary for the Triola Statistics Series

Review for Exam #1. Chapter 1. The Nature of Data. Definitions. Population. Sample. Quantitative data. Qualitative (attribute) data

Section 1.1. Data - Collections of observations (such as measurements, genders, survey responses, etc.)

Learning Objectives for Stat 225

AP Statistics Cumulative AP Exam Study Guide

Prentice Hall Stats: Modeling the World 2004 (Bock) Correlated to: National Advanced Placement (AP) Statistics Course Outline (Grades 9-12)

Dover- Sherborn High School Mathematics Curriculum Probability and Statistics

Last Lecture. Distinguish Populations from Samples. Knowing different Sampling Techniques. Distinguish Parameters from Statistics

Lecture Slides. Elementary Statistics Tenth Edition. by Mario F. Triola. and the Triola Statistics Series. Slide 1

Glossary. The ISI glossary of statistical terms provides definitions in a number of different languages:

STAT 200 Chapter 1 Looking at Data - Distributions

Exam 1 Review (Notes 1-8)

AIM HIGH SCHOOL. Curriculum Map W. 12 Mile Road Farmington Hills, MI (248)

M & M Project. Think! Crunch those numbers! Answer!

P8130: Biostatistical Methods I

STATISTICS 141 Final Review

Practical Statistics for the Analytical Scientist Table of Contents

DETAILED CONTENTS PART I INTRODUCTION AND DESCRIPTIVE STATISTICS. 1. Introduction to Statistics

Variables, distributions, and samples (cont.) Phil 12: Logic and Decision Making Fall 2010 UC San Diego 10/18/2010

What is Statistics? Statistics is the science of understanding data and of making decisions in the face of variability and uncertainty.

Final Exam STAT On a Pareto chart, the frequency should be represented on the A) X-axis B) regression C) Y-axis D) none of the above

Chapter 2: Tools for Exploring Univariate Data

STATISTICS ( CODE NO. 08 ) PAPER I PART - I

Chapter 2 Solutions Page 15 of 28

ECN221 Exam 1 VERSION B Fall 2017 (Modules 1-4), ASU-COX VERSION B

MIDTERM EXAMINATION (Spring 2011) STA301- Statistics and Probability

Section 3. Measures of Variation

A is one of the categories into which qualitative data can be classified.

Stat 2300 International, Fall 2006 Sample Midterm. Friday, October 20, Your Name: A Number:

Lecture Slides. Elementary Statistics Twelfth Edition. by Mario F. Triola. and the Triola Statistics Series. Section 3.1- #

CIVL 7012/8012. Collection and Analysis of Information

Example 2. Given the data below, complete the chart:

Math Review Sheet, Fall 2008

Ø Set of mutually exclusive categories. Ø Classify or categorize subject. Ø No meaningful order to categorization.

Sociology 6Z03 Review II

Ch. 1: Data and Distributions

Quantitative Methods Chapter 0: Review of Basic Concepts 0.1 Business Applications (II) 0.2 Business Applications (III)

1-1. Chapter 1. Sampling and Descriptive Statistics by The McGraw-Hill Companies, Inc. All rights reserved.

Ø Set of mutually exclusive categories. Ø Classify or categorize subject. Ø No meaningful order to categorization.

Random Variable. Random Variables. Chapter 5 Slides. Maurice Geraghty Inferential Statistics and Probability a Holistic Approach

Chapter 3 Statistics for Describing, Exploring, and Comparing Data. Section 3-1: Overview. 3-2 Measures of Center. Definition. Key Concept.

Sociology 6Z03 Review I

Sets and Set notation. Algebra 2 Unit 8 Notes

What is statistics? Statistics is the science of: Collecting information. Organizing and summarizing the information collected

Introduction to Statistics

Review for Final. Chapter 1 Type of studies: anecdotal, observational, experimental Random sampling

Statistical Methods. by Robert W. Lindeman WPI, Dept. of Computer Science

Descriptive Data Summarization

GLOSSARY. a n + n. a n 1 b + + n. a n r b r + + n C 1. C r. C n

TOPIC: Descriptive Statistics Single Variable

Table of z values and probabilities for the standard normal distribution. z is the first column plus the top row. Each cell shows P(X z).

Macomb Community College Department of Mathematics. Review for the Math 1340 Final Exam

21 ST CENTURY LEARNING CURRICULUM FRAMEWORK PERFORMANCE RUBRICS FOR MATHEMATICS PRE-CALCULUS

3 Lecture 3 Notes: Measures of Variation. The Boxplot. Definition of Probability

FREQUENCY DISTRIBUTIONS AND PERCENTILES

Exam details. Final Review Session. Things to Review

Lecture 1: Descriptive Statistics

QUIZ 1 (CHAPTERS 1-4) SOLUTIONS MATH 119 FALL 2012 KUNIYUKI 105 POINTS TOTAL, BUT 100 POINTS

Test 1 Review. Review. Cathy Poliak, Ph.D. Office in Fleming 11c (Department Reveiw of Mathematics University of Houston Exam 1)

The Union and Intersection for Different Configurations of Two Events Mutually Exclusive vs Independency of Events

Chapter 3. Data Description

Describing distributions with numbers

Business Statistics. Lecture 10: Course Review

Statistics I Chapter 2: Univariate data analysis

Counting principles, including permutations and combinations.

Introduction to Statistics

University of Jordan Fall 2009/2010 Department of Mathematics

ECLT 5810 Data Preprocessing. Prof. Wai Lam

Chapter 1. Looking at Data

Describing distributions with numbers

Elementary Statistics

Lecture 2 and Lecture 3

GRACEY/STATISTICS CH. 3. CHAPTER PROBLEM Do women really talk more than men? Science, Vol. 317, No. 5834). The study

MgtOp 215 Chapter 3 Dr. Ahn

Descriptive Statistics

dates given in your syllabus.

Statistics in medicine

Statistics I Chapter 2: Univariate data analysis

Stat Lecture Slides Exploring Numerical Data. Yibi Huang Department of Statistics University of Chicago

Micro Syllabus for Statistics (B.Sc. CSIT) program

Descriptive Statistics C H A P T E R 5 P P

are the objects described by a set of data. They may be people, animals or things.

Lecture Slides. Elementary Statistics Eleventh Edition. by Mario F. Triola. and the Triola Statistics Series 3.1-1

Math 221, REVIEW, Instructor: Susan Sun Nunamaker

20 Hypothesis Testing, Part I

Index I-1. in one variable, solution set of, 474 solving by factoring, 473 cubic function definition, 394 graphs of, 394 x-intercepts on, 474

FRANKLIN UNIVERSITY PROFICIENCY EXAM (FUPE) STUDY GUIDE

Review of Statistics 101

1. Exploratory Data Analysis

AP Final Review II Exploring Data (20% 30%)

Continuous random variables

Introduction to Basic Statistics Version 2

Descriptive Univariate Statistics and Bivariate Correlation

Exam: 4 hour multiple choice. Agenda. Course Introduction to Statistics. Lecture 1: Introduction to Statistics. Per Bruun Brockhoff

= Stat1600 Solution to Midterm #1 Form D

STATISTICS/MATH /1760 SHANNON MYERS

Stat1600 Solution to Midterm #1 Form A

Math 1040 Sample Final Examination. Problem Points Score Total 200

A SHORT INTRODUCTION TO PROBABILITY

QUANTITATIVE DATA. UNIVARIATE DATA data for one variable

Transcription:

REVIEW: Midterm Exam Spring 2012

Introduction Important Definitions: - Data - Statistics - A Population - A census - A sample

Types of Data Parameter (Describing a characteristic of the Population) Statistic (Describing a characteristic of the Sample) -QUALITATIVE DATA (Categorical or Attribute Data) -QUANTITATIVE DATA: - Discrete - Continuous Levels of Measurement: - Nominal - Ordinal - Interval - Ratio

Design of Experiments An observational study (don t attempt to modify the subjects) An experiment (treatment group vs. control group) Types of Observational Studies: Cross-sectional Retrospective (or case-control) Prospective (or longitudinal or cohort)

Problems Confounding (confusion of variables effects) How to solve this problem?: Blinding (placebo effect, single-blind, double-blind) Blocking Randomization: Completely randomized design Randomized block design

Sampling strategies Random sample Simple random sample (assumed throughout the book) Systematic sampling Convenience sampling Stratified sampling Cluster sampling Sampling Error :difference between sampling result and the true population result Nonsampling error: Sample data incorrectly collected

Important characteristics of data Center Variation Distribution Outliers Time

Frequency distribution Counts of data values individually or by groups of intervals Other forms: Relative frequency distribution (divide each class frequency by the total of all frequencies) Cumulative frequency distribution (cumulative totals) Histogram: Graphical representation of the frequency distribution

Other graphs Relative frequency histogram Frequency polygon Dotplots Steam-Leaf plots Pareto chart Pie Charts Scatter Diagrams Time series graphs

Examples: Histogram and Scatter plot

Measures of center Sample Mean: = Median: Middle value Mode: Most frequent value Bimodal Multimodal No mode Midrange=

Skewed distributions

Measures of Variation Range= (Maximum value-minimum value) Sample standard deviation: Variation from the mean = ( ) 1 Population standard deviation: = ( ) Sample variance: = ( ) 1 Population Variance: = ( )

Measures of Variation (Cont.) Sample Coefficient of Variation: =. 100% Population coefficient of variation: =. 100% Range Rule of Thumb 4 Minimum usual value: (mean)-2 x (standard deviation) Maximum usual value: (mean)+2 x (standard deviation)

Rule of data with Bell-Shaped distribution About 68% of all values fall within 1 standard deviation of the mean About 95% of all values fall within 2 standard deviations of the mean About 99.7% of all values fall within 3 standard deviations of the mean

Z Scores Sample Population = = Ordinary values: -2 z score 2 Unusual value: z score < -2 or z score> 2

Quartiles and Percentiles Quartiles: Separate a data set into four parts Q1 (First): Separates bottom 25% of the sorted values from the top 75% Q2 (Second): Same as the median Q3 (Third): Separates bottom 75% of the sorted values from the top 25% Percentiles: Separate the data into 100 parts (P 1, P 2,, P 99 ) Percentile value of x=. 100 Intercuartile range= Q3-Q1

Boxplots

Probability Definitions: An event A simple event The Sample Space Notation P: Probability A,B and C: specific events P(A): Probability of event A occurring

Definitions of Probability Frequency approximation: P(A)= Classical Approach: P(A)= Subjective Probability = LAW OF LARGE NUMBERS: A procedure is repeated many times. Relative frequency probability tends to the actual probability

Properties of probability Probability of an impossible event is 0 Probability of an event that is certain is 1 For any event A, 0 P(A) 1 P(Complement of event A)=P( ) = 1 ( ) Addition Rule: P(A or B)=P(in a single trial, event A occurs or event B occurs or they both occur)= P(A)+P(B)-P(A and B) Or P(A B)= P(A)+P(B)-P(A B) Events A and B are disjoint if P(A B)=0

Multiplication Rule P(A and B)=P(event A occurs in the first trial and event B occurs in a second trial) =. Independent events: P(B A)=P(B) If A and B are independent: =. ( ) Conditional probability: = ( ) ( )

Bayes Theorem =. ( ). [. ]

Probability distributions Definitions: Random Variable (x): Numerical value given to an outcome of a procedure. Example: Number Mountain lions seen at UCSC campus last year Probability distribution (P(x)): Gives the probability to each value of the random variable. Types of random variables: Discrete Continuous

Requirements of a Probability distribution = 1(Discrete case) 0 P(x) 1 Expected value of a discrete random variable = [. ] Discrete Distributions: Binomial Poisson

Binomial distribution Requirements: Fixed number of trials Trials are independent Each trial can be a success or a failure Probabilities remain constant Random variable: x=number of successes among n trials =!.!!. (You can also use the Binomial Table) n= number of trials p=probability of success in one trial q=probability of failure in one trial (q=1-p)

Mean,Variance and Standard deviation of the Binomial distribution Mean: = Variance: = Standard deviation: Maximum usual value: + 2 Minimum usual value: 2

Poisson distribution Requirements: Random variable x is the number of occurrences of an event over some interval The occurrences must be random The occurrences must be independent =. The Poisson distribution only depends on (the mean of the process)!

Mean, Variance and Standard deviation of the Poisson distribution Mean: Variance: Standard deviation: Maximum usual value: + 2 Minimum usual value: 2

Continuous distributions Uniform distribution Normal distribution Density curve: Graph of a continuous distribution Properties: Area below the curve is equal to 1 All points in the curve are greater or equal than zero

Uniform and Normal distributions

Sampling distributions Variation of the value of a statistics from sample to sample: Sampling variability Sampling distribution of the sample mean Sampling distribution of the sample proportion CENTRAL LIMIT THEOREM: The random variable x has a distribution (normal or not) with mean and standard deviation The distribution of the sample means will approach to a normal distribution as the sample size increases.

Mean and standard deviation of the sample mean Mean: = Standard deviation: =

Normal approximation to the Binomial If np 5 and nq 5 a Binomial random variable x can be approximated with a Normal distribution with mean and standard deviation: Mean: = Standard deviation:

Confidence Interval for the Population Proportion (p) p=population proportion = = sample proportion of successes = 1- = sample proportion of failures

Procedure to build a CI of confidence level (1-1) Check the normal approximation to the Binomial distribution (np 5 and nq 5 ) 2) Get the critical value / 3) Evaluate the margin of error: = /. 4) Confidence Interval: < < + ± (, + ) 5) Interpret results

Sample size for estimating proportion p is given: = [ / ] is not given: ( is assumed = 0.5) = [ / ] 0.25

Finding (point estimate) and E from the Confidence Interval Point estimate: = ( ) Margin of Error: E= ( )

Confidence Interval for the Population Mean ( Check Requirements: Sample is a simple random sample Population standard deviation is known Population is normally distributed or n>30 Procedure 1) Check normality requirements 2) Get the critical value / 3) Evaluate the margin of error: = /. 4) Confidence Interval: < < + ± (, + ) 5) Interpret results

Sample size for estimating Mean =. Values of, and E are given.

Confidence Interval for the Population Mean ( In this case we use the Student t distribution with n-1 degrees of freedom Check Requirements: Sample is a simple random sample Population standard deviation is estimated by s (sample standard dev.) Population is normally distributed or n>30 Procedure 1) Check normality requirements 2) Get the critical value / with n-1 degrees of freedom 3) Evaluate the margin of error: = /. 4) Confidence Interval: < < + ± (, + ) 5) Interpret results

Finding point estimate and E from Confidence Interval Point estimate of : + ( ) = 2 Margin of Error E= ( )