Statistics Independent (X) you can choose and manipulate. Usually on x-axis
|
|
- Hubert Richardson
- 5 years ago
- Views:
Transcription
1 Statistics-6000 Variable: are characteristic that ca take o differet values with respect to persos, time, ad place ad types of variables are as follow: Idepedet (X) you ca choose ad maipulate. Usually o x-axis Depedet (Y) is what you measure i the experimet ad what is affected durig the experimet. Usually o y-axis Itermediate is a variable i a causal pathway that causes variatio i the depedet variable ad is itself caused to vary by the idepedet variable Cofouder is a extraeous variable i a statistical model that correlates (positively or egatively) with both the depedet variable ad the idepedet variable. The methodologies of scietific studies therefore eed to accout for these variables - either through true experimetal desigs, i which case, oe achieves cotrol, or through statistical meas. (Iteral Validity) Discrete Variable: This is a whole umber ad coutable variable. Ordial, Rakig Type or Nomial Classificatory Categorical Type. (qualitative variable) Cotiuous or measurable variable: variables have o gaps betwee them. Have decimal poits ad uits. (Quatitative variable) Why statistic i geeral: collectio of data, summarizatio ad aalyzig of data set, evaluatio, coduct a research ad fially makig coclusio (Testig hypothesis) Specific goal of statistic: defie a ormal rage (μ ad σ), correlatio study (relatioship), regressio study (predictio), associatio (Qui-Square Test) & agreemet testig (Croblach Alpha & Kappa Cohe Correlatio), testig hypothesis (z,t,f) ad quality cotrol (L G Chart) Sample (): small radom group of idividuals or observatios that is chose for study from populatio. Sample is a part of populatio. Radom sample: is the selectio of the sample such that every member from the populatio has a equal chace of beig icluded i the sample
2 Samplig uit: A part from populatio, a idividual, household, school, sectio, village Samplig frame: a complete list of samplig uits i the populatio Why we eed sample study: o Less time o Less persoel o Less resources o Less moey o For i-depth study Sample size: the umber of idividuals or observatios uder study. ( 30) Samplig methods: o Simple Radom Samplig: Each uit i this method has a equal probability of beig icluded i the sample. (Lottery sample) by usig tables of radom umbers. Is used whe there is homogeeity i the study elemets of the populatio. (N) is small o Stratified Samplig: The study elemets of populatio are heterogeeous. (N) is lager. (Stratum). Precisio (1/SE) of the estimate will be high (SE will be less) o Systematic Samplig (coveiece): (N) is very large. (K)=N/; is samplig iterval. Oe umber (X) is chose radomly from (1 to K). X+0K, X+1K, X+2K X+3K, X+ (-1) K are icluded i the sample. Precisio of the estimate will be less. o Cluster Samplig: (N) is large ad it s ot possible to get complete listig of the populatio uit. Precisio of the estimate will be less. o Multi Stage Samplig: (N) is very large. Samplig is doe i stages. Precisio of the estimate will be less. o Quota Samplig: (Samplig of Coveiece). () Is fixed ad ot probability samplig method. Not radomly selected. Results caot be geeralized but applicable to that area oly. Not good samplig method. Populatio (N): Aggregate of subjects uder cosideratio. Whole group is represetative Parameters (μ ad σ) Statistics ( ad SD or s) Statistical methods: descriptive method ad iferece method
3 Descriptive method: frequecy tables, diagrams, graphs (bar chart, pie chart, pictogram, histogram, frequecy polygo ad curves-lierity), arithmetic or geometric or weighted mea, media, mode, rage, quartile deviatio(iqr), mea deviatio, stadard deviatio(sd), coefficiet of variatio (CV%), correlatio coefficiet (r)-pearso Product Momet Correlatio, ad regressio aalysis used for predicatio. Iferece aalysis: used to geeralize the results, obtaied from the radom sample, for the populatio from which the represetative sample was selected. Two mai compoets of iferece method are: Estimatio of Parameters (populatio values) Testig the Statistical Sigificace of the Hypothesis Measure of locatio: mea, mode, ad media. They are oe sigle value to represet the distributio. Whe these values describe a populatio they called parameters. If the describe a sample the referred as statistic(s). Mea ( or µ) = x or x N مجموع القیم على العدد Media: is the middle most value of the arrage data set (cotiuous distributio). The value of it is ot affected by the extreme values ad therefore media is preferred to mea whe there are extreme values. Whe sample ot ormally distributed Mode: the most frequet observatio of data/distributio. Distributio may have more tha 1 mode. There are 2 types of data? Group data ad U-group data (very rich) Why we group the data? Groupig the actual data collected will lose erichmet of the data set from its actual values but some time we eed to hide the actual data from the public ad other competitors or for simplificatio of data we hadig large data set. f = or N ; total umber of frequecy = umber of observatios (sample size) Number of classes or groups eeded to make histogram: 2 k or N Class Iterval Size = MaximumMiimum ; this is icremet value that would be added k For group data arithmetic mea; = mf f, where (m = mid-value of class iterval)
4 Mid-value = (Lower limit:l1 + Upper limit:l2) 2; these L = real limits oly (x- ) = Zero, always Variace for a group data; (SD 2 or σ 2 ) = fm2 f 2 While computig arithmetic mea for a give grouped frequecy distributio, it is assumed that all values fallig i a particular group or class are located at the midpoit of the group. For group media= L 1 + L2L1 x N C, f = media frequecy, C=cumulative fre. f 2 Law of ext If the give class limits are score limits the covert them to real limits Last group of cumulative frequecy = N or or f For group mode = L 1 + L2L1 x (f f1) ; class with maximum frequecy 2ff1f2 Quartiles ad Percetiles: are the values i the cotiuous distributio showig the proportio/percetage of lyig below (or up to) the give value Q i = L 1 + L2L1 i x N x C; i = 1,2,3 (looks very likely to media formula) f 4 Iterquartile rage (IQR): reflects the variability amog the middle 50% of the observatio of the data. Better tha rage ( uses extreme values oly) Q 1 (25%) ad Q 2 (50%) ad Q 3 (75%) IQR = Q 3 Q 1 ; better tha rage = 75%-25%=50% P 50 = Q 2 = Media; of cotiuous data distributio Real times limits used for group data for: media, mode, quartiles, ad percetiles
5 P i = L 1 + L2L1 f x i x N 100 C; i = 1,2,3,.,99 (looks very likely to media formula) Rule of ext to locate the class iterval from cumulative frequecy distributio Measure of Variability = Rage, IQR, Variace, SD, ad Coefficiet of Variatio Measure of Variability = Scatter or dispersio of data aroud the mea Rage = Largest observatio Smallest observatio σ 2 = (Xμ)2 N or SD 2 = (x 1 )2 ; variace of ugroup data Group data σ 2 or SD 2 = x2 ( x)2 ; o eed for 1 (1) = fm f σ or SD = + σ 2 or SD 2 ; uit of SD is similar to observatio value CV = SD x 100 ; o uit its uitless quatity CV% is used to compare variatio betwee same sample variables or differet A evet = outcome Probability of (A) = is the proportio of times the outcomes would occur i a very log series of repetitios. (all evets are equally likely) P(A) = m (0 m ); whe () is exhaustive, mutually exclusive Equally likely trials of (m) is possible
6 Idepedet evets: two evets are said to be idepedet if the presece or absece of oe does ot alter the chaces of the other beig preset, or of the occurrece of oe does ot alter the chace of occurrece of the other. (meas that they ca occur together) Mutually exclusive evets: if they caot both occur together or be preset at the same time. No overlappig betwee the outcomes. Cois flippig head or tail Additive rule: mutually exclusive evets the probability of occurrece of 2 or more mutually exclusive evets is the sum of their probabilities of each outcome P (A or B) = P (A) + P (B) e.g. throwig die for odd umbers- mutually exclusive ev. Multiplicative rule: Idepedet evets probability of simultaeous occurrece of evets A ad B i a series of idepedet trails (i.e. chace of oe outcome occurrig is ot affected by kowledge of whether or ot the other occurred) is the product of their probabilities. P (A ad B) = P(A) x P(B) Idepedet evets Geeral additive rule: if the 2 evets are ot mutually exclusive, the the probability that either evet A or B occurs is: P(A or B or both) = P(A) + P(B) P(A & B) Discrete Probability Distributio (DPD): sum of p(x)s = 1, probability of each outcome is betwee 0-1, outcomes are mutually exclusive. μ= (x i p(x i )) ad σ 2 = ((x i μ) 2. p(x i )) ; for discrete probability distributio Coditioal probability: Joit probability: P(A B)= P(A) x P(B) = multiplicative rule Biomial Distributio: have two outcomes oly oe or zero. Its discrete distributio p(x) = C p q ; C is called biomial coefficiet. (0 x ) C =1 ad C = 1 ad 0! = 1 ad (p+q) = 1; p is the parameters ad is the degree of biomial distributio ad ad p is fixed, trails idepedet, 2 outcomes possible
7 Its applicatio whe populatio is dichotomized or divided ito 2 classes oly (p) is the probability of success ad (q) is the probability of failure. (p+q)=1 The mea of the biomial distributio (expected value) = p(x) = mea = p The variace of biomial distributio V(x) or σ 2 = p q; if.p.q 10 we ca use ormal distributio to approximate biomial At least to 10 = P(10 x ) = i the questios At most to 10 = P(0 x 10) = i the questios At least oe will retur: 1-p(x=0) i the biomial distributio = i the questios The Poisso distributio: discrete distributio, trails are idepedet, p is very small, is very large, evets are very rare. P(x) = x P(x) = eλ λ x x! ; x=0, 1, 2,.. λ (Aver.)=.p; is parameters (Mea = Variace) e= Normal distributio: for cotiuous distributio, large umber of observatios, curve is bell-shaped, symmetrical about the mea, mea=mode=media, total area uder the curve = 1sqr uit ad it approximate the histogram (frequecy polygo). The mea of all possible sample mea is equal to the populatio mea, therefore sample mea is called ubiased estimatio of populatio.
8 Z (λ) µ±1sd = µ±2sd = Empirical rule=bell Curved-shaped µ±3sd = The degree of flatess or peakess of the curve is determied by the value of σ or SD Stadard Normal Distributio(Z): μ=0, σ 2 =1; σ = 1, Z or Z(λ)= Xμ λ = area uder the curve after trasformatio process. Z(λ) is poit o horizotal lie Estimatio of discrete sample size = = Z2 p q, Z = 1.96 (95% CI) or 2.58 (99% CI) or L (99.9%CI) L: is the permissible error o either side of the estimate (2L is the width of the iterval) If the permissible error o either side of the estimate is give i % L is calculate as ( # 100 x p); do pilot study to estimate p) The populatio proportio of the characteristic is expected to lie i the iterval (p 1 -L, p 2 +L) σ
9 Estimatio of cotiuous sample size = = Z2 SD 2 (99%CI) or 3.29 (99.9%CI), Z = 1.96 (95% CI) or 2.58 d2 If the permissible error o either side of the estimate is give i % d is calculate as ( # 100 x ) Whe 95% of cofidece iterval: ±1.96 (SE( )) = SD Whe 95% of cofidece iterval: p±1.96 (SE(p)) = p.q SD 2 = p q, Prevalece rate mea old ad ew cases together (Prevalece rate) V(p) = p.q SE ( )= SD the it follows that SE(p) = p.q for prevalece rate of the populatio SD: average amout of deviatio of differet sample values from the mea value SE: average amout of deviatio of differet meas (of differet samples) from the populatio mea Average Mea Deviatio = x Positive skew of the curve : mea > media ad the right side skewed (positive) Geometric mea = product of all % values or = value at ed value at begig 1 Weighted mea = (1x 1)(2 x 2) 12 A experiemet: the observatio of some activity or the act of takig some measuremet. (havig 3 childre) by 3 pregacies A outcome: particular result of a experimet. All the (BBB, BBG ) = 8 outcomes A evet: is the collectio (subset) of oe or more outcomes. E.g. Boy-Girl-Boy A, B, C if we wat 2 joits Combiatios (C r )=! - this is used i biomial probability: AB, BC, AC =3 (r)! r!
10 Permutatios (P r ) =! ; AB, AC, BA, BC, CA, CB = 6 (r)! Simple Radom Sample: each uit or item has a equal chace of beig selected Samplig error = a sample statistic populatio parameter We reject the ull hypothesis, P<0.05 for testig of sigificace t-distributio We accept the ull hypothesis, P>0.05 for testig of sigificace t-distributio P-value = α (5% or 1% or 0.1%) = rejectio area= tailed area V (X i ) = N N1 x σ2 = SE( ) Cetral Limit Theory: the mea of all possible samples mea is equal to the populatio mea. Therefore; sample mea is called ubiased estimatio of populatio mea. V(X) = N N1 σ2 if the populatio is fiite V(X) = σ2 if the populatio is ifiite (ulimited) = (SE)2 Chi-Square Test: x 2 = (OE)2 E ; (No of colum-1) (No of raw-1) =df If calculated value is greater tha tabular value the there is associatio
11 Oe-tailed t-test; H 0 =0 ad H 1 > 0 or H 1 < 0
12
13 P-value: Presumig H 0 is true, the likelihood of chace variatio yieldig a t-statistic more extreme tha o either side of 0 (sice H 1 directio is both high ad low) is.11. Coclusio: Sice P-value >.05, we do ot reject H 0. Two-tailed t-test; H 0 =0 ad H 1 0
14 Oe sample test: Compariso of sample mea with populatio mea. Degree of freedom = -1 for t-test which is distributio of differeces If the calculated value of t > table value we reject the ull hypothesis, H 0 : μ = μ 0 = # (o differece or they are same ad equal)-type I error H 1 0 or H 1 > 0 or H 1 < 0 Z = μ0 ; here <30 where assumptio of SD = σ SE( ) t= μ0 ; here <30 where SD σ, eve (N) is ormally distributed SE( ) Upaired two sample test: Compariso of two idepedet sample meas. H 0 :μ 1 = μ 2 = (μ 1 μ 2 = Zero) they come from same populatio, samples are take from the populatio z = 1 2 SE ( 1 2) ; 30 SE( 1 2)= SD SD22 2 ; 30
15 t = 1 2 SE ( 1 2) <30 ; studet t-distributio SE (μ1 μ2) = s ; <30 S = (11)SD12 (21)SD ; <30 Degree of freedom = (1-1) + (2-1) = Paired sample test: Compariso of meas of two correlated samples. Same subject i both groups. Mea differece for the values is Zero H 0 : µ d = 0 (the mea of the differece i the populatio is zero D= di ad SD d = (did)2 1 Degree of freedom = -1 t= D SE(SDd) SE(SDd) = SDd
16 If (P-value) is low or equal the Null (H 0 ) must GO (Rejected) Iferece of proportios: H 0 : P = P 0 Z = pp0 SE(p) ad SE (p) = P0 x Q0 ad p= m m is prevalece Where Q 0 = 1-P 0 (remember this is populatio proportio) (p) is calculate from () Two sample t-test is as follow: H 0 : P 1 = P 2 (P 1 - P 2 = Zero) z = p A p B, for 2 sample test of proportio for ay () sample # SE (p A p B ) p = r 1r ; weighted average for 2 sample test of proportio for ay () sample SE (p A p B ) = pq ; for 2 sample test of proportio for ay () sample # Correlatio of (X,Y): DF= -2 t= r 2 1r 2 Calculated t-value is greater tha table t-value the X ad Y sigificatly related to each other
17 Regressio: a=is the y-itercept ad b=slope Y= a + bx Percetage of total variatio i Y explaied by X = 100 (r) 2 t= r 2 1r 2 if t(calculated) > t(table) the variables (X,Y) related to each other
Continuous Data that can take on any real number (time/length) based on sample data. Categorical data can only be named or categorised
Questio 1. (Topics 1-3) A populatio cosists of all the members of a group about which you wat to draw a coclusio (Greek letters (μ, σ, Ν) are used) A sample is the portio of the populatio selected for
More informationTABLES AND FORMULAS FOR MOORE Basic Practice of Statistics
TABLES AND FORMULAS FOR MOORE Basic Practice of Statistics Explorig Data: Distributios Look for overall patter (shape, ceter, spread) ad deviatios (outliers). Mea (use a calculator): x = x 1 + x 2 + +
More informationChapter 2 Descriptive Statistics
Chapter 2 Descriptive Statistics Statistics Most commoly, statistics refers to umerical data. Statistics may also refer to the process of collectig, orgaizig, presetig, aalyzig ad iterpretig umerical data
More informationRandom Variables, Sampling and Estimation
Chapter 1 Radom Variables, Samplig ad Estimatio 1.1 Itroductio This chapter will cover the most importat basic statistical theory you eed i order to uderstad the ecoometric material that will be comig
More informationDescribing the Relation between Two Variables
Copyright 010 Pearso Educatio, Ic. Tables ad Formulas for Sulliva, Statistics: Iformed Decisios Usig Data 010 Pearso Educatio, Ic Chapter Orgaizig ad Summarizig Data Relative frequecy = frequecy sum of
More informationImportant Formulas. Expectation: E (X) = Σ [X P(X)] = n p q σ = n p q. P(X) = n! X1! X 2! X 3! X k! p X. Chapter 6 The Normal Distribution.
Importat Formulas Chapter 3 Data Descriptio Mea for idividual data: X = _ ΣX Mea for grouped data: X= _ Σf X m Stadard deviatio for a sample: _ s = Σ(X _ X ) or s = 1 (Σ X ) (Σ X ) ( 1) Stadard deviatio
More informationChapter 1 (Definitions)
FINAL EXAM REVIEW Chapter 1 (Defiitios) Qualitative: Nomial: Ordial: Quatitative: Ordial: Iterval: Ratio: Observatioal Study: Desiged Experimet: Samplig: Cluster: Stratified: Systematic: Coveiece: Simple
More information1 Inferential Methods for Correlation and Regression Analysis
1 Iferetial Methods for Correlatio ad Regressio Aalysis I the chapter o Correlatio ad Regressio Aalysis tools for describig bivariate cotiuous data were itroduced. The sample Pearso Correlatio Coefficiet
More information7-1. Chapter 4. Part I. Sampling Distributions and Confidence Intervals
7-1 Chapter 4 Part I. Samplig Distributios ad Cofidece Itervals 1 7- Sectio 1. Samplig Distributio 7-3 Usig Statistics Statistical Iferece: Predict ad forecast values of populatio parameters... Test hypotheses
More informationFormulas and Tables for Gerstman
Formulas ad Tables for Gerstma Measuremet ad Study Desig Biostatistics is more tha a compilatio of computatioal techiques! Measuremet scales: quatitative, ordial, categorical Iformatio quality is primary
More informationMOST PEOPLE WOULD RATHER LIVE WITH A PROBLEM THEY CAN'T SOLVE, THAN ACCEPT A SOLUTION THEY CAN'T UNDERSTAND.
XI-1 (1074) MOST PEOPLE WOULD RATHER LIVE WITH A PROBLEM THEY CAN'T SOLVE, THAN ACCEPT A SOLUTION THEY CAN'T UNDERSTAND. R. E. D. WOOLSEY AND H. S. SWANSON XI-2 (1075) STATISTICAL DECISION MAKING Advaced
More informationFACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING. Lectures
FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING Lectures MODULE 5 STATISTICS II. Mea ad stadard error of sample data. Biomial distributio. Normal distributio 4. Samplig 5. Cofidece itervals
More informationCHAPTER SUMMARIES MAT102 Dr J Lubowsky Page 1 of 13 Chapter 1: Introduction to Statistics
CHAPTER SUMMARIES MAT102 Dr J Lubowsky Page 1 of 13 Chapter 1: Itroductio to Statistics Misleadig Iformatio: Surveys ad advertisig claims ca be biased by urepresetative samples, biased questios, iappropriate
More information[ ] ( ) ( ) [ ] ( ) 1 [ ] [ ] Sums of Random Variables Y = a 1 X 1 + a 2 X 2 + +a n X n The expected value of Y is:
PROBABILITY FUNCTIONS A radom variable X has a probabilit associated with each of its possible values. The probabilit is termed a discrete probabilit if X ca assume ol discrete values, or X = x, x, x 3,,
More informationFinal Review. Fall 2013 Prof. Yao Xie, H. Milton Stewart School of Industrial Systems & Engineering Georgia Tech
Fial Review Fall 2013 Prof. Yao Xie, yao.xie@isye.gatech.edu H. Milto Stewart School of Idustrial Systems & Egieerig Georgia Tech 1 Radom samplig model radom samples populatio radom samples: x 1,..., x
More informationProperties and Hypothesis Testing
Chapter 3 Properties ad Hypothesis Testig 3.1 Types of data The regressio techiques developed i previous chapters ca be applied to three differet kids of data. 1. Cross-sectioal data. 2. Time series data.
More informationComputing Confidence Intervals for Sample Data
Computig Cofidece Itervals for Sample Data Topics Use of Statistics Sources of errors Accuracy, precisio, resolutio A mathematical model of errors Cofidece itervals For meas For variaces For proportios
More informationTABLES AND FORMULAS FOR MOORE Basic Practice of Statistics
TABLES AND FORMULAS FOR MOORE Basic Practice of Statistics Explorig Data: Distributios Look for overall patter (shape, ceter, spread) ad deviatios (outliers). Mea (use a calculator): x = x 1 + x 2 + +
More informationCEE 522 Autumn Uncertainty Concepts for Geotechnical Engineering
CEE 5 Autum 005 Ucertaity Cocepts for Geotechical Egieerig Basic Termiology Set A set is a collectio of (mutually exclusive) objects or evets. The sample space is the (collectively exhaustive) collectio
More informationFinal Examination Solutions 17/6/2010
The Islamic Uiversity of Gaza Faculty of Commerce epartmet of Ecoomics ad Political Scieces A Itroductio to Statistics Course (ECOE 30) Sprig Semester 009-00 Fial Eamiatio Solutios 7/6/00 Name: I: Istructor:
More informationChapter 6 Sampling Distributions
Chapter 6 Samplig Distributios 1 I most experimets, we have more tha oe measuremet for ay give variable, each measuremet beig associated with oe radomly selected a member of a populatio. Hece we eed to
More informationUNIVERSITY OF TORONTO Faculty of Arts and Science APRIL/MAY 2009 EXAMINATIONS ECO220Y1Y PART 1 OF 2 SOLUTIONS
PART of UNIVERSITY OF TORONTO Faculty of Arts ad Sciece APRIL/MAY 009 EAMINATIONS ECO0YY PART OF () The sample media is greater tha the sample mea whe there is. (B) () A radom variable is ormally distributed
More informationInferential Statistics. Inference Process. Inferential Statistics and Probability a Holistic Approach. Inference Process.
Iferetial Statistics ad Probability a Holistic Approach Iferece Process Chapter 8 Poit Estimatio ad Cofidece Itervals This Course Material by Maurice Geraghty is licesed uder a Creative Commos Attributio-ShareAlike
More informationRegression, Inference, and Model Building
Regressio, Iferece, ad Model Buildig Scatter Plots ad Correlatio Correlatio coefficiet, r -1 r 1 If r is positive, the the scatter plot has a positive slope ad variables are said to have a positive relatioship
More informationClass 27. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700
Class 7 Daiel B. Rowe, Ph.D. Departmet of Mathematics, Statistics, ad Computer Sciece Copyright 013 by D.B. Rowe 1 Ageda: Skip Recap Chapter 10.5 ad 10.6 Lecture Chapter 11.1-11. Review Chapters 9 ad 10
More informationCorrelation. Two variables: Which test? Relationship Between Two Numerical Variables. Two variables: Which test? Contingency table Grouped bar graph
Correlatio Y Two variables: Which test? X Explaatory variable Respose variable Categorical Numerical Categorical Cotigecy table Cotigecy Logistic Grouped bar graph aalysis regressio Mosaic plot Numerical
More informationStat 139 Homework 7 Solutions, Fall 2015
Stat 139 Homework 7 Solutios, Fall 2015 Problem 1. I class we leared that the classical simple liear regressio model assumes the followig distributio of resposes: Y i = β 0 + β 1 X i + ɛ i, i = 1,...,,
More informationClass 23. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700
Class 23 Daiel B. Rowe, Ph.D. Departmet of Mathematics, Statistics, ad Computer Sciece Copyright 2017 by D.B. Rowe 1 Ageda: Recap Chapter 9.1 Lecture Chapter 9.2 Review Exam 6 Problem Solvig Sessio. 2
More informationLecture 1. Statistics: A science of information. Population: The population is the collection of all subjects we re interested in studying.
Lecture Mai Topics: Defiitios: Statistics, Populatio, Sample, Radom Sample, Statistical Iferece Type of Data Scales of Measuremet Describig Data with Numbers Describig Data Graphically. Defiitios. Example
More informationModule 1 Fundamentals in statistics
Normal Distributio Repeated observatios that differ because of experimetal error ofte vary about some cetral value i a roughly symmetrical distributio i which small deviatios occur much more frequetly
More informationKLMED8004 Medical statistics. Part I, autumn Estimation. We have previously learned: Population and sample. New questions
We have previously leared: KLMED8004 Medical statistics Part I, autum 00 How kow probability distributios (e.g. biomial distributio, ormal distributio) with kow populatio parameters (mea, variace) ca give
More informationExpectation and Variance of a random variable
Chapter 11 Expectatio ad Variace of a radom variable The aim of this lecture is to defie ad itroduce mathematical Expectatio ad variace of a fuctio of discrete & cotiuous radom variables ad the distributio
More informationEconomics 250 Assignment 1 Suggested Answers. 1. We have the following data set on the lengths (in minutes) of a sample of long-distance phone calls
Ecoomics 250 Assigmet 1 Suggested Aswers 1. We have the followig data set o the legths (i miutes) of a sample of log-distace phoe calls 1 20 10 20 13 23 3 7 18 7 4 5 15 7 29 10 18 10 10 23 4 12 8 6 (1)
More informationS Y Y = ΣY 2 n. Using the above expressions, the correlation coefficient is. r = SXX S Y Y
1 Sociology 405/805 Revised February 4, 004 Summary of Formulae for Bivariate Regressio ad Correlatio Let X be a idepedet variable ad Y a depedet variable, with observatios for each of the values of these
More informationBIOS 4110: Introduction to Biostatistics. Breheny. Lab #9
BIOS 4110: Itroductio to Biostatistics Brehey Lab #9 The Cetral Limit Theorem is very importat i the realm of statistics, ad today's lab will explore the applicatio of it i both categorical ad cotiuous
More informationStatistics Lecture 27. Final review. Administrative Notes. Outline. Experiments. Sampling and Surveys. Administrative Notes
Admiistrative Notes s - Lecture 7 Fial review Fial Exam is Tuesday, May 0th (3-5pm Covers Chapters -8 ad 0 i textbook Brig ID cards to fial! Allowed: Calculators, double-sided 8.5 x cheat sheet Exam Rooms:
More informationHypothesis Testing. Evaluation of Performance of Learned h. Issues. Trade-off Between Bias and Variance
Hypothesis Testig Empirically evaluatig accuracy of hypotheses: importat activity i ML. Three questios: Give observed accuracy over a sample set, how well does this estimate apply over additioal samples?
More informationMBACATÓLICA. Quantitative Methods. Faculdade de Ciências Económicas e Empresariais UNIVERSIDADE CATÓLICA PORTUGUESA 9. SAMPLING DISTRIBUTIONS
MBACATÓLICA Quatitative Methods Miguel Gouveia Mauel Leite Moteiro Faculdade de Ciêcias Ecoómicas e Empresariais UNIVERSIDADE CATÓLICA PORTUGUESA 9. SAMPLING DISTRIBUTIONS MBACatólica 006/07 Métodos Quatitativos
More informationAnna Janicka Mathematical Statistics 2018/2019 Lecture 1, Parts 1 & 2
Aa Jaicka Mathematical Statistics 18/19 Lecture 1, Parts 1 & 1. Descriptive Statistics By the term descriptive statistics we will mea the tools used for quatitative descriptio of the properties of a sample
More informationFormulas FROM LECTURE 01 TO 22 W X. d n. fx f. Arslan Latif (mt ) & Mohsin Ali (mc ) Mean: Weighted Mean: Mean Deviation: Ungroup Data
1 Formulas FROM LECTURE 01 TO Mea: fx f Weighted Mea: X w W X i i Wi Mea Deviatio: Ugroup Data d M. D Group Data fi di M. D f d ( X X ) Coefficiet of Mea Deviatio: M. D Co-efficiet of M. D(for mea) Mea
More informationChapter 22. Comparing Two Proportions. Copyright 2010 Pearson Education, Inc.
Chapter 22 Comparig Two Proportios Copyright 2010 Pearso Educatio, Ic. Comparig Two Proportios Comparisos betwee two percetages are much more commo tha questios about isolated percetages. Ad they are more
More informationRead through these prior to coming to the test and follow them when you take your test.
Math 143 Sprig 2012 Test 2 Iformatio 1 Test 2 will be give i class o Thursday April 5. Material Covered The test is cummulative, but will emphasize the recet material (Chapters 6 8, 10 11, ad Sectios 12.1
More informationSTA Learning Objectives. Population Proportions. Module 10 Comparing Two Proportions. Upon completing this module, you should be able to:
STA 2023 Module 10 Comparig Two Proportios Learig Objectives Upo completig this module, you should be able to: 1. Perform large-sample ifereces (hypothesis test ad cofidece itervals) to compare two populatio
More informationData Description. Measure of Central Tendency. Data Description. Chapter x i
Data Descriptio Describe Distributio with Numbers Example: Birth weights (i lb) of 5 babies bor from two groups of wome uder differet care programs. Group : 7, 6, 8, 7, 7 Group : 3, 4, 8, 9, Chapter 3
More informationChapter 22. Comparing Two Proportions. Copyright 2010, 2007, 2004 Pearson Education, Inc.
Chapter 22 Comparig Two Proportios Copyright 2010, 2007, 2004 Pearso Educatio, Ic. Comparig Two Proportios Read the first two paragraphs of pg 504. Comparisos betwee two percetages are much more commo
More informationINSTRUCTIONS (A) 1.22 (B) 0.74 (C) 4.93 (D) 1.18 (E) 2.43
PAPER NO.: 444, 445 PAGE NO.: Page 1 of 1 INSTRUCTIONS I. You have bee provided with: a) the examiatio paper i two parts (PART A ad PART B), b) a multiple choice aswer sheet (for PART A), c) selected formulae
More informationTables and Formulas for Sullivan, Fundamentals of Statistics, 2e Pearson Education, Inc.
Table ad Formula for Sulliva, Fudametal of Statitic, e. 008 Pearo Educatio, Ic. CHAPTER Orgaizig ad Summarizig Data Relative frequecy frequecy um of all frequecie Cla midpoit: The um of coecutive lower
More informationSampling Distributions, Z-Tests, Power
Samplig Distributios, Z-Tests, Power We draw ifereces about populatio parameters from sample statistics Sample proportio approximates populatio proportio Sample mea approximates populatio mea Sample variace
More informationMA238 Assignment 4 Solutions (part a)
(i) Sigle sample tests. Questio. MA38 Assigmet 4 Solutios (part a) (a) (b) (c) H 0 : = 50 sq. ft H A : < 50 sq. ft H 0 : = 3 mpg H A : > 3 mpg H 0 : = 5 mm H A : 5mm Questio. (i) What are the ull ad alterative
More informationTopic 10: Introduction to Estimation
Topic 0: Itroductio to Estimatio Jue, 0 Itroductio I the simplest possible terms, the goal of estimatio theory is to aswer the questio: What is that umber? What is the legth, the reactio rate, the fractio
More informationProbability and statistics: basic terms
Probability ad statistics: basic terms M. Veeraraghava August 203 A radom variable is a rule that assigs a umerical value to each possible outcome of a experimet. Outcomes of a experimet form the sample
More informationMath 152. Rumbos Fall Solutions to Review Problems for Exam #2. Number of Heads Frequency
Math 152. Rumbos Fall 2009 1 Solutios to Review Problems for Exam #2 1. I the book Experimetatio ad Measuremet, by W. J. Youde ad published by the by the Natioal Sciece Teachers Associatio i 1962, the
More informationMATH 320: Probability and Statistics 9. Estimation and Testing of Parameters. Readings: Pruim, Chapter 4
MATH 30: Probability ad Statistics 9. Estimatio ad Testig of Parameters Estimatio ad Testig of Parameters We have bee dealig situatios i which we have full kowledge of the distributio of a radom variable.
More information- E < p. ˆ p q ˆ E = q ˆ = 1 - p ˆ = sample proportion of x failures in a sample size of n. where. x n sample proportion. population proportion
1 Chapter 7 ad 8 Review for Exam Chapter 7 Estimates ad Sample Sizes 2 Defiitio Cofidece Iterval (or Iterval Estimate) a rage (or a iterval) of values used to estimate the true value of the populatio parameter
More informationBig Picture. 5. Data, Estimates, and Models: quantifying the accuracy of estimates.
5. Data, Estimates, ad Models: quatifyig the accuracy of estimates. 5. Estimatig a Normal Mea 5.2 The Distributio of the Normal Sample Mea 5.3 Normal data, cofidece iterval for, kow 5.4 Normal data, cofidece
More informationCHAPTER 8 FUNDAMENTAL SAMPLING DISTRIBUTIONS AND DATA DESCRIPTIONS. 8.1 Random Sampling. 8.2 Some Important Statistics
CHAPTER 8 FUNDAMENTAL SAMPLING DISTRIBUTIONS AND DATA DESCRIPTIONS 8.1 Radom Samplig The basic idea of the statistical iferece is that we are allowed to draw ifereces or coclusios about a populatio based
More informationComparing Two Populations. Topic 15 - Two Sample Inference I. Comparing Two Means. Comparing Two Pop Means. Background Reading
Topic 15 - Two Sample Iferece I STAT 511 Professor Bruce Craig Comparig Two Populatios Research ofte ivolves the compariso of two or more samples from differet populatios Graphical summaries provide visual
More informationENGI 4421 Confidence Intervals (Two Samples) Page 12-01
ENGI 44 Cofidece Itervals (Two Samples) Page -0 Two Sample Cofidece Iterval for a Differece i Populatio Meas [Navidi sectios 5.4-5.7; Devore chapter 9] From the cetral limit theorem, we kow that, for sufficietly
More informationSampling, Sampling Distribution and Normality
4/17/11 Tools of Busiess Statistics Samplig, Samplig Distributio ad ormality Preseted by: Mahedra Adhi ugroho, M.Sc Descriptive statistics Collectig, presetig, ad describig data Iferetial statistics Drawig
More information2 1. The r.s., of size n2, from population 2 will be. 2 and 2. 2) The two populations are independent. This implies that all of the n1 n2
Chapter 8 Comparig Two Treatmets Iferece about Two Populatio Meas We wat to compare the meas of two populatios to see whether they differ. There are two situatios to cosider, as show i the followig examples:
More informationSample Size Determination (Two or More Samples)
Sample Sie Determiatio (Two or More Samples) STATGRAPHICS Rev. 963 Summary... Data Iput... Aalysis Summary... 5 Power Curve... 5 Calculatios... 6 Summary This procedure determies a suitable sample sie
More informationChapter 23: Inferences About Means
Chapter 23: Ifereces About Meas Eough Proportios! We ve spet the last two uits workig with proportios (or qualitative variables, at least) ow it s time to tur our attetios to quatitative variables. For
More informationDAWSON COLLEGE DEPARTMENT OF MATHEMATICS 201-BZS-05 PROBABILITY AND STATISTICS FALL 2015 FINAL EXAM
DAWSON COLLEGE DEPARTMENT OF MATHEMATICS 201-BZS-05 PROBABILITY AND STATISTICS FALL 2015 FINAL EXAM Name: Date: December 24th, 2015 Studet Number: Time: 9:30 12:30 Grade: / 116 Examier: Matthew MARCHANT
More informationRule of probability. Let A and B be two events (sets of elementary events). 11. If P (AB) = P (A)P (B), then A and B are independent.
Percetile: the αth percetile of a populatio is the value x 0, such that P (X x 0 ) α% For example the 5th is the x 0, such that P (X x 0 ) 5% 05 Rule of probability Let A ad B be two evets (sets of elemetary
More informationTopic 9: Sampling Distributions of Estimators
Topic 9: Samplig Distributios of Estimators Course 003, 2018 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be
More informationEstimation for Complete Data
Estimatio for Complete Data complete data: there is o loss of iformatio durig study. complete idividual complete data= grouped data A complete idividual data is the oe i which the complete iformatio of
More informationGoodness-of-Fit Tests and Categorical Data Analysis (Devore Chapter Fourteen)
Goodess-of-Fit Tests ad Categorical Data Aalysis (Devore Chapter Fourtee) MATH-252-01: Probability ad Statistics II Sprig 2019 Cotets 1 Chi-Squared Tests with Kow Probabilities 1 1.1 Chi-Squared Testig................
More informationIntroduction to Probability and Statistics Twelfth Edition
Itroductio to Probability ad Statistics Twelfth Editio Robert J. Beaver Barbara M. Beaver William Medehall Presetatio desiged ad writte by: Barbara M. Beaver Itroductio to Probability ad Statistics Twelfth
More informationLecture 7: Non-parametric Comparison of Location. GENOME 560, Spring 2016 Doug Fowler, GS
Lecture 7: No-parametric Compariso of Locatio GENOME 560, Sprig 2016 Doug Fowler, GS (dfowler@uw.edu) 1 Review How ca we set a cofidece iterval o a proportio? 2 Review How ca we set a cofidece iterval
More informationOverview. p 2. Chapter 9. Pooled Estimate of. q = 1 p. Notation for Two Proportions. Inferences about Two Proportions. Assumptions
Chapter 9 Slide Ifereces from Two Samples 9- Overview 9- Ifereces about Two Proportios 9- Ifereces about Two Meas: Idepedet Samples 9-4 Ifereces about Matched Pairs 9-5 Comparig Variatio i Two Samples
More informationMath 140 Introductory Statistics
8.2 Testig a Proportio Math 1 Itroductory Statistics Professor B. Abrego Lecture 15 Sectios 8.2 People ofte make decisios with data by comparig the results from a sample to some predetermied stadard. These
More informationCommon Large/Small Sample Tests 1/55
Commo Large/Small Sample Tests 1/55 Test of Hypothesis for the Mea (σ Kow) Covert sample result ( x) to a z value Hypothesis Tests for µ Cosider the test H :μ = μ H 1 :μ > μ σ Kow (Assume the populatio
More informationResponse Variable denoted by y it is the variable that is to be predicted measure of the outcome of an experiment also called the dependent variable
Statistics Chapter 4 Correlatio ad Regressio If we have two (or more) variables we are usually iterested i the relatioship betwee the variables. Associatio betwee Variables Two variables are associated
More informationMEASURES OF DISPERSION (VARIABILITY)
POLI 300 Hadout #7 N. R. Miller MEASURES OF DISPERSION (VARIABILITY) While measures of cetral tedecy idicate what value of a variable is (i oe sese or other, e.g., mode, media, mea), average or cetral
More informationBecause it tests for differences between multiple pairs of means in one test, it is called an omnibus test.
Math 308 Sprig 018 Classes 19 ad 0: Aalysis of Variace (ANOVA) Page 1 of 6 Itroductio ANOVA is a statistical procedure for determiig whether three or more sample meas were draw from populatios with equal
More informationStatistics 511 Additional Materials
Cofidece Itervals o mu Statistics 511 Additioal Materials This topic officially moves us from probability to statistics. We begi to discuss makig ifereces about the populatio. Oe way to differetiate probability
More informationStat 200 -Testing Summary Page 1
Stat 00 -Testig Summary Page 1 Mathematicias are like Frechme; whatever you say to them, they traslate it ito their ow laguage ad forthwith it is somethig etirely differet Goethe 1 Large Sample Cofidece
More informationNANYANG TECHNOLOGICAL UNIVERSITY SYLLABUS FOR ENTRANCE EXAMINATION FOR INTERNATIONAL STUDENTS AO-LEVEL MATHEMATICS
NANYANG TECHNOLOGICAL UNIVERSITY SYLLABUS FOR ENTRANCE EXAMINATION FOR INTERNATIONAL STUDENTS AO-LEVEL MATHEMATICS STRUCTURE OF EXAMINATION PAPER. There will be oe 2-hour paper cosistig of 4 questios.
More informationIE 230 Seat # Name < KEY > Please read these directions. Closed book and notes. 60 minutes.
IE 230 Seat # Name < KEY > Please read these directios. Closed book ad otes. 60 miutes. Covers through the ormal distributio, Sectio 4.7 of Motgomery ad Ruger, fourth editio. Cover page ad four pages of
More informationSample Size Estimation in the Proportional Hazards Model for K-sample or Regression Settings Scott S. Emerson, M.D., Ph.D.
ample ie Estimatio i the Proportioal Haards Model for K-sample or Regressio ettigs cott. Emerso, M.D., Ph.D. ample ie Formula for a Normally Distributed tatistic uppose a statistic is kow to be ormally
More informationConfounding: two variables are confounded when the effects of an RV cannot be distinguished. When describing data: describe center, spread, and shape.
Importat Cocepts ot o the AP Statistics Formula Sheet Part I: IQR = Q 3 Q 1 Test for a outlier: 1.5(IQR) above Q 3 or below Q 1 The calculator will ru the Liear trasformatio: Additio: affects ceter NOT
More information1036: Probability & Statistics
036: Probability & Statistics Lecture 0 Oe- ad Two-Sample Tests of Hypotheses 0- Statistical Hypotheses Decisio based o experimetal evidece whether Coffee drikig icreases the risk of cacer i humas. A perso
More informationLecture 7: Non-parametric Comparison of Location. GENOME 560 Doug Fowler, GS
Lecture 7: No-parametric Compariso of Locatio GENOME 560 Doug Fowler, GS (dfowler@uw.edu) 1 Review How ca we set a cofidece iterval o a proportio? 2 What do we mea by oparametric? 3 Types of Data A Review
More informationStatisticians use the word population to refer the total number of (potential) observations under consideration
6 Samplig Distributios Statisticias use the word populatio to refer the total umber of (potetial) observatios uder cosideratio The populatio is just the set of all possible outcomes i our sample space
More informationInterval Estimation (Confidence Interval = C.I.): An interval estimate of some population parameter is an interval of the form (, ),
Cofidece Iterval Estimatio Problems Suppose we have a populatio with some ukow parameter(s). Example: Normal(,) ad are parameters. We eed to draw coclusios (make ifereces) about the ukow parameters. We
More informationST 305: Exam 3 ( ) = P(A)P(B A) ( ) = P(A) + P(B) ( ) = 1 P( A) ( ) = P(A) P(B) σ X 2 = σ a+bx. σ ˆp. σ X +Y. σ X Y. σ X. σ Y. σ n.
ST 305: Exam 3 By hadig i this completed exam, I state that I have either give or received assistace from aother perso durig the exam period. I have used o resources other tha the exam itself ad the basic
More informationMATH/STAT 352: Lecture 15
MATH/STAT 352: Lecture 15 Sectios 5.2 ad 5.3. Large sample CI for a proportio ad small sample CI for a mea. 1 5.2: Cofidece Iterval for a Proportio Estimatig proportio of successes i a biomial experimet
More informationGG313 GEOLOGICAL DATA ANALYSIS
GG313 GEOLOGICAL DATA ANALYSIS 1 Testig Hypothesis GG313 GEOLOGICAL DATA ANALYSIS LECTURE NOTES PAUL WESSEL SECTION TESTING OF HYPOTHESES Much of statistics is cocered with testig hypothesis agaist data
More informationLecture 8: Non-parametric Comparison of Location. GENOME 560, Spring 2016 Doug Fowler, GS
Lecture 8: No-parametric Compariso of Locatio GENOME 560, Sprig 2016 Doug Fowler, GS (dfowler@uw.edu) 1 Review What do we mea by oparametric? What is a desirable locatio statistic for ordial data? What
More informationExam 2 Instructions not multiple versions
Exam 2 Istructios Remove this sheet of istructios from your exam. You may use the back of this sheet for scratch work. This is a closed book, closed otes exam. You are ot allowed to use ay materials other
More informationIE 230 Probability & Statistics in Engineering I. Closed book and notes. No calculators. 120 minutes.
Closed book ad otes. No calculators. 120 miutes. Cover page, five pages of exam, ad tables for discrete ad cotiuous distributios. Score X i =1 X i / S X 2 i =1 (X i X ) 2 / ( 1) = [i =1 X i 2 X 2 ] / (
More informationStatistical Inference (Chapter 10) Statistical inference = learn about a population based on the information provided by a sample.
Statistical Iferece (Chapter 10) Statistical iferece = lear about a populatio based o the iformatio provided by a sample. Populatio: The set of all values of a radom variable X of iterest. Characterized
More informationMathematical Notation Math Introduction to Applied Statistics
Mathematical Notatio Math 113 - Itroductio to Applied Statistics Name : Use Word or WordPerfect to recreate the followig documets. Each article is worth 10 poits ad ca be prited ad give to the istructor
More informationLecture 24 Floods and flood frequency
Lecture 4 Floods ad flood frequecy Oe of the thigs we wat to kow most about rivers is what s the probability that a flood of size will happe this year? I 100 years? There are two ways to do this empirically,
More informationBiostatistics for Med Students. Lecture 2
Biostatistics for Med Studets Lecture 2 Joh J. Che, Ph.D. Professor & Director of Biostatistics Core UH JABSOM JABSOM MD7 February 22, 2017 Lecture Objectives To uderstad basic research desig priciples
More informationTests of Hypotheses Based on a Single Sample (Devore Chapter Eight)
Tests of Hypotheses Based o a Sigle Sample Devore Chapter Eight MATH-252-01: Probability ad Statistics II Sprig 2018 Cotets 1 Hypothesis Tests illustrated with z-tests 1 1.1 Overview of Hypothesis Testig..........
More information11 Correlation and Regression
11 Correlatio Regressio 11.1 Multivariate Data Ofte we look at data where several variables are recorded for the same idividuals or samplig uits. For example, at a coastal weather statio, we might record
More informationChapter If n is odd, the median is the exact middle number If n is even, the median is the average of the two middle numbers
Chapter 4 4-1 orth Seattle Commuity College BUS10 Busiess Statistics Chapter 4 Descriptive Statistics Summary Defiitios Cetral tedecy: The extet to which the data values group aroud a cetral value. Variatio:
More information(7 One- and Two-Sample Estimation Problem )
34 Stat Lecture Notes (7 Oe- ad Two-Sample Estimatio Problem ) ( Book*: Chapter 8,pg65) Probability& Statistics for Egieers & Scietists By Walpole, Myers, Myers, Ye Estimatio 1 ) ( ˆ S P i i Poit estimate:
More informationAgenda: Recap. Lecture. Chapter 12. Homework. Chapt 12 #1, 2, 3 SAS Problems 3 & 4 by hand. Marquette University MATH 4740/MSCS 5740
Ageda: Recap. Lecture. Chapter Homework. Chapt #,, 3 SAS Problems 3 & 4 by had. Copyright 06 by D.B. Rowe Recap. 6: Statistical Iferece: Procedures for μ -μ 6. Statistical Iferece Cocerig μ -μ Recall yes
More information