Formulas FROM LECTURE 01 TO 22 W X. d n. fx f. Arslan Latif (mt ) & Mohsin Ali (mc ) Mean: Weighted Mean: Mean Deviation: Ungroup Data

Size: px
Start display at page:

Download "Formulas FROM LECTURE 01 TO 22 W X. d n. fx f. Arslan Latif (mt ) & Mohsin Ali (mc ) Mean: Weighted Mean: Mean Deviation: Ungroup Data"

Transcription

1 1 Formulas FROM LECTURE 01 TO Mea: fx f Weighted Mea: X w W X i i Wi Mea Deviatio: Ugroup Data d M. D Group Data fi di M. D f d ( X X ) Coefficiet of Mea Deviatio: M. D Co-efficiet of M. D(for mea) Mea M. D Co-efficiet of M. D(for media) Media Stadard Deviatio: Ugroup Data ( X X ) S. D Group Data f ( X X ) S. D f

2 Shortcut Formula for Ugroup data x x S. D Shortcut Formula for Group Data fx fx S. D f f Co-Efficiet of Stadard Deviatio: S. D X Variace: Ugroup Data Variace ( X X ) Group Data Variace ( ) f X X f Co-Efficiet of Variatio:. C. V S D 100 X Media: + 1 Media item h Media l + c f Harmoic Mea: Ugroup Data X 1 x Group Data f X f x th

3 3 Average Speed: Total Distace Travelled Average Speed Total Time Take Distace Time Speed Distace Speed Time Distace Speed Mode: Time ( fm f 1) X l + h ( fm f 1) + ( fm f ) Empirical Relatio betwee Mea, Media ad Mode: Mode 3Media Mea Rage: m 0 Rage X X Mid-Rage: Mid Rage X 0 + X m Mid Quatile Rage: Mid Rage Q1 + Q3 Coefficiet of Dispersio: Coefficiet of Dispersio X X m m X + X 0 0 Geometric Mea: Ugroup Data log X G. M ati log Group Data f log X G. M ati log f

4 4 Quartile Deviatio: Q3 Q1 Q. D Q Q Co efficiet of Q. D Q + Q th + 1 Q1 item 4 h Q1 l + c f 4 h 3 Q3 l + c f 4 Skewess: SK Mea Mode S. D Pearso s Coefficiet of Skewess 3( Mea Media) SK S. D Bowley s coefficiet of Skewess Q1 + Q3 ( Media) SK Q Q Kurtosis: K Q. D P P Momets: Ugroup Data ( X i X ) m1 0 ( X i X ) m 3 ( X i X ) m3 Group Data m 3 f ( X X ) Momet Ratios: i f ( m3 ) m4 ad b ( m ) ( m ) b 1 3 3

5 5 Equatio of a Straight Lie Y a + bx Stadard Error of Estimate: s yx Y a Y b XY Normal Equatios: Y a + b X XY a X + b X Correlatio: r ( X X )( Y Y ) ( X X ) ( Y Y ) PEARSON S COEFFICIENT OF CORRELATION r XY ( X )( Y ) / [ X ( X ) / ][ Y ( Y ) / ] Liear Correlatio of Coefficiet: r xy ( x)( y) x ( x) y ( y) Regressio Lie of Y o X: xy ( x)( y) byx x ( x) Divisio of Circle: Cell Frequecy 360 Total Frequecy Biomial Coefficiet:( ( C r ) ( a + b) a b r 0 r r r

6 6 1. Cumulative Law: A B B A A B B A. Associative Law: ( A B) C A ( B C) ( A B) C A ( B C) 3. Distributive Law: A (B C) (A B) (A C) A (B C) (A B) (A C) 4. Idempotet Law: A A A A A A 5. Idetity Law: A S S A S A A φ A A φ φ0 6. Complemetatio Law: A A S A A φ ( A) A S φ φ S 7. De Morga s Law: ( ) ( ) A B A B A B A B Rule of Permutatio:! r! ( ) Rule of Combiatio:! r! r! ( )

7 7 Probability Formula: m P(A) No. of Favourable Outcomes P(A) Total No. of Possible Outcomes Mutually Exclusive Evets: If A ad B mutually exclusive evets, P(A B) P(A) + P(B) Complemetatio Law of Probability: P(A) 1 P( A) Additio Law: P(A B) P( A) + P( B) P( A B) Coditioal Probability: P(A/B) P(B/A) P(A B) P(B) P(A B) P(A) Multiplicatio Law: P(A B) P(A)P(B/A) Baye s Theorem: P(A B) 1 P( A ). P( B A ) 1 1 P( A ). P( B A ) + P( A ). P( B A ) 1 1 Computatio of Stadard Deviatio Probability Distributio: X P( X ) XP( X ) S. D.( X ) P( X ) P( X ) S. D.( X ) X P( X ) XP( X ) As we kow, P( X ) 1 Coefficiet of Variatio: 100 µ

8 8 Total Cards 5 Diamod 13 Heart 13 Club 13 Spades 13 Face Cards 1 Quee 4 Jack 4 Kig 4 Jocker 4 FROM LECTURE 3 TO 45 Mathematical Expectatios: E( X ) x f ( x ) i 1 i i E(X) is also called the mea of X ad is usually deoted by the letter µ. If the values are equally likely: 1 E( X ) X i If X is a discrete radom variable ad if a ad b are costats, the: E ( ax + b) ae( X ) + b If µ is a Populatio Mea the: E ( X µ ) E( X ) [ E( X )] Shortcut Formula for the Variace i Mathematical Expectatios: E( X ) [ E( X )] Skewess of Probability: µ 3 β1 3 µ Kurtosis of Probability: 4 µ β1 µ Mea o Probability: E( X ) XP( X )

9 9 If the Parameters i distributio are a ad b the µ a + b Mea of the Populatio Distributio E( X ) xf ( x) µ Mea of the SAMPLIN DISTRIBUTION of X µ E( X ) x f ( x) x µ µ x Variace of Probability: Var( X ) X P( X ) [ XP( X )] Expected value of Variace Var( X ) or E( X µ ) ( x µ ) f ( x) If the Parameters i distributio are a ad b the ( b a) 1 Variace of the Populatio Distributio x f ( x) x f ( x) Variace of the SAMPLIN DISTRIBUTION of X Ugroup Data X X ( X ) Group Data X X f ( X ) X f ( X ) I case of samplig with replacemet: x I case of samplig without replacemet from a fiite populatio: x N N 1 i Fiite Populatio Correctio (fpc) N N 1

10 10 Biomial Distributio: x P( X x) p q x x IN CASE OF NORMAL APPROXIMATION TO THE BINOMIAL DISTRIBUTION f ( x) x x x p q Joit Probability Fuctio: f ( x, y ) kth Momet: m ' r X r i Origi of Radom Variable µ k E( Xk) X k f ( x) Mea of Radom Variable µ k i E( X µ ) k ( X µ ) k f ( x) Chebychev s Iequality: i I the case of a Discrete Probability Distributio 1 P( µ k < X < µ + k ) 1 k E µ µ [( X ) ] E( X ) MARGINAL PROBABILITY: Margial Probability fuctio of X g( x ) f ( x, y ) i i j j 1

11 11 Margial Probability fuctio of Y m h( y ) f ( x, y ) P( Y y ) i i j j i 1 Coditioal Probability Fuctio: Coditioal Probability fuctio for X f ( x y ) P( X x Y y ) i j i j Coditioal Probability fuctio for Y f ( y x ) P( Y y X x ) j i j i HYPER GEOMETRIC PROBABILITY DISTRIBUTION: P( X x) k N k ( x )( x ) N ( ) Where N umber of uits i the populatio, umber of uits i the sample, ad k umber of successes i the populatio. HYPER GEOMETRIC DISTRIBUTION: Mea of Hyper Geometric Distributios k µ N Variace of Hyper Geometric Distributio k N k N N N N 1 Biomial Probability Distributio: Lim p 0 b(x;,p) x 0,1,,..., e.1788 µ p e x! µ x µ UNIFORM DISTRIBUTION: A radom variable X is said to be uiformly distributed if its desity fuctio is defied as

12 1 1 f ( x), a x b b a Poisso Process Formula: e P( X x) ( t) x! λ t x λ P( X x) e µ x µ x! Where λ average umber of occurreces of the outcome of iterest per uit of time, t umber of time-uits uder cosideratio, ad x umber of occurreces of the outcome of iterest i t uits of time. NORMAL DISTRIBUTION: 1 f ( x) e π Where : π or 7 e x µ Momet Ratios i Probability: The momet ratios of the ormal distributio come out to be 0 ad 3 respectively: µ 0 β µ ( ) 4 4 µ 3 β 3 µ ( ) Stadardizatio Formula: Z X µ I case of X X µ x Z x

13 13 Proportio of Successes i the Populatio: X p or p N X p Proportio of Success X Number of successes i the populatio Samplig distributio of p SAMPLING DISTRIBUTION OF DIFFERENCES BETWEEN MEANS: µ µ µ x x 1 ad µ x1 x 1 1 x x 1 ad x x 1 ( x1 x) f ( x1 x) d f ( d) df ( d) x1 + x Z ( X1 X ) ( µ 1 µ ) SAMPLING DISTRIBUTION OF p µ p p f ( p ) Stadard Deviatio of the Samplig Distributio: p q p1 + p p q Modified Formula for the Sample Variace: S ( x x) 1

14 14 Relative Efficiecy: E f Var( T ) Var( T ) 1 Differece betwee the Meas of Two Populatios: s s ( x1 x ) ± zα / CONFIDENCE INTERVAL FOR A POPULATION PROPORTION (P): p ± z α / p (1 p ) Where p Proportio of "success" i hte sample Sample size z α / 1.96 for 95% cofidece.58 for 99% cofidece CONFIDENCE INTERVAL FOR P1-P: p1(1 p1) p(1 p ) ( p1 p) ± zα / + 1 SAMPLE SIZE FOR ESTIMATING POPULATION PROPORTION: p z α / pq ɵ t-distribution: 1 x f ( x) 1 1 v + v vβ, Variace of the t-distributio v v for v > ( v+ 1)/

15 15 F-Distributio: [ ] Γ ( v + v ) / ( v / v ) x < x < 1 Γ( v / ) Γ ( v / ) 1 + v x / v v1 / ( v1 /) ( ), 0 ( v + v )/ f x [ ] 1 1 Mea of the F-distributio: For v > v v Variace of the F-distributio: For v > 4 v ( v + v ) ( ) ( 4) 1 v1 v v Mea Square: Mea Square Sum of Squares Degrees of Freedom Least Sigificat Differece: LSD t α /,( v) ( MSE) r

16 16 Defiitios FROM LECTURE 01 TO Statistics: Statistics is that sciece which eables to draw coclusios about various pheomea oe the basis of real data collected o sample basis. OR Statistics is a sciece of facts ad figures. INFERENTIAL STATISTICS: That brach of Statistics which eables us to draw coclusios or ifereces about various pheomea o the basis of real data collected o sample basis. Data: A well defied collectio of objects is kow as data. Qualitative data: Data that are labels or ames used to idetify a attribute of each elemet. Qualitative data may be oumeric or umeric. Qualitative variable: A variable with qualitative data. Quatitative data: Data that idicate how much or how may of somethig. Quatitative data are always umeric. Variable: A measurable quatity which ca vary from oe idividual or object to aother is called a variable. Nomial Scale: The classificatio or groupig of observatios ito mutually exclusive qualitative categories is said to costitute a omial scale e.g. studets are classified as male ad female. Ordial Scale: It icludes the characteristic of a omial scale ad i additio has the property of orderig or rakig of measuremets e.g. the performace of studets ca be rated as excellet, good or poor. Iterval Scale: A measuremet scale possessig a costat iterval size but ot true zero poit is called a Iterval Scale. Ratio Scale: It is a special kid of a iterval scale i which the scale of measuremet has a true zero poit as its origi.

17 17 Biased Errors: A error is said to be biased whe the observed value is cosistetly ad costatly higher or lower tha the true value. Ubiased Errors or Radom Errors: A error, o the other had, is said to be ubiased whe the deviatios, i.e. the excesses ad defects, from the true value ted to occur equally ofte. Primary Data: The data published or used by a orgaizatio which origially collected them are called primary data thus the primary data are the first had iformatio collected, complied, ad published by a orgaizatio for a certai purpose. Secodary Data: The data published or used by a orgaizatio other tha the oe which origially collected them are kow as secodary data. DIRECT PERSONAL INVESTIGATION: I this method, a ivestigator collects the iformatio persoally from the idividuals cocered. Sice he iterviews the iformats himself, the iformatio collected is geerally cosidered quite accurate ad complete. INDIRECT INVESTIGATION: Sometimes the direct sources do ot exist or the iformats hesitate to respod for some reaso or other. I such a case, third parties or witesses havig iformatio are iterviewed. Populatio: The collectio of all idividuals, items or data uder cosideratio i statistical study is called Populatio. Sample: A sample is a group of uits selected from a larger group (the populatio). By studyig the sample it is hoped to draw valid coclusios about the larger group. OR Sample is that part of the Populatio from which iformatio is collected. SAMPLING FRAME: A samplig frame is a complete list of all the elemets i the populatio. Samplig Error: The samplig error is the differece betwee the sample statistic ad the populatio parameter. No-Samplig Error: Such errors which are ot attributable to samplig but arise i the process of data collectio eve if a complete cout is carried out. Samplig with replacemet: Oce a elemet has bee icluded i the sample, it is retured to the populatio. A previously selected elemet ca be selected agai ad therefore may appear i the sample more tha oce. Samplig without replacemet: Oce a elemet has bee icluded i the sample, it is removed from the populatio ad caot be selected a secod time.

18 18 Stadard error: The stadard deviatio of a poit estimator. AND The degree of scatter of the observed values about the regressio lie measured by what is called stadard deviatio of regressio or stadard error of estimate. Samplig Uit: The uits selected for samplig. A samplig uit may iclude several elemets. NON-RANDOM SAMPLING: Noradom samplig implies that kid of samplig i which the populatio uits are draw ito the sample by usig oe s persoal judgmet. This type of samplig is also kow as purposive samplig. Quota Samplig: Quota samplig is a method of samplig widely used i opiio pollig ad market research. Iterviewers are each give a quota of subjects of specified type to attempt to recruit for example, a iterviewer might be told to go out ad select 0 adult me ad 0 adult wome, 10 teeage girls ad 10 teeage boys so that they could iterview them about their televisio viewig. RANDOM SAMPLING: The theory of statistical samplig rests o the assumptio that the selectio of the sample uits has bee carried out i a radom maer. By radom samplig we mea samplig that has bee doe by adoptig the lottery method. Simple radom samplig: Fiite populatio: a sample selected such that each possible sample of size has the same probability of beig selected. Ifiite populatio: a sample selected such that each elemet comes from the same populatio ad the elemets are selected idepedetly. Pie Chart: Pie Chart cosists of a circle which is divided ito two or more mars i accordace with the umber of distict classes that we have i our data. SIMPLE BAR CHART: A simple bar chart cosists of horizotal or vertical bars of equal width ad legths proportioal to values they represet. MULTIPLE BAR CHARTS: This kid of a chart cosists of a set of grouped bars, the legths of which are proportioate to the values of our variables, ad each of which is shaded or colored differetly i order to aid idetificatio. CLASS BOUNDARIES: The true class limits of a class are kow as its class boudaries. HISTOGRAM: A histogram cosists of a set of adjacet rectagles whose bases are marked off by class boudaries alog the X-axis, ad whose heights are proportioal to the frequecies associated with the respective classes. FREQUENCY POLYGON: A frequecy polygo is obtaied by plottig the class frequecies agaist the mid-poits of the classes, ad coectig the poits so obtaied by straight lie segmets.

19 19 FREQUENCY CURVE: Whe the frequecy polygo is smoothed, we obtai what may be called the frequecy curve. CUMULATIVE FREQUENCY DISTRIBUTION: As i the case of the frequecy distributio of a discrete variable, if we start addig the frequecies of our frequecy table colum-wise, we obtai the colum of cumulative frequecies. AVERAGES (I.E. MEASURES OF CENTRAL TENDENCY): A sigle value which iteded to represet a distributio or a set of data as a whole is called a average. It is more or less a cetral value aroud which the observatios ted to cluster so it is called measure of cetral tedecy. Sice measure of cetral tedecy idicate the locatio of the distributio o X axis so it is also called measure of locatio. The Arithmetic, Geometric ad Harmoic meas Are averages that are mathematical i character, ad give a idicatio of the magitude of the observed values. The Media Idicates the middle positio while the mode provides iformatio about the most frequet value i the distributio or the set of data. THE MODE: The Mode is defied as that value which occurs most frequetly i a set of data i.e. it idicates the most commo result. DOT PLOT: The horizotal axis of a dot plot cotais a scale for the quatitative variable that we wat to represet. The umerical value of each measuremet i the data set is located o the horizotal scale by a dot. GROUPING ERROR: Groupig error refers to the error that is itroduced by the assumptio that all the values fallig i a class are equal to the mid-poit of the class iterval. Ogive: A graph of a cumulative distributio. Dispersio: The variability that exists betwee data set. Rage: The rage is defied as the differece betwee the maximum ad miimum values of a data set. The coefficiet of variatio: The coefficiet of variatio expresses the stadard deviatio as the percetage of the arithmetic mea. Quartiles: Quartiles are those three quatities that divide the distributio ito four equal parts. Quartile Deviatio: The quartile deviatio is defied as half of the differece betwee the first ad third quartiles.

20 0 Quatiles: Collectively the quartiles, the deciles, percetiles ad other values obtaied by equall sub-divisio of the data are called quatiles. Percetiles: Percetiles are those iety ie quatities that divide the distributio ito hudred equall parts Absolute measure of dispersio: A absolute measure of dispersio is oe that measures the dispersio i terms of the same uits or i the square of uits, as the uits of the data. Relative measure of dispersio: Relative measure of dispersio is oe that is expressed i the form of a ratio, co-efficiet of percetage ad is idepedet of the uits of measuremet. COEFFICIENT OF QUARTILE DEVIATION: The Coefficiet of Quartile Deviatio is a pure umber ad is used for COMPARING the variatio i two or more sets of data. Mea Deviatio: The mea deviatio is defied as the arithmetic mea of the deviatios measured either from the mea or from the media, all deviatios beig couted as positive. Variace: Variace is defied as the square of the stadard deviatio. Stadard Deviatio: Stadard Deviatio is defied as the positive square root of the mea of the squared deviatios of the values from their mea. Five-umber summary: A exploratory data aalysis techique that uses the followig five umbers to summarize the data set: smallest value, first quartile, media, third quartile, ad largest value. Momets: Momets are the arithmetic meas of the powers to which the deviatios are raised. Correlatio: Correlatio is a measure of the stregth or the degree of relatioship betwee two radom variables. OR Iterdepedece of two variables is called correlatio. OR Correlatio is a techique which measures the stregth of associatio betwee two variables. RANDOM EXPERIMENT: A experimet which produces differet results eve though it is repeated a large umber of times uder essetially similar coditios is called a Radom Experimet. SAMPLE SPACE: A set cosistig of all possible outcomes that ca result from a radom experimet (real or coceptual), ca be defied as the sample space for the experimet ad is deoted by the letter S. Each possible outcome is a member of the sample space, ad is called a sample poit i that space.

21 1 SIMPLE & COMPOUND EVENTS: A evet that cotais exactly oe sample poit is defied as a simple evet. A compoud evet cotais more tha oe sample poit, ad is produced by the uio of simple evets. MUTUALLY EXCLUSIVE EVENTS: Two evets A ad B of a sigle experimet are said to be mutually exclusive or disjoit if ad oly if they caot both occur at the same time i.e. they have o poits i commo. EXHAUSTIVE EVENTS: Evets are said to be collectively exhaustive, whe the uio of mutually exclusive evets is equal to the etire sample space S. EQUALLY LIKELY EVENTS: Two evets A ad B are said to be equally likely, whe oe evet is as likely to occur as the other. I other words, each evet should occur i equal umber i repeated trials. Skewess: Skewess is the lack of symmetry i a distributio aroud some cetral value (mea, media or mode).it is thus the degree of a symmetry. Kurtosis: Kurtosis is the degree of peakess of a distributio usually take relative to a ormal distributio. MOMENTS: A momet desigates the power to which deviatios are raised before averagig them. Probability: Probability is defied as the ratio of favorable cases over equally likely cases. SUBJECTIVE OR PERSONALISTIC PROBABILITY: Probability i this sese is purely subjective, ad is based o whatever evidece is available to the idividual. It has a disadvatage that two or more persos faced with the same evidece may arrive at differet probabilities. Coditioal Probability: The probability of the occurrece of a evet A whe it is kow that some other evet B has already occurred is called the coditioal probability. MULTIPLICATION LAW: The probability that two evets A ad B will both occur is equal to the probability that oe of the evets will occur multiplied by the coditioal probability that the other evet will occur give that the first evet has already occurred. INDEPENDENT EVENTS: Two evets A ad B i the same sample space S, are defied to be idepedet (or statistically idepedet) if the probability that oe evet occurs, is ot affected by whether the other evet has or has ot occurred, Baye s theorem: A method used to compute posterior probabilities. RANDOM VARIABLE: Such a umerical quatity whose value is determied by the outcome of a radom experimet is called a radom variable.

22 Discrete radom variable: A radom variable that may assume either a fiite umber of values or a ifiite sequece of values. Set: A set is ay well-defied collectio or list of distict objects, e.g. a group of studets, the books i a library, the itegers betwee 1 ad 100, all huma beigs o the earth, etc. SUBSET: A set that cosists of some elemets of aother set, is called a subset of that set. For example, if B is a subset of A, the every member of set B is also a member of set A. If B is a subset of A, we write: (B A) PROPER SUBSET: If a set B cotais some but ot all of the elemets of aother set A, while A cotais each elemet of B, the the set B is defied to be a proper subset of A. Uiversal Set: The origial set of which all the sets we talk about, are subsets, is called the uiversal set (or the space) ad is geerally deoted by S or Ω. VENN DIAGRAM: A diagram that is uderstood to represet sets by circular regios, parts of circular regios or their complemets with respect to a rectagle represetig the space S is called a Ve diagram, amed after the Eglish logicia Joh Ve ( ). The Ve diagrams are used to represet sets ad subsets i a pictorial way ad to verify the relatioship amog sets ad subsets. UNION OF SETS: The uio or sum of two sets A ad B, deoted by (A B), ad read as A uio B, meas the set of all elemets that belog to at least oe of the sets A ad B, that is (A B) INTERSECTION OF SETS: The itersectio of two sets A ad B, deoted by A B, ad read as A itersectio B, meas that the set of all elemets that belog to both A ad B. OPERATIONS ON SETS: The basic operatios are uio, itersectio, differece ad complemetatio. SET DIFFERENCE: The differece of two sets A ad B, deoted by A B or by A (A B), is the set of all elemets of A which do ot belog to B. DISJOINT SETS: Two sets A ad B are defied to be disjoit or mutually exclusive or o-overlappig whe they have o elemets i commo, i.e. whe their itersectio is a empty set i.e. A B φ Cojoit Sets: Two sets A ad B are said to be cojoit whe they have at least oe elemet i commo. COMPLEMENTATION: The particular differece S A, that is, the set of all those elemets of S which do ot belog to A, is called the complemet of A ad is deoted by A or by Ac.

23 3 POWER SET: The class of ALL subsets of a set A is called the Power Set of A ad is deoted by P(A). For example, if A {H, T}, the P(A) {φ, {H}, {T}, {H, T}}. CARTESIAN PRODUCT OF SETS: The Cartesia product of sets A ad B, deoted by A B, (read as A cross B ), is a set that cotais all ordered pairs (x, y), where x belogs to A ad y belogs to B. This set is also called the Cartesia set of A ad B set of A ad B, amed after the Frech mathematicia Ree Descartes ( ). TREE DIAGRAM: The tree is costructed from the left to the right. A tree diagram is a useful device for eumeratig all the possible outcomes of two or more sequetial evets. PERMUTATION: A permutatio is ay ordered subset from a set of distict objects. OR A arragemet of all or some of a set of objects i a defiite order is called permutatio. COMBINATION: A combiatio is ay subset of r objects, selected without regard to their order, from a set of distict objects. OCCURRENCE OF AN EVENT: A evet A is said to occur if ad oly if the outcome of the experimet correspods to some elemet of A. PARTITION OF SETS: A partitio of a set S is a sub-divisio of the set ito o-empty subsets that are disjoit ad exhaustive, i.e. their uio is the set S itself. CLASS OF SETS: A set of sets is called a class. For example, i a set of lies, each lie is a set of poits. DISTRIBUTION FUNCTION: The distributio fuctio of a radom variable X, deoted by F(x), is defied by F(x) P(X < x). FROM LECTURE 3 TO 45 Chebychev s Iequality: If X is a radom variable havig mea µ ad variace > 0, ad k is ay positive costat, the the probability that a value of X falls withi k stadard deviatios of the mea is at least. CONTINUOUS RANDOM VARIABLE: A radom variable X is defied to be cotiuous if it ca assume every possible value i a iterval [a, b], a < b, where a ad b may be ad + respectively. JOINT DISTRIBUTIONS: The distributio of two or more radom variables which are observed simultaeously whe a experimet is performed is called their JOINT distributio.

24 4 BIVARIATE PROBABILITY FUNCTION: The joit or bivariate probability distributio cosistig of all pairs of values (xi, yj). MARGINAL PROBABILITY FUNCTIONS: The poit to be uderstood here is that, from the joit probability fuctio for (X, Y), we ca obtai the INDIVIDUAL probability fuctio of X ad Y. Such idividual probability fuctios are called MARGINAL probability fuctios. CONDITIONAL PROBABILITY FUNCTION: Let X ad Y be two discrete r.v. s with joit probability fuctio f(x, y). The the coditioal probability fuctio for X give Y y, deoted as f(x y). INDEPENDENCE: Two discrete r.v. s X ad Y are said to be statistically idepedet, if ad oly if, for all possible pairs of values (xi, yj) the joit probability fuctio f(x, y) ca be expressed as the product of the two margial probability fuctios. COVARIANCE OF TWO RANDOM VARIABLES: The covariace of two r.v. s X ad Y is a umerical measure of the extet to which their values ted to icrease or decrease together. It is deoted by XY or Cov (X, Y), ad is defied as the expected value of the product. BINOMIAL DISTRIBUTION: The biomial distributio is a very importat discrete probability distributio. It was discovered by James Beroulli about the year PROPERTIES OF A BINOMIAL EXPERIMENT: Every trial results i a success or a failure. The successive trials are idepedet. The probability of success, p, remais costat from trial to trial. The umber of trials,, is fixed i advaced. Biomial probability distributio: A probability distributio showig the probability of x successes i trials of a biomial experimet. Biomial probability fuctio: The fuctio used to compute probabilities i a biomial experimet. Biomial experimet: A probability experimet havig the followig four properties: cosists of idetical trials, two outcomes (success ad failure) are possible o each trial, probability of success does ot chage from trial to trail, ad the trials are idepedet. PROPERTIES OF A BINOMIAL EXPERIMENT: Every item selected will either be defective (i.e. success) or ot defective (i.e. failure) Every item draw is idepedet of every other item The probability of obtaiig a defective item i.e. 7% is the same (costat) for all items. (This probability figure is accordig to relative frequecy defiitio of probability. Hyper Geometric Probability fuctio: The fuctio used to compute the probability of x successes i trials whe the trials are depedet.

25 5 PROPERTIES OF HYPERGEOMETRIC EXPERIMENT: The outcomes of each trial may be classified ito oe of two categories, success ad failure. The probability of success chages o each trial. The successive trials are ot idepedet. The experimet is repeated a fixed umber of times. Hyper Geometric Distributio: There are may experimets i which the coditio of idepedece is violated ad the probability of success does ot remai costat for all trials. Such experimets are called hyper geometric experimets. PROPERTIES OF THE HYPERGEOMETRIC DISTRIBUTION: If N becomes idefiitely large, the hyper geometric probability distributio teds to the BINOMIAL probability distributio. The above property will be best uderstood with referece to the followig importat poits: There are two ways of drawig a sample from a populatio, samplig with replacemet, ad samplig without replacemet. Also, a sample ca be draw from either a fiite populatio or a ifiite populatio. Poisso Probability Distributio: A probability distributio showig the probability of x occurreces of a evet over a specified iterval of time or space. Poisso Probability Fuctio: The fuctio used to compute Poisso probabilities. PROPERTIES OF POISSON DISTRIBUTION: It is a limitig approximatio to the biomial distributio, whe p, the probability of success is very small but, the umber of trials is so large that the product p µ is of a moderate size; a distributio i its ow right by cosiderig a POISSON PROCESS where evets occur radomly over a specified iterval of time or space or legth. POISSON PROCESS: It may be defied as a physical process govered at least i part by some radom mechaism. NORMAL DISTRIBUTION: A cotiuous radom variable is said to be ormally distributed with mea μ ad stadard deviatio if its probability desity fuctio is give by (Formula of Normal Distributio) THE STANDARD NORMAL DISTRIBUTION: A ormal distributio whose mea is zero ad whose stadard deviatio is 1 is kow as the stadard ormal distributio. THE PROCESS OF STANDARDIZATION: I other words, the stadardizatio formula coverts our ormal distributio to the oe whose mea is 0 ad whose stadard deviatio is equal to 1. SAMPLING DISTRIBUTION: The probability distributio of ay statistic (such as the mea, the stadard deviatio, the proportio of successes i a sample, etc.) is kow as its samplig distributio.

26 6 CENTRAL LIMIT THEOREM: The theorem states that: If a variable X from a populatio has mea μ ad fiite variace, the the samplig distributio of the sample mea X approaches a ormal distributio with mea μ ad variace / as the sample size approaches ifiity. Samplig Distributio: A probability distributio cosistig of all possible values of a sample statistic. SAMPLING DISTRIBUTION OF THE SAMPLE PROPORTION: I this regard, the first poit to be oted is that, wheever the elemets of a populatio ca be classified ito two categories, techically called success ad failure, we may be iterested i the proportio of successes i the populatio. POINT ESTIMATION: Poit estimatio of a populatio parameter provides as a estimate a sigle value calculated from the sample that is likely to be close i magitude to the ukow parameter. UNBIASEDNESS: A estimator is defied to be ubiased if the statistic used as a estimator has its expected value equal to the true value of the populatio parameter beig estimated. CONSISTENCY: A estimator θˆ is said to be a cosistet estimator of the parameter θ if, for ay arbitrarily small positive quatity e. EFFICIENCY: A ubiased estimator is defied to be efficiet if the variace of its samplig distributio is smaller tha that of the samplig distributio of ay other ubiased estimator of the same parameter. METHODS OF POINTESTIMATION: The Method of Momets The Method of Least Squares The Method of Maximum Likelihood These methods give estimates which may differ as the methods are based o differet theories of estimatio. THE METHOD OF LEAST SQUARES: The method of Least Squares, which is due to Gauss ( ) ad Markov ( ), is based o the theory of liear estimatio. It is regarded as oe of the importat methods of poit estimatio. METHOD OF MAXIMUM LIKELIHOOD: This method was itroduced i 19 by Sir Roald A. Fisher ( ).The mathematical techique of fidig Maximum Likelihood Estimators is a bit advaced, ad ivolves the cocept of the Likelihood Fuctio. HYPOTHESIS-TESTING: It is a procedure which eables us to decide o the basis of iformatio obtaied from sample data whether to accept or reject a statemet or a assumptio about the value of a populatio parameter. NULL HYPOTHESIS: A ull hypothesis, geerally deoted by the symbol H0, is ay hypothesis which is to be tested for possible rejectio or ullificatio uder the assumptio that it is true.

27 7 ALTERNATIVE HYPOTHESIS: A alterative hypothesis is ay other hypothesis which we are willig to accept whe the ull hypothesis H0 is rejected. It is customarily deoted by H1 or HA. TYPE-I AND TYPE-II ERRORS: O the basis of sample iformatio, we may reject a ull hypothesis H0, whe it is, i fact, true or we may accept a ull hypothesis H0, whe it is actually false. The probability of makig a Type I error is covetioally deoted by α ad that of committig a Type II error is idicated by β. TEST-STATISTIC: A statistic (i.e. a fuctio of the sample data ot cotaiig ay parameters), which provides a basis for testig a ull hypothesis, is called a test-statistic. PROPERTIES OF STUDENT S t-distribution: i) The t-distributio is bell-shaped ad symmetric about the value t 0, ragig from to. ii) The umber of degrees of freedom determies the shape of the t-distributio. PROPERTIES OF F-DISTRIBUTION: 1. The F-distributio is a cotiuous distributio ragig from zero to plus ifiity.. The curve of the F-distributio is positively skewed. ANALYSIS OF VARIANCE (ANOVA): It is a procedure which eables us to test the hypothesis of equality of several populatio meas. EXPERIMENTAL DESIGN: By a experimetal desig, we mea a pla used to collect the data relevat to the problem uder study i such a way as to provide a basis for valid ad objective iferece about the stated problem. The pla usually icludes: The selectio of treatmets, whose effects are to be studied, The specificatio of the experimetal layout, ad The assigmet of treatmets to the experimetal uits. SYSTEMATIC AND RANDOMIZED DESIGNS: I this course, we will be discussig oly the radomized desigs, ad, i this regard, it should be oted that for the radomized desigs, the aalysis of the collected data is carried out through the techique kow as Aalysis of Variace. THE COMPLETELY RANDOMIZED DESIGN (CR DESIGN): A completely radomized (CR) desig, which is the simplest type of the basic desigs, may be defied as a desig i which the treatmets are assiged to experimetal uits completely at radom, i.e. the radomizatio is doe without ay restrictios. THE RANDOMIZED COMPLETE BLOCK DESIGN (RCB DESIGN): A radomized complete block (RCB) desig is the oe i which The experimetal material (which is ot homogeeous overall) is divided ito groups or blocks i such a maer that the experimetal uits withi a particular block are relatively homogeeous. Each block cotais a complete set of treatmets, i.e., it costitutes a replicatio of treatmets. The treatmets are allocated at radom to the experimetal uits withi each block, which meas the radomizatio is restricted. (A ew radomizatio is made for every block.)the object of this type of arragemet is to brig the variability of the experimetal material uder cotrol.

28 8 Least Squares Method: The method used to develop the estimated regressio equatio. It miimizes the sum of squared residuals (the deviatios betwee the observed values of the depedet variable, yi, ad the estimated values of the depedet variable, yi) Level of Sigificace: Level of sigificace of a test is the probability used as a stadard for rejectig ull hypothesis Ho whe Ho is assumed to be true. The level of sigificace acts as a basis for determiig the critical regio of the test. THE LEAST SIGNIFICANT DIFFERENCE (LSD) TEST: Accordig to this procedure, we compute the smallest differece that would be judged sigificat, ad compare the absolute values of all differeces of meas with it. This smallest differece is called the least sigificat differece or LSD. PROPERTIES OF THE CHI-SQUARE DISTRIBUTION: The Chi-Square (χ) distributio has the followig properties: 1. It is a cotiuous distributio ragig from 0 to +.The umber of degrees of freedom determies the shape of the chi-square distributio. (Thus, there is a differet chi-square distributio for each umber of degrees of freedom. As such, it is a whole family of distributios.). The curve of a chi-square distributio is positively skewed. The skewess decreases as ν icreases. ASSUMPTIONS OF THE CHI-SQUARE TEST OF GOODNESS OF FIT: While applyig the chi-square test of goodess of fit, certai requiremets must be satisfied, three of which are as follows: 1. The total umber of observatios (i.e. the sample size) should be at least 50.. The expected umber ei i ay of the categories should ot be less tha 5. (So, whe the expected frequecy ei i ay category is less tha 5, we may combie this category with oe or more of the other categories to get ei 5.) 3. The observatios i the sample or the frequecies of the categories should be idepedet. DEGREES OF FREEDOM: As you will recall, whe discussig the t-distributio, the chi-square distributio, ad the F-distributio, it was coveyed to you that the parameters that exists i the equatios of those distributios are kow as degrees of freedom. P value: The p-value is a property of the data, ad it idicates how improbable the obtaied result really is. LATEST STATISTICAL DEFINITION: Statistics is a sciece of decisio makig for goverig the state affairs. It collects aalyzes, maages, moitors, iterprets, evaluates ad validates iformatio. Statistics is Iformatio Sciece ad Iformatio Sciece is Statistics. It is a applicable sciece as its tools are applied to all scieces icludig humaities ad social scieces.

CEE 522 Autumn Uncertainty Concepts for Geotechnical Engineering

CEE 522 Autumn Uncertainty Concepts for Geotechnical Engineering CEE 5 Autum 005 Ucertaity Cocepts for Geotechical Egieerig Basic Termiology Set A set is a collectio of (mutually exclusive) objects or evets. The sample space is the (collectively exhaustive) collectio

More information

Random Variables, Sampling and Estimation

Random Variables, Sampling and Estimation Chapter 1 Radom Variables, Samplig ad Estimatio 1.1 Itroductio This chapter will cover the most importat basic statistical theory you eed i order to uderstad the ecoometric material that will be comig

More information

MOST PEOPLE WOULD RATHER LIVE WITH A PROBLEM THEY CAN'T SOLVE, THAN ACCEPT A SOLUTION THEY CAN'T UNDERSTAND.

MOST PEOPLE WOULD RATHER LIVE WITH A PROBLEM THEY CAN'T SOLVE, THAN ACCEPT A SOLUTION THEY CAN'T UNDERSTAND. XI-1 (1074) MOST PEOPLE WOULD RATHER LIVE WITH A PROBLEM THEY CAN'T SOLVE, THAN ACCEPT A SOLUTION THEY CAN'T UNDERSTAND. R. E. D. WOOLSEY AND H. S. SWANSON XI-2 (1075) STATISTICAL DECISION MAKING Advaced

More information

Probability and statistics: basic terms

Probability and statistics: basic terms Probability ad statistics: basic terms M. Veeraraghava August 203 A radom variable is a rule that assigs a umerical value to each possible outcome of a experimet. Outcomes of a experimet form the sample

More information

Quick Review of Probability

Quick Review of Probability Quick Review of Probability Berli Che Departmet of Computer Sciece & Iformatio Egieerig Natioal Taiwa Normal Uiversity Refereces: 1. W. Navidi. Statistics for Egieerig ad Scietists. Chapter & Teachig Material.

More information

Quick Review of Probability

Quick Review of Probability Quick Review of Probability Berli Che Departmet of Computer Sciece & Iformatio Egieerig Natioal Taiwa Normal Uiversity Refereces: 1. W. Navidi. Statistics for Egieerig ad Scietists. Chapter 2 & Teachig

More information

Continuous Data that can take on any real number (time/length) based on sample data. Categorical data can only be named or categorised

Continuous Data that can take on any real number (time/length) based on sample data. Categorical data can only be named or categorised Questio 1. (Topics 1-3) A populatio cosists of all the members of a group about which you wat to draw a coclusio (Greek letters (μ, σ, Ν) are used) A sample is the portio of the populatio selected for

More information

MATH 320: Probability and Statistics 9. Estimation and Testing of Parameters. Readings: Pruim, Chapter 4

MATH 320: Probability and Statistics 9. Estimation and Testing of Parameters. Readings: Pruim, Chapter 4 MATH 30: Probability ad Statistics 9. Estimatio ad Testig of Parameters Estimatio ad Testig of Parameters We have bee dealig situatios i which we have full kowledge of the distributio of a radom variable.

More information

Chapter 6 Sampling Distributions

Chapter 6 Sampling Distributions Chapter 6 Samplig Distributios 1 I most experimets, we have more tha oe measuremet for ay give variable, each measuremet beig associated with oe radomly selected a member of a populatio. Hece we eed to

More information

1 Inferential Methods for Correlation and Regression Analysis

1 Inferential Methods for Correlation and Regression Analysis 1 Iferetial Methods for Correlatio ad Regressio Aalysis I the chapter o Correlatio ad Regressio Aalysis tools for describig bivariate cotiuous data were itroduced. The sample Pearso Correlatio Coefficiet

More information

KLMED8004 Medical statistics. Part I, autumn Estimation. We have previously learned: Population and sample. New questions

KLMED8004 Medical statistics. Part I, autumn Estimation. We have previously learned: Population and sample. New questions We have previously leared: KLMED8004 Medical statistics Part I, autum 00 How kow probability distributios (e.g. biomial distributio, ormal distributio) with kow populatio parameters (mea, variace) ca give

More information

Properties and Hypothesis Testing

Properties and Hypothesis Testing Chapter 3 Properties ad Hypothesis Testig 3.1 Types of data The regressio techiques developed i previous chapters ca be applied to three differet kids of data. 1. Cross-sectioal data. 2. Time series data.

More information

1 of 7 7/16/2009 6:06 AM Virtual Laboratories > 6. Radom Samples > 1 2 3 4 5 6 7 6. Order Statistics Defiitios Suppose agai that we have a basic radom experimet, ad that X is a real-valued radom variable

More information

Chapter 1 (Definitions)

Chapter 1 (Definitions) FINAL EXAM REVIEW Chapter 1 (Defiitios) Qualitative: Nomial: Ordial: Quatitative: Ordial: Iterval: Ratio: Observatioal Study: Desiged Experimet: Samplig: Cluster: Stratified: Systematic: Coveiece: Simple

More information

FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING. Lectures

FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING. Lectures FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING Lectures MODULE 5 STATISTICS II. Mea ad stadard error of sample data. Biomial distributio. Normal distributio 4. Samplig 5. Cofidece itervals

More information

Chapter 2 Descriptive Statistics

Chapter 2 Descriptive Statistics Chapter 2 Descriptive Statistics Statistics Most commoly, statistics refers to umerical data. Statistics may also refer to the process of collectig, orgaizig, presetig, aalyzig ad iterpretig umerical data

More information

11 Correlation and Regression

11 Correlation and Regression 11 Correlatio Regressio 11.1 Multivariate Data Ofte we look at data where several variables are recorded for the same idividuals or samplig uits. For example, at a coastal weather statio, we might record

More information

7-1. Chapter 4. Part I. Sampling Distributions and Confidence Intervals

7-1. Chapter 4. Part I. Sampling Distributions and Confidence Intervals 7-1 Chapter 4 Part I. Samplig Distributios ad Cofidece Itervals 1 7- Sectio 1. Samplig Distributio 7-3 Usig Statistics Statistical Iferece: Predict ad forecast values of populatio parameters... Test hypotheses

More information

Statistics Independent (X) you can choose and manipulate. Usually on x-axis

Statistics Independent (X) you can choose and manipulate. Usually on x-axis Statistics-6000 Variable: are characteristic that ca take o differet values with respect to persos, time, ad place ad types of variables are as follow: Idepedet (X) you ca choose ad maipulate. Usually

More information

Statistical inference: example 1. Inferential Statistics

Statistical inference: example 1. Inferential Statistics Statistical iferece: example 1 Iferetial Statistics POPULATION SAMPLE A clothig store chai regularly buys from a supplier large quatities of a certai piece of clothig. Each item ca be classified either

More information

Resampling Methods. X (1/2), i.e., Pr (X i m) = 1/2. We order the data: X (1) X (2) X (n). Define the sample median: ( n.

Resampling Methods. X (1/2), i.e., Pr (X i m) = 1/2. We order the data: X (1) X (2) X (n). Define the sample median: ( n. Jauary 1, 2019 Resamplig Methods Motivatio We have so may estimators with the property θ θ d N 0, σ 2 We ca also write θ a N θ, σ 2 /, where a meas approximately distributed as Oce we have a cosistet estimator

More information

TABLES AND FORMULAS FOR MOORE Basic Practice of Statistics

TABLES AND FORMULAS FOR MOORE Basic Practice of Statistics TABLES AND FORMULAS FOR MOORE Basic Practice of Statistics Explorig Data: Distributios Look for overall patter (shape, ceter, spread) ad deviatios (outliers). Mea (use a calculator): x = x 1 + x 2 + +

More information

NANYANG TECHNOLOGICAL UNIVERSITY SYLLABUS FOR ENTRANCE EXAMINATION FOR INTERNATIONAL STUDENTS AO-LEVEL MATHEMATICS

NANYANG TECHNOLOGICAL UNIVERSITY SYLLABUS FOR ENTRANCE EXAMINATION FOR INTERNATIONAL STUDENTS AO-LEVEL MATHEMATICS NANYANG TECHNOLOGICAL UNIVERSITY SYLLABUS FOR ENTRANCE EXAMINATION FOR INTERNATIONAL STUDENTS AO-LEVEL MATHEMATICS STRUCTURE OF EXAMINATION PAPER. There will be oe 2-hour paper cosistig of 4 questios.

More information

Econ 325 Notes on Point Estimator and Confidence Interval 1 By Hiro Kasahara

Econ 325 Notes on Point Estimator and Confidence Interval 1 By Hiro Kasahara Poit Estimator Eco 325 Notes o Poit Estimator ad Cofidece Iterval 1 By Hiro Kasahara Parameter, Estimator, ad Estimate The ormal probability desity fuctio is fully characterized by two costats: populatio

More information

Economics 250 Assignment 1 Suggested Answers. 1. We have the following data set on the lengths (in minutes) of a sample of long-distance phone calls

Economics 250 Assignment 1 Suggested Answers. 1. We have the following data set on the lengths (in minutes) of a sample of long-distance phone calls Ecoomics 250 Assigmet 1 Suggested Aswers 1. We have the followig data set o the legths (i miutes) of a sample of log-distace phoe calls 1 20 10 20 13 23 3 7 18 7 4 5 15 7 29 10 18 10 10 23 4 12 8 6 (1)

More information

[ ] ( ) ( ) [ ] ( ) 1 [ ] [ ] Sums of Random Variables Y = a 1 X 1 + a 2 X 2 + +a n X n The expected value of Y is:

[ ] ( ) ( ) [ ] ( ) 1 [ ] [ ] Sums of Random Variables Y = a 1 X 1 + a 2 X 2 + +a n X n The expected value of Y is: PROBABILITY FUNCTIONS A radom variable X has a probabilit associated with each of its possible values. The probabilit is termed a discrete probabilit if X ca assume ol discrete values, or X = x, x, x 3,,

More information

Goodness-of-Fit Tests and Categorical Data Analysis (Devore Chapter Fourteen)

Goodness-of-Fit Tests and Categorical Data Analysis (Devore Chapter Fourteen) Goodess-of-Fit Tests ad Categorical Data Aalysis (Devore Chapter Fourtee) MATH-252-01: Probability ad Statistics II Sprig 2019 Cotets 1 Chi-Squared Tests with Kow Probabilities 1 1.1 Chi-Squared Testig................

More information

Expectation and Variance of a random variable

Expectation and Variance of a random variable Chapter 11 Expectatio ad Variace of a radom variable The aim of this lecture is to defie ad itroduce mathematical Expectatio ad variace of a fuctio of discrete & cotiuous radom variables ad the distributio

More information

Chapter 22. Comparing Two Proportions. Copyright 2010, 2007, 2004 Pearson Education, Inc.

Chapter 22. Comparing Two Proportions. Copyright 2010, 2007, 2004 Pearson Education, Inc. Chapter 22 Comparig Two Proportios Copyright 2010, 2007, 2004 Pearso Educatio, Ic. Comparig Two Proportios Read the first two paragraphs of pg 504. Comparisos betwee two percetages are much more commo

More information

If, for instance, we were required to test whether the population mean μ could be equal to a certain value μ

If, for instance, we were required to test whether the population mean μ could be equal to a certain value μ STATISTICAL INFERENCE INTRODUCTION Statistical iferece is that brach of Statistics i which oe typically makes a statemet about a populatio based upo the results of a sample. I oesample testig, we essetially

More information

Lecture 5. Random variable and distribution of probability

Lecture 5. Random variable and distribution of probability Itroductio to theory of probability ad statistics Lecture 5. Radom variable ad distributio of probability prof. dr hab.iż. Katarzya Zarzewsa Katedra Eletroii, AGH e-mail: za@agh.edu.pl http://home.agh.edu.pl/~za

More information

1036: Probability & Statistics

1036: Probability & Statistics 036: Probability & Statistics Lecture 0 Oe- ad Two-Sample Tests of Hypotheses 0- Statistical Hypotheses Decisio based o experimetal evidece whether Coffee drikig icreases the risk of cacer i humas. A perso

More information

PSYCHOLOGICAL RESEARCH (PYC 304-C) Lecture 9

PSYCHOLOGICAL RESEARCH (PYC 304-C) Lecture 9 Hypothesis testig PSYCHOLOGICAL RESEARCH (PYC 34-C Lecture 9 Statistical iferece is that brach of Statistics i which oe typically makes a statemet about a populatio based upo the results of a sample. I

More information

What is Probability?

What is Probability? Quatificatio of ucertaity. What is Probability? Mathematical model for thigs that occur radomly. Radom ot haphazard, do t kow what will happe o ay oe experimet, but has a log ru order. The cocept of probability

More information

DS 100: Principles and Techniques of Data Science Date: April 13, Discussion #10

DS 100: Principles and Techniques of Data Science Date: April 13, Discussion #10 DS 00: Priciples ad Techiques of Data Sciece Date: April 3, 208 Name: Hypothesis Testig Discussio #0. Defie these terms below as they relate to hypothesis testig. a) Data Geeratio Model: Solutio: A set

More information

Important Formulas. Expectation: E (X) = Σ [X P(X)] = n p q σ = n p q. P(X) = n! X1! X 2! X 3! X k! p X. Chapter 6 The Normal Distribution.

Important Formulas. Expectation: E (X) = Σ [X P(X)] = n p q σ = n p q. P(X) = n! X1! X 2! X 3! X k! p X. Chapter 6 The Normal Distribution. Importat Formulas Chapter 3 Data Descriptio Mea for idividual data: X = _ ΣX Mea for grouped data: X= _ Σf X m Stadard deviatio for a sample: _ s = Σ(X _ X ) or s = 1 (Σ X ) (Σ X ) ( 1) Stadard deviatio

More information

Class 23. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700

Class 23. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700 Class 23 Daiel B. Rowe, Ph.D. Departmet of Mathematics, Statistics, ad Computer Sciece Copyright 2017 by D.B. Rowe 1 Ageda: Recap Chapter 9.1 Lecture Chapter 9.2 Review Exam 6 Problem Solvig Sessio. 2

More information

Anna Janicka Mathematical Statistics 2018/2019 Lecture 1, Parts 1 & 2

Anna Janicka Mathematical Statistics 2018/2019 Lecture 1, Parts 1 & 2 Aa Jaicka Mathematical Statistics 18/19 Lecture 1, Parts 1 & 1. Descriptive Statistics By the term descriptive statistics we will mea the tools used for quatitative descriptio of the properties of a sample

More information

Lecture 4. Random variable and distribution of probability

Lecture 4. Random variable and distribution of probability Itroductio to theory of probability ad statistics Lecture. Radom variable ad distributio of probability dr hab.iż. Katarzya Zarzewsa, prof.agh Katedra Eletroii, AGH e-mail: za@agh.edu.pl http://home.agh.edu.pl/~za

More information

GG313 GEOLOGICAL DATA ANALYSIS

GG313 GEOLOGICAL DATA ANALYSIS GG313 GEOLOGICAL DATA ANALYSIS 1 Testig Hypothesis GG313 GEOLOGICAL DATA ANALYSIS LECTURE NOTES PAUL WESSEL SECTION TESTING OF HYPOTHESES Much of statistics is cocered with testig hypothesis agaist data

More information

Stat 319 Theory of Statistics (2) Exercises

Stat 319 Theory of Statistics (2) Exercises Kig Saud Uiversity College of Sciece Statistics ad Operatios Research Departmet Stat 39 Theory of Statistics () Exercises Refereces:. Itroductio to Mathematical Statistics, Sixth Editio, by R. Hogg, J.

More information

Topic 9: Sampling Distributions of Estimators

Topic 9: Sampling Distributions of Estimators Topic 9: Samplig Distributios of Estimators Course 003, 2018 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be

More information

Lecture 1. Statistics: A science of information. Population: The population is the collection of all subjects we re interested in studying.

Lecture 1. Statistics: A science of information. Population: The population is the collection of all subjects we re interested in studying. Lecture Mai Topics: Defiitios: Statistics, Populatio, Sample, Radom Sample, Statistical Iferece Type of Data Scales of Measuremet Describig Data with Numbers Describig Data Graphically. Defiitios. Example

More information

Statistics 511 Additional Materials

Statistics 511 Additional Materials Cofidece Itervals o mu Statistics 511 Additioal Materials This topic officially moves us from probability to statistics. We begi to discuss makig ifereces about the populatio. Oe way to differetiate probability

More information

t distribution [34] : used to test a mean against an hypothesized value (H 0 : µ = µ 0 ) or the difference

t distribution [34] : used to test a mean against an hypothesized value (H 0 : µ = µ 0 ) or the difference EXST30 Backgroud material Page From the textbook The Statistical Sleuth Mea [0]: I your text the word mea deotes a populatio mea (µ) while the work average deotes a sample average ( ). Variace [0]: The

More information

Final Examination Solutions 17/6/2010

Final Examination Solutions 17/6/2010 The Islamic Uiversity of Gaza Faculty of Commerce epartmet of Ecoomics ad Political Scieces A Itroductio to Statistics Course (ECOE 30) Sprig Semester 009-00 Fial Eamiatio Solutios 7/6/00 Name: I: Istructor:

More information

Topic 9: Sampling Distributions of Estimators

Topic 9: Sampling Distributions of Estimators Topic 9: Samplig Distributios of Estimators Course 003, 2016 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be

More information

Recall the study where we estimated the difference between mean systolic blood pressure levels of users of oral contraceptives and non-users, x - y.

Recall the study where we estimated the difference between mean systolic blood pressure levels of users of oral contraceptives and non-users, x - y. Testig Statistical Hypotheses Recall the study where we estimated the differece betwee mea systolic blood pressure levels of users of oral cotraceptives ad o-users, x - y. Such studies are sometimes viewed

More information

Chapter 22. Comparing Two Proportions. Copyright 2010 Pearson Education, Inc.

Chapter 22. Comparing Two Proportions. Copyright 2010 Pearson Education, Inc. Chapter 22 Comparig Two Proportios Copyright 2010 Pearso Educatio, Ic. Comparig Two Proportios Comparisos betwee two percetages are much more commo tha questios about isolated percetages. Ad they are more

More information

A statistical method to determine sample size to estimate characteristic value of soil parameters

A statistical method to determine sample size to estimate characteristic value of soil parameters A statistical method to determie sample size to estimate characteristic value of soil parameters Y. Hojo, B. Setiawa 2 ad M. Suzuki 3 Abstract Sample size is a importat factor to be cosidered i determiig

More information

Lecture 5: Parametric Hypothesis Testing: Comparing Means. GENOME 560, Spring 2016 Doug Fowler, GS

Lecture 5: Parametric Hypothesis Testing: Comparing Means. GENOME 560, Spring 2016 Doug Fowler, GS Lecture 5: Parametric Hypothesis Testig: Comparig Meas GENOME 560, Sprig 2016 Doug Fowler, GS (dfowler@uw.edu) 1 Review from last week What is a cofidece iterval? 2 Review from last week What is a cofidece

More information

6 Sample Size Calculations

6 Sample Size Calculations 6 Sample Size Calculatios Oe of the major resposibilities of a cliical trial statisticia is to aid the ivestigators i determiig the sample size required to coduct a study The most commo procedure for determiig

More information

Discrete Mathematics for CS Spring 2008 David Wagner Note 22

Discrete Mathematics for CS Spring 2008 David Wagner Note 22 CS 70 Discrete Mathematics for CS Sprig 2008 David Wager Note 22 I.I.D. Radom Variables Estimatig the bias of a coi Questio: We wat to estimate the proportio p of Democrats i the US populatio, by takig

More information

MBACATÓLICA. Quantitative Methods. Faculdade de Ciências Económicas e Empresariais UNIVERSIDADE CATÓLICA PORTUGUESA 9. SAMPLING DISTRIBUTIONS

MBACATÓLICA. Quantitative Methods. Faculdade de Ciências Económicas e Empresariais UNIVERSIDADE CATÓLICA PORTUGUESA 9. SAMPLING DISTRIBUTIONS MBACATÓLICA Quatitative Methods Miguel Gouveia Mauel Leite Moteiro Faculdade de Ciêcias Ecoómicas e Empresariais UNIVERSIDADE CATÓLICA PORTUGUESA 9. SAMPLING DISTRIBUTIONS MBACatólica 006/07 Métodos Quatitativos

More information

General IxJ Contingency Tables

General IxJ Contingency Tables page1 Geeral x Cotigecy Tables We ow geeralize our previous results from the prospective, retrospective ad cross-sectioal studies ad the Poisso samplig case to x cotigecy tables. For such tables, the test

More information

CHAPTER SUMMARIES MAT102 Dr J Lubowsky Page 1 of 13 Chapter 1: Introduction to Statistics

CHAPTER SUMMARIES MAT102 Dr J Lubowsky Page 1 of 13 Chapter 1: Introduction to Statistics CHAPTER SUMMARIES MAT102 Dr J Lubowsky Page 1 of 13 Chapter 1: Itroductio to Statistics Misleadig Iformatio: Surveys ad advertisig claims ca be biased by urepresetative samples, biased questios, iappropriate

More information

Describing the Relation between Two Variables

Describing the Relation between Two Variables Copyright 010 Pearso Educatio, Ic. Tables ad Formulas for Sulliva, Statistics: Iformed Decisios Usig Data 010 Pearso Educatio, Ic Chapter Orgaizig ad Summarizig Data Relative frequecy = frequecy sum of

More information

Statistics Lecture 27. Final review. Administrative Notes. Outline. Experiments. Sampling and Surveys. Administrative Notes

Statistics Lecture 27. Final review. Administrative Notes. Outline. Experiments. Sampling and Surveys. Administrative Notes Admiistrative Notes s - Lecture 7 Fial review Fial Exam is Tuesday, May 0th (3-5pm Covers Chapters -8 ad 0 i textbook Brig ID cards to fial! Allowed: Calculators, double-sided 8.5 x cheat sheet Exam Rooms:

More information

Computing Confidence Intervals for Sample Data

Computing Confidence Intervals for Sample Data Computig Cofidece Itervals for Sample Data Topics Use of Statistics Sources of errors Accuracy, precisio, resolutio A mathematical model of errors Cofidece itervals For meas For variaces For proportios

More information

Final Review. Fall 2013 Prof. Yao Xie, H. Milton Stewart School of Industrial Systems & Engineering Georgia Tech

Final Review. Fall 2013 Prof. Yao Xie, H. Milton Stewart School of Industrial Systems & Engineering Georgia Tech Fial Review Fall 2013 Prof. Yao Xie, yao.xie@isye.gatech.edu H. Milto Stewart School of Idustrial Systems & Egieerig Georgia Tech 1 Radom samplig model radom samples populatio radom samples: x 1,..., x

More information

Lecture 2: Monte Carlo Simulation

Lecture 2: Monte Carlo Simulation STAT/Q SCI 43: Itroductio to Resamplig ethods Sprig 27 Istructor: Ye-Chi Che Lecture 2: ote Carlo Simulatio 2 ote Carlo Itegratio Assume we wat to evaluate the followig itegratio: e x3 dx What ca we do?

More information

Sample questions. 8. Let X denote a continuous random variable with probability density function f(x) = 4x 3 /15 for

Sample questions. 8. Let X denote a continuous random variable with probability density function f(x) = 4x 3 /15 for Sample questios Suppose that humas ca have oe of three bloodtypes: A, B, O Assume that 40% of the populatio has Type A, 50% has type B, ad 0% has Type O If a perso has type A, the probability that they

More information

Inferential Statistics. Inference Process. Inferential Statistics and Probability a Holistic Approach. Inference Process.

Inferential Statistics. Inference Process. Inferential Statistics and Probability a Holistic Approach. Inference Process. Iferetial Statistics ad Probability a Holistic Approach Iferece Process Chapter 8 Poit Estimatio ad Cofidece Itervals This Course Material by Maurice Geraghty is licesed uder a Creative Commos Attributio-ShareAlike

More information

Topic 9: Sampling Distributions of Estimators

Topic 9: Sampling Distributions of Estimators Topic 9: Samplig Distributios of Estimators Course 003, 2018 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be

More information

It is always the case that unions, intersections, complements, and set differences are preserved by the inverse image of a function.

It is always the case that unions, intersections, complements, and set differences are preserved by the inverse image of a function. MATH 532 Measurable Fuctios Dr. Neal, WKU Throughout, let ( X, F, µ) be a measure space ad let (!, F, P ) deote the special case of a probability space. We shall ow begi to study real-valued fuctios defied

More information

Lecture 22: Review for Exam 2. 1 Basic Model Assumptions (without Gaussian Noise)

Lecture 22: Review for Exam 2. 1 Basic Model Assumptions (without Gaussian Noise) Lecture 22: Review for Exam 2 Basic Model Assumptios (without Gaussia Noise) We model oe cotiuous respose variable Y, as a liear fuctio of p umerical predictors, plus oise: Y = β 0 + β X +... β p X p +

More information

IE 230 Probability & Statistics in Engineering I. Closed book and notes. No calculators. 120 minutes.

IE 230 Probability & Statistics in Engineering I. Closed book and notes. No calculators. 120 minutes. Closed book ad otes. No calculators. 120 miutes. Cover page, five pages of exam, ad tables for discrete ad cotiuous distributios. Score X i =1 X i / S X 2 i =1 (X i X ) 2 / ( 1) = [i =1 X i 2 X 2 ] / (

More information

Axioms of Measure Theory

Axioms of Measure Theory MATH 532 Axioms of Measure Theory Dr. Neal, WKU I. The Space Throughout the course, we shall let X deote a geeric o-empty set. I geeral, we shall ot assume that ay algebraic structure exists o X so that

More information

Tests of Hypotheses Based on a Single Sample (Devore Chapter Eight)

Tests of Hypotheses Based on a Single Sample (Devore Chapter Eight) Tests of Hypotheses Based o a Sigle Sample Devore Chapter Eight MATH-252-01: Probability ad Statistics II Sprig 2018 Cotets 1 Hypothesis Tests illustrated with z-tests 1 1.1 Overview of Hypothesis Testig..........

More information

Module 1 Fundamentals in statistics

Module 1 Fundamentals in statistics Normal Distributio Repeated observatios that differ because of experimetal error ofte vary about some cetral value i a roughly symmetrical distributio i which small deviatios occur much more frequetly

More information

Binomial Distribution

Binomial Distribution 0.0 0.5 1.0 1.5 2.0 2.5 3.0 0 1 2 3 4 5 6 7 0.0 0.5 1.0 1.5 2.0 2.5 3.0 Overview Example: coi tossed three times Defiitio Formula Recall that a r.v. is discrete if there are either a fiite umber of possible

More information

Class 27. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700

Class 27. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700 Class 7 Daiel B. Rowe, Ph.D. Departmet of Mathematics, Statistics, ad Computer Sciece Copyright 013 by D.B. Rowe 1 Ageda: Skip Recap Chapter 10.5 ad 10.6 Lecture Chapter 11.1-11. Review Chapters 9 ad 10

More information

Formulas and Tables for Gerstman

Formulas and Tables for Gerstman Formulas ad Tables for Gerstma Measuremet ad Study Desig Biostatistics is more tha a compilatio of computatioal techiques! Measuremet scales: quatitative, ordial, categorical Iformatio quality is primary

More information

Notation List. For Cambridge International Mathematics Qualifications. For use from 2020

Notation List. For Cambridge International Mathematics Qualifications. For use from 2020 Notatio List For Cambridge Iteratioal Mathematics Qualificatios For use from 2020 Notatio List for Cambridge Iteratioal Mathematics Qualificatios (For use from 2020) Mathematical otatio Eamiatios for CIE

More information

Lecture 7: Properties of Random Samples

Lecture 7: Properties of Random Samples Lecture 7: Properties of Radom Samples 1 Cotiued From Last Class Theorem 1.1. Let X 1, X,...X be a radom sample from a populatio with mea µ ad variace σ

More information

Simulation. Two Rule For Inverting A Distribution Function

Simulation. Two Rule For Inverting A Distribution Function Simulatio Two Rule For Ivertig A Distributio Fuctio Rule 1. If F(x) = u is costat o a iterval [x 1, x 2 ), the the uiform value u is mapped oto x 2 through the iversio process. Rule 2. If there is a jump

More information

Statisticians use the word population to refer the total number of (potential) observations under consideration

Statisticians use the word population to refer the total number of (potential) observations under consideration 6 Samplig Distributios Statisticias use the word populatio to refer the total umber of (potetial) observatios uder cosideratio The populatio is just the set of all possible outcomes i our sample space

More information

MEASURES OF DISPERSION (VARIABILITY)

MEASURES OF DISPERSION (VARIABILITY) POLI 300 Hadout #7 N. R. Miller MEASURES OF DISPERSION (VARIABILITY) While measures of cetral tedecy idicate what value of a variable is (i oe sese or other, e.g., mode, media, mea), average or cetral

More information

Hypothesis Testing. Evaluation of Performance of Learned h. Issues. Trade-off Between Bias and Variance

Hypothesis Testing. Evaluation of Performance of Learned h. Issues. Trade-off Between Bias and Variance Hypothesis Testig Empirically evaluatig accuracy of hypotheses: importat activity i ML. Three questios: Give observed accuracy over a sample set, how well does this estimate apply over additioal samples?

More information

Sampling, Sampling Distribution and Normality

Sampling, Sampling Distribution and Normality 4/17/11 Tools of Busiess Statistics Samplig, Samplig Distributio ad ormality Preseted by: Mahedra Adhi ugroho, M.Sc Descriptive statistics Collectig, presetig, ad describig data Iferetial statistics Drawig

More information

Read through these prior to coming to the test and follow them when you take your test.

Read through these prior to coming to the test and follow them when you take your test. Math 143 Sprig 2012 Test 2 Iformatio 1 Test 2 will be give i class o Thursday April 5. Material Covered The test is cummulative, but will emphasize the recet material (Chapters 6 8, 10 11, ad Sectios 12.1

More information

Joint Probability Distributions and Random Samples. Jointly Distributed Random Variables. Chapter { }

Joint Probability Distributions and Random Samples. Jointly Distributed Random Variables. Chapter { } UCLA STAT A Applied Probability & Statistics for Egieers Istructor: Ivo Diov, Asst. Prof. I Statistics ad Neurology Teachig Assistat: Neda Farziia, UCLA Statistics Uiversity of Califoria, Los Ageles, Sprig

More information

Mathematical Statistics - MS

Mathematical Statistics - MS Paper Specific Istructios. The examiatio is of hours duratio. There are a total of 60 questios carryig 00 marks. The etire paper is divided ito three sectios, A, B ad C. All sectios are compulsory. Questios

More information

Introductory statistics

Introductory statistics CM9S: Machie Learig for Bioiformatics Lecture - 03/3/06 Itroductory statistics Lecturer: Sriram Sakararama Scribe: Sriram Sakararama We will provide a overview of statistical iferece focussig o the key

More information

Discrete probability distributions

Discrete probability distributions Discrete probability distributios I the chapter o probability we used the classical method to calculate the probability of various values of a radom variable. I some cases, however, we may be able to develop

More information

Lecture 1 Probability and Statistics

Lecture 1 Probability and Statistics Wikipedia: Lecture 1 Probability ad Statistics Bejami Disraeli, British statesma ad literary figure (1804 1881): There are three kids of lies: lies, damed lies, ad statistics. popularized i US by Mark

More information

CS284A: Representations and Algorithms in Molecular Biology

CS284A: Representations and Algorithms in Molecular Biology CS284A: Represetatios ad Algorithms i Molecular Biology Scribe Notes o Lectures 3 & 4: Motif Discovery via Eumeratio & Motif Represetatio Usig Positio Weight Matrix Joshua Gervi Based o presetatios by

More information

Estimation for Complete Data

Estimation for Complete Data Estimatio for Complete Data complete data: there is o loss of iformatio durig study. complete idividual complete data= grouped data A complete idividual data is the oe i which the complete iformatio of

More information

Econ 325/327 Notes on Sample Mean, Sample Proportion, Central Limit Theorem, Chi-square Distribution, Student s t distribution 1.

Econ 325/327 Notes on Sample Mean, Sample Proportion, Central Limit Theorem, Chi-square Distribution, Student s t distribution 1. Eco 325/327 Notes o Sample Mea, Sample Proportio, Cetral Limit Theorem, Chi-square Distributio, Studet s t distributio 1 Sample Mea By Hiro Kasahara We cosider a radom sample from a populatio. Defiitio

More information

Categorical Data Analysis

Categorical Data Analysis Categorical Data Aalysis Refereces : Ala Agresti, Categorical Data Aalysis, Wiley Itersciece, New Jersey, 2002 Bhattacharya, G.K., Johso, R.A., Statistical Cocepts ad Methods, Wiley,1977 Outlie Categorical

More information

April 18, 2017 CONFIDENCE INTERVALS AND HYPOTHESIS TESTING, UNDERGRADUATE MATH 526 STYLE

April 18, 2017 CONFIDENCE INTERVALS AND HYPOTHESIS TESTING, UNDERGRADUATE MATH 526 STYLE April 18, 2017 CONFIDENCE INTERVALS AND HYPOTHESIS TESTING, UNDERGRADUATE MATH 526 STYLE TERRY SOO Abstract These otes are adapted from whe I taught Math 526 ad meat to give a quick itroductio to cofidece

More information

Chapter 23: Inferences About Means

Chapter 23: Inferences About Means Chapter 23: Ifereces About Meas Eough Proportios! We ve spet the last two uits workig with proportios (or qualitative variables, at least) ow it s time to tur our attetios to quatitative variables. For

More information

Advanced Stochastic Processes.

Advanced Stochastic Processes. Advaced Stochastic Processes. David Gamarik LECTURE 2 Radom variables ad measurable fuctios. Strog Law of Large Numbers (SLLN). Scary stuff cotiued... Outlie of Lecture Radom variables ad measurable fuctios.

More information

STA Learning Objectives. Population Proportions. Module 10 Comparing Two Proportions. Upon completing this module, you should be able to:

STA Learning Objectives. Population Proportions. Module 10 Comparing Two Proportions. Upon completing this module, you should be able to: STA 2023 Module 10 Comparig Two Proportios Learig Objectives Upo completig this module, you should be able to: 1. Perform large-sample ifereces (hypothesis test ad cofidece itervals) to compare two populatio

More information

Direction: This test is worth 150 points. You are required to complete this test within 55 minutes.

Direction: This test is worth 150 points. You are required to complete this test within 55 minutes. Term Test 3 (Part A) November 1, 004 Name Math 6 Studet Number Directio: This test is worth 10 poits. You are required to complete this test withi miutes. I order to receive full credit, aswer each problem

More information

4. Partial Sums and the Central Limit Theorem

4. Partial Sums and the Central Limit Theorem 1 of 10 7/16/2009 6:05 AM Virtual Laboratories > 6. Radom Samples > 1 2 3 4 5 6 7 4. Partial Sums ad the Cetral Limit Theorem The cetral limit theorem ad the law of large umbers are the two fudametal theorems

More information

CHAPTER 8 FUNDAMENTAL SAMPLING DISTRIBUTIONS AND DATA DESCRIPTIONS. 8.1 Random Sampling. 8.2 Some Important Statistics

CHAPTER 8 FUNDAMENTAL SAMPLING DISTRIBUTIONS AND DATA DESCRIPTIONS. 8.1 Random Sampling. 8.2 Some Important Statistics CHAPTER 8 FUNDAMENTAL SAMPLING DISTRIBUTIONS AND DATA DESCRIPTIONS 8.1 Radom Samplig The basic idea of the statistical iferece is that we are allowed to draw ifereces or coclusios about a populatio based

More information

Chapter 12 Correlation

Chapter 12 Correlation Chapter Correlatio Correlatio is very similar to regressio with oe very importat differece. Regressio is used to explore the relatioship betwee a idepedet variable ad a depedet variable, whereas correlatio

More information

5. Likelihood Ratio Tests

5. Likelihood Ratio Tests 1 of 5 7/29/2009 3:16 PM Virtual Laboratories > 9. Hy pothesis Testig > 1 2 3 4 5 6 7 5. Likelihood Ratio Tests Prelimiaries As usual, our startig poit is a radom experimet with a uderlyig sample space,

More information

Understanding Samples

Understanding Samples 1 Will Moroe CS 109 Samplig ad Bootstrappig Lecture Notes #17 August 2, 2017 Based o a hadout by Chris Piech I this chapter we are goig to talk about statistics calculated o samples from a populatio. We

More information