A SHORT INTRODUCTION TO PROBABILITY

Similar documents
Introduction to Probability. Experiments. Sample Space. Event. Basic Requirements for Assigning Probabilities. Experiments

The science of learning from data.

AIM HIGH SCHOOL. Curriculum Map W. 12 Mile Road Farmington Hills, MI (248)

University of Jordan Fall 2009/2010 Department of Mathematics

Quantitative Methods Chapter 0: Review of Basic Concepts 0.1 Business Applications (II) 0.2 Business Applications (III)

Learning Objectives for Stat 225

Introduction to Statistics

Chapter 2: Tools for Exploring Univariate Data

BIOL 51A - Biostatistics 1 1. Lecture 1: Intro to Biostatistics. Smoking: hazardous? FEV (l) Smoke

MIDTERM EXAMINATION (Spring 2011) STA301- Statistics and Probability

Dover- Sherborn High School Mathematics Curriculum Probability and Statistics

Lecture Notes for BUSINESS STATISTICS - BMGT 571. Chapters 1 through 6. Professor Ahmadi, Ph.D. Department of Management

Binomial and Poisson Probability Distributions

DETAILED CONTENTS PART I INTRODUCTION AND DESCRIPTIVE STATISTICS. 1. Introduction to Statistics

REVIEW: Midterm Exam. Spring 2012

Statistics for Economists. Lectures 3 & 4

Conditional Probability Solutions STAT-UB.0103 Statistics for Business Control and Regression Models

Probability and Probability Distributions. Dr. Mohammed Alahmed

A Probability Primer. A random walk down a probabilistic path leading to some stochastic thoughts on chance events and uncertain outcomes.

Introduction to Statistics

Lectures of STA 231: Biostatistics

Statistics. Industry Business Education Physics Chemistry Economics Biology Agriculture Psychology Astronomy, etc. GFP - Sohar University

Math Review Sheet, Fall 2008

Biostatistics Presentation of data DR. AMEER KADHIM HUSSEIN M.B.CH.B.FICMS (COM.)

Lecture 6. Probability events. Definition 1. The sample space, S, of a. probability experiment is the collection of all

Glossary for the Triola Statistics Series

Topic 2: Probability & Distributions. Road Map Probability & Distributions. ECO220Y5Y: Quantitative Methods in Economics. Dr.

Contents. Acknowledgments. xix

Quantitative Methods for Decision Making

QUANTITATIVE TECHNIQUES

BIOS 6222: Biostatistics II. Outline. Course Presentation. Course Presentation. Review of Basic Concepts. Why Nonparametrics.

What is Probability? Probability. Sample Spaces and Events. Simple Event

13.1 Categorical Data and the Multinomial Experiment

Probability- describes the pattern of chance outcomes

Tabulation means putting data into tables. A table is a matrix of data in rows and columns, with the rows and the columns having titles.

Nominal Data. Parametric Statistics. Nonparametric Statistics. Parametric vs Nonparametric Tests. Greg C Elvers

Chapter 7: Statistics Describing Data. Chapter 7: Statistics Describing Data 1 / 27

2.4. Conditional Probability

Random Variables Example:

FREQUENCY DISTRIBUTIONS AND PERCENTILES

Lecture 28 Chi-Square Analysis

What is Statistics? Statistics is the science of understanding data and of making decisions in the face of variability and uncertainty.

Binomial random variable

Dr. Babasaheb Ambedkar Marathwada University, Aurangabad. Syllabus at the F.Y. B.Sc. / B.A. In Statistics

104 Business Research Methods - MCQs

Lecture 2: Categorical Variable. A nice book about categorical variable is An Introduction to Categorical Data Analysis authored by Alan Agresti

Chapter # classifications of unlikely, likely, or very likely to describe possible buying of a product?

8/4/2009. Describing Data with Graphs

C A R I B B E A N E X A M I N A T I O N S C O U N C I L REPORT ON CANDIDATES WORK IN THE CARIBBEAN ADVANCED PROFICIENCY EXAMINATION MAY/JUNE 2009

Psych 230. Psychological Measurement and Statistics

Counting principles, including permutations and combinations.

AP Final Review II Exploring Data (20% 30%)

DSST Principles of Statistics

Chapter 4a Probability Models

MTH001- Elementary Mathematics Solved Final Term Papers For Final Term Exam Preparation

Math 221, REVIEW, Instructor: Susan Sun Nunamaker

Sets and Set notation. Algebra 2 Unit 8 Notes

Conditional Probability

Last Lecture. Distinguish Populations from Samples. Knowing different Sampling Techniques. Distinguish Parameters from Statistics

Example. χ 2 = Continued on the next page. All cells

Probability and Sample space

CIVL 7012/8012. Collection and Analysis of Information

Copyright c 2006 Jason Underdown Some rights reserved. choose notation. n distinct items divided into r distinct groups.

Suppose that you have three coins. Coin A is fair, coin B shows heads with probability 0.6 and coin C shows heads with probability 0.8.

Lecture 41 Sections Mon, Apr 7, 2008

Glossary. The ISI glossary of statistical terms provides definitions in a number of different languages:

Statistics for Managers Using Microsoft Excel/SPSS Chapter 4 Basic Probability And Discrete Probability Distributions

16.400/453J Human Factors Engineering. Design of Experiments II

Relate Attributes and Counts

3 PROBABILITY TOPICS

The enumeration of all possible outcomes of an experiment is called the sample space, denoted S. E.g.: S={head, tail}

Lecture (chapter 13): Association between variables measured at the interval-ratio level

STATISTICS ( CODE NO. 08 ) PAPER I PART - I

Analytical Graphing. lets start with the best graph ever made

Analytical Graphing. lets start with the best graph ever made

AP Statistics Ch 6 Probability: The Study of Randomness

3. DISCRETE PROBABILITY DISTRIBUTIONS

Statistics Statistical Process Control & Control Charting

0 0'0 2S ~~ Employment category

Midterm Review Honors ICM Name: Per: Remember to show work to receive credit! Circle your answers! Sets and Probability

MAT2377. Ali Karimnezhad. Version September 9, Ali Karimnezhad

Ø Set of mutually exclusive categories. Ø Classify or categorize subject. Ø No meaningful order to categorization.

DEFINITION: IF AN OUTCOME OF A RANDOM EXPERIMENT IS CONVERTED TO A SINGLE (RANDOM) NUMBER (E.G. THE TOTAL

15: CHI SQUARED TESTS

Chapter2 Description of samples and populations. 2.1 Introduction.

Sampling Populations limited in the scope enumerate

Discrete probability distributions

Answers Part A. P(x = 67) = 0, because x is a continuous random variable. 2. Find the following probabilities:

8.1 Frequency Distribution, Frequency Polygon, Histogram page 326

Micro Syllabus for Statistics (B.Sc. CSIT) program

Section 1.1. Data - Collections of observations (such as measurements, genders, survey responses, etc.)

Marquette University Executive MBA Program Statistics Review Class Notes Summer 2018

MATH 10 INTRODUCTORY STATISTICS

Probability and Statistics

G.PULLAIAH COLLEGE OF ENGINEERING & TECHNOLOGY DEPARTMENT OF ELECTRONICS & COMMUNICATION ENGINEERING PROBABILITY THEORY & STOCHASTIC PROCESSES

STATISTICS ANCILLARY SYLLABUS. (W.E.F. the session ) Semester Paper Code Marks Credits Topic

Standard & Conditional Probability

Management Programme. MS-08: Quantitative Analysis for Managerial Applications

UNIT 4 MATHEMATICAL METHODS SAMPLE REFERENCE MATERIALS

2011 Pearson Education, Inc

Transcription:

A Lecture for B.Sc. 2 nd Semester, Statistics (General) A SHORT INTRODUCTION TO PROBABILITY By Dr. Ajit Goswami Dept. of Statistics MDKG College, Dibrugarh 19-Apr-18 1

Terminology The possible outcomes of a stochastic process are called events. (A deterministic process has only one possible outcome.) A stochastic process may have a finite or an infinite number of outcomes. The probability of a particular event is the fraction of outcomes in which the event occurs. The probability of event A is denoted by P(A). 19-Apr-18 2

Terminology Probability values are between 0 (the event never occurs) and 1 (the event always occurs). Events may or may not be mutually exclusive. Events that are not mutually exclusive are called independent events. 19-Apr-18 3

The birth of a son or a daughter are mutually exclusive events. The birth of a daughter and the birth of carrier of the sickle-cell anemia allele are not mutually exclusive (they are independent events). 19-Apr-18 4

Terminology The sum of probabilities of all mutually exclusive events in a process is 1. For example, if there are n possible mutually exclusive outcomes, then n P(i) 1 i 1 19-Apr-18 5

Simple probabilities If A and B are mutually exclusive events, then the probability of either A or B to occur is the union P(A B) P(A) P(B) Example: The probability of a hat being red is ¼, the probability of the hat being green is ¼, and the probability of the hat being black is ½. Then, the probability of a hat being red OR black is ¾. 19-Apr-18 6

Simple probabilities If A and B are independent events, then the probability that both A and B occur is the intersection P(A B) P(A) P(B) 19-Apr-18 7

Conditional probabilities What is the probability of event A to occur given than event B did occur. The conditional probability of A given B is P(A B) P(A B) P(A) Example: The probability that a US president dies in office if he is bearded 0.03/0.14 = 22%. Thus, out of 6 bearded presidents, 22% (or 1.3) are expected to die. In reality, 2 died. (Again, a close enough result.) 19-Apr-18 8

Probability Distribution The probability distribution refers to the frequency with which all possible outcomes occur. There are numerous types of probability distribution. 19-Apr-18 9

P(i) 1 n The uniform distribution A variable is said to be uniformly distributed if the probability of all possible outcomes are equal to one another. Thus, the probability P(i), where i is one of n possible outcomes, is 19-Apr-18 10

The binomial distribution A process that has only two possible outcomes is called a binomial process. In statistics, the two outcomes are frequently denoted as success and failure. The probabilities of a success or a failure are denoted by p and q, respectively. Note that p + q = 1. The binomial distribution gives the probability of exactly k successes in n trials P(k) n k p k 1 p n k 19-Apr-18 11

The binomial distribution The mean and variance of a binomially distributed variable are given by np V npq 19-Apr-18 12

The Poisson distribution Poisson d April Siméon Denis Poisson 1781-1840 19-Apr-18 13

The Poisson distribution When the probability of success is very small, e.g., the probability of a mutation, then p k and (1 p) n k become too small to calculate exactly by the binomial distribution. In such cases, the Poisson distribution becomes useful. Let l be the expected number of successes in a process consisting of n trials, i.e., l = np. The probability of observing k successes is P(k) lk e l The mean and variance of a Poisson distributed variable 19-Apr-18 are given by = l and V = l, respectively. 14 k!

Thank you 19-Apr-18 15

Class Lecture for B.Sc. 1 st Semester (General) Introduction To Statistics There are three kinds of lies: lies, damned lies, and statistics. (B.Disraeli) DR. AJIT GOSWAMI Dept. of Statistics, MDKG College

Why study statistics? 1. Data are everywhere 2. Statistical techniques are used to make many decisions that affect our lives 3. No matter what your career, you will make professional decisions that involve data. An understanding of statistical methods will help you make these decisions efectively

Applications of statistical concepts in the business world Finance correlation and regression, index numbers, time series analysis Marketing hypothesis testing, chi-square tests, nonparametric statistics Personel hypothesis testing, chi-square tests, nonparametric tests Operating management hypothesis testing, estimation, analysis of variance, time series analysis

Statistics The science of collectiong, organizing, presenting, analyzing, and interpreting data to assist in making more effective decisions Statistical analysis used to manipulate summarize, and investigate data, so that useful decision-making information results.

Types of statistics Descriptive statistics Methods of organizing, summarizing, and presenting data in an informative way Inferential statistics The methods used to determine something about a population on the basis of a sample Population The entire set of individuals or objects of interest or the measurements obtained from all individuals or objects of interest Sample A portion, or part, of the population of interest

Inferential Statistics Estimation e.g., Estimate the population mean weight using the sample mean weight Hypothesis testing e.g., Test the claim that the population mean weight is 70 kg Inference is the process of drawing conclusions or making decisions about a population based on sample results

Sampling a sample should have the same characteristics as the population it is representing. Sampling can be: with replacement: a member of the population may be chosen more than once (picking the candy from the bowl) without replacement: a member of the population may be chosen only once (lottery ticket)

Descriptive Statistics Collect data e.g., Survey Present data e.g., Tables and graphs Summarize data e.g., Sample mean = X i n

Statistical data The collection of data that are relevant to the problem being studied is commonly the most difficult, expensive, and timeconsuming part of the entire research project. Statistical data are usually obtained by counting or measuring items. Primary data are collected specifically for the analysis desired Secondary data have already been compiled and are available for statistical analysis A variable is an item of interest that can take on many different numerical values. A constant has a fixed numerical value.

Data Statistical data are usually obtained by counting or measuring items. Most data can be put into the following categories: Qualitative - data are measurements that each fail into one of several categories. (hair color, ethnic groups and other attributes of the population) Quantitative - data are observations that are measured on a numerical scale (distance traveled to college, number of children in a family, etc.)

Qualitative data Qualitative data are generally described by words or letters. They are not as widely used as quantitative data because many numerical techniques do not apply to the qualitative data. For example, it does not make sense to find an average hair color or blood type. Qualitative data can be separated into two subgroups: dichotomic (if it takes the form of a word with two options (gender - male or female) polynomic (if it takes the form of a word with more than two options (education - primary school, secondary school and university).

Quantitative data Quantitative data are always numbers and are the result of counting or measuring attributes of a population. Quantitative data can be separated into two subgroups: discrete (if it is the result of counting (the number of students of a given ethnic group in a class, the number of books on a shelf,...) continuous (if it is the result of measuring (distance traveled, weight of luggage, )

Types of variables Variables Qualitative Quantitative Dichotomic Polynomic Discrete Continuous Gender, marital status Brand of Pc, hair color Children in family, Strokes on a golf hole Amount of income tax paid, weight of a student

Numerical scale of measurement: Nominal consist of categories in each of which the number of respective observations is recorded. The categories are in no logical order and have no particular relationship. The categories are said to be mutually exclusive since an individual, object, or measurement can be included in only one of them. Ordinal contain more information. Consists of distinct categories in which order is implied. Values in one category are larger or smaller than values in other categories (e.g. rating-excelent, good, fair, poor) Interval is a set of numerical measurements in which the distance between numbers is of a known, sonstant size. Ratio consists of numerical measurements where the distance between numbers is of a known, constant size, in addition, there is a nonarbitrary zero point.

Charts and graphs Frequency distributions are good ways to present the essential aspects of data collections in concise and understable terms Pictures are always more effective in displaying large data collections

Histogram Frequently used to graphically present interval and ratio data Is often used for interval and ratio data The adjacent bars indicate that a numerical range is being summarized by indicating the frequencies in arbitrarily chosen classes

Frequency polygon Another common method for graphically presenting interval and ratio data To construct a frequency polygon mark the frequencies on the vertical axis and the values of the variable being measured on the horizontal axis, as with the histogram. If the purpose of presenting is comparation with other distributions, the frequency polygon provides a good summary of the data

Ogive A graph of a cumulative frequency distribution Ogive is used when one wants to determine how many observations lie above or below a certain value in a distribution. First cumulative frequency distribution is constructed Cumulative frequencies are plotted at the upper class limit of each category Ogive can also be constructed for a relative frequency distribution.

Pie Chart The pie chart is an effective way of displaying the percentage breakdown of data by category. Useful if the relative sizes of the data components are to be emphasized Pie charts also provide an effective way of presenting ratioor interval-scaled data after they have been organized into categories

Pie Chart

Bar chart Another common method for graphically presenting nominal and ordinal scaled data One bar is used to represent the frequency for each category The bars are usually positioned vertically with their bases located on the horizontal axis of the graph The bars are separated, and this is why such a graph is frequently used for nominal and ordinal data the separation emphasize the plotting of frequencies for distinct categories

Thank You