BUSINESS STATISTICS AND OPERATIONS RESEARCH-BPH4A Unit : 1-5

Size: px
Start display at page:

Download "BUSINESS STATISTICS AND OPERATIONS RESEARCH-BPH4A Unit : 1-5"

Transcription

1 BUSINESS STATISTICS AND OPERATIONS RESEARCH-BPH4A Unit : 1-5

2 Unit I: Syllabus 1. Introduction 2. Meaning and definition of statistics 3. Collection of statistical data 4. Tabulation of statistical data 5. Presentation of statistical data 6. Graphs and diagram 7. Measures of central tendency 8. Arithmetic mean, median, mode, harmonic mean and geometric mean. 2

3 Introduction 1. Data are everywhere 2. Statistical techniques are used to make many decisions that affect our lives 3. No matter what your career, you will make professional decisions that involve data. An understanding of statistical methods will help you make these decisions efectively Statistics is a science of collectiong, organizing, presenting, analyzing, and interpreting data to assist in making more effective decisions Statistical analysis used to manipulate summarize, and investigate data, so that useful decision-making information results. 3

4 Meaning and Definition of Statistics Definition: The term Statistics has been defined in two senses A. Singular B. Plural sense A. In the Plural Sense: Statistics are numerical statements of facts in any department of enquiry placed in relation to each other. A.L. Bowley The classified facts respecting the condition of the people in a state especially those facts which can be stated in numbers or in tables of numbers or in any tabular or classified arrangement. Webster a systematic collection of numerical facts and in singular sense; it is the science of collecting, classifying and using statistics. 4

5 Meaning Continued By statistics we mean aggregates of facts affected to a marked extent by multiplicity of causes, numerically expressed, enumerated or estimated according to reasonable standard of accuracy, collected in a systematic manner for a predetermined purpose, and placed in relation to each other. Horace Sacrist B. In the Singular Sense: Statistics refers to the body of technique or methodology, which has been developed for the collection, presentation and analysis of quantitative data and for the use of such data in decision making. Ncttor and Washerman. Statistics may rightly be called the science of averages. Bowleg Statistics may be defined as the collection, presentation, analysis, and interpretation of numerical data. Croxton and Cowden 5

6 Sources of Statistical Data Statistical data as we have seen can be either primary or secondary. Primary data are those which are collected for the first time and so are in crude form. But secondary data are those which have already been collected. Primary data are always collected from the source. It is collected either by the investigator himself or through his agents. There are different methods of collecting primary data. The choice to a large extent depends on the preliminaries to data collection some of the commonly used methods are given below. 1. Direct Personal observation: 2. Indirect Oral Interviews : 3. Mailed Questionnaire method: 4. Schedule Method: 5. From Local Agents: 6

7 Methods of collecting Primary Data 1. Direct Personal observation 2. Indirect Oral Interviews 3. Mailed Questionnaire method 4. Schedule Method 5. From Local Agents 7

8 Secondary data The data that have been already collected by and readily available from other sources. Such data are cheaper and more quickly obtainable than the primary data and also may be available when primary data can not be obtained at all. Advantages of Secondary data 1. It is economical. It saves efforts and expenses. 2. It is time saving. 3. It helps to make primary data collection more specific since with the help of secondary data, we are able to make out what are the gaps and deficiencies and what additional information needs to be collected. 4. It helps to improve the understanding of the problem. 5. It provides a basis for comparison for the data that is collected by the researcher. 8

9 Disadvantages of Secondary Data Secondary data is something that seldom fits in the framework of the marketing research factors. Reasons for its non-fitting are:-unit of secondary data collection- Suppose you want information on disposable income, but the data is available on gross income. The information may not be same as we require. Class Boundaries may be different when units are same. 9

10 Types of statistics: Descriptive & Inferential Descriptive statistics Methods of organizing, summarizing, and presenting data in an informative way Inferential statistics The methods used to determine something about a population on the basis of a sample Population The entire set of individuals or objects of interest or the measurements obtained from all individuals or objects of interest Sample A portion, or part, of the population of interest 10

11 Descriptive Statistics Collect data e.g., Survey Present data e.g., Tables and graphs Summarize data e.g., Sample mean = n X i Statistical data are usually obtained by counting or measuring items. Most data can be put into the following categories: Qualitative - data are measurements that each fail into one of several categories. (hair color, ethnic groups and other attributes of the population) quantitative - data are observations that are measured on a numerical scale (distance traveled to college, number of children in a family, etc.) 11

12 Inferential Statistics We have seen that descriptive statistics provide information about our immediate group of data. For example, we could calculate the mean and standard deviation of the exam marks for the 100 students and this could provide valuable information about this group of 100 students. Any group of data like this, which includes all the data you are interested in, is called a population. A population can be small or large, as long as it includes all the data you are interested in. For example, if you were only interested in the exam marks of 100 students, the 100 students would represent your population. Descriptive statistics are applied to populations, and the properties of populations, like the mean or standard deviation, are called parameters as they represent the whole population(i.e., everybody) 12

13 Inferential Statistics - Continued You might be interested in the exam marks of all students in the UK. It is not feasible to measure all exam marks of all students in the whole of the UK so you have to measure a smaller sample of students (e.g., 100 students), which are used to represent the larger population of all UK students. Properties of samples, such as the mean or standard deviation, are not called parameters, but statistics. Inferential statistics are techniques that allow us to use these samples to make generalizations about the populations from which the samples were drawn. It is, therefore, important that the sample accurately represents the population. The process of achieving this is called sampling. Inferential statistics arise out of the fact that sampling naturally incurs sampling error and thus a sample is not expected to perfectly represent the population. The methods of inferential statistics are (1) the estimation of parameter(s) and (2) testing of statistical hypotheses. 13

14 Quantitative data Quantitative data are always numbers and are the result of counting or measuring attributes of a population. Quantitative data can be separated into two subgroups: discrete (if it is the result of counting (the number of students of a given ethnic group in a class, the number of books on a shelf,...) continuous (if it is the result of measuring (distance traveled, weight of luggage, ) Numerical presentation of qualitative data pivot table (qualitative dichotomic statistical attributes) contingency table (qualitative statistical attributes from which at least one of them is polynomic) 14

15 Frequency distributions numerical presentation of quantitative data Frequency distribution shows the frequency, or number of occurences, in each of several categories. Frequency distributions are used to summarize large volumes of data values. When the raw data are measured on a qunatitative scale, either interval or ration, categories or classes must be designed for the data values before a frequency distribution can be formulated. Steps for constructing a frequency distribution 1. Determine the number of classes 2. Determine the size of each class 3. Determine the starting point for the first class 4. Tally the number of values that occur in each class 5. Prepare a table of the distribution using actual counts and/ or percentages (relative frequencies) m = n ( max min ) 15 h = m

16 Diagrammatic Representation Bar Diagrams are those diagrams in which data are presented in the form of bars or rectangles. Types of bar diagrams :- 1. Simple bar diagrams : These are those bar diagrams which are based on a single set of numerical data. The different items or values are represented by different bars. 2. Multiple bar diagrams : These are those bar diagrams which show two or more sets of data simultaneously. E.g. Multiple and Stacked bar diagram. 16

17 Pie Chart The pie chart is an effective way of displaying the percentage breakdown of data by category. Useful if the relative sizes of the data components are to be emphasized Pie charts also provide an effective way of presenting ratio- or interval-scaled data after they have been organized into categories Source of borrowing Amounts of loan USA 1800 USSR 1200 UK 800 Japan 600 Germany

18 Histogram & Frequency Polygon Frequently used to graphically present interval and ratio data Is often used for interval and ratio data The adjacent bars indicate that a numerical range is being summarized by indicating the frequencies in arbitrarily chosen classes Histogram frquency polygon Another common method for graphically presenting interval and ratio data To construct a frequency polygon mark the frequencies on the vertical axis and the values of the variable being measured on the horizontal axis, as with the histogram. If the purpose of presenting is comparation with other distributions, the frequency polygon provides a good summary of the data

19 Objectives of Data Tabulation 1. To carry out investigations 2. To do comparisons 3. To locate omissions and errors in the data 4. To use space economically 5. To study the trends 6. To simplify data 7. To use it as future references Parts of an Ideal Table Table number Title Stubs or Row designations Column headings or Captions: Body of the table: Unit of measurement Source Footnotes and references 19

20 Rules of Tabulation The following general rules should be observed while tabulating statistical data. 1. The table should suit the size of the paper and, therefore, the width of the column should be decided before hand. 2. Number of columns and rows should neither be too large nor too small. 3. As far as possible figures should be approximated before tabulation. This would reduce unnecessary details. 4. Items should be arranged either in alphabetical, chronological or geographical order or according to size. 5. The sub-total and total of the items of the table must be written. 20

21 Rules of Tabulation 6. Percentages are given in the tables if necessary. 7. Ditto marks should not be used in a table because sometimes it create confusion. 8. Table should be simple and attractive. 9. A table should be logical, well-balanced in length and breadth and the comparable columns should be placed side by side. 10. Light/heavy/thick or double rulings may be used to distinguish sub columns, main columns and totals. 11. For large data more than one table may be used. 21

22 Methods of Tabulation 1. Simple tabulation Simple tabulation is when the data are tabulated to one characteristic. 2. Two way table Double tabulation is when two characteristics of data are tabulated. 3. Complex Table Complex tabulation of data that includes more than two characteristics. Parts of an Ideal Table It is useful in condensing the collected data. It presents the numerical figures in an attractive form. A table with a title and a number can be easily identified and used for the required purpose. Relationship between different parts can be easily known. 22

23 Parts of an Ideal Table It is useful in condensing the collected data. It presents the numerical figures in an attractive form. A table with a title and a number can be easily identified and used for the required purpose. Relationship between different parts can be easily known. 23

24 OGIVE Wages No. of Workers A graph of a cumulative frequency distribution Ogive is used when one wants to determine how many observations lie above or below a certain value in a distribution. First cumulative frequency distribution is constructed Cumulative frequencies are plotted at the upper class limit of each category Ogive can also be constructed for a relative frequency distribution. 24

25 Mean, Median and Mode i) Mean by direct method where f is the frequency of the class and x is the size or midpoint of the class i) Median iii) Mode 25

26 Harmonic Mean The harmonic mean can be expressed as the reciprocal of the arithmetic mean of the reciprocals of the given set of observations. HM for raw and discrete data are calculated as follows. x 1/x Total HM=(5/0.3417)=

27 Mean, Median and Mode Calculation of Mean, Median and Mode for Continuous Frequency Distribution i) Mean by direct method =59920/390 = Median, Modal Class Md=150+(((390/2)-151) *10/116) = Class x f xf cf f f f L N/2 390/2= Mean median Mode

28 Mode Using graphical Method Frequency Frequency Class Frequency

29 Unit II: Syllabus 1. Measures of variation 2. Mean deviation 3. Standard deviation, 4. Quartile deviation 5. Skewness and kurtosis 6. Lorenz curve 7. Simple correlation 8. Scatter diagram 9. Karl Pearson s correlation 10. Rank correlations regressions. 29

30 Measures of Dispersion Central tendency measures do not reveal the variability present in the data. 1. Dispersion is the scatteredness of the data series around its averages. 2. It is the extent to which values in a distribution differ from the average of the distribution. Importance of dispersion: 1. Determine the reliability of an average 2. Serve as a basis for the control of the variability 3. To compare the variability of two or more series and 4. Facilitate the use of other statistical measures. 30

31 Absolute & Relative Measures The following measures of dispersion are used to study the variation. 1. Absolute Measures of Dispersion 2. Relative Measures of Dispersion Absolute measures of Dispersion are expressed in same units in which original data is presented but these measures cannot be used to compare the variations between the two series. Relative measures are not expressed in units but it is a pure number. It is the ratios of absolute dispersion to an appropriate average such as co-efficient of Standard Deviation or Co-efficient of Mean Deviation. Absolute Measures Range Quartile Deviation Mean Deviation Standard Deviation Relative Measure Co-efficient of Range Co-efficient of Quartile Deviation Co-efficient of mean Deviation Co-efficient of Variation. 31

32 Mean Deviation Average of absolute differences (differences expressed without plus or minus sign) between each value in a set of values, and the average of all values of that set. For example, the average (arithmetic mean or mean) of the set of values 1, 2, 3, 4, and 5 is (15 5) or 3. The difference between this average (3) and the values in the set is 2, 1, 0, -1, and -2; the absolute difference being 2, 1, 0, 1, and 2. The average of these numbers (6 5) is 1.2 which is the mean deviation. Also called mean absolute deviation, it is used as a measure of dispersion where the number of values or quantities is small, otherwise standard deviation is used. 32

33 Standard deviation Standard deviation is a measure of the dispersion of a set of data from its mean. It is calculated as the square root of variance by determining the variation between each data point relative to the mean. If the data points are further from the mean, there is higher deviation within the data set. Example, In finance, standard deviation is a statistical measurement; when applied to the annual rate of return of an investment, it sheds light on the historical volatility of that investment. The greater the standard deviation of a security, the greater the variance between each price and the mean, indicating a larger price range. 33

34 Quartile Deviation Quartile deviation is based on the lower quartile Q1Q1 and the upper quartile Q3Q3. The difference Q3 Q1 is called the inter quartile range. The difference Q3 Q1 divided by 2 is called semi-inter-quartile range or the quartile deviation. The quartile deviation is a slightly better measure of absolute dispersion than the range, but it ignores the observations on the tails. If we take difference samples from a population and calculate their quartile deviations, their values are quite likely to be sufficiently different. The quartile deviation calculated from the sample data does not help us to draw any conclusion (inference) about the quartile deviation in the population. Coefficient of Quartile Deviation A relative measure of dispersion based on the quartile deviation is called the coefficient of quartile deviation. It is defined as: (Q3 Q1)/(Q3+Q1) 34

35 Skewness and Kurtosis Skewness tells you the amount and direction of skew (departure from horizontal symmetry), and kurtosis tells you how tall and sharp the central peak is, relative to a standard bell curve. One application is testing for normality: many statistics inferences require that a distribution be normal or nearly normal. A normal distribution has skewness and excess kurtosis of 0, so if your distribution is close to those values then it is probably close to normal. The other common measure of shape is called the kurtosis. As skewness involves the third moment of the distribution, kurtosis involves the fourth moment. The outliers in a sample, therefore, have even more effect on the kurtosis than they do on the skewness and in a symmetric distribution both tails increase the kurtosis, unlike skewness where they offset each other. BPH4A Business Statistics and Operations 35

36 Kurtosis Continued mean and standard deviation have the same units as the original data, and the variance has the square of those units. However, the kurtosis, like skewness, has no units: it s a pure number, like a z-score. Traditionally, kurtosis has been explained in terms of the central peak. You ll see statements like this one: Higher values indicate a higher, sharper peak; lower values indicate a lower, less distinct peak, increasing kurtosis is associated with the movement of probability mass from the shoulders of a distribution into its center and tails. higher kurtosis means more of the variance is the result of infrequent extreme deviations, as opposed to frequent modestly sized deviations. In other words, it s the tails that mostly account for kurtosis, not the central peak. 36

37 Mean Absolute Deviation (MAD) For a series of n observations the mean deviation about mean is computed by MD For a discrete frequency distribution, the mean deviation about mean and median are computed respectively by = n i= 1 X i n X MD = n i= 1 f X i n i= 1 i f md i and MD = n i= 1 f X i n i= 1 i f i X The Coefficient of MD =MD/ Average 37

38 Standard Deviation It is the positive square root of the average of squares of deviations of the observations from the mean. This is also called as Root Mean squared deviation. SD fd = f For a Series of observation, the Standard Deviation can be obtained by the following formula. fd f 2 SD 1 = (x i x) 1 n n i=

39 Lorenz curve The Lorenz curve is a graphical representation of income inequality or wealth inequality developed by American economist Max Lorenz in The graph plots percentiles of the population according to income or wealth on the horizontal axis. It plots cumulative income or wealth on the vertical axis, so that an x-value of 45 and a y-value of 14.2 would mean that the bottom 45% of the population controls 14.2% of the total income or wealth. The Lorenz curve is often accompanied by a straight diagonal line with a slope of 1, which represents perfect equality in income or wealth distribution; the Lorenz curve lies beneath it, showing the actual distribution. The area between the straight line and the curved line, expressed as a ratio of the area under the straight line, is the Gini coefficient, a measurement of inequality. 39

40 CORRELATION Correlation coefficient is a statistical measure of the degree to which changes to the value of one variable predict change to the value of another. When the fluctuation of one variable reliabljy predicts a similar fluctuation in another variable, there s often a tendency to think that means that the change in one causes the change in the other. Properties of Correlation Coefficient Coefficient of Correlation lies between -1 and +1. Coefficients of Correlation are independent of Change of Origin. Coefficients of Correlation possess the property of symmetry. Coefficient of Correlation is independent of Change of Scale. Co-efficient of correlation measures only linear correlation between X and Y. 40

41 Types of correlation Correlation are of three types: Positive Correlation Negative Correlation No correlation In correlation, when values of one variable increase with the increase in another variable, it is supposed to be a positive correlation. On the other hand, if the values of one variable decrease with the decrease in another variable, then it would be a negative correlation. There might be the case when there is no change in a variable with any change in another variable. In this case, it is defined as no correlation between the two. Methods of studying Correlation Scatter diagram. Karl pearson's coefficient of correlation. Spearman's Rank correlation coefficient. 41

42 Types of correlation Correlation are of three types: Positive Correlation Negative Correlation No correlation In correlation, when values of one variable increase with the increase in another variable, it is supposed to be a positive correlation. On the other hand, if the values of one variable decrease with the decrease in another variable, then it would be a negative correlation. There might be the case when there is no change in a variable with any change in another variable. In this case, it is defined as no correlation between the two. 42

43 Scatter Diagram The scatter diagram graphs pairs of numerical data, with one variable on each axis, to look for a relationship between them. If the variables are correlated, the points will fall along a line or curve. The better the correlation, the tighter the points will hug the line. The following video explains the scatter diagram with an example 43

44 Karl pearson's coefficient of correlation The formula to calculate Karl Pearson s coefficient correlation is as follows: r = n( x 2 ) n ( xy The following video explains the correlation problem. x) ( 2 x)( n( Spearman s Rank correlation co-efficient Rank correlation is any of several statistics that measure an ordinal association the relationship between rankings of different ordinal variables or different rankings of the same variable y y) 2 ) ( y) 2 44

45 Unit III: Syllabus 1. Analysis of Time series 2. Methods of measuring trend 3. Seasonal variations. 4. Consumer Price Index Number 5. Cost of Living index number 6. Statistical Quality Control 7. Sampling Simple Random, Stratified and Systematic 45

46 Components of Time Series Seasonal effect (Seasonal Variation or Seasonal Fluctuations) Many of the time series data exhibits a seasonal variation which is annual period, such as sales and temperature readings. It is easy to understand and can be easily measured or removed from the data to give de-seasonalized data. Seasonal Fluctuations describes any regular variation (fluctuation) with a period of less than one year for example cost of variation types of fruits and vegetables, cloths, unemployment figures, average daily rainfall, increase in sale of tea in winter, increase in sale of ice cream in summer etc., all show seasonal variations. The changes which repeat themselves within a fixed period, are also called seasonal variations, for example, traffic on roads in morning and evening hours, Sales at festivals like EID etc., increase in the number of passengers at weekend etc. Seasonal variations are caused by climate, social customs, religious activities etc. 46

47 Components of Time Series Seasonal effect (Seasonal Variation or Seasonal Fluctuations) Many of the time series data exhibits a seasonal variation which is annual period, such as sales and temperature readings. It is easy to understand and can be easily measured or removed from the data to give de-seasonalized data. Seasonal Fluctuations describes any regular variation (fluctuation) with a period of less than one year for example cost of variation types of fruits and vegetables, cloths, unemployment figures, average daily rainfall, increase in sale of tea in winter, increase in sale of ice cream in summer etc., all show seasonal variations. The changes which repeat themselves within a fixed period, are also called seasonal variations, for example, traffic on roads in morning and evening hours, Sales at festivals like EID etc., increase in the number of passengers at weekend etc. Seasonal variations are caused by climate, social customs, religious activities etc. 47

48 These movements are systematic in nature where the movements are broad, steady, showing slow rise or fall in the same direction. The trend may be linear or non-linear (curvilinear). Some examples of secular trend are: Increase in prices, Increase in pollution, increase in the need of wheat, increase in literacy rate, decrease in deaths due to advances in science. Taking averages over a certain period is a simple way of detecting trend in seasonal data. Change in averages with time is evidence of a trend in the given series, though there are more formal tests for detecting trend in time series. Irregular Variation (Irregular Fluctuations) When trend and cyclical variations are removed from a set of time series data, the residual left, which may or may not be random. Various techniques for analyzing series of this type examine to see if irregular variation may be explained in terms of probability models such 48

49 as moving average or autoregressive models, i.e. we can see if any cyclical variation is still left in the residuals. These variation occur due to sudden causes are called residual variation (irregular variation or accidental or erratic fluctuations) and are unpredictable, for example rise in prices of steel due to strike in the factory, accident due to failure of break, flood, earth quick, war etc. Methods of Measuring Trend Trend can be determined : (i) Free hand curve method ; (ii) moving averages method ; (iii) semiaverages method; and (iv) least-squares method. Each of these methods is described below : (i) Freehand Curve Method : The term freehand is used to any non-mathematical curve in statistical analysis even if it is drawn with the aid of drafting instruments. This is the simplest method of studying trend of a time series. 49

50 Procedure for Free Hand Curve (i) first plot the original data on a graph paper. (ii) The direction of the plotted data is carefully observed. (iii) A smooth line is drawn through the plotted points. While fitting a trend line by the freehand method, an attempt should be made that the fitted curve conforms to these conditions. (i) The curve should be smooth either a straight line or a combination of long gradual curves. (ii) The trend line or curve should be drawn through the graph of the data in such a way that the areas below and above the trend line are equal to each other. (iii) The vertical deviations of the data above the trend line must equal to the deviations below the line. (iv) Sum of the squares of the vertical deviations of the observations from the trend should be minimum. 50

51 Trend by the Method of Semi -averages This method can be used if a straight line trend is to be obtained. Since the location of only two points is necessary to obtain a straight line equation, it is obvious that we may select two representative points and connect them by a straight line. Data are divided into two halves and an average is obtained for each half. Each such average is shown against the mid-point of the half period, we obtain two points on a graph paper. By joining these points, a straight line trend is obtained. The method is to be commended for its simplicity and used to some extent in practical work. This method is also flexible, for it is permissible to select representative periods to determine the two points.unrepresentative years may be ignored. 51

52 The formula for a straight-line trend can most simply be expressed as Y c = a + bx where X represents time variable, Yc is the dependent variable for which trend values are to be calculated and a and b are the constants of the straight tine to be found by the method of least squares. Constant is the Y-intercept. This is the difference between the point of the origin (O) and the point of the trend line and Y-axis intersect. It shows the value of Y when X = 0, constant b indicates the slope which is the change in Y for each unit change in X. Let us assume that we are given observations of Y for n number of years. If we wish to find the values of constants a and b in such a manner that the two conditions laid down above are satisfied by the fitted equation. 52

53 Mathematical reasoning suggests that, to obtain the TM values of constants a and b according to the Principle of Least Squares, we have to solve simultaneously the following two equations. Y = na + b Y...(i) XY = a X + b X 2...(ii) On solving (i) and (ii), we get the values of a and b. Methods of Measuring Seasonal Variations 1. Method of Simple Averages (Weekly, Monthly or Quarterly). 2. Ratio-to-Trend Method. 3. Ratio-to-Moving Average Method. 4. Link Relatives Method. Methods of Simple Average This is the simplest method of obtaining a seasonal index. The following steps are necessary for calculating the index : (i) Average the unadjusted date by years and months or quarters if quarterly data are given. (ii) Find totals of January, February etc. 53

54 (iii) Divide each total by the number of years for which data are given. For example, if we are given monthly data for five years then we shall first obtain total for each month for five years and divide each total by 5 to obtain an average. (iv) Obtain an average of monthly averages by dividing the total of monthly averages by 12. (v) Taking the average of monthly average as 100, compute the percentage of various monthly averages as follows: Seasonal Index for January = (average of January month/overall average)*100. Ratio-to-moving average method The method of monthly totals or monthly averages does not give any consideration to the trend which may be present in the data. The ratio-tomoving-average method is one of the simplest of the commonly used devices for measuring seasonal variation. 54

55 STEPS (i) Arrange the unadjusted data by years and months. (ii) Compute the trend values by the method of moving averages. For this purpose take 12 month moving average followed by a two-month moving average to recentre the trend values. (iii) Express the data for each month as a percentage ratio of the corresponding moving-average trend value. (iv) Arrange these ratios by months and years. (v) Aggregate the ratios for January, February etc. (vi) Find the average ratio for each month. (vii) Adjust the average monthly ratios found in step (vi) so that they will themselves average 100 percent. These adjusted ratios will be the seasonal indices for various months. 55

56 Ratio-to-trend method TM The steps in the calculation of seasonal variation are as follows : (i) Arrange the unadjusted data by years and months. (ii) Compute the trend values for each month with the help of least squares equation. (iii) Express the data for each month as a percentage ratio of the corresponding trend value. (iv) Aggregate the January s ratios, February s ratios, etc., computed previously (v) Find the average ratio for each month. (vi) Adjust the average ratios found in step (v) so that they will themselves average 100 per cent. 56

57 Link Relatives Method The following steps are taken to calculate the seasonal variation indices : (i) Calculate the link relatives of the seasonal figures. (ii) Calculating the average of the link relatives for each season. (iii) Convert these averages into chain relatives on the base of the first season. (iv) Calculate the chain relatives of the first season on the basis of the last season. There will be some difference between the chain relative of the first season and the chain relative calculated by the previous method. This difference will be due to long-term changes. It is therefore necessary to correct these chain relatives. 57

58 (v) For correction, the chain relative of the first season calculated by first method is deducted from the chain relative (of the first season) calculated by the second method. The difference is divided by the number of seasons. The resulting figure multiplied by 1,2,3 (and so on) is deducted respectively from the chain relatives of the 2nd, 3rd, 4th (and so on) seasons. These are correct chain relatives. (vi) Express the corrected chain relatives as percentage of their averages. These provide the required seasonal indices by the method of link relatives. 58

59 Statistical quality control Statistical quality control (SQC): the term used to describe the set of statistical tools used by quality professionals; SQC encompasses three broad categories of: Statistical process control (SPC) 1. Descriptive statistics include the mean, standard deviation, and range. Involve inspecting the output from a process. Quality characteristics are measured and charted Helps identify in-process variations. 2. Acceptance sampling used to randomly inspect a batch of goods to determine acceptance/rejection. Does not help to catch in-process problems. 59

60 Sources of Variation Variation exists in all processes. Variation can be categorized as either: Common or Random causes of variation, or Random causes that we cannot identify Unavoidable, e.g. slight differences in process variables like diameter, weight, service time, temperature Assignable causes of variation Causes can be identified and eliminated: poor employee training, worn tool, machine needing repair 60

61 Important uses of the control chart Most processes do not operate in a state of statistical control. Consequently, the routine and attentive use of control charts will identify assignable causes. If these causes can be eliminated from the process, variability will be reduced and the process will be improved. The control chart only detects assignable causes. Management, operator, and engineering action will be necessary to eliminate the assignable causes. 61

62 Control Charts Control Charts (aka process or QC charts) show sample data plotted on a graph with CL, UCL, and LCL Control chart for variables are used to monitor characteristics that can be measured, e.g. length, weight, diameter, time Control charts for attributes are used to monitor characteristics that have discrete values and can be counted, e.g. % defective, # of flaws in a shirt, etc. 62

63 Sampling Procedures Sampling is a process or technique of choosing a sub-group from a population to participate in the study; it is the process of selecting a number of individuals for a study in such a way that the individuals selected represent the large group from which they were selected. There are two major sampling procedures in research. These include probability and non probability sampling. Probability Sampling Procedures In probability sampling, everyone has an equal chance of being selected. This scheme is one in which every unit in the population has a chance (greater than zero) of being selected in the sample. There are four basic types of sampling procedures associated with probability samples. These include simple random, systematic sampling, stratified and cluster. Simple Random Sampling Procedure Simple random sampling provides the base from which the other more complex sampling methodologies are derived. To conduct a simple random sample, the researcher must first prepare an exhaustive list (sampling frame) of all members of the population of interest. 63

64 SRS - continued From this list, the sample is drawn so that each person or item has an equal chance of being drawn during each selection round. To draw a simple random sample without introducing researcher bias, computerized sampling programs and random number tables are used to impartially select the members of the population to be sampled. Subjects in the population are sampled by a random process, using either a random number generator or a random number table, so that each person remaining in the population has the same probability of being selected for the sample. Systematic Sampling Procedure Systematic sampling procedure often used in place of simple random sampling. In systematic sampling, the researcher selects every nth member after randomly selecting the first through nth element as the starting point. For example, if the researcher decides to sample 20 respondents from a sample of 100, every 5th member of the population will systematically be selected. 64

65 Systematic Sampling A researcher may choose to conduct a systematic sample instead of a simple random sample for several reasons. Firstly, systematic samples tend to be easier to draw and execute. a systematic sample may spread the members selected for measurement more evenly across the entire population than simple random sampling. Therefore, in some cases, systematic sampling may be more representative of the population and more precise. Stratified Sampling Procedure It is the most effective method of sampling when a researcher wants to get a representative sample of a population. It involves categorizing the members of the population into mutually exclusive and collectively exhaustive groups. An independent simple random sample is then drawn from each group. Stratified sampling techniques can provide more precise estimates if the population being surveyed is more heterogeneous than the categorized groups. 65

66 Stratified Sampling - Continued This technique can enable the researcher to determine desired levels of sampling precision for each group, and can provide administrative efficiency. The main advantage of the approach is that it s able to give the most representative sample of a population 66

67 Unit IV: Syllabus 1. Hypothesis test and its Fundamental ideas 2. Large sample Test 3. Small sample test 4. t 5. F 6. Chi Square Test 67

68 Hypothesis test and its Fundamental ideas A population is any large collection of objects or individuals, such as Americans, students, or trees about which information is desired. A parameter is any summary number, like an average or percentage, that describes the entire population. The general idea of hypothesis testing involves: 1. Making an initial assumption. 2. Collecting evidence (data). 3. Based on the available evidence (data), deciding whether to reject or not reject the initial assumption. A sample is a representative group drawn from the population. A statistic is any summary number, like an average or percentage, that describes the sample. Example: Is normal body temperature really 98.6 degrees F? 68

69 Hypothesis testing -continued Example: Consider the population of many, many adults. A researcher hypothesized that the average adult body temperature is lower than the often-advertised 98.6 degrees F. That is, the researcher wants an answer to the question: "Is the average adult body temperature 98.6 degrees? Or is it lower?" To answer his research question, the researcher starts by assuming that the average adult body temperature was 98.6 degrees F. Then, the researcher went out and tried to find evidence that refutes his initial assumption. In doing so, he selects a random sample of 130 adults. The average body temperature of the 130 sampled adults is degrees. Then, the researcher uses the data he collected to make a decision about his initial assumption. 69

70 Hypothesis testing continued It is either likely or unlikely that the researcher would collect the evidence he did given his initial assumption that the average adult body temperature is 98.6 degrees: If it is likely, then the researcher does not reject his initial assumption that the average adult body temperature is 98.6 degrees. There is not enough evidence to do otherwise. If it is unlikely, then: either the researcher's initial assumption is correct and he experienced a very unusual event; or the researcher's initial assumption is incorrect. A null hypothesis is a hypothesis that says there is no statistical significance between the two variables. It is usually the hypothesis a researcher or experimenter will try to disprove or discredit. E.g. in the previous example, average adult body temperature is 98.6 degrees. An alternative hypothesis is one that states there is a statistically significant relationship between two variables. E.g. average adult body temperature is not 98.6 degrees. 70

71 Hypothesis testing continued Let's imagine that μ is the average grade point average of all students in Chennai who s major in mathematics. We can do the following three hypothesis tests about the population mean µ and the steps involved in testing the hypothesis are given below. 1. Make assumptions and meet test requirements. 2. State the H Select the Sampling Distribution and Determine the Critical Region. 4. Calculate the test statistic. 5. Make a Decision and Interpret Results. 71

72 Large Sample Test In this section we describe and demonstrate the procedure for conducting a test of hypotheses about the mean of a population in the case that the sample size n is at least 30. The test statistic has the standard normal distribution. 72

73 Example: Large sample test The critical value is The data do not provide sufficient evidence, at the 1% level of significance, to conclude that the average amount of product dispensed is different from 8.1 ounce. We conclude that the machine does not need to be recalibrated. 73

74 Hypothesis Testing in the Two Sample Case H 0 : μ1 = μ2 The Null asserts there is no significant difference between the populations. H 1 : μ1 μ2 The research hypothesis contradicts the H 0 and asserts there is a significant difference between the populations. If the obtained test statistic falls in the Critical Region then reject the null hypothesis. The difference between the sample means is so large that we can conclude (at α = 0.05) that a difference exists between the populations represented by the samples. 74

75 Small Sample test In the previous section hypotheses testing for population means was described in the case of large samples. The statistical validity of the tests was insured by the Central Limit Theorem, with essentially no assumptions on the distribution of the population. When sample sizes are small, as is often the case in practice, the Central Limit Theorem does not apply. One must then impose stricter assumptions on the population to give statistical validity to the test procedure. One common assumption is that the population from which the sample is taken has a normal probability distribution to begin with. Under such circumstances, If σ is unknown Z has the standard normal distribution. If σ is unknown and is approximated by the sample standard deviation s, then the resulting test statistic follows Student s t-distribution with n 1 degrees of freedom. 75

76 t- test The t-test is used to test hypotheses about means when the population variance is unknown (the usual case). Closely related to z, the unit normal. Developed by Gossett for the quality control of beer. Comes in 3 varieties: Single sample, independent samples, and dependent samples. Single sample t we have only 1 group; want to test against a hypothetical mean. Independent samples t we have 2 means, 2 groups; no relation between groups, e.g., people randomly assigned to a single group. Dependent t we have two means. Either same people in both groups, or people are related, e.g., husband-wife, left hand-right hand, hospital patient and visitor. 76

77 t test continued We use t when the population variance is unknown (the usual case) and sample size is small (n<100, the usual case). If you use a stat package for testing hypotheses about means, you will use t. The t distribution is a short, fat relative of the normal. The shape of t depends on its df. As N becomes infinitely large, t becomes normal. If σ is unknown 77

78 One Sample Mean t Test The one sample t-test is a statistical procedure used to determine whether a sample of observations could have been generated by a process with a specific mean. Suppose you are interested in determining whether an assembly line produces laptop computers that weigh five pounds. To test this hypothesis, you could collect a sample of laptop computers from the assembly line, measure their weights, and compare the sample with a value of five using a one-sample t-test. 78

79 Example: One Sample t test In a sample of 226 Penn State students, the mean cost of textbooks was $344 with a standard deviation of $106. Does the average Penn State student spend more than $300 a semester on textbooks? H0: mean cost of textbooks is $344. H1: mean cost of textbooks is not $344 n=226, sample mean=344, sample standard deviation=106, hypothesized mean = 300. t=6.24 and Reject the hypothesis. 79

80 TESTING THE EQUALITY OF TWO VARIANCES: THE F TEST The test assumption of equal variances that was made in using the t-test The interest is in actually comparing the variance of two populations Assume we repeatedly select a random sample of size n from two normal populations. Consider the distribution of the ratio of two variances: F = s 1 2/s 1 2. The distribution formed in this manner approximates an F distribution with the following degrees of freedom: v 1 = n 1-1 and v 1 = n 1-1 The F table can gives the critical values of the F-distribution which depend upon the degrees of freedom. 80

81 Chi-Square Test It consists of r c contingency table, where r= number of rows, c = number of columns. Our question of interest is Are the two variables independent? This question is set up using the following hypothesis statements: Null Hypothesis: The two categorical variables are independent. Alternative Hypothesis: The two categorical variables are dependent. The chi-square test statistic is calculated by using the formula: 2 = (O E) 2 /E where O represents the observed frequency. E is the expected frequency We will compare the value of the test statistic to the critical (tabulated) value of with degree of freedom = (r - 1) (c - 1), and reject the null hypothesis if 2 > tabulated value. For example, we can check whether education level is independent of gender. 81

82 Unit V: Syllabus 1. Introduction to OR 2. Meaning Scope and Characteristics 3. Models in OR 4. LPP and its Formulation 5. Graphical Method 6. Simplex Method 7. Big M Method 8. Merits and Demerits 9. Transportation Model 10. Transportation and Assignment Problem 82

83 Operations Research Operations The activities carried out in an organization. Research The process of observation and testing characterized by the scientific method. Situation, problem statement, model construction, validation, experimentation, candidate solutions. Operations Research is a quantitative approach to decision making based on the scientific method of problem solving. Operations Research is an Art and Science It had its early roots in World War II and is flourishing in business and industry with the aid of computer Primary applications areas of Operations Research include forecasting, production scheduling, inventory control, capital budgeting, and transportation. 83

84 Example 1: A Minimization Problem LP Formulation Min z = 5x 1 + 2x 2 s.t. 2x 1 + 5x 2 > 10 4x 1 - x 2 > 12 x 1 + x 2 > 4 x 1, x 2 > 0 84

85 Example 1: Graphical Solution Graph the Constraints Constraint 1: When x 1 = 0, then x 2 = 2; when x 2 = 0, then x 1 = 5. Connect (5,0) and (0,2). The ">" side is above this line. Constraint 2: When x 2 = 0, then x 1 = 3. But setting x 1 to 0 will yield x 2 = -12, which is not on the graph. Thus, to get a second point on this line, set x 1 to any number larger than 3 and solve for x 2 : when x 1 = 5, then x 2 = 8. Connect (3,0) and (5,8). The ">" side is to the right. Constraint 3: When x 1 = 0, then x 2 = 4; when x 2 = 0, then x 1 = 4. Connect (4,0) and (0,4). The ">" side is above this line. 85

86 Example 1: Graphical Solution Constraints Graphed 5 4 x 2 Feasible Region 4x 1 - x 2 > 12 x 1 + x 2 > (16/5,4/5 ) (10/3, 2/3) (5,0) x 1 2x 1 + 5x 2 > 10 86

87 The Simplex Method The graphical method is useful only for problems involving two decision variables and relatively few problem constraints. What happens when we need more decision variables and more problem constraints? We use an algebraic method called the simplex method, which was developed by George B. DANTZIG ( ) in 1947 while on assignment with the U.S. Department of the air force. Surplus and Slack Variable A mathematical representation of surplus resources. In real life problems, it s unlikely that all resources will be used completely, so there usually are unused resources. Slack variables represent the unused resources between the left-hand side and right-hand side of each inequality. 87

MCM3C BUSINESS MATHEMATICS AND STATISTICS Unit : 1-5

MCM3C BUSINESS MATHEMATICS AND STATISTICS Unit : 1-5 MCM3C BUSINESS MATHEMATICS AND STATISTICS Unit : 1-5 Unit I: Syllabus 1. Introduction 2. meaning and definition of statistics 3. collection of statistical data 4. tabulation of statistical data 5. presentation

More information

Class 11 Maths Chapter 15. Statistics

Class 11 Maths Chapter 15. Statistics 1 P a g e Class 11 Maths Chapter 15. Statistics Statistics is the Science of collection, organization, presentation, analysis and interpretation of the numerical data. Useful Terms 1. Limit of the Class

More information

Glossary. The ISI glossary of statistical terms provides definitions in a number of different languages:

Glossary. The ISI glossary of statistical terms provides definitions in a number of different languages: Glossary The ISI glossary of statistical terms provides definitions in a number of different languages: http://isi.cbs.nl/glossary/index.htm Adjusted r 2 Adjusted R squared measures the proportion of the

More information

A SHORT INTRODUCTION TO PROBABILITY

A SHORT INTRODUCTION TO PROBABILITY A Lecture for B.Sc. 2 nd Semester, Statistics (General) A SHORT INTRODUCTION TO PROBABILITY By Dr. Ajit Goswami Dept. of Statistics MDKG College, Dibrugarh 19-Apr-18 1 Terminology The possible outcomes

More information

MIDTERM EXAMINATION (Spring 2011) STA301- Statistics and Probability

MIDTERM EXAMINATION (Spring 2011) STA301- Statistics and Probability STA301- Statistics and Probability Solved MCQS From Midterm Papers March 19,2012 MC100401285 Moaaz.pk@gmail.com Mc100401285@gmail.com PSMD01 MIDTERM EXAMINATION (Spring 2011) STA301- Statistics and Probability

More information

Dr. G.R. Damodaran college of Science Coimbatore GRD School of Commerce and International Business II B Com(CS) ( ) Semester IV

Dr. G.R. Damodaran college of Science Coimbatore GRD School of Commerce and International Business II B Com(CS) ( ) Semester IV Dr. G.R. Damodaran college of Science Coimbatore 641 014 GRD School of Commerce and International Business II B Com(CS) (2016-2019) Semester IV Allied : Business Statistics - 405D Multiple Choice Question

More information

Overview of Dispersion. Standard. Deviation

Overview of Dispersion. Standard. Deviation 15.30 STATISTICS UNIT II: DISPERSION After reading this chapter, students will be able to understand: LEARNING OBJECTIVES To understand different measures of Dispersion i.e Range, Quartile Deviation, Mean

More information

UNIT 3 CONCEPT OF DISPERSION

UNIT 3 CONCEPT OF DISPERSION UNIT 3 CONCEPT OF DISPERSION Structure 3.0 Introduction 3.1 Objectives 3.2 Concept of Dispersion 3.2.1 Functions of Dispersion 3.2.2 Measures of Dispersion 3.2.3 Meaning of Dispersion 3.2.4 Absolute Dispersion

More information

Ø Set of mutually exclusive categories. Ø Classify or categorize subject. Ø No meaningful order to categorization.

Ø Set of mutually exclusive categories. Ø Classify or categorize subject. Ø No meaningful order to categorization. Statistical Tools in Evaluation HPS 41 Dr. Joe G. Schmalfeldt Types of Scores Continuous Scores scores with a potentially infinite number of values. Discrete Scores scores limited to a specific number

More information

STATISTICS ( CODE NO. 08 ) PAPER I PART - I

STATISTICS ( CODE NO. 08 ) PAPER I PART - I STATISTICS ( CODE NO. 08 ) PAPER I PART - I 1. Descriptive Statistics Types of data - Concepts of a Statistical population and sample from a population ; qualitative and quantitative data ; nominal and

More information

Ø Set of mutually exclusive categories. Ø Classify or categorize subject. Ø No meaningful order to categorization.

Ø Set of mutually exclusive categories. Ø Classify or categorize subject. Ø No meaningful order to categorization. Statistical Tools in Evaluation HPS 41 Fall 213 Dr. Joe G. Schmalfeldt Types of Scores Continuous Scores scores with a potentially infinite number of values. Discrete Scores scores limited to a specific

More information

Descriptive Statistics-I. Dr Mahmoud Alhussami

Descriptive Statistics-I. Dr Mahmoud Alhussami Descriptive Statistics-I Dr Mahmoud Alhussami Biostatistics What is the biostatistics? A branch of applied math. that deals with collecting, organizing and interpreting data using well-defined procedures.

More information

Glossary for the Triola Statistics Series

Glossary for the Triola Statistics Series Glossary for the Triola Statistics Series Absolute deviation The measure of variation equal to the sum of the deviations of each value from the mean, divided by the number of values Acceptance sampling

More information

The science of learning from data.

The science of learning from data. STATISTICS (PART 1) The science of learning from data. Numerical facts Collection of methods for planning experiments, obtaining data and organizing, analyzing, interpreting and drawing the conclusions

More information

Bus 216: Business Statistics II Introduction Business statistics II is purely inferential or applied statistics.

Bus 216: Business Statistics II Introduction Business statistics II is purely inferential or applied statistics. Bus 216: Business Statistics II Introduction Business statistics II is purely inferential or applied statistics. Study Session 1 1. Random Variable A random variable is a variable that assumes numerical

More information

Diploma Part 2. Quantitative Methods. Examiner s Suggested Answers

Diploma Part 2. Quantitative Methods. Examiner s Suggested Answers Diploma Part Quantitative Methods Examiner s Suggested Answers Question 1 (a) The standard normal distribution has a symmetrical and bell-shaped graph with a mean of zero and a standard deviation equal

More information

Introduction to Statistics

Introduction to Statistics Introduction to Statistics Data and Statistics Data consists of information coming from observations, counts, measurements, or responses. Statistics is the science of collecting, organizing, analyzing,

More information

psychological statistics

psychological statistics psychological statistics B Sc. Counselling Psychology 011 Admission onwards III SEMESTER COMPLEMENTARY COURSE UNIVERSITY OF CALICUT SCHOOL OF DISTANCE EDUCATION CALICUT UNIVERSITY.P.O., MALAPPURAM, KERALA,

More information

FCE 3900 EDUCATIONAL RESEARCH LECTURE 8 P O P U L A T I O N A N D S A M P L I N G T E C H N I Q U E

FCE 3900 EDUCATIONAL RESEARCH LECTURE 8 P O P U L A T I O N A N D S A M P L I N G T E C H N I Q U E FCE 3900 EDUCATIONAL RESEARCH LECTURE 8 P O P U L A T I O N A N D S A M P L I N G T E C H N I Q U E OBJECTIVE COURSE Understand the concept of population and sampling in the research. Identify the type

More information

CENTRAL TENDENCY (1 st Semester) Presented By Dr. Porinita Dutta Department of Statistics

CENTRAL TENDENCY (1 st Semester) Presented By Dr. Porinita Dutta Department of Statistics CENTRAL TENDENCY (1 st Semester) Presented By Dr. Porinita Dutta Department of Statistics OUTLINES Descriptive Statistics Introduction of central tendency Classification Characteristics Different measures

More information

CIVL 7012/8012. Collection and Analysis of Information

CIVL 7012/8012. Collection and Analysis of Information CIVL 7012/8012 Collection and Analysis of Information Uncertainty in Engineering Statistics deals with the collection and analysis of data to solve real-world problems. Uncertainty is inherent in all real

More information

PSYCHOLOGICAL STATISTICS

PSYCHOLOGICAL STATISTICS PSYCHOLOGICAL STATISTICS B Sc. Counselling Psychology 011 Admission onwards II SEMESTER COMPLIMETARY COURSE UIVERSITY OF CALICUT SCHOOL OF DISTACE EDUCATIO CALICUT UIVERSITY.P.O., MALAPPURAM, KERALA, IDIA

More information

2/2/2015 GEOGRAPHY 204: STATISTICAL PROBLEM SOLVING IN GEOGRAPHY MEASURES OF CENTRAL TENDENCY CHAPTER 3: DESCRIPTIVE STATISTICS AND GRAPHICS

2/2/2015 GEOGRAPHY 204: STATISTICAL PROBLEM SOLVING IN GEOGRAPHY MEASURES OF CENTRAL TENDENCY CHAPTER 3: DESCRIPTIVE STATISTICS AND GRAPHICS Spring 2015: Lembo GEOGRAPHY 204: STATISTICAL PROBLEM SOLVING IN GEOGRAPHY CHAPTER 3: DESCRIPTIVE STATISTICS AND GRAPHICS Descriptive statistics concise and easily understood summary of data set characteristics

More information

STP 420 INTRODUCTION TO APPLIED STATISTICS NOTES

STP 420 INTRODUCTION TO APPLIED STATISTICS NOTES INTRODUCTION TO APPLIED STATISTICS NOTES PART - DATA CHAPTER LOOKING AT DATA - DISTRIBUTIONS Individuals objects described by a set of data (people, animals, things) - all the data for one individual make

More information

MATH 1150 Chapter 2 Notation and Terminology

MATH 1150 Chapter 2 Notation and Terminology MATH 1150 Chapter 2 Notation and Terminology Categorical Data The following is a dataset for 30 randomly selected adults in the U.S., showing the values of two categorical variables: whether or not the

More information

Preliminary Statistics course. Lecture 1: Descriptive Statistics

Preliminary Statistics course. Lecture 1: Descriptive Statistics Preliminary Statistics course Lecture 1: Descriptive Statistics Rory Macqueen (rm43@soas.ac.uk), September 2015 Organisational Sessions: 16-21 Sep. 10.00-13.00, V111 22-23 Sep. 15.00-18.00, V111 24 Sep.

More information

SESSION 5 Descriptive Statistics

SESSION 5 Descriptive Statistics SESSION 5 Descriptive Statistics Descriptive statistics are used to describe the basic features of the data in a study. They provide simple summaries about the sample and the measures. Together with simple

More information

Introduction to Statistics

Introduction to Statistics Why Statistics? Introduction to Statistics To develop an appreciation for variability and how it effects products and processes. Study methods that can be used to help solve problems, build knowledge and

More information

1-1. Chapter 1. Sampling and Descriptive Statistics by The McGraw-Hill Companies, Inc. All rights reserved.

1-1. Chapter 1. Sampling and Descriptive Statistics by The McGraw-Hill Companies, Inc. All rights reserved. 1-1 Chapter 1 Sampling and Descriptive Statistics 1-2 Why Statistics? Deal with uncertainty in repeated scientific measurements Draw conclusions from data Design valid experiments and draw reliable conclusions

More information

Unit Two Descriptive Biostatistics. Dr Mahmoud Alhussami

Unit Two Descriptive Biostatistics. Dr Mahmoud Alhussami Unit Two Descriptive Biostatistics Dr Mahmoud Alhussami Descriptive Biostatistics The best way to work with data is to summarize and organize them. Numbers that have not been summarized and organized are

More information

Chapter 2: Tools for Exploring Univariate Data

Chapter 2: Tools for Exploring Univariate Data Stats 11 (Fall 2004) Lecture Note Introduction to Statistical Methods for Business and Economics Instructor: Hongquan Xu Chapter 2: Tools for Exploring Univariate Data Section 2.1: Introduction What is

More information

104 Business Research Methods - MCQs

104 Business Research Methods - MCQs 104 Business Research Methods - MCQs 1) Process of obtaining a numerical description of the extent to which a person or object possesses some characteristics a) Measurement b) Scaling c) Questionnaire

More information

What is Statistics? Statistics is the science of understanding data and of making decisions in the face of variability and uncertainty.

What is Statistics? Statistics is the science of understanding data and of making decisions in the face of variability and uncertainty. What is Statistics? Statistics is the science of understanding data and of making decisions in the face of variability and uncertainty. Statistics is a field of study concerned with the data collection,

More information

Quantitative Methods Chapter 0: Review of Basic Concepts 0.1 Business Applications (II) 0.2 Business Applications (III)

Quantitative Methods Chapter 0: Review of Basic Concepts 0.1 Business Applications (II) 0.2 Business Applications (III) Quantitative Methods Chapter 0: Review of Basic Concepts 0.1 Business Applications (II) 0.1.1 Simple Interest 0.2 Business Applications (III) 0.2.1 Expenses Involved in Buying a Car 0.2.2 Expenses Involved

More information

TOPIC: Descriptive Statistics Single Variable

TOPIC: Descriptive Statistics Single Variable TOPIC: Descriptive Statistics Single Variable I. Numerical data summary measurements A. Measures of Location. Measures of central tendency Mean; Median; Mode. Quantiles - measures of noncentral tendency

More information

AIM HIGH SCHOOL. Curriculum Map W. 12 Mile Road Farmington Hills, MI (248)

AIM HIGH SCHOOL. Curriculum Map W. 12 Mile Road Farmington Hills, MI (248) AIM HIGH SCHOOL Curriculum Map 2923 W. 12 Mile Road Farmington Hills, MI 48334 (248) 702-6922 www.aimhighschool.com COURSE TITLE: Statistics DESCRIPTION OF COURSE: PREREQUISITES: Algebra 2 Students will

More information

Stat 101 Exam 1 Important Formulas and Concepts 1

Stat 101 Exam 1 Important Formulas and Concepts 1 1 Chapter 1 1.1 Definitions Stat 101 Exam 1 Important Formulas and Concepts 1 1. Data Any collection of numbers, characters, images, or other items that provide information about something. 2. Categorical/Qualitative

More information

Last Lecture. Distinguish Populations from Samples. Knowing different Sampling Techniques. Distinguish Parameters from Statistics

Last Lecture. Distinguish Populations from Samples. Knowing different Sampling Techniques. Distinguish Parameters from Statistics Last Lecture Distinguish Populations from Samples Importance of identifying a population and well chosen sample Knowing different Sampling Techniques Distinguish Parameters from Statistics Knowing different

More information

GLOSSARY. a n + n. a n 1 b + + n. a n r b r + + n C 1. C r. C n

GLOSSARY. a n + n. a n 1 b + + n. a n r b r + + n C 1. C r. C n GLOSSARY A absolute cell referencing A spreadsheet feature that blocks automatic adjustment of cell references when formulas are moved or copied. References preceded by a dollar sign $A$1, for example

More information

Vocabulary: Samples and Populations

Vocabulary: Samples and Populations Vocabulary: Samples and Populations Concept Different types of data Categorical data results when the question asked in a survey or sample can be answered with a nonnumerical answer. For example if we

More information

MEASURES OF CENTRAL TENDENCY

MEASURES OF CENTRAL TENDENCY MAT001-Statistics for Engineers MEASURES OF CENTRAL TENDENCY DESCRIPTIVE STATISTICAL MEASURES Graphical representation summarizes information in the data. In addition to the diagrammatic and graphic representations

More information

Application of Statistical Analysis in Population and Sampling Population

Application of Statistical Analysis in Population and Sampling Population Quest Journals Journal of Electronics and Communication Engineering Research Volume 2 ~ Issue 9 (2015) pp: 01-05 ISSN(Online) : 2321-5941 www.questjournals.org Research Paper Application of Statistical

More information

CHARACTERISTICS OF STATISTICS:-

CHARACTERISTICS OF STATISTICS:- Model Answers to Paper Code AS 2404 Master of Business Administration (First Semester) Examination, 2014 Paper : Second QUANTITATIVE METHODS Note: Attempt both the sections as directed. Section A Note:

More information

Biostatistics Presentation of data DR. AMEER KADHIM HUSSEIN M.B.CH.B.FICMS (COM.)

Biostatistics Presentation of data DR. AMEER KADHIM HUSSEIN M.B.CH.B.FICMS (COM.) Biostatistics Presentation of data DR. AMEER KADHIM HUSSEIN M.B.CH.B.FICMS (COM.) PRESENTATION OF DATA 1. Mathematical presentation (measures of central tendency and measures of dispersion). 2. Tabular

More information

1. Exploratory Data Analysis

1. Exploratory Data Analysis 1. Exploratory Data Analysis 1.1 Methods of Displaying Data A visual display aids understanding and can highlight features which may be worth exploring more formally. Displays should have impact and be

More information

CHAPTER 4 VARIABILITY ANALYSES. Chapter 3 introduced the mode, median, and mean as tools for summarizing the

CHAPTER 4 VARIABILITY ANALYSES. Chapter 3 introduced the mode, median, and mean as tools for summarizing the CHAPTER 4 VARIABILITY ANALYSES Chapter 3 introduced the mode, median, and mean as tools for summarizing the information provided in an distribution of data. Measures of central tendency are often useful

More information

KDF2C QUANTITATIVE TECHNIQUES FOR BUSINESSDECISION. Unit : I - V

KDF2C QUANTITATIVE TECHNIQUES FOR BUSINESSDECISION. Unit : I - V KDF2C QUANTITATIVE TECHNIQUES FOR BUSINESSDECISION Unit : I - V Unit I: Syllabus Probability and its types Theorems on Probability Law Decision Theory Decision Environment Decision Process Decision tree

More information

Final Exam - Solutions

Final Exam - Solutions Ecn 102 - Analysis of Economic Data University of California - Davis March 19, 2010 Instructor: John Parman Final Exam - Solutions You have until 5:30pm to complete this exam. Please remember to put your

More information

STAT 200 Chapter 1 Looking at Data - Distributions

STAT 200 Chapter 1 Looking at Data - Distributions STAT 200 Chapter 1 Looking at Data - Distributions What is Statistics? Statistics is a science that involves the design of studies, data collection, summarizing and analyzing the data, interpreting the

More information

Statistics 1. Edexcel Notes S1. Mathematical Model. A mathematical model is a simplification of a real world problem.

Statistics 1. Edexcel Notes S1. Mathematical Model. A mathematical model is a simplification of a real world problem. Statistics 1 Mathematical Model A mathematical model is a simplification of a real world problem. 1. A real world problem is observed. 2. A mathematical model is thought up. 3. The model is used to make

More information

Higher Secondary - First year STATISTICS Practical Book

Higher Secondary - First year STATISTICS Practical Book Higher Secondary - First year STATISTICS Practical Book th_statistics_practicals.indd 07-09-08 8:00:9 Introduction Statistical tools are important for us in daily life. They are used in the analysis of

More information

AP Statistics Cumulative AP Exam Study Guide

AP Statistics Cumulative AP Exam Study Guide AP Statistics Cumulative AP Eam Study Guide Chapters & 3 - Graphs Statistics the science of collecting, analyzing, and drawing conclusions from data. Descriptive methods of organizing and summarizing statistics

More information

Part 7: Glossary Overview

Part 7: Glossary Overview Part 7: Glossary Overview In this Part This Part covers the following topic Topic See Page 7-1-1 Introduction This section provides an alphabetical list of all the terms used in a STEPS surveillance with

More information

Elementary Statistics

Elementary Statistics Elementary Statistics Q: What is data? Q: What does the data look like? Q: What conclusions can we draw from the data? Q: Where is the middle of the data? Q: Why is the spread of the data important? Q:

More information

Final Exam STAT On a Pareto chart, the frequency should be represented on the A) X-axis B) regression C) Y-axis D) none of the above

Final Exam STAT On a Pareto chart, the frequency should be represented on the A) X-axis B) regression C) Y-axis D) none of the above King Abdul Aziz University Faculty of Sciences Statistics Department Final Exam STAT 0 First Term 49-430 A 40 Name No ID: Section: You have 40 questions in 9 pages. You have 90 minutes to solve the exam.

More information

DETAILED CONTENTS PART I INTRODUCTION AND DESCRIPTIVE STATISTICS. 1. Introduction to Statistics

DETAILED CONTENTS PART I INTRODUCTION AND DESCRIPTIVE STATISTICS. 1. Introduction to Statistics DETAILED CONTENTS About the Author Preface to the Instructor To the Student How to Use SPSS With This Book PART I INTRODUCTION AND DESCRIPTIVE STATISTICS 1. Introduction to Statistics 1.1 Descriptive and

More information

Probability and Statistics

Probability and Statistics Probability and Statistics Kristel Van Steen, PhD 2 Montefiore Institute - Systems and Modeling GIGA - Bioinformatics ULg kristel.vansteen@ulg.ac.be CHAPTER 4: IT IS ALL ABOUT DATA 4a - 1 CHAPTER 4: IT

More information

A is one of the categories into which qualitative data can be classified.

A is one of the categories into which qualitative data can be classified. Chapter 2 Methods for Describing Sets of Data 2.1 Describing qualitative data Recall qualitative data: non-numerical or categorical data Basic definitions: A is one of the categories into which qualitative

More information

COVENANT UNIVERSITY NIGERIA TUTORIAL KIT OMEGA SEMESTER PROGRAMME: ECONOMICS

COVENANT UNIVERSITY NIGERIA TUTORIAL KIT OMEGA SEMESTER PROGRAMME: ECONOMICS COVENANT UNIVERSITY NIGERIA TUTORIAL KIT OMEGA SEMESTER PROGRAMME: ECONOMICS COURSE: CBS 221 DISCLAIMER The contents of this document are intended for practice and leaning purposes at the undergraduate

More information

Diploma Part 2. Quantitative Methods. Examiners Suggested Answers

Diploma Part 2. Quantitative Methods. Examiners Suggested Answers Diploma Part 2 Quantitative Methods Examiners Suggested Answers Q1 (a) A frequency distribution is a table or graph (i.e. a histogram) that shows the total number of measurements that fall in each of a

More information

Chapter 8 - Forecasting

Chapter 8 - Forecasting Chapter 8 - Forecasting Operations Management by R. Dan Reid & Nada R. Sanders 4th Edition Wiley 2010 Wiley 2010 1 Learning Objectives Identify Principles of Forecasting Explain the steps in the forecasting

More information

Statistics. Industry Business Education Physics Chemistry Economics Biology Agriculture Psychology Astronomy, etc. GFP - Sohar University

Statistics. Industry Business Education Physics Chemistry Economics Biology Agriculture Psychology Astronomy, etc. GFP - Sohar University Statistics اإلحصاء تعاريف 3-1 Definitions Statistics is a branch of Mathematics that deals collecting, analyzing, summarizing, and presenting data to help in the decision-making process. Statistics is

More information

Psych 230. Psychological Measurement and Statistics

Psych 230. Psychological Measurement and Statistics Psych 230 Psychological Measurement and Statistics Pedro Wolf December 9, 2009 This Time. Non-Parametric statistics Chi-Square test One-way Two-way Statistical Testing 1. Decide which test to use 2. State

More information

Dr. Babasaheb Ambedkar Marathwada University, Aurangabad. Syllabus at the F.Y. B.Sc. / B.A. In Statistics

Dr. Babasaheb Ambedkar Marathwada University, Aurangabad. Syllabus at the F.Y. B.Sc. / B.A. In Statistics Dr. Babasaheb Ambedkar Marathwada University, Aurangabad Syllabus at the F.Y. B.Sc. / B.A. In Statistics With effect from the academic year 2009-2010 Class Semester Title of Paper Paper Per week Total

More information

Chapter Four. Numerical Descriptive Techniques. Range, Standard Deviation, Variance, Coefficient of Variation

Chapter Four. Numerical Descriptive Techniques. Range, Standard Deviation, Variance, Coefficient of Variation Chapter Four Numerical Descriptive Techniques 4.1 Numerical Descriptive Techniques Measures of Central Location Mean, Median, Mode Measures of Variability Range, Standard Deviation, Variance, Coefficient

More information

Keystone Exams: Algebra

Keystone Exams: Algebra KeystoneExams:Algebra TheKeystoneGlossaryincludestermsanddefinitionsassociatedwiththeKeystoneAssessmentAnchorsand Eligible Content. The terms and definitions included in the glossary are intended to assist

More information

Review of Multiple Regression

Review of Multiple Regression Ronald H. Heck 1 Let s begin with a little review of multiple regression this week. Linear models [e.g., correlation, t-tests, analysis of variance (ANOVA), multiple regression, path analysis, multivariate

More information

Review of Statistics 101

Review of Statistics 101 Review of Statistics 101 We review some important themes from the course 1. Introduction Statistics- Set of methods for collecting/analyzing data (the art and science of learning from data). Provides methods

More information

1.0 Continuous Distributions. 5.0 Shapes of Distributions. 6.0 The Normal Curve. 7.0 Discrete Distributions. 8.0 Tolerances. 11.

1.0 Continuous Distributions. 5.0 Shapes of Distributions. 6.0 The Normal Curve. 7.0 Discrete Distributions. 8.0 Tolerances. 11. Chapter 4 Statistics 45 CHAPTER 4 BASIC QUALITY CONCEPTS 1.0 Continuous Distributions.0 Measures of Central Tendency 3.0 Measures of Spread or Dispersion 4.0 Histograms and Frequency Distributions 5.0

More information

LC OL - Statistics. Types of Data

LC OL - Statistics. Types of Data LC OL - Statistics Types of Data Question 1 Characterise each of the following variables as numerical or categorical. In each case, list any three possible values for the variable. (i) Eye colours in a

More information

Measures of Central Tendency

Measures of Central Tendency Measures of Central Tendency Summary Measures Summary Measures Central Tendency Mean Median Mode Quartile Range Variance Variation Coefficient of Variation Standard Deviation Measures of Central Tendency

More information

The aim of this section is to introduce the numerical, graphical and listing facilities of the graphic display calculator (GDC).

The aim of this section is to introduce the numerical, graphical and listing facilities of the graphic display calculator (GDC). Syllabus content Topic 1 Introduction to the graphic display calculator The aim of this section is to introduce the numerical, graphical and listing facilities of the graphic display calculator (GDC).

More information

Dover- Sherborn High School Mathematics Curriculum Probability and Statistics

Dover- Sherborn High School Mathematics Curriculum Probability and Statistics Mathematics Curriculum A. DESCRIPTION This is a full year courses designed to introduce students to the basic elements of statistics and probability. Emphasis is placed on understanding terminology and

More information

Sampling Populations limited in the scope enumerate

Sampling Populations limited in the scope enumerate Sampling Populations Typically, when we collect data, we are somewhat limited in the scope of what information we can reasonably collect Ideally, we would enumerate each and every member of a population

More information

Lectures of STA 231: Biostatistics

Lectures of STA 231: Biostatistics Lectures of STA 231: Biostatistics Second Semester Academic Year 2016/2017 Text Book Biostatistics: Basic Concepts and Methodology for the Health Sciences (10 th Edition, 2014) By Wayne W. Daniel Prepared

More information

Chapter 7 Forecasting Demand

Chapter 7 Forecasting Demand Chapter 7 Forecasting Demand Aims of the Chapter After reading this chapter you should be able to do the following: discuss the role of forecasting in inventory management; review different approaches

More information

CHAPTER 4: DATASETS AND CRITERIA FOR ALGORITHM EVALUATION

CHAPTER 4: DATASETS AND CRITERIA FOR ALGORITHM EVALUATION CHAPTER 4: DATASETS AND CRITERIA FOR ALGORITHM EVALUATION 4.1 Overview This chapter contains the description about the data that is used in this research. In this research time series data is used. A time

More information

CHAPTER 8 INTRODUCTION TO STATISTICAL ANALYSIS

CHAPTER 8 INTRODUCTION TO STATISTICAL ANALYSIS CHAPTER 8 INTRODUCTION TO STATISTICAL ANALYSIS LEARNING OBJECTIVES: After studying this chapter, a student should understand: notation used in statistics; how to represent variables in a mathematical form

More information

ADMS2320.com. We Make Stats Easy. Chapter 4. ADMS2320.com Tutorials Past Tests. Tutorial Length 1 Hour 45 Minutes

ADMS2320.com. We Make Stats Easy. Chapter 4. ADMS2320.com Tutorials Past Tests. Tutorial Length 1 Hour 45 Minutes We Make Stats Easy. Chapter 4 Tutorial Length 1 Hour 45 Minutes Tutorials Past Tests Chapter 4 Page 1 Chapter 4 Note The following topics will be covered in this chapter: Measures of central location Measures

More information

2. Linear Programming Problem

2. Linear Programming Problem . Linear Programming Problem. Introduction to Linear Programming Problem (LPP). When to apply LPP or Requirement for a LPP.3 General form of LPP. Assumptions in LPP. Applications of Linear Programming.6

More information

Descriptive Statistics

Descriptive Statistics Contents 36 Descriptive Statistics 36.1 Describing Data 2 36.2 Exploring Data 26 Learning outcomes In the first Section of this Workbook you will learn how to describe data sets and represent them numerically

More information

DE CHAZAL DU MEE BUSINESS SCHOOL AUGUST 2003 MOCK EXAMINATIONS IOP 201-Q (INDUSTRIAL PSYCHOLOGICAL RESEARCH)

DE CHAZAL DU MEE BUSINESS SCHOOL AUGUST 2003 MOCK EXAMINATIONS IOP 201-Q (INDUSTRIAL PSYCHOLOGICAL RESEARCH) DE CHAZAL DU MEE BUSINESS SCHOOL AUGUST 003 MOCK EXAMINATIONS IOP 01-Q (INDUSTRIAL PSYCHOLOGICAL RESEARCH) Time: hours READ THE INSTRUCTIONS BELOW VERY CAREFULLY. Do not open this question paper until

More information

Statistical thinking will one day be as necessary for efficient citizenship as the ability to read and write. H.G. Wells

Statistical thinking will one day be as necessary for efficient citizenship as the ability to read and write. H.G. Wells Statistical thinking will one day be as necessary for efficient citizenship as the ability to read and write. H.G. Wells 1 Statistics is a science which deals with collection, tabulation, presentation,

More information

Chapter 3. Data Description

Chapter 3. Data Description Chapter 3. Data Description Graphical Methods Pie chart It is used to display the percentage of the total number of measurements falling into each of the categories of the variable by partition a circle.

More information

Business Statistics. Lecture 9: Simple Regression

Business Statistics. Lecture 9: Simple Regression Business Statistics Lecture 9: Simple Regression 1 On to Model Building! Up to now, class was about descriptive and inferential statistics Numerical and graphical summaries of data Confidence intervals

More information

AP Final Review II Exploring Data (20% 30%)

AP Final Review II Exploring Data (20% 30%) AP Final Review II Exploring Data (20% 30%) Quantitative vs Categorical Variables Quantitative variables are numerical values for which arithmetic operations such as means make sense. It is usually a measure

More information

What is statistics? Statistics is the science of: Collecting information. Organizing and summarizing the information collected

What is statistics? Statistics is the science of: Collecting information. Organizing and summarizing the information collected What is statistics? Statistics is the science of: Collecting information Organizing and summarizing the information collected Analyzing the information collected in order to draw conclusions Two types

More information

Inferential Statistics

Inferential Statistics Inferential Statistics Part 1 Sampling Distributions, Point Estimates & Confidence Intervals Inferential statistics are used to draw inferences (make conclusions/judgements) about a population from a sample.

More information

Chapter 16. Simple Linear Regression and Correlation

Chapter 16. Simple Linear Regression and Correlation Chapter 16 Simple Linear Regression and Correlation 16.1 Regression Analysis Our problem objective is to analyze the relationship between interval variables; regression analysis is the first tool we will

More information

CORRELATION ANALYSIS. Dr. Anulawathie Menike Dept. of Economics

CORRELATION ANALYSIS. Dr. Anulawathie Menike Dept. of Economics CORRELATION ANALYSIS Dr. Anulawathie Menike Dept. of Economics 1 What is Correlation The correlation is one of the most common and most useful statistics. It is a term used to describe the relationship

More information

MATH 117 Statistical Methods for Management I Chapter Three

MATH 117 Statistical Methods for Management I Chapter Three Jubail University College MATH 117 Statistical Methods for Management I Chapter Three This chapter covers the following topics: I. Measures of Center Tendency. 1. Mean for Ungrouped Data (Raw Data) 2.

More information

Chapter 1 Statistical Inference

Chapter 1 Statistical Inference Chapter 1 Statistical Inference causal inference To infer causality, you need a randomized experiment (or a huge observational study and lots of outside information). inference to populations Generalizations

More information

After completing this chapter, you should be able to:

After completing this chapter, you should be able to: Chapter 2 Descriptive Statistics Chapter Goals After completing this chapter, you should be able to: Compute and interpret the mean, median, and mode for a set of data Find the range, variance, standard

More information

Types of Information. Topic 2 - Descriptive Statistics. Examples. Sample and Sample Size. Background Reading. Variables classified as STAT 511

Types of Information. Topic 2 - Descriptive Statistics. Examples. Sample and Sample Size. Background Reading. Variables classified as STAT 511 Topic 2 - Descriptive Statistics STAT 511 Professor Bruce Craig Types of Information Variables classified as Categorical (qualitative) - variable classifies individual into one of several groups or categories

More information

Forecasting Chapter 3

Forecasting Chapter 3 Forecasting Chapter 3 Introduction Current factors and conditions Past experience in a similar situation 2 Accounting. New product/process cost estimates, profit projections, cash management. Finance.

More information

Chapter 1:Descriptive statistics

Chapter 1:Descriptive statistics Slide 1.1 Chapter 1:Descriptive statistics Descriptive statistics summarises a mass of information. We may use graphical and/or numerical methods Examples of the former are the bar chart and XY chart,

More information

Business Analytics and Data Mining Modeling Using R Prof. Gaurav Dixit Department of Management Studies Indian Institute of Technology, Roorkee

Business Analytics and Data Mining Modeling Using R Prof. Gaurav Dixit Department of Management Studies Indian Institute of Technology, Roorkee Business Analytics and Data Mining Modeling Using R Prof. Gaurav Dixit Department of Management Studies Indian Institute of Technology, Roorkee Lecture - 04 Basic Statistics Part-1 (Refer Slide Time: 00:33)

More information

ST Presenting & Summarising Data Descriptive Statistics. Frequency Distribution, Histogram & Bar Chart

ST Presenting & Summarising Data Descriptive Statistics. Frequency Distribution, Histogram & Bar Chart ST2001 2. Presenting & Summarising Data Descriptive Statistics Frequency Distribution, Histogram & Bar Chart Summary of Previous Lecture u A study often involves taking a sample from a population that

More information

8.1 Frequency Distribution, Frequency Polygon, Histogram page 326

8.1 Frequency Distribution, Frequency Polygon, Histogram page 326 page 35 8 Statistics are around us both seen and in ways that affect our lives without us knowing it. We have seen data organized into charts in magazines, books and newspapers. That s descriptive statistics!

More information

ECLT 5810 Data Preprocessing. Prof. Wai Lam

ECLT 5810 Data Preprocessing. Prof. Wai Lam ECLT 5810 Data Preprocessing Prof. Wai Lam Why Data Preprocessing? Data in the real world is imperfect incomplete: lacking attribute values, lacking certain attributes of interest, or containing only aggregate

More information