Predicting Retention Rates from Placement Exam Scores

Size: px
Start display at page:

Download "Predicting Retention Rates from Placement Exam Scores"

Transcription

1 Predicting Retention Rates from Placement Exam Scores Dr. Michael S. Pilant, Dept. of Mathematics, Texas A&M University Dr. Robert Hall, Dept. of Ed. Psychology, Texas A&M University Amy Austin, Senior Lecturer, Dept. of Mathematics, Texas A&M University Marcia Drost, Senior Lecturer (retired), Dept. of Mathematics, Texas A&M University March 18, 2012 Abstract During the past 5 years, over 15,000 students have taken a math placement exam at Texas A&M University to determine their readiness for Calculus. By tracking the performance of those students who subsequently enrolled in Calculus, a very accurate model has been developed which predicts the probability of success (retention rate) in Calculus based on precalculus math placement scores. 1 Introduction The goal of this paper is to be able to accurately compute the retention rate R(s) where s is the score on the math placement exam (MPE), and 0 s S max. The retention rate R(S) is defined as the percentage of students in a group Ω who pass the course, and whose MPE scores exceed S. We begin by considering P (S), where P (S) is the number of students with scores s S who pass the course. This function is guaranteed to be monotone increasing, and typically has the shape of an S-curve, as shown in Figure 1. This figure was obtained, using historical data from thousands of students. By historical we mean data for students who have already taken the Math Placement Exam and taken Calculus I and received a grade. 1

2 Figure 1: Data vs Logistic Model We model it as a two parameter logistic (2-PL) function of the following form P (S) N P = e a P b P S (1) where N P is the total number who pass (N P = P (S max )), and the parameters a P, b P define the two parameter logistic function. Similarly, let F (S) denote the number of students with scores s S who fail the course. This function has the same general shape, and can also be modeled by a two parameter logistic function F (S) N F = e a F b F S (2) where N F is the total number who fail (N F = F (S max )), and the parameters a F, b F define the two parameter logistic function. Clearly N(S) = P (S) + F (S) where N(S) is the total number of students with MPE scores s S. We approximately the probability density functions by differentiating the cumulative density functions

3 p(s) = P (S) N P = b P e ap bp S (1 + e a P b P S ) 2 (3) The function p(s) has a maximum at s = a P /b P and a maximum height of b P 4. The mean for this distribution is µ P = a P /b P and the standard deviation is σ P = Likewise, π b P 3 f(s) = F (S) N F = b F e af bf S (1 + e a F b F S ) 2 (4) The function f(s) has a maximum at s = a F /b F and a maximum height of b F 4. The mean for this distribution is µ F = a F /b F and the standard deviation is σ F = We can solve these equations for a, b in terms if µ, σ by π b F 3 b P = π σ P 3 (5) a P = b P µ P (6) b F = π σ F 3 (7) a F = b F µ F (8)

4 2 Fall 2010 Data set The actual numbers from Fall 2010 are given in the table below s p(s) f(s) P (s) F (s) N(s) Table 1: Fall 2010 pass/fail data 3 Comparison with Normal Distributions The cumulative density functions are of the form: 1 Π(S) = 1 + e a bs (9) The probability density functions are given by π(s) = Π (s) = be a bs (1 + e a bs ) 2 (10)

5 Although the optimal choice of model parameters for logistical regression is determined by maximal likelihood, we approximate this by equating the mean and standard deviation of the logistic function to that of the sample population µ = a b σ = π b 3 In terms of µ, σ we may write (10) as π(s) = π 3σ e π(s µ)/( 3σ) (1 + e π(s µ)/( 3σ ) 2 (11) The equivalent normal distribution (with mean µ and standard deviation σ) is given by ν(s) = 1 2πσ e (s µ)2 /(2σ 2 ) These differ slightly at the maximum probability s = µ, as shown in Figure 2 below: π(µ) = π 4 3σ = σ 1 ν(µ) = 1 2πσ = σ 1 Figure 2: Probability Density Functions The logistic distribution has the same first and second moment as the normal distribution. The logistic function is not symmetric, so the third moment differs from zero. The fourth moment

6 defines the excess kurtosis, which is 1.2 for the logistic and zero for the normal distribution. This indicates that the logistic distribution has a sharper peak, and longer flatter tails, which is the case. This appears to make it more suitable for distribution of test scores which have flatter tails than those given by a normal distribution. The cumulative density functions are almost indistinguishable, with the slope of the logistic cumulative density function slightly steeper at the median. The primary advantage of the logistic function is that it has an explicit cumulative density function, as opposed to the normal distribution s error function. Figure 3: Cumulative Density Functions We now have a two-parameter family of functions which describe the probability of passing given a particular placement score. The distributions are defined in terms of the parameters a, b or equivalently µ, σ. To complete the mathematical model, we examine how the parameters of the passing and failing distributions are related, through the statistics of the MPE. 4 MPE statistical parameters The math placement exam consists of 33 multiple choice questions, and was written by Amy Austin and Marcia Drost - senior lecturers with extensive experience in both precalculus and calculus and Texas A&M University. Each question is chosen from a test bank of 15 questions, and the stems are randomly permuted. The MPE exam is timed (90 minutes), un proctored, and calculators are not allowed. The test may be taken any time after a student is admitted to Texas A&M, and can be repeated after 30 days have

7 passed. Beginning in Fall 2011, a score of 22 or higher out of 33 is required to enroll in Calculus I. Historically, each question has several parameters associated with it, given by the ratio of correct responses to the total number of responses, for each of the two groups of passing and failing students: µ k = #correct N (12) µ k,p = #correct N p (13) µ k,f = #correct N f (14) The individual test item means µ are an indication of the overall difficulty of the question, and the difference of the means measures the ability of a particular question to discriminate between passing and failing students. δ k = µ k,p µ k,f (15) for each question. The historical difference in means, for a given set of questions, is therefore = k δ k = µ P µ F (16) The MPE test mean is related through µ = N F N µ F + N P N µ P = (1 R)µ F + Rµ P (17) where R is the historical probability of being in the passing group (that is, the overall retention rate). We can solve (16) and (17) to obtain µ F = µ R (18) µ P = µ + (1 R) (19) This allows us to estimate the scores on the MPE exam from the overall mean, and the historical passing rates for Calculus.

8 The standard deviations are related through a weighted average We can define a parameter κ by the following σ 2 = Rσ 2 P + (1 R)σ 2 F + R(1 R) 2 (20) σ F σ P = 1 + κ (21) (kappa measures the relative size of the passing and failing standard deviations.) (18) and (19) can be solved to obtain σ σ P = 2 R(1 R) 2 R + (1 R)(1 + κ) 2 (22) σ F = (1 + κ) σ P (23) Given a set of N students, and calculating their MPE scores based on a set of k 33 questions, we now have enough information to calculate the probability and cumulative density functions. 5 Computational Algorithm 1. Compute the overall historical passing rate for Calculus for previous semesters. Passing is defined as achieving a grade of A, B, or C. This number R is quite stable given the large number of students taking Calculus each year. It is slowly increasing, but we can use the previous years retention rate as a good estimate of R. Also, compute the mean of the MPE µ and the standard deviation σ, and the difference between the means of those who pass and those who fail, = µ P µ F. 2. Choose a sample of N students that have taken the MPE exam and are enrolled in (or have taken) Calculus. We now have the parameters {N, µ, σ, R, } available. 3. The total number of students expected to pass is therefore given by N P = RN and the total number expected to fail is given by N F = (1 R)N

9 4. The mean MPE scores for the two groups (pass, fail) is given by µ P = µ + (1 R) and µ F = µ R 5. Given the heuristic constant κ = 0.05, we can calculate the standard deviations by σ σ P = 2 R(1 R) 2 R + (1 R)(1 + κ) 2 and σ F = (1 + κ) σ P 6. The Logistic parameters are then estimated by (a) b F = π/( 3σ F ), a F = b F.µ F (b) b P = π/( 3σ P ), a P = b P.µ P 7. The Logistic CDFs are therefore given by (a) P (S) = (b) F (S) = N P 1+e a P b P S N F 1+e a F b F S We now have enough information to estimate the following 1. False Negatives: Number of individuals who pass with scores s S P (S) 2. Positives: Number of individuals who pass with scores s > S N P P (S) 3. Negatives: Number of people who fail with scores s S F (S) 4. False Positives: Number of people who fail with scores s > S N F F (S) and compare it to actual data. For example, one can compute the value of the cutoff score S C such that the number of false positives equals the number of false negatives

10 6 Retention Rate vs MPE scores We can now compute the theoretical retention rate function R(S) defined as the percentage of students with scores s S who pass. R(S) = N P P (S) N (P (S) + F (S)) We can compare it to historical data, as shown in the table below. We can also compute a cutoff score S C defined by the requirement that R(S C ) = Correlation between MPE scores and grades If one calculates the Pearson correlation between the MPE scores and the pass/fail grades (pass=1, fail=0) for the Fall 2010 data set, we get the following R 2 =... This is not very good, indicating that only a third of the variance of the outcome is explained by the MPE scores. This is the reason why we went to an odds or probability based outcome measure. 8 Summary Based on a large sample of students who have taken the Math Placement Exam (MPE), and who have subsequently enrolled in Calculus, we have developed a model that estimates the likelihood that a student will pass Calculus, given a particular score on the MPE. Because of the binary nature of the result (pass=1, fail=0) we can only estimate the probability that a student with a particular score will succeed in calculus. However, due to the large sample size, the logistical models predictions are quite accurate. The existence of a highly accurate parameterized model relating probability of success to a small number of parameters allows one to apply various optimization techniques, perturbation analysis and other mathematical tools to help understand the relationship between the individual MPE questions, the MPE exam, and probability of success.

COURSE SYLLABUS Part I Course Title: MATH College Algebra Credit Hours: 4, (4 Lecture 0 Lab G) OTM-TMM001

COURSE SYLLABUS Part I Course Title: MATH College Algebra Credit Hours: 4, (4 Lecture 0 Lab G) OTM-TMM001 COURSE SYLLABUS Part I Course Title: MATH 1340 - College Algebra Credit Hours: 4, (4 Lecture 0 Lab G) OTM-TMM001 Course Description: College Algebra in conjunction with MATH 1350, Pre-Calculus, provides

More information

Math Sec 4 CST Topic 7. Statistics. i.e: Add up all values and divide by the total number of values.

Math Sec 4 CST Topic 7. Statistics. i.e: Add up all values and divide by the total number of values. Measures of Central Tendency Statistics 1) Mean: The of all data values Mean= x = x 1+x 2 +x 3 + +x n n i.e: Add up all values and divide by the total number of values. 2) Mode: Most data value 3) Median:

More information

Chapter 1 - Lecture 3 Measures of Location

Chapter 1 - Lecture 3 Measures of Location Chapter 1 - Lecture 3 of Location August 31st, 2009 Chapter 1 - Lecture 3 of Location General Types of measures Median Skewness Chapter 1 - Lecture 3 of Location Outline General Types of measures What

More information

Homework 7. Name: ID# Section

Homework 7. Name: ID# Section Homework 7 Name: ID# Section 1 Find the probabilities for each of the following using the standard normal distribution. 1. P(0 < z < 1.69) 2. P(-1.57 < z < 0) 3. P(z > 1.16) 4. P(z < -1.77) 5. P(-2.46

More information

Machine Learning Linear Classification. Prof. Matteo Matteucci

Machine Learning Linear Classification. Prof. Matteo Matteucci Machine Learning Linear Classification Prof. Matteo Matteucci Recall from the first lecture 2 X R p Regression Y R Continuous Output X R p Y {Ω 0, Ω 1,, Ω K } Classification Discrete Output X R p Y (X)

More information

Mark your answers ON THE EXAM ITSELF. If you are not sure of your answer you may wish to provide a brief explanation.

Mark your answers ON THE EXAM ITSELF. If you are not sure of your answer you may wish to provide a brief explanation. CS 189 Spring 2015 Introduction to Machine Learning Midterm You have 80 minutes for the exam. The exam is closed book, closed notes except your one-page crib sheet. No calculators or electronic items.

More information

Math 416 Lecture 3. The average or mean or expected value of x 1, x 2, x 3,..., x n is

Math 416 Lecture 3. The average or mean or expected value of x 1, x 2, x 3,..., x n is Math 416 Lecture 3 Expected values The average or mean or expected value of x 1, x 2, x 3,..., x n is x 1 x 2... x n n x 1 1 n x 2 1 n... x n 1 n 1 n x i p x i where p x i 1 n is the probability of x i

More information

Midterm Exam 1 Solution

Midterm Exam 1 Solution EECS 126 Probability and Random Processes University of California, Berkeley: Fall 2015 Kannan Ramchandran September 22, 2015 Midterm Exam 1 Solution Last name First name SID Name of student on your left:

More information

Name: Exam 2 Solutions. March 13, 2017

Name: Exam 2 Solutions. March 13, 2017 Department of Mathematics University of Notre Dame Math 00 Finite Math Spring 07 Name: Instructors: Conant/Galvin Exam Solutions March, 07 This exam is in two parts on pages and contains problems worth

More information

Assessing the relation between language comprehension and performance in general chemistry. Appendices

Assessing the relation between language comprehension and performance in general chemistry. Appendices Assessing the relation between language comprehension and performance in general chemistry Daniel T. Pyburn a, Samuel Pazicni* a, Victor A. Benassi b, and Elizabeth E. Tappin c a Department of Chemistry,

More information

Chapter 1: Revie of Calculus and Probability

Chapter 1: Revie of Calculus and Probability Chapter 1: Revie of Calculus and Probability Refer to Text Book: Operations Research: Applications and Algorithms By Wayne L. Winston,Ch. 12 Operations Research: An Introduction By Hamdi Taha, Ch. 12 OR441-Dr.Khalid

More information

AP Statistics Semester I Examination Section I Questions 1-30 Spend approximately 60 minutes on this part of the exam.

AP Statistics Semester I Examination Section I Questions 1-30 Spend approximately 60 minutes on this part of the exam. AP Statistics Semester I Examination Section I Questions 1-30 Spend approximately 60 minutes on this part of the exam. Name: Directions: The questions or incomplete statements below are each followed by

More information

Closed book and notes. 60 minutes. Cover page and four pages of exam. No calculators.

Closed book and notes. 60 minutes. Cover page and four pages of exam. No calculators. IE 230 Seat # Closed book and notes. 60 minutes. Cover page and four pages of exam. No calculators. Score Exam #3a, Spring 2002 Schmeiser Closed book and notes. 60 minutes. 1. True or false. (for each,

More information

(a) Fill in the missing probabilities in the table. (b) Calculate P(F G). (c) Calculate P(E c ). (d) Is this a uniform sample space?

(a) Fill in the missing probabilities in the table. (b) Calculate P(F G). (c) Calculate P(E c ). (d) Is this a uniform sample space? Math 166 Exam 1 Review Sections L.1-L.2, 1.1-1.7 Note: This review is more heavily weighted on the new material this week: Sections 1.5-1.7. For more practice problems on previous material, take a look

More information

Math May 13, Final Exam

Math May 13, Final Exam Math 447 - May 13, 2013 - Final Exam Name: Read these instructions carefully: The points assigned are not meant to be a guide to the difficulty of the problems. If the question is multiple choice, there

More information

Chapter 9 Regression with a Binary Dependent Variable. Multiple Choice. 1) The binary dependent variable model is an example of a

Chapter 9 Regression with a Binary Dependent Variable. Multiple Choice. 1) The binary dependent variable model is an example of a Chapter 9 Regression with a Binary Dependent Variable Multiple Choice ) The binary dependent variable model is an example of a a. regression model, which has as a regressor, among others, a binary variable.

More information

Lecture # 31. Questions of Marks 3. Question: Solution:

Lecture # 31. Questions of Marks 3. Question: Solution: Lecture # 31 Given XY = 400, X = 5, Y = 4, S = 4, S = 3, n = 15. Compute the coefficient of correlation between XX and YY. r =0.55 X Y Determine whether two variables XX and YY are correlated or uncorrelated

More information

COMPSCI 240: Reasoning Under Uncertainty

COMPSCI 240: Reasoning Under Uncertainty COMPSCI 240: Reasoning Under Uncertainty Nic Herndon and Andrew Lan University of Massachusetts at Amherst Spring 2019 Lecture 7: Counting (cont.) Summary of Counting Problems Structure Permutation k-permutation

More information

STAT100 Elementary Statistics and Probability

STAT100 Elementary Statistics and Probability STAT100 Elementary Statistics and Probability Exam, Sample Test, Summer 014 Solution Show all work clearly and in order, and circle your final answers. Justify your answers algebraically whenever possible.

More information

MAT 271E Probability and Statistics

MAT 271E Probability and Statistics MAT 71E Probability and Statistics Spring 013 Instructor : Class Meets : Office Hours : Textbook : Supp. Text : İlker Bayram EEB 1103 ibayram@itu.edu.tr 13.30 1.30, Wednesday EEB 5303 10.00 1.00, Wednesday

More information

Name: Exam: In-term Two Page: 1 of 8 Date: 12/07/2018. University of Texas at Austin, Department of Mathematics M358K - Applied Statistics TRUE/FALSE

Name: Exam: In-term Two Page: 1 of 8 Date: 12/07/2018. University of Texas at Austin, Department of Mathematics M358K - Applied Statistics TRUE/FALSE Exam: In-term Two Page: 1 of 8 Date: 12/07/2018 Name: TRUE/FALSE 1.1 TRUE FALSE University of Texas at Austin, Department of Mathematics M358K - Applied Statistics MULTIPLE CHOICE 1.2 TRUE FALSE 1.3 TRUE

More information

2016 SFUSD Math Validation Test

2016 SFUSD Math Validation Test 2016 SFUSD Math Validation Test The SFUSD course sequence for math provides a focused, coherent, and rigorous learning experience that balances mathematical content knowledge with mathematical practices.

More information

The Empirical Rule, z-scores, and the Rare Event Approach

The Empirical Rule, z-scores, and the Rare Event Approach Overview The Empirical Rule, z-scores, and the Rare Event Approach Look at Chebyshev s Rule and the Empirical Rule Explore some applications of the Empirical Rule How to calculate and use z-scores Introducing

More information

The Normal Distribution Review

The Normal Distribution Review The Normal Distribution Review A. Properties of the Normal Distribution Many continuous variables have distributions that are bell-shaped and are called approximately normally distributed variables. The

More information

PRACTICE PROBLEMS FOR EXAM 1

PRACTICE PROBLEMS FOR EXAM 1 PRACTICE PROBLEMS FOR EXAM 1 Math 3160Q Spring 01 Professor Hohn Below is a list of practice questions for Exam 1. Any quiz, homework, or example problem has a chance of being on the exam. For more practice,

More information

Mathematical statistics

Mathematical statistics October 18 th, 2018 Lecture 16: Midterm review Countdown to mid-term exam: 7 days Week 1 Chapter 1: Probability review Week 2 Week 4 Week 7 Chapter 6: Statistics Chapter 7: Point Estimation Chapter 8:

More information

EXAM 3 Math 1342 Elementary Statistics 6-7

EXAM 3 Math 1342 Elementary Statistics 6-7 EXAM 3 Math 1342 Elementary Statistics 6-7 Name Date ********************************************************************************************************************************************** MULTIPLE

More information

Intermediate Algebra

Intermediate Algebra Intermediate Algebra COURSE OUTLINE FOR MATH 0312 (REVISED JULY 29, 2015) Catalog Description: Topics include factoring techniques, radicals, algebraic fractions, absolute values, complex numbers, graphing

More information

EXAM IN STATISTICAL MACHINE LEARNING STATISTISK MASKININLÄRNING

EXAM IN STATISTICAL MACHINE LEARNING STATISTISK MASKININLÄRNING EXAM IN STATISTICAL MACHINE LEARNING STATISTISK MASKININLÄRNING DATE AND TIME: August 30, 2018, 14.00 19.00 RESPONSIBLE TEACHER: Niklas Wahlström NUMBER OF PROBLEMS: 5 AIDING MATERIAL: Calculator, mathematical

More information

MATH 117 Statistical Methods for Management I Chapter Three

MATH 117 Statistical Methods for Management I Chapter Three Jubail University College MATH 117 Statistical Methods for Management I Chapter Three This chapter covers the following topics: I. Measures of Center Tendency. 1. Mean for Ungrouped Data (Raw Data) 2.

More information

Basic IRT Concepts, Models, and Assumptions

Basic IRT Concepts, Models, and Assumptions Basic IRT Concepts, Models, and Assumptions Lecture #2 ICPSR Item Response Theory Workshop Lecture #2: 1of 64 Lecture #2 Overview Background of IRT and how it differs from CFA Creating a scale An introduction

More information

HYPOTHESIS TESTING. Hypothesis Testing

HYPOTHESIS TESTING. Hypothesis Testing MBA 605 Business Analytics Don Conant, PhD. HYPOTHESIS TESTING Hypothesis testing involves making inferences about the nature of the population on the basis of observations of a sample drawn from the population.

More information

Part A. BASIC INFORMATION: (Please print) 1. Name: 2. Please mark all courses you have taken in high school or college :

Part A. BASIC INFORMATION: (Please print) 1. Name: 2. Please mark all courses you have taken in high school or college : THE PLACEMENT TEST FOR PRECALCULUS/CALCULUS I - Fall, 1 Please read the following carefully: Your score on this test (in Part B), your math SAT or ACT score, and the information you are asked to provide

More information

Cumulative Frequency & Frequency Density

Cumulative Frequency & Frequency Density For more awesome GCSE and A level resources, visit us at www.savemyexams.co.uk Frequency & Frequency Density Diagrams Question Paper 2 Level IGCSE Subject Maths (0580) Exam Board Cambridge International

More information

Measures of Central Tendency. For Ungrouped Data

Measures of Central Tendency. For Ungrouped Data Measures of Central Tendency For Ungrouped Data Recall: What are the three ways on how we can define central tendency or center of the distribution? We can also say: (1) the point on which a distribution

More information

COWLEY COLLEGE & Area Vocational Technical School

COWLEY COLLEGE & Area Vocational Technical School COWLEY COLLEGE & Area Vocational Technical School COURSE PROCEDURE FOR COLLEGE ALGEBRA WITH REVIEW MTH 4421 5 Credit Hours Student Level: This course is open to students on the college level in the freshman

More information

Probability theory and inference statistics! Dr. Paola Grosso! SNE research group!! (preferred!)!!

Probability theory and inference statistics! Dr. Paola Grosso! SNE research group!!  (preferred!)!! Probability theory and inference statistics Dr. Paola Grosso SNE research group p.grosso@uva.nl paola.grosso@os3.nl (preferred) Roadmap Lecture 1: Monday Sep. 22nd Collecting data Presenting data Descriptive

More information

Chapter 6 Continuous Probability Distributions

Chapter 6 Continuous Probability Distributions Continuous Probability Distributions Learning Objectives 1. Understand the difference between how probabilities are computed for discrete and continuous random variables. 2. Know how to compute probability

More information

AP STATISTICS: Summer Math Packet

AP STATISTICS: Summer Math Packet Name AP STATISTICS: Summer Math Packet DIRECTIONS: Complete all problems on this packet. Packet due by the end of the first week of classes. Attach additional paper if needed. Calculator may be used. 1.

More information

Logistic Regression: Regression with a Binary Dependent Variable

Logistic Regression: Regression with a Binary Dependent Variable Logistic Regression: Regression with a Binary Dependent Variable LEARNING OBJECTIVES Upon completing this chapter, you should be able to do the following: State the circumstances under which logistic regression

More information

Math 1040 Final Exam Form A Introduction to Statistics Fall Semester 2010

Math 1040 Final Exam Form A Introduction to Statistics Fall Semester 2010 Math 1040 Final Exam Form A Introduction to Statistics Fall Semester 2010 Instructor Name Time Limit: 120 minutes Any calculator is okay. Necessary tables and formulas are attached to the back of the exam.

More information

Preliminary Statistics. Lecture 3: Probability Models and Distributions

Preliminary Statistics. Lecture 3: Probability Models and Distributions Preliminary Statistics Lecture 3: Probability Models and Distributions Rory Macqueen (rm43@soas.ac.uk), September 2015 Outline Revision of Lecture 2 Probability Density Functions Cumulative Distribution

More information

Instrumentation (cont.) Statistics vs. Parameters. Descriptive Statistics. Types of Numerical Data

Instrumentation (cont.) Statistics vs. Parameters. Descriptive Statistics. Types of Numerical Data Norm-Referenced vs. Criterion- Referenced Instruments Instrumentation (cont.) October 1, 2007 Note: Measurement Plan Due Next Week All derived scores give meaning to individual scores by comparing them

More information

Table of z values and probabilities for the standard normal distribution. z is the first column plus the top row. Each cell shows P(X z).

Table of z values and probabilities for the standard normal distribution. z is the first column plus the top row. Each cell shows P(X z). Table of z values and probabilities for the standard normal distribution. z is the first column plus the top row. Each cell shows P(X z). For example P(X.04) =.8508. For z < 0 subtract the value from,

More information

Additional practice with these ideas can be found in the problems for Tintle Section P.1.1

Additional practice with these ideas can be found in the problems for Tintle Section P.1.1 Psych 10 / Stats 60, Practice Problem Set 3 (Week 3 Material) Part 1: Decide if each variable below is quantitative, ordinal, or categorical. If the variable is categorical, also decide whether or not

More information

Class 24. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700

Class 24. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700 Class 4 Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science Copyright 013 by D.B. Rowe 1 Agenda: Recap Chapter 9. and 9.3 Lecture Chapter 10.1-10.3 Review Exam 6 Problem Solving

More information

Business Statistics Midterm Exam Fall 2015 Russell. Please sign here to acknowledge

Business Statistics Midterm Exam Fall 2015 Russell. Please sign here to acknowledge Business Statistics Midterm Exam Fall 5 Russell Name Do not turn over this page until you are told to do so. You will have hour and 3 minutes to complete the exam. There are a total of points divided into

More information

1 Descriptive statistics. 2 Scores and probability distributions. 3 Hypothesis testing and one-sample t-test. 4 More on t-tests

1 Descriptive statistics. 2 Scores and probability distributions. 3 Hypothesis testing and one-sample t-test. 4 More on t-tests Overall Overview INFOWO Statistics lecture S3: Hypothesis testing Peter de Waal Department of Information and Computing Sciences Faculty of Science, Universiteit Utrecht 1 Descriptive statistics 2 Scores

More information

1-1. Chapter 1. Sampling and Descriptive Statistics by The McGraw-Hill Companies, Inc. All rights reserved.

1-1. Chapter 1. Sampling and Descriptive Statistics by The McGraw-Hill Companies, Inc. All rights reserved. 1-1 Chapter 1 Sampling and Descriptive Statistics 1-2 Why Statistics? Deal with uncertainty in repeated scientific measurements Draw conclusions from data Design valid experiments and draw reliable conclusions

More information

Table of z values and probabilities for the standard normal distribution. z is the first column plus the top row. Each cell shows P(X z).

Table of z values and probabilities for the standard normal distribution. z is the first column plus the top row. Each cell shows P(X z). Table of z values and probabilities for the standard normal distribution. z is the first column plus the top row. Each cell shows P(X z). For example P(X 1.04) =.8508. For z < 0 subtract the value from

More information

Area1 Scaled Score (NAPLEX) .535 ** **.000 N. Sig. (2-tailed)

Area1 Scaled Score (NAPLEX) .535 ** **.000 N. Sig. (2-tailed) Institutional Assessment Report Texas Southern University College of Pharmacy and Health Sciences "An Analysis of 2013 NAPLEX, P4-Comp. Exams and P3 courses The following analysis illustrates relationships

More information

are the objects described by a set of data. They may be people, animals or things.

are the objects described by a set of data. They may be people, animals or things. ( c ) E p s t e i n, C a r t e r a n d B o l l i n g e r 2016 C h a p t e r 5 : E x p l o r i n g D a t a : D i s t r i b u t i o n s P a g e 1 CHAPTER 5: EXPLORING DATA DISTRIBUTIONS 5.1 Creating Histograms

More information

Master s Written Examination - Solution

Master s Written Examination - Solution Master s Written Examination - Solution Spring 204 Problem Stat 40 Suppose X and X 2 have the joint pdf f X,X 2 (x, x 2 ) = 2e (x +x 2 ), 0 < x < x 2

More information

Tribhuvan University Institute of Science and Technology 2065

Tribhuvan University Institute of Science and Technology 2065 1CSc. Stat. 108-2065 Tribhuvan University Institute of Science and Technology 2065 Bachelor Level/First Year/ First Semester/ Science Full Marks: 60 Computer Science and Information Technology (Stat. 108)

More information

Psychology 405: Psychometric Theory Validity

Psychology 405: Psychometric Theory Validity Preliminaries Psychology 405: Psychometric Theory Validity William Revelle Department of Psychology Northwestern University Evanston, Illinois USA May, 2016 1/17 Preliminaries Outline Preliminaries 2/17

More information

Mean/Modal Class Grouped Data

Mean/Modal Class Grouped Data For more awesome GCSE and A level resources, visit us at www.savemyexams.co.uk Mean/Modal Class Grouped Data Question Paper 1 Level IGCSE Subject Maths (58) Exam Board Cambridge International Examinations

More information

Elementary Probability and Statistics Midterm I April 22, 2010

Elementary Probability and Statistics Midterm I April 22, 2010 Elementary Probability and Statistics Midterm I April 22, 2010 Instructor: Bjørn Kjos-Hanssen Student name and ID number Disclaimer: It is essential to write legibly and show your work. If your work is

More information

Black White Total Observed Expected χ 2 = (f observed f expected ) 2 f expected (83 126) 2 ( )2 126

Black White Total Observed Expected χ 2 = (f observed f expected ) 2 f expected (83 126) 2 ( )2 126 Psychology 60 Fall 2013 Practice Final Actual Exam: This Wednesday. Good luck! Name: To view the solutions, check the link at the end of the document. This practice final should supplement your studying;

More information

Closed book and notes. 120 minutes. Cover page, five pages of exam. No calculators.

Closed book and notes. 120 minutes. Cover page, five pages of exam. No calculators. IE 230 Seat # Closed book and notes. 120 minutes. Cover page, five pages of exam. No calculators. Score Final Exam, Spring 2005 (May 2) Schmeiser Closed book and notes. 120 minutes. Consider an experiment

More information

Calculus with Algebra and Trigonometry II Lecture 21 Probability applications

Calculus with Algebra and Trigonometry II Lecture 21 Probability applications Calculus with Algebra and Trigonometry II Lecture 21 Probability applications Apr 16, 215 Calculus with Algebra and Trigonometry II Lecture 21Probability Apr applications 16, 215 1 / 1 Histograms The distribution

More information

Algebra 1. Statistics and the Number System Day 3

Algebra 1. Statistics and the Number System Day 3 Algebra 1 Statistics and the Number System Day 3 MAFS.912. N-RN.1.2 Which expression is equivalent to 5 m A. m 1 5 B. m 5 C. m 1 5 D. m 5 A MAFS.912. N-RN.1.2 Which expression is equivalent to 5 3 g A.

More information

Statistics Primer. ORC Staff: Jayme Palka Peter Boedeker Marcus Fagan Trey Dejong

Statistics Primer. ORC Staff: Jayme Palka Peter Boedeker Marcus Fagan Trey Dejong Statistics Primer ORC Staff: Jayme Palka Peter Boedeker Marcus Fagan Trey Dejong 1 Quick Overview of Statistics 2 Descriptive vs. Inferential Statistics Descriptive Statistics: summarize and describe data

More information

P (A B) P ((B C) A) P (B A) = P (B A) + P (C A) P (A) = P (B A) + P (C A) = Q(A) + Q(B).

P (A B) P ((B C) A) P (B A) = P (B A) + P (C A) P (A) = P (B A) + P (C A) = Q(A) + Q(B). Lectures 7-8 jacques@ucsdedu 41 Conditional Probability Let (Ω, F, P ) be a probability space Suppose that we have prior information which leads us to conclude that an event A F occurs Based on this information,

More information

Introduction to Business Statistics QM 220 Chapter 12

Introduction to Business Statistics QM 220 Chapter 12 Department of Quantitative Methods & Information Systems Introduction to Business Statistics QM 220 Chapter 12 Dr. Mohammad Zainal 12.1 The F distribution We already covered this topic in Ch. 10 QM-220,

More information

Question Points Score Total: 76

Question Points Score Total: 76 Math 447 Test 2 March 17, Spring 216 No books, no notes, only SOA-approved calculators. true/false or fill-in-the-blank question. You must show work, unless the question is a Name: Question Points Score

More information

Comparing IRT with Other Models

Comparing IRT with Other Models Comparing IRT with Other Models Lecture #14 ICPSR Item Response Theory Workshop Lecture #14: 1of 45 Lecture Overview The final set of slides will describe a parallel between IRT and another commonly used

More information

6.1 Normal Distribution

6.1 Normal Distribution GOALS: 1. Understand properties of: a) Density Curves b) Normal Curves c) Standard Normal Curve 2. Relate area under the curve to proportions of the population represented by the curve. Study Ch. 6.1,

More information

Lecture #11: Classification & Logistic Regression

Lecture #11: Classification & Logistic Regression Lecture #11: Classification & Logistic Regression CS 109A, STAT 121A, AC 209A: Data Science Weiwei Pan, Pavlos Protopapas, Kevin Rader Fall 2016 Harvard University 1 Announcements Midterm: will be graded

More information

Advanced Quantitative Data Analysis

Advanced Quantitative Data Analysis Chapter 24 Advanced Quantitative Data Analysis Daniel Muijs Doing Regression Analysis in SPSS When we want to do regression analysis in SPSS, we have to go through the following steps: 1 As usual, we choose

More information

Introduction to Signal Detection and Classification. Phani Chavali

Introduction to Signal Detection and Classification. Phani Chavali Introduction to Signal Detection and Classification Phani Chavali Outline Detection Problem Performance Measures Receiver Operating Characteristics (ROC) F-Test - Test Linear Discriminant Analysis (LDA)

More information

You have 3 hours to complete the exam. Some questions are harder than others, so don t spend too long on any one question.

You have 3 hours to complete the exam. Some questions are harder than others, so don t spend too long on any one question. Data 8 Fall 2017 Foundations of Data Science Final INSTRUCTIONS You have 3 hours to complete the exam. Some questions are harder than others, so don t spend too long on any one question. The exam is closed

More information

(a) Find the value of x. (4) Write down the standard deviation. (2) (Total 6 marks)

(a) Find the value of x. (4) Write down the standard deviation. (2) (Total 6 marks) 1. The following frequency distribution of marks has mean 4.5. Mark 1 2 3 4 5 6 7 Frequency 2 4 6 9 x 9 4 Find the value of x. (4) Write down the standard deviation. (Total 6 marks) 2. The following table

More information

Introducing Generalized Linear Models: Logistic Regression

Introducing Generalized Linear Models: Logistic Regression Ron Heck, Summer 2012 Seminars 1 Multilevel Regression Models and Their Applications Seminar Introducing Generalized Linear Models: Logistic Regression The generalized linear model (GLM) represents and

More information

Get started [Hawkes Learning] with this system. Common final exam, independently administered, group graded, grades reported.

Get started [Hawkes Learning] with this system. Common final exam, independently administered, group graded, grades reported. Course Information Math 095 Elementary Algebra Placement No placement necessary Course Description Learning Outcomes Elementary algebraic topics for students whose mathematical background or placement

More information

However, reliability analysis is not limited to calculation of the probability of failure.

However, reliability analysis is not limited to calculation of the probability of failure. Probabilistic Analysis probabilistic analysis methods, including the first and second-order reliability methods, Monte Carlo simulation, Importance sampling, Latin Hypercube sampling, and stochastic expansions

More information

University of California, Berkeley, Statistics 131A: Statistical Inference for the Social and Life Sciences. Michael Lugo, Spring 2012

University of California, Berkeley, Statistics 131A: Statistical Inference for the Social and Life Sciences. Michael Lugo, Spring 2012 University of California, Berkeley, Statistics 3A: Statistical Inference for the Social and Life Sciences Michael Lugo, Spring 202 Solutions to Exam Friday, March 2, 202. [5: 2+2+] Consider the stemplot

More information

Hierarchical Generalized Linear Models. ERSH 8990 REMS Seminar on HLM Last Lecture!

Hierarchical Generalized Linear Models. ERSH 8990 REMS Seminar on HLM Last Lecture! Hierarchical Generalized Linear Models ERSH 8990 REMS Seminar on HLM Last Lecture! Hierarchical Generalized Linear Models Introduction to generalized models Models for binary outcomes Interpreting parameter

More information

18.05 Practice Final Exam

18.05 Practice Final Exam No calculators. 18.05 Practice Final Exam Number of problems 16 concept questions, 16 problems. Simplifying expressions Unless asked to explicitly, you don t need to simplify complicated expressions. For

More information

Math P (A 1 ) =.5, P (A 2 ) =.6, P (A 1 A 2 ) =.9r

Math P (A 1 ) =.5, P (A 2 ) =.6, P (A 1 A 2 ) =.9r Math 3070 1. Treibergs σιι First Midterm Exam Name: SAMPLE January 31, 2000 (1. Compute the sample mean x and sample standard deviation s for the January mean temperatures (in F for Seattle from 1900 to

More information

School of Mathematics and Science MATH 163 College Algebra Section: EFA

School of Mathematics and Science MATH 163 College Algebra Section: EFA CCBC Essex School of Mathematics and Science MATH 163 College Algebra Section: EFA CLASSROOM LOCATION: L311 SEMESTER: Spring 2012 CLASS MEETING DAYS: MWF CLASS MEETING TIMES:12:20-1:15 INSTRUCTOR: D. TRUSZKOWSKI

More information

Development of a Pre-General Chemistry Course at Valdosta State University

Development of a Pre-General Chemistry Course at Valdosta State University Georgia Southern University Digital Commons@Georgia Southern Interdisciplinary STEM Teaching & Learning Conference Mar 7th, 9:55 AM - 10:15 AM Development of a Pre-General Chemistry Course at Valdosta

More information

Lecture Notes for BUSINESS STATISTICS - BMGT 571. Chapters 1 through 6. Professor Ahmadi, Ph.D. Department of Management

Lecture Notes for BUSINESS STATISTICS - BMGT 571. Chapters 1 through 6. Professor Ahmadi, Ph.D. Department of Management Lecture Notes for BUSINESS STATISTICS - BMGT 571 Chapters 1 through 6 Professor Ahmadi, Ph.D. Department of Management Revised May 005 Glossary of Terms: Statistics Chapter 1 Data Data Set Elements Variable

More information

Ron Heck, Fall Week 8: Introducing Generalized Linear Models: Logistic Regression 1 (Replaces prior revision dated October 20, 2011)

Ron Heck, Fall Week 8: Introducing Generalized Linear Models: Logistic Regression 1 (Replaces prior revision dated October 20, 2011) Ron Heck, Fall 2011 1 EDEP 768E: Seminar in Multilevel Modeling rev. January 3, 2012 (see footnote) Week 8: Introducing Generalized Linear Models: Logistic Regression 1 (Replaces prior revision dated October

More information

Get to know the Departments of Chemistry and Biochemistry (Two separate departments as of summer 2018)

Get to know the Departments of Chemistry and Biochemistry (Two separate departments as of summer 2018) Get to know the Departments of Chemistry and Biochemistry (Two separate departments as of summer 2018) Where are we? Chemistry: Main Campus- Cristol & Ekeley Biochemistry: East Campus - JSCBB Susan Hendrickson,

More information

Glossary. The ISI glossary of statistical terms provides definitions in a number of different languages:

Glossary. The ISI glossary of statistical terms provides definitions in a number of different languages: Glossary The ISI glossary of statistical terms provides definitions in a number of different languages: http://isi.cbs.nl/glossary/index.htm Adjusted r 2 Adjusted R squared measures the proportion of the

More information

ECN221 Exam 1 VERSION B Fall 2017 (Modules 1-4), ASU-COX VERSION B

ECN221 Exam 1 VERSION B Fall 2017 (Modules 1-4), ASU-COX VERSION B ECN221 Exam 1 VERSION B Fall 2017 (Modules 1-4), ASU-COX VERSION B Choose the best answer. Do not write letters in the margin or communicate with other students in any way; if you do you will receive a

More information

Topic 2 Part 1 [195 marks]

Topic 2 Part 1 [195 marks] Topic 2 Part 1 [195 marks] The distribution of rainfall in a town over 80 days is displayed on the following box-and-whisker diagram. 1a. Write down the median rainfall. 1b. Write down the minimum rainfall.

More information

Math 1120 Solutions to Review for the Third Exam Spring 2011

Math 1120 Solutions to Review for the Third Exam Spring 2011 Math 20 Solutions to Review for the Third Exam Spring 20. Calculate the mean, median and mode for each of the following data sets. (a) 4, 8, 3, 3, 5, 3, 5, 7,, 9, 0, 2 First, order the set: 0,, 2, 3, 3,

More information

Computations - Show all your work. (30 pts)

Computations - Show all your work. (30 pts) Math 1012 Final Name: Computations - Show all your work. (30 pts) 1. Fractions. a. 1 7 + 1 5 b. 12 5 5 9 c. 6 8 2 16 d. 1 6 + 2 5 + 3 4 2.a Powers of ten. i. 10 3 10 2 ii. 10 2 10 6 iii. 10 0 iv. (10 5

More information

Math 430 Exam 2, Fall 2008

Math 430 Exam 2, Fall 2008 Do not distribute. IIT Dept. Applied Mathematics, February 16, 2009 1 PRINT Last name: Signature: First name: Student ID: Math 430 Exam 2, Fall 2008 These theorems may be cited at any time during the test

More information

Math 180A. Lecture 16 Friday May 7 th. Expectation. Recall the three main probability density functions so far (1) Uniform (2) Exponential.

Math 180A. Lecture 16 Friday May 7 th. Expectation. Recall the three main probability density functions so far (1) Uniform (2) Exponential. Math 8A Lecture 6 Friday May 7 th Epectation Recall the three main probability density functions so far () Uniform () Eponential (3) Power Law e, ( ), Math 8A Lecture 6 Friday May 7 th Epectation Eample

More information

2. This problem is very easy if you remember the exponential cdf; i.e., 0, y 0 F Y (y) = = ln(0.5) = ln = ln 2 = φ 0.5 = β ln 2.

2. This problem is very easy if you remember the exponential cdf; i.e., 0, y 0 F Y (y) = = ln(0.5) = ln = ln 2 = φ 0.5 = β ln 2. FALL 8 FINAL EXAM SOLUTIONS. Use the basic counting rule. There are n = number of was to choose 5 white balls from 7 = n = number of was to choose ellow ball from 5 = ( ) 7 5 ( ) 5. there are n n = ( )(

More information

Semiparametric Generalized Linear Models

Semiparametric Generalized Linear Models Semiparametric Generalized Linear Models North American Stata Users Group Meeting Chicago, Illinois Paul Rathouz Department of Health Studies University of Chicago prathouz@uchicago.edu Liping Gao MS Student

More information

AP Calculus AB Course Syllabus

AP Calculus AB Course Syllabus AP Calculus AB Course Syllabus Grant Community High School Mr. Rous Textbook Finney, Ross L., Franklin D. Demana, Bert K. Waits, and Daniel Kennedy. Calculus Graphical, Numerical, Algebraic, Fourth Addition,

More information

9/19/2012. PSY 511: Advanced Statistics for Psychological and Behavioral Research 1

9/19/2012. PSY 511: Advanced Statistics for Psychological and Behavioral Research 1 PSY 511: Advanced Statistics for Psychological and Behavioral Research 1 The aspect of the data we want to describe/measure is relative position z scores tell us how many standard deviations above or below

More information

CMPSCI 240: Reasoning Under Uncertainty First Midterm Exam

CMPSCI 240: Reasoning Under Uncertainty First Midterm Exam CMPSCI 240: Reasoning Under Uncertainty First Midterm Exam February 13, 2013. Name: ID: Instructions: Answer the questions directly on the exam pages. Show all your work for each question. Giving more

More information

Math 115 Practice for Exam 2

Math 115 Practice for Exam 2 Math 115 Practice for Exam Generated October 30, 017 Name: SOLUTIONS Instructor: Section Number: 1. This exam has 5 questions. Note that the problems are not of equal difficulty, so you may want to skip

More information

Prerequisite: STATS 7 or STATS 8 or AP90 or (STATS 120A and STATS 120B and STATS 120C). AP90 with a minimum score of 3

Prerequisite: STATS 7 or STATS 8 or AP90 or (STATS 120A and STATS 120B and STATS 120C). AP90 with a minimum score of 3 University of California, Irvine 2017-2018 1 Statistics (STATS) Courses STATS 5. Seminar in Data Science. 1 Unit. An introduction to the field of Data Science; intended for entering freshman and transfers.

More information

AP Calculus AB - Course Outline

AP Calculus AB - Course Outline By successfully completing this course, you will be able to: a. Work with functions represented in a variety of ways and understand the connections among these representations. b. Understand the meaning

More information

MATH 360. Probablity Final Examination December 21, 2011 (2:00 pm - 5:00 pm)

MATH 360. Probablity Final Examination December 21, 2011 (2:00 pm - 5:00 pm) Name: MATH 360. Probablity Final Examination December 21, 2011 (2:00 pm - 5:00 pm) Instructions: The total score is 200 points. There are ten problems. Point values per problem are shown besides the questions.

More information