Lecture 1 January 18

Similar documents
Lecture 4 January 23

1 Impact Evaluation: Randomized Controlled Trial (RCT)

CompSci Understanding Data: Theory and Applications

Analysis of Variance and Co-variance. By Manza Ramesh

STAT 263/363: Experimental Design Winter 2016/17. Lecture 1 January 9. Why perform Design of Experiments (DOE)? There are at least two reasons:

Bayesian regression tree models for causal inference: regularization, confounding and heterogeneity

Estimating the Mean Response of Treatment Duration Regimes in an Observational Study. Anastasios A. Tsiatis.

Lecture 1: Introduction and probability review

Fundamental Probability and Statistics

Weighting. Homework 2. Regression. Regression. Decisions Matching: Weighting (0) W i. (1) -å l i. )Y i. (1-W i 3/5/2014. (1) = Y i.

DEALING WITH MULTIVARIATE OUTCOMES IN STUDIES FOR CAUSAL EFFECTS

Evaluation Module: Care for the Chronically ill Person Spring 2010 Responsible for the evaluation process: Project manager, Mette Bro Jansen

Introduction to Statistical Inference

Null Hypothesis Significance Testing p-values, significance level, power, t-tests Spring 2017

Rerandomization to Balance Covariates

review session gov 2000 gov 2000 () review session 1 / 38

7.6. Solve Problems Involving Exponential Growth and Decay. Tools

DETERMINE whether the conditions for performing inference are met. CONSTRUCT and INTERPRET a confidence interval to compare two proportions.

Section 9.4. Notation. Requirements. Definition. Inferences About Two Means (Matched Pairs) Examples

University of Jordan Fall 2009/2010 Department of Mathematics

Introduction to Econometrics

STAT/SOC/CSSS 221 Statistical Concepts and Methods for the Social Sciences. Random Variables

Causality II: How does causal inference fit into public health and what it is the role of statistics?

Difference Between Pair Differences v. 2 Samples

Kunjamma Father s younger brother s wife in Malayalam. For the teacher: Chittappan Father s younger brother in Malayalam.

Chapter 18. Sampling Distribution Models. Bin Zou STAT 141 University of Alberta Winter / 10

(Refer Slide Time: 3:48)

Lecture 1: Probability Fundamentals

CONTENTS OF DAY 2. II. Why Random Sampling is Important 10 A myth, an urban legend, and the real reason NOTES FOR SUMMER STATISTICS INSTITUTE COURSE

Statistical Inference with Regression Analysis

Bernoulli and Binomial Distributions. Notes. Bernoulli Trials. Bernoulli/Binomial Random Variables Bernoulli and Binomial Distributions.

Null Hypothesis Significance Testing p-values, significance level, power, t-tests

[ z = 1.48 ; accept H 0 ]

Dynamics in Social Networks and Causality

Chapter 11. Correlation and Regression

14.74 Lecture 10: The returns to human capital: education

Chapter 9. Inferences from Two Samples. Objective. Notation. Section 9.2. Definition. Notation. q = 1 p. Inferences About Two Proportions

Chapter 18. Sampling Distribution Models. Copyright 2010, 2007, 2004 Pearson Education, Inc.

Table Probabilities and Independence

EMERGING MARKETS - Lecture 2: Methodology refresher

CH. 2 POPULATION AND HEALTH

Psych 10 / Stats 60, Practice Problem Set 5 (Week 5 Material) Part 1: Power (and building blocks of power)

Causal Inference with General Treatment Regimes: Generalizing the Propensity Score

Two-sample inference: Continuous Data

An Introduction to Mplus and Path Analysis

ECON Introductory Econometrics. Lecture 17: Experiments

The t-test Pivots Summary. Pivots and t-tests. Patrick Breheny. October 15. Patrick Breheny Biostatistical Methods I (BIOS 5710) 1/18

Ohio s State Tests ANSWER KEY & SCORING GUIDELINES GRADE 6 SOCIAL STUDIES PART 1

Event A: at least one tail observed A:

Chapter 10 Relationships Among Variables

Introduction to bivariate analysis

Investigating Climate Change

Causal Inference from Experimental Data

Lectures of STA 231: Biostatistics

Introduction to bivariate analysis

STAT 285: Fall Semester Final Examination Solutions

An Introduction to Path Analysis

STA441: Spring Multiple Regression. More than one explanatory variable at the same time

Predicting the Treatment Status

14.30 Introduction to Statistical Methods in Economics Spring 2009

THE DESIGN (VERSUS THE ANALYSIS) OF EVALUATIONS FROM OBSERVATIONAL STUDIES: PARALLELS WITH THE DESIGN OF RANDOMIZED EXPERIMENTS DONALD B.

5.2 Fisher information and the Cramer-Rao bound

Quantitative Economics for the Evaluation of the European Policy

CHAPTER 10 Comparing Two Populations or Groups

Targeted Maximum Likelihood Estimation in Safety Analysis

Mantel-Haenszel Test Statistics. for Correlated Binary Data. Department of Statistics, North Carolina State University. Raleigh, NC

ECON1310 Quantitative Economic and Business Analysis A

Path Analysis. PRE 906: Structural Equation Modeling Lecture #5 February 18, PRE 906, SEM: Lecture 5 - Path Analysis

Counterfactual Model for Learning Systems

Mediation for the 21st Century

Simple Regression Model. January 24, 2011

Notes 6: Multivariate regression ECO 231W - Undergraduate Econometrics

Potential Outcomes and Causal Inference I

ECO375 Tutorial 4 Wooldridge: Chapter 6 and 7

Lecture 9 February 8

where Female = 0 for males, = 1 for females Age is measured in years (22, 23, ) GPA is measured in units on a four-point scale (0, 1.22, 3.45, etc.

Causal Inference with Big Data Sets

AP Human Geography Chapter 2: Population

SF2930: REGRESION ANALYSIS LECTURE 1 SIMPLE LINEAR REGRESSION.

Today. HW 1: due February 4, pm. Aspects of Design CD Chapter 2. Continue with Chapter 2 of ELM. In the News:

Lecture: Difference-in-Difference (DID)

Quick Lab. The Structure of Seeds

BIOSTATISTICAL METHODS

Binomial and Poisson Probability Distributions

SEQUENTIAL MULTIPLE ASSIGNMENT RANDOMIZATION TRIALS WITH ENRICHMENT (SMARTER) DESIGN

Written Exam (2 hours)

Logistic regression: Why we often can do what we think we can do. Maarten Buis 19 th UK Stata Users Group meeting, 10 Sept. 2015

Inferences Based on Two Samples

The key is that there are two disjoint populations, and everyone in the market is on either one side or the other

Quantitative Understanding in Biology 1.7 Bayesian Methods

Lab #12: Exam 3 Review Key

Correlation and simple linear regression S5

Crash Course in Statistics for Neuroscience Center Zurich University of Zurich

10.1. Comparing Two Proportions. Section 10.1

UNIVERSITY OF TORONTO Faculty of Arts and Science

Chapter 20 Comparing Groups

Mathematical statistics

Chapter 7: Sampling Distributions

Inferences About Two Proportions

BIOL 51A - Biostatistics 1 1. Lecture 1: Intro to Biostatistics. Smoking: hazardous? FEV (l) Smoke

Transcription:

STAT 263/363: Experimental Design Winter 2016/17 Lecture 1 January 18 Lecturer: Art B. Owen Scribe: Julie Zhu Overview Experiments are powerful because you can conclude causality from the results. In most classes, we deal with data analysis and given data. In this class, we will be responsible for the design of data. We are getting the data ourselves - so what should the data look like? If the data is not good enough, it is our responsibility to think about what should be in the data. However, Today s lecture will cover the following topics: 1. Neyman Rubin Causal Model (Holland Reference) 2. History of Experimental Design Other topics we might cover in the future: 1. Bandits 2. Computer Experiments 3. Sparse Regression 1.1 Neyman Ruben Causal Model In experimental design, we want to study the effects of causes, not the causes of effects. Example: We use experiments to see if watering a plant will cause growth. We don t use experiments to determine what caused the plant s growth (Why did the plant grow? Water? Sun? Fertilizers?) Treatment T Control C Population of Items (Units u U) Each unit gets s(u) a treatment where s {T, C}. In principal, any unit could get T or C. Example Gender by birth can t be determined using a causal model. Everyone has a gender by birth. We can t assign genders to others. Example Smoking can have a causal model, but there are other ethical barriers. Can you make people who don t smoke smoke? Where do we draw the line in that case? 1-1

Outcome Variable Y t (u) if s(u) = T Y c (u) if s(u) = C The effect of the treatment on unit u is Y t (u) Y c (u). Fundamental Problem of Causal Inference Note that the effect of the treatment on unit u is Y t (u) Y c (u), but we can t apply both the treatment and effect to the same unit u. We can only see Y t (u) or Y c (u). Y s(u) (u) is what we observe: u s(u) 1 C 2 C 3 T 4 C 5 T The average causal effect is 1 U u U Y t(u) = Y c (u). 1.2 Random Treatments We can understand Y t Y c without observing it through making a few assumptions. Temporal Stability Sometimes we know the outcome of the control Y c (u) is the same before and after. Then, Y t (u) Y c (u) is causal. Unit Stability Y t (u 1 ) = Y t (u 2 ) Y c (u 1 ) = Y c (u 2 ) Then, we see that Y t (u 1 ) Y c (u 2 ) is causal effect. Conditional expectation of treatment outcomes can differ from the actual expectation of response varible. E(Y s(u) s = T ) = E(Y T s = T ) vs. E(Y t ) E(Y s(u) s = C) = E(Y C s = C) vs. E(Y c ) Example In the UK, they experiment and test if milk could help malnourished children. The study 1-2

was to give 2/3 pint of milk to school children. 10,000 students got no milk 5,000 students got whole milk 5,000 students got pasteurized milk After distributing milk to schools in the UK, the researchers measured the students height and weight in February and then in June. There were several confounding factors in the experiment: 1. Children were weighed with their clothes - students had heavier clothing in the winter 2. Teachers would mess with the list and give milk to students they though were not well-off and less nourished 3. Students who were less well-off also had less clothing in the winter Thus, the researchers turned to height. They found that the height metric is irrelevant to clothes and weight. However, the choosing of less nourished students to receive the milk might have still messed with the results. Goal of an Experiment Choose a treatment s(u) independent of Y T (u) and Y C (u). For example, toss a coin to assign treatment and control groups to the population units. Using the coin tossing method, it is possible that we don t get any controls. It might be better to use a simple random sample (SRS) here. Then we can obtain unbiased average estimates as given by: 1 n T s(u)=t Y T (u) 1 n C s(u)=c Y C(u) Y t Y c for each unit could have a high variance, so we take the average. We can double check causal inference by splitting the population up into groups and then analyzing the results. 1.3 History of Experimental Design 1920-1930s R.A Fisher in Agriculture Weather would determine the success of crop yields. Fisher wanted to test to see how he could yield more wheat from his harvests. Advantages included having to wait a year for each experiment, small signal (gain), and large noise in data. 1950s Medicine gave rise to the popular placebo vs. control experiment design. These experiments 1-3

were still noisy. 1950-1960s George Box invited the 2-level experiment. He had data with 20 binary factors. He began factorial experiments in order to test several features at once (since he can t do 20 separate experiments; it was too costly). 1970s Monte Carlo Simulation 1950s Computer Experiments 1950s Use of A/B experiments in E-commerce resulted from the introduction of big data. We see that it s a similar 2-level experiment to what Box figured out, but the labels of T or C are now known. A/A experiments sample in the null and create 2 groups with the same treatment. This is usually to compare to the A/B experiment results to make those more impressive. 1.4 Historical Controls Example There is a new diabetes treatment given 50 people and everyone gets the treatment. The results are then compared to last years data (compared to data for clinic, for the hospital, state, country, etc...) In this example, there are a lot of variables and there are also more confounding factors. It s possible that the entire demographic of the individuals receiving treatment this year and last year are different. It is possible that particular sexes, ages, socio-economic status contribute or detract from recover and study results. Positive Negative Experiment 1 7 Historical 16 5 Usually, a double blind experiment is a good standard (the ones administering the test and the researchers don t actually know who is getting what treatment). Different Medical Treatment Experiments In the cases where we are testing a new drug, we also have to consider the case where the patient didn t not take the medicine. newline Note that testing only population A to C is different than testing population (A + C) vs. (B + D). 1-4

Took Didn t Take Treatment A B Control C D The FDA general standard is to test the latter because the general public might not actually take the medicine. By choosing to take the medication, those people are already different from those who didn t. 1.5 Randomization Inference George Box - When performing analysis, forget you ever did the randomization Don Ruben - When performing analysis, calculate in the fact that you did randomization In this case, if we have two treatments A and B: n 1 got A n 2 got B Note that if we assume A and B have identical distributions (not just the same means and variances), then we see that (Ȳ A Ȳ B ) has the same distribution as ȲA ȲB). Note that assuming the same distribution is a strong assumption. 1-5