Unit 1: Introduction to data Lecture 1: Data collection, observational studies, and experiments. Statistics 101. Gary Larson Duke University
|
|
- Della Barnett
- 6 years ago
- Views:
Transcription
1 Unit 1: Introduction to data Lecture 1: Data collection, observational studies, and experiments Statistics 101 Gary Larson Duke University June 29, 2015
2 Sta 101 Welcome to Stat 101! 1 Welcome to Stat 101! Introduction to Inference Populations and Samples Sampling from a population Sampling bias Observational studies and experiments Cereal breakfast Observations and variables Principles of experimental design Recap 2 Syllabus & policies Logistics Details Support Policies Tips 3 To do
3 Today s agenda Relevant reading from OpenIntro Stat (2nd ed.): Sections 1.2, 1.3, 1.4, 1.5. Other: Intro to inference Populations, and sampling from them Sampling bias Observational studies & experimental studies Types of data Principles of experimental design Course overview (see course website ) Outside survey Sta 101 (G. Larson Duke University) U1 - L1: Data coll., obs. studies, experiments June 29, / 57
4 Introduction to Inference So...what is statistics? Statistics is the art and science of learning from data. Sta 101 (G. Larson Duke University) U1 - L1: Data coll., obs. studies, experiments June 29, / 57
5 Introduction to Inference So...what is statistics? Statistics is the art and science of learning from data. Data are a set of measurements taken on a set of individual units Sta 101 (G. Larson Duke University) U1 - L1: Data coll., obs. studies, experiments June 29, / 57
6 Introduction to Inference So...what is statistics? Statistics is the art and science of learning from data. Data are a set of measurements taken on a set of individual units Steps for statistical inference / scientific inquiry 1 Identify a hypothesis or research question Sta 101 (G. Larson Duke University) U1 - L1: Data coll., obs. studies, experiments June 29, / 57
7 Introduction to Inference So...what is statistics? Statistics is the art and science of learning from data. Data are a set of measurements taken on a set of individual units Steps for statistical inference / scientific inquiry 1 Identify a hypothesis or research question 2 Collect relevant data Sta 101 (G. Larson Duke University) U1 - L1: Data coll., obs. studies, experiments June 29, / 57
8 Introduction to Inference So...what is statistics? Statistics is the art and science of learning from data. Data are a set of measurements taken on a set of individual units Steps for statistical inference / scientific inquiry 1 Identify a hypothesis or research question 2 Collect relevant data 3 Analyze the data Sta 101 (G. Larson Duke University) U1 - L1: Data coll., obs. studies, experiments June 29, / 57
9 Introduction to Inference So...what is statistics? Statistics is the art and science of learning from data. Data are a set of measurements taken on a set of individual units Steps for statistical inference / scientific inquiry 1 Identify a hypothesis or research question 2 Collect relevant data 3 Analyze the data 4 Form a conclusion Sta 101 (G. Larson Duke University) U1 - L1: Data coll., obs. studies, experiments June 29, / 57
10 Introduction to Inference So...what is statistics? Statistics is the art and science of learning from data. Data are a set of measurements taken on a set of individual units Steps for statistical inference / scientific inquiry 1 Identify a hypothesis or research question 2 Collect relevant data 3 Analyze the data 4 Form a conclusion 5 Communicate the results 6 Present your data Sta 101 (G. Larson Duke University) U1 - L1: Data coll., obs. studies, experiments June 29, / 57
11 Introduction to Inference Step 1: Identify a Hypothesis or Research Question A well-formed hypothesis will clearly identify a population and associated parameters of interest. Population: group of individuals or subjects exhibiting some characteristic. We would like to make inferences about this population. Parameters: True values of characteristics in the population we want to study. We make inferences about the population by using data to learn about the parameter(s). Let s make these definitions more concrete via examples... Sta 101 (G. Larson Duke University) U1 - L1: Data coll., obs. studies, experiments June 29, / 57
12 Introduction to Inference Step 1: Identify a Hypothesis or Research Question A well-formed hypothesis will clearly identify a population and associated parameters of interest. Population: group of individuals or subjects exhibiting some characteristic. We would like to make inferences about this population. Parameters: True values of characteristics in the population we want to study. We make inferences about the population by using data to learn about the parameter(s). Let s make these definitions more concrete via examples... How many names given to newborn babies in 2012 in the United States begin with the letter j? Population? Parameter? Sta 101 (G. Larson Duke University) U1 - L1: Data coll., obs. studies, experiments June 29, / 57
13 Introduction to Inference Step 2: Collect the data Each year the Social Security Administration collects and releases data on the how many babies are given a certain name. They released these data for years 1880 to 2013 for each gender. We often store and present such data in data sets, comprised of variables measured on individual cases (a.k.a. records). Sta 101 (G. Larson Duke University) U1 - L1: Data coll., obs. studies, experiments June 29, / 57
14 Introduction to Inference Data Sets data set or data matrix variable type price weight 1 small midsize observation midsize Sta 101 (G. Larson Duke University) U1 - L1: Data coll., obs. studies, experiments June 29, / 57
15 Introduction to Inference Baby Names Data Set Besides looking at the frequency of first initials, what else could we learn from this data set? Sta 101 (G. Larson Duke University) U1 - L1: Data coll., obs. studies, experiments June 29, / 57
16 Introduction to Inference Visualize the Data: Rank Table Top Baby Names in 2012 Rank Male Female 1 Jacob Sophia 2 Mason Emma 3 Ethan Isabella 4 Noah Olivia 5 William Ava 6 Liam Emily 7 Michael Abigail 8 Jayden Mia 9 Alexander Madison 10 Aiden Elizabeth oact/ babynames Sta 101 (G. Larson Duke University) U1 - L1: Data coll., obs. studies, experiments June 29, / 57
17 Introduction to Inference Visualize the Data: Rank Table Top Baby Names in 2013 Rank Male Female 1 Noah Sophia 2 Liam Emma 3 Jacob Olivia 4 Mason Isabella 5 William Ava 6 Ethan Mia 7 Michael Emily 8 Alexander Abigail 9 Jayden Madison 10 Daniel Elizabeth oact/ babynames Sta 101 (G. Larson Duke University) U1 - L1: Data coll., obs. studies, experiments June 29, / 57
18 Introduction to Inference Visualize the Data: Time Dependencies How has the popularity of a name changed over time? voyager#prefix=&sw=both& exact=false Sta 101 (G. Larson Duke University) U1 - L1: Data coll., obs. studies, experiments June 29, / 57
19 Introduction to Inference Visualize the data: time dependencies Sta 101 (G. Larson Duke University) U1 - L1: Data coll., obs. studies, experiments June 29, / 57
20 Introduction to Inference What about the first initials? 1 Obtain data from SS website: name, gender, frequency. d <- read.csv("yob2012.txt") Sta 101 (G. Larson Duke University) U1 - L1: Data coll., obs. studies, experiments June 29, / 57
21 Introduction to Inference What about the first initials? 1 Obtain data from SS website: name, gender, frequency. d <- read.csv("yob2012.txt") 2 Use an R function (substring) to extract the initial of the name. d$initial = substring(d[,1],1,1) Sta 101 (G. Larson Duke University) U1 - L1: Data coll., obs. studies, experiments June 29, / 57
22 Introduction to Inference What about the first initials? 1 Obtain data from SS website: name, gender, frequency. d <- read.csv("yob2012.txt") 2 Use an R function (substring) to extract the initial of the name. d$initial = substring(d[,1],1,1) 3 Make a barplot of the initials, by gender if desired. barplot(table(d$initial)) barplot(table(d$initial[d$gender == "M"])) barplot(table(d$initial[d$gender == "F"])) Sta 101 (G. Larson Duke University) U1 - L1: Data coll., obs. studies, experiments June 29, / 57
23 Introduction to Inference Initials All names in A B C D E F G H I J K L M N O P Q R S T U V W X Y Z Sta 101 (G. Larson Duke University) U1 - L1: Data coll., obs. studies, experiments June 29, / 57
24 Introduction to Inference Initials All names in 201 (M) A B C D E F G H I J K L M N O P Q R S T U V W X Y Z Initials All names in 2012 (F) A B C D E F G H I J K L M N O P Q R S T U V W X Y Z Sta 101 (G. Larson Duke University) U1 - L1: Data coll., obs. studies, experiments June 29, / 57
25 Introduction to Inference Step 4: Form a conclusion In 2012, newborn babies in the US were given 3,000 unique names that began with the letter j based on the data from the Social Security database The list of babies from the Social Security data set is a sample, a group of individuals taken from the entire population. The number of individuals in the sample is usually denoted with the letter n. A statistic is any function of the data collected in the sample (e.g., mean, median, etc). So, the count of the babies in the Social Security data set for 2012 who have first initial j is a statistic. Sta 101 (G. Larson Duke University) U1 - L1: Data coll., obs. studies, experiments June 29, / 57
26 Populations and Samples Data Collection Be aware that there exist bad samples. There are three kinds of lies: lies, damned lies, and statistics. If poor sampling techniques are utilized, then the observed statistics will not be applicable to the true population of interest. Sta 101 (G. Larson Duke University) U1 - L1: Data coll., obs. studies, experiments June 29, / 57
27 Populations and Samples Data Collection Be aware that there exist bad samples. There are three kinds of lies: lies, damned lies, and statistics. If poor sampling techniques are utilized, then the observed statistics will not be applicable to the true population of interest. Example: data collection: Raise your hand if you have been on an airplane in the past two years. Sta 101 (G. Larson Duke University) U1 - L1: Data coll., obs. studies, experiments June 29, / 57
28 Populations and Samples Data Collection Be aware that there exist bad samples. There are three kinds of lies: lies, damned lies, and statistics. If poor sampling techniques are utilized, then the observed statistics will not be applicable to the true population of interest. Example: data collection: Raise your hand if you have been on an airplane in the past two years. What does this tell us about how many year olds have ridden an airplane in the past two years? Sta 101 (G. Larson Duke University) U1 - L1: Data coll., obs. studies, experiments June 29, / 57
29 Sampling from a population Census Wouldn t it be better to just include everyone and sample the entire population, i.e. conduct a census? Sta 101 (G. Larson Duke University) U1 - L1: Data coll., obs. studies, experiments June 29, / 57
30 Sampling from a population Census Wouldn t it be better to just include everyone and sample the entire population, i.e. conduct a census? Some individuals are hard to locate or hard to measure. And these difficult-to-find people may have certain characteristics that distinguish them from the rest of the population. Even if you could take a census, a population is always changing, so it s never possible to get a perfect measure. Sta 101 (G. Larson Duke University) U1 - L1: Data coll., obs. studies, experiments June 29, / 57
31 Sampling from a population Census Wouldn t it be better to just include everyone and sample the entire population, i.e. conduct a census? Some individuals are hard to locate or hard to measure. And these difficult-to-find people may have certain characteristics that distinguish them from the rest of the population. Even if you could take a census, a population is always changing, so it s never possible to get a perfect measure. templates/ story/ story.php?storyid= Sta 101 (G. Larson Duke University) U1 - L1: Data coll., obs. studies, experiments June 29, / 57
32 Sampling from a population Exploratory analysis to inference Sampling is natural. Sta 101 (G. Larson Duke University) U1 - L1: Data coll., obs. studies, experiments June 29, / 57
33 Sampling from a population Exploratory analysis to inference Sampling is natural. Think about sampling something you are cooking - you taste (examine) a small part of what you re cooking to get an idea about the dish as a whole. Sta 101 (G. Larson Duke University) U1 - L1: Data coll., obs. studies, experiments June 29, / 57
34 Sampling from a population Exploratory analysis to inference Sampling is natural. Think about sampling something you are cooking - you taste (examine) a small part of what you re cooking to get an idea about the dish as a whole. When you taste a spoonful of chili and decide the single spoonful you tasted isn t spicy enough, that s exploratory analysis. Sta 101 (G. Larson Duke University) U1 - L1: Data coll., obs. studies, experiments June 29, / 57
35 Sampling from a population Exploratory analysis to inference Sampling is natural. Think about sampling something you are cooking - you taste (examine) a small part of what you re cooking to get an idea about the dish as a whole. When you taste a spoonful of chili and decide the single spoonful you tasted isn t spicy enough, that s exploratory analysis. If you generalize and conclude that your entire chili is not spicy enough either, that s an inference. Sta 101 (G. Larson Duke University) U1 - L1: Data coll., obs. studies, experiments June 29, / 57
36 Sampling from a population Exploratory analysis to inference Sampling is natural. Think about sampling something you are cooking - you taste (examine) a small part of what you re cooking to get an idea about the dish as a whole. When you taste a spoonful of chili and decide the single spoonful you tasted isn t spicy enough, that s exploratory analysis. If you generalize and conclude that your entire chili is not spicy enough either, that s an inference. For your inference to be valid, the sample (the spoonful you tasted) needs to be representative of the population (entire pot). If all the spices are sitting at the bottom of the pot, what you tasted is probably not representative of the whole pot. If all the spices are well-mixed before you taste, your spoonful will more likely be representative of the whole pot. Sta 101 (G. Larson Duke University) U1 - L1: Data coll., obs. studies, experiments June 29, / 57
37 Sampling bias Landon vs. FDR A historical example of a biased sample yielding misleading results: In 1936, Landon was the Republican presidential candidate opposing the re-election of FDR. Sta 101 (G. Larson Duke University) U1 - L1: Data coll., obs. studies, experiments June 29, / 57
38 Sampling bias The Literary Digest poll The sample: The Literary Digest polled about 10 million Americans, and got responses from about 2.4 million. The inference: Landon would likely be the overwhelming winner and FDR would get only 43% of the votes. The reality: Sta 101 (G. Larson Duke University) U1 - L1: Data coll., obs. studies, experiments June 29, / 57
39 Sampling bias The Literary Digest poll The sample: The Literary Digest polled about 10 million Americans, and got responses from about 2.4 million. The inference: Landon would likely be the overwhelming winner and FDR would get only 43% of the votes. The reality: FDR won, with 62% of the votes. The magazine was completely discredited because of the poll, and was soon discontinued. How could they have been so wrong? Sta 101 (G. Larson Duke University) U1 - L1: Data coll., obs. studies, experiments June 29, / 57
40 Sampling bias The Literary Digest Poll - what went wrong The magazine had surveyed its own readers, registered automobile owners, and registered telephone users. These groups had incomes well above the national average of the day (remember, this is Great Depression era) which resulted in lists of voters far more likely to support Republicans than a truly typical voter of the time, i.e. the sample was not representative of the American population at the time, so it was impossible to accurately generalize the results from the sample to the population. Sta 101 (G. Larson Duke University) U1 - L1: Data coll., obs. studies, experiments June 29, / 57
41 Sampling bias Large samples are preferable, but... The Literary Digest election poll was based on a sample size of 2.4 million, which is huge, but since the sample was biased, the sample did not yield an accurate prediction. Back to the chili analogy: If the chili is not well stirred, it doesn t matter how large a spoon you have, it will still not taste right (the sample won t be representative of the population). If the chili is well stirred, a small spoon will suffice to test the chili. Sta 101 (G. Larson Duke University) U1 - L1: Data coll., obs. studies, experiments June 29, / 57
42 Sampling bias A few sources of bias Non-response: If only a (non-random) fraction of the randomly sampled people choose to respond to a survey, the sample may no longer be representative of the population. Sta 101 (G. Larson Duke University) U1 - L1: Data coll., obs. studies, experiments June 29, / 57
43 Sampling bias A few sources of bias Non-response: If only a (non-random) fraction of the randomly sampled people choose to respond to a survey, the sample may no longer be representative of the population. Voluntary response: Occurs when the sample consists of people who volunteer to respond because they have strong opinions on the issue, and hence is not representative of the population. Sta 101 (G. Larson Duke University) U1 - L1: Data coll., obs. studies, experiments June 29, / 57
44 Sampling bias A few sources of bias Non-response: If only a (non-random) fraction of the randomly sampled people choose to respond to a survey, the sample may no longer be representative of the population. Voluntary response: Occurs when the sample consists of people who volunteer to respond because they have strong opinions on the issue, and hence is not representative of the population. Sta 101 (G. Larson Duke University) U1 - L1: Data coll., obs. studies, experiments June 29, / 57
45 Sampling bias A few sources of bias Non-response: If only a (non-random) fraction of the randomly sampled people choose to respond to a survey, the sample may no longer be representative of the population. Voluntary response: Occurs when the sample consists of people who volunteer to respond because they have strong opinions on the issue, and hence is not representative of the population. edition.com, Aug 29, 2013 Sta 101 (G. Larson Duke University) U1 - L1: Data coll., obs. studies, experiments June 29, / 57
46 Sampling bias A few sources of bias Non-response: If only a (non-random) fraction of the randomly sampled people choose to respond to a survey, the sample may no longer be representative of the population. Voluntary response: Occurs when the sample consists of people who volunteer to respond because they have strong opinions on the issue, and hence is not representative of the population. edition.com, Aug 29, 2013 Convenience sample: Individuals who are easily accessible are more likely to be included in the sample. What type of bias do reviews on Amazon.com have? What about reviews on RateMyProfessor.com? Sta 101 (G. Larson Duke University) U1 - L1: Data coll., obs. studies, experiments June 29, / 57
47 Sampling bias Participation question A school district is considering whether it will no longer allow high school students to park at school after two recent accidents where students were severely injured. As a first step, they survey parents by mail, asking them whether or not the parents would object to this policy change. Of 6,000 surveys that go out, 1,200 are returned. Of these 1,200 surveys that were completed, 960 agreed with the policy change and 240 disagreed. Which of the following statements are true? I. Some of the mailings may have never reached the parents. II. The school district has strong support from parents to move forward with the policy approval. III. It is possible that majority of the parents of high school students disagree with the policy change. IV. The survey results are unlikely to be biased because all parents were mailed a survey. (a) Only I (b) I and II (c) I and III (d) III and IV (e) Only IV Sta 101 (G. Larson Duke University) U1 - L1: Data coll., obs. studies, experiments June 29, / 57
48 Sampling bias Participation question A school district is considering whether it will no longer allow high school students to park at school after two recent accidents where students were severely injured. As a first step, they survey parents by mail, asking them whether or not the parents would object to this policy change. Of 6,000 surveys that go out, 1,200 are returned. Of these 1,200 surveys that were completed, 960 agreed with the policy change and 240 disagreed. Which of the following statements are true? I. Some of the mailings may have never reached the parents. II. The school district has strong support from parents to move forward with the policy approval. III. It is possible that majority of the parents of high school students disagree with the policy change. IV. The survey results are unlikely to be biased because all parents were mailed a survey. (a) Only I (b) I and II (c) I and III (d) III and IV (e) Only IV Sta 101 (G. Larson Duke University) U1 - L1: Data coll., obs. studies, experiments June 29, / 57
49 Sampling bias A picture s worth a lot, but... A lot of the time we only have part of the story. BabyCenter: Our data comes from nearly half a million parents who shared their baby s name with us in top-baby-names The Netflix effect Orange is the New Black: Galina, Piper, Nicky, Alex, Gloria House of Cards : Garrett, Claire, Robin, Wright 2 A blizzard of Frozen names (Elsa, Hans, Kristin) Are we comfortable making decisions about these name trends based on this data? Sta 101 (G. Larson Duke University) U1 - L1: Data coll., obs. studies, experiments June 29, / 57
50 Sampling bias A picture s worth a lot, but... A lot of the time we only have part of the story. BabyCenter: Our data comes from nearly half a million parents who shared their baby s name with us in top-baby-names The Netflix effect Orange is the New Black: Galina, Piper, Nicky, Alex, Gloria House of Cards : Garrett, Claire, Robin, Wright 2 A blizzard of Frozen names (Elsa, Hans, Kristin) Are we comfortable making decisions about these name trends based on this data? The name Elsa soared 29 percent on our list of names for baby girls. Is this sample statistic enough for us to conclude that the population parameter of the percent of newborn girls in the United States who are named Elsa has increased from 2013 to 2014? top-baby-names-2014 Sta 101 (G. Larson Duke University) U1 - L1: Data coll., obs. studies, experiments June 29, / 57
51 Observational studies and experiments Correlation vs. causation 1 Is there an increase in the popularity of the number of baby girls named Elsa from 2013 to 2014? 2 Has the popularity in Frozen caused an increase in the number of baby girls that were named Elsa? Causal Effect 3 Is the popularity in Frozen related to the increase in the number of baby girls that were named Elsa? Correlation, or relationship We collect our data differently depending on the type of relationship (causal or correlation) that we are interested in. Sta 101 (G. Larson Duke University) U1 - L1: Data coll., obs. studies, experiments June 29, / 57
52 Observational studies and experiments Observational vs. experimental studies An experimental study (a.k.a. experiment ) is a controlled study in which the researchers impose treatments upon the subjects. Subjects are assigned to control and treatment groups using random assignment. Experiments are the preferred method of data collection we can conclude that the treatments caused the response of the study (i.e. results can be attributed as causal) Sta 101 (G. Larson Duke University) U1 - L1: Data coll., obs. studies, experiments June 29, / 57
53 Observational studies and experiments Blocking We would like to design an experiment to investigate if energy gels makes you run faster: Sta 101 (G. Larson Duke University) U1 - L1: Data coll., obs. studies, experiments June 29, / 57
54 Observational studies and experiments Blocking We would like to design an experiment to investigate if energy gels makes you run faster: Treatment: energy gel Control: no energy gel Sta 101 (G. Larson Duke University) U1 - L1: Data coll., obs. studies, experiments June 29, / 57
55 Observational studies and experiments Blocking We would like to design an experiment to investigate if energy gels makes you run faster: Treatment: energy gel Control: no energy gel It is suspected that energy gels might affect pro and amateur athletes differently, therefore we block for pro status: Sta 101 (G. Larson Duke University) U1 - L1: Data coll., obs. studies, experiments June 29, / 57
56 Observational studies and experiments Blocking We would like to design an experiment to investigate if energy gels makes you run faster: Treatment: energy gel Control: no energy gel It is suspected that energy gels might affect pro and amateur athletes differently, therefore we block for pro status: Divide the sample to pro and amateur Randomly assign pro athletes to treatment and control groups Randomly assign amateur athletes to treatment and control groups Pro/amateur status is equally represented in the resulting treatment and control groups Sta 101 (G. Larson Duke University) U1 - L1: Data coll., obs. studies, experiments June 29, / 57
57 Observational studies and experiments Blocking We would like to design an experiment to investigate if energy gels makes you run faster: Treatment: energy gel Control: no energy gel It is suspected that energy gels might affect pro and amateur athletes differently, therefore we block for pro status: Divide the sample to pro and amateur Randomly assign pro athletes to treatment and control groups Randomly assign amateur athletes to treatment and control groups Pro/amateur status is equally represented in the resulting treatment and control groups Why is this important? Can you think of other variables to block for? Sta 101 (G. Larson Duke University) U1 - L1: Data coll., obs. studies, experiments June 29, / 57
58 Observational studies and experiments Observational vs. experimental studies An experimental study (a.k.a. experiment ) is a controlled study in which the researchers impose treatments upon the subjects. Subjects are assigned to control and treatment groups using random assignment. Experiments are the preferred method of data collection we can conclude that the treatments caused the response of the study (i.e. results can be attributed as causal) Experiments are not always feasible or ethical (e.g. effect of second-hand smoke on newborns) Sta 101 (G. Larson Duke University) U1 - L1: Data coll., obs. studies, experiments June 29, / 57
59 Observational studies and experiments Observational vs. experimental studies An experimental study (a.k.a. experiment ) is a controlled study in which the researchers impose treatments upon the subjects. Subjects are assigned to control and treatment groups using random assignment. Experiments are the preferred method of data collection we can conclude that the treatments caused the response of the study (i.e. results can be attributed as causal) Experiments are not always feasible or ethical (e.g. effect of second-hand smoke on newborns) An observational study is a study in which the researchers did not assign the subjects to treatments. Observational studies retain the notion of treatment and control groups. Observational studies still require the researcher to clearly define a research question. This requires identification of the response variable that they will measure on each subject in the study. Sta 101 (G. Larson Duke University) U1 - L1: Data coll., obs. studies, experiments June 29, / 57
60 Cereal breakfast Sta 101 (G. Larson Duke University) U1 - L1: Data coll., obs. studies, experiments June 29, / 57
61 Cereal breakfast What type of study is this, observational study or an experiment? Girls who regularly ate breakfast, particularly one that includes cereal, were slimmer than those who skipped the morning meal, according to a study that tracked nearly 2,400 girls for 10 years. [...] As part of the survey, the girls were asked once a year what they had eaten during the previous three days. What is the conclusion of the study? Who sponsored the study? Sta 101 (G. Larson Duke University) U1 - L1: Data coll., obs. studies, experiments June 29, / 57
62 Cereal breakfast What type of study is this, observational study or an experiment? Girls who regularly ate breakfast, particularly one that includes cereal, were slimmer than those who skipped the morning meal, according to a study that tracked nearly 2,400 girls for 10 years. [...] As part of the survey, the girls were asked once a year what they had eaten during the previous three days. This is an observational study since the researchers merely observed the behavior of the girls (subjects) as opposed to imposing treatments on them. What is the conclusion of the study? Who sponsored the study? Sta 101 (G. Larson Duke University) U1 - L1: Data coll., obs. studies, experiments June 29, / 57
63 Cereal breakfast What type of study is this, observational study or an experiment? Girls who regularly ate breakfast, particularly one that includes cereal, were slimmer than those who skipped the morning meal, according to a study that tracked nearly 2,400 girls for 10 years. [...] As part of the survey, the girls were asked once a year what they had eaten during the previous three days. This is an observational study since the researchers merely observed the behavior of the girls (subjects) as opposed to imposing treatments on them. What is the conclusion of the study? There is an association between girls eating breakfast and being slimmer. Who sponsored the study? Sta 101 (G. Larson Duke University) U1 - L1: Data coll., obs. studies, experiments June 29, / 57
64 Cereal breakfast What type of study is this, observational study or an experiment? Girls who regularly ate breakfast, particularly one that includes cereal, were slimmer than those who skipped the morning meal, according to a study that tracked nearly 2,400 girls for 10 years. [...] As part of the survey, the girls were asked once a year what they had eaten during the previous three days. This is an observational study since the researchers merely observed the behavior of the girls (subjects) as opposed to imposing treatments on them. What is the conclusion of the study? There is an association between girls eating breakfast and being slimmer. Who sponsored the study? General Mills. Sta 101 (G. Larson Duke University) U1 - L1: Data coll., obs. studies, experiments June 29, / 57
65 Cereal breakfast 3 possible explanations: Sta 101 (G. Larson Duke University) U1 - L1: Data coll., obs. studies, experiments June 29, / 57
66 Cereal breakfast 3 possible explanations: 1 Eating breakfast causes girls to be thinner. Sta 101 (G. Larson Duke University) U1 - L1: Data coll., obs. studies, experiments June 29, / 57
67 Cereal breakfast 3 possible explanations: 1 Eating breakfast causes girls to be thinner. 2 Being thin causes girls to eat breakfast. Sta 101 (G. Larson Duke University) U1 - L1: Data coll., obs. studies, experiments June 29, / 57
68 Cereal breakfast 3 possible explanations: 1 Eating breakfast causes girls to be thinner. 2 Being thin causes girls to eat breakfast. 3 A third variable is responsible for both. What could it be? Any extraneous variables that affect both the explanatory and the response variable and that make it seem like there is a relationship between the two are called confounding variables. Sta 101 (G. Larson Duke University) U1 - L1: Data coll., obs. studies, experiments June 29, / 57
69 Cereal breakfast Observational studies and experiments (Recap) Observational study: Researchers collect data in a way that does not directly interfere with how the data arise, i.e. they merely observe, and can only establish an association between the explanatory and response variables. Sta 101 (G. Larson Duke University) U1 - L1: Data coll., obs. studies, experiments June 29, / 57
70 Cereal breakfast Observational studies and experiments (Recap) Observational study: Researchers collect data in a way that does not directly interfere with how the data arise, i.e. they merely observe, and can only establish an association between the explanatory and response variables. Experiment: Researchers randomly assign subjects to various treatments in order to establish causal connections between the explanatory and response variables. Sta 101 (G. Larson Duke University) U1 - L1: Data coll., obs. studies, experiments June 29, / 57
71 Cereal breakfast Observational studies and experiments (Recap) Observational study: Researchers collect data in a way that does not directly interfere with how the data arise, i.e. they merely observe, and can only establish an association between the explanatory and response variables. Experiment: Researchers randomly assign subjects to various treatments in order to establish causal connections between the explanatory and response variables. If you re going to walk away with one thing from this class: correlation does not imply causation. xkcd.com/ 552/ Sta 101 (G. Larson Duke University) U1 - L1: Data coll., obs. studies, experiments June 29, / 57
72 Cereal breakfast Random assignment vs. random sampling ideal experiment Random sampling No random sampling Random assignment Causal conclusion, generalized to the whole population. Causal conclusion, only for the sample. No random assignment No causal conclusion, correlation statement generalized to the whole population. No causal conclusion, correlation statement only for the sample. most observational studies Generalizability No generalizability most experiments Causation Correlation bad observational studies Sta 101 (G. Larson Duke University) U1 - L1: Data coll., obs. studies, experiments June 29, / 57
73 Observations and variables Types of variables all variables numerical categorical continuous discrete regular categorical ordinal measured counted unordered categories ordered categories Sta 101 (G. Larson Duke University) U1 - L1: Data coll., obs. studies, experiments June 29, / 57
74 Observations and variables Types of variables (cont.) type: small, midsize or large. Sta 101 (G. Larson Duke University) U1 - L1: Data coll., obs. studies, experiments June 29, / 57
75 Observations and variables Types of variables (cont.) type: small, midsize or large. (categorical, ordinal) Sta 101 (G. Larson Duke University) U1 - L1: Data coll., obs. studies, experiments June 29, / 57
76 Observations and variables Types of variables (cont.) type: small, midsize or large. (categorical, ordinal) price: average price in $1000 s Sta 101 (G. Larson Duke University) U1 - L1: Data coll., obs. studies, experiments June 29, / 57
77 Observations and variables Types of variables (cont.) type: small, midsize or large. (categorical, ordinal) price: average price in $1000 s (numerical, continuous) Sta 101 (G. Larson Duke University) U1 - L1: Data coll., obs. studies, experiments June 29, / 57
78 Observations and variables Types of variables (cont.) type: small, midsize or large. (categorical, ordinal) price: average price in $1000 s (numerical, continuous) mpgcity: city mileage per gallon Sta 101 (G. Larson Duke University) U1 - L1: Data coll., obs. studies, experiments June 29, / 57
79 Observations and variables Types of variables (cont.) type: small, midsize or large. (categorical, ordinal) price: average price in $1000 s (numerical, continuous) mpgcity: city mileage per gallon (numerical, continuous) Sta 101 (G. Larson Duke University) U1 - L1: Data coll., obs. studies, experiments June 29, / 57
80 Observations and variables Types of variables (cont.) type: small, midsize or large. (categorical, ordinal) price: average price in $1000 s (numerical, continuous) mpgcity: city mileage per gallon (numerical, continuous) drivetrain: front, rear, 4WD Sta 101 (G. Larson Duke University) U1 - L1: Data coll., obs. studies, experiments June 29, / 57
81 Observations and variables Types of variables (cont.) type: small, midsize or large. (categorical, ordinal) price: average price in $1000 s (numerical, continuous) mpgcity: city mileage per gallon (numerical, continuous) drivetrain: front, rear, 4WD (categorical) Sta 101 (G. Larson Duke University) U1 - L1: Data coll., obs. studies, experiments June 29, / 57
82 Observations and variables Types of variables (cont.) type: small, midsize or large. (categorical, ordinal) price: average price in $1000 s (numerical, continuous) mpgcity: city mileage per gallon (numerical, continuous) drivetrain: front, rear, 4WD (categorical) passengers: passenger capacity Sta 101 (G. Larson Duke University) U1 - L1: Data coll., obs. studies, experiments June 29, / 57
83 Observations and variables Types of variables (cont.) type: small, midsize or large. (categorical, ordinal) price: average price in $1000 s (numerical, continuous) mpgcity: city mileage per gallon (numerical, continuous) drivetrain: front, rear, 4WD (categorical) passengers: passenger capacity (numerical, discrete) Sta 101 (G. Larson Duke University) U1 - L1: Data coll., obs. studies, experiments June 29, / 57
84 Observations and variables Types of variables (cont.) type: small, midsize or large. (categorical, ordinal) price: average price in $1000 s (numerical, continuous) mpgcity: city mileage per gallon (numerical, continuous) drivetrain: front, rear, 4WD (categorical) passengers: passenger capacity (numerical, discrete) weight: car weight in pounds Sta 101 (G. Larson Duke University) U1 - L1: Data coll., obs. studies, experiments June 29, / 57
85 Observations and variables Types of variables (cont.) type: small, midsize or large. (categorical, ordinal) price: average price in $1000 s (numerical, continuous) mpgcity: city mileage per gallon (numerical, continuous) drivetrain: front, rear, 4WD (categorical) passengers: passenger capacity (numerical, discrete) weight: car weight in pounds (numerical, continuous) Sta 101 (G. Larson Duke University) U1 - L1: Data coll., obs. studies, experiments June 29, / 57
86 Principles of experimental design Participation question A study is designed to test the effect of light level and noise level on exam performance of students. The researcher also believes that light and noise levels might have different effects on males and females, so wants to make sure both genders are represented equally under different conditions. Which of the below is correct? (a) There are 3 explanatory variables (light, noise, gender) and 1 response variable (exam performance) (b) There are 2 explanatory variables (light and noise), 1 blocking variable (gender), and 1 response variable (exam performance) (c) There is 1 explanatory variable (gender) and 3 response variables (light, noise, exam performance) (d) There are 2 blocking variables (light and noise), 1 explanatory variable (gender), and 1 response variable (exam performance) Sta 101 (G. Larson Duke University) U1 - L1: Data coll., obs. studies, experiments June 29, / 57
87 Principles of experimental design Participation question A study is designed to test the effect of light level and noise level on exam performance of students. The researcher also believes that light and noise levels might have different effects on males and females, so wants to make sure both genders are represented equally under different conditions. Which of the below is correct? (a) There are 3 explanatory variables (light, noise, gender) and 1 response variable (exam performance) (b) There are 2 explanatory variables (light and noise), 1 blocking variable (gender), and 1 response variable (exam performance) (c) There is 1 explanatory variable (gender) and 3 response variables (light, noise, exam performance) (d) There are 2 blocking variables (light and noise), 1 explanatory variable (gender), and 1 response variable (exam performance) Sta 101 (G. Larson Duke University) U1 - L1: Data coll., obs. studies, experiments June 29, / 57
88 Principles of experimental design Difference between blocking and explanatory variables Factors are conditions we can impose on the experimental units. Blocking variables are characteristics that the experimental units come with, that we would like to control for. Blocking is like stratifying, except: Blocking used in experimental settings when randomly assigning the sample to treatment / control. Stratifying used when sampling. Sta 101 (G. Larson Duke University) U1 - L1: Data coll., obs. studies, experiments June 29, / 57
89 Principles of experimental design Principles of experimental design Reading: Open Intro Chapter 1.5: Experiments 1 Control: Compare treatment of interest to a control group. 2 Randomize: Randomly assign subjects to treatments. 3 Replicate: Within a study, replicate by collecting a sufficiently large sample. Or replicate the entire study. 4 Block: If there are variables that are known or suspected to affect the response variable, first group subjects into blocks based on these variables, and then within each block, randomize cases to treatment/control groups. Sta 101 (G. Larson Duke University) U1 - L1: Data coll., obs. studies, experiments June 29, / 57
90 Principles of experimental design More experimental design terminology... Placebo: fake treatment, often used as the control group for medical studies Placebo effect: experimental units showing improvement simply because they believe they are receiving a special treatment Blinding: when experimental units do not know whether they are in the control or treatment group Double-blind: when both the experimental units and the researchers do not know who is in the control and who is in the treatment group Sta 101 (G. Larson Duke University) U1 - L1: Data coll., obs. studies, experiments June 29, / 57
91 Recap Participation question What is the main difference between observational studies and experiments? (a) Experiments take place in a lab while observational studies do not need to. (b) In an observational study we only look at what happened in the past. (c) Most experiments use random assignment while observational studies do not. (d) Observational studies are completely useless since no causal inference can be made based on their findings. Sta 101 (G. Larson Duke University) U1 - L1: Data coll., obs. studies, experiments June 29, / 57
92 Recap Participation question What is the main difference between observational studies and experiments? (a) Experiments take place in a lab while observational studies do not need to. (b) In an observational study we only look at what happened in the past. (c) Most experiments use random assignment while observational studies do not. (d) Observational studies are completely useless since no causal inference can be made based on their findings. Sta 101 (G. Larson Duke University) U1 - L1: Data coll., obs. studies, experiments June 29, / 57
93 Recap More... Want more baby name analysis? Freakonomics podcast: How Much Does Your Name Matter? freakonomics.com/ 2013/ 04/ 08/ how-much-does-your-name-matter-a-new-freakonomics-radio-podcast/ Sta 101 (G. Larson Duke University) U1 - L1: Data coll., obs. studies, experiments June 29, / 57
94 Sta 101 Syllabus & policies 1 Welcome to Stat 101! Introduction to Inference Populations and Samples Sampling from a population Sampling bias Observational studies and experiments Cereal breakfast Observations and variables Principles of experimental design Recap 2 Syllabus & policies Logistics Details Support Policies Tips 3 To do
95 Syllabus & policies Logistics General Info Instructor: Lecture: Lab: Gary Larson Weekdays 11:00 AM - 12:15 PM Link 088 (Classroom 4) Tu, Th 1:30 PM - 3 PM Link 072 (Classroom 6) Sta 101 (G. Larson Duke University) U1 - L1: Data coll., obs. studies, experiments June 29, / 57
96 Syllabus & policies Logistics Office Hours These are subject to change. Check the website for the most current. Mon 12:45-2:00 PM Tue 3:15-4:30 PM Wed 2:15-3:30 PM Thu 3:15-4:30 PM Fri 12:45-2:00 PM If asking about a problem set or lab, you must have tried it out before coming to Office Hours! Sta 101 (G. Larson Duke University) U1 - L1: Data coll., obs. studies, experiments June 29, / 57
97 Syllabus & policies Logistics Required materials Textbook Calculator OpenIntro Statistics Diez, Barr, Çetinkaya-Rundel ISBN: Available at link above (free) or in print from Amazon (for about $10) (Optional) You might need a four function calculator that can do square roots for this class. No limitation on the type of calculator you can use. Sta 101 (G. Larson Duke University) U1 - L1: Data coll., obs. studies, experiments June 29, / 57
98 Syllabus & policies Logistics Webpage stat.duke.edu/ gjl7/ courses/ summer15/ sta / Also accessible via: Sta 101 (G. Larson Duke University) U1 - L1: Data coll., obs. studies, experiments June 29, / 57
99 Syllabus & policies Details Course structure Seven learning units. Set of learning objectives and required and suggested readings, videos, etc. for each unit. Prior to beginning the unit, complete the readings and familiarize yourselves with the learning objectives. Class time: split between lecture, discussion/application. Computing labs. Sta 101 (G. Larson Duke University) U1 - L1: Data coll., obs. studies, experiments June 29, / 57
100 Syllabus & policies Details Class Slides will be posted on the course webpage (under schedule) on the day of the course. Discussion of concepts as well as hands on activities and exercises to complement them. Attend class to keep up with the pace and not fall behind + to contribute to application activities completed in teams. You are responsible for all the material covered in all components of the course, not just the class. Please ask questions in class, office-hours or by if you are struggling (or just curious), do not wait until just before an exam - it will be too late! Sta 101 (G. Larson Duke University) U1 - L1: Data coll., obs. studies, experiments June 29, / 57
101 Syllabus & policies Details Participation questions: attendance and participation Objective: Make you an active participant and help me pace the class. On new material being discussed in class that day. Credit for participation, regardless of whether you have the correct answer. Up to two unexcused late arrivals or absences will not affect your participation grade. While I might sometimes call on you during the class discussion, it is your responsibility to be an active participant without being called on. Sta 101 (G. Larson Duke University) U1 - L1: Data coll., obs. studies, experiments June 29, / 57
102 Syllabus & policies Details Problem sets and labs Problem sets: Objective: Help you develop a more in-depth understanding of the material and help you prepare for exams. Individual: collaborate but don t copy! submit in class, show all work. Sta 101 (G. Larson Duke University) U1 - L1: Data coll., obs. studies, experiments June 29, / 57
103 Syllabus & policies Details Problem sets and labs Problem sets: Objective: Help you develop a more in-depth understanding of the material and help you prepare for exams. Individual: collaborate but don t copy! submit in class, show all work. Labs: Objective: Give you hands on experience with data analysis using statistical software. In partners turn in lab report on Sakai by the following day at 5 PM. Sta 101 (G. Larson Duke University) U1 - L1: Data coll., obs. studies, experiments June 29, / 57
104 Syllabus & policies Details Problem sets and labs Problem sets: Objective: Help you develop a more in-depth understanding of the material and help you prepare for exams. Individual: collaborate but don t copy! submit in class, show all work. Labs: Objective: Give you hands on experience with data analysis using statistical software. In partners turn in lab report on Sakai by the following day at 5 PM. Lowest score dropped for both. Sta 101 (G. Larson Duke University) U1 - L1: Data coll., obs. studies, experiments June 29, / 57
105 Syllabus & policies Details Exams Midterm: TENTATIVE Monday, July 20, in class Final: Saturday, August 8th (9:00 AM - 12:00 PM) (Cumulative) Exam dates cannot be changed. No make-up exams will be given. If you cannot take the exams on these dates you should drop this class. You must bring a calculator to the exams (no cell phones, etc.) and you are also allowed to bring one sheet of notes ( cheat sheet ). This sheet must be no larger than and must be prepared by you (no photocopies). You may use both sides of the sheet. Sta 101 (G. Larson Duke University) U1 - L1: Data coll., obs. studies, experiments June 29, / 57
106 Syllabus & policies Details Grading In Class Participation/Activities: 5% Quizzes: 5% Problem sets: 20% Labs: 15% Midterm: 25% Final: 30% Sta 101 (G. Larson Duke University) U1 - L1: Data coll., obs. studies, experiments June 29, / 57
Announcements. Unit 1: Introduction to data Lecture 1: Data collection, observational studies, and experiments. Statistics 101
Announcements Unit 1: Introduction to data Lecture 1: Data collection, observational studies, and experiments Statistics 101 Mine Çetinkaya-Rundel Duke University I m still waiting on a couple more Gmail
More informationHave you... Unit 1: Introduction to data Lecture 1: Data collection, observational studies, and experiments. Readiness assessment
Have you... Unit 1: Introduction to data Lecture 1: Data collection, observational studies, and experiments Statistics 101 Mine Çetinkaya-Rundel January 15, 2013 been placed into a team? successfully logged
More informationAnnouncements. Lecture 5: Probability. Dangling threads from last week: Mean vs. median. Dangling threads from last week: Sampling bias
Recap Announcements Lecture 5: Statistics 101 Mine Çetinkaya-Rundel September 13, 2011 HW1 due TA hours Thursday - Sunday 4pm - 9pm at Old Chem 211A If you added the class last week please make sure to
More informationCourse Staff. Textbook
Course Staff CS311H: Discrete Mathematics Intro and Propositional Logic Instructor: Işıl Dillig Instructor: Prof. Işıl Dillig TAs: Jacob Van Geffen, Varun Adiga, Akshay Gupta Class meets every Monday,
More informationSampling. Benjamin Graham
Sampling Benjamin Graham Schedule This Week: Sampling and External Validity How many kids? Fertility rate in the US. could be interesting as an independent or a dependent variable. How many children did
More informationWelcome to Physics 161 Elements of Physics Fall 2018, Sept 4. Wim Kloet
Welcome to Physics 161 Elements of Physics Fall 2018, Sept 4 Wim Kloet 1 Lecture 1 TOPICS Administration - course web page - contact details Course materials - text book - iclicker - syllabus Course Components
More informationLECTURE 15: SIMPLE LINEAR REGRESSION I
David Youngberg BSAD 20 Montgomery College LECTURE 5: SIMPLE LINEAR REGRESSION I I. From Correlation to Regression a. Recall last class when we discussed two basic types of correlation (positive and negative).
More informationChapter Goals. To introduce you to data collection
Chapter Goals To introduce you to data collection You will learn to think critically about the data collected or presented learn various methods for selecting a sample Formulate Theories Interpret Results/Make
More information1.1 Administrative Stuff
601.433 / 601.633 Introduction to Algorithms Lecturer: Michael Dinitz Topic: Introduction, Karatsuba/Strassen Date: 9/4/18 1.1 Administrative Stuff Welcome to Algorithms! In this class you will learn the
More informationAS 101: The Solar System (Spring 2017) Course Syllabus
AS 101: The Solar System (Spring 2017) Course Syllabus Instructor: Professor Wen Li Office: CAS 501 Phone: 617-353-7439 Email: wenli77@bu.edu Office hours: Mondays 3:30 5:00 pm, Wednesdays 3:30 5:00 pm,
More informationStat 406: Algorithms for classification and prediction. Lecture 1: Introduction. Kevin Murphy. Mon 7 January,
1 Stat 406: Algorithms for classification and prediction Lecture 1: Introduction Kevin Murphy Mon 7 January, 2008 1 1 Slides last updated on January 7, 2008 Outline 2 Administrivia Some basic definitions.
More informationGeneral Info. Grading
Syllabus & Policies General Info Lecture 1: Introduction, Set Theory, and Boolean Algebra Classroom: Perkins 2-072 Time: Mon - Fri, 2:00-3:15 pm Wed, 3:30-4:30 pm Sta 111 Colin Rundel May 13, 2014 Professor:
More informationappstats8.notebook October 11, 2016
Chapter 8 Linear Regression Objective: Students will construct and analyze a linear model for a given set of data. Fat Versus Protein: An Example pg 168 The following is a scatterplot of total fat versus
More informationOffice Hours: Dr. Kruse: Tue, 14:30-15:30 & Fri, 10:30-11:30 in ABB 142 (Chemistry Help Centre) TA's: tutorial time
Chem 2P03 & ChemBio 2P03 Course Outline - Fall 2016 Applications of Physical Chemistry Prof. P. Kruse, ABB-263, x23480, pkruse@mcmaster.ca http://www.chemistry.mcmaster.ca/kruse/ version 16 August 2016
More informationEcon 1123: Section 2. Review. Binary Regressors. Bivariate. Regression. Omitted Variable Bias
Contact Information Elena Llaudet Sections are voluntary. My office hours are Thursdays 5pm-7pm in Littauer Mezzanine 34-36 (Note room change) You can email me administrative questions to ellaudet@gmail.com.
More informationSYLLABUS SEFS 540 / ESRM 490 B Optimization Techniques for Natural Resources Spring 2017
SYLLABUS SEFS 540 / ESRM 490 B Optimization Techniques for Natural Resources Spring 2017 Lectures: Winkenwerder Hall 107, 4:50-5:50pm, MW Labs: Mary Gates Hall 030, 1:30-2:50pm, Th Course Web Site: http://faculty.washington.edu/toths/course.shtml
More informationST 371 (IX): Theories of Sampling Distributions
ST 371 (IX): Theories of Sampling Distributions 1 Sample, Population, Parameter and Statistic The major use of inferential statistics is to use information from a sample to infer characteristics about
More informationLinear Algebra. Instructor: Justin Ryan
Linear Algebra Instructor: Justin Ryan ryan@math.wichita.edu Department of Mathematics, Statistics, and Physics Wichita State University Wichita, Kansas Summer 2014 DRAFT 3 June 2014 Preface These lecture
More informationGEOG 105 THE DIGITAL EARTH Spring 2017
GEOG 105 THE DIGITAL EARTH Spring 2017 Instructor: Dr. Julie Cidell E-mail: jcidell@illinois.edu Office: CAB 251 (244-4665) Office hours: W 1:30-3:30 or by appt. TA: Donald Planey TA E-mail: planey1@illinois.edu
More informationGeography 1103: Spatial Thinking
Geography 1103: Spatial Thinking Lecture: T\TH 8:00-9:15 am (McEniry 401) Lab: Wed 2:00-4:30 pm (McEniry 420) Instructor: Dr. Elizabeth C. Delmelle Email: edelmell@uncc.edu Office: McEniry 419 Phone: 704-687-5932
More informationSan Jose State University Department of Mechanical and Aerospace Engineering ME 230, Advanced Mechanical Engineering Analysis, Fall 2015
San Jose State University Department of Mechanical and Aerospace Engineering ME 230, Advanced Mechanical Engineering Analysis, Fall 2015 Instructor: Office Location: Younes Shabany TBD Telephone: (408)
More informationAstronomy Course Syllabus
Astronomy Course Syllabus Course: ASTR& 100 Title: Survey of Astronomy Section: DE Term: 2017 Spring Days: Online Time: Online Location: Online Instructor: Julie Masura Phone None E-mail: Canvas intranet
More informationPh 1a Fall General Information
Ph 1a Fall 2017 General Information Lecturer Jonas Zmuidzinas 306 Cahill, Ext. 6229, jonas@caltech.edu Lectures are on Wednesdays and Fridays, 11:00-11:55 am, in 201 E. Bridge. Course Administrator Meagan
More informationSolar Open House Toolkit
A Solar Open House is an informal meet and greet at a solar homeowner s home. It is an opportunity for homeowners who are considering going solar to see solar energy at work, ask questions about the process
More informationGeological Foundations of Environmental Sciences
Geological Foundations of Environmental Sciences David C. Elbert Office: Olin Hall 228 Department of Earth and Planetary Sciences Johns Hopkins University 3400 N. Charles St. Baltimore, MD 21218 Phone:
More informationASTRONOMY 10 De Anza College
ASTRONOMY De Anza College Section 1 M - F, 7:0-8: am De Anza Planetarium (PLT) ek Cichanski Office: S-1a Office hours: Mon thru Fri :0am-:pm, other times by appt. Office phone: (408) 84-84 Email: cichanskimarek@fhda.edu
More informationAnnouncements. Lecture 18: Simple Linear Regression. Poverty vs. HS graduate rate
Announcements Announcements Lecture : Simple Linear Regression Statistics 1 Mine Çetinkaya-Rundel March 29, 2 Midterm 2 - same regrade request policy: On a separate sheet write up your request, describing
More informationUnit5: Inferenceforcategoricaldata. 4. MT2 Review. Sta Fall Duke University, Department of Statistical Science
Unit5: Inferenceforcategoricaldata 4. MT2 Review Sta 101 - Fall 2015 Duke University, Department of Statistical Science Dr. Çetinkaya-Rundel Slides posted at http://bit.ly/sta101_f15 Outline 1. Housekeeping
More informationMATH 10 INTRODUCTORY STATISTICS
MATH 10 INTRODUCTORY STATISTICS Tommy Khoo Your friendly neighbourhood graduate student. It is Time for Homework! ( ω `) First homework + data will be posted on the website, under the homework tab. And
More informationAnnouncements. Lecture 1 - Data and Data Summaries. Data. Numerical Data. all variables. continuous discrete. Homework 1 - Out 1/15, due 1/22
Announcements Announcements Lecture 1 - Data and Data Summaries Statistics 102 Colin Rundel January 13, 2013 Homework 1 - Out 1/15, due 1/22 Lab 1 - Tomorrow RStudio accounts created this evening Try logging
More informationLearning From Data Lecture 14 Three Learning Principles
Learning From Data Lecture 14 Three Learning Principles Occam s Razor Sampling Bias Data Snooping M. Magdon-Ismail CSCI 4100/6100 recap: Validation and Cross Validation Validation Cross Validation D (N)
More informationMath 410 Linear Algebra Summer Session American River College
Course Information Instructor: Kristin Lui Email: luik@arc.losrios.edu Office Hours: By appointment Location: Liberal Arts 163 ARC Main Campus Meet Times: Tuesday/Thursday 6:30 pm 9:40 pm Dates: June 16,
More informationAS The Astronomical Universe. Prof. Merav Opher - Fall 2013
SYLLABUS AS 102 - The Astronomical Universe Prof. Merav Opher - Fall 2013 Course Catalog Summary: The birth and death of stars; red giants, white dwarfs, black holes; our galaxy, the Milky Way, and other
More informationChemistry 21 Organic Chemistry I
Chemistry 21 Organic Chemistry I Summer, 2012 Section 1092 Dr. J. F. Harwig Office: Sci 274 Voice Mail: 310-434-4747 Email: Harwig_John@smc.edu Homepage: http://homepage.smc.edu/harwig_john/ PREREQUISITES
More informationHunan University. CHEM32: Organic Chemistry
Academic Inquiries: Hunan University Email: iss@hnu.edu.cn Hunan University CHEM32: Organic Chemistry Professor: To be announced Total contact hours: 54 hours Credit: 4 Course Description Topics covered
More informationMATH 345 Differential Equations
MATH 345 Differential Equations Spring 2018 Instructor: Time: Dr. Manuela Girotti; office: Weber 223C email: manuela.girotti@colostate.edu Mon-Tue-Wed-Fri 1:00pm-1:50pm Location: Engineering E 206 Office
More informationChapter 18. Sampling Distribution Models. Copyright 2010, 2007, 2004 Pearson Education, Inc.
Chapter 18 Sampling Distribution Models Copyright 2010, 2007, 2004 Pearson Education, Inc. Normal Model When we talk about one data value and the Normal model we used the notation: N(μ, σ) Copyright 2010,
More informationCHEM 25: Organic Chemistry I https://moodle.drew.edu (2009FA-CHEM )
CHEM 25: Organic Chemistry I https://moodle.drew.edu (2009FA-CHEM-25-001) Instructor Jane M. Liu, Ph.D. Hall of Sciences S212 jliu3@drew.edu (973-408-3303) Class 9:30-10:35am M, W, F (S244) Laboratory
More informationMATH 10 INTRODUCTORY STATISTICS
MATH 10 INTRODUCTORY STATISTICS Ramesh Yapalparvi It is Time for Homework! ( ω `) First homework + data will be posted on the website, under the homework tab. And also sent out via email. 30% weekly homework.
More informationGEOL 0086 Environmental Geology
GEOL 0086 Environmental Geology Instructor: Dr. Carrie Davis Todd E-mail address: Phone: Office: Krebs 244 Office hours: Monday 3 4 pm; Tuesday 1 2 pm; Wednesday 9 10 am; or by appointment Textbook: Geology
More informationWelcome. to Physics 2135.
Welcome to Physics 2135. PHYSICS 2135 Engineering Physics II Dr. S. Thomas Vojta Instructor in charge Office: 204 Physics, Phone: 341-4793 vojtat@mst.edu www.mst.edu/~vojtat Office hours: Mon+ Wed 11am-12pm
More informationLabs: Chelsea Ackroyd Office Location: FMAB 005 Office Hours: M 08:45 11:45 AM or by appointment
Introduction to Geographic Information Systems (GEOG 3140 and 314) & GIS Fundamentals (GEOG 6139) Fall 2017 Section 002 M 01:25 02:45 PM Section 003 M 03:00 4:20 PM Section 004 W 01:25 02:45 PM M Lib 1150
More informationASTR 4 Solar System Astronom y
Lectures: M thru F, :30-:0 am Location: De Anza Planetarium ASTR 4 Solar System Astronom y Marek Cichanski Office: S-a cichanskimarek@fhda.edu (40) 4-4 Office Hours: M thru F, :30 0:0 am In This Syllabus
More informationAnnouncements. Final Review: Units 1-7
Announcements Announcements Final : Units 1-7 Statistics 104 Mine Çetinkaya-Rundel June 24, 2013 Final on Wed: cheat sheet (one sheet, front and back) and calculator Must have webcam + audio on at all
More informationLectures of STA 231: Biostatistics
Lectures of STA 231: Biostatistics Second Semester Academic Year 2016/2017 Text Book Biostatistics: Basic Concepts and Methodology for the Health Sciences (10 th Edition, 2014) By Wayne W. Daniel Prepared
More informationDraft Proof - Do not copy, post, or distribute
1 LEARNING OBJECTIVES After reading this chapter, you should be able to: 1. Distinguish between descriptive and inferential statistics. Introduction to Statistics 2. Explain how samples and populations,
More informationPhysics 2BL: Experiments in Mechanics and Electricity Summer Session I, 2012
Physics BL: Experiments in Mechanics and Electricity Summer Session I, 01 Instructor: E-mail: Office: Office Hours: Phone: Tera (Bell) Austrum tbell@physics.ucsd.edu 164 Mayer Hall Addition TuTh 6-7 pm
More informationwhere Female = 0 for males, = 1 for females Age is measured in years (22, 23, ) GPA is measured in units on a four-point scale (0, 1.22, 3.45, etc.
Notes on regression analysis 1. Basics in regression analysis key concepts (actual implementation is more complicated) A. Collect data B. Plot data on graph, draw a line through the middle of the scatter
More informationNow we will define some common sampling plans and discuss their strengths and limitations.
Now we will define some common sampling plans and discuss their strengths and limitations. 1 For volunteer samples individuals are self selected. Participants decide to include themselves in the study.
More informationIntroduction. Outline. EE3321 Electromagnetic Field Theory. Welcome! About this class Rules and syllabus Let s get started! 9/19/2018. E t.
D v B 0 B E t D H J t D B E H Introduction EE3321 Electromagnetic Field Theory Outline Welcome! About this class Rules and syllabus Let s get started! Slide 2 1 Welcome Course Information http://emlab.utep.edu/ee3321emf.htm
More informationChemistry 110 General Chemistry, Course Lecture MWF 8:30 am 9:50 am Room NSM C221 Laboratory M or W 1:00 pm 3:50 pm Room NSM B340
Course Syllabus Fall 2009 CSU Dominguez Hills Chemistry 110 General Chemistry, Course 40318 Lecture MWF 8:30 am 9:50 am Room NSM C221 Laboratory M or W 1:00 pm 3:50 pm Room NSM B340 Instructor: Dr. Kenneth
More informationStatics - TAM 210 & TAM 211. Spring 2018
Statics - TAM 210 & TAM 211 Spring 2018 Course distribution Required TAM 210 TAM 211 Aerospace Engineering 31 1 Agricultural & Biological 12 3 Bioengineering 2 6 Civil Engineering 41 Engineering Mechanics
More informationGeneral Chemistry I Chemistry 101B Fall 2002 Department of Chemistry Colgate University
Instructor: Germaine Gogel Office Hours: General Chemistry I Chemistry 101B Fall 2002 Department of Chemistry Colgate University Lecture: 9:20 10:10 on Monday, Wednesday, and Friday Classroom: Wynn 111
More informationAP Final Review II Exploring Data (20% 30%)
AP Final Review II Exploring Data (20% 30%) Quantitative vs Categorical Variables Quantitative variables are numerical values for which arithmetic operations such as means make sense. It is usually a measure
More informationMath 31 Lesson Plan. Day 2: Sets; Binary Operations. Elizabeth Gillaspy. September 23, 2011
Math 31 Lesson Plan Day 2: Sets; Binary Operations Elizabeth Gillaspy September 23, 2011 Supplies needed: 30 worksheets. Scratch paper? Sign in sheet Goals for myself: Tell them what you re going to tell
More informationAdditional practice with these ideas can be found in the problems for Tintle Section P.1.1
Psych 10 / Stats 60, Practice Problem Set 3 (Week 3 Material) Part 1: Decide if each variable below is quantitative, ordinal, or categorical. If the variable is categorical, also decide whether or not
More informationPhysics Fundamentals of Astronomy
Physics 1303.010 Fundamentals of Astronomy Course Information Meeting Place & Time ASU Planetarium (VIN P-02) TR 09:30-10:45 AM Spring 2018 Instructor Dr. Kenneth Carrell Office: VIN 119 Phone: (325) 942-2136
More informationPHY131H1F Class 5. Clicker Question
PHY131H1F Class 5 Today, Chapter 2, Sections 2.5 to 2.7 Freefall Acceleration due to gravity Motion on an inclined plane Differentiating velocity to get acceleration Integrating acceleration to get velocity
More informationFall 2013 Northwestern University STAT Section 23 Introduction to Statistics. Final Exam. Name:
Fall 2013 Northwestern University STAT 202-0 Section 23 Introduction to Statistics Final Exam Name: December 13, 2013 1. Please do not leave blank for any question. 2. There are 15 questions, each question
More informationLecture 1 Intro to Spatial and Temporal Data
Lecture 1 Intro to Spatial and Temporal Data Dennis Sun Stanford University Stats 253 June 22, 2015 1 What is Spatial and Temporal Data? 2 Trend Modeling 3 Omitted Variables 4 Overview of this Class 1
More informationStat 20 Midterm 1 Review
Stat 20 Midterm Review February 7, 2007 This handout is intended to be a comprehensive study guide for the first Stat 20 midterm exam. I have tried to cover all the course material in a way that targets
More informationWeather and Climate AOSC 200. Textbook: Weather: A Consice Introduction by: Gregory Hakim and Jérôme Patou. Tim s Info
Weather and Climate AOSC 200 Tim Canty (tcanty@umd.edu) Hannah Daley (hmdaley@umd.edu) Justin Hicks (jhicks14@umd.edu) Shaun Howe (showe@umd.edu) Lindsey Rodio (larodio@umd.edu) Agniv Sengupta(agnivs@umd.edu)
More informationToday s Menu. Administrativia Two Problems Cutting a Pizza Lighting Rooms
Welcome! L01 Today s Menu Administrativia Two Problems Cutting a Pizza Lighting Rooms Administrativia Course page: https://www.cs.duke.edu/courses/spring13/compsci230/ Who we are: Instructor: TA: UTAs:
More informationMath 147 Lecture Notes: Lecture 12
Math 147 Lecture Notes: Lecture 12 Walter Carlip February, 2018 All generalizations are false, including this one.. Samuel Clemens (aka Mark Twain) (1835-1910) Figures don t lie, but liars do figure. Samuel
More informationMorpheus: Neo, sooner or later you re going to realize, just as I did, that there s a difference between knowing the path, and walking the path.
Morpheus: Neo, sooner or later you re going to realize, just as I did, that there s a difference between knowing the path, and walking the path. Why CS 53? Making linear algebra more concrete. Making it
More informationThe Clickers. PHY131 Summer 2011 Class 2 Notes 5/19/11. Your First Clicker Question!
PHY131H1F Summer Introduction to Physics I Class 2 Today: Error Analysis Significant figures Constant Velocity Motion Constant Acceleration Motion Freefall Motion on an inclined plane Status Light When
More information*Karle Laska s Sections: There is no class tomorrow and Friday! Have a good weekend! Scores will be posted in Compass early Friday morning
STATISTICS 100 EXAM 3 Spring 2016 PRINT NAME (Last name) (First name) *NETID CIRCLE SECTION: Laska MWF L1 Laska Tues/Thurs L2 Robin Tu Write answers in appropriate blanks. When no blanks are provided CIRCLE
More informationMATH 1150 Chapter 2 Notation and Terminology
MATH 1150 Chapter 2 Notation and Terminology Categorical Data The following is a dataset for 30 randomly selected adults in the U.S., showing the values of two categorical variables: whether or not the
More informationGEO 448 Plate Tectonics Fall 2014 Syllabus
GEO 448 Plate Tectonics Fall 2014 Syllabus TH 4:00-8:10pm, Gillet Hall 324 Plate Tectonics as a unifying theory: the driving mechanisms of crustal deformation. Evidence supporting sea-floor spreading and
More informationAS102 -The Astronomical Universe. The boring details. AS102 - Major Topics. Day Labs - Rooms B4 & 606. Where are we (earth, sun) in the universe?
AS102 - Major Topics Where are we (earth, sun) in the universe? What are stars? AS102 -The Astronomical Universe Instructor: Professor Tereasa Brainerd TAs: Ren Cashman & Katie Garcia How are stars born,
More informationImportant Dates. Non-instructional days. No classes. College offices closed.
Instructor: Dr. Alexander Krantsberg Email: akrantsberg@nvcc.edu Phone: 703-845-6548 Office: Bisdorf, Room AA 352 Class Time: Tuesdays and Thursdays 7:30 PM - 9:20 PM. Classroom: Bisdorf / AA 467 Office
More informationMachine Learning. Instructor: Pranjal Awasthi
Machine Learning Instructor: Pranjal Awasthi Course Info Requested an SPN and emailed me Wait for Carol Difrancesco to give them out. Not registered and need SPN Email me after class No promises It s a
More information0703C101 General Chemistry I(With Lab)
Summer 2019 0703C101 General Chemistry I(With Lab) Instructor: Stefan Kautsch Time: Monday through Friday (June 17, 2019 - July 19, 2019) Office Hours: 2 hours (according to the teaching schedule) Contact
More informationNicole Dalzell. July 2, 2014
UNIT 1: INTRODUCTION TO DATA LECTURE 3: EDA (CONT.) AND INTRODUCTION TO STATISTICAL INFERENCE VIA SIMULATION STATISTICS 101 Nicole Dalzell July 2, 2014 Teams and Announcements Team1 = Houdan Sai Cui Huanqi
More informationNicole Dalzell. July 3, 2014
UNIT 2: PROBABILITY AND DISTRIBUTIONS LECTURE 1: PROBABILITY AND CONDITIONAL PROBABILITY STATISTICS 101 Nicole Dalzell July 3, 2014 Announcements No team activities today Labs: Individual Write-Ups Statistics
More informationPractice Questions for Exam 1
Practice Questions for Exam 1 1. A used car lot evaluates their cars on a number of features as they arrive in the lot in order to determine their worth. Among the features looked at are miles per gallon
More informationOccupy movement - Duke edition. Lecture 14: Large sample inference for proportions. Exploratory analysis. Another poll on the movement
Occupy movement - Duke edition Lecture 14: Large sample inference for proportions Statistics 101 Mine Çetinkaya-Rundel October 20, 2011 On Tuesday we asked you about how closely you re following the news
More informationStatistics 301: Probability and Statistics Introduction to Statistics Module
Statistics 301: Probability and Statistics Introduction to Statistics Module 1 2018 Introduction to Statistics Statistics is a science, not a branch of mathematics, but uses mathematical models as essential
More informationWEST LOS ANGELES COLLEGE Introduction to General Chemistry CHEMISTRY 60 SYLLABUS; 5 units
WEST LOS ANGELES COLLEGE Introduction to General Chemistry CHEMISTRY 60 SYLLABUS; 5 units SPRING 2014; Section 3492 Instructor Information Instructor E-mail Lecture Room MSA 005 Laboratory Session Room
More informationGEOG 100E Introduction to Geography (5 credits)
Bellevue College GEOG 100E Introduction to Geography (5 credits) Fall, 2010 5:30 7:40 pm T, Th Room D273 Instructor: Charles Chip Dodd Phone: 425 564-2499 E-mail: cdodd@shoreline.edu Office Hours: 4:30
More informationModule 03 Lecture 14 Inferential Statistics ANOVA and TOI
Introduction of Data Analytics Prof. Nandan Sudarsanam and Prof. B Ravindran Department of Management Studies and Department of Computer Science and Engineering Indian Institute of Technology, Madras Module
More informationTimes/Room Friday 9:00 pm 3:00 pm Room B225 (lecture and laboratory) Course Semester Credit Total Course hours (lecture & lab)
HOUSTON COMMUNITY COLLEGE NORTHWEST (ALIEF CAMPUS) COURSE OUTLINE FOR CHEM 1411 GENERAL CHEMISTRY I Class Number 13095 / Spring 2017 Discipline/Program Course Level Course Title Chemistry First Year (Freshman)
More informationSection 2.1 ~ Data Types and Levels of Measurement. Introduction to Probability and Statistics Spring 2017
Section 2.1 ~ Data Types and Levels of Measurement Introduction to Probability and Statistics Spring 2017 Objective To be able to classify data as qualitative or quantitative, to identify quantitative
More informationWhy should you care?? Intellectual curiosity. Gambling. Mathematically the same as the ESP decision problem we discussed in Week 4.
I. Probability basics (Sections 4.1 and 4.2) Flip a fair (probability of HEADS is 1/2) coin ten times. What is the probability of getting exactly 5 HEADS? What is the probability of getting exactly 10
More informationCHEMISTRY 210 SYLLABUS Spring 2007 General Chemistry II
CHEMISTRY 210 SYLLABUS Spring 2007 General Chemistry II Dr. Craig P. Jasperse Office: Hagen 407J email: jasperse@mnstate.edu Office Hours: Mon 10:3011:30, 3:305:30 Telephone: 4772230 Tues 9:3010:30, 2:305:30
More informationGEOLOGY 10 De Anza College
GEOLOGY 0 De Anza College Marek Cichanski Office: S-a CLASS WEBSITE: TEXTBOOKS & MATERIALS Sections 0,0,0 Lectures: M - Th, 0:0-:0am Labs: T, W, Th, :0-:0pm Office hours: M thru Th :0am-:0pm; Friday 9:0am-0:0am
More informationSTAT/SOC/CSSS 221 Statistical Concepts and Methods for the Social Sciences. Random Variables
STAT/SOC/CSSS 221 Statistical Concepts and Methods for the Social Sciences Random Variables Christopher Adolph Department of Political Science and Center for Statistics and the Social Sciences University
More informationGEOLOGY 10 De Anza College
GEOLOGY 0 De Anza College Marek Cichanski Office: S-5a CLASS WEBSITE: TEXTBOOKS & MATERIALS Sections 0,02,0 Lectures: M - Th, 0:0-:20am Labs: T, W, Th, :0-4:20pm Office hours: M thru Th :0am-2:20pm; other
More informationPre-AP Chemistry Dr. Brighid Corcoran, Ph.D. Lake Dallas High School Classroom Policies and Procedures
Pre-AP Chemistry 2016-2017 Dr. Brighid Corcoran, Ph.D. Lake Dallas High School Classroom Policies and Procedures The following topics were covered with your child during class: Attendance/Make-Up Procedure
More information- a value calculated or derived from the data.
Descriptive statistics: Note: I'm assuming you know some basics. If you don't, please read chapter 1 on your own. It's pretty easy material, and it gives you a good background as to why we need statistics.
More informationChemistry 883 Computational Quantum Chemistry
Chemistry 883 Computational Quantum Chemistry Instructor Contact Information Professor Benjamin G. Levine levine@chemistry.msu.edu 215 Chemistry Building 517-353-1113 Office Hours Tuesday 9:00-11:00 am
More informationUnless provided with information to the contrary, assume for each question below that the Classical Linear Model assumptions hold.
Economics 345: Applied Econometrics Section A01 University of Victoria Midterm Examination #2 Version 1 SOLUTIONS Spring 2015 Instructor: Martin Farnham Unless provided with information to the contrary,
More informationGEOG 508 GEOGRAPHIC INFORMATION SYSTEMS I KANSAS STATE UNIVERSITY DEPARTMENT OF GEOGRAPHY FALL SEMESTER, 2002
GEOG 508 GEOGRAPHIC INFORMATION SYSTEMS I KANSAS STATE UNIVERSITY DEPARTMENT OF GEOGRAPHY FALL SEMESTER, 2002 Course Reference #: 13210 Meeting Time: TU 2:05pm - 3:20 pm Meeting Place: Ackert 221 Remote
More informationAstro 115: Introduction to Astronomy. About Me. On your survey paper, take 3 minutes to answer the following:
1/25/17 Astro 115: Introduction to Astronomy About Me My office is in Science 356 Best way to contact me is email jfielder@sfsu.edu put Astro 115-03 (for TTh) or Astro 115-05 (for MWF) in the subject line
More informationdownload instant at
Answers to Odd-Numbered Exercises Chapter One: An Overview of Regression Analysis 1-3. (a) Positive, (b) negative, (c) positive, (d) negative, (e) ambiguous, (f) negative. 1-5. (a) The coefficients in
More informationPHYSICAL GEOLOGY SYLLABUS GLY 1010 Physical Geology Fall
PHYSICAL GEOLOGY SYLLABUS GLY 1010 Physical Geology Fall - 2011 Instructor s Name: Marianne O Neal Caldwell Telephone Number: (813) 253-7251 Email Address: Please use the email in our online classroom.
More informationUniversity of Alaska Fairbanks Course Syllabus Chemistry 103: Basic General Chemistry
University of Alaska Fairbanks Course Syllabus Chemistry 103: Basic General Chemistry Course: Basic General Chemistry (CHEM 103); 4 credits Prerequisites: Placement in ENGL 111 and DEVM 105 or, alternatively,
More informationPhysics 211 (Spring 2016) (4 Credit Hours) General Physics II. Syllabus available on BlackBoard under Course information
Physics 211 (Spring 2016) (4 Credit Hours) General Physics II Syllabus available on BlackBoard http://webcourses.niu.edu/ under Course information Name: Prof. Omar Chmaissem (sha-my-sim) (You can call
More informationHarvard University. Rigorous Research in Engineering Education
Statistical Inference Kari Lock Harvard University Department of Statistics Rigorous Research in Engineering Education 12/3/09 Statistical Inference You have a sample and want to use the data collected
More informationSYLLABUS 8/28/2017. SYLLABUS: Readings and Reference. SYLLABUS: Objectives. SYLLABUS: Grading. GEOG 247 Cultural Geography
SYLLABUS GEOG 27 Cultural Geography Prof. Anthony Grande Hunter College-CUNY AFG 2017 Lecture design, content and presentation AFG 0617. Individual images and illustrations may be subject to prior copyright.
More information