Statistics in Practice Forensic Science

Size: px
Start display at page:

Download "Statistics in Practice Forensic Science"

Transcription

1 Statistics in Practice Forensic Science Dr. David Lucy Lancaster University Statistics in Practice p.1/36

2 Forensic Science Criminal evidence becoming increasingly scientific. Greater use of trace evidence (paint/glass/fibres). DNA revolution. The rise of DNA was coincident with a greater awareness on the part of courts that observations are subject to uncertainty. Statistics in Practice p.2/36

3 Forensic Science Greater realisation that uncertainty is important has lead to: Trace evidence (glass/paint/fibres) being treated statistically. More evidence types: common observations - shoe types - facial features - all being treated with some form of statistical method. observation of co-incidence of treatment in cases where carers are suspected of harming their charges. Statistics in Practice p.3/36

4 Forensic Consultancies Typically: Police. Customs and Excise. Criminal Defence lawyers. Start with an approach by one of the above. Usually concludes with the submission of a statistical report. Rarely concludes with a court appearance. Statistics in Practice p.4/36

5 Case work - Tinley The facts were: Evening of 9th April 2004 Andrew Tinley was assaulted by his partner Sally Rose. He picked up a champagne bottle and struck her on the head. Rose died at the scene of the incident. Statistics in Practice p.5/36

6 Case work - Tinley Under interrogation: Tinley said he had struck Rose twice with the bottle. The pathologist could find no evidence for two strikes. A double blow is more incriminating than a single - obviously of interest to the court The question was: what is the probability of administering two blows with a champagne bottle and leaving only a single wound? Statistics in Practice p.6/36

7 Case work - Tinley r 1 r 3 r 2 If: 1. r 1 is the radius head. 2. r 2 is the radius of the area of the wound. 3. r 3 is the radius of the area of the implement. Statistics in Practice p.7/36

8 Case work - Tinley Pr = 2π(r 2 r 3 ) 2 2π(r 1 r 3 ) 2 Given the values for r in the pathologists report gives a probability of about 8%. Only a guide for the court. Could spend a lot of time working out a more exact value. Limits of knowledge given by Tinley s account. Statistics in Practice p.8/36

9 Firearm rifling patterns Two firearms offences committed: The first incident was the shooting of a man in an Ulster town in June The second was a shooting of a man in the same town in March Both incidents featured a 0.32 calibre revolver with a 5-right rifling pattern. What is the evidential value of the match? Statistics in Practice p.9/36

10 5-right 0.32 calibre Statistics in Practice p.10/36

11 Firearm rifling patterns A suspect had been located. That suspect was been found to possess a firearm with a five-right rifling pattern. Start with two propositions 1. H p is that two firearms offences employed the same weapon, and that weapon is that found in the possession of the suspect. 2. H d is that the firearms used in the two offences were different weapons to that of the suspect. Statistics in Practice p.11/36

12 Likelihood ratios A standard measure of evidential value is a likelihood ratio Let: E the firearm used was of calibre 0.32 and rifling pattern 5 right. Then: LR = Pr(E H p,i) Pr(E H d,i) where I a firearm has been used in the commission of the offences. Statistics in Practice p.12/36

13 Numerator The numerator is: Pr(E H p,i). What is the probability of observing 5-right, 0.32 calibre, were the firearm used that of the suspect. The suspect has only one firearm. Some other individual may have used the suspect s firearm - should be from defence case if so. The probability is quite high 1 Statistics in Practice p.13/36

14 Denominator The denominator is: Pr(E H d,i). What is the probability of observing 5-right, 0.32 calibre, were the firearm used some other firearm other than that of the suspect. This is proportional to the frequency of 5-right, 0.32 calibre, firearms from the population of illegally held firearms. Need for data - supplied by investigators. Statistics in Practice p.14/36

15 Denominator From data supplied of observations from a suitable sample of weapons recovered There are 716 illegal firearms known to the firearms intelligence branch 4 were revolvers with right handed rifling of 5 grooves, and were of 0.32 calibre. The likelihood ratio is 716/4 = 179 Statistics in Practice p.15/36

16 Likelihood ratio The likelihood ratio is 716/4 = 179 can be interpreted: The observation of 0.32 calibre, and 5-right rifling, is 179 more likely were the weapon used in these offences that of the suspect rather than any other weapon from the population of illicit firearms. This does not take into account the fact that there were two scenes - should it be Should I have done so? - some disquiet about using the higher figure. Statistics in Practice p.16/36

17 How incriminating Evett et al. (2000) give the following table: likelihood ratio verbal equivalent 1 < LR 10 limited support for H p 10 < LR 100 moderate support for H p 100 < LR 1000 moderately strong support for H p 1000 < LR strong support for H p < LR very strong support for H p 179 is in the middle of this range - thus implying moderately strong support for H p. Statistics in Practice p.17/36

18 Case work The comparison problem is the archetypal forensic problem: where a fragment of an item found to be associated with a suspect, compared to an item known to be associated with an offence, found to match in some sense. What is the evidential value of that match. Statistics in Practice p.18/36

19 Glass Surprisingly common material of forensic interest Criminal activity often includes glass breakage. Shards scattered over area, including offenders. To what extent do the observations from the fragments of glass found upon a suspect suggest that some, or all, of those fragments came from the crimescene glass? Statistics in Practice p.19/36

20 Glass Statistics in Practice p.20/36

21 Glass Statistics in Practice p.21/36

22 Glass With glass the observables tend to be: Refractive index. Major, minor and trace element measurements. Isotopic measurements - not attempted yet - should be possible. Statistics in Practice p.22/36

23 Continuous variables For continuous variables there are a few problems: Within source variation. Leads to some conceptual confusion - no such thing as an absolute match. Repeated observations of the same object are unlikely to give exactly the same observations - similarity not as simple as for shoe types. Statistics in Practice p.23/36

24 Continuous variables Statistics in Practice p.24/36

25 Traditional evaluation Traditional evidence evaluation methods include: 1. nσ rules - where if the means between the fragments for all elements are within nσ of each other then a match is declared - frighteningly bad. 2. multiple t testing - variables are t tested between fragments - poor. 3. eyeball - scatterplots of pairwise elements are examined by eye. The really hardened scientist examines principle component plots. Each and every one of these is unfounded, and none answers the question. Statistics in Practice p.25/36

26 Proximity based measures These only take into account the proximity of one object to another. Proximity is not the same, and cannot be easily equated to identity. Does not calculate a weight of evidence within a propositional framework. It can: be used to generate the best guess from limited number of entities, more tenuously be used to exclude. Statistics in Practice p.26/36

27 The fallacy of proximity An easily conceived illustration of this: A friend collects coloured marbles - stores them in bags. each bag has marbles of only one colour. there may be more than one bag containing marbles of each colour. A marble has dropped out of a bag - the marble is red - you select a bag and sample a marble - that marble is red. To what extent does the observation that both marbles are red support the notion that the marble came from that particular bag? Statistics in Practice p.27/36

28 The fallacy of proximity Statistics in Practice p.28/36

29 The fallacy of proximity? Statistics in Practice p.29/36

30 The fallacy of proximity Similarity of observation on its own: Gives no idea as to identity. Dissimilarity of observation: Can be used to reject in cases where observation is unambiguous. Does not apply in any logical manner where observations are continuous. Need knowledge of population to make any legitimate probabilistic inference about identity of source. Statistics in Practice p.30/36

31 Lindley s formulation X sample of m measurements from crimescene, Y sample of n measurements of suspect properties. X N(θ 1,σ 2 /m), Y N(θ 2,σ 2 /n). Under Under H p : θ 1 = θ 2 = θ, under H d : θ 1 θ 2. LR = Pr(X θ)pr(y θ)pr(θ)dθ Pr(X θ1 )Pr(θ 1 )dθ 1 Pr(Y θ2 )Pr(θ 2 )dθ 2 Statistics in Practice p.31/36

32 Lindley s formulation LR = σ 1σ 2 aσσ 3 exp { (X Y)2 τ 2 } a 2 σ 2 (σ1 2 +σ2 2 }{{ ) } exp { (W µ)2 ) 2σ (Z µ)2 (σ1 2 +σ2 2 ) } 2σ 2 1 σ2 2 }{{} proximity rarity where: X mean of observations from object 1 σ1 2 = τ2 +σ 2 /m Y mean of observations from object 2 σ2 2 = τ2 +σ 2 /n m number of observations from object 1 σ3 2 = τ2 +σ 2 /(m+n) n number of observations from object 2 Z = (σ2 2X +σ2 1 Y)/(σ2 1 +σ2 2 ) σ 2 within object variance W = (mx +ny)/(m+n) µ population mean a 2 = 1/m+1/n τ 2 population variance Statistics in Practice p.32/36

33 Generalisations Lindley s formulation of the comparison problem has now: extended to the multivariate case, for high multivariate spaces DAGs used, non-normal between item means using 1. kernel density estimates 2. log-concave estimation with structural zeros This is, at the moment, in constant development Statistics in Practice p.33/36

34 Future work There are a number of outstanding problems: 1. isotopic systems tend to be compositional - what about multiple isotopic systems? 2. dimensionality - curse of - lots of dimensions - few observations. Comparison problems, at the moment, are developing quickly Statistics in Practice p.34/36

35 Professional forensic statistics Intellectually forensic statistics is of great interest: Most have distinct features. Many revolve around comparison problems. Academia one of the best occupations for those involved in this sort of work: Quite often your conclusions favour the side who has not employed you. You can become quite unpopular with both sides. Statistics in Practice p.35/36

36 Professional forensic statistics A career in forensic statistics: has variety of challenging problems. Is in a growing field, unfortunately the main employer, FSS, is being shut down. Commercial forensic science providers are unlikely to employ statisticians. But: Is difficult to operate in, particularly independently. Really need institutional support. Statistics in Practice p.36/36

arxiv: v1 [stat.ap] 27 Mar 2015

arxiv: v1 [stat.ap] 27 Mar 2015 Submitted to the Annals of Applied Statistics A NOTE ON THE SPECIFIC SOURCE IDENTIFICATION PROBLEM IN FORENSIC SCIENCE IN THE PRESENCE OF UNCERTAINTY ABOUT THE BACKGROUND POPULATION By Danica M. Ommen,

More information

The Forensic Laboratory. Forensic Science

The Forensic Laboratory. Forensic Science The Forensic Laboratory Forensic Science 1 Introduction The forensic laboratory is where the essence of forensic science takes place, with one objective - to deduce all of what is possible from evidence.

More information

A Two-Level Model for Evidence Evaluation

A Two-Level Model for Evidence Evaluation Aitken, CGG. Zadora, G & Lucy, D. (2007) A Two-Level Model for Evidence Evaluation. Journal of Forensic Sciences. 52(2); 412-419. A Two-Level Model for Evidence Evaluation Colin G.G. Aitken, 1 Ph.D.; Grzegorz

More information

Bayesian Inference. STA 121: Regression Analysis Artin Armagan

Bayesian Inference. STA 121: Regression Analysis Artin Armagan Bayesian Inference STA 121: Regression Analysis Artin Armagan Bayes Rule...s! Reverend Thomas Bayes Posterior Prior p(θ y) = p(y θ)p(θ)/p(y) Likelihood - Sampling Distribution Normalizing Constant: p(y

More information

Probability and Statistics

Probability and Statistics Probability and Statistics Kristel Van Steen, PhD 2 Montefiore Institute - Systems and Modeling GIGA - Bioinformatics ULg kristel.vansteen@ulg.ac.be CHAPTER 4: IT IS ALL ABOUT DATA 4a - 1 CHAPTER 4: IT

More information

Forensic Science Chapter 3 Notes

Forensic Science Chapter 3 Notes Key Vocabulary: Class Characteristics Comparison Identification Individual Characteristics Product Rule Learning Objectives: Forensic Science Chapter 3 Notes 1. Review the common types of physical evidence

More information

Reporting LRs - principles

Reporting LRs - principles Simone Gittelson Ian Evett Reporting LRs - principles Tacha Hicks-Champod Franco Taroni DNA: http://www.formation-continueunil-epfl.ch/en/essentials-dnainterpretation 1 Steven Myers Tim Kalafut Gittelson

More information

P Q (P Q) (P Q) (P Q) (P % Q) T T T T T T T F F T F F F T F T T T F F F F T T

P Q (P Q) (P Q) (P Q) (P % Q) T T T T T T T F F T F F F T F T T T F F F F T T Logic and Reasoning Final Exam Practice Fall 2017 Name Section Number The final examination is worth 100 points. 1. (10 points) What is an argument? Explain what is meant when one says that logic is the

More information

The Island Problem Revisited

The Island Problem Revisited The Island Problem Revisited Halvor Mehlum 1 Department of Economics, University of Oslo, Norway E-mail: halvormehlum@econuiono March 10, 2009 1 While carrying out this research, the author has been associated

More information

Two-stage approach for the inference of the source of high-dimension and complex chemical data in forensic science

Two-stage approach for the inference of the source of high-dimension and complex chemical data in forensic science Two-stage approach for the inference of the source of high-dimension and complex chemical data in forensic science arxiv:1804.01049v2 [stat.me] 12 Apr 2018 M.A. Ausdemore, M.Sc. and C. Neumann, Ph.D. and

More information

Lectures 5 & 6: Hypothesis Testing

Lectures 5 & 6: Hypothesis Testing Lectures 5 & 6: Hypothesis Testing in which you learn to apply the concept of statistical significance to OLS estimates, learn the concept of t values, how to use them in regression work and come across

More information

Experimental Design and Graphical Analysis of Data

Experimental Design and Graphical Analysis of Data Experimental Design and Graphical Analysis of Data A. Designing a controlled experiment When scientists set up experiments they often attempt to determine how a given variable affects another variable.

More information

Introduction to Hugin.

Introduction to Hugin. Introduction to Hugin. Hugin is a commercial software package which allows the user to represent Bayesian networks using a relatively straightforward set of graphical tools. Hugin is also available in

More information

Journal of Forensic Sciences. Information-Theoretical Assessment of the Performance of Likelihood Ratio Computation Methods

Journal of Forensic Sciences. Information-Theoretical Assessment of the Performance of Likelihood Ratio Computation Methods Information-Theoretical Assessment of the Performance of Likelihood Ratio Computation Methods Journal: Manuscript ID: JOFS--.R Manuscript Type: Paper Date Submitted by the Author: n/a Complete List of

More information

When 'neutral' evidence still has probative value (with implications from the Barry George Case).

When 'neutral' evidence still has probative value (with implications from the Barry George Case). When 'neutral' evidence still has probative value (with implications from the Barry George Case). Fenton, N; Berger, D; Lagnado, D; Neil, M; Hsu, A For additional information about this publication click

More information

Introduction to Machine Learning. Lecture 2

Introduction to Machine Learning. Lecture 2 Introduction to Machine Learning Lecturer: Eran Halperin Lecture 2 Fall Semester Scribe: Yishay Mansour Some of the material was not presented in class (and is marked with a side line) and is given for

More information

System Identification, Lecture 4

System Identification, Lecture 4 System Identification, Lecture 4 Kristiaan Pelckmans (IT/UU, 2338) Course code: 1RT880, Report code: 61800 - Spring 2016 F, FRI Uppsala University, Information Technology 13 April 2016 SI-2016 K. Pelckmans

More information

Forensic Genetics Module. Evaluating DNA evidence

Forensic Genetics Module. Evaluating DNA evidence Summer Institute in Statistical Genetics University of Washington, Seattle Forensic Genetics Module Evaluating DNA evidence David Balding Professor of Statistical Genetics University of Melbourne and UCL

More information

Stat 535 C - Statistical Computing & Monte Carlo Methods. Arnaud Doucet.

Stat 535 C - Statistical Computing & Monte Carlo Methods. Arnaud Doucet. Stat 535 C - Statistical Computing & Monte Carlo Methods Arnaud Doucet Email: arnaud@cs.ubc.ca 1 CS students: don t forget to re-register in CS-535D. Even if you just audit this course, please do register.

More information

Parameter Estimation. William H. Jefferys University of Texas at Austin Parameter Estimation 7/26/05 1

Parameter Estimation. William H. Jefferys University of Texas at Austin Parameter Estimation 7/26/05 1 Parameter Estimation William H. Jefferys University of Texas at Austin bill@bayesrules.net Parameter Estimation 7/26/05 1 Elements of Inference Inference problems contain two indispensable elements: Data

More information

Law and Science. Developments in the law confronting the new challenges of scientific evidence. 108 HARV. L. REV. 1482, 1484 (1995).

Law and Science. Developments in the law confronting the new challenges of scientific evidence. 108 HARV. L. REV. 1482, 1484 (1995). Law and Science Law is a normative pursuit that seeks to define how public and private relations should function In contrast science is a descriptive pursuit which does not define how the universe should

More information

Chapter 4. Bayesian inference. 4.1 Estimation. Point estimates. Interval estimates

Chapter 4. Bayesian inference. 4.1 Estimation. Point estimates. Interval estimates Chapter 4 Bayesian inference The posterior distribution π(θ x) summarises all our information about θ to date. However, sometimes it is helpful to reduce this distribution to a few key summary measures.

More information

BetaZi SCIENCE AN OVERVIEW OF PHYSIO-STATISTICS FOR PRODUCTION FORECASTING. EVOLVE, ALWAYS That s geologic. betazi.com. geologic.

BetaZi SCIENCE AN OVERVIEW OF PHYSIO-STATISTICS FOR PRODUCTION FORECASTING. EVOLVE, ALWAYS That s geologic. betazi.com. geologic. BetaZi SCIENCE AN OVERVIEW OF PHYSIO-STATISTICS FOR PRODUCTION FORECASTING EVOLVE, ALWAYS That s geologic betazi.com geologic.com Introduction Predictive Analytics is the science of using past facts to

More information

CHEMICAL AND EXPLOSIVE TERRORISM

CHEMICAL AND EXPLOSIVE TERRORISM CHEMICAL AND EXPLOSIVE TERRORISM THE THREAT OF CHEMICAL AND EXPLOSIVE TERRORISM The use of chemical and explosive materials by criminals and terrorist groups poses a significant threat today and all regions

More information

A Rejoinder to Mackintosh and some Remarks on the. Concept of General Intelligence

A Rejoinder to Mackintosh and some Remarks on the. Concept of General Intelligence A Rejoinder to Mackintosh and some Remarks on the Concept of General Intelligence Moritz Heene Department of Psychology, Ludwig Maximilian University, Munich, Germany. 1 Abstract In 2000 Nicholas J. Mackintosh

More information

Decision theory. 1 We may also consider randomized decision rules, where δ maps observed data D to a probability distribution over

Decision theory. 1 We may also consider randomized decision rules, where δ maps observed data D to a probability distribution over Point estimation Suppose we are interested in the value of a parameter θ, for example the unknown bias of a coin. We have already seen how one may use the Bayesian method to reason about θ; namely, we

More information

System Identification, Lecture 4

System Identification, Lecture 4 System Identification, Lecture 4 Kristiaan Pelckmans (IT/UU, 2338) Course code: 1RT880, Report code: 61800 - Spring 2012 F, FRI Uppsala University, Information Technology 30 Januari 2012 SI-2012 K. Pelckmans

More information

MACHINE LEARNING INTRODUCTION: STRING CLASSIFICATION

MACHINE LEARNING INTRODUCTION: STRING CLASSIFICATION MACHINE LEARNING INTRODUCTION: STRING CLASSIFICATION THOMAS MAILUND Machine learning means different things to different people, and there is no general agreed upon core set of algorithms that must be

More information

Belton Police Department

Belton Police Department Belton Police Department Uniform Crime Report for June 2013 RETURN-A - MONTHLY RETURN OF OFFENSES KNOWN TO THE POLICE 1 Classification of offenses 1 Criminal Homicide A Murder/nonnegligent homicide 2 Offenses

More information

Introduction 1. STA442/2101 Fall See last slide for copyright information. 1 / 33

Introduction 1. STA442/2101 Fall See last slide for copyright information. 1 / 33 Introduction 1 STA442/2101 Fall 2016 1 See last slide for copyright information. 1 / 33 Background Reading Optional Chapter 1 of Linear models with R Chapter 1 of Davison s Statistical models: Data, and

More information

Role of Nuclear Forensics in Preventing N- Terrorism

Role of Nuclear Forensics in Preventing N- Terrorism Role of Nuclear Forensics in Preventing N- Terrorism March - April 2011 By Dr.Rukmani Krishnamurthy & Nisha Menon (About the authors: Dr. Rukmani Krishnamurthy is Technical Advisor, Institute of Forensic

More information

AUTOMATED TEMPLATE MATCHING METHOD FOR NMIS AT THE Y-12 NATIONAL SECURITY COMPLEX

AUTOMATED TEMPLATE MATCHING METHOD FOR NMIS AT THE Y-12 NATIONAL SECURITY COMPLEX AUTOMATED TEMPLATE MATCHING METHOD FOR NMIS AT THE Y-1 NATIONAL SECURITY COMPLEX J. A. Mullens, J. K. Mattingly, L. G. Chiang, R. B. Oberer, J. T. Mihalczo ABSTRACT This paper describes a template matching

More information

Introductory Econometrics. Review of statistics (Part II: Inference)

Introductory Econometrics. Review of statistics (Part II: Inference) Introductory Econometrics Review of statistics (Part II: Inference) Jun Ma School of Economics Renmin University of China October 1, 2018 1/16 Null and alternative hypotheses Usually, we have two competing

More information

Energy Transformations IDS 101

Energy Transformations IDS 101 Energy Transformations IDS 101 It is difficult to design experiments that reveal what something is. As a result, scientists often define things in terms of what something does, what something did, or what

More information

Do not copy, post, or distribute

Do not copy, post, or distribute 14 CORRELATION ANALYSIS AND LINEAR REGRESSION Assessing the Covariability of Two Quantitative Properties 14.0 LEARNING OBJECTIVES In this chapter, we discuss two related techniques for assessing a possible

More information

Expectation of geometric distribution

Expectation of geometric distribution Expectation of geometric distribution What is the probability that X is finite? Can now compute E(X): Σ k=1f X (k) = Σ k=1(1 p) k 1 p = pσ j=0(1 p) j = p 1 1 (1 p) = 1 E(X) = Σ k=1k (1 p) k 1 p = p [ Σ

More information

Applied Bayesian Statistics

Applied Bayesian Statistics Applied Bayesian Statistics David Spiegelhalter Statistical Laboratory, Cambridge with thanks to: Nicky Best Dave Lunn Andrew Thomas Course: Lent Term 2010 Course Summary (provisional) Jan 18 Lect 1: Probability

More information

A Introduction to Matrix Algebra and the Multivariate Normal Distribution

A Introduction to Matrix Algebra and the Multivariate Normal Distribution A Introduction to Matrix Algebra and the Multivariate Normal Distribution PRE 905: Multivariate Analysis Spring 2014 Lecture 6 PRE 905: Lecture 7 Matrix Algebra and the MVN Distribution Today s Class An

More information

Likelihood and Fairness in Multidimensional Item Response Theory

Likelihood and Fairness in Multidimensional Item Response Theory Likelihood and Fairness in Multidimensional Item Response Theory or What I Thought About On My Holidays Giles Hooker and Matthew Finkelman Cornell University, February 27, 2008 Item Response Theory Educational

More information

DD Advanced Machine Learning

DD Advanced Machine Learning Modelling Carl Henrik {chek}@csc.kth.se Royal Institute of Technology November 4, 2015 Who do I think you are? Mathematically competent linear algebra multivariate calculus Ok programmers Able to extend

More information

Experiment 2 Random Error and Basic Statistics

Experiment 2 Random Error and Basic Statistics PHY191 Experiment 2: Random Error and Basic Statistics 7/12/2011 Page 1 Experiment 2 Random Error and Basic Statistics Homework 2: turn in the second week of the experiment. This is a difficult homework

More information

Applied Bayesian Statistics

Applied Bayesian Statistics Applied Bayesian Statistics Course Summary (provisional) 10 lectures and 6 practicals David Spiegelhalter Statistical Laboratory, Cambridge Lecture 1: Probability and Bayes theorem for discrete observables

More information

MA/ST 810 Mathematical-Statistical Modeling and Analysis of Complex Systems

MA/ST 810 Mathematical-Statistical Modeling and Analysis of Complex Systems MA/ST 810 Mathematical-Statistical Modeling and Analysis of Complex Systems Principles of Statistical Inference Recap of statistical models Statistical inference (frequentist) Parametric vs. semiparametric

More information

CSC321 Lecture 18: Learning Probabilistic Models

CSC321 Lecture 18: Learning Probabilistic Models CSC321 Lecture 18: Learning Probabilistic Models Roger Grosse Roger Grosse CSC321 Lecture 18: Learning Probabilistic Models 1 / 25 Overview So far in this course: mainly supervised learning Language modeling

More information

Experiment 2 Random Error and Basic Statistics

Experiment 2 Random Error and Basic Statistics PHY9 Experiment 2: Random Error and Basic Statistics 8/5/2006 Page Experiment 2 Random Error and Basic Statistics Homework 2: Turn in at start of experiment. Readings: Taylor chapter 4: introduction, sections

More information

Introduction to Bayesian Statistics

Introduction to Bayesian Statistics School of Computing & Communication, UTS January, 207 Random variables Pre-university: A number is just a fixed value. When we talk about probabilities: When X is a continuous random variable, it has a

More information

Chapter 4: An Introduction to Probability and Statistics

Chapter 4: An Introduction to Probability and Statistics Chapter 4: An Introduction to Probability and Statistics 4. Probability The simplest kinds of probabilities to understand are reflected in everyday ideas like these: (i) if you toss a coin, the probability

More information

Discrete Distributions

Discrete Distributions Discrete Distributions STA 281 Fall 2011 1 Introduction Previously we defined a random variable to be an experiment with numerical outcomes. Often different random variables are related in that they have

More information

ITWG - A Platform for International Cooperation in Nuclear Forensics

ITWG - A Platform for International Cooperation in Nuclear Forensics ITWG - A Platform for International Cooperation in Nuclear Forensics David K. Smith, Klaus Mayer, Tamas Biro, Bernard Chartier, Bruno Jouniaux, Paul Thompson, Carey Larsson, Michael Kristo, and Richard

More information

Part III: Unstructured Data. Lecture timetable. Analysis of data. Data Retrieval: III.1 Unstructured data and data retrieval

Part III: Unstructured Data. Lecture timetable. Analysis of data. Data Retrieval: III.1 Unstructured data and data retrieval Inf1-DA 2010 20 III: 28 / 89 Part III Unstructured Data Data Retrieval: III.1 Unstructured data and data retrieval Statistical Analysis of Data: III.2 Data scales and summary statistics III.3 Hypothesis

More information

Bayesian Learning (II)

Bayesian Learning (II) Universität Potsdam Institut für Informatik Lehrstuhl Maschinelles Lernen Bayesian Learning (II) Niels Landwehr Overview Probabilities, expected values, variance Basic concepts of Bayesian learning MAP

More information

Bayesian Computation

Bayesian Computation Bayesian Computation CAS Centennial Celebration and Annual Meeting New York, NY November 10, 2014 Brian M. Hartman, PhD ASA Assistant Professor of Actuarial Science University of Connecticut CAS Antitrust

More information

LECTURE NOTES by DR. J.S.V.R. KRISHNA PRASAD

LECTURE NOTES by DR. J.S.V.R. KRISHNA PRASAD .0 Introduction: The theory of probability has its origin in the games of chance related to gambling such as tossing of a coin, throwing of a die, drawing cards from a pack of cards etc. Jerame Cardon,

More information

PMR Learning as Inference

PMR Learning as Inference Outline PMR Learning as Inference Probabilistic Modelling and Reasoning Amos Storkey Modelling 2 The Exponential Family 3 Bayesian Sets School of Informatics, University of Edinburgh Amos Storkey PMR Learning

More information

Confidence Intervals and Hypothesis Tests

Confidence Intervals and Hypothesis Tests Confidence Intervals and Hypothesis Tests STA 281 Fall 2011 1 Background The central limit theorem provides a very powerful tool for determining the distribution of sample means for large sample sizes.

More information

Overview. Probabilistic Interpretation of Linear Regression Maximum Likelihood Estimation Bayesian Estimation MAP Estimation

Overview. Probabilistic Interpretation of Linear Regression Maximum Likelihood Estimation Bayesian Estimation MAP Estimation Overview Probabilistic Interpretation of Linear Regression Maximum Likelihood Estimation Bayesian Estimation MAP Estimation Probabilistic Interpretation: Linear Regression Assume output y is generated

More information

1 Using standard errors when comparing estimated values

1 Using standard errors when comparing estimated values MLPR Assignment Part : General comments Below are comments on some recurring issues I came across when marking the second part of the assignment, which I thought it would help to explain in more detail

More information

Regression M&M 2.3 and 10. Uses Curve fitting Summarization ('model') Description Prediction Explanation Adjustment for 'confounding' variables

Regression M&M 2.3 and 10. Uses Curve fitting Summarization ('model') Description Prediction Explanation Adjustment for 'confounding' variables Uses Curve fitting Summarization ('model') Description Prediction Explanation Adjustment for 'confounding' variables MALES FEMALES Age. Tot. %-ile; weight,g Tot. %-ile; weight,g wk N. 0th 50th 90th No.

More information

Uni- and Bivariate Power

Uni- and Bivariate Power Uni- and Bivariate Power Copyright 2002, 2014, J. Toby Mordkoff Note that the relationship between risk and power is unidirectional. Power depends on risk, but risk is completely independent of power.

More information

10.3 Random variables and their expectation 301

10.3 Random variables and their expectation 301 10.3 Random variables and their expectation 301 9. For simplicity, assume that the probabilities of the birth of a boy and of a girl are the same (which is not quite so in reality). For a certain family,

More information

Expectation of geometric distribution. Variance and Standard Deviation. Variance: Examples

Expectation of geometric distribution. Variance and Standard Deviation. Variance: Examples Expectation of geometric distribution Variance and Standard Deviation What is the probability that X is finite? Can now compute E(X): Σ k=f X (k) = Σ k=( p) k p = pσ j=0( p) j = p ( p) = E(X) = Σ k=k (

More information

Hypothesis Testing. Part I. James J. Heckman University of Chicago. Econ 312 This draft, April 20, 2006

Hypothesis Testing. Part I. James J. Heckman University of Chicago. Econ 312 This draft, April 20, 2006 Hypothesis Testing Part I James J. Heckman University of Chicago Econ 312 This draft, April 20, 2006 1 1 A Brief Review of Hypothesis Testing and Its Uses values and pure significance tests (R.A. Fisher)

More information

COS 424: Interacting with Data. Lecturer: Rob Schapire Lecture #15 Scribe: Haipeng Zheng April 5, 2007

COS 424: Interacting with Data. Lecturer: Rob Schapire Lecture #15 Scribe: Haipeng Zheng April 5, 2007 COS 424: Interacting ith Data Lecturer: Rob Schapire Lecture #15 Scribe: Haipeng Zheng April 5, 2007 Recapitulation of Last Lecture In linear regression, e need to avoid adding too much richness to the

More information

Model comparison. Christopher A. Sims Princeton University October 18, 2016

Model comparison. Christopher A. Sims Princeton University October 18, 2016 ECO 513 Fall 2008 Model comparison Christopher A. Sims Princeton University sims@princeton.edu October 18, 2016 c 2016 by Christopher A. Sims. This document may be reproduced for educational and research

More information

AQA GCSE Physics. 60 minutes. 60 marks. Q1 to Q4 to be worked through with tutor. Q5 to Q8 to be worked through independently.

AQA GCSE Physics. 60 minutes. 60 marks. Q1 to Q4 to be worked through with tutor. Q5 to Q8 to be worked through independently. AQA GCSE Physics Electricity 4.2.5: Static Electricity & Electric Fields Name: Class: Date: Time: 60 minutes Marks: 60 marks Comments: Q to Q4 to be worked through with tutor. Q5 to Q8 to be worked through

More information

Bayesian Inference (A Rough Guide)

Bayesian Inference (A Rough Guide) Sigmedia, Electronic Engineering Dept., Trinity College, Dublin. 1 Bayesian Inference (A Rough Guide) Anil C. Kokaram anil.kokaram@tcd.ie Electrical and Electronic Engineering Dept., University of Dublin,

More information

Hydraulics Prof Dr Arup Kumar Sarma Department of Civil Engineering Indian Institute of Technology, Guwahati

Hydraulics Prof Dr Arup Kumar Sarma Department of Civil Engineering Indian Institute of Technology, Guwahati Hydraulics Prof Dr Arup Kumar Sarma Department of Civil Engineering Indian Institute of Technology, Guwahati Module No # 08 Pipe Flow Lecture No # 04 Pipe Network Analysis Friends, today we will be starting

More information

Statistical Inference

Statistical Inference Statistical Inference Classical and Bayesian Methods Class 6 AMS-UCSC Thu 26, 2012 Winter 2012. Session 1 (Class 6) AMS-132/206 Thu 26, 2012 1 / 15 Topics Topics We will talk about... 1 Hypothesis testing

More information

(Refer Slide Time: 02:20)

(Refer Slide Time: 02:20) Discrete Mathematical Structures Dr. Kamala Krithivasan Department of Computer Science and Engineering Indian Institute of Technology, Madras Lecture - 5 Logical Inference In the last class we saw about

More information

Principal Components Analysis

Principal Components Analysis Principal Components Analysis Lecture 9 August 2, 2011 Advanced Multivariate Statistical Methods ICPSR Summer Session #2 Lecture #9-8/2/2011 Slide 1 of 54 Today s Lecture Principal Components Analysis

More information

Robotics. Lecture 4: Probabilistic Robotics. See course website for up to date information.

Robotics. Lecture 4: Probabilistic Robotics. See course website   for up to date information. Robotics Lecture 4: Probabilistic Robotics See course website http://www.doc.ic.ac.uk/~ajd/robotics/ for up to date information. Andrew Davison Department of Computing Imperial College London Review: Sensors

More information

MATH2206 Prob Stat/20.Jan Weekly Review 1-2

MATH2206 Prob Stat/20.Jan Weekly Review 1-2 MATH2206 Prob Stat/20.Jan.2017 Weekly Review 1-2 This week I explained the idea behind the formula of the well-known statistic standard deviation so that it is clear now why it is a measure of dispersion

More information

Assessing Risks. Tom Carter. tom/sfi-csss. Complex Systems

Assessing Risks. Tom Carter.   tom/sfi-csss. Complex Systems Assessing Risks Tom Carter http://cogs.csustan.edu/ tom/sfi-csss Complex Systems July, 2002 Our general topics: Assessing risks Using Bayes Theorem The doomsday argument References 2 The quotes Science

More information

Poisson regression: Further topics

Poisson regression: Further topics Poisson regression: Further topics April 21 Overdispersion One of the defining characteristics of Poisson regression is its lack of a scale parameter: E(Y ) = Var(Y ), and no parameter is available to

More information

10. Exchangeability and hierarchical models Objective. Recommended reading

10. Exchangeability and hierarchical models Objective. Recommended reading 10. Exchangeability and hierarchical models Objective Introduce exchangeability and its relation to Bayesian hierarchical models. Show how to fit such models using fully and empirical Bayesian methods.

More information

Euclidean Geometry. The Elements of Mathematics

Euclidean Geometry. The Elements of Mathematics Euclidean Geometry The Elements of Mathematics Euclid, We Hardly Knew Ye Born around 300 BCE in Alexandria, Egypt We really know almost nothing else about his personal life Taught students in mathematics

More information

Testing Hypothesis. Maura Mezzetti. Department of Economics and Finance Università Tor Vergata

Testing Hypothesis. Maura Mezzetti. Department of Economics and Finance Università Tor Vergata Maura Department of Economics and Finance Università Tor Vergata Hypothesis Testing Outline It is a mistake to confound strangeness with mystery Sherlock Holmes A Study in Scarlet Outline 1 The Power Function

More information

X = X X n, + X 2

X = X X n, + X 2 CS 70 Discrete Mathematics for CS Fall 2003 Wagner Lecture 22 Variance Question: At each time step, I flip a fair coin. If it comes up Heads, I walk one step to the right; if it comes up Tails, I walk

More information

Nonlinear Regression

Nonlinear Regression Nonlinear Regression James H. Steiger Department of Psychology and Human Development Vanderbilt University James H. Steiger (Vanderbilt University) Nonlinear Regression 1 / 36 Nonlinear Regression 1 Introduction

More information

Topic 17: Simple Hypotheses

Topic 17: Simple Hypotheses Topic 17: November, 2011 1 Overview and Terminology Statistical hypothesis testing is designed to address the question: Do the data provide sufficient evidence to conclude that we must depart from our

More information

- measures the center of our distribution. In the case of a sample, it s given by: y i. y = where n = sample size.

- measures the center of our distribution. In the case of a sample, it s given by: y i. y = where n = sample size. Descriptive Statistics: One of the most important things we can do is to describe our data. Some of this can be done graphically (you should be familiar with histograms, boxplots, scatter plots and so

More information

Survey of JPEG compression history analysis

Survey of JPEG compression history analysis Survey of JPEG compression history analysis Andrew B. Lewis Computer Laboratory Topics in security forensic signal analysis References Neelamani et al.: JPEG compression history estimation for color images,

More information

Basic of Probability Theory for Ph.D. students in Education, Social Sciences and Business (Shing On LEUNG and Hui Ping WU) (May 2015)

Basic of Probability Theory for Ph.D. students in Education, Social Sciences and Business (Shing On LEUNG and Hui Ping WU) (May 2015) Basic of Probability Theory for Ph.D. students in Education, Social Sciences and Business (Shing On LEUNG and Hui Ping WU) (May 2015) This is a series of 3 talks respectively on: A. Probability Theory

More information

Physics Dec The Maxwell Velocity Distribution

Physics Dec The Maxwell Velocity Distribution Physics 301 7-Dec-2005 29-1 The Maxwell Velocity Distribution The beginning of chapter 14 covers some things we ve already discussed. Way back in lecture 6, we calculated the pressure for an ideal gas

More information

Business Statistics. Lecture 9: Simple Regression

Business Statistics. Lecture 9: Simple Regression Business Statistics Lecture 9: Simple Regression 1 On to Model Building! Up to now, class was about descriptive and inferential statistics Numerical and graphical summaries of data Confidence intervals

More information

Lecture 5: Clustering, Linear Regression

Lecture 5: Clustering, Linear Regression Lecture 5: Clustering, Linear Regression Reading: Chapter 10, Sections 3.1-2 STATS 202: Data mining and analysis Sergio Bacallado September 19, 2018 1 / 23 Announcements Starting next week, Julia Fukuyama

More information

Estimation of reliability parameters from Experimental data (Parte 2) Prof. Enrico Zio

Estimation of reliability parameters from Experimental data (Parte 2) Prof. Enrico Zio Estimation of reliability parameters from Experimental data (Parte 2) This lecture Life test (t 1,t 2,...,t n ) Estimate θ of f T t θ For example: λ of f T (t)= λe - λt Classical approach (frequentist

More information

Inference for Regression Inference about the Regression Model and Using the Regression Line, with Details. Section 10.1, 2, 3

Inference for Regression Inference about the Regression Model and Using the Regression Line, with Details. Section 10.1, 2, 3 Inference for Regression Inference about the Regression Model and Using the Regression Line, with Details Section 10.1, 2, 3 Basic components of regression setup Target of inference: linear dependency

More information

Chapter 10. Hypothesis Testing (I)

Chapter 10. Hypothesis Testing (I) Chapter 10. Hypothesis Testing (I) Hypothesis Testing, together with statistical estimation, are the two most frequently used statistical inference methods. It addresses a different type of practical problems

More information

MC3: Econometric Theory and Methods. Course Notes 4

MC3: Econometric Theory and Methods. Course Notes 4 University College London Department of Economics M.Sc. in Economics MC3: Econometric Theory and Methods Course Notes 4 Notes on maximum likelihood methods Andrew Chesher 25/0/2005 Course Notes 4, Andrew

More information

Conjugate Priors: Beta and Normal Spring 2018

Conjugate Priors: Beta and Normal Spring 2018 Conjugate Priors: Beta and Normal 18.05 Spring 2018 Review: Continuous priors, discrete data Bent coin: unknown probability θ of heads. Prior f (θ) = 2θ on [0,1]. Data: heads on one toss. Question: Find

More information

Lecture 5: Likelihood ratio tests, Neyman-Pearson detectors, ROC curves, and sufficient statistics. 1 Executive summary

Lecture 5: Likelihood ratio tests, Neyman-Pearson detectors, ROC curves, and sufficient statistics. 1 Executive summary ECE 830 Spring 207 Instructor: R. Willett Lecture 5: Likelihood ratio tests, Neyman-Pearson detectors, ROC curves, and sufficient statistics Executive summary In the last lecture we saw that the likelihood

More information

Hybrid Logic and Uncertain Logic

Hybrid Logic and Uncertain Logic Journal of Uncertain Systems Vol.3, No.2, pp.83-94, 2009 Online at: www.jus.org.uk Hybrid Logic and Uncertain Logic Xiang Li, Baoding Liu Department of Mathematical Sciences, Tsinghua University, Beijing,

More information

Hypothesis testing I. - In particular, we are talking about statistical hypotheses. [get everyone s finger length!] n =

Hypothesis testing I. - In particular, we are talking about statistical hypotheses. [get everyone s finger length!] n = Hypothesis testing I I. What is hypothesis testing? [Note we re temporarily bouncing around in the book a lot! Things will settle down again in a week or so] - Exactly what it says. We develop a hypothesis,

More information

Lecture 5: Clustering, Linear Regression

Lecture 5: Clustering, Linear Regression Lecture 5: Clustering, Linear Regression Reading: Chapter 10, Sections 3.1-3.2 STATS 202: Data mining and analysis October 4, 2017 1 / 22 .0.0 5 5 1.0 7 5 X2 X2 7 1.5 1.0 0.5 3 1 2 Hierarchical clustering

More information

Theory Change and Bayesian Statistical Inference

Theory Change and Bayesian Statistical Inference Theory Change and Bayesian Statistical Inference Jan-Willem Romeyn Department of Philosophy, University of Groningen j.w.romeyn@philos.rug.nl Abstract This paper addresses the problem that Bayesian statistical

More information

Chapter 2. Assertions. An Introduction to Separation Logic c 2011 John C. Reynolds February 3, 2011

Chapter 2. Assertions. An Introduction to Separation Logic c 2011 John C. Reynolds February 3, 2011 Chapter 2 An Introduction to Separation Logic c 2011 John C. Reynolds February 3, 2011 Assertions In this chapter, we give a more detailed exposition of the assertions of separation logic: their meaning,

More information

Bayesian Updating: Odds Class 12, Jeremy Orloff and Jonathan Bloom

Bayesian Updating: Odds Class 12, Jeremy Orloff and Jonathan Bloom Learning Goals Bayesian Updating: Odds Class 2, 8.05 Jeremy Orloff and Jonathan Bloom. Be able to convert between odds and probability. 2. Be able to update prior odds to posterior odds using Bayes factors.

More information

Chapter 5: Preferences

Chapter 5: Preferences Chapter 5: Preferences 5.1: Introduction In chapters 3 and 4 we considered a particular type of preferences in which all the indifference curves are parallel to each other and in which each indifference

More information

Statistical Inference: Uses, Abuses, and Misconceptions

Statistical Inference: Uses, Abuses, and Misconceptions Statistical Inference: Uses, Abuses, and Misconceptions Michael W. Trosset Indiana Statistical Consulting Center Department of Statistics ISCC is part of IU s Department of Statistics, chaired by Stanley

More information