Spatial Modeling of grub density of the may beetle (forest cockchafer: Melolontha hippocastani) in the Hessischen Ried area

Size: px
Start display at page:

Download "Spatial Modeling of grub density of the may beetle (forest cockchafer: Melolontha hippocastani) in the Hessischen Ried area"

Transcription

1 Spatial Modeling of grub density of the may beetle (forest cockchafer: Melolontha hippocastani) in the Hessischen Ried area Matthias Schmidt und Rainer Hurling

2 Motivation Since several decades the forests in the Hessisch Ried area are subject to high abiotic and biotic stress factors. Subsequently in some areas the stands show an extensive decay. Temperature veg = 16,6 C, Precipitation veg = 371 mm, Index of aridity = 38.4 Extensive drawdown of ground water, fragmentation of forests, emission of pollutants (e.g. kerosin by the nearby Frankfurt airport, nitrogen emission) Outbreaks of may beetle, gipsy moth, bark beetle

3 Motivation

4 Motivation Biology: The may beetle lives usually 3 years as grub below ground (including moulting), at the end of summer of the third year it cocoons and the adult beetle hatches out from the ground in May of the 4 years. An extensively mature grub takes place, followed by copulation and death of the males, several egg depositions in the ground and death of the females. There are several tribes in the Rhine valley that follow their own cycle. Spatial prognosis of grub density (E 3 ) to support decision making in practical forestry concerning regeneration planning (planting, harrowing), (airborne) operations of forest protection applying chemical and biological pesticides against the adult beetle. Tests of hypothesis: Effect of distance to ground water table? Effect of soil type? Effect of forest stand type?..on grub density. Quantifying the spatial pattern of grub density. Method: Generalized additive regression models (mgcv, gamlss)

5 Data base: grub sampling design

6 Data base: grub inventory 2009 Grubs E 3 /m² >10 Samples= % Min. 1 st Quartil Median Mean 3 rd Quartil Max. Distance to GWT 2007 [m] Clay thickness [%] Number of samples with massive clay layer = 55

7 Data base: grub inventory 2009 The grub densities are count data Count data usually show certain properties: integer values, positive data range, significant skewness to the right, high proportion of zeros Standard regression models are not appropriate Distribution assumption: Poisson, zero-inflated Poisson, negative binomial

8 Generalized additive models g( i )= f 1 (DWT i )+f 2 (CTH i )+f 3 (east i,north i ) [1] GD i Grub density at sample plot i [n/m²] GD i x i ~ Poisson( i ) with E(y i x i ) = i and Var (GD i x i ) = i ; GD i = 0, 1, 2,. DWT i : Regionalized distance to GWT in October 2007 at sample plot i [m] CTH i : Regionalized clay thickness at sample plot i [%] east i, north i : Easting and northing of sample plot i using the Gauß-Krüger-System referring to the 3. meridian f 1, f 2 : 1-dimensional smoothing functions (penalized thin plate basis regressions splines) f 3 : 2-dimensional smoothing function (penalized thin plate basis regressions spline) Distribution assumptions: Poisson, zero inflated Poisson, negative binomial library(mgcv) Wood library(gamlss) Rigby & Stasinopoulos library(aer) Kleiber and Zeileis

9 Poisson distribution Model AIC Dispersion Parameter g( i )= f 1 (DWT i )+f 2 (CTH i ) [2.0] *** g( i )= f 1 (DWT i )+f 2 (CTH i )+f 3 (east i,north i ), edf for f 3 (east i,north i )= [2.1] *** edf for f 3 (east i,north i )= [2.2] *** zero-inflated Poisson distribution Modell AIC Mixture Parameter g 1 ( i )= f 11 (DWT i )+f 21 (CTH i ) [3.0] g 2 ( i ) = g 1 ( i )= f 11 (DWT i )+f 21 (CTH i )+f 31 (east i,north i ) edf for f 31 (east i,north i ) = [3.1] g 2 ( i )= f 12 (DWT i ) negative binomial distribution Model AIC Dispersion Parameter g 1 ( i )= f 11 (DWT i )+f 21 (CTH i ) [4.0] g 2 1/ ) = g 1 ( i )= f 11 (DWT i )+f 21 (CTH i )+f 31 (east i,north i ), edf for f 3 (east i,north i )= [4.1] g 2 1/ ) = edf for f 3 (east i,north i )= [4.2] g 2 1/ ) = edf for f 3 (east i,north i )= [4.3] g 2 1/ ) = 1.734

10 Validation / quantile residuals Assumption: the observations y 1,,y n are independently distributed with probability density function (pdf) f(y i, i, ). F(y i, i, ) denotes the corresponding (cumulative) distribution function (cdf). the quantile residual r i corresponding to observation y i is defined as: r i =F - F( ; ˆ, ˆ ) y i i with F and F - denoting the distribution and quantile function of the standard normal distribution. r i = ~N(0,1), if f(y i, i, ) is indeed the correct model for the observations Dunn and Smyth, 1996

11 Validation quantile residuals Poisson distribution negativ binomial distribution zipoisson distribution negativ binomial distribution

12 Validation quantile residuals Draw a random sample from the fitted negative binomial model of 10 times the original sample size applying the original predictors and fitting a Poisson and a negative binomial model to the data. negativ binomial distribution Poisson distribution

13 Results model effects

14 Model predictions: Effect of distance to GWT

15 Effect of the spatial trend function

16 Conclusions Generalized additive regression models (~negative binomial distribution) are a well suited approach for modeling count data of may beetle grubs. The most important model effect on grub density results from geographic location. The causal predictors distance to GWT and clay thickness show significant effects on grub density also. If distance to GWT exceeds more than 4 m and clay thickness is less than 40% no effects are observed. For validation purposes quantile residuals should be used. In 2013 the next grub inventory will take place on the same grid and the model will be parameterized again. The potential changes in the spatial pattern of grub density over time and the effects of applied pesticides will be tested. An additional inventory of pest-damages should be used to quantify the relation between E 3 /m³ and corresponding damages.

17 Chemical and biological operations of forest protection both require a good information base! Busch (1865)

18 Simulation der Effekte unterschiedlicher Grundwasserbewirtschaftungspläne auf die Engerlingsdichte

19 Simulation der Effekte unterschiedlicher Grundwasserbewirtschaftungspläne auf die Engerlingsdichte E 3 /m²

20 Model predictions: Effect of clay thickness

21 Poisson distribution g i ) = i = x i = + 1 x i1 + + k x ik or i = exp( i ) = exp(x i ) = exp( ) exp( 1 x i1 ) exp( k x ik ) mit g(.) : Verknüpfungsfunktion: natürlicher Logarithmus und y i x i ~ Poisson( i ) mit E(y i x i ) = i and Var (y i x i ) = i ; y i = 0, 1, 2,. Wahrscheinlichkeitsdichtefunktion der Poissonverteilung: f(y; ) = y e - y=0, 1, 2,.. y!

22 zero inflated Poisson distribution g 1 i ) = i = x i = + 11 x i1 + + k1 x ik mit g 1 (.) : Verknüpfungsfunktion: natürlicher Logarithmus g 2 ( i ) = i = x i = + 12 x i1 + + k2 x ik und g 2 (.) : Verknüpfungsfunktion: logistisch und y i x i ~ ZIP( i, i ) mit E(y i x i ) = (1- ) i und Var(y i x i ) = i (1- ) (1+ i ) [3a] [3b] y i = 0, 1, 2,. Wahrscheinlichkeitsdichtefunktion der Zero-inflated Poissonverteilung: - e -, y = 0 f(y;, ) = y - e y! - y, y = 1,2,.

23 negativ binomial distribution g 1 i ) = i = x i = + 11 x i1 + + k1 x ik mit g 1 (.) : Verknüpfungsfunktion: natürlicher Logarithmus und y i x i ~ Poisson( i ) with i ~ Gamma(shape=, scale=1/ ) = a Gamma(Mittelwert=1, Varianz=1/ ) und y i x i ~ NegBin( i, ) mit E(y i x i )= i und Var(y i x i ) = i i ; y i = 0, 1, 2,. Wahrscheinlichkeitsdichtefunktion der negativ Binomialverteilung: f(y,, ) = ( y) y ( ) y! ( ) y, mit Gammafunktion. y = 0, 1, 2,. g 2 1/ ) = i = x i = + 12 x i1 + + k2 x ik mit g 2 (.) : Verknüpfungdfunktion: natürlicher Logarithmus b

Matthias Schmidt * and Rainer Hurling

Matthias Schmidt * and Rainer Hurling Schmidt and Hurling Forest Ecosystems 2014, 1:19 RESEARCH ARTICLE Open Access A spatially-explicit count data regression for modeling the density of forest cockchafer (Melolontha hippocastani) larvae in

More information

Poisson population distribution X P(

Poisson population distribution X P( Chapter 8 Poisson population distribution P( ) ~ 8.1 Definition of a Poisson distribution, ~ P( ) If the random variable has a Poisson population distribution, i.e., P( ) probability function is given

More information

Part 8: GLMs and Hierarchical LMs and GLMs

Part 8: GLMs and Hierarchical LMs and GLMs Part 8: GLMs and Hierarchical LMs and GLMs 1 Example: Song sparrow reproductive success Arcese et al., (1992) provide data on a sample from a population of 52 female song sparrows studied over the course

More information

ST3241 Categorical Data Analysis I Generalized Linear Models. Introduction and Some Examples

ST3241 Categorical Data Analysis I Generalized Linear Models. Introduction and Some Examples ST3241 Categorical Data Analysis I Generalized Linear Models Introduction and Some Examples 1 Introduction We have discussed methods for analyzing associations in two-way and three-way tables. Now we will

More information

MODELING COUNT DATA Joseph M. Hilbe

MODELING COUNT DATA Joseph M. Hilbe MODELING COUNT DATA Joseph M. Hilbe Arizona State University Count models are a subset of discrete response regression models. Count data are distributed as non-negative integers, are intrinsically heteroskedastic,

More information

School on Modelling Tools and Capacity Building in Climate and Public Health April 2013

School on Modelling Tools and Capacity Building in Climate and Public Health April 2013 2453-14 School on Modelling Tools and Capacity Building in Climate and Public Health 15-26 April 2013 Expectile and Quantile Regression and Other Extensions KAZEMBE Lawrence University of Malawi Chancellor

More information

Modeling Uncertainty in the Earth Sciences Jef Caers Stanford University

Modeling Uncertainty in the Earth Sciences Jef Caers Stanford University Probability theory and statistical analysis: a review Modeling Uncertainty in the Earth Sciences Jef Caers Stanford University Concepts assumed known Histograms, mean, median, spread, quantiles Probability,

More information

A Few Special Distributions and Their Properties

A Few Special Distributions and Their Properties A Few Special Distributions and Their Properties Econ 690 Purdue University Justin L. Tobias (Purdue) Distributional Catalog 1 / 20 Special Distributions and Their Associated Properties 1 Uniform Distribution

More information

Parametric Modelling of Over-dispersed Count Data. Part III / MMath (Applied Statistics) 1

Parametric Modelling of Over-dispersed Count Data. Part III / MMath (Applied Statistics) 1 Parametric Modelling of Over-dispersed Count Data Part III / MMath (Applied Statistics) 1 Introduction Poisson regression is the de facto approach for handling count data What happens then when Poisson

More information

Generalized Linear Models for Count, Skewed, and If and How Much Outcomes

Generalized Linear Models for Count, Skewed, and If and How Much Outcomes Generalized Linear Models for Count, Skewed, and If and How Much Outcomes Today s Class: Review of 3 parts of a generalized model Models for discrete count or continuous skewed outcomes Models for two-part

More information

Generalized Linear Models

Generalized Linear Models Generalized Linear Models Assumptions of Linear Model Homoskedasticity Model variance No error in X variables Errors in variables No missing data Missing data model Normally distributed error Error in

More information

ZERO INFLATED POISSON REGRESSION

ZERO INFLATED POISSON REGRESSION STAT 6500 ZERO INFLATED POISSON REGRESSION FINAL PROJECT DEC 6 th, 2013 SUN JEON DEPARTMENT OF SOCIOLOGY UTAH STATE UNIVERSITY POISSON REGRESSION REVIEW INTRODUCING - ZERO-INFLATED POISSON REGRESSION SAS

More information

Discrete Choice Modeling

Discrete Choice Modeling [Part 6] 1/55 0 Introduction 1 Summary 2 Binary Choice 3 Panel Data 4 Bivariate Probit 5 Ordered Choice 6 7 Multinomial Choice 8 Nested Logit 9 Heterogeneity 10 Latent Class 11 Mixed Logit 12 Stated Preference

More information

Test Problems for Probability Theory ,

Test Problems for Probability Theory , 1 Test Problems for Probability Theory 01-06-16, 010-1-14 1. Write down the following probability density functions and compute their moment generating functions. (a) Binomial distribution with mean 30

More information

EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY

EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY GRADUATE DIPLOMA, 2016 MODULE 1 : Probability distributions Time allowed: Three hours Candidates should answer FIVE questions. All questions carry equal marks.

More information

Exam Applied Statistical Regression. Good Luck!

Exam Applied Statistical Regression. Good Luck! Dr. M. Dettling Summer 2011 Exam Applied Statistical Regression Approved: Tables: Note: Any written material, calculator (without communication facility). Attached. All tests have to be done at the 5%-level.

More information

Generalized Linear Models Introduction

Generalized Linear Models Introduction Generalized Linear Models Introduction Statistics 135 Autumn 2005 Copyright c 2005 by Mark E. Irwin Generalized Linear Models For many problems, standard linear regression approaches don t work. Sometimes,

More information

Learning Objectives for Stat 225

Learning Objectives for Stat 225 Learning Objectives for Stat 225 08/20/12 Introduction to Probability: Get some general ideas about probability, and learn how to use sample space to compute the probability of a specific event. Set Theory:

More information

Linear Regression Models P8111

Linear Regression Models P8111 Linear Regression Models P8111 Lecture 25 Jeff Goldsmith April 26, 2016 1 of 37 Today s Lecture Logistic regression / GLMs Model framework Interpretation Estimation 2 of 37 Linear regression Course started

More information

11. Generalized Linear Models: An Introduction

11. Generalized Linear Models: An Introduction Sociology 740 John Fox Lecture Notes 11. Generalized Linear Models: An Introduction Copyright 2014 by John Fox Generalized Linear Models: An Introduction 1 1. Introduction I A synthesis due to Nelder and

More information

Edexcel GCE Statistics 2

Edexcel GCE Statistics 2 Edexcel GCE Statistics Continuous Random Variables. Edited by: K V Kumaran kumarmaths.weebly.com 1 kumarmaths.weebly.com kumarmaths.weebly.com 3 kumarmaths.weebly.com 4 kumarmaths.weebly.com 5 kumarmaths.weebly.com

More information

PhysicsAndMathsTutor.com

PhysicsAndMathsTutor.com PhysicsAndMathsTutor.com June 2005 6. A continuous random variable X has probability density function f(x) where 3 k(4 x x ), 0 x 2, f( x) = 0, otherwise, where k is a positive integer. 1 (a) Show that

More information

Lecture-19: Modeling Count Data II

Lecture-19: Modeling Count Data II Lecture-19: Modeling Count Data II 1 In Today s Class Recap of Count data models Truncated count data models Zero-inflated models Panel count data models R-implementation 2 Count Data In many a phenomena

More information

Gauge Plots. Gauge Plots JAPANESE BEETLE DATA MAXIMUM LIKELIHOOD FOR SPATIALLY CORRELATED DISCRETE DATA JAPANESE BEETLE DATA

Gauge Plots. Gauge Plots JAPANESE BEETLE DATA MAXIMUM LIKELIHOOD FOR SPATIALLY CORRELATED DISCRETE DATA JAPANESE BEETLE DATA JAPANESE BEETLE DATA 6 MAXIMUM LIKELIHOOD FOR SPATIALLY CORRELATED DISCRETE DATA Gauge Plots TuscaroraLisa Central Madsen Fairways, 996 January 9, 7 Grubs Adult Activity Grub Counts 6 8 Organic Matter

More information

Generalized Linear Models: An Introduction

Generalized Linear Models: An Introduction Applied Statistics With R Generalized Linear Models: An Introduction John Fox WU Wien May/June 2006 2006 by John Fox Generalized Linear Models: An Introduction 1 A synthesis due to Nelder and Wedderburn,

More information

Mixtures of Negative Binomial distributions for modelling overdispersion in RNA-Seq data

Mixtures of Negative Binomial distributions for modelling overdispersion in RNA-Seq data Mixtures of Negative Binomial distributions for modelling overdispersion in RNA-Seq data Cinzia Viroli 1 joint with E. Bonafede 1, S. Robin 2 & F. Picard 3 1 Department of Statistical Sciences, University

More information

Lecture 7. Testing for Poisson cdf - Poisson regression - Random points in space 1

Lecture 7. Testing for Poisson cdf - Poisson regression - Random points in space 1 Lecture 7. Testing for Poisson cdf - Poisson regression - Random points in space 1 Igor Rychlik Chalmers Department of Mathematical Sciences Probability, Statistics and Risk, MVE300 Chalmers April 2010

More information

On the Dependency of Soccer Scores - A Sparse Bivariate Poisson Model for the UEFA EURO 2016

On the Dependency of Soccer Scores - A Sparse Bivariate Poisson Model for the UEFA EURO 2016 On the Dependency of Soccer Scores - A Sparse Bivariate Poisson Model for the UEFA EURO 2016 A. Groll & A. Mayr & T. Kneib & G. Schauberger Department of Statistics, Georg-August-University Göttingen MathSport

More information

C4-304 STATISTICS OF LIGHTNING OCCURRENCE AND LIGHTNING CURRENT S PARAMETERS OBTAINED THROUGH LIGHTNING LOCATION SYSTEMS

C4-304 STATISTICS OF LIGHTNING OCCURRENCE AND LIGHTNING CURRENT S PARAMETERS OBTAINED THROUGH LIGHTNING LOCATION SYSTEMS 2, rue d'artois, F-75008 Paris http://www.cigre.org C4-304 Session 2004 CIGRÉ STATISTICS OF LIGHTNING OCCURRENCE AND LIGHTNING CURRENT S PARAMETERS OBTAINED THROUGH LIGHTNING LOCATION SYSTEMS baran@el.poweng.pub.ro

More information

Generalised linear models. Response variable can take a number of different formats

Generalised linear models. Response variable can take a number of different formats Generalised linear models Response variable can take a number of different formats Structure Limitations of linear models and GLM theory GLM for count data GLM for presence \ absence data GLM for proportion

More information

A CASE STUDY IN HANDLING OVER-DISPERSION IN NEMATODE COUNT DATA SCOTT EDWIN DOUGLAS KREIDER. B.S., The College of William & Mary, 2008 A THESIS

A CASE STUDY IN HANDLING OVER-DISPERSION IN NEMATODE COUNT DATA SCOTT EDWIN DOUGLAS KREIDER. B.S., The College of William & Mary, 2008 A THESIS A CASE STUDY IN HANDLING OVER-DISPERSION IN NEMATODE COUNT DATA by SCOTT EDWIN DOUGLAS KREIDER B.S., The College of William & Mary, 2008 A THESIS submitted in partial fulfillment of the requirements for

More information

Application of Copulas as a New Geostatistical Tool

Application of Copulas as a New Geostatistical Tool Application of Copulas as a New Geostatistical Tool Presented by Jing Li Supervisors András Bardossy, Sjoerd Van der zee, Insa Neuweiler Universität Stuttgart Institut für Wasserbau Lehrstuhl für Hydrologie

More information

Mohammed. Research in Pharmacoepidemiology National School of Pharmacy, University of Otago

Mohammed. Research in Pharmacoepidemiology National School of Pharmacy, University of Otago Mohammed Research in Pharmacoepidemiology (RIPE) @ National School of Pharmacy, University of Otago What is zero inflation? Suppose you want to study hippos and the effect of habitat variables on their

More information

What is insect forecasting, and why do it

What is insect forecasting, and why do it Insect Forecasting Programs: Objectives, and How to Properly Interpret the Data John Gavloski, Extension Entomologist, Manitoba Agriculture, Food and Rural Initiatives Carman, MB R0G 0J0 Email: jgavloski@gov.mb.ca

More information

Chapter 4a Probability Models

Chapter 4a Probability Models Chapter 4a Probability Models 4a.2 Probability models for a variable with a finite number of values 297 4a.1 Introduction Chapters 2 and 3 are concerned with data description (descriptive statistics) where

More information

gamboostlss: boosting generalized additive models for location, scale and shape

gamboostlss: boosting generalized additive models for location, scale and shape gamboostlss: boosting generalized additive models for location, scale and shape Benjamin Hofner Joint work with Andreas Mayr, Nora Fenske, Thomas Kneib and Matthias Schmid Department of Medical Informatics,

More information

Analysis of Count Data A Business Perspective. George J. Hurley Sr. Research Manager The Hershey Company Milwaukee June 2013

Analysis of Count Data A Business Perspective. George J. Hurley Sr. Research Manager The Hershey Company Milwaukee June 2013 Analysis of Count Data A Business Perspective George J. Hurley Sr. Research Manager The Hershey Company Milwaukee June 2013 Overview Count data Methods Conclusions 2 Count data Count data Anything with

More information

1 Uniform Distribution. 2 Gamma Distribution. 3 Inverse Gamma Distribution. 4 Multivariate Normal Distribution. 5 Multivariate Student-t Distribution

1 Uniform Distribution. 2 Gamma Distribution. 3 Inverse Gamma Distribution. 4 Multivariate Normal Distribution. 5 Multivariate Student-t Distribution A Few Special Distributions Their Properties Econ 675 Iowa State University November 1 006 Justin L Tobias (ISU Distributional Catalog November 1 006 1 / 0 Special Distributions Their Associated Properties

More information

PAPER 206 APPLIED STATISTICS

PAPER 206 APPLIED STATISTICS MATHEMATICAL TRIPOS Part III Thursday, 1 June, 2017 9:00 am to 12:00 pm PAPER 206 APPLIED STATISTICS Attempt no more than FOUR questions. There are SIX questions in total. The questions carry equal weight.

More information

LASSO-Type Penalization in the Framework of Generalized Additive Models for Location, Scale and Shape

LASSO-Type Penalization in the Framework of Generalized Additive Models for Location, Scale and Shape LASSO-Type Penalization in the Framework of Generalized Additive Models for Location, Scale and Shape Nikolaus Umlauf https://eeecon.uibk.ac.at/~umlauf/ Overview Joint work with Andreas Groll, Julien Hambuckers

More information

3. DISCRETE PROBABILITY DISTRIBUTIONS

3. DISCRETE PROBABILITY DISTRIBUTIONS 1 3. DISCRETE PROBABILITY DISTRIBUTIONS Probability distributions may be discrete or continuous. This week we examine two discrete distributions commonly used in biology: the binomial and Poisson distributions.

More information

Regression models. Generalized linear models in R. Normal regression models are not always appropriate. Generalized linear models. Examples.

Regression models. Generalized linear models in R. Normal regression models are not always appropriate. Generalized linear models. Examples. Regression models Generalized linear models in R Dr Peter K Dunn http://www.usq.edu.au Department of Mathematics and Computing University of Southern Queensland ASC, July 00 The usual linear regression

More information

UNIVERSITY OF MASSACHUSETTS Department of Mathematics and Statistics Basic Exam - Applied Statistics Thursday, August 30, 2018

UNIVERSITY OF MASSACHUSETTS Department of Mathematics and Statistics Basic Exam - Applied Statistics Thursday, August 30, 2018 UNIVERSITY OF MASSACHUSETTS Department of Mathematics and Statistics Basic Exam - Applied Statistics Thursday, August 30, 2018 Work all problems. 60 points are needed to pass at the Masters Level and 75

More information

Homework 5: Answer Key. Plausible Model: E(y) = µt. The expected number of arrests arrests equals a constant times the number who attend the game.

Homework 5: Answer Key. Plausible Model: E(y) = µt. The expected number of arrests arrests equals a constant times the number who attend the game. EdPsych/Psych/Soc 589 C.J. Anderson Homework 5: Answer Key 1. Probelm 3.18 (page 96 of Agresti). (a) Y assume Poisson random variable. Plausible Model: E(y) = µt. The expected number of arrests arrests

More information

Ch. 4 - Population Ecology

Ch. 4 - Population Ecology Ch. 4 - Population Ecology Ecosystem all of the living organisms and nonliving components of the environment in an area together with their physical environment How are the following things related? mice,

More information

Georgia Performance Standards for Urban Watch Restoration Field Trips

Georgia Performance Standards for Urban Watch Restoration Field Trips Georgia Performance Standards for Field Trips 6 th grade S6E3. Students will recognize the significant role of water in earth processes. a. Explain that a large portion of the Earth s surface is water,

More information

Section 4.6 Simple Linear Regression

Section 4.6 Simple Linear Regression Section 4.6 Simple Linear Regression Objectives ˆ Basic philosophy of SLR and the regression assumptions ˆ Point & interval estimation of the model parameters, and how to make predictions ˆ Point and interval

More information

Continuous RVs. 1. Suppose a random variable X has the following probability density function: π, zero otherwise. f ( x ) = sin x, 0 < x < 2

Continuous RVs. 1. Suppose a random variable X has the following probability density function: π, zero otherwise. f ( x ) = sin x, 0 < x < 2 STAT 4 Exam I Continuous RVs Fall 7 Practice. Suppose a random variable X has the following probability density function: f ( x ) = sin x, < x < π, zero otherwise. a) Find P ( X < 4 π ). b) Find µ = E

More information

Generalized Multilevel Models for Non-Normal Outcomes

Generalized Multilevel Models for Non-Normal Outcomes Generalized Multilevel Models for Non-Normal Outcomes Topics: 3 parts of a generalized (multilevel) model Models for binary, proportion, and categorical outcomes Complications for generalized multilevel

More information

GROUPED DATA E.G. FOR SAMPLE OF RAW DATA (E.G. 4, 12, 7, 5, MEAN G x / n STANDARD DEVIATION MEDIAN AND QUARTILES STANDARD DEVIATION

GROUPED DATA E.G. FOR SAMPLE OF RAW DATA (E.G. 4, 12, 7, 5, MEAN G x / n STANDARD DEVIATION MEDIAN AND QUARTILES STANDARD DEVIATION FOR SAMPLE OF RAW DATA (E.G. 4, 1, 7, 5, 11, 6, 9, 7, 11, 5, 4, 7) BE ABLE TO COMPUTE MEAN G / STANDARD DEVIATION MEDIAN AND QUARTILES Σ ( Σ) / 1 GROUPED DATA E.G. AGE FREQ. 0-9 53 10-19 4...... 80-89

More information

Lecture 8 Insect ecology and balance of life

Lecture 8 Insect ecology and balance of life Lecture 8 Insect ecology and balance of life Ecology: The term ecology is derived from the Greek term oikos meaning house combined with logy meaning the science of or the study of. Thus literally ecology

More information

STAT5044: Regression and Anova

STAT5044: Regression and Anova STAT5044: Regression and Anova Inyoung Kim 1 / 18 Outline 1 Logistic regression for Binary data 2 Poisson regression for Count data 2 / 18 GLM Let Y denote a binary response variable. Each observation

More information

CE 3710: Uncertainty Analysis in Engineering

CE 3710: Uncertainty Analysis in Engineering FINAL EXAM Monday, December 14, 10:15 am 12:15 pm, Chem Sci 101 Open book and open notes. Exam will be cumulative, but emphasis will be on material covered since Exam II Learning Expectations for Final

More information

11. Generalized Linear Models: An Introduction

11. Generalized Linear Models: An Introduction Sociolog 740 John Fox Lecture Notes 11. Generalized Linear Models: An Introduction Generalized Linear Models: An Introduction 1 1. Introduction I A snthesis due to Nelder and Wedderburn, generalized linear

More information

Dr. Junchao Xia Center of Biophysics and Computational Biology. Fall /13/2016 1/33

Dr. Junchao Xia Center of Biophysics and Computational Biology. Fall /13/2016 1/33 BIO5312 Biostatistics Lecture 03: Discrete and Continuous Probability Distributions Dr. Junchao Xia Center of Biophysics and Computational Biology Fall 2016 9/13/2016 1/33 Introduction In this lecture,

More information

Varieties of Count Data

Varieties of Count Data CHAPTER 1 Varieties of Count Data SOME POINTS OF DISCUSSION What are counts? What are count data? What is a linear statistical model? What is the relationship between a probability distribution function

More information

A strategy for modelling count data which may have extra zeros

A strategy for modelling count data which may have extra zeros A strategy for modelling count data which may have extra zeros Alan Welsh Centre for Mathematics and its Applications Australian National University The Data Response is the number of Leadbeater s possum

More information

Distribution Fitting (Censored Data)

Distribution Fitting (Censored Data) Distribution Fitting (Censored Data) Summary... 1 Data Input... 2 Analysis Summary... 3 Analysis Options... 4 Goodness-of-Fit Tests... 6 Frequency Histogram... 8 Comparison of Alternative Distributions...

More information

Generalized Additive Models

Generalized Additive Models Generalized Additive Models Overview What is a GAM? What is smoothing? How do GAMs work? Fitting GAMs using dsm What is a GAM? "gam" 1. Collective noun used to refer to a group of whales, or rarely also

More information

Unit 2: Ecology. Big Idea...

Unit 2: Ecology. Big Idea... Name: Block: Unit 2: Ecology Big Idea... The natural world is defined by organisms and life processes which conform to principles regarding conservation and transformation of matter and energy. Knowledge

More information

Modeling Overdispersion

Modeling Overdispersion James H. Steiger Department of Psychology and Human Development Vanderbilt University Regression Modeling, 2009 1 Introduction 2 Introduction In this lecture we discuss the problem of overdispersion in

More information

Population Ecology NRM

Population Ecology NRM Population Ecology NRM What do we need? MAKING DECISIONS Consensus working through views until agreement among all CONSENSUS Informed analyze options through respectful discussion INFORMED DECISION Majority

More information

THE BIOLOGY OF PHENACOCCUS AZALEAE KUWANA, A PEST OF BUNGE PRICKLY ASH (ZANTHOXYLUM BUNGEANUM MAXIM) FOREST IN NORTHERN CHINA.

THE BIOLOGY OF PHENACOCCUS AZALEAE KUWANA, A PEST OF BUNGE PRICKLY ASH (ZANTHOXYLUM BUNGEANUM MAXIM) FOREST IN NORTHERN CHINA. Entomologica, Bari, 33, (1999): 377-382 XIE YINGPING 1 ; ZHAO JINLONG 2 ; GUO YONGPING 2 ; LI YUNFENG 2 ; ZHANG HAIJUN 2 ; GUO YUQING 2 1 Department of Biology, Nankai University, Tianjin 300071, China

More information

a table or a graph or an equation.

a table or a graph or an equation. Topic (8) POPULATION DISTRIBUTIONS 8-1 So far: Topic (8) POPULATION DISTRIBUTIONS We ve seen some ways to summarize a set of data, including numerical summaries. We ve heard a little about how to sample

More information

Network Simulation Chapter 5: Traffic Modeling. Chapter Overview

Network Simulation Chapter 5: Traffic Modeling. Chapter Overview Network Simulation Chapter 5: Traffic Modeling Prof. Dr. Jürgen Jasperneite 1 Chapter Overview 1. Basic Simulation Modeling 2. OPNET IT Guru - A Tool for Discrete Event Simulation 3. Review of Basic Probabilities

More information

Page 2. (b) (i) 2.6 to 2.7 = 2 marks; Incorrect answer but evidence of a numerator of OR or denominator of 9014 = 1 mark; 2

Page 2. (b) (i) 2.6 to 2.7 = 2 marks; Incorrect answer but evidence of a numerator of OR or denominator of 9014 = 1 mark; 2 M.(a). Females are (generally) longer / larger / bigger / up to 5(mm) / males are (generally) shorter / smaller / up to 00(mm); Ignore: tall Accept: females have a larger / 90 modal / peak / most common

More information

Poisson regression 1/15

Poisson regression 1/15 Poisson regression 1/15 2/15 Counts data Examples of counts data: Number of hospitalizations over a period of time Number of passengers in a bus station Blood cells number in a blood sample Number of typos

More information

Nonparametric inference in hidden Markov and related models

Nonparametric inference in hidden Markov and related models Nonparametric inference in hidden Markov and related models Roland Langrock, Bielefeld University Roland Langrock Bielefeld University 1 / 47 Introduction and motivation Roland Langrock Bielefeld University

More information

Multivariate negative binomial models for insurance claim counts

Multivariate negative binomial models for insurance claim counts Multivariate negative binomial models for insurance claim counts Peng Shi (Northern Illinois University) and Emiliano A. Valdez (University of Connecticut) 9 November 0, Montréal, Quebec Université de

More information

A Hierarchical Perspective on Lee-Carter Models

A Hierarchical Perspective on Lee-Carter Models A Hierarchical Perspective on Lee-Carter Models Paul Eilers Leiden University Medical Centre L-C Workshop, Edinburgh 24 The vantage point Previous presentation: Iain Currie led you upward From Glen Gumbel

More information

Math Review Sheet, Fall 2008

Math Review Sheet, Fall 2008 1 Descriptive Statistics Math 3070-5 Review Sheet, Fall 2008 First we need to know about the relationship among Population Samples Objects The distribution of the population can be given in one of the

More information

Temperature. (1) directly controls metabolic rates of ectotherms (invertebrates, fish) Individual species

Temperature. (1) directly controls metabolic rates of ectotherms (invertebrates, fish) Individual species Temperature (1) directly controls metabolic rates of ectotherms (invertebrates, fish) Individual species (2) controls concentrations (3) is relatively predictable over and can provide a basis for species.

More information

mgcv: GAMs in R Simon Wood Mathematical Sciences, University of Bath, U.K.

mgcv: GAMs in R Simon Wood Mathematical Sciences, University of Bath, U.K. mgcv: GAMs in R Simon Wood Mathematical Sciences, University of Bath, U.K. mgcv, gamm4 mgcv is a package supplied with R for generalized additive modelling, including generalized additive mixed models.

More information

6 2 Insects and plants

6 2 Insects and plants 6 2 Insects and plants Insect DIY 1. Find plant habitat 2. Find plant 3. Accept plant 4. Eat survive, reproduce Plant characteristics Shape structure Mechanical defenses trichomes Chemical defenses sap,

More information

Lecture 13: More on Binary Data

Lecture 13: More on Binary Data Lecture 1: More on Binary Data Link functions for Binomial models Link η = g(π) π = g 1 (η) identity π η logarithmic log π e η logistic log ( π 1 π probit Φ 1 (π) Φ(η) log-log log( log π) exp( e η ) complementary

More information

Madhya Pradesh Bhoj Open University. Bhopal M.sc Zoology Final Year

Madhya Pradesh Bhoj Open University. Bhopal M.sc Zoology Final Year Subject : Comparative Anatomy of Vertebrates Q.1 Describe the inter-relationship of Uro chords and cephalochordates and their relationship with other deuterostomes. Q.2 Describe origin, evolution and general

More information

NELS 88. Latent Response Variable Formulation Versus Probability Curve Formulation

NELS 88. Latent Response Variable Formulation Versus Probability Curve Formulation NELS 88 Table 2.3 Adjusted odds ratios of eighth-grade students in 988 performing below basic levels of reading and mathematics in 988 and dropping out of school, 988 to 990, by basic demographics Variable

More information

1.1 Draw one line from each adaptation of the fennec fox to the advantage of the. To reflect the sun s rays

1.1 Draw one line from each adaptation of the fennec fox to the advantage of the. To reflect the sun s rays 4-7 Ecology Trilogy.0 Figure shows a fennec fox. Figure Fennec foxes live in the desert. Drew Avery Creative Commons 2.0. Draw one line from each adaptation of the fennec fox to the advantage of the adaptation.

More information

Truck prices - linear model? Truck prices - log transform of the response variable. Interpreting models with log transformation

Truck prices - linear model? Truck prices - log transform of the response variable. Interpreting models with log transformation Background Regression so far... Lecture 23 - Sta 111 Colin Rundel June 17, 2014 At this point we have covered: Simple linear regression Relationship between numerical response and a numerical or categorical

More information

Log-linear Models for Contingency Tables

Log-linear Models for Contingency Tables Log-linear Models for Contingency Tables Statistics 149 Spring 2006 Copyright 2006 by Mark E. Irwin Log-linear Models for Two-way Contingency Tables Example: Business Administration Majors and Gender A

More information

Brief Review of Probability

Brief Review of Probability Maura Department of Economics and Finance Università Tor Vergata Outline 1 Distribution Functions Quantiles and Modes of a Distribution 2 Example 3 Example 4 Distributions Outline Distribution Functions

More information

A flexible regression approach using GAMLSS in R. Bob Rigby and Mikis Stasinopoulos

A flexible regression approach using GAMLSS in R. Bob Rigby and Mikis Stasinopoulos A flexible regression approach using GAMLSS in R. Bob Rigby and Mikis Stasinopoulos May 27, 2010 2 Preface This book is designed for a short course in GAMLSS given at the University of Athens. : The copyright

More information

Design Flood Estimation in Ungauged Catchments: Quantile Regression Technique And Probabilistic Rational Method Compared

Design Flood Estimation in Ungauged Catchments: Quantile Regression Technique And Probabilistic Rational Method Compared Design Flood Estimation in Ungauged Catchments: Quantile Regression Technique And Probabilistic Rational Method Compared N Rijal and A Rahman School of Engineering and Industrial Design, University of

More information

STAT100 Elementary Statistics and Probability

STAT100 Elementary Statistics and Probability STAT100 Elementary Statistics and Probability Exam, Monday, August 11, 014 Solution Show all work clearly and in order, and circle your final answers. Justify your answers algebraically whenever possible.

More information

Probability Distributions Columns (a) through (d)

Probability Distributions Columns (a) through (d) Discrete Probability Distributions Columns (a) through (d) Probability Mass Distribution Description Notes Notation or Density Function --------------------(PMF or PDF)-------------------- (a) (b) (c)

More information

PENALIZING YOUR MODELS

PENALIZING YOUR MODELS PENALIZING YOUR MODELS AN OVERVIEW OF THE GENERALIZED REGRESSION PLATFORM Michael Crotty & Clay Barker Research Statisticians JMP Division, SAS Institute Copyr i g ht 2012, SAS Ins titut e Inc. All rights

More information

Solutions. Some of the problems that might be encountered in collecting data on check-in times are:

Solutions. Some of the problems that might be encountered in collecting data on check-in times are: Solutions Chapter 7 E7.1 Some of the problems that might be encountered in collecting data on check-in times are: Need to collect separate data for each airline (time and cost). Need to collect data for

More information

Trends in Forest Soil Acidity: A GAM Based Approach with Application on Swedish Forest Soil Inventory Data

Trends in Forest Soil Acidity: A GAM Based Approach with Application on Swedish Forest Soil Inventory Data Trends in Forest Soil Acidity: A GAM Based Approach with Application on Swedish Forest Soil Inventory Data By Staffan Betnér Department of Statistics Uppsala University Supervisor: Shaobo Jin 2018 Abstract

More information

Generalized Linear Models

Generalized Linear Models Generalized Linear Models Assumptions of Linear Model Homoskedasticity Model variance No error in X variables Errors in variables No missing data Missing data model Normally distributed error GLM Error

More information

Stat 587: Key points and formulae Week 15

Stat 587: Key points and formulae Week 15 Odds ratios to compare two proportions: Difference, p 1 p 2, has issues when applied to many populations Vit. C: P[cold Placebo] = 0.82, P[cold Vit. C] = 0.74, Estimated diff. is 8% What if a year or place

More information

Linear Regression With Special Variables

Linear Regression With Special Variables Linear Regression With Special Variables Junhui Qian December 21, 2014 Outline Standardized Scores Quadratic Terms Interaction Terms Binary Explanatory Variables Binary Choice Models Standardized Scores:

More information

Stat 5102 Final Exam May 14, 2015

Stat 5102 Final Exam May 14, 2015 Stat 5102 Final Exam May 14, 2015 Name Student ID The exam is closed book and closed notes. You may use three 8 1 11 2 sheets of paper with formulas, etc. You may also use the handouts on brand name distributions

More information

Gaussian copula regression using R

Gaussian copula regression using R University of Warwick August 17, 2011 Gaussian copula regression using R Guido Masarotto University of Padua Cristiano Varin Ca Foscari University, Venice 1/ 21 Framework Gaussian copula marginal regression

More information

Logistic Regression. James H. Steiger. Department of Psychology and Human Development Vanderbilt University

Logistic Regression. James H. Steiger. Department of Psychology and Human Development Vanderbilt University Logistic Regression James H. Steiger Department of Psychology and Human Development Vanderbilt University James H. Steiger (Vanderbilt University) Logistic Regression 1 / 38 Logistic Regression 1 Introduction

More information

Approach to Field Research Data Generation and Field Logistics Part 1. Road Map 8/26/2016

Approach to Field Research Data Generation and Field Logistics Part 1. Road Map 8/26/2016 Approach to Field Research Data Generation and Field Logistics Part 1 Lecture 3 AEC 460 Road Map How we do ecology Part 1 Recap Types of data Sampling abundance and density methods Part 2 Sampling design

More information

AP Final Review II Exploring Data (20% 30%)

AP Final Review II Exploring Data (20% 30%) AP Final Review II Exploring Data (20% 30%) Quantitative vs Categorical Variables Quantitative variables are numerical values for which arithmetic operations such as means make sense. It is usually a measure

More information

Regression so far... Lecture 21 - Logistic Regression. Odds. Recap of what you should know how to do... At this point we have covered: Sta102 / BME102

Regression so far... Lecture 21 - Logistic Regression. Odds. Recap of what you should know how to do... At this point we have covered: Sta102 / BME102 Background Regression so far... Lecture 21 - Sta102 / BME102 Colin Rundel November 18, 2014 At this point we have covered: Simple linear regression Relationship between numerical response and a numerical

More information

Identification of hotspots of rat abundance and their effect on human risk of leptospirosis in a Brazilian slum community

Identification of hotspots of rat abundance and their effect on human risk of leptospirosis in a Brazilian slum community Identification of hotspots of rat abundance and their effect on human risk of leptospirosis in a Brazilian slum community Poppy Miller 1 Kate Hacker 2 Peter Diggle 1 Mike Begon 3 James Childs 2 Albert

More information

Biost 518 Applied Biostatistics II. Purpose of Statistics. First Stage of Scientific Investigation. Further Stages of Scientific Investigation

Biost 518 Applied Biostatistics II. Purpose of Statistics. First Stage of Scientific Investigation. Further Stages of Scientific Investigation Biost 58 Applied Biostatistics II Scott S. Emerson, M.D., Ph.D. Professor of Biostatistics University of Washington Lecture 5: Review Purpose of Statistics Statistics is about science (Science in the broadest

More information

Statistics: A review. Why statistics?

Statistics: A review. Why statistics? Statistics: A review Why statistics? What statistical concepts should we know? Why statistics? To summarize, to explore, to look for relations, to predict What kinds of data exist? Nominal, Ordinal, Interval

More information