This report details analyses and methodologies used to examine and visualize the spatial and nonspatial

Similar documents
Exploratory Spatial Data Analysis (ESDA)

Applying Health Outcome Data to Improve Health Equity

ESRI 2008 Health GIS Conference

Modeling Spatial Relationships using Regression Analysis

Analyzing the Geospatial Rates of the Primary Care Physician Labor Supply in the Contiguous United States

Neighborhood social characteristics and chronic disease outcomes: does the geographic scale of neighborhood matter? Malia Jones

Modeling Spatial Relationships Using Regression Analysis

GIS Spatial Statistics for Public Opinion Survey Response Rates

In matrix algebra notation, a linear model is written as

Measuring community health outcomes: New approaches for public health services research

Using Spatial Statistics Social Service Applications Public Safety and Public Health

Medical GIS: New Uses of Mapping Technology in Public Health. Peter Hayward, PhD Department of Geography SUNY College at Oneonta

Modeling Spatial Relationships Using Regression Analysis. Lauren M. Scott, PhD Lauren Rosenshein Bennett, MS

Dr Arulsivanathan Naidoo Statistics South Africa 18 October 2017

CRP 608 Winter 10 Class presentation February 04, Senior Research Associate Kirwan Institute for the Study of Race and Ethnicity

Spatial Disparities in the Distribution of Parks and Green Spaces in the United States

Spatial Variation in Hospitalizations for Cardiometabolic Ambulatory Care Sensitive Conditions Across Canada

1Department of Demography and Organization Studies, University of Texas at San Antonio, One UTSA Circle, San Antonio, TX

Tracey Farrigan Research Geographer USDA-Economic Research Service

Defining Statistically Significant Spatial Clusters of a Target Population using a Patient-Centered Approach within a GIS

ARIC Manuscript Proposal # PC Reviewed: _9/_25_/06 Status: A Priority: _2 SC Reviewed: _9/_25_/06 Status: A Priority: _2

The Church Demographic Specialists

Acknowledgments xiii Preface xv. GIS Tutorial 1 Introducing GIS and health applications 1. What is GIS? 2

Application of Indirect Race/ Ethnicity Data in Quality Metric Analyses

GIS Analysis: Spatial Statistics for Public Health: Lauren M. Scott, PhD; Mark V. Janikas, PhD

A GEOSTATISTICAL APPROACH TO PREDICTING A PHYSICAL VARIABLE THROUGH A CONTINUOUS SURFACE

DIFFERENT INFLUENCES OF SOCIOECONOMIC FACTORS ON THE HUNTING AND FISHING LICENSE SALES IN COOK COUNTY, IL

MAKING PLANNING LOCAL

JOINT STRATEGIC NEEDS ASSESSMENT (JSNA) Key findings from the Leicestershire JSNA and Charnwood summary

Big-Geo-Data EHR Infrastructure Development for On-Demand Analytics

Migration Clusters in Brazil: an Analysis of Areas of Origin and Destination Ernesto Friedrich Amaral

NEW YORK DEPARTMENT OF SANITATION. Spatial Analysis of Complaints

Finding Hot Spots in ArcGIS Online: Minimizing the Subjectivity of Visual Analysis. Nicholas M. Giner Esri Parrish S.

Mapping For Wellness: Then and Now

Comparison of spatial methods for measuring road accident hotspots : a case study of London

Using GIS to Explore the Relationship between Socioeconomic Status and Demographic Variables and Crime in Pittsburgh, Pennsylvania

Social Vulnerability Index. Susan L. Cutter Department of Geography, University of South Carolina

Mapping Communities of Opportunity in New Orleans

Community Inclusion in Colorado

OBESITY AND LOCATION IN MARION COUNTY, INDIANA MIDWEST STUDENT SUMMIT, APRIL Samantha Snyder, Purdue University

Using Spatial Statistics and Geostatistical Analyst as Educational Tools

GeoHealth Applications Platform ESRI Health GIS Conference 2013

Objectives Define spatial statistics Introduce you to some of the core spatial statistics tools available in ArcGIS 9.3 Present a variety of example a

Luc Anselin Spatial Analysis Laboratory Dept. Agricultural and Consumer Economics University of Illinois, Urbana-Champaign

Do the Causes of Poverty Vary by Neighborhood Type?

Applications of GIS in Health Research. West Nile virus

Demographic Data in ArcGIS. Harry J. Moore IV

Regression Analysis of 911 call frequency in Portland, OR Urban Areas in Relation to Call Center Vicinity Elyse Maurer March 13, 2015

Effective Use of Geographic Maps

Jun Tu. Department of Geography and Anthropology Kennesaw State University

Administrative Data Research Facility Linked HMDA and ACS Database

Summary of OLS Results - Model Variables

Geospatial Analysis of Job-Housing Mismatch Using ArcGIS and Python

GIS and Health Geography. What is epidemiology?

ADDRESSING TITLE VI AND ENVIRONMENTAL JUSTICE IN LONG-RANGE TRANSPORTATION PLANS

Driving Forces of Houston s Burglary Hotspots During Hurricane Rita

Socioeconomic determinants of geographic disparities in campylobacteriosis risk: a comparison of global and local modeling approaches

Purpose Study conducted to determine the needs of the health care workforce related to GIS use, incorporation and training.

Assessing Social Vulnerability to Biophysical Hazards. Dr. Jasmine Waddell

Egypt Public DSS. the right of access to information. Mohamed Ramadan, Ph.D. [R&D Advisor to the president of CAPMAS]

Spatiotemporal Analysis of Urban Traffic Accidents: A Case Study of Tehran City, Iran

Where to Invest Affordable Housing Dollars in Polk County?: A Spatial Analysis of Opportunity Areas

Keywords: Air Quality, Environmental Justice, Vehicle Emissions, Public Health, Monitoring Network

BROOKINGS May

Texas A&M University

APPENDIX C-3 Equitable Target Areas (ETA) Technical Analysis Methodology

KAAF- GE_Notes GIS APPLICATIONS LECTURE 3

Integrating GIS into Food Access Analysis

Creating a Geographic Health Information System to Analyze Spatial and Social Patterns of Emergency Department Usage in Olmsted County, Minnesota, USA

Introduction. China demographic trends

Social Epidemiology and Spatial Epidemiology: An Empirical Comparison of Perspectives

Environmental Disparity. KRISTIN M. OSIECKI BS, University of Illinois at Urbana, 1989 MS, University of Illinois at Chicago, 2011 DISSERTATION

Mapping Your Educational Research: Putting Spatial Concepts into Practice with GIS. Mark Hogrebe Washington University in St.

Maggie M. Kovach. Department of Geography University of North Carolina at Chapel Hill

Long Island Breast Cancer Study and the GIS-H (Health)

DEVELOPING DECISION SUPPORT TOOLS FOR THE IMPLEMENTATION OF BICYCLE AND PEDESTRIAN SAFETY STRATEGIES

TUESDAYS AT APA PLANNING AND HEALTH. SAGAR SHAH, PhD AMERICAN PLANNING ASSOCIATION SEPTEMBER 2017 DISCUSSING THE ROLE OF FACTORS INFLUENCING HEALTH

Descriptive Statistics

Models for Count and Binary Data. Poisson and Logistic GWR Models. 24/07/2008 GWR Workshop 1

Mapping and Health Equity Advocacy

Community Health Needs Assessment through Spatial Regression Modeling

Spatial Analysis 1. Introduction

Integration for Informed Decision Making

An Assessment of People, Place and Business on Syracuse s Near Northside

Gis Based Analysis of Supply and Forecasting Piped Water Demand in Nairobi

Nature of Spatial Data. Outline. Spatial Is Special

A FOSS Web Tool for Spatial Regression Techniques and its Application to Explore Bike Sharing Usage Patterns

LEHMAN COLLEGE OF THE CITY UNIVERSITY OF NEW YORK. 1. Type of Change: Change in Degree Requirements

Urban GIS for Health Metrics

Telling Stories with Numbers Secondary data collection, presentation, and interpretation

Using Geospatial Methods with Other Health and Environmental Data to Identify Populations

Multidimensional Poverty in Colombia: Identifying Regional Disparities using GIS and Population Census Data (2005)

Demographic Data. How to get it and how to use it (with caution) By Amber Keller

Extended Follow-Up and Spatial Analysis of the American Cancer Society Study Linking Particulate Air Pollution and Mortality

RELATIONAL DATA MODELING TO ENHANCE GIS-BASED VISUAL INFORMATION SYSTEMS

Welcome. C o n n e c t i n g

Inclusion of Non-Street Addresses in Cancer Cluster Analysis

GIS in Weather and Society

The Cost of Transportation : Spatial Analysis of US Fuel Prices

Lecture 4. Spatial Statistics

Transcription:

Analysis Summary: Acute Myocardial Infarction and Social Determinants of Health Acute Myocardial Infarction Study Summary March 2014 Project Summary :: Purpose This report details analyses and methodologies used to examine and visualize the spatial and nonspatial data relationships between Acute Myocardial Infarction (AMI) hospitalization rates, selected Social Determinants of Health (SDoH), and additional heart attack risk and behavioral factors (diabetes hospitalizations and adult cigarette smoking). This project investigated the premise that data analyses incorporating geo-spatial statistical methods can provide additional insight into understanding the distribution of data and interactions between data variables. This data analysis was completed in 2013 as a pilot project funded by the Environmental Public Health Tracking (EPHT) grant in which the Colorado Department of Public Health and Environment (CDPHE) has participated in, beginning in 2007. :: :: Goals Obtain and organize a consensus census-tract level database of Social Determinants of Health (SDoH) variables and additional risk factor variables of interest to examine community-level variation in the prevalence of Acute Myocardial Infarction hospitalizations Use selected geospatial statistical tools and spatial analysis methods to describe the socio-demographic characteristics of populations and risk factors in relation to the rate of Acute Myocardial Infarction hospitalizations at the census tract geography. Document the selected methods, tools, results, and maps in report format. :: Cardiovascular Disease and Acute Myocardial Infarction Cardiovascular disease including heart disease, stroke, and other vascular diseases is the leading cause of death in the United States. Each year, nearly 800,000 people die from cardiovascular disease, accounting for one in every three deaths. An estimated 67 million American adults have high blood pressure and 71 million American adults have high levels of low-density lipoprotein (LDL) cholesterol, the two leading risk factors for heart disease and stroke. About one of every six healthcare dollars in the United States is spent on treating cardiovascular disease. Annual US cardiovascular disease costs exceed $192.1 billion in direct medical expenses and $312.6 billion when indirect expenses are included. Lifestyle and genetics continue to be primary risk factors for cardiovascular disease. A heart attack, also called acute myocardial infarction, occurs when part of the heart muscle gets damaged or dies because it isn t getting enough blood. Heart attack hospitalization data can be used as an indicator of the cardiovascular disease burden, and to interpolate where disparities in cardiovascular disease exist, geographically. The 2011 Colorado statewide age-adjusted rate of heart attack hospitalizations was 21.3 per 10,000 persons. This report examined how population-level SDoH characteristics (e.g. poverty, race, educational attainment), diabetes hospitalization rates, and adult smoking prevalence influence the rate of heart attack hospitalizations, and provides additional insight into where these characteristics or risk factors may be more associated. Acute Myocardial Infarction Study Summary March 2014 page 1

:: Figure 1 Distribution of AMI Hospitalizations in Colorado Using census tract as our geographical unit, the map above (Figure 1) displays 5-Year 2007-2011 age-adjusted hospitalization rates for Acute Myocardial Infarction (Data Source: Colorado Hospital Association), and illustrates the variation in heart attack hospitalizations across the state. The age-adjusted rate is grouped into four categories or quartiles based on its value and displayed on the map -- each quartile containing roughly the same number of census tracts. Census tracts in Colorado range in size from an entire county in rural areas to small neighborhoods in urban areas. Health outcomes and/or hospitalization data sets in Colorado are often aggregated to a county in order to display the data on a map or identify disparities across the state. This data analysis considers data at the census tract geography which can illustrate community-level patterns in the data. The above map depicts which census tracts rates have confidence intervals that are statistically higher than the state average by drawing a diagonal line across those census tract. Higher rates in census tracts that do not contain the diagonal line should be interpreted with caution as they may have been calculated using a small denominator and could be considered unstable. It should be noted that the hospitalization data used in this analysis was assigned a location based on the billing address of those patients admitted to participating hospitals with ICD-9 codes matching Acute Myocardial Infarction. Billing addresses were then geocoded to assign an accurate census tract ID. All of the addresses inside a census tract were then counted in order to calculate an age-adjusted hospitalization rate for each individual census tract. Acute Myocardial Infarction Study Summary March 2014 page 2

Data Included in the Analysis :: Social Determinants of Health Based on a literature review of research connecting specific socio-demographic and economic indicators to Acute Myocardial Infarction incidence, and through discussions with the CDPHE Chronic Disease prevention program, a core group of SDoH data variables were selected to include in this data analysis (see Table 2 below). Data for the 7 selected socio-demographic indicators were obtained directly from the 2010 U.S. Census and 2007-2011 American Community Survey. These data were organized and grouped using 2010 Census Tract boundaries. :: Diabetes Hospitalization Data The 2007-2011 age-adjusted Diabetes hospitalization rate (Data Source: Colorado Hospital Association) was also included in this data analysis as an additional risk factor for understanding the distribution of heart attack (AMI) hospitalizations. The ageadjusted Diabetes hospitalization rate was calculated for each individual census tract based on patient billing address. The hypothesis that this data closely models the geographic distribution and magnitude of heart attack hospitalization data was also examined as a component of this data analyses. :: Smoking Prevalence Also included in this data analysis, as an additional risk factor for understanding the distribution of heart attack (AMI) hospitalizations, was county-level adult smoking prevalence data, assigned to each census tract based on the county estimate. This 4-Year prevalence data represents the percent of adults aged 18+ years who currently smoke cigarettes and was assembled from the 2007-2010 Colorado Behavioral Risk Factor Surveillance System (BRFSS) dataset. The inclusion of this data into the analysis assumed that every census tract inside of a county has the same adult smoking prevalence, whereas the other data included in this analysis represents an estimate specifically calculated for samples collected inside the census tract. Variables included in the Data Analysis: Health Outcome 01 Age-Adjusted Acute Myocardial Infarction Hospitalization Rate Social Determinants of Health 02 Percent of families living at or below poverty 03 Non-White, Percent of population 04 Percent of the population age 25 and over without a high school diploma 05 Percent of the population age 16 and over in the civilian labor force, Unemployed 06 Percent of the population living in urban areas 07 Percent of occupied housing units, With no vehicle available 08 Percent of occupied housing units, Renter occupied Diabetes Hospitalization Data 09 Age-Adjusted Diabetes Hospitalization Rates Smoking Prevalence 10 Percent of population, Smokes cigarettes (County-Level) Table 1 Acute Myocardial Infarction Study Summary March 2014 page 3

Methodology and Results :: Analysis Overview A data analysis framework was developed to understand the relationships and geographic relationships between the selected data variables (Social Determinants of Health, Diabetes Hospitalization Rate, and Adult Smoking Prevalence) and the age-adjusted AMI hospitalization rate. Specifically, the ArcGIS (ArcMap) 10.1 Spatial Statistics Toolset were used as data analysis tools. There were two goals to developing and applying this framework: 1) Describe the relationships between the outcome variable (AMI hospitalization rates) and independent variables (selected socio-demographic characteristics, diabetes hospitalization rates, and smoking prevalence estimates) and to describe and visualize local variation in these relationships between individual census tracts. 2) Document and communicate the applied methodologies and results from this analysis into a report including process flowcharts, maps, tables, and data visualizations. :: Spatial Analysis Steps The spatial analysis framework follows the five steps outlined below in Table 2: Spatial Data Analysis Steps Steps Exploratory Spatial Data Analysis Use What Does this Tell Us? Description Visualize patterns and understand the spatial distribution of all of the data variables included in the model (Moran s I, Cluster Analysis, Scatterplots) Independently, all of the socio-demographic, diabetes hospitalization rate, and smoking prevallence data exhibit clustering of similar values by census tract. Ordinary Least Squares Regression (OLS) Use What Does this Tell Us? This linear regression model (global) is used to assess the relationships between variables included in the model and to calculate coefficents representing the effect of each variable on the dependent variable. The results from the OLS Regression indicate that all of the variables included in the model are associated with the AMI hospitalization rate (p <0.05), except for these 3 variables: poverty status, vehicle access (mobility), and renter/owner occupancy. Coefficients representing the effect of each variable on the dependent variable were calculated. Identification of Hotspots Use The Getis-Ord Gi* statistic is used to understand where, and to what degree, clustering of high and low values are occurring for each variable in the model. Geographically Weighted Regression (GWR) What Does this Tell Us? Use What Does this Tell Us? There are significant hotspots of AMI hospitalization rates in the metropolitan Denver area, and northward along the front range into Larimer and Weld Counties. After identifying hotspots, the socio-demographic characteristics of hotspots vs. non-hotspots can be compared. The GWR model (local) describes relationships between the variables in the model and the AMI hospitalization rate, but unlike OLS, the GWR model fits a weighted regression equation to every census tract. GWR assess correlations between variables but also accounts for the spatial relationship between neighboring census tracts. Local coefficients were calculated for each census tract to describe the relationship between each variable and AMI hospitalization rate, after controlling for other variables. These coefficients were mapped to visualize the magnitude and geographic relationship between the age-adjusted AMI hospitalization rate, socio-demographics, diabetes hospitalizations, and adult smoking prevalence. Interpretation and Conclusion Using geo-statistical methods in conjunction with traditional statistics can provide Table specific 4 and informative results beyond traditional methods for visualizing and understanding the relationships between AMI hospitalization rates, selected SDoH data, diabetes hospitalization rates, and adult smoking prevalence. Table 2 Acute Myocardial Infarction Study Summary March 2014 page 4

:: Exploratory Spatial Data Scatterplot Scatterplot Percent of Population age 25+ without a High School Diploma Percent Unemployed Age Adjusted Rate of AMI Hospitalizations Map of Families Living Below Poverty Map of Unemployment Age Adjusted Rate of AMI Hospitalizations Figure 2 Scatterplots are a way to visualize potential associations between data variables. The two scatterplots shown above in Figure 2 depict the relationship between two SDoH variables and the age-adjusted AMI hospitalization rate (for each census tract). A choropleth map for each independent variable was also created to show the distribution of the socio-demographic indicators, diabetes hospitalization rates, and estimated smoking prevalence. :: Ordinary Least Squares Regression OLS Resuts: AMI Hospitalizations and Socio-Demographic Indicators, Diabetes Hospitalizations and Smoking Prevalence Variable Coefficient Std Error Probability Robust Probability Intercept (01) Poverty (02) Minority Race/Ethnicity (03) Educational Attainment (04) Employment (05) Urban/Rural (06) Transportation/Mobility (07) Renter/Owner (08) Diabetes (09) Smoking 40.327113-0.858387-0.950749 1.295646 2.718463 0.315704-0.161578 0.013981 0.072360 1.804358 14.933872 0.440762 0.249807 0.495434 1.132399 0.092514 0.621517 0.200061 0.002875 0.757153 0.007023 0.051705 0.000159 0.009025 0.016507 0.000681 0.794934 0.944283 0.000000 0.017311 0.000301 0.100664 0.003223 0.038229 0.027794 0.005336 0.889576 0.958678 0.000060 0.003374 Akaike s Information Criterion (AICc): 14122.937381 Adjusted R-Squared: 0.455042 Koenker (BP) Statistic: 322.446172 (p>0.00000) Jarque-Bera Statistic: 70097.347024 (p>0.00000) Table 3 The Ordinary Least Squares regression (Table 3) included the Age-adjusted AMI Hospitalization Rates as the dependent variable, and Poverty (01), Minority Race (02), Educational Attainment (03), Employment (04), Urban/Rural (05), Transportation/Mobility (06), Renter vs. Owner (07), Age-Adjusted Diabetes Hospitalizations (08), and Adult Smoking Prevalence (09) as the independent variables. Based on the OLS model, the percentage of the population that is of minority race/ethnicity (02), percentage of population (25 years and over) without a high school degree (03), percentage of the population unemployed age 16+ in the civilian labor force (04), percentage of the population Acute Myocardial Infarction Study Summary March 2014 page 5

living in urban areas (05), age-adjusted diabetes hospitalization rates (08), and estimated county smoking prevalence (09) are all significantly associated with age-adjusted AMI hospitalization rates. This model has an adjusted R² of 0.455, which tells us that it accounts for 45.50% of the variability in the age-adjusted AMI hospitalization rate. The Koenker and Jarque-Bera Statistic were significant (p<0.0000) indicating the normal probability should be used, and that the residuals are not normally distributed, detecting spatial dependency in the OLS model. Based on these characteristics, the Geographically Weighted Regression (GWR) model may help explain the spatial dependency occurring in the OLS model. The coefficients resulting from the OLS regression model can be used directly in reporting on the burden of this health outcome. A 1.0% increase in the percentage of the population of minority race/ethnicity was associated with a 1.0 per 100,000 persons decrease in the AMI hospitalization rate. A 1.0% increase in the percentage of population (25 years and over) without a high school degree is associated with a 1.3 per 100,000 persons increase in the AMI hospitalization rate. A 1.0% increase in the percentage of population unemployed (age16+ in the civilian labor force) is associated with a 2.7 per 100,000 persons increase in the AMI hospitalization rate. Finally, a 1.0% increase in estimated adult smoking prevalence is associated with a 1.8 per 100,000 persons increase in the AMI hospitalization rate. These coefficients represent effects for all of the census tracts in Colorado, and are not specific to one indiviudal census tract or community. After interpreting the OLS regression output, a Moran s I Test for Spatial Autocorrelation was performed on the residuals from the OLS model, in order to detect if the residuals are clustered. Clustering of the residuals was detected (Figure 3), which indicates that moving to a Geographic Weighted Regression (GWR) may be beneficial in analyzing the data. Test for Spatial Autocorrelation Ordinary Least Squares Regression: Moran s I 0.076571, z-score 6.92, p<0.0000 A positive value for Moran s I indicates that a feature has neighboring features with similarly high or low attribute values; this feature is part of a cluster. :: Getis-Ord Gi* Statistic Cluster Analysis Figure 3 Cluster analysis using the Getis-Ord Gi* statistic in ArcGIS 10.1 was able to identify census tracts with statistically significant clusters of higher and lower AMI hospitalization rates. The resultant z-scores and p-values define where features with either high or low values cluster spatially. This tool takes into account each census tract within the context of neighboring census tracts. A census tract with a high AMI hospitalization rate may not be a statistically significant hot spot. To be a statistically significant hot spot, a census tract assigned a high AMI hospitalization rate must also be surrounded by neighboring census tracts with high values. Acute Myocardial Infarction Study Summary March 2014 page 6

Significant Clusters of High and Low AMI Rates in Colorado (Census Tract) Figure 4 The map above (Figure 4) displays the result of the Hotspot Analysis Rendering tool (Getis-Ord Gi* Statistic). Given the location of census tracts with their age-adjusted AMI hospitalization rates, the tool calculated the Getis-Ord Gi* statistic which was used to identify statistically significant hot spots and cold spots. The map displays in red the census tracts that are identified as hot spots (Z-Scores of 1.65 or higher). No cold spots were identified. A Moran s I test for Spatial Autocorrelation can also be performed on the AMI hospitalization rates, and the results are shown at right in Figure 5. Given the location of census tracts and their age-adjusted AMI hospitalization rates, this statistic found the pattern of age-adjusted AMI hospitalization rates across Colorado to be clustered. Figure 5 Acute Myocardial Infarction Study Summary March 2014 page 7

Comparison of Variables in AMI Hot Spots vs. Non-Hot Spot (Census Tracts) Variables Statistically Significant Hot Spots of AMI Rates Not a Statistically Significant Hot Spots of AMI Rates Not Included in Study Area Crude Values No. of Census Tracts 24 1,171 54 AMI Hospitalizations 882 37,692 39 Population (2010) 96,463 4,826,072 106,651 Percent by Census Tract Families Below Poverty 21.6 12.9 12.9 Minority Population 42.6 29.2 26.8 25+ without a High School Degree 19.7 10.6 10.1 Unemployment 5.7 5.2 4.4 Urban Population 98.4 84.5 45.1 Transportation/Mobility 8.3 5.6 3 Renter/Owner 42.6 33.3 42.1 Adult Smoking Prevalence 18.0 17.0 16.0 Diabetes Hosp. Rate 3866.1 1401.9 566.4 Table 4 ::SDoH, Diabetes Hospitalization Rate, and Adult Smoking Prevalence of Hot-Spots One way of examining the characteristics of those who live in areas of high AMI hospitalization rates, is to spatially define the AMI hotspots which are census tracts that age-adjusted AMI hospitalization rates that are statistically higher than the state average AND have a Getis-Ord Gi z-score in the 95th percentile. Based on the AMI hospitalization data used in this analysis, here were 24 census tracts in Colorado (out of 1249) that were identified as hotspots. Within these hotspot census tracts, trends in the SDoH, diabetes hosp. rate, and adult smoking prevalence variables can be identified and compared to census tracts that are outside of defined hotspots. Table 4 above compares the socio-demographic characteristics of hotspots to non-hotspots. Comparing these characteristics helps us to contrast and understand the disparities in socio-demographics, diabetes hospitalization rates, and adult smoking prevalence within communities known to have higher AMI hospitalizaton rates. It is evident that the hotspot population is defined by lower socio-demographic characteristics, higher diabetes hospitalization rates, and higher adult smoking prevalence. Acute Myocardial Infarction Study Summary March 2014 page 8

:: Geographically Weighted Regression Geographically Weighted Regression (GWR) is a linear regression model used to develop local coefficients. GWR creates local statistics that generally reduce spatial autocorrelation, increase model fit, increase R2,, and allow parameter estimates to vary geographically. Whereas the OLS model assigned one coefficient for each variable representing its effect on the AMI hospitalization rate across the state, the GWR model assigned local coefficients for each census tract vary based on the weighted values of neighboring census tracts. GWR coefficients are estimated based on a weighting scheme that considers neighboring observations more of an influence than those further away. Similar to a least squares regression, the parameter estimate represented a one unit change in the independent variable that will lead to a corresponding change in the dependent variable. Only those variables that were identified as statistically significant from the OLS Regression model output are examined in the Geographic Weighted Regression model. Geographic Weighted Regression Results: Bandwidth: Up to 743 Neighbors Variable Coefficient Range (02) % of the Population that is of Minority Race/Ethnicity (03) % of the Population Without a High School Degree (25+) (04) % of the Population Unemployed (16+ in Civ. Labor Force) (05) % of the Population that Lives in Urban Areas (08) Age-Adjusted Rate of Diabetes Hospitalizations (09) % of Adults who Currently Smoke Cigarettes -2.507-0.158-1.438-1.786-2.721-5.237-0.163-0.682 0.054-0.10-2.13-9.65 Akaike s Information Criterion (AICc): 14079.76 Adjusted R-Squared: 0.48 *AICc values indicates improved model fit versus OLS Model Spatial Autocorrelation Results: Geographically Weighted Regression: Ordinary Least Squares Regression: Moran s I 0.014375, z-score 2.09, p<0.037066 Moran s I 0.076571, z-score 6.92, p<0.0000 Table 5 The GWR results, shown above in Table 5, indicate that the location of the census tract in Colorado also affects the magnitude association of the variables with age-adjusted AMI hospitalization rates. The GWR model indicated an improved model fit versus the OLS Model as the Adjusted R-Square value improved from 0.45 to 0.48. Although these models both indicate significant clustering of residuals, there is still a reduction in spatial clustering compared to the OLS model. This clustering in the residuals from the GWR indicates that there may be variables not included in the model that account for the spatial autocorrelation in the residuals. GWR coefficients indicated a protective factor for some variables that are known not to be protective of cardiovascular disease and AMI hospitalizations. This is observed above in the coefficients for adult smoking prevalence. Based on what is known about cigarette use as a risk factor for cardiovascular disease, it is unlikely that smoking is protective. As observed in many GWR models, one variable can exhibit such a strong association with the outcome, that other variables in the model will appear to have an effect opposite of what is expected. This is especially the case when the variable with a weaker association may have little presence in a census tract where the variable with a stronger Acute Myocardial Infarction Study Summary March 2014 page 9

:: Mapping the Coefficients from the GWR The maps below depict the range of coefficients for each independent variable included in the GWR model, showing the association between independent variables and the age-adjusted AMI hospitalization rates, adjusted for the other independent variables in the GWR model. Areas of darker color represent a greater association between the variable and AMI hospitalization rate. These maps can be used in targeting specific risk factors for AMI depending on location of a census tract within the state of Colorado. For every one unit change in the variables listed in Table 5, there is a corresponding change in the AMI hospitalization rate, as indicated by the coefficient value. 02 Minority Race/Ethnicity The census tracts or areas with darker color indicate where the minority race/ethnicity percent of population value has a greater association with the age-adjusted AMI hospitalization rate, after adjusting for the other 5 independent variables (educational attainment, unemployment, urban/rural, diabetes hospitalizations, and estimated smoking prevalence). 03 Educational Attainment The census tracts or areas with darker color indicate where the percent of the population with a high school degree value has a greater association with the age-adjusted AMI hospitalization rate, after adjusting for the other 5 independent variables (minority/ race, unemployment, urban/rural, diabetes hospitalizations, and estimated smoking prevalence). 04 Employment The census tracts or areas with darker color indicate where the percent of the population unemployed value has a greater association with the age-adjusted AMI hospitalization rate, after adjusting for the other 5 independent variables (minority/race, educational attainment, urban/rural, diabetes hospitalizations, and estimated smoking prevalence). 02 03 04 Acute Myocardial Infarction Study Summary March 2014 page 10

05 Urban/Rural The census tracts or areas with darker color indicate where the percent of the population living in an urban area value has a greater association with the age-adjusted AMI hospitalization rate, after adjusting for the other 5 independent variables (minority/ race, educational attainment, unemployment, diabetes hospitalizations, and estimated smoking prevalence). 08 Diabetes Hospitalizations The census tracts or areas with darker color indicate where the age-adjusted diabetes hospitalization rate has a greater association with the age-adjusted AMI hospitalization rate, after adjusting for the other 5 independent variables (minority/race, educational attainment, unemployment, urban/rural, and estimated smoking prevalence). 09 Smoking The census tracts or areas with darker color indicate where the estimated percent of adults who smoke value has a greater association with the age-adjusted AMI hospitalization rate, after adjusting for the other 5 independent variables (minority/ race, educational attainment, unemployment, urban/rural, and diabetes hospitalizations). 08 Acute Myocardial Infarction Study Summary March 2014 page 11

Conclusion The results of this analysis indicate that race, educational attainment, unemployment, and urban/rural population are all significantly associated with the age-adjusted AMI hospitalization rate, along with diabetes hospitalization rate and adult smoking prevalance. Based on the results of the GWR and associated spatial analyses, these results vary in influence across Colorado. The utility of analyzing age-adjusted AMI hospitalization rates at the sub-county geography, using these geospatial statistical methods, is important in that it provides program professionals within our health department information on community-level disparities, and it increased awareness and visualization that not all SDoH and other risk factors (diabetes hospitalizations and smoking) are distributed evenly across the landscape of higher and lower AMI hospitalization rates in Colorado. Furthermore, this analysis and map products helps to increase the visualization and utility of Colorado Hospitalization Data in that a model for tying together health outcome with socio-demographic and risk factor data using geospatial statistical methods tools now exists. These results also indicate spatial clustering of age-adjusted AMI hospitalization rates as well as clustering of variables known to be risk factors for AMI. This data analyses and mapping project demonstrated the ability to analyze and visualize SDoH and health outcome data using accessible methodologies and tools through mainstream GIS software that many individuals, private sector, non-profit, and government agencies have access to. Geospatial methodologies in conjunction with Ordinary Least Squares (OLS) models can provide specific and informative results beyond non-spatial methods for social determinants of health data (SDoH). These results can indicate spatial clustering of health outcomes as well as clustering of variables known to effect health outcomes. This geographic information can provide substantial insight on the magnitude SDoH can have on health outcomes and the way they can interact across geography. When data and data relationships are visualized at the subcounty level, the results can provide a much more compelling story about a health outcome and the populations that are affected. This is partly because finer resolution geography (census tract) can better capture the true variation in SDoH, risk factor, or health outcome. Displaying data at the census tract level captures communities and populations in a way that allows people to engage more with the data because viewers are better able to identify trends within communities and neighborhoods. Maps allow individuals to instantly connect with the distribution, magnitude, and scale of a problem in a way that is more difficult with traditional data dissemination methods. These types of analyses begin to incorporate what all major health organizations have acknowledged for years: that social determinants of health are significant predictors of health outcomes. This project begins to address the top tier constructs in most health equity models in trying to understanding a health outcome by examining the influence of socio-demographics and/or risk factors. These types of data analyses can confirm the idea that where we live and our social/economic status can significantly influence our health behaviors and overall wellbeing, and that our healthy lifestyles and behaviors are linked to much broader ideas related to our environment, culture and social characteristics. Finally, the use of spatial analysis methods can help resolve many of the statistical issues that arise with data exhibiting spatial dependency. The use of GWR allows for spatial variation in the parameter estimates of the model, therefore reducing the spatial dependency in the model. In allowing these results to vary by census tract, we can get more accurate estimates of the variables that account for regional differences and variation in the population. Acute Myocardial Infarction Study Summary March 2014 page 12

Acute Myocardial Infarction Study Summary March 2014 page 13