Administrative Data Research Facility Linked HMDA and ACS Database

Similar documents
The Church Demographic Specialists

AFFH-T User Guide September 2017 AFFH-T User Guide U.S. Department of Housing and Urban Development

Demographic Data in ArcGIS. Harry J. Moore IV

Medical GIS: New Uses of Mapping Technology in Public Health. Peter Hayward, PhD Department of Geography SUNY College at Oneonta

Speakers: Jeff Price, Federal Transit Administration Linda Young, Center for Neighborhood Technology Sofia Becker, Center for Neighborhood Technology

INSIDE. Metric Descriptions by Topic Area. Data Sources and Methodology by Topic Area. Technical Appendix

Working with Census 2000 Data from MassGIS

Residential Demographic Multipliers

Demographic Data. How to get it and how to use it (with caution) By Amber Keller

Using American Factfinder

Understanding China Census Data with GIS By Shuming Bao and Susan Haynie China Data Center, University of Michigan

Spatial Organization of Data and Data Extraction from Maptitude

ACS Multiyear PUMS Estimates and Usage: User-Created PUMS Files. Nate Ramsey U.S. Department of Education

This report details analyses and methodologies used to examine and visualize the spatial and nonspatial

The econ Planning Suite: CPD Maps and the Con Plan in IDIS for Consortia Grantees Session 1

PALS: Neighborhood Identification, City of Frederick, Maryland. David Boston Razia Choudhry Chris Davis Under the supervision of Chao Liu

User Guide. Affirmatively Furthering Fair Housing Data and Mapping Tool. U.S. Department of Housing and Urban Development

Applying Health Outcome Data to Improve Health Equity

Mapping the Route Identifying and Consulting with HS2 s Impacted Populations

Spatiotemporal Analysis of Commuting Patterns in Southern California Using ACS PUMS, CTPP and LODES

A User s Guide to the Federal Statistical Research Data Centers

Census Transportation Planning Products (CTPP)

Jun Tu. Department of Geography and Anthropology Kennesaw State University

Guide to the Data-Driven Planning Toolkit in CPD Maps

Massachusetts Institute of Technology Department of Urban Studies and Planning

What are we like? Population characteristics from UK censuses. Justin Hayes & Richard Wiseman UK Data Service Census Support

Orien Ori t en a t tion a Webi W nar: ebi CPD Maps 2

The World Bank Community Development Program Support Project-Phase III (P144637)

Spatial and Socioeconomic Analysis of Commuting Patterns in Southern California Using LODES, CTPP, and ACS PUMS

APPENDIX C-3 Equitable Target Areas (ETA) Technical Analysis Methodology

Where to Invest Affordable Housing Dollars in Polk County?: A Spatial Analysis of Opportunity Areas

Neighborhood social characteristics and chronic disease outcomes: does the geographic scale of neighborhood matter? Malia Jones

How is Your Health? Using SAS Macros, ODS Graphics, and GIS Mapping to Monitor Neighborhood and Small-Area Health Outcomes

CRP 608 Winter 10 Class presentation February 04, Senior Research Associate Kirwan Institute for the Study of Race and Ethnicity

Rural Pennsylvania: Where Is It Anyway? A Compendium of the Definitions of Rural and Rationale for Their Use

Geospatial Analysis of Job-Housing Mismatch Using ArcGIS and Python

Acknowledgments xiii Preface xv. GIS Tutorial 1 Introducing GIS and health applications 1. What is GIS? 2

Utilizing Data from American FactFinder with TIGER/Line Shapefiles in ArcGIS

NEW YORK DEPARTMENT OF SANITATION. Spatial Analysis of Complaints

Understanding and accessing 2011 census aggregate data

Spring. The City of Pittsburgh and Allegheny County. Allegheny Valley Bank of Pittsburgh. NorthWest Savings Bank. First Commonwealth

Follow this and additional works at: Part of the Urban Studies Commons

GIS-Based Analysis of the Commuting Behavior and the Relationship between Commuting and Urban Form

Regression with a Binary Dependent Variable (SW Ch. 9)

Online Robustness Appendix to Endogenous Gentrification and Housing Price Dynamics

BROOKINGS May

Technical Documentation Demostats april 2018

GIS Spatial Statistics for Public Opinion Survey Response Rates

1Department of Demography and Organization Studies, University of Texas at San Antonio, One UTSA Circle, San Antonio, TX

The Cost of Transportation : Spatial Analysis of US Fuel Prices

Spatiotemporal Analysis of Commuting Patterns: Using ArcGIS and Big Data

A Comprehensive Method for Identifying Optimal Areas for Supermarket Development. TRF Policy Solutions April 28, 2011

ARIC Manuscript Proposal # PC Reviewed: _9/_25_/06 Status: A Priority: _2 SC Reviewed: _9/_25_/06 Status: A Priority: _2

Data Matrix User Guide

An Introduction to China and US Map Library. Shuming Bao Spatial Data Center & China Data Center University of Michigan

Life, Physical, and Social Science Occupations in Allegheny County

Tracey Farrigan Research Geographer USDA-Economic Research Service

Public Disclosure Copy. Implementation Status & Results Report KH - Livelihood Enhancement and Association of the Poor Project (LEAP) (P153591)

Globally Estimating the Population Characteristics of Small Geographic Areas. Tom Fitzwater

Urban Revival in America

Basic Training Battlemind to Home Symposium. Sept

FHWA Planning Data Resources: Census Data Planning Products (CTPP) HEPGIS Interactive Mapping Portal

2010 Census Data Release and Current Geographic Programs. Michaellyn Garcia Geographer Seattle Regional Census Center

Applied Economics. Regression with a Binary Dependent Variable. Department of Economics Universidad Carlos III de Madrid

CONSTRUCTING THE POVERTY AND OPPORTUNITIES/PUBLIC SERVICES MAPS INFORMATION MANAGEMENT. Background: Brazil Without Extreme Poverty Plan

Land cover modeling in the Travis County

Exercise on Using Census Data UCSB, July 2006

Using GIS to Explore the Relationship between Socioeconomic Status and Demographic Variables and Crime in Pittsburgh, Pennsylvania

DIFFERENT INFLUENCES OF SOCIOECONOMIC FACTORS ON THE HUNTING AND FISHING LICENSE SALES IN COOK COUNTY, IL

The World Bank TOGO Community Development and Safety Nets Project (P127200)

emerge Network: CERC Survey Survey Sampling Data Preparation

VIDEO: The World In A Box: Geographic Information Systems

emerge Network: CERC Survey Survey Sampling Data Preparation

Making Our Cities Safer: A Study In Neighbhorhood Crime Patterns

Mapping Communities of Opportunity in New Orleans

Classification Exercise UCSB July 2006

Developing Built Environment Indicators for Urban Oregon. Dan Rubado, MPH EPHT Epidemiologist Oregon Public Health Division

Public Disclosure Copy

Analysis of Bank Branches in the Greater Los Angeles Region

The World Bank Earthquake Housing Reconstruction Project (P155969)

2014 Planning Database (PDB)

Gridded Population of the World Version 4 (GPWv4)

Department of Statistics Malaysia

Propensity Score Analysis Using teffects in Stata. SOC 561 Programming for the Social Sciences Hyungjun Suh Apr

Kentucky HFA Performance Data Reporting- Borrower Characteristics

The World Bank Community Development Program Support Project-Phase III (P144637)

Tuesday 6:30 9:30 (First/Last classes) Home Phone: SYLLABUS. I. Focus of Course

The World Bank Community Development Program Support Project-Phase III (P144637)

MIDWEST INFRASTRUCTURE THE FEDERAL RESERVE BANK JUNE Understanding isolation and change in urban neighborhoods: A research symposium

KING FAHD UNIVERSITY OF PETROLEUM & MINERALS

ESRI 2008 Health GIS Conference

Long Island Breast Cancer Study and the GIS-H (Health)

Compact guides GISCO. Geographic information system of the Commission

Integrated Postsecondary Education Data System

GIS ADMINISTRATOR / WEB DEVELOPER EVANSVILLE-VANDERBURGH COUNTY AREA PLAN COMMISSION

Public Disclosure Copy

PAMSIMAS Support Trust Fund (P116236)

How GIS can be used for improvement of literacy and CE programmes

Egypt Public DSS. the right of access to information. Mohamed Ramadan, Ph.D. [R&D Advisor to the president of CAPMAS]

Mapping Social Vulnerability and Lyme Disease in Vermont

Transcription:

University of Pennsylvania ScholarlyCommons 2017 ADRF Network Research Conference Presentations ADRF Network Research Conference Presentations 11-2017 Administrative Data Research Facility Linked HMDA and ACS Database Jun Zhu Urban Institute Follow this and additional works at: https://repository.upenn.edu/ admindata_conferences_presentations_2017 Zhu, Jun, "Administrative Data Research Facility Linked HMDA and ACS Database" (2017). 2017 ADRF Network Research Conference Presentations. 8. https://repository.upenn.edu/admindata_conferences_presentations_2017/8 This paper is posted at ScholarlyCommons. https://repository.upenn.edu/admindata_conferences_presentations_2017/8 For more information, please contact repository@pobox.upenn.edu.

Administrative Data Research Facility Linked HMDA and ACS Database This presentation is available at ScholarlyCommons: https://repository.upenn.edu/admindata_conferences_presentations_2017/8

Administrative Data Research Facility Linked HMDA and ACS Database Jun Zhu Housing Finance Policy Center Urban Institute

Background Challenges for research: Publicly available government data is messy and hard to aggregate up and link to geographic levels Researchers need to download data and use the installed statistical software to analyze the data Our solution: Create a relational database across geographies by crosswalking data Databases can either be downloaded on the initial web interface or spun up into the spark social science platform for analysis.

Product Details LINKED DATASETS Most of the variables from American Community Survey (ACS) and Home Mortgage Disclosure Act (HMDA) linked by geographies Data Dictionary to define these variables SPARK SOCIAL SCIENCE PLATFORM Push button launch of platform from website Platform includes tutorials, access to linked data and manual WEB INTERFACE Basic, open-source website offering download of data/data dictionary and a way to launch the analytics platform Critical information about the project, including guidance on how to cite linked data

Product Details Linked Datasets For download Public Web Interface Project information, links to publications, links to resources. Data Dictionary For download Spark Platform Button launches platform Tutorials Data

Sloan Website https://adrf.urban.org/ Data can also be analyzed in the Spark for Social Science platform

Spark

Methodology We have linked the two datasets on geographic level. Census Tract 2000 to Census Tract 2010 Census Tract (HMDA) PUMA (ACS) PUMA 2000 to Census Tract 2010 Census Tract 2000 to PUMA 2010 PUMA 2000 to PUMA 2010 Census Tract 2010 to Zipcode 2010 PUMA 2010 to CBSA 2013 PUMA 2010 to County 2014 If the direct crosswalk is not available from the website, we did some manipulations to create a new crosswalk

Geographic Levels Available From 2001 There are 6 databases at different geographic levels.

Data structure HMDA Variables ACS Housing Variables ACS Household Variables ACS Person Variables Agency Supervisory/reg ulatory agency of institution Loan purpose Purchase, refinance, home improvement Loan Amount Mortgage Status If there is a mortgage owned free and clear Bedrooms Number of bedrooms in the house Monthly Mortgage Payment Number of families Number of families in the household Multiple generations Number of generations in the household Household Income Sex Whether the respondent is male or female Grade Attained Grade or level of recent schooling Transportation Time (minutes) In the dataset, we have included both the raw variables from HMDA and ACS, as well as some variables which we constructed

Variable Types Categorical Variables Numerical Variables Categorical- Categorical Categorical- Numerical Variable with codes Mean Percentiles (P10, P25, P50, P75, P90) Standard deviation Variable with codes Mean

Data Dictionary We have a detailed data dictionary available which covers all the data types, variables, and definitions

Sample Use Case What is the black homeownership rate in Atlanta? Original Way: (1) go to IPUMS. (2) find the race/ethnicity variable and ownership variables. (3) find the geographic level variables.

Answer This Question using IPUMS Pick the years. Search for the crosswalk (PUMA 2000 to PUMA 2010) Calculate the total households and homeowners using crosswalk

Or with the ADRF 3 Simple Steps: Use the CBSA database Search for CBSA (12060) for Atlanta Find the cross tab variable: race1_hh_b_ownership_1 and single categorical variable: race1_hh_b Year HH_Black Owner_Black Black homeownership rate 2005 532,991 271,159 50.9% 2006 546,690 288,162 52.7% 2007 567,361 309,523 54.6% 2008 572,848 308,427 53.8% 2009 566,727 298,689 52.7% 2010 602,219 311,240 51.7% 2011 610,884 305,327 50.0% 2012 619,959 299,822 48.4% 2013 631,399 296,475 47.0% 2014 643,395 305,015 47.4% 2015 660,484 303,386 45.9%

Developing a Housing Affordability Index A New housing affordability index How many renters earn as much income as an owner who just purchased a 1-4 unit home in the same area? PA= Likelihood that a renter s income falls in a specific income level A QA= Likelihood that a renter s income is enough to get a mortgage and purchase a home, given his/her specific income level A Renter income: ACS; Borrower income: HMDA

Affordability Index Inputs Using ADRF Income Level P A Q A P A *Q A 1 2.07 0.00 0.00 2 9.15 0.00 0.00 3 8.91 0.78 0.07 4 13.83 2.13 0.29 5 9.51 7.56 0.72 6 8.43 16.86 1.42 16 6.33 87.40 5.53 17 0.00 89.53 0.00 18 2.56 91.47 2.34 19 0.00 93.22 0.00 20 0.00 93.99 0.00 21 0.00 95.54 0.00 22 0.00 100.0 0.00

Affordability Index Using ADRF

Using the database for empirical studies Different geographic level variables can be easily added in the regressions to control the local market conditions. Unemployment rate by race at county level Percentage of minority population at zipcode level Average mortgage borrower income at census tract level Average loan amount at CBSA level

Future Work We hope that the process of doing these crosswalks is helpful to researchers in all fields In the next phase of our project, we plan to include more datasets We are planning to update the most recent 2016 HMDA and ACS data.