Simple Spatial Growth Models The Origins of Scaling in Size Distributions

Similar documents
Integrated Modelling of the Spatial Impacts of Climate Change In Greater London

Session 4: Lecture 4: Cellular Automata. One and Two D D Automata and the Beginnings of ABM. Michael Batty

Extending the Focus of Urban Modelling Tools

Infrastructure to Explore Geographic Systems through Models and Maps

A Land Use Transport Model for Greater London:

Cities, AI, Design, & the Future Can Artificial Intelligence Improve Design Intelligence?

City Models to Explore the Impact of New Urban Infrastructures

Scaling in Biology. How do properties of living systems change as their size is varied?

Lecture 7: Cellular Automata Modelling: Principles of Cell Space Simulation

Lecture 7: Cellular Automata Modelling: Principles of Cell Space Simulation

Regression, part II. I. What does it all mean? A) Notice that so far all we ve done is math.

How big is the Milky Way? Introduction. by Toby O'Neil. How big is the Milky Way? about Plus support Plus subscribe to Plus terms of use

Power Laws & Rich Get Richer

COSMIC: COmplexity in Spatial dynamic

Experiment 1: The Same or Not The Same?

Lecture: Sampling and Standard Error LECTURE 8 1

Uncertainty. Michael Peters December 27, 2013

Ordinary Least Squares Linear Regression

MA 3280 Lecture 05 - Generalized Echelon Form and Free Variables. Friday, January 31, 2014.

Land Use Transportation Interaction Models

Complex Systems Methods 11. Power laws an indicator of complexity?

Statistical methods research done as science rather than math: Estimates on the boundary in random regressions

Lecture 11: Extrema. Nathan Pflueger. 2 October 2013

Getting Started with Communications Engineering

Multiple Regression Theory 2006 Samuel L. Baker

Where do pseudo-random generators come from?

CS246: Mining Massive Datasets Jure Leskovec, Stanford University

Predator - Prey Model Trajectories and the nonlinear conservation law

Data Collection: What Is Sampling?

22 Approximations - the method of least squares (1)

Population growth dynamics in cities, countries and communication systems

appstats27.notebook April 06, 2017

Scaling invariant distributions of firms exit in OECD countries

Part I Electrostatics. 1: Charge and Coulomb s Law July 6, 2008

MA 1125 Lecture 15 - The Standard Normal Distribution. Friday, October 6, Objectives: Introduce the standard normal distribution and table.

Lecture 09 Combined Effect of Strain, Strain Rate and Temperature

ECNU Writing Workshop, October 28 th, Urban Models: The Decline of Theory, The Rise of Big Data and the Fragmentation of Applications

Land Use Transportation Interaction Models

1 A Science of Cities

Chapter 27 Summary Inferences for Regression

The Fundamentals of Heavy Tails Properties, Emergence, & Identification. Jayakrishnan Nair, Adam Wierman, Bert Zwart

Approximations - the method of least squares (1)

Relativistic Boats: an explanation of special relativity. Brianna Thorpe, Dr. Michael Dugger

Lecture 2: Modelling Histories: Types and Styles:

Lecture 1: Probability Fundamentals

A Science of Cities. Models of Urban Form and Function. Michael

James H. Steiger. Department of Psychology and Human Development Vanderbilt University. Introduction Factors Influencing Power

Lecture 5. 1 Review (Pairwise Independence and Derandomization)

Introduction to Special Relativity

Common ontinuous random variables

CS341 info session is on Thu 3/1 5pm in Gates415. CS246: Mining Massive Datasets Jure Leskovec, Stanford University

Map reading made easy

Diffusion and cellular-level simulation. CS/CME/BioE/Biophys/BMI 279 Nov. 7 and 9, 2017 Ron Dror

A Series Transformations

6.080 / Great Ideas in Theoretical Computer Science Spring 2008

O.K. But what if the chicken didn t have access to a teleporter.

Map reading made easy

3 Non-linearities and Dummy Variables

TheFourierTransformAndItsApplications-Lecture28

Chapter 1. Linear Equations

Math (P)Review Part I:

Probability Methods in Civil Engineering Prof. Dr. Rajib Maity Department of Civil Engineering Indian Institute of Technology, Kharagpur

Gravity Pre-Lab 1. Why do you need an inclined plane to measure the effects due to gravity?

The fitting of q-laws as replacement for power-law fit: A social science tutorial. Douglas R. White 2006

/633 Introduction to Algorithms Lecturer: Michael Dinitz Topic: NP-Completeness I Date: 11/13/18

Assignment 7 Solutions

MITOCW ocw f99-lec30_300k

Lecture 15: Exploding and Vanishing Gradients

Maps and differential equations

Math 132. Population Growth: Raleigh and Wake County

Chapter 1 Introduction

Map reading made easy

Model Fitting. Jean Yves Le Boudec

Chemical Applications of Symmetry and Group Theory Prof. Manabendra Chandra Department of Chemistry Indian Institute of Technology, Kanpur.

Our first case consists of those sequences, which are obtained by adding a constant number d to obtain subsequent elements:

Sin, Cos and All That

Communication Engineering Prof. Surendra Prasad Department of Electrical Engineering Indian Institute of Technology, Delhi

Simple Linear Regression

The Growth of Functions. A Practical Introduction with as Little Theory as possible

Class 16. Prof J. Kenney October 31, Relativity

Dynamics and Time Integration. Computer Graphics CMU /15-662, Fall 2016

Extreme Value Theory.

Partner s Name: EXPERIMENT MOTION PLOTS & FREE FALL ACCELERATION

MA 3260 Lecture 10 - Boolean Algebras (cont.) Friday, October 19, 2018.

The Generalized Roy Model and Treatment Effects

1 Probabilities. 1.1 Basics 1 PROBABILITIES

Chapter 8. Linear Regression. The Linear Model. Fat Versus Protein: An Example. The Linear Model (cont.) Residuals

ARE211, Fall2012. Contents. 2. Linear Algebra (cont) Vector Spaces Spanning, Dimension, Basis Matrices and Rank 8

Lab Exercise 03: Gauss Law

SocViz: Visualization of Facebook Data

Randomness at the root of things 2: Poisson sequences

POL 681 Lecture Notes: Statistical Interactions

6.207/14.15: Networks Lecture 12: Generalized Random Graphs

N H 2 2 NH 3 and 2 NH 3 N H 2

Lecture 22: Quantum computational complexity

Implicit Differentiation Applying Implicit Differentiation Applying Implicit Differentiation Page [1 of 5]

ARE202A, Fall 2005 CONTENTS. 1. Graphical Overview of Optimization Theory (cont) Separating Hyperplanes 1

Business Statistics. Lecture 9: Simple Regression

Confidence intervals

Prelab 7: Sunspots and Solar Rotation

Transcription:

Lectures on Spatial Complexity 17 th 28 th October 2011 Lecture 3: 21 st October 2011 Simple Spatial Growth Models The Origins of Scaling in Size Distributions Michael Batty m.batty@ucl.ac.uk @jmichaelbatty http://www.complexcity.info/ http://www.spatialcomplexity.info/ Outline of Lecture 3 Scaling: Size Distributions: Frequencies, Power Laws and Ranks Simple Models: Generators, Gibrat s Model of Proportionate Effect City Size Distributions Observed: The US and the UK The Idea of the Rank Clock The US, UK, and World Urban Systems Visualizing Size Distributions 1

Scaling: Size Distributions: Frequencies, Power Laws and Ranks Most distributions of objects in social systems where scarce resources and competition are intrinsic to their structure do not follow the usual Gaussian or normal distribution. They follow a power law Frequency of Cities The Short, Fat or Heavy Tail The Long Tail Size of Cities 2

We can transform the frequency to the rank size distribution Cumulative Frequency Distribution Counter Cumulative Frequency Distribution Frequency Distribution Of course such manipulations change the semantics of long, short, fat and heavy tails The mathematics of these distributions is comparatively simple and we assume that the frequency follows a power law: First f ( P) KP And then we have cumulative F ( P) f ( P) dp ~ KP 1 And then we have the counter cumulative r( P) F( P ) P f ( P) dp ~ KP 1 We can simplify the counter cumulative to show that this too follows a power law: then 3

r( P) KP KP 1 1 1 1 1 ( KP ) r( P) P( r) Z r( P) 1 1 1 1 Z r This is called the rank size distribution if 2 then 1 and P( r) Z r 1 more generally the rank P( r) Z r size rule is written as In fact this is clearly an oversimplification and usually the power law only holds in the upper tail. A vast array of evidence suggests that rank size distributions are more likely to be lognormal This one But a lot of work on rank size is still fitting the power law to this upper tail 4

I could say an awful lot more about power laws, their limitations and their advantages they are scaling meaning that they imply all kinds of things about how systems grow and develop which are related to self similarity and fractals. In some senses, their apparent stability is a signature of selforganization and I will return to this. But the key issue here is how we fit them and it is easy to see that if we transform them to a log log scale, then we get straight lines that make them easy to fit. Log population or Log P Log rank or Log r Log population or Log P Log rank or Log r The Strict Rank Size Relation P r log P r P r 1 1 log P K r 1 1 logr The first popular demonstration of this relation was by Zipf in papers published in the 1930s and 1940s. The variable Rank Size Relation which is the more usual is defined as P r K r log log K1 log r P r 5

There are clearly different types of systems such as P 1 log P Growth or decline: pure scaling The number of cities is expanding or contracting and all populations expand or contract log r There are clearly different types of systems such as log P P 1 log r The number of cities is expanding or contracting and top populations are fixed. The number of cities is fixed and all populations are expanding or contracting mixed scaling: Cities expanding or contracting, populations expanding or contracting 6

Simple Models: Generators, Gibrat s Model of Proportionate Effect Now before we continue to look at the evidence for such scaling, then we need to consider how such ranking relations come about. Why is it that the number of large objects is small and the number of small objects is large? On an intuitive level, then this is because of competition but why is the relationships so regular why is it not more discontinuous or irregular with changes in slope? To an extent, this is because of the repeated and regular action of competition we will assume some of this is random and some of structured according to proportionate effects. But before we develop this model, let us look at a simple way of generating city sizes. Now if we generate a new small city at each time period and we let the cities grow exponentially say, then we only ever have one small city, one next big city and so on. We don t have a rank size hierarchy and our frequency distribution is flat we have one city of each size. We thus need a mechanism that grows more small cities than big and one way of doing this is to assume that each time a cities spawn new ones. A simple model is based on the idea that we start with a single city that then spawns another city at the next time period. So we start with 1 and then we have 2. At the next time period, the original city spawns a new city and the first new city spawns another, so we have 4 cities in all. Then with 4 cities, we spawn 4 more at the next time period making 8. 7

8 leads to 16, then to 32, and then 64 and so in the progression that can be modeled as numberofcities 2 t Where t is time, so the sequence goes as 2 0 =1 which is the first given city, 2 1 =2 the first spawned city at time t=2, 2 2 =4, 2 3 =8, 2 4 =16, 2 5 =32 and so on up through time. Now at each stage the city that already exists is not only spawning but it is also growing, and if it grows exponentially or nonlinearly, then it if we add all the cities together we will get classic exponential growth. What we hope is that this process leads to a rank size rule or power law. In fact it doesn t as we will now show and thus was have to relax our model. At each time, the cities grow as follows. In time 1, the start cities grows according to the rate and the new city is fixed as 1 unit. If the growth rate is say 1.5 (very large), then the city grows as 1.5*1. In the second time period, this start city grows as (1.5 2 )*1, in the next as (1.5 3 )*1 and so on. By the time we get to the 10 th time period, this start city is as large as (1.5 10 ) = 985, by 25 th period, it is 33,537,598, which is fact is more than exponential growth. It is super or hyper exponential. We have worked this out for this simple model and what we get is as follows: the frequency is exponential not power and thus this model gives massive weight to the biggest units. Moreover the model does not allow for cities to go up or down in size and there are no deaths in the model. 8

Population Population Population 20000 10000 0 0 5 10 15 20 25 30 Rank 1E+14 1E+07 1 0 5 10 15 20 25 30 Rank 1E+14 1E+07 1 1 10 100 Rank We will show total population of the start city, then all cities, up to time 25 and then the population rank relation, then the log pop rank, then the log pop log rank which shows that the relation is strictly an exponential not power law. Clearly we can derived this from the process that we followed if we were to write the model in terms of its formula but we won t do this here. A much weaker but more acceptable model is not based on spawning cities which is rather an unrealistic hypothesis but assuming some sort of urban soup where cities rise and fall quite independently. Imagine a landscape where at every point a city might grow (or decline if it is already established). The best way to think of this to begin with is if to assume every point is a tiny city of 1 unit size a uniform distribution. Or we could assume that they are all randomly distributed in terms of size. The uniform hypothesis is the best. If we assume that the growth rate of a city is proportionate to its size but that this rate is randomly chosen, then a city of P population at time t becomes P t+1 at time t+1 from P random( ) In fact we might write this more clearly as t 1 P t t 1 ( t P ) P t 9

Where is the growth rate and t is the random effect. What happens to this process if we continue is that it is increasingly unlikely that a city gets big. The chances of the growth rate being large at every time period for any single city, decreases in time. In the same way, the chances of a city becoming very small are increasingly small but if we assume that cities cannot become negative if they approach a population of zero, then this means that they either disappear or we do not keep them at a minimum level of 1 unit say. In short, what this model is, is a random walk with a barrier or threshold at a small number. The process is artificially constrained from the negative quadrant and thus competition is limited at the bottom of the pile. Negative values have no meaning as in the real world of cities In fact I will show you a little movie of this process I don t have the program handy but the movie basically makes the point that if you run this for long enough then what happens is that a rank size distribution a power law of sorts emerges. Let me show you. 10

City Size Distributions Observed: The US and the UK Ok let us move onto some empirical evidence this has been a long digression and we need to take stock Parameter/Statistic 1970 1980 1990 2000 R Square 0.979 0.972 0.973 0.969 Intercept 16.790 16.891 17.090 17.360 Zipf Exponent 0.986 0.982 0.995 1.014 I have begun to imply something about the dynamics. When we plot the log log distribution for different periods of time, there is remarkable stability quite remarkable. Here again is the US system from Zipf with a reworking to 2000 This remarkable stability, however is only skindeep as we will demonstrate here 7 6.5 6 5.5 5 4.5 4 3.5 0 0.5 1 1.5 2 Year 1790 1800 1810 1820 1830 1840 1850 1860 1870 1880 1890 1900 1910 1920 1930 1940 1950 1960 1970 1980 1990 2000 r- square d 0.975 0.968 0.989 0.983 0.990 0.991 0.989 0.994 0.992 0.992 0.992 0.994 0.991 0.995 0.995 0.994 0.990 0.985 0.980 0.986 0.987 0.988 expon ent 0.876 0.869 0.909 0.904 0.899 0.894 0.917 0.990 0.978 0.983 0.951 0.946 0.912 0.908 0.903 0.907 0.900 0.838 0.808 0.769 0.744 0.737 11

So if there is stability of this kind at the aggregate level and such volatility at the micro level, how do we understand it. The first thing is to visualise it and the rest of this talk is about such visualisation There are two devices we have developed: first the Rank Space i.e. movement in terms of the position of cities on the original rank size or Zipf plot, and secondly a Rank Clock as we will show But there is also first the rank shift that we have seen. This is merely plotting a distribution of rank size for a particular year using the ranks at a previous or later year. Here is the US from 1940 to 2000 with switches in rank order when we plot population of the 1901 cities with their 1991 ranks. 20 16 12 8 4 0 0 3 6 9 Now for the UK urban system from 1901 to 1991 but the other way around 12

-1 0 0.5 1 1.5 2 2.5 3-1.5 Log of Population Shares -2-2.5-3 -3.5 1991 1991 Population based on 1901 Ranks 1901-4 -4.5 Log of Rank The Idea of the Rank Clock (and Other Visual Mnemonics) New York Houston, TX Richmond, VA Log (Population) From George Kingsley Zipf (1949) Human Behavior and the Principle of Least Effort (Addison Wesley, Cambridge, MA) Log (Rank) 13

The idea of the rank clock is to discard population because rank is its equivalent and just plot changes in rank with respect to time We can do this in rank space but it is not as effective as in polar coordinate space Here is an example of the possible trajectories of cities on the clock and we will stick with our US cities example for a bit longer, showing various possible plots and animations These clocks are useful for small numbers of cities but as we get more and more cities they become confused and we need to animate them in various ways as we will show below. The Idea of a Rank Clock rank is from number 1 at centre to 100 at edge and time goes in years in the usual clockwise direction 14

The US, UK, and World Urban Systems The morphology of the clock should tell us something i.e. the increase in cities, the volatility of ranks and so on. The rudimentary software for this in on our web site at http://www.casa.ucl.ac.uk/software/rank.asp And here is an animation of the US clock.. 15

10000000 Log City Size 1000000 New York City Houston 100000 Boston Philadelphia Charleston Los Angeles 10000 Baltimore Richmond VA Norfolk VA Chicago 1000 1 Log Rank 10 100 Another Example: The UK from 1901 to 2001 16

Very Long Term Dynamics: From 430 BCE From the Chandler data set, for the top 50 cities we have 17

Distances 18

Visualizing Size Distributions We are currently producing web and desktop animations of these ideas and to conclude I will show and point to two areas of work that we are developing. First we have a rank clock visualiser web page which is at http://orca.casa.ucl.ac.uk/~ollie/rankclocks/ And we can link the dynamics of rank size to population distributions which are mapped spatially. Demo populations in 33 Greater London Boroughs from 1801 19

Demo High buildings in London from 1950 to 2015 Demo Tube Exit Volumes from 2003 to 2009 20

Demo Fortune 500 from 1955 to 2010 for a sample of fast rising companies Here is the typical user interface in Processing 21

Couple of Key references on this work but I will post more in the bibliography and refer you to material other than my own if you click on these it takes you to my Science of Cities blog where the t papers sit http://www.complexcity.info/files/2011/06/batty-complexity-2010.pdf http://www.complexcity.info/files/2011/06/batty-nature-2006.pdf The blog will have more and more references as the course continues Questions www.complexity.info www.spatialcomplexity.info 22