Vertical transmission of culture and the distribution of family names

Similar documents
arxiv:nlin/ v1 [nlin.ao] 25 Sep 2000

Synchronizationinanarray of globally coupled maps with delayed interactions

At the Boundary between Biological and Cultural Evolution: The Origin of Surname Distributions

arxiv:nlin/ v1 [nlin.cd] 25 Apr 2001

Ordering periodic spatial structures by non-equilibrium uctuations

Surfing genes. On the fate of neutral mutations in a spreading population

Ensemble equivalence for non-extensive thermostatistics

Potentials of Unbalanced Complex Kinetics Observed in Market Time Series

Simple Model of Evolution. Reed College FAX (505) Abstract. of evolution. The model consists of a two-dimensional world with a

Supplementary information for: Frequency stabilization in nonlinear micro-mechanical oscillators

Scaling invariant distributions of firms exit in OECD countries

Stochastic renormalization group in percolation: I. uctuations and crossover

6.207/14.15: Networks Lecture 12: Generalized Random Graphs

adap-org/ Jan 1994

Ridge analysis of mixture response surfaces

Anti-pathogen genes and the replacement of diseasevectoring mosquito populations: a model-based evaluation of hybrid strategies

Controllingthe spikingactivity in excitable membranes via poisoning

Modelling high-frequency economic time series

Deterministic scale-free networks

Power-law distributions resulting from nite resources

The critical behaviour of the long-range Potts chain from the largest cluster probability distribution

Major questions of evolutionary genetics. Experimental tools of evolutionary genetics. Theoretical population genetics.

An explicit dynamic model of segregation

From time series to superstatistics

General idea. Firms can use competition between agents for. We mainly focus on incentives. 1 incentive and. 2 selection purposes 3 / 101

Class Copy! Return to teacher at the end of class! Mendel's Genetics

Visuomotor tracking on a computer screen an experimental paradigm to study the dynamics of motor control

Cultural Dissemination using a Quantum Model

Aspects of risk assessment in power-law distributed natural hazards

Spreadingof a mercury droplet on thin gold lms

1 An Introduction to Stochastic Population Models

Dynamical Embodiments of Computation in Cognitive Processes James P. Crutcheld Physics Department, University of California, Berkeley, CA a

CARIBBEAN POPULATION AND DEVELOPMENT TRENDS AND INTERRELATIONS: A ASSESSMENT VOLUME 1

CASE STUDY: EXTINCTION OF FAMILY NAMES

Real space renormalization group and totalitarian paradox of majority rule voting

Percolation model for the existence of a mitochondrial Eve

ECONOMETRIC MODELS. The concept of Data Generating Process (DGP) and its relationships with the analysis of specication.

How robust are the predictions of the W-F Model?

Taylor series based nite dierence approximations of higher-degree derivatives

The Cultural Landscape: Introduction to Human Geography Chapter 1 Thinking Geographically Chapter 2 Population

Complexity analysis of job-shop scheduling with deteriorating jobs

Stochastic dominance with imprecise information

Assessing the covariance function in geostatistics

One-dimensional thermoluminescence kinetics

4. Identify one bird that would most likely compete for food with the large tree finch. Support your answer. [1]

A model of capillary equilibrium for the centrifuge technique

Effect of Non Gaussian Noises on the Stochastic Resonance-Like Phenomenon in Gated Traps. Abstract

Exponential Growth and Decay. Lesson #1 of Unit 7. Differential Equations (Textbook 3.8)

Conductivity dependence of the polarization impedance spectra of platinum black electrodes in contact with aqueous NaCl electrolyte solutions

Agent Models and Demographic Research. Robert D. Mare December 7, 2007

The e!ects of the concentration of a polymer dispersant on apparent viscosity and sedimentation behavior of dense slurries

THE CONTROL OF CHAOS: THEORY AND APPLICATIONS

arxiv: v1 [physics.soc-ph] 3 Dec 2009

ESTIMATING SCALING EXPONENTS IN AUDITORY-NERVE SPIKE TRAINS USING FRACTAL MODELS INCORPORATING REFRACTORINESS

Uncertainty quantification of world population growth: A self-similar PDF model

SchroK dinger equation with imaginary potential

CENTRAL OR POLARIZED PATTERNS IN COLLECTIVE ACTIONS

Effects of transport memory and nonlinear damping in a generalized Fisher s equation

arxiv:cond-mat/ v1 11 Jan 2001

PHYSICAL REVIEW LETTERS

Uncertainty in radon measurements with CR39 detector due to unknown deposition of Po

550 XU Hai-Bo, WANG Guang-Rui, and CHEN Shi-Gang Vol. 37 the denition of the domain. The map is a generalization of the standard map for which (J) = J

NGSS Example Bundles. Page 1 of 13

Cellular Automata Models for Diffusion of Innovations

Phase Transition of Finance Chia Cheng Chang

Evolutionary games of condensates in coupled birth-death processes

Akinori Sekiguchi and Yoshihiko Nakamura. Dept. of Mechano-Informatics, University of Tokyo Hongo, Bunkyo-Ku, Tokyo , Japan

INTERNATIONAL CENTRE FOR THEORETICAL PHYSICS

arxiv:cond-mat/ v1 [cond-mat.dis-nn] 18 Feb 2004 Diego Garlaschelli a,b and Maria I. Loffredo b,c

6 Introduction to Population Genetics

Evolving network with different edges

HS.LS1.B: Growth and Development of Organisms

High-conductance states in a mean-eld cortical network model

These outputs can be written in a more convenient form: with y(i) = Hc m (i) n(i) y(i) = (y(i); ; y K (i)) T ; c m (i) = (c m (i); ; c m K(i)) T and n

Limiting distribution for subcritical controlled branching processes with random control function

Dynamics that trigger/inhibit cluster formation in a one-dimensional granular gas

(2) The drawings show stages in the evolution of the human skeleton.

EVOLUTION: BIOLOGY S UNIFYING THEME

Persistence in Random Bond Ising Models of a Socio-Econo Dynamics in High Dimensions. Abstract

1/f noise: a pedagogical review.

6 Introduction to Population Genetics

Signal and noise in financial correlation matrices

A Summary of the Theory of Evolution

OIB GEOGRAPHY SYLLABUS. Theme

2 Nils Andersson and Kostas D. Kokkotas Moreover, the w-mode spectra are qualitatively similar for axial and polar perturbations (for a description of

Competition amongst scientists for publication status: Toward a model of scientific publication and citation distributions

UNIVERSITY OF NOTTINGHAM. Discussion Papers in Economics CONSISTENT FIRM CHOICE AND THE THEORY OF SUPPLY

31/10/2012. Human Evolution. Cytochrome c DNA tree

Theoretical Computer Science. Completing a combinatorial proof of the rigidity of Sturmian words generated by morphisms

Introduction to course: BSCI 462 of BIOL 708 R

Small-world structure of earthquake network

Science Unit Learning Summary

Answers to Problem Set Number 04 for MIT (Spring 2008)

Lagrangian Description for Particle Interpretations of Quantum Mechanics Single-Particle Case

Levels of Ecological Organization. Biotic and Abiotic Factors. Studying Ecology. Chapter 4 Population Ecology

Chapter 4 Population Ecology

Unpublished Opening Lecture for the Course on the Theory of Relativity in Argentina, 1925

AAG CENTER FOR GLOBAL GEOGRAPHY EDUCATION Internationalizing the Teaching and Learning of Geography

THE problem of phase noise and its influence on oscillators

Calculation of maximum entropy densities with application to income distribution

Transcription:

Physica A 295 (2001) 1 8 www.elsevier.com/locate/physa Vertical transmission of culture and the distribution of family names Damian H. Zanette a;, Susanna C. Manrubia b a Consejo Nacional de Investigaciones Cientcas y Tecnicas, Centro Atomico Bariloche and Instituto Balseiro, 8400 Bariloche, Ro Negro, Argentina b Max Planck Institute of Colloids and Interfaces, Theory Division, D-14424 Potsdam, Germany Abstract A stochastic model for the evolution of a growing population is proposed, in order to explain empirical power-law distributions in the frequency of family names as a function of the family size. Preliminary results show that the predicted exponents are in good agreement with real data. The evolution of family-name distributions is discussed in the frame of vertical transmission of cultural features. c 2001 Elsevier Science B.V. All rights reserved. PACS: 87.23.Ge; 05.40. a Keywords: Social dynamics; Random processes; Power-law distributions 1. Introduction The fascinating complexity of social phenomena is increasingly attracting the attention of physicists. We nd in the techniques of Statistical Physics an ideal tool for the study of models of such phenomena, where complex macroscopic behaviour emerges spontaneously as the consequence of relatively simple microscopic dynamical rules. During the last decade, in fact, much work along those lines has been devoted to the study of statistical properties of dynamical processes in economics [1,2]. Other key social processes such as the dynamics of cultural features have received relatively less attention, in spite of the fact that empirical data call for the kind of approach already employed with economical systems. Consider, for instance, the size distribution of large religious groups, shown in Fig. 1. A well dened power-law decay, spanning Corresponding author. Tel.: +54-2944-445173; fax: +54-2944-445299. E-mail addresses: zanette@cab.cnea.gov.ar (D.H. Zanette), manrubia@mpikg-golm.mpg.de (S.C. Manrubia). 0378-4371/01/$ - see front matter c 2001 Elsevier Science B.V. All rights reserved. PII: S 0378-4371(01)00046-2

2 D.H. Zanette, S.C. Manrubia / Physica A 295 (2001) 1 8 Fig. 1. Frequency of religious groups as a function of the number of adherents, in arbitrary units (source: www.adherents.com). The straight line has slope 5 3 1:67. more than two orders of magnitude, is apparent. These power-law distributions are indeed a main clue to complexity in real and model systems [2]. The spatiotemporal dynamics of culture is driven by geographical dissemination of cultural features and by their transmission from old to new generations. Axelrod [3] has proposed a simple model of culture dissemination that captures its basic mechanisms. Cultural features can spread by interaction between individuals, but some preexistent cultural agreement is necessary for such interaction to take place. These mechanisms are able to explain the maintenance of a certain level of cultural diversity. Meanwhile, vertical culture transmission along the genealogical line, from ancestors to their descendents is governed by the inuence of cultural features in the formation of couples, and by the inuence of each parent s features in determining those of the ospring [4]. Cavalli-Sforza and coworkers have modeled and studied dierent situations of vertical culture transmission, with special emphasis on the eect of stochastic external agents [5]. An extreme case of vertical transmission of a cultural feature, which can be used as a benchmark for models of culture dynamics, is that of family names. An individual s family name is (in most cases, at least) inherited from the father and, therefore, its possible inuence in the formation of the parents couple is irrelevant to its transmission. Moreover, creation and mutation of family names are strongly restricted to specic historical periods and places. Most of the time, such changes are extremely rare. The history of family names is, in fact, quite complex [6]. In Europe, for instance, dierent groups of family names (patronymic-like, toponymic-like, etc.) originated at dierent times typically, during the Middle Ages and mutations became important particularly during the large migration waves within Europe and towards the Americas. New family names appeared also as a consequence of immigration. In spite of this eventful history, current distributions of family names exhibit striking regularities.

D.H. Zanette, S.C. Manrubia / Physica A 295 (2001) 1 8 3 Fig. 2. Frequency of family names as a function of the family size, in arbitrary units. The United States data is extrapolated from a sample taken during the 1990 census (source: www.census.gov). The Berlin data corresponds to family names beginning by A, taken from the 1996 phonebook. Family sizes in the Berlin data are multiplied by a factor 10 2 for convenience in displaying. The straight lines have slope 2. Fig. 2 shows family-name frequencies as a function of the family size i.e., of the number of individuals bearing a given family name for the United States and a part of Berlin, in recent times. Both data show a well dened power-law dependence, with an exponent close to 2. Analogous data have recently been reported for Japanese family names [7], which exhibit power-law distributions with smaller exponents ( 1:75). In this paper, we consider a model for a growing population where each individual can inherit cultural features from its parents. In particular, we analyze the case of transmission of the family name, and study its distribution as a function of the family size. The parameters relevant to the model are the relative birth rate and the mortality, which control the population growth, and the creation rate of family names. Our preliminary results show that the model satisfactorily reproduces the power laws observed in real data, for wide ranges of the parameters. 2. The model We introduce in the following a variation of the mechanism proposed by Simon [8] to explain the occurrence of power laws in the frequency distribution of words and city sizes (Zipf s law [9]), among other instances. In our model, evolution proceeds by discrete steps. At a given step s, the P(s) individuals in the population are divided into groups the families. Within each group, all the individuals share the same family name. At each step, two mechanisms act. (i) A new individual is introduced in the population, representing a birth event. With probability the newborn is assigned a new family name, not previously present in the population. With the complementary probability, 1, a preexistent individual is chosen at random to become the newborn s

4 D.H. Zanette, S.C. Manrubia / Physica A 295 (2001) 1 8 father, and its family name is given to the newborn. Thus, a specic family name is assigned with a probability proportional to the corresponding family size. (ii) An individual is chosen at random from the whole population and, with probability, it is eliminated. This represents a death event. Note that if the dead was the only individual with its family name, this specic family name disappears from the population. The evolution of the population is controlled by the parameter which, as we show below, is a direct measure of the mortality rate. The distribution of family names varies due to the eect of family-name creation and mutation, measured by, and of mortality. Since during the evolution the total population P(s) changes, the time interval t(s) to be associated with each evolution step should also change, as t(s) = 1=P(s). The frequency, whose value is in principle arbitrary, xes time units. The variation of the population at each step is, on average, P(s)=1. Consequently, the macroscopic equation for the time evolution of the population reads dp dt P t = (1 )P : (1) Identifying with the birth rate per individual and unit time, the product is the corresponding mortality rate. In average, thus, the population grows exponentially in time. Note that, since an individual s family name is here supposed to be inherited from the father, the model describes the evolution of the male population only. However, the same mechanism can be reinterpreted assuming that the family name is transmitted with the same probability by either parent. In this case, the model encompasses the whole population and no sex distinction occurs. The real situation is in fact intermediate between these two limiting cases. We also stress that in the present model individuals are ageless, in the sense that neither the probability of becoming father of a newborn nor the death probability depend on the individual s age. As a consequence, the probability p(m) that an individual has m children during its whole life is exponential, p(m)=(1 + ) m 1. This is to be compared with the Poissonian probability of real, age-structured populations [10]. Below, we consider a class of initial conditions where the population is divided into N 0 families, with i 0 individuals in each family. We denote such an initial condition as (N 0 ;i 0 ). The corresponding initial population is P(0) = N 0 i 0. 2.1. Simon s model: =0 Neglecting mortality (that is with =0), our system reduces to the model introduced by Simon to explain Zipf s law [8]. In this case, the evolution of the population is deterministic, P(s) = P(0) + s, since exactly one individual is added to the population at each step. Under these conditions, it is possible to write an evolution equation for the average number of families n i (s) with exactly i individuals at step s. We have n i (s +1)=n i (s)+ 1 P(s) [(i 1)n i 1(s) in i (s)] ; (2)

D.H. Zanette, S.C. Manrubia / Physica A 295 (2001) 1 8 5 Fig. 3. Family-name frequency as a function of the family size, given by Eqs. (2) and (3) with =10 3, at three evolution stages: (a) s =3 10 3, (b) s =10 4, (c) s =3 10 4. For convenience in displaying, the distributions have been normalized. The numbers in brackets give the initial condition for each case (see text). The dotted straight lines have slope 2. for i 1, and n 1 (s +1)=n 1 (s)+ 1 P(s) n 1(s) : (3) Simon has shown that, under fairly general conditions, these equations predict a long-time distribution with a power-law decay n i i 1 1=(1 ) (4) for moderately large values of i (1 i N 0 + s). This power-law distribution is to be ascribed to the stochastic multiplicative nature of family growth, which involves a growth probability proportional to the family size. In the limit 0 the exponent in Eq. (4) equals 2. Note that this limit is relevant to our problem, since the probability of creation or mutation of a family name per individual is expected to be very small. The exponent, in fact, agrees with the empirical data presented in Fig. 2. We point out that transient eects strongly depend on initial conditions. Fig. 3 shows the (normalized) distribution n i (s) calculated from Eqs. (2) and (3) at several evolution stages, for dierent initial conditions and =10 3. For intermediate values of i the development of the power-law decay with exponent close to 2 is apparent in all cases. However, the behaviour of the distribution for larger values of i varies noticeably with the initial condition. Eqs. (2) and (3) imply that the total number of family names in the population, given by N(s)= i n i(s), grows in average as N (s)=n 0 + s. As a function of time, thus, the number of family names increases exponentially, as N(t) =N 0 exp(t), as expected for a population without mortality where family names are created at rate.

6 D.H. Zanette, S.C. Manrubia / Physica A 295 (2001) 1 8 In contrast, in real populations at present times, the number of family names is known to decrease [6]. 2.2. Eects of mortality: 0 With 0, the growth of the total population P(s) uctuates stochastically, depending on the occurrence of death events at each evolution step. Consequently, a formulation for the average evolution of n i (s) in terms of a deterministic equation of the form of Eqs. (2) and (3) turns out to be inconsistent. These equation can however be adapted in a way suitable for numerical calculation to the case where the population growth is not deterministic, in the following form. First, for a given value of P(s) at step s, the functions in the right-hand side of Eqs. (2) and (3) are applied to n i (s) to obtain intermediate values n i(s). Then, with probability, we calculate 1 n i (s +1)= P(s)+1 [(i +1)n i+1(s) in i(s)] (5) for all i =1; 2;:::. Since in this case both birth and death events have taken place, P(s +1)=P(s). With the complementary probability, 1, weputn i (s +1)=n i(s) for all i, and P(s +1)=P(s)+1. Heuristic arguments not reproduced here indicate that, under the conditions used to derive Eq. (4), the above algorithm should give rise to distributions with a well dened power-law decay for moderately large family sizes, of the form n i i 1 (1 )=(1 ) : (6) Quite remarkably, in the relevant limit 0 the exponent becomes independent of, and reduces again to 2. For suciently low, thus, mortality is not expected to aect the power-law exponent which, as we have seen, is in agreement with empirical data. This has been veried through numerical calculation of n i (s) with the above algorithm, as illustrated in Fig. 4 for the initial condition (1; 1) and three values of. The algorithm combining Eqs. (2), (3), and (5) mixes the deterministic average evolution of n i (s) with the stochastic variation of the population, due to random death events. This combination involves, thus, a statistical approximation which must be tested by means of numerical simulations of the fully stochastic model. Results of such simulations, averaged over 10 4 realizations for each value of, are shown as dots in Fig. 4. We nd very good agreement between both methods. As for the number of dierent family names, N (t), we have found that, for moderate values of and at suciently long times, it increases exponentially. As expected, the growth rate depends on both and. There is however an initial transient during which the evolution is not exponential and, in fact, N(t) can temporarily decrease. Decay of the number of family names for long times seems to be restricted to very high death probability, 1. Note that these are precisely the values expected for in modern developed societies, where birth and death rates are practically identical.

D.H. Zanette, S.C. Manrubia / Physica A 295 (2001) 1 8 7 Fig. 4. Normalized family-name frequency as a function of the family size calculated from Eqs. (2), (3) and (5) at s =3 10 4, for =10 3 and three values of (a) =0:3, (b) =0:6, (c) =0:9. Dots stand for the results of numerical simulations of the model, averaged over 10 4 realizations. The dotted straight line has slope 2. 3. Discussion The present variant of Simon s model provides a plausible description of a growing population, as far as the assumption of age-independent fertility and mortality is admitted. The numerical resolution of averaged evolution equations and numerical simulations show that our model successfully reproduces the exponent of power-law distributions observed in the frequency of family names as a function of the family size. Specically, the exponent close to 2 found in empirical data for family names from the United States and Berlin is reproduced in the limit of very small creation and mutation rates and a wide variety of mortality rates. For other creation and mutation rates, the predicted exponents are, in absolute value, larger than above [cf. Eqs. (4) and (6)]. This contrasts with the exponents found for modern Japanese family names, close to 1:75 [7]. We argue that this is an eect of transients which, in this case, are still acting. In fact, most Japanese family names are relatively recent, as they appeared some 120 years ago [7]. Curve (c) in Fig. 4, for instance, shows clearly that transient distributions could be assigned smaller spurious power-law exponents. Note, however, that a detailed evaluation of transient eects requires a careful identication of initial conditions which, as a result of the complex history of family names, could be a hard task in any real situation. A quantitative comparison of the predictions of the present model with real data not presented at this preliminary level will require considering populations of several million individuals (cf. Fig. 2). Since extensive numerical simulations of systems of such sizes could become computationally too expensive, it will be useful to analyze in

8 D.H. Zanette, S.C. Manrubia / Physica A 295 (2001) 1 8 detail the scaling properties of our model. In particular, the attention will focus on the dependence of the duration of transients, both in the frequency and in the total number of family names, on the initial population and its distribution in families, as well as on the probabilities and. Considering long-term variations of these probabilities is also in close connection with the comparison of our results with empirical data. In fact, for a modern developed population, in Europe for instance, we can distinguish at least two well dierentiated stages. When most European family names appeared, some centuries ago, the total population was increasing more or less steadily. This stage, thus, corresponds to relatively large values of and moderate values of. In modern times, on the contrary, new family names appear at an extremely low rate in fact, their total number decreases [6] and the total European population is practically constant, so that 0 and 1. The adaptation of the present model to the study of the evolution of other cultural features requires the addition of two main new ingredients. First, a new parameter must be introduced to dene the probability that a given cultural feature is inherited from either parent [4,5]. Second, it is necessary to specify the eect of that feature in the formation of the parents couple, and the mechanism by which couples are eectively formed. This latter process has been classically proposed as an optimization problem [11]. In the frame of our system, it would require a much more realistic approach if any connection with actual populations is to be established. Acknowledgements We thank G. Abramson for his critical reading of the manuscript. References [1] B. Mandelbrot, Fractals and Scaling in Finance, Springer, New York, 1997. [2] J.P. Bouchaud, M. Potters, Theory of Financial Risks: From Statistical Physics to Risk Management, Cambridge University Press, Cambridge, 2000. [3] R. Axelrod, The Complexity of Cooperation, Princeton University Press, Princeton, 1997. [4] L.L. Cavalli-Sforza, M.W. Feldman, K.H. Chen, S.M. Dornbusch, Science 218 (1982) 19 27. [5] L.L. Cavalli-Sforza, M.W. Feldman, Cultural Transmission and Evolution: A Quantitative Approach, Princeton University Press, Princeton, 1981. [6] J.-M. Legay, M. Vernay, Pour la Science 255 (1999) 58 65. [7] S. Miyazima, Y. Lee, T. Nagamine, H. Miyajima, Physica A 278 (2000) 282 288. [8] H.A. Simon, Models of Man, Wiley, New York, 1957. [9] G.K. Zipf, Human Behavior and the Principle of Least Eort, Addison-Wesley, Cambridge, 1949. [10] A.K. Dewdney, Sci. Am. 254 (1986) 12 16. [11] M. Dzierzawa, M.-J. Omero, Physica A 287 (2000) 321 333.