Diversity and Team Science

Size: px
Start display at page:

Download "Diversity and Team Science"

Transcription

1 Diversity and Team Science Scott E Page Santa Fe Institute University of Michigan

2 Outline

3 A Great Big Complex World Diversity and Prediction Diversity Prediction Theorem Model Diversity Theorem Categorical Diversity Theorem Big Science

4 A Great Big (Complex) World

5

6

7

8 Authors Per Paper: Computer Science

9 Identity and Cognitive Diversity

10 Authors Per Paper: Computer Science Courtesy: Jacob Foster

11 ``Few if any funding programs support research on the effectiveness of science teams and larger groups.

12 Conclusions 1&2 #1: Team composition matters. Diversity is critical #2: Team composition systematic

13 Prediction

14

15 All Models Are Wrong - George Box

16 Hence, our truth is the intersection of independent lies. - Richard Levins

17

18

19

20 1197

21 Average = 1,

22 Diversity Prediction Theorem

23 Diversity Prediction Theorem Crowd Error = Average Error - Diversity

24 Crowd Error = Average Error - Diversity

25 Crowd Error = Average Error Diversity 0.6 = 2,

26 ctual Number 103 lass Prediction Diversity Prediction Theorem Crowd = Average Diversity 1086 = Number Finnish Athletes

27 ctual Number 103 lass Prediction 65.6 Diversity Prediction Theorem Crowd = Average Diversity 1401 = Number Finnish Athletes

28 Actual Number 319 Class Prediction Latvian Cars Per 1000 Diversity Prediction Theorem Crowd = Average Diversity 0.1 = 202, ,323.5

29 Actual Number 319 Class Prediction 369 Latvian Cars Per 1000 Diversity Prediction Theorem Crowd = Average Diversity 2542 = 89,743 87,201

30 Christina Romer

31

32 Model Diversity Theorem

33 N Models Distribution across models: (P 1, P 2, P 3 P n ) P j = probability someone uses model j F = probability two individuals have the same model

34 Probability two people use the same model. Match(P) = (P 1 ) 2 + (P 2 ) 2 + (P n ) 2

35 Diversity Index: Δ = 1/Match(P) Δ = Effective number of parties (political science) firms (economics) species (ecology)

36 Diversity Index Δ = (p p p 32 ) -1 (1/3) 2 + (1/3) 2 + (1/3) 2 = 3(1/9) = 1/3 Δ = 3 (1/2) 2 + (1/4) 2 + (1/4) 2 = 6/16 Δ = 2.66

37 Model Diversity Theorem Claim: N independent predictive models with a diversity index of Δ and an average variance of V, have an expected squared error equal to: V/Δ Economo, Hong, and Page (2015)

38 Size DIVERSITY (Δ) Matters

39 Goncola Abecasis

40

41 Categorical Diversity Theorem

42 Partition the set of possible instances into categories Make a prediction for each category

43 Computer Science PAC Learning: Valient Robust Classification: Provost and Foster Ensemble Learning: Desarthy and Sheela

44 Categorical Predictive Models Partition the set of possible instances into categories Make a prediction for each category

45 Chloride in Water: mg/l A B C D

46 Chloride in Water: mg/l A B C D

47 Variation in Data: 20,000 20,000 = ( ) 2 +( ) 2 +( ) 2 +( ) 2

48 Variation in Data: 20,000 20,000 = ( ) 2 +( ) 2 +( ) 2 +( ) 2 Residual Variation: 12,000 20,000 = ( ) 2 +( ) 2 +( ) 2 +( ) 2

49 Variation in Data: 20,000 20,000 = ( ) 2 +( ) 2 +( ) 2 +( ) 2 Residual Variation: 11,000 20,000 = ( ) 2 +( ) 2 +( ) 2 +( ) 2 R 2 : 0.45

50 Total Variation (20,000)

51 Chloride in Water: mg/l A B C D

52 Categorization Loss: 10,000 Mean of Category 1: 250 Categorization Loss 1 : = ( ) 2 +( ) 2 Mean of Category 2: 150 Categorization Loss 2 : = ( ) 2 +( ) 2

53 Possible Variation Explained 10,000 Categorization Categorization Error Loss 10,000

54 Chloride in Water: mg/l A B C D

55 Prediction Error: 1000= Prediction Category 1: 240 Actual Value: 250 Prediction Error: 200 = ( ) 2 +( ) 2 Prediction Category 2: 170 Actual Value: 150 Prediction Error: 800 = ( ) 2 +( ) 2

56 Variation Categorization Explained Categorization Error Loss 9,000 10,000 Prediction Error 1000

57 Categorical Diversity Theorem Variation = Explained + Category Loss + Predictive Error

58 Value of Distinct Categories Distinct Categories result in diverse categorization losses and by the Diversity Prediction Theorem lower error.

59

60

61 Six years of data Half million users 17,700 movies Data divided into (training, testing) Testing Data dived into (probe, quiz, test)

62 Singular Value Decomposition Each movie represented by a vector: (p 1,p 2,p 3,p 4 p n ) Each person represented by a vector: (q 1,q 2,q 3,q 4 q n )

63 Robert Bell Christina and David Rom

64 BellKor 50 dimensions 107 models Best Model: 6.8%

65 BellKor 50 dimensions 107 models Best Model: 6.8% Combination of Models: 8.4%

66 BellKor s Pragmatic Chaos Best Model 8.4% Ensemble: 10.1%

67 Enter ``The Ensemble 23 Teams 30 Countries

68 And the Winner is Ensemble 10.06% Bellkor 10.06%

69 But, the Real Winner is Ensemble 10.06% Bellkor 10.06% Combination 10.19%

70 Combine accurate and diverse models to make good predictions.

71 Big Science

72 Leaving Our Silos

73 Medicine Sociology Chemistry Economics

74 Freeman and Huang Citations Impact Factor # Addresses + + # References + + # Past papers + + Homophily - -

75 0.14% 0.12% Team Papers > 100 Cites 0.10% 0.08% 0.06% Team 0.04% 0.02% Solo Solo 0.00% Science & Engineering Social Science

76 Inter-University Collaboration Increases Impact Jones B, Wuchty S, Uzzi B (2008) Multi-University Research Teams: Shifting Impact, Geography, and Stratification in Science. Science 322: 1259

77 Cummings, J. N., Kiesler, S., Zadeh, R., & Balakrishnan, A. (2013). Group heterogeneity increases the risks of large group size: A longitudinal study of productivity in research groups. Psychological Science, 24(6),

78 Cummings, J. N., Kiesler, S., Zadeh, R., & Balakrishnan, A. (2013). Group heterogeneity increases the risks of large group size: A longitudinal study of productivity in research groups. Psychological Science, 24(6),

79 Cummings, J. N., Kiesler, S., Zadeh, R., & Balakrishnan, A. (2013). Group heterogeneity increases the risks of large group size: A longitudinal study of productivity in research groups. Psychological Science, 24(6),

80 Best patents have low proximity Best papers have low proximity

81

82 Atypical Connections Melissa Schilling Variable Odds Ratio Years since PHD 1.14 Prior Cites 2.25 Author Count 0.8 Depth (HHI) 3.29 Atypical Connect ``Recombinant search and breakthrough idea generation: An analysis of high impact papers in the social sciences Melissa A. Schilling and Elad Green Research Policy, 2011, vol. 40, issue 10, pages

83 Q?

Collective Intelligence

Collective Intelligence Collective Intelligence Collective Intelligence Prediction A Tale of Two Models Lu Hong and Scott Page Interpreted and Generated Signals Journal of Economic Theory, 2009 Generated Signals Interpreted

More information

Scott E Page. University of Michigan Santa Fe Institute. Leveraging Diversity

Scott E Page. University of Michigan Santa Fe Institute. Leveraging Diversity Scott E Page University of Michigan Santa Fe Institute Leveraging Diversity Outline Big Challenges Kiddie Pools Cattle and Netflix Problem Solving Poverty Climate Change Energy Health Care Disease/Epidemics

More information

Visualisation of knowledge domains in interdisciplinary research organizations

Visualisation of knowledge domains in interdisciplinary research organizations Neversdorf, September 2010 Visualisation of knowledge domains in interdisciplinary research organizations Ismael Rafols 1,2 1 SPRU -- Science and Technology Policy Research University of Sussex, Brighton,

More information

BOOK REVIEW Sampling: Design and Analysis. Sharon L. Lohr. 2nd Edition, International Publication,

BOOK REVIEW Sampling: Design and Analysis. Sharon L. Lohr. 2nd Edition, International Publication, STATISTICS IN TRANSITION-new series, August 2011 223 STATISTICS IN TRANSITION-new series, August 2011 Vol. 12, No. 1, pp. 223 230 BOOK REVIEW Sampling: Design and Analysis. Sharon L. Lohr. 2nd Edition,

More information

Scientometrics as network science The hidden face of a misperceived research field. Sándor Soós, PhD

Scientometrics as network science The hidden face of a misperceived research field. Sándor Soós, PhD Scientometrics as network science The hidden face of a misperceived research field Sándor Soós, PhD soossand@konyvtar.mta.hu Public understanding of scientometrics Three common misperceptions: Scientometrics

More information

Sharing The Ganges: The Politics And Technology Of River Development By Ben Crow READ ONLINE

Sharing The Ganges: The Politics And Technology Of River Development By Ben Crow READ ONLINE Sharing The Ganges: The Politics And Technology Of River Development By Ben Crow READ ONLINE If you are searching for the ebook Sharing the Ganges: The Politics and Technology of River Development by Ben

More information

A spatial literacy initiative for undergraduate education at UCSB

A spatial literacy initiative for undergraduate education at UCSB A spatial literacy initiative for undergraduate education at UCSB Mike Goodchild & Don Janelle Department of Geography / spatial@ucsb University of California, Santa Barbara ThinkSpatial Brown bag forum

More information

Conurbano s Atlas MALENA HOPP M A L H OO.COM. AR

Conurbano s Atlas MALENA HOPP M A L H OO.COM. AR Conurbano s Atlas MALENA HOPP M A L ENAHOPP@YA H OO.COM. AR R ESEARCH GRO UP IN LIVING CONDITIONS IN BUENOS A I R ES CONURBANO. UNIVERSI DAD N ACIONAL DE AV E L L A N E DA Conurbano, an heterogeneous region

More information

Lesson Quizzes (Individual assessments for each lesson) Social Studies Alive! Our Community and Beyond Lesson 1-6

Lesson Quizzes (Individual assessments for each lesson) Social Studies Alive! Our Community and Beyond Lesson 1-6 7.1 Basic Geographic Literacy 7.1.3.A-Identigy how basic geographic tools are used to organize and interpret information about people, places and environment. 7.3 Human Characteristics of Places and Regions

More information

UVA CS 4501: Machine Learning

UVA CS 4501: Machine Learning UVA CS 4501: Machine Learning Lecture 21: Decision Tree / Random Forest / Ensemble Dr. Yanjun Qi University of Virginia Department of Computer Science Where are we? è Five major sections of this course

More information

Chapter 1 - Lecture 3 Measures of Location

Chapter 1 - Lecture 3 Measures of Location Chapter 1 - Lecture 3 of Location August 31st, 2009 Chapter 1 - Lecture 3 of Location General Types of measures Median Skewness Chapter 1 - Lecture 3 of Location Outline General Types of measures What

More information

Using SVD to Recommend Movies

Using SVD to Recommend Movies Michael Percy University of California, Santa Cruz Last update: December 12, 2009 Last update: December 12, 2009 1 / Outline 1 Introduction 2 Singular Value Decomposition 3 Experiments 4 Conclusion Last

More information

Applications of GIS in Health Research. West Nile virus

Applications of GIS in Health Research. West Nile virus Applications of GIS in Health Research West Nile virus Outline Part 1. Applications of GIS in Health research or spatial epidemiology Disease Mapping Cluster Detection Spatial Exposure Assessment Assessment

More information

A Response to Rodrik s Geography Critique Incomplete and subject to revision as of March 6, 2001 The idea of using instrumental variables from the

A Response to Rodrik s Geography Critique Incomplete and subject to revision as of March 6, 2001 The idea of using instrumental variables from the A Response to Rodrik s Geography Critique Incomplete and subject to revision as of March 6, 2001 The idea of using instrumental variables from the gravity model to isolate the effect of openness on growth

More information

Human Population Dynamics CAPT Embedded Task

Human Population Dynamics CAPT Embedded Task Name Per. Assigned Due Human Population Dynamics CAPT Embedded Task Essential Question: What factors influence population growth? The human population has existed for a little more than 500,000 years.

More information

Overcoming creative disadvantage in rural areas with networked economic development strategies

Overcoming creative disadvantage in rural areas with networked economic development strategies Overcoming creative disadvantage in rural areas with networked economic development strategies Post Doctoral Research Fellow Munk School of Global Affairs University of Toronto Personal and research background

More information

Recommender Systems. Dipanjan Das Language Technologies Institute Carnegie Mellon University. 20 November, 2007

Recommender Systems. Dipanjan Das Language Technologies Institute Carnegie Mellon University. 20 November, 2007 Recommender Systems Dipanjan Das Language Technologies Institute Carnegie Mellon University 20 November, 2007 Today s Outline What are Recommender Systems? Two approaches Content Based Methods Collaborative

More information

Defining KBD and KCities

Defining KBD and KCities Defining KBD and KCities Francisco Javier Carrillo World Capital Institute and Center for Knowledge Systems fjcarrillo@itesm.mx Contents Demarcation and units of analysis Synthetic demarcation Analytic

More information

A Data-Based Assessment of Research-Doctorate Programs in the United States. National Research Council

A Data-Based Assessment of Research-Doctorate Programs in the United States. National Research Council A Data-Based Assessment of Research-Doctorate Programs in the United States National Research Council Initial Analysis for University of California, Davis Brief Report September, 0 Table of Contents About

More information

Binary Principal Component Analysis in the Netflix Collaborative Filtering Task

Binary Principal Component Analysis in the Netflix Collaborative Filtering Task Binary Principal Component Analysis in the Netflix Collaborative Filtering Task László Kozma, Alexander Ilin, Tapani Raiko first.last@tkk.fi Helsinki University of Technology Adaptive Informatics Research

More information

Space Syntax: Architecture and Cities MRes This programme information sheet includes details of the structure and content of the course.

Space Syntax: Architecture and Cities MRes This programme information sheet includes details of the structure and content of the course. Space Syntax: Architecture and Cities MRes 2018-19 This programme information sheet includes details of the structure and content of the course. CONTENTS Overview 3 Structure 4 Content 5 Staff 6 Opportunities

More information

Methodological issues in the development of accessibility measures to services: challenges and possible solutions in the Canadian context

Methodological issues in the development of accessibility measures to services: challenges and possible solutions in the Canadian context Methodological issues in the development of accessibility measures to services: challenges and possible solutions in the Canadian context Alessandro Alasia 1, Frédéric Bédard 2, and Julie Bélanger 1 (1)

More information

Grade 6 Social Studies

Grade 6 Social Studies Grade 6 Social Studies Social Studies Grade(s) 6th Course Overview This course focuses on the five Wisconsin Model Academic standards for Social Studies: Geography, History, Political Science and Citizenship,

More information

7.1 Significance of question: are there laws in S.S.? (Why care?) Possible answers:

7.1 Significance of question: are there laws in S.S.? (Why care?) Possible answers: I. Roberts: There are no laws of the social sciences Social sciences = sciences involving human behaviour (Economics, Psychology, Sociology, Political Science) 7.1 Significance of question: are there laws

More information

Introducing the WISERD Geoportal. WISERD DATA TEAM Dr Robert Berry & Dr Richard Fry, University of Glamorgan Dr Scott Orford, Cardiff University

Introducing the WISERD Geoportal. WISERD DATA TEAM Dr Robert Berry & Dr Richard Fry, University of Glamorgan Dr Scott Orford, Cardiff University Introducing the WISERD Geoportal WISERD DATA TEAM Dr Robert Berry & Dr Richard Fry, University of Glamorgan Dr Scott Orford, Cardiff University WISERD What is WISERD? Wales Institute of Social and Economic

More information

Seymour Centre 2017 Education Program 2071 CURRICULUM LINKS

Seymour Centre 2017 Education Program 2071 CURRICULUM LINKS Suitable for: Stage 5 Stage 6 HSC Subject Links: Seymour Centre 2017 Education Program 2071 CURRICULUM LINKS Science Stage Content Objective Outcomes Stage 5 Earth and Space ES3: People use scientific

More information

The Problem. Sustainability is an abstract concept that cannot be directly measured.

The Problem. Sustainability is an abstract concept that cannot be directly measured. Measurement, Interpretation, and Assessment Applied Ecosystem Services, Inc. (Copyright c 2005 Applied Ecosystem Services, Inc.) The Problem is an abstract concept that cannot be directly measured. There

More information

NATIONAL SCIENCE FOUNDATION GRADUATE RESEARCH FELLOWSHIPS

NATIONAL SCIENCE FOUNDATION GRADUATE RESEARCH FELLOWSHIPS NATIONAL SCIENCE FOUNDATION GRADUATE RESEARCH FELLOWSHIPS Supported Fields of Study GRFP application review panels are organized according to Primary Field of Study, which the applicant must indicate on

More information

Learning representations

Learning representations Learning representations Optimization-Based Data Analysis http://www.cims.nyu.edu/~cfgranda/pages/obda_spring16 Carlos Fernandez-Granda 4/11/2016 General problem For a dataset of n signals X := [ x 1 x

More information

PAC Learning Introduction to Machine Learning. Matt Gormley Lecture 14 March 5, 2018

PAC Learning Introduction to Machine Learning. Matt Gormley Lecture 14 March 5, 2018 10-601 Introduction to Machine Learning Machine Learning Department School of Computer Science Carnegie Mellon University PAC Learning Matt Gormley Lecture 14 March 5, 2018 1 ML Big Picture Learning Paradigms:

More information

+ DEEP. Credentials OLIVIER LA ROCCA

+ DEEP. Credentials OLIVIER LA ROCCA DEEP Credentials OLIVIER LA ROCCA EUROPARTNERS What is Deep? How does it work? Case study AGENDA Contacts Understanding the reality and the nuances of local communities is crucial when it comes to take

More information

Georgia Kayser, PhD. Module 4 Approaches to Sampling. Hello and Welcome to Monitoring Evaluation and Learning: Approaches to Sampling.

Georgia Kayser, PhD. Module 4 Approaches to Sampling. Hello and Welcome to Monitoring Evaluation and Learning: Approaches to Sampling. Slide 1 Module 4 Approaches to Sampling Georgia Kayser, PhD Hello and Welcome to Monitoring Evaluation and Learning: Approaches to Sampling Slide 2 Objectives To understand the reasons for sampling populations

More information

Variance Reduction and Ensemble Methods

Variance Reduction and Ensemble Methods Variance Reduction and Ensemble Methods Nicholas Ruozzi University of Texas at Dallas Based on the slides of Vibhav Gogate and David Sontag Last Time PAC learning Bias/variance tradeoff small hypothesis

More information

What are we like? Population characteristics from UK censuses. Justin Hayes & Richard Wiseman UK Data Service Census Support

What are we like? Population characteristics from UK censuses. Justin Hayes & Richard Wiseman UK Data Service Census Support What are we like? Population characteristics from UK censuses Justin Hayes & Richard Wiseman UK Data Service Census Support Who are we? Richard Wiseman UK Data Service / Jisc Justin Hayes UK Data Service

More information

Eligible Fields of Study

Eligible Fields of Study Eligible Fields of Study NSF Graduate Research Fellowship CHEMISTRY Chemical Catalysis Chemical Measurement and Imaging Chemical Structure, Dynamics, and Mechanism Chemical Synthesis Chemical Theory, Models

More information

ESSENTIAL CONCEPTS AND SKILL SETS OF THE IOWA CORE CURRICULUM

ESSENTIAL CONCEPTS AND SKILL SETS OF THE IOWA CORE CURRICULUM ESSENTIAL CONCEPTS AND SKILL SETS OF THE IOWA CORE CURRICULUM SOCIAL STUDIES PROVIDED BY THE IOWA DEPARTMENT OF EDUCATION INTRODUCTION Social studies is the integrated study of the social sciences and

More information

Agent Based Modeling and Simulation in the Social Sciences. Frank Witmer Computing and Research Services 14 May 2014

Agent Based Modeling and Simulation in the Social Sciences. Frank Witmer Computing and Research Services 14 May 2014 Agent Based Modeling and Simulation in the Social Sciences Frank Witmer Computing and Research Services 14 May 2014 Outline What are agent based models? How do they work? What types of questions are they

More information

Social Exclusion and Digital Disengagement

Social Exclusion and Digital Disengagement Social Exclusion and Digital Disengagement Issues of Policy, Theory and Measurement OxIS Discussion Seminar 2007 (OII) Ellen J. Helsper Bill Dutton Agenda Introduction Bill Dutton Director OII & Principal

More information

Community Detection. fundamental limits & efficient algorithms. Laurent Massoulié, Inria

Community Detection. fundamental limits & efficient algorithms. Laurent Massoulié, Inria Community Detection fundamental limits & efficient algorithms Laurent Massoulié, Inria Community Detection From graph of node-to-node interactions, identify groups of similar nodes Example: Graph of US

More information

CSISS Resources for Research and Teaching

CSISS Resources for Research and Teaching CSISS Resources for Research and Teaching Donald G. Janelle Center for Spatially Integrated Social Science University of California, Santa Barbara Montreal 26 July 2003 Workshop on Spatial Analysis for

More information

Regional cooperation: evidence from European cooperative innovation networks

Regional cooperation: evidence from European cooperative innovation networks Regional cooperation: evidence from European cooperative innovation networks Sara Amoroso Alex Coad Nicola Grassano European Commission, JRC, B.3 September 29, 2016 Motivation of the study Regional innovation

More information

Introductory Seminar Ⅰ Humans and Society in the Age of Internationalization. French(Beginner)Ⅱ. French(Basic)Ⅱ. German(Beginner)Ⅱ.

Introductory Seminar Ⅰ Humans and Society in the Age of Internationalization. French(Beginner)Ⅱ. French(Basic)Ⅱ. German(Beginner)Ⅱ. Liberal Arts Education -Compulsory Elective Subjects -Elective Subjects Subject Humankind and Culture SubjectsHumankind Humankind and Society Subjects Nature and the Environment Subjects Liberal Arts Seminars

More information

The Scope and Growth of Spatial Analysis in the Social Sciences

The Scope and Growth of Spatial Analysis in the Social Sciences context. 2 We applied these search terms to six online bibliographic indexes of social science Completed as part of the CSISS literature search initiative on November 18, 2003 The Scope and Growth of Spatial

More information

Bayesian Statistics as an Alternative for Analyzing Data and Testing Hypotheses Benjamin Scheibehenne

Bayesian Statistics as an Alternative for Analyzing Data and Testing Hypotheses Benjamin Scheibehenne Bayesian Statistics as an Alternative for Analyzing Data and Testing Hypotheses Benjamin Scheibehenne http://scheibehenne.de Can Social Norms Increase Towel Reuse? Standard Environmental Message Descriptive

More information

Introduction PCA classic Generative models Beyond and summary. PCA, ICA and beyond

Introduction PCA classic Generative models Beyond and summary. PCA, ICA and beyond PCA, ICA and beyond Summer School on Manifold Learning in Image and Signal Analysis, August 17-21, 2009, Hven Technical University of Denmark (DTU) & University of Copenhagen (KU) August 18, 2009 Motivation

More information

Applying to study geography (or geology) at university. David Richards on behalf of local Universities

Applying to study geography (or geology) at university. David Richards on behalf of local Universities Applying to study geography (or geology) at university David Richards on behalf of local Universities diverse relevant enabling What employers value in new graduates* Working under pressure Oral communication

More information

Outline: Ensemble Learning. Ensemble Learning. The Wisdom of Crowds. The Wisdom of Crowds - Really? Crowd wiser than any individual

Outline: Ensemble Learning. Ensemble Learning. The Wisdom of Crowds. The Wisdom of Crowds - Really? Crowd wiser than any individual Outline: Ensemble Learning We will describe and investigate algorithms to Ensemble Learning Lecture 10, DD2431 Machine Learning A. Maki, J. Sullivan October 2014 train weak classifiers/regressors and how

More information

Click-Through Rate prediction: TOP-5 solution for the Avazu contest

Click-Through Rate prediction: TOP-5 solution for the Avazu contest Click-Through Rate prediction: TOP-5 solution for the Avazu contest Dmitry Efimov Petrovac, Montenegro June 04, 2015 Outline Provided data Likelihood features FTRL-Proximal Batch algorithm Factorization

More information

Background. Adaptive Filters and Machine Learning. Bootstrap. Combining models. Boosting and Bagging. Poltayev Rassulzhan

Background. Adaptive Filters and Machine Learning. Bootstrap. Combining models. Boosting and Bagging. Poltayev Rassulzhan Adaptive Filters and Machine Learning Boosting and Bagging Background Poltayev Rassulzhan rasulzhan@gmail.com Resampling Bootstrap We are using training set and different subsets in order to validate results

More information

Core Courses for Students Who Enrolled Prior to Fall 2018

Core Courses for Students Who Enrolled Prior to Fall 2018 Biostatistics and Applied Data Analysis Students must take one of the following two sequences: Sequence 1 Biostatistics and Data Analysis I (PHP 2507) This course, the first in a year long, two-course

More information

AIS 2016 AIS 2014 AIS 2015

AIS 2016 AIS 2014 AIS 2015 Interne score ASMF journal subject category Economics or Business, Finance on ISI Web of Knowledge TI selected Marketing or OR journal Journal 2013 2014 2015 2016 AVERAGE Advances in Water Resources 1,246

More information

Egypt Public DSS. the right of access to information. Mohamed Ramadan, Ph.D. [R&D Advisor to the president of CAPMAS]

Egypt Public DSS. the right of access to information. Mohamed Ramadan, Ph.D. [R&D Advisor to the president of CAPMAS] Egypt Public DSS ì the right of access to information Central Agency for Public Mobilization and Statistics Arab Republic of Egypt Mohamed Ramadan, Ph.D. [R&D Advisor to the president of CAPMAS] Egypt

More information

Conservation Biology and Ecology Option Learning Outcome Indicator Rubric Threshold

Conservation Biology and Ecology Option Learning Outcome Indicator Rubric Threshold Conservation Biology and Ecology Option Learning Outcome Indicator Rubric Threshold Demonstrate effective written and oral communication. WRIT 201 COM 110 or CLS 101US BIOE 455 writing Completion of course

More information

HOUSING VALUES: THE CONTRIBUTION OF PEOPLE -ENVIRONMENT STUDIES FOR HOUSING RESEARCH AND PRACTICE. Roderick J. Lawrence.

HOUSING VALUES: THE CONTRIBUTION OF PEOPLE -ENVIRONMENT STUDIES FOR HOUSING RESEARCH AND PRACTICE. Roderick J. Lawrence. HOUSING VALUES: THE CONTRIBUTION OF PEOPLE -ENVIRONMENT STUDIES FOR HOUSING RESEARCH AND PRACTICE Roderick J. Lawrence roderick.lawrence@unige.ch WH0 S WHO Architectural practice in Sydney IAPS Housing

More information

Making sense of Econometrics: Basics

Making sense of Econometrics: Basics Making sense of Econometrics: Basics Lecture 4: Qualitative influences and Heteroskedasticity Egypt Scholars Economic Society November 1, 2014 Assignment & feedback enter classroom at http://b.socrative.com/login/student/

More information

Predictive Discrete Latent Factor Models for large incomplete dyadic data

Predictive Discrete Latent Factor Models for large incomplete dyadic data Predictive Discrete Latent Factor Models for large incomplete dyadic data Deepak Agarwal, Srujana Merugu, Abhishek Agarwal Y! Research MMDS Workshop, Stanford University 6/25/2008 Agenda Motivating applications

More information

Matrix Factorization In Recommender Systems. Yong Zheng, PhDc Center for Web Intelligence, DePaul University, USA March 4, 2015

Matrix Factorization In Recommender Systems. Yong Zheng, PhDc Center for Web Intelligence, DePaul University, USA March 4, 2015 Matrix Factorization In Recommender Systems Yong Zheng, PhDc Center for Web Intelligence, DePaul University, USA March 4, 2015 Table of Contents Background: Recommender Systems (RS) Evolution of Matrix

More information

SOCIAL SCIENCES. WORLD GEOGRAPHY LH Grade(s): 9 Pre-Req: N/A

SOCIAL SCIENCES. WORLD GEOGRAPHY LH Grade(s): 9 Pre-Req: N/A SOCIAL SCIENCES WORLD GEOGRAPHY 21033000 Grade(s): 9 The World Cultural Geography course consists of the following content area strands: American History, World History, Geography, Humanities, Civics and

More information

Chapter 1 Introduction

Chapter 1 Introduction 1 Chapter 1 Introduction Figure 1.1: Westlake Plaza A warm sunny day on a downtown street and plaza, pedestrians pass on the sidewalks, people sit on benches and steps, enjoying a cup of coffee, shoppers

More information

Taught: 2005/6, 2006/7, 2007/8, 2008/9, 2009/10, 2010/11

Taught: 2005/6, 2006/7, 2007/8, 2008/9, 2009/10, 2010/11 Department of Sociology & Anthropology, BGU Course: Social Network Analysis Level and status: 4 credit points M.A. One of the core seminars for the track Organizational Sociology, and an elective seminar

More information

Review for Exam #1. Chapter 1. The Nature of Data. Definitions. Population. Sample. Quantitative data. Qualitative (attribute) data

Review for Exam #1. Chapter 1. The Nature of Data. Definitions. Population. Sample. Quantitative data. Qualitative (attribute) data Review for Exam #1 1 Chapter 1 Population the complete collection of elements (scores, people, measurements, etc.) to be studied Sample a subcollection of elements drawn from a population 11 The Nature

More information

Center for Orbital Debris Education and Research

Center for Orbital Debris Education and Research Center for Orbital Debris Education and Research SPACE JUNK IS A GLOBAL ISSUE The past 50 years of space exploration and utilization have created an orbiting junkyard of orbital debris. Over 22,000 pieces

More information

STATISTICAL ANALYSIS OF LAW ENFORCEMENT SURVEILLANCE IMPACT ON SAMPLE CONSTRUCTION ZONES IN MISSISSIPPI (Part 1: DESCRIPTIVE)

STATISTICAL ANALYSIS OF LAW ENFORCEMENT SURVEILLANCE IMPACT ON SAMPLE CONSTRUCTION ZONES IN MISSISSIPPI (Part 1: DESCRIPTIVE) STATISTICAL ANALYSIS OF LAW ENFORCEMENT SURVEILLANCE IMPACT ON SAMPLE CONSTRUCTION ZONES IN MISSISSIPPI (Part 1: DESCRIPTIVE) Tulio Sulbaran, Ph.D 1, David Marchman 2 Abstract It is estimated that every

More information

The role of community in urban regeneration: mixed use areas approach in USA

The role of community in urban regeneration: mixed use areas approach in USA The role of community in urban regeneration: mixed use areas approach in USA Outline 1. Thesis 2. Main topics 3. Urban regeneration and community 4. Community involvement methods 5. Policies: EU vs USA

More information

Labour Market Areas in Italy. Sandro Cruciani Istat, Italian National Statistical Institute Directorate for territorial and environmental statistics

Labour Market Areas in Italy. Sandro Cruciani Istat, Italian National Statistical Institute Directorate for territorial and environmental statistics Labour Market Areas in Italy Sandro Cruciani Istat, Italian National Statistical Institute Directorate for territorial and environmental statistics Workshop on Developing European Labour Market Areas Nuremberg,

More information

1 What s Way Out There? The Hubble Ultra Deep Field

1 What s Way Out There? The Hubble Ultra Deep Field ENGAGING IN ASTRONOMICAL INQUIRY 5 1 What s Way Out There? The Hubble Ultra Deep Field Big Idea: The Hubble Space Telescope image Hubble Ultra Deep Field reveals a variety of previously unknown objects

More information

Week 2: MAPS! Intro to Geography Lehman College GEH 101/GEH 501 Fall Source: Keith Miyake. Friday, September 9, 11

Week 2: MAPS! Intro to Geography Lehman College GEH 101/GEH 501 Fall Source:  Keith Miyake. Friday, September 9, 11 1 Week 2: MAPS! Intro to Geography Lehman College GEH 101/GEH 501 Fall 2010 Keith Miyake Source: http://xkcd.com/256/ 2 Term Papers: Intro The purpose of this paper is to help students relate the geographical

More information

Math at Top Speed. Richard A. Tapia. University Professor and Maxfield-Oshman Professor in Engineering, Rice University

Math at Top Speed. Richard A. Tapia. University Professor and Maxfield-Oshman Professor in Engineering, Rice University Math at Top Speed Richard A. Tapia University Professor and Maxfield-Oshman Professor in Engineering, Rice University 2015 RUSMP Fall Networking Conference Today we will talk about mathematics 2 2 Definition

More information

Sampling distributions and the Central Limit. Theorem. 17 October 2016

Sampling distributions and the Central Limit. Theorem. 17 October 2016 distributions and the Johan A. Elkink School of Politics & International Relations University College Dublin 17 October 2016 1 2 3 Outline 1 2 3 (or inductive statistics) concerns drawing conclusions regarding

More information

SESSION 5 Descriptive Statistics

SESSION 5 Descriptive Statistics SESSION 5 Descriptive Statistics Descriptive statistics are used to describe the basic features of the data in a study. They provide simple summaries about the sample and the measures. Together with simple

More information

Social Geography for Mitigation: Developing Geo-narrative Media to Tell Your Client s Story

Social Geography for Mitigation: Developing Geo-narrative Media to Tell Your Client s Story Social Geography for Mitigation: Developing Geo-narrative Media to Tell Your Client s Story Julie Urbanik, Ph.D. Federal Public Defender Training, KC, MO April 27, 2017 Social Geography? Don t we already

More information

Probability and statistics; Rehearsal for pattern recognition

Probability and statistics; Rehearsal for pattern recognition Probability and statistics; Rehearsal for pattern recognition Václav Hlaváč Czech Technical University in Prague Czech Institute of Informatics, Robotics and Cybernetics 166 36 Prague 6, Jugoslávských

More information

Fast Adaptive Algorithm for Robust Evaluation of Quality of Experience

Fast Adaptive Algorithm for Robust Evaluation of Quality of Experience Fast Adaptive Algorithm for Robust Evaluation of Quality of Experience Qianqian Xu, Ming Yan, Yuan Yao October 2014 1 Motivation Mean Opinion Score vs. Paired Comparisons Crowdsourcing Ranking on Internet

More information

Putting the World in a Box

Putting the World in a Box Putting the World in a Box OBJECTIVES The student will construct models of the particulate level in chemistry and describe the models limitations. The student will use particulate level models to explain

More information

Ensembles. Léon Bottou COS 424 4/8/2010

Ensembles. Léon Bottou COS 424 4/8/2010 Ensembles Léon Bottou COS 424 4/8/2010 Readings T. G. Dietterich (2000) Ensemble Methods in Machine Learning. R. E. Schapire (2003): The Boosting Approach to Machine Learning. Sections 1,2,3,4,6. Léon

More information

Network Analysis and Modeling

Network Analysis and Modeling lecture 0: what are networks and how do we talk about them? 2017 Aaron Clauset 003 052 002 001 Aaron Clauset @aaronclauset Assistant Professor of Computer Science University of Colorado Boulder External

More information

The Normal Distribution. Chapter 6

The Normal Distribution. Chapter 6 + The Normal Distribution Chapter 6 + Applications of the Normal Distribution Section 6-2 + The Standard Normal Distribution and Practical Applications! We can convert any variable that in normally distributed

More information

Innovation in the Measurement of Place: Systematic Social Observation in a Rural Setting

Innovation in the Measurement of Place: Systematic Social Observation in a Rural Setting Innovation in the Measurement of Place: Systematic Social Observation in a Rural Setting Barbara Entwisle Heather Edelblute Brian Frizzelle Phil McDaniel September 21, 2009 Background Hundreds of studies

More information

Large-Margin Thresholded Ensembles for Ordinal Regression

Large-Margin Thresholded Ensembles for Ordinal Regression Large-Margin Thresholded Ensembles for Ordinal Regression Hsuan-Tien Lin (accepted by ALT 06, joint work with Ling Li) Learning Systems Group, Caltech Workshop Talk in MLSS 2006, Taipei, Taiwan, 07/25/2006

More information

One-Way ANOVA. Some examples of when ANOVA would be appropriate include:

One-Way ANOVA. Some examples of when ANOVA would be appropriate include: One-Way ANOVA 1. Purpose Analysis of variance (ANOVA) is used when one wishes to determine whether two or more groups (e.g., classes A, B, and C) differ on some outcome of interest (e.g., an achievement

More information

Contemporary Data Collection and Spatial Information Management Techniques to support Good Land Policies

Contemporary Data Collection and Spatial Information Management Techniques to support Good Land Policies Contemporary Data Collection and Spatial Information Management Techniques to support Good Land Policies Ch. Ioannidis Associate Professor FIG Commission 3 Workshop Paris, 25-28 October 2011 Introduction

More information

UC Santa Barbara CSISS Classics

UC Santa Barbara CSISS Classics UC Santa Barbara CSISS Classics Title Georg Simmel, The Sociology of Space. CSISS Classics Permalink https://escholarship.org/uc/item/7s73860q Author Fearon, David Publication Date 2004-01-01 escholarship.org

More information

CountrySTAT REGIONAL BASIC ADMINISTRATOR TRAINING for GCC MEMBER STATES. CountrySTAT STATISTICS COMPONENT

CountrySTAT REGIONAL BASIC ADMINISTRATOR TRAINING for GCC MEMBER STATES. CountrySTAT STATISTICS COMPONENT CountrySTAT REGIONAL BASIC ADMINISTRATOR TRAINING for GCC MEMBER STATES Abu Dhabi, United Arab Emirates (UAE), 26 30 January 2014 CountrySTAT STATISTICS COMPONENT (Concepts, Definitions, Classification,

More information

Spatial Interaction with eigenfunction. estimated with a Poisson estimator. and Poisson spatial interaction modelling

Spatial Interaction with eigenfunction. estimated with a Poisson estimator. and Poisson spatial interaction modelling Appendix A Table A1. Empirical studies analysing the structure of knowledge exchange using spatial interaction modelling Study Dataset Method used Dependent Variable Proximity forms included PONDS et al.

More information

Theory and Applications of A Repeated Game Playing Algorithm. Rob Schapire Princeton University [currently visiting Yahoo!

Theory and Applications of A Repeated Game Playing Algorithm. Rob Schapire Princeton University [currently visiting Yahoo! Theory and Applications of A Repeated Game Playing Algorithm Rob Schapire Princeton University [currently visiting Yahoo! Research] Learning Is (Often) Just a Game some learning problems: learn from training

More information

Micro-Randomized Trials & mhealth. S.A. Murphy NRC 8/2014

Micro-Randomized Trials & mhealth. S.A. Murphy NRC 8/2014 Micro-Randomized Trials & mhealth S.A. Murphy NRC 8/2014 mhealth Goal: Design a Continually Learning Mobile Health Intervention: HeartSteps + Designing a Micro-Randomized Trial 2 Data from wearable devices

More information

Towards international cooperation in the development of Marine Spatial Plans for the North Pacific: economic, social, and environmental dimensions

Towards international cooperation in the development of Marine Spatial Plans for the North Pacific: economic, social, and environmental dimensions Towards international cooperation in the development of Marine Spatial Plans for the North Pacific: economic, social, and environmental dimensions A.V. Sereda, V.B. Lobanov V.I. Il ichev Pacific Oceanological

More information

Information Retrieval

Information Retrieval Introduction to Information CS276: Information and Web Search Christopher Manning and Pandu Nayak Lecture 13: Latent Semantic Indexing Ch. 18 Today s topic Latent Semantic Indexing Term-document matrices

More information

Papers in Evolutionary Economic Geography # Complex Economic Activities Concentrate in Large Cities

Papers in Evolutionary Economic Geography # Complex Economic Activities Concentrate in Large Cities Papers in Evolutionary Economic Geography #18.29 Complex Economic Activities Concentrate in Large Cities Pierre-Alexandre Balland & Cristian Jara-Figueroa & Sergio Petralia & Mathieu Steijn & David Rigby

More information

Lecture 10. Variance and standard deviation

Lecture 10. Variance and standard deviation 18.440: Lecture 10 Variance and standard deviation Scott Sheffield MIT 1 Outline Defining variance Examples Properties Decomposition trick 2 Outline Defining variance Examples Properties Decomposition

More information

Recommender systems, matrix factorization, variable selection and social graph data

Recommender systems, matrix factorization, variable selection and social graph data Recommender systems, matrix factorization, variable selection and social graph data Julien Delporte & Stéphane Canu stephane.canu@litislab.eu StatLearn, april 205, Grenoble Road map Model selection for

More information

Spatial Modeling, Regional Science, Arthur Getis Emeritus, San Diego State University March 1, 2016

Spatial Modeling, Regional Science, Arthur Getis Emeritus, San Diego State University March 1, 2016 Spatial Modeling, Regional Science, and UCSB Arthur Getis Emeritus, San Diego State University March 1, 2016 My Link to UCSB The 1980s at UCSB (summers and sabbatical) Problems within Geography: The Quantitative

More information

Grade 5 Social Studies Curriculum Map Mrs. Ward. Standards. Standards. # Course Outcomes Course Outcome

Grade 5 Social Studies Curriculum Map Mrs. Ward. Standards. Standards. # Course Outcomes Course Outcome Grade 5 Social Studies Curriculum Map 2015 2016 Mrs. Ward # Course Outcomes Course Outcome Standards Standards 1 SWBAT construct timelines, tables, charts, maps and graph to represent and interpret historical

More information

Are knowledge flows all Alike? Evidence from EU regions (preliminary results)

Are knowledge flows all Alike? Evidence from EU regions (preliminary results) Are knowledge flows all Alike? Evidence from EU regions (preliminary results) Francesco Quatraro and Stefano Usai University of Nice Sophia Antipolis, GREDEG-CNRS and BRICK, Collegio Carlo Alberto University

More information

Globally Estimating the Population Characteristics of Small Geographic Areas. Tom Fitzwater

Globally Estimating the Population Characteristics of Small Geographic Areas. Tom Fitzwater Globally Estimating the Population Characteristics of Small Geographic Areas Tom Fitzwater U.S. Census Bureau Population Division What we know 2 Where do people live? Difficult to measure and quantify.

More information

George Mason University February 2009

George Mason University February 2009 Cheryl Druehl George Mason University February 2009 Open Innovation Chesborough's (2003) book Open Innovation identifies open innovation as distinct from closed innovation. Closed takes place entirely

More information

Module 4 Approaches to Sampling. Georgia Kayser, PhD The Water Institute

Module 4 Approaches to Sampling. Georgia Kayser, PhD The Water Institute Module 4 Approaches to Sampling Georgia Kayser, PhD 2014 The Water Institute Objectives To understand the reasons for sampling populations To understand the basic questions and issues in selecting a sample.

More information

Rural Pennsylvania: Where Is It Anyway? A Compendium of the Definitions of Rural and Rationale for Their Use

Rural Pennsylvania: Where Is It Anyway? A Compendium of the Definitions of Rural and Rationale for Their Use Rural Pennsylvania: Where Is It Anyway? A Compendium of the Definitions of Rural and Rationale for Their Use Pennsylvania Office of Rural Health 310 Nursing Sciences Building University Park, PA 16802

More information

Writing Patent Specifications

Writing Patent Specifications Writing Patent Specifications Japan Patent Office Asia-Pacific Industrial Property Center, JIPII 2013 Collaborator: Shoji HADATE, Patent Attorney, Intellectual Property Office NEXPAT CONTENTS Page 1. Patent

More information

!""""""""""""""""""""""""""""""""""""""""""""""""""#$%&'()*+",&-./'&%"0*".1&"2/*"

!#$%&'()*+,&-./'&%0*.1&2/* ENGAGING IN ASTRONOMICAL INQUIRY 89 """"""""""""""""""""""""""""""""""""""""""""""""""#$%&'()*+",&-./'&%"0*".1&"2/*" Big Idea: The Sun has surface features in different wavelengths and those features have

More information