EP219:Data Analysis And Interpretation

Size: px
Start display at page:

Download "EP219:Data Analysis And Interpretation"

Transcription

1 EP219:Data Analysis And Interpretation Report: Week 1 Team Poisson ous 28 July 2017 to 4 August

2 Contents 1 Problem Statement 3 2 Python Program 3 3 Histograms 6 4 Inference 7 5 Team responsibilities 7 6 Website 8 2

3 1 Problem Statement Given the text file Complete TAVG daily.txt that contains the daily temperature data from This file shows the average land temperature fluctuations averaged over the entire earth. Using this data the following analysis has been done. Extract the full data set to a numpy array. To this array add a column which shows the actual temperature on that day. Make a histogram of the temperature distribution for the year Re- peat this for the year Make a histogram of the temperature distribution for the temperature on 1st and 15th day of every month for the year 1912 and Python Program Here is the code for plotting the required histograms. Parts of the code to append extra column to the data imported as a numpy array, has been taken from Prof. Rentala s example code. 1 import pandas a s pd 2 import numpy a s np 3 import m a t p l o t l i b. pyplot as p l t 4 5 #Code to get the number o f commented l i n e s b e f o r e the a c t u a l data s t a r t s 6 D a t a f i l e=open ( Complete TAVG daily. txt ) 7 Ncommentedlines=0 8 while D a t a f i l e. read ( 1 )== % : 9 D a t a f i l e. r e a d l i n e ( ) 10 Ncommentedlines=Ncommentedlines+1 11 D a t a f i l e. c l o s e ( ) #S t o r i n g the data in a dataframe df with no headers and whitespace as d e l i m i t e r s ; the f i r s t Nommentedlines rows are skipped 14 df = pd. r e a d t a b l e ( Complete TAVG daily. txt, delim whitespace= True, header=none, skiprows=ncommentedlines ) #Data i s now s t o r e d in an numpy array 17 rawdatatable=df. v a l u e s 3

4 18 19 #Function i s c r e a t e d that takes any number to number tempfix = lambda t : t #f u n c t i o n i s v e c t o r i z e d, so i t can be a p p l i e d to an array 23 vtempfix = np. v e c t o r i z e ( tempfix ) #Vectorized f u n c t i o n i s a p p l i e d to column corresponding to temperature anamolies 26 newcolumn = vtempfix ( rawdatatable [ :, 5 ] ) #new column i s converted i n t o a 2 dimensional array so that i t can be appended to o r i g i n a l data 29 reshapednewcolumn = newcolumn. reshape ( newcolumn. s i z e, 1 ) #b e t t e r data i s the o r i g i n a l data with an added column o f temperature a x i s=1 i m p l i e s data i s appended columnwise 32 b e t t e r d a t a = np. append ( rawdatatable, reshapednewcolumn, a x i s =1) #cutdata c o n t a i n s only those rows o f b e t t e r d a t a f o r which the year i s i 35 cutdata = lambda i : b e t t e r d a t a [ ( b e t t e r d a t a [:,1]== i ), : ] 36 #A f u n c t i o n i s d e f i n e d to p l o t histograms f o r temperature data 37 def HistTemp ( year, c o l o u r ) : 38 #tempdata c o n t a i n s the temperature data o f each day f o r which the year was year 39 tempdata=cutdata ( year ) [ :, 6 ] #to get the l i m i t s o f histogram we f i n d the minimum and maximum v a l u e s o f temperature 4 i s m u l t i p l i e d and d ivided to get f i r s t bin value to the accuracy o f xmin=i n t (4 min ( tempdata ) ) / xmax=i n t (4 max( tempdata ) ) / #Number o f bins i s c a l c u l a t e d to get the array o f bins 46 Numbins=((xmax xmin ) 4) #binarray i s the array which i s p a s s e s as a parameter to histogram f u n c t i o n 49 binarray=np. l i n s p a c e ( xmin, xmax,num=numbins ) #Histogram i s p l o t t e d by p l t. h i s t f u n c t i o n 52 p l t. h i s t ( tempdata, bins=binarray, f a c e c o l o r=colour, alpha =0.9, rwidth =0.95) 53 4

5 54 #a x i s are l a b e l l e d and t i t l e i s given 55 p l t. t i t l e ( Temperature D i s t r i b u t i o n f o r the Year +s t r ( year ) ) 56 p l t. x l a b e l ( Temperature in C e l s i u s ) 57 p l t. y l a b e l ( Number o f days ) 58 p l t. xlim ( [ xmin, xmax ] ) 59 p l t. x t i c k s ( np. arange ( xmin, xmax, ) ) #p l o t i s d i s p l a y e d 63 p l t. show ( ) #Function i s c a l l e d f o r both the years r e q u i r e d 66 HistTemp (1912, green ) 67 HistTemp (2012, blue ) 68 #Function i s c r e a t e d f o r p r i n t i n g only datas o f two s p e c i f i c dates o f a month, year i s the year, day1 and day2 are the days o f the month r e q u i r e d 69 def HistSpecificTemp ( year, day1, day2, c o l o u r ) : 70 tempdata=cutdata ( year ) 71 r e f d a t a=tempdata [ ( tempdata [:,3]== day1 )+(tempdata [:,3]== day2 ), : ] [ :, 6 ] 72 #to get the l i m i t s o f histogram we f i n d the minimum and maximum v a l u e s o f temperature, 4 i s m u l t i p l i e d and d ivided to get f i r s t bin value to the accuracy o f xmin=i n t (4 min ( r e f d a t a ) ) / xmax=i n t (4 max( r e f d a t a ) ) / #Number o f bins i s c a l c u l a t e d to get the array o f bins 77 Numbins=((xmax xmin ) 4) #binarray i s the array which i s p a s s e s as a parameter to histogram f u n c t i o n 80 binarray=np. l i n s p a c e ( xmin, xmax,num=numbins ) #Histogram i s p l o t t e d by p l t. h i s t f u n c t i o n 83 p l t. h i s t ( refdata, bins=binarray, f a c e c o l o r=colour, alpha =0.9, rwidth =0.95) #a x i s are l a b e l l e d and t i t l e i s given 86 p l t. t i t l e ( Refined Temperature D i s t r i b u t i o n f o r the Year +s t r ( year ) ) 87 p l t. x l a b e l ( Temperature in C e l s i u s ) 88 p l t. y l a b e l ( Number o f days ) 89 p l t. xlim ( [ xmin, xmax ] ) 90 p l t. x t i c k s ( np. arange ( xmin, xmax, ) ) 5

6 91 #p l o t i s d i s p l a y e d 92 p l t. show ( ) #Function i s c a l l e d f o r year 1912 and HistSpecificTemp ( , 1, 15, green ) 96 HistSpecificTemp ( , 1, 15, blue ) #End o f Code 3 Histograms The following are the histograms for the temperature distribution of the year 1912 and 2012 considering the daily temperatures. (a) Year 1912 (b) Year 2012 Figure 1: the temperature distribution considering the daily temperatures Following are the histograms for the temperature distribution of the year 1912 and 2012 considering the temperatures on the 1st and 15th day of every month. 6

7 (a) Year 1912 (b) Year 2012 Figure 2: the temperature distribution considering the temperatures on the 1st and 15th day of every month 4 Inference We expected that the refined temperature histogram, having lesser data points is more likely to miss out the data points that lie on the fringes. Thus we expected the range of the distribution to be smaller. From the Histograms plotted we infer that the spread of the temperature in the Histogram of Temperature distribution for entire year is same as the spread of the Histogram of Temperature distribution for first and the fifteenth day of each month for both years 1912 and Moreover we see the temperature value at which the maxima of the temperature distribution occurs is higher in 2012 than in We infer from this that in fact global warming is a real issue. 5 Team responsibilities Project Leader - Saipriya Satyajit Programmer - Keshav Janyani Web Manager - Ananay Garg Report Writers - Abhisek Sahu and Ashay Telang 7

8 6 Website The link to our website is Team Poisson-ous All the assignments will be uploaded on this site. 8

Numpy. Luis Pedro Coelho. October 22, Programming for Scientists. Luis Pedro Coelho (Programming for Scientists) Numpy October 22, 2012 (1 / 26)

Numpy. Luis Pedro Coelho. October 22, Programming for Scientists. Luis Pedro Coelho (Programming for Scientists) Numpy October 22, 2012 (1 / 26) Numpy Luis Pedro Coelho Programming for Scientists October 22, 2012 Luis Pedro Coelho (Programming for Scientists) Numpy October 22, 2012 (1 / 26) Historical Numeric (1995) Numarray (for large arrays)

More information

CS 237 Fall 2018, Homework 07 Solution

CS 237 Fall 2018, Homework 07 Solution CS 237 Fall 2018, Homework 07 Solution Due date: Thursday November 1st at 11:59 pm (10% off if up to 24 hours late) via Gradescope General Instructions Please complete this notebook by filling in solutions

More information

Solution to running JLA

Solution to running JLA Solution to running JLA Benjamin Audren École Polytechnique Fédérale de Lausanne 08/10/2014 Benjamin Audren (EPFL) CLASS/MP MP runs 08/10/2014 1 / 13 Parameter file data. e x p e r i m e n t s =[ JLA ]

More information

PANDAS FOUNDATIONS. pandas Foundations

PANDAS FOUNDATIONS. pandas Foundations PANDAS FOUNDATIONS pandas Foundations What is pandas? Python library for data analysis High-performance containers for data analysis Data structures with a lot of functionality Meaningful labels Time series

More information

CS 237 Fall 2018, Homework 06 Solution

CS 237 Fall 2018, Homework 06 Solution 0/9/20 hw06.solution CS 237 Fall 20, Homework 06 Solution Due date: Thursday October th at :59 pm (0% off if up to 24 hours late) via Gradescope General Instructions Please complete this notebook by filling

More information

Creative Data Mining

Creative Data Mining Creative Data Mining Using ML algorithms in python Artem Chirkin Dr. Daniel Zünd Danielle Griego Lecture 7 0.04.207 /7 What we will cover today Outline Getting started Explore dataset content Inspect visually

More information

Exercise 5 Release: Due:

Exercise 5 Release: Due: Stochastic Modeling and Simulation Winter 28 Prof. Dr. I. F. Sbalzarini, Dr. Christoph Zechner (MPI-CBG/CSBD TU Dresden, 87 Dresden, Germany Exercise 5 Release: 8..28 Due: 5..28 Question : Variance of

More information

Lectures about Python, useful both for beginners and experts, can be found at (http://scipy-lectures.github.io).

Lectures about Python, useful both for beginners and experts, can be found at  (http://scipy-lectures.github.io). Random Matrix Theory (Sethna, "Entropy, Order Parameters, and Complexity", ex. 1.6, developed with Piet Brouwer) 2016, James Sethna, all rights reserved. This is an ipython notebook. This hints file is

More information

Python Analysis. PHYS 224 September 25/26, 2014

Python Analysis. PHYS 224 September 25/26, 2014 Python Analysis PHYS 224 September 25/26, 2014 Goals Two things to teach in this lecture 1. How to use python to fit data 2. How to interpret what python gives you Some references: http://nbviewer.ipython.org/url/media.usm.maine.edu/~pauln/

More information

Python Analysis. PHYS 224 October 1/2, 2015

Python Analysis. PHYS 224 October 1/2, 2015 Python Analysis PHYS 224 October 1/2, 2015 Goals Two things to teach in this lecture 1. How to use python to fit data 2. How to interpret what python gives you Some references: http://nbviewer.ipython.org/url/media.usm.maine.edu/~pauln/

More information

Introduction to Python

Introduction to Python Introduction to Python Luis Pedro Coelho Institute for Molecular Medicine (Lisbon) Lisbon Machine Learning School II Luis Pedro Coelho (IMM) Introduction to Python Lisbon Machine Learning School II (1

More information

2 Getting Started with Numerical Computations in Python

2 Getting Started with Numerical Computations in Python 1 Documentation and Resources * Download: o Requirements: Python, IPython, Numpy, Scipy, Matplotlib o Windows: google "windows download (Python,IPython,Numpy,Scipy,Matplotlib" o Debian based: sudo apt-get

More information

/Users/jenskremkow/Science/Courses/python-summerschool-berlin/faculty/Day2/examples numpy.py September 2,

/Users/jenskremkow/Science/Courses/python-summerschool-berlin/faculty/Day2/examples numpy.py September 2, /Users/jenskremkow/Science/Courses/python-summerschool-berlin/faculty/Day2/examples numpy.py September 2, 2009 1 Numpy Many of the examples are taken from: http://www.scipy.org/cookbook Building Arrays

More information

Complex Numbers. A complex number z = x + iy can be written in polar coordinates as re i where

Complex Numbers. A complex number z = x + iy can be written in polar coordinates as re i where Lab 20 Complex Numbers Lab Objective: Create visualizations of complex functions. Visually estimate their zeros and poles, and gain intuition about their behavior in the complex plane. Representations

More information

Parametric Model for the LWA-1 Dipole Response as a Function of Frequency

Parametric Model for the LWA-1 Dipole Response as a Function of Frequency Parametric Model for the LWA-1 Dipole Response as a Function of Frequency Jayce Dowell December 20, 2011 LWA Memo #178 Version 2 Contents 1 Introduction 2 2 Methods 2 3 Results 3 4 Application 3 A Document

More information

STAT2201 Assignment 3 Semester 1, 2017 Due 13/4/2017

STAT2201 Assignment 3 Semester 1, 2017 Due 13/4/2017 Class Example 1. Single Sample Descriptive Statistics (a) Summary Statistics and Box-Plots You are working in factory producing hand held bicycle pumps and obtain a sample of 174 bicycle pump weights in

More information

MCP Scrubber Monitoring

MCP Scrubber Monitoring MCP Scrubber Monitoring Hannah Tomio August 18, 2015 1 Introduction The scrubber is a device that bombards microchannel plates (MCP s) with electrons, thus scrubbing them. This is done both to stabilize

More information

Analyzing the Earth Using Remote Sensing

Analyzing the Earth Using Remote Sensing Analyzing the Earth Using Remote Sensing Instructors: Dr. Brian Vant- Hull: Steinman 185, 212-650- 8514 brianvh@ce.ccny.cuny.edu Ms. Hannah Aizenman: NAC 7/311, 212-650- 6295 haizenman@ccny.cuny.edu Dr.

More information

Notater: INF3331. Veronika Heimsbakk December 4, Introduction 3

Notater: INF3331. Veronika Heimsbakk December 4, Introduction 3 Notater: INF3331 Veronika Heimsbakk veronahe@student.matnat.uio.no December 4, 2013 Contents 1 Introduction 3 2 Bash 3 2.1 Variables.............................. 3 2.2 Loops...............................

More information

Exploratory data analysis

Exploratory data analysis Exploratory data analysis November 29, 2017 Dr. Khajonpong Akkarajitsakul Department of Computer Engineering, Faculty of Engineering King Mongkut s University of Technology Thonburi Module III Overview

More information

import pandas as pd d = {"A":[1,2,np.nan], "B":[5,np.nan,np.nan], "C":[1,2,3]} In [9]: Out[8]: {'A': [1, 2, nan], 'B': [5, nan, nan], 'C': [1, 2, 3]}

import pandas as pd d = {A:[1,2,np.nan], B:[5,np.nan,np.nan], C:[1,2,3]} In [9]: Out[8]: {'A': [1, 2, nan], 'B': [5, nan, nan], 'C': [1, 2, 3]} In [4]: import numpy as np In [5]: import pandas as pd In [7]: d = {"":[1,2,np.nan], "B":[5,np.nan,np.nan], "C":[1,2,3]} In [8]: d Out[8]: {'': [1, 2, nan], 'B': [5, nan, nan], 'C': [1, 2, 3]} In [9]:

More information

Learning Deep Broadband Hongjoo LEE

Learning Deep Broadband Hongjoo LEE Learning Deep Broadband Network@HOME Hongjoo LEE Who am I? Machine Learning Engineer Software Engineer Fraud Detection System Software Defect Prediction Email Services (40+ mil. users) High traffic server

More information

Computational Physics HW2

Computational Physics HW2 Computational Physics HW2 Luke Bouma July 27, 2015 1 Plotting experimental data 1.1 Plotting sunspots.txt in Python: The figure above is the output from from numpy import l o a d t x t data = l o a d t

More information

Python & Numpy A tutorial

Python & Numpy A tutorial Python & Numpy A tutorial Devert Alexandre School of Software Engineering of USTC 13 February 2012 Slide 1/38 Table of Contents 1 Why Python & Numpy 2 First steps with Python 3 Fun with lists 4 Quick tour

More information

MATH 250 Homework 4: Due May 4, 2017

MATH 250 Homework 4: Due May 4, 2017 Due May 4, 17 Answer the following questions to the best of your ability. Solutions should be typed. Any plots or graphs should be included with the question (please include the questions in your solutions).

More information

Propensity Score Matching

Propensity Score Matching Propensity Score Matching This notebook illustrates how to do propensity score matching in Python. Original dataset available at: http://biostat.mc.vanderbilt.edu/wiki/main/datasets (http://biostat.mc.vanderbilt.edu/wiki/main/datasets)

More information

Complex Numbers. Visualize complex functions to estimate their zeros and poles.

Complex Numbers. Visualize complex functions to estimate their zeros and poles. Lab 1 Complex Numbers Lab Objective: Visualize complex functions to estimate their zeros and poles. Polar Representation of Complex Numbers Any complex number z = x + iy can be written in polar coordinates

More information

Exponential, Gamma and Normal Distribuions

Exponential, Gamma and Normal Distribuions Exponential, Gamma and Normal Distribuions Sections 5.4, 5.5 & 6.5 Cathy Poliak, Ph.D. cathy@math.uh.edu Office in Fleming 11c Department of Mathematics University of Houston Lecture 9-3339 Cathy Poliak,

More information

Lecture 08: Poisson and More. Lisa Yan July 13, 2018

Lecture 08: Poisson and More. Lisa Yan July 13, 2018 Lecture 08: Poisson and More Lisa Yan July 13, 2018 Announcements PS1: Grades out later today Solutions out after class today PS2 due today PS3 out today (due next Friday 7/20) 2 Midterm announcement Tuesday,

More information

Gradient Descent Methods

Gradient Descent Methods Lab 18 Gradient Descent Methods Lab Objective: Many optimization methods fall under the umbrella of descent algorithms. The idea is to choose an initial guess, identify a direction from this point along

More information

Introduction to Python

Introduction to Python Introduction to Python Luis Pedro Coelho luis@luispedro.org @luispedrocoelho European Molecular Biology Laboratory Lisbon Machine Learning School 2015 Luis Pedro Coelho (@luispedrocoelho) Introduction

More information

Hot spot Analysis with Clustering on NIJ Data Set The Data Set:

Hot spot Analysis with Clustering on NIJ Data Set The Data Set: Hot spot Analysis with Clustering on NIJ Data Set The Data Set: The data set consists of Category, Call groups, final case type, case desc, Date, X-coordinates, Y- coordinates and census-tract. X-coordinates

More information

Lecture 10: Linear Multistep Methods (LMMs)

Lecture 10: Linear Multistep Methods (LMMs) Lecture 10: Linear Multistep Methods (LMMs) 2nd-order Adams-Bashforth Method The approximation for the 2nd-order Adams-Bashforth method is given by equation (10.10) in the lecture note for week 10, as

More information

Tools for Feature Extraction: Exploring essentia

Tools for Feature Extraction: Exploring essentia Tools for Feature Extraction: Exploring essentia MUS-15 Andrea Hanke July 5, 2017 Introduction In the research on Music Information Retrieval, it is attempted to automatically classify a piece of music

More information

Skriptsprachen. Numpy und Scipy. Kai Dührkop. Lehrstuhl fuer Bioinformatik Friedrich-Schiller-Universitaet Jena

Skriptsprachen. Numpy und Scipy. Kai Dührkop. Lehrstuhl fuer Bioinformatik Friedrich-Schiller-Universitaet Jena Skriptsprachen Numpy und Scipy Kai Dührkop Lehrstuhl fuer Bioinformatik Friedrich-Schiller-Universitaet Jena kai.duehrkop@uni-jena.de 24. September 2015 24. September 2015 1 / 37 Numpy Numpy Numerische

More information

DATA SCIENCE SIMPLIFIED USING ARCGIS API FOR PYTHON

DATA SCIENCE SIMPLIFIED USING ARCGIS API FOR PYTHON DATA SCIENCE SIMPLIFIED USING ARCGIS API FOR PYTHON LEAD CONSULTANT, INFOSYS LIMITED SEZ Survey No. 41 (pt) 50 (pt), Singapore Township PO, Ghatkesar Mandal, Hyderabad, Telengana 500088 Word Limit of the

More information

Computational Physics HW3

Computational Physics HW3 Computational Physics HW3 Luke Bouma August 9, 215 1 The Stefan-Boltzmann constant In the angular frequency interval ω to ω + dω, black bodies of unit area radiate a thermal energy per second equal to

More information

Bayesian course - problem set 5 (lecture 6)

Bayesian course - problem set 5 (lecture 6) Bayesian course - problem set 5 (lecture 6) Ben Lambert November 30, 2016 1 Stan entry level: discoveries data The file prob5 discoveries.csv contains data on the numbers of great inventions and scientific

More information

Homework Example Chapter 1 Similar to Problem #14

Homework Example Chapter 1 Similar to Problem #14 Chapter 1 Similar to Problem #14 Given a sample of n = 129 observations of shower-flow-rate, do this: a.) Construct a stem-and-leaf display of the data. b.) What is a typical, or representative flow rate?

More information

Fin System, Inc. Company Report. Temperature Profile Calculators. Team 1 J. C. Stewards, Lead A. B. Williams, Documentation M. D.

Fin System, Inc. Company Report. Temperature Profile Calculators. Team 1 J. C. Stewards, Lead A. B. Williams, Documentation M. D. Fin System, Inc. Company Report Temperature Profile Calculators Team 1 J. C. Stewards, Lead A. B. Williams, Documentation M. D. Daily, Programmer Submitted in Fulfillment of Management Requirements August

More information

The Metropolis Algorithm

The Metropolis Algorithm 16 Metropolis Algorithm Lab Objective: Understand the basic principles of the Metropolis algorithm and apply these ideas to the Ising Model. The Metropolis Algorithm Sampling from a given probability distribution

More information

(a) (i) Use StatCrunch to simulate 1000 random samples of size n = 10 from this population.

(a) (i) Use StatCrunch to simulate 1000 random samples of size n = 10 from this population. Chapter 8 Sampling Distribution Ch 8.1 Distribution of Sample Mean Objective A : Shape, Center, and Spread of the Distributions of A1. Sampling Distributions of Mean A1.1 Sampling Distribution of the Sample

More information

ACCESS Physics Week #2. Three Computer Simulation activities

ACCESS Physics Week #2. Three Computer Simulation activities ACCESS Physics Week #2 Three Computer Simulation activities Activity 1 (p1/5): The Normal (Gaussian) Distribution Generating Gaussian (normally distributed) deviates: Use the norminv(p,mu,sigma) p=cumulative

More information

HOMEWORK #4: LOGISTIC REGRESSION

HOMEWORK #4: LOGISTIC REGRESSION HOMEWORK #4: LOGISTIC REGRESSION Probabilistic Learning: Theory and Algorithms CS 274A, Winter 2019 Due: 11am Monday, February 25th, 2019 Submit scan of plots/written responses to Gradebook; submit your

More information

Name: Date: Period: Activity 5.1.1: Hurricanes

Name: Date: Period: Activity 5.1.1: Hurricanes Name: Date: Period: Activity 5.1.1: Hurricanes Each year tropical storms that form in the Atlantic Ocean are given names. The first named storm starts with A, the second starts with B, and so on. A tropical

More information

Conjugate-Gradient. Learn about the Conjugate-Gradient Algorithm and its Uses. Descent Algorithms and the Conjugate-Gradient Method. Qx = b.

Conjugate-Gradient. Learn about the Conjugate-Gradient Algorithm and its Uses. Descent Algorithms and the Conjugate-Gradient Method. Qx = b. Lab 1 Conjugate-Gradient Lab Objective: Learn about the Conjugate-Gradient Algorithm and its Uses Descent Algorithms and the Conjugate-Gradient Method There are many possibilities for solving a linear

More information

Google Adwords. 8WEB Google Adwords. Capture leads & make sales. Y o u r P a r t n e r s I n O n l i n e S a l e s

Google Adwords. 8WEB Google Adwords. Capture leads & make sales. Y o u r P a r t n e r s I n O n l i n e S a l e s 8WEB Google Adwords Capture leads & make sales 8WEB Google Adwords Y o u r P a r t n e r s I n O n l i n e S a l e s 0407 924 368 8web.com.au PO BOX 8, Callala Beach, NSW 2540 W H A T I S G O O G L E A

More information

Image Processing in Numpy

Image Processing in Numpy Version: January 17, 2017 Computer Vision Laboratory, Linköping University 1 Introduction Image Processing in Numpy Exercises During this exercise, you will become familiar with image processing in Python.

More information

You have 3 hours to complete the exam. Some questions are harder than others, so don t spend too long on any one question.

You have 3 hours to complete the exam. Some questions are harder than others, so don t spend too long on any one question. Data 8 Fall 2017 Foundations of Data Science Final INSTRUCTIONS You have 3 hours to complete the exam. Some questions are harder than others, so don t spend too long on any one question. The exam is closed

More information

Write a simple 1D DFT code in Python

Write a simple 1D DFT code in Python Write a simple 1D DFT code in Python Ask Hjorth Larsen, asklarsen@gmail.com Keenan Lyon, lyon.keenan@gmail.com September 15, 2018 Overview Our goal is to write our own KohnSham (KS) density functional

More information

ECE 5615/4615 Computer Project

ECE 5615/4615 Computer Project Set #1p Due Friday March 17, 017 ECE 5615/4615 Computer Project The details of this first computer project are described below. This being a form of take-home exam means that each person is to do his/her

More information

NUMERICAL ANALYSIS WEEKLY OVERVIEW

NUMERICAL ANALYSIS WEEKLY OVERVIEW NUMERICAL ANALYSIS WEEKLY OVERVIEW M. AUTH 1. Monday 28 August Students are encouraged to download Anaconda Python. Anaconda is a version of Python that comes with some numerical packages (numpy and matplotlib)

More information

GEMF: GENERALIZED EPIDEMIC MODELING FRAMEWORK SOFTWARE IN PYTHON

GEMF: GENERALIZED EPIDEMIC MODELING FRAMEWORK SOFTWARE IN PYTHON GEMF: GENERALIZED EPIDEMIC MODELING FRAMEWORK SOFTWARE IN PYTHON HEMAN SHAKERI Network Science and Engineering Group (NetSE) Department of Electrical and Computer Engineering Kansas State University Manhattan,

More information

Dengue Forecasting Project

Dengue Forecasting Project Dengue Forecasting Project In areas where dengue is endemic, incidence follows seasonal transmission patterns punctuated every few years by much larger epidemics. Because these epidemics are currently

More information

STATISTICAL THINKING IN PYTHON I. Introduction to summary statistics: The sample mean and median

STATISTICAL THINKING IN PYTHON I. Introduction to summary statistics: The sample mean and median STATISTICAL THINKING IN PYTHON I Introduction to summary statistics: The sample mean and median 2008 US swing state election results Data retrieved from Data.gov (https://www.data.gov/) 2008 US swing state

More information

ntopic Organic Traffic Study

ntopic Organic Traffic Study ntopic Organic Traffic Study 1 Abstract The objective of this study is to determine whether content optimization solely driven by ntopic recommendations impacts organic search traffic from Google. The

More information

Extracting Canadian Climate Data from Environment Canada dataset

Extracting Canadian Climate Data from Environment Canada dataset University of British Columbia Department of Statistics Technical Report #244 May 2009 Extracting Canadian Climate Data from Environment Canada dataset by Reza Hosseini 1 1 University of British Columbia

More information

Introduction and Overview STAT 421, SP Course Instructor

Introduction and Overview STAT 421, SP Course Instructor Introduction and Overview STAT 421, SP 212 Prof. Prem K. Goel Mon, Wed, Fri 3:3PM 4:48PM Postle Hall 118 Course Instructor Prof. Goel, Prem E mail: goel.1@osu.edu Office: CH 24C (Cockins Hall) Phone: 614

More information

Course Information Course Overview Study Skills Background Material. Introduction. CS 205A: Mathematical Methods for Robotics, Vision, and Graphics

Course Information Course Overview Study Skills Background Material. Introduction. CS 205A: Mathematical Methods for Robotics, Vision, and Graphics Introduction CS 205A: Mathematical Methods for Robotics, Vision, and Graphics Doug James CS 205A: Mathematical Methods Introduction 1 / 16 Instructor Prof. Doug James Office: Gates 363 Telephone: (650)

More information

Lecture 3. G. Cowan. Lecture 3 page 1. Lectures on Statistical Data Analysis

Lecture 3. G. Cowan. Lecture 3 page 1. Lectures on Statistical Data Analysis Lecture 3 1 Probability (90 min.) Definition, Bayes theorem, probability densities and their properties, catalogue of pdfs, Monte Carlo 2 Statistical tests (90 min.) general concepts, test statistics,

More information

Test 2 - Python Edition

Test 2 - Python Edition 'XNH8QLYHUVLW\ (GPXQG7UDWW-U6FKRRORI(QJLQHHULQJ EGR 10L Spring 2018 Test 2 - Python Edition Shaundra B. Daily & Michael R. Gustafson II Name (please print): NetID (please print): In keeping with the Community

More information

STATISTICAL THINKING IN PYTHON I. Probabilistic logic and statistical inference

STATISTICAL THINKING IN PYTHON I. Probabilistic logic and statistical inference STATISTICAL THINKING IN PYTHON I Probabilistic logic and statistical inference 50 measurements of petal length Statistical Thinking in Python I 50 measurements of petal length Statistical Thinking in Python

More information

Tutorial Three: Loops and Conditionals

Tutorial Three: Loops and Conditionals Tutorial Three: Loops and Conditionals Imad Pasha Chris Agostino February 18, 2015 1 Introduction In lecture Monday we learned that combinations of conditionals and loops could make our code much more

More information

COMP3211 Report - Cart Pole Problem

COMP3211 Report - Cart Pole Problem COMP3211 Report - Cart Pole Problem Team members: CHAU Tsun Man (20265761) CHEUNG Wai Kwan (20272910) KOO Tin Lok (20344775) Task Definition The system is trying to solve the cart pole problem of the OpenAI

More information

Managing Uncertainty

Managing Uncertainty Managing Uncertainty Bayesian Linear Regression and Kalman Filter December 4, 2017 Objectives The goal of this lab is multiple: 1. First it is a reminder of some central elementary notions of Bayesian

More information

Detecting Climate Change through Means and Extremes

Detecting Climate Change through Means and Extremes Detecting Climate Change through Means and Extremes New Mexico Supercomputing Challenge Final Report April 6, 2016 Team 55 Los Alamos Middle School Team Member: Lillian Petersen Teacher: Adam Drew Mentor:

More information

VTU Edusat Programme 16

VTU Edusat Programme 16 VTU Edusat Programme 16 Subject : Engineering Mathematics Sub Code: 10MAT41 UNIT 8: Sampling Theory Dr. K.S.Basavarajappa Professor & Head Department of Mathematics Bapuji Institute of Engineering and

More information

The Periodic Table By Primo Levi READ ONLINE

The Periodic Table By Primo Levi READ ONLINE The Periodic Table By Primo Levi READ ONLINE Periodic Table, periodic table trends, periodic table quiz, Chemistry education software, whether in the form of chemistry lab simulations, data acquisition

More information

Instrument Cross-Comparisons and Automated Quality Control of Atmospheric Radiation Measurement Data

Instrument Cross-Comparisons and Automated Quality Control of Atmospheric Radiation Measurement Data Instrument Cross-Comparisons and Automated Quality Control of Atmospheric Radiation Measurement Data S. Moore and G. Hughes ATK Mission Research Santa Barbara, California Introduction Within the Atmospheric

More information

SME 864 Mark Urban-Lurain

SME 864 Mark Urban-Lurain SME 864 Mark Urban-Lurain 1 Import data from non-excel sources Probe Software Web sites Other sources Organize data Structure file for analysis Clean values Analyze Summarize Statistics Graph 2 Get files

More information

Frequency and Histograms

Frequency and Histograms Warm Up Lesson Presentation Lesson Quiz Algebra 1 Create stem-and-leaf plots. Objectives Create frequency tables and histograms. Vocabulary stem-and-leaf plot frequency frequency table histogram cumulative

More information

Discrete distribution. Fitting probability models to frequency data. Hypotheses for! 2 test. ! 2 Goodness-of-fit test

Discrete distribution. Fitting probability models to frequency data. Hypotheses for! 2 test. ! 2 Goodness-of-fit test Discrete distribution Fitting probability models to frequency data A probability distribution describing a discrete numerical random variable For example,! Number of heads from 10 flips of a coin! Number

More information

HIRES 2017 Syllabus. Instructors:

HIRES 2017 Syllabus. Instructors: HIRES 2017 Syllabus Instructors: Dr. Brian Vant-Hull: Steinman 185, 212-650-8514, brianvh@ce.ccny.cuny.edu Ms. Hannah Aizenman: NAC 7/311, 212-650-6295, haizenman@ccny.cuny.edu Dr. Tarendra Lakhankar:

More information

Computer projects for Mathematical Statistics, MA 486. Some practical hints for doing computer projects with MATLAB:

Computer projects for Mathematical Statistics, MA 486. Some practical hints for doing computer projects with MATLAB: Computer projects for Mathematical Statistics, MA 486. Some practical hints for doing computer projects with MATLAB: You can save your project to a text file (on a floppy disk or CD or on your web page),

More information

STAT2201 Assignment 6 Semester 1, 2017 Due 26/5/2017

STAT2201 Assignment 6 Semester 1, 2017 Due 26/5/2017 Class Example 1. Linear Regression Example The code below uses BrisGCtemp.csv, as appeared in a class example of assignment 3. This file contains temperature observations recorded in Brisbane and the GoldCoast.

More information

Product Quality Disclaimer

Product Quality Disclaimer ENVI-GSOP-EOGD-QD-4-49 Affected Data Sets Various Disclaimer Title L1b disclaimers with an impact on the ATS_AR 2P product L2 spatially averaged data are affected by the following disclaimers against the

More information

Pytorch Tutorial. Xiaoyong Yuan, Xiyao Ma 2018/01

Pytorch Tutorial. Xiaoyong Yuan, Xiyao Ma 2018/01 (Li Lab) National Science Foundation Center for Big Learning (CBL) Department of Electrical and Computer Engineering (ECE) Department of Computer & Information Science & Engineering (CISE) Pytorch Tutorial

More information

f = Xw + b, We can compute the total square error of the function values above, compared to the observed training set values:

f = Xw + b, We can compute the total square error of the function values above, compared to the observed training set values: Linear regression Much of machine learning is about fitting functions to data. That may not sound like an exciting activity that will give us artificial intelligence. However, representing and fitting

More information

/home/thierry/columbia/msongsdb/pyreport_tutorials/tutorial1/tutorial1.py January 23, 20111

/home/thierry/columbia/msongsdb/pyreport_tutorials/tutorial1/tutorial1.py January 23, 20111 /home/thierry/columbia/msongsdb/pyreport_tutorials/tutorial1/tutorial1.py January 23, 20111 27 """ 28 Tutorial for the Million Song Dataset 29 30 by Thierry Bertin - Mahieux ( 2011) Columbia University

More information

Study Ch. 13.1, # 1 4 all Study Ch. 13.2, # 9 15, 25, 27, 31 [# 11 17, ~27, 29, ~33]

Study Ch. 13.1, # 1 4 all Study Ch. 13.2, # 9 15, 25, 27, 31 [# 11 17, ~27, 29, ~33] GOALS: 1. Learn the properties of the χ 2 Distribution. 2. Understand how the shape of the χ 2 Distribution changes as the df increases. 3. Be able to find p values. 4. Recognize that χ 2 tests are right

More information

InSchedule File Upload Definition

InSchedule File Upload Definition InSchedule File Upload Definition Jennifer Long Sr Business Analyst PJM Interconnection PJM 2012 I. Purpose This document identifies the file upload process for the InSchedule application. It will discuss

More information

<br /> D. Thiebaut <br />August """Example of DNNRegressor for Housing dataset.""" In [94]:

<br /> D. Thiebaut <br />August Example of DNNRegressor for Housing dataset. In [94]: sklearn Tutorial: Linear Regression on Boston Data This is following the https://github.com/tensorflow/tensorflow/blob/maste

More information

Shootout 2017 Problem Statement

Shootout 2017 Problem Statement Shootout 2017 Problem Statement Understanding Wildfires 1 Background In the fall of 2016, a number of wildfires occurred in the southeastern United States. Spanning over 5 states and several weeks, more

More information

x k+1 = x k + α k p k (13.1)

x k+1 = x k + α k p k (13.1) 13 Gradient Descent Methods Lab Objective: Iterative optimization methods choose a search direction and a step size at each iteration One simple choice for the search direction is the negative gradient,

More information

Data Intensive Computing Handout 11 Spark: Transitive Closure

Data Intensive Computing Handout 11 Spark: Transitive Closure Data Intensive Computing Handout 11 Spark: Transitive Closure from random import Random spark/transitive_closure.py num edges = 3000 num vertices = 500 rand = Random(42) def generate g raph ( ) : edges

More information

OCEAN & SEA ICE SAF CDOP2. OSI-SAF Metop-A IASI Sea Surface Temperature L2P (OSI-208) Validation report. Version 1.4 April 2015

OCEAN & SEA ICE SAF CDOP2. OSI-SAF Metop-A IASI Sea Surface Temperature L2P (OSI-208) Validation report. Version 1.4 April 2015 OCEAN & SEA ICE SAF CDOP2 OSI-SAF Metop-A IASI Sea Surface Temperature L2P (OSI-208) Validation report Version 1.4 April 2015 A. O Carroll and A. Marsouin EUMETSAT, Eumetsat-Allee 1, Darmstadt 64295, Germany

More information

BINF702 SPRING 2015 Chapter 7 Hypothesis Testing: One-Sample Inference

BINF702 SPRING 2015 Chapter 7 Hypothesis Testing: One-Sample Inference BINF702 SPRING 2015 Chapter 7 Hypothesis Testing: One-Sample Inference BINF702 SPRING 2014 Chapter 7 Hypothesis Testing 1 Section 7.9 One-Sample c 2 Test for the Variance of a Normal Distribution Eq. 7.40

More information

Data Analysis of NYC Cab Services

Data Analysis of NYC Cab Services Data Analysis of NYC Cab Services Abhinandan Dubey Stony Brook University New York, USA Raju Khanal Stony Brook University New York, USA [adubey, rkhanal, lmadiraju]@cs.stonybrook.edu Teja Madiraju Stony

More information

Pre-Algebra Semester 1 Practice Exam B DRAFT

Pre-Algebra Semester 1 Practice Exam B DRAFT . Evaluate x y 5 6 80 when x = 0 and y =.. Which expression is equivalent to? + + + +. In Pre-Algebra class, we follow the order of operations in evaluating expressions. Which operation should a student

More information

Binomial random variable

Binomial random variable Binomial random variable Toss a coin with prob p of Heads n times X: # Heads in n tosses X is a Binomial random variable with parameter n,p. X is Bin(n, p) An X that counts the number of successes in many

More information

Statistical methods in NLP Introduction

Statistical methods in NLP Introduction Statistical methods in NLP Introduction Richard Johansson January 20, 2015 today course matters analysing numerical data with Python basic notions of probability simulating random events in Python overview

More information

HOMEWORK #4: LOGISTIC REGRESSION

HOMEWORK #4: LOGISTIC REGRESSION HOMEWORK #4: LOGISTIC REGRESSION Probabilistic Learning: Theory and Algorithms CS 274A, Winter 2018 Due: Friday, February 23rd, 2018, 11:55 PM Submit code and report via EEE Dropbox You should submit a

More information

SECTION 7: CURVE FITTING. MAE 4020/5020 Numerical Methods with MATLAB

SECTION 7: CURVE FITTING. MAE 4020/5020 Numerical Methods with MATLAB SECTION 7: CURVE FITTING MAE 4020/5020 Numerical Methods with MATLAB 2 Introduction Curve Fitting 3 Often have data,, that is a function of some independent variable,, but the underlying relationship is

More information

What s Cooking? Predicting Cuisines from Recipe Ingredients

What s Cooking? Predicting Cuisines from Recipe Ingredients What s Cooking? Predicting Cuisines from Recipe Ingredients Kevin K. Do Department of Computer Science Duke University Durham, NC 27708 kevin.kydat.do@gmail.com Abstract Kaggle is an online platform for

More information

Data science and engineering for local weather forecasts. Nikhil R Podduturi Data {Scientist, Engineer} November, 2016

Data science and engineering for local weather forecasts. Nikhil R Podduturi Data {Scientist, Engineer} November, 2016 1 Data science and engineering for local weather forecasts Nikhil R Podduturi Data {Scientist, Engineer} November, 2016 Agenda About MeteoGroup Introduction to weather data Problem description Data science

More information

CALIFORNIA INSTITUTE OF TECHNOLOGY Division of Physics, Mathematics, and Astronomy. Ay190 Computational Astrophysics

CALIFORNIA INSTITUTE OF TECHNOLOGY Division of Physics, Mathematics, and Astronomy. Ay190 Computational Astrophysics CALIFORNIA INSTITUTE OF TECHNOLOGY Division of Physics, Mathematics, and Astronomy Ay190 Computational Astrophysics Christian D. Ott, Andrew Benson, and Michael W. Eastwood cott@tapir.caltech.edu, abenson@tapir.caltech.edu,

More information

Mark Scheme (Results) Summer GCE Physics (6PH07) Paper 01 Exploring Physics (Written Alternative)

Mark Scheme (Results) Summer GCE Physics (6PH07) Paper 01 Exploring Physics (Written Alternative) Mark Scheme (Results) Summer 2012 GCE Physics (6PH07) Paper 01 Exploring Physics (Written Alternative) Edexcel and BTEC Qualifications Edexcel and BTEC qualifications come from Pearson, the world s leading

More information

Standards of Learning Content Review Notes. Grade 7 Mathematics 3 rd Nine Weeks,

Standards of Learning Content Review Notes. Grade 7 Mathematics 3 rd Nine Weeks, Standards of Learning Content Review Notes Grade 7 Mathematics 3 rd Nine Weeks, 2016-2017 1 2 Content Review: Standards of Learning in Detail Grade 7 Mathematics: Third Nine Weeks 2016-2017 This resource

More information

NOAA s Climate Normals. Pre-release Webcast presented by NOAA s National Climatic Data Center June 13, 2011

NOAA s Climate Normals. Pre-release Webcast presented by NOAA s National Climatic Data Center June 13, 2011 NOAA s 1981-2010 Climate Normals Pre-release Webcast presented by NOAA s National Climatic Data Center June 13, 2011 Takeaway Messages Most Normals will be available July 1 via FTP NWS Normals to be loaded

More information

Minimum semi-major axis of extrasolar planets in relation to dust sublimation zones

Minimum semi-major axis of extrasolar planets in relation to dust sublimation zones Minimum semi-major axis of extrasolar planets in relation to dust sublimation zones Frane Lunić Supervisor: assoc.prof.dr. Dejan Vinković Split, September 2014 Bachelor Thesis in Physics Department of

More information