EP219:Data Analysis And Interpretation
|
|
- Martina Merritt
- 5 years ago
- Views:
Transcription
1 EP219:Data Analysis And Interpretation Report: Week 1 Team Poisson ous 28 July 2017 to 4 August
2 Contents 1 Problem Statement 3 2 Python Program 3 3 Histograms 6 4 Inference 7 5 Team responsibilities 7 6 Website 8 2
3 1 Problem Statement Given the text file Complete TAVG daily.txt that contains the daily temperature data from This file shows the average land temperature fluctuations averaged over the entire earth. Using this data the following analysis has been done. Extract the full data set to a numpy array. To this array add a column which shows the actual temperature on that day. Make a histogram of the temperature distribution for the year Re- peat this for the year Make a histogram of the temperature distribution for the temperature on 1st and 15th day of every month for the year 1912 and Python Program Here is the code for plotting the required histograms. Parts of the code to append extra column to the data imported as a numpy array, has been taken from Prof. Rentala s example code. 1 import pandas a s pd 2 import numpy a s np 3 import m a t p l o t l i b. pyplot as p l t 4 5 #Code to get the number o f commented l i n e s b e f o r e the a c t u a l data s t a r t s 6 D a t a f i l e=open ( Complete TAVG daily. txt ) 7 Ncommentedlines=0 8 while D a t a f i l e. read ( 1 )== % : 9 D a t a f i l e. r e a d l i n e ( ) 10 Ncommentedlines=Ncommentedlines+1 11 D a t a f i l e. c l o s e ( ) #S t o r i n g the data in a dataframe df with no headers and whitespace as d e l i m i t e r s ; the f i r s t Nommentedlines rows are skipped 14 df = pd. r e a d t a b l e ( Complete TAVG daily. txt, delim whitespace= True, header=none, skiprows=ncommentedlines ) #Data i s now s t o r e d in an numpy array 17 rawdatatable=df. v a l u e s 3
4 18 19 #Function i s c r e a t e d that takes any number to number tempfix = lambda t : t #f u n c t i o n i s v e c t o r i z e d, so i t can be a p p l i e d to an array 23 vtempfix = np. v e c t o r i z e ( tempfix ) #Vectorized f u n c t i o n i s a p p l i e d to column corresponding to temperature anamolies 26 newcolumn = vtempfix ( rawdatatable [ :, 5 ] ) #new column i s converted i n t o a 2 dimensional array so that i t can be appended to o r i g i n a l data 29 reshapednewcolumn = newcolumn. reshape ( newcolumn. s i z e, 1 ) #b e t t e r data i s the o r i g i n a l data with an added column o f temperature a x i s=1 i m p l i e s data i s appended columnwise 32 b e t t e r d a t a = np. append ( rawdatatable, reshapednewcolumn, a x i s =1) #cutdata c o n t a i n s only those rows o f b e t t e r d a t a f o r which the year i s i 35 cutdata = lambda i : b e t t e r d a t a [ ( b e t t e r d a t a [:,1]== i ), : ] 36 #A f u n c t i o n i s d e f i n e d to p l o t histograms f o r temperature data 37 def HistTemp ( year, c o l o u r ) : 38 #tempdata c o n t a i n s the temperature data o f each day f o r which the year was year 39 tempdata=cutdata ( year ) [ :, 6 ] #to get the l i m i t s o f histogram we f i n d the minimum and maximum v a l u e s o f temperature 4 i s m u l t i p l i e d and d ivided to get f i r s t bin value to the accuracy o f xmin=i n t (4 min ( tempdata ) ) / xmax=i n t (4 max( tempdata ) ) / #Number o f bins i s c a l c u l a t e d to get the array o f bins 46 Numbins=((xmax xmin ) 4) #binarray i s the array which i s p a s s e s as a parameter to histogram f u n c t i o n 49 binarray=np. l i n s p a c e ( xmin, xmax,num=numbins ) #Histogram i s p l o t t e d by p l t. h i s t f u n c t i o n 52 p l t. h i s t ( tempdata, bins=binarray, f a c e c o l o r=colour, alpha =0.9, rwidth =0.95) 53 4
5 54 #a x i s are l a b e l l e d and t i t l e i s given 55 p l t. t i t l e ( Temperature D i s t r i b u t i o n f o r the Year +s t r ( year ) ) 56 p l t. x l a b e l ( Temperature in C e l s i u s ) 57 p l t. y l a b e l ( Number o f days ) 58 p l t. xlim ( [ xmin, xmax ] ) 59 p l t. x t i c k s ( np. arange ( xmin, xmax, ) ) #p l o t i s d i s p l a y e d 63 p l t. show ( ) #Function i s c a l l e d f o r both the years r e q u i r e d 66 HistTemp (1912, green ) 67 HistTemp (2012, blue ) 68 #Function i s c r e a t e d f o r p r i n t i n g only datas o f two s p e c i f i c dates o f a month, year i s the year, day1 and day2 are the days o f the month r e q u i r e d 69 def HistSpecificTemp ( year, day1, day2, c o l o u r ) : 70 tempdata=cutdata ( year ) 71 r e f d a t a=tempdata [ ( tempdata [:,3]== day1 )+(tempdata [:,3]== day2 ), : ] [ :, 6 ] 72 #to get the l i m i t s o f histogram we f i n d the minimum and maximum v a l u e s o f temperature, 4 i s m u l t i p l i e d and d ivided to get f i r s t bin value to the accuracy o f xmin=i n t (4 min ( r e f d a t a ) ) / xmax=i n t (4 max( r e f d a t a ) ) / #Number o f bins i s c a l c u l a t e d to get the array o f bins 77 Numbins=((xmax xmin ) 4) #binarray i s the array which i s p a s s e s as a parameter to histogram f u n c t i o n 80 binarray=np. l i n s p a c e ( xmin, xmax,num=numbins ) #Histogram i s p l o t t e d by p l t. h i s t f u n c t i o n 83 p l t. h i s t ( refdata, bins=binarray, f a c e c o l o r=colour, alpha =0.9, rwidth =0.95) #a x i s are l a b e l l e d and t i t l e i s given 86 p l t. t i t l e ( Refined Temperature D i s t r i b u t i o n f o r the Year +s t r ( year ) ) 87 p l t. x l a b e l ( Temperature in C e l s i u s ) 88 p l t. y l a b e l ( Number o f days ) 89 p l t. xlim ( [ xmin, xmax ] ) 90 p l t. x t i c k s ( np. arange ( xmin, xmax, ) ) 5
6 91 #p l o t i s d i s p l a y e d 92 p l t. show ( ) #Function i s c a l l e d f o r year 1912 and HistSpecificTemp ( , 1, 15, green ) 96 HistSpecificTemp ( , 1, 15, blue ) #End o f Code 3 Histograms The following are the histograms for the temperature distribution of the year 1912 and 2012 considering the daily temperatures. (a) Year 1912 (b) Year 2012 Figure 1: the temperature distribution considering the daily temperatures Following are the histograms for the temperature distribution of the year 1912 and 2012 considering the temperatures on the 1st and 15th day of every month. 6
7 (a) Year 1912 (b) Year 2012 Figure 2: the temperature distribution considering the temperatures on the 1st and 15th day of every month 4 Inference We expected that the refined temperature histogram, having lesser data points is more likely to miss out the data points that lie on the fringes. Thus we expected the range of the distribution to be smaller. From the Histograms plotted we infer that the spread of the temperature in the Histogram of Temperature distribution for entire year is same as the spread of the Histogram of Temperature distribution for first and the fifteenth day of each month for both years 1912 and Moreover we see the temperature value at which the maxima of the temperature distribution occurs is higher in 2012 than in We infer from this that in fact global warming is a real issue. 5 Team responsibilities Project Leader - Saipriya Satyajit Programmer - Keshav Janyani Web Manager - Ananay Garg Report Writers - Abhisek Sahu and Ashay Telang 7
8 6 Website The link to our website is Team Poisson-ous All the assignments will be uploaded on this site. 8
Numpy. Luis Pedro Coelho. October 22, Programming for Scientists. Luis Pedro Coelho (Programming for Scientists) Numpy October 22, 2012 (1 / 26)
Numpy Luis Pedro Coelho Programming for Scientists October 22, 2012 Luis Pedro Coelho (Programming for Scientists) Numpy October 22, 2012 (1 / 26) Historical Numeric (1995) Numarray (for large arrays)
More informationCS 237 Fall 2018, Homework 07 Solution
CS 237 Fall 2018, Homework 07 Solution Due date: Thursday November 1st at 11:59 pm (10% off if up to 24 hours late) via Gradescope General Instructions Please complete this notebook by filling in solutions
More informationSolution to running JLA
Solution to running JLA Benjamin Audren École Polytechnique Fédérale de Lausanne 08/10/2014 Benjamin Audren (EPFL) CLASS/MP MP runs 08/10/2014 1 / 13 Parameter file data. e x p e r i m e n t s =[ JLA ]
More informationPANDAS FOUNDATIONS. pandas Foundations
PANDAS FOUNDATIONS pandas Foundations What is pandas? Python library for data analysis High-performance containers for data analysis Data structures with a lot of functionality Meaningful labels Time series
More informationCS 237 Fall 2018, Homework 06 Solution
0/9/20 hw06.solution CS 237 Fall 20, Homework 06 Solution Due date: Thursday October th at :59 pm (0% off if up to 24 hours late) via Gradescope General Instructions Please complete this notebook by filling
More informationCreative Data Mining
Creative Data Mining Using ML algorithms in python Artem Chirkin Dr. Daniel Zünd Danielle Griego Lecture 7 0.04.207 /7 What we will cover today Outline Getting started Explore dataset content Inspect visually
More informationExercise 5 Release: Due:
Stochastic Modeling and Simulation Winter 28 Prof. Dr. I. F. Sbalzarini, Dr. Christoph Zechner (MPI-CBG/CSBD TU Dresden, 87 Dresden, Germany Exercise 5 Release: 8..28 Due: 5..28 Question : Variance of
More informationLectures about Python, useful both for beginners and experts, can be found at (http://scipy-lectures.github.io).
Random Matrix Theory (Sethna, "Entropy, Order Parameters, and Complexity", ex. 1.6, developed with Piet Brouwer) 2016, James Sethna, all rights reserved. This is an ipython notebook. This hints file is
More informationPython Analysis. PHYS 224 September 25/26, 2014
Python Analysis PHYS 224 September 25/26, 2014 Goals Two things to teach in this lecture 1. How to use python to fit data 2. How to interpret what python gives you Some references: http://nbviewer.ipython.org/url/media.usm.maine.edu/~pauln/
More informationPython Analysis. PHYS 224 October 1/2, 2015
Python Analysis PHYS 224 October 1/2, 2015 Goals Two things to teach in this lecture 1. How to use python to fit data 2. How to interpret what python gives you Some references: http://nbviewer.ipython.org/url/media.usm.maine.edu/~pauln/
More informationIntroduction to Python
Introduction to Python Luis Pedro Coelho Institute for Molecular Medicine (Lisbon) Lisbon Machine Learning School II Luis Pedro Coelho (IMM) Introduction to Python Lisbon Machine Learning School II (1
More information2 Getting Started with Numerical Computations in Python
1 Documentation and Resources * Download: o Requirements: Python, IPython, Numpy, Scipy, Matplotlib o Windows: google "windows download (Python,IPython,Numpy,Scipy,Matplotlib" o Debian based: sudo apt-get
More information/Users/jenskremkow/Science/Courses/python-summerschool-berlin/faculty/Day2/examples numpy.py September 2,
/Users/jenskremkow/Science/Courses/python-summerschool-berlin/faculty/Day2/examples numpy.py September 2, 2009 1 Numpy Many of the examples are taken from: http://www.scipy.org/cookbook Building Arrays
More informationComplex Numbers. A complex number z = x + iy can be written in polar coordinates as re i where
Lab 20 Complex Numbers Lab Objective: Create visualizations of complex functions. Visually estimate their zeros and poles, and gain intuition about their behavior in the complex plane. Representations
More informationParametric Model for the LWA-1 Dipole Response as a Function of Frequency
Parametric Model for the LWA-1 Dipole Response as a Function of Frequency Jayce Dowell December 20, 2011 LWA Memo #178 Version 2 Contents 1 Introduction 2 2 Methods 2 3 Results 3 4 Application 3 A Document
More informationSTAT2201 Assignment 3 Semester 1, 2017 Due 13/4/2017
Class Example 1. Single Sample Descriptive Statistics (a) Summary Statistics and Box-Plots You are working in factory producing hand held bicycle pumps and obtain a sample of 174 bicycle pump weights in
More informationMCP Scrubber Monitoring
MCP Scrubber Monitoring Hannah Tomio August 18, 2015 1 Introduction The scrubber is a device that bombards microchannel plates (MCP s) with electrons, thus scrubbing them. This is done both to stabilize
More informationAnalyzing the Earth Using Remote Sensing
Analyzing the Earth Using Remote Sensing Instructors: Dr. Brian Vant- Hull: Steinman 185, 212-650- 8514 brianvh@ce.ccny.cuny.edu Ms. Hannah Aizenman: NAC 7/311, 212-650- 6295 haizenman@ccny.cuny.edu Dr.
More informationNotater: INF3331. Veronika Heimsbakk December 4, Introduction 3
Notater: INF3331 Veronika Heimsbakk veronahe@student.matnat.uio.no December 4, 2013 Contents 1 Introduction 3 2 Bash 3 2.1 Variables.............................. 3 2.2 Loops...............................
More informationExploratory data analysis
Exploratory data analysis November 29, 2017 Dr. Khajonpong Akkarajitsakul Department of Computer Engineering, Faculty of Engineering King Mongkut s University of Technology Thonburi Module III Overview
More informationimport pandas as pd d = {"A":[1,2,np.nan], "B":[5,np.nan,np.nan], "C":[1,2,3]} In [9]: Out[8]: {'A': [1, 2, nan], 'B': [5, nan, nan], 'C': [1, 2, 3]}
In [4]: import numpy as np In [5]: import pandas as pd In [7]: d = {"":[1,2,np.nan], "B":[5,np.nan,np.nan], "C":[1,2,3]} In [8]: d Out[8]: {'': [1, 2, nan], 'B': [5, nan, nan], 'C': [1, 2, 3]} In [9]:
More informationLearning Deep Broadband Hongjoo LEE
Learning Deep Broadband Network@HOME Hongjoo LEE Who am I? Machine Learning Engineer Software Engineer Fraud Detection System Software Defect Prediction Email Services (40+ mil. users) High traffic server
More informationComputational Physics HW2
Computational Physics HW2 Luke Bouma July 27, 2015 1 Plotting experimental data 1.1 Plotting sunspots.txt in Python: The figure above is the output from from numpy import l o a d t x t data = l o a d t
More informationPython & Numpy A tutorial
Python & Numpy A tutorial Devert Alexandre School of Software Engineering of USTC 13 February 2012 Slide 1/38 Table of Contents 1 Why Python & Numpy 2 First steps with Python 3 Fun with lists 4 Quick tour
More informationMATH 250 Homework 4: Due May 4, 2017
Due May 4, 17 Answer the following questions to the best of your ability. Solutions should be typed. Any plots or graphs should be included with the question (please include the questions in your solutions).
More informationPropensity Score Matching
Propensity Score Matching This notebook illustrates how to do propensity score matching in Python. Original dataset available at: http://biostat.mc.vanderbilt.edu/wiki/main/datasets (http://biostat.mc.vanderbilt.edu/wiki/main/datasets)
More informationComplex Numbers. Visualize complex functions to estimate their zeros and poles.
Lab 1 Complex Numbers Lab Objective: Visualize complex functions to estimate their zeros and poles. Polar Representation of Complex Numbers Any complex number z = x + iy can be written in polar coordinates
More informationExponential, Gamma and Normal Distribuions
Exponential, Gamma and Normal Distribuions Sections 5.4, 5.5 & 6.5 Cathy Poliak, Ph.D. cathy@math.uh.edu Office in Fleming 11c Department of Mathematics University of Houston Lecture 9-3339 Cathy Poliak,
More informationLecture 08: Poisson and More. Lisa Yan July 13, 2018
Lecture 08: Poisson and More Lisa Yan July 13, 2018 Announcements PS1: Grades out later today Solutions out after class today PS2 due today PS3 out today (due next Friday 7/20) 2 Midterm announcement Tuesday,
More informationGradient Descent Methods
Lab 18 Gradient Descent Methods Lab Objective: Many optimization methods fall under the umbrella of descent algorithms. The idea is to choose an initial guess, identify a direction from this point along
More informationIntroduction to Python
Introduction to Python Luis Pedro Coelho luis@luispedro.org @luispedrocoelho European Molecular Biology Laboratory Lisbon Machine Learning School 2015 Luis Pedro Coelho (@luispedrocoelho) Introduction
More informationHot spot Analysis with Clustering on NIJ Data Set The Data Set:
Hot spot Analysis with Clustering on NIJ Data Set The Data Set: The data set consists of Category, Call groups, final case type, case desc, Date, X-coordinates, Y- coordinates and census-tract. X-coordinates
More informationLecture 10: Linear Multistep Methods (LMMs)
Lecture 10: Linear Multistep Methods (LMMs) 2nd-order Adams-Bashforth Method The approximation for the 2nd-order Adams-Bashforth method is given by equation (10.10) in the lecture note for week 10, as
More informationTools for Feature Extraction: Exploring essentia
Tools for Feature Extraction: Exploring essentia MUS-15 Andrea Hanke July 5, 2017 Introduction In the research on Music Information Retrieval, it is attempted to automatically classify a piece of music
More informationSkriptsprachen. Numpy und Scipy. Kai Dührkop. Lehrstuhl fuer Bioinformatik Friedrich-Schiller-Universitaet Jena
Skriptsprachen Numpy und Scipy Kai Dührkop Lehrstuhl fuer Bioinformatik Friedrich-Schiller-Universitaet Jena kai.duehrkop@uni-jena.de 24. September 2015 24. September 2015 1 / 37 Numpy Numpy Numerische
More informationDATA SCIENCE SIMPLIFIED USING ARCGIS API FOR PYTHON
DATA SCIENCE SIMPLIFIED USING ARCGIS API FOR PYTHON LEAD CONSULTANT, INFOSYS LIMITED SEZ Survey No. 41 (pt) 50 (pt), Singapore Township PO, Ghatkesar Mandal, Hyderabad, Telengana 500088 Word Limit of the
More informationComputational Physics HW3
Computational Physics HW3 Luke Bouma August 9, 215 1 The Stefan-Boltzmann constant In the angular frequency interval ω to ω + dω, black bodies of unit area radiate a thermal energy per second equal to
More informationBayesian course - problem set 5 (lecture 6)
Bayesian course - problem set 5 (lecture 6) Ben Lambert November 30, 2016 1 Stan entry level: discoveries data The file prob5 discoveries.csv contains data on the numbers of great inventions and scientific
More informationHomework Example Chapter 1 Similar to Problem #14
Chapter 1 Similar to Problem #14 Given a sample of n = 129 observations of shower-flow-rate, do this: a.) Construct a stem-and-leaf display of the data. b.) What is a typical, or representative flow rate?
More informationFin System, Inc. Company Report. Temperature Profile Calculators. Team 1 J. C. Stewards, Lead A. B. Williams, Documentation M. D.
Fin System, Inc. Company Report Temperature Profile Calculators Team 1 J. C. Stewards, Lead A. B. Williams, Documentation M. D. Daily, Programmer Submitted in Fulfillment of Management Requirements August
More informationThe Metropolis Algorithm
16 Metropolis Algorithm Lab Objective: Understand the basic principles of the Metropolis algorithm and apply these ideas to the Ising Model. The Metropolis Algorithm Sampling from a given probability distribution
More information(a) (i) Use StatCrunch to simulate 1000 random samples of size n = 10 from this population.
Chapter 8 Sampling Distribution Ch 8.1 Distribution of Sample Mean Objective A : Shape, Center, and Spread of the Distributions of A1. Sampling Distributions of Mean A1.1 Sampling Distribution of the Sample
More informationACCESS Physics Week #2. Three Computer Simulation activities
ACCESS Physics Week #2 Three Computer Simulation activities Activity 1 (p1/5): The Normal (Gaussian) Distribution Generating Gaussian (normally distributed) deviates: Use the norminv(p,mu,sigma) p=cumulative
More informationHOMEWORK #4: LOGISTIC REGRESSION
HOMEWORK #4: LOGISTIC REGRESSION Probabilistic Learning: Theory and Algorithms CS 274A, Winter 2019 Due: 11am Monday, February 25th, 2019 Submit scan of plots/written responses to Gradebook; submit your
More informationName: Date: Period: Activity 5.1.1: Hurricanes
Name: Date: Period: Activity 5.1.1: Hurricanes Each year tropical storms that form in the Atlantic Ocean are given names. The first named storm starts with A, the second starts with B, and so on. A tropical
More informationConjugate-Gradient. Learn about the Conjugate-Gradient Algorithm and its Uses. Descent Algorithms and the Conjugate-Gradient Method. Qx = b.
Lab 1 Conjugate-Gradient Lab Objective: Learn about the Conjugate-Gradient Algorithm and its Uses Descent Algorithms and the Conjugate-Gradient Method There are many possibilities for solving a linear
More informationGoogle Adwords. 8WEB Google Adwords. Capture leads & make sales. Y o u r P a r t n e r s I n O n l i n e S a l e s
8WEB Google Adwords Capture leads & make sales 8WEB Google Adwords Y o u r P a r t n e r s I n O n l i n e S a l e s 0407 924 368 8web.com.au PO BOX 8, Callala Beach, NSW 2540 W H A T I S G O O G L E A
More informationImage Processing in Numpy
Version: January 17, 2017 Computer Vision Laboratory, Linköping University 1 Introduction Image Processing in Numpy Exercises During this exercise, you will become familiar with image processing in Python.
More informationYou have 3 hours to complete the exam. Some questions are harder than others, so don t spend too long on any one question.
Data 8 Fall 2017 Foundations of Data Science Final INSTRUCTIONS You have 3 hours to complete the exam. Some questions are harder than others, so don t spend too long on any one question. The exam is closed
More informationWrite a simple 1D DFT code in Python
Write a simple 1D DFT code in Python Ask Hjorth Larsen, asklarsen@gmail.com Keenan Lyon, lyon.keenan@gmail.com September 15, 2018 Overview Our goal is to write our own KohnSham (KS) density functional
More informationECE 5615/4615 Computer Project
Set #1p Due Friday March 17, 017 ECE 5615/4615 Computer Project The details of this first computer project are described below. This being a form of take-home exam means that each person is to do his/her
More informationNUMERICAL ANALYSIS WEEKLY OVERVIEW
NUMERICAL ANALYSIS WEEKLY OVERVIEW M. AUTH 1. Monday 28 August Students are encouraged to download Anaconda Python. Anaconda is a version of Python that comes with some numerical packages (numpy and matplotlib)
More informationGEMF: GENERALIZED EPIDEMIC MODELING FRAMEWORK SOFTWARE IN PYTHON
GEMF: GENERALIZED EPIDEMIC MODELING FRAMEWORK SOFTWARE IN PYTHON HEMAN SHAKERI Network Science and Engineering Group (NetSE) Department of Electrical and Computer Engineering Kansas State University Manhattan,
More informationDengue Forecasting Project
Dengue Forecasting Project In areas where dengue is endemic, incidence follows seasonal transmission patterns punctuated every few years by much larger epidemics. Because these epidemics are currently
More informationSTATISTICAL THINKING IN PYTHON I. Introduction to summary statistics: The sample mean and median
STATISTICAL THINKING IN PYTHON I Introduction to summary statistics: The sample mean and median 2008 US swing state election results Data retrieved from Data.gov (https://www.data.gov/) 2008 US swing state
More informationntopic Organic Traffic Study
ntopic Organic Traffic Study 1 Abstract The objective of this study is to determine whether content optimization solely driven by ntopic recommendations impacts organic search traffic from Google. The
More informationExtracting Canadian Climate Data from Environment Canada dataset
University of British Columbia Department of Statistics Technical Report #244 May 2009 Extracting Canadian Climate Data from Environment Canada dataset by Reza Hosseini 1 1 University of British Columbia
More informationIntroduction and Overview STAT 421, SP Course Instructor
Introduction and Overview STAT 421, SP 212 Prof. Prem K. Goel Mon, Wed, Fri 3:3PM 4:48PM Postle Hall 118 Course Instructor Prof. Goel, Prem E mail: goel.1@osu.edu Office: CH 24C (Cockins Hall) Phone: 614
More informationCourse Information Course Overview Study Skills Background Material. Introduction. CS 205A: Mathematical Methods for Robotics, Vision, and Graphics
Introduction CS 205A: Mathematical Methods for Robotics, Vision, and Graphics Doug James CS 205A: Mathematical Methods Introduction 1 / 16 Instructor Prof. Doug James Office: Gates 363 Telephone: (650)
More informationLecture 3. G. Cowan. Lecture 3 page 1. Lectures on Statistical Data Analysis
Lecture 3 1 Probability (90 min.) Definition, Bayes theorem, probability densities and their properties, catalogue of pdfs, Monte Carlo 2 Statistical tests (90 min.) general concepts, test statistics,
More informationTest 2 - Python Edition
'XNH8QLYHUVLW\ (GPXQG7UDWW-U6FKRRORI(QJLQHHULQJ EGR 10L Spring 2018 Test 2 - Python Edition Shaundra B. Daily & Michael R. Gustafson II Name (please print): NetID (please print): In keeping with the Community
More informationSTATISTICAL THINKING IN PYTHON I. Probabilistic logic and statistical inference
STATISTICAL THINKING IN PYTHON I Probabilistic logic and statistical inference 50 measurements of petal length Statistical Thinking in Python I 50 measurements of petal length Statistical Thinking in Python
More informationTutorial Three: Loops and Conditionals
Tutorial Three: Loops and Conditionals Imad Pasha Chris Agostino February 18, 2015 1 Introduction In lecture Monday we learned that combinations of conditionals and loops could make our code much more
More informationCOMP3211 Report - Cart Pole Problem
COMP3211 Report - Cart Pole Problem Team members: CHAU Tsun Man (20265761) CHEUNG Wai Kwan (20272910) KOO Tin Lok (20344775) Task Definition The system is trying to solve the cart pole problem of the OpenAI
More informationManaging Uncertainty
Managing Uncertainty Bayesian Linear Regression and Kalman Filter December 4, 2017 Objectives The goal of this lab is multiple: 1. First it is a reminder of some central elementary notions of Bayesian
More informationDetecting Climate Change through Means and Extremes
Detecting Climate Change through Means and Extremes New Mexico Supercomputing Challenge Final Report April 6, 2016 Team 55 Los Alamos Middle School Team Member: Lillian Petersen Teacher: Adam Drew Mentor:
More informationVTU Edusat Programme 16
VTU Edusat Programme 16 Subject : Engineering Mathematics Sub Code: 10MAT41 UNIT 8: Sampling Theory Dr. K.S.Basavarajappa Professor & Head Department of Mathematics Bapuji Institute of Engineering and
More informationThe Periodic Table By Primo Levi READ ONLINE
The Periodic Table By Primo Levi READ ONLINE Periodic Table, periodic table trends, periodic table quiz, Chemistry education software, whether in the form of chemistry lab simulations, data acquisition
More informationInstrument Cross-Comparisons and Automated Quality Control of Atmospheric Radiation Measurement Data
Instrument Cross-Comparisons and Automated Quality Control of Atmospheric Radiation Measurement Data S. Moore and G. Hughes ATK Mission Research Santa Barbara, California Introduction Within the Atmospheric
More informationSME 864 Mark Urban-Lurain
SME 864 Mark Urban-Lurain 1 Import data from non-excel sources Probe Software Web sites Other sources Organize data Structure file for analysis Clean values Analyze Summarize Statistics Graph 2 Get files
More informationFrequency and Histograms
Warm Up Lesson Presentation Lesson Quiz Algebra 1 Create stem-and-leaf plots. Objectives Create frequency tables and histograms. Vocabulary stem-and-leaf plot frequency frequency table histogram cumulative
More informationDiscrete distribution. Fitting probability models to frequency data. Hypotheses for! 2 test. ! 2 Goodness-of-fit test
Discrete distribution Fitting probability models to frequency data A probability distribution describing a discrete numerical random variable For example,! Number of heads from 10 flips of a coin! Number
More informationHIRES 2017 Syllabus. Instructors:
HIRES 2017 Syllabus Instructors: Dr. Brian Vant-Hull: Steinman 185, 212-650-8514, brianvh@ce.ccny.cuny.edu Ms. Hannah Aizenman: NAC 7/311, 212-650-6295, haizenman@ccny.cuny.edu Dr. Tarendra Lakhankar:
More informationComputer projects for Mathematical Statistics, MA 486. Some practical hints for doing computer projects with MATLAB:
Computer projects for Mathematical Statistics, MA 486. Some practical hints for doing computer projects with MATLAB: You can save your project to a text file (on a floppy disk or CD or on your web page),
More informationSTAT2201 Assignment 6 Semester 1, 2017 Due 26/5/2017
Class Example 1. Linear Regression Example The code below uses BrisGCtemp.csv, as appeared in a class example of assignment 3. This file contains temperature observations recorded in Brisbane and the GoldCoast.
More informationProduct Quality Disclaimer
ENVI-GSOP-EOGD-QD-4-49 Affected Data Sets Various Disclaimer Title L1b disclaimers with an impact on the ATS_AR 2P product L2 spatially averaged data are affected by the following disclaimers against the
More informationPytorch Tutorial. Xiaoyong Yuan, Xiyao Ma 2018/01
(Li Lab) National Science Foundation Center for Big Learning (CBL) Department of Electrical and Computer Engineering (ECE) Department of Computer & Information Science & Engineering (CISE) Pytorch Tutorial
More informationf = Xw + b, We can compute the total square error of the function values above, compared to the observed training set values:
Linear regression Much of machine learning is about fitting functions to data. That may not sound like an exciting activity that will give us artificial intelligence. However, representing and fitting
More information/home/thierry/columbia/msongsdb/pyreport_tutorials/tutorial1/tutorial1.py January 23, 20111
/home/thierry/columbia/msongsdb/pyreport_tutorials/tutorial1/tutorial1.py January 23, 20111 27 """ 28 Tutorial for the Million Song Dataset 29 30 by Thierry Bertin - Mahieux ( 2011) Columbia University
More informationStudy Ch. 13.1, # 1 4 all Study Ch. 13.2, # 9 15, 25, 27, 31 [# 11 17, ~27, 29, ~33]
GOALS: 1. Learn the properties of the χ 2 Distribution. 2. Understand how the shape of the χ 2 Distribution changes as the df increases. 3. Be able to find p values. 4. Recognize that χ 2 tests are right
More informationInSchedule File Upload Definition
InSchedule File Upload Definition Jennifer Long Sr Business Analyst PJM Interconnection PJM 2012 I. Purpose This document identifies the file upload process for the InSchedule application. It will discuss
More information<br /> D. Thiebaut <br />August """Example of DNNRegressor for Housing dataset.""" In [94]:
sklearn Tutorial: Linear Regression on Boston Data This is following the https://github.com/tensorflow/tensorflow/blob/maste
More informationShootout 2017 Problem Statement
Shootout 2017 Problem Statement Understanding Wildfires 1 Background In the fall of 2016, a number of wildfires occurred in the southeastern United States. Spanning over 5 states and several weeks, more
More informationx k+1 = x k + α k p k (13.1)
13 Gradient Descent Methods Lab Objective: Iterative optimization methods choose a search direction and a step size at each iteration One simple choice for the search direction is the negative gradient,
More informationData Intensive Computing Handout 11 Spark: Transitive Closure
Data Intensive Computing Handout 11 Spark: Transitive Closure from random import Random spark/transitive_closure.py num edges = 3000 num vertices = 500 rand = Random(42) def generate g raph ( ) : edges
More informationOCEAN & SEA ICE SAF CDOP2. OSI-SAF Metop-A IASI Sea Surface Temperature L2P (OSI-208) Validation report. Version 1.4 April 2015
OCEAN & SEA ICE SAF CDOP2 OSI-SAF Metop-A IASI Sea Surface Temperature L2P (OSI-208) Validation report Version 1.4 April 2015 A. O Carroll and A. Marsouin EUMETSAT, Eumetsat-Allee 1, Darmstadt 64295, Germany
More informationBINF702 SPRING 2015 Chapter 7 Hypothesis Testing: One-Sample Inference
BINF702 SPRING 2015 Chapter 7 Hypothesis Testing: One-Sample Inference BINF702 SPRING 2014 Chapter 7 Hypothesis Testing 1 Section 7.9 One-Sample c 2 Test for the Variance of a Normal Distribution Eq. 7.40
More informationData Analysis of NYC Cab Services
Data Analysis of NYC Cab Services Abhinandan Dubey Stony Brook University New York, USA Raju Khanal Stony Brook University New York, USA [adubey, rkhanal, lmadiraju]@cs.stonybrook.edu Teja Madiraju Stony
More informationPre-Algebra Semester 1 Practice Exam B DRAFT
. Evaluate x y 5 6 80 when x = 0 and y =.. Which expression is equivalent to? + + + +. In Pre-Algebra class, we follow the order of operations in evaluating expressions. Which operation should a student
More informationBinomial random variable
Binomial random variable Toss a coin with prob p of Heads n times X: # Heads in n tosses X is a Binomial random variable with parameter n,p. X is Bin(n, p) An X that counts the number of successes in many
More informationStatistical methods in NLP Introduction
Statistical methods in NLP Introduction Richard Johansson January 20, 2015 today course matters analysing numerical data with Python basic notions of probability simulating random events in Python overview
More informationHOMEWORK #4: LOGISTIC REGRESSION
HOMEWORK #4: LOGISTIC REGRESSION Probabilistic Learning: Theory and Algorithms CS 274A, Winter 2018 Due: Friday, February 23rd, 2018, 11:55 PM Submit code and report via EEE Dropbox You should submit a
More informationSECTION 7: CURVE FITTING. MAE 4020/5020 Numerical Methods with MATLAB
SECTION 7: CURVE FITTING MAE 4020/5020 Numerical Methods with MATLAB 2 Introduction Curve Fitting 3 Often have data,, that is a function of some independent variable,, but the underlying relationship is
More informationWhat s Cooking? Predicting Cuisines from Recipe Ingredients
What s Cooking? Predicting Cuisines from Recipe Ingredients Kevin K. Do Department of Computer Science Duke University Durham, NC 27708 kevin.kydat.do@gmail.com Abstract Kaggle is an online platform for
More informationData science and engineering for local weather forecasts. Nikhil R Podduturi Data {Scientist, Engineer} November, 2016
1 Data science and engineering for local weather forecasts Nikhil R Podduturi Data {Scientist, Engineer} November, 2016 Agenda About MeteoGroup Introduction to weather data Problem description Data science
More informationCALIFORNIA INSTITUTE OF TECHNOLOGY Division of Physics, Mathematics, and Astronomy. Ay190 Computational Astrophysics
CALIFORNIA INSTITUTE OF TECHNOLOGY Division of Physics, Mathematics, and Astronomy Ay190 Computational Astrophysics Christian D. Ott, Andrew Benson, and Michael W. Eastwood cott@tapir.caltech.edu, abenson@tapir.caltech.edu,
More informationMark Scheme (Results) Summer GCE Physics (6PH07) Paper 01 Exploring Physics (Written Alternative)
Mark Scheme (Results) Summer 2012 GCE Physics (6PH07) Paper 01 Exploring Physics (Written Alternative) Edexcel and BTEC Qualifications Edexcel and BTEC qualifications come from Pearson, the world s leading
More informationStandards of Learning Content Review Notes. Grade 7 Mathematics 3 rd Nine Weeks,
Standards of Learning Content Review Notes Grade 7 Mathematics 3 rd Nine Weeks, 2016-2017 1 2 Content Review: Standards of Learning in Detail Grade 7 Mathematics: Third Nine Weeks 2016-2017 This resource
More informationNOAA s Climate Normals. Pre-release Webcast presented by NOAA s National Climatic Data Center June 13, 2011
NOAA s 1981-2010 Climate Normals Pre-release Webcast presented by NOAA s National Climatic Data Center June 13, 2011 Takeaway Messages Most Normals will be available July 1 via FTP NWS Normals to be loaded
More informationMinimum semi-major axis of extrasolar planets in relation to dust sublimation zones
Minimum semi-major axis of extrasolar planets in relation to dust sublimation zones Frane Lunić Supervisor: assoc.prof.dr. Dejan Vinković Split, September 2014 Bachelor Thesis in Physics Department of
More information