Introduction Snippets Conclusion References. Beamer snippets. October 21, Decock Beamer snippets

Size: px
Start display at page:

Download "Introduction Snippets Conclusion References. Beamer snippets. October 21, Decock Beamer snippets"

Transcription

1 Jérémie DECOCK October 21, 2014

2 Introduction 2

3 Introduction TODO TODO TODO 3

4 4

5 Basic frame Subtitle... 5

6 Citations and references cite, label and ref commands Eq. (1) define the Bellman equation [Bel57] V (x) = max {F (x, a) + βv (T (x, a))} (1) a Γ(x) 6

7 Lists itemize, enumerate and description commands item 1 item item 1 2. item First item 1 Second item 2 Last... 7

8 Colors color environment small footnotesize scriptsize tiny 8

9 Fonts color color environment Red Green Blue 9

10 Centered image includegraphics command 10

11 Subfigures figure, subfigure and includegraphics commands 11

12 Blocks block command Block 1 Blablabla Block 2 Blablabla 12

13 Equations V (x) = max {F (x, a) + βv (T (x, a))} a Γ(x) V (x) = max {F (x, a) + βv (T (x, a))} a Γ(x) V (x) = max {F (x, a) + βv (T (x, a))} (2) a Γ(x) 13

14 Equation array eqnarray command Expectation of N = n E(Z i ) i=1 n β γ c(d) = d i=1 β/2 i αβ γ n 1 = c(d)β dβ/2 i i=1 αβ = z Variance of N = n V (Z i ) (3) i=1 n E(Z i ) (as V (Z i ) E(Z i )) (4) i=1 z 14

15 Matrices a 1,1 a 1,2 a 1,n a 2,1 a 2,2 a 2,n A m,n = a m,1 a m,2 a m,n M = ( x y ) A 1 0 M = B

16 Systems of equation array { n/2 if n is even f (n) = (n + 1)/2 if n is odd 16

17 Mathematical programming with align max z = 4x 1 + 7x 2 s.t. 3x 1 + 5x 2 6 (5) x 1 + 2x 2 8 (6) x 1, x

18 Mathematical programming with alignat Max z = x x 2 s.t. 13x 1 + x x 3 5 x 1 + x x 1 + x 2 = 14 x j 0, j = 1, 2, 3. Max z = x x 2 s.t. 13x 1 + x x 3 5 (7) x 1 + x 3 16 (8) 15x 1 + x 2 = 14 (9) x j 0, j = 1, 2, 3. 18

19 Animations Slide

20 Animations Slide

21 Animations Slide

22 Algorithms algorithmic command Require: S, A, T, R, an MDP γ, the discount factor ɛ, the maximum error allowed in the utility of any state in an iteration Local variables: U, U, vector of utilities for states in S, initially zero δ, the maximum change in the utility of any state in an iteration repeat U U δ 0 for all s S do U [s] R[s] + γ max a s T (s, a, s )U[s ] if U [s] U[s] > δ then δ U [s] U[s] end if end for until δ < ɛ(1 γ)/γ return U 20

23 Verbatim To insert a verbatim paragraph, the frame have to be declared fragile. The title has to be written in frametitle command, not as argument of frame (I don t know why... )..--. o_o :_/ // \ \ ( ) / \ / \ \ )=( / # gcc -o hello hello.c 21

24 Listings I 1 #!/ usr / bin / env python 2 # -*- coding : utf -8 -*- 3 4 # Author : Jéré mie 5 6 def main (): 7 """ Main function """ print " Hello world!" 11 if name == main : 12 main () listings/test.py 22

25 Table tabular command γ = 1 (small noise) γ < 1 (large noise) 1 Proved rate for R-EDA β α 1 2β α Former lower bounds α 1 α 1 R-EDA experimental rates α = 1 β α = 1 2β Rate by active learning α = 1 2 α =

26 Multi-columns columns and column commands Blablabla 24

27 URL JDHP 25

28 Conclusion 26

29 Conclusion TODO TODO TODO 27

30 References I Richard Ernest Bellman, Dynamic programming, Princeton University Press, Princeton, New Jersey, USA,

Latex Review Beamer: Presentations and Posters Citations & References. L A TEXWorkshop. Mai Ha Vu. University of Delaware.

Latex Review Beamer: Presentations and Posters Citations & References. L A TEXWorkshop. Mai Ha Vu. University of Delaware. L A TEXWorkshop Mai Ha Vu University of Delaware February 10, 2016 1 / 22 Table of Contents Latex Review Beamer: Presentations and Posters Citations & References 2 / 22 Getting started 1. L A TEXdistribution

More information

Joy Allen. March 12, 2015

Joy Allen. March 12, 2015 LATEX Newcastle University March 12, 2015 Why use L A TEX? Extremely useful tool for writing scientific papers, presentations and PhD Thesis which are heavily laden with mathematics You get lovely pretty

More information

L A TEXtutorial. The very basics for typesetting research papers. Dr. Robin Roche. February 23, 2018

L A TEXtutorial. The very basics for typesetting research papers. Dr. Robin Roche. February 23, 2018 L A TEXtutorial The very basics for typesetting research papers Dr. Robin Roche February 23, 2018 Dr. Robin Roche LATEXtutorial February 23, 2018 1 / 11 What is L A TEX? L A TEX(pronounced latek ) is a

More information

Managing Uncertainty

Managing Uncertainty Managing Uncertainty Bayesian Linear Regression and Kalman Filter December 4, 2017 Objectives The goal of this lab is multiple: 1. First it is a reminder of some central elementary notions of Bayesian

More information

Decision Theory: Markov Decision Processes

Decision Theory: Markov Decision Processes Decision Theory: Markov Decision Processes CPSC 322 Lecture 33 March 31, 2006 Textbook 12.5 Decision Theory: Markov Decision Processes CPSC 322 Lecture 33, Slide 1 Lecture Overview Recap Rewards and Policies

More information

Decision Theory: Q-Learning

Decision Theory: Q-Learning Decision Theory: Q-Learning CPSC 322 Decision Theory 5 Textbook 12.5 Decision Theory: Q-Learning CPSC 322 Decision Theory 5, Slide 1 Lecture Overview 1 Recap 2 Asynchronous Value Iteration 3 Q-Learning

More information

Title of the Book. A. U. Thor XYZ University

Title of the Book. A. U. Thor XYZ University Title of the Book A. U. Thor XYZ University ii Copyright c1997 by Author Preface Preface Head This is the preface and it is created using a TeX field in a paragraph by itself. When the document is loaded,

More information

Tox Issues with virtualenv GNU Guix guix-tox. Guix-tox. A functional version of tox. Cyril Roelandt January 30, /35

Tox Issues with virtualenv GNU Guix guix-tox. Guix-tox. A functional version of tox. Cyril Roelandt January 30, /35 A functional version of tox cyril@redhat.com January 30, 2016 1/35 OpenStack developer at Red Hat since 2013 developer 2/35 1 2 Only Python packages can be installed Reproducibility One package manager

More information

What is Beamer?! Introduction to Beamer Beamer is a LaTeX class for creating slides for presentations. Commands for Header and the Title Page

What is Beamer?! Introduction to Beamer Beamer is a LaTeX class for creating slides for presentations. Commands for Header and the Title Page Introduction to Beamer Beamer is a LaTeX class for creating slides for presentations Steven G. Wicker Winston Salem, NC wickersg@wfu.edu Updated October, 2011 What is Beamer?! Beamer is a LaTeX class for

More information

How to make a presentation with L A TEX?

How to make a presentation with L A TEX? How to make a presentation with L A TEX? Department of computer science King Saud University November 23, 2015 1 / 49 Contents 1 to L A TEX 2 2 / 49 to L A TEX L A TEX( pronounced /"la:tex/, /"la:tek/,

More information

Notater: INF3331. Veronika Heimsbakk December 4, Introduction 3

Notater: INF3331. Veronika Heimsbakk December 4, Introduction 3 Notater: INF3331 Veronika Heimsbakk veronahe@student.matnat.uio.no December 4, 2013 Contents 1 Introduction 3 2 Bash 3 2.1 Variables.............................. 3 2.2 Loops...............................

More information

CS429: Computer Organization and Architecture

CS429: Computer Organization and Architecture CS429: Computer Organization and Architecture Warren Hunt, Jr. and Bill Young Department of Computer Sciences University of Texas at Austin Last updated: September 3, 2014 at 08:38 CS429 Slideset C: 1

More information

, and rewards and transition matrices as shown below:

, and rewards and transition matrices as shown below: CSE 50a. Assignment 7 Out: Tue Nov Due: Thu Dec Reading: Sutton & Barto, Chapters -. 7. Policy improvement Consider the Markov decision process (MDP) with two states s {0, }, two actions a {0, }, discount

More information

COMP3211 Report - Cart Pole Problem

COMP3211 Report - Cart Pole Problem COMP3211 Report - Cart Pole Problem Team members: CHAU Tsun Man (20265761) CHEUNG Wai Kwan (20272910) KOO Tin Lok (20344775) Task Definition The system is trying to solve the cart pole problem of the OpenAI

More information

BASIC TECHNOLOGY Pre K starts and shuts down computer, monitor, and printer E E D D P P P P P P P P P P

BASIC TECHNOLOGY Pre K starts and shuts down computer, monitor, and printer E E D D P P P P P P P P P P BASIC TECHNOLOGY Pre K 1 2 3 4 5 6 7 8 9 10 11 12 starts and shuts down computer, monitor, and printer P P P P P P practices responsible use and care of technology devices P P P P P P opens and quits an

More information

Due dates are as mentioned above. Checkoff interviews for PS4 and PS5 will happen together between October 24 and 28, 2012.

Due dates are as mentioned above. Checkoff interviews for PS4 and PS5 will happen together between October 24 and 28, 2012. Problem Set 5 Your answers will be graded by actual human beings (at least that's what we believe!), so don't limit your answers to machine-gradable responses. Some of the questions specifically ask for

More information

Introduction to Beamer

Introduction to Beamer Introduction to Beamer Beamer is a LaTeX class for creating slides for presentations Steven Wicker Winston Salem, NC wickersg@wfu.edu June 16, 2009 How to Get Beamer Go to http://latex-beamer.sourceforge.net/

More information

Reinforcement Learning

Reinforcement Learning Reinforcement Learning Markov decision process & Dynamic programming Evaluative feedback, value function, Bellman equation, optimality, Markov property, Markov decision process, dynamic programming, value

More information

MDP Preliminaries. Nan Jiang. February 10, 2019

MDP Preliminaries. Nan Jiang. February 10, 2019 MDP Preliminaries Nan Jiang February 10, 2019 1 Markov Decision Processes In reinforcement learning, the interactions between the agent and the environment are often described by a Markov Decision Process

More information

ECE661: Homework 6. Ahmed Mohamed October 28, 2014

ECE661: Homework 6. Ahmed Mohamed October 28, 2014 ECE661: Homework 6 Ahmed Mohamed (akaseb@purdue.edu) October 28, 2014 1 Otsu Segmentation Algorithm Given a grayscale image, my implementation of the Otsu algorithm follows these steps: 1. Construct a

More information

Python & Numpy A tutorial

Python & Numpy A tutorial Python & Numpy A tutorial Devert Alexandre School of Software Engineering of USTC 13 February 2012 Slide 1/38 Table of Contents 1 Why Python & Numpy 2 First steps with Python 3 Fun with lists 4 Quick tour

More information

Special types of matrices

Special types of matrices Roberto s Notes on Linear Algebra Chapter 4: Matrix Algebra Section 2 Special types of matrices What you need to know already: What a matrix is. The basic terminology and notation used for matrices. What

More information

Lecture 4: Approximate dynamic programming

Lecture 4: Approximate dynamic programming IEOR 800: Reinforcement learning By Shipra Agrawal Lecture 4: Approximate dynamic programming Deep Q Networks discussed in the last lecture are an instance of approximate dynamic programming. These are

More information

Figure 1: Bayes Net. (a) (2 points) List all independence and conditional independence relationships implied by this Bayes net.

Figure 1: Bayes Net. (a) (2 points) List all independence and conditional independence relationships implied by this Bayes net. 1 Bayes Nets Unfortunately during spring due to illness and allergies, Billy is unable to distinguish the cause (X) of his symptoms which could be: coughing (C), sneezing (S), and temperature (T). If he

More information

First and last name of the author. Title of the thesis BACHELOR THESIS. Charles University in Prague Faculty of Mathematics and Physics

First and last name of the author. Title of the thesis BACHELOR THESIS. Charles University in Prague Faculty of Mathematics and Physics Charles University in Prague Faculty of Mathematics and Physics BACHELOR THESIS First and last name of the author Title of the thesis Name of the department or institute Supervisor of the bachelor thesis:

More information

CS260: Machine Learning Algorithms

CS260: Machine Learning Algorithms CS260: Machine Learning Algorithms Lecture 4: Stochastic Gradient Descent Cho-Jui Hsieh UCLA Jan 16, 2019 Large-scale Problems Machine learning: usually minimizing the training loss min w { 1 N min w {

More information

HW5 computer problem: modelling a dye molecule Objectives By the end of this lab you should: Know how to manipulate complex numbers and matrices. Prog

HW5 computer problem: modelling a dye molecule Objectives By the end of this lab you should: Know how to manipulate complex numbers and matrices. Prog HW5 computer problem: modelling a dye molecule Objectives By the end of this lab you should: Know how to manipulate complex numbers and matrices. Programming environment: Spyder. To implement this, do

More information

IE 361 Module 4. Modeling Measurement. Reading: Section 2.1 Statistical Methods for Quality Assurance. ISU and Analytics Iowa LLC

IE 361 Module 4. Modeling Measurement. Reading: Section 2.1 Statistical Methods for Quality Assurance. ISU and Analytics Iowa LLC IE 361 Module 4 Modeling Measurement Reading: Section 2.1 Statistical Methods for Quality Assurance ISU and Analytics Iowa LLC (ISU and Analytics Iowa LLC) IE 361 Module 4 1 / 12 Using Probability to Describe

More information

Internet Monetization

Internet Monetization Internet Monetization March May, 2013 Discrete time Finite A decision process (MDP) is reward process with decisions. It models an environment in which all states are and time is divided into stages. Definition

More information

Material/Chemical Structure and Function

Material/Chemical Structure and Function Due Date: Dec. 14, 2017 Material/Chemical Structure and Function Objective: PStd. 1 Investigate how the structure of a chemical matches its function. Analyze and apply scientific information to issues

More information

Algorithms Exam TIN093 /DIT602

Algorithms Exam TIN093 /DIT602 Algorithms Exam TIN093 /DIT602 Course: Algorithms Course code: TIN 093, TIN 092 (CTH), DIT 602 (GU) Date, time: 21st October 2017, 14:00 18:00 Building: SBM Responsible teacher: Peter Damaschke, Tel. 5405

More information

Introduction to Python and its unit testing framework

Introduction to Python and its unit testing framework Introduction to Python and its unit testing framework Instructions These are self evaluation exercises. If you know no python then work through these exercises and it will prepare yourself for the lab.

More information

Introduction to Reinforcement Learning Part 1: Markov Decision Processes

Introduction to Reinforcement Learning Part 1: Markov Decision Processes Introduction to Reinforcement Learning Part 1: Markov Decision Processes Rowan McAllister Reinforcement Learning Reading Group 8 April 2015 Note I ve created these slides whilst following Algorithms for

More information

Christopher Watkins and Peter Dayan. Noga Zaslavsky. The Hebrew University of Jerusalem Advanced Seminar in Deep Learning (67679) November 1, 2015

Christopher Watkins and Peter Dayan. Noga Zaslavsky. The Hebrew University of Jerusalem Advanced Seminar in Deep Learning (67679) November 1, 2015 Q-Learning Christopher Watkins and Peter Dayan Noga Zaslavsky The Hebrew University of Jerusalem Advanced Seminar in Deep Learning (67679) November 1, 2015 Noga Zaslavsky Q-Learning (Watkins & Dayan, 1992)

More information

Recurrent Problems. ITT9131 Konkreetne Matemaatika. Chapter One The Tower of Hanoi Lines in the Plane The Josephus Problem

Recurrent Problems. ITT9131 Konkreetne Matemaatika. Chapter One The Tower of Hanoi Lines in the Plane The Josephus Problem Recurrent Problems ITT93 Konkreetne Matemaatika Chapter One The Tower of Hanoi Lines in the Plane The Josephus Problem Contents The Tower of Hanoi Lines in the Plane 3 The Josephus Problem 4 Intermezzo:

More information

Chart 3 Data in Array

Chart 3 Data in Array Chart 3 Data in Array 3.1. How to Handle Data in Table Form 3.1.1 Create Matrix and Tables 1. Choose an input mode cell where you like to create a table. Next, choose Insert (I) - Table & Matrix (M) -

More information

1 MDP Value Iteration Algorithm

1 MDP Value Iteration Algorithm CS 0. - Active Learning Problem Set Handed out: 4 Jan 009 Due: 9 Jan 009 MDP Value Iteration Algorithm. Implement the value iteration algorithm given in the lecture. That is, solve Bellman s equation using

More information

Markov Decision Processes

Markov Decision Processes Markov Decision Processes Noel Welsh 11 November 2010 Noel Welsh () Markov Decision Processes 11 November 2010 1 / 30 Annoucements Applicant visitor day seeks robot demonstrators for exciting half hour

More information

STAT 111 Recitation 7

STAT 111 Recitation 7 STAT 111 Recitation 7 Xin Lu Tan xtan@wharton.upenn.edu October 25, 2013 1 / 13 Miscellaneous Please turn in homework 6. Please pick up homework 7 and the graded homework 5. Please check your grade and

More information

Computation Theory Finite Automata

Computation Theory Finite Automata Computation Theory Dept. of Computing ITT Dublin October 14, 2010 Computation Theory I 1 We would like a model that captures the general nature of computation Consider two simple problems: 2 Design a program

More information

Biostatistics Student Seminar An Introduction To L A T E X

Biostatistics Student Seminar An Introduction To L A T E X Biostatistics Student Seminar An Introduction To L A T E X Josh Murray Osvaldo Espin-Garcia (edition) Dalla Lana School of Public Health University of Toronto Introduction There are four stages to creating

More information

Reinforcement Learning. George Konidaris

Reinforcement Learning. George Konidaris Reinforcement Learning George Konidaris gdk@cs.brown.edu Fall 2017 Machine Learning Subfield of AI concerned with learning from data. Broadly, using: Experience To Improve Performance On Some Task (Tom

More information

THE PERIODIC TABLE Element Research Project

THE PERIODIC TABLE Element Research Project NAME PER DUE DATE: Monday -April 12, 2016 MAIL BOX THE PERIODIC TABLE Element Research Project Rough Draft Progress Check - April 1 Rough Draft Complete - April 8 Pictures for poster printed & or poster

More information

Lab 6: Linear Algebra

Lab 6: Linear Algebra 6.1 Introduction Lab 6: Linear Algebra This lab is aimed at demonstrating Python s ability to solve linear algebra problems. At the end of the assignment, you should be able to write code that sets up

More information

Artificial Intelligence

Artificial Intelligence Artificial Intelligence Dynamic Programming Marc Toussaint University of Stuttgart Winter 2018/19 Motivation: So far we focussed on tree search-like solvers for decision problems. There is a second important

More information

CSE 546 Final Exam, Autumn 2013

CSE 546 Final Exam, Autumn 2013 CSE 546 Final Exam, Autumn 0. Personal info: Name: Student ID: E-mail address:. There should be 5 numbered pages in this exam (including this cover sheet).. You can use any material you brought: any book,

More information

Quantum space. Quantum space: designing pages on grids. TUG X 004 Wed 30 Jun Jean-luc Doumont TUG 2010 Wed 30 Jun 2010

Quantum space. Quantum space: designing pages on grids. TUG X 004 Wed 30 Jun Jean-luc Doumont  TUG 2010 Wed 30 Jun 2010 Quantum space Jean-luc Doumont www.principiae.be TUG 2010 Wed 30 Jun 2010 I have found that all ugly things are made by those who strive to make something beautiful, Oscar Wilde I have found that all ugly

More information

Maximum Likelihood Estimation

Maximum Likelihood Estimation Maximum Likelihood Estimation Guy Lebanon February 19, 2011 Maximum likelihood estimation is the most popular general purpose method for obtaining estimating a distribution from a finite sample. It was

More information

Q-Learning in Continuous State Action Spaces

Q-Learning in Continuous State Action Spaces Q-Learning in Continuous State Action Spaces Alex Irpan alexirpan@berkeley.edu December 5, 2015 Contents 1 Introduction 1 2 Background 1 3 Q-Learning 2 4 Q-Learning In Continuous Spaces 4 5 Experimental

More information

Write a simple 1D DFT code in Python

Write a simple 1D DFT code in Python Write a simple 1D DFT code in Python Ask Hjorth Larsen, asklarsen@gmail.com Keenan Lyon, lyon.keenan@gmail.com September 15, 2018 Overview Our goal is to write our own KohnSham (KS) density functional

More information

Reinforcement Learning

Reinforcement Learning Reinforcement Learning Inverse Reinforcement Learning LfD, imitation learning/behavior cloning, apprenticeship learning, IRL. Hung Ngo MLR Lab, University of Stuttgart Outline Learning from Demonstrations

More information

Hidden Markov Model Documentation

Hidden Markov Model Documentation Hidden Markov Model Documentation Release 0.3 Rahul Ramesh Aug 22, 2017 Table of Contents 1 Introduction 3 1.1 Installation................................................ 3 1.2 Requirements...............................................

More information

PHYS 114 Exam 1 Answer Key NAME:

PHYS 114 Exam 1 Answer Key NAME: PHYS 4 Exam Answer Key AME: Please answer all of the questions below. Each part of each question is worth points, except question 5, which is worth 0 points.. Explain what the following MatLAB commands

More information

2-5 Algebraic Proof. Warm Up Lesson Presentation Lesson Quiz. Holt McDougal Geometry

2-5 Algebraic Proof. Warm Up Lesson Presentation Lesson Quiz. Holt McDougal Geometry 2-5 Algebraic Proof Warm Up Lesson Presentation Lesson Quiz Geometry Warm Up Solve each equation. 1. 3x + 5 = 17 4. x = 4 2. r 3.5 = 8.7 r = 12.2 3. 4t 7 = 8t + 3 t = 5 2 n = 38 5. 2(y 5) 20 = 0 y = 15

More information

6.02 Fall 2012 Lecture #1

6.02 Fall 2012 Lecture #1 6.02 Fall 2012 Lecture #1 Digital vs. analog communication The birth of modern digital communication Information and entropy Codes, Huffman coding 6.02 Fall 2012 Lecture 1, Slide #1 6.02 Fall 2012 Lecture

More information

Submitted to Econometrica A SAMPLE DOCUMENT 1

Submitted to Econometrica A SAMPLE DOCUMENT 1 Submitted to Econometrica A SAMPLE DOCUMENT 1 First Author 2,3,, Second Author 4 and and Third Author The abstract should summarize the contents of the paper. It should be clear, descriptive, self-explanatory

More information

UC Berkeley Department of Electrical Engineering and Computer Science Department of Statistics. EECS 281A / STAT 241A Statistical Learning Theory

UC Berkeley Department of Electrical Engineering and Computer Science Department of Statistics. EECS 281A / STAT 241A Statistical Learning Theory UC Berkeley Department of Electrical Engineering and Computer Science Department of Statistics EECS 281A / STAT 241A Statistical Learning Theory Solutions to Problem Set 2 Fall 2011 Issued: Wednesday,

More information

PPT PRESENTATION. What works, and what doesn t. Associate Dean USF Graduate School. Revised

PPT PRESENTATION. What works, and what doesn t. Associate Dean USF Graduate School. Revised PPT PRESENTATION TUTORIAL What works, and what doesn t RS Pollenz, PhD Associate Dean USF Graduate School Revised 5.5.10 Acknowledgment (This tutorial was based on a previous document constructed by by

More information

RESEARCH ARTICLE. Linear Magic Rectangles

RESEARCH ARTICLE. Linear Magic Rectangles Linear and Multilinear Algebra Vol 00, No 00, January 01, 1 7 RESEARCH ARTICLE Linear Magic Rectangles John Lorch (Received 00 Month 00x; in final form 00 Month 00x) We introduce a method for producing

More information

Dynamic Programming. Cormen et. al. IV 15

Dynamic Programming. Cormen et. al. IV 15 Dynamic Programming Cormen et. al. IV 5 Dynamic Programming Applications Areas. Bioinformatics. Control theory. Operations research. Some famous dynamic programming algorithms. Unix diff for comparing

More information

Counting self-avoiding paths in the plane with dual variables

Counting self-avoiding paths in the plane with dual variables 1 Counting self-avoiding paths in the plane with dual variables This project grew out of discussions with Persi Diaconis on a method devised by Donald Knuth [1] for counting self-avoiding paths in the

More information

The Binary Self-Dual Codes of Length Up To 32: A Revised Enumeration*

The Binary Self-Dual Codes of Length Up To 32: A Revised Enumeration* The Binary Self-Dual Codes of Length Up To 32: A Revised Enumeration* J. H. Conway Mathematics Department Princeton University Princeton, New Jersey 08540 V. Pless** Mathematics Department University of

More information

Reinforcement Learning

Reinforcement Learning Reinforcement Learning Ron Parr CompSci 7 Department of Computer Science Duke University With thanks to Kris Hauser for some content RL Highlights Everybody likes to learn from experience Use ML techniques

More information

Lecture 5b: Starting Matlab

Lecture 5b: Starting Matlab Lecture 5b: Starting Matlab James K. Peterson Department of Biological Sciences and Department of Mathematical Sciences Clemson University August 7, 2013 Outline 1 Resources 2 Starting Matlab 3 Homework

More information

ROBERTO BATTITI, MAURO BRUNATO. The LION Way: Machine Learning plus Intelligent Optimization. LIONlab, University of Trento, Italy, Apr 2015

ROBERTO BATTITI, MAURO BRUNATO. The LION Way: Machine Learning plus Intelligent Optimization. LIONlab, University of Trento, Italy, Apr 2015 ROBERTO BATTITI, MAURO BRUNATO. The LION Way: Machine Learning plus Intelligent Optimization. LIONlab, University of Trento, Italy, Apr 2015 http://intelligentoptimization.org/lionbook Roberto Battiti

More information

9A. Iteration with range. Topics: Using for with range Summation Computing Min s Functions and for-loops A Graphics Applications

9A. Iteration with range. Topics: Using for with range Summation Computing Min s Functions and for-loops A Graphics Applications 9A. Iteration with range Topics: Using for with range Summation Computing Min s Functions and for-loops A Graphics Applications Iterating Through a String s = abcd for c in s: print c Output: a b c d In

More information

Selection and Adversary Arguments. COMP 215 Lecture 19

Selection and Adversary Arguments. COMP 215 Lecture 19 Selection and Adversary Arguments COMP 215 Lecture 19 Selection Problems We want to find the k'th largest entry in an unsorted array. Could be the largest, smallest, median, etc. Ideas for an n lg n algorithm?

More information

This question has three parts, each of which can be answered concisely, but be prepared to explain and justify your concise answer.

This question has three parts, each of which can be answered concisely, but be prepared to explain and justify your concise answer. This question has three parts, each of which can be answered concisely, but be prepared to explain and justify your concise answer. 1. Suppose you have a policy and its action-value function, q, then you

More information

CSC321 Lecture 22: Q-Learning

CSC321 Lecture 22: Q-Learning CSC321 Lecture 22: Q-Learning Roger Grosse Roger Grosse CSC321 Lecture 22: Q-Learning 1 / 21 Overview Second of 3 lectures on reinforcement learning Last time: policy gradient (e.g. REINFORCE) Optimize

More information

Leap Frog Solar System

Leap Frog Solar System Leap Frog Solar System David-Alexander Robinson Sch. 08332461 20th November 2011 Contents 1 Introduction & Theory 2 1.1 The Leap Frog Integrator......................... 2 1.2 Class.....................................

More information

Package gtheory. October 30, 2016

Package gtheory. October 30, 2016 Package gtheory October 30, 2016 Version 0.1.2 Date 2016-10-22 Title Apply Generalizability Theory with R Depends lme4 Estimates variance components, generalizability coefficients, universe scores, and

More information

Name: INSERT YOUR NAME HERE. Due to dropbox by 6pm PDT, Wednesday, December 14, 2011

Name: INSERT YOUR NAME HERE. Due to dropbox by 6pm PDT, Wednesday, December 14, 2011 AMath 584 Name: INSERT YOUR NAME HERE Take-home Final UWNetID: INSERT YOUR NETID Due to dropbox by 6pm PDT, Wednesday, December 14, 2011 The main part of the assignment (Problems 1 3) is worth 80 points.

More information

Assignment 3: The Atomic Nature of Matter

Assignment 3: The Atomic Nature of Matter Introduction Assignment 3: The Atomic Nature of Matter Science One CS 2015-2016 Instructor: Mike Gelbart Last Updated Sunday, Jan 24, 2016 around 7:45pm Part One due Saturday, Jan 30, 2016 at 5:00pm Part

More information

Introduction to Reinforcement Learning

Introduction to Reinforcement Learning CSCI-699: Advanced Topics in Deep Learning 01/16/2019 Nitin Kamra Spring 2019 Introduction to Reinforcement Learning 1 What is Reinforcement Learning? So far we have seen unsupervised and supervised learning.

More information

CS181 Midterm 2 Practice Solutions

CS181 Midterm 2 Practice Solutions CS181 Midterm 2 Practice Solutions 1. Convergence of -Means Consider Lloyd s algorithm for finding a -Means clustering of N data, i.e., minimizing the distortion measure objective function J({r n } N n=1,

More information

Introduction to Induction (LAMC, 10/14/07)

Introduction to Induction (LAMC, 10/14/07) Introduction to Induction (LAMC, 10/14/07) Olga Radko October 1, 007 1 Definitions The Method of Mathematical Induction (MMI) is usually stated as one of the axioms of the natural numbers (so-called Peano

More information

Process Robustness Studies

Process Robustness Studies Process Robustness Studies ST 435/535 Background When factors interact, the level of one can sometimes be chosen so that another has no effect on the response. If the second factor is controllable in a

More information

Manual for a computer class in ML

Manual for a computer class in ML Manual for a computer class in ML November 3, 2015 Abstract This document describes a tour of Machine Learning (ML) techniques using tools in MATLAB. We point to the standard implementations, give example

More information

10-701/ Machine Learning - Midterm Exam, Fall 2010

10-701/ Machine Learning - Midterm Exam, Fall 2010 10-701/15-781 Machine Learning - Midterm Exam, Fall 2010 Aarti Singh Carnegie Mellon University 1. Personal info: Name: Andrew account: E-mail address: 2. There should be 15 numbered pages in this exam

More information

Linear algebra and differential equations (Math 54): Lecture 10

Linear algebra and differential equations (Math 54): Lecture 10 Linear algebra and differential equations (Math 54): Lecture 10 Vivek Shende February 24, 2016 Hello and welcome to class! As you may have observed, your usual professor isn t here today. He ll be back

More information

Reinforcement Learning and Deep Reinforcement Learning

Reinforcement Learning and Deep Reinforcement Learning Reinforcement Learning and Deep Reinforcement Learning Ashis Kumer Biswas, Ph.D. ashis.biswas@ucdenver.edu Deep Learning November 5, 2018 1 / 64 Outlines 1 Principles of Reinforcement Learning 2 The Q

More information

Chapter 2 The text above the third display should say Three other examples.

Chapter 2 The text above the third display should say Three other examples. ERRATA Organic Chemistry, 6th Edition, by Marc Loudon Date of this release: October 10, 2018 (Items marked with (*) were corrected in the second printing.) (Items marked with ( ) were corrected in the

More information

Package ShrinkCovMat

Package ShrinkCovMat Type Package Package ShrinkCovMat Title Shrinkage Covariance Matrix Estimators Version 1.2.0 Author July 11, 2017 Maintainer Provides nonparametric Steinian shrinkage estimators

More information

Riemann Sums. Outline. James K. Peterson. September 15, Riemann Sums. Riemann Sums In MatLab

Riemann Sums. Outline. James K. Peterson. September 15, Riemann Sums. Riemann Sums In MatLab Riemann Sums James K. Peterson Department of Biological Sciences and Department of Mathematical Sciences Clemson University September 15, 2013 Outline Riemann Sums Riemann Sums In MatLab Abstract This

More information

Name: Student number:

Name: Student number: UNIVERSITY OF TORONTO Faculty of Arts and Science APRIL 2018 EXAMINATIONS CSC321H1S Duration 3 hours No Aids Allowed Name: Student number: This is a closed-book test. It is marked out of 35 marks. Please

More information

Finite Time Bounds for Temporal Difference Learning with Function Approximation: Problems with some state-of-the-art results

Finite Time Bounds for Temporal Difference Learning with Function Approximation: Problems with some state-of-the-art results Finite Time Bounds for Temporal Difference Learning with Function Approximation: Problems with some state-of-the-art results Chandrashekar Lakshmi Narayanan Csaba Szepesvári Abstract In all branches of

More information

Activity: Derive a matrix from input-output pairs

Activity: Derive a matrix from input-output pairs Activity: Derive a matrix from input-output pairs [ ] a b The 2 2 matrix A = satisfies the following equations: c d Calculate the entries of the matrix. [ ] [ ] a b 5 c d 10 [ ] [ ] a b 2 c d 1 = = [ ]

More information

Probabilistic Planning. George Konidaris

Probabilistic Planning. George Konidaris Probabilistic Planning George Konidaris gdk@cs.brown.edu Fall 2017 The Planning Problem Finding a sequence of actions to achieve some goal. Plans It s great when a plan just works but the world doesn t

More information

Reinforcement Learning. Yishay Mansour Tel-Aviv University

Reinforcement Learning. Yishay Mansour Tel-Aviv University Reinforcement Learning Yishay Mansour Tel-Aviv University 1 Reinforcement Learning: Course Information Classes: Wednesday Lecture 10-13 Yishay Mansour Recitations:14-15/15-16 Eliya Nachmani Adam Polyak

More information

Biology and Biology Honors Summer Reading Assignment 2018

Biology and Biology Honors Summer Reading Assignment 2018 Biology and Biology Honors Summer Reading Assignment 2018 Directions: You must read TWO (2) articles from the following RFH LibGuide: http://rumsonfairhaven.libguides.com/content.php?pid=710149 While reading

More information

1. (3 pts) In MDPs, the values of states are related by the Bellman equation: U(s) = R(s) + γ max a

1. (3 pts) In MDPs, the values of states are related by the Bellman equation: U(s) = R(s) + γ max a 3 MDP (2 points). (3 pts) In MDPs, the values of states are related by the Bellman equation: U(s) = R(s) + γ max a s P (s s, a)u(s ) where R(s) is the reward associated with being in state s. Suppose now

More information

CSC 344 Algorithms and Complexity. Proof by Mathematical Induction

CSC 344 Algorithms and Complexity. Proof by Mathematical Induction CSC 344 Algorithms and Complexity Lecture #1 Review of Mathematical Induction Proof by Mathematical Induction Many results in mathematics are claimed true for every positive integer. Any of these results

More information

VPython Class 2: Functions, Fields, and the dreaded ˆr

VPython Class 2: Functions, Fields, and the dreaded ˆr Physics 212E Classical and Modern Physics Spring 2016 1. Introduction VPython Class 2: Functions, Fields, and the dreaded ˆr In our study of electric fields, we start with a single point charge source

More information

Outline Python, Numpy, and Matplotlib Making Models with Polynomials Making Models with Monte Carlo Error, Accuracy and Convergence Floating Point Mod

Outline Python, Numpy, and Matplotlib Making Models with Polynomials Making Models with Monte Carlo Error, Accuracy and Convergence Floating Point Mod Outline Python, Numpy, and Matplotlib Making Models with Polynomials Making Models with Monte Carlo Error, Accuracy and Convergence Floating Point Modeling the World with Arrays The World in a Vector What

More information

Adding and Subtracting Terms

Adding and Subtracting Terms Adding and Subtracting Terms 1.6 OBJECTIVES 1.6 1. Identify terms and like terms 2. Combine like terms 3. Add algebraic expressions 4. Subtract algebraic expressions To find the perimeter of (or the distance

More information

Algorithms for Uncertainty Quantification

Algorithms for Uncertainty Quantification Technische Universität München SS 2017 Lehrstuhl für Informatik V Dr. Tobias Neckel M. Sc. Ionuț Farcaș April 26, 2017 Algorithms for Uncertainty Quantification Tutorial 1: Python overview In this worksheet,

More information

Chapter 10. Finding Independent Features. Pavol Olejar

Chapter 10. Finding Independent Features. Pavol Olejar Chapter 10 Finding Independent Features Pavol Olejar Agenda Introduction Corpus of News Previous Approaches Non-negative Matrix Factorization Break Displaying the Results Using Stock Market Data Questions?

More information

Course 16:198:520: Introduction To Artificial Intelligence Lecture 13. Decision Making. Abdeslam Boularias. Wednesday, December 7, 2016

Course 16:198:520: Introduction To Artificial Intelligence Lecture 13. Decision Making. Abdeslam Boularias. Wednesday, December 7, 2016 Course 16:198:520: Introduction To Artificial Intelligence Lecture 13 Decision Making Abdeslam Boularias Wednesday, December 7, 2016 1 / 45 Overview We consider probabilistic temporal models where the

More information

Logistic Regression - problem 6.14

Logistic Regression - problem 6.14 Logistic Regression - problem 6.14 Let x 1, x 2,, x m be given values of an input variable x and let Y 1,, Y m be independent binomial random variables whose distributions depend on the corresponding values

More information

For final project discussion every afternoon Mark and I will be available

For final project discussion every afternoon Mark and I will be available Worshop report 1. Daniels report is on website 2. Don t expect to write it based on listening to one project (we had 6 only 2 was sufficient quality) 3. I suggest writing it on one presentation. 4. Include

More information