Dennis Bricker, 2001 Dept of Industrial Engineering The University of Iowa. MDP: Taxi page 1
|
|
- Hilary Atkins
- 5 years ago
- Views:
Transcription
1 Denns Brcker, 2001 Dept of Industrl Engneerng The Unversty of Iow MDP: Tx pge 1
2 A tx serves three djcent towns: A, B, nd C. Ech tme the tx dschrges pssenger, the drver must choose from three possble ctons: (1) "Cruse" the streets lookng for pssenger. (2) Go to the nerest tx stnd (hotel, trn stton, etc.) (3) Wt for rdo cll from the dsptcher wth nstructons (but not possble n town B becuse of dstnce nd poor recepton). MDP: Tx pge 2
3 MDP model: Sttes: {A, B, C} Acton sets: KA = {1,2,3}, KB = {1,2,3}, KC = {1,2} Trnston probblty mtrces Crusng streets Wtng t tx stnd Wtng for dsptch P = P = P = MDP: Tx pge 3
4 Pyoff mtrces (expected proft per pssenger): k R j = expected proft f cton k s selected, nd pssenger wshes to trvel from town to town j Crusng streets Wtng t tx stnd Dsptch cll R = R = R = Snce our model ssumes mnmzton of cost, we use k k k C = PR j j j Note: Ths exmple ws ntroduced by Ron Howrd n hs textbook, Dynmc Progrmmng nd Mrkov Processes, MIT Press (1960), n whch no consderton ws gven to the vrble mount of tme per stge (trp) n the optmzton model. MDP: Tx pge 4
5 Sttes: Actons: stte 1 town A 2 town B 3 town C k cton 1 CRUISE 2 TAXISTAND 3 RADIO CALL Cost Mtrx stte town A town B town C (Rows ~ sttes, Columns ~ ctons) A vlue of 999 bove sgnls n nfesble cton n stte. Note tht the lgorthm ssumes mnmzton, nd so the "cost" s the negtve of the expected pyoffs! MDP: Tx pge 5
6 Acton: CRUISE Acton: TAXISTAND Acton: RADIO CALL Trnston Probbltes f to r o m f to r o m f to r o m MDP: Tx pge 6
7 Let's frst use the crteron: Mxmze verge rewrd per trp LP Tbleu for MDP k: : RHS Mn Note tht one of the "stedystte" equtons (for stte C) ws elmnted becuse of redundncy. MDP: Tx pge 7
8 Mnmze subject to = 1 x = px for ll sttes j j j j x 0 for ll sttes nd ctons A x cx MDP: Tx pge 8
9 Phse One procedure ws used to fnd n ntl bsc fesble soluton Iterton 0 Polcy: (Cost= 8 ) Stte Acton P{} R{} 1) town A 3) RADIO CALL ) town B 1) CRUISE ) town C 2) TAXISTAND MDP: Tx pge 9
10 k: : rhs Mn Intlly the bsc vrbles re { 3 1 2,, } A B C X X X (exctly one per stte). The vlues of these vrbles re the stedystte probbltes of the Mrkov chn correspondng to the polcy (3, 1, 2). The "most negtve" reduced cost s 6 (of vrble tht vrble should enter the bss, replcng element s , ndcted bove.) 2 X B), nd so 1 X B. (The pvot MDP: Tx pge 10
11 Iterton 1 Polcy: (Cost= ) Stte Acton P{} R{} 1) town A 3) RADIO CALL ) town B 2) TAXISTAND ) town C 2) TAXISTAND k: : rhs Mn The next pvot should enter 2 X A nto the bss, replcng 3 X A. MDP: Tx pge 11
12 Iterton 2 Polcy: (Cost= ) Stte Acton P{} R{} 1) town A 2) TAXISTAND ) town B 2) TAXISTAND ) town C 2) TAXISTAND k: : rhs Mn All reduced costs re nonnegtve! MDP: Tx pge 12
13 Optml Polcy Stte Acton P{} R{} 1) town A 2) TAXISTAND ) town B 2) TAXISTAND ) town C 2) TAXISTAND Averge cost/stge = MDP: Tx pge 13
14 MDP: Tx pge 14
15 MDP: Tx pge 15
16 Vlue Iterton Method (Note: objectve: mxmze verge rewrd per pssenger) ( ) f We wnt to compute lm n n n fn = C + pj fn j A j where ( ) ( ) ( ) f Snce lm n should be ndependent of the stte, our n n convergence crteron s to compute fn( ) fn 1 ( ) f ( ) = n n 1 nd termnte when mx ( ) { n } mn { n ( )} f f ε MDP: Tx pge 16
17 Tolernce: 1.00E 6 Mnmzng verge cost/perod terton Mx V Mn V gp (%) E E E E E E E E E E E E E E E E E E E E E E E E 5 ***Converged! wth gp = % Soluton: stte cton Vlue MDP: Tx pge 17
18 Plot of f ( ) f ( ) n n 1 MDP: Tx pge 18
19 Plot of ( ) { fn } { fn ( ) } log 10 mx mn MDP: Tx pge 19
20 Next we wll solve the problem wth the objectve of mxmzng the totl dscounted pyoff. Crteron: Dscounted Totl Cost, wth β = (1.2) -1 = k: : RHS Mn Note: Specfyng ntl condtons to be determnstc, wth town A s ntl stte Mnmze X 0 A, A j C X subject to X = β PX j j j A A MDP: Tx pge 20
21 Note tht X s not probblty n ths model, nd so the equton X = 1 s not ncluded n the LP tbleu. There s one equton for ech stte (not ncludng the objectve row), wth no redundncy s n the verge cost/stge LP model, so the totl number of vrbles, s before, s equl to the number of sttes, nd s before, n bsc fesble soluton there s one bsc vrble per stte. MDP: Tx pge 21
22 Phse One procedure ws used to fnd ntl bsc fesble soluton Iterton 0 Polcy: (Cost= ) Stte Acton X{} V{} 1) town A 1) CRUISE ) town B 1) CRUISE ) town C 3) RADIO CALL k: : rhs Mn ~stte, k~cton MDP: Tx pge 22
23 Iterton 1 Polcy: (Cost= 61.4 ) Stte Acton X{} V{} 1) town A 1) CRUISE ) town B 2) TAXISTAND ) town C 3) RADIO CALL k: : rhs Mn ~stte, k~cton MDP: Tx pge 23
24 Iterton 2 Polcy: (Cost= ) Stte Acton X{} V{} 1) town A 1) CRUISE ) town B 2) TAXISTAND ) town C 2) TAXISTAND k: : rhs Mn ~stte, k~cton MDP: Tx pge 24
25 Iterton 3 Polcy: (Cost= ) Stte Acton X{} V{} 1) town A 2) TAXISTAND ) town B 2) TAXISTAND ) town C 2) TAXISTAND k: : rhs Mn ~stte, k~cton Reduced costs re now nonnegtve! MDP: Tx pge 25
26 Optml Polcy Stte Acton X{} V{} lph{} 1) town A 2) TAXISTAND ) town B 2) TAXISTAND ) town C 2) TAXISTAND Alph s ntl dstrbuton of the stte Dscounted future costs = MDP: Tx pge 26
27 MDP: Tx pge 27
28 MDP: Tx pge 28
29 Vlue Iterton Method Tolernce: 1.00E 6 Mnmzng dscounted future costs terton Mx V Mn V gp (%) E E E E E E E E E E E E E E E E E E E E E E E E E E E 5 ***Converged! wth gp = % MDP: Tx pge 29
30 Soluton: stte cton Vlue MDP: Tx pge 30
31 Solvng gn. Tolernce: 1.00E 12 Reduced the tolernce! terton Mx V Mn V gp (%) E E E E E E E E E E E E E E E E E E E E E E E E E E E E E E E E E E E E E E E E E E E E E 11 ***Converged! wth gp = 4.794E 11% MDP: Tx pge 31
32 Soluton: stte cton Vlue Note: Polcy s sme s the erler run wth lrger tolernce, but objectve vlue s nerer to true vlue. MDP: Tx pge 32
33 MDP: Tx pge 33
34 Becuse the durton (n mnutes) of stge (trp) wll depend upon the polcy whch we select, the objectve of mxmzng the rewrd per trp s npproprte-- we should nsted mxmze the rewrd per unt tme. Ths requres tht we tret ths s Sem-Mrkov Decson Process (SMDP). MDP: Tx pge 34
35 Suppose we hve the ddtonl dt: Expected tme to obtn pssenger: k W = expected wtng tme (mnutes) n town when cton k s selected Town \ Acton Crusng Tx stnd Dsptch cll A B C Expected trvel tme between towns: T j= expected trvel tme (mnutes) from town to town j Town \ Town A B C A B C MDP: Tx pge 35
36 Expected trvel tme (mnutes) of trp = k PT j j = j Cruse Tx-stnd Rdo cll town A town B town C v E τ = expected totl durton of trp (wtng + trvelng) k k k when n town f cton s selected,.e., E τ = W + PT j j = Cruse Tx-stnd Rdo cll town A town B town C j MDP: Tx pge 36
37 Averge rewrd per mnute for the optml polcy (2,2,2) found by the MDP: cx verge rewrd/trp $ $ /mn. vx = verge durton of trp = mn. = MDP: Tx pge 37
38 If we tret ths s Sem-Mrkov Decson Process (SMDP), then we cn fnd the polcy whch mxmzes our rewrd per mnute by solvng the LP: Mnmze subject to = 1 u = pu for ll sttes j j j j u 0 for ll sttes nd ctons A vu cu MDP: Tx pge 38
39 k: : RHS Mn Note tht ths tbleu dffers from tht of the LP for MDP only n the lst row! MDP: Tx pge 39
40 Optml Polcy: Stte Acton U{} 1) town A 1) CRUISE ) town B 2) TAXISTAND ) town C 2) TAXISTAND Averge cost/unt tme = The optml polcy (1,2,2) of the SMDP s dfferent n town A, nd the verge rewrd per mnute s slghtly lrger ($ ) thn tht ($ ) of the erler polcy (2,2,2). In relty, of course, the nfnte horzon s much more problemtc ssumpton n ths prtculr problem!) MDP: Tx pge 40
Partially Observable Systems. 1 Partially Observable Markov Decision Process (POMDP) Formalism
CS294-40 Lernng for Rootcs nd Control Lecture 10-9/30/2008 Lecturer: Peter Aeel Prtlly Oservle Systems Scre: Dvd Nchum Lecture outlne POMDP formlsm Pont-sed vlue terton Glol methods: polytree, enumerton,
More informationRemember: Project Proposals are due April 11.
Bonformtcs ecture Notes Announcements Remember: Project Proposls re due Aprl. Clss 22 Aprl 4, 2002 A. Hdden Mrov Models. Defntons Emple - Consder the emple we tled bout n clss lst tme wth the cons. However,
More informationM/G/1/GD/ / System. ! Pollaczek-Khinchin (PK) Equation. ! Steady-state probabilities. ! Finding L, W q, W. ! π 0 = 1 ρ
M/G//GD/ / System! Pollcze-Khnchn (PK) Equton L q 2 2 λ σ s 2( + ρ ρ! Stedy-stte probbltes! π 0 ρ! Fndng L, q, ) 2 2 M/M/R/GD/K/K System! Drw the trnston dgrm! Derve the stedy-stte probbltes:! Fnd L,L
More informationVariable time amplitude amplification and quantum algorithms for linear algebra. Andris Ambainis University of Latvia
Vrble tme mpltude mplfcton nd quntum lgorthms for lner lgebr Andrs Ambns Unversty of Ltv Tlk outlne. ew verson of mpltude mplfcton;. Quntum lgorthm for testng f A s sngulr; 3. Quntum lgorthm for solvng
More informationUNIVERSITY OF IOANNINA DEPARTMENT OF ECONOMICS. M.Sc. in Economics MICROECONOMIC THEORY I. Problem Set II
Mcroeconomc Theory I UNIVERSITY OF IOANNINA DEPARTMENT OF ECONOMICS MSc n Economcs MICROECONOMIC THEORY I Techng: A Lptns (Note: The number of ndctes exercse s dffculty level) ()True or flse? If V( y )
More informationDefinition of Tracking
Trckng Defnton of Trckng Trckng: Generte some conclusons bout the moton of the scene, objects, or the cmer, gven sequence of mges. Knowng ths moton, predct where thngs re gong to project n the net mge,
More informationChapter Newton-Raphson Method of Solving a Nonlinear Equation
Chpter.4 Newton-Rphson Method of Solvng Nonlner Equton After redng ths chpter, you should be ble to:. derve the Newton-Rphson method formul,. develop the lgorthm of the Newton-Rphson method,. use the Newton-Rphson
More informationChapter Newton-Raphson Method of Solving a Nonlinear Equation
Chpter 0.04 Newton-Rphson Method o Solvng Nonlner Equton Ater redng ths chpter, you should be ble to:. derve the Newton-Rphson method ormul,. develop the lgorthm o the Newton-Rphson method,. use the Newton-Rphson
More informationRank One Update And the Google Matrix by Al Bernstein Signal Science, LLC
Introducton Rnk One Updte And the Google Mtrx y Al Bernsten Sgnl Scence, LLC www.sgnlscence.net here re two dfferent wys to perform mtrx multplctons. he frst uses dot product formulton nd the second uses
More informationv v at 1 2 d vit at v v 2a d
SPH3UW Unt. Accelerton n One Denon Pge o 9 Note Phyc Inventory Accelerton the rte o chnge o velocty. Averge ccelerton, ve the chnge n velocty dvded by the te ntervl, v v v ve. t t v dv Intntneou ccelerton
More informationThe Schur-Cohn Algorithm
Modelng, Estmton nd Otml Flterng n Sgnl Processng Mohmed Njm Coyrght 8, ISTE Ltd. Aendx F The Schur-Cohn Algorthm In ths endx, our m s to resent the Schur-Cohn lgorthm [] whch s often used s crteron for
More informationIn this Chapter. Chap. 3 Markov chains and hidden Markov models. Probabilistic Models. Example: CpG Islands
In ths Chpter Chp. 3 Mrov chns nd hdden Mrov models Bontellgence bortory School of Computer Sc. & Eng. Seoul Ntonl Unversty Seoul 5-74, Kore The probblstc model for sequence nlyss HMM (hdden Mrov model)
More informationModel Fitting and Robust Regression Methods
Dertment o Comuter Engneerng Unverst o Clorn t Snt Cruz Model Fttng nd Robust Regresson Methods CMPE 64: Imge Anlss nd Comuter Vson H o Fttng lnes nd ellses to mge dt Dertment o Comuter Engneerng Unverst
More informationDCDM BUSINESS SCHOOL NUMERICAL METHODS (COS 233-8) Solutions to Assignment 3. x f(x)
DCDM BUSINESS SCHOOL NUMEICAL METHODS (COS -8) Solutons to Assgnment Queston Consder the followng dt: 5 f() 8 7 5 () Set up dfference tble through fourth dfferences. (b) Wht s the mnmum degree tht n nterpoltng
More informationSolution of Tutorial 5 Drive dynamics & control
ELEC463 Unversty of New South Wles School of Electrcl Engneerng & elecommunctons ELEC463 Electrc Drve Systems Queston Motor Soluton of utorl 5 Drve dynmcs & control 500 rev/mn = 5.3 rd/s 750 rted 4.3 Nm
More informationAn Introduction to Support Vector Machines
An Introducton to Support Vector Mchnes Wht s good Decson Boundry? Consder two-clss, lnerly seprble clssfcton problem Clss How to fnd the lne (or hyperplne n n-dmensons, n>)? Any de? Clss Per Lug Mrtell
More information4. Eccentric axial loading, cross-section core
. Eccentrc xl lodng, cross-secton core Introducton We re strtng to consder more generl cse when the xl force nd bxl bendng ct smultneousl n the cross-secton of the br. B vrtue of Snt-Vennt s prncple we
More informationDynamic Power Management in a Mobile Multimedia System with Guaranteed Quality-of-Service
Dynmc Power Mngement n Moble Multmed System wth Gurnteed Qulty-of-Servce Qnru Qu, Qng Wu, nd Mssoud Pedrm Dept. of Electrcl Engneerng-Systems Unversty of Southern Clforn Los Angeles CA 90089 Outlne! Introducton
More informationψ ij has the eigenvalue
Moller Plesset Perturbton Theory In Moller-Plesset (MP) perturbton theory one tes the unperturbed Hmltonn for n tom or molecule s the sum of the one prtcle Foc opertors H F() where the egenfunctons of
More informationWe will see what is meant by standard form very shortly
THEOREM: For fesible liner progrm in its stndrd form, the optimum vlue of the objective over its nonempty fesible region is () either unbounded or (b) is chievble t lest t one extreme point of the fesible
More informationCS434a/541a: Pattern Recognition Prof. Olga Veksler. Lecture 9
CS434/541: Pttern Recognton Prof. Olg Veksler Lecture 9 Announcements Fnl project proposl due Nov. 1 1-2 prgrph descrpton Lte Penlt: s 1 pont off for ech d lte Assgnment 3 due November 10 Dt for fnl project
More informationApplied Statistics Qualifier Examination
Appled Sttstcs Qulfer Exmnton Qul_june_8 Fll 8 Instructons: () The exmnton contns 4 Questons. You re to nswer 3 out of 4 of them. () You my use ny books nd clss notes tht you mght fnd helpful n solvng
More informationUsing Predictions in Online Optimization: Looking Forward with an Eye on the Past
Usng Predctons n Onlne Optmzton: Lookng Forwrd wth n Eye on the Pst Nngjun Chen Jont work wth Joshu Comden, Zhenhu Lu, Anshul Gndh, nd Adm Wermn 1 Predctons re crucl for decson mkng 2 Predctons re crucl
More informationLinear and Nonlinear Optimization
Lner nd Nonlner Optmzton Ynyu Ye Deprtment of Mngement Scence nd Engneerng Stnford Unversty Stnford, CA 9430, U.S.A. http://www.stnford.edu/~yyye http://www.stnford.edu/clss/msnde/ Ynyu Ye, Stnford, MS&E
More informationOptimal Resource Allocation and Policy Formulation in Loosely-Coupled Markov Decision Processes
Optml Resource Allocton nd Polcy Formulton n Loosely-Coupled Mrkov Decson Processes Dmtr A. Dolgov nd Edmund H. Durfee Deprtment of Electrcl Engneerng nd Computer Scence Unversty of Mchgn Ann Arbor, MI
More informationImproving Anytime Point-Based Value Iteration Using Principled Point Selections
In In Proceedngs of the Twenteth Interntonl Jont Conference on Artfcl Intellgence (IJCAI-7) Improvng Anytme Pont-Bsed Vlue Iterton Usng Prncpled Pont Selectons Mchel R. Jmes, Mchel E. Smples, nd Dmtr A.
More informationA New Markov Chain Based Acceptance Sampling Policy via the Minimum Angle Method
Irnn Journl of Opertons Reserch Vol. 3, No., 202, pp. 04- A New Mrkov Chn Bsed Acceptnce Smplng Polcy v the Mnmum Angle Method M. S. Fllh Nezhd * We develop n optmzton model bsed on Mrkovn pproch to determne
More informationStatistics and Probability Letters
Sttstcs nd Probblty Letters 79 (2009) 105 111 Contents lsts vlble t ScenceDrect Sttstcs nd Probblty Letters journl homepge: www.elsever.com/locte/stpro Lmtng behvour of movng verge processes under ϕ-mxng
More informationJean Fernand Nguema LAMETA UFR Sciences Economiques Montpellier. Abstract
Stochstc domnnce on optml portfolo wth one rsk less nd two rsky ssets Jen Fernnd Nguem LAMETA UFR Scences Economques Montpeller Abstrct The pper provdes restrctons on the nvestor's utlty functon whch re
More informationPrinciple Component Analysis
Prncple Component Anlyss Jng Go SUNY Bufflo Why Dmensonlty Reducton? We hve too mny dmensons o reson bout or obtn nsghts from o vsulze oo much nose n the dt Need to reduce them to smller set of fctors
More informationWe consider a finite-state, finite-action, infinite-horizon, discounted reward Markov decision process and
MANAGEMENT SCIENCE Vol. 53, No. 2, Februry 2007, pp. 308 322 ssn 0025-1909 essn 1526-5501 07 5302 0308 nforms do 10.1287/mnsc.1060.0614 2007 INFORMS Bs nd Vrnce Approxmton n Vlue Functon Estmtes She Mnnor
More informationarxiv: v2 [cs.lg] 9 Nov 2017
Renforcement Lernng under Model Msmtch Aurko Roy 1, Hun Xu 2, nd Sebstn Pokutt 2 rxv:1706.04711v2 cs.lg 9 Nov 2017 1 Google Eml: urkor@google.com 2 ISyE, Georg Insttute of Technology, Atlnt, GA, USA. Eml:
More informationLecture 4: Piecewise Cubic Interpolation
Lecture notes on Vrtonl nd Approxmte Methods n Appled Mthemtcs - A Perce UBC Lecture 4: Pecewse Cubc Interpolton Compled 6 August 7 In ths lecture we consder pecewse cubc nterpolton n whch cubc polynoml
More information3/6/00. Reading Assignments. Outline. Hidden Markov Models: Explanation and Model Learning
3/6/ Hdden Mrkov Models: Explnton nd Model Lernng Brn C. Wllms 6.4/6.43 Sesson 2 9/3/ courtesy of JPL copyrght Brn Wllms, 2 Brn C. Wllms, copyrght 2 Redng Assgnments AIMA (Russell nd Norvg) Ch 5.-.3, 2.3
More informationSimultaneous estimation of rewards and dynamics from noisy expert demonstrations
Smultneous estmton of rewrds nd dynmcs from nosy expert demonstrtons Mchel Hermn,2, Tobs Gndele, Jo rg Wgner, Felx Schmtt, nd Wolfrm Burgrd2 - Robert Bosch GmbH - 70442 Stuttgrt - Germny 2- Unversty of
More informationHQPD - ALGEBRA I TEST Record your answers on the answer sheet.
HQPD - ALGEBRA I TEST Record your nswers on the nswer sheet. Choose the best nswer for ech. 1. If 7(2d ) = 5, then 14d 21 = 5 is justified by which property? A. ssocitive property B. commuttive property
More informationINTRODUCTION TO COMPLEX NUMBERS
INTRODUCTION TO COMPLEX NUMBERS The numers -4, -3, -, -1, 0, 1,, 3, 4 represent the negtve nd postve rel numers termed ntegers. As one frst lerns n mddle school they cn e thought of s unt dstnce spced
More information6.6 The Marquardt Algorithm
6.6 The Mqudt Algothm lmttons of the gdent nd Tylo expnson methods ecstng the Tylo expnson n tems of ch-sque devtves ecstng the gdent sech nto n tetve mtx fomlsm Mqudt's lgothm utomtclly combnes the gdent
More informationChemical Reaction Engineering
Lecture 20 hemcl Recton Engneerng (RE) s the feld tht studes the rtes nd mechnsms of chemcl rectons nd the desgn of the rectors n whch they tke plce. Lst Lecture Energy Blnce Fundmentls F 0 E 0 F E Q W
More informationGAUSS ELIMINATION. Consider the following system of algebraic linear equations
Numercl Anlyss for Engneers Germn Jordnn Unversty GAUSS ELIMINATION Consder the followng system of lgebrc lner equtons To solve the bove system usng clsscl methods, equton () s subtrcted from equton ()
More information8 Transportation Problem Alpha-Beta
8 rnsportton Proble Alph-Bet Now, we ntroduce n ddtonl lgorth for the Htchcock rnsportton proble, whch ws lredy ntroduced before hs s the Alph-Bet Algorth It copletes the lst of soluton pproches for solvng
More informationSmart Motorways HADECS 3 and what it means for your drivers
Vehcle Rentl Smrt Motorwys HADECS 3 nd wht t mens for your drvers Vehcle Rentl Smrt Motorwys HADECS 3 nd wht t mens for your drvers You my hve seen some news rtcles bout the ntroducton of Hghwys Englnd
More information18.7 Artificial Neural Networks
310 18.7 Artfcl Neurl Networks Neuroscence hs hypotheszed tht mentl ctvty conssts prmrly of electrochemcl ctvty n networks of brn cells clled neurons Ths led McCulloch nd Ptts to devse ther mthemtcl model
More informationComputing a complete histogram of an image in Log(n) steps and minimum expected memory requirements using hypercubes
Computng complete hstogrm of n mge n Log(n) steps nd mnmum expected memory requrements usng hypercubes TAREK M. SOBH School of Engneerng, Unversty of Brdgeport, Connectcut, USA. Abstrct Ths work frst revews
More informationProof that if Voting is Perfect in One Dimension, then the First. Eigenvector Extracted from the Double-Centered Transformed
Proof tht f Votng s Perfect n One Dmenson, then the Frst Egenvector Extrcted from the Doule-Centered Trnsformed Agreement Score Mtrx hs the Sme Rn Orderng s the True Dt Keth T Poole Unversty of Houston
More informationReview of linear algebra. Nuno Vasconcelos UCSD
Revew of lner lgebr Nuno Vsconcelos UCSD Vector spces Defnton: vector spce s set H where ddton nd sclr multplcton re defned nd stsf: ) +( + ) (+ )+ 5) λ H 2) + + H 6) 3) H, + 7) λ(λ ) (λλ ) 4) H, - + 8)
More informationOn-line Reinforcement Learning Using Incremental Kernel-Based Stochastic Factorization
On-lne Renforcement Lernng Usng Incrementl Kernel-Bsed Stochstc Fctorzton André M. S. Brreto School of Computer Scence McGll Unversty Montrel, Cnd msb@cs.mcgll.c Don Precup School of Computer Scence McGll
More informationQuiz: Experimental Physics Lab-I
Mxmum Mrks: 18 Totl tme llowed: 35 mn Quz: Expermentl Physcs Lb-I Nme: Roll no: Attempt ll questons. 1. In n experment, bll of mss 100 g s dropped from heght of 65 cm nto the snd contner, the mpct s clled
More informationA Robust Folk Theorem for the Prisoner s Dilemma
A Robust Folk Theorem for the Prsoner s Dlemm Jeffrey C. Ely Juuso Välmäk December 23, 1999 Abstrct We prove the folk theorem for the Prsoner s dlemm usng strteges tht re robust to prvte montorng. From
More informationFormulated Algorithm for Computing Dominant Eigenvalue. and the Corresponding Eigenvector
Int. J. Contemp. Mth. Scences Vol. 8 23 no. 9 899-9 HIKARI Ltd www.m-hkr.com http://dx.do.org/.2988/jcms.23.3674 Formulted Algorthm for Computng Domnnt Egenlue nd the Correspondng Egenector Igob Dod Knu
More informationJens Siebel (University of Applied Sciences Kaiserslautern) An Interactive Introduction to Complex Numbers
Jens Sebel (Unversty of Appled Scences Kserslutern) An Interctve Introducton to Complex Numbers 1. Introducton We know tht some polynoml equtons do not hve ny solutons on R/. Exmple 1.1: Solve x + 1= for
More informationName: SID: Discussion Session:
Nme: SID: Dscusson Sesson: hemcl Engneerng hermodynmcs -- Fll 008 uesdy, Octoer, 008 Merm I - 70 mnutes 00 onts otl losed Book nd Notes (5 ponts). onsder n del gs wth constnt het cpctes. Indcte whether
More informationBi-level models for OD matrix estimation
TNK084 Trffc Theory seres Vol.4, number. My 2008 B-level models for OD mtrx estmton Hn Zhng, Quyng Meng Abstrct- Ths pper ntroduces two types of O/D mtrx estmton model: ME2 nd Grdent. ME2 s mxmum-entropy
More informationLeast squares. Václav Hlaváč. Czech Technical University in Prague
Lest squres Václv Hlváč Czech echncl Unversty n Prgue hlvc@fel.cvut.cz http://cmp.felk.cvut.cz/~hlvc Courtesy: Fred Pghn nd J.P. Lews, SIGGRAPH 2007 Course; Outlne 2 Lner regresson Geometry of lest-squres
More informationKatholieke Universiteit Leuven Department of Computer Science
Updte Rules for Weghted Non-negtve FH*G Fctorzton Peter Peers Phlp Dutré Report CW 440, Aprl 006 Ktholeke Unverstet Leuven Deprtment of Computer Scence Celestjnenln 00A B-3001 Heverlee (Belgum) Updte Rules
More informationThe Number of Rows which Equal Certain Row
Interntonl Journl of Algebr, Vol 5, 011, no 30, 1481-1488 he Number of Rows whch Equl Certn Row Ahmd Hbl Deprtment of mthemtcs Fcult of Scences Dmscus unverst Dmscus, Sr hblhmd1@gmlcom Abstrct Let be X
More informationLecture 5 Single factor design and analysis
Lectue 5 Sngle fcto desgn nd nlss Completel ndomzed desgn (CRD Completel ndomzed desgn In the desgn of expements, completel ndomzed desgns e fo studng the effects of one pm fcto wthout the need to tke
More information4.5 JACOBI ITERATION FOR FINDING EIGENVALUES OF A REAL SYMMETRIC MATRIX. be a real symmetric matrix. ; (where we choose θ π for.
4.5 JACOBI ITERATION FOR FINDING EIGENVALUES OF A REAL SYMMETRIC MATRIX Some reliminries: Let A be rel symmetric mtrix. Let Cos θ ; (where we choose θ π for Cos θ 4 purposes of convergence of the scheme)
More informationMath 8 Winter 2015 Applications of Integration
Mth 8 Winter 205 Applictions of Integrtion Here re few importnt pplictions of integrtion. The pplictions you my see on n exm in this course include only the Net Chnge Theorem (which is relly just the Fundmentl
More informationOnline Appendix to. Mandating Behavioral Conformity in Social Groups with Conformist Members
Onlne Appendx to Mndtng Behvorl Conformty n Socl Groups wth Conformst Members Peter Grzl Andrze Bnk (Correspondng uthor) Deprtment of Economcs, The Wllms School, Wshngton nd Lee Unversty, Lexngton, 4450
More informationChemical Reaction Engineering
Lecture 20 hemcl Recton Engneerng (RE) s the feld tht studes the rtes nd mechnsms of chemcl rectons nd the desgn of the rectors n whch they tke plce. Lst Lecture Energy Blnce Fundmentls F E F E + Q! 0
More informationn α j x j = 0 j=1 has a nontrivial solution. Here A is the n k matrix whose jth column is the vector for all t j=0
MODULE 2 Topcs: Lnear ndependence, bass and dmenson We have seen that f n a set of vectors one vector s a lnear combnaton of the remanng vectors n the set then the span of the set s unchanged f that vector
More informationDynamic Power Management in a Mobile Multimedia System with Guaranteed Quality-of-Service
Dynmc Power Mngement n Moble Multmed System wth Gurnteed Qulty-of-Servce Abstrct In ths pper we ddress the problem of dynmc power mngement n dstrbuted multmed system wth requred qulty of servce (QoS).
More informationTwo Coefficients of the Dyson Product
Two Coeffcents of the Dyson Product rxv:07.460v mth.co 7 Nov 007 Lun Lv, Guoce Xn, nd Yue Zhou 3,,3 Center for Combntorcs, LPMC TJKLC Nnk Unversty, Tnjn 30007, P.R. Chn lvlun@cfc.nnk.edu.cn gn@nnk.edu.cn
More informationTrigonometry. Trigonometry. Solutions. Curriculum Ready ACMMG: 223, 224, 245.
Trgonometry Trgonometry Solutons Currulum Redy CMMG:, 4, 4 www.mthlets.om Trgonometry Solutons Bss Pge questons. Identfy f the followng trngles re rght ngled or not. Trngles,, d, e re rght ngled ndted
More informationOnline Learning Algorithms for Stochastic Water-Filling
Onlne Lernng Algorthms for Stochstc Wter-Fllng Y G nd Bhskr Krshnmchr Mng Hseh Deprtment of Electrcl Engneerng Unversty of Southern Clforn Los Angeles, CA 90089, USA Eml: {yg, bkrshn}@usc.edu Abstrct Wter-fllng
More information1 Online Learning and Regret Minimization
2.997 Decision-Mking in Lrge-Scle Systems My 10 MIT, Spring 2004 Hndout #29 Lecture Note 24 1 Online Lerning nd Regret Minimiztion In this lecture, we consider the problem of sequentil decision mking in
More information1. Extend QR downwards to meet the x-axis at U(6, 0). y
In the digrm, two stright lines re to be drwn through so tht the lines divide the figure OPQRST into pieces of equl re Find the sum of the slopes of the lines R(6, ) S(, ) T(, 0) Determine ll liner functions
More informationLOCAL FRACTIONAL LAPLACE SERIES EXPANSION METHOD FOR DIFFUSION EQUATION ARISING IN FRACTAL HEAT TRANSFER
Yn, S.-P.: Locl Frctonl Lplce Seres Expnson Method for Dffuson THERMAL SCIENCE, Yer 25, Vol. 9, Suppl., pp. S3-S35 S3 LOCAL FRACTIONAL LAPLACE SERIES EXPANSION METHOD FOR DIFFUSION EQUATION ARISING IN
More informationCan the Phase I problem be unfeasible or unbounded? -No
Cn the Phse I problem be unfesible or unbounded? -No Phse I: min 1X AX + IX = b with b 0 X 1, X 0 By mnipulting constrints nd dding/subtrcting slck/surplus vribles, we cn get b 0 A fesible solution with
More informationAltitude Estimation for 3-D Tracking with Two 2-D Radars
th Interntonl Conference on Informton Fuson Chcgo Illnos USA July -8 Alttude Estmton for -D Trckng wth Two -D Rdrs Yothn Rkvongth Jfeng Ru Sv Svnnthn nd Soontorn Orntr Deprtment of Electrcl Engneerng Unversty
More informationElectrochemical Thermodynamics. Interfaces and Energy Conversion
CHE465/865, 2006-3, Lecture 6, 18 th Sep., 2006 Electrochemcl Thermodynmcs Interfces nd Energy Converson Where does the energy contrbuton F zϕ dn come from? Frst lw of thermodynmcs (conservton of energy):
More informationSection 6: Area, Volume, and Average Value
Chpter The Integrl Applied Clculus Section 6: Are, Volume, nd Averge Vlue Are We hve lredy used integrls to find the re etween the grph of function nd the horizontl xis. Integrls cn lso e used to find
More informationCISE 301: Numerical Methods Lecture 5, Topic 4 Least Squares, Curve Fitting
CISE 3: umercl Methods Lecture 5 Topc 4 Lest Squres Curve Fttng Dr. Amr Khouh Term Red Chpter 7 of the tetoo c Khouh CISE3_Topc4_Lest Squre Motvton Gven set of epermentl dt 3 5. 5.9 6.3 The reltonshp etween
More informationMachine Learning Support Vector Machines SVM
Mchne Lernng Support Vector Mchnes SVM Lesson 6 Dt Clssfcton problem rnng set:, D,,, : nput dt smple {,, K}: clss or lbel of nput rget: Construct functon f : X Y f, D Predcton of clss for n unknon nput
More informationSuggested solutions for the exam in SF2863 Systems Engineering. June 12,
Suggested solutons for the exam n SF2863 Systems Engneerng. June 12, 2012 14.00 19.00 Examner: Per Enqvst, phone: 790 62 98 1. We can thnk of the farm as a Jackson network. The strawberry feld s modelled
More informationExpectation and Variance
Expecttion nd Vrince : sum of two die rolls P(= P(= = 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12 P(=2) = 1/36 P(=3) = 1/18 P(=4) = 1/12 P(=5) = 1/9 P(=7) = 1/6 P(=13) =? 2 1/36 3 1/18 4 1/12 5 1/9 6 5/36 7 1/6
More informationMORE FUNCTION GRAPHING; OPTIMIZATION. (Last edited October 28, 2013 at 11:09pm.)
MORE FUNCTION GRAPHING; OPTIMIZATION FRI, OCT 25, 203 (Lst edited October 28, 203 t :09pm.) Exercise. Let n be n rbitrry positive integer. Give n exmple of function with exctly n verticl symptotes. Give
More information1 Probability Density Functions
Lis Yn CS 9 Continuous Distributions Lecture Notes #9 July 6, 28 Bsed on chpter by Chris Piech So fr, ll rndom vribles we hve seen hve been discrete. In ll the cses we hve seen in CS 9, this ment tht our
More informationLecture 2: Prelude to the big shrink
Lecture 2: Prelude to the bg shrnk Last tme A slght detour wth vsualzaton tools (hey, t was the frst day... why not start out wth somethng pretty to look at?) Then, we consdered a smple 120a-style regresson
More informationReinforcement Learning
Reinforcement Lerning Tom Mitchell, Mchine Lerning, chpter 13 Outline Introduction Comprison with inductive lerning Mrkov Decision Processes: the model Optiml policy: The tsk Q Lerning: Q function Algorithm
More informationThe Regulated and Riemann Integrals
Chpter 1 The Regulted nd Riemnn Integrls 1.1 Introduction We will consider severl different pproches to defining the definite integrl f(x) dx of function f(x). These definitions will ll ssign the sme vlue
More informationLecture 3 ( ) (translated and slightly adapted from lecture notes by Martin Klazar)
Lecture 3 (5.3.2018) (trnslted nd slightly dpted from lecture notes by Mrtin Klzr) Riemnn integrl Now we define precisely the concept of the re, in prticulr, the re of figure U(, b, f) under the grph of
More informationProblem Set 9 - Solutions Due: April 27, 2005
Problem Set - Solutons Due: Aprl 27, 2005. (a) Frst note that spam messages, nvtatons and other e-mal are all ndependent Posson processes, at rates pλ, qλ, and ( p q)λ. The event of the tme T at whch you
More informationSolubilities and Thermodynamic Properties of SO 2 in Ionic
Solubltes nd Therodync Propertes of SO n Ionc Lquds Men Jn, Yucu Hou, b Weze Wu, *, Shuhng Ren nd Shdong Tn, L Xo, nd Zhgng Le Stte Key Lbortory of Checl Resource Engneerng, Beng Unversty of Checl Technology,
More informationAPPROXIMATE INTEGRATION
APPROXIMATE INTEGRATION. Introduction We hve seen tht there re functions whose nti-derivtives cnnot be expressed in closed form. For these resons ny definite integrl involving these integrnds cnnot be
More informationThe 7 th Balkan Conference on Operational Research BACOR 05 Constanta, May 2005, Romania STEADY-STATE SOLUTIONS OF MARKOV CHAINS
The 7 th Blkn Conference on Opertonl Reserch BACOR 5 Constnt My 25 Romn STEADY-STATE SOLUTIONS OF MARKOV CHAINS DIMITAR RADEV Deprtment of Communcton Technque & Technologes Unversty of Rousse Bulgr VLADIMIR
More informationThe solution of transport problems by the method of structural optimization
Scentfc Journls Mrtme Unversty of Szczecn Zeszyty Nukowe Akdem Morsk w Szczecne 2013, 34(106) pp. 59 64 2013, 34(106) s. 59 64 ISSN 1733-8670 The soluton of trnsport problems by the method of structurl
More informationUsing the Econometric Models in Planning the Service of Several Machines at Random Time Intervals. Authors:
Usng the Econometrc Models n Plnnng the Servce of Severl Mchnes t Rndom Tme Intervls. Authors: ) Ion Constntn Dm, Unversty Vlh of Trgovste, Romn ) Mrce Udrescu, Unversty Artfex of Buchrest, Romn Interntonl
More informationESCI 342 Atmospheric Dynamics I Lesson 1 Vectors and Vector Calculus
ESI 34 tmospherc Dnmcs I Lesson 1 Vectors nd Vector lculus Reference: Schum s Outlne Seres: Mthemtcl Hndbook of Formuls nd Tbles Suggested Redng: Mrtn Secton 1 OORDINTE SYSTEMS n orthonorml coordnte sstem
More informationTELCOM 2130 Time Varying Queues. David Tipper Associate Professor Graduate Telecommunications and Networking Program University of Pittsburgh Slides 7
TELOM 3 Tme Vryng Queues Dvd Tpper Assote Professor Grdute Teleommuntons nd Networkng Progrm Unversty of Pttsburgh ldes 7 Tme Vryng Behvor Teletrff typlly hs lrge tme of dy vrtons Men number of lls per
More informationA Reinforcement Learning System with Chaotic Neural Networks-Based Adaptive Hierarchical Memory Structure for Autonomous Robots
Interntonl Conference on Control, Automton nd ystems 008 Oct. 4-7, 008 n COEX, eoul, Kore A Renforcement ernng ystem wth Chotc Neurl Networs-Bsed Adptve Herrchcl Memory tructure for Autonomous Robots Msno
More informationIntroduction To Matrices MCV 4UI Assignment #1
Introduction To Mtrices MCV UI Assignment # INTRODUCTION: A mtrix plurl: mtrices) is rectngulr rry of numbers rrnged in rows nd columns Exmples: ) b) c) [ ] d) Ech number ppering in the rry is sid to be
More informationDIRECT CURRENT CIRCUITS
DRECT CURRENT CUTS ELECTRC POWER Consider the circuit shown in the Figure where bttery is connected to resistor R. A positive chrge dq will gin potentil energy s it moves from point to point b through
More informationSubstitution Matrices and Alignment Statistics. Substitution Matrices
Susttuton Mtrces nd Algnment Sttstcs BMI/CS 776 www.ostt.wsc.edu/~crven/776.html Mrk Crven crven@ostt.wsc.edu Ferur 2002 Susttuton Mtrces two oulr sets of mtrces for roten seuences PAM mtrces [Dhoff et
More information7.2 Volume. A cross section is the shape we get when cutting straight through an object.
7. Volume Let s revew the volume of smple sold, cylnder frst. Cylnder s volume=se re heght. As llustrted n Fgure (). Fgure ( nd (c) re specl cylnders. Fgure () s rght crculr cylnder. Fgure (c) s ox. A
More informationMath Solutions to homework 1
Mth 75 - Solutions to homework Cédric De Groote October 5, 07 Problem, prt : This problem explores the reltionship between norms nd inner products Let X be rel vector spce ) Suppose tht is norm on X tht
More informationProblem Set 3 Solutions
Chemistry 36 Dr Jen M Stndrd Problem Set 3 Solutions 1 Verify for the prticle in one-dimensionl box by explicit integrtion tht the wvefunction ψ ( x) π x is normlized To verify tht ψ ( x) is normlized,
More informationMath 426: Probability Final Exam Practice
Mth 46: Probbility Finl Exm Prctice. Computtionl problems 4. Let T k (n) denote the number of prtitions of the set {,..., n} into k nonempty subsets, where k n. Argue tht T k (n) kt k (n ) + T k (n ) by
More informationTheoretical foundations of Gaussian quadrature
Theoreticl foundtions of Gussin qudrture 1 Inner product vector spce Definition 1. A vector spce (or liner spce) is set V = {u, v, w,...} in which the following two opertions re defined: (A) Addition of
More information