Approximate dynamic programming using model-free Bellman Residual Elimination

Size: px
Start display at page:

Download "Approximate dynamic programming using model-free Bellman Residual Elimination"

Transcription

1 Approxiate dynaic prograing using ode-free Bean Residua Eiination The MIT Facuty has ade this artice openy avaiabe. Pease share how this access benefits you. Your story atters. Citation As Pubished Pubisher Bethke, B., and J.P. How. Approxiate dynaic prograing using ode-free Bean Residua Eiination. Aerican Contro Conference (ACC), Print. &isnuber= Institute of Eectrica and Eectronics Engineers / Aerican Autoatic Contro Counci Version Fina pubished version Accessed Sat Apr 06 07:06:48 EDT 209 Citabe Link Ters of Use Detaied Ters Artice is ade avaiabe in accordance with the pubisher's poicy and ay be subject to US copyright aw. Pease refer to the pubisher's site for ters of use.

2 200 Aerican Contro Conference Marriott Waterfront, Batiore, MD, USA June 30-Juy 02, 200 ThC07.2 Approxiate Dynaic Prograing Using Mode-Free Bean Residua Eiination Brett Bethke and Jonathan P. How Abstract This paper presents an odification to the ethod of Bean Residua Eiination (BRE) [], [2] for approxiate dynaic prograing. Whie prior work on BRE has focused on earning an approxiate poicy for an underying Markov Decision Process (MDP) when the state transition ode of the MDP is known, this work proposes a ode-free variant of BRE that does not require knowedge of the state transition ode. Instead, state trajectories of the syste, generated using siuation and/or observations of the rea syste in operation, are used to buid stochastic approxiations of the quantities needed to carry out the BRE agorith. The resuting agorith can be shown to converge to the poicy produced by the noina, ode-based BRE agorith in the iit of observing an infinite nuber of trajectories. To vaidate the perforance of the approach, we copare ode-based and ode-free BRE against LSPI [3], a we-known approxiate dynaic prograing agorith. Measuring perforance in ters of both coputationa copexity and poicy quaity, we present resuts showing that BRE perfors at east as we as, and soeties significanty better than, LSPI on a standard benchark probe. I. INTRODUCTION Markov Decision Processes (MDPs) are a powerfu and genera fraework for addressing probes invoving sequentia decision aking under uncertainty [4]. Unfortunatey, since MDPs suffer fro the we-known curse of diensionaity, the coputationa copexity of finding an exact soution is typicay prohibitivey arge. Therefore, in ost cases, approxiate dynaic prograing (ADP) techniques ust be epoyed to copute approxiate soutions that can be found in reasonabe tie [5]. Motivated by the success of kerne-based ethods such as support vector achines [6] and Gaussian processes [7] in pattern cassification and regression appications, researchers have begun appying these powerfu techniques in the ADP doain. This work has ed to agoriths such as GPTD [8], an approach which uses tepora differences to earn a Gaussian process representation of the cost-to-go function, and GPDP [9], which is an approxiate vaue iteration schee based on a siiar Gaussian process cost-to-go representation. Another recenty-deveoped approach, known as Bean Residua Eiination (BRE) [], [2], uses kernebased regression to sove a syste of Bean equations over a sa set of sape states. Siiar to the we-studied cass of inear architectures [3], [0], kerne-based cost representations can be interpreted as apping a state of the MDP into a set of features; however, B. Bethke, Ph.D. bbethke@it.edu J. P. How is a Professor of Aeronautics and Astronautics, MIT, jhow@it.edu unike inear architectures, the effective feature vector of a kerne-based representation ay be infinite-diensiona. This property gives kerne ethods a great dea of fexibiity and akes the particuary appropriate in approxiate dynaic prograing, where the structure of the cost function ay not be we understood. The prior work on BRE [], [2] has deonstrated that by taking advantage of this fexibiity, a cass of agoriths can be deveoped that enjoy severa advantageous theoretica properties, incuding the abiity to copute cost-to-go functions whose Bean residuas are identicay zero at the saped states. This property, in turn, iediatey ipies that the BRE agoriths are guaranteed to converge to the optia poicy in the iit of saping the entire state space. In addition to these theoretica properties, we have recenty deonstrated encouraging resuts in appying BRE in order to copute approxiate poicies for arge-scae, uti-agent UAV panning probes []. The BRE work to date has assued that the underying state transition ode of the MDP is avaiabe, and has focused on deveoping a cass of agoriths with the stated theoretica properties that can use this ode inforation efficienty. Of course, in soe circustances, an exact ode ay not be known if the syste in question is not we understood. To cope with these situations, this paper presents a ode-free variation on the BRE agoriths deveoped to date. Instead of reying on knowedge of the state transition ode, the new ode-free approach uses state trajectory data, gathered either by siuation of the syste or by observing the actua syste in operation, to approxiate the quantities needed to carry out BRE. The paper begins by presenting background ateria on ode-based BRE. It then shows how to odify the basic BRE approach when ony state trajectory data, and not a fu syste ode, is avaiabe. Severa proofs deonstrate the correctness of the ode-free BRE approach, estabishing that the ode-free agorith converges to the sae resut as the ode-based agorith as ore and ore trajectory data is used. Finay, resuts of a coparison study between BRE and LSPI, a we-known approxiate dynaic prograing agorith [3], show that both ode-based and ode-free BRE perfor at east as we as, and soeties significanty better than, LSPI on a standard benchark probe. II. BACKGROUND A. Markov Decision Processes This paper considers the genera cass of infinite horizon, discounted, finite state MDPs. The MDP is specified by (S, A, P, g), where S is the state space, A is the action space, /0/$ AACC 446

3 P ij (u) gives the transition probabiity fro state i to state j under action u, and g(i, u, j) gives the cost of oving fro state i to state j under action u. To derive ode-based BRE, we assue that the MDP ode (i.e. the data (S, A, P, g)) is known. Future costs are discounted by a factor 0 < α <. A poicy of the MDP is denoted by µ : S A. Given the MDP specification, the probe is to iniize the cost-to-go function J µ over the set of adissibe poicies Π: [ ] in J µ(i 0 ) = in E α k g(i k, µ(i k )). µ Π µ Π k=0 For notationa convenience, the cost and state transition functions for a fixed poicy µ are defined as g µ i P ij (µ(i))g(i, µ(i), j) () P µ ij P ij (µ(i)), respectivey. The cost-to-go for a fixed poicy µ satisfies the Bean equation [4] J µ (i) = g µ i + α P µ ij J µ(j) i S, (2) which can aso be expressed copacty as J µ = T µ J µ, where T µ is the (fixed-poicy) dynaic prograing operator. B. Bean Residua Eiination Bean Residua Eiination is a approxiate poicy iteration technique that is cosey reated to the cass of Bean residua ethods [2] [4]. Bean residua ethods attept to perfor poicy evauation by iniizing an objective function of the for i es J µ (i) T µ Jµ (i) 2 /2, (3) where J µ is an approxiation to the true cost function J µ and S S is a set of representative sape states. BRE uses a fexibe kerne-based cost approxiation architecture to construct Jµ such that the objective function given by Eq. (3) is identicay zero. We now present the basic derivation of the BRE approach; for ore detais, see [], [2]. To begin, BRE assues the foowing functiona for of the cost function: J µ (i) = Θ, Φ(i). (4) Here,, denotes the inner product in a Reproducing Kerne Hibert Space (RKHS) [5] with the pre-specified kerne k(i, i ) = Φ(i), Φ(i ). Φ(i) is the feature apping of the kerne k(i, i ), and Θ is a weighting eeent to be found by the BRE agorith. The ain insight behind BRE is the observation that the individua Bean residuas BR(i), given by BR(i) J µ (i) T µ Jµ (i), (5) can be siutaneousy forced to zero by foruating a regression probe, which can then be soved using standard kerne-based regression schees such as support vector regression [6] or Gaussian process regression [7]. To construct the regression probe, we substitute the functiona for of the cost, given by Eq. (4), into Eq. (5): BR(i) = Θ, Φ(i) g µ i + α P µ ij Θ, Φ(j). By expoiting inearity of the inner product,, we can express BR(i) as BR(i) = g µ i + Θ, Φ(i) α P µ ij Φ(j) = g µ i where Ψ(i) is defined as + Θ, Ψ(i), Ψ(i) Φ(i) α P µ ijφ(j). (6) By the preceding construction, every cost function J µ is associated with a function W µ Θ, Ψ(i) (7) which is cosey reated to the Bean residuas through the reation W µ (i) = BR(i) + g µ i. Therefore, the condition that the Bean residuas are identicay zero at the sape states S is equivaent to finding a function W µ satisfying W µ (i) = g µ i i S. (8) Eq. (7) ipies that W µ beongs to a RKHS whose kerne is given by K(i, i ) = Ψ(i), Ψ(i ) (9) = Φ(i) α P µ ij Φ(j), Φ(i ) α P µ i jφ(j) (0) K(i, i ) is caed the Bean kerne associated with k(i, i ). Soving the regression probe given by Eq. (8), using the associated Bean kerne and any kerne-based regression technique (such as support vector regression, Gaussian process regression, etc), yieds the weighting eeent Θ. By the representer theore [6, 4.2], Θ can be expressed as a inear cobination of features Ψ of the sape states: Θ = i e S λ i Ψ(i ). Therefore, the output of soving the kerne regression probe is copacty described by the coefficients {λ i } i e S. Once these coefficients are known, Eqs. (4) and (6) can be 447

4 used to copute the approxiate cost function J µ : i i J µ (i) = Θ, Φ(i) = i e S λ i Ψ(i ), Φ(i) = λ i k(i, i) α P µ i jk(j, i). () i S e j... j j... j To suarize, the preceding construction has shown how to perfor poicy evauation for a fixed poicy µ, given a base kerne k(i, i ) and a set of sape states S. The steps to perfor approxiate poicy iteration are as foows: ) Sove the regression probe [Eq. (8)] using the associated Bean kerne K(i, i ) [Eq. (0)] and any kernebased regression technique to find the coefficients {λ i } i S e. 2) Use the coefficients found in step to copute the cost function J µ [Eq. ()]. 3) Perfor a poicy iproveent step based on the cost function found in step 2, and return to step to evauate this new poicy. Under a technica, nondegeneracy condition on the kerne k(i, i ) (which is satisfied by any coony-used kernes incuding radia basis function kernes), it can be proven that the Bean residuas of the resuting cost function J µ are aways identicay zero at the states S, as desired [2]. As a resut, it is straightforward to show that BRE is guaranteed to yied the exact cost J µ in the iit of saping the entire state space ( S = S). Graph Structure of the Associated Bean Kerne: It is usefu to exaine the associated Bean kerne K(i, i ) in soe detai to understand its structure. Eq. (9) shows that K(i, i ) can be viewed as the inner product between the feature appings Ψ(i) and Ψ(i ) of the input states i and i, respectivey. In turn, Eq. (6) shows that Ψ(i) represents a new feature apping that takes into account the oca graph structure of the MDP, since it is a inear cobination of Φ features of both the state i and a of the successor states j that can be reached in a singe step fro i (these are a the states j for which P µ ij are nonzero). Figure is a graphica depiction of the associated Bean kerne. Using the figure, we can interpret the associated Bean kerne as easuring the tota overap or siiarity between i, i, and a iediate (one-stage) successor states of i and i. In this sense, the associated Bean kerne autoaticay accounts for the oca graph structure in the state space. Coputationa Copexity The coputationa copexity of ode-based BRE is doinated by two factors: first, coputing K(i, i ) over every pair of sape states (i, i ) S S (this inforation is often caed the Gra atrix of the kerne), and second, soving the regression probe. As iustrated in Fig. (), coputing K(i, i ) invoves enuerating each successor state of both i and i and evauating the base kerne k(, ) for Fig. : Graph structure of the associated Bean kerne K(i, i ) [Eq. (9)]. The one-stage successor states of i and i are {j,..., j } and {j,..., j }, respectivey. The associated Bean kerne is fored by taking a inear cobination of the base kerne k(, ) appied to a possibe state pairings (shown by the dashed bue ines) where one state in the pair beongs to the set of i and its descendants (this set is shown by the eft shaded rectange), and the other beongs to the set of i and its descendants (right shaded rectange). each pair of successor states. If β is the average branching factor of the MDP, each state wi have β successor states on average, so coputing K(i, i ) for a singe pair of states (i, i ) invoves O(β 2 ) operations. Therefore, if n s S is the nuber of sape states, coputing the fu Gra atrix costs O(β 2 n 2 s) operations. The cost of soving the regression probe ceary depends on the regression technique used, but typica ethods such as Gaussian process regression invove soving a inear syste in the diension of the Gra atrix, which invoves O(n 3 s) operations. Thus, the tota copexity of BRE is of order O(β 2 n 2 s + n 3 s). III. MODEL-FREE BRE The BRE agorith presented in the previous section requires knowedge of the syste dynaics ode, encoded in the probabiities P µ ij, to evauate the poicy µ. These probabiities are used in the cacuation of the cost vaues g µ i [Eq. ()], the associated Bean kerne K(i, i ) [Eq. (0)], and the approxiate cost-to-go J µ (i) [Eq. ()]. In particuar, in order to sove the BRE regression probe given by Eq. (8), it is necessary to copute the vaues g µ i for i S, as we as the Gra atrix K of the associated Bean kerne, defined by K ii = K(i, i ) i, i S. By expoiting the ode inforation, BRE is abe to construct cost-to-go soutions Jµ (i) for which the Bean residuas are exacty zero. However, there ay be situations in which it is not possibe to use syste ode inforation directy. Ceary, one such situation is when an exact syste ode is unknown or unavaiabe. In this case, one typicay assues instead that a generative ode of the syste is avaiabe, which is a back box siuator that can be used to sape state trajectories of the syste under a given poicy. Aternativey, 448

5 one ay coect state trajectory data fro the actua syste in operation and use this data in ieu of an exact ode. To address these situations, we now deveop a variant of the BRE agorith that uses ony siuated trajectory data, obtained fro siuations or fro observing the rea syste, instead of using data fro the true underying syste ode P µ ij. This variant of BRE therefore represents a true, odefree, reinforceent earning agorith. The key idea behind ode-free BRE is to siuate a nuber of trajectories of ength n, starting fro each of the sape states in S, and use this inforation to buid stochastic approxiations to the cost vaues g µ i, kerne Gra atrix K, and cost-to-go J µ (i). We use the notation T iq to denote the th state encountered in the q th trajectory starting fro state i S, where ranges fro 0 to n (the ength of the trajectory), and q ranges fro to (the nuber of trajectories starting fro each state i S). Using the trajectory data T iq, stochastic approxiations of each of the iportant quantities necessary to carry out BRE can be fored. First, exaining Eq. (), notice that the cost vaues g µ i can be expressed as g µ i = P ij (µ(i))g(i, µ(i), j) g(t iq 0, µ(t iq 0 iq ), T ), (2) where the expectation over future states has been repaced by a Monte Caro estiator based on the trajectory data. In the iit of saping an infinite nuber of trajectories ( ), the approxiation given by Eq. (2) converges to the true vaue of g µ i. A siiar approxiation can be constructed for the associated Bean kerne by starting with Eq. (6): Ψ(i) = Φ(i) α P µ ij Φ(j) = Φ(i) αe j [Φ(j)] Φ(i) α Φ(T iq (3) Substituting Eq. (3) into Eq. (0) gives K(i, i ) Φ(i) α Φ(T iq ), Φ(i ) α q = Φ(T i q ). Expanding this expression using inearity of the dot product gives K(i, i ) Φ(i), Φ(i ) α ( ) Φ(T iq ) + Φ(T i q ), Φ(i) + α 2 2 q = Φ(T iq, Φ(T i q. Finay, substituting the definition of the kerne k(i, i ) = Φ(i), Φ(i ) gives K(i, i ) k(i, i ) α ( ) k(t iq, i ) + k(t i q, i) α 2 2 q = k(t iq, T i q ). (4) Again, in the iit of infinite saping, Eq. (4) converges to K(i, i ). Finay, an approxiation to J µ (i) [Eq. ()] is needed: J µ (i) = λ i k(i, i) α P µ i jk(j, i) i es = i e S λ i (k(i, i) αe j [k(j, i)]) i e S λ i ( k(i, i) α k(t i q, i) ). (5) The procedure for carrying out ode-free BRE can now be stated as foows: ) Using the generative ode of the MDP (or data coected fro the rea syste), siuate trajectories of ength n starting fro each of the sape states i S. Store this data in T iq. 2) Sove the regression probe [Eq. (8)] using the stochastic approxiations to g µ i and K(i, i ) (given by Eqs. (2) and (4), respectivey), and any kerne-based regression technique to find the coefficients {λ i } i S e. 3) Use the coefficients found in step 2 to copute a stochastic approxiation to the cost function J µ, given by Eq. (5). 4) Perfor a poicy iproveent step based on the cost function found in step 3, and return to step to evauate this new poicy. Coputationa Copexity The overa copexity of running ode-free is sti doinated by buiding the kerne Gra atrix and soving the regression probe, just as in ode-based BRE. Furtherore, the copexity of soving the regression probe is the sae between both ethods: O(n 3 s). However, in odefree BRE, coputing an eeent of the associated Bean kerne using Eq. (4) now requires O( 2 ) operations (where is the nuber of saped trajectories), instead of O(β 2 ) (where β is the average branching factor of the MDP) as in the ode-based case. In essence, in ode-free BRE, the successor states for each sape state i S (of which there are on average β) are approxiated using data fro siuated trajectories. Thus, coputing the fu Gra atrix requires O( 2 n 2 s) operations, and the tota copexity of ode-free BRE is of order O( 2 n 2 s + n 3 s). 449

6 Correctness of Mode-Free BRE The foowing theore and ea estabish two iportant properties of the ode-free BRE agorith presented in this section. Theore: In the iit of saping an infinite nuber of trajectories (i.e. ), the cost function J µ (i) coputed by ode-free BRE is identica to the cost function coputed by ode-based BRE. Proof: In order to approxiate the quantities g µ i, K(i, i ), and J µ (i) in the absence of ode inforation, ode-free BRE uses the stochastic approxiators given by Eqs. (2), (4), and (5), respectivey. Exaining these equations, note that in each, the state trajectory data T iq is used to for an epirica distribution ˆP µ ij = δ(t iq, j), where δ(i, j) is the Kronecker deta function. This distribution is used as a substitute for the true distribution P µ ij in coputing g µ i, K(i, i ), and J µ (i). Since by assuption the individua trajectories are independent, the rando variabes {T iq q =,..., } are independent and identicay distributed. Therefore, the aw of arge nubers states that the epirica distribution converges to the true distribution in the iit of an infinite nuber of sapes: i ˆP µ ij = P µ ij. Therefore, as, Eqs. (2), (4), and (5) converge to the true vaues g µ i, K(i, i ), and J µ (i). In particuar, the cost function J µ (i) coputed by ode-free BRE converges to the cost function coputed by ode-based BRE as, as desired. Using resuts shown in [], [2], which prove that odebased BRE coputes a cost function J µ (i) whose Bean residuas are exacty zero at the sape states S, we iediatey have the foowing ea: Lea: In the iit, ode-free BRE coputes a cost function J µ (i) whose Bean residuas are exacty zero at the sape states S. Proof: The theore showed that in the iit, ode-free BRE yieds the sae cost function J µ (i) as ode-based BRE. Therefore, appying the resuts fro [], [2], it iediatey foows that the Bean residuas J µ (i) are zero at the sape states. IV. RESULTS A further series of tests were carried out to copare the perforance of the BRE agoriths presented in this paper to LSPI [3], a we-known approxiate dynaic prograing agorith. LSPI uses a inear cobination of basis functions to represent approxiate Q-vaues of the MDP, and earns a weighting of these basis functions using siuated trajectory data. In earning the weights, LSPI can take advantage of syste ode inforation if it is avaiabe, or can be run in a purey ode-free setting if not. Thus, our tests copared the perforance of four different agoriths: ode-based BRE, ode-free BRE, ode-based LSPI, and ode-free LSPI. The LSPI ipeentation used for these tests is the one provided by the authors in [3]. The benchark probe used was the chain-wak probe [3], [7], which has a one-diensiona state space (we used a tota of 50 states in these tests) and two possibe actions ( ove eft and ove right ) in each state. In order to ake the coparisons between BRE and LSPI as siiar as possibe, the sae cost representation was used in both agoriths. More precisey, in LSPI, five radia basis functions (with standard deviation σ = 2), with centers at x =,, 2, 3, 4, were used; whie in BRE, the sae radia basis kerne (with the sae σ) was used and the sape states were taken as S = {,, 2, 3, 4}. This ensures that neither agorith gains an unfair advantage by being provided with a better set of basis functions. The agoriths were copared on two different perforance etrics: quaity of the approxiate poicy produced by each agorith (expressed as a percentage of states in which the approxiate poicy atches the optia poicy), and running tie of the agorith (easured in the tota nuber of eeentary operations required, such as additions and utipications). The poicies produced LSPI and the ode-free variant of BRE depend on the aount of siuated data provided to the agorith, so each of these agoriths was run with different aounts of siuated data to investigate how the aount of data ipacts the quaity of the resuting poicy. Furtherore, since the siuated data is randoy generated according to the generative ode, each agorith was run utipe ties to exaine the effect of this rando siuation noise on the agorith perforance. The resuts of the coparison tests are shown in Fig. 2. Mode-based BRE, shown as the fied bue diaond, finds the optia poicy. Furtherore, its running tie is faster than any other agorith in the test by at east one order of agnitude. In addition, it is the ony agorith that is free fro siuation noise, and it therefore consistenty finds the optia poicy every tie it is run, unike the other agoriths which ay yied different resuts over different runs. Thus, if a syste ode is avaiabe, ode-based BRE has a cear advantage over the other agoriths in the chainwak probe. LSPI aso found the optia poicy in a nuber of the tests, confiring siiar resuts that were reported in [3]. However, the aount of siuation data required to consistenty find a near-optia poicy was arge (between 5,000 and 0,000 sapes), eading to ong run ties. Indeed, for any of the LSPI variants that consistenty found poicies within 0% of optia, a were between two and four orders of agnitude sower than ode-based BRE. As the aount of siuation data is decreased in an attept to reduce the soution tie of LSPI, the quaity of the produced poicies becoes ower on-average and aso ore inconsistent across runs. For siuated data sets of ess than about,000, the poicy quaity approaches (and soeties drops beow) 50% of optia, indicating perforance equivaent to (or worse than) sipy guessing one of the two actions randoy. A- 450

7 Furtherore, experienta coparison of BRE against another we-known approach, LSPI, indicates that whie both approaches find near-optia poicies, both ode-based and ode-free BRE appears to have severa advantages over LSPI in the benchark probe we tested. In particuar, ode-based BRE appears to be abe to efficienty expoit knowedge of the syste ode to find the poicy significanty faster than LSPI. In addition, ode-based BRE is free of siuation noise, eiinating the probe of inconsistent resuts across different runs of the agorith. Even when ode inforation is not avaiabe, ode-free BRE sti finds near-optia poicies ore consistenty and ore quicky than LSPI. Fig. 2: Coparison of BRE vs. LSPI for the chain-wak probe. Nubers after the agorith naes denote the aount of siuation data used to train the agorith. Note that ode-based BRE does not require siuation data. owing LSPI access to the true syste ode does appear to iprove the perforance of the agorith sighty; in Fig. 2, ode-based LSPI generay yieds a higher quaity poicy than ode-free LSPI for a given data set size (athough there are severa exceptions to this). However, the resuts indicate the ode-based BRE is significanty ore efficient than ode-based LSPI at expoiting knowedge of the syste ode to reduce the coputation tie needed to find a good poicy. Finay, exaining the resuts for ode-free BRE, the figure shows that this agorith yieds consistenty good poicies, even when a sa aount of siuation data is used. As the aount of siuation data is reduced, the variabiity of the quaity of poicies produced increases and the average quaity decreases, as woud be expected. However, both of these effects are significanty saer than in the LSPI case. Indeed, the worst poicy produced by ode-free BRE is sti within 2% of optia, whie using ony 50 siuation data. In contrast, LSPI exhibited a uch greater variabiity and uch ower average poicy quaity when the aount of siuation data was reduced. V. CONCLUSION This paper has presented a ode-free variant of the BRE approach to approxiate dynaic prograing. Instead of reying on knowedge of the syste ode, the ode-free variant uses trajectory siuations or data fro the rea syste to buid stochastic approxiations to the cost vaues g µ i and kerne Gra atrix K needed to carry out BRE. It is straightforward to show that in the iit of carrying out an infinite nuber of siuations, this approach reduces to the ode-based BRE approach, and thus enjoys the sae theoretica properties of that approach (incuding the iportant fact that the Bean residuas are identicay zero at the sape states S. ACKNOWLEDGMENTS Research supported by the Boeing Copany, Phanto Works, Seatte; AFOSR grant FA ; and the Hertz Foundation. REFERENCES [] B. Bethke and J. How, Approxiate dynaic prograing using Bean residua eiination and Gaussian process regression, in Proceedings of the Aerican Contro Conference, St. Louis, MO, [2] B. Bethke, J. How, and A. Ozdagar, Approxiate dynaic prograing using support vector regression, in Proceedings of the 2008 IEEE Conference on Decision and Contro, Cancun, Mexico, [3] M. Lagoudakis and R. Parr, Least-squares poicy iteration, Journa of Machine Learning Research, vo. 4, pp , [4] D. Bertsekas, Dynaic Prograing and Optia Contro. Beont, MA: Athena Scientific, [5] D. Bertsekas and J. Tsitsikis, Neuro-Dynaic Prograing. Beont, MA: Athena Scientific, 996. [6] A. Soa and B. Schökopf, A tutoria on support vector regression, Statistics and Coputing, vo. 4, pp , [7] C. Rasussen and C. Wiias, Gaussian Processes for Machine Learning. MIT Press, Cabridge, MA, [8] Y. Enge, Agoriths and representations for reinforceent earning, Ph.D. dissertation, Hebrew University, [9] M. Deisenroth, J. Peters, and C. Rasussen, Approxiate dynaic prograing with Gaussian processes, in Proceedings of the Aerican Contro Conference, [0] J. Si, A. Barto, W. Powe, and D. Wunsch, Learning and Approxiate Dynaic Prograing. NY: IEEE Press, [Onine]. Avaiabe: [] B. Bethke, J. How, and J. Vian, Muti-UAV persistent surveiance with counication constraints and heath anageent, in Proceedings of the AIAA Guidance, Navigation and Contro Conference, Chicago, IL, August [2] P. Schweitzer and A. Seidan, Generaized poynoia approxiation in Markovian decision processes, Journa of atheatica anaysis and appications, vo. 0, pp , 985. [3] L. C. Baird, Residua agoriths: Reinforceent earning with function approxiation. in ICML, 995, pp [4] R. Munos and C. Szepesvári, Finite-tie bounds for fitted vaue iteration, Journa of Machine Learning Research, vo., pp , [5] N. Aronszajn, Theory of reproducing kernes, Transactions of the Aerican Matheatica Society, vo. 68, pp , 950. [6] B. Schökopf and A. Soa, Learning with Kernes: Support Vector Machines, Reguarization, Optiization, and Beyond. MIT Press, Cabridge, MA, [7] D. Koer and R. Parr, Poicy iteration for factored MDPs, in UAI, C. Boutiier and M. Godszidt, Eds. Morgan Kaufann, 2000, pp

A New Method of Transductive SVM-Based Network Intrusion Detection

A New Method of Transductive SVM-Based Network Intrusion Detection A New Method of Transductive SVM-Based Network Intrusion Detection Manfu Yan and Zhifang Liu 2 Departent of Matheatics, Tangshan Teacher s Coege, Tangshan Hebei, China 3005@tstc.edu.cn 2 Network Technoogy

More information

Research on the Nonlinear Governor of Diesel Engine with Variable Structure Control Theory

Research on the Nonlinear Governor of Diesel Engine with Variable Structure Control Theory Research on the Noninear Governor of Diese Engine with Variabe Structure Contro Theory Xiao-Bing Mao schoo of energy and power engineering wuhan university of technoogy Wuhan, China aoxiaobing@.co Kai-Sheng

More information

TRACKING CONTROL FOR WHEELED MOBILE ROBOTS USING NEURAL NETWORK MODEL ALGORITHM CONTROL

TRACKING CONTROL FOR WHEELED MOBILE ROBOTS USING NEURAL NETWORK MODEL ALGORITHM CONTROL Journa of Theoretica and Appied Inforation Technoogy 3 st Deceber. Vo. 46 No. 5 - JATIT & LLS. A rights reserved. ISSN: 99-8645 www.jatit.org E-ISSN: 87-395 TRACKING CONTROL FOR WHEELED OBILE ROBOTS USING

More information

Transforms, Convolutions, and Windows on the Discrete Domain

Transforms, Convolutions, and Windows on the Discrete Domain Chapter 3 Transfors, Convoutions, and Windows on the Discrete Doain 3. Introduction The previous two chapters introduced Fourier transfors of functions of the periodic and nonperiodic types on the continuous

More information

A REGULARIZED GMRES METHOD FOR INVERSE BLACKBODY RADIATION PROBLEM

A REGULARIZED GMRES METHOD FOR INVERSE BLACKBODY RADIATION PROBLEM Wu, J., Ma. Z.: A Reguarized GMRES Method for Inverse Backbody Radiation Probe THERMAL SCIENCE: Year 3, Vo. 7, No. 3, pp. 847-85 847 A REGULARIZED GMRES METHOD FOR INVERSE BLACKBODY RADIATION PROBLEM by

More information

An Approach to Worst-Case Circuit Analysis

An Approach to Worst-Case Circuit Analysis Eena Nicuescu, Dorina-Mioara Purcaru and Mariuscristian Nicuescu An Approach to Worst-Case Circuit Anaysis ELENA NICULESCU*, DORINA-MIOARA PURCARU* and MARIUS- CRISTIAN NICULESCU** Eectronics and Instruentation

More information

Approximate Dynamic Programming Using Bellman Residual Elimination and Gaussian Process Regression

Approximate Dynamic Programming Using Bellman Residual Elimination and Gaussian Process Regression Approximate Dynamic Programming Using Bellman Residual Elimination and Gaussian Process Regression The MIT Faculty has made this article openly available. Please share how this access benefits you. Your

More information

A Simple Framework of Conservative Algorithms for the Coupled Nonlinear Schrödinger Equations with Multiply Components

A Simple Framework of Conservative Algorithms for the Coupled Nonlinear Schrödinger Equations with Multiply Components Coun. Theor. Phys. 61 (2014) 703 709 Vo. 61, o. 6, June 1, 2014 A Sipe Fraework of Conservative Agoriths for the Couped oninear Schrödinger Equations with Mutipy Coponents QIA u ( ), 1,2, SOG Song-He (

More information

Involutions and representations of the finite orthogonal groups

Involutions and representations of the finite orthogonal groups Invoutions and representations of the finite orthogona groups Student: Juio Brau Advisors: Dr. Ryan Vinroot Dr. Kaus Lux Spring 2007 Introduction A inear representation of a group is a way of giving the

More information

SPEECH RECOGNITION USING LPC AND HMM APPLIED FOR CONTROLLING MOVEMENT OF MOBILE ROBOT

SPEECH RECOGNITION USING LPC AND HMM APPLIED FOR CONTROLLING MOVEMENT OF MOBILE ROBOT Seinar asiona Teknoogi Inforasi 200 SPEECH RECOGITIO USIG LPC AD HMM APPLIED FOR COTROLLIG MOVEMET OF MOBILE ROBOT Thiang ) Wanto ) ) Eectrica Engineering Departent Petra Christian university Siwaankerto

More information

Identites and properties for associated Legendre functions

Identites and properties for associated Legendre functions Identites and properties for associated Legendre functions DBW This note is a persona note with a persona history; it arose out off y incapacity to find references on the internet that prove reations that

More information

Factorizations of Invertible Symmetric Matrices over Polynomial Rings with Involution

Factorizations of Invertible Symmetric Matrices over Polynomial Rings with Involution Goba Journa of Pure and Appied Matheatics ISSN 0973-1768 Voue 13 Nuber 10 (017) pp 7073-7080 Research India Pubications http://wwwripubicationco Factorizations of Invertibe Syetric Matrices over Poynoia

More information

AN INVESTIGATION ON SEISMIC ANALYSIS OF SHALLOW TUNEELS IN SOIL MEDIUM

AN INVESTIGATION ON SEISMIC ANALYSIS OF SHALLOW TUNEELS IN SOIL MEDIUM The 4 th October -7, 8, Beijing, China AN INVESTIGATION ON SEISMIC ANALYSIS OF SHALLOW TUNEELS IN SOIL MEDIUM J. Boouri Bazaz and V. Besharat Assistant Professor, Dept. of Civi Engineering, Ferdowsi University,

More information

A complete set of ladder operators for the hydrogen atom

A complete set of ladder operators for the hydrogen atom A copete set of adder operators for the hydrogen ato C. E. Burkhardt St. Louis Counity Coege at Forissant Vaey 3400 Persha Road St. Louis, MO 6335-499 J. J. Leventha Departent of Physics University of

More information

Distances in random graphs with finite mean and infinite variance degrees

Distances in random graphs with finite mean and infinite variance degrees Distances in rando graphs with finite ean and infinite variance degrees Citation for pubished version APA: Hofstad, van der, R. W., Hooghiestra, G., & Znaensi, D. 2007. Distances in rando graphs with finite

More information

Performance Evaluation of Space-Time Block Coding Using a Realistic Mobile Radio Channel Model *

Performance Evaluation of Space-Time Block Coding Using a Realistic Mobile Radio Channel Model * Perforance Evauation of Space-Tie Bock Coding Using a Reaistic obie Radio Channe ode H. Carrasco Espinosa, J.. egado Penín Javier R. Fonoosa epartent of Signa Theory Counications Universitat Poitècnica

More information

On Unbounded Delays in Asynchronous Parallel Fixed-Point Algorithms

On Unbounded Delays in Asynchronous Parallel Fixed-Point Algorithms On Unbounded Deays in Asynchronous Parae Fixed-Point Agoriths Robert Hannah and Wotao Yin Departent of Matheatics, University of Caifornia, Los Angees, CA 90095, USA Septeber 3, 206 Abstract The need for

More information

Multi-events Earthquake Early Warning algorithm

Multi-events Earthquake Early Warning algorithm subitted to Geophys. J. Int. Muti-events Earthquake Eary Warning agorith using a Bayesian approach S. Wu 1, M. Yaada 2, K. Taaribuchi 3 and J.L. Beck 1 1 Caifornia Institute of Technoogy, Pasadena, Caifornia,

More information

AN ANALYTICAL ESTIMATION OF THE CORIOLIS METER'S CHARACTERISTICS BASED ON MODAL SUPERPOSITION. J. Kutin *, I. Bajsić

AN ANALYTICAL ESTIMATION OF THE CORIOLIS METER'S CHARACTERISTICS BASED ON MODAL SUPERPOSITION. J. Kutin *, I. Bajsić Fow Measureent and Instruentation 1 (00) 345 351 doi:10.1016/s0955-5986(0)00006-7 00 Esevier Science Ltd. AN ANALYTICAL ESTIMATION OF THE CORIOLIS METER'S CHARACTERISTICS BASED ON MODAL SUPERPOSITION J.

More information

AN IMPROVED DOA ESTIMATION ALGORITHM FOR ASYNCHRONOUS MULTIPATH CDMA SYSTEM 1

AN IMPROVED DOA ESTIMATION ALGORITHM FOR ASYNCHRONOUS MULTIPATH CDMA SYSTEM 1 Vo.23 No. JOURNA OF EERONIS (INA January 26 AN IMPROVED DOA ESIMAION AGORIM FOR ASYNRONOUS MUIPA DMA SYSEM Yang Wei hen Junshi an Zhenhui (Schoo of Eectronics and Info. Eng. Beijing Jiaotong University

More information

Part B: Many-Particle Angular Momentum Operators.

Part B: Many-Particle Angular Momentum Operators. Part B: Man-Partice Anguar Moentu Operators. The coutation reations deterine the properties of the anguar oentu and spin operators. The are copete anaogous: L, L = i L, etc. L = L ± il ± L = L L L L =

More information

Statistical Learning Theory: A Primer

Statistical Learning Theory: A Primer Internationa Journa of Computer Vision 38(), 9 3, 2000 c 2000 uwer Academic Pubishers. Manufactured in The Netherands. Statistica Learning Theory: A Primer THEODOROS EVGENIOU, MASSIMILIANO PONTIL AND TOMASO

More information

Separation of Variables and a Spherical Shell with Surface Charge

Separation of Variables and a Spherical Shell with Surface Charge Separation of Variabes and a Spherica She with Surface Charge In cass we worked out the eectrostatic potentia due to a spherica she of radius R with a surface charge density σθ = σ cos θ. This cacuation

More information

APPENDIX B. Some special functions in low-frequency seismology. 2l +1 x j l (B.2) j l = j l 1 l +1 x j l (B.3) j l = l x j l j l+1

APPENDIX B. Some special functions in low-frequency seismology. 2l +1 x j l (B.2) j l = j l 1 l +1 x j l (B.3) j l = l x j l j l+1 APPENDIX B Soe specia functions in ow-frequency seisoogy 1. Spherica Besse functions The functions j and y are usefu in constructing ode soutions in hoogeneous spheres (fig B1). They satisfy d 2 j dr 2

More information

A Brief Introduction to Markov Chains and Hidden Markov Models

A Brief Introduction to Markov Chains and Hidden Markov Models A Brief Introduction to Markov Chains and Hidden Markov Modes Aen B MacKenzie Notes for December 1, 3, &8, 2015 Discrete-Time Markov Chains You may reca that when we first introduced random processes,

More information

HOW OFTEN SHOULD YOU CLEAN YOUR ROOM? INTRODUCTION

HOW OFTEN SHOULD YOU CLEAN YOUR ROOM? INTRODUCTION HOW OFTEN SHOULD YOU CLEAN YOUR ROOM? KIMBALL MARTIN AND KRISHNAN SHANKAR ABSTRACT. We introduce and study a cobinatoria optiization probe otivated by the question in the tite. In the sipe case where you

More information

MC-CDMA Systems: a General Framework for Performance Evaluation with Linear Equalization

MC-CDMA Systems: a General Framework for Performance Evaluation with Linear Equalization C-CDA Systes: a Genera Fraework for Perforance Evauation with Linear Equaization Barbara. asini 1, Favio Zabini 1 and Andrea Conti 1, 6 1 IEIIT/CNR, WiLab and University of Boogna ENDIF, University of

More information

Energy Efficient WiFi-based Fingerprinting for Indoor Positioning with Smartphones

Energy Efficient WiFi-based Fingerprinting for Indoor Positioning with Smartphones Energy Efficient WiFi-based Fingerprinting for Indoor Positioning with Sartphones Igor Bisio Fabio avagetto ario archese Andrea Sciarrone Dept of eecounication Eectronic Eectric Engineering and Nava Architectures

More information

Design of a Single Input Fuzzy Logic Controller Based SVC for Dynamic Performance Enhancement of Power Systems

Design of a Single Input Fuzzy Logic Controller Based SVC for Dynamic Performance Enhancement of Power Systems Design of a Singe Input Fuzzy Logic Controer Based SC for Dynaic Perforance Enhanceent of Power Systes DR.D. PADMA SUBRAMANIAN PROFESSOR AND DEAN, DEPARTMENT OF ELECTRICAL AND ELECTRONICS ENGINEERING AMET

More information

Translation to Bundle Operators

Translation to Bundle Operators Syetry, Integrabiity and Geoetry: Methods and Appications Transation to Bunde Operators SIGMA 3 7, 1, 14 pages Thoas P. BRANSON and Doojin HONG Deceased URL: http://www.ath.uiowa.edu/ branson/ Departent

More information

A. Distribution of the test statistic

A. Distribution of the test statistic A. Distribution of the test statistic In the sequentia test, we first compute the test statistic from a mini-batch of size m. If a decision cannot be made with this statistic, we keep increasing the mini-batch

More information

Modified Gaussian Sum Filtering Methods for INS/GPS Integration

Modified Gaussian Sum Filtering Methods for INS/GPS Integration Journa of Goba Positioning Systes (27) Vo.6, No.: 65-73 Modified Gaussian Su Fitering Methods for INS/GPS Integration Yukihiro Kubo, akuya Sato and Sueo Sugioto Departent of Eectrica and Eectronic Engineering,

More information

An Algorithm for Pruning Redundant Modules in Min-Max Modular Network

An Algorithm for Pruning Redundant Modules in Min-Max Modular Network An Agorithm for Pruning Redundant Modues in Min-Max Moduar Network Hui-Cheng Lian and Bao-Liang Lu Department of Computer Science and Engineering, Shanghai Jiao Tong University 1954 Hua Shan Rd., Shanghai

More information

Session : Electrodynamic Tethers

Session : Electrodynamic Tethers Session : Eectrodynaic Tethers Eectrodynaic tethers are ong, thin conductive wires depoyed in space that can be used to generate power by reoving kinetic energy fro their orbita otion, or to produce thrust

More information

Stochastic Automata Networks (SAN) - Modelling. and Evaluation. Paulo Fernandes 1. Brigitte Plateau 2. May 29, 1997

Stochastic Automata Networks (SAN) - Modelling. and Evaluation. Paulo Fernandes 1. Brigitte Plateau 2. May 29, 1997 Stochastic utomata etworks (S) - Modeing and Evauation Pauo Fernandes rigitte Pateau 2 May 29, 997 Institut ationa Poytechnique de Grenobe { IPG Ecoe ationae Superieure d'informatique et de Mathematiques

More information

arxiv: v1 [math.ca] 6 Mar 2017

arxiv: v1 [math.ca] 6 Mar 2017 Indefinite Integras of Spherica Besse Functions MIT-CTP/487 arxiv:703.0648v [math.ca] 6 Mar 07 Joyon K. Boomfied,, Stephen H. P. Face,, and Zander Moss, Center for Theoretica Physics, Laboratory for Nucear

More information

11 - KINETIC THEORY OF GASES Page 1. The constituent particles of the matter like atoms, molecules or ions are in continuous motion.

11 - KINETIC THEORY OF GASES Page 1. The constituent particles of the matter like atoms, molecules or ions are in continuous motion. - KIETIC THEORY OF GASES Page Introduction The constituent partices of the atter ike atos, oecues or ions are in continuous otion. In soids, the partices are very cose and osciate about their ean positions.

More information

Kernel-Based Reinforcement Learning Using Bellman Residual Elimination

Kernel-Based Reinforcement Learning Using Bellman Residual Elimination Journal of Machine Learning Research () Submitted ; Published Kernel-Based Reinforcement Learning Using Bellman Residual Elimination Brett Bethke Department of Aeronautics and Astronautics Massachusetts

More information

Unimodality and Log-Concavity of Polynomials

Unimodality and Log-Concavity of Polynomials Uniodaity and Log-Concavity of Poynoias Jenny Avarez University of Caifornia Santa Barbara Leobardo Rosaes University of Caifornia San Diego Agst 10, 2000 Mige Aadis Nyack Coege New York Abstract A poynoia

More information

https://doi.org/ /epjconf/

https://doi.org/ /epjconf/ HOW TO APPLY THE OPTIMAL ESTIMATION METHOD TO YOUR LIDAR MEASUREMENTS FOR IMPROVED RETRIEVALS OF TEMPERATURE AND COMPOSITION R. J. Sica 1,2,*, A. Haefee 2,1, A. Jaai 1, S. Gamage 1 and G. Farhani 1 1 Department

More information

Cryptanalysis of PKP: A New Approach

Cryptanalysis of PKP: A New Approach Cryptanaysis of PKP: A New Approach Éiane Jaumes and Antoine Joux DCSSI 18, rue du Dr. Zamenhoff F-92131 Issy-es-Mx Cedex France eiane.jaumes@wanadoo.fr Antoine.Joux@ens.fr Abstract. Quite recenty, in

More information

Combinatorially Derived Properties of Young Tableaux

Combinatorially Derived Properties of Young Tableaux Coege of Wiia and Mary W&M Pubish Coege of Wiia & Mary Undergraduate Honors Theses Theses, Dissertations, & Master Projects 3-2014 Cobinatoriay Derived Properties of Young Tabeaux Jaes R. Janopau-Nayor

More information

SIGNALS are inherently defined on the sphere in a variety

SIGNALS are inherently defined on the sphere in a variety An Optia-Diensionaity Saping Schee on the Sphere with Fast Spherica Haronic Transfors Zubair Khaid, Meber, IEEE, Rodney A. Kennedy, Feow, IEEE, and Jason D. McEwen, Meber, IEEE arxiv:43.466v [cs.it] 3

More information

Dual-beard sampling method for fibre length measurements

Dual-beard sampling method for fibre length measurements Indian Journa of Fibre & Textie Research Vo. 39, March 14, pp. 7-78 Dua-beard saping ethod for fibre ength easureents H Y Wu & F M Wang a Coege of Texties, Donghua University, Shanghai, China Received

More information

MATH 172: MOTIVATION FOR FOURIER SERIES: SEPARATION OF VARIABLES

MATH 172: MOTIVATION FOR FOURIER SERIES: SEPARATION OF VARIABLES MATH 172: MOTIVATION FOR FOURIER SERIES: SEPARATION OF VARIABLES Separation of variabes is a method to sove certain PDEs which have a warped product structure. First, on R n, a inear PDE of order m is

More information

Efficient linear precoding for massive MIMO systems using truncated polynomial expansion

Efficient linear precoding for massive MIMO systems using truncated polynomial expansion Efficient inear precoding for assive MIMO systes using truncated poynoia expansion Axe Müer, Aba Kaoun, Ei Björnson, Mérouane Debbah To cite this version: Axe Müer, Aba Kaoun, Ei Björnson, Mérouane Debbah.

More information

CS229 Lecture notes. Andrew Ng

CS229 Lecture notes. Andrew Ng CS229 Lecture notes Andrew Ng Part IX The EM agorithm In the previous set of notes, we taked about the EM agorithm as appied to fitting a mixture of Gaussians. In this set of notes, we give a broader view

More information

Family of convergent numerical schemes for the incompressible Navier-Stokes equations

Family of convergent numerical schemes for the incompressible Navier-Stokes equations Faiy of convergent nuerica schees for the incopressibe Navier-Stokes equations Robert Eyard, Pierre Feron, Cindy Guichard To cite this version: Robert Eyard, Pierre Feron, Cindy Guichard. Faiy of convergent

More information

Cancelling out systematic uncertainties

Cancelling out systematic uncertainties Mon. Not. R. Astron. Soc. 419, 1040 1050 (2012) doi:10.1111/.1365-2966.2011.19761.x Canceing out systeatic uncertainties Jorge Noreña, 1 Licia Verde, 1 Rau Jienez, 1 Caros Peña-Garay 2 and Cesar Goez 3

More information

Design and System Modeling of a Tri-Axial Microaccelerometer Using Piezoelectric Thin Films

Design and System Modeling of a Tri-Axial Microaccelerometer Using Piezoelectric Thin Films Ferroeectrics, 85:, 69 74, 009 Reprints avaiae directy fro the puisher DOI:.80/00509090888764 URL: http://dx.doi.org/.80/00509090888764 69 Jyh-Cheng Yu et a. 009 Tayor & Francis ISSN: 005-09 print / 56-5

More information

Statistical Learning Theory: a Primer

Statistical Learning Theory: a Primer ??,??, 1 6 (??) c?? Kuwer Academic Pubishers, Boston. Manufactured in The Netherands. Statistica Learning Theory: a Primer THEODOROS EVGENIOU AND MASSIMILIANO PONTIL Center for Bioogica and Computationa

More information

a l b l m m w2 b l b G[ u/ R] C

a l b l m m w2 b l b G[ u/ R] C Degree Probes for ence Graph Graars K. Skodinis y E. Wanke z Abstract The copexity of the bounded degree probe is anayzed for graph anguages generated by ence graph graars. In particuar, the bounded degree

More information

<C 2 2. λ 2 l. λ 1 l 1 < C 1

<C 2 2. λ 2 l. λ 1 l 1 < C 1 Teecommunication Network Contro and Management (EE E694) Prof. A. A. Lazar Notes for the ecture of 7/Feb/95 by Huayan Wang (this document was ast LaT E X-ed on May 9,995) Queueing Primer for Muticass Optima

More information

From Margins to Probabilities in Multiclass Learning Problems

From Margins to Probabilities in Multiclass Learning Problems From Margins to Probabiities in Muticass Learning Probems Andrea Passerini and Massimiiano Ponti 2 and Paoo Frasconi 3 Abstract. We study the probem of muticass cassification within the framework of error

More information

Experimental Measurement of Magnetization of a Rectangular Bar-shaped Permanent Magnet Utilizing MFMIM

Experimental Measurement of Magnetization of a Rectangular Bar-shaped Permanent Magnet Utilizing MFMIM ostafa Kair et a. Experienta easureent of agnetization of Origina Artice Experienta easureent of agnetization of a Rectanguar Bar-shaped Peranent agnet Utiizing FI. Kair,. R. ashayehi, S. Aftai 3,. Khodajou-Choai

More information

e-companion ONLY AVAILABLE IN ELECTRONIC FORM

e-companion ONLY AVAILABLE IN ELECTRONIC FORM OPERATIONS RESEARCH doi 10.1287/opre.1070.0427ec pp. ec1 ec5 e-copanion ONLY AVAILABLE IN ELECTRONIC FORM infors 07 INFORMS Electronic Copanion A Learning Approach for Interactive Marketing to a Custoer

More information

Development of Truss Equations

Development of Truss Equations MANE & CIV Introuction to Finite Eeents Prof. Suvranu De Deveopent of Truss Equations Reaing assignent: Chapter : Sections.-.9 + ecture notes Suar: Stiffness atri of a bar/truss eeent Coorinate transforation

More information

Two view learning: SVM-2K, Theory and Practice

Two view learning: SVM-2K, Theory and Practice Two view earning: SVM-2K, Theory and Practice Jason D.R. Farquhar jdrf99r@ecs.soton.ac.uk Hongying Meng hongying@cs.york.ac.uk David R. Hardoon drh@ecs.soton.ac.uk John Shawe-Tayor jst@ecs.soton.ac.uk

More information

FORECASTING TELECOMMUNICATIONS DATA WITH AUTOREGRESSIVE INTEGRATED MOVING AVERAGE MODELS

FORECASTING TELECOMMUNICATIONS DATA WITH AUTOREGRESSIVE INTEGRATED MOVING AVERAGE MODELS FORECASTING TEECOMMUNICATIONS DATA WITH AUTOREGRESSIVE INTEGRATED MOVING AVERAGE MODES Niesh Subhash naawade a, Mrs. Meenakshi Pawar b a SVERI's Coege of Engineering, Pandharpur. nieshsubhash15@gmai.com

More information

Computional solutions of a family of generalized Procrustes problems. P. Benner, J. Fankhänel

Computional solutions of a family of generalized Procrustes problems. P. Benner, J. Fankhänel Coputiona soutions of a faiy of generaized Procrustes probes P. Benner, J. Fankhäne Preprint 014-6 Preprintreihe der Fakutät für Matheatik ISSN 1614-8835 Contents 1 Introduction 5 The (`p; `q) Procrustes

More information

Non-Parametric Non-Line-of-Sight Identification 1

Non-Parametric Non-Line-of-Sight Identification 1 Non-Paraetric Non-Line-of-Sight Identification Sinan Gezici, Hisashi Kobayashi and H. Vincent Poor Departent of Electrical Engineering School of Engineering and Applied Science Princeton University, Princeton,

More information

Likelihood-Based Inference for Discretely Observed Birth Death-Shift Processes, with Applications to Evolution of Mobile Genetic Elements

Likelihood-Based Inference for Discretely Observed Birth Death-Shift Processes, with Applications to Evolution of Mobile Genetic Elements Bioetrics 71, 1009 1021 Deceber 2015 DOI: 10.1111/bio.12352 Likeihood-Based Inference for Discretey Observed Birth Death-Shift Processes, with Appications to Evoution of Mobie Genetic Eeents Jason Xu,

More information

MARKOV CHAINS AND MARKOV DECISION THEORY. Contents

MARKOV CHAINS AND MARKOV DECISION THEORY. Contents MARKOV CHAINS AND MARKOV DECISION THEORY ARINDRIMA DATTA Abstract. In this paper, we begin with a forma introduction to probabiity and expain the concept of random variabes and stochastic processes. After

More information

Some Measures for Asymmetry of Distributions

Some Measures for Asymmetry of Distributions Some Measures for Asymmetry of Distributions Georgi N. Boshnakov First version: 31 January 2006 Research Report No. 5, 2006, Probabiity and Statistics Group Schoo of Mathematics, The University of Manchester

More information

A simple reliability block diagram method for safety integrity verification

A simple reliability block diagram method for safety integrity verification Reiabiity Engineering and System Safety 92 (2007) 1267 1273 www.esevier.com/ocate/ress A simpe reiabiity bock diagram method for safety integrity verification Haitao Guo, Xianhui Yang epartment of Automation,

More information

Consistent community detection in multi-relational data through restricted multi-layer stochastic blockmodel

Consistent community detection in multi-relational data through restricted multi-layer stochastic blockmodel Eectronic Journa of Statistics Vo. 10 016 3807 3870 ISS: 1935-754 DOI: 10.114/16-EJS111 Consistent counity detection in uti-reationa data through restricted uti-ayer stochastic bockode Subhadeep Pau and

More information

Nonlinear Control Structure of Grid Connected Modular Multilevel Converters

Nonlinear Control Structure of Grid Connected Modular Multilevel Converters Downoaded fro vbn.aau.dk on: anuar 15, 019 Aaborg Universitet Noninear Contro Structure of Grid Connected Moduar Mutieve Converters Haizadeh, Ain; Noru, Lars; Ahadpour Sha, Aireza Pubished in: Renewabe

More information

MULTI-PERIOD MODEL FOR PART FAMILY/MACHINE CELL FORMATION. Objectives included in the multi-period formulation

MULTI-PERIOD MODEL FOR PART FAMILY/MACHINE CELL FORMATION. Objectives included in the multi-period formulation ationa Institute of Technoogy aicut Department of echanica Engineering ULTI-PERIOD ODEL FOR PART FAILY/AHIE ELL FORATIO Given a set of parts, processing requirements, and avaiabe resources The objective

More information

12.2. Maxima and Minima. Introduction. Prerequisites. Learning Outcomes

12.2. Maxima and Minima. Introduction. Prerequisites. Learning Outcomes Maima and Minima 1. Introduction In this Section we anayse curves in the oca neighbourhood of a stationary point and, from this anaysis, deduce necessary conditions satisfied by oca maima and oca minima.

More information

Development of an Acoustic Instrument for Bubble Size Distribution Measurement

Development of an Acoustic Instrument for Bubble Size Distribution Measurement Deveopent of an Acoustic Instruent for Bubbe Size Distribution Measureent Xiongjun Wu 1* and Georges L. Chahine 1 1 DYNAFLOW, INC. Jessup, Maryand * E-ai: wxj@dynafow-inc.co ABSTRACT Measureent of bubbe

More information

V.B The Cluster Expansion

V.B The Cluster Expansion V.B The Custer Expansion For short range interactions, speciay with a hard core, it is much better to repace the expansion parameter V( q ) by f(q ) = exp ( βv( q )) 1, which is obtained by summing over

More information

Bayesian Learning. You hear a which which could equally be Thanks or Tanks, which would you go with?

Bayesian Learning. You hear a which which could equally be Thanks or Tanks, which would you go with? Bayesian Learning A powerfu and growing approach in machine earning We use it in our own decision making a the time You hear a which which coud equay be Thanks or Tanks, which woud you go with? Combine

More information

In-plane shear stiffness of bare steel deck through shell finite element models. G. Bian, B.W. Schafer. June 2017

In-plane shear stiffness of bare steel deck through shell finite element models. G. Bian, B.W. Schafer. June 2017 In-pane shear stiffness of bare stee deck through she finite eement modes G. Bian, B.W. Schafer June 7 COLD-FORMED STEEL RESEARCH CONSORTIUM REPORT SERIES CFSRC R-7- SDII Stee Diaphragm Innovation Initiative

More information

arxiv:math/ v1 [math.pr] 28 Feb 2005

arxiv:math/ v1 [math.pr] 28 Feb 2005 Distances in rando graphs with finite ean and infinite variance degrees arxiv:ath/0502581v1 [ath.pr] 28 Feb 2005 Reco van der Hofstad Gerard Hooghiestra and Ditri Znaensi June 5, 2018 Abstract In this

More information

Multi-Dimensional Hegselmann-Krause Dynamics

Multi-Dimensional Hegselmann-Krause Dynamics Multi-Diensional Hegselann-Krause Dynaics A. Nedić Industrial and Enterprise Systes Engineering Dept. University of Illinois Urbana, IL 680 angelia@illinois.edu B. Touri Coordinated Science Laboratory

More information

Optimality of Inference in Hierarchical Coding for Distributed Object-Based Representations

Optimality of Inference in Hierarchical Coding for Distributed Object-Based Representations Optimaity of Inference in Hierarchica Coding for Distributed Object-Based Representations Simon Brodeur, Jean Rouat NECOTIS, Département génie éectrique et génie informatique, Université de Sherbrooke,

More information

Multilayer Kerceptron

Multilayer Kerceptron Mutiayer Kerceptron Zotán Szabó, András Lőrincz Department of Information Systems, Facuty of Informatics Eötvös Loránd University Pázmány Péter sétány 1/C H-1117, Budapest, Hungary e-mai: szzoi@csetehu,

More information

Asynchronous Control for Coupled Markov Decision Systems

Asynchronous Control for Coupled Markov Decision Systems INFORMATION THEORY WORKSHOP (ITW) 22 Asynchronous Contro for Couped Marov Decision Systems Michae J. Neey University of Southern Caifornia Abstract This paper considers optima contro for a coection of

More information

6.2 Grid Search of Chi-Square Space

6.2 Grid Search of Chi-Square Space 6.2 Grid Search of Chi-Square Space exaple data fro a Gaussian-shaped peak are given and plotted initial coefficient guesses are ade the basic grid search strateg is outlined an actual anual search is

More information

V.B The Cluster Expansion

V.B The Cluster Expansion V.B The Custer Expansion For short range interactions, speciay with a hard core, it is much better to repace the expansion parameter V( q ) by f( q ) = exp ( βv( q )), which is obtained by summing over

More information

FRST Multivariate Statistics. Multivariate Discriminant Analysis (MDA)

FRST Multivariate Statistics. Multivariate Discriminant Analysis (MDA) 1 FRST 531 -- Mutivariate Statistics Mutivariate Discriminant Anaysis (MDA) Purpose: 1. To predict which group (Y) an observation beongs to based on the characteristics of p predictor (X) variabes, using

More information

Related Topics Maxwell s equations, electrical eddy field, magnetic field of coils, coil, magnetic flux, induced voltage

Related Topics Maxwell s equations, electrical eddy field, magnetic field of coils, coil, magnetic flux, induced voltage Magnetic induction TEP Reated Topics Maxwe s equations, eectrica eddy fied, magnetic fied of cois, coi, magnetic fux, induced votage Principe A magnetic fied of variabe frequency and varying strength is

More information

Consistent linguistic fuzzy preference relation with multi-granular uncertain linguistic information for solving decision making problems

Consistent linguistic fuzzy preference relation with multi-granular uncertain linguistic information for solving decision making problems Consistent inguistic fuzzy preference reation with muti-granuar uncertain inguistic information for soving decision making probems Siti mnah Binti Mohd Ridzuan, and Daud Mohamad Citation: IP Conference

More information

THE STOCHSTIC CELL TRANSIMMISION MODEL BASED DYNAMIC JOURNEY TIME RELIABILITY ANALYSIS

THE STOCHSTIC CELL TRANSIMMISION MODEL BASED DYNAMIC JOURNEY TIME RELIABILITY ANALYSIS THE STOCHSTIC CELL TRANSIMMISION MODEL BASED DYNAMIC JOURNEY TIME RELIABILITY ANALYSIS Tianu PAN, Agachai SUMALEE 2, and Wiia H.K. La 3 Departent of Civi and Structura Engineering The Hong Kong Poytechnic

More information

Support Vector Machine Classification of Uncertain and Imbalanced data using Robust Optimization

Support Vector Machine Classification of Uncertain and Imbalanced data using Robust Optimization Recent Researches in Coputer Science Support Vector Machine Classification of Uncertain and Ibalanced data using Robust Optiization RAGHAV PAT, THEODORE B. TRAFALIS, KASH BARKER School of Industrial Engineering

More information

Wave Motion: revision. Professor Guy Wilkinson Trinity Term 2014

Wave Motion: revision. Professor Guy Wilkinson Trinity Term 2014 Wave Motion: revision Professor Gu Wiinson gu.wiinson@phsics.o.a.u Trinit Ter 4 Introduction Two ectures to reind ourseves of what we earned ast ter Wi restrict discussion to the topics on the sabus Wi

More information

International Scientific and Technological Conference EXTREME ROBOTICS October 8-9, 2015, Saint-Petersburg, Russia

International Scientific and Technological Conference EXTREME ROBOTICS October 8-9, 2015, Saint-Petersburg, Russia International Scientific and Technological Conference EXTREME ROBOTICS October 8-9, 215, Saint-Petersburg, Russia LEARNING MOBILE ROBOT BASED ON ADAPTIVE CONTROLLED MARKOV CHAINS V.Ya. Vilisov University

More information

Pattern Recognition and Machine Learning. Learning and Evaluation for Pattern Recognition

Pattern Recognition and Machine Learning. Learning and Evaluation for Pattern Recognition Pattern Recognition and Machine Learning Jaes L. Crowley ENSIMAG 3 - MMIS Fall Seester 2017 Lesson 1 4 October 2017 Outline Learning and Evaluation for Pattern Recognition Notation...2 1. The Pattern Recognition

More information

A Simplified Analytical Approach for Efficiency Evaluation of the Weaving Machines with Automatic Filling Repair

A Simplified Analytical Approach for Efficiency Evaluation of the Weaving Machines with Automatic Filling Repair Proceedings of the 6th SEAS International Conference on Siulation, Modelling and Optiization, Lisbon, Portugal, Septeber -4, 006 0 A Siplified Analytical Approach for Efficiency Evaluation of the eaving

More information

MSEC MODELING OF DEGRADATION PROCESSES TO OBTAIN AN OPTIMAL SOLUTION FOR MAINTENANCE AND PERFORMANCE

MSEC MODELING OF DEGRADATION PROCESSES TO OBTAIN AN OPTIMAL SOLUTION FOR MAINTENANCE AND PERFORMANCE Proceeding of the ASME 9 International Manufacturing Science and Engineering Conference MSEC9 October 4-7, 9, West Lafayette, Indiana, USA MSEC9-8466 MODELING OF DEGRADATION PROCESSES TO OBTAIN AN OPTIMAL

More information

FUNDAMENTALS OF FLUID MECHANICS Chapter 7 Dimensional Analysis Modeling, and Similitude

FUNDAMENTALS OF FLUID MECHANICS Chapter 7 Dimensional Analysis Modeling, and Similitude FUNAMENTALS OF FLUI MECHANICS Chapter 7 iensiona Anaysis Modeing, and Siiitude Jyh-Cherng Shieh epartent of Bio-Industria Mechatronics Engineering Nationa Taiwan University 1/4/007 1 MAIN TOPICS iensiona

More information

A unified framework for Regularization Networks and Support Vector Machines. Theodoros Evgeniou, Massimiliano Pontil, Tomaso Poggio

A unified framework for Regularization Networks and Support Vector Machines. Theodoros Evgeniou, Massimiliano Pontil, Tomaso Poggio MASSACHUSETTS INSTITUTE OF TECHNOLOGY ARTIFICIAL INTELLIGENCE LABORATORY and CENTER FOR BIOLOGICAL AND COMPUTATIONAL LEARNING DEPARTMENT OF BRAIN AND COGNITIVE SCIENCES A.I. Memo No. 1654 March23, 1999

More information

Robust and Fast Ellipsoid Fitting from Noisy Image: Computer solution for fitting Tumours on MRI Scanner

Robust and Fast Ellipsoid Fitting from Noisy Image: Computer solution for fitting Tumours on MRI Scanner Internationa Journa of Video&Iage Processing and Network Security IJVIPNS-IJENS Vo:3 No: 7 Robust and Fast Eipsoid Fitting fro Noisy Iage: Coputer soution for fitting Tuours on MRI Scanner A Cherkaoui,

More information

Theory of Generalized k-difference Operator and Its Application in Number Theory

Theory of Generalized k-difference Operator and Its Application in Number Theory Internationa Journa of Mathematica Anaysis Vo. 9, 2015, no. 19, 955-964 HIKARI Ltd, www.m-hiari.com http://dx.doi.org/10.12988/ijma.2015.5389 Theory of Generaized -Difference Operator and Its Appication

More information

Location-Pricing Problem in the Closed-loop Supply Chain Network Design under Uncertainty

Location-Pricing Problem in the Closed-loop Supply Chain Network Design under Uncertainty Location-Pricing Probe in the Cosed-oop Suppy Chain Netork Design under Uncertainty F. Joai, A. Zabihian, R. Tavakkoi-Moghadda, P. Meari Abstract This paper presents a bi-obective credibiity-based fuzzy

More information

Interactive Markov Models of Evolutionary Algorithms

Interactive Markov Models of Evolutionary Algorithms Cleveland State University EngagedScholarship@CSU Electrical Engineering & Coputer Science Faculty Publications Electrical Engineering & Coputer Science Departent 2015 Interactive Markov Models of Evolutionary

More information

April 1980 TR/96. Extrapolation techniques for first order hyperbolic partial differential equations. E.H. Twizell

April 1980 TR/96. Extrapolation techniques for first order hyperbolic partial differential equations. E.H. Twizell TR/96 Apri 980 Extrapoatio techiques for first order hyperboic partia differetia equatios. E.H. Twize W96086 (0) 0. Abstract A uifor grid of step size h is superiposed o the space variabe x i the first

More information

Sharp Time Data Tradeoffs for Linear Inverse Problems

Sharp Time Data Tradeoffs for Linear Inverse Problems Sharp Tie Data Tradeoffs for Linear Inverse Probles Saet Oyak Benjain Recht Mahdi Soltanolkotabi January 016 Abstract In this paper we characterize sharp tie-data tradeoffs for optiization probles used

More information

Effects of Dissipation Energy on Vibrational and Sound Energy Flow

Effects of Dissipation Energy on Vibrational and Sound Energy Flow TECNSCE MECANK, Band [6, cf12(996) 107416 Manuskripreingang: ()2 Januar 1996 Effects of Dissipation Energy on Vibrationa and Sound Energy Fow N Nakagawa, Y Sekiguchi, A igashi, O Mahrenhotz Sound radiating

More information

Phase Diagrams. Chapter 8. Conditions for the Coexistence of Multiple Phases. d S dt V

Phase Diagrams. Chapter 8. Conditions for the Coexistence of Multiple Phases. d S dt V hase Diaras Chapter 8 hase - a for of atter that is unifor with respect to cheica coposition and the physica state of areation (soid, iquid, or aseous phases) icroscopicay and acroscopicay. Conditions

More information