Optimal Control: Applications in Risk Management and Online Markets

Size: px
Start display at page:

Download "Optimal Control: Applications in Risk Management and Online Markets"

Transcription

1 Optimal Control: Applications in Risk Management and Online Markets Albert Cohen Actuarial Sciences Program Department of Mathematics Department of Statistics and Probability Michigan State University East Lansing, MI Joint work with Milan Bradonjić (Bell Labs) and Matt Causley (Kettering U.) Mathematics Department Colloquium Albert Cohen (MSU) Albion Math Department Colloquium Albion College April / 48

2 Outline 1 Stochastic Optimal Control for Online Seller under Reputational Mechanisms Modeling Agent Behavior Stochastic Model and Solution Hamilton-Jacobi-Bellman formulation Market share based pricing mechanism Reduced model Numerical results for the reduced model Numerical results for the full model 2 Conclusions Albert Cohen (MSU) Albion Math Department Colloquium Albion College April / 48

3 Problem Working as an individual agent, in selling an asset or collection of assets within a network like Amazon or ebay, we can: Work a little harder. Manage our reputation or brand within a strategy that takes advantage of our extra capacity to work. Model how our online reputation affects our bottom line, and act in a way that it is optimal (maximize expected profit, for example.) Question: Can we leverage our skills in continuous-time modeling of stochastic optimal control to design such an optimal strategy? As a first approach, we don t consider shipping considerations or other factors available to a buyer, only ranking 1 1 Bradonjic, M., Causley, M., and Cohen, A. Stochastic Optimal Control for Online Seller under Reputational Mechanisms. Risks Dec 4;3(4): Albert Cohen (MSU) Albion Math Department Colloquium Albion College April / 48

4 Online Marketplace In this work, we consider sellers in an online market like Amazon without a bidding mechanism. The setting may include concurrent online auctions on a site like ebay, for a product or service with only a buy-it-now option, under the assumption that buyers can compare reputation among online platforms. In this online enviroment, the buyer and seller have a certain amount of anonymity, which has implications on the fairness of the transaction. One of the only counter measures to ensure fairness is the use of feedback forums, such as Amazon, ebay, and Yahoo!, at online sites. The feedback provided by customers provides a ranking system for the sellers, which in turn rewards fair sellers and penalizes unfair or unreliable sellers. Since it is in the sellers best interest to maximize their profits, it follows that a seller will seek to use reputation as a means to optimize their decision making. Albert Cohen (MSU) Albion Math Department Colloquium Albion College April / 48

5 Online Marketplace We propose a simple behavioral model to study the relationship between a seller s wealth and reputation in the online environment with only the buy-it-now option. The wealth and growth of the seller are described by continuous stochastic processes. Note that our model is a continuous approximation of a discrete sales/marketing/processing process. The discrete model of the process may be less tractable but still very interesting and useful. Albert Cohen (MSU) Albion Math Department Colloquium Albion College April / 48

6 Online Marketplace Online sellers can be broadly categorized by two types of behavior: those focused on advertising and customer service, which includes following up with past customers; and those who instead focus only on processing existing orders. While the latter behavior will likely lower the seller reputation on average, this immediate increase in wealth may offset the long-term damage from a lower reputation. With this in mind, it is compelling to ask whether there exists an optimal long term strategy, in which a seller attempts to maximize her profit by achieving a fixed reputation. Albert Cohen (MSU) Albion Math Department Colloquium Albion College April / 48

7 Online Marketplace This work incorporates the idea that humans do not multitask well, but rather switch between tasks 2, 3. In other words, switching costs do not factor in for an individual agent, but only for large online retailers who have marketing department reorganization costs. Additionally, the model in this work allows all sellers to have a little extra capacity/money for either shipping or advertising (e.g. writing a letter or similar individual campaign), which is rather small and not unlimited Levitin, Daniel J. The organized mind: Thinking straight in the age of information overload. Penguin, Albert Cohen (MSU) Albion Math Department Colloquium Albion College April / 48

8 Agent Description Consider a seller in an online marketplace as a triple (W, R, µ) representing her wealth, reputation, and excess rate, respectively. Here, the excess rate is extra effort that can be allocated to either expedite payment or increase her reputation. In this setting, the reputation R is a positive number reflecting customer satisfaction. As an initial approach, we consider only the effects of reputation on the optimal way to grow her wealth, and leave the more general model with shipping costs for future work. Albert Cohen (MSU) Albion Math Department Colloquium Albion College April / 48

9 Agent Description Reputation R is observed in O := (0, ) and the control µ is constrained to U := [ ɛ, ɛ]. Since this process will be stochastic, we introduce a probability space (Ω, F, P) and corresponding reference probability systems ν = ((Ω, {F s }, P), B) where F s F and B is an F s -adapted Brownian motion. The evolution of wealth and reputation is hence modeled by a two-state Markov process, dw (µ) t = (1 µ t )h(r (µ) t ) dt, (1) dr (µ) t = (µ t κr (µ) t ) dt + σ db t, (2) where κ is a proportionality constant which accounts for mean reversion. Albert Cohen (MSU) Albion Math Department Colloquium Albion College April / 48

10 Agent Description Also, τ O is the first exit time from O, or if R O for all s 0. We set as an absorbing state for reputation, and enforce that the µ are chosen from the admissible class A ν, A ν := B 1 B 2 B 3 B 1 := {µ F s µ is F s p.m.} B 2 := {µ F s µ. U on [0, )} { [ τ O ] B 3 := µ F s E e ρs (1 µs )h(r (µ) ds Recall that p.m. denotes progressively measurable. 0 s ) } <. (3) Albert Cohen (MSU) Albion Math Department Colloquium Albion College April / 48

11 Mink-Seifert Model The major difficulty is in defining the growth rate h(r). However, an explicit mechanism has been proposed by Mink and Seifert 4. There the authors not only propose, but empirically justify a growth rate of h(r) = A + C ( 1 1 ln (e + R) ), (4) where A relates to the inherent value of the object for sale and C is a parameter to be fitted. This is accomplished by obtaining data using an auction robot and then computing a single regression, which gives C = Mink, Moritz, and Stefan Seifert. Reputation on ebay and its impact on sales prices. Group decision and negotiation (GDN) 2006, International Conference, Karlsruhe, Germany Albert Cohen (MSU) Albion Math Department Colloquium Albion College April / 48

12 Mink-Seifert Model It should be noted that in many economic papers on the effect of reputation on bidding and final sales price, the Mink-Seifert model is the first one we found to give an explicit relationship between reputation and price. To the best of our knowledge, their model is among the first to give an explicit relationship between reputation and price. The Mink-Seifert model also suggests a multiple regression formula where other factors, such as shipping costs and whether a buy-it-now price is offered, are considered as well, in which case C = Albert Cohen (MSU) Albion Math Department Colloquium Albion College April / 48

13 Hamilton-Jacobi-Bellman formulation With the stochastic dynamics for reputation above, a growth rate model for reputational effect on sales, and an infinite horizon, we expect that switching would depend only on the current reputational state. We now seek a function V C 2 [0, ) C p [0, ) which is a candidate solution of the optimal control problem [ τ0 ] V (R) = sup sup E e ρs (1 µ s )h(r s (µ) ) ds R 0 = R ν µ A ν 0 ( ) 1 h(r) = A + C 1 ln (e + R) τ 0 := inf {t 0 R t = 0}. (5) Albert Cohen (MSU) Albion Math Department Colloquium Albion College April / 48

14 Hamilton-Jacobi-Bellman formulation One approach to finding V would be to solve (5) directly. For example, by the definition of V, it follows that V (0) = 0 [ V ( ) = sup sup E ν µ A ν 0 [ ] = E e ρs (1 + ɛ)h( ) ds 0 ] e ρs (1 µ s )h(r s (µ) ) ds R 0 = = 1 + ɛ ρ h( ). (6) Albert Cohen (MSU) Albion Math Department Colloquium Albion College April / 48

15 Hamilton-Jacobi-Bellman formulation However, we shall instead apply the principle of optimality, and in doing so arrive at the following nonlinear Hamilton-Jacobi-Bellman (HJB) boundary value problem (suppressing the explicit dependence of R on µ) [ 0 = max (µ κr) V ɛ µ ɛ R + σ2 2 ] V + (1 µ)h(r) ρv 2 R2 V (0) = 0 V ( ) = 1 + ɛ ρ h( ) h(r) = A + C ( 1 ) 1 ln (e + R) = A + C ln ( 1 + R e 1 + ln ( 1 + R ), e ) (7) Albert Cohen (MSU) Albion Math Department Colloquium Albion College April / 48

16 Hamilton-Jacobi-Bellman formulation The previous problem simplifies to ρv = h(r) κr V R + σ2 2 V 2 R 2 + ɛ V R h(r) V (0) = 0 V ( ) = 1 + ɛ (A + C) ρ h(r) = A + C ln ( 1 + R ) e 1 + ln ( 1 + R ). e (8) Albert Cohen (MSU) Albion Math Department Colloquium Albion College April / 48

17 Hamilton-Jacobi-Bellman formulation We therefore instead solve the HJB problem, which is justified by the following standard theorem in Fleming-Soner (2006) 5 : Theorem Let V C 2 (O) C b (Ō) be a solution to the boundary value problem (8). Then V is an optimal solution to the optimal control problem (5). Remark We note here that it is sufficient to require polynomial growth on V, not bounded growth. However, as h is bounded above by A + C on O, we can restrict our attention to bounded V. Furthermore, an upper bound (and in fact limit as R ) for V is the value obtained by a seller earning maximum value A + C on her items on the time interval [0, ) while always processing orders. 5 Fleming, Wendell H., and Halil Mete Soner. Controlled Markov processes and viscosity solutions. Vol. 25. Springer Science and Business Media, Albert Cohen (MSU) Albion Math Department Colloquium Albion College April / 48

18 Hamilton-Jacobi-Bellman formulation Proof. By Corollary IV.5.1 in Fleming-Soner (2006), since (i) the drift and volatility are (uniformly) Lipschitz in R and µ, (ii) ρ > 0, and (iii) we impose an additional boundary condition on the solution as R as our infinite horizon is now [0, τ 0 ) and not [0, τ O ), it follows directly that our bounded classical solution V = V and the control [ µ (R) argmax µ [ ɛ,ɛ] (µ κr) V R + σ2 2 leads to an optimal, stationary Markov control µ (R (µ ) s ). 2 ] V + (1 µ)h(r) ρv R2 (9) Albert Cohen (MSU) Albion Math Department Colloquium Albion College April / 48

19 Conversion from Reputation to Market Share We observe that under its current definition, the reputation R O. In this section, we employ the mapping Y := f (R) = which results in a normalized market share reputation Y Q := (0, 1). R 1 + R, (10) This is consistent with Mink and Seifert s assumption that the value function h(r) be concave, and bounded. We therefore also simplify (4) accordingly, replacing logarithms with a rational function h(r) = A + C R 1 + R. (11) Albert Cohen (MSU) Albion Math Department Colloquium Albion College April / 48

20 Conversion from Reputation to Market Share Consequently, we have dy t = f (R)dR t f (R)dR t dr t and so [ ( dy t = (1 Y ) 2 µ κ Y 1 Y ) dt + σ db t ] ( 2(1 Y )3 )σ 2 dt = [ µ(1 Y ) 2 κy (1 Y ) σ 2 (1 Y ) 3] dt + σ(1 Y ) 2 db t h(y ) = A + CY [ τ0 ] Ṽ (y) = sup sup E e ρs (1 µ s ) h(y s ) ds Y 0 = y. ν µ A ν 0 (12) Note that a function that is bounded for market share Y (0, 1) is also correspondingly bounded for all reputation R (0, ). We therefore study the market share model below, confident that our results will hold for the full reputation model by virtue of the inverse map of (10). Albert Cohen (MSU) Albion Math Department Colloquium Albion College April / 48

21 Conversion from Reputation to Market Share After market share rescaling, the HJB now takes a form which is in fact degenerate at both endpoints y = 0, 1: σ2 2 (1 y)4 V (y) + ρv = h(y) [ κy(1 y) + σ 2 (1 y) 3] V (y) + ɛ (1 y) 2 V (y) h(y) V (0) = 0 V (1) = 1 + ɛ ρ h(1). (13) Albert Cohen (MSU) Albion Math Department Colloquium Albion College April / 48

22 Conversion from Reputation to Market Share In theory, a closed form solution for this boundary value problem can be formally constructed piecewise, to the left and right of the special point y, for which h(y ) = (1 y ) 2 V (y ). We shall refer to y as the switching point below, since it is the reputational value at which the seller switches from advertising to processing. In practice, we shall resort to numerical computation to approximate this solution, and in doing so estimate the value y. Albert Cohen (MSU) Albion Math Department Colloquium Albion College April / 48

23 Conversion from Reputation to Market Share Theorem If V C 2 (Q) is a monotonically increasing solution to the HJB problem (13), then V = Ṽ. Proof. By Theorem 5.1 in Fleming-Soner (2006), the construction of a solution V on an open interval Q to (13) that resides in C 2 (Q) C p ( Q) is in fact equal to the optimal value function Ṽ. Enforcing the boundary condition on V (1) enforces not only polynomial growth on our monotonically increasing V, but in fact bounded growth. Note that Q = (0, 1) is open, but y = 1 is not reached in finite time from anywhere in Q and the diffusion Y is in fact absorbed for all time in state Y = 1. Hence, the exact value of V (1) imposed. Albert Cohen (MSU) Albion Math Department Colloquium Albion College April / 48

24 Reduced Model With the previous mapping, note that the market (reputation) share Y is absorbed at Y = 1, which corresponds to R. Moreover, as in the Stochastic Nerlove evolution, we have that the probability that Y t < 0 for some positive t is greater than 0, and the drift term p 2 ((1 Y ); µ) is a second order polynomial in Y. Based on these observations, we propose a reduced model for Stochastic Reputation Share, expressed by the following stochastic differential equation and associated stochastic optimal control problem ˇV (y) = sup ν sup E µ A ν [ τ0 dy t 1 Y t = µdt + σ db t. 0 ] e ρs (1 µ s )(A + CY s ) ds Y 0 = y (14) Albert Cohen (MSU) Albion Math Department Colloquium Albion College April / 48

25 Reduced Model This leads to a corresponding nonlinear HJB boundary value problem: σ2 2 (1 y)2 V (y) + ρv = h(y) + ɛ (1 y)v (y) h(y) V (0) = 0 V (1) = 1 + ɛ ρ h(1). One of the most attractive features of the reduced model (15) is that it has a closed form, piecewise defined analytic solution. (15) Albert Cohen (MSU) Albion Math Department Colloquium Albion College April / 48

26 Reduced Model Theorem There exists a solution V C 2 (Q) C b ( Q) to the reduced model (15). Furthermore, V = ˇV, and is given by the piecewise solution { V V l (y), 0 y y, (y) = V r (y), y (16) y 1, where with constants defined below. V l = c 1 (1 y) γl + c2 (1 y) γl + + α l + β l y, (17) V r = c 3 (1 y) γr + + α r + β r y, (18) Albert Cohen (MSU) Albion Math Department Colloquium Albion College April / 48

27 Reduced Model Proof. The construction of this piecewise solution is found in the Appendix. The solution constructed is monotonically increasing in y. Enforcing the boundary condition on V (1) uniformly bounds V on Q. The proof of equality V = ˇV then follows from Theorem 5.1 in Fleming-Soner (2006), as in our previous theorem. Remark: Since the reduced model has a known closed form solution, we can measure the error made in constructing numerical solutions, which in turn acts as a benchmark for the code we use to study the full model. It is uncommon to find a closed form solution for most optimal control problems. Albert Cohen (MSU) Albion Math Department Colloquium Albion College April / 48

28 Piecewise analytic solution of reduced model We now construct the exact solution to the reduced HJB boundary value problem. The principal difficulty that must be overcome is the appearance of an absolute value, which contains the unknown variable V. We therefore consider a piecewise defined solution V (y), which remains C 2 across the switching point y, defined by the vanishing of the expression in the absolute value (1 y )V (y ) = h(y ). (19) We now separately consider the regions y < y and y > y in the reduced model (8), and define the corresponding linear differential operators L ± as L ± [V ] = ( σ2 d 2 (1 y)2 2 dy 2 ± ɛ(1 y) d ) dy + ρ V. (20) Albert Cohen (MSU) Albion Math Department Colloquium Albion College April / 48

29 Piecewise analytic solution of reduced model The boundary value problem can then be decomposed into two smaller problems for V l and V r, where boundary conditions provide one condition for each of V l and V r. The remaining two conditions are provided by enforcing C 2 smoothness of the solution at the switching point (19). Hence we now formulate two (well-posed) boundary value problems. Once solved, enforcing continuity uniquely defines the value of the switching point y : V l (y ) = V r (y ). (21) Albert Cohen (MSU) Albion Math Department Colloquium Albion College April / 48

30 Piecewise analytic solution of reduced model L [V l ] = (1 ɛ) h(y) 0 < y < y, (22) V l (0) = 0, (1 y )V l (y ) = h(y ), (23) L + [V r ] = (1 + ɛ) h(y) y < y < 1, (24) V r (1) = 1 + ɛ ρ h(1), (1 y )V r (y ) = h(y ). (25) Albert Cohen (MSU) Albion Math Department Colloquium Albion College April / 48

31 Constructing the reduced model solution We shall construct the solutions V l and V r separately first, although the approach for both solutions will be the same. Following standard methods, we first decompose the full solution into a homogeneous and particular solution, where the homogeneous part satisfies L ± [u] = 0. Since the operators L ± are equi-dimensional, we seek solutions of the form u = c(1 y) γ, (26) and the application of the differential operator yields ( ) L ± [u] = c(1 y) γ σ2 2 (γ2 γ) ɛγ + ρ = 0. (27) Albert Cohen (MSU) Albion Math Department Colloquium Albion College April / 48

32 Constructing the reduced model solution We set the parenthetical term of this expression to zero in order to solve for the admissible exponents γ (1 γ± l = ɛ σ 2 ± 2 + ɛ ) 2 σ ρ σ 2 (from L ) (28) (1 γ± r = 1 2 ɛ σ 2 ± 2 ɛ ) 2 σ ρ σ 2 (from L + ). (29) Because γ r < 0 for ρ > 0, the homogeneous solution (1 y) γr will be unbounded as y 1, and so we exclude it from further consideration below. Albert Cohen (MSU) Albion Math Department Colloquium Albion College April / 48

33 Constructing the reduced model solution Furthermore, since h(y) = A + Cy is a linear function, the particular solution will also be linear. We therefore have a general solution of the form V l = c 1 (1 y) γl + c2 (1 y) γl + + α l + β l y, (30) V r = c 3 (1 y) γr + + α r + β r y. (31) The particular solution is fixed by enforcing that the differential equations (22) and (24) are satisfied, which yields ( ) ( ) ( ) 1 ɛ 1 ɛ 1 ɛ α l = (A + C) C, β l = C, (32) ρ ρ + ɛ ρ + ɛ ( ) ( ) ( ) 1 + ɛ 1 + ɛ 1 + ɛ α r = (A + C) C, β r = C. (33) ρ ρ ɛ ρ ɛ Albert Cohen (MSU) Albion Math Department Colloquium Albion College April / 48

34 Constructing the reduced model solution The homogeneous coefficients c 1 and c 2 are then found by applying the boundary conditions (23), and the resulting linear system ( ) ( ) ( ) 1 1 c1 α γ (1 l y ) γl γ l + (1 y = l ) γl + β l (1 y ) h(y ) is solved by c 1 = α lγ l +(1 y ) γl + + βl (1 y ) h(y ) γ l + (1 y ) γl + γ l (1 y ) γl c 2, (34) c 2 = β l(1 y ) h(y ) + α l γ l (1 y ) γl γ l + (1 y ) γl + γ l (1 y ) γl. (35) Albert Cohen (MSU) Albion Math Department Colloquium Albion College April / 48

35 Constructing the reduced model solution The boundary condition (25) at y = 1 is automatically satisfied by the particular solution, and so the final coefficient c 3 is determined by the condition at y = y, c 3 = β r (1 y ) h(y ) γ r + (1 y ) γr +. (36) Finally, having determined the general solutions V l and V r, the solution V is obtained by enforcing continuity. This also fixes the value y, which must now satisfy the nonlinear transcendental equation c 1 (1 y ) γl + c2 (1 y ) γl + + α l + β l y = c 3 (1 y ) γr + + α r + β r y. (37) Albert Cohen (MSU) Albion Math Department Colloquium Albion College April / 48

36 Constructing the numerical solution The boundary value problem (8) is discretized using a standard finite difference scheme. Let y k = k/n y, and V k = V (y k ), for k = 0, 1,..., N y. Then we solve a linear system of N y + 1 equations of the form Mv = f, where Mv ρv σ2 2 (1 y)2 V, and f h + ɛ (1 y)v h. Albert Cohen (MSU) Albion Math Department Colloquium Albion College April / 48

37 Constructing the numerical solution Since f depends on V, the solution is implicit, and therefore must be obtained by using a fixed point iteration Mv (k+1) = f (k). In our numerical experiments, we let N y = 1000, and set the maximum iteration count at K = 20. Convergence is observed in all tested cases. In Figure 1, several numerical solutions V (y) are shown, for fixed A = 1, C = 0.15, ɛ = 0.02, and various values of ρ = 0.1, 0.2, 0.5, 2.0 and σ = 0.2, 0.5, 1.0, 5.0. It follows that both ρ and σ have a dramatic effect not only the shape of the solution, but also, as is demonstrated in Figure 2, on the switching point y. These numerical results suggest that both the discount rate ρ and the volatility σ of a seller s reputation can dramatically affect her behavior, particularly the critical reputation at which she switches from the processing to advertising mode. Albert Cohen (MSU) Albion Math Department Colloquium Albion College April / 48

38 Figure 1: Value Function (Reduced Model) (a) ρ = 0.1 (b) ρ = 0.2 (c) ρ = 0.5 (d) ρ = 2.0 Figure: A plot of several numerical solutions V (y) for the reduced boundary value problem (15) are shown with A = 1, C = 0.15, and ɛ = Albert Cohen (MSU) Albion Math Department Colloquium Albion College April / 48

39 Figure 2: Switching Point (Reduced Model) (a) ρ = 0.1 (b) ρ = 0.2 (c) ρ = 0.5 (d) ρ = 2.0 Figure: A plot of the quantity (1 y)v (y), where V (y) is a numerical solution of the the reduced boundary value problem (15). The value y is defined as the intersection of these curves with h(y) (the solid line). Albert Cohen (MSU) Albion Math Department Colloquium Albion College April / 48

40 Figure 3: Value Function (Full Model) (a) ρ = 0.1 (b) ρ = 0.2 (c) ρ = 0.5 (d) ρ = 2.0 Figure: A plot of several numerical solutions V (y) for the full boundary value problem (13) are shown with A = 1, C = 0.15, ɛ = 0.02, and κ = 0. Albert Cohen (MSU) Albion Math Department Colloquium Albion College April / 48

41 Figure 4: Switching Point (Full Model) (a) ρ = 0.1 (b) ρ = 0.2 (c) ρ = 0.5 (d) ρ = 2.0 Figure: A plot of the quantity (1 y) 2 V, where V is a numerical solution of the the full boundary value problem (13). The value y is defined as the intersection of these curves with h(y) (black solid line). Albert Cohen (MSU) Albion Math Department Colloquium Albion College April / 48

42 Constructing the numerical solution The same numerical discretization is employed to solve the full Nerlove-Arrow model (13), where we solve a linear system of the form Mv = f, with Mv ρv + [ κy(1 y) + σ 2 (1 y) 3] V (y) σ2 2 (1 y)4 V (y) f h + ɛ (1 y) 2 V h. (38) Albert Cohen (MSU) Albion Math Department Colloquium Albion College April / 48

43 Figure 5: Value Function (Full Model) (a) ρ = 0.1 (b) ρ = 0.2 (c) ρ = 0.5 (d) ρ = 2.0 Figure: The same as Figure 3, but with κ = 1. Albert Cohen (MSU) Albion Math Department Colloquium Albion College April / 48

44 Figure 6: Switching Point (Full Model) (a) ρ = 0.1 (b) ρ = 0.2 (c) ρ = 0.5 (d) ρ = 2.0 Figure: The same as Figure 4, but with κ = 1. Albert Cohen (MSU) Albion Math Department Colloquium Albion College April / 48

45 Conclusions In this work, we have proposed a model which describes the behavior of sellers participating in online markets. In doing so, we were able to design an optimal selling strategy, wherein the seller switches their behavior when an optimal market share reputation is reached, which depends on statistical modeling parameters. These modeling parameters were introduced through the wealth-reputation mechanisms by Nerlove and Arrow 6, which have been generalized to stochastic settings by Raman 7, among others. 6 Nerlove, Marc, and Kenneth J. Arrow. Optimal advertising policy under dynamic conditions. Economica (1962): Raman, Kalyan. Boundary value problems in stochastic optimal control of advertising. Automatica 42.8 (2006): Albert Cohen (MSU) Albion Math Department Colloquium Albion College April / 48

46 Conclusions We have additionally modified the empirical model h(r) found in Mink and Seifert, relating the price per unit to the reputation. Rather than viewing reputation R as an unbounded quantity, we instead introduced a market scaled reputation Y = R/(1 + R), where Y [0, 1), which reduces the price per unit to a simple linear model h(y ) = A + CY. We then optimized the stochastic model over the control of the excess rate µ using the Hamilton-Jacobi-Bellman equation, yielding a deterministic equation relating the value per unit to the seller reputation. The resulting boundary value problem is then solved numerically, and we qualitatively see that the value of sellers goods increase monotonically with reputation, but that a unique optimal reputation y = y determines when the seller should switch from advertising to processing. Albert Cohen (MSU) Albion Math Department Colloquium Albion College April / 48

47 Conclusions The numerical scheme was validated with a reduced model, which has a closed form piecewise analytic solution, and permits direct determination of y. The numerical results qualitatively validate our modeling assumptions, and provide a framework for studying seller behavior based on seller reputation. Although the used techniques are standard, we believe that the optimal strategy presented both analytically and numerically has implications on the way reputational information can be used to predict the behavior of an individual seller in an online market. This work can naturally be generalized in a variety of ways. For instance, the model could be developed in the case of finite horizon time T, since a more realistic assumption is that a seller strategy depends on both time and the current reputation state. Unlike our current infinite horizon model, the resulting HJB equation will lead to a time-dependent partial differential equation. Albert Cohen (MSU) Albion Math Department Colloquium Albion College April / 48

48 Acknowledgements Milan Bradonjic (Bell Labs) Matt Causley (Kettering U.) John Chadam (Pitt) Nick Costanzino (AIG) Sean Hilden (Bank of America) Goran Peskir (U. of Manchester) Steven Shreve (CMU) Albert Cohen (MSU) Albion Math Department Colloquium Albion College April / 48

Stochastic Optimal Control for Online Seller under Reputational Mechanisms

Stochastic Optimal Control for Online Seller under Reputational Mechanisms Stochastic Optimal Control for Online Seller under Reputational Mechanisms Milan Bradonjić, Matthew Causley, Albert Cohen arxiv:1103.1918v3 [math.oc 31 Dec 2015 January 1, 2016 Abstract In this work we

More information

Multi-dimensional Stochastic Singular Control Via Dynkin Game and Dirichlet Form

Multi-dimensional Stochastic Singular Control Via Dynkin Game and Dirichlet Form Multi-dimensional Stochastic Singular Control Via Dynkin Game and Dirichlet Form Yipeng Yang * Under the supervision of Dr. Michael Taksar Department of Mathematics University of Missouri-Columbia Oct

More information

Oblivious Equilibrium: A Mean Field Approximation for Large-Scale Dynamic Games

Oblivious Equilibrium: A Mean Field Approximation for Large-Scale Dynamic Games Oblivious Equilibrium: A Mean Field Approximation for Large-Scale Dynamic Games Gabriel Y. Weintraub, Lanier Benkard, and Benjamin Van Roy Stanford University {gweintra,lanierb,bvr}@stanford.edu Abstract

More information

Dynamic Pricing for Non-Perishable Products with Demand Learning

Dynamic Pricing for Non-Perishable Products with Demand Learning Dynamic Pricing for Non-Perishable Products with Demand Learning Victor F. Araman Stern School of Business New York University René A. Caldentey DIMACS Workshop on Yield Management and Dynamic Pricing

More information

Viscosity Solutions for Dummies (including Economists)

Viscosity Solutions for Dummies (including Economists) Viscosity Solutions for Dummies (including Economists) Online Appendix to Income and Wealth Distribution in Macroeconomics: A Continuous-Time Approach written by Benjamin Moll August 13, 2017 1 Viscosity

More information

An Introduction to Noncooperative Games

An Introduction to Noncooperative Games An Introduction to Noncooperative Games Alberto Bressan Department of Mathematics, Penn State University Alberto Bressan (Penn State) Noncooperative Games 1 / 29 introduction to non-cooperative games:

More information

Controlled Markov Processes and Viscosity Solutions

Controlled Markov Processes and Viscosity Solutions Controlled Markov Processes and Viscosity Solutions Wendell H. Fleming, H. Mete Soner Controlled Markov Processes and Viscosity Solutions Second Edition Wendell H. Fleming H.M. Soner Div. Applied Mathematics

More information

HJB equations. Seminar in Stochastic Modelling in Economics and Finance January 10, 2011

HJB equations. Seminar in Stochastic Modelling in Economics and Finance January 10, 2011 Department of Probability and Mathematical Statistics Faculty of Mathematics and Physics, Charles University in Prague petrasek@karlin.mff.cuni.cz Seminar in Stochastic Modelling in Economics and Finance

More information

Liquidation in Limit Order Books. LOBs with Controlled Intensity

Liquidation in Limit Order Books. LOBs with Controlled Intensity Limit Order Book Model Power-Law Intensity Law Exponential Decay Order Books Extensions Liquidation in Limit Order Books with Controlled Intensity Erhan and Mike Ludkovski University of Michigan and UCSB

More information

Optimal exit strategies for investment projects. 7th AMaMeF and Swissquote Conference

Optimal exit strategies for investment projects. 7th AMaMeF and Swissquote Conference Optimal exit strategies for investment projects Simone Scotti Université Paris Diderot Laboratoire de Probabilité et Modèles Aléatories Joint work with : Etienne Chevalier, Université d Evry Vathana Ly

More information

MS&E338 Reinforcement Learning Lecture 1 - April 2, Introduction

MS&E338 Reinforcement Learning Lecture 1 - April 2, Introduction MS&E338 Reinforcement Learning Lecture 1 - April 2, 2018 Introduction Lecturer: Ben Van Roy Scribe: Gabriel Maher 1 Reinforcement Learning Introduction In reinforcement learning (RL) we consider an agent

More information

Proving the Regularity of the Minimal Probability of Ruin via a Game of Stopping and Control

Proving the Regularity of the Minimal Probability of Ruin via a Game of Stopping and Control Proving the Regularity of the Minimal Probability of Ruin via a Game of Stopping and Control Erhan Bayraktar University of Michigan joint work with Virginia R. Young, University of Michigan K αρλoβασi,

More information

A Barrier Version of the Russian Option

A Barrier Version of the Russian Option A Barrier Version of the Russian Option L. A. Shepp, A. N. Shiryaev, A. Sulem Rutgers University; shepp@stat.rutgers.edu Steklov Mathematical Institute; shiryaev@mi.ras.ru INRIA- Rocquencourt; agnes.sulem@inria.fr

More information

Dynamic Risk Measures and Nonlinear Expectations with Markov Chain noise

Dynamic Risk Measures and Nonlinear Expectations with Markov Chain noise Dynamic Risk Measures and Nonlinear Expectations with Markov Chain noise Robert J. Elliott 1 Samuel N. Cohen 2 1 Department of Commerce, University of South Australia 2 Mathematical Insitute, University

More information

Optimal Control. Macroeconomics II SMU. Ömer Özak (SMU) Economic Growth Macroeconomics II 1 / 112

Optimal Control. Macroeconomics II SMU. Ömer Özak (SMU) Economic Growth Macroeconomics II 1 / 112 Optimal Control Ömer Özak SMU Macroeconomics II Ömer Özak (SMU) Economic Growth Macroeconomics II 1 / 112 Review of the Theory of Optimal Control Section 1 Review of the Theory of Optimal Control Ömer

More information

Stochastic optimal control with rough paths

Stochastic optimal control with rough paths Stochastic optimal control with rough paths Paul Gassiat TU Berlin Stochastic processes and their statistics in Finance, Okinawa, October 28, 2013 Joint work with Joscha Diehl and Peter Friz Introduction

More information

Utility Maximization in Hidden Regime-Switching Markets with Default Risk

Utility Maximization in Hidden Regime-Switching Markets with Default Risk Utility Maximization in Hidden Regime-Switching Markets with Default Risk José E. Figueroa-López Department of Mathematics and Statistics Washington University in St. Louis figueroa-lopez@wustl.edu pages.wustl.edu/figueroa

More information

Maximum Process Problems in Optimal Control Theory

Maximum Process Problems in Optimal Control Theory J. Appl. Math. Stochastic Anal. Vol. 25, No., 25, (77-88) Research Report No. 423, 2, Dept. Theoret. Statist. Aarhus (2 pp) Maximum Process Problems in Optimal Control Theory GORAN PESKIR 3 Given a standard

More information

Point Process Control

Point Process Control Point Process Control The following note is based on Chapters I, II and VII in Brémaud s book Point Processes and Queues (1981). 1 Basic Definitions Consider some probability space (Ω, F, P). A real-valued

More information

, and rewards and transition matrices as shown below:

, and rewards and transition matrices as shown below: CSE 50a. Assignment 7 Out: Tue Nov Due: Thu Dec Reading: Sutton & Barto, Chapters -. 7. Policy improvement Consider the Markov decision process (MDP) with two states s {0, }, two actions a {0, }, discount

More information

Deceptive Advertising with Rational Buyers

Deceptive Advertising with Rational Buyers Deceptive Advertising with Rational Buyers September 6, 016 ONLINE APPENDIX In this Appendix we present in full additional results and extensions which are only mentioned in the paper. In the exposition

More information

Introduction to Algorithmic Trading Strategies Lecture 3

Introduction to Algorithmic Trading Strategies Lecture 3 Introduction to Algorithmic Trading Strategies Lecture 3 Trend Following Haksun Li haksun.li@numericalmethod.com www.numericalmethod.com References Introduction to Stochastic Calculus with Applications.

More information

The New Palgrave: Separability

The New Palgrave: Separability The New Palgrave: Separability Charles Blackorby Daniel Primont R. Robert Russell 1. Introduction July 29, 2006 Separability, as discussed here, refers to certain restrictions on functional representations

More information

Introduction Optimality and Asset Pricing

Introduction Optimality and Asset Pricing Introduction Optimality and Asset Pricing Andrea Buraschi Imperial College Business School October 2010 The Euler Equation Take an economy where price is given with respect to the numéraire, which is our

More information

Cost and Preference in Recommender Systems Junhua Chen LESS IS MORE

Cost and Preference in Recommender Systems Junhua Chen LESS IS MORE Cost and Preference in Recommender Systems Junhua Chen, Big Data Research Center, UESTC Email:junmshao@uestc.edu.cn http://staff.uestc.edu.cn/shaojunming Abstract In many recommender systems (RS), user

More information

A new approach for investment performance measurement. 3rd WCMF, Santa Barbara November 2009

A new approach for investment performance measurement. 3rd WCMF, Santa Barbara November 2009 A new approach for investment performance measurement 3rd WCMF, Santa Barbara November 2009 Thaleia Zariphopoulou University of Oxford, Oxford-Man Institute and The University of Texas at Austin 1 Performance

More information

The Mathematics of Continuous Time Contract Theory

The Mathematics of Continuous Time Contract Theory The Mathematics of Continuous Time Contract Theory Ecole Polytechnique, France University of Michigan, April 3, 2018 Outline Introduction to moral hazard 1 Introduction to moral hazard 2 3 General formulation

More information

Optimal Control and Viscosity Solutions of Hamilton-Jacobi-Bellman Equations

Optimal Control and Viscosity Solutions of Hamilton-Jacobi-Bellman Equations Martino Bardi Italo Capuzzo-Dolcetta Optimal Control and Viscosity Solutions of Hamilton-Jacobi-Bellman Equations Birkhauser Boston Basel Berlin Contents Preface Basic notations xi xv Chapter I. Outline

More information

Distributed Optimization. Song Chong EE, KAIST

Distributed Optimization. Song Chong EE, KAIST Distributed Optimization Song Chong EE, KAIST songchong@kaist.edu Dynamic Programming for Path Planning A path-planning problem consists of a weighted directed graph with a set of n nodes N, directed links

More information

The speculator: a case study of the dynamic programming equation of an infinite horizon stochastic problem

The speculator: a case study of the dynamic programming equation of an infinite horizon stochastic problem The speculator: a case study of the dynamic programming equation of an infinite horizon stochastic problem Pavol Brunovský Faculty of Mathematics, Physics and Informatics Department of Applied Mathematics

More information

An Adaptive Algorithm for Selecting Profitable Keywords for Search-Based Advertising Services

An Adaptive Algorithm for Selecting Profitable Keywords for Search-Based Advertising Services An Adaptive Algorithm for Selecting Profitable Keywords for Search-Based Advertising Services Paat Rusmevichientong David P. Williamson July 6, 2007 Abstract Increases in online searches have spurred the

More information

Shadow prices and well-posedness in the problem of optimal investment and consumption with transaction costs

Shadow prices and well-posedness in the problem of optimal investment and consumption with transaction costs Shadow prices and well-posedness in the problem of optimal investment and consumption with transaction costs Mihai Sîrbu, The University of Texas at Austin based on joint work with Jin Hyuk Choi and Gordan

More information

University Of Calgary Department of Mathematics and Statistics

University Of Calgary Department of Mathematics and Statistics University Of Calgary Department of Mathematics and Statistics Hawkes Seminar May 23, 2018 1 / 46 Some Problems in Insurance and Reinsurance Mohamed Badaoui Department of Electrical Engineering National

More information

MARKOV DECISION PROCESSES (MDP) AND REINFORCEMENT LEARNING (RL) Versione originale delle slide fornita dal Prof. Francesco Lo Presti

MARKOV DECISION PROCESSES (MDP) AND REINFORCEMENT LEARNING (RL) Versione originale delle slide fornita dal Prof. Francesco Lo Presti 1 MARKOV DECISION PROCESSES (MDP) AND REINFORCEMENT LEARNING (RL) Versione originale delle slide fornita dal Prof. Francesco Lo Presti Historical background 2 Original motivation: animal learning Early

More information

Worst Case Portfolio Optimization and HJB-Systems

Worst Case Portfolio Optimization and HJB-Systems Worst Case Portfolio Optimization and HJB-Systems Ralf Korn and Mogens Steffensen Abstract We formulate a portfolio optimization problem as a game where the investor chooses a portfolio and his opponent,

More information

Controlled Markov Processes and Viscosity Solutions

Controlled Markov Processes and Viscosity Solutions Controlled Markov Processes and Viscosity Solutions Wendell H. Fleming, H. Mete Soner Controlled Markov Processes and Viscosity Solutions Second Edition Wendell H. Fleming H.M. Soner Div. Applied Mathematics

More information

Perturbative Approaches for Robust Intertemporal Optimal Portfolio Selection

Perturbative Approaches for Robust Intertemporal Optimal Portfolio Selection Perturbative Approaches for Robust Intertemporal Optimal Portfolio Selection F. Trojani and P. Vanini ECAS Course, Lugano, October 7-13, 2001 1 Contents Introduction Merton s Model and Perturbative Solution

More information

Thomas Knispel Leibniz Universität Hannover

Thomas Knispel Leibniz Universität Hannover Optimal long term investment under model ambiguity Optimal long term investment under model ambiguity homas Knispel Leibniz Universität Hannover knispel@stochastik.uni-hannover.de AnStAp0 Vienna, July

More information

A comparison of numerical methods for the. Solution of continuous-time DSGE models. Juan Carlos Parra Alvarez

A comparison of numerical methods for the. Solution of continuous-time DSGE models. Juan Carlos Parra Alvarez A comparison of numerical methods for the solution of continuous-time DSGE models Juan Carlos Parra Alvarez Department of Economics and Business, and CREATES Aarhus University, Denmark November 14, 2012

More information

Optimal Control of an Inventory System with Joint Production and Pricing Decisions

Optimal Control of an Inventory System with Joint Production and Pricing Decisions Optimal Control of an Inventory System with Joint Production and Pricing Decisions Ping Cao, Jingui Xie Abstract In this study, we consider a stochastic inventory system in which the objective of the manufacturer

More information

1 Markov decision processes

1 Markov decision processes 2.997 Decision-Making in Large-Scale Systems February 4 MI, Spring 2004 Handout #1 Lecture Note 1 1 Markov decision processes In this class we will study discrete-time stochastic systems. We can describe

More information

University of Warwick, EC9A0 Maths for Economists Lecture Notes 10: Dynamic Programming

University of Warwick, EC9A0 Maths for Economists Lecture Notes 10: Dynamic Programming University of Warwick, EC9A0 Maths for Economists 1 of 63 University of Warwick, EC9A0 Maths for Economists Lecture Notes 10: Dynamic Programming Peter J. Hammond Autumn 2013, revised 2014 University of

More information

B Search and Rest Unemployment Fernando Alvarez and Robert Shimer Additional Appendixes not for Publication

B Search and Rest Unemployment Fernando Alvarez and Robert Shimer Additional Appendixes not for Publication B Search and Rest Unemployment Fernando Alvarez and Robert Shimer Additional Appendixes not for Publication B.1 Derivation Hamilton-Jacobi-Bellman This appendix proves that if v() is given by: v() = R(

More information

ON THE POLICY IMPROVEMENT ALGORITHM IN CONTINUOUS TIME

ON THE POLICY IMPROVEMENT ALGORITHM IN CONTINUOUS TIME ON THE POLICY IMPROVEMENT ALGORITHM IN CONTINUOUS TIME SAUL D. JACKA AND ALEKSANDAR MIJATOVIĆ Abstract. We develop a general approach to the Policy Improvement Algorithm (PIA) for stochastic control problems

More information

of space-time diffusions

of space-time diffusions Optimal investment for all time horizons and Martin boundary of space-time diffusions Sergey Nadtochiy and Michael Tehranchi October 5, 2012 Abstract This paper is concerned with the axiomatic foundation

More information

Advertising and Promotion in a Marketing Channel

Advertising and Promotion in a Marketing Channel Applied Mathematical Sciences, Vol. 13, 2019, no. 9, 405-413 HIKARI Ltd, www.m-hikari.com https://doi.org/10.12988/ams.2019.9240 Advertising and Promotion in a Marketing Channel Alessandra Buratto Dipartimento

More information

Knapsack Auctions. Sept. 18, 2018

Knapsack Auctions. Sept. 18, 2018 Knapsack Auctions Sept. 18, 2018 aaacehicbvdlssnafj34rpuvdelmsigvskmkoasxbtcuk9ghncfmjpn26gqszibsevojbvwvny4ucevsnx/jpm1cww8mndnnxu69x08ylcqyvo2v1bx1jc3svnl7z3dv3zw47mg4fzi0ccxi0forjixy0lzumdjlbegrz0jxh93kfvebceljfq8mcxejnoa0pbgplxnmwxxs2tu49ho16lagvjl/8hpoua6dckkhrizrtt2zytwtgeaysqtsaqvanvnlbdfoi8ivzkjkvm0lys2qubqzmi07qsqjwim0ih1noyqidlpzqvn4qpuahrhqjys4u393zcischl5ujjfus56ufif109veovmlcepihzpb4upgyqgetowoijgxsaaicyo3hxiiriik51hwydgl568tdqnum3v7bulsvo6ikmejsejqaibxiimuaut0ayypijn8arejcfjxxg3pualk0brcwt+wpj8acrvmze=

More information

Markov Decision Processes

Markov Decision Processes Markov Decision Processes Noel Welsh 11 November 2010 Noel Welsh () Markov Decision Processes 11 November 2010 1 / 30 Annoucements Applicant visitor day seeks robot demonstrators for exciting half hour

More information

Optimal portfolio strategies under partial information with expert opinions

Optimal portfolio strategies under partial information with expert opinions 1 / 35 Optimal portfolio strategies under partial information with expert opinions Ralf Wunderlich Brandenburg University of Technology Cottbus, Germany Joint work with Rüdiger Frey Research Seminar WU

More information

Economics 2010c: Lectures 9-10 Bellman Equation in Continuous Time

Economics 2010c: Lectures 9-10 Bellman Equation in Continuous Time Economics 2010c: Lectures 9-10 Bellman Equation in Continuous Time David Laibson 9/30/2014 Outline Lectures 9-10: 9.1 Continuous-time Bellman Equation 9.2 Application: Merton s Problem 9.3 Application:

More information

problem. max Both k (0) and h (0) are given at time 0. (a) Write down the Hamilton-Jacobi-Bellman (HJB) Equation in the dynamic programming

problem. max Both k (0) and h (0) are given at time 0. (a) Write down the Hamilton-Jacobi-Bellman (HJB) Equation in the dynamic programming 1. Endogenous Growth with Human Capital Consider the following endogenous growth model with both physical capital (k (t)) and human capital (h (t)) in continuous time. The representative household solves

More information

Geometry and Optimization of Relative Arbitrage

Geometry and Optimization of Relative Arbitrage Geometry and Optimization of Relative Arbitrage Ting-Kam Leonard Wong joint work with Soumik Pal Department of Mathematics, University of Washington Financial/Actuarial Mathematic Seminar, University of

More information

Lecture 3: Markov Decision Processes

Lecture 3: Markov Decision Processes Lecture 3: Markov Decision Processes Joseph Modayil 1 Markov Processes 2 Markov Reward Processes 3 Markov Decision Processes 4 Extensions to MDPs Markov Processes Introduction Introduction to MDPs Markov

More information

Optimal Consumption, Investment and Insurance Problem in Infinite Time Horizon

Optimal Consumption, Investment and Insurance Problem in Infinite Time Horizon Optimal Consumption, Investment and Insurance Problem in Infinite Time Horizon Bin Zou and Abel Cadenillas Department of Mathematical and Statistical Sciences University of Alberta August 213 Abstract

More information

CS599: Algorithm Design in Strategic Settings Fall 2012 Lecture 7: Prior-Free Multi-Parameter Mechanism Design. Instructor: Shaddin Dughmi

CS599: Algorithm Design in Strategic Settings Fall 2012 Lecture 7: Prior-Free Multi-Parameter Mechanism Design. Instructor: Shaddin Dughmi CS599: Algorithm Design in Strategic Settings Fall 2012 Lecture 7: Prior-Free Multi-Parameter Mechanism Design Instructor: Shaddin Dughmi Outline 1 Multi-Parameter Problems and Examples 2 The VCG Mechanism

More information

LEGAL DISCLAIMER. APG Coin (APG) White Paper (hereinafter 'the White Paper', 'the Document') is presented for informational purposes only

LEGAL DISCLAIMER. APG Coin (APG) White Paper (hereinafter 'the White Paper', 'the Document') is presented for informational purposes only LEGAL DISCLAIMER THIS DOCUMENT DOES NOT GIVE PERSONAL LEGAL OR FINANCIAL ADVICE. YOU ARE STRONGLY ENCOURAGED TO SEEK YOUR OWN PROFESSIONAL LEGAL AND FINANCIAL ADVICE. APG Coin (APG) White Paper (hereinafter

More information

Department of Economics, University of California, Davis Ecn 200C Micro Theory Professor Giacomo Bonanno ANSWERS TO PRACTICE PROBLEMS 18

Department of Economics, University of California, Davis Ecn 200C Micro Theory Professor Giacomo Bonanno ANSWERS TO PRACTICE PROBLEMS 18 Department of Economics, University of California, Davis Ecn 00C Micro Theory Professor Giacomo Bonanno ANSWERS TO PRACTICE PROBEMS 8. If price is Number of cars offered for sale Average quality of cars

More information

Prof. Erhan Bayraktar (University of Michigan)

Prof. Erhan Bayraktar (University of Michigan) September 17, 2012 KAP 414 2:15 PM- 3:15 PM Prof. (University of Michigan) Abstract: We consider a zero-sum stochastic differential controller-and-stopper game in which the state process is a controlled

More information

A problem of portfolio/consumption choice in a. liquidity risk model with random trading times

A problem of portfolio/consumption choice in a. liquidity risk model with random trading times A problem of portfolio/consumption choice in a liquidity risk model with random trading times Huyên PHAM Special Semester on Stochastics with Emphasis on Finance, Kick-off workshop, Linz, September 8-12,

More information

Probabilistic Planning. George Konidaris

Probabilistic Planning. George Konidaris Probabilistic Planning George Konidaris gdk@cs.brown.edu Fall 2017 The Planning Problem Finding a sequence of actions to achieve some goal. Plans It s great when a plan just works but the world doesn t

More information

Controlled Diffusions and Hamilton-Jacobi Bellman Equations

Controlled Diffusions and Hamilton-Jacobi Bellman Equations Controlled Diffusions and Hamilton-Jacobi Bellman Equations Emo Todorov Applied Mathematics and Computer Science & Engineering University of Washington Winter 2014 Emo Todorov (UW) AMATH/CSE 579, Winter

More information

On Stopping Times and Impulse Control with Constraint

On Stopping Times and Impulse Control with Constraint On Stopping Times and Impulse Control with Constraint Jose Luis Menaldi Based on joint papers with M. Robin (216, 217) Department of Mathematics Wayne State University Detroit, Michigan 4822, USA (e-mail:

More information

Lecture 4. 1 Examples of Mechanism Design Problems

Lecture 4. 1 Examples of Mechanism Design Problems CSCI699: Topics in Learning and Game Theory Lecture 4 Lecturer: Shaddin Dughmi Scribes: Haifeng Xu,Reem Alfayez 1 Examples of Mechanism Design Problems Example 1: Single Item Auctions. There is a single

More information

1 Bewley Economies with Aggregate Uncertainty

1 Bewley Economies with Aggregate Uncertainty 1 Bewley Economies with Aggregate Uncertainty Sofarwehaveassumedawayaggregatefluctuations (i.e., business cycles) in our description of the incomplete-markets economies with uninsurable idiosyncratic risk

More information

Minimization of ruin probabilities by investment under transaction costs

Minimization of ruin probabilities by investment under transaction costs Minimization of ruin probabilities by investment under transaction costs Stefan Thonhauser DSA, HEC, Université de Lausanne 13 th Scientific Day, Bonn, 3.4.214 Outline Introduction Risk models and controls

More information

Course 16:198:520: Introduction To Artificial Intelligence Lecture 13. Decision Making. Abdeslam Boularias. Wednesday, December 7, 2016

Course 16:198:520: Introduction To Artificial Intelligence Lecture 13. Decision Making. Abdeslam Boularias. Wednesday, December 7, 2016 Course 16:198:520: Introduction To Artificial Intelligence Lecture 13 Decision Making Abdeslam Boularias Wednesday, December 7, 2016 1 / 45 Overview We consider probabilistic temporal models where the

More information

A Review of Auction Theory: Sequential Auctions and Vickrey Auctions

A Review of Auction Theory: Sequential Auctions and Vickrey Auctions A Review of Auction Theory: and Vickrey Daniel R. 1 1 Department of Economics University of Maryland, College Park. September 2017 / Econ415 . Vickrey s. Vickrey. Example Two goods, one per bidder Suppose

More information

Welfare Maximization with Production Costs: A Primal Dual Approach

Welfare Maximization with Production Costs: A Primal Dual Approach Welfare Maximization with Production Costs: A Primal Dual Approach Zhiyi Huang Anthony Kim The University of Hong Kong Stanford University January 4, 2015 Zhiyi Huang, Anthony Kim Welfare Maximization

More information

MFM Practitioner Module: Risk & Asset Allocation. John Dodson. January 25, 2012

MFM Practitioner Module: Risk & Asset Allocation. John Dodson. January 25, 2012 MFM Practitioner Module: Risk & Asset Allocation January 25, 2012 Optimizing Allocations Once we have 1. chosen the markets and an investment horizon 2. modeled the markets 3. agreed on an objective with

More information

Administration. CSCI567 Machine Learning (Fall 2018) Outline. Outline. HW5 is available, due on 11/18. Practice final will also be available soon.

Administration. CSCI567 Machine Learning (Fall 2018) Outline. Outline. HW5 is available, due on 11/18. Practice final will also be available soon. Administration CSCI567 Machine Learning Fall 2018 Prof. Haipeng Luo U of Southern California Nov 7, 2018 HW5 is available, due on 11/18. Practice final will also be available soon. Remaining weeks: 11/14,

More information

Liquidity risk and optimal dividend/investment strategies

Liquidity risk and optimal dividend/investment strategies Liquidity risk and optimal dividend/investment strategies Vathana LY VATH Laboratoire de Mathématiques et Modélisation d Evry ENSIIE and Université d Evry Joint work with E. Chevalier and M. Gaigi ICASQF,

More information

Stochastic Volatility and Correction to the Heat Equation

Stochastic Volatility and Correction to the Heat Equation Stochastic Volatility and Correction to the Heat Equation Jean-Pierre Fouque, George Papanicolaou and Ronnie Sircar Abstract. From a probabilist s point of view the Twentieth Century has been a century

More information

Lecture notes: Rust (1987) Economics : M. Shum 1

Lecture notes: Rust (1987) Economics : M. Shum 1 Economics 180.672: M. Shum 1 Estimate the parameters of a dynamic optimization problem: when to replace engine of a bus? This is a another practical example of an optimal stopping problem, which is easily

More information

A.Piunovskiy. University of Liverpool Fluid Approximation to Controlled Markov. Chains with Local Transitions. A.Piunovskiy.

A.Piunovskiy. University of Liverpool Fluid Approximation to Controlled Markov. Chains with Local Transitions. A.Piunovskiy. University of Liverpool piunov@liv.ac.uk The Markov Decision Process under consideration is defined by the following elements X = {0, 1, 2,...} is the state space; A is the action space (Borel); p(z x,

More information

Optimal Trade Execution with Instantaneous Price Impact and Stochastic Resilience

Optimal Trade Execution with Instantaneous Price Impact and Stochastic Resilience Optimal Trade Execution with Instantaneous Price Impact and Stochastic Resilience Ulrich Horst 1 Humboldt-Universität zu Berlin Department of Mathematics and School of Business and Economics Vienna, Nov.

More information

Uncertainty in Kalman Bucy Filtering

Uncertainty in Kalman Bucy Filtering Uncertainty in Kalman Bucy Filtering Samuel N. Cohen Mathematical Institute University of Oxford Research Supported by: The Oxford Man Institute for Quantitative Finance The Oxford Nie Financial Big Data

More information

Risk-Sensitive Control with HARA Utility

Risk-Sensitive Control with HARA Utility IEEE TRANSACTIONS ON AUTOMATIC CONTROL, VOL. 46, NO. 4, APRIL 2001 563 Risk-Sensitive Control with HARA Utility Andrew E. B. Lim Xun Yu Zhou, Senior Member, IEEE Abstract In this paper, a control methodology

More information

Sequential Procedure for Testing Hypothesis about Mean of Latent Gaussian Process

Sequential Procedure for Testing Hypothesis about Mean of Latent Gaussian Process Applied Mathematical Sciences, Vol. 4, 2010, no. 62, 3083-3093 Sequential Procedure for Testing Hypothesis about Mean of Latent Gaussian Process Julia Bondarenko Helmut-Schmidt University Hamburg University

More information

Solution of Stochastic Optimal Control Problems and Financial Applications

Solution of Stochastic Optimal Control Problems and Financial Applications Journal of Mathematical Extension Vol. 11, No. 4, (2017), 27-44 ISSN: 1735-8299 URL: http://www.ijmex.com Solution of Stochastic Optimal Control Problems and Financial Applications 2 Mat B. Kafash 1 Faculty

More information

Microeconomic Theory (501b) Problem Set 10. Auctions and Moral Hazard Suggested Solution: Tibor Heumann

Microeconomic Theory (501b) Problem Set 10. Auctions and Moral Hazard Suggested Solution: Tibor Heumann Dirk Bergemann Department of Economics Yale University Microeconomic Theory (50b) Problem Set 0. Auctions and Moral Hazard Suggested Solution: Tibor Heumann 4/5/4 This problem set is due on Tuesday, 4//4..

More information

A MODEL FOR THE LONG-TERM OPTIMAL CAPACITY LEVEL OF AN INVESTMENT PROJECT

A MODEL FOR THE LONG-TERM OPTIMAL CAPACITY LEVEL OF AN INVESTMENT PROJECT A MODEL FOR HE LONG-ERM OPIMAL CAPACIY LEVEL OF AN INVESMEN PROJEC ARNE LØKKA AND MIHAIL ZERVOS Abstract. We consider an investment project that produces a single commodity. he project s operation yields

More information

Markov Decision Processes and Dynamic Programming

Markov Decision Processes and Dynamic Programming Markov Decision Processes and Dynamic Programming A. LAZARIC (SequeL Team @INRIA-Lille) Ecole Centrale - Option DAD SequeL INRIA Lille EC-RL Course In This Lecture A. LAZARIC Markov Decision Processes

More information

Speculation and the Bond Market: An Empirical No-arbitrage Framework

Speculation and the Bond Market: An Empirical No-arbitrage Framework Online Appendix to the paper Speculation and the Bond Market: An Empirical No-arbitrage Framework October 5, 2015 Part I: Maturity specific shocks in affine and equilibrium models This Appendix present

More information

Technical Appendix for: When Promotions Meet Operations: Cross-Selling and Its Effect on Call-Center Performance

Technical Appendix for: When Promotions Meet Operations: Cross-Selling and Its Effect on Call-Center Performance Technical Appendix for: When Promotions Meet Operations: Cross-Selling and Its Effect on Call-Center Performance In this technical appendix we provide proofs for the various results stated in the manuscript

More information

Portfolios that Contain Risky Assets 17: Fortune s Formulas

Portfolios that Contain Risky Assets 17: Fortune s Formulas Portfolios that Contain Risky Assets 7: Fortune s Formulas C. David Levermore University of Maryland, College Park, MD Math 420: Mathematical Modeling April 2, 208 version c 208 Charles David Levermore

More information

Technion - Computer Science Department - Technical Report CS On Centralized Smooth Scheduling

Technion - Computer Science Department - Technical Report CS On Centralized Smooth Scheduling On Centralized Smooth Scheduling Ami Litman January 25, 2005 Abstract Shiri Moran-Schein This paper studies evenly distributed sets of natural numbers and their applications to scheduling in a centralized

More information

Order book modeling and market making under uncertainty.

Order book modeling and market making under uncertainty. Order book modeling and market making under uncertainty. Sidi Mohamed ALY Lund University sidi@maths.lth.se (Joint work with K. Nyström and C. Zhang, Uppsala University) Le Mans, June 29, 2016 1 / 22 Outline

More information

An Empirical Analysis of Consumer Behavior in the Online Auction Marketplace

An Empirical Analysis of Consumer Behavior in the Online Auction Marketplace Clemson University TigerPrints All Theses Theses 12-2014 An Empirical Analysis of Consumer Behavior in the Online Auction Marketplace Calvin Salter Clemson University, csalter@clemson.edu Follow this and

More information

Electronic Market Making and Latency

Electronic Market Making and Latency Electronic Market Making and Latency Xuefeng Gao 1, Yunhan Wang 2 June 15, 2018 Abstract This paper studies the profitability of market making strategies and the impact of latency on electronic market

More information

Game Theory: Spring 2017

Game Theory: Spring 2017 Game Theory: Spring 2017 Ulle Endriss Institute for Logic, Language and Computation University of Amsterdam Ulle Endriss 1 Plan for Today In this second lecture on mechanism design we are going to generalise

More information

Regularity of the Optimal Exercise Boundary of American Options for Jump Diffusions

Regularity of the Optimal Exercise Boundary of American Options for Jump Diffusions Regularity of the Optimal Exercise Boundary of American Options for Jump Diffusions Hao Xing University of Michigan joint work with Erhan Bayraktar, University of Michigan SIAM Conference on Financial

More information

Online Appendix Liking and Following and the Newsvendor: Operations and Marketing Policies under Social Influence

Online Appendix Liking and Following and the Newsvendor: Operations and Marketing Policies under Social Influence Online Appendix Liking and Following and the Newsvendor: Operations and Marketing Policies under Social Influence Ming Hu, Joseph Milner Rotman School of Management, University of Toronto, Toronto, Ontario,

More information

Suggested solutions for the exam in SF2863 Systems Engineering. December 19,

Suggested solutions for the exam in SF2863 Systems Engineering. December 19, Suggested solutions for the exam in SF863 Systems Engineering. December 19, 011 14.00 19.00 Examiner: Per Enqvist, phone: 790 6 98 1. We can think of the support center as a Jackson network. The reception

More information

STOCHASTIC PERRON S METHOD AND VERIFICATION WITHOUT SMOOTHNESS USING VISCOSITY COMPARISON: OBSTACLE PROBLEMS AND DYNKIN GAMES

STOCHASTIC PERRON S METHOD AND VERIFICATION WITHOUT SMOOTHNESS USING VISCOSITY COMPARISON: OBSTACLE PROBLEMS AND DYNKIN GAMES STOCHASTIC PERRON S METHOD AND VERIFICATION WITHOUT SMOOTHNESS USING VISCOSITY COMPARISON: OBSTACLE PROBLEMS AND DYNKIN GAMES ERHAN BAYRAKTAR AND MIHAI SÎRBU Abstract. We adapt the Stochastic Perron s

More information

Memory, Attention and Choice

Memory, Attention and Choice Memory, Attention and Choice Pedro Bordalo, Nicola Gennaioli, Andrei Shleifer BRIC 2015 1 / 19 Introduction Psychologists and Economists have long recognised that not all available information is used

More information

An Uncertain Control Model with Application to. Production-Inventory System

An Uncertain Control Model with Application to. Production-Inventory System An Uncertain Control Model with Application to Production-Inventory System Kai Yao 1, Zhongfeng Qin 2 1 Department of Mathematical Sciences, Tsinghua University, Beijing 100084, China 2 School of Economics

More information

Economics 201b Spring 2010 Solutions to Problem Set 1 John Zhu

Economics 201b Spring 2010 Solutions to Problem Set 1 John Zhu Economics 201b Spring 2010 Solutions to Problem Set 1 John Zhu 1a The following is a Edgeworth box characterization of the Pareto optimal, and the individually rational Pareto optimal, along with some

More information

Dynamic Discrete Choice Structural Models in Empirical IO

Dynamic Discrete Choice Structural Models in Empirical IO Dynamic Discrete Choice Structural Models in Empirical IO Lecture 4: Euler Equations and Finite Dependence in Dynamic Discrete Choice Models Victor Aguirregabiria (University of Toronto) Carlos III, Madrid

More information

Introduction to Reinforcement Learning. CMPT 882 Mar. 18

Introduction to Reinforcement Learning. CMPT 882 Mar. 18 Introduction to Reinforcement Learning CMPT 882 Mar. 18 Outline for the week Basic ideas in RL Value functions and value iteration Policy evaluation and policy improvement Model-free RL Monte-Carlo and

More information

Rational Inattention

Rational Inattention Rational Inattention (prepared for The New Palgrave Dictionary of Economics) Mirko Wiederholt Northwestern University August 2010 Abstract Economists have studied for a long time how decision-makers allocate

More information