Dynamic Iterations for the Solution of Ordinary Differential Equations on Multicore Processors

Size: px
Start display at page:

Download "Dynamic Iterations for the Solution of Ordinary Differential Equations on Multicore Processors"

Transcription

1 Dynamic Ieraions for he Soluion of Ordinary Differenial Equaions on Mulicore Processors Yanan Yu Compuer Science Deparmen Florida Sae Universiy Tallahassee FL 336, USA Ashok Srinivasan Compuer Science Deparmen Florida Sae Universiy Tallahassee FL 336, USA Absrac In he pas few years, here has been a rend of providing increased compuing power hrough greaer number of cores on a chip, raher han hrough higher clock speeds. In order o exploi he available compuing power, applicaions need o be parallelized efficienly. We consider he soluion of Ordinary Differenial Equaions (ODE) on mulicore processors. Convenional parallelizaion sraegies disribue he sae space amongs he processors, and are efficien only when he sae space of he ODE sysem is large. However, users of a deskop sysem wih mulicore processors may wish o solve small ODE sysems. Dynamic ieraions, parallelized along he ime domain, appear promising for such applicaions. However, hey have been of limied usefulness because of heir slow convergence. They also have a high memory requiremen when he number of ime seps is large. We propose a hybrid mehod ha combines convenional sequenial ODE solvers wih dynamic ieraions. We show ha i has beer convergence and also requires less memory. Empirical resuls show a facor of wo o four improvemen in performance over an equivalen convenional solver on a single node. The significance of his paper lies in proposing a new mehod ha can enable small ODE sysems, possibly wih long ime spans, o be solved faser on mulicore processors.. Inroducion Parallel processing is now widely available due o he populariy of mulicore processors. A deskop, for example, may have wo quad-core processors. I may also be equipped wih a GPU having - cores. The Cell processor has eigh cores called SPEs, which are mean o handle he compuaional workload. Two of hese can be combined o form a 6-core shared memory machine. The Sony PS3 uses he same chip and makes six SPE cores available for use. Thus parallel processing is no longer resriced o he HPC communiy. Non-HPC users ofen have a need o solve smaller problems han he HPC communiy does. However, he speed of soluion can sill be imporan. For example, reducing he ime o solve an ODE from a minue o around en seconds is a noiceable difference in soluion ime. Applicaions will need o efficienly exploi he available parallelism, in order o ge improved performance. This can be difficul for small problems, because of he finer granulariy. We will consider he specific problem of solving an Iniial Value Problem involving an ordinary differenial equaion. We provide below a simple descripion of he problem and he convenional approach o solving i. (Readers wih a background in ODEs can skip he nex wo paragraphs.) A firs order ODE is given in he form du/d = f(, u), where u, which may be a vecor, is he sae of a physical sysem, and is ofen ime. Higher order ODEs can be expressed as a firs order ODE sysem using a sandard ransformaion [3]. The iniial sae of he sysem a some ime, u( ), is provided in an Iniial Value Problem. The problem is o deermine he value of u a subsequen values of ime, up o some ime span T. An ODE solver will ieraively deermine hese values by saring wih he known value of u a, and use u( i ) o find u( i+ ). Convenional parallelizaion of his procedure disribues he componens of u amongs differen processors, and all processors updae he values of u assigned o hem, a each ime sep. In order o perform his updae, each processor compues he componens of f for he componens of u assigned o i. They will need some componens of u assigned o oher processors, and hus communicaion is required. In a disribued memory compuing environmen, he communicaion cos is a leas he order of a microsecond using a fas nework such as Infiniband. In he shared memory programming paradigm on symmeric muliprocessor sysems, he hread synchronizaion overhead is he order of a microsecond. Therefore, he compuaion involved on a processor should be large enough ha he parallel overhead does no become a boleneck. When he sae space is small, he communicaion overhead dominaes he compuaion, and parallelizaion is no feasible. Thus any improvemen

2 over he sequenial performance is of much benefi in such siuaions. For small ODE sysems, he soluion ime is significan when he number of ime seps is large. Parallelizaion of he ime domain appears promising hen. Dynamic ieraions are a class of mehods ha are ofen suiable for ime parallelizaion. Here, we sar ou wih an iniial guess for he enire ime domain, and hen ieraively improve i unil i converges. Picard ieraions, which pre-dae modern ODE solvers, are easy o parallelize along he ime domain each processor updaes a differen ime inerval independenly, followed by a parallel prefix compuaion o correc he updae. This mehod is explained furher in. Dynamic ieraions were well sudied in he 98s and 99s. However, hey were no suiable for mos realisic problems due o heir slow convergence. They also have a high memory requiremen, because he soluion a all ime seps in he previous ieraion is needed for he updae in he nex ieraion. In his paper, we propose a hybrid scheme ha combines convenional sequenial ODE solvers wih dynamic ieraions. We show ha he convergence is faser wih his scheme. This scheme also guaranees progress in each ieraion, a leas one ime inerval converges. Thus, in he wors case, he oal compuaion ime is no worse han ha of a sequenial solver, apar from a small parallelizaion overhead. Of course, we hope o improve on a sequenial solver in a ypical case. We show laer ha we obain significan improvemen in performance on some sample ODE sysems, compared wih an equivalen convenional solver. We also compare i wih Picard ieraions. The hybrid mehod converges abou wice as fas as Picard ieraions. The ouline of he res of his paper is as follows. In we explain dynamic ieraions, and Picard ieraions in paricular. We hen describe he new approach in 3 and discuss is correcness and convergence. We demonsrae is effeciveness in 4. We discuss oher ime parallelizaion echniques in 5. We summarize our conclusions and presen fuure work in 6.. Dynamic Ieraions The modern developmen of dynamic ieraions for he soluion of Iniial Value Problems sared wih he work on Waveform Relaxaion for he soluion of large differenialalgebraic equaions in VLSI circui simulaions [7]. Many varians of he original mehod were hen developed in he 98s and 99s. A unified definiion of his class of mehods is presened in [3]. I is also shown here ha Picard ieraions, which pre-dae waveform relaxaion by almos a cenury, can be considered a special case of his class of mehods. We use he noaion of [3] in he descripion below. Consider an iniial value problem, u = f(, u), u() = u, () where u R n, f : R R n R n, and u := du/d. Dynamic Ieraions solve () recursively using he following equaion. g(, u m ) = f(, u m ), () = u. () The recursion is normally sared using he iniial guess u () = u. If g is defined such ha g(u, u) =, hen i can be shown ha if he ieraion converges, hen i converges o he exac soluion, provided () is solved exacly [3]. In pracice, numerical mehods are used o solve (), and he approximaions used affec he resuls, as shown in 4. Differen mehods differ in heir choice of g, which influences he convergence behavior and also he ease of parallelizaion. Picard ieraions choose g(y, z) =. We discuss oher common choices of g in 5. Dynamic ieraions converge under fairly weak condiions [3], and so can be applied in mos pracical siuaions. The Picard mehod can also be parallelized efficienly, as explained below. Picard ieraions recursively solve () as follows. Iniially, u is sored a discree ime poins. In he (m+)h ieraion, f(, u m ) is compued a hose ime poins. Then f is numerically inegraed using some quadraure mehod. This leads o he following expression for deermining, when he inegraion is exac. u() m+ = u + f(s, u m (s)) ds. (3) The above equaion can be rewrien as follows. i+ = um+ i + i+ i f(s, u m (s)) ds. (4) Here, i is a sequence of ime poins. There may be several ime poins a which u m are known, in he ime inerval [ i, i+ ]. The inegral in (4) is evaluaed using a quadraure rule. The above mehod is parallelized easily. Each processor i, i < P is responsible for deermining he values of in he inerval ( i, i+ ], using he values of u m. Each of hem firs evaluaes he inegrals in (4) for heir ime poins using a quadraure mehod, for which hey will need some boundary values of u m available on neighboring processors. Then, a parallel prefix compuaion on he inegral for i s is performed o compue he cumulaive inegrals from o i. Afer his, each processor can independenly compue he values of for which hey are responsible. If he ime inerval ( i, i+ ) conains sufficien ime poins a which u are evaluaed, hen he compuaion cos will be much larger han he prefix cos, and hus he parallelizaion will be efficien. While he parallel overhead is ofen small, he efficiency of parallelizaion of any dynamic ieraion should be compared agains an equivalen convenional ODE solver (ha is, having he same error order as he quadraure), raher han agains he dynamic ieraion on one processor, in order o ge an idea of he effeciveness of dynamic ieraions.

3 Under his crieria, noe ha if he compuaions converge in N ieraions, hen he efficiency canno be greaer han /N. For example, N = 5 will lead o relaive efficiency less han %. Thus, dynamic ieraions are suiable when convenional mehods give poor parallel efficiency. Dynamic ieraions have generally no found widespread use because hey were slow o converge, making hem uncompeiive wih convenional solvers. For example, fig. shows four ieraions being needed for convergence (jus from a visual perspecive) for a simple equaion. However, we believe ha wih he populariy of mulicore processors, i is ime o re-evaluae he usefulness of dynamic ieraions. As menioned earlier, we now have a siuaion where users can solve small problems in parallel. Convenional solvers are no effecive for his, and so any gain in speedup is a benefi, even if he efficiency is low. For example, an efficiency of % on eigh cores gives a speedup of.6 over a convenional solver. 3. A Hybrid Scheme We will presen our mehod as an inuiive improvemen o he Picard mehod o deal wih one of he laer s shorcoming, and hen we will analyze is convergence properies. Apar from slow convergence, anoher problem wih he Picard mehod, of somewha less imporance, is is large memory requiremen. In order o simulae n ime seps, he value of u m needs o be sored for he n ime poins. Usually one divides he oal ime span ino smaller windows, and wais for convergence of he enire window before moving o he nex window. However, he number of ime seps in each window can sill be high. This can affec performance hrough increased cache misses. In he proposed mehod, only he resuls a cerain ime poins i are sored. The ime seps are usually small for sabiliy and accuracy reasons. However, because he resul of he exac soluion a each ime sep is no always required, we need no o sore he resuls a all inermediae ime seps, whereas Picard ieraions need he inermediae values inside each inerval [ i, i+ ]. Of course, he granulariy of he i s in he hybrid mehod should be a leas as fine as he resoluion desired by he user. We firs noe ha he exac soluion of eqn. () is given by he following. u i+ = u i + i+ i f(s, u(s)) ds. (5) If we compare wih he Picard recurrence, eqn. (4), we see ha Picard ieraions replace u wih u m in he inegral, because ha is he laes approximaion o u available o i. In our mehod, we have u m available only a he poins i. In our mehod, processor i will solve he exac ODE eqn. () wih iniial condiion û (i) ( i ) = u m i, up o ime i+. The ha on u indicaes ha his is a soluion o he exac ODE wih a possibly wrong iniial condiion, and he subscrip (i) indicaes ha his problem is solved on processor i. Noe ha û (i) ( i+ ) = û (i) ( i ) + i+ i f(s, û (i) (s)) ds. Thus we ge he following expression. û (i) ( i+ ) u m i = i+ i f(s, û (i) (s)) ds, (6) where we have used he iniial condiion û (i) ( i ) = u m i. We replace he inegral in he Picard ieraions recurrence, eqn. (4) wih he inegral above, and his inegral, in urn, can be replaced by he lef hand side of he expression above o give he following recurrence for he hybrid mehod. i+ := um+ i + û (i) ( i+ ) u m i. (7) The parallelizaion is similar o ha of he Picard mehod, and is shown in Algorihm 3.. The û (i) ( i+ ) are compued by solving an ODE, independenly on each processor. Then a parallel prefix compuaion is performed as in he Picard mehod and he new approximaion i+ is compued. An inuiive way of hinking abou his mehod is as follows. If u m i were accurae, hen û i ( i+ ) would be he exac soluion for ime i+. However, we observe an error in he value of i u m i beween wo successive ieraions a ime i. We add his as a correcion facor o û i ( i+ ). Algorihm 3.: HYBRID-METHOD(u i = u, Number of processors P) repea for each processor i {..P } û m (i) ( i+) Solve ODE do (wih iniial condiion u do m i ) i û m (i) ( i+) u m i G i Parallel prefix on i i+ G i + u unil Convergence We nex show ha if he ieraions converge, hen hey converge o he exac soluion, provided he ODE is solved exacly in each ieraion. Of course, he ODE is acually solved numerically, wih approximaion errors. We evaluae he effec of hese in 4. Assume ha he ieraions converge. A convergence, i = u m i, for i [, n]. From eqn. (7), we ge i+ = û (i) ( i+ ). Processor sars from he exac iniial condiion, and hus = û ( ) is exac. Bu = u m due o convergence, and hus u m is exac. Thus process sared from he exac soluion in deermining û ( ) and hus = û ( ) is exac. We can proceed in his manner, using an inducive argumen o show ha is exac.

4 We now consider he issue of how fas he mehod converges. We firs explain why we inuiively expec i o have beer convergence han Picard ieraions. We hen prove ha i converges in a finie number of seps, if he ODE soluion is exac. We will demonsrae in 4. ha i has good convergence properies in pracice. We saw above ha he difference beween Picard ieraions and he hybrid mehod is in how he erm i+ i f(s, u(s)) ds is esimaed, wihou knowing he rue soluion u. Picard ieraions perform a quadraure using he approximaion u m. The hybrid mehod sars wih a poin on u m, bu uses he exac differenial equaion. Thus, we can expec is soluion o be beer han ha of Picard ieraions, which use he old approximaion hroughou he ime domain. We also show ha he hybrid mehod always progresses. Tha is, in each sep, a leas one addiional ime inerval converges. Noe ha processor sars wih he exac iniial condiion and hus converges afer he firs ieraion. In he second ieraion, he iniial condiion for processor is exac, and so is soluion u is exac. Using an inducive argumen, u i i is exac. Thus he mehod converges in n ieraions if here are n ime inervals. Of course, his would no give any speedup over an equivalen sequenial solver, and we would hope o do beer in pracice. However, i does provide a guaranee ha in he wors case, he performance will no be worse han an equivalen convenional sequenial solver, if he parallelizaion overheads are small. Noe ha he memory consumpion of his solver on each processor is small roughly he same as a sequenial solver. I is independen of he number of ime seps in a ime inerval, unlike dynamic ieraions. We consider hree varians of his mehod. (i) We solve for he enire ime domain by dividing he ime span T ino T/P inervals on P processors. (ii) We use windows of a cerain widh W, wih he number of windows being much larger han P. The processors wai for convergence of he firs P windows (one window per processor) before proceeding wih he nex se of P windows. This ype of windowing has been shown by ohers o be more effecive han solving for he enire ime domain in dynamic ieraions [6]. (iii) We noice ha afer he firs few processors have converged, hey don perform useful work by repeaing he calculaions. So, hese processors will be used for compuing new windows. We call his mehod sliding windows. Noe ha a processor canno be considered as having converged unil i has locally converged and all he processors handling smaller values of ime have also converged. 4. Empirical Evaluaion 4.. Experimenal Seup The experimenal ess were performed on an Inel Xeon cluser. Each node has wo.33 GHz Quad-core CPUs wih 8 GB shared memory. OpenMPI. is used for iner-process communicaions. Boh he sequenial and parallel code are compiled wih opimizaion level O3. All he speedup and relevan resuls are obained on 8 cores on a single node. We use an assembly iming rouine, which accesses he ime-samp couner hrough he rdsc insrucion, for our iming resuls. The resoluion of he imer is.5µs. I is sufficienly good for he iming purpose in his paper, where he iming resuls are a leas he order of a few microseconds. The resuls are presened by solving hree ODE sysems. They are: (i) ODE: u = ω u, wih ω =, u() =, u () = for [, 6.4]; (ii) ODE Airy equaion: u = k u, wih k =, u( 6) =, u () = for [ 6, ]; (iii) ODE3 Duffing equaion: u + δu + (βu 3 + ω u) = γ cos(ω+φ), wih γ =.3, φ =, δ =., β =.5, ω =, u( 6) =, u () = for [ 6, ]. The Airy equaion is commonly seen in physics and asronomy. The Duffing equaion describes he moion of a damped oscillaor. In his paper, a single sep 4h order Runge-Kua Mehod is implemened as he ODE solver used in he hybrid mehod. We use he Simpson s rule o evaluae he quadraure in he Picard mehod. The accuracy of he Simpson s rule is also order 4. All compuaions were in double precision. The exac soluions o each of he hree ODEs are shown in fig., 3, 4. The exac soluion o he firs ODE can be expressed in he simple symbolic form, y = sin(ω). However, here are no closed-form soluions o he second and hird ODEs in he real domain. In order o ge good approximaions o he exac soluions, we used he following procedure. We solved he ODEs using a sequenial ODE solver. We hen solved hem again wih a ime sep size ha was an order of magniude smaller, and deermined he larges difference in values a any given ime poin. We repeaed his wih smaller ime seps, unil his larges difference was he order of. This soluion was hen used as an approximaion o he exac soluion. In 4., we chose he ime sep size such ha he global error is of he order of 6. Therefore, he accuracy in he approximaed exac soluion is sufficien for he comparisons in his paper. In he resuls below, we use he erms olerance and error. The laer refers o he absolue difference beween he exac soluion and he compued soluion. The olerance is a parameer o he ODE solver. When using dynamic ieraions, he soluion is considered o have converged if he absolue difference beween soluions a successive ieraions is smaller han his value. When used wih a sequenial ODE

5 y.8.6 y Figure. Convergence of Picard ieraion for he soluion of u = u, u = Figure 3. ODE: y = k y 4.5 y y Figure. ODE: y = ω y solver, he olerance has he convenional meaning. The error may be larger han he olerance. The speedup resuls compare he ime of dynamic ieraions wih he ime for a sequenial 4h order Runge- Kua Mehod. Speedup is close o linear when we compare wih a sequenial implemenaion of dynamic ieraions. However, in order o realisically evaluae he usefulness of dynamic ieraions, we need o compare hem agains a convenional solver. The oal compuaional work of a sequenial solver is less han ha of dynamic ieraions, and so he speedup resuls are lower han if compared wih a sequenial implemenaion of dynamic ieraions Figure 4. ODE3: y + δy + (βy 3 + ω ) = γ cos(ω + φ) 4.. Experimenal Resuls We firs explain our choice of ime sep size. If he ime sep size is smaller han necessary, hen he granulariy of he compuaion is coarser han necessary, and hus he communicaion overheads are arificially made small, relaive o he compuaional effor in is parallel implemenaion. In order o avoid his, we firs evaluae he accuracy of he sequenial ODE solver for differen ime sep sizes, and choose one ha gives good accuracy. For sabiliy, a sep size of.5 is sufficien. However, a much smaller sep size is usually required for accuracy. Fig. 5 shows he comparison of he errors from ode45 funcion in Malab (wih parameers of absolue error = 6 and relaive error = 6 ) and dlsode funcion in ODEPACK. We need a sep size of 7 for he

6 4 5 4 ODE ODE ODE3 Errors 6 8 RK4 sepsize=e 7 RK4 sepsize=e 6 ode45 (Malab) ODEPACK Figure 5. Error versus ime for differen values of sepsize h using he sequenial solver for ODE. Errors are compared wih ha of ode45(malab) and dlsode(odepack). Speedup 4 3 Wihou Windowing Wih Windowing Sliding Window ODE ODE ODE3 Figure 6. Speedup resul of he hree ODEs using differen varians of he hybrid mehod. The window size is 5 ime seps and he olerance is 6. sequenial ODE solver o have he same order of accuracy as ode45 in Malab when solving ODE. Similar experimens were performed for ODE and ODE3 and heir sep sizes were also deermined as 7. As menioned in 3, he hybrid mehod can have hree differen varians wihou windowing, wih windowing, and wih sliding windows. We wish o compare he effeciveness of each of hese varians. Fig. 6 shows he speedup resuls using he hree varians on he ODEs. The window size is 5 ime seps for boh he windowing mehods. We can see ha sliding windows yields a beer speedup han fixed No. of Ieraions per Window 3 Tol= Tol= Window Size (no. of ime seps) Figure 7. Average number of ieraions using differen window sizes and olerances wih sliding windows. windows. The non-window mehod does no perform well for such small numbers of processors, because i requires more han eigh ieraions, on eigh cores, o converge for he given ime span. Given hese observaions, we will only use resuls from he sliding windows mehod for comparisons wih he sequenial solver as well as wih Picard ieraions, in he following discussions. The hybrid mehod wih sliding windows requires hree MPI funcion calls per ieraion. MPI Scan is used for he parallel prefix summaion, MPI Allreduce is called in esing for convergence, and MPI Isend/MPI Recv is used o ransfer he laes updaed value beween cerain processors. Across 8 cores on he same node, MPI Scan and MPI Isend/MPI Recv ake less han µs and MPI Allreduce akes less han µs. The compuaion ime per sep for he hree sample ODEs are.µs,.µs and.44µs. For speedup resuls shown in he following discussions, he window size is a leas 3 ime seps. In ha case, he granulariy of he compuaion is of he order of a hundred microseconds, and he overall communicaion overhead a mos he order of %. We now discuss he choice of window size. As menioned before, he window size affecs he granulariy of he parallelism. For exac ODE solvers, smaller granulariy indicaes relaively larger communicaion overhead. Thus he speedup can be expeced o decrease wih decrease in window size. In he hybrid mehod, he window size can also affec he number of ieraions as well as he numerical accuracy of he soluion. We experimened wih hree differen window sizes, 5 ime seps, 4 ime seps and 3 ime seps. Fig. 7 compares he average number of ieraions per window for he soluions o converge. As we can see, if we use he same order of error olerance for all he window sizes, he

7 6 5 ODE ODE ODE3 window size= 3 window size= 4 window size= 5 Speedup 4 3 Tol= Tol= Errors 3 5 Window Size (no. of ime seps) Figure 8. Speedup resuls using differen window sizes and olerances wih sliding windows Figure. Errors using differen window sizes in he Picard mehod wih olerance equal o o solve for ODE3. Errors 4 6 window size= 5, olerance= 6 window size= 4, olerance= 6 window size= 3, olerance= window size= 3, olerance= 6 window size= 4, olerance= 6 4 Figure 9. Accuracy of he soluion using differen window sizes and olerances for ODE3. (The black solid line almos overlaps wih he red dashed line.) smaller he window size is, he faser he mehod converges. Fig. 8 compares he speedup when differen window sizes and olerance are used o solve he hree ODEs. However here is a rade-off wih he accuracy of he soluion. Fig. 9 shows how he accuracy is affeced by differen window sizes and local error olerances. The error is abou wo orders larger when he window size is decreased by one order of magniude, when he olerance is 6. By comparing he effec of differen window sizes and olerances on boh speedup and accuracy, we chose he window size of 3 ime seps. Similar conclusions were also drawn for ODE and ODE. The window size also affecs he accuracy and he convergence rae of Picard ieraions. Fig. shows he error versus he window size for ODE3. The sep size used for he quadraure inegraion is he same as ha of he ODE solver for our modified mehod. The solid line uses he same window size of 3 and olerance of. As we can see he accuracy is abou order lower han ha of our mehod. Fig. shows he comparison of he number of ieraions of he Picard mehod o he hybrid mehod for he equivalen parameers of window size, sepsize and olerance. The y-axis is he raio of he number of ieraions required by Picard ieraions versus he number of ieraions required by he hybrid mehod. The convergence rae is slower in Picard ieraions in all he hree cases. However, noe ha his improvemen in convergence rae does no direcly ranslae o a proporional improvemen in speedup, because he Runge-Kua mehod evaluaes he ODE a 5 poins for each ime sep, while he Picard ieraion, using Simpson s rule, only calls he funcion f in equaion () once for each ime sep. 5. Relaed Work We discussed dynamic ieraions using he Picard mehod in. Oher mehods can be derived by a differen choice of g in equaion (). Two imporan cases are he Waveform Jacobi mehod and he Waveform Gauss-Seidel mehod. The definiions of g for hese are provided in [3]. The Waveform Jacobi recurrence is like ha of equaion (3), excep ha in he evaluaion of he j h componen of f, he j h componen of u m is replaced by he j h componen of. This decomposes he ODE wih n variables ino n ODEs where one variable is updaed in each

8 Average number of ieraions per window (Picard ieraion/hybrid mehod) 3 Tol= Tol=e 6 Window size (no. of ime seps) ODE ODE ODE3 Tol= Figure. Raio of he average number of ieraions. Picard mehod vs. he hybrid mehod compared for he same window size and olerance. ODE. In he evaluaion of f, he value of u m (s) is used for all oher variables. The Waveform Gauss-Seidel recurrence is like ha of equaion (3), excep ha in he evaluaion of he j h componen of f, all componens of u m wih index less han or equal o j are replaced by he corresponding componens of. This decomposes he ODE wih n variables ino n ODEs where one variable is updaed in each ODE. The ODE for he firs componen is solved as in Waveform Jacobi. The ODE for he second componen uses he values of he firs componen of (s) evaluaed before, and he values of u m (s) for he hird and higher indexed componens, in each evaluaion of f. This process coninues, wih he ODE for each componen using he values of (s) for componens wih smaller indices, and u m (s) for componens wih larger indices, in each evaluaion of f. Noe ha, despie he use of an ODE solver insead of quadraure by hese mehods, hey are quie differen from he hybrid-scheme. Their basic goal is o decouple he compuaion of each componen of he sysem. They have high memory requiremen as ha of Picard ieraions, because u m needs o be sored for all ime seps. Each ieraion of Waveform Jacobi can be parallelized by having he ODE for each componen solved on a differen processor. This is advanageous for large sysems, especially when differen ime seps can be used for differen componens. However, he communicaion cos can be high, because he componens of u a all ime poins will need o be sen o oher processors. Parallelizing i along he ime domain is difficul in general. Some special cases can, however, be parallelized in ime [3]. A wavefron approach is used o parallelize Waveform Gauss-Seidel [3]. The idea behind his is basically pipelining. Noe ha once he firs ime sep of he firs componen of has been compleed, he firs ime sep of he second componen can be sared, while he second ime sep of he firs componen is simulaneously sared. If we need o perform m ieraions wih d variables for n ime seps, hen he m n d evaluaions of f are performed in parallel in d+m n ime, if d n. This mehod appears o parallelize well. However, noe ha a sequenial ODE solver can solve he same problem wih n evaluaions of f. Thus, even wih parallelizaion, his mehod is no faser han an equivalen convenional solver. The above processes of spliing he ODE ino subcomponens can be generalized by keeping blocks of variables ha are srongly coupled ogeher. The blocks may also overlap [4]. When () is solved exacly, he soluion converges superlinearly [3], [] on a finie ime inerval. However, numerical mehods have o be used o solve i in pracice. Theoreical resuls on convergence for discree version of dynamic ieraions are presened in [], [3]. Esimaes of suiable window sizes for good convergence are provided in [5]. In realisic siuaions, his class of mehods has ofen been found o converge slowly, which is a major limiaion of hese mehods in pracice. Differen sraegies have been proposed o speed up he convergence of dynamic ieraions. Coarse ime sep size for he earlier ieraions, and finer ime seps for laer ieraions, is proposed in []. A wo-sep-size scheme for solving Picard ieraions is proposed in [6]. A fine sep size is used for a fixed number of ieraions in all he sub-sysems o smooh he ieraions. A coarse sep size is hen used o solve he residue of he soluion resriced o he coarse ime-mesh. Muligrid echniques have also been used for acceleraing convergence [8], [7]. Reduced order modeling echniques have been combined wih dynamic ieraions, in order o help propagae he change in one componen of he sysem faser o he res of he sysem in []. A differen approach o ime parallelizaion uses a coarsegrained model o guess he soluion, followed by correcion. The Parareal mehod can be considered an example of his [], [9]. Dynamic daa-driven ime parallelizaion [4], [5], [8] uses resuls from prior, relaed, runs o consruc a coarse-grained model. Convenional parallelizaion of ODEs is hrough disribuion of he sae space, as menioned earlier. The difficuly wih small sysems is ha he compuaion ime per ime sep is small. For example, each sep in our compuaion akes he order of a enh of a microsecond using he. However, his mehod may be useful for oher reasons, such as when widely differen ime seps can be used for differen componens. Bu he order in which variables are solved has o be chosen carefully, o preven he variable wih small imes seps from becoming a boleneck o variables ha depend on i [3].

9 Runge-Kua mehod. This will be even smaller when he sae space is disribued across he cores. On he oher hand, MPI communicaion overhead is of he order of microseconds. One could also use hreads on mulicore processors. However, he hread synchronizaion overhead is hen significan, as menioned earlier. In fac, a parallel OpenMP implemenaion ook almos an order of magniude more ime han a sequenial ODE solver, for he hree ODEs considered in his paper. 6. Conclusions We have shown ha hybrid dynamic ieraions yield significanly beer performance han an equivalen sequenial mehod. This is imporan because convenional parallelizaion is no feasible for small ODEs. The loss in efficiency in he hybrid scheme is primarily due o he number of ieraions required for convergence. The oher parallelizaion overheads are small. Thus, hey provide subsanial benefi when effecive, and are no much worse han he sequenial compuaion when less effecive. We have considered hree varians of his mehod: (i) solving for he enire inerval, (ii) using windows, and (iii) using sliding windows. The former wo have been sudied by ohers for dynamic ieraions (see [5], [], [6] for windowing). We demonsraed hrough he speedup resuls ha he parallel efficiency of he hird is beer. We also showed ha he convergence behavior of he hybrid mehod is beer han Picard ieraions. Furhermore, our mehod has smaller memory requiremens han dynamic ieraions. This can lead o beer cache performance. This paper has demonsraed he poenial of hybrid dynamic ieraions in providing a scalable soluion for small ODE sysems on modern compuing plaforms. We propose he following fuure work o furher esablish his echnique. We are currenly working on compleing he following simple exensions of his paper. (i) We are evaluaing more ODEs. (ii) We are implemening his approach on GPUs (including double precision) and on he Cell. The hybrid mehod s smaller memory requiremen is especially useful on hese plaforms because of he small local sore on he Cell and he small se of regisers and shared memory on he GPU. (iii) We plan o replace he Runge-Kua mehod, which requires muliple evaluaions of f a each ime sep, by a muli-sep mehod, which requires only one funcion evaluaion a each sep. Our plans for he fuure are as follows. (i) We wish o use an adapive ODE solver insead of he fixed ime-sep Runge-Kua mehod ha we have currenly used. This may require load balancing, when he ime seps sizes on differen processors are differen. (ii) Our error olerance condiion is currenly very simple, and jus based on he absolue difference beween differen ieraions. Convenional solvers use error esimaes o deermine he olerance. We need o develop heoreical resuls o enable us o esimae he error, based on he ime seps size, olerance, and window widh. We are also planning a Malab inerface so ha he Malab ode45 solver can be replaced by our solver when a GPU is available. Acknowledgmens This work was funded by NSF gran # DMS 668. We also hank Xin Yuan a Florida Sae Universiy for permiing use of his Linux cluser. A.S. expresses his graiude o Sri Sahya Sai Baba for his help and inspiraion. References [] L. Baffico, S. Bernard, Y. Maday, G. Turinici, and G. Zerah. Parallel-in-ime molecular-dynamics simulaions. Physical Review E (Saisical, Nonlinear, and Sof Maer Physics), 66: ,. [] M. Bjorhus. A noe on he convergence of discreized dynamic ieraions. BIT Numerical Mahemaics, 35:9 96, 995. [3] C. W. Gear. Waveform mehods for space and ime parallelism. Journal of Compuaional and Applied Mahemaics, 38:37 47, 99. [4] R. Jelsch and B. Pohl. Waveform relaxaion wih overlapping spliings. SIAM Journal on Scienific Compuing, 6:4 49, 995. [5] B. Leimkuhler. Esimaing waveform relaxaion convergence. SIAM Journal on Scienific Compuing, 4:87 889, 993. [6] B. Leimkuhler. Timesep acceleraion of waveform relaxaion. SIAM Journal on Numerical Analysis, 35:3 5, 998. [7] E. Lelarasmee, A. E. Ruehli, and A. L. Sangiovanni- Vincenelli. The waveform relaxaion mehod for imedomain analysis of large scale inegraed circuis. IEEE Transacions on Compuer-aided Design of Inegraed Circuis and Sysems, :3 45, 98. [8] Ch. Lubich and A. Osermann. Muli-grid dynamic ieraions for parabolic equaions. BIT Numerical Mahemaics, 7:6 34, 987. [9] Y. Maday and G. Turinici. Parallel in ime algorihms for quanum conrol: Parareal ime discreizaion scheme. Inernaional Journal of Quanum Chemisry, 93:3 38, 3. [] O. Nevanlinna. Remarks on Picard Lindelöf ieraion. BIT Numerical Mahemaics, 9:38 346, 989. [] M. Rahinam and L. R. Pezold. Dynamic ieraion using reduced order models: A mehod for simulaion of large scale modular sysems. SIAM Journal on Numerical Analysis, 4: ,. [] J. Sand and K. Burrage. A Jacobi waveform relaxaion mehod for ODEs. SIAM Journal on Scienific Compuing, :534 55, 998.

10 [3] L. F. Shampine. Numerical Soluion of Ordinary Differenial Equaions. Chapman & Halll, 994. [4] A. Srinivasan and N. Chandra. Laency olerance hrough parallelizaion of ime in scienific applicaions. Parallel Compuing, 3: , 5. [5] A. Srinivasan, Y. Yu, and N. Chandra. Applicaion of reduced order modeling o ime parallelizaion. In Proceedings of HiPC 5, Lecure Noes in Compuer Science, volume 3769, pages 6 7. Springer-Verlag, 5. [6] J. Sun and H. Grosollen. Fas ime-domain simulaion by waveform relaxaion mehods. IEEE Transacions on Circuis and Sysems - I: Fundamenal Theory and Applicaions, 44:66 666, 997. [7] S. Vandewalle and R. Piessens. On dynamic ieraion mehods for solving ime-periodic differenial equaions. SIAM Journal on Numerical Analysis, 3:86 33, 993. [8] Y. Yu, A. Srinivasan, and N. Chandra. Scalable imeparallelizaion of molecular dynamics simulaions in nano mechanics. In Proceedings of he 35 h Inernaional Conference on Parallel Processing (ICPP), pages 9 6. IEEE, 6.

Simulation-Solving Dynamic Models ABE 5646 Week 2, Spring 2010

Simulation-Solving Dynamic Models ABE 5646 Week 2, Spring 2010 Simulaion-Solving Dynamic Models ABE 5646 Week 2, Spring 2010 Week Descripion Reading Maerial 2 Compuer Simulaion of Dynamic Models Finie Difference, coninuous saes, discree ime Simple Mehods Euler Trapezoid

More information

Vehicle Arrival Models : Headway

Vehicle Arrival Models : Headway Chaper 12 Vehicle Arrival Models : Headway 12.1 Inroducion Modelling arrival of vehicle a secion of road is an imporan sep in raffic flow modelling. I has imporan applicaion in raffic flow simulaion where

More information

Physics 235 Chapter 2. Chapter 2 Newtonian Mechanics Single Particle

Physics 235 Chapter 2. Chapter 2 Newtonian Mechanics Single Particle Chaper 2 Newonian Mechanics Single Paricle In his Chaper we will review wha Newon s laws of mechanics ell us abou he moion of a single paricle. Newon s laws are only valid in suiable reference frames,

More information

Random Walk with Anti-Correlated Steps

Random Walk with Anti-Correlated Steps Random Walk wih Ani-Correlaed Seps John Noga Dirk Wagner 2 Absrac We conjecure he expeced value of random walks wih ani-correlaed seps o be exacly. We suppor his conjecure wih 2 plausibiliy argumens and

More information

Chapter 2. First Order Scalar Equations

Chapter 2. First Order Scalar Equations Chaper. Firs Order Scalar Equaions We sar our sudy of differenial equaions in he same way he pioneers in his field did. We show paricular echniques o solve paricular ypes of firs order differenial equaions.

More information

d 1 = c 1 b 2 - b 1 c 2 d 2 = c 1 b 3 - b 1 c 3

d 1 = c 1 b 2 - b 1 c 2 d 2 = c 1 b 3 - b 1 c 3 and d = c b - b c c d = c b - b c c This process is coninued unil he nh row has been compleed. The complee array of coefficiens is riangular. Noe ha in developing he array an enire row may be divided or

More information

STATE-SPACE MODELLING. A mass balance across the tank gives:

STATE-SPACE MODELLING. A mass balance across the tank gives: B. Lennox and N.F. Thornhill, 9, Sae Space Modelling, IChemE Process Managemen and Conrol Subjec Group Newsleer STE-SPACE MODELLING Inroducion: Over he pas decade or so here has been an ever increasing

More information

An introduction to the theory of SDDP algorithm

An introduction to the theory of SDDP algorithm An inroducion o he heory of SDDP algorihm V. Leclère (ENPC) Augus 1, 2014 V. Leclère Inroducion o SDDP Augus 1, 2014 1 / 21 Inroducion Large scale sochasic problem are hard o solve. Two ways of aacking

More information

1 Review of Zero-Sum Games

1 Review of Zero-Sum Games COS 5: heoreical Machine Learning Lecurer: Rob Schapire Lecure #23 Scribe: Eugene Brevdo April 30, 2008 Review of Zero-Sum Games Las ime we inroduced a mahemaical model for wo player zero-sum games. Any

More information

) were both constant and we brought them from under the integral.

) were both constant and we brought them from under the integral. YIELD-PER-RECRUIT (coninued The yield-per-recrui model applies o a cohor, bu we saw in he Age Disribuions lecure ha he properies of a cohor do no apply in general o a collecion of cohors, which is wha

More information

Single and Double Pendulum Models

Single and Double Pendulum Models Single and Double Pendulum Models Mah 596 Projec Summary Spring 2016 Jarod Har 1 Overview Differen ypes of pendulums are used o model many phenomena in various disciplines. In paricular, single and double

More information

Ordinary differential equations. Phys 750 Lecture 7

Ordinary differential equations. Phys 750 Lecture 7 Ordinary differenial equaions Phys 750 Lecure 7 Ordinary Differenial Equaions Mos physical laws are expressed as differenial equaions These come in hree flavours: iniial-value problems boundary-value problems

More information

Final Spring 2007

Final Spring 2007 .615 Final Spring 7 Overview The purpose of he final exam is o calculae he MHD β limi in a high-bea oroidal okamak agains he dangerous n = 1 exernal ballooning-kink mode. Effecively, his corresponds o

More information

Lecture 2-1 Kinematics in One Dimension Displacement, Velocity and Acceleration Everything in the world is moving. Nothing stays still.

Lecture 2-1 Kinematics in One Dimension Displacement, Velocity and Acceleration Everything in the world is moving. Nothing stays still. Lecure - Kinemaics in One Dimension Displacemen, Velociy and Acceleraion Everyhing in he world is moving. Nohing says sill. Moion occurs a all scales of he universe, saring from he moion of elecrons in

More information

CHAPTER 10 VALIDATION OF TEST WITH ARTIFICAL NEURAL NETWORK

CHAPTER 10 VALIDATION OF TEST WITH ARTIFICAL NEURAL NETWORK 175 CHAPTER 10 VALIDATION OF TEST WITH ARTIFICAL NEURAL NETWORK 10.1 INTRODUCTION Amongs he research work performed, he bes resuls of experimenal work are validaed wih Arificial Neural Nework. From he

More information

Multi-scale 2D acoustic full waveform inversion with high frequency impulsive source

Multi-scale 2D acoustic full waveform inversion with high frequency impulsive source Muli-scale D acousic full waveform inversion wih high frequency impulsive source Vladimir N Zubov*, Universiy of Calgary, Calgary AB vzubov@ucalgaryca and Michael P Lamoureux, Universiy of Calgary, Calgary

More information

Air Traffic Forecast Empirical Research Based on the MCMC Method

Air Traffic Forecast Empirical Research Based on the MCMC Method Compuer and Informaion Science; Vol. 5, No. 5; 0 ISSN 93-8989 E-ISSN 93-8997 Published by Canadian Cener of Science and Educaion Air Traffic Forecas Empirical Research Based on he MCMC Mehod Jian-bo Wang,

More information

t is a basis for the solution space to this system, then the matrix having these solutions as columns, t x 1 t, x 2 t,... x n t x 2 t...

t is a basis for the solution space to this system, then the matrix having these solutions as columns, t x 1 t, x 2 t,... x n t x 2 t... Mah 228- Fri Mar 24 5.6 Marix exponenials and linear sysems: The analogy beween firs order sysems of linear differenial equaions (Chaper 5) and scalar linear differenial equaions (Chaper ) is much sronger

More information

Waveform Transmission Method, A New Waveform-relaxation Based Algorithm. to Solve Ordinary Differential Equations in Parallel

Waveform Transmission Method, A New Waveform-relaxation Based Algorithm. to Solve Ordinary Differential Equations in Parallel Waveform Transmission Mehod, A New Waveform-relaxaion Based Algorihm o Solve Ordinary Differenial Equaions in Parallel Fei Wei Huazhong Yang Deparmen of Elecronic Engineering, Tsinghua Universiy, Beijing,

More information

3.1.3 INTRODUCTION TO DYNAMIC OPTIMIZATION: DISCRETE TIME PROBLEMS. A. The Hamiltonian and First-Order Conditions in a Finite Time Horizon

3.1.3 INTRODUCTION TO DYNAMIC OPTIMIZATION: DISCRETE TIME PROBLEMS. A. The Hamiltonian and First-Order Conditions in a Finite Time Horizon 3..3 INRODUCION O DYNAMIC OPIMIZAION: DISCREE IME PROBLEMS A. he Hamilonian and Firs-Order Condiions in a Finie ime Horizon Define a new funcion, he Hamilonian funcion, H. H he change in he oal value of

More information

Robotics I. April 11, The kinematics of a 3R spatial robot is specified by the Denavit-Hartenberg parameters in Tab. 1.

Robotics I. April 11, The kinematics of a 3R spatial robot is specified by the Denavit-Hartenberg parameters in Tab. 1. Roboics I April 11, 017 Exercise 1 he kinemaics of a 3R spaial robo is specified by he Denavi-Harenberg parameers in ab 1 i α i d i a i θ i 1 π/ L 1 0 1 0 0 L 3 0 0 L 3 3 able 1: able of DH parameers of

More information

A Shooting Method for A Node Generation Algorithm

A Shooting Method for A Node Generation Algorithm A Shooing Mehod for A Node Generaion Algorihm Hiroaki Nishikawa W.M.Keck Foundaion Laboraory for Compuaional Fluid Dynamics Deparmen of Aerospace Engineering, Universiy of Michigan, Ann Arbor, Michigan

More information

Ordinary dierential equations

Ordinary dierential equations Chaper 5 Ordinary dierenial equaions Conens 5.1 Iniial value problem........................... 31 5. Forward Euler's mehod......................... 3 5.3 Runge-Kua mehods.......................... 36

More information

Unsteady Flow Problems

Unsteady Flow Problems School of Mechanical Aerospace and Civil Engineering Unseady Flow Problems T. J. Craf George Begg Building, C41 TPFE MSc CFD-1 Reading: J. Ferziger, M. Peric, Compuaional Mehods for Fluid Dynamics H.K.

More information

Christos Papadimitriou & Luca Trevisan November 22, 2016

Christos Papadimitriou & Luca Trevisan November 22, 2016 U.C. Bereley CS170: Algorihms Handou LN-11-22 Chrisos Papadimiriou & Luca Trevisan November 22, 2016 Sreaming algorihms In his lecure and he nex one we sudy memory-efficien algorihms ha process a sream

More information

23.2. Representing Periodic Functions by Fourier Series. Introduction. Prerequisites. Learning Outcomes

23.2. Representing Periodic Functions by Fourier Series. Introduction. Prerequisites. Learning Outcomes Represening Periodic Funcions by Fourier Series 3. Inroducion In his Secion we show how a periodic funcion can be expressed as a series of sines and cosines. We begin by obaining some sandard inegrals

More information

Linear Response Theory: The connection between QFT and experiments

Linear Response Theory: The connection between QFT and experiments Phys540.nb 39 3 Linear Response Theory: The connecion beween QFT and experimens 3.1. Basic conceps and ideas Q: How do we measure he conduciviy of a meal? A: we firs inroduce a weak elecric field E, and

More information

20. Applications of the Genetic-Drift Model

20. Applications of the Genetic-Drift Model 0. Applicaions of he Geneic-Drif Model 1) Deermining he probabiliy of forming any paricular combinaion of genoypes in he nex generaion: Example: If he parenal allele frequencies are p 0 = 0.35 and q 0

More information

On Solving the Perturbed Multi- Revolution Lambert Problem: Applications in Enhanced SSA

On Solving the Perturbed Multi- Revolution Lambert Problem: Applications in Enhanced SSA On Solving he Perurbed Muli- Revoluion Lamber Problem: Applicaions in Enhanced SSA John L. Junkins and Robyn M. Woollands Texas A&M Universiy Presened o Sacie Williams (AFOSR/RT) AFOSR REMOTE SENSING PORTFOLIO

More information

Rapid Termination Evaluation for Recursive Subdivision of Bezier Curves

Rapid Termination Evaluation for Recursive Subdivision of Bezier Curves Rapid Terminaion Evaluaion for Recursive Subdivision of Bezier Curves Thomas F. Hain School of Compuer and Informaion Sciences, Universiy of Souh Alabama, Mobile, AL, U.S.A. Absrac Bézier curve flaening

More information

Comparison between the Discrete and Continuous Time Models

Comparison between the Discrete and Continuous Time Models Comparison beween e Discree and Coninuous Time Models D. Sulsky June 21, 2012 1 Discree o Coninuous Recall e discree ime model Î = AIS Ŝ = S Î. Tese equaions ell us ow e populaion canges from one day o

More information

Robust estimation based on the first- and third-moment restrictions of the power transformation model

Robust estimation based on the first- and third-moment restrictions of the power transformation model h Inernaional Congress on Modelling and Simulaion, Adelaide, Ausralia, 6 December 3 www.mssanz.org.au/modsim3 Robus esimaion based on he firs- and hird-momen resricions of he power ransformaion Nawaa,

More information

Sequential Importance Resampling (SIR) Particle Filter

Sequential Importance Resampling (SIR) Particle Filter Paricle Filers++ Pieer Abbeel UC Berkeley EECS Many slides adaped from Thrun, Burgard and Fox, Probabilisic Roboics 1. Algorihm paricle_filer( S -1, u, z ): 2. Sequenial Imporance Resampling (SIR) Paricle

More information

Math 333 Problem Set #2 Solution 14 February 2003

Math 333 Problem Set #2 Solution 14 February 2003 Mah 333 Problem Se #2 Soluion 14 February 2003 A1. Solve he iniial value problem dy dx = x2 + e 3x ; 2y 4 y(0) = 1. Soluion: This is separable; we wrie 2y 4 dy = x 2 + e x dx and inegrae o ge The iniial

More information

arxiv:quant-ph/ v1 5 Jul 2004

arxiv:quant-ph/ v1 5 Jul 2004 Numerical Mehods for Sochasic Differenial Equaions Joshua Wilkie Deparmen of Chemisry, Simon Fraser Universiy, Burnaby, Briish Columbia V5A 1S6, Canada Sochasic differenial equaions (sdes) play an imporan

More information

Module 2 F c i k c s la l w a s o s f dif di fusi s o i n

Module 2 F c i k c s la l w a s o s f dif di fusi s o i n Module Fick s laws of diffusion Fick s laws of diffusion and hin film soluion Adolf Fick (1855) proposed: d J α d d d J (mole/m s) flu (m /s) diffusion coefficien and (mole/m 3 ) concenraion of ions, aoms

More information

ACE 562 Fall Lecture 5: The Simple Linear Regression Model: Sampling Properties of the Least Squares Estimators. by Professor Scott H.

ACE 562 Fall Lecture 5: The Simple Linear Regression Model: Sampling Properties of the Least Squares Estimators. by Professor Scott H. ACE 56 Fall 005 Lecure 5: he Simple Linear Regression Model: Sampling Properies of he Leas Squares Esimaors by Professor Sco H. Irwin Required Reading: Griffihs, Hill and Judge. "Inference in he Simple

More information

Time Parallel Time Integration

Time Parallel Time Integration Time Inegraion marin.gander@unige.ch Universiy of Geneva SSSTC Symposium, Geneva, Augus 17 Solving Evoluion Problems in Parallel? The Lorenz equaions: d d dy d dz d = σ +σy = z +r y = y bz 4 4 15 1 5 5

More information

di Bernardo, M. (1995). A purely adaptive controller to synchronize and control chaotic systems.

di Bernardo, M. (1995). A purely adaptive controller to synchronize and control chaotic systems. di ernardo, M. (995). A purely adapive conroller o synchronize and conrol chaoic sysems. hps://doi.org/.6/375-96(96)8-x Early version, also known as pre-prin Link o published version (if available):.6/375-96(96)8-x

More information

T L. t=1. Proof of Lemma 1. Using the marginal cost accounting in Equation(4) and standard arguments. t )+Π RB. t )+K 1(Q RB

T L. t=1. Proof of Lemma 1. Using the marginal cost accounting in Equation(4) and standard arguments. t )+Π RB. t )+K 1(Q RB Elecronic Companion EC.1. Proofs of Technical Lemmas and Theorems LEMMA 1. Le C(RB) be he oal cos incurred by he RB policy. Then we have, T L E[C(RB)] 3 E[Z RB ]. (EC.1) Proof of Lemma 1. Using he marginal

More information

Delivering Better Time-of-Day Using Synchronous Ethernet and Yaakov (J) Stein, Alon Geva, Gabriel Zigelboim RAD Data Communications

Delivering Better Time-of-Day Using Synchronous Ethernet and Yaakov (J) Stein, Alon Geva, Gabriel Zigelboim RAD Data Communications Delivering Beer Time-of-Day Using Synchronous Eherne and 1588 Yaakov (J) Sein, Alon Geva, Gabriel Zigelboim RAD Daa Communicaions The problem I wan discuss he use of 1588 (or NTP for ha maer) for ime of

More information

( ) = b n ( t) n " (2.111) or a system with many states to be considered, solving these equations isn t. = k U I ( t,t 0 )! ( t 0 ) (2.

( ) = b n ( t) n  (2.111) or a system with many states to be considered, solving these equations isn t. = k U I ( t,t 0 )! ( t 0 ) (2. Andrei Tokmakoff, MIT Deparmen of Chemisry, 3/14/007-6.4 PERTURBATION THEORY Given a Hamilonian H = H 0 + V where we know he eigenkes for H 0 : H 0 n = E n n, we can calculae he evoluion of he wavefuncion

More information

Matlab and Python programming: how to get started

Matlab and Python programming: how to get started Malab and Pyhon programming: how o ge sared Equipping readers he skills o wrie programs o explore complex sysems and discover ineresing paerns from big daa is one of he main goals of his book. In his chaper,

More information

Two Popular Bayesian Estimators: Particle and Kalman Filters. McGill COMP 765 Sept 14 th, 2017

Two Popular Bayesian Estimators: Particle and Kalman Filters. McGill COMP 765 Sept 14 th, 2017 Two Popular Bayesian Esimaors: Paricle and Kalman Filers McGill COMP 765 Sep 14 h, 2017 1 1 1, dx x Bel x u x P x z P Recall: Bayes Filers,,,,,,, 1 1 1 1 u z u x P u z u x z P Bayes z = observaion u =

More information

Notes on Kalman Filtering

Notes on Kalman Filtering Noes on Kalman Filering Brian Borchers and Rick Aser November 7, Inroducion Daa Assimilaion is he problem of merging model predicions wih acual measuremens of a sysem o produce an opimal esimae of he curren

More information

Accurate Piecewise Linear Continuous Approximations to One-Dimensional Curves: Error Estimates and Algorithms

Accurate Piecewise Linear Continuous Approximations to One-Dimensional Curves: Error Estimates and Algorithms Accurae Piecewise Linear Coninuous Approximaions o One-Dimensional Curves: rror simaes and Algorihms Hiroaki Nishikawa January 998 Absrac Local and global asympoic L error esimaes are derived for piecewise

More information

Bias in Conditional and Unconditional Fixed Effects Logit Estimation: a Correction * Tom Coupé

Bias in Conditional and Unconditional Fixed Effects Logit Estimation: a Correction * Tom Coupé Bias in Condiional and Uncondiional Fixed Effecs Logi Esimaion: a Correcion * Tom Coupé Economics Educaion and Research Consorium, Naional Universiy of Kyiv Mohyla Academy Address: Vul Voloska 10, 04070

More information

MATH 128A, SUMMER 2009, FINAL EXAM SOLUTION

MATH 128A, SUMMER 2009, FINAL EXAM SOLUTION MATH 28A, SUMME 2009, FINAL EXAM SOLUTION BENJAMIN JOHNSON () (8 poins) [Lagrange Inerpolaion] (a) (4 poins) Le f be a funcion defined a some real numbers x 0,..., x n. Give a defining equaion for he Lagrange

More information

Speaker Adaptation Techniques For Continuous Speech Using Medium and Small Adaptation Data Sets. Constantinos Boulis

Speaker Adaptation Techniques For Continuous Speech Using Medium and Small Adaptation Data Sets. Constantinos Boulis Speaker Adapaion Techniques For Coninuous Speech Using Medium and Small Adapaion Daa Ses Consaninos Boulis Ouline of he Presenaion Inroducion o he speaker adapaion problem Maximum Likelihood Sochasic Transformaions

More information

Some Basic Information about M-S-D Systems

Some Basic Information about M-S-D Systems Some Basic Informaion abou M-S-D Sysems 1 Inroducion We wan o give some summary of he facs concerning unforced (homogeneous) and forced (non-homogeneous) models for linear oscillaors governed by second-order,

More information

Introduction to Probability and Statistics Slides 4 Chapter 4

Introduction to Probability and Statistics Slides 4 Chapter 4 Inroducion o Probabiliy and Saisics Slides 4 Chaper 4 Ammar M. Sarhan, asarhan@mahsa.dal.ca Deparmen of Mahemaics and Saisics, Dalhousie Universiy Fall Semeser 8 Dr. Ammar Sarhan Chaper 4 Coninuous Random

More information

Sterilization D Values

Sterilization D Values Seriliaion D Values Seriliaion by seam consis of he simple observaion ha baceria die over ime during exposure o hea. They do no all live for a finie period of hea exposure and hen suddenly die a once,

More information

3.1 More on model selection

3.1 More on model selection 3. More on Model selecion 3. Comparing models AIC, BIC, Adjused R squared. 3. Over Fiing problem. 3.3 Sample spliing. 3. More on model selecion crieria Ofen afer model fiing you are lef wih a handful of

More information

Hamilton- J acobi Equation: Explicit Formulas In this lecture we try to apply the method of characteristics to the Hamilton-Jacobi equation: u t

Hamilton- J acobi Equation: Explicit Formulas In this lecture we try to apply the method of characteristics to the Hamilton-Jacobi equation: u t M ah 5 2 7 Fall 2 0 0 9 L ecure 1 0 O c. 7, 2 0 0 9 Hamilon- J acobi Equaion: Explici Formulas In his lecure we ry o apply he mehod of characerisics o he Hamilon-Jacobi equaion: u + H D u, x = 0 in R n

More information

10. State Space Methods

10. State Space Methods . Sae Space Mehods. Inroducion Sae space modelling was briefly inroduced in chaper. Here more coverage is provided of sae space mehods before some of heir uses in conrol sysem design are covered in he

More information

Two Coupled Oscillators / Normal Modes

Two Coupled Oscillators / Normal Modes Lecure 3 Phys 3750 Two Coupled Oscillaors / Normal Modes Overview and Moivaion: Today we ake a small, bu significan, sep owards wave moion. We will no ye observe waves, bu his sep is imporan in is own

More information

Numerical Dispersion

Numerical Dispersion eview of Linear Numerical Sabiliy Numerical Dispersion n he previous lecure, we considered he linear numerical sabiliy of boh advecion and diffusion erms when approimaed wih several spaial and emporal

More information

Ensamble methods: Bagging and Boosting

Ensamble methods: Bagging and Boosting Lecure 21 Ensamble mehods: Bagging and Boosing Milos Hauskrech milos@cs.pi.edu 5329 Senno Square Ensemble mehods Mixure of expers Muliple base models (classifiers, regressors), each covers a differen par

More information

Let us start with a two dimensional case. We consider a vector ( x,

Let us start with a two dimensional case. We consider a vector ( x, Roaion marices We consider now roaion marices in wo and hree dimensions. We sar wih wo dimensions since wo dimensions are easier han hree o undersand, and one dimension is a lile oo simple. However, our

More information

Embedded Systems and Software. A Simple Introduction to Embedded Control Systems (PID Control)

Embedded Systems and Software. A Simple Introduction to Embedded Control Systems (PID Control) Embedded Sysems and Sofware A Simple Inroducion o Embedded Conrol Sysems (PID Conrol) Embedded Sysems and Sofware, ECE:3360. The Universiy of Iowa, 2016 Slide 1 Acknowledgemens The maerial in his lecure

More information

Introduction to Numerical Analysis. In this lesson you will be taken through a pair of techniques that will be used to solve the equations of.

Introduction to Numerical Analysis. In this lesson you will be taken through a pair of techniques that will be used to solve the equations of. Inroducion o Nuerical Analysis oion In his lesson you will be aen hrough a pair of echniques ha will be used o solve he equaions of and v dx d a F d for siuaions in which F is well nown, and he iniial

More information

R t. C t P t. + u t. C t = αp t + βr t + v t. + β + w t

R t. C t P t. + u t. C t = αp t + βr t + v t. + β + w t Exercise 7 C P = α + β R P + u C = αp + βr + v (a) (b) C R = α P R + β + w (c) Assumpions abou he disurbances u, v, w : Classical assumions on he disurbance of one of he equaions, eg. on (b): E(v v s P,

More information

Chapter 6. Systems of First Order Linear Differential Equations

Chapter 6. Systems of First Order Linear Differential Equations Chaper 6 Sysems of Firs Order Linear Differenial Equaions We will only discuss firs order sysems However higher order sysems may be made ino firs order sysems by a rick shown below We will have a sligh

More information

Biol. 356 Lab 8. Mortality, Recruitment, and Migration Rates

Biol. 356 Lab 8. Mortality, Recruitment, and Migration Rates Biol. 356 Lab 8. Moraliy, Recruimen, and Migraion Raes (modified from Cox, 00, General Ecology Lab Manual, McGraw Hill) Las week we esimaed populaion size hrough several mehods. One assumpion of all hese

More information

Advanced Organic Chemistry

Advanced Organic Chemistry Lalic, G. Chem 53A Chemisry 53A Advanced Organic Chemisry Lecure noes 1 Kineics: A racical Approach Simple Kineics Scenarios Fiing Experimenal Daa Using Kineics o Deermine he Mechanism Doughery, D. A.,

More information

WEEK-3 Recitation PHYS 131. of the projectile s velocity remains constant throughout the motion, since the acceleration a x

WEEK-3 Recitation PHYS 131. of the projectile s velocity remains constant throughout the motion, since the acceleration a x WEEK-3 Reciaion PHYS 131 Ch. 3: FOC 1, 3, 4, 6, 14. Problems 9, 37, 41 & 71 and Ch. 4: FOC 1, 3, 5, 8. Problems 3, 5 & 16. Feb 8, 018 Ch. 3: FOC 1, 3, 4, 6, 14. 1. (a) The horizonal componen of he projecile

More information

EXPLICIT TIME INTEGRATORS FOR NONLINEAR DYNAMICS DERIVED FROM THE MIDPOINT RULE

EXPLICIT TIME INTEGRATORS FOR NONLINEAR DYNAMICS DERIVED FROM THE MIDPOINT RULE Version April 30, 2004.Submied o CTU Repors. EXPLICIT TIME INTEGRATORS FOR NONLINEAR DYNAMICS DERIVED FROM THE MIDPOINT RULE Per Krysl Universiy of California, San Diego La Jolla, California 92093-0085,

More information

4.2 The Fourier Transform

4.2 The Fourier Transform 4.2. THE FOURIER TRANSFORM 57 4.2 The Fourier Transform 4.2.1 Inroducion One way o look a Fourier series is ha i is a ransformaion from he ime domain o he frequency domain. Given a signal f (), finding

More information

Inventory Analysis and Management. Multi-Period Stochastic Models: Optimality of (s, S) Policy for K-Convex Objective Functions

Inventory Analysis and Management. Multi-Period Stochastic Models: Optimality of (s, S) Policy for K-Convex Objective Functions Muli-Period Sochasic Models: Opimali of (s, S) Polic for -Convex Objecive Funcions Consider a seing similar o he N-sage newsvendor problem excep ha now here is a fixed re-ordering cos (> 0) for each (re-)order.

More information

Maintenance Models. Prof. Robert C. Leachman IEOR 130, Methods of Manufacturing Improvement Spring, 2011

Maintenance Models. Prof. Robert C. Leachman IEOR 130, Methods of Manufacturing Improvement Spring, 2011 Mainenance Models Prof Rober C Leachman IEOR 3, Mehods of Manufacuring Improvemen Spring, Inroducion The mainenance of complex equipmen ofen accouns for a large porion of he coss associaed wih ha equipmen

More information

18 Biological models with discrete time

18 Biological models with discrete time 8 Biological models wih discree ime The mos imporan applicaions, however, may be pedagogical. The elegan body of mahemaical heory peraining o linear sysems (Fourier analysis, orhogonal funcions, and so

More information

On Measuring Pro-Poor Growth. 1. On Various Ways of Measuring Pro-Poor Growth: A Short Review of the Literature

On Measuring Pro-Poor Growth. 1. On Various Ways of Measuring Pro-Poor Growth: A Short Review of the Literature On Measuring Pro-Poor Growh 1. On Various Ways of Measuring Pro-Poor Growh: A Shor eview of he Lieraure During he pas en years or so here have been various suggesions concerning he way one should check

More information

Math 2142 Exam 1 Review Problems. x 2 + f (0) 3! for the 3rd Taylor polynomial at x = 0. To calculate the various quantities:

Math 2142 Exam 1 Review Problems. x 2 + f (0) 3! for the 3rd Taylor polynomial at x = 0. To calculate the various quantities: Mah 4 Eam Review Problems Problem. Calculae he 3rd Taylor polynomial for arcsin a =. Soluion. Le f() = arcsin. For his problem, we use he formula f() + f () + f ()! + f () 3! for he 3rd Taylor polynomial

More information

MATHEMATICAL DESCRIPTION OF THEORETICAL METHODS OF RESERVE ECONOMY OF CONSIGNMENT STORES

MATHEMATICAL DESCRIPTION OF THEORETICAL METHODS OF RESERVE ECONOMY OF CONSIGNMENT STORES MAHEMAICAL DESCIPION OF HEOEICAL MEHODS OF ESEVE ECONOMY OF CONSIGNMEN SOES Péer elek, József Cselényi, György Demeer Universiy of Miskolc, Deparmen of Maerials Handling and Logisics Absrac: Opimizaion

More information

Maple Tools for Differential Equations A. J. Meir

Maple Tools for Differential Equations A. J. Meir Maple Tools for Differenial Equaions A. J. Meir Copyrigh (C) A. J. Meir. All righs reserved. This workshee is for educaional use only. No par of his publicaion may be reproduced or ransmied for profi in

More information

Lecture 20: Riccati Equations and Least Squares Feedback Control

Lecture 20: Riccati Equations and Least Squares Feedback Control 34-5 LINEAR SYSTEMS Lecure : Riccai Equaions and Leas Squares Feedback Conrol 5.6.4 Sae Feedback via Riccai Equaions A recursive approach in generaing he marix-valued funcion W ( ) equaion for i for he

More information

Lecture 33: November 29

Lecture 33: November 29 36-705: Inermediae Saisics Fall 2017 Lecurer: Siva Balakrishnan Lecure 33: November 29 Today we will coninue discussing he boosrap, and hen ry o undersand why i works in a simple case. In he las lecure

More information

15. Vector Valued Functions

15. Vector Valued Functions 1. Vecor Valued Funcions Up o his poin, we have presened vecors wih consan componens, for example, 1, and,,4. However, we can allow he componens of a vecor o be funcions of a common variable. For example,

More information

Hamilton- J acobi Equation: Weak S olution We continue the study of the Hamilton-Jacobi equation:

Hamilton- J acobi Equation: Weak S olution We continue the study of the Hamilton-Jacobi equation: M ah 5 7 Fall 9 L ecure O c. 4, 9 ) Hamilon- J acobi Equaion: Weak S oluion We coninue he sudy of he Hamilon-Jacobi equaion: We have shown ha u + H D u) = R n, ) ; u = g R n { = }. ). In general we canno

More information

Predator - Prey Model Trajectories and the nonlinear conservation law

Predator - Prey Model Trajectories and the nonlinear conservation law Predaor - Prey Model Trajecories and he nonlinear conservaion law James K. Peerson Deparmen of Biological Sciences and Deparmen of Mahemaical Sciences Clemson Universiy Ocober 28, 213 Ouline Drawing Trajecories

More information

Kriging Models Predicting Atrazine Concentrations in Surface Water Draining Agricultural Watersheds

Kriging Models Predicting Atrazine Concentrations in Surface Water Draining Agricultural Watersheds 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 Kriging Models Predicing Arazine Concenraions in Surface Waer Draining Agriculural Waersheds Paul L. Mosquin, Jeremy Aldworh, Wenlin Chen Supplemenal Maerial Number

More information

KEY. Math 334 Midterm I Fall 2008 sections 001 and 003 Instructor: Scott Glasgow

KEY. Math 334 Midterm I Fall 2008 sections 001 and 003 Instructor: Scott Glasgow 1 KEY Mah 4 Miderm I Fall 8 secions 1 and Insrucor: Sco Glasgow Please do NOT wrie on his eam. No credi will be given for such work. Raher wrie in a blue book, or on our own paper, preferabl engineering

More information

23.5. Half-Range Series. Introduction. Prerequisites. Learning Outcomes

23.5. Half-Range Series. Introduction. Prerequisites. Learning Outcomes Half-Range Series 2.5 Inroducion In his Secion we address he following problem: Can we find a Fourier series expansion of a funcion defined over a finie inerval? Of course we recognise ha such a funcion

More information

Inventory Control of Perishable Items in a Two-Echelon Supply Chain

Inventory Control of Perishable Items in a Two-Echelon Supply Chain Journal of Indusrial Engineering, Universiy of ehran, Special Issue,, PP. 69-77 69 Invenory Conrol of Perishable Iems in a wo-echelon Supply Chain Fariborz Jolai *, Elmira Gheisariha and Farnaz Nojavan

More information

Theory of! Partial Differential Equations!

Theory of! Partial Differential Equations! hp://www.nd.edu/~gryggva/cfd-course/! Ouline! Theory o! Parial Dierenial Equaions! Gréar Tryggvason! Spring 011! Basic Properies o PDE!! Quasi-linear Firs Order Equaions! - Characerisics! - Linear and

More information

International Journal of Scientific & Engineering Research, Volume 4, Issue 10, October ISSN

International Journal of Scientific & Engineering Research, Volume 4, Issue 10, October ISSN Inernaional Journal of Scienific & Engineering Research, Volume 4, Issue 10, Ocober-2013 900 FUZZY MEAN RESIDUAL LIFE ORDERING OF FUZZY RANDOM VARIABLES J. EARNEST LAZARUS PIRIYAKUMAR 1, A. YAMUNA 2 1.

More information

Physics 127b: Statistical Mechanics. Fokker-Planck Equation. Time Evolution

Physics 127b: Statistical Mechanics. Fokker-Planck Equation. Time Evolution Physics 7b: Saisical Mechanics Fokker-Planck Equaion The Langevin equaion approach o he evoluion of he velociy disribuion for he Brownian paricle migh leave you uncomforable. A more formal reamen of his

More information

m = 41 members n = 27 (nonfounders), f = 14 (founders) 8 markers from chromosome 19

m = 41 members n = 27 (nonfounders), f = 14 (founders) 8 markers from chromosome 19 Sequenial Imporance Sampling (SIS) AKA Paricle Filering, Sequenial Impuaion (Kong, Liu, Wong, 994) For many problems, sampling direcly from he arge disribuion is difficul or impossible. One reason possible

More information

A Note on the Equivalence of Fractional Relaxation Equations to Differential Equations with Varying Coefficients

A Note on the Equivalence of Fractional Relaxation Equations to Differential Equations with Varying Coefficients mahemaics Aricle A Noe on he Equivalence of Fracional Relaxaion Equaions o Differenial Equaions wih Varying Coefficiens Francesco Mainardi Deparmen of Physics and Asronomy, Universiy of Bologna, and he

More information

Class Meeting # 10: Introduction to the Wave Equation

Class Meeting # 10: Introduction to the Wave Equation MATH 8.5 COURSE NOTES - CLASS MEETING # 0 8.5 Inroducion o PDEs, Fall 0 Professor: Jared Speck Class Meeing # 0: Inroducion o he Wave Equaion. Wha is he wave equaion? The sandard wave equaion for a funcion

More information

Exponential Weighted Moving Average (EWMA) Chart Under The Assumption of Moderateness And Its 3 Control Limits

Exponential Weighted Moving Average (EWMA) Chart Under The Assumption of Moderateness And Its 3 Control Limits DOI: 0.545/mjis.07.5009 Exponenial Weighed Moving Average (EWMA) Char Under The Assumpion of Moderaeness And Is 3 Conrol Limis KALPESH S TAILOR Assisan Professor, Deparmen of Saisics, M. K. Bhavnagar Universiy,

More information

UCLA Math 135, Winter 2015 Ordinary Differential Equations Review of First-Order Equations from Math 33b

UCLA Math 135, Winter 2015 Ordinary Differential Equations Review of First-Order Equations from Math 33b UCLA Mah 35, Winer 05 Ordinary Differenial Equaions Review of Firs-Order Equaions from Mah 33b 7. Numerical Mehods C. David Levermore Deparmen of Mahemaics Universiy of Maryland January 9, 05 7. Numerical

More information

Particle Swarm Optimization

Particle Swarm Optimization Paricle Swarm Opimizaion Speaker: Jeng-Shyang Pan Deparmen of Elecronic Engineering, Kaohsiung Universiy of Applied Science, Taiwan Email: jspan@cc.kuas.edu.w 7/26/2004 ppso 1 Wha is he Paricle Swarm Opimizaion

More information

Technical Report Doc ID: TR March-2013 (Last revision: 23-February-2016) On formulating quadratic functions in optimization models.

Technical Report Doc ID: TR March-2013 (Last revision: 23-February-2016) On formulating quadratic functions in optimization models. Technical Repor Doc ID: TR--203 06-March-203 (Las revision: 23-Februar-206) On formulaing quadraic funcions in opimizaion models. Auhor: Erling D. Andersen Convex quadraic consrains quie frequenl appear

More information

Failure of the work-hamiltonian connection for free energy calculations. Abstract

Failure of the work-hamiltonian connection for free energy calculations. Abstract Failure of he work-hamilonian connecion for free energy calculaions Jose M. G. Vilar 1 and J. Miguel Rubi 1 Compuaional Biology Program, Memorial Sloan-Keering Cancer Cener, 175 York Avenue, New York,

More information

Section 3.5 Nonhomogeneous Equations; Method of Undetermined Coefficients

Section 3.5 Nonhomogeneous Equations; Method of Undetermined Coefficients Secion 3.5 Nonhomogeneous Equaions; Mehod of Undeermined Coefficiens Key Terms/Ideas: Linear Differenial operaor Nonlinear operaor Second order homogeneous DE Second order nonhomogeneous DE Soluion o homogeneous

More information

Lecture 9: September 25

Lecture 9: September 25 0-725: Opimizaion Fall 202 Lecure 9: Sepember 25 Lecurer: Geoff Gordon/Ryan Tibshirani Scribes: Xuezhi Wang, Subhodeep Moira, Abhimanu Kumar Noe: LaTeX emplae couresy of UC Berkeley EECS dep. Disclaimer:

More information

SPH3U: Projectiles. Recorder: Manager: Speaker:

SPH3U: Projectiles. Recorder: Manager: Speaker: SPH3U: Projeciles Now i s ime o use our new skills o analyze he moion of a golf ball ha was ossed hrough he air. Le s find ou wha is special abou he moion of a projecile. Recorder: Manager: Speaker: 0

More information

Echocardiography Project and Finite Fourier Series

Echocardiography Project and Finite Fourier Series Echocardiography Projec and Finie Fourier Series 1 U M An echocardiagram is a plo of how a porion of he hear moves as he funcion of ime over he one or more hearbea cycles If he hearbea repeas iself every

More information

Modal identification of structures from roving input data by means of maximum likelihood estimation of the state space model

Modal identification of structures from roving input data by means of maximum likelihood estimation of the state space model Modal idenificaion of srucures from roving inpu daa by means of maximum likelihood esimaion of he sae space model J. Cara, J. Juan, E. Alarcón Absrac The usual way o perform a forced vibraion es is o fix

More information