Statistical Methods for Bridging Experimental Data and Dynamic Models with Biomedical Applications

Size: px
Start display at page:

Download "Statistical Methods for Bridging Experimental Data and Dynamic Models with Biomedical Applications"

Transcription

1 Statistical Methods for Bridging Experimental Data and Dynamic Models with Biomedical Applications Hulin Wu, Ph.D. Dr. D.R. Seth Family Professor & Associate Chair Department of Biostatistics, School of Public Health Professor, School of Biomedical Informatics University of Texas Health Science Center at Houston Pittsburgh, March, 2017 Hulin Wu UTSPH March / 52

2 Outline 1 Introduction 2 Statistical estimation and inference methods for dynamic ODE models Naive Method: LS or MLE principle Local solution and time-varying parameter problems Smoothing-based methods Sparse longitudinal data: mixed-effects ODE models Bayesian methods High-dimensional ODE models: ODE model selection 3 Other dynamic models 4 Ongoing and future Work 5 Conclusions Hulin Wu UTSPH March / 52

3 Statistical Modeling Cultures Leo Breiman (Statistical Science, 2001): Two cultures Data modeling (98% statisticians): What the data look like? e.g., regression models Algorithmic modeling (2% statisticians): No models and for prediction purpose, e.g., neural nets and decision trees A third culture: Mechanistic modeling (<1% statisticians): Build mathematical models based on the mechanisms behind the data How are the data generated? Goal: Understand physics principles or biological mechanisms Hulin Wu UTSPH March / 52

4 Dynamic Systems/Models Many engineering and biological systems can be described by dynamic models: Differential equations: Ordinary differential equations (ODE) simplest Delay differential equations (DDE) Hybrid differential equations (HDE) Partial differential equations (PDE) Stochastic differential equations (SDE) Difference equations and state-space models Stochastic processes models: branching process etc. Agent-based models and cellular automata... Hulin Wu UTSPH March / 52

5 Modeling Goals Forward Problems: θ P θ Easier to do Predictions Simulations Inverse Problems: Y θ Θ More challenging Determine model structures/forms Estimate unknown parameters: θ Hulin Wu UTSPH March / 52

6 A Dynamic System: ODE Model where d dt X(t) = G[X(t), θ], X(0) = X 0 (1) Y (t i ) = H[X(t i ), β] + e(t i ), (2) e(t i ) (0, σ 2 I), i = 1,..., n G( ): linear or nonlinear functions H( ): observation functions (θ, β): unknown parameters e(t i ): measurement error The NLS method: n min {Y (t i ) H[X(t i, θ), β]} T {Y (t i ) H[X(t i, θ), β]}, θ,β,x 0 i=1 where X(t i ) evaluated numerically from Eq (1). Hulin Wu UTSPH March / 52

7 Naive NLS Method: Challenging Problems 1 Identifiability problem 2 Local solutions 3 Time-varying parameters 4 Need to solve the forward problem numerically and many times: Numerical error vs. measurement error 5 Slow convergence and high computational cost 6 Sparse longitudinal data problem 7 Nonlinear optimization 8 High-dimensional parameter space Motivate new statistical methods for dynamic models Hulin Wu UTSPH March / 52

8 Identifiability issues Theoretical identifiability: Mathematical identifiability Practical identifiability: Statistical and numerical identifiability Need to be investigated before the inverse problem How to deal with unidentifiable models? Simplify or revise the model Lump some parameters together Fixed some parameters Bayesian approach: Use priors Hulin Wu UTSPH March / 52

9 Identifiability issues: References Wu, H., Zhu, H., Miao, H., and Perelson, A.S. (2008), Parameter Identifiability and Estimation of HIV/AIDS Dynamic Models, Bulletin of Mathematical Biology, 70(3), Miao, H., Dykes, C., Demeter, L.M., Cavenaugh, J., Park, S.Y., Perelson, A.S., and Wu, H. (2008), Modeling and Estimation of Kinetic Parameters and Replicative Fitness of HIV-1 from Flow-Cytometry-Based Growth Competition Experiments, Bulletin of Mathematical Biology, 70, Miao, H., Dykes, C., Demeter, L., Wu, H. (2009), Differential Equation Modeling of HIV Viral Fitness Experiments: Model Identification, Model Selection, and Multi-Model Inference, Biometrics, 65, Liang, H., Miao, H., and Wu, H. (2010), Estimation of constant and time-varying dynamic parameters of HIV infection in a nonlinear differential equation model, Annals of Applied Statistics, 4, Miao, H., Xia, X., Perelson, A.S., Wu, H. (2011), On Identifiability of Nonlinear ODE Models and Applications in Viral Dynamics, SIAM Review, 53(1): Hulin Wu UTSPH March / 52

10 Naive NLS Method: Local solution and numerical error problems Local solution problem: Global optimization methods: Differential evolution algorithms and genetic algorithms (Storn et al 1997). Mixture of stochastic global optimization method and deterministic methods: scatter search method (Rodriguez-Fernandez et al. 2006) Numerical error problem: Xue, Miao and Wu (Annals of Statistics, 2010): theoretical results on numerical error vs. measurement error Hulin Wu UTSPH March / 52

11 Naive NLS Method: Time-varying parameter problem Xue, Miao and Wu, Annals of Statistics (2010) dx(t) dt = F {t, X(t), θ, η(t)} The spline approach can be used to approximate the time-varying parameter: η(t) = π(t) T α, where π(t) = (B 1 (t),, B N (t)) T is a vector of basis functions. The time-varying coefficient ODE model becomes an ODE model with constant parameters: dx(t) dt = F {t, X(t), θ, π(t) T α} Hulin Wu UTSPH March / 52

12 Smoothing-Based Approaches: ODE Computational Problem Earlier ideas: Hemker (1972) and Varah (1982) Two-stage decoupling approaches: Chen and Wu (JASA 2008, Statistica Sinica 2008) and Liang and Wu (JASA, 2008) Parameter cascading method: Ramsay et al. JRSS-B (2007) and Wang et al. Stat Comput Hulin Wu UTSPH March / 52

13 Smoothing-Based Approaches: Two-Stage Method Chen and Wu (JASA 2008, Statistica Sinica 2008) and Liang and Wu (JASA, 2008): X (t i ) = F [X(t i ), θ] (3) Y (t i ) = X(t i ) + e 1 (t i ), e 1 (t i ) (0, σ 2 I), (4) Step 1: Use a nonparametric smoothing to estimate X(t) and X (t) from model (4). Step 2: Substitute the estimate ˆX(t i ) into model (3) to obtain: ˆX (t i ) = F [ ˆX(t i ), θ] + e 2 (t i ). (5) Then fit the above regression model (5) to estimate θ. F ( ): Linear or nonlinear function Hulin Wu UTSPH March / 52

14 Smoothing-Based Approaches: Two-Step Methods Step 2 decoupled the system of ODEs: Fit the ODE one-by-one Convert ODE models to regression: Standard regression software tools can be used Avoid numerically solving the ODEs Computationally fast and efficient: Easy to deal with high-dimensional ODEs Price to pay: The derivative estimate may not be accurate The decoupled system: Some information lost The coupled" property: destroyed Extension to higher-order numerical discretization-based algorithms: Wu, Xue and Kuman (Biometrics 2012) Hulin Wu UTSPH March / 52

15 Parameter Cascading or Profiling Method Ramsay, Hooker, Campbell, Cao, JRSS-B, 2007 Fitting to data Observations: y(t i ) Nonparametric function: f(t) = φ(t) c Fitting to data: C 1 = n i=1 [y(t i) f(t i )] 2 Fidelity to DE x (t) = g(x β) f (t) = φ (t)c Difference between two sides of DE: Lf(t) = f (t) g(f(t) β) Fidelity to DE: C 2 = [Lf(t)] 2 dt Criterion to estimate c: J(c β) = C 1 + λc 2 Criterion to estimate β: H(β) = n i=1 [y(t i) φ(t i ) ĉ(β)] 2 Hulin Wu UTSPH March / 52

16 Numerical Comparisons: NLS, Profiling and Two-Stage Estimates Ding and Wu, Statistica Sinica, 2014 NLS: Not stable to get the global solution, computationally expensive Profiling: A 3-step iterative algorithm More stable than NLS to get a better solution Computational efficiency: similar to NLS Two-Stage Method: Computationally fast, but not accurate. Hulin Wu UTSPH March / 52

17 Sparse Longitudinal Data Problem: Mixed-Effects Modeling Approaches Deal with sparse data: Borrow information across subjects The MLE principle: Nonlinear Mixed-Effects Modeling (NLME) Treat the ODE solution as a nonlinear regression function Computational challenge: Stochastic Approximation EM (SAEM) Two-stage smoothing-based mixed-effects modeling approaches Fang, Wu and Zhu, Statistica Sinica (2011) Linear ODE: Linear mixed-effects model (LME) Nonlinear ODE: NLME Bayes methods A three-stage hierarchical model: implemented by MCMC Computation: expensive Hulin Wu UTSPH March / 52

18 Mixed-Effects ODE Model: NLME Within-subject variation: d dt X(t) = G[X(t), θ i], X(0) = X i0 (6) Y i (t i ) = H i [X i (t i ), θ i ] + e i (t i ), i = 1,..., n Xi(t i): ODE solution for Subject i. Y i = (y i1(t 1),, y imi (t mi )) T : Data from Subject i ei = (e i(t 1),, e i(t mi )) T N (0, σ 2 I mi ): Measurement error Between-subject variation: θ i = µ + b i, [b i Σ] N (0, Σ) µ: population parameter bi: random effects Estimation and inference: Stochastic Approximation EM (SAEM) Delyon, Lavielle and Moulines (1999), Kuhn and Lavielle (2005) Grenier, Louvet, Vigneaux (2014) Hulin Wu UTSPH March / 52

19 Smoothing-based Two-Stage Mixed-Effects Model Fang, Wu and Zhu, Statistica Sinica (2011): X (t i ) = F [X(t i ), θ] (7) Y (t i ) = X(t i ) + e 1 (t i ), e 1 (t i ) (0, σ 2 I), (8) Step 1: Use a nonparametric smoothing to estimate X(t) and X (t) from model (8). Step 2: Substitute the estimate ˆX(t i ) into model (7) to obtain: ˆX (t i ) = F [ ˆX(t i ), θ] + e 2 (t i ). (9) Convert the model (9) into a LME or NLME if F (x) is linear or nonlinear. Fit the LME or NLME using a standard approach or SAEM method Hulin Wu UTSPH March / 52

20 Bayesian Methods: Borrow Information to Deal with Sparse Data and Identifiability Problems Huang, Liu and Wu, Biometrics (2006): Example A viral dynamic model: describe the population dynamics of HIV and its target cells in plasma d dt T = λ ρt [1 γ(t)]kt V d dt T = [1 γ(t)]kt V δt d dt V = NδT cv (10) T, T, V : target uninfected cells, infected cells, virus γ(t): time-varying antiviral drug efficacy (λ, ρ, k, δ, N, c): unknown parameters to be estimated The equations (10): no closed-form solution Hulin Wu UTSPH March / 52

21 Antiviral Drug Efficacy Model A modified E max (M-M) model for drug efficacy: γ(t) = C(t)A(t) φic 50 (t) + C(t)A(t) = IQ(t)A(t) φ + IQ(t)A(t), 0 γ(t) 1 C(t): the plasma drug concentration A(t): drug adherence measurements IC50: in vitro phenotype drug resistance marker φ: a conversion factor parameter IQ = C(t) IC 50 : the Inhibitory Quotient (IQ) (t) If γ(t) = 1, the drug: 100% effective If γ(t) = 0, the drug: no effect (11) Hulin Wu UTSPH March / 52

22 Drug Susceptibility Model Phenotype marker IC 50 is used to quantify agent-specific drug sensitivity The function: to describe changes overtime in IC 50 IC 50 (t) = { I0 + Ir I0 t r t for 0 < t < t r, I r for t t r, (12) I0 and I r: respective values of IC 50(t) at baseline and time point t r at which drug resistant mutations appear If Ir = I 0, no resistance mutation developed during treatment Hulin Wu UTSPH March / 52

23 A Challenging Problem How to estimate the unknown parameters in the complex dynamic model? Difficulties: Identifiability problem: Too many parameters, (φ, λ, ρ, k, δ, N, C), some of them are not identifiable Data from individuals: sparse, only V (t) measured Nonlinear differential equations model: no closed-form solutions Hulin Wu UTSPH March / 52

24 Viral load data from a clinical trial Real data up to day 112 Time (days) log10(rna) copies/ml log10(50) Hulin Wu UTSPH March / 52

25 Bayesian Modeling A three-stage Bayesian hierarchical model Stage 1. Within-subject variation: y i = f i(θ i) + e i, [e i σ 2, θ i] N (0, σ 2 I mi ) fi(θ i) = (f i1(θ i, t 1),, f imi (θ i, t mi )) T : ODE solutions. yi = (y i1(t 1),, y imi (t mi )) T : Data from Subject i ei = (e i(t 1),, e i(t mi )) T : Measurement error Stage 2. Between-subject variation: θ i = µ + b i, [b i Σ] N (0, Σ) Stage 3. Hyperprior distributions: σ 2 Ga(a, b), µ N (η, Λ), Σ 1 Wi(Ω, ν) Gamma (Ga), Normal (N ) and Wishart (Wi): independent distributions Hyper-parameters a, b, η, Λ, Ω and ν: known Hulin Wu UTSPH March / 52

26 Bayesian Estimation: Implementation Choose prior distributions Informative prior and non-informative prior Rule of thumb: choose non-informative prior distributions for parameters of interest Implement MCMC algorithm Gibbs sampling step: closed form of conditional distributions for σ 2, µ, Σ 1 Metropolis-Hastings step: no closed form of conditional distributions for θ i Run a long chain: the number of iterations, initial burn-in", every fifth simulation samples Obtain posterior distributions (posterior means or credible intervals) based on the final MCMC samples Hulin Wu UTSPH March / 52

27 A Clinical Study: A5055 A study of HIV-1 infected patients failing PI-containing therapies. Two salvage regimens: 44 patients Arm A: IDV 800 mg q12h+rtv 200mg q12h+two NRTIs Arm B: IDV 400 mg q12h+rtv 400mg q12h+two NRTIs Plasma HIV-1 RNA (viral load) measured at days 0, 7, 14, 28, 56, 84, 112, 140 and 168 of follow-up Hulin Wu UTSPH March / 52

28 Clinical Data Results of Population Parameters Parameter PM SD 95% CI φ (1.2143, ) c (2.7139, ) δ (0.3387, ) λ (91.497, ) ρ (0.0905, ) N ( , ) k ( , ) Posterior mean for the population parameter φ is with a SD of and the 95% CI of (1.2143, ) As φ plays a role of transforming the in vitro IC 50 into in vivo IC 50, our estimate shows that there is about 2-fold difference between in vitro IC 50 and in vivo IC 50 Hulin Wu UTSPH March / 52

29 Clinical Data Results of Individual Parameters Patient φ i c i δ i λ i ρ i N i k i e The individual-specific parameter estimates suggest a large inter-subject variation The model provides a good fit to the clinical data Hulin Wu UTSPH March / 52

30 Patient 1 Fitted individual curves, drug efficacy, IC50 and adherence with IQ=c12h/IC50 Patid= 1 Patid= 1 IC IDV RTV Adherence IDV RTV Time (day) Time (day) Patid= 1 Patid= 1 Drug efficacy ec log10(rna) o o o o o o o o Time (day) Time (day) Hulin Wu UTSPH March / 52

31 Patient 2 Patid= 2 Patid= 2 IC IDV RTV Adherence IDV RTV Time (day) Time (day) Patid= 2 Patid= 2 Drug efficacy ec log10(rna) o o o o o o o o Time (day) Time (day) Hulin Wu UTSPH March / 52

32 Patient 3 Patid= 3 Patid= 3 IC IDV RTV Adherence IDV RTV Time (day) Time (day) Patid= 3 Patid= 3 Drug efficacy ec log10(rna) o o o o o o o o o Time (day) Time (day) Hulin Wu UTSPH March / 52

33 Bayesian Methods: Pros & Cons Pros Use prior to solve the identifiability problem Deal with extremely complicated models such as nonlinear differential equation models Borrow information across subjects: Deal with sparse longitudinal data Estimate parameters for both population and individuals Always get reasonable estimates Use posterior distributions: Easy to quantify uncertainty" for inference Cons Computation: complex and expensive Prior: dominate the results Hulin Wu UTSPH March / 52

34 High-Dimensional ODEs Require computationally fast and efficient methods Need to incorporate variable selection approaches: LASSO, SCAD etc. Easy to deal with longitudinal data: Mixed-effects models Two-stage smoothing-based method: good for this purpose Hulin Wu UTSPH March / 52

35 Linear ODEs Time course gene expression data: Dynamic gene regulatory network (GRN) reconstruction (Lu, Liang, Li and Wu, JASA 2011) dx i dt = n j=1 θ ij x j, i = 1,, n, (13) When n is small, standard statistical inference and variable selection methods can be used When n is large, curse-of-dimensionality Hulin Wu UTSPH March / 52

36 High-Dimensional Linear ODE: Identifying Significant Regulations Two-Stage Method (Chen and Wu 2008a, 2008b; Liang and Wu 2008): Obtain mean expression curves and their derivatives ˆM k (t) and ˆM k (t) from Step II. Substitute ˆM k (t) and ˆM k (t) into the ODE model to form a regression model High Dimensional Linear Regression Model y k (t) = p j=1 β kjx j (t) + ε k (t), k = 1,, p; t = t 1, t 2,..., t N y k (t) = ˆM k (t) and x j(t) = ˆM j (t) Hulin Wu UTSPH March / 52

37 High Dimensional Model Selection Two-stage method Decouple the high-dimensional ODEs Convert the ODE model into a simple linear model Computationally fast Stepwise selection and subset selection Bridge selection (Frank and Friedman 1993) Least absolute shrinkage and selection operator (LASSO) (Tibshirani 1996) Smoothly Clipped Absolute Deviation (SCAD)(Fan and Li 2001; Kim, Choi and Oh 2008) Hulin Wu UTSPH March / 52

38 Estimation Refinement: Stochastic Approximation EM (SAEM) Algorithm Mixed-Effects ODE Model for Module k dx ki dt m k = β kij M [kj] (t), i = 1,, n k ; k = 1,..., p, (14) j=1 Longitudinal Measurement Model y ki (t) = x ki (t) + ε ki (t) (15) Random Effects Model β ki = β k + b ki (16) b ki N (0, D k ) Hulin Wu UTSPH March / 52

39 Application: Identification of Dynamic GRN for Yeast Cell Cycle DNA microarrays experiment: 18 equally spaced time points during two cell cycles (Spellman 1998) Step I: 800 significant genes identified Step II: Cluster 800 genes into 41 functional modules Step III: Smoothing Step IV: Linear ODE model identification: SCAD variable selection Step V: Estimation Refinement Step VI: Function Enrichment Analysis Hulin Wu UTSPH March / 52

40 Yeast Cell Cycle Gene Expression Profile Module Module Module Module Module Module Module Module 8* Module 9* Module 10 Module 11 Module Module 13 Module 14* Module 15* Module Hulin Wu UTSPH March / 52

41 Yeast Cell Cycle Gene Expression Profile Module 17 Module 18* Module 19 Module 20* Module 21 Module 22 Module 23 Module Module 25* Module 26 Module 27 Module Module Module 30* Module Module 32* Hulin Wu UTSPH March / 52

42 Yeast Cell Cycle Gene Expression Profile Module Module Module Module 36 Module 37 Module 38* Module 39 Module Module 41* 2 0 Hulin Wu UTSPH March / 52

43 Graph of Yeast Cell Cycle GRN Hulin Wu UTSPH March / 52

44 High-Dimensional Nonlinear/Nonparametric ODEs Generalized ODEs: Miao, Wu and Xue, Journal of the American Statistical Association (2014) Sparse additive ODEs: Wu, Lu, Xue and Liang, Journal of the American Statistical Association (2014) Additive nonlinear ODEs: Xue, Wu, Wu and Wu, a manuscript (2017) Hulin Wu UTSPH March / 52

45 Other Dynamic Models: State-Space Models (SSM) Linear SSM: where X t+1 = F t X t + V t, V t (0, Q t ) (17) Y t = G t X t + W t, W t (0, R t ) (18) V t and W t : independent model noise and measurement noise Standard Kalman filter (Kalman, 1960): the core algorithm for prediction and smoothing of state state vectors Hulin Wu UTSPH March / 52

46 Statistical Methods for State-Space Models Zhu and Wu, JCGS (2007) Liu, Lu, Niu and Wu, Biometrics (2011) Liu, Wu, Zhu, Miao, BMC Bioinformatics (2014) Chen et al. PlusOne (2017), submitted Hulin Wu UTSPH March / 52

47 Extension to SDE and PDE: Possible but Challenging Theoretically difficult Computationally challenging Applications: Not common Hulin Wu UTSPH March / 52

48 Ongoing and Future Research High-dimensional ODEs: How to improve accuracy without sacrificing too much on computing? Extra-high dimensional ODE: 1000 ODEs with 1 million parameters (Wu, Qiu, Yuan and Wu, 2017, submitted). Characteristic analyses of large ODE systems: Controllability and stability analysis with uncertainty in parameter estimation Sun, Hu, Wu, Qiu, Linel, Wu, Infectious Disease Modelling 2016 AI-driven ODE Model Builder Hulin Wu UTSPH March / 52

49 Conclusions Dynamic Models: Practically useful for both understanding associations and predictions Both theoretically and computationally challenging Statistical methods for dynamic models: More work needed Hulin Wu UTSPH March / 52

50 Dr. Hulin Wu s Publications on ODE Models by Topics ODE identifiability 1. Wu, H.*, Zhu, H.+, Miao, H.+, and Perelson, A.S. (2008), Parameter Identifiability and Estimation of HIV/AIDS Dynamic Models, Bulletin of Mathematical Biology, 70(3), Miao, H.+, Dykes, C., Demeter, L., Wu, H.* (2009), Differential Equation Modeling of HIV Viral Fitness Experiments: Model Identification, Model Selection, and Multi-Model Inference, Biometrics, 65, Liang, H., Miao, H., and Wu, H.* (2010), Estimation of constant and time-varying dynamic parameters of HIV infection in a nonlinear differential equation model, Annals of Applied Statistics, 4, Miao, H., Xia, X., Perelson, A.S., Wu, H.* (2011), On Identifiability of Nonlinear ODE Models and Applications in Viral Dynamics, SIAM Review, 53(1): Lee, Y.+ and Wu, H.* (2012), MARS Approach for Global Sensitivity Analysis of Differential Equation Models with Applications to Dynamics of Influenza Infection, Bulletin of Mathematical Biology, 74, Wu, H.*, Miao, H., Xue, H., Topham, D.J., Zand, M. (2015), Quantifying Immune Response to Influenza Virus Infection via Multivariate Nonlinear ODE Models with Partially Observed State Variables and Time-Varying Parameters, Statistics in Biosciences, 7(1): NLS Estimation of ODE parameters 1. Wu, H.*, Huang, Y.+, Dykes, C., Liu, D., Ma, J., Perelson, A.S., Demeter, L. (2006), Modeling and Estimation of Replication Fitness of HIV-1 in Vitro Experiments Using a Growth Competition Assay, Journal of Virology, 80, Xue, H.+, Miao, H., Wu, H.* (2010), Sieve Estimation of Constant and Time-Varying Coefficients in Nonlinear Ordinary Differential Equation Models by Considering Both Numerical Error and Measurement Error, Annals of Statistics, 38(4), Two-stage methods for ODE models 1. Liang, H., Wu, H.*, (2008), Parameter Estimation for Differential Equation Models Using a Framework of Measurement Error in Regression Models, Journal of the American Statistical Association, 103, Chen, J.+ and Wu, H.* (2008), Efficient Local Estimation for Time-varying Coefficients in Deterministic Dynamic Models with Applications to HIV-1 Dynamics, Journal of the American Statistical Association, 103, Fang, Y.+, Wu, H.*, Zhu, L. (2011), A Two-Stage Estimation Method for Random Coefficient Differential Equation Models with Application to Longitudinal HIV Dynamic Data, Statistica Sinica, 21, Lu, T.+, Liang, H., Li, H., Wu, H.* (2011), High Dimensional ODEs Coupled with Mixed- Effects Modeling Techniques for Dynamic Gene Regulatory Network Identification, Journal of the American Statistical Association, 106, Wu, H.*, Xue, H., Kumar A.+ (2012), Numerical Discretization-Based Estimation Methods for Ordinary Differential Equation Models via Penalized Spline Smoothing with Applications in Biomedical Research, Biometrics, 68(2), Ding, A.A. and Wu, H.* (2014), Estimation of ODE Parameters Using Constrained Local Polynomial Regression, Statistica Sinica, 24, Wu, H.*, Miao, H., Xue, H., Topham, D.J., Zand, M. (2015), Quantifying Immune Response to Influenza Virus Infection via Multivariate Nonlinear ODE Models with Partially Observed State Variables and Time-Varying Parameters, Statistics in Biosciences, 7(1):

51 Time-varying parameter estimation in ODE Models 1. Chen, J.+ and Wu, H.* (2008), Efficient Local Estimation for Time-varying Coefficients in Deterministic Dynamic Models with Applications to HIV-1 Dynamics, Journal of the American Statistical Association, 103, Chen, J.+ and Wu, H.* (2008), Estimation of time-varying parameters in deterministic dynamic models, Statistica Sinica, 18, Liang, H., Miao, H., and Wu, H.* (2010), Estimation of constant and time-varying dynamic parameters of HIV infection in a nonlinear differential equation model, Annals of Applied Statistics, 4, Xue, H.+, Miao, H., Wu, H.* (2010), Sieve Estimation of Constant and Time-Varying Coefficients in Nonlinear Ordinary Differential Equation Models by Considering Both Numerical Error and Measurement Error, Annals of Statistics, 38(4), Cao, J., Huang, J.Z., Wu, H. (2012), Penalized Nonlinear Least Squares Estimation of Time-Varying Parameters in Ordinary Differential Equations, Journal of Computational and Graphical Statistics (JCGS), 21(1), Bayesian and mixed-effects ODE modeling approaches for longitudinal data 1. Wu, H.*, Ding, A. and DeGruttola. V. (1998), Estimation of HIV Dynamic Parameters," Statistics in Medicine, 17, Wu, H.* and Ding, A. (1999), Population HIV-1 Dynamics in Vivo: Applicable Models and Inferential Tools for Virological Data from AIDS Clinical Trials," Biometrics, 55, Huang, Y.+ and Wu, H.* (2006), A Bayesian Approach for Estimating Antiviral Efficacy in HIV Dynamic Models, Journal of Applied Statistics, 33, Huang, Y., Liu, D.+ and Wu, H.* (2006), Hierarchical Bayesian Methods for Estimation of Parameters in a Longitudinal HIV Dynamic System, Biometrics, 62, Fang, Y.+, Wu, H.*, Zhu, L. (2011), A Two-Stage Estimation Method for Random Coefficient Differential Equation Models with Application to Longitudinal HIV Dynamic Data, Statistica Sinica, 21, High-dimensional ODE models and model selections 1. Lu, T.+, Liang, H., Li, H., Wu, H.* (2011), High Dimensional ODEs Coupled with Mixed- Effects Modeling Techniques for Dynamic Gene Regulatory Network Identification, Journal of the American Statistical Association, 106, Wu, H.*, Lu, T.+, Xue, H., and Liang, H. (2014), Sparse Additive ODEs for Dynamic Gene Regulatory Network Modeling, Journal of the American Statistical Association, 109:506, Nonlinear/nonparametric ODE models 1. Wu, H.*, Lu, T.+, Xue, H., and Liang, H. (2014), Sparse Additive ODEs for Dynamic Gene Regulatory Network Modeling, Journal of the American Statistical Association, 109:506, Miao, H., Wu, H., and Xue, H. (2014), Generalized Ordinary Differential Equation Models, Journal of the American Statistical Association, 109:508, Statistical methods for state-space models 1. Zhu, H.+ and Wu, H.* (2007), Estimation of Smoothing Time-Varying Parameters in State Space Models, Journal of Computational and Graphical Statistics (JCGS), 16(4), Liu, D.+, Lu, T.+, Niu, X.F., and Wu, H.* (2011), Mixed-Effects State Space Models for Analysis of Longitudinal Dynamic Systems, Biometrics, 67,

52 ODE experimental design 1. Wu, H.* and Ding, A.A. (2002), Design of Viral Dynamic Studies for Efficiently Assessing Anti-HIV Therapies in AIDS Clinical Trials," Biometrical Journal, 2, Huang, Y. and Wu, H.* (2008), Bayesian Experimental Design for Long-Term Longitudinal HIV Dynamic Studies, Journal of Statistical Planning and Inference, 138, Miao, H., Xia, X., Perelson, A.S., Wu, H.* (2011), On Identifiability of Nonlinear ODE Models and Applications in Viral Dynamics, SIAM Review, 53(1): Dynamic model property analysis with uncertainty 1. Sun, X.+, Hu, F.+, Wu, S., Qiu, X., Linel, P.+, Wu, H.* (2016), Controllability and Stability Analysis of Large Transcriptomic Dynamic Systems for Host Response to Influenza Infection in Human, Infectious Disease Modelling, 1(1),

53 Our recent work in nonlinear/high-dimensional ODE models Lu, T., Liang, H., Li, H., Wu, H. (2011), High Dimensional ODEs Coupled with Mixed-Effects Modeling Techniques for Dynamic Gene Regulatory Network Identification, JASA, 106, Wu, H., Xue, H., Kumar A. (2012), Numerical Discretization-Based Estimation Methods for Ordinary Differential Equation Models via Penalized Spline Smoothing with Applications in Biomedical Research, Biometrics, 68(2), Miao, H., Wu, H., and Xue, H. (2014), Generalized Ordinary Differential Equation Models, JASA, 109:508, Wu, H., Lu, T., Xue, H., and Liang, H. (2014), Sparse Additive ODEs for Dynamic Gene Regulatory Network Modeling, JASA, 109:506, Wu, S., Liu, Z.P., Qiu, X., and Wu, H. (2014), Modeling genome-wide dynamic regulatory network in mouse lungs with influenza infection using high-dimensional ordinary differential equations, PLOS ONE, 9(5):e Linel, P., Wu, S., Deng, N., Wu, H. (2014), Dynamic transcriptional signatures and network responses for clinical symptoms in influenza-infected human subjects using systems biology approaches, Journal of PK/PD, 41, Qiu, X. et al. (2015), Diversity in Compartmental Dynamics of Gene Regulatory Networks: The Immune Response in Primary Influenza A Infection in Mice, PLoS ONE, 10(9). Hulin Wu UTSPH March / 52

54 Acknowledgement More than 30 postdocs, students and collaborators Hulin Wu UTSPH March / 52

55 Thank You! Hulin Wu UTSPH March / 52

Mixed-Effects Biological Models: Estimation and Inference

Mixed-Effects Biological Models: Estimation and Inference Mixed-Effects Biological Models: Estimation and Inference Hulin Wu, Ph.D. Dr. D.R. Seth Family Professor & Associate Chair Department of Biostatistics School of Public Health University of Texas Health

More information

Bayesian Inference on Joint Mixture Models for Survival-Longitudinal Data with Multiple Features. Yangxin Huang

Bayesian Inference on Joint Mixture Models for Survival-Longitudinal Data with Multiple Features. Yangxin Huang Bayesian Inference on Joint Mixture Models for Survival-Longitudinal Data with Multiple Features Yangxin Huang Department of Epidemiology and Biostatistics, COPH, USF, Tampa, FL yhuang@health.usf.edu January

More information

Stochastic approximation EM algorithm in nonlinear mixed effects model for viral load decrease during anti-hiv treatment

Stochastic approximation EM algorithm in nonlinear mixed effects model for viral load decrease during anti-hiv treatment Stochastic approximation EM algorithm in nonlinear mixed effects model for viral load decrease during anti-hiv treatment Adeline Samson 1, Marc Lavielle and France Mentré 1 1 INSERM E0357, Department of

More information

1Non Linear mixed effects ordinary differential equations models. M. Prague - SISTM - NLME-ODE September 27,

1Non Linear mixed effects ordinary differential equations models. M. Prague - SISTM - NLME-ODE September 27, GDR MaMoVi 2017 Parameter estimation in Models with Random effects based on Ordinary Differential Equations: A bayesian maximum a posteriori approach. Mélanie PRAGUE, Daniel COMMENGES & Rodolphe THIÉBAUT

More information

Analysis Methods for Supersaturated Design: Some Comparisons

Analysis Methods for Supersaturated Design: Some Comparisons Journal of Data Science 1(2003), 249-260 Analysis Methods for Supersaturated Design: Some Comparisons Runze Li 1 and Dennis K. J. Lin 2 The Pennsylvania State University Abstract: Supersaturated designs

More information

Bayesian Inference of Interactions and Associations

Bayesian Inference of Interactions and Associations Bayesian Inference of Interactions and Associations Jun Liu Department of Statistics Harvard University http://www.fas.harvard.edu/~junliu Based on collaborations with Yu Zhang, Jing Zhang, Yuan Yuan,

More information

A State Space Model Approach for HIV Infection Dynamics

A State Space Model Approach for HIV Infection Dynamics A State Space Model Approach for HIV Infection Dynamics Jiabin Wang a, Hua Liang b, and Rong Chen a Mathematical models have been proposed and developed to model HIV dynamics over the past few decades,

More information

Expression Data Exploration: Association, Patterns, Factors & Regression Modelling

Expression Data Exploration: Association, Patterns, Factors & Regression Modelling Expression Data Exploration: Association, Patterns, Factors & Regression Modelling Exploring gene expression data Scale factors, median chip correlation on gene subsets for crude data quality investigation

More information

A Hierarchical Bayesian Approach for Parameter Estimation in HIV Models

A Hierarchical Bayesian Approach for Parameter Estimation in HIV Models A Hierarchical Bayesian Approach for Parameter Estimation in HIV Models H. T. Banks, Sarah Grove, Shuhua Hu, and Yanyuan Ma Center for Research in Scientific Computation North Carolina State University

More information

Functional Estimation in Systems Defined by Differential Equation using Bayesian Smoothing Methods

Functional Estimation in Systems Defined by Differential Equation using Bayesian Smoothing Methods Université Catholique de Louvain Institut de Statistique, Biostatistique et Sciences Actuarielles Functional Estimation in Systems Defined by Differential Equation using Bayesian Smoothing Methods 19th

More information

ESTIMATION OF ORDINARY DIFFERENTIAL EQUATION PARAMETERS USING CONSTRAINED LOCAL POLYNOMIAL REGRESSION

ESTIMATION OF ORDINARY DIFFERENTIAL EQUATION PARAMETERS USING CONSTRAINED LOCAL POLYNOMIAL REGRESSION Statistica Sinica 24 (2014), 1613-1631 doi:http://dx.doi.org/10.5705/ss.2012.304 ESTIMATION OF ORDINARY DIFFERENTIAL EQUATION PARAMETERS USING CONSTRAINED LOCAL POLYNOMIAL REGRESSION A. Adam Ding and Hulin

More information

MA/ST 810 Mathematical-Statistical Modeling and Analysis of Complex Systems

MA/ST 810 Mathematical-Statistical Modeling and Analysis of Complex Systems MA/ST 810 Mathematical-Statistical Modeling and Analysis of Complex Systems Integrating Mathematical and Statistical Models Recap of mathematical models Models and data Statistical models and sources of

More information

Personalized Treatment Selection Based on Randomized Clinical Trials. Tianxi Cai Department of Biostatistics Harvard School of Public Health

Personalized Treatment Selection Based on Randomized Clinical Trials. Tianxi Cai Department of Biostatistics Harvard School of Public Health Personalized Treatment Selection Based on Randomized Clinical Trials Tianxi Cai Department of Biostatistics Harvard School of Public Health Outline Motivation A systematic approach to separating subpopulations

More information

Predicting Protein Functions and Domain Interactions from Protein Interactions

Predicting Protein Functions and Domain Interactions from Protein Interactions Predicting Protein Functions and Domain Interactions from Protein Interactions Fengzhu Sun, PhD Center for Computational and Experimental Genomics University of Southern California Outline High-throughput

More information

Bayesian shrinkage approach in variable selection for mixed

Bayesian shrinkage approach in variable selection for mixed Bayesian shrinkage approach in variable selection for mixed effects s GGI Statistics Conference, Florence, 2015 Bayesian Variable Selection June 22-26, 2015 Outline 1 Introduction 2 3 4 Outline Introduction

More information

Learning in Bayesian Networks

Learning in Bayesian Networks Learning in Bayesian Networks Florian Markowetz Max-Planck-Institute for Molecular Genetics Computational Molecular Biology Berlin Berlin: 20.06.2002 1 Overview 1. Bayesian Networks Stochastic Networks

More information

ON IDENTIFIABILITY OF NONLINEAR ODE MODELS AND APPLICATIONS IN VIRAL DYNAMICS

ON IDENTIFIABILITY OF NONLINEAR ODE MODELS AND APPLICATIONS IN VIRAL DYNAMICS ON IDENTIFIABILITY OF NONLINEAR ODE MODELS AND APPLICATIONS IN VIRAL DYNAMICS HONGYU MIAO, XIAOHUA XIA, ALAN S. PERELSON, AND HULIN WU Abstract. Ordinary differential equations (ODE) are a powerful tool

More information

Unravelling the biochemical reaction kinetics from time-series data

Unravelling the biochemical reaction kinetics from time-series data Unravelling the biochemical reaction kinetics from time-series data Santiago Schnell Indiana University School of Informatics and Biocomplexity Institute Email: schnell@indiana.edu WWW: http://www.informatics.indiana.edu/schnell

More information

Computational Genomics. Systems biology. Putting it together: Data integration using graphical models

Computational Genomics. Systems biology. Putting it together: Data integration using graphical models 02-710 Computational Genomics Systems biology Putting it together: Data integration using graphical models High throughput data So far in this class we discussed several different types of high throughput

More information

Prerequisite: STATS 7 or STATS 8 or AP90 or (STATS 120A and STATS 120B and STATS 120C). AP90 with a minimum score of 3

Prerequisite: STATS 7 or STATS 8 or AP90 or (STATS 120A and STATS 120B and STATS 120C). AP90 with a minimum score of 3 University of California, Irvine 2017-2018 1 Statistics (STATS) Courses STATS 5. Seminar in Data Science. 1 Unit. An introduction to the field of Data Science; intended for entering freshman and transfers.

More information

Jointly modeling time-to-event and longitudinal data: a Bayesian approach

Jointly modeling time-to-event and longitudinal data: a Bayesian approach Stat Methods Appl (2014) 23:95 121 DOI 10.1007/s10260-013-0242-7 Jointly modeling time-to-event and longitudinal data: a Bayesian approach Yangxin Huang X. Joan Hu Getachew A. Dagne Accepted: 7 September

More information

Or How to select variables Using Bayesian LASSO

Or How to select variables Using Bayesian LASSO Or How to select variables Using Bayesian LASSO x 1 x 2 x 3 x 4 Or How to select variables Using Bayesian LASSO x 1 x 2 x 3 x 4 Or How to select variables Using Bayesian LASSO On Bayesian Variable Selection

More information

MEDLINE Clinical Laboratory Sciences Journals

MEDLINE Clinical Laboratory Sciences Journals Source Type Publication Name ISSN Peer-Reviewed Academic Journal Acta Biochimica et Biophysica Sinica 1672-9145 Y Academic Journal Acta Physiologica 1748-1708 Y Academic Journal Aging Cell 1474-9718 Y

More information

State Space and Hidden Markov Models

State Space and Hidden Markov Models State Space and Hidden Markov Models Kunsch H.R. State Space and Hidden Markov Models. ETH- Zurich Zurich; Aliaksandr Hubin Oslo 2014 Contents 1. Introduction 2. Markov Chains 3. Hidden Markov and State

More information

Heterogeneous shedding of influenza by human subjects and. its implications for epidemiology and control

Heterogeneous shedding of influenza by human subjects and. its implications for epidemiology and control 1 2 3 4 5 6 7 8 9 10 11 Heterogeneous shedding of influenza by human subjects and its implications for epidemiology and control Laetitia Canini 1*, Mark EJ Woolhouse 1, Taronna R. Maines 2, Fabrice Carrat

More information

Bayesian Networks: Construction, Inference, Learning and Causal Interpretation. Volker Tresp Summer 2016

Bayesian Networks: Construction, Inference, Learning and Causal Interpretation. Volker Tresp Summer 2016 Bayesian Networks: Construction, Inference, Learning and Causal Interpretation Volker Tresp Summer 2016 1 Introduction So far we were mostly concerned with supervised learning: we predicted one or several

More information

Bayesian Networks: Construction, Inference, Learning and Causal Interpretation. Volker Tresp Summer 2014

Bayesian Networks: Construction, Inference, Learning and Causal Interpretation. Volker Tresp Summer 2014 Bayesian Networks: Construction, Inference, Learning and Causal Interpretation Volker Tresp Summer 2014 1 Introduction So far we were mostly concerned with supervised learning: we predicted one or several

More information

BAGUS: Bayesian Regularization for Graphical Models with Unequal Shrinkage

BAGUS: Bayesian Regularization for Graphical Models with Unequal Shrinkage BAGUS: Bayesian Regularization for Graphical Models with Unequal Shrinkage Lingrui Gan, Naveen N. Narisetty, Feng Liang Department of Statistics University of Illinois at Urbana-Champaign Problem Statement

More information

Extension of the SAEM algorithm to left-censored data in nonlinear mixed-effects model: application to HIV dynamics model

Extension of the SAEM algorithm to left-censored data in nonlinear mixed-effects model: application to HIV dynamics model Extension of the SAEM algorithm to left-censored data in nonlinear mixed-effects model: application to HIV dynamics model Adeline Samson 1, Marc Lavielle 2, France Mentré 1 1 INSERM U738, Paris, France;

More information

Estimating subgroup specific treatment effects via concave fusion

Estimating subgroup specific treatment effects via concave fusion Estimating subgroup specific treatment effects via concave fusion Jian Huang University of Iowa April 6, 2016 Outline 1 Motivation and the problem 2 The proposed model and approach Concave pairwise fusion

More information

Bayesian Methods for Machine Learning

Bayesian Methods for Machine Learning Bayesian Methods for Machine Learning CS 584: Big Data Analytics Material adapted from Radford Neal s tutorial (http://ftp.cs.utoronto.ca/pub/radford/bayes-tut.pdf), Zoubin Ghahramni (http://hunch.net/~coms-4771/zoubin_ghahramani_bayesian_learning.pdf),

More information

Inferring Transcriptional Regulatory Networks from High-throughput Data

Inferring Transcriptional Regulatory Networks from High-throughput Data Inferring Transcriptional Regulatory Networks from High-throughput Data Lectures 9 Oct 26, 2011 CSE 527 Computational Biology, Fall 2011 Instructor: Su-In Lee TA: Christopher Miles Monday & Wednesday 12:00-1:20

More information

Least Absolute Shrinkage is Equivalent to Quadratic Penalization

Least Absolute Shrinkage is Equivalent to Quadratic Penalization Least Absolute Shrinkage is Equivalent to Quadratic Penalization Yves Grandvalet Heudiasyc, UMR CNRS 6599, Université de Technologie de Compiègne, BP 20.529, 60205 Compiègne Cedex, France Yves.Grandvalet@hds.utc.fr

More information

Approaches to Modeling Menstrual Cycle Function

Approaches to Modeling Menstrual Cycle Function Approaches to Modeling Menstrual Cycle Function Paul S. Albert (albertp@mail.nih.gov) Biostatistics & Bioinformatics Branch Division of Epidemiology, Statistics, and Prevention Research NICHD SPER Student

More information

Discovering molecular pathways from protein interaction and ge

Discovering molecular pathways from protein interaction and ge Discovering molecular pathways from protein interaction and gene expression data 9-4-2008 Aim To have a mechanism for inferring pathways from gene expression and protein interaction data. Motivation Why

More information

An Adaptive Association Test for Microbiome Data

An Adaptive Association Test for Microbiome Data An Adaptive Association Test for Microbiome Data Chong Wu 1, Jun Chen 2, Junghi 1 Kim and Wei Pan 1 1 Division of Biostatistics, School of Public Health, University of Minnesota; 2 Division of Biomedical

More information

Bayesian Statistics for Personalized Medicine. David Yang

Bayesian Statistics for Personalized Medicine. David Yang Bayesian Statistics for Personalized Medicine David Yang Outline Why Bayesian Statistics for Personalized Medicine? A Network-based Bayesian Strategy for Genomic Biomarker Discovery Part One Why Bayesian

More information

Sparse Stochastic Inference for Latent Dirichlet Allocation

Sparse Stochastic Inference for Latent Dirichlet Allocation Sparse Stochastic Inference for Latent Dirichlet Allocation David Mimno 1, Matthew D. Hoffman 2, David M. Blei 1 1 Dept. of Computer Science, Princeton U. 2 Dept. of Statistics, Columbia U. Presentation

More information

Efficient Likelihood-Free Inference

Efficient Likelihood-Free Inference Efficient Likelihood-Free Inference Michael Gutmann http://homepages.inf.ed.ac.uk/mgutmann Institute for Adaptive and Neural Computation School of Informatics, University of Edinburgh 8th November 2017

More information

Spatially Adaptive Smoothing Splines

Spatially Adaptive Smoothing Splines Spatially Adaptive Smoothing Splines Paul Speckman University of Missouri-Columbia speckman@statmissouriedu September 11, 23 Banff 9/7/3 Ordinary Simple Spline Smoothing Observe y i = f(t i ) + ε i, =

More information

Introduction to Bioinformatics

Introduction to Bioinformatics CSCI8980: Applied Machine Learning in Computational Biology Introduction to Bioinformatics Rui Kuang Department of Computer Science and Engineering University of Minnesota kuang@cs.umn.edu History of Bioinformatics

More information

25 : Graphical induced structured input/output models

25 : Graphical induced structured input/output models 10-708: Probabilistic Graphical Models 10-708, Spring 2016 25 : Graphical induced structured input/output models Lecturer: Eric P. Xing Scribes: Raied Aljadaany, Shi Zong, Chenchen Zhu Disclaimer: A large

More information

Bayesian nonparametric estimation of finite population quantities in absence of design information on nonsampled units

Bayesian nonparametric estimation of finite population quantities in absence of design information on nonsampled units Bayesian nonparametric estimation of finite population quantities in absence of design information on nonsampled units Sahar Z Zangeneh Robert W. Keener Roderick J.A. Little Abstract In Probability proportional

More information

Regularization Parameter Selection for a Bayesian Multi-Level Group Lasso Regression Model with Application to Imaging Genomics

Regularization Parameter Selection for a Bayesian Multi-Level Group Lasso Regression Model with Application to Imaging Genomics Regularization Parameter Selection for a Bayesian Multi-Level Group Lasso Regression Model with Application to Imaging Genomics arxiv:1603.08163v1 [stat.ml] 7 Mar 016 Farouk S. Nathoo, Keelin Greenlaw,

More information

Default Priors and Effcient Posterior Computation in Bayesian

Default Priors and Effcient Posterior Computation in Bayesian Default Priors and Effcient Posterior Computation in Bayesian Factor Analysis January 16, 2010 Presented by Eric Wang, Duke University Background and Motivation A Brief Review of Parameter Expansion Literature

More information

A New Method to Build Gene Regulation Network Based on Fuzzy Hierarchical Clustering Methods

A New Method to Build Gene Regulation Network Based on Fuzzy Hierarchical Clustering Methods International Academic Institute for Science and Technology International Academic Journal of Science and Engineering Vol. 3, No. 6, 2016, pp. 169-176. ISSN 2454-3896 International Academic Journal of

More information

An Algorithm for Bayesian Variable Selection in High-dimensional Generalized Linear Models

An Algorithm for Bayesian Variable Selection in High-dimensional Generalized Linear Models Proceedings 59th ISI World Statistics Congress, 25-30 August 2013, Hong Kong (Session CPS023) p.3938 An Algorithm for Bayesian Variable Selection in High-dimensional Generalized Linear Models Vitara Pungpapong

More information

Hierarchical expectation propagation for Bayesian aggregation of average data

Hierarchical expectation propagation for Bayesian aggregation of average data Hierarchical expectation propagation for Bayesian aggregation of average data Andrew Gelman, Columbia University Sebastian Weber, Novartis also Bob Carpenter, Daniel Lee, Frédéric Bois, Aki Vehtari, and

More information

Chapter 3. Linear Models for Regression

Chapter 3. Linear Models for Regression Chapter 3. Linear Models for Regression Wei Pan Division of Biostatistics, School of Public Health, University of Minnesota, Minneapolis, MN 55455 Email: weip@biostat.umn.edu PubH 7475/8475 c Wei Pan Linear

More information

Regularized Regression A Bayesian point of view

Regularized Regression A Bayesian point of view Regularized Regression A Bayesian point of view Vincent MICHEL Director : Gilles Celeux Supervisor : Bertrand Thirion Parietal Team, INRIA Saclay Ile-de-France LRI, Université Paris Sud CEA, DSV, I2BM,

More information

Consistent high-dimensional Bayesian variable selection via penalized credible regions

Consistent high-dimensional Bayesian variable selection via penalized credible regions Consistent high-dimensional Bayesian variable selection via penalized credible regions Howard Bondell bondell@stat.ncsu.edu Joint work with Brian Reich Howard Bondell p. 1 Outline High-Dimensional Variable

More information

Optimal State Estimation for Boolean Dynamical Systems using a Boolean Kalman Smoother

Optimal State Estimation for Boolean Dynamical Systems using a Boolean Kalman Smoother Optimal State Estimation for Boolean Dynamical Systems using a Boolean Kalman Smoother Mahdi Imani and Ulisses Braga-Neto Department of Electrical and Computer Engineering Texas A&M University College

More information

Semi-Penalized Inference with Direct FDR Control

Semi-Penalized Inference with Direct FDR Control Jian Huang University of Iowa April 4, 2016 The problem Consider the linear regression model y = p x jβ j + ε, (1) j=1 where y IR n, x j IR n, ε IR n, and β j is the jth regression coefficient, Here p

More information

CS 4491/CS 7990 SPECIAL TOPICS IN BIOINFORMATICS

CS 4491/CS 7990 SPECIAL TOPICS IN BIOINFORMATICS CS 4491/CS 7990 SPECIAL TOPICS IN BIOINFORMATICS * Some contents are adapted from Dr. Hung Huang and Dr. Chengkai Li at UT Arlington Mingon Kang, Ph.D. Computer Science, Kennesaw State University Problems

More information

Identifying Bio-markers for EcoArray

Identifying Bio-markers for EcoArray Identifying Bio-markers for EcoArray Ashish Bhan, Keck Graduate Institute Mustafa Kesir and Mikhail B. Malioutov, Northeastern University February 18, 2010 1 Introduction This problem was presented by

More information

Proteomics and Variable Selection

Proteomics and Variable Selection Proteomics and Variable Selection p. 1/55 Proteomics and Variable Selection Alex Lewin With thanks to Paul Kirk for some graphs Department of Epidemiology and Biostatistics, School of Public Health, Imperial

More information

Mixture models for analysing transcriptome and ChIP-chip data

Mixture models for analysing transcriptome and ChIP-chip data Mixture models for analysing transcriptome and ChIP-chip data Marie-Laure Martin-Magniette French National Institute for agricultural research (INRA) Unit of Applied Mathematics and Informatics at AgroParisTech,

More information

Numerical Solutions of ODEs by Gaussian (Kalman) Filtering

Numerical Solutions of ODEs by Gaussian (Kalman) Filtering Numerical Solutions of ODEs by Gaussian (Kalman) Filtering Hans Kersting joint work with Michael Schober, Philipp Hennig, Tim Sullivan and Han C. Lie SIAM CSE, Atlanta March 1, 2017 Emmy Noether Group

More information

A Probabilistic Framework for solving Inverse Problems. Lambros S. Katafygiotis, Ph.D.

A Probabilistic Framework for solving Inverse Problems. Lambros S. Katafygiotis, Ph.D. A Probabilistic Framework for solving Inverse Problems Lambros S. Katafygiotis, Ph.D. OUTLINE Introduction to basic concepts of Bayesian Statistics Inverse Problems in Civil Engineering Probabilistic Model

More information

Bayesian Data Fusion with Gaussian Process Priors : An Application to Protein Fold Recognition

Bayesian Data Fusion with Gaussian Process Priors : An Application to Protein Fold Recognition Bayesian Data Fusion with Gaussian Process Priors : An Application to Protein Fold Recognition Mar Girolami 1 Department of Computing Science University of Glasgow girolami@dcs.gla.ac.u 1 Introduction

More information

Inferring biomarkers for Mycobacterium avium subsp. paratuberculosis infection and disease progression in cattle using experimental data

Inferring biomarkers for Mycobacterium avium subsp. paratuberculosis infection and disease progression in cattle using experimental data Inferring biomarkers for Mycobacterium avium subsp. paratuberculosis infection and disease progression in cattle using experimental data 1,2,4, Gesham Magombedze, Tinevimbo Shiri, Shigetoshi Eda 4, 5,

More information

Multiple Change-Point Detection and Analysis of Chromosome Copy Number Variations

Multiple Change-Point Detection and Analysis of Chromosome Copy Number Variations Multiple Change-Point Detection and Analysis of Chromosome Copy Number Variations Yale School of Public Health Joint work with Ning Hao, Yue S. Niu presented @Tsinghua University Outline 1 The Problem

More information

Sequential Monte Carlo Methods for Bayesian Model Selection in Positron Emission Tomography

Sequential Monte Carlo Methods for Bayesian Model Selection in Positron Emission Tomography Methods for Bayesian Model Selection in Positron Emission Tomography Yan Zhou John A.D. Aston and Adam M. Johansen 6th January 2014 Y. Zhou J. A. D. Aston and A. M. Johansen Outline Positron emission tomography

More information

Permutation-invariant regularization of large covariance matrices. Liza Levina

Permutation-invariant regularization of large covariance matrices. Liza Levina Liza Levina Permutation-invariant covariance regularization 1/42 Permutation-invariant regularization of large covariance matrices Liza Levina Department of Statistics University of Michigan Joint work

More information

Bayesian Calibration of Simulators with Structured Discretization Uncertainty

Bayesian Calibration of Simulators with Structured Discretization Uncertainty Bayesian Calibration of Simulators with Structured Discretization Uncertainty Oksana A. Chkrebtii Department of Statistics, The Ohio State University Joint work with Matthew T. Pratola (Statistics, The

More information

Disk Diffusion Breakpoint Determination Using a Bayesian Nonparametric Variation of the Errors-in-Variables Model

Disk Diffusion Breakpoint Determination Using a Bayesian Nonparametric Variation of the Errors-in-Variables Model 1 / 23 Disk Diffusion Breakpoint Determination Using a Bayesian Nonparametric Variation of the Errors-in-Variables Model Glen DePalma gdepalma@purdue.edu Bruce A. Craig bacraig@purdue.edu Eastern North

More information

Bayesian Nonparametric Regression for Diabetes Deaths

Bayesian Nonparametric Regression for Diabetes Deaths Bayesian Nonparametric Regression for Diabetes Deaths Brian M. Hartman PhD Student, 2010 Texas A&M University College Station, TX, USA David B. Dahl Assistant Professor Texas A&M University College Station,

More information

hsnim: Hyper Scalable Network Inference Machine for Scale-Free Protein-Protein Interaction Networks Inference

hsnim: Hyper Scalable Network Inference Machine for Scale-Free Protein-Protein Interaction Networks Inference CS 229 Project Report (TR# MSB2010) Submitted 12/10/2010 hsnim: Hyper Scalable Network Inference Machine for Scale-Free Protein-Protein Interaction Networks Inference Muhammad Shoaib Sehgal Computer Science

More information

Research Article Mixed-Effects Tobit Joint Models for Longitudinal Data with Skewness, Detection Limits, and Measurement Errors

Research Article Mixed-Effects Tobit Joint Models for Longitudinal Data with Skewness, Detection Limits, and Measurement Errors Journal of Probability and Statistics Volume 2012, Article ID 614102, 19 pages doi:10.1155/2012/614102 Research Article Mixed-Effects Tobit Joint Models for Longitudinal Data with Skewness, Detection Limits,

More information

The University of Auckland Applied Mathematics Bayesian Methods for Inverse Problems : why and how Colin Fox Tiangang Cui, Mike O Sullivan (Auckland),

The University of Auckland Applied Mathematics Bayesian Methods for Inverse Problems : why and how Colin Fox Tiangang Cui, Mike O Sullivan (Auckland), The University of Auckland Applied Mathematics Bayesian Methods for Inverse Problems : why and how Colin Fox Tiangang Cui, Mike O Sullivan (Auckland), Geoff Nicholls (Statistics, Oxford) fox@math.auckland.ac.nz

More information

Inference for partially observed stochastic dynamic systems: A new algorithm, its theory and applications

Inference for partially observed stochastic dynamic systems: A new algorithm, its theory and applications Inference for partially observed stochastic dynamic systems: A new algorithm, its theory and applications Edward Ionides Department of Statistics, University of Michigan ionides@umich.edu Statistics Department

More information

Extension of the SAEM algorithm for nonlinear mixed models with 2 levels of random effects Panhard Xavière

Extension of the SAEM algorithm for nonlinear mixed models with 2 levels of random effects Panhard Xavière Extension of the SAEM algorithm for nonlinear mixed models with 2 levels of random effects Panhard Xavière * Modèles et mé thodes de l'évaluation thérapeutique des maladies chroniques INSERM : U738, Universit

More information

Variable selection and machine learning methods in causal inference

Variable selection and machine learning methods in causal inference Variable selection and machine learning methods in causal inference Debashis Ghosh Department of Biostatistics and Informatics Colorado School of Public Health Joint work with Yeying Zhu, University of

More information

Inferring Transcriptional Regulatory Networks from Gene Expression Data II

Inferring Transcriptional Regulatory Networks from Gene Expression Data II Inferring Transcriptional Regulatory Networks from Gene Expression Data II Lectures 9 Oct 26, 2011 CSE 527 Computational Biology, Fall 2011 Instructor: Su-In Lee TA: Christopher Miles Monday & Wednesday

More information

Basic modeling approaches for biological systems. Mahesh Bule

Basic modeling approaches for biological systems. Mahesh Bule Basic modeling approaches for biological systems Mahesh Bule The hierarchy of life from atoms to living organisms Modeling biological processes often requires accounting for action and feedback involving

More information

Wavelet-Based Nonparametric Modeling of Hierarchical Functions in Colon Carcinogenesis

Wavelet-Based Nonparametric Modeling of Hierarchical Functions in Colon Carcinogenesis Wavelet-Based Nonparametric Modeling of Hierarchical Functions in Colon Carcinogenesis Jeffrey S. Morris University of Texas, MD Anderson Cancer Center Joint wor with Marina Vannucci, Philip J. Brown,

More information

Issues on quantile autoregression

Issues on quantile autoregression Issues on quantile autoregression Jianqing Fan and Yingying Fan We congratulate Koenker and Xiao on their interesting and important contribution to the quantile autoregression (QAR). The paper provides

More information

High-dimensional Problems in Finance and Economics. Thomas M. Mertens

High-dimensional Problems in Finance and Economics. Thomas M. Mertens High-dimensional Problems in Finance and Economics Thomas M. Mertens NYU Stern Risk Economics Lab April 17, 2012 1 / 78 Motivation Many problems in finance and economics are high dimensional. Dynamic Optimization:

More information

A New Bayesian Variable Selection Method: The Bayesian Lasso with Pseudo Variables

A New Bayesian Variable Selection Method: The Bayesian Lasso with Pseudo Variables A New Bayesian Variable Selection Method: The Bayesian Lasso with Pseudo Variables Qi Tang (Joint work with Kam-Wah Tsui and Sijian Wang) Department of Statistics University of Wisconsin-Madison Feb. 8,

More information

Hastings-within-Gibbs Algorithm: Introduction and Application on Hierarchical Model

Hastings-within-Gibbs Algorithm: Introduction and Application on Hierarchical Model UNIVERSITY OF TEXAS AT SAN ANTONIO Hastings-within-Gibbs Algorithm: Introduction and Application on Hierarchical Model Liang Jing April 2010 1 1 ABSTRACT In this paper, common MCMC algorithms are introduced

More information

CPSC 540: Machine Learning

CPSC 540: Machine Learning CPSC 540: Machine Learning MCMC and Non-Parametric Bayes Mark Schmidt University of British Columbia Winter 2016 Admin I went through project proposals: Some of you got a message on Piazza. No news is

More information

Integrated Non-Factorized Variational Inference

Integrated Non-Factorized Variational Inference Integrated Non-Factorized Variational Inference Shaobo Han, Xuejun Liao and Lawrence Carin Duke University February 27, 2014 S. Han et al. Integrated Non-Factorized Variational Inference February 27, 2014

More information

POMP inference via iterated filtering

POMP inference via iterated filtering POMP inference via iterated filtering Edward Ionides University of Michigan, Department of Statistics Lecture 3 at Wharton Statistics Department Thursday 27th April, 2017 Slides are online at http://dept.stat.lsa.umich.edu/~ionides/talks/upenn

More information

Recent Advances in Bayesian Inference Techniques

Recent Advances in Bayesian Inference Techniques Recent Advances in Bayesian Inference Techniques Christopher M. Bishop Microsoft Research, Cambridge, U.K. research.microsoft.com/~cmbishop SIAM Conference on Data Mining, April 2004 Abstract Bayesian

More information

BART: Bayesian additive regression trees

BART: Bayesian additive regression trees BART: Bayesian additive regression trees Hedibert F. Lopes & Paulo Marques Insper Institute of Education and Research São Paulo, Brazil Most of the notes were kindly provided by Rob McCulloch (Arizona

More information

Gaussian Process Approximations of Stochastic Differential Equations

Gaussian Process Approximations of Stochastic Differential Equations Gaussian Process Approximations of Stochastic Differential Equations Cédric Archambeau Centre for Computational Statistics and Machine Learning University College London c.archambeau@cs.ucl.ac.uk CSML

More information

Simultaneous Inference for Multiple Testing and Clustering via Dirichlet Process Mixture Models

Simultaneous Inference for Multiple Testing and Clustering via Dirichlet Process Mixture Models Simultaneous Inference for Multiple Testing and Clustering via Dirichlet Process Mixture Models David B. Dahl Department of Statistics Texas A&M University Marina Vannucci, Michael Newton, & Qianxing Mo

More information

Selection of Variables and Functional Forms in Multivariable Analysis: Current Issues and Future Directions

Selection of Variables and Functional Forms in Multivariable Analysis: Current Issues and Future Directions in Multivariable Analysis: Current Issues and Future Directions Frank E Harrell Jr Department of Biostatistics Vanderbilt University School of Medicine STRATOS Banff Alberta 2016-07-04 Fractional polynomials,

More information

Embedding Supernova Cosmology into a Bayesian Hierarchical Model

Embedding Supernova Cosmology into a Bayesian Hierarchical Model 1 / 41 Embedding Supernova Cosmology into a Bayesian Hierarchical Model Xiyun Jiao Statistic Section Department of Mathematics Imperial College London Joint work with David van Dyk, Roberto Trotta & Hikmatali

More information

Survival Prediction Under Dependent Censoring: A Copula-based Approach

Survival Prediction Under Dependent Censoring: A Copula-based Approach Survival Prediction Under Dependent Censoring: A Copula-based Approach Yi-Hau Chen Institute of Statistical Science, Academia Sinica 2013 AMMS, National Sun Yat-Sen University December 7 2013 Joint work

More information

Design of HIV Dynamic Experiments: A Case Study

Design of HIV Dynamic Experiments: A Case Study Design of HIV Dynamic Experiments: A Case Study Cong Han Department of Biostatistics University of Washington Kathryn Chaloner Department of Biostatistics University of Iowa Nonlinear Mixed-Effects Models

More information

Chapters AP Biology Objectives. Objectives: You should know...

Chapters AP Biology Objectives. Objectives: You should know... Objectives: You should know... Notes 1. Scientific evidence supports the idea that evolution has occurred in all species. 2. Scientific evidence supports the idea that evolution continues to occur. 3.

More information

Geert Geeven. April 14, 2010

Geert Geeven. April 14, 2010 iction of Gene Regulatory Interactions NDNS+ Workshop April 14, 2010 Today s talk - Outline Outline Biological Background Construction of Predictors The main aim of my project is to better understand the

More information

Model Selection in Bayesian Survival Analysis for a Multi-country Cluster Randomized Trial

Model Selection in Bayesian Survival Analysis for a Multi-country Cluster Randomized Trial Model Selection in Bayesian Survival Analysis for a Multi-country Cluster Randomized Trial Jin Kyung Park International Vaccine Institute Min Woo Chae Seoul National University R. Leon Ochiai International

More information

Causal Graphical Models in Systems Genetics

Causal Graphical Models in Systems Genetics 1 Causal Graphical Models in Systems Genetics 2013 Network Analysis Short Course - UCLA Human Genetics Elias Chaibub Neto and Brian S Yandell July 17, 2013 Motivation and basic concepts 2 3 Motivation

More information

A novel and efficient algorithm for de novo discovery of mutated driver pathways in cancer

A novel and efficient algorithm for de novo discovery of mutated driver pathways in cancer A novel and efficient algorithm for de novo discovery of mutated driver pathways in cancer Binghui Liu, Chong Wu, Xiaotong Shen, Wei Pan University of Minnesota, Minneapolis, MN 55455 Nov 2017 Introduction

More information

Advances and Applications in Perfect Sampling

Advances and Applications in Perfect Sampling and Applications in Perfect Sampling Ph.D. Dissertation Defense Ulrike Schneider advisor: Jem Corcoran May 8, 2003 Department of Applied Mathematics University of Colorado Outline Introduction (1) MCMC

More information

Statistical Inference

Statistical Inference Statistical Inference Liu Yang Florida State University October 27, 2016 Liu Yang, Libo Wang (Florida State University) Statistical Inference October 27, 2016 1 / 27 Outline The Bayesian Lasso Trevor Park

More information

Approximate Bayesian Computation

Approximate Bayesian Computation Approximate Bayesian Computation Michael Gutmann https://sites.google.com/site/michaelgutmann University of Helsinki and Aalto University 1st December 2015 Content Two parts: 1. The basics of approximate

More information

25 : Graphical induced structured input/output models

25 : Graphical induced structured input/output models 10-708: Probabilistic Graphical Models 10-708, Spring 2013 25 : Graphical induced structured input/output models Lecturer: Eric P. Xing Scribes: Meghana Kshirsagar (mkshirsa), Yiwen Chen (yiwenche) 1 Graph

More information