Package NTW. November 20, 2018

Similar documents
Package noise. July 29, 2016

Package GeneExpressionSignature

Package SEMModComp. R topics documented: February 19, Type Package Title Model Comparisons for SEM Version 1.0 Date Author Roy Levy

Package ChIPtest. July 20, 2016

Package SCEPtERbinary

Package ltsbase. R topics documented: February 20, 2015

Package ShrinkCovMat

Package mirlastic. November 12, 2015

Package BRAIN. October 16, 2018

Package flowmap. October 8, 2018

Package BNN. February 2, 2018

Package depth.plot. December 20, 2015

Package scio. February 20, 2015

Package LBLGXE. R topics documented: July 20, Type Package

Package covrna. R topics documented: September 7, Type Package

Package plw. R topics documented: May 7, Type Package

Package BRAIN. June 30, 2018

Package genphen. March 18, 2019

Package idmtpreg. February 27, 2018

Package effectfusion

Package sodavis. May 13, 2018

Package Blendstat. February 21, 2018

Package BayesNI. February 19, 2015

Package basad. November 20, 2017

Package EMVS. April 24, 2018

Package locfdr. July 15, Index 5

Package matrixnormal

Package rbvs. R topics documented: August 29, Type Package Title Ranking-Based Variable Selection Version 1.0.

Package gtheory. October 30, 2016

Package factorqr. R topics documented: February 19, 2015

Package dama. November 25, 2017

Package RI2by2. October 21, 2016

Package NanoStringDiff

Package survidinri. February 20, 2015

Package symmoments. R topics documented: February 20, Type Package

Package LCAextend. August 29, 2013

Package ICBayes. September 24, 2017

The sbgcop Package. March 9, 2007

Package SK. May 16, 2018

Package msir. R topics documented: April 7, Type Package Version Date Title Model-Based Sliced Inverse Regression

Package sscor. January 28, 2016

Package HDtest. R topics documented: September 13, Type Package

Package mpmcorrelogram

Package ARCensReg. September 11, 2016

Package sbgcop. May 29, 2018

Package intansv. October 14, 2018

Package beam. May 3, 2018

Package flexcwm. R topics documented: July 11, Type Package. Title Flexible Cluster-Weighted Modeling. Version 1.0.

Package bigtime. November 9, 2017

Package gsarima. February 20, 2015

Package varband. November 7, 2016

Package magma. February 15, 2013

Package flexcwm. R topics documented: July 2, Type Package. Title Flexible Cluster-Weighted Modeling. Version 1.1.

Package gma. September 19, 2017

Package MiRKATS. May 30, 2016

Package clogitboost. R topics documented: December 21, 2015

Package unbalhaar. February 20, 2015

Package penalized. February 21, 2018

Package tsvd. March 11, 2015

Package ebal. R topics documented: February 19, Type Package

Package WGDgc. June 3, 2014

Package CovSel. September 25, 2013

Package nardl. R topics documented: May 7, Type Package

Package BSSasymp. R topics documented: September 12, Type Package

Final Review. Yang Feng. Yang Feng (Columbia University) Final Review 1 / 58

Package noncompliance

Package bmem. February 15, 2013

Package icmm. October 12, 2017

Package tsvd. March 24, 2015

Package SSPA. December 9, 2018

Package FDRreg. August 29, 2016

Package RcppSMC. March 18, 2018

Linear Models and Estimation by Least Squares

Package MACAU2. R topics documented: April 8, Type Package. Title MACAU 2.0: Efficient Mixed Model Analysis of Count Data. Version 1.

Package cellscape. October 15, 2018

Package mixture. R topics documented: February 13, 2018

Package PoisNonNor. R topics documented: March 10, Type Package

Package mar. R topics documented: February 20, Title Multivariate AutoRegressive analysis Version Author S. M. Barbosa

Package hierdiversity

Package neat. February 23, 2018

Package sindyr. September 10, 2018

Package WGDgc. October 23, 2015

Package milineage. October 20, 2017

Package NonpModelCheck

Package xdcclarge. R topics documented: July 12, Type Package

Package Rdisop. R topics documented: August 20, Title Decomposition of Isotopic Patterns Version Date

Package ForecastCombinations

Package SHIP. February 19, 2015

Package plspolychaos

Package SPreFuGED. July 29, 2016

Inferring gene regulatory networks

Package abundant. January 1, 2017

Managing Uncertainty

Package rarhsmm. March 20, 2018

Package denseflmm. R topics documented: March 17, 2017

Package CoxRidge. February 27, 2015

Package LPTime. March 3, 2015

The ICS Package. May 4, 2007

Package FindAllRoots

Package CCP. R topics documented: December 17, Type Package. Title Significance Tests for Canonical Correlation Analysis (CCA) Version 0.

Transcription:

Type Package Package NTW November 20, 2018 Title Predict gene network using an Ordinary Differential Equation (ODE) based method Depends R (>= 2.3.0) Imports mvtnorm, stats, utils Version 1.33.0 Date 2011-10-8 Author Wei iao, Yin Jin, Darong Lai, inyi Yang, Yuanhua Liu, Christine Nardini Maintainer Yuanhua Liu <liuyuanhua@picb.ac.cn> This package predicts the gene-gene interaction network and identifies the direct transcriptional targets of the perturbation using an ODE (Ordinary Differential Equation) based method. License GPL-2 biocviews Preprocessing LazyLoad yes git_url https://git.bioconductor.org/packages/ntw git_branch master git_last_commit 239d190 git_last_commit_date 2018-10-30 Date/Publication 2018-11-19 R topics documented: NTW-package........................................ 2 A.estimation.Srow..................................... 3 AP.estimation.Srow..................................... 4 comb.matrix......................................... 5 methods.regression..................................... 6 NTW............................................ 7 P.preestimation....................................... 8 patterns.priora....................................... 9 sos.data........................................... 10 Index 11 1

2 NTW-package NTW-package Gene interaction network and perturbation targets predictions Details This package includes the functions for estimating the gene-gene interaction network (a matrix, named A, with genes as rows and columns) and the associated transcriptional targets of the perturbations (a matrix, named P, with genes as rows and perturbations as columns). These estimations are computed with the NTW algorithm, a gene network inference algorithm based on ODE (ordinary differential equation) method, see reference. In this package, the whole A matrix and P matrix are estimated row by row with the function AP.estimation.Srow, and built together with the function NTW. AP.estimation.Srow can be used independently so that estimation of each row can be performed in parallel, improving computation time. For solving the steady state ODE equations, 3 regression methods are supplied: geo, sse and ml, see details in the the corresponding function help pages. In addition, in order to accelerate the estimation of matrix A, an option is available to make use of some prior information such as gene association (output from other gene netwrok inference algorithms, or from literature) in NTW. The regression methods used in forward or backward mode makes 6 possibilities available for estimating a single row of A matrix. The main functions in this package are listed below, NTW, to estimate the whole matrix A and P (if P is unknown). AP.estimation.Srow, to estimate one single row in A and P. A.estimation.Srow, to estimate one single row in A with P known. backward and forward, to estimate one single row of matrix A with different patterns of using prior gene association information. method.geo, method.sse and method.ml, to estimate one single row of matrix A with different regression methods. comb.matrix, sub-function to create all the combinations for regressor locations. P.preestimation, pre-estimate P matrix according to the gene expression data. Package: NTW Type: Package Version: 0.99.0 Date: 2010-5-11 License: GPL-2 LazyLoad: yes Wei iao, Yin Jin, Darong Lai, inyi Yang, Yuanhua Liu, Christine Nardini Maintainer: Yuanhua Liu <liuyuanhua@picb.ac.cn>

A.estimation.Srow 3 References Applied method for the inference of gene networks: the bifidobacterium case. to be submitted A.estimation.Srow Estimation of a single row in matrix A with the perturbation targets matrix P known Estimating a single row of gene interaction matrix A when the perturbation targets matrix P is given. The single row in A is then regressed according to the equation A=P with one of the three regression methods, geo, sse and ml. A.estimation.Srow(r, cmm.corrected, pred.net,, P.known, topd, restk, cflag, sup.drop, noiselevel) r cmm.corrected pred.net A number indicating the row of A to be estimated. A flag to indicate whether a prior network is applied. A matrix with the same dimensions of A for prior network, which should be specified if cmm.corrected is 1, default is NULL. Gene expression data, a matrix with genes as rows and perturbations as columns. P.known A known P matrix with the same dimensions of. topd restk cflag sup.drop noiselevel A parameter in NTW algorithm for keeping the top topd combinations of nonzero regressors of row r in A, see vignette for details. A vector (length equals to nrow(a)) with each element to indicate the number of non-zero regressors in the corresponding row of A. A flag to tell the regression methods, "geo" for geometric mean method, "sse" for sum of square method and "ml" for maximum likelihood method. An indication to identify the pattern for using the prior gene association information. 1 for "forward" pattern and -1 for "backward" pattern, see vignette for details. Only used in "ml" method, to indicate the noise level in each perturbed experiment. A.row A vector of estimated row r in A. Wei iao, Yin Jin, Darong Lai, inyi Yang,Yuanhua Liu, Christine Nardini

4 AP.estimation.Srow ##single row estimation without prior gene association information, regression is done by "sse"## <-sos.data <-as.matrix() P.known<-matrix(round(runif(nrow()*ncol(), min=0, max=1)), nrow(), ncol()) restk=rep(ncol()-1, nrow()) topd = round(0.6*nrow()) topk = round(0.5*nrow()) result<-a.estimation.srow(r=1,cmm.corrected = 0, pred.net= NULL,,P.known, topd, restk, cflag="sse",sup.drop = -1, noiselevel=0.1) result$a.row ##single row estimation with prior gene association information, regression is done by "geo"### pred.net<-matrix(round(runif(nrow()*nrow(), min=0, max=1)), nrow(), ncol()) result<-a.estimation.srow(r=1,cmm.corrected = 1, pred.net,,p.known,topd, restk, cflag="geo",sup.drop = -1, noiselevel=0.1) result$a.row AP.estimation.Srow Estimation of a single row in gene interaction matrix A and perturbation targets matrix P Estimating a single row of gene interaction matrix A and identifying the perturbations which target the corresponding gene of the row. For perturbation identifications, multiple perturbations are considered for one target (gene). Combinations of perturbations are first assumed fixed. The single row in A are then regressed according to equation A=P with one of the three regression methods, geo,sse and ml. All these combinations will finally be optimized according to the difference between the predicted with the estimated A and P and the experimental values. AP.estimation.Srow(r, cmm.corrected, pred.net,, I, topd, restk, cflag, sup.drop, nump, noiselevel r cmm.corrected pred.net I topd restk A number indicating the row of A and P to be estimated. A flag to indicate whether a prior network is applied. A matrix with the same dimensions of A for the prior network, which should be specified if cmm.corrected is 1, default is NULL. Gene expression data, a matrix with genes as rows and perturbations as columns. The pre-estimated P matrix according to the gene expression data, with the same dimensions of or P. A parameter in NTW algorithm for keeping the top topd combinations of nonzero regressors of row r in A, see vignette for details. A vector (length equals to nrow(a)) with each element to indicate the number of non-zero regressors in the corresponding row of A.

comb.matrix 5 cflag sup.drop nump noiselevel A flag to identify the regression methods, "geo" for geometric mean method, "sse" for sum of square method and "ml" for maximum likelihood method. A flag to show the pattern for using the prior gene association information. 1 for "forward" pattern and -1 for "backward" pattern. A number set to limit the possibilities that one gene will be targeted by perturbations. That is at most nump perturbations can directly perturb one gene. Only used in "ml" method, to indicate the noise level in each perturbed experiment. A.row Estimation of the row r in A. P.index A vector to show which perturbations target the gene corresponding to row r. Wei iao, Yin Jin, Darong Lai, inyi Yang,Yuanhua Liu, Christine Nardini ##single row estimation without prior gene association information, regression is done by "sse"## <-sos.data <-as.matrix() I<-P.preestimation(, topk= round(2*nrow())) restk=rep(ncol()-1, nrow()) topd = round(0.6*nrow()) topk = round(0.5*nrow()) nump = round(0.25*nrow()) result<-ap.estimation.srow(r=1,cmm.corrected = 0, pred.net= NULL,, I,topD, restk, cflag="sse",sup.drop = -1, nump, noiselevel=0.1) result$a.row result$p.index ###single row estimation with prior gene association information, regression is done by "geo"### pred.net<-matrix(round(runif(nrow()*nrow(), min=0, max=1)), nrow(), ncol()) result<-ap.estimation.srow(r=1,cmm.corrected = 1, pred.net,, I,topD, restk, cflag="geo",sup.drop = -1, nump, noiselevel=0.1) result$a.row result$p.index comb.matrix Create all combinations of vectors Create all combinations of vectors especially for matrices. comb.matrix(x, y)

6 methods.regression x y A vector. A vector. A matrix with nrow(x)*nrow(y) rows and ncol(x)+ncol(y) columns. Wei iao, Yin Jin, Darong Lai, inyi Yang, Yuanhua Liu, Christine Nardini ###A matrix with only one row is obtained#### x<-c(1,2,3) y<-c(4,5) comb.matrix(x,y) ###A matrix with 2 rows and 4 columns is obtained#### x<-matrix(x,1,) y<-matrix(y,,1) comb.matrix(x,y) methods.regression Regression methods to estimate a single row in A with fixed perturbations Regression methods to estimate a single row in a gene interaction network (A) with perturbations (P) fixed. These methods differ at the regression criterions: the objective function of geo is the geometric mean, sum of square for sse and maximum likelihood for ml, see vignette for the details. method.geo(index.vars,, pert) method.sse(index.vars,, pert) method.ml(index.vars,, pert, noiselevel) index.vars A vector to select the rows in on which regression will be used. It sets a group of combination of rows in. Gene expression data, a matrix with genes as rows and perturbations as columns. pert A vector of row r in P. noiselevel Indicate the noise level in each perturbed experiment. sol error A vector of regression result The result of the objective function.

NTW 7 Wei iao, Yin Jin, Darong Lai, inyi Yang,Yuanhua Liu, Christine Nardini NTW Estimation of gene interaction matrix A and perturbation targets matrix P This function is used to estimate the whole gene interaction matrix A and the perturbation targets matrix P, row-wise, using the NTW algorithm (see reference), based on ODE method. In this method, the linearized ODE can solved using 3 regression methods: geo, sse and ml. In order to save computation time, and improve results, NTW offers the opportunity to input gene association information output from other algorithms or from the literature. The non-null regressors in the gene association network will help fix the regressors to be estimated in the final matrix A. Two ways are supplied to use the non-zero information, namely forward and backward approaches. In the "backward" pattern, only the non-zero positions in the prior gene association network will be used as regressors in A. While in the "forward" pattern, both these non-zero positions and some other possible positions (depending on restk ) in A are used as regressors. NTW(, restk, topd, topk = NULL, P.known = NULL, cflag, pred.net = NULL, sup.drop = -1, nump = NULL, no restk topd topk Gene expression data, a matrix with genes as rows and perturbations as columns. A vector (length equals to nrow(a)) with each element to indicate the number of non-zero regressors in the corresponding row of A. A parameter in NTW algorithm for keeping the top topd combinations of nonzero regressors of a single row in A, see vignette for details. The number of possible targets of the perturbations, used for pre-estimate the perturbation targets matrix P. P.known A known P matrix with the same dimensions of. cflag pred.net sup.drop nump noiselevel A flag to tell the regression methods, "geo" for geometric mean method, "sse" for sum of square method and "ml" for maximum likelihood method. A matrix with the same dimensions of A for the prior gene association information. Default is NULL. An indication to show the pattern for using the prior gene association information. 1 for "forward" pattern and -1 for "backward" pattern. A number set to limit the possibilities that one gene will be targeted by perturbations. That is at most nump perturbations can directly perturb one gene. Only used in ml method, to indicate the noise level in each perturbed experiment. est.a est.p Estimated gene interaction matrix A, with genes as rows and columns. Estimated perturbation targets matrix P, with genes as rows and perturbations as columns.

8 P.preestimation Wei iao, Yin Jin, Darong Lai, inyi Yang,Yuanhua Liu, Christine Nardini References Applied method for the inference of gene networks: the bifidobacterium case. to be submitted ##NTW testing without prior gene association information, regression is done by "sse"## <-sos.data <-as.matrix() restk=rep(ncol()-1, nrow()) topd = round(0.6*nrow()) topk = round(0.5*nrow()) nump = round(0.25*nrow()) result<-ntw(, restk, topd, topk, P.known=NULL, cflag="sse", pred.net = NULL, sup.drop = -1,numP, noiselevel=0.1) result$est.a result$est.p ##NTW testing with prior gene association information, regression is done by "geo"## pred.net<-matrix(round(runif(nrow()*nrow(), min=0, max=1)), nrow(), nrow()) result<-ntw(, restk, topd, topk, P.known=NULL, cflag="geo", pred.net, sup.drop = -1,numP, noiselevel=0.1) result$est.a result$est.p P.preestimation Pre-estimation of the transcriptional pertubation targets matrix P Pre-estimate the potential transcriptional perturbation targets matrix P according to gene expression data. Those genes with the changes in top topk will be assumed as possible targets of the perturbations. P.preestimation(, topk) topk Gene expression data, a matrix with rows as genes and columns as experiments. The number of possible targets of the perturbations. A matrix with the same structure of or P.

patterns.priora 9 Wei iao, Yin Jin, Darong Lai, inyi Yang, Yuanhua Liu, Christine Nardini <-sos.data <-as.matrix() I<-P.preestimation(, topk= round(0.6*nrow())) I patterns.priora Approaches to use a priori known gene association information for a single row estimation in matrix A Given some information of the gene interaction network, for example the estimated gene association matrix pred.net by other algorithms or from literature, NTW can use this prior information to enhance acurracy. NTW offers two approaches to infer the gene network, i.e. forward and backward, on the base of pred.net. The former computes further edges than the ones in pred.net, while the latter prunes edges. backward(r,, pert, topd, restk, cflag, TA, noiselevel) forward(r,, pert, topd, restk, cflag, TA, noiselevel) r A number to indicate the row of A to be estimated when row r of P is fixed. Gene expression data, a matrix with genes as rows and perturbations as columns. pert Row r in P. topd A parameter in NTW algorithm for keeping the top topd combinations of nonzero regressors of row r in A, see vignette for details. restk The number of non-zero regressors for the estimation of row r in A. cflag A flag to identify the regression methods, "geo" for geometric mean method, "sse" for sum of square method and "ml" for maximum likelihood method. TA A vector including the indexes of non-zero elements in row r of the network containing a priori information, pred.net. noiselevel Only used in "ml" method, to indicate the noise level in each perturbed experiment. A.row A vector of estimated row r in A. Crt The minimum value from the objective function. Wei iao, Yin Jin, Darong Lai, inyi Yang,Yuanhua Liu, Christine Nardini

10 sos.data sos.data SOS pathway perturbation data Format RT-PCR data with 9 genes in SOS pathway of Escherichia coli perturbed. These 9 perturbed genes are observed in the RT-PCR experiments. sos.data is a data frame containing 9 rows for observed genes and 9 columns for perturbations. References T.S. Gardner, D.di Bernardo, D. Lorenz, and J.J. Collins. Inferring genetic networks and identifying compound mode of action via expression profiling. Science, 301(5629): 102-105, 2003. sos.data

Index Topic arith A.estimation.Srow, 3 AP.estimation.Srow, 4 comb.matrix, 5 NTW, 7 P.preestimation, 8 Topic datasets sos.data, 10 Topic multivariate methods.regression, 6 patterns.priora, 9 Topic package NTW-package, 2 A.estimation.Srow, 3 AP.estimation.Srow, 4 backward (patterns.priora), 9 comb.matrix, 5 forward (patterns.priora), 9 method.geo (methods.regression), 6 method.ml (methods.regression), 6 method.sse (methods.regression), 6 methods.regression, 6 NTW, 7 NTW-package, 2 P.preestimation, 8 patterns.priora, 9 sos.data, 10 11