Package msir. R topics documented: April 7, Type Package Version Date Title Model-Based Sliced Inverse Regression

Similar documents
Package CEC. R topics documented: August 29, Title Cross-Entropy Clustering Version Date

Package leiv. R topics documented: February 20, Version Type Package

Package depth.plot. December 20, 2015

Package darts. February 19, 2015

Package NonpModelCheck

Package CovSel. September 25, 2013

Package mpmcorrelogram

Package basad. November 20, 2017

The nltm Package. July 24, 2006

Package Grace. R topics documented: April 9, Type Package

Package flexcwm. R topics documented: July 11, Type Package. Title Flexible Cluster-Weighted Modeling. Version 1.0.

Package sscor. January 28, 2016

Package misctools. November 25, 2016

Package flexcwm. R topics documented: July 2, Type Package. Title Flexible Cluster-Weighted Modeling. Version 1.1.

Package gma. September 19, 2017

Package idmtpreg. February 27, 2018

Package sklarsomega. May 24, 2018

Package clogitboost. R topics documented: December 21, 2015

Package pearson7. June 22, 2016

Package CorrMixed. R topics documented: August 4, Type Package

Package FDRreg. August 29, 2016

Package EDR. September 14, 2016

Package clustergeneration

Package BayesNI. February 19, 2015

Package esaddle. R topics documented: January 9, 2017

Package face. January 19, 2018

Package bayeslm. R topics documented: June 18, Type Package

Package symmoments. R topics documented: February 20, Type Package

Package ExtremeBounds

The panel Package. June 10, Description Functions and datasets for fitting models to Panel data. dclike.panel

Package beam. May 3, 2018

The dr Package. R topics documented: September 1, Version Date 2005/08/31. Title Methods for dimension reduction for regression

Package survidinri. February 20, 2015

Package mixture. R topics documented: February 13, 2018

Package CoxRidge. February 27, 2015

Package jmcm. November 25, 2017

Package AID. R topics documented: November 10, Type Package Title Box-Cox Power Transformation Version 2.3 Date Depends R (>= 2.15.

Package EBMAforecast

Package KRLS. July 10, 2017

Package spcov. R topics documented: February 20, 2015

Package EMVS. April 24, 2018

Package rbvs. R topics documented: August 29, Type Package Title Ranking-Based Variable Selection Version 1.0.

Package sodavis. May 13, 2018

Package plw. R topics documented: May 7, Type Package

Package PolynomF. May 17, 2018

Package abundant. January 1, 2017

Finite Mixture Model Diagnostics Using Resampling Methods

Package effectfusion

Package ICBayes. September 24, 2017

Package elhmc. R topics documented: July 4, Type Package

Package unbalhaar. February 20, 2015

Package rgabriel. February 20, 2015

Package ltsbase. R topics documented: February 20, 2015

Package TSPred. April 5, 2017

Package denseflmm. R topics documented: March 17, 2017

Package GPfit. R topics documented: April 2, Title Gaussian Processes Modeling Version Date

Package TwoStepCLogit

Package threg. August 10, 2015

Package ARCensReg. September 11, 2016

Package horseshoe. November 8, 2016

Package FDRSeg. September 20, 2017

Package rnmf. February 20, 2015

The cts Package. June 23, 2006

Package BNN. February 2, 2018

Package BootWPTOS. R topics documented: June 30, Type Package

Package bpp. December 13, 2016

Package fwsim. October 5, 2018

Package scio. February 20, 2015

Package dhsic. R topics documented: July 27, 2017

Package SpatPCA. R topics documented: February 20, Type Package

Package EMMIXmcfa. October 18, 2017

Package MultisiteMediation

Package mistwosources

Package MAVE. May 20, 2018

Package pfa. July 4, 2016

The evdbayes Package

Package LIStest. February 19, 2015

Package matrixnormal

Package blme. August 29, 2016

Package SpatialNP. June 5, 2018

Package MonoPoly. December 20, 2017

Package multivariance

Package EMMIXmfa. October 18, Index 15

Package LCAextend. August 29, 2013

Package ShrinkCovMat

Package convospat. November 3, 2017

Package OGI. December 20, 2017

Package BGGE. August 10, 2018

Package DiscreteLaplace

Package pendensity. March 13, 2016

Package gk. R topics documented: February 4, Type Package Title g-and-k and g-and-h Distribution Functions Version 0.5.

The ICS Package. May 4, 2007

Package QuantifQuantile

Package lomb. February 20, 2015

Package EMMIX. November 8, 2013

Package netcoh. May 2, 2016

Package bivariate. December 10, 2018

Package RootsExtremaInflections

Package severity. February 20, 2015

Package Select. May 11, 2018

Transcription:

Type Package Version 1.3.1 Date 2016-04-07 Title Model-Based Sliced Inverse Regression Package April 7, 2016 An R package for dimension reduction based on finite Gaussian mixture modeling of inverse regression. Author Maintainer Depends R (>= 3.0), mclust (>= 5.0) Imports stats, utils, graphics, grdevices, rgl License GPL (>= 2) Repository CRAN ByteCompile true LazyLoad yes NeedsCompilation no Date/Publication 2016-04-07 14:06:05 R topics documented: -package........................................ 2 loess.sd........................................... 2............................................. 4.bic........................................... 6.dir........................................... 8.nslices......................................... 9.permutation.test.................................... 9.regularizedSigma................................... 10.slices.......................................... 11 plot............................................ 12 summary......................................... 13 Index 15 1

2 loess.sd -package Model-based Sliced Inverse Regression (MSIR) An R package that implements MSIR, a dimension reduction method based on Gaussian finite mixture models. The basis of the subspace is estimated by modeling the inverse distribution within slice using finite mixtures of Gaussians, with number of components and covariance matrix parameterization selected by BIC or defined by the user. The method provides an extension to sliced inverse regression (SIR) and allows to overcome the main limitation of SIR, i.e., the failure in the presence of regression symmetric relationships, without the need to impose further assumptions. References Scrucca, L. (2011) Model-based SIR for dimension reduction. Computational Statistics & Data Analysis, 55(11), 3010-3026. loess.sd Local Polynomial Regression Fitting with Variability bands Nonparametric estimation of mean function with variability bands. loess.sd(x, y = NULL, nsigma = 1,...) panel.loess(x, y, col = par("col"), bg = NA, pch = par("pch"), cex = 1, col.smooth = "red", span = 2/3, degree = 2, nsigma = 1,...)

loess.sd 3 x a vector of values for the predictor variable x. y a vector of values for the response variable y. nsigma a multiplier for the standard deviation function. col, bg, pch, cex numeric or character codes for the color(s), point type and size of points; see also par. col.smooth span degree color to be used by lines for drawing the smooths. smoothing parameter for loess. the degree of the polynomials to be used, see loess.... further argument passed to the function loess. The function loess.sd computes the loess smooth for the mean function and the mean plus and minus k times the standard deviation function. The function panel.loess can be used to add to a scatterplot matrix panel a smoothing of mean function using loess with variability bands at plus and minus nsigmas times the standard deviation. References Weisberg, S. (2005) Applied Linear Regression, 3rd ed., Wiley, New York, pp. 275-278. loess Examples data(cars) plot(cars, main = "lowess.sd(cars)") lines(l <- loess.sd(cars)) lines(l$x, l$upper, lty=2) lines(l$x, l$lower, lty=2)

4 Model-based Sliced Inverse Regression (MSIR) A dimension reduction method based on Gaussian finite mixture models which provides an extension to sliced inverse regression (SIR). The basis of the subspace is estimated by modeling the inverse distribution within slice using Gaussian finite mixtures with number of components and covariance matrix parameterization selected by BIC or defined by the user. (x, y, nslices =.nslices, slice.function =.slices, modelnames = NULL, G = NULL, cov = c("mle", "regularized"),...) x y nslices a (n p) design matrix containing the predictors data values. a (n 1) vector of data values for the response variable. It can be a numeric vector (regression) but also a factor (classification). In the latter case, the levels of the factor define the slices used. the number of slices used, unless y is a factor. By default the value returned by.nslices. slice.function the slice functions to be used, by default.slices, but the user can provide a different slicing function. modelnames G cov a vector of character strings indicating the Gaussian mixture models to be fitted as described in mclustmodelnames. If a vector of strings is given they are used for all the slices. If a list of vectors is provided then each vector refers to a single slice. an integer vector specifying the numbers of mixture components used in fitting Gaussian mixture models. If a list of vectors is provided then each vector refers to a single slice. The predictors marginal covariance matrix. Possible choices are: "mle" = maximum likelihood estimate "regularized" = regularized covariance matrix (see.regularizedsigma) R matrix = a (p p) user defined covariance matrix... other arguments passed to.compute. Returns an object of class with attributes: call the function call.

5 x y slice.info mixmod loglik f mu sigma M the design matrix. the response vector. output from slicing function. a list of finite mixture model objects as described in mclustmodel. the log-likelihood for the mixture models. a vector of length equal to the total number of mixture components containing the fraction of observations in each fitted component within slices. a matrix of component within slices predictors means. the marginal predictors covariance matrix. the kernel matrix. evalues the eigenvalues from the generalized eigen-decomposition of M. evectors basis std.basis numdir dir the raw eigenvectors from the generalized eigen-decomposition of M ordered according to the eigenvalues. the normalized eigenvectors from the generalized eigen-decomposition of M ordered according to the eigenvalues. standardized basis vectors obtained by multiplying each coefficient of the eigenvectors by the standard deviation of the corresponding predictor. The resulting coefficients are scaled such that all predictors have unit standard deviation. the maximal number of directions estimated. the estimated MSIR directions from mean-centered predictors. References Scrucca, L. (2011) Model-based SIR for dimension reduction. Computational Statistics & Data Analysis, 55(11), 3010-3026. summary., plot., dr Examples # 1-dimensional simple regression n = 200; p = 5 b = as.matrix(c(1,-1,rep(0,p-2))) x = matrix(rnorm(n*p), nrow = n, ncol = p) y = exp(0.5 * x%*%b) + 0.1*rnorm(n) MSIR = (x, y) summary(msir) plot(msir, type = "2Dplot") # 1-dimensional symmetric response curve

6.bic n = 200; p = 5 b = as.matrix(c(1,-1,rep(0,p-2))) x = matrix(rnorm(n*p), nrow = n, ncol = p) y = (0.5 * x%*%b)^2 + 0.1*rnorm(n) MSIR = (x, y) summary(msir) plot(msir, type = "2Dplot") plot(msir, type = "coefficients") # 2-dimensional response curve n = 300; p = 5 b1 = c(1, 1, 1, rep(0, p-3)) b2 = c(1,-1,-1, rep(0, p-3)) b = cbind(b1,b2) x = matrix(rnorm(n*p), nrow = n, ncol = p) y = x %*% b1 + (x %*% b1)^3 + 4*(x %*% b2)^2 + rnorm(n) MSIR = (x, y) summary(msir) plot(msir, which = 1:2) plot(msir, type = "spinplot") plot(msir, which = 1, type = "2Dplot", span = 0.7) plot(msir, which = 2, type = "2Dplot", span = 0.7).bic BIC-type criterion for dimensionality BIC-type criterion for selecting the dimensionality of a dimension reduction subspace..bic(object, type = 1, plot = FALSE) bicdimred(m, x, nslices, type = 1, tol = sqrt(.machine$double.eps)) object plot M x type nslices tol a object if TRUE a plot of the criterion is shown. the kernel matrix. See details below. the predictors data matrix. See details below. See details below. the number of slices. See details below. a tolerance value

.bic 7 Details This BIC-type criterion for the determination of the structural dimension selects d as the maximizer of G(d) = l(d) P enalty(p, d, n) where l(d) is the log-likelihood for dimensions up to d, p is the number of predictors, and n is the sample size. The term P enalty(p, d, n) is the type of penalty to be used: type = 1 P enalty(p, d, n) = (p d) log(n) type = 2 P enalty(p, d, n) = 0.5Cd(2p d + 1) where C = (0.5 log(n) + 0.1n ( 1/3))/2nslices/n type = 3 P enalty(p, d, n) = 0.5Cd(2p d + 1) where C = log(n)nslices/n type = 3 P enalty(p, d, n) = 1/2d log(n) Returns a list with components: evalues l crit d eigenvalues log-likelihood BIC-type criterion selected dimensionality The.bic also assign the above information to the corresponding object. References Zhu, Miao and Peng (2006) "Sliced Inverse Regression for CDR Space Estimation", JASA. Zhu, Zhu (2007) "On kernel method for SAVE", Journal of Multivariate Analysis. Examples # 1-dimensional symmetric response curve n = 200; p = 5 b = as.matrix(c(1,-1,rep(0,p-2))) x = matrix(rnorm(n*p), nrow = n, ncol = p) y = (0.5 * x%*%b)^2 + 0.1*rnorm(n) MSIR = (x, y).bic(msir, plot = TRUE) summary(msir).bic(msir, type = 3, plot = TRUE) summary(msir)

8.dir.dir Model-based Sliced Inverse Regression directions MSIR estimates a set of up to p orthogonal direction vectors each of length p, the first d of which are estimates of the basis of the dimensional reduction subspace..dir(object, numdir = object$numdir) object numdir a object the number of basis vectors to return. The function returns the estimated directions in the original n dimensional space for plotting. References Scrucca, L. (2011) Model-based SIR for dimension reduction. Computational Statistics & Data Analysis, 55(11), 3010-3026. {} Examples # 1-dimensional simple regression n = 200; p = 5 b = as.matrix(c(1,-1,rep(0,p-2))) x = matrix(rnorm(n*p), nrow = n, ncol = p) y = exp(0.5 * x%*%b) + 0.1*rnorm(n) MSIR = (x, y).dir(msir, numdir = 1)

.nslices 9.nslices Default number of slices This function computes a Sturges type number of slices to be used as default in the function..nslices(n, p) n p the number of observations in the sample. the number of predictors in the sample. The function returns a single value, i.e. the number of slices..permutation.test Permutation test for dimensionality Approximates marginal dimension test significance levels by sampling from the permutation distribution..permutation.test(object, npermute = 99, numdir = object$numdir, verbose = TRUE) object npermute numdir verbose a object. number of permutations to compute. maximum value of the dimension to test. if TRUE a textual progress bar is shown during computation.

10.regularizedSigma Details The function approximates significance levels of the marginal dimension tests based on a permutation test. The function returns a list with components: summary npermute a table containing the hypotheses, the test statistics, the permutation p-values. the number of permutations used. Furthermore, it also assigns the above information to the corresponding object. References Scrucca, L. (2011) Model-based SIR for dimension reduction. Computational Statistics & Data Analysis, 55(11), 3010-3026. dr Examples ## Not run: # 1-dimensional simple regression n = 200; p = 5 b = as.matrix(c(1,-1,rep(0,p-2))) x = matrix(rnorm(n*p), nrow = n, ncol = p) y = exp(0.5 * x%*%b) + 0.1*rnorm(n) MSIR = (x, y).permutation.test(msir) summary(msir) ## End(Not run).regularizedsigma Regularized estimate of predictors covariance matrix. This function computes a regularized version of the covariance matrix of the predictors. Among the possible models the one which maximizes BIC is returned.

.slices 11.regularizedSigma(x, inv = FALSE, model = c("xii", "XXI", "XXX")) x inv model the predictors data matrix. if TRUE the inverse of the estimated covariance matrix is returned. available models: XII = diagonal equal variances XXI = diagonal unequal variances XXX = full covariance matrix A (p p) covariance matrix estimate..slices Slice a vector into slices of approximately equal size Function used for slicing a continuous response variable..slices(y, nslices) y nslices a vector of n values the number of slices, no larger than n Returns a list with components: slice.indicator an indicator variable for the slices. nslices the actual number of slices produced. slice.sizes the number of observations in each slice.

12 plot. plot. Plot method for objects. Plots directions and other information from MSIR estimation. ## S3 method for class plot(x, which, type = c("pairs", "2Dplot", "spinplot", "evalues", "coefficients"), span = NULL, std = TRUE, ylab, xlab, restore.par = TRUE,...) x which type span std ylab xlab restore.par a object. a vector of value(s) giving the directions for which the plot should be drawn. the type of plot to be drawn. the span of smoother (only for type = "pairs" "2Dplot"). if TRUE coefficients are standardized (only for type = "coefficients"). a character string for the y-axis label. a character string for the x-axis label. if TRUE the graphical parameters (see par) changed are restored to the previous state. If you want to manipulate the resulting plot you should set restore.par = FALSE.... additional arguments. References Scrucca, L. (2011) Model-based SIR for dimension reduction. Computational Statistics & Data Analysis, 55(11), 3010-3026.

summary. 13 Examples # 2-dimensional response curve n = 300; p = 5 b1 = c(1, 1, 1, rep(0, p-3)) b2 = c(1,-1,-1, rep(0, p-3)) b = cbind(b1,b2) x = matrix(rnorm(n*p), nrow = n, ncol = p) y = x %*% b1 + (x %*% b1)^3 + 4*(x %*% b2)^2 + rnorm(n) MSIR = (x, y) summary(msir) plot(msir) plot(msir, which = 1:2) plot(msir, type = "2Dplot", which = 1, span = 0.7) plot(msir, type = "2Dplot", which = 2, span = 0.7) ## Not run: plot(msir, type = "spinplot") plot(msir, type = "evalues") plot(msir, type = "coefficients") summary. Summary and print methods for objects Summary and print methods for objects. ## S3 method for class summary(object, numdir = object$numdir, std = FALSE, verbose = TRUE,...) ## S3 method for class summary. print(x, digits = max(5, getoption("digits") - 3),... ) object numdir std verbose x digits a object the number of directions to be shown. if TRUE the coefficients basis are scaled such that all predictors have unit standard deviation. if FALSE the coefficients basis are omitted; by default verbose = TRUE. a summary. object. the significant digits to use.... additional arguments.

14 summary.

Index Topic dplot plot., 12 Topic htest.permutation.test, 9 Topic loess loess.sd, 2 Topic multivariate, 4 Topic package -package, 2 Topic regression, 4.bic, 6.dir, 8.permutation.test, 9 plot., 12 bicdimred (.bic), 6 dr, 5, 10 loess, 3 loess.sd, 2 mclustmodel, 5 mclustmodelnames, 4, 2, 4, 7, 9, 11, 12, 14 -package, 2.bic, 6.dir, 8.nslices, 4, 9.permutation.test, 9.regularizedSigma, 4, 10.slices, 4, 11 panel.loess (loess.sd), 2 par, 3, 12 plot., 5, 12 print. (), 4 print.summary. (summary.), 13 summary., 5, 13 15