Package locfdr. July 15, Index 5

Similar documents
The locfdr Package. August 19, hivdata... 1 lfdrsim... 2 locfdr Index 5

Package FDRreg. August 29, 2016

Package aspi. R topics documented: September 20, 2016

Tweedie s Formula and Selection Bias. Bradley Efron Stanford University

Package jmcm. November 25, 2017

Package BNN. February 2, 2018

Package polypoly. R topics documented: May 27, 2017

Package invgamma. May 7, 2017

Package severity. February 20, 2015

Package BayesNI. February 19, 2015

Package beam. May 3, 2018

Package SEMModComp. R topics documented: February 19, Type Package Title Model Comparisons for SEM Version 1.0 Date Author Roy Levy

Package depth.plot. December 20, 2015

Package Tcomp. June 6, 2018

Package gtheory. October 30, 2016

Multiple Testing. Hoang Tran. Department of Statistics, Florida State University

Package scpdsi. November 18, 2018

Package MultisiteMediation

Correlation, z-values, and the Accuracy of Large-Scale Estimators. Bradley Efron Stanford University

Package ChIPtest. July 20, 2016

Package sscor. January 28, 2016

Package noncompliance

Package RootsExtremaInflections

Package pearson7. June 22, 2016

Package effectfusion

Package CCP. R topics documented: December 17, Type Package. Title Significance Tests for Canonical Correlation Analysis (CCA) Version 0.

Package SK. May 16, 2018

Package FinCovRegularization

Package bivariate. December 10, 2018

A Sequential Bayesian Approach with Applications to Circadian Rhythm Microarray Gene Expression Data

Package leiv. R topics documented: February 20, Version Type Package

Package clogitboost. R topics documented: December 21, 2015

Package goftest. R topics documented: April 3, Type Package

Package unbalhaar. February 20, 2015

Package mlmmm. February 20, 2015

Package varband. November 7, 2016

Package HarmonicRegression

Package ShrinkCovMat

Package RTransferEntropy

Package scio. February 20, 2015

Package gk. R topics documented: February 4, Type Package Title g-and-k and g-and-h Distribution Functions Version 0.5.

Package NonpModelCheck

Package generalhoslem

Package ssanv. June 23, 2015

Package NlinTS. September 10, 2018

Package sindyr. September 10, 2018

Package reinforcedpred

Package MicroMacroMultilevel

Package dhsic. R topics documented: July 27, 2017

Package darts. February 19, 2015

Package DNMF. June 9, 2015

Package hot.deck. January 4, 2016

Package SpatialPack. March 20, 2018

Package sdprisk. December 31, 2016

Package plw. R topics documented: May 7, Type Package

Package RI2by2. October 21, 2016

Package factorqr. R topics documented: February 19, 2015

Package Delaporte. August 13, 2017

Package rbvs. R topics documented: August 29, Type Package Title Ranking-Based Variable Selection Version 1.0.

Package coop. November 14, 2017

Package bhrcr. November 12, 2018

Package pfa. July 4, 2016

Package AID. R topics documented: November 10, Type Package Title Box-Cox Power Transformation Version 2.3 Date Depends R (>= 2.15.

Package supernova. February 23, 2018

Package Perc. R topics documented: December 5, Type Package

Package ltsbase. R topics documented: February 20, 2015

Package penmsm. R topics documented: February 20, Type Package

Package dhga. September 6, 2016

Package bigtime. November 9, 2017

Package vhica. April 5, 2016

Package milineage. October 20, 2017

Package CEC. R topics documented: August 29, Title Cross-Entropy Clustering Version Date

Package plspolychaos

Package rgabriel. February 20, 2015

Package r2glmm. August 5, 2017

Package pssm. March 1, 2017

Package ICGOR. January 13, 2017

Manual: R package HTSmix

Package basad. November 20, 2017

Package metafuse. October 17, 2016

Package MonoPoly. December 20, 2017

Package jmuoutlier. February 17, 2017

Package SpatialNP. June 5, 2018

Package multivariance

Package jaccard. R topics documented: June 14, Type Package

Package robustrao. August 14, 2017

Package funrar. March 5, 2018

Package HIMA. November 8, 2017

Package BNSL. R topics documented: September 18, 2017

Package LIStest. February 19, 2015

Package ARCensReg. September 11, 2016

Package EMVS. April 24, 2018

Package CoxRidge. February 27, 2015

Package covsep. May 6, 2018

Package EnergyOnlineCPM

Package IGG. R topics documented: April 9, 2018

Package lmm. R topics documented: March 19, Version 0.4. Date Title Linear mixed models. Author Joseph L. Schafer

Package blme. August 29, 2016

Package ForwardSearch

Package face. January 19, 2018

Transcription:

Version 1.1-8 Title Computes Local False Discovery Rates Package locfdr July 15, 2015 Maintainer Balasubramanian Narasimhan <naras@stat.stanford.edu> License GPL-2 Imports stats, splines, graphics Computation of local false discovery rates. NeedsCompilation no Author Bradley Efron [aut], Brit Turnbull [aut], Balasubramanian Narasimhan [aut, cre], Korbinian Strimmer [ctb] Repository CRAN Date/Publication 2015-07-15 01:48:23 R topics documented: hivdata............................................ 1 lfdrsim............................................ 2 locfdr............................................ 2 Index 5 hivdata HIV data set The data comprises 7680 z-values, each relating to a two-sample t-test. The test compares gene expression values for 4 HIV patients with values for 4 normal subjects; the t-score T[i] for gene i has been transformed to a normal scale, z[i] = qnorm(pt(t[i], df=6)), so that the z[i] s theoretically would have a standard N(0, 1) distribution under the null hypothesis. The original experiment is described in van t Wout et. al. (2003). 1

2 locfdr data(hivdata) Format A vector containing 7680 z-values References van t Wout, et. al., Cellular gene expression upon human immuno-deficiency virus type 1 infection of CD4+-T-Cell lines, Journal ofvirology 77, 1392-1402. lfdrsim Simulated data set for locfdr A simulated dataset that involves 2000 "genes", each of which has yielded a test statistic "zex", with zex[i] N(mu[i], 1) (independently for i = 1, 2,...2000.) The data comprises 2000 µ i values and 2000 z-values. data(lfdrsim) Format A matrix of 2000 rows and 2 columns containing mu and the z-score values (zex) locfdr Local False Discovery Rate Calculation Compute local false discovery rates, following the definitions and description in references listed below. locfdr(zz, bre = 120, df = 7, pct = 0, pct0 = 1/4, nulltype = 1, type = 0, plot = 1, mult, mlests, main = " ", sw = 0)

locfdr 3 Arguments zz bre df pct pct0 nulltype type plot mult mlests main A vector of summary statistics, one for each case under simultaneous consideration. The calculations assume a large number of cases, say length(zz) exceeding 200. Results may be improved by transforming zz so that its elements are theoretically distributed as N(0, 1) under the null hypothesis. See the locfdr vignette for tips on creating zz. Number of breaks in the discretization of the z-score axis, or a vector of breakpoints fully describing the discretization. If length(zz) is small, such as when the number of cases is less than about 1000, set bre to a number lower than the default of 120. Degrees of freedom for fitting the estimated density f(z). Excluded tail proportions of zz s when fitting f(z). pct=0 includes full range of zz s. pct can also be a 2-vector, describing the fitting range. Proportion of the zz distribution used in fitting the null density f 0(z) by central matching. If a 2-vector, e.g. pct0=c(0.25,0.60), the range [pct0[1], pct0[2]] is used. If a scalar, [pct0, 1-pct0] is used. Type of null hypothesis assumed in estimating f0(z), for use in the fdr calculations. 0 is the theoretical null N(0, 1), 1 is maximum likelihood estimation, 2 is central matching estimation, 3 is a split normal version of 2. Type of fitting used for f; 0 is a natural spline, 1 is a polynomial, in either case with degrees of freedom df [so total degrees of freedom including the intercept is df+1.] Plots desired. 0 gives no plots. 1 gives single plot showing the histogram of zz and fitted densities f and p0 f0. 2 also gives plot of fdr, and the right and left tail area Fdr curves. 3 gives instead the f1 cdf of the estimated fdr curve; plot=4 gives all three plots. Optional scalar multiple (or vector of multiples) of the sample size for calculation of the corresponding hypothetical Efdr value(s). Optional vector of initial values for (delta0, sigma0) in the maximum likelihood iteration. Main heading for the histogram plot when plot>0. sw Determines the type of output desired. 2 gives a list consisting of the last 5 values listed under Value below. 3 gives the square matrix of dimension bre-1 representing the influence function of log(fdr). Any other value of sw returns a list consisting of the first 5 (6 if mult is supplied) values listed below. Details See the locfdr vignette for details and tips. Value fdr the estimated local false discovery rate for each case, using the selected type and nulltype.

4 locfdr fp0 Efdr the estimated parameters delta (mean of f0), sigma (standard deviation of f0), and p0, along with their standard errors. the expected false discovery rate for the non-null cases, a measure of the experiment s power as described in Section 3 of the second reference. Overall Efdr and right and left values are given, both for the specified nulltype and for nulltype 0. If nulltype==0, values are given for nulltypes 1 and 0. cdf1 a 99x2 matrix giving the estimated cdf of fdr under the non-null distribution f1. Large values of the cdf for small fdr values indicate good power; see Section 3 of the second reference. Set plot to 3 or 4 to see the cdf1 plot. mat A matrix of estimates of f(x), f0(x), fdr(x), etc. at the bre 1 midpoints "x" of the break discretization, convenient for comparisons and plotting. Details are in the locfdr vignette. z.2 the interval along the zz-axis outside of which $fdr(z)<0.2$, the locations of the yellow triangles in the histogram plot. If no elements of zz on the left or right satisfy the criterion, the corresponding element of z.2 is NA. call mult pds x f pds. stdev Author(s) the function call. If the argument mult was supplied, vector of the ratios of hypothetical Efdr for the supplied multiples of the sample size to Efdr for the actual sample size. The estimates of p0, delta, and sigma. The bin midpoints. The values of f(z) at the bin midpoints. The derivative of the estimates of p0, delta, and sigma with respect to the bin counts. The delta-method estimates of the standard deviations of the p0, delta, and sigma estimates. Bradley Efron, Brit B. Turnbull, and Balasubramanian Narasimhan References Efron, B. (2004) "Large-scale simultaneous hypothesis testing: the choice of a null hypothesis", Jour Amer Stat Assoc, 99, pp. 96 104 Efron, B. (2006) "Size, Power, and False Discovery Rates" Efron, B. (2007) "Correlation and Large-Scale Simultaneous Significance Testing", Jour Amer Stat Assoc, 102, pp. 93 103 http://statweb.stanford.edu/~ckirby/brad/papers/ Examples ## HIV data example data(hivdata) w <- locfdr(hivdata)

Index Topic datasets hivdata, 1 lfdrsim, 2 Topic htest Topic models hivdata, 1 lfdrsim, 2 5