Downloaded from jhs.mazums.ac.ir at 9: on Monday September 17th 2018 [ DOI: /acadpub.jhs ]

Similar documents
4. Score normalization technical details We now discuss the technical details of the score normalization method.

Introduction to Probability and Statistics

Tests for Two Proportions in a Stratified Design (Cochran/Mantel-Haenszel Test)

MODELING THE RELIABILITY OF C4ISR SYSTEMS HARDWARE/SOFTWARE COMPONENTS USING AN IMPROVED MARKOV MODEL

7.2 Inference for comparing means of two populations where the samples are independent

On split sample and randomized confidence intervals for binomial proportions

¼ ¼ 6:0. sum of all sample means in ð8þ 25

Radial Basis Function Networks: Algorithms

Hotelling s Two- Sample T 2

CHAPTER 5 STATISTICAL INFERENCE. 1.0 Hypothesis Testing. 2.0 Decision Errors. 3.0 How a Hypothesis is Tested. 4.0 Test for Goodness of Fit

Econ 3790: Business and Economics Statistics. Instructor: Yogesh Uppal

A Comparison between Biased and Unbiased Estimators in Ordinary Least Squares Regression

Estimation of the large covariance matrix with two-step monotone missing data

Hypothesis Test-Confidence Interval connection

The Binomial Approach for Probability of Detection

General Linear Model Introduction, Classes of Linear models and Estimation

MULTIVARIATE SHEWHART QUALITY CONTROL FOR STANDARD DEVIATION

Ratio Estimators in Simple Random Sampling Using Information on Auxiliary Attribute

A Bound on the Error of Cross Validation Using the Approximation and Estimation Rates, with Consequences for the Training-Test Split

SAS for Bayesian Mediation Analysis

STK4900/ Lecture 7. Program

Lower Confidence Bound for Process-Yield Index S pk with Autocorrelated Process Data

Econ 3790: Business and Economics Statistics. Instructor: Yogesh Uppal

Robustness of multiple comparisons against variance heterogeneity Dijkstra, J.B.

An Analysis of Reliable Classifiers through ROC Isometrics

Combining Logistic Regression with Kriging for Mapping the Risk of Occurrence of Unexploded Ordnance (UXO)

STA 250: Statistics. Notes 7. Bayesian Approach to Statistics. Book chapters: 7.2

arxiv:cond-mat/ v2 25 Sep 2002

Finite Mixture EFA in Mplus

Objectives. 6.1, 7.1 Estimating with confidence (CIS: Chapter 10) CI)

Chapter 7 Sampling and Sampling Distributions. Introduction. Selecting a Sample. Introduction. Sampling from a Finite Population

Using the Divergence Information Criterion for the Determination of the Order of an Autoregressive Process

One-way ANOVA Inference for one-way ANOVA

START Selected Topics in Assurance

arxiv: v3 [physics.data-an] 23 May 2011

An Improved Generalized Estimation Procedure of Current Population Mean in Two-Occasion Successive Sampling

Evaluating Process Capability Indices for some Quality Characteristics of a Manufacturing Process

AI*IA 2003 Fusion of Multiple Pattern Classifiers PART III

The one-sample t test for a population mean

CHAPTER-II Control Charts for Fraction Nonconforming using m-of-m Runs Rules

Completely Randomized Design

Linear diophantine equations for discrete tomography

Plotting the Wilson distribution

arxiv: v1 [physics.data-an] 26 Oct 2012

The Mathematics of Winning Streaks

GENERATING FUZZY RULES FOR PROTEIN CLASSIFICATION E. G. MANSOORI, M. J. ZOLGHADRI, S. D. KATEBI, H. MOHABATKAR, R. BOOSTANI AND M. H.

Modeling and Estimation of Full-Chip Leakage Current Considering Within-Die Correlation

On the Estimation Of Population Mean Under Systematic Sampling Using Auxiliary Attributes

Objectives. Estimating with confidence Confidence intervals.

3.4 Design Methods for Fractional Delay Allpass Filters

Cryptanalysis of Pseudorandom Generators

One step ahead prediction using Fuzzy Boolean Neural Networks 1

AN OPTIMAL CONTROL CHART FOR NON-NORMAL PROCESSES

Monte Carlo Studies. Monte Carlo Studies. Sampling Distribution

On Optimization of Power Coefficient of HAWT

Pretest (Optional) Use as an additional pacing tool to guide instruction. August 21

A MIXED CONTROL CHART ADAPTED TO THE TRUNCATED LIFE TEST BASED ON THE WEIBULL DISTRIBUTION

Bayesian Spatially Varying Coefficient Models in the Presence of Collinearity

Uncorrelated Multilinear Principal Component Analysis for Unsupervised Multilinear Subspace Learning

VIBRATION ANALYSIS OF BEAMS WITH MULTIPLE CONSTRAINED LAYER DAMPING PATCHES

Adaptive estimation with change detection for streaming data

The Poisson Regression Model

Notes on Instrumental Variables Methods

CONTROL SYSTEMS, ROBOTICS, AND AUTOMATION Vol. III Stability Theory - Peter C. Müller

INTRODUCTION. Please write to us at if you have any comments or ideas. We love to hear from you.

An Investigation on the Numerical Ill-conditioning of Hybrid State Estimators

Analysis of M/M/n/K Queue with Multiple Priorities

Outline. Markov Chains and Markov Models. Outline. Markov Chains. Markov Chains Definitions Huizhen Yu

Research Note REGRESSION ANALYSIS IN MARKOV CHAIN * A. Y. ALAMUTI AND M. R. MESHKANI **

Yixi Shi. Jose Blanchet. IEOR Department Columbia University New York, NY 10027, USA. IEOR Department Columbia University New York, NY 10027, USA

Deriving Indicator Direct and Cross Variograms from a Normal Scores Variogram Model (bigaus-full) David F. Machuca Mory and Clayton V.

Mathematical Efficiency Modeling of Static Power Converters

Universal Finite Memory Coding of Binary Sequences

Slides Prepared by JOHN S. LOUCKS St. Edward s s University Thomson/South-Western. Slide

Supplementary Materials for Robust Estimation of the False Discovery Rate

Monopolist s mark-up and the elasticity of substitution

Genetic Algorithms, Selection Schemes, and the Varying Eects of Noise. IlliGAL Report No November Department of General Engineering

Convex Optimization methods for Computing Channel Capacity

On Wald-Type Optimal Stopping for Brownian Motion

DETC2003/DAC AN EFFICIENT ALGORITHM FOR CONSTRUCTING OPTIMAL DESIGN OF COMPUTER EXPERIMENTS

Soci Data Analysis in Sociological Research. Homework 4 Computer Handout. Chapter 19 Confidence Intervals for Proportions

On the Chvatál-Complexity of Knapsack Problems

Uniform Law on the Unit Sphere of a Banach Space

Spectral Analysis by Stationary Time Series Modeling

Applied Mathematics and Computation

Estimation of Separable Representations in Psychophysical Experiments

Effective conductivity in a lattice model for binary disordered media with complex distributions of grain sizes

On Wrapping of Exponentiated Inverted Weibull Distribution

Asymptotic Properties of the Markov Chain Model method of finding Markov chains Generators of..

Churilova Maria Saint-Petersburg State Polytechnical University Department of Applied Mathematics

Background. GLM with clustered data. The problem. Solutions. A fixed effects approach

PERFORMANCE BASED DESIGN SYSTEM FOR CONCRETE MIXTURE WITH MULTI-OPTIMIZING GENETIC ALGORITHM

Morten Frydenberg Section for Biostatistics Version :Friday, 05 September 2014

Homogeneous and Inhomogeneous Model for Flow and Heat Transfer in Porous Materials as High Temperature Solar Air Receivers

Keywords: pile, liquefaction, lateral spreading, analysis ABSTRACT

Support Vector Machines versus Artificial Neural Networks for wood Dielectric Loss Factor estimation

Shadow Computing: An Energy-Aware Fault Tolerant Computing Model

State Estimation with ARMarkov Models

Estimation of component redundancy in optimal age maintenance

On parameter estimation in deformable models

Transcription:

Iranian journal of health sciences 013; 1(): 56-60 htt://jhs.mazums.ac.ir Original Article Comaring Two Formulas of Samle Size Determination for Prevalence Studies Hamed Tabesh 1 *Azadeh Saki Fatemeh Pourmotahari 3 1-Assistant Professor of Biostatistics, Deartment of Biostatistics and Eidemiology, School of Health, Ahvaz Jundishaur University of Medical Sciences, Ahvaz, Iran -Assistant Professor of Biostatistics, Deartment of Biostatistics and Eidemiology, School of Health, Ahvaz Jundishaur University of Medical Sciences, Ahvaz, Iran 3- MSc Student of Biostatistics, Deartment of Biostatistics and Eidemiology, School of Health, Ahvaz Jundishaur University of Medical Sciences,Ahvaz,Iran Abstract * azadehsaki@yahoo.com Background and urose: Samle size and its determination is one of the most imortant roblems in health researches. Calculating samle size for revalence studies is one of the common questions of samle size toics. Minimum samle size with least comlexity is desirable in order to achieve the basic goal of these studies. This study aims to comare two formulas of samle size calculation for revalence researches and finally, to use the simlest formula to get the most aroriate samle size. Materials and Methods: Samle size for roortions: 0.9, 0.95, 0.99, 0.999 candidates of close to 1 roortions; 10-5, 10-4, 10-3, 10 -, 0.05, 0.1 candidates of close to 0, and roortions 0.3, 0.4, 0.5, 0.6, 0.7 candidates of close to 0.5 were calculated. For comaring n 1, n ; φ = n 1 n, it was comuted by R ackage (.10.1). Results: Comuted samle size by (f ) is lightly greater than samle size comuted by (f 1 ) and maximum value of φ index for comaring the two formulas equals 1. Conclusion: Results show that the calculated samle size by (f 1 ) is similar to what was obtained by (f ), though, according to its interretation and easy comutation,it is suggested for all values of. [Tabesh H. * Saki A. Pourmotahari F. Comaring two Formulas of Samle Size Determination for revalence studies. IJHS 013; 1():56-60] htt://jhs.mazums.ac.ir ey words: Samle Size Calculation, Prevalence Study, Calculation Procedure. 56

1. Introduction If observational and exerimental studies are designed effectively, valuable results would be obtained. Good lanning has many asects such as exact definition of roblem and method and enough samle size due to the goals. Enough samle size for a research is determined based on the tye and object of a study, statistical methods for analyzing and interreting, available data, validity and reliability for the generalized results by two general techniques: confidence interval and Bayesian methods (1). In simle term, samle size estimation means to estimate the minimum number of the samle for a study by using statistical methods based on secific situation, basic information and recision requirement, and under the remise of guaranteeing the reliability of the conclusion.() Calculated samle size should be large enough so that an effort of such magnitude to be of scientific significance & also be statistically significant. It is just as imortant, however, that the study not to be too large, where an effect of little scientific imortance is nevertheless statistically detectable (3). Although, for such an imortant issue, there is not a large amount of ublished literature, there are several aroaches for samle size. For examle, one can secify the desired width of a confidence interval and determine the samle size that achieves that goal or Bayesian aroach can be used where we otimize some utility function. For conducting samle size, statistical inference is based on estimating a arameter or testing a hyothesis which demand one tye of the study. When estimating an infinite oulation arameter such as revalence, incidence and chance of illness recurrence is desirable. Several text books (4-7) recommended using (f 1 ) and (f ). n 1 = Zα (1 ) (f d 1 ) n = Z α sin 1 d ( ) (1 ) (f ) Where n 1, n are the estimated minimum samle size d and d are the desired level of recision and estimated roortion of an attribute resent in the oulation. It is clear that z is the abscissa of the normal curve that cuts off an area α at the fails. (1- α equals the desired confidence level). Simlicity is one of the greatest roerties of each statistical aroach. Denominator of (f 1 ) equals a half width of confidence interval which shows the estimation recision. So it is easy to understand and interret. Also calculating samle size by (f 1 ) is simle and does not have any comlexity. But unlike the denominator of (f ), to the best of our knowledge, there is not any interretation and comuting samle size by (f ) which is not as convenient as comuting by (f 1 ). some literatures recommended using (f 1 ) when is close to 0.5 and (f ) when is close to 0 or 1(6,8-9). In this study, comaring results of using (f 1 ) and (f ) in the same situation was desirable so if ossible, an alternative formula for (f ) could be suggested. IJHS 013; 1(): 57

. Materials and Methods As mentioned before, several literatures recommended using (f ) when crude estimation of revalence, incidence, and success roortion or illness recurrence robability is close to 0 or 1. In this study, to comare the oututs of (f 1 ) and (f ) samle size for roortions: 0.9, 0.95, 0.99, 0.999 as candidates of close to 1 and, 10-5, 10-4, 10-3, 10 -, 0.05, 0.1 as candidates of close to 0 and 0.3, 0.4, 0.5, 0.6, 0.7 as candidates of close to 0.5 considered. Since d < *min {, 1- } where < ¼. So k equal to 10%, 15%, 0%, and 5% have been considered. Whereas softwares for samle size determination such as PASS, UnifyPow and Power and Precision determine samle size based on (f 1 ) so for comaring (f ),ackage R (.10.1) was used and φ index (φ = n 1 n ) was comuted (10-1). As the most common confidence intervals in medical research cases are 95% and 99%, samle size was comuted for these confidence intervals. 3. Results Samle size for different estimated roortions in oulation,, and distinct values of k, was comuted by (f 1 ) and (f ). N 1, n are estimated minimum samle size by (f 1 ) and (f ) resectively. For comaring n 1, n, index φ = n 1 n was calculated. Comuted φfor close to 0, 1 and 0.5 with 0.95 confidence limit are shown in tables (1), () and (3) resectively. The findings of this study show that n 1 and n were very close to each other when estimated roortion of an attribute resent in the oulation was very small. Table1. Calculated φ index by (f 1 ) and (f ) for very small values of when confidence limit is 95% 10-5 10-4 10-3 10-0.05 0.10 10% 1.000000 1.000000 1.000003 1.000034 1.000175 1.000371 15% 1.000000 1.000001 1.000008 1.000076 1.000395 1.000834 0% 1.000000 1.000001 1.000013 1.000135 1.000703 1.001485 5% 1.000000 1.00000 1.00001 1.00011 1.001098 1.0033 Table. Calculated φ index by (f 1 ) and (f ) for very large values of when confidence limit is 95% 0.9 0.95 0.99 0.995 0.999 10% 1.000371 1.000175 1.000034 1.000017 1.000003 15% 1.000834 1.000395 1.000076 1.000076 1.000008 0% 1.001485 1.000703 1.000135 1.000067 1.000013 5% 1.0033 1.001098 1.00011 1.000105 1.00001 IJHS 013; 1(): 58

Table3. Calculated φ index by (f 1 ) and (f ) for mid- values of when confidence limit is 95% When was close to 1, φ index was aroximately equal to 1 which means that n 1 and n were similar. For mid-values of, similar conclusion could be drawn. The results revealed that both formulas, (f 1 ) and (f ), would erform similarly when the estimated roortion of an attribute resent in the oulation had very small, medium and very large values. The comutations were done for 99% confidence limit, and the results were closely similar. 4. Discussion In medical research, it is imortant to determine size sufficiently enough to ensure reliable conclusions. On the other hand, revalence studies are interesting research cases in medical sciences and consequently, adequate samle size for these research cases would be interesting, too. The most common formulas for revalence or incidence studies are (f1) and (f). (5-7). (f1) has simle structure made u of (f) so it has more ublic 0.3 0.4 0.5 0.6 0.7 10% 1.00143 1.0030 1.003351 1.0030 1.00143 15% 1.00331 1.005040 1.007591 1.005040 1.00331 0% 1.005767 1.009018 1.01365 1.009018 1.005767 5% 1.009058 1.01406 1.01557 1.01406 1.009058 interest than (f). But some literatures limited using (f1). They believe that (f1) could be useful when is not to be close to 1 or 0. This study showed that the estimated samle size by (f1) is aroximately similar to the estimated samle size by (f). On the other hand, comuting samle size by (f1) is easy. Therefore, using (f1) is recommended for estimating samle size of revalence or incidence studies for infinite oulation References 1. Lenth RV. Some ractical Guidelines Effective samle size Determination. The American Statistical, 001; 55(3): 187-193.. HU LP, Bao XL, Zhou SG, Guas X, Estimation of samle size and testing ower (art1). J chin Integr Med. 011;9(10): 1070-1074. 3. Sathian B, sreedharan y, Baboo NS, Sharan k, Abhilash ES, Rajesh E. Relevance of samle size Determination in Medical Researche., Neal Journal of Eidemiology, 010;1(1): 4-10. IJHS 013; 1(): 59

4. Daniel WW. Biostatistics: A Foundation for Analysis in the Health Sciences.7th edition. NewYork: John Wiley & Sons. 5. Lwanga sk and Lemeshow S. Samle size determination in Health Studies: A ractical Manual. Genevai World Health Organization.1999. 6. Cochran WG. Samling Techniques, 3 rd edition. NewYork: John Wiley & Sons.1997. 7. Naing L,Winn T and Rusli BN. Samle size calculator for revalence studies. Archives of Orofacial Sciences, 006;1:9-14 8. reamer, H.C, and Thiemann, S. How many subjects & statistical ower Analysis in Research, Newbury ark, CA: Sage Publications. 1987. 9. Desu MM, Raghavao D. samle size methodology. Boston, MA: Academic ress, Inc.1990 10. Hintze, J. PASS 000, aysville, UT: Number Cruncher Statistical System, Software for MS-DOS systems.001 11. O Brien R.G. Unifyow.sas: version 98.08.5, Deartment of Biostatistics and Eidemiology, Cleveland Clinic Foundation, Cleveland, OH.1998. 1. Borenstein, M., Rothstein, H., and Cohen, J. Power and Precision, Biostat, Teaneck, NY: software for MS-DOS systems.1997. IJHS 013; 1(): 60