CHAPTER 38 DIMENSIONAL ANALYSIS - SPURIOUS CORRELATION

Similar documents
Examples of linear systems and explanation of the term linear. is also a solution to this equation.

Lab 5. Simple Pendulum

EQUATIONS FOR DISCHARGE CALCULATION IN COMPOUND CHANNELS HAVING HOMOGENEOUS ROUGHNESS * S. M. HOSSEINI **

MARK SCHEME for the November 2004 question paper 9709 MATHEMATICS 8719 HIGHER MATHEMATICS

PROOF OF POINCARE'S GEOMETRIC THEOREM

Lecture Notes #10. The "paradox" of finite strain and preferred orientation pure vs. simple shear

Lab 4. Friction. Goals. Introduction

STEP Support Programme. STEP 2 Matrices Topic Notes

Lab 3. Newton s Second Law

CHAPTER 12 THE EFFECT OF ROUGHNESS ON THE MASS-TRANSPORT OF PROGRESSIVE GRAVITY WAVES

EFFECT OF CHANNEL BENDS ON TRANSVERSE MIXING

Exponential and. Logarithmic Functions. Exponential Functions. Logarithmic Functions

Maths 212: Homework Solutions

Ordinary Differential Equations (ODEs)

Prediction of bed form height in straight and meandering compound channels

Mathematics Course 111: Algebra I Part I: Algebraic Structures, Sets and Permutations

VERTICAL SCALES AND SHEAR STRESSES IN WAVE BOUNDARY LAYERS OVER MOVABLE BEDS. Peter Nielsen & Paul A Guard

51.1 Direct proportionality

Fluid Mechanics Prof. S.K. Som Department of Mechanical Engineering Indian Institute of Technology, Kharagpur

Guo, James C.Y. (1999). "Critical Flow Section in a Collector Channel," ASCE J. of Hydraulic Engineering, Vol 125, No. 4, April.

Experimental Uncertainty (Error) and Data Analysis

Extreme Mixing Events in Rivers

Algebra II Vocabulary Alphabetical Listing. Absolute Maximum: The highest point over the entire domain of a function or relation.

6x 2 8x + 5 ) = 12x 8

arxiv:math-ph/ v2 16 Feb 2000

Chapter 13 - Inverse Functions

7.5 Partial Fractions and Integration

Dimensional Analysis and Similarity

MSM120 1M1 First year mathematics for civil engineers Revision notes 4

LINEAR SYSTEMS AND MATRICES

Lab 6. Current Balance

Chapter 11. Special Relativity

Experimental Uncertainty (Error) and Data Analysis

MEASUREMENTS AND ERRORS (OR EXPERIMENTAL UNCERTAINTIES)

ELEMENTARY LINEAR ALGEBRA

Achievement Level Descriptors Mathematics Algebra 1

c n (x a) n c 0 c 1 (x a) c 2 (x a) 2...

s and FE X. A. Flow measurement B. properties C. statics D. impulse, and momentum equations E. Pipe and other internal flow 7% of FE Morning Session I

A matrix over a field F is a rectangular array of elements from F. The symbol

Settling-velocity based criteria for incipient sediment motion

1.1 GRAPHS AND LINEAR FUNCTIONS

SCHOOL OF MATHEMATICS MATHEMATICS FOR PART I ENGINEERING. Self-paced Course

Transactions on Engineering Sciences vol 5, 1994 WIT Press, ISSN

SUBGROUPS OF CYCLIC GROUPS. 1. Introduction In a group G, we denote the (cyclic) group of powers of some g G by

Formation Of Hydraulic Jumps On Corrugated Beds

) sm wl t. _.!!... e -pt sinh y t. Vo + mx" + cx' + kx = 0 (26) has a unique tions x(o) solution for t ;?; 0 satisfying given initial condi

EXERCISES IN QUATERNIONS. William Rowan Hamilton. (The Cambridge and Dublin Mathematical Journal, 4 (1849), pp ) Edited by David R.

a 11 x 1 + a 12 x a 1n x n = b 1 a 21 x 1 + a 22 x a 2n x n = b 2.

FORMATION OF HYDRAULIC JUMPS ON CORRUGATED BEDS

Sediment Transport in the Middle Odra River. Verification of Ackers-White s Formulae with Reference to Big Flows

ELEMENTARY LINEAR ALGEBRA

ANALYTIC TRANSFORMATIONS OF EVERYWHERE DENSE POINT SETS*

Dimensional and Model Analysis

Advanced Hydraulics Prof. Dr. Suresh A. Kartha Department of Civil Engineering Indian Institute of Technology, Guwahati

Real Analysis Math 131AH Rudin, Chapter #1. Dominique Abdi

Similitude and Dimensional Analysis. CE Fluid Mechanics Diogo Bolster

College Algebra Through Problem Solving (2018 Edition)

Chapter 5. Linear Algebra. Sections A linear (algebraic) equation in. unknowns, x 1, x 2,..., x n, is. an equation of the form

Chapter 3. Rings. The basic commutative rings in mathematics are the integers Z, the. Examples

ABE Math Review Package

EXAMPLES (SEDIMENT TRANSPORT) AUTUMN 2018

Math 163: Lecture notes

(x) = lim. dx = f f(x + h) f(x)

Review of Integration Techniques

Chapter 5. Linear Algebra. A linear (algebraic) equation in. unknowns, x 1, x 2,..., x n, is. an equation of the form

ELEMENTARY LINEAR ALGEBRA

KOZENY-CARMAN EQUATION REVISITED. Jack Dvorkin Abstract

Fundamentals of Linear Algebra. Marcel B. Finan Arkansas Tech University c All Rights Reserved

Chapter 7 DIMENSIONAL ANALYSIS AND SIMILITUDE Because so few real flows can be solved exactly by analytical methods alone, the development of fluid

Advanced Heat and Mass Transfer by Amir Faghri, Yuwen Zhang, and John R. Howell

Infinite series, improper integrals, and Taylor series

Lab 9. Rotational Dynamics

2 Lie Groups. Contents

1D heat conduction problems

ELEMENTARY LINEAR ALGEBRA

Cheng Soon Ong & Christian Walder. Canberra February June 2018

UNIT IV DIMENSIONAL AND MODEL ANALYSIS

Mathematics Specialist Units 3 & 4 Program 2018

WUCT121. Discrete Mathematics. Numbers

Bisection and False Position Dr. Marco A. Arocha Aug, 2014

Lab 6. RC Circuits. Switch R 5 V. ower upply. Voltmete. Capacitor. Goals. Introduction

Section 3.7. Rolle s Theorem and the Mean Value Theorem

Differentiation 1. The METRIC Project, Imperial College. Imperial College of Science Technology and Medicine, 1996.

YEAR 13 - Mathematics Pure (C3) Term 1 plan

Part 1. For any A-module, let M[x] denote the set of all polynomials in x with coefficients in M, that is to say expressions of the form

Number Axioms. P. Danziger. A Group is a set S together with a binary operation (*) on S, denoted a b such that for all a, b. a b S.

Soil Damping Ratio: Theoretical Aspect and Measurement

Chapter 2 Non-linear Equations

Automorphism groups of wreath product digraphs

On The Determination of Transmissibility and Storage Coefficients from Pumping Test Data

ULTIMATE BEARING CAPACITY OF ECCENTRICALLY LOADED STRIP FOUNDATION ON SAND REINFORCED WITH GEOGRIDS

Turbulence is a ubiquitous phenomenon in environmental fluid mechanics that dramatically affects flow structure and mixing.

Math 201 Solutions to Assignment 1. 2ydy = x 2 dx. y = C 1 3 x3

The Not-Formula Book for C2 Everything you need to know for Core 2 that won t be in the formula book Examination Board: AQA

High School Algebra I Scope and Sequence by Timothy D. Kanold

IDEAL CLASSES AND SL 2

The complex logarithm, exponential and power functions

(Refer Slide Time: 2:14)

Chemical Reaction Engineering II Prof. A. K. Suresh Department of Chemical Engineering Indian Institute of Technology, Bombay

6x 2 8x + 5 ) = 12x 8. f (x) ) = d (12x 8) = 12

Transcription:

CHAPTER 38 DIMENSIONAL ANALYSIS - SPURIOUS CORRELATION by M S Yalm Professor, Department of Civil Engineering, and J W Kamphuis Associate Professor, Department of Civil Engineering, Queen's University at Kingston, Canada ABSTRACT Dimensional analysis and errors leading to spurious correlation are presented using a number of practical examples INTRODUCTION The theory of dimensions has been subject of considerable controversy and only in the last few decades has it been agreed that dimensional methods can serve as a powerful tool m experimental investigations of physical phenomena Its effectiveness becomes especially noticeable when the number of factors involved is large and the theoretical knowledge is insufficient However, like any other mathematical tool, the theory of dimensions can supply correct and useful results only if it is properly applied One example of improper application is the introduction of an exaggerated and sometimes simply of a nonexistent correlation between the experimental results The purpose of the present paper is to analyse these kinds of spurious correlation OUTLINE OF GENERAL PRINCIPLES For detailed information on the principles of the theory of dimensions the reader is referred to Refs (1), (2), (3), here only those points will be mentioned which have a direct bearing on the present analysis In general, a physical phenomenon corresponding to a specified geometry can be described by a number of independent quantities a l> a 2 a n referred to as the characteristic parameters Accordingly any quantitative property b of the phenomenon under consideration must be given by a functional relation such as b = f(a 1( a 2, a n ) (1) where the quantities a x and b are usually dimensional Since all 625

626 COASTAL ENGINEERING physical relations must be dimensionally homogeneous, using the II theorem of the theory of dimensions, one can express the above relation in a dimensionless form as Y - (XL X 2, X n. k ) (2) Here, k is the number of fundamental units, while Y and X-j (j =1, 2, n-k) are the following dimensionless power products k 6i Y = b i n 1 ai (3) X = a-, i n l a X x (4) The dimensionless quantities Y and X, can be interpreted as the dimensionless versions of b and a respectively The k parameters ax(i = 1 k) which are common^in the expressions of Xj and Y are often called the basic quantities or the "repeaters" Although either system may be used in experimental work, Eq 2 has some definite advantages over Eq 1 and these are described in standard works on the theory of dimensions, e g Refs (1, 2, 3) The most important advantages are - a) Eq 2 is independent of the system of units used, b) the number of variables on the right hand side of the equation has been reduced by k, c) the dimensionless variables (X,) are criteria of similarity VARIATION OF THE DIMENSIONLESS CHARACTERISTICS, SPURIOUS CORRELATION In a correctly designed experimental investigation, the variation of Y is achieved by varying only one X at a time In such cases, one deals with a set of functions of only one variable, as below Y, = $ (Const}, const2, X,,, constjj) - ^i( x j) (5) Accordingly in the following, the consideration of Eq 2 will be replaced by its special case, Eq 5, while the subscript j will be omitted Differentiating Eqs 3 and 4, one arrives at the following relations which represent the most general versions of variation of Y and X-. dy = b n, a, 3l 1=1 fdb + Z, g, da il 1=1 \T X -d (6) k dy = d(ln Y) = d(ln b) +,, 6 d(ln a.) (7) T 1_1 i k dx = a II, a, 1=1 X x '^ h-k (da. da ~a \ l a i _± (81

SPURIOUS CORRELATION 627 k 2% = d(ln X) = d(ln a) + l a x d(ln a^ (9) Three typical methods of variation of Y and X, are discussed below Case I Variation of Y and X is achieved by varying b and a only and by keeping the basic quantities a x constant Substituting da x = 0 into Eqs 6 and 8, one arrives at where fh = c = C dx da dx (10) k n a-l J i C = ±- i 1 = 1 PI (11) and C* = II a T x r = 1 a Hence, the variation of Y with X is proportional to the variation of b with a (and of b with X) and therefore the dimensionless relationship between Y and X is merely a scaled down version of the dimensional relationship between b and a (The scale of the ordmate is C times the scale of the abscissa ) For example, consider the decay of an impulsively generated wave with distance (4) The wave height H (which should be identified with b) may be plotted against the distance, s (to be identified with a) as m Tig la On the other hand, one can plot the dimensionless quantities Y = H ^ x = against each other - Fig lb Since the common parameter h, the water depth (which is to be identified with one of a x ) has not been varied, Figs la and lb are merely scaled down versions of each other For the present example C = 1 and thus the curves m Figs la and lb are geometrically similar From Eqs 7 and 9 it follows that d(ln Y) _ d(ln b) = d(ln b) d(ln X) d(ln a) d(ln X) (12) is also valid, which implies that, in the case under consideration, the variation of Y and X in a logarithmic system is identical to that of b and a (or of b and X) Case II Variation of Y and X is achieved by keeping the parameter a constant and by varying b and one or more of basic quantities a x Substituting da = 0 into Eas 7 and 9 one obtains d(ln Y) = d(ln b) + m (1S) UiJ d(ln X) axlnt)

628 COASTAL ENGINEERING 04 - A B C 03 H 02 0 I - ^ ^fc= 0 K-209 &-K=l 54 A 'K=I 18 K=0 63 a 20 1 i 40 (a) 1 1 60 80 04 03 02 _ K=209»K=I54 ^K=l 18 K=0 63 01 20 J_ 40 (b) 60 80 s/d Fig i Dimensional and Dimensionless Plots

SPURIOUS CORRELATION 629 where k E i=l E 1=1 da. da" E g d(ln a ) 1=1 T E a d(ln a ) 1=1 x (14) I only one of the basic quantities, e expression for m reduces to g a^ is varied, then the m = a l The present case is very common in practice (15) Consider, for instance, the familiar plot of the drag force acting on a sphere m an infinite fluid flow (Fig 2) The dimensional relationship - analogous to Eq 1 - is F = f(u, D, p, y) (16) where F is the drag force, U the velocity of the undisturbed flow, D, the sphere diameter, p the fluid density and y the dynamic viscosity The quantities F, y, U, D and p must be identified with b,a, ai, &2 and a 3 of Eq 1 respectively The dimensionless form - analogous to Eq 2 - is Y = = * (X) (SSe.) (17) Note that Eq 16 contains four variables whereas Eq 17 contains only one, l e the problem has been simplified considerably by introducing the dimensionless form 1 Let us now assume that the variation in flow is achieved by varying only one basic quantity (common parameter) U The variation of U alone will certainly induce the variation of the drag force r and the experimental procedure carried out in this manner will form an example for the present case The experimental relation between Y and X shown in Fig 2 indicates that d(ln Y)., d(lri X) " l is valid On the other hand from the expression for Y and X (Eq it follows that m = -2 Accordingly, Eq 13 gives 17) d(ln b) d(ln X) d(ln Y) d(ln X)

630 COASTAL ENGINEERING 4> Q. if) a> o o CM

SPURIOUS CORRELATION 631 and m terms of finite differences A (In Y) = A (In b) + m A (In X) In Fig 2 the increment A(In X) along the abscissa is shown by the horizontal distance M, the decrement A (In Y) along the ordmate being BE In Fig 2 the decrement BE is shown as the algebraic sum of the increment A(In b) = CB and the decrement ma (In X)= BD = CE, l e as implied by the equation above It follows, that the correlation between Y and X, achieved by varying a-^ (and consequently b) is as legitimate as that achieved by varying a(and b) as in the previous case The difference is that m the case I the variation AE is achieved directly, whereas m the present case it yields itself as the "resultant" of the "component variations" AC and AD Note, however, that the variation of Y with X can characterise the variation of b with a only, as has been shown in the case I It cannot characterise the variation of b with any of the common parameters a n Observe from Fig 2 that the value of Y decreases (along AE with X, whereas the value of b increases (along AC) with a^ = U Case III Variation of Y is achieved without varying b Substituting d(ln b) = 0 into Eqs 7 and 9, one obtains d(ln Y) = m d(ln X) 1 + d(ln a) jjj % d(ln a x ) (18) and if a is kept constant, d(ln Y) d(ln X) = m (19) Eq 19 is simply Eq 13 without the first term which reflects the influence of the variation of b, the dimensional term responsible for the existence of Y Hence a correlation between Y and X, obtained as described above is completely spurious Indeed the correlation implied by the present case can only be due to the variation of the common quantities a x If the parameters a x vary only, then the quantities Y and X also vary, and Y can be plotted against X in the form of a curve, even if b m actual fact does not depend on a at all With reference to Fig 2, the curve of spurious correlation is the line AD which implies the following case of Eq 19 d(ln Y) _, d(ln X) " z and which is due to the variation of the common quality 32 = 11 alone Hence, even if F had not been a function of y at all, by applying the procedure described m the present case, one could still obtain a

632 COASTAL ENGINEERING correlation between Y and X, m the form of the line M, which would, of course, be completely spurious Integrating Eq 19, one obtains Y = CX m (20) which is a straight line in the logarithmic system of co-ordinates and where C is given by It follows that c = ± n a tt - J W a m 1=2 1 a a) any correlation between Y and X obtained without varying b is spurious (unless a special subset of Eq 16 is chosen, where U, D,o and y are not independent, thus violating the basic assumptions), b) m a log-log system of co-ordinates, this spurious correlation forms a straight line, c) the position and slope of the straight line of spurious correlation is, a priori, predictable, it is determined by Eqs 19 and 20 When analysing experimental plots, m case of doubt it is advisable to determine and plot the family of straight lines Sj of spurious correlation (Fig 3) and to check the trend of the experimental points accordingly If the dimensional analysis has been performed improperly and the original quantities are not truly independent, then the variation of a x brings about unexpected variation in a or b and spurious correlation will take place along a curve r (Fig 3) This is also a point to be kept in mind A special case occurs when experimentally Y = <J> (X) = C X (22) and the lines of spurious correlation and genuine correlation coincide Consider, for example, the lengtha =b of dunes forming on the bed of a two dimensional rough turbulent flow with a mobile bed According to (5), this length may be expressed by the following functional relation A = f (D, p, v*, h) (23) where D = a^ is the gram size, p = a2 is the fluid density, v* = a.^ is the shear velocity and h = a is the flow depth The relation above can be expressed in dimensionless form

SPURIOUS CORRELATION 633 In Y InX Fig 3 Lines of Spurious Correlation

634 COASTAL ENGINEERING as follows h Y - - +CX) D (24) If the gram size D = a^ is varied only, then Eqs 20 and 21 give Y = CX (25) which implies that the lines of spurious correlation m a ]og-log system are straight lines with slope m = 1 On the other hand, the experimental results (obtained from experiments where A = b is varied over a large range) indicate that the genuine correlation between Y and X is given by the linear relation Y = 2TT X (26) which also a straight line with the slope m = 1 The lines of spurious correlation and the real relationship can coincide as m example above only if the parameter a^ is a "spurious parameter" This can be deduced from Eqs 3, 4 and 20 and indeed, the relation above which can be expressed as A - 2irh (27) indicates (contrary to the theoretical expectation) that the dune length A actually does not depend on the grain size a^ = D EXAGGERATION OF THE ACTUAL CORRELATION In practical applications, an existing correlation is often exaggerated by plotting common quantities along both co-ordinate axes (6) The explanation is given here in terms of a dimensionless system, however, the same argument is true for dimensional quantities where Z is the product and where Y implies Consider the functional relation From Eq 28 one obtains Z = I J(X) (28) Z = Y X N (29) Y = <.(X) (30) d(ln Z) = d (In Y) + N d(ln X)

SPURIOUS CORRELATION 635 and thus d (In Z) _ d(ln Y) (31) + N W d (In X) d(ln X) This equation indicates how m a log-log system of coordinates the rate of change with X can be increased by plotting the product Z = YXN rather than simply Y versus X For example if d(ln Y)_ =, anmo tan 9 l then the plot Y vs X is a 45 straight line, as shown in Fig 4 a It is assumed that the experimental points forming this straight line are scattered in a "ribbon" of the thickness w a If the product Z = Y xn is used as ordmate, and if for example the value of N is 2, then from Eq 30 d(ln Z) _,? =, d(ln X)! + 2-3 is valid, which implies that the straight line becomes steeper by a factor 3, while the thickness of the scatter-ribbon decieases by a factor w a VI + 1^4=^ "a (Fig 4b) Thus the relative thickness of the scatter ribbon, l e is reduced by factor thickness of scatter-ribbon length of the line 1 + N 2 tan 2 0 5 = 1 + tan-* 0 I z b One can say that the correlation can be improved 2 5 times by plotting Z = YX 2 rather than simply Y against X Such an "improvement" is nothing else but an optical illusion and is not legitimate, for the dimensionless version of the quantity b under investigation, as supplied by the H - theorem, is Y, not YXN These kinds of illegitimate plots are recognisable by the presence of the parameter a (which should be present only in the expression of X) in the power product implying the ordmate SUMMARY The experimenter is free to choose between a dimensional representation, Eq 1, or a dimensionless representation, Eq 2 of the

636 COASTAL ENGINEERING InX In X Fig 4 Exaggerated Correlation

SPURIOUS CORRELATION 637 phenomenon under investigation, The latter is found to have certain obvious advantages Once a decision is made, all the subsequent analysis is expressed in terms of the system chosen For instance if the drmensionless system is chosen, then the results should not be interpreted m terms of the individual dimensional parameters that form the dimensionless variables since the real variables of the experiment are the dimensionless variables The usual cause of spurious correlation is the appearance of common quantities along both co-ordinate axes This is true for both a dimensional and a dimensionless system This must not be confused with the appearance of common dimensional parameters along both co-ordinate axes when dimensionless variables are plotted against each other In addition, when considering dimensionless variables, care must be taken that the relationship between Y and X is not brought about only by variation of one or more of the repeating dimensional parameters common to both ACKNOWLEDGEMENTS The authors wish to thank Dr A Brebner who urged the writing of this paper APPENDIX 1 - REFERENCES Sedov, L I, Similarity and Dimensional Methods m Mechanics, Academic Press, New York, 1959 Langhaar, H L, Dimensional Analysis and Theory of Models, John Wiley and Sons, New York, London, 1962 Birkhoff, G, Hydrodynamics, a Study in Logic, Fact and Similitude, Princeton, Harvard University Press, 1950, Dover, 1955 Kamphuis, J William and Bowermg, Richard "Impulse Waves", 12th Conference on Coastal Engineering, Washington, Sept 1970 ~~ Yalm, M S, "Geometric Properties of Sand Waves" Journal of the Hydraulics Division, ASCE, Vol 90, No HY5, Sept 1964, pp 105-119 Benson, Manuel A, "Spurious Correlation m Hydraulics and Hydrology", Journal of the Hydraulics Division, ASCE, Vol 91, No HY4, July 1965, pp 35-42

638 COASTAL ENGINEERING a l a J APPENDIX II - NOTATION = dimensional basic quantities (repeaters), = non repeating dimensional independent quantities, b = dimensional dependent parameter, C = constant, D = diameter or gram size, F = force, f = dimensional function, H = wave height, h = depth of flow, K = constant, k = number of fundamental units, m = constant (usually slope), N = exponent, s = distance, U = approach velocity, v* = shear velocity, w = scatter band width, h = dimensionless independent variable, Y = dimensionless dependent variable, Z = dimensionless dependent variable, a i = exponent, Bi = exponent, A = dune length, V = dynamic viscosity, n = product, p = density, s = sum, dimensionless function, dimensionless function of a single dimensionless variable