A probabilistic justification for using tf idf term weighting in information retrieval

Size: px
Start display at page:

Download "A probabilistic justification for using tf idf term weighting in information retrieval"

Transcription

1 In J Digi Libr (2000) 3: Springer-Verlag A probabilisic jusificaion for using f idf erm weighing in informaion rerieval Djoerd Hiemsra Cenre for Telemaics and Informaion Technology, Universiy of Twene, The Neherlands; hiemsra@ci.uwene.nl Received: 17 December 1998/Revised: 31 May 1999 Absrac. This paper presens a new probabilisic model of informaion rerieval. The mos imporan modeling assumpion made is ha documens and queries are defined by an ordered sequence of single erms. This assumpion is no made in well-known exising models of informaion rerieval, bu is essenial in he field of saisical naural language processing. Advances already made in saisical naural language processing will be used in his paper o formulae a probabilisic jusificaion for using f idf erm weighing. The paper shows ha he new probabilisic inerpreaion of f idf erm weighing migh lead o beer undersanding of saisical ranking mechanisms, for example by explaining how hey relae o coordinaion level ranking. A pilo experimen on he TREC collecion shows ha he linguisically moivaed weighing algorihm ouperforms he popular BM25 weighing algorihm. Key words: Informaion rerieval heory Saisical informaion rerieval Saisical naural language processing 1 Inroducion There are hree basic processes an informaion rerieval sysem has o suppor: he represenaion of documens, he represenaion of a user reques, and he comparison of hese wo represenaions. In ex rerieval he documens and he user reques are expressed in naural language. Alhough ex rerieval has had by far he mos aenion in he informaion rerieval communiy, so far he success of naural language processing echniques has been limied. Mos of he effor in he field of ex rerieval has been pu in he developmen of saisical rerieval models like he vecor space model (proposed by Salon e al. [16]), he classical probabilisic model (proposed by Roberson and Spark Jones [12]) and more recenly he inference nework model (proposed by Crof and Turle [3]). The applicaion of naural language processing echniques in combinaion wih hese models has solid bu limied impac on he performance of ex rerieval 1 [19]. The research does however provide lile insigh o he quesion how o use naural language processing. Naural language processing modules are usually considered as preprocessing seps, ha is, hey are no included in he model iself. This paper aemps o formulae a model ha capures saisical informaion rerieval and saisical naural language processing ino one unifying framework, an approach ha ohers are also beginning o invesigae [9, 11]. I is he model iself ha explicily defines how documens and queries should be analysed. This seems a raher rivial requiremen, bu we claim ha his is no he general idea behind he exising models for informaion rerieval. The (implici) assumpion made by hese rerieval models is ha some procedure, eiher manual or auomaic, is used o assign index erms o documens. I is he resul of his procedure ha can be refleced by he model, no he procedure iself. This paper is organised as follows. In Sec. 2 he basics of he linguisically moivaed rerieval model are presened. Secion 3 gives a new probabilisic inerpreaion of f idf erm weighing by using esimaion procedures developed in he field of saisical naural language processing. Secion 4 presens a number of experimens, one pilo experimen on he relaively oudaed Cranfield collecion and wo addiional experimens on he TREC ad hoc and TREC-CLIR collecion. Finally, Sec. 5 presens 1 Rerieval performance is usually measured in erms of precision (he fracion of he rerieved documens ha is acually relevan) and recall (he fracion of he relevan documens ha is acually rerieved).

2 132 D. Hiemsra: A probabilisicjusificaion for using f idf erm weighing in informaion rerieval conclusions and plans for fuure work. These plans include he developmen of a model for phrases and he developmen of a model for cross-language informaion rerieval. An early version of his paper was presened a he Second European Conference on Digial Libraries (ECDL) [5]. 2 The basic rerieval model This paper defines a linguisically moivaed model of full ex informaion rerieval. The mos imporan modeling assumpion we make is ha documens and queries are defined by an ordered sequence of words or erms. 2 This assumpion is usually no made in informaion rerieval. In he models menioned in he inroducion, documens and queries are modeled as unordered collecions of erms or conceps. In he field of saisical naural language processing he word order assumpion is essenial for many applicaions, for insance par-of-speech agging, speech recogniion and parsing. By making he ordered sequence of erms assumpion we will be able o use advances already made in saisical naural language processing. In his secion we will define he framework ha will be used in he subsequen secions o give a probabilisic inerpreaion of f idf erm weighing. 2.1 An informal descripion: drawing query erms from a documen Before we describe he new rerieval model mahemaically, his secion gives an informal descripion of he underlying ideas. The main goal of an informaion rerieval sysem is o find hose documens in a documen collecion ha are relevan o a query. A full ex rerieval sysem compares he words in he query wih he words in each documen o rank he documens. Documens ha are likely o be relevan should be ranked a he op and documens ha are unlikely o be relevan should be ranked a he boom of he ranked lis. A mahemaical model of informaion rerieval formally defines how he sysem should perform his ranking, usually based on inuiions or meaphors from some well-undersood branch of mahemaics. For example Salon s vecor space model is based on inuiions from geomery: documens and queries are vecors in a high-dimensional space and documens are ranked by he cosine of he angle ha separaes he documen vecor and he query vecor. The Roberson/Sparck-Jones probabilisic model is based on he inuiion ha a sysem can learn from he disribuion of erms over relevan and non-relevan documens which documens are probably relevan o a query. 2 In he linguisically moivaed model erms and words are equivalen, boh expressions will be used in his paper. A classical index erm ha consiss of more han one word will be called a phrase. We will use probabiliy heory in a differen way here by using a meaphor ha is very similar o he sampling coloured balls from urns examples ha are ofen used in inroducory saisics courses [10]. Insead of drawing balls a random wih replacemen from an urn, we will consider he process of drawing words a random wih replacemen from a documen. Suppose someone selecs one documen in he documen collecion; draws a random, one a a ime, wih replacemen en words from his documen and hands hose en words (he query erms) over o he sysem. The sysem now can make an educaed guess as from which documen he words came from, by calculaing for each documen he probabiliy ha he en words were sampled from i and by ranking he documens accordingly. This meaphor for informaion rerieval was inroduced by Pone and Crof [11]. The inuiion behind i is ha users have a reasonable idea of which erms are likely o occur in documens of ineres and will choose query erms accordingly. The meaphor is a very powerful one as i can be exended in various ways. Because of he sequenial naure of he sampling process, i can be exended o model phrases as done by Miller, Leek and Schwarz [9]. I can be exended o Boolean queries by reaing he sampling process as an AND-query and allowing ha each draw is specified by a disjuncion of more han one erm. For example, he probabiliy of firs drawing he erm informaion and hen drawing eiher he erm rerieval orhe erm filering from a documen can be calculaed by he model inroduced in his paper wihou any addiional modeling assumpions. Furhermore, i can be exended wih addiional saisical processes o model differences beween he vocabulary of he query and he vocabulary of he documens. For insance, for cross-language rerieval, saisical ranslaion can be added o he process of sampling erms from a documen: e.g., firs an English word is sampled from he documen, and hen his word is ranslaed o Duch wih some probabiliy ha can be esimaed from a parallel corpus. Evaluaions of Boolean queries and saisical ranslaion are described in [6, 7]. In his paper we will focus on he basics of he new rerieval model by defining i mahemaically and by saing how i relaes o exising f idf erm weighing algorihms. We will deal wih he mahemaical deails of he exensions of he model in fuure publicaions. 2.2 The sample space We assume ha a collecion consiss of a finie number of exual documens. The documens are wrien in a language ha exiss of a finie number of words or erms. Definiion 1. Le P be a probabiliy funcion on he join sample space Ω D Ω T.LeΩ D be a discree sample space ha conains a finie number of poins d such ha each d refers o an acual documen in he documen collecion. Le D be discree random variable over Ω D.Le

3 D. Hiemsra: A probabilisicjusificaion for using f idf erm weighing in informaion rerieval 133 Ω T be a discree sample space ha conains a finie number of poins such ha each refers o an acual erm ha is used o represen he documens. Le T be a discree random variable over Ω T. In oher words, he random variable D refers o a documen id and he random variable T refers o an index erm. 2.3 Modeling documens and queries Queries will be modeled as compound evens. A compound even is an even ha consiss of wo or more single evens, as when a die is ossed wice or hree cards are drawn one a a ime from a deck [10]. The single evens ha define he compound even are he query erms. In general he probabiliy of a compound even does depend on he order of he single evens. For example a query of lengh n is modeled by an ordered sequence on n single erms T 1,T 2,,T n. Given a documen id D he probabiliy of he ordered sequence will be defined by P (T 1,T 2,,T n D). Mos pracical models for informaion rerieval assume independence beween index erms. Assuming condiional independence of erms given a documen id leads o he following model. P (T 1,T 2,,T n D)= n P (T i D) (1) Noe ha he assumpion of independence beween query erms does no conradic he assumpion ha erms in queries have a paricular order. The independence assumpion merely saes ha every possible order of erms has he same probabiliy. I is made o illusrae ha a simple version of he linguisically moivaed model is very similar o exising informaion rerieval models. 2.4 The maching process Equaion (1) can be used direcly o rank documens given a query T 1,T 2,,T n. I migh however be ineresing o rewrie (1) o a probabiliy measure ha explicily ranks documens given a query: P (D T 1,T 2,,T n ). This measure can be relaed o (1) by applying Bayes rule. P (D T 1,T 2,,T n )=P(D) P (T 1,T 2,,T n D) (2) P (T 1,T 2,,T n ) n = P (D) P (T i D) (3) P (T 1,T 2,,T n ) Equaion (2) is he direc resul of applying Bayes rule. Filling in he independence assumpion of (1) leads o (3). I seems emping o make he assumpion ha erms are also independen if hey are no condiioned on a documen D. This will however lead o an inconsisency of he model (e.g., see Cooper s paper on modeling assumpions for he classical probabilisic rerieval model [2]). Since d P (D = d T 1,T 2,,T n ) = 1 we can scale he formula using a consan C such ha 1 C = d P (D = d T 1,T 2,,T n ). P (D T 1,T 2,,T n )=CP(D) n P (T i D) (4) Equaion (4) defines he ranking formula of he linguisic moivaed probabilisic rerieval model if we assume erm independence. 3 Esimaing he probabiliies The process of probabiliy esimaion defines how probabiliies should be esimaed from he frequency of erms in he acual documen collecion. We will look a he esimaing process by drawing a parallel o saisical naural language processing and corpus linguisics. 3.1 Viewing documens as language samples The general idea is he following. Each documen conains a small sample of naural language. For each documen he rerieval sysem should build a lile saisical language model P (T D) wheret is a single even. Such a language model migh indicae ha he auhor of ha documen used a cerain word 5 ou of 1000 imes; i migh indicae ha he auhor used a cerain synacic consrucion like a phrase 5 ou of 1000 imes; or ulimaely indicae ha he auhor used a cerain logical semanic srucure 5 ou of 1000 imes. One of he main problems in saisical naural language processing and corpus linguisics is he problem of sparse daa. If he sample ha is used o esimae he parameers of a language model is small, hen many possible language evens never ake place in he acual daa. Suppose for example ha an auhor wroe a documen abou informaion rerieval wihou using he words keyword and crocodile. The reason ha he auhor did no menion he word keyword is probably differen from he reason for no menioning he word crocodile. If we were able o ask an exper in he field of informaion rerieval o esimae probabiliies for he erms keyword and crocodile he/she migh for example indicae ha he chance ha he erm keyword occurred is one in a housand erms and he chance ha he erm crocodile occurred is much lower: one in a million. If we however base he probabiliies on he frequency of erms in he acual documen hen he probabiliy esimaes of low frequen and medium frequen erms will be unreliable. A full ex informaion rerieval sysem based on hese frequencies canno make a difference beween words ha were no used by chance, like he word keyword, and words ha were no used because hey are no par of he vocabulary of he

4 134 D. Hiemsra: A probabilisicjusificaion for using f idf erm weighing in informaion rerieval subjec, like he word crocodile. Furhermore here is always a small chance ha compleely off he subjec words occur like he word crocodile in his paper. We believe ha he sparse daa problem is exacly he reason ha i is hard for informaion rerieval sysems o obain high recall values wihou degrading values for precision. Many soluions o he sparse daa problem were proposed in he field of saisical naural language processing ( e.g., see [8] for an overview). We will use he combinaion of esimaors by linear inerpolaion o esimae parameers of he probabiliy measure P (T D). 3.2 Esimaing probabiliies from sparse daa Perhaps he mos sraighforward way o esimae probabiliies from frequency informaion is maximum likelihood esimaion [10]. A maximum likelihood esimae makes he probabiliy of observed evens as high as possible and assigns zero probabiliy o unseen evens. This makes he maximum likelihood esimae unsuiable for direcly esimaing P (T D). One way of removing he zero probabiliies is o mix he maximum likelihood model of P (T D) wih a model ha suffers less from sparseness like he marginal P (T ). I is possible o make a linear combinaion of boh probabiliy esimaes so ha he resul is anoher probabiliy funcion. This mehod is called linear inerpolaion: P li (T D)= P mle (T )+α 2 P mle (T D), (0 <,α 2 < 1 and + α 2 =1) (5) The weighs and α 2 migh be se by hand, in which case we would choose hem in such a way ha P mle (T = ) is smaller han α 2 P mle (T = D) foreach erm. This will give erms ha did no appear in he documen a much smaller probabiliy han erms ha did appear in he documen. In general one wans o find he combinaion of weighs ha works he bes, for example by opimising hem on a es collecion consising of documens, queries and corresponding relevance judgemens. Table 1 liss he frequencies ha are used o esimae he probabiliies of he model. Two frequencies are paricularly imporan, he erm frequency and he documen frequency. The erm frequency of a erm is defined by he number of imes a erm appears in a documen and can be viewed a as local or documen specific informaion. Given a specific documen many erms will have a frequency of zero, so he erm frequency suffers from sparseness. The documen frequency of a erm is defined by he number of documens in which a erm appears and can be viewed as global informaion. (Someimes documen frequency is referred o as collecion frequency.) The documen frequency of a erm will never be zero, because by definiion 1, erms ha do no appear in any documen will no be included in he model. The sparseness problem can be avoided by esimaing P (T D) as a linear combinaion of a probabiliy model based on documen Table 1. Frequency informaion N he number of documens in he collecion f (, d)erm frequency: he number of imes he erm appears in he documen d. df () documen frequency: henumberof documens in which he erm appears. frequency and a probabiliy model based on erm frequency as in (7): P (D = d)= 1 N (6) df ( i ) P (T i = i D = d)= df () + α f ( i,d) 2 f (, d) (7) Noe ha erm frequency and documen frequency are no derived from he same disribuion. Alhough he erm frequency can also be used o compue global informaion of a erm by summing over all possible documens, his informaion will usually no be he same as he documen frequency of a erm, more formally: df () d f (, d). Equaions (4) and (7) define he ranking algorihm. The formula bears some resemblance wih he ranking formula used by Miller, Leek and Schwarz [9]. They showed ha he model can be inerpreed as a wo-sae hidden Markov model in which and α 2 define he sae ransiions. 3.3 Relaion o f idf The use of erm frequency and documen frequency o rank documens was exensively sudied, especially by Salon e al., for he vecor space model [15]. Following consideraions of he erm discriminaion model [17], hey argued ha erms appearing in documens should be weighed proporional o he erm frequency and inversely proporional o he documen frequency. Weighing schemes ha follow his approach are called f idf (erm frequency inverse documen frequency) weighing schemes. The combinaion of f idf weighs and documen lengh normalisaion gave he bes rerieval resuls on several es collecions, bu hey were no able o jusify heir approach by probabiliy heory (which is no a prerequisie for using i in he vecor space model anyway):... The erm discriminaion model has been criicised because i does no exhibi well subsaniaed heoreical properies. This in conras wih he probabilisic model of informaion rerieval... The lack of heoreical jusificaion of f idf weighs did no keep developers of he probabilisic model and he inference nework model from using hem. Roberson e al. [13] jusified he use of erm frequency in he probabilisic model by approximaing a ranking formula ha is

5 D. Hiemsra: A probabilisicjusificaion for using f idf erm weighing in informaion rerieval 135 based on he combinaion of he probabilisic model and he 2-Poisson model. There is however a more plausible probabilisic jusificaion of f idf weighing which can be jusified by he linear inerpolaion esimaor of (7). This can be shown by rewriing. Muliplying he ranking formula defined by (4), (6) and (7) wih values ha are he same for each documen will no affec he final ranking, so we can muliply he ranking formula by df () and as follows: P (D = d T 1 = 1,,T n = n ) n df ( i ) ( df () + α f ( i,d) 2 [by (4),(6) and (7)] f (, d)) n f ( i,d) df () ( + α 2 f (, d) df ( i) ) [ n df () df ( i ) ] n (1 + f ( i,d) df ( i ) 1 α2 df () )[ n 1 f (, d) ] The resuling formula can direcly be inerpreed as a f idf weighing algorihm wih documen lengh normalisaion, because: α 2 df () is consan for any documen d and erm f ( i,d) is he f idf weigh of he erm i in he df ( i ) documen d 1 f (, d) is he inverse lengh of he documen d Any monoonic ransformaion of he documen ranking funcion will produce he same ranking of he documens. Insead of he produc of weighs we could herefore also rank he documens by he sum of logarihmic weighs. n log(1 + f ( i,d) df ( i ) 1 α2 f (, d) df () )[log. r.] Roberson and Sparck-Jones [12] call he resuling formula a presence weighing scheme (as opposed o a presence/absence weighing scheme) because he formula assigns a zero weigh o erms ha are no presen in a documen. Presence weighing schemes can be implemened using he vecor produc formula as inroduced by Salon e al. [15]. The query weighs of he vecor produc formula can be used o accoun for muliple occurrences of he same erm in he query. The resuling vecor produc version of he ranking formula is displayed in Table 2. On firs glance he consan α 2 df ()/ seems o have lile impac on he final ranking. Bu in fac, differen values of and α 2 will lead o differen documen rankings. In Sec. 3.5 we will show some effecs of differen values of and α 2 on he ranking of documens, especially for shor queries. As said above, he weighing algorihm can, by he definiion of Salon e al., be inerpreed as a f idf Table 2. Vecor produc version of weighing algorihm l similariy(q, D)= w qk w dk k=1 w qk = f ( k,q) f ( k,d) w dk =log(1+ df ( k ) α2 f (, d) df () ) weigh wih documen lengh normalisaion. However, he f (, d) (ha is: he documen lengh) in he denominaor of he documen erm weigh in Table 2 is he resul of he requiremen ha probabiliies have o sum up o one and no he resuls of documen lengh normalisaion. Documen lengh normalisaion is assumed by (6). In fac we migh assume ha longer documens are more likely o be relevan by using he prior probabiliy of (8). f (, d) P (D = d)= d f (, d) (8) This resuls in a weighing algorihm ha canno be rewrien ino he vecor produc normal form. I can however be implemened fairly easily by iniialising similariies o log( f (, d)) insead of o zero when processing he query. This version of he weighing algorihm was used in he experimens of Sec A new informal definiion of f idf weighing Equaion (7) gives rise o a new informal definiion of f idf weighing. Giving an informal definiion afer he formal definiion seems a bi useless, bu we believe ha i will help o undersand wha exacly makes f idf weighing successful. The classical definiion of f idf weighing can be formulaed as follows: Definiion 2. The weigh of a erm ha appears in a documen should increase wih he erm frequency of he erm in he documen and decrease wih he documen frequency of he erm. Terms ha do no appear in a documen should all ge he same weighs (zero weighs). An alernaive definiion is based on (7). If we assume ha df ( i )/ df () is much smaller han α 2 f ( i,d)/ f (, d) hen i can be formulaed as follows: Definiion 3. The weigh of a erm ha appears in a documen should increase wih he erm frequency of he erm in he documen. The weigh of a erm ha does no appear in a documen should increase wih he documen frequency of he erm. An example may clarify he implicaions of boh definiions. Suppose he user formulaes he query informaion rerieval and here is no documen in he collecion

6 136 D. Hiemsra: A probabilisicjusificaion for using f idf erm weighing in informaion rerieval in which he erms informaion and rerieval boh appear. Furhermore, suppose ha he erm informaion is much more common, i.e., has a higher documen frequency han he erm rerieval. Now he sysem will rank documens conaining k occurrences of he erm rerieval above documens conaining k occurrences of he erm informaion. The classical explanaion would be as follows: Explanaion 1. The query erm rerieval maches beer wih documens conaining rerieval han he query erm informaion maches wih documens conaining informaion, because rerieval has a higher inverse documen frequency han informaion. The alernaive explanaion would be ha: Explanaion 2. The query erm informaion maches beer wih documens no conaining informaion han he query erm rerieval maches documens no conaining rerieval because informaion has a higher documen frequency han rerieval. There is no a priori reason o prefer one explanaion above he oher. However, he idea of definiion 3, ha global informaion is only used o weigh erms of which here is no local informaion, migh lead o beer undersanding of probabilisic erm weighing in ex rerieval. 3.5 The problem of non-coordinaion level ranking There is a well-known problem wih saisical informaion rerieval sysems ha use f idf weighing: someimes documens conaining n query erms are ranked higher han documens conaining n + 1 query erms. We will call his problem he problem of non-coordinaion level ranking in which he coordinaion level refers o he number of disinc query erms conained in a documen. A coordinaion level ranking procedure will always rank documens conaining n +1 query erms above documens conaining n query erms even if he op documens have lile evidence for he presence of n +1query erms and lower-ranked documens have a lo of evidence for he presence of n erms. According o sudies of user preferences and evaluaions on es collecions he problem of non-coordinaion level ranking becomes paricularly apparen if shor queries are used [14]. In a lo of pracical siuaions shor queries are he rule raher han he excepion, especially in siuaions where here is no or lile user raining like wih Web-based search engines. For some research groups, he imporance of coordinaion level is he reason for developing ranking mehods ha are based on he lexical disance of search erms in documens insead of on documen frequency of erms [1, 4]. However, as poined ou by experimens of Wilkinson e al. [21], some f idf measures (e.g., like he measure proposed by Roberson e al. [13]) are more like coordinaion level ranking han ohers (e.g., like he measure proposed by Salon e al. [15]). Wilkinson e al., showed ha weighing measures ha are more like coordinaion level ranking perform beer on he TREC collecion, especially if shor queries are used. Following he resuls of Wilkinson e al., i migh be useful o invesigae wha exacly makes a weighing measure like coordinaion level ranking. The following example may provide some insigh. Firs of all, suppose we use he following ranking formula which can be derived from he probabiliy ranking funcion in a similar way as is shown in Sec P (D = d T 1 = 1,,T n = n ) n ( α 2 df () + f ( i,d) df ( i ) 1 (9) f (, d)) Now suppose he user eners a small query of only wo erms a and b. As in he previous example a migh be he erm informaion and b migh be he erm rerieval. Furhermore, suppose ha he documen d 1 conains a lo of evidence for erm a and no evidence for he erm b; and ha documen d 2 conains lile evidence of boh erms. I can be shown ha documen d 1 will have a lo of evidence for a and none for b if f (a, d 1 )ishigh,f (b, d 1 )=0 and he lengh of d 1 is shor. Documen d 2 conains lile evidence of a and b if f (a, d 2 )=f (b, d 2 )=1 and if d 2 is a long documen. Now he following equaion defines he requiremen for coordinaion level ranking, ha is, he similariy of documen d 1 o he query abshould be smaller han he similariy of documen d 2 o he query. The lef hand side of he equaion conains he similariy of he query compared o documen d 1 and he righ hand side conains he similariy of he query o documen d 2. We will use shor noaions of he documen lengh and he consan of (9): l(d)= f (, d) and c = /α 2 df (). (c + f (a, d 1) df (a)l(d 1 ) )(c +0) < (c + 1 df (a)l(d 2 ) )(c + 1 df (b)l(d 2 ) ) c 2 + cf(a, d 1) c df (a)l(d 1 ) <c2 + df (a)l(d 2 ) + c df (b)l(d 2 ) 1 + df (a)df (b)l(d 2 ) 2 cf(a, d 1 ) df (a)l(d 1 ) c df (a)l(d 2 ) c df (b)l(d 2 ) < 1 df (a)df (b)l(d 2 ) 2 c<. l(d 1 ) l(d 2 )(f (a, d 1 )df (b)l(d 2 ) df (b)l(d 1 ) df (a)l(d 1 )) (10) Equaion (10) shows ha we can rewrie he requiremen for coordinaion level ranking as a requiremen for he consan c. Ifc is small enough hen he problem of noncoordinaion level ranking will never occur. By changing he value of c he ranking formula can be adaped o

7 D. Hiemsra: A probabilisicjusificaion for using f idf erm weighing in informaion rerieval 137 differen applicaions. If we are developing a Web-based search engine we migh choose a relaively small value for c, bu if we are developing a search engine for evaluaion in TREC [20] we migh choose a higher value of c.forhe Web-based search engine we migh define a collecion specific lower bound of c by keeping rack of he collecion exrema like maximum erm frequency and documen frequency (maxf and maxdf) and maximum and minimum documen lenghs (maxl and minl). If we fill in hese exrema and he definiion c = /α 2 df () in ((10)) hen he lower bound will be defined as follows on he raio beween and α 2. minl df () < α 2 maxl (maxf maxdf maxl maxdf minl minl) (11) Equaion (11) defines a ranking formula ha always produces coordinaion level ranking for queries of wo words. For longer queries he bound will be lower and for queries wih unresriced lengh only = 0 will guaranee coordinaion level ranking. 3.6 A plausible explanaion of non-coordinaion level ranking The argumens in he previous secion showed he following. The smaller he value of he consan c, hemorehe ranking formula will behave like coordinaion level ranking. I is good o noe ha mos f idf measures defined for he exising models of informaion rerieval include consans for which he argumens inroduced above also hold (for insance he +0.5 in he Roberson/Sparck Jones formula [12, 13]). However, he classical definiion of f idf weighing (definiion 2) does no give a plausible explanaion of why and when non-coordinaion level ranking does happen. Using he new definiion 3 and he fac ha c is defined by he raio /α 2 we can give he following explanaion of non-coordinaion level ranking when f idf weighs are used. Explanaion 3. Non-coordinaion level ranking occurs if query erms ha do no appear in a documen are weighed oo high compared o query erms ha do appear in a documen. According o definiion 3 erms ha do no appear in a documen are weighed proporional o he documen frequency. If we choose a relaively high value for he consan hen query erms ha do no appear in a documen will be weighed oo high, possibly causing noncoordinaion level ranking. 4 Experimenal resuls This secion briefly describes he resuls of a number of experimens wih he linguisically moivaed erm weighing algorihm. A pilo experimen uses he relaively oudaed Cranfield collecion as repored earlier in [5]. Addiional experimens using he TREC collecion indicae ha he new weighing algorihm ouperforms he popular Cornell version of BM The Cranfield collecion The Cranfield collecion is a small collecion (1398 documens) wih a relaively large number of queries (255 queries). In he experimen we implemened a linguisically moivaed probabilisic rerieval engine and a sandard vecor engine. Boh engines used he same okenisaion and semming of he words in he documens. As a es collecion we used he Cranfield collecion which was also used exensively in early experimens wih he vecor space model [15]. Table 3 liss he non-inerpolaed average precision averaged over 225 queries of he Cranfield collecion for differen values of and α 2. Table 3. Experimenal resuls on he Cranfield collecion weigh avg. precision =0.05 α 2 = =0.2 α 2 = =0.35 α 2 = =0.5 α 2 = =0.65 α 2 = =0.8 α 2 = =0.95 α 2 = To evaluae how our weighing scheme performs relaive o oher f idf weighing schemes wih documen lengh normalisaion we implemened he vecor space model wih fc.nfx weighing as proposed by Salon and Buckley [15]. The non-inerpolaed average precision averaged over 225 queries of his sysem was on he Cranfield collecion. 3 The linguisically moivaed sysem performs beer for quie a wide range of differen values of and α 2. The bes performance in erms of average precision is approximaely a = Coordinaion level ranking Cranfield has he following collecion exrema: The smalles documen is 18 words long, he longes 354 words. The maximum erm frequency is 28 and he maximum documen frequency 729. Following he argumens of Sec. 3.5 i is possible o calculae a lower bound on he raio beween and α 2 ha will define coordinaion level ranking given a query of lengh 2. This leads 3 Salon and Buckley [15] repor a 3-poin inerpolaed average precision of Our version of heir sysem reaches a 3-poin inerpolaed average precision of which is probably due o he use of a semmer.

8 138 D. Hiemsra: A probabilisicjusificaion for using f idf erm weighing in informaion rerieval o a lower bound of on he raio beween and α 2 which corresponds roughly o = and α 2 = Alhough correc, he lower bound inroduced by (11) is obviously no very useful for idenifying proper values for and α 2. There are several reasons ha migh explain why he sysem performs opimally for much higher values of : 1. Coordinaion level ranking does no lead o good average precision on he Cranfield collecion. 2. The sysem does produce coordinaion level ranking, bu he bound on he raio beween and α 2 is oo low o be of any use. 3. The sysem does produce coordinaion level ranking, bu he bound is no useful because he collecion does no have very small queries (he average query lengh is abou 9.5words). Addiional experimens have o poin ou which reason or reasons acually explain he experimenal resuls he bes. 4.3 The TREC collecion Since he exisence of he TEx Rerieval Conference (TREC), experimens have resuled in weighing algorihms ha perform much beer han fc.nfx weighing. According o Voorhees and Harman [20], odays mos popular weighing algorihm is he Cornell implemenaion of he Okapi BM25 algorihm [13, 18]. Table 4 displays he version of he BM25 weighing algorihm ha was used for comparison. Table 5 liss he evaluaion resuls in erms of average precision of he new weighing algorihm compared o he resuls of he Cornell/BM25 algorihm. We used wo modern es collecions provided by NIST via TREC. One is he ad hoc collecion consising of aricles from he LA Times, he Financial Times, he Federal Regiser and Foreign Broadcas Informaion Service. 4 The oher collecion uses he English documens and English opics 1-24 of he TREC Cross-Language Informaion Rerieval (CLIR) collecion consising of AP Newswire aricles. On boh collecions we used =0.85. Table 4. Cornell version of BM25 weighing algorihm l similariy(q, D)= w qk w dk k=1 w qk = f ( k,q) f ( k,d) log( N df ( k )+0.5 df ( w dk = k )+0.5 ) 2 ( f (,d),d f (,d)/n )+f ( k,d) Boh weighing algorihms perform approximaely equally well on Cranfield, bu on he collecions of more 4 We lef ou he Congressional Records because NIST decided no o include his subcollecion in TREC-7. realisic size he linguisically moivaed weighing algorihm ouperforms he Cornell/BM25 weighing algorihm. The performance gain of 17% on he ad hoc collecion is specacular. Experimens using he ad hoc opics show a significan, hough less specacular, improvemen [7]. Table 5. Performance on hree es collecions average precision collecion BM25 new algorihm % diff. Cranfield TREC CLIR TREC ad-hoc Maybe more imporan han he resuls in erms of average precision is he he fac ha he bes value of he unknown parameer of he linguisically moivaed weighing algorihm is sable across differen es collecions. On all hree collecions he bes performance of he linguisically moivaed weighing algorihm lies a approximaely =0.85. For =0.8 and =0.9 he average precision on he English CLIRcollecion is respecively and and on he ad hoc collecion respecively and Conclusion and fuure plans This paper has presened he linguisically moivaed probabilisic model of informaion rerieval. Using esimaion by linear inerpolaion which is ofen used in he field of saisical naural language processing we have been able o presen a probabilisic inerpreaion of f idf erm weighing. We have shown ha his new inerpreaion leads o beer undersanding of he behaviour of f idf ranking. Experimens wih he TREC collecion show ha a rerieval sysem ha uses he new model ouperforms he same sysem using he Cornell/BM25 weighing algorihm. This paper has no presened he linguisically moivaed model of informaion rerieval in is full srengh. Alhough we claim ha he mos imporan modeling assumpion of he model is ha documens and queries are defined by an ordered sequence of erms, he assumpion is no essenial for he claims made in his paper. In fuure papers we will invesigae wo major informaion rerieval issues ha require naural language processing echniques. The firs issue is he use of phrases in informaion rerieval. The second issue is he problem of cross-language informaion rerieval. Acknowledgemens. The work repored in his paper is funded in par by he Duch Telemaics Insiue projec DRUID. The repored TREC evaluaions were funded in par by he EU projecs Tweny-One (IE 2108) and Pop-Eye (LE 4234). I would like o hank he following people for heir suppor. Franciska de Jong, Paul van der Ve and Wilber Kallenberg for

9 D. Hiemsra: A probabilisicjusificaion for using f idf erm weighing in informaion rerieval 139 general advice, and David Hawking of he Ausralian Naional Universiy for his advice on coordinaion level ranking. Special hanks go o Wessel Kraaij of TNO-TPD for running he TREC ad hoc experimens. References 1. Clarke, C.L.A., Cormack, G.V., Tudhope, E.A.: Relevance ranking for one o hree erm queries. In: Proc. RIAO 97, 1997, pp Cooper, W.S.: Some inconsisencies and misidenified modeling assumpions in probabilisic informaion rerieval. ACM Trans. Informaion Sysems 13: , Crof, W.B., Turle, H.R.: Tex rerieval and inference. In: Jacobs, P. (ed.): Tex-based Inelligen Sysems. Lawrence Erlbaum, 1992, pp Hawking, D., Thislewaie, P.: Relevance weighing using disance beween erm occurrences. Technical Repor TR-CS-96-08, The Ausralian Naional Universiy, 1996 hp://cs.anu.edu.au/echrepors 5. Hiemsra, D.: A linguisically moivaed probabilisic model of informaion rerieval. In: Nicolaou, C., Sephanidis, C. (eds.): Proc. 2nd European Conference on Research and Advanced Technology for Digial Libraries (ECDL-2), 1998, pp Hiemsra, D., de Jong, F.M.G.: Cross-language rerieval in Tweny-One: using one, some or all possible ranslaions? In: Proc. 14h Twene Workshop on Language Technology (TWLT-14), 1998, pp Hiemsra, D., Kraaij, W.: Tweny-One a TREC-7: Ad-hoc and cross-language rack. In: Proc. 7h Tex Rerieval Conference (TREC-7). NIST Special Publicaions, Manning, C., Schüze, H. (eds.): Saisical Naural Language Processing: Theory and Pracice (draf) hp://www. sulry.ars.su.edu.au/manning/courses/sanlp 9. Miller, D.R.H., Leek, T., Schwarz, R.M.: BBN a TREC-7: using Hidden Markov Models for informaion rerieval. In: Proc. 7h Tex Rerieval Conference, TREC-7. NIST Special Publicaions, Mood, A.M., Graybill, F.A. (eds.): Inroducion o he Theory of Saisics. Second ediion. McGraw-Hill, Pone, J.M., Crof, W.B.: A language modeling approach o informaion rerieval. In: Proc. 21s ACM SIGIR Conference on Research and Developmen in Informaion Rerieval (SI- GIR 98), Roberson, S.E., Sparck Jones, K.: Relevance weighing of search erms. J. American Sociey Informaion Science 27: , Roberson, S.E., Walker, S.: Some simple effecive approximaions o he 2-poisson model for probabilisic weighed rerieval. In: Proc. 17h ACM SIGIR Conference on Research and Developmen in Informaion Rerieval (SIGIR 94), 1994, pp Rose, D.E., Sevens, C.: V-win: A lighweigh engine for ineracive use. In: Voorhees, E.M., Harman, D.K. (eds.): Proc. 5h Tex Rerieval Conference TREC-5, NIST Special Publicaion , 1997, pp Salon, G., Buckley, C.: Term-weighing approaches in auomaic ex rerieval. Informaion Processing & Managemen 24(5): , Salon, G., McGill, M.J. (eds.): Inroducion o Modern Informaion Rerieval. McGraw-Hill, Salon, G., Yang, C.S.: On he specificaion of erm values in auomaic indexing. J. Documenaion 29(4): , Singhal, A., Salon, G., Mira, M., Buckley, C.: Documen lengh normalizaion. Technical Repor TR , Cornell Universiy, hp://cs-r.cs.cornell.edu/ 19. Srzalkowski, T., Sparck Jones, K.: NLP rack a TREC-5. In: Voorhees, E.M., Harman, D.K. (eds): Proc. 5h Tex Rerieval Conference TREC-5, NIST Special Publicaion , 1997, pp Voorhees, E.M., Harman, D.K.: Overview of he 6h ex rerieval conference. In: Proc. 6h Tex Rerieval Conference TREC-6, NIST Special Publicaion , 1998, pp Wilkinson, R., Zobel, J., Sacks-Davis, R.: Similariy measures for shor queries. In: Harman, D.K. (ed.): Proc. 4h Tex Rerieval Conference TREC-4, NIST Special Publicaion , 1996, pp

Retrieval Models. Boolean and Vector Space Retrieval Models. Common Preprocessing Steps. Boolean Model. Boolean Retrieval Model

Retrieval Models. Boolean and Vector Space Retrieval Models. Common Preprocessing Steps. Boolean Model. Boolean Retrieval Model 1 Boolean and Vecor Space Rerieval Models Many slides in his secion are adaped from Prof. Joydeep Ghosh (UT ECE) who in urn adaped hem from Prof. Dik Lee (Univ. of Science and Tech, Hong Kong) Rerieval

More information

STATE-SPACE MODELLING. A mass balance across the tank gives:

STATE-SPACE MODELLING. A mass balance across the tank gives: B. Lennox and N.F. Thornhill, 9, Sae Space Modelling, IChemE Process Managemen and Conrol Subjec Group Newsleer STE-SPACE MODELLING Inroducion: Over he pas decade or so here has been an ever increasing

More information

Vehicle Arrival Models : Headway

Vehicle Arrival Models : Headway Chaper 12 Vehicle Arrival Models : Headway 12.1 Inroducion Modelling arrival of vehicle a secion of road is an imporan sep in raffic flow modelling. I has imporan applicaion in raffic flow simulaion where

More information

Lecture 2-1 Kinematics in One Dimension Displacement, Velocity and Acceleration Everything in the world is moving. Nothing stays still.

Lecture 2-1 Kinematics in One Dimension Displacement, Velocity and Acceleration Everything in the world is moving. Nothing stays still. Lecure - Kinemaics in One Dimension Displacemen, Velociy and Acceleraion Everyhing in he world is moving. Nohing says sill. Moion occurs a all scales of he universe, saring from he moion of elecrons in

More information

23.2. Representing Periodic Functions by Fourier Series. Introduction. Prerequisites. Learning Outcomes

23.2. Representing Periodic Functions by Fourier Series. Introduction. Prerequisites. Learning Outcomes Represening Periodic Funcions by Fourier Series 3. Inroducion In his Secion we show how a periodic funcion can be expressed as a series of sines and cosines. We begin by obaining some sandard inegrals

More information

Bias in Conditional and Unconditional Fixed Effects Logit Estimation: a Correction * Tom Coupé

Bias in Conditional and Unconditional Fixed Effects Logit Estimation: a Correction * Tom Coupé Bias in Condiional and Uncondiional Fixed Effecs Logi Esimaion: a Correcion * Tom Coupé Economics Educaion and Research Consorium, Naional Universiy of Kyiv Mohyla Academy Address: Vul Voloska 10, 04070

More information

18 Biological models with discrete time

18 Biological models with discrete time 8 Biological models wih discree ime The mos imporan applicaions, however, may be pedagogical. The elegan body of mahemaical heory peraining o linear sysems (Fourier analysis, orhogonal funcions, and so

More information

GMM - Generalized Method of Moments

GMM - Generalized Method of Moments GMM - Generalized Mehod of Momens Conens GMM esimaion, shor inroducion 2 GMM inuiion: Maching momens 2 3 General overview of GMM esimaion. 3 3. Weighing marix...........................................

More information

Final Spring 2007

Final Spring 2007 .615 Final Spring 7 Overview The purpose of he final exam is o calculae he MHD β limi in a high-bea oroidal okamak agains he dangerous n = 1 exernal ballooning-kink mode. Effecively, his corresponds o

More information

Diebold, Chapter 7. Francis X. Diebold, Elements of Forecasting, 4th Edition (Mason, Ohio: Cengage Learning, 2006). Chapter 7. Characterizing Cycles

Diebold, Chapter 7. Francis X. Diebold, Elements of Forecasting, 4th Edition (Mason, Ohio: Cengage Learning, 2006). Chapter 7. Characterizing Cycles Diebold, Chaper 7 Francis X. Diebold, Elemens of Forecasing, 4h Ediion (Mason, Ohio: Cengage Learning, 006). Chaper 7. Characerizing Cycles Afer compleing his reading you should be able o: Define covariance

More information

Simulation-Solving Dynamic Models ABE 5646 Week 2, Spring 2010

Simulation-Solving Dynamic Models ABE 5646 Week 2, Spring 2010 Simulaion-Solving Dynamic Models ABE 5646 Week 2, Spring 2010 Week Descripion Reading Maerial 2 Compuer Simulaion of Dynamic Models Finie Difference, coninuous saes, discree ime Simple Mehods Euler Trapezoid

More information

Georey E. Hinton. University oftoronto. Technical Report CRG-TR February 22, Abstract

Georey E. Hinton. University oftoronto.   Technical Report CRG-TR February 22, Abstract Parameer Esimaion for Linear Dynamical Sysems Zoubin Ghahramani Georey E. Hinon Deparmen of Compuer Science Universiy oftorono 6 King's College Road Torono, Canada M5S A4 Email: zoubin@cs.orono.edu Technical

More information

On Measuring Pro-Poor Growth. 1. On Various Ways of Measuring Pro-Poor Growth: A Short Review of the Literature

On Measuring Pro-Poor Growth. 1. On Various Ways of Measuring Pro-Poor Growth: A Short Review of the Literature On Measuring Pro-Poor Growh 1. On Various Ways of Measuring Pro-Poor Growh: A Shor eview of he Lieraure During he pas en years or so here have been various suggesions concerning he way one should check

More information

23.5. Half-Range Series. Introduction. Prerequisites. Learning Outcomes

23.5. Half-Range Series. Introduction. Prerequisites. Learning Outcomes Half-Range Series 2.5 Inroducion In his Secion we address he following problem: Can we find a Fourier series expansion of a funcion defined over a finie inerval? Of course we recognise ha such a funcion

More information

Physics 235 Chapter 2. Chapter 2 Newtonian Mechanics Single Particle

Physics 235 Chapter 2. Chapter 2 Newtonian Mechanics Single Particle Chaper 2 Newonian Mechanics Single Paricle In his Chaper we will review wha Newon s laws of mechanics ell us abou he moion of a single paricle. Newon s laws are only valid in suiable reference frames,

More information

10. State Space Methods

10. State Space Methods . Sae Space Mehods. Inroducion Sae space modelling was briefly inroduced in chaper. Here more coverage is provided of sae space mehods before some of heir uses in conrol sysem design are covered in he

More information

Lecture 33: November 29

Lecture 33: November 29 36-705: Inermediae Saisics Fall 2017 Lecurer: Siva Balakrishnan Lecure 33: November 29 Today we will coninue discussing he boosrap, and hen ry o undersand why i works in a simple case. In he las lecure

More information

Chapter 2. First Order Scalar Equations

Chapter 2. First Order Scalar Equations Chaper. Firs Order Scalar Equaions We sar our sudy of differenial equaions in he same way he pioneers in his field did. We show paricular echniques o solve paricular ypes of firs order differenial equaions.

More information

A Specification Test for Linear Dynamic Stochastic General Equilibrium Models

A Specification Test for Linear Dynamic Stochastic General Equilibrium Models Journal of Saisical and Economeric Mehods, vol.1, no.2, 2012, 65-70 ISSN: 2241-0384 (prin), 2241-0376 (online) Scienpress Ld, 2012 A Specificaion Tes for Linear Dynamic Sochasic General Equilibrium Models

More information

ACE 562 Fall Lecture 4: Simple Linear Regression Model: Specification and Estimation. by Professor Scott H. Irwin

ACE 562 Fall Lecture 4: Simple Linear Regression Model: Specification and Estimation. by Professor Scott H. Irwin ACE 56 Fall 005 Lecure 4: Simple Linear Regression Model: Specificaion and Esimaion by Professor Sco H. Irwin Required Reading: Griffihs, Hill and Judge. "Simple Regression: Economic and Saisical Model

More information

Some Basic Information about M-S-D Systems

Some Basic Information about M-S-D Systems Some Basic Informaion abou M-S-D Sysems 1 Inroducion We wan o give some summary of he facs concerning unforced (homogeneous) and forced (non-homogeneous) models for linear oscillaors governed by second-order,

More information

Robust estimation based on the first- and third-moment restrictions of the power transformation model

Robust estimation based on the first- and third-moment restrictions of the power transformation model h Inernaional Congress on Modelling and Simulaion, Adelaide, Ausralia, 6 December 3 www.mssanz.org.au/modsim3 Robus esimaion based on he firs- and hird-momen resricions of he power ransformaion Nawaa,

More information

NCSS Statistical Software. , contains a periodic (cyclic) component. A natural model of the periodic component would be

NCSS Statistical Software. , contains a periodic (cyclic) component. A natural model of the periodic component would be NCSS Saisical Sofware Chaper 468 Specral Analysis Inroducion This program calculaes and displays he periodogram and specrum of a ime series. This is someimes nown as harmonic analysis or he frequency approach

More information

Comparing Means: t-tests for One Sample & Two Related Samples

Comparing Means: t-tests for One Sample & Two Related Samples Comparing Means: -Tess for One Sample & Two Relaed Samples Using he z-tes: Assumpions -Tess for One Sample & Two Relaed Samples The z-es (of a sample mean agains a populaion mean) is based on he assumpion

More information

SUPPLEMENTARY INFORMATION

SUPPLEMENTARY INFORMATION SUPPLEMENTARY INFORMATION DOI: 0.038/NCLIMATE893 Temporal resoluion and DICE * Supplemenal Informaion Alex L. Maren and Sephen C. Newbold Naional Cener for Environmenal Economics, US Environmenal Proecion

More information

ACE 562 Fall Lecture 5: The Simple Linear Regression Model: Sampling Properties of the Least Squares Estimators. by Professor Scott H.

ACE 562 Fall Lecture 5: The Simple Linear Regression Model: Sampling Properties of the Least Squares Estimators. by Professor Scott H. ACE 56 Fall 005 Lecure 5: he Simple Linear Regression Model: Sampling Properies of he Leas Squares Esimaors by Professor Sco H. Irwin Required Reading: Griffihs, Hill and Judge. "Inference in he Simple

More information

Echocardiography Project and Finite Fourier Series

Echocardiography Project and Finite Fourier Series Echocardiography Projec and Finie Fourier Series 1 U M An echocardiagram is a plo of how a porion of he hear moves as he funcion of ime over he one or more hearbea cycles If he hearbea repeas iself every

More information

PENALIZED LEAST SQUARES AND PENALIZED LIKELIHOOD

PENALIZED LEAST SQUARES AND PENALIZED LIKELIHOOD PENALIZED LEAST SQUARES AND PENALIZED LIKELIHOOD HAN XIAO 1. Penalized Leas Squares Lasso solves he following opimizaion problem, ˆβ lasso = arg max β R p+1 1 N y i β 0 N x ij β j β j (1.1) for some 0.

More information

Stability and Bifurcation in a Neural Network Model with Two Delays

Stability and Bifurcation in a Neural Network Model with Two Delays Inernaional Mahemaical Forum, Vol. 6, 11, no. 35, 175-1731 Sabiliy and Bifurcaion in a Neural Nework Model wih Two Delays GuangPing Hu and XiaoLing Li School of Mahemaics and Physics, Nanjing Universiy

More information

Speaker Adaptation Techniques For Continuous Speech Using Medium and Small Adaptation Data Sets. Constantinos Boulis

Speaker Adaptation Techniques For Continuous Speech Using Medium and Small Adaptation Data Sets. Constantinos Boulis Speaker Adapaion Techniques For Coninuous Speech Using Medium and Small Adapaion Daa Ses Consaninos Boulis Ouline of he Presenaion Inroducion o he speaker adapaion problem Maximum Likelihood Sochasic Transformaions

More information

Random Walk with Anti-Correlated Steps

Random Walk with Anti-Correlated Steps Random Walk wih Ani-Correlaed Seps John Noga Dirk Wagner 2 Absrac We conjecure he expeced value of random walks wih ani-correlaed seps o be exacly. We suppor his conjecure wih 2 plausibiliy argumens and

More information

Inventory Analysis and Management. Multi-Period Stochastic Models: Optimality of (s, S) Policy for K-Convex Objective Functions

Inventory Analysis and Management. Multi-Period Stochastic Models: Optimality of (s, S) Policy for K-Convex Objective Functions Muli-Period Sochasic Models: Opimali of (s, S) Polic for -Convex Objecive Funcions Consider a seing similar o he N-sage newsvendor problem excep ha now here is a fixed re-ordering cos (> 0) for each (re-)order.

More information

Finish reading Chapter 2 of Spivak, rereading earlier sections as necessary. handout and fill in some missing details!

Finish reading Chapter 2 of Spivak, rereading earlier sections as necessary. handout and fill in some missing details! MAT 257, Handou 6: Ocober 7-2, 20. I. Assignmen. Finish reading Chaper 2 of Spiva, rereading earlier secions as necessary. handou and fill in some missing deails! II. Higher derivaives. Also, read his

More information

) were both constant and we brought them from under the integral.

) were both constant and we brought them from under the integral. YIELD-PER-RECRUIT (coninued The yield-per-recrui model applies o a cohor, bu we saw in he Age Disribuions lecure ha he properies of a cohor do no apply in general o a collecion of cohors, which is wha

More information

STA 114: Statistics. Notes 2. Statistical Models and the Likelihood Function

STA 114: Statistics. Notes 2. Statistical Models and the Likelihood Function STA 114: Saisics Noes 2. Saisical Models and he Likelihood Funcion Describing Daa & Saisical Models A physicis has a heory ha makes a precise predicion of wha s o be observed in daa. If he daa doesn mach

More information

An introduction to the theory of SDDP algorithm

An introduction to the theory of SDDP algorithm An inroducion o he heory of SDDP algorihm V. Leclère (ENPC) Augus 1, 2014 V. Leclère Inroducion o SDDP Augus 1, 2014 1 / 21 Inroducion Large scale sochasic problem are hard o solve. Two ways of aacking

More information

1 Review of Zero-Sum Games

1 Review of Zero-Sum Games COS 5: heoreical Machine Learning Lecurer: Rob Schapire Lecure #23 Scribe: Eugene Brevdo April 30, 2008 Review of Zero-Sum Games Las ime we inroduced a mahemaical model for wo player zero-sum games. Any

More information

Biol. 356 Lab 8. Mortality, Recruitment, and Migration Rates

Biol. 356 Lab 8. Mortality, Recruitment, and Migration Rates Biol. 356 Lab 8. Moraliy, Recruimen, and Migraion Raes (modified from Cox, 00, General Ecology Lab Manual, McGraw Hill) Las week we esimaed populaion size hrough several mehods. One assumpion of all hese

More information

Lecture Notes 2. The Hilbert Space Approach to Time Series

Lecture Notes 2. The Hilbert Space Approach to Time Series Time Series Seven N. Durlauf Universiy of Wisconsin. Basic ideas Lecure Noes. The Hilber Space Approach o Time Series The Hilber space framework provides a very powerful language for discussing he relaionship

More information

Inequality measures for intersecting Lorenz curves: an alternative weak ordering

Inequality measures for intersecting Lorenz curves: an alternative weak ordering h Inernaional Scienific Conference Financial managemen of Firms and Financial Insiuions Osrava VŠB-TU of Osrava, Faculy of Economics, Deparmen of Finance 7 h 8 h Sepember 25 Absrac Inequaliy measures for

More information

Exponential Weighted Moving Average (EWMA) Chart Under The Assumption of Moderateness And Its 3 Control Limits

Exponential Weighted Moving Average (EWMA) Chart Under The Assumption of Moderateness And Its 3 Control Limits DOI: 0.545/mjis.07.5009 Exponenial Weighed Moving Average (EWMA) Char Under The Assumpion of Moderaeness And Is 3 Conrol Limis KALPESH S TAILOR Assisan Professor, Deparmen of Saisics, M. K. Bhavnagar Universiy,

More information

Let us start with a two dimensional case. We consider a vector ( x,

Let us start with a two dimensional case. We consider a vector ( x, Roaion marices We consider now roaion marices in wo and hree dimensions. We sar wih wo dimensions since wo dimensions are easier han hree o undersand, and one dimension is a lile oo simple. However, our

More information

Analyze patterns and relationships. 3. Generate two numerical patterns using AC

Analyze patterns and relationships. 3. Generate two numerical patterns using AC envision ah 2.0 5h Grade ah Curriculum Quarer 1 Quarer 2 Quarer 3 Quarer 4 andards: =ajor =upporing =Addiional Firs 30 Day 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 andards: Operaions and Algebraic Thinking

More information

Introduction D P. r = constant discount rate, g = Gordon Model (1962): constant dividend growth rate.

Introduction D P. r = constant discount rate, g = Gordon Model (1962): constant dividend growth rate. Inroducion Gordon Model (1962): D P = r g r = consan discoun rae, g = consan dividend growh rae. If raional expecaions of fuure discoun raes and dividend growh vary over ime, so should he D/P raio. Since

More information

An Excursion into Set Theory using a Constructivist Approach

An Excursion into Set Theory using a Constructivist Approach An Excursion ino Se Theory using a Consrucivis Approach Miderm Repor Nihil Pail under supervision of Ksenija Simic Fall 2005 Absrac Consrucive logic is an alernaive o he heory of classical logic ha draws

More information

Econ107 Applied Econometrics Topic 7: Multicollinearity (Studenmund, Chapter 8)

Econ107 Applied Econometrics Topic 7: Multicollinearity (Studenmund, Chapter 8) I. Definiions and Problems A. Perfec Mulicollineariy Econ7 Applied Economerics Topic 7: Mulicollineariy (Sudenmund, Chaper 8) Definiion: Perfec mulicollineariy exiss in a following K-variable regression

More information

t is a basis for the solution space to this system, then the matrix having these solutions as columns, t x 1 t, x 2 t,... x n t x 2 t...

t is a basis for the solution space to this system, then the matrix having these solutions as columns, t x 1 t, x 2 t,... x n t x 2 t... Mah 228- Fri Mar 24 5.6 Marix exponenials and linear sysems: The analogy beween firs order sysems of linear differenial equaions (Chaper 5) and scalar linear differenial equaions (Chaper ) is much sronger

More information

Testing for a Single Factor Model in the Multivariate State Space Framework

Testing for a Single Factor Model in the Multivariate State Space Framework esing for a Single Facor Model in he Mulivariae Sae Space Framework Chen C.-Y. M. Chiba and M. Kobayashi Inernaional Graduae School of Social Sciences Yokohama Naional Universiy Japan Faculy of Economics

More information

5. Stochastic processes (1)

5. Stochastic processes (1) Lec05.pp S-38.45 - Inroducion o Teleraffic Theory Spring 2005 Conens Basic conceps Poisson process 2 Sochasic processes () Consider some quaniy in a eleraffic (or any) sysem I ypically evolves in ime randomly

More information

Math 2142 Exam 1 Review Problems. x 2 + f (0) 3! for the 3rd Taylor polynomial at x = 0. To calculate the various quantities:

Math 2142 Exam 1 Review Problems. x 2 + f (0) 3! for the 3rd Taylor polynomial at x = 0. To calculate the various quantities: Mah 4 Eam Review Problems Problem. Calculae he 3rd Taylor polynomial for arcsin a =. Soluion. Le f() = arcsin. For his problem, we use he formula f() + f () + f ()! + f () 3! for he 3rd Taylor polynomial

More information

ACE 564 Spring Lecture 7. Extensions of The Multiple Regression Model: Dummy Independent Variables. by Professor Scott H.

ACE 564 Spring Lecture 7. Extensions of The Multiple Regression Model: Dummy Independent Variables. by Professor Scott H. ACE 564 Spring 2006 Lecure 7 Exensions of The Muliple Regression Model: Dumm Independen Variables b Professor Sco H. Irwin Readings: Griffihs, Hill and Judge. "Dumm Variables and Varing Coefficien Models

More information

0.1 MAXIMUM LIKELIHOOD ESTIMATION EXPLAINED

0.1 MAXIMUM LIKELIHOOD ESTIMATION EXPLAINED 0.1 MAXIMUM LIKELIHOOD ESTIMATIO EXPLAIED Maximum likelihood esimaion is a bes-fi saisical mehod for he esimaion of he values of he parameers of a sysem, based on a se of observaions of a random variable

More information

EXPLICIT TIME INTEGRATORS FOR NONLINEAR DYNAMICS DERIVED FROM THE MIDPOINT RULE

EXPLICIT TIME INTEGRATORS FOR NONLINEAR DYNAMICS DERIVED FROM THE MIDPOINT RULE Version April 30, 2004.Submied o CTU Repors. EXPLICIT TIME INTEGRATORS FOR NONLINEAR DYNAMICS DERIVED FROM THE MIDPOINT RULE Per Krysl Universiy of California, San Diego La Jolla, California 92093-0085,

More information

Cash Flow Valuation Mode Lin Discrete Time

Cash Flow Valuation Mode Lin Discrete Time IOSR Journal of Mahemaics (IOSR-JM) e-issn: 2278-5728,p-ISSN: 2319-765X, 6, Issue 6 (May. - Jun. 2013), PP 35-41 Cash Flow Valuaion Mode Lin Discree Time Olayiwola. M. A. and Oni, N. O. Deparmen of Mahemaics

More information

Modal identification of structures from roving input data by means of maximum likelihood estimation of the state space model

Modal identification of structures from roving input data by means of maximum likelihood estimation of the state space model Modal idenificaion of srucures from roving inpu daa by means of maximum likelihood esimaion of he sae space model J. Cara, J. Juan, E. Alarcón Absrac The usual way o perform a forced vibraion es is o fix

More information

Application of a Stochastic-Fuzzy Approach to Modeling Optimal Discrete Time Dynamical Systems by Using Large Scale Data Processing

Application of a Stochastic-Fuzzy Approach to Modeling Optimal Discrete Time Dynamical Systems by Using Large Scale Data Processing Applicaion of a Sochasic-Fuzzy Approach o Modeling Opimal Discree Time Dynamical Sysems by Using Large Scale Daa Processing AA WALASZE-BABISZEWSA Deparmen of Compuer Engineering Opole Universiy of Technology

More information

DEPARTMENT OF STATISTICS

DEPARTMENT OF STATISTICS A Tes for Mulivariae ARCH Effecs R. Sco Hacker and Abdulnasser Haemi-J 004: DEPARTMENT OF STATISTICS S-0 07 LUND SWEDEN A Tes for Mulivariae ARCH Effecs R. Sco Hacker Jönköping Inernaional Business School

More information

Matrix Versions of Some Refinements of the Arithmetic-Geometric Mean Inequality

Matrix Versions of Some Refinements of the Arithmetic-Geometric Mean Inequality Marix Versions of Some Refinemens of he Arihmeic-Geomeric Mean Inequaliy Bao Qi Feng and Andrew Tonge Absrac. We esablish marix versions of refinemens due o Alzer ], Carwrigh and Field 4], and Mercer 5]

More information

Notes on Kalman Filtering

Notes on Kalman Filtering Noes on Kalman Filering Brian Borchers and Rick Aser November 7, Inroducion Daa Assimilaion is he problem of merging model predicions wih acual measuremens of a sysem o produce an opimal esimae of he curren

More information

PROC NLP Approach for Optimal Exponential Smoothing Srihari Jaganathan, Cognizant Technology Solutions, Newbury Park, CA.

PROC NLP Approach for Optimal Exponential Smoothing Srihari Jaganathan, Cognizant Technology Solutions, Newbury Park, CA. PROC NLP Approach for Opimal Exponenial Smoohing Srihari Jaganahan, Cognizan Technology Soluions, Newbury Park, CA. ABSTRACT Esimaion of smoohing parameers and iniial values are some of he basic requiremens

More information

Technical Report Doc ID: TR March-2013 (Last revision: 23-February-2016) On formulating quadratic functions in optimization models.

Technical Report Doc ID: TR March-2013 (Last revision: 23-February-2016) On formulating quadratic functions in optimization models. Technical Repor Doc ID: TR--203 06-March-203 (Las revision: 23-Februar-206) On formulaing quadraic funcions in opimizaion models. Auhor: Erling D. Andersen Convex quadraic consrains quie frequenl appear

More information

Demodulation of Digitally Modulated Signals

Demodulation of Digitally Modulated Signals Addiional maerial for TSKS1 Digial Communicaion and TSKS2 Telecommunicaion Demodulaion of Digially Modulaed Signals Mikael Olofsson Insiuionen för sysemeknik Linköpings universie, 581 83 Linköping November

More information

CHERNOFF DISTANCE AND AFFINITY FOR TRUNCATED DISTRIBUTIONS *

CHERNOFF DISTANCE AND AFFINITY FOR TRUNCATED DISTRIBUTIONS * haper 5 HERNOFF DISTANE AND AFFINITY FOR TRUNATED DISTRIBUTIONS * 5. Inroducion In he case of disribuions ha saisfy he regulariy condiions, he ramer- Rao inequaliy holds and he maximum likelihood esimaor

More information

In this chapter the model of free motion under gravity is extended to objects projected at an angle. When you have completed it, you should

In this chapter the model of free motion under gravity is extended to objects projected at an angle. When you have completed it, you should Cambridge Universiy Press 978--36-60033-7 Cambridge Inernaional AS and A Level Mahemaics: Mechanics Coursebook Excerp More Informaion Chaper The moion of projeciles In his chaper he model of free moion

More information

INTRODUCTION TO MACHINE LEARNING 3RD EDITION

INTRODUCTION TO MACHINE LEARNING 3RD EDITION ETHEM ALPAYDIN The MIT Press, 2014 Lecure Slides for INTRODUCTION TO MACHINE LEARNING 3RD EDITION alpaydin@boun.edu.r hp://www.cmpe.boun.edu.r/~ehem/i2ml3e CHAPTER 2: SUPERVISED LEARNING Learning a Class

More information

Two Popular Bayesian Estimators: Particle and Kalman Filters. McGill COMP 765 Sept 14 th, 2017

Two Popular Bayesian Estimators: Particle and Kalman Filters. McGill COMP 765 Sept 14 th, 2017 Two Popular Bayesian Esimaors: Paricle and Kalman Filers McGill COMP 765 Sep 14 h, 2017 1 1 1, dx x Bel x u x P x z P Recall: Bayes Filers,,,,,,, 1 1 1 1 u z u x P u z u x z P Bayes z = observaion u =

More information

Matlab and Python programming: how to get started

Matlab and Python programming: how to get started Malab and Pyhon programming: how o ge sared Equipping readers he skills o wrie programs o explore complex sysems and discover ineresing paerns from big daa is one of he main goals of his book. In his chaper,

More information

Probabilistic learning

Probabilistic learning Probabilisic learning Charles Elkan November 8, 2012 Imporan: These lecure noes are based closely on noes wrien by Lawrence Saul. Tex may be copied direcly from his noes, or paraphrased. Also, hese ypese

More information

Problem 1 / 25 Problem 2 / 20 Problem 3 / 10 Problem 4 / 15 Problem 5 / 30 TOTAL / 100

Problem 1 / 25 Problem 2 / 20 Problem 3 / 10 Problem 4 / 15 Problem 5 / 30 TOTAL / 100 eparmen of Applied Economics Johns Hopkins Universiy Economics 602 Macroeconomic Theory and Policy Miderm Exam Suggesed Soluions Professor Sanjay hugh Fall 2008 NAME: The Exam has a oal of five (5) problems

More information

Notes for Lecture 17-18

Notes for Lecture 17-18 U.C. Berkeley CS278: Compuaional Complexiy Handou N7-8 Professor Luca Trevisan April 3-8, 2008 Noes for Lecure 7-8 In hese wo lecures we prove he firs half of he PCP Theorem, he Amplificaion Lemma, up

More information

Single and Double Pendulum Models

Single and Double Pendulum Models Single and Double Pendulum Models Mah 596 Projec Summary Spring 2016 Jarod Har 1 Overview Differen ypes of pendulums are used o model many phenomena in various disciplines. In paricular, single and double

More information

International Journal of Scientific & Engineering Research, Volume 4, Issue 10, October ISSN

International Journal of Scientific & Engineering Research, Volume 4, Issue 10, October ISSN Inernaional Journal of Scienific & Engineering Research, Volume 4, Issue 10, Ocober-2013 900 FUZZY MEAN RESIDUAL LIFE ORDERING OF FUZZY RANDOM VARIABLES J. EARNEST LAZARUS PIRIYAKUMAR 1, A. YAMUNA 2 1.

More information

ACE 562 Fall Lecture 8: The Simple Linear Regression Model: R 2, Reporting the Results and Prediction. by Professor Scott H.

ACE 562 Fall Lecture 8: The Simple Linear Regression Model: R 2, Reporting the Results and Prediction. by Professor Scott H. ACE 56 Fall 5 Lecure 8: The Simple Linear Regression Model: R, Reporing he Resuls and Predicion by Professor Sco H. Irwin Required Readings: Griffihs, Hill and Judge. "Explaining Variaion in he Dependen

More information

Some Ramsey results for the n-cube

Some Ramsey results for the n-cube Some Ramsey resuls for he n-cube Ron Graham Universiy of California, San Diego Jozsef Solymosi Universiy of Briish Columbia, Vancouver, Canada Absrac In his noe we esablish a Ramsey-ype resul for cerain

More information

Two Coupled Oscillators / Normal Modes

Two Coupled Oscillators / Normal Modes Lecure 3 Phys 3750 Two Coupled Oscillaors / Normal Modes Overview and Moivaion: Today we ake a small, bu significan, sep owards wave moion. We will no ye observe waves, bu his sep is imporan in is own

More information

5.2. The Natural Logarithm. Solution

5.2. The Natural Logarithm. Solution 5.2 The Naural Logarihm The number e is an irraional number, similar in naure o π. Is non-erminaing, non-repeaing value is e 2.718 281 828 59. Like π, e also occurs frequenly in naural phenomena. In fac,

More information

Fishing limits and the Logistic Equation. 1

Fishing limits and the Logistic Equation. 1 Fishing limis and he Logisic Equaion. 1 1. The Logisic Equaion. The logisic equaion is an equaion governing populaion growh for populaions in an environmen wih a limied amoun of resources (for insance,

More information

2.7. Some common engineering functions. Introduction. Prerequisites. Learning Outcomes

2.7. Some common engineering functions. Introduction. Prerequisites. Learning Outcomes Some common engineering funcions 2.7 Inroducion This secion provides a caalogue of some common funcions ofen used in Science and Engineering. These include polynomials, raional funcions, he modulus funcion

More information

di Bernardo, M. (1995). A purely adaptive controller to synchronize and control chaotic systems.

di Bernardo, M. (1995). A purely adaptive controller to synchronize and control chaotic systems. di ernardo, M. (995). A purely adapive conroller o synchronize and conrol chaoic sysems. hps://doi.org/.6/375-96(96)8-x Early version, also known as pre-prin Link o published version (if available):.6/375-96(96)8-x

More information

Rapid Termination Evaluation for Recursive Subdivision of Bezier Curves

Rapid Termination Evaluation for Recursive Subdivision of Bezier Curves Rapid Terminaion Evaluaion for Recursive Subdivision of Bezier Curves Thomas F. Hain School of Compuer and Informaion Sciences, Universiy of Souh Alabama, Mobile, AL, U.S.A. Absrac Bézier curve flaening

More information

= ( ) ) or a system of differential equations with continuous parametrization (T = R

= ( ) ) or a system of differential equations with continuous parametrization (T = R XIII. DIFFERENCE AND DIFFERENTIAL EQUATIONS Ofen funcions, or a sysem of funcion, are paramerized in erms of some variable, usually denoed as and inerpreed as ime. The variable is wrien as a funcion of

More information

Christos Papadimitriou & Luca Trevisan November 22, 2016

Christos Papadimitriou & Luca Trevisan November 22, 2016 U.C. Bereley CS170: Algorihms Handou LN-11-22 Chrisos Papadimiriou & Luca Trevisan November 22, 2016 Sreaming algorihms In his lecure and he nex one we sudy memory-efficien algorihms ha process a sream

More information

( ) a system of differential equations with continuous parametrization ( T = R + These look like, respectively:

( ) a system of differential equations with continuous parametrization ( T = R + These look like, respectively: XIII. DIFFERENCE AND DIFFERENTIAL EQUATIONS Ofen funcions, or a sysem of funcion, are paramerized in erms of some variable, usually denoed as and inerpreed as ime. The variable is wrien as a funcion of

More information

Class Meeting # 10: Introduction to the Wave Equation

Class Meeting # 10: Introduction to the Wave Equation MATH 8.5 COURSE NOTES - CLASS MEETING # 0 8.5 Inroducion o PDEs, Fall 0 Professor: Jared Speck Class Meeing # 0: Inroducion o he Wave Equaion. Wha is he wave equaion? The sandard wave equaion for a funcion

More information

Hidden Markov Models

Hidden Markov Models Hidden Markov Models Probabilisic reasoning over ime So far, we ve mosly deal wih episodic environmens Excepions: games wih muliple moves, planning In paricular, he Bayesian neworks we ve seen so far describe

More information

Product Integration. Richard D. Gill. Mathematical Institute, University of Utrecht, Netherlands EURANDOM, Eindhoven, Netherlands August 9, 2001

Product Integration. Richard D. Gill. Mathematical Institute, University of Utrecht, Netherlands EURANDOM, Eindhoven, Netherlands August 9, 2001 Produc Inegraion Richard D. Gill Mahemaical Insiue, Universiy of Urech, Neherlands EURANDOM, Eindhoven, Neherlands Augus 9, 21 Absrac This is a brief survey of produc-inegraion for biosaisicians. 1 Produc-Inegraion

More information

Asymptotic Equipartition Property - Seminar 3, part 1

Asymptotic Equipartition Property - Seminar 3, part 1 Asympoic Equipariion Propery - Seminar 3, par 1 Ocober 22, 2013 Problem 1 (Calculaion of ypical se) To clarify he noion of a ypical se A (n) ε and he smalles se of high probabiliy B (n), we will calculae

More information

Linear Response Theory: The connection between QFT and experiments

Linear Response Theory: The connection between QFT and experiments Phys540.nb 39 3 Linear Response Theory: The connecion beween QFT and experimens 3.1. Basic conceps and ideas Q: How do we measure he conduciviy of a meal? A: we firs inroduce a weak elecric field E, and

More information

Ensamble methods: Boosting

Ensamble methods: Boosting Lecure 21 Ensamble mehods: Boosing Milos Hauskrech milos@cs.pi.edu 5329 Senno Square Schedule Final exam: April 18: 1:00-2:15pm, in-class Term projecs April 23 & April 25: a 1:00-2:30pm in CS seminar room

More information

Unit Root Time Series. Univariate random walk

Unit Root Time Series. Univariate random walk Uni Roo ime Series Univariae random walk Consider he regression y y where ~ iid N 0, he leas squares esimae of is: ˆ yy y y yy Now wha if = If y y hen le y 0 =0 so ha y j j If ~ iid N 0, hen y ~ N 0, he

More information

20. Applications of the Genetic-Drift Model

20. Applications of the Genetic-Drift Model 0. Applicaions of he Geneic-Drif Model 1) Deermining he probabiliy of forming any paricular combinaion of genoypes in he nex generaion: Example: If he parenal allele frequencies are p 0 = 0.35 and q 0

More information

STRUCTURAL CHANGE IN TIME SERIES OF THE EXCHANGE RATES BETWEEN YEN-DOLLAR AND YEN-EURO IN

STRUCTURAL CHANGE IN TIME SERIES OF THE EXCHANGE RATES BETWEEN YEN-DOLLAR AND YEN-EURO IN Inernaional Journal of Applied Economerics and Quaniaive Sudies. Vol.1-3(004) STRUCTURAL CHANGE IN TIME SERIES OF THE EXCHANGE RATES BETWEEN YEN-DOLLAR AND YEN-EURO IN 001-004 OBARA, Takashi * Absrac The

More information

A Framework for Efficient Document Ranking Using Order and Non Order Based Fitness Function

A Framework for Efficient Document Ranking Using Order and Non Order Based Fitness Function A Framework for Efficien Documen Ranking Using Order and Non Order Based Finess Funcion Hazra Imran, Adii Sharan Absrac One cenral problem of informaion rerieval is o deermine he relevance of documens

More information

Testing the Random Walk Model. i.i.d. ( ) r

Testing the Random Walk Model. i.i.d. ( ) r he random walk heory saes: esing he Random Walk Model µ ε () np = + np + Momen Condiions where where ε ~ i.i.d he idea here is o es direcly he resricions imposed by momen condiions. lnp lnp µ ( lnp lnp

More information

4.1 Other Interpretations of Ridge Regression

4.1 Other Interpretations of Ridge Regression CHAPTER 4 FURTHER RIDGE THEORY 4. Oher Inerpreaions of Ridge Regression In his secion we will presen hree inerpreaions for he use of ridge regression. The firs one is analogous o Hoerl and Kennard reasoning

More information

Navneet Saini, Mayank Goyal, Vishal Bansal (2013); Term Project AML310; Indian Institute of Technology Delhi

Navneet Saini, Mayank Goyal, Vishal Bansal (2013); Term Project AML310; Indian Institute of Technology Delhi Creep in Viscoelasic Subsances Numerical mehods o calculae he coefficiens of he Prony equaion using creep es daa and Herediary Inegrals Mehod Navnee Saini, Mayank Goyal, Vishal Bansal (23); Term Projec

More information

Online Convex Optimization Example And Follow-The-Leader

Online Convex Optimization Example And Follow-The-Leader CSE599s, Spring 2014, Online Learning Lecure 2-04/03/2014 Online Convex Opimizaion Example And Follow-The-Leader Lecurer: Brendan McMahan Scribe: Sephen Joe Jonany 1 Review of Online Convex Opimizaion

More information

( ) ( ) if t = t. It must satisfy the identity. So, bulkiness of the unit impulse (hyper)function is equal to 1. The defining characteristic is

( ) ( ) if t = t. It must satisfy the identity. So, bulkiness of the unit impulse (hyper)function is equal to 1. The defining characteristic is UNIT IMPULSE RESPONSE, UNIT STEP RESPONSE, STABILITY. Uni impulse funcion (Dirac dela funcion, dela funcion) rigorously defined is no sricly a funcion, bu disribuion (or measure), precise reamen requires

More information

GENERALIZATION OF THE FORMULA OF FAA DI BRUNO FOR A COMPOSITE FUNCTION WITH A VECTOR ARGUMENT

GENERALIZATION OF THE FORMULA OF FAA DI BRUNO FOR A COMPOSITE FUNCTION WITH A VECTOR ARGUMENT Inerna J Mah & Mah Sci Vol 4, No 7 000) 48 49 S0670000970 Hindawi Publishing Corp GENERALIZATION OF THE FORMULA OF FAA DI BRUNO FOR A COMPOSITE FUNCTION WITH A VECTOR ARGUMENT RUMEN L MISHKOV Received

More information

Solutions from Chapter 9.1 and 9.2

Solutions from Chapter 9.1 and 9.2 Soluions from Chaper 9 and 92 Secion 9 Problem # This basically boils down o an exercise in he chain rule from calculus We are looking for soluions of he form: u( x) = f( k x c) where k x R 3 and k is

More information