Axiometrics: Axioms of Information Retrieval Effectiveness Metrics

Size: px
Start display at page:

Download "Axiometrics: Axioms of Information Retrieval Effectiveness Metrics"

Transcription

1 Proceeings of the Secon Australasian Web Conference (AWC 214), Aucklan, New Zealan Axiometrics: Axioms of Information Retrieval Effectiveness Metrics Ey Maalena Stefano Mizzaro Department of Maths an Computer Science University of Uine Uine, Italy Abstract The evaluation of retrieval effectiveness has playe an is playing a central role in Information Retrieval (IR). A specific issue is that there are literally ozens (most likely more than one hunre) IR effectiveness metrics, an counting. In this paper we propose an axiomatic approach to IR effectiveness metrics. We buil on the notions of measure, measurement, an similarity; they allow us to provie a general efinition of IR effectiveness metric. On this basis, we provie a efinition of some common metrics an we then propose an justify some axioms that every effectiveness metric shoul satisfy. We also iscuss some future evelopments. 1 Introuction Effectiveness evaluation is of paramount importance in Information Retrieval (IR). IR has become one of the most evaluation-oriente fiels in computer science since the first IR systems (IRS) were evelope in the late 195 s. Several effectiveness metrics have been propose so far. A survey in 26 (Demartini & Mizzaro 26) counte more than 5 metrics, taking into account only the system oriente effectiveness metrics. In an extene version of the survey (Demartini et al. n..), yet unpublishe, about one hunre metrics are collecte, let alone user-oriente ones or metrics for tasks somehow relate to IR, like filtering, clustering, recommenation, summarization, etc. As state for example in (Robertson 26), there is nothing close to agreement on a common metric that everyone will use. It is a iffuse opinion that ifferent metrics evaluate ifferent aspects of retrieval behavior (Buckley & Voorhees 2, Robertson 26). Each of these metrics has its own avantages but also limitations. Metric choice is neither a simple task, nor it is without consequences: an inaequate metric might mean to waste research efforts improving systems towar a wrong target. However, some researchers simply o not investigate into the suitability of the metric for the problem itself an they seem to choose just the most popular metrics for their experiments. We cannot exclue the temptation for researchers to choose, among all available metrics, those that help corroborating their claims, or even to esign a new metric to this aim. It is not clear what to o when two metrics isagree. Copyright c 214, Australian Computer Society, Inc. This paper appeare at the The Secon Australasian Web Conference (AWC214), Aucklan, New Zealan, January 214. Conferences in Research an Practice in Information Technology (CRPIT), Vol. 155, S. Cranefiel, A. Trotman, J. Yang, E. Reprouction for acaemic, not-for-profit purposes permitte provie this text is inclue. It is clear that a better unerstaning of the formal properties of effectiveness metrics woul help to avoi wasting time in tuning retrieval systems accoring to effectiveness metrics inaequate to specific purposes, an it will also inuce researchers to make explicit an clarify the assumptions behin metrics. This paper proposes an axiomatic approach to effectiveness metrics, presenting some basic axioms that any reasonable metric shoul satisfy an that are formulate in a general way. The paper is structure as follows. Although there has not been much work on formal accounts of IR effectiveness metrics, in Sect. 2 we briefly recall such previous work. To have a common language to state the axioms, we then efine a general framework. In Sect. 3 we rely upon the notions of measurement, measure, an scale of measurement to provie a common efinition of both the output of an IR system an a relevance assessment by a human juge; the notion of similarity between the two is then analyze in Sect. 4. An analysis of the measurement scales use in IR an their implications for similarity is also propose. Measure an similarity allow us to efine the notion of effectiveness metric in Sect. 5. In Sect. 6 some metrics are efine within the framework, to emonstrate its effectiveness power. In Sect. 7, some axioms are state. Conclusions an future work are presente in Sect. 8. This stuy is one of the first steps within a larger project, name Axiometrics, that is a mix of the wors axioms (or axiomatic) an metric an stans for axiomatic approach to IR effectiveness metrics. Axiometrics is one of the research irections iscusse an selecte as interesting at the recent SWIRL meeting ( One companion paper has alreay been publishe on this topic (Busin & Mizzaro 213); although the present work is similar in the first part, we propose a rather ifferent set of axioms an several refinements an improvements: since Sects. 6 an 7 are completely rewritten, about 4% of the paper is novel. 2 Relate Work Although formal approaches have high importance in the IR fiel, they have mainly focusse on the retrieval process rather than on effectiveness metrics themselves (see, e.g., (Fang et al. 24, Fang & Zhai 25)). However, some research specific to effectiveness metrics oes exist, an it is briefly iscusse here. Early attempts have been mae by Swets (Swets 1963), who liste some properties of IR effectiveness metrics, an van Rijsbergen (van Rijsbergen 1979, Ch. 7), who followe an axiomatic approach. In (Bollmann 1984), Bollmann iscusses the risk of obtaining inconsistent evaluations on a ocument collection an 39

2 CRPIT Volume The Web 213 on its subcollections. Two axioms on effectiveness metrics, name the Axiom of monotonicity an the Archimeean axiom, are propose, an their implication is presente as a theorem. These approaches are evelope on the basis of binary relevance (either a ocument is relevant or it is not) an binary retrieval (either a ocument is retrieve or it is not) only. Here we o not make any assumption on the notions of relevance an retrieval (binary, ranke, continuous, etc.). Our approach is meant to be more general. Yao (Yao 1995) focusses on the notion of user preferences to measure the relevance (or usefulness) of ocuments. He aopts a framework where user jugements are escribe as a weak orer. On this basis he then proposes a new effectiveness metric that compares the relative orer of ocuments. The propose metric is prove to be appropriate through an axiomatic approach. More recently, Amigó et al. in (Amigó et al. 29) focus their formal analysis on evaluation metrics for text clustering algorithms fining four basic formal constraints. These constraints shoul be intuitive an coul point out the limitations of each metric. Moreover, it shoul be possible to prove formally which constraint is satisfie by a metric or a metric family, not just empirically. They foun BCube metrics (BCube precision an Bcube recall) to be the only ones satisfying the four propose constraints. Still Amigó et al. in (Amigó et al. 211) iscuss, with a similar approach, a unifie comparative view of metrics for ocument filtering. They obtain that no metric for ocument filtering can satisfy all esirable properties unless a smoothing process is performe. Finally, in an even more recent work (Amigó et al. 213), they start from a set of formal constraints to efine a general metric for ocument organization tasks, that inclue retrieval, clustering, an filtering. 3 Measurement 3.1 Measurement, Measures, an Scales Measurement can be efine as a process aime at etermining a relationship between a physical quantity an a unit of measurement (Wikipeia 212). In 1946, Stevens (Stevens 1946) efine measurement for social sciences as the assignment of numerals to objects or events accoring to some rule. A more recent an wiely aopte efinition of measurement, propose by Michell (Michell 1997), is the numerical estimation an expression of the magnitue of one quantity relative to another. A particularly iscusse issue is how the measurement is expresse. Stevens propose the four stanar measurement scales (Stevens 1946): Nominal, Orinal, Interval, Ratio. This classification has become a traition in various fiels an it provies useful insights (Robertson 26), although it is rather simple, leaves asie some subtleties, an it has been criticize (Velleman & Wilkinson 1993). Inee, ifferent scales an classification have been introuce through time (Michell 1997). A slightly ifferent an quite common classification inclues: Nominal, Orinal, Interval, Log-Interval, Ratio, Absolute. A further one is mae up of ten levels of measurement (Chrisman 1998): (1) Nominal, (2) Grae membership, (3) Orinal, (4) Interval, (5) Log-Interval, (6) Extensive Ratio, (7) Cyclical Ratio, (8) Derive Ratio, (9) Counts, an (1) Absolute. Also terminology is iscusse: for example, Chrisman s position (Chrisman 1998) is that the term level shoul be preferre to scale (in this paper we use scale ). Another iscipline that provies a useful backgroun is Software Measurement (Zuse 1997), where a istinction is mae between measurement an measure: a measurement is the process through which values are assigne to attributes of entities of the real worl; a measure is the result of that process, so it is the assignment of a value to an entity with the goal of characterizing a specifie attribute. In the rest of this paper we refer to measurement an measure with these meanings. 3.2 IR as Relevance Measurement The evaluation process in IR is base on two quantities: (i) an automate evaluation, by an IR system, of the possible relevance of a ocument, an (ii) the user s (or assessor s) estimation of the relevance of a ocument. We propose to use the above concepts to moel these two quantities: given a query, a system tries to measure the relevance of the ocuments to the query, for example to rank the ocuments; given (a escription of) an information nee, an assessor tries to measure the relevance of the ocuments to the nee. We therefore have two kins of relevance measurements (an measures as well): one mae by a system an referre to in the following as system relevance measure(ment), an one mae by a human an referre to in the following as user / assessor / human relevance measure(ment). A notion of measure / measurement common to both quantities (system an assessor) will allow us to efine a notion of similarity among them. In IR, alternative terms to measurement have been an are use, e.g., assessment, jugment, preiction, score, estimate, amount. Some of what follows coul be expresse also without reference to measurement; however, our choice provies a goo groun an allows us to exploit the measurement machinery. Yet on terminological issues, we also note that since in the IR fiel it is common to speak of effectiveness metrics, we stick with this terminology, also because using metrics for the effectiveness metrics avois confusion with the measure(ment) of relevance mae by IRSs an humans. However, let us remark that effectiveness measure woul perhaps be more appropriate because metrics satisfies some peculiar properties that are not require for measures in their general efinitions: not all measures are metrics. Turning to measurement scales, in IR it is common to use two ifferent scales for system an assessor measurements. Moreover, all the items of the traitional scales make sense, as shown by some examples: Nominal: categorizing the ocuments, e.g., as long / short, or on one specific topic among the topics in a set (e.g., Java vs. C++) or from a specific author, etc. A more subtle question is whether relevance classification (into relevant an nonrelevant) is a nominal classification as well: we will come back shortly on this. Orinal: the usual rank of retrieve ocuments by an IRS, but also grae relevance assessments like in the four level relevance scale HRP N ( Highly relevant, Relevant, Partially relevant, an Non-relevant ). Interval: IRSs that try to calibrate their Retrieval Status Values (RSV) may use internal interval scales before proviing the ranke output. Ratio an Absolute. IRSs that try to estimate the amount of relevance in a ocument, as has been suggeste several times (Swets 1963, 4

3 Nominal: Orinal: Interval Ratio / Absolute Unrelate categories Relate Categories: Across scales Within scales < (strict orer) (orer with equality) P. O. (Partial Orer) Ranke Categories Figure 1: Measurement scales in IR Della Mea & Mizzaro 24), or human juges that use magnitue scale estimation (Eisenberg 1988). On a more careful look, however, there are some peculiar features in IR. The usual relevant / nonrelevant binary scale, when use by both IRS an human assessor, can inee be seen as a nominal scale (an this leas to efining some well known metrics like precision an recall). However, the situation is more complex. When a grae relevance scale using more than two values is use, it seems unquestionable that it is not nominal but orinal. This is clear when comparing a grae relevance scale by a human assessor an the usual ranking of ocuments by an IRS. In other terms, in IR, the categories in a nominal scale are often (although not always) relate or, more precisely, ranke. For instance, let us consier a four level relevance scale HRP N, that is recently being use an iscusse quite often. Within this scale, the four categories are naturally ranke, an thus H is more similar to R, less to P an even less to N (an so on), whereas there is no rank among the categories in a classical nominal scale: the HRN P scale is actually orinal. Furthermore, it is quite customary to collapse H, R, P an N into the binary relevance scale R an N as either HR R an PN N or HRP R an N N ( rigi an relaxe mapping in (NTCIR Project 212)). This means that there exists a relationship across the two scales HRPN an RN, an also that the RN scale is orinal itself. However, in the classical binary relevance an binary retrieval case, the RN scale is nominal: the scale of a relevance measurement can be nominal or orinal epening on the scale of the other relevance measurement. Finally, the rank of categories can be partial: if a ocument is on Java, a misclassification into the C++ category is a smaller error than a classification into Sport, but there is not a total ranking of the scale C++, Java, Sport. To make things even more complex, the relationship across two scales may concern ifferent scales: for example, it is possible to compare a ranke output by an IRS (orinal scale) with a relevance jugment by a human assessor using magnitue scale estimation (Eisenberg 1988) (ratio or absolute scale). Thus an IRS an a relevance assessor express measurements of relevance of the ocuments in a set. Each measurement can be expresse in one of the scales in Fig Notation In the following, q represents a query, a ocument, Q a set of queries, an D a set of ocuments. We also use subscripts an primes with the obvious meaning. To istinguish ifferent measurements, we enote by ρ a relevance measurement, by a system relevance measurement, an by an assessor, user or, generally, human relevance measurement. We use Proceeings of the Secon Australasian Web Conference (AWC 214), Aucklan, New Zealan ρ,, an to represent both the measurement process an the measure itself. Given a query q an a ocument, the relevance measurement ρ applie to q an returns the relevance measure ρ(q, ). To represent the relevance measure of many ocuments (those in the set D q ) to a single query q we simply write ρ (q, D q ), that can be efine in set theoretic terms as ρ (q, D q ) = {ρ (q, ) : D q }, an recursively as ρ (q, D q {}) = {ρ (q, )} ρ (q, D q ). The relevance measure of the ocuments D q in the set of ocuments D for the corresponing queries q in Q is ρ (Q, D). It can be efine in set theoretic terms as ρ (Q, D) = {ρ (q, D q ) : ρ (q, ) ρ (q, D q ), q Q, D q D, D q is the subset corresponing to q}, an recursively as ρ (Q {q}, D) = ρ (q, D q ) ρ (Q, D). ρ (q, ) is usually a known value. When it is not known, often it can be compare, for instance by stating that ρ (q, ) < ρ (q, ), i.e., the ocument is more relevant to the query q than the ocument, accoring to the measurement ρ. The scale of a measurement ρ is enote by scale (ρ). Scales are represente by ouble square brackets an. For example, if ρ is on the above mentione four level relevance scale HRP N, then we write scale(ρ) = H, R, P, N ; a rank scale is enote by Rank ; an so on. 4 Similarity By exploiting the previous notions we can frame the notion of similarity between two relevance measurements. We nee a criterion to juge when two relevance measurements, one from the IRS an one from the human assessor, are similar. Ieally, an IRS shoul use the same measurement scale of the human assessor an provie the same measurement of the human assessor. However, IRSs are far from being perfect, an therefore the very same measurement is almost never provie. The aim of an IRS is to provie the measurement that is most similar to the human assessor / user one. Moreover, often the scales are ifferent: scale () can be fixe a priori, e.g., when a test collection provies human relevance assessments, an scale () epens on the retrieval algorithm at han, an ifferent approaches have ifferent scales. Of course, two measurements expresse on two ifferent scales can not be ientical (e.g., a rank can not be ientical to a measurement expresse on a category scale, the usual a-hoc retrieval situation). Summarizing, the IRS shoul provie the measurement that, accoring to the chosen scales, is the most similar to the human one. 4.1 Notation The similarity of two relevance measurements ρ (on a set of ocuments D an a set of queries Q) an ρ (on D an Q ) is enote as sim(ρ(q, D), ρ (Q, D )). If Q = Q then we use the notation sim Q (ρ(d), ρ (D )), or sim Q (ρ(d), ρ (D )), to inicate that Q is common. If D = D then we write sim D (ρ (Q), ρ (Q )). If both Q = Q an D = D we enote the similarity as sim Q,D (ρ, ρ ). We also use the same notation for single ocuments an queries. For example, if both ocument an query 41

4 CRPIT Volume The Web 213 sets contain only one element, i.e., Q = Q = {q} an D = D = {}, we write sim (ρ, ρ ). If there is no ambiguity on the sets of queries an ocuments, then we simply write sim (ρ, ρ ). Following the notation of Sect. 3.3, the similarity between two systems is given by sim (, ), the agreement among juges is sim (, ), while the similarity of a human relevance measurement an a system relevance measurement is sim (, ), that correspons to the effectiveness of an IRS in measuring relevance as the user oes. In the following, for the sake of simplicity, we mainly eal with this last case, sim (, ), but most of the results hol for any pair of relevance measurements. It is important to note that, in this context, it is not always possible express similarity as a number, i.e., it is possible that we can not know, nor even express, the value of sim (, ). More often, similarities can be compare so we can say if sim (, ) < sim (, ) or sim (, ) > sim (, ). For example, let us consier two IRSs that behave in exactly the same way on a set of ocuments D an a set of queries Q, an provie two relevance measurements an. Their similarities with an assessor measure are exactly the same for each ocument. Therefore, sim Q,D (, ) = sim Q,D (, ). If a new ocument / D is available an the two systems have a ifferent behavior on it such that sim Q, (, ) > sim Q, (, ) ( evaluates the ocument in a more similar way to the assessor than ), we can now infer that the similarity on the new whole collection of ocuments D {} is higher for than for : sim (, ) > sim (, Q,D {} Q,D {} ). Note that we i not nee to assign specific values to the various measurements to be able to compare them. Also note that, in general, sim q,d {} (, ) sim q,d (, ) + sim (, ). In the following of this paper we aopt a ifferent approach from (Busin & Mizzaro 213) an we will not use the similarity of sets of queries (Q) an/or ocuments (D): we will nee sim (, ) only, since we will be ealing with sets when working on metrics. 4.2 Similarity an Measurement Scale This generic notion of similarity can be specialize by taking into account the ifferent scales to obtain operational efinitions. The scenario is quite complex. Since is fixe an varies (e.g., by choosing a ifferent retrieval algorithm), similarity is not symmetrical: sim(ρ, ρ ) an sim(ρ, ρ) nee to be efine inepenently. Moreover, each of an can be over nine scales (see Fig. 1): this leas up to 9 9 = 81 cases, see Tab. 1. Actually, some combinations are rule out, because either the combination oes not make sense (e.g., categories can not be both relate an unrelate), or no notion of similarity can be efine (a nominal scale with unrelate categories oes not allow to efine a notion of similarity). However, all the other combinations make sense: although it might seem strange to have a human assessors that ranks the ocuments an an IRS that categorizes them into H, R, P, N, this is not impossible in principle. For space limitations, we only hint at how sim can be efine for some pairs of scales; the escription is at an intuitive level, but it can be formalize. (a) (b) Figure 2: Categories with across-scales relations If an are expresse on ifferent an unrelate category scales, nothing can be sai. For instance, if scale () = A, B (i.e., is a category scale with two categories A an B), an scale () = C, D, an there is no relationship neither across scales (i.e., A an B have nothing to o with neither C nor D) neither within scales (i.e., no relations between A an B, nor between C an D), then sim (, ) can not be efine. For instance, if is a measurement of the topicality of the ocuments an classifies the source of the ocuments (e.g., web page, article, technical report, etc.) then of course an can not be compare. Even if one an only one of scale () an scale () has some within scale relationship, this is not enough to efine any sensible similarity measure (if the two measures share the same categories then a similarity can be efine, but this woul mean that an acrossscales relationship exists). The following postulate simply rules out the unrelate categories scale from further analysis. Postulate 1 (Unrelate categories). Given two measurements, if the scale of at least one of them is an unrelate categories scale, then it is not possible to efine a similarity between the two measurements. There might exist across-scales relationships: some categories of scale () can be relate to some categories of scale (), or even equal (i.e., a particular case of across-scales relationships is scales with the same categories). In this case, similarity can be efine an it epens on the number of correctly an wrongly classifie ocuments: moving one ocument from a wrong category into a correct one (i.e., correctly classifying one more ocument) increases similarity. For instance, let scale() = A, B (i.e., it is expresse on a binary scale), an scale() = A, B, C, D. Let us also assume that it makes sense that ocuments in A shoul also be in A or B, an ocuments in B shoul also be in C or D (see Fig. 2(a)). In such a case, sim (, ) can be efine. Moving items from A to B (or viceversa) or from C to D (or vice-versa) oes not affect sim (, ); similarity varies by moving items from A or B to C or D (or vice-versa), or from A to B (or vice-versa). We can increase sim (, ) by moifying in in such a way that correctly classifies more ocuments. Any other moification (e.g., moving a wrongly classifie item into another wrong category) oes not affect the similarity. Let us consier another ifferent case. Let an be two system relevance measurements such that scale() = scale( ) = Rank an let be a human relevance measurement. If is obtaine from by swapping two ajacent ocuments an moving a more relevant ocument above a less relevant one without affecting anything else (see Fig. 2(b)), then sim(, ) > sim(, ). The notions of more an less relevant are efine if scale() is any orinal, interval, 42

5 Nominal Proceeings of the Secon Australasian Web Conference (AWC 214), Aucklan, New Zealan Nominal Orinal Interval Ratio / Un- Relate < P.O. Ranke Absolute relate Acr. With. Categ. Unrelate Relate Across Within < Orinal P.O. Ranke Categories Interval Ratio / Absolute Table 1: Similarity between relevance measurement scales. Empty cells mean that similarity can be efine. means that the combination oes not make sense. means that no notion of similarity can be efine. or ratio / absolute scale. The move of a less relevant ocument below a more relevant one is equivalent. Other changes can always be obtaine by several ajacent swaps (as it is well known, any permutation is a sequence of swaps). 5 Effectiveness Metric On the basis of the concepts of measurement, measurement scales, an similarity we now turn to moeling the effectiveness metrics itself. An effectiveness metric provies a numerical representation of the similarity between two relevance measurements. A metric is then a function that takes as arguments two measurements an, a set of ocuments D, an a set of queries Q, an provies as output a numeric value (usually in R): metric : D Q R. (1) A metric is efine on the basis of five components: scale(), scale(), a notion of similarity sim, how the values on single ocuments are average over the set D (we enote the corresponing averaging function with avgd), an how these averages are average over the set Q (avgq). We can write metric (scale(), scale(), sim, avgd, avgq). In most cases we o not nee to specify all the components of the effectiveness metric. Also, we o not nee to refer to the metric itself, but rather to the value of the metric in a specific measurement (the context will resolve the ambiguity). For instance we enote with metric((q, D), (Q, D)) the effectiveness metric value obtaine when evaluating with respect to on the set of queries Q an on the set of ocuments D. Usually, Q an D are common to an, so we often write metric Q,D (, ).When not neee, we omit Q an D as in metric(, ), an sometimes in place of sets Q an D we also use single elements q an, as in metric (, ). 6 Some Metrics In this section we efine the sim, avgd an avgq functions for some common metrics, to emonstrate that the framework an notation shoul be general enough to moel most (if not all) effectiveness metrics. Table 2 presents tentative efinitions of some common metrics. The table shoul be unerstanable, but we briefly iscuss some metrics. For instance, for both Precision an Recall, scale() = scale() = R, N. Let Rel = { D () = R} an Ret = { D () = R} be the sets of relevant an retrieve ocuments, respectively. Similarity is (see the table) 1 if a ocument is both retrieve an relevant an otherwise., the avgd functions for Precision an Recall are the arithmetic means over the sets Ret an Rel, respectively, an both the avgq functions are the arithmetic mean over the set Q. For MAP an MAP-like metrics (GMAP, logitap, yaap, etc.) we have two ifferent scales: scale() = R, N an scale() = Rank. Similarity is more complex on a rank (see the formula in the table), since to unerstan the similarity of a ocument we nee to analize also other ocuments in the rank. Similarity can then be use to efine AP values an then MAP is obtaine using as avgq the arithmetic mean of the AP values. GMAP is similar to MAP, the only ifference being on avgq since in GMAP the geometric mean is use. GMAP can also be efine in an equivalent way as the average of logarithms, an logitap efinition is similar. (an yaap shoul be similar as well). The table also inclues MAP@n, i.e., MAP compute averaging only the AP values of the relevant ocuments retrieve in the first n rank positions, an consiering as the AP of ocuments retrieve after rank n (this is the metric use in TREC-like settings). The last row efines ADM (Della Mea & Mizzaro 24). 7 Axioms We now can list some axioms: they efine properties that, ceteris paribus, any effectiveness metric shoul satisfy. Axioms can also be interprete as a set of constraints on a search space. We formalize as axioms the properties of similarity between relevance measurements (Subsection 7.1), we then present some axioms that efine the relationships between similarity an metrics (Subsection 7.2), an we then present metric-specific axioms (Subsection 7.3). 7.1 Similarity The first axioms represent basic constraints on similarity, an metrics are not concerne yet. Axiom 1 (Similarity of ocuments). Let q be a query, an two ocuments, a human relevance measurement an a system relevance measurement such that (q, ) = (q, ) an (q, ) = (q, ). sim (, ) = sim (, ). Axiom 2 (Similarity of queries). Let q an q be two queries, a ocument, a human relevance measurement an a system relevance measurement such that (q, ) = (q, ) an (q, ) = (q, ). sim (, ) = sim (, ). q, 43

6 CRPIT Volume The Web 213 Metric scale() scale() sim(, ) avgd avgq Precision R, N { 1 if () = () otherwise, Pq = 1 Ret Recall Rq = 1 Ret sim 1 (, ) Rel sim 1 (, ) q Q P q q Q R q P@n = 1 n Rel () n sim 1 (, ) q Q q R-Prec R-Precq = 1 R, N Rank 1 if () = R 1 if () = N ( ) = R ( ) > () otherwise. MAP APq = 1 MAP@n APq = 1 Rel () sim 1 (, ) Rel sim 1 (, ) Rel & () n sim 1 (, ) q Q R-Prec q q Q AP q q Q AP q GMAP APq = 1 Rel sim (, ) q Q AP q 1 q Q log AP q logitap APq = 1 Rel sim 1 (, ) q Q AP +ɛ log 1 AP +ɛ ADM [, +1] [, +1] (q, ) (q, ) ADMq = 1 1 D i D sim (, ) i q Q ADM q Table 2: Metrics on the basis of their components as per formula. 1 44

7 Proceeings of the Secon Australasian Web Conference (AWC 214), Aucklan, New Zealan (a) Divergent (b) Divergent (c) Overestimate (a) Overestimate (b) Unerestimate () Unerestimate (c) Divergent () Divergent Figure 3: Two systems having equal similarity to Figure 4: Two systems with ifferent similarity to Axiom 3 (Similarity of two systems). Let q be a query, a ocument, a human relevance measurement an an two system relevance measurements such that Axiom 5 (Systems with ifferent effectiveness). Let q be a query, a ocument, a human relevance measurement an an two system relevance measurements such that (q, ) = (q, ). sim (, ) > sim (, ) (4) sim (, ) > sim (, ). (5) (2) sim (, ) = sim (, ). (3) Let us remark that (3) oes not entail (2). Diagrams like those in Fig. 3 can be helpful to intuitively unerstan the situation: Figs. 3(a) an 3(b) represent the cases in which an respectively overestimate an unerestimate (or vice-versa) by the same amount; then the similarity (represente in the figure by the two arcs on the right) of the two systems is the same, but obviously (2) oes not hol. Conversely, as state by the axiom, when (2) hols then (3) hols as well (Figs. 3(c) an 3()) From Similarity to Metric Different systems an metric (, ) > metric (, ). Remark 2. Conition (4) means that is less wrong than. The combination of (4) an (5) means that the two systems are wrong in the same irection: if overestimates (unerestimates), then overestimates (unerestimates) it even more. Figs. 4(a) an 4(b) show these two cases. Conition (4) rules out the other two situations, shown in Figs. 4(c) an 4(), in which no constraint on the metric can be state for the same reasons mentione in Remark 1. The following axiom sets a constraint on the metric in one of the two last cases of Fig. 3 (Figs. 3(c) an 3()) Axiom 4 (Systems with equal effectiveness). Let q be a query, a ocument, a human relevance measurement an an two system relevance measurements such that We now turn to compare a system measurement for two ocuments an. Let us assume, without loss of generality, that is more relevant than (() > ( )). We can consier two cases: Different ocuments sim (, ) > sim (, ) (see Fig. 5(a)); (q, ) = (q, ). sim (, ) < sim (, ) (Fig. 5(b)). metric (, ) = metric (, ). Remark 1. Note that by using, in this axiom, a conition like (2) an not like (3) the first two cases of Fig. 3 are rule out, an inee in those cases we cannot state any constraint on the metric: a recalloriente metric woul give a higher value to a system overestimating all the ocuments (retrieving all ocuments means that recall is 1), whereas a precisionoriente metric woul o the opposite. In the first case we have a smaller error in the more relevant ocument an a larger error in less relevant ocument. In such a case, no constraint can be state on the metric since, as it is often state, earlier rank positions are more important than later ones. Conversely, the secon case allows to state some axioms. We analyze it an we start by observing that, since the system coul overestimate or unerestimate the ocuments an, the case of Fig. 5(b) can be subivie into four cases: 45

8 CRPIT Volume The Web 213 an (6) an (8) hol (i.e., both an are overestimate), then metric (, ) > metric (, ). The secon axiom concerns the case of Fig. 6(b). Axiom 7 (Unerestimate ocuments). Let q be a query, an two ocuments, a human relevance measurement an a system relevance measurements such that () > ( ), (a) Best top similarity (b) Best bottom similarity () > ( ), sim (, ) < sim (, ), Figure 5: Two ocuments with ifferent similarity an (7) an (9) hol (i.e., both an are unerestimate), then metric (, ) > metric (, ). (c) Divergent () Divergent (a) Both over- (b) Both estimate unerestimate Remark 3. The conition (1) rules out the critical case in which both ocuments an are unerestimate but there is a swap as shown in Fig. 7. In such a case, although the similarity is higher for, no constraint can be impose on the metric, again for the reason of Remark 1 about top rank positions. In Axiom 6, this aitional conition is not necessary, because if both ocuments are overestimate an similarity is higher for the less relevant ocument, then no swap is possible. Figure 6: Four possible cases overestimates both an (see Fig. 6(a)); unerestimates both an (Fig. 6(b)); overestimates (Fig. 6(c)); unerestimates (Fig. 6()). an an unerestimates overestimates Only the first two cases allow to express some constraints on the metric, again for the same reason of Remark 1 (an 2). We analyze the first two cases. Let us start by noting that: if overestimates then sim (( ), ()) < sim ((), ( )); In the following we will nee to write that a metric value is more affecte by a ocument than by another ocument. Formally, we efine: Definition 1. We write that Ametric(,) if an only if sim (( ), ()) > sim ((), ( )); (7) metric (, ) metric (, ) > q,d q,d {} if overestimates then sim ((), ( )) > sim ((), ( )); metric (, ) metric (, ) sim ((), ( )) < sim ((), ( )). (9) We can now state the following two axioms. The first concerns the case of Fig. 6(a). Axiom 6 (Overestimate ocuments). Let q be a query, an two ocument, a human relevance measurement an a system relevance measurements such that () > ( ), q,d (to be rea as affects metric value more than ). sim (, ) < sim (, ) q,d { } (8) an if unerestimates then 46 Figure 7: Critical situation (6) if unerestimates then (1) Analogously, we will write wmetric(,) an we will also v, an with similar meanings. A similar notation hols for queries. Axiom 8 (System relevance). Let q be a query, an two ocuments, a human relevance measurement an a system relevance measurement such that sim (, ) = sim (, ), () > ( ), an () ( ). Ametric(,). (11)

9 This means that if system relevance measures on two ocuments an are equally correct, an system relevance of is higher than system relevance of, then the effectiveness metric shoul be more affecte by than by (provie that is not less relevant than ). As alreay mentione, it is usually state that early rank positions affect a metric value more than later rank positions. This can be erive as a corollary of the previous axiom (that states a more general principle, inepenent of the scales) simply by taking scale() = Rank. A symmetric axiom can also be state on user relevance measurement: a metric shoul weigh more, an be more affecte, by more relevant ocuments. This is perhaps less intuitive than the previous one, but it oes inee seem natural in this framework. Moreover, it is quite easy for an IRS to evaluate a nonrelevant ocument as non-relevant, since the vast majority of ocuments in the atabase are non-relevant. Thus, an IRS stating that a non-relevant ocument is non-relevant is somehow oing an easy job, an shoul not be reware too much for it. On the other han it shoul be reware when correctly ientifying a relevant ocument. This is generalize an formalize as follows. Axiom 9 (User relevance). Let q be a query, an two ocuments, a human relevance measurement an a system relevance measurement such that: sim (, ) = sim (, ), () > ( ), an () ( ). (12) metric(,). Remark 4. Conitions (11) in Axiom 8 an (12) in Axiom 9 an are neee to rule out the case in which the two axioms woul result inconsistent. Finally, the following axiom eals with the last case. Axiom 1 (Same relevance). Let q be a query, an two ocuments, a human relevance measurement an a system relevance measurement such that sim (, ) = sim (, ). If () = ( ) an () = ( ) then 7.3 Metrics metric(,). We now turn to the last set of axioms, that are specifically about metrics. The following axiom formalizes Swets s properties (see Sect. 2). To simplify its formulation we enote by the theoretically worst performance, i.e., the relevance measure that gives the worst possible performance accoring to a given assessor relevance measure. Axiom 11 (Zero an maximum). An effectiveness metric shoul have a true zero in an a maximum value M. The theoretically worst (best) performances shoul give (M) as the metric value. As a normalization convention let M = 1 such that metric, range(metric) = [, 1], metric (, ) = 1, an metric (, ) =. Axiom 12 (Document monotonicity). Let q be a query, D an D two sets of ocuments such that D D =, a human relevance measurement an Proceeings of the Secon Australasian Web Conference (AWC 214), Aucklan, New Zealan an two system relevance measurements such that: 1 an metric (, ) > metric (, q,d (=) q,d ) (13) (>) metric (, ) > metric (, ). (14) q,d (=) q,d (=) metric (, ) > metric (, q,d D (=) q,d D ). (15) (>) A similar axiom hols for queries, as follows. Axiom 13 (Query monotonicity). Let Q an Q be two query sets such that Q Q =, D a ocument set, a human relevance measurement an an two system relevance measurements such that: an metric (, ) > metric (, Q,D (=) Q,D ) (>) metric (, ) > metric Q,D (=) (=) Q,D (, ). metric (, ) > metric (, Q Q,D (=) Q Q,D ). (>) These two last axioms can also be interprete as constraints on the avgd an avgq functions, respectively. 8 Conclusions an Future Work Builing on measure, measurement, an similarity, we have efine a framework that has been teste by using it to efine some common metrics an to propose some axioms on IR effectiveness metrics. Our contribution is fourfol: (i) the proposal of using measurement to moel in a uniform way both system output an human relevance assessment, an the analysis of the ifferent measurement scales use in IR; (ii) the notions of similarity among ifferent measurement scales an the consequent efinition of metric; (iii) the efinitions of some metrics within the framework; an (iv) the axioms themselves. A future irection concerns the measurement scales: we propose a set of scales specific for IR, an this proposal has been aequate for this paper; however, the literature on measurement scales is quite rich, an the IR case shoul perhaps be linke more carefully with it. An obvious future irection is the efinition of some theorems, that we have omitte for space limitations. Again for space limitations we have omitte axioms on iversity, novelty an session metrics, as well as metrics taking into account the notion of ifficulty / ease of query an ocuments. On these issues, something is hinte in (Busin & Mizzaro 213). Strictness of an effectiveness metric is another interesting property: a metric is strict if having a high value of it implies that also the other metrics will have a high value (i.e., being effective accoring to a 1 In this axiom the equal = an less than < signs have obviously to be paire in the appropriate way, row by row. We use this notation for the sake of brevity an to avoi to state three ifferent an very similar axioms. 47

10 CRPIT Volume The Web 213 strict metric means also being effective accoring to other metrics). The set of axioms coul be more structure: some of them coul be more basic an some others coul be erive from the basic ones. It is also possible that more funamental axioms can be foun, for example on the basic notions of measurement an similarity, an that the axioms state in this paper can inee be theorems erive from those more funamental axioms. On similar issues, although we have state our axioms in a general way, axioms (an theorems) for specific IR tasks an scenarios coul probably be erive from the set of general axioms. Of course, it woul be interesting to analyze other existing metrics, to see if they satisfy the axioms. It is also possible that ifferent sets of axioms can be ientifie. In this paper we have shown a possible set, but the question is left open whether there exist other ifferent sets, an if they are equivalent or perhaps even contraictory. Inee the axioms in (Busin & Mizzaro 213) are completely ifferent from those propose here; we leave as future work a etaile comparison of the two sets, but we remark that the combination of this paper an (Busin & Mizzaro 213) emonstrates that the framework is expressive an allows to formally reason on effectiveness metrics. Besies axioms an theorems, it woul be interesting to think of esierata (esirable properties, supporte by common sense, that coul be useful in some scenarios) an empirical properties (those that emerge from ata, i.e., from actual test collections an system comparisons). Those coul cover aspects like robustness or statistical correlation between effectiveness metrics. We believe that our research has also shown that basic features of metrics might be quite ifferent from those usually iscusse in the classical a-hoc retrieval situation (binary / category relevance an ranking retrieval), where we usually speak of early rank positions, rank swaps, etc. Finally, it woul be interesting to implement a software to analyze metrics by specifying some parameters corresponing to the values of specific components. Acknowlegments We thank Julio Gonzalo an Enrique Amigó for long an interesting iscussions, Evangelos Kanoulas an Enrique Alfonseca for helping to frame the Axiometrics research project, Arjen e Vries for suggesting the name Axiometrics, an organizers of (an participants to) SWIRL 212. This work has been partially supporte by a Google Research Awar. References Amigó, E., Gonzalo, J., Artiles, J. & Verejo, F. (29), A comparison of extrinsic clustering evaluation metrics base on formal constraints, Information Retrieval 12(4), Amigó, E., Gonzalo, J. & Verejo, F. (211), A comparison of evaluation metrics for ocument filtering, in CLEF, Vol of LNCS, Springer, pp Amigó, E., Gonzalo, J. & Verejo, F. (213), A general evaluation measure for ocument organization tasks, in G. J. F. Jones, P. Sherian, D. Kelly, M. e Rijke & T. Sakai, es, SIGIR, ACM, pp Bollmann, P. (1984), Two axioms for evaluation measures in information retrieval, in SIGIR 84, British Computer Society, Swinton, UK, pp Buckley, C. & Voorhees, E. M. (2), Evaluating evaluation measure stability, in SIGIR, ACM, New York, NY, USA, pp Busin, L. & Mizzaro, S. (213), Axiometrics: An Axiomatic Approach to Information Retrieval Effectiveness Metrics, in ICTIR 213 Proceeings of the 4th International Conference on the Theory of Information Retrieval. To appear. Chrisman, N. R. (1998), Rethinking levels of measurement for cartography, Cartography an Geographic Information Science 25(4), Della Mea, V. & Mizzaro, S. (24), Measuring retrieval effectiveness: A new proposal an a first experimental valiation, Journal of the American Society for Information Science an Technology 55(6), Demartini, G., Kazai, G. & Mizzaro, S. (n..), A survey an classification of information retrieval effectiveness metrics. Draft. Demartini, G. & Mizzaro, S. (26), A Classification of IR Effectiveness Metrics, in ECIR 26, Vol of LNCS, pp Eisenberg, M. B. (1988), Measuring relevance jugments, Informat. Process. & Management 24(4), Fang, H., Tao, T. & Zhai, C. (24), A formal stuy of information retrieval heuristics, in SIGIR 4, ACM, New York, NY, USA, pp Fang, H. & Zhai, C. (25), An exploration of axiomatic approaches to information retrieval, in SI- GIR 5, pp Michell, J. (1997), Quantitative science an the efinition of measurement in psychology, British Journal of Psychology 88(3), NTCIR Project (212), jp/ntcir/inex-en.html. [Last visit: August 213]. Robertson, S. (26), On GMAP: an other transformations, in CIKM 6, New York, USA, pp Stevens, S. S. (1946), On the theory of scales of measurement, Science 13 (2684), Swets, J. A. (1963), Information retrieval systems, Science 141, van Rijsbergen, C. J. (1979), Information Retrieval, 2n en, Butterworths. Velleman, P. F. & Wilkinson, L. (1993), Nominal, Orinal, Interval, an Ratio Typologies Are Misleaing, The American Statistician 47(1), Wikipeia (212), Measurement Wikipeia, the free encyclopeia, wiki/measurement. [Last visit: August 213]. Yao, Y. Y. (1995), Measuring retrieval effectiveness base on user preference of ocuments, Journal of the American Society for Information Science 46(2), Zuse, H. (1997), A Framework of Software Measurement, Walter e Gruyter. 48

Axiometrics: Axioms of Information Retrieval Effectiveness Metrics

Axiometrics: Axioms of Information Retrieval Effectiveness Metrics Axiometrics: Axioms of Information Retrieval Effectiveness Metrics ABSTRACT Ey Maalena Department of Maths Computer Science University of Uine Uine, Italy ey.maalena@uniu.it There are literally ozens most

More information

The Axiometrics Project

The Axiometrics Project The Axiometrics Project Eddy Maddalena and Stefano Mizzaro Department of Mathematics and Computer Science University of Udine Udine, Italy eddy.maddalena@uniud.it, mizzaro@uniud.it Abstract. The evaluation

More information

Survey Sampling. 1 Design-based Inference. Kosuke Imai Department of Politics, Princeton University. February 19, 2013

Survey Sampling. 1 Design-based Inference. Kosuke Imai Department of Politics, Princeton University. February 19, 2013 Survey Sampling Kosuke Imai Department of Politics, Princeton University February 19, 2013 Survey sampling is one of the most commonly use ata collection methos for social scientists. We begin by escribing

More information

Schrödinger s equation.

Schrödinger s equation. Physics 342 Lecture 5 Schröinger s Equation Lecture 5 Physics 342 Quantum Mechanics I Wenesay, February 3r, 2010 Toay we iscuss Schröinger s equation an show that it supports the basic interpretation of

More information

. Using a multinomial model gives us the following equation for P d. , with respect to same length term sequences.

. Using a multinomial model gives us the following equation for P d. , with respect to same length term sequences. S 63 Lecture 8 2/2/26 Lecturer Lillian Lee Scribes Peter Babinski, Davi Lin Basic Language Moeling Approach I. Special ase of LM-base Approach a. Recap of Formulas an Terms b. Fixing θ? c. About that Multinomial

More information

The Exact Form and General Integrating Factors

The Exact Form and General Integrating Factors 7 The Exact Form an General Integrating Factors In the previous chapters, we ve seen how separable an linear ifferential equations can be solve using methos for converting them to forms that can be easily

More information

6 General properties of an autonomous system of two first order ODE

6 General properties of an autonomous system of two first order ODE 6 General properties of an autonomous system of two first orer ODE Here we embark on stuying the autonomous system of two first orer ifferential equations of the form ẋ 1 = f 1 (, x 2 ), ẋ 2 = f 2 (, x

More information

Vectors in two dimensions

Vectors in two dimensions Vectors in two imensions Until now, we have been working in one imension only The main reason for this is to become familiar with the main physical ieas like Newton s secon law, without the aitional complication

More information

Linear First-Order Equations

Linear First-Order Equations 5 Linear First-Orer Equations Linear first-orer ifferential equations make up another important class of ifferential equations that commonly arise in applications an are relatively easy to solve (in theory)

More information

Optimization of Geometries by Energy Minimization

Optimization of Geometries by Energy Minimization Optimization of Geometries by Energy Minimization by Tracy P. Hamilton Department of Chemistry University of Alabama at Birmingham Birmingham, AL 3594-140 hamilton@uab.eu Copyright Tracy P. Hamilton, 1997.

More information

The total derivative. Chapter Lagrangian and Eulerian approaches

The total derivative. Chapter Lagrangian and Eulerian approaches Chapter 5 The total erivative 51 Lagrangian an Eulerian approaches The representation of a flui through scalar or vector fiels means that each physical quantity uner consieration is escribe as a function

More information

Robustness and Perturbations of Minimal Bases

Robustness and Perturbations of Minimal Bases Robustness an Perturbations of Minimal Bases Paul Van Dooren an Froilán M Dopico December 9, 2016 Abstract Polynomial minimal bases of rational vector subspaces are a classical concept that plays an important

More information

Chapter 6: Energy-Momentum Tensors

Chapter 6: Energy-Momentum Tensors 49 Chapter 6: Energy-Momentum Tensors This chapter outlines the general theory of energy an momentum conservation in terms of energy-momentum tensors, then applies these ieas to the case of Bohm's moel.

More information

Diophantine Approximations: Examining the Farey Process and its Method on Producing Best Approximations

Diophantine Approximations: Examining the Farey Process and its Method on Producing Best Approximations Diophantine Approximations: Examining the Farey Process an its Metho on Proucing Best Approximations Kelly Bowen Introuction When a person hears the phrase irrational number, one oes not think of anything

More information

SYNCHRONOUS SEQUENTIAL CIRCUITS

SYNCHRONOUS SEQUENTIAL CIRCUITS CHAPTER SYNCHRONOUS SEUENTIAL CIRCUITS Registers an counters, two very common synchronous sequential circuits, are introuce in this chapter. Register is a igital circuit for storing information. Contents

More information

Separation of Variables

Separation of Variables Physics 342 Lecture 1 Separation of Variables Lecture 1 Physics 342 Quantum Mechanics I Monay, January 25th, 2010 There are three basic mathematical tools we nee, an then we can begin working on the physical

More information

Construction of the Electronic Radial Wave Functions and Probability Distributions of Hydrogen-like Systems

Construction of the Electronic Radial Wave Functions and Probability Distributions of Hydrogen-like Systems Construction of the Electronic Raial Wave Functions an Probability Distributions of Hyrogen-like Systems Thomas S. Kuntzleman, Department of Chemistry Spring Arbor University, Spring Arbor MI 498 tkuntzle@arbor.eu

More information

Implicit Differentiation

Implicit Differentiation Implicit Differentiation Thus far, the functions we have been concerne with have been efine explicitly. A function is efine explicitly if the output is given irectly in terms of the input. For instance,

More information

2Algebraic ONLINE PAGE PROOFS. foundations

2Algebraic ONLINE PAGE PROOFS. foundations Algebraic founations. Kick off with CAS. Algebraic skills.3 Pascal s triangle an binomial expansions.4 The binomial theorem.5 Sets of real numbers.6 Surs.7 Review . Kick off with CAS Playing lotto Using

More information

inflow outflow Part I. Regular tasks for MAE598/494 Task 1

inflow outflow Part I. Regular tasks for MAE598/494 Task 1 MAE 494/598, Fall 2016 Project #1 (Regular tasks = 20 points) Har copy of report is ue at the start of class on the ue ate. The rules on collaboration will be release separately. Please always follow the

More information

19 Eigenvalues, Eigenvectors, Ordinary Differential Equations, and Control

19 Eigenvalues, Eigenvectors, Ordinary Differential Equations, and Control 19 Eigenvalues, Eigenvectors, Orinary Differential Equations, an Control This section introuces eigenvalues an eigenvectors of a matrix, an iscusses the role of the eigenvalues in etermining the behavior

More information

This module is part of the. Memobust Handbook. on Methodology of Modern Business Statistics

This module is part of the. Memobust Handbook. on Methodology of Modern Business Statistics This moule is part of the Memobust Hanbook on Methoology of Moern Business Statistics 26 March 2014 Metho: Balance Sampling for Multi-Way Stratification Contents General section... 3 1. Summary... 3 2.

More information

Math 1B, lecture 8: Integration by parts

Math 1B, lecture 8: Integration by parts Math B, lecture 8: Integration by parts Nathan Pflueger 23 September 2 Introuction Integration by parts, similarly to integration by substitution, reverses a well-known technique of ifferentiation an explores

More information

05 The Continuum Limit and the Wave Equation

05 The Continuum Limit and the Wave Equation Utah State University DigitalCommons@USU Founations of Wave Phenomena Physics, Department of 1-1-2004 05 The Continuum Limit an the Wave Equation Charles G. Torre Department of Physics, Utah State University,

More information

Transmission Line Matrix (TLM) network analogues of reversible trapping processes Part B: scaling and consistency

Transmission Line Matrix (TLM) network analogues of reversible trapping processes Part B: scaling and consistency Transmission Line Matrix (TLM network analogues of reversible trapping processes Part B: scaling an consistency Donar e Cogan * ANC Eucation, 308-310.A. De Mel Mawatha, Colombo 3, Sri Lanka * onarecogan@gmail.com

More information

What s in an Attribute? Consequences for the Least Common Subsumer

What s in an Attribute? Consequences for the Least Common Subsumer What s in an Attribute? Consequences for the Least Common Subsumer Ralf Küsters LuFG Theoretical Computer Science RWTH Aachen Ahornstraße 55 52074 Aachen Germany kuesters@informatik.rwth-aachen.e Alex

More information

Introduction to the Vlasov-Poisson system

Introduction to the Vlasov-Poisson system Introuction to the Vlasov-Poisson system Simone Calogero 1 The Vlasov equation Consier a particle with mass m > 0. Let x(t) R 3 enote the position of the particle at time t R an v(t) = ẋ(t) = x(t)/t its

More information

Bohr Model of the Hydrogen Atom

Bohr Model of the Hydrogen Atom Class 2 page 1 Bohr Moel of the Hyrogen Atom The Bohr Moel of the hyrogen atom assumes that the atom consists of one electron orbiting a positively charge nucleus. Although it oes NOT o a goo job of escribing

More information

On the enumeration of partitions with summands in arithmetic progression

On the enumeration of partitions with summands in arithmetic progression AUSTRALASIAN JOURNAL OF COMBINATORICS Volume 8 (003), Pages 149 159 On the enumeration of partitions with summans in arithmetic progression M. A. Nyblom C. Evans Department of Mathematics an Statistics

More information

A simple model for the small-strain behaviour of soils

A simple model for the small-strain behaviour of soils A simple moel for the small-strain behaviour of soils José Jorge Naer Department of Structural an Geotechnical ngineering, Polytechnic School, University of São Paulo 05508-900, São Paulo, Brazil, e-mail:

More information

Part I: Web Structure Mining Chapter 1: Information Retrieval and Web Search

Part I: Web Structure Mining Chapter 1: Information Retrieval and Web Search Part I: Web Structure Mining Chapter : Information Retrieval an Web Search The Web Challenges Crawling the Web Inexing an Keywor Search Evaluating Search Quality Similarity Search The Web Challenges Tim

More information

Proof by Mathematical Induction.

Proof by Mathematical Induction. Proof by Mathematical Inuction. Mathematicians have very peculiar characteristics. They like proving things or mathematical statements. Two of the most important techniques of mathematical proof are proof

More information

Table of Common Derivatives By David Abraham

Table of Common Derivatives By David Abraham Prouct an Quotient Rules: Table of Common Derivatives By Davi Abraham [ f ( g( ] = [ f ( ] g( + f ( [ g( ] f ( = g( [ f ( ] g( g( f ( [ g( ] Trigonometric Functions: sin( = cos( cos( = sin( tan( = sec

More information

MATHEMATICAL REPRESENTATION OF REAL SYSTEMS: TWO MODELLING ENVIRONMENTS INVOLVING DIFFERENT LEARNING STRATEGIES C. Fazio, R. M. Sperandeo-Mineo, G.

MATHEMATICAL REPRESENTATION OF REAL SYSTEMS: TWO MODELLING ENVIRONMENTS INVOLVING DIFFERENT LEARNING STRATEGIES C. Fazio, R. M. Sperandeo-Mineo, G. MATHEMATICAL REPRESENTATION OF REAL SYSTEMS: TWO MODELLING ENIRONMENTS INOLING DIFFERENT LEARNING STRATEGIES C. Fazio, R. M. Speraneo-Mineo, G. Tarantino GRIAF (Research Group on Teaching/Learning Physics)

More information

JUST THE MATHS UNIT NUMBER DIFFERENTIATION 2 (Rates of change) A.J.Hobson

JUST THE MATHS UNIT NUMBER DIFFERENTIATION 2 (Rates of change) A.J.Hobson JUST THE MATHS UNIT NUMBER 10.2 DIFFERENTIATION 2 (Rates of change) by A.J.Hobson 10.2.1 Introuction 10.2.2 Average rates of change 10.2.3 Instantaneous rates of change 10.2.4 Derivatives 10.2.5 Exercises

More information

Lower bounds on Locality Sensitive Hashing

Lower bounds on Locality Sensitive Hashing Lower bouns on Locality Sensitive Hashing Rajeev Motwani Assaf Naor Rina Panigrahy Abstract Given a metric space (X, X ), c 1, r > 0, an p, q [0, 1], a istribution over mappings H : X N is calle a (r,

More information

The Press-Schechter mass function

The Press-Schechter mass function The Press-Schechter mass function To state the obvious: It is important to relate our theories to what we can observe. We have looke at linear perturbation theory, an we have consiere a simple moel for

More information

CUSTOMER REVIEW FEATURE EXTRACTION Heng Ren, Jingye Wang, and Tony Wu

CUSTOMER REVIEW FEATURE EXTRACTION Heng Ren, Jingye Wang, and Tony Wu CUSTOMER REVIEW FEATURE EXTRACTION Heng Ren, Jingye Wang, an Tony Wu Abstract Popular proucts often have thousans of reviews that contain far too much information for customers to igest. Our goal for the

More information

'HVLJQ &RQVLGHUDWLRQ LQ 0DWHULDO 6HOHFWLRQ 'HVLJQ 6HQVLWLYLW\,1752'8&7,21

'HVLJQ &RQVLGHUDWLRQ LQ 0DWHULDO 6HOHFWLRQ 'HVLJQ 6HQVLWLYLW\,1752'8&7,21 Large amping in a structural material may be either esirable or unesirable, epening on the engineering application at han. For example, amping is a esirable property to the esigner concerne with limiting

More information

A Second Time Dimension, Hidden in Plain Sight

A Second Time Dimension, Hidden in Plain Sight A Secon Time Dimension, Hien in Plain Sight Brett A Collins. In this paper I postulate the existence of a secon time imension, making five imensions, three space imensions an two time imensions. I will

More information

Mark J. Machina CARDINAL PROPERTIES OF "LOCAL UTILITY FUNCTIONS"

Mark J. Machina CARDINAL PROPERTIES OF LOCAL UTILITY FUNCTIONS Mark J. Machina CARDINAL PROPERTIES OF "LOCAL UTILITY FUNCTIONS" This paper outlines the carinal properties of "local utility functions" of the type use by Allen [1985], Chew [1983], Chew an MacCrimmon

More information

Calculus of Variations

Calculus of Variations 16.323 Lecture 5 Calculus of Variations Calculus of Variations Most books cover this material well, but Kirk Chapter 4 oes a particularly nice job. x(t) x* x*+ αδx (1) x*- αδx (1) αδx (1) αδx (1) t f t

More information

Crowdsourced Judgement Elicitation with Endogenous Proficiency

Crowdsourced Judgement Elicitation with Endogenous Proficiency Crowsource Jugement Elicitation with Enogenous Proficiency Anirban Dasgupta Arpita Ghosh Abstract Crowsourcing is now wiely use to replace jugement or evaluation by an expert authority with an aggregate

More information

Euler equations for multiple integrals

Euler equations for multiple integrals Euler equations for multiple integrals January 22, 2013 Contents 1 Reminer of multivariable calculus 2 1.1 Vector ifferentiation......................... 2 1.2 Matrix ifferentiation........................

More information

Computing Exact Confidence Coefficients of Simultaneous Confidence Intervals for Multinomial Proportions and their Functions

Computing Exact Confidence Coefficients of Simultaneous Confidence Intervals for Multinomial Proportions and their Functions Working Paper 2013:5 Department of Statistics Computing Exact Confience Coefficients of Simultaneous Confience Intervals for Multinomial Proportions an their Functions Shaobo Jin Working Paper 2013:5

More information

The Principle of Least Action

The Principle of Least Action Chapter 7. The Principle of Least Action 7.1 Force Methos vs. Energy Methos We have so far stuie two istinct ways of analyzing physics problems: force methos, basically consisting of the application of

More information

Level Construction of Decision Trees in a Partition-based Framework for Classification

Level Construction of Decision Trees in a Partition-based Framework for Classification Level Construction of Decision Trees in a Partition-base Framework for Classification Y.Y. Yao, Y. Zhao an J.T. Yao Department of Computer Science, University of Regina Regina, Saskatchewan, Canaa S4S

More information

Lectures - Week 10 Introduction to Ordinary Differential Equations (ODES) First Order Linear ODEs

Lectures - Week 10 Introduction to Ordinary Differential Equations (ODES) First Order Linear ODEs Lectures - Week 10 Introuction to Orinary Differential Equations (ODES) First Orer Linear ODEs When stuying ODEs we are consiering functions of one inepenent variable, e.g., f(x), where x is the inepenent

More information

Some vector algebra and the generalized chain rule Ross Bannister Data Assimilation Research Centre, University of Reading, UK Last updated 10/06/10

Some vector algebra and the generalized chain rule Ross Bannister Data Assimilation Research Centre, University of Reading, UK Last updated 10/06/10 Some vector algebra an the generalize chain rule Ross Bannister Data Assimilation Research Centre University of Reaing UK Last upate 10/06/10 1. Introuction an notation As we shall see in these notes the

More information

Monotonicity for excited random walk in high dimensions

Monotonicity for excited random walk in high dimensions Monotonicity for excite ranom walk in high imensions Remco van er Hofsta Mark Holmes March, 2009 Abstract We prove that the rift θ, β) for excite ranom walk in imension is monotone in the excitement parameter

More information

Perfect Matchings in Õ(n1.5 ) Time in Regular Bipartite Graphs

Perfect Matchings in Õ(n1.5 ) Time in Regular Bipartite Graphs Perfect Matchings in Õ(n1.5 ) Time in Regular Bipartite Graphs Ashish Goel Michael Kapralov Sanjeev Khanna Abstract We consier the well-stuie problem of fining a perfect matching in -regular bipartite

More information

Witt#5: Around the integrality criterion 9.93 [version 1.1 (21 April 2013), not completed, not proofread]

Witt#5: Around the integrality criterion 9.93 [version 1.1 (21 April 2013), not completed, not proofread] Witt vectors. Part 1 Michiel Hazewinkel Sienotes by Darij Grinberg Witt#5: Aroun the integrality criterion 9.93 [version 1.1 21 April 2013, not complete, not proofrea In [1, section 9.93, Hazewinkel states

More information

Calculus in the AP Physics C Course The Derivative

Calculus in the AP Physics C Course The Derivative Limits an Derivatives Calculus in the AP Physics C Course The Derivative In physics, the ieas of the rate change of a quantity (along with the slope of a tangent line) an the area uner a curve are essential.

More information

The Non-abelian Hodge Correspondence for Non-Compact Curves

The Non-abelian Hodge Correspondence for Non-Compact Curves 1 Section 1 Setup The Non-abelian Hoge Corresponence for Non-Compact Curves Chris Elliott May 8, 2011 1 Setup In this talk I will escribe the non-abelian Hoge theory of a non-compact curve. This was worke

More information

VI. Linking and Equating: Getting from A to B Unleashing the full power of Rasch models means identifying, perhaps conceiving an important aspect,

VI. Linking and Equating: Getting from A to B Unleashing the full power of Rasch models means identifying, perhaps conceiving an important aspect, VI. Linking an Equating: Getting from A to B Unleashing the full power of Rasch moels means ientifying, perhaps conceiving an important aspect, efining a useful construct, an calibrating a pool of relevant

More information

A Sketch of Menshikov s Theorem

A Sketch of Menshikov s Theorem A Sketch of Menshikov s Theorem Thomas Bao March 14, 2010 Abstract Let Λ be an infinite, locally finite oriente multi-graph with C Λ finite an strongly connecte, an let p

More information

Thermal runaway during blocking

Thermal runaway during blocking Thermal runaway uring blocking CES_stable CES ICES_stable ICES k 6.5 ma 13 6. 12 5.5 11 5. 1 4.5 9 4. 8 3.5 7 3. 6 2.5 5 2. 4 1.5 3 1. 2.5 1. 6 12 18 24 3 36 s Thermal runaway uring blocking Application

More information

Calculus of Variations

Calculus of Variations Calculus of Variations Lagrangian formalism is the main tool of theoretical classical mechanics. Calculus of Variations is a part of Mathematics which Lagrangian formalism is base on. In this section,

More information

Integration Review. May 11, 2013

Integration Review. May 11, 2013 Integration Review May 11, 2013 Goals: Review the funamental theorem of calculus. Review u-substitution. Review integration by parts. Do lots of integration eamples. 1 Funamental Theorem of Calculus In

More information

3.7 Implicit Differentiation -- A Brief Introduction -- Student Notes

3.7 Implicit Differentiation -- A Brief Introduction -- Student Notes Fin these erivatives of these functions: y.7 Implicit Differentiation -- A Brief Introuction -- Stuent Notes tan y sin tan = sin y e = e = Write the inverses of these functions: y tan y sin How woul we

More information

Lecture Introduction. 2 Examples of Measure Concentration. 3 The Johnson-Lindenstrauss Lemma. CS-621 Theory Gems November 28, 2012

Lecture Introduction. 2 Examples of Measure Concentration. 3 The Johnson-Lindenstrauss Lemma. CS-621 Theory Gems November 28, 2012 CS-6 Theory Gems November 8, 0 Lecture Lecturer: Alesaner Mąry Scribes: Alhussein Fawzi, Dorina Thanou Introuction Toay, we will briefly iscuss an important technique in probability theory measure concentration

More information

Math Notes on differentials, the Chain Rule, gradients, directional derivative, and normal vectors

Math Notes on differentials, the Chain Rule, gradients, directional derivative, and normal vectors Math 18.02 Notes on ifferentials, the Chain Rule, graients, irectional erivative, an normal vectors Tangent plane an linear approximation We efine the partial erivatives of f( xy, ) as follows: f f( x+

More information

Lower Bounds for the Smoothed Number of Pareto optimal Solutions

Lower Bounds for the Smoothed Number of Pareto optimal Solutions Lower Bouns for the Smoothe Number of Pareto optimal Solutions Tobias Brunsch an Heiko Röglin Department of Computer Science, University of Bonn, Germany brunsch@cs.uni-bonn.e, heiko@roeglin.org Abstract.

More information

PHYS 414 Problem Set 2: Turtles all the way down

PHYS 414 Problem Set 2: Turtles all the way down PHYS 414 Problem Set 2: Turtles all the way own This problem set explores the common structure of ynamical theories in statistical physics as you pass from one length an time scale to another. Brownian

More information

Introduction to variational calculus: Lecture notes 1

Introduction to variational calculus: Lecture notes 1 October 10, 2006 Introuction to variational calculus: Lecture notes 1 Ewin Langmann Mathematical Physics, KTH Physics, AlbaNova, SE-106 91 Stockholm, Sween Abstract I give an informal summary of variational

More information

Math 342 Partial Differential Equations «Viktor Grigoryan

Math 342 Partial Differential Equations «Viktor Grigoryan Math 342 Partial Differential Equations «Viktor Grigoryan 6 Wave equation: solution In this lecture we will solve the wave equation on the entire real line x R. This correspons to a string of infinite

More information

Problem Sheet 2: Eigenvalues and eigenvectors and their use in solving linear ODEs

Problem Sheet 2: Eigenvalues and eigenvectors and their use in solving linear ODEs Problem Sheet 2: Eigenvalues an eigenvectors an their use in solving linear ODEs If you fin any typos/errors in this problem sheet please email jk28@icacuk The material in this problem sheet is not examinable

More information

arxiv:physics/ v2 [physics.ed-ph] 23 Sep 2003

arxiv:physics/ v2 [physics.ed-ph] 23 Sep 2003 Mass reistribution in variable mass systems Célia A. e Sousa an Vítor H. Rorigues Departamento e Física a Universiae e Coimbra, P-3004-516 Coimbra, Portugal arxiv:physics/0211075v2 [physics.e-ph] 23 Sep

More information

18 EVEN MORE CALCULUS

18 EVEN MORE CALCULUS 8 EVEN MORE CALCULUS Chapter 8 Even More Calculus Objectives After stuing this chapter you shoul be able to ifferentiate an integrate basic trigonometric functions; unerstan how to calculate rates of change;

More information

THE VAN KAMPEN EXPANSION FOR LINKED DUFFING LINEAR OSCILLATORS EXCITED BY COLORED NOISE

THE VAN KAMPEN EXPANSION FOR LINKED DUFFING LINEAR OSCILLATORS EXCITED BY COLORED NOISE Journal of Soun an Vibration (1996) 191(3), 397 414 THE VAN KAMPEN EXPANSION FOR LINKED DUFFING LINEAR OSCILLATORS EXCITED BY COLORED NOISE E. M. WEINSTEIN Galaxy Scientific Corporation, 2500 English Creek

More information

A check digit system over a group of arbitrary order

A check digit system over a group of arbitrary order 2013 8th International Conference on Communications an Networking in China (CHINACOM) A check igit system over a group of arbitrary orer Yanling Chen Chair of Communication Systems Ruhr University Bochum

More information

How the potentials in different gauges yield the same retarded electric and magnetic fields

How the potentials in different gauges yield the same retarded electric and magnetic fields How the potentials in ifferent gauges yiel the same retare electric an magnetic fiels José A. Heras a Departamento e Física, E. S. F. M., Instituto Politécnico Nacional, México D. F. México an Department

More information

Calculating the effective delayed neutron fraction using Monte Carlo techniques

Calculating the effective delayed neutron fraction using Monte Carlo techniques Calculating the effective elaye neutron fraction using Monte Carlo techniques 1. Introuction Steven C. van er Marck * an Robin Klein Meulekamp NRG, Petten, the Netherlans We present a true Monte Carlo

More information

Computing Derivatives

Computing Derivatives Chapter 2 Computing Derivatives 2.1 Elementary erivative rules Motivating Questions In this section, we strive to unerstan the ieas generate by the following important questions: What are alternate notations

More information

A Note on Exact Solutions to Linear Differential Equations by the Matrix Exponential

A Note on Exact Solutions to Linear Differential Equations by the Matrix Exponential Avances in Applie Mathematics an Mechanics Av. Appl. Math. Mech. Vol. 1 No. 4 pp. 573-580 DOI: 10.4208/aamm.09-m0946 August 2009 A Note on Exact Solutions to Linear Differential Equations by the Matrix

More information

LATTICE-BASED D-OPTIMUM DESIGN FOR FOURIER REGRESSION

LATTICE-BASED D-OPTIMUM DESIGN FOR FOURIER REGRESSION The Annals of Statistics 1997, Vol. 25, No. 6, 2313 2327 LATTICE-BASED D-OPTIMUM DESIGN FOR FOURIER REGRESSION By Eva Riccomagno, 1 Rainer Schwabe 2 an Henry P. Wynn 1 University of Warwick, Technische

More information

Least-Squares Regression on Sparse Spaces

Least-Squares Regression on Sparse Spaces Least-Squares Regression on Sparse Spaces Yuri Grinberg, Mahi Milani Far, Joelle Pineau School of Computer Science McGill University Montreal, Canaa {ygrinb,mmilan1,jpineau}@cs.mcgill.ca 1 Introuction

More information

Lecture 10 Notes, Electromagnetic Theory II Dr. Christopher S. Baird, faculty.uml.edu/cbaird University of Massachusetts Lowell

Lecture 10 Notes, Electromagnetic Theory II Dr. Christopher S. Baird, faculty.uml.edu/cbaird University of Massachusetts Lowell Lecture 10 Notes, Electromagnetic Theory II Dr. Christopher S. Bair, faculty.uml.eu/cbair University of Massachusetts Lowell 1. Pre-Einstein Relativity - Einstein i not invent the concept of relativity,

More information

Unit #6 - Families of Functions, Taylor Polynomials, l Hopital s Rule

Unit #6 - Families of Functions, Taylor Polynomials, l Hopital s Rule Unit # - Families of Functions, Taylor Polynomials, l Hopital s Rule Some problems an solutions selecte or aapte from Hughes-Hallett Calculus. Critical Points. Consier the function f) = 54 +. b) a) Fin

More information

Some properties of random staircase tableaux

Some properties of random staircase tableaux Some properties of ranom staircase tableaux Sanrine Dasse Hartaut Pawe l Hitczenko Downloae /4/7 to 744940 Reistribution subject to SIAM license or copyright; see http://wwwsiamorg/journals/ojsaphp Abstract

More information

Similarity Measures for Categorical Data A Comparative Study. Technical Report

Similarity Measures for Categorical Data A Comparative Study. Technical Report Similarity Measures for Categorical Data A Comparative Stuy Technical Report Department of Computer Science an Engineering University of Minnesota 4-92 EECS Builing 200 Union Street SE Minneapolis, MN

More information

State observers and recursive filters in classical feedback control theory

State observers and recursive filters in classical feedback control theory State observers an recursive filters in classical feeback control theory State-feeback control example: secon-orer system Consier the riven secon-orer system q q q u x q x q x x x x Here u coul represent

More information

Calculus and optimization

Calculus and optimization Calculus an optimization These notes essentially correspon to mathematical appenix 2 in the text. 1 Functions of a single variable Now that we have e ne functions we turn our attention to calculus. A function

More information

Transreal Limits and Elementary Functions

Transreal Limits and Elementary Functions Transreal Limits an Elementary Functions Tiago S. os Reis, James A. D. W. Anerson Abstract We exten all elementary functions from the real to the transreal omain so that they are efine on ivision by zero.

More information

Summary: Differentiation

Summary: Differentiation Techniques of Differentiation. Inverse Trigonometric functions The basic formulas (available in MF5 are: Summary: Differentiation ( sin ( cos The basic formula can be generalize as follows: Note: ( sin

More information

Lecture 6: Calculus. In Song Kim. September 7, 2011

Lecture 6: Calculus. In Song Kim. September 7, 2011 Lecture 6: Calculus In Song Kim September 7, 20 Introuction to Differential Calculus In our previous lecture we came up with several ways to analyze functions. We saw previously that the slope of a linear

More information

3.2 Differentiability

3.2 Differentiability Section 3 Differentiability 09 3 Differentiability What you will learn about How f (a) Might Fail to Eist Differentiability Implies Local Linearity Numerical Derivatives on a Calculator Differentiability

More information

On the Aloha throughput-fairness tradeoff

On the Aloha throughput-fairness tradeoff On the Aloha throughput-fairness traeoff 1 Nan Xie, Member, IEEE, an Steven Weber, Senior Member, IEEE Abstract arxiv:1605.01557v1 [cs.it] 5 May 2016 A well-known inner boun of the stability region of

More information

Acute sets in Euclidean spaces

Acute sets in Euclidean spaces Acute sets in Eucliean spaces Viktor Harangi April, 011 Abstract A finite set H in R is calle an acute set if any angle etermine by three points of H is acute. We examine the maximal carinality α() of

More information

TMA 4195 Matematisk modellering Exam Tuesday December 16, :00 13:00 Problems and solution with additional comments

TMA 4195 Matematisk modellering Exam Tuesday December 16, :00 13:00 Problems and solution with additional comments Problem F U L W D g m 3 2 s 2 0 0 0 0 2 kg 0 0 0 0 0 0 Table : Dimension matrix TMA 495 Matematisk moellering Exam Tuesay December 6, 2008 09:00 3:00 Problems an solution with aitional comments The necessary

More information

ensembles When working with density operators, we can use this connection to define a generalized Bloch vector: v x Tr x, v y Tr y

ensembles When working with density operators, we can use this connection to define a generalized Bloch vector: v x Tr x, v y Tr y Ph195a lecture notes, 1/3/01 Density operators for spin- 1 ensembles So far in our iscussion of spin- 1 systems, we have restricte our attention to the case of pure states an Hamiltonian evolution. Toay

More information

FIRST YEAR PHD REPORT

FIRST YEAR PHD REPORT FIRST YEAR PHD REPORT VANDITA PATEL Abstract. We look at some potential links between totally real number fiels an some theta expansions (these being moular forms). The literature relate to moular forms

More information

A. Incorrect! The letter t does not appear in the expression of the given integral

A. Incorrect! The letter t does not appear in the expression of the given integral AP Physics C - Problem Drill 1: The Funamental Theorem of Calculus Question No. 1 of 1 Instruction: (1) Rea the problem statement an answer choices carefully () Work the problems on paper as neee (3) Question

More information

All s Well That Ends Well: Supplementary Proofs

All s Well That Ends Well: Supplementary Proofs All s Well That Ens Well: Guarantee Resolution of Simultaneous Rigi Boy Impact 1:1 All s Well That Ens Well: Supplementary Proofs This ocument complements the paper All s Well That Ens Well: Guarantee

More information

Systems & Control Letters

Systems & Control Letters Systems & ontrol Letters ( ) ontents lists available at ScienceDirect Systems & ontrol Letters journal homepage: www.elsevier.com/locate/sysconle A converse to the eterministic separation principle Jochen

More information

A Course in Machine Learning

A Course in Machine Learning A Course in Machine Learning Hal Daumé III 12 EFFICIENT LEARNING So far, our focus has been on moels of learning an basic algorithms for those moels. We have not place much emphasis on how to learn quickly.

More information

Topic 7: Convergence of Random Variables

Topic 7: Convergence of Random Variables Topic 7: Convergence of Ranom Variables Course 003, 2016 Page 0 The Inference Problem So far, our starting point has been a given probability space (S, F, P). We now look at how to generate information

More information

Situation awareness of power system based on static voltage security region

Situation awareness of power system based on static voltage security region The 6th International Conference on Renewable Power Generation (RPG) 19 20 October 2017 Situation awareness of power system base on static voltage security region Fei Xiao, Zi-Qing Jiang, Qian Ai, Ran

More information

FLUCTUATIONS IN THE NUMBER OF POINTS ON SMOOTH PLANE CURVES OVER FINITE FIELDS. 1. Introduction

FLUCTUATIONS IN THE NUMBER OF POINTS ON SMOOTH PLANE CURVES OVER FINITE FIELDS. 1. Introduction FLUCTUATIONS IN THE NUMBER OF POINTS ON SMOOTH PLANE CURVES OVER FINITE FIELDS ALINA BUCUR, CHANTAL DAVID, BROOKE FEIGON, MATILDE LALÍN 1 Introuction In this note, we stuy the fluctuations in the number

More information

Quantum Mechanics in Three Dimensions

Quantum Mechanics in Three Dimensions Physics 342 Lecture 20 Quantum Mechanics in Three Dimensions Lecture 20 Physics 342 Quantum Mechanics I Monay, March 24th, 2008 We begin our spherical solutions with the simplest possible case zero potential.

More information