Textual Document Indexing and Retrieval via Knowledge Sources and Data Mining

Size: px
Start display at page:

Download "Textual Document Indexing and Retrieval via Knowledge Sources and Data Mining"

Transcription

1 Textual Doument Indexing and Retrieval via Knowledge Soures and Data Mining Wesley. W. Chu, Zhenyu Liu and Wenlei Mao Computer Siene Department, University of California, Los Angeles {ww, viliu, Abstrat We present a knowledge-based query expansion tehnique to improve doument retrieval effetiveness. The general onept terms in a query are substituted by a set of speifi onept terms used in the orpus that o-our with the key query onept. Sine the expanded query mathes with the doument index terms muh better, experimental results reveal that suh query expansion produes better retrieval effetiveness than the unexpanded ones. We have also developed a new phrase-based indexing tehnique that ombines onepts with word stems. Using word stems an ompensate for the inompleteness of the knowledge soures. Experimental results reveal that using phrase-based for indexing produes more aurate doument retrieval than using word stems or onepts alone. We also present an implementation that integrates the query expansion and phrase-based indexing in a medial digital library for the retrieval of patient reords, laboratory reports and medial literature. 1. Introdution Effiient doument retrieval based on user query is ahieved by indexing. Current tehnique uses word stem to index a doument [1]. Suh tehnique suffers from the inability to math words in query with their related words suh as synonyms, hypernyms and hyponyms [2] in the douments. Therefore, there are reent attempts to index the doument based on oneptual terms. However the knowledge soures are usually inomplete. As a result, past researh reveals that although using oneptual terms for doument indexing an solve some of the problems, it annot outperform the word-stem-based model [3,4,5,6]. To remedy the inompleteness of the knowledge soures, we propose a phrase-based indexing model where we parse a doument into phrases based on the oneptual terms in domain speifi knowledge soures, and alulate the similarity between two douments using both the similarity between the onepts and the ommon word stems in them. Inluding word stem similarity in doument similarity evaluation ompensates for the inompleteness of the knowledge soures. When seeking speifi information regarding a partiular topi, the user often has to pose a general query with onept terms. For example, not knowing that ontat lenses is a treatment option for keratoonus, the user has to request for treatment options for keratoonus in the query. This results in low retrieval preision sine douments are indexed by the speifi terms. To remedy suh shortoming, we propose to substitute the general onept terms in the query with the speifi terms. The level of relevany of a speifi term in the resulting query is determined by its o-ourrene with the general onept term, whih an be mined from the orpus. Based on the query, the knowledge soures an identify the irrelevant oneptual terms and prevent them from being inserted into the query. Sine the expanded queries math better with relevant douments, the doument retrieval performane is improved. We shall first present the phrase based index tehnique and the experimental results to show the performane improvements of phrased based index over word stem and onept based index methods, Next we shall present the knowledge based query extension tehnique and present the performane improvement derived from the query extension. Finally, we present an implementation of integrating the two proposed tehniques in a medial digital library for retrieving medial textual reords and reports. 2. Phrase-based indexing To failitate disussion, we shall use the following sample query in this setion: 22 year old with hyperthermia, leukoytosis, inreased intraranial pressure, and entral herniation. Cerebral edema seondary to infetion, diagnosis and treatment. The first part of the query is a brief desription of the patient; and the seond part is the information need. 2.1 Word stem Indexing

2 A doument is ommonly represented as a vetor of terms in a vetor spae model (VSM) [1]. The basis of the vetor spae orresponds to distint terms in a doument olletion. Components of the doument vetor are the weights of the orresponding terms that represent their relative importane in the doument. In a naïvest approah, we ould treat a word as a term. Yet, morphologial variants like edema and edemas are so losely related that they are usually onflated into a single word stem, e.g., edem, by stemming [1,7]. Our sample query thus onsists of word stems hypertherm, leukoytos, inreas, intraran, pressur, et. Word stems are usually treated as notational, rather than oneptual entities. Two word stems are onsidered unrelated if they are different. For example, the stem of hyperthermia and that of fever are usually onsidered unrelated despite their apparent relationship. In stem-based VSM, word stems onstitute the basis of the vetor spae. The base vetors are orthogonal to eah other beause different word stems are onsidered unrelated. The weight w s α, u of a word stem u in a doument α is determined by the number of times u appears in α (known as the term frequeny) and the number of douments that ontain u (known as the doument frequeny) following the TF-IDF (term frequeny, inverse doument frequeny) sheme [1]. In essene, the more often u appears in α, the more important u is in α. On the other hand, the more douments u belongs to, the less disambiguating power it has, and thus the less important it is. Word stems are widely used as index terms. To improve retrieval auray, it is natural to replae word stems with onepts [3,4,5,6,8] or multiple-word ombinations [9,10]. However, previous researh showed not only no improvements, but degradation in retrieval auray when onepts were used in doument retrieval [3,4,5,6] exept when douments were very short [8]. When properly used, multiple-word ombinations were shown to improve retrieval effetiveness for some speial queries [9,10]. However, the retrieval effetiveness improvement for ad ho queries is still questionable. 2.2 Conept-based VSM Using word stems to represent douments results in the inappropriate fragmentation of onepts suh as inreased intraranial pressure into its omponent stems inreas, intraran, and pressur. Clearly, using onepts instead of single words or word stems as the vetor spae basis should produe a VSM that better mimis the human thought proesses, and therefore should result in more aurate retrieval. However, using onepts is more omplex than using word stems. First, onepts are usually represented by multi-word phrases suh as inreased intraranial pressure. More importantly, there exist synonymous and polysemous phrases. Two phrases sharing a onept are synonymous, and phrases that ould represent more than one onept are polysemous [2]. For example, hyperthermia and fever are synonymous beause they share the same onept an abnormal elevation of the body temperature. At the same time, hyperthermia is polysemous, beause in addition to the above meaning it also means a treatment in whih body tissues is exposed to high temperature to damage and kill aner ells. Synonyms an be identified with the help of a ditionary or a thesaurus. Determining whih onept a partiular polysemous phrase represents is known as word sense disambiguation (WSD) [11]. Third, some onepts are related to one another. Hypernym and hyponym relations are important oneptual relations. If we say an x is a (kind of) y then onept x is a hyponym of onept y, and y is a hypernym of x [2]. Hyperthermia is a hyponym of high body temperature; and high body temperature is a hypernym of hyperthermia. Conept identifiers are usually used to identify onepts. Using UMLS [13] as a knowledge soure, our sample query beomes (15967, ), (23518), and (151740) et., representing hyperthermia, leukoytosis, and inreased intraranial pressure, et., respetively. In onept-based VSM, the basis of the vetors spae onsists of distint onepts. To model the relationship of suh onepts as hyperthermia and elevated body temperature we remove the orthogonality onstraint on base vetors. Base vetors for two related onepts form an aute angle. Only when we annot find any reasonable relations between two onepts that we treat their orresponding vetors as orthogonal. The osine of the angle between two onept vetors is defined as the oneptual similarity between the orresponding onepts. The oneptual similarity thus ranges from 0 to 1 with 0 indiating unrelated and 1 indiating highly related onepts. To study the effets of oneptual similarities, we shall ompare two ases. In one ase, we assume all different onepts are unrelated. Therefore, all base vetors of the vetor spaes are orthogonal to one another. We label this ase as O for orthogonal. In the other ase, we derive oneptual similarities from knowledge soures. The resulting base vetors are no longer mutually orthogonal. We label this ase as NO for non-orthogonal. We derive the weight w α,xi of the i th onept x i in a doument α using a slightly modified version of TF-IDF sheme. Higher weights are assigned to longer phrases that orrespond to more speifi onepts. For example, if the term frequenies and doument frequenies for

3 inreased intraranial pressure and hyperthermia were idential, the former onept would obtain a higher weight than the latter. 2.2 Phrase-based VSM Coneptual similarities needed in onept-based VSM are derived from knowledge soures. The quality of suh VSM therefore depends heavily on the quality of the knowledge soures. The missing of ertain oneptual relations in the knowledge soures ould potentially degrades retrieval auray. For example, treating erebral edema and erebral lesion as unrelated is potentially harmful. Notiing their ommon of omponent word erebral in the above phrases, we propose phrase-based VSM to remedy the inompleteness of the knowledge soures. In phrase-based VSM, a doument is represented as a set of phrases. Eah phrase may orrespond to multiple onepts (due to polysemy) and onsist of several word stems. Our sample query now beomes [(15967, ), ( hypertherm )], [(23518), ( leukoytos )] and [(151740), ( inreas, intraran, pressur )] et. Following the TF-IDF shemes in stem-based and onept-based VSMs, we an derive the stem weight s w α, u i, k of the k th stem u i,k and the onept weight w α, x i, m of the m th onept x i,m in phrase i of α. Similar to onept-based VSM, we study two ases, O and NO. In ase O, different onepts are unrelated; while in ase NO, onepts may be related. In both ases, distint word stems are assumed to be unrelated. 2.3 Doument Similarity The similarity of two douments α and β is the osine of the angle between their orresponding doument vetors α and β ; that is, α β sim ( α, β ) = os( α, β ) = (1) α α β β We shall extend the vetor dot produtα β and denote the extended dot produt (EDP) as α β to represent the ases when the omponents of the vetors α and β are related. Using the EDP in plae of the dot produt, we derive doument similarity as, α β sim ( α, β ) = (2) α α β β EDP Derivation To derive the EDP in the phrase-based VSM, we first onsider onepts without polysemy. Thus, α β = S i, i, (3) where S i, is the oneptual ontribution of phrase i in α and phrase in β to the EDP. Assuming that eah phrase represents a single onept, we have, S ( x y ) i, = wα, x w y s i β, i, (4) where s(x i, y ) is the oneptual similarity between the i th onept x i in α and the th onept y in β. In the orthogonal ase, s(x,y) is redued to the Kroneker delta funtion, δ ( x, y) 1 = 0 x = y x y In the non-orthogonal ase, oneptual similarities are derived from knowledge soures. Coneptual Similarity, S(x,y) Given a hypernym hierarhy, the oneptual similarity s(x, y) between two onepts x and y depends on both the distane between them in the hierarhy and their generality. When two onepts are farther apart in the hypernym hierarhy, they are less similar a onept is less similarity to its grandparent than to its parent in the hypernym hierarhy. Thus we define the oneptual similarity to be inversely proportional to the number of hops between x and y, d(x,y). The generality of a onept x an be derived from the number of all its desendants D(x). The more desendants x has, the more general it is. A general onept like disease has muh more desendants than a more speifi onept like hyperthermia has. Beause of the exponential growth of the number of desendants when a onept moves up a tree struture, we take the logarithm of the number of desendant in oneptual similarity alulation. The oneptual similarity is therefore defined to be inversely proportional to the logarithm of the number of desendants of the two. A final onsideration is the boundary ase when we reah the leaves of the hypernym tree. Let us assume we have two onepts x o and y o, where x o is the only diret hypernym of y o, y o is the only hyponym of x o, and y o has no hyponym of its own. Conepts x o and y o are so muh alike that we define the oneptual similarity between them to be lose to 1, say 0.9, to represent suh loseness. As a result, the oneptual similarity between onepts x and y is, s ( x, y) = d ( x, y) log ( 1+ D( x) + D( y) ) 2 (5) In order to use (4) in the presene of polysemy, we need to disambiguate senses. To avoid WSD ost, we use the most popular onept that a phrase represents as the meaning of the phrase. Alternatively, we derive the

4 oneptual ontribution to the similarity between two phrases using an aggregation of (4) over all possible onept pairs, where eah pair onsists of one onept from eah phrase. The ontribution of word stems to the EDP is the sum of the weight produt for those word stems ommon to both phrases, s s s Si, = wδ, u w v ( ui k v l ) i k, l,,, θ δ,, (6) k, l where u i,k and v,l are the k th word stem in phrase i in α and l th word stem in phrase in β respetively. Given the ontribution of onepts and stems, (4) and (6), we selet the larger of the two as the ontribution of phrase i in α and phrase in β to the EDP and get, s α β = max( S i,, Si, ) (7) i, Suh seletion remedies the inompleteness of the knowledge soures. α α and β β an be derived similar to (7). The doument similarity an then be omputed from (2) using these EDPs Experimental Results The Test Colletion, OHSUMED OHSUMED [12] is a large test olletion used in many information retrieval system evaluations. The test set onsists of a referene olletion, a query olletion, and a set of relevane udgments. The referene olletion is a subset of the MEDLINE database. Eah referene ontains a title, an optional abstrat, a set of MeSH headings, author information, publiation type, soure, a MEDLINE identifier, and a sequene identifier. The query olletion onsists of 106 queries. Eah query ontains a patient desription, an information request, and a sequene identifier. The sample query we use in this paper is query 57 in the olletion. 14,430 referenes out of the 348K are udged by human experts to be not relevant, possibly relevant, or definitely relevant to eah query.we use the title, the abstrat, and the MeSH headings to represent eah doument; and the patient desription, and the information request to represent eah query. The Knowledge Soure UMLS [13] is a medial lexial knowledge soure and a set of assoiated lexial programs. The knowledge soure onsists of UMLS Metathesaurus, SPECIALIST lexion, and UMLS semanti network. Espeially of interest to us is its entral voabulary omponent the Metathesaurus. It ontains biomedial phrases from more than 60 voabularies and lassifiations. The Metathesaurus ontains 1.6M phrases representing over 800K onepts. A onept unique identifier (CUI) identifies eah onept. UMLS tends to assign a smaller CUI to a more popular sense of a phrase. Therefore, we use the onept with the smallest CUI in oneptual ontribution alulation (2). Our experiment results show that suh heuristi produes retrieval auray omparable to that produed by the aggregation approah where we onsider all oneptual similarities due to different sense ombinations from the phrases. The Metathesaurus enodes many oneptual relations. We onentrate on hypernym relations. Two relations in UMLS roughly orrespond to the hypernym relations: the RB (border than) and the PAR (parent) relations. For example, hyperthermia has a parent onept body temperature hange. We ombine the 838K RB and 607K PAR relations into a single hypernym hierarhy. Hypernymy is transitive [14]. For example, sign and symptom is a hypernym of body temperature hange and body temperature hange is a hypernym of hyperthermia, so sign and symptom is also a hypernym of hyperthermia. However UMLS Metathesaurus enodes only the diret hypernym relations but not the transitive losure. We derive the transitive losure of the hypernym relation and use (5) to alulate the oneptual similarities. Phrase Detetion Given a set of douments (106 queries and 14K udged douments of OSHUMED), we need to detet any ourrenes in a set of phrases (1.3M phrases in UMLS). We adopt the Aho-Corasik algorithm [15] for the set-mathing problem to detet phrases: First, Aho-Corasik algorithm detets all ourrenes of any phrase in a doument. But we only keep the longest, most speifi phrase. For example, although both edema and erebral edema are deteted in the sample query, we keep only the latter and ignore the former. Seond, to detet multi-word phrases, we math stems instead of words in a doument with UMLS phrases. We use Lovins stemmer [7] to derive word stems. To avoid onflating different abbreviations into a single stem, we define the stem for a word shorter than four haraters to be the original word. Third, stop-word removal is performed after the multi-word phrase detetion. In this way, we orretly detet seondary to and arinoma from erebral edema seondary to arinoma. We would inorretly detet seondary arinoma if the stop-words ( to in this ase) were removed before the phrase detetion. Disussion of Results To alulate retrieval auray using preision-reall [1], we ombined the possibly relevant and definitely relevant udgments in OHSUMED into a single relevant ategory. Based on the type of VSM, we alulate the doument similarity between eah of the 14K douments and eah of the 105 queries (one query does not have relevant doument). For a given VSM

5 and a query, we rank the douments from the most to the least similar to the query. When a ertain number of douments are retrieved, preision is the perentage of retrieved douments that are relevant; and reall is the perentage of the relevant douments that has been retrieved so far. We evaluate the retrieval auray by interpolating the preision values at eleven reall points. The overall effetiveness of different VSM is then ompared by averaging over the performane of all the 105 queries (Figure 1). The average of the eleven preision values gives an overview of the effetiveness of eah VSM Stems, O Conepts, O Conepts, NO Phrases, O Phrases, NO Figure 1. 1 Comparison of the average preisionreall over 105 queries. 1. The baseline labeled (Stems, O) uses stem-based VSM. Its 11-point average preision is Considering the ontribution of onepts only, and treating different onepts as unrelated (Conepts, O), we arrive at an 11-point average preision of 0.260, whih is a 28% derease from the baseline. 3. Similar to 2, but taking the onept inter-relationship into onsideration (Conepts, NO), we ahieve a signifiant improvement over 2. The average auray is similar to that of the baseline. 4. Considering ontribution of both onepts and word stems in a phrase, but treating different onepts as unrelated (Phrases, O), we arrive at an 11-point average preision of 0.375, a 3% improvement over the baseline. 5. Similar to 4, but taking onept interrelations into onsideration (Phrases, NO), we ahieve an 11-point average preision of 0.420, whih is a signifiant 16% improvement over the baseline. Our experiment results reveal that viewing douments as onepts only and treating different onepts as unrelated an ause the retrieval auray to deteriorate (ase 2). Considering onept inter-relations (ase 3) or relating different phrases by their shared word stems (ase 4) an both improve retrieval auray. The extended dot produt ombines ontributions from the onepts and word stems. The phrase-based VSM utilizes suh extended dot produt and yields signifiant improvement in retrieval auray. 3. Enhane retrieval performane via query expansion When posing a query, a user usually has a main obetive (key onept, key ) in mind and uses additional general supporting oneptual terms, s, to speify ertain aspets of key. For example, when a user asks Keratoonus, treatment options., Keratoonus (an eye disease) is the key onept whereas treatment options is a general supporting onept. Although suh query is easy to form, it does not math well with relevant douments that use suh speifi supporting onepts as ontat lens or keratoplasty. To remedy this problem, we propose to substitute the general supporting onepts by speifi onepts that used in the relevant douments. We need to selet the set of speifi onepts, and determine the weight of eah of these onepts. We shall first present the weight determination method, and then ompare two onept seletion approahes. For a speifi onept, its weight should represent the degree of orrelation between and the key onept term, key. For example, ontat lens is a treatment option for Keratoonus but not Bak pain, and therefore it should assign a larger weight in the expansion of Keratoonus, treatment options than that of Bak pain, treatment options. We shall now present a salable method for suh weight assignment. We shall first represent onepts into inverted doument vetors, and then use the similarity between the two inverted doument vetors to represent the orrelation between the two onepts. Given a orpus of n douments, the inverted doument vetor for onept i, θ, is defined as an n i dimensional vetor. The weight of omponent i in the vetor represents the term frequeny of onept i in eah doument. For example, if a orpus ontains douments D 1, D 2 and D 3, and onept ours three, zero, and two times in these douments respetively, then θ =<3, 0, 2>. i We further define the orrelation between onepts i and as: θ θ i orrel( i, ) = os ( θ, θ ) i = θ θ i θ i θ The orrelation between two onepts ranged from 0 to 1. For example, if θ i =<3, 0, 2>, θ =<6, 0, 4>, and θ k =<0, 1, 0>, then orrel( i, )=1 and orrel( i,

6 k )=0. The orrelation between all pairs of onepts an be omputed offline and stored in a onept orrelation table (see Figure 3). For query expansion, the weight assigned to i is the orrelation between a supporting onept i and key. There are two onept seletion approahes: with or without knowledge soures. When no knowledge soure is available, all the supporting onept terms that have zero orrelation with the key onept, key, an be viewed as irrelevant onepts and filtered out. Let 1, 2,..., k be all the onepts that have nonzero orrelation with key. The expanded query beomes <( key,1), ( 1, orrel( 1, key )),, ( k, orrel( k, key ))> In this expanded query vetor, the higher the orrelation value orrel( i, key ) is, the more emphasis is plaed on the vetor omponent of i. Clearly, the weight assigned to key is 1 sine orrel( key, key ) = 1. Sine a general supporting onept in a query usually only relevant to one or two aspets of key, this motivate the use of knowledge soure together with the key onept and the general supporting onept in the query to selet the relevant set of speifi supporting onepts. For example, "Keratoonus, treatment options" is emphasizing on the treatment options for the disease, instead of diagnosis methods or auses for the disease. ULMS [13] indiates only three ategories of medial onepts as potential treatments: "Therapeuti and Preventive Proedures", "Medial Devies" and "Pharmalogial Substane". Thus only onepts that belong to these three ategories will be expanded into the query. By filtering out the irrelevant supporting onepts, the omputation omplexity is greatly redued. In addition, the preision in the low reall region inreases. To evaluate the performane improvement of query expansion, we selet 28 OHSUMED queries [12] that ontain general supporting onepts as the test set. Our experimental results for the query set reveal the average query expansion size without knowledge soure is 1227 terms per query, while using ULMS, the average expansion size redued to 82.3 terms per query. This represents more than an order of magnitude redution in query expansion size. The retrieval performane improvement for the set of expaned OHSUMED queries is shown in Figure 2. We note that the expansion queries performed better than the non-expansion ases (base line). The query derived from knowledge-based expansion performs better than the ases without knowledge base in the low reall region, and the performane is reverse in the high reall region. This is beause the non-knowledge based expansion ase inludes many irreverent speifi onepts, whih resulted in low preision in the low reall region. Sine relevant douments in the high reall region are not typially overed by the highly orrelated supporting onepts in the query, adding more terms by the non-knowledge based method yields preision better preision in that region. Aside from the omputation omplexity saving of using the knowledge based expansion method, the hoie of whih expansion method to use depends on the appliation. When low reall region is more of onern, then knowledge-based method is preferable; and if high reall region is more important, then the non-knowledge based method should be onsidered reall Figure 2. Retrieval performane improvements with query expansion 4. Appliations Single Word Stem Baseline Non-knowledge Based Expansion Knowledge Based Expansion We shall now present a doument retrieval system that integrates the knowledge-based query expansion and phrase-based indexing for a digital medial library. As shown in Figure 3, the system onsists of three subsystems: a doument indexing engine, a query expansion engine, and a doument retrieval engine. The doument indexing engine proesses the knowledge soure and the orpus offline and generates data struture neessary for the online query expansion and doument retrieval. When the system reeives a user query, the query expansion engine expands the query and the doument retrieval engine then returns a set of douments relevant to the user. The medial knowledge soure, UMLS, is used in the system for phrase detetion, oneptual similarity derivation, and expansion term filtering. The doument indexing engine proesses UMLS and the douments orpus separately. The oneptual similarity alulator derives the oneptual similarities between onepts from UMLS and stores them in the oneptual similarity table. The phrase detetor identifies onepts in UMLS from the douments. The inverse doument frequeny alulator uses the output of the phrase detetor to onstrut an inverse doument frequeny table. The phrase weight alulator in turn alulates the weights of onepts and word stems in eah phrase, and onverts the original orpus into a phrase-indexed orpus. The orrelation alulator then derives the oneptual

7 Original Query Phrase Detetor for Query Domain Speifi Knowledge Soure Query Expansion Engine General Term Detetor List of General Supporting Conepts Has General Supporting Conepts? N Y Query Expander C 1 C 2 C 1 C 2 Conept Correlation Table Expanded Query Correlation Calulator Corpus Phrase-indexed Corpus Phrase Weight Calulator for Douments Phrase Detetor for Douments Phrase Weight Calulator for Query Inverse Doument Frequeny Table Inverse Doument Frequeny Calulator Doument Similarity Calulator and Ranker Doument Retrieval Engine C 1 C 2 C 1 C 2 Coneptual Similarity Table Coneptual Similarity Calulator Doument Indexing Engine Douments Returned Figure 3. A phrase based indexing and query expansion doument retrieval system. orrelations from the phrase-indexed orpus and stores them in the onept orrelation table. When the system reeives a user query, phrase detetor first parse the query into phrases. The general term detetor then determines if any query expansion is neessary by onsulting the general supporting onept list. If no general supporting onepts are deteted. The original query is diretly input into the doument retrieval engine. Otherwise, the query expander replaes general onept terms with the speifi ones in the appropriate ategories as speified in UMLS. The weights of the expanded onepts are looked up from the onept orrelation table. The doument retrieval engine ompares the expanded query with the phrase-index and returns a set of douments to the user. First, the similarity alulator onsults the oneptual similarity table to alulate the phrase-based doument similarities. The ranker then returns to the user those douments the most similar to the query. 5. Conlusion We introdued a knowledge-based tehnique to rewrite a user query ontaining general oneptual terms to one ontaining speifi terms that related to the general supporting onept. These speifi supporting terms are mined from the orpus. Experimental results show that retrieval using suh expanded queries is more effetive than the original queries. Beause the additional onept terms inluded in the expanded query are seleted from few ategories as indiated by the knowledge soure, the average size of the expanded queries in our approah is muh smaller (less than an order of magnitude) than that produed by the full query

8 expansion tehnique and also yield better retrieval performane in the low reall region whih is of interest to most appliations. We developed a new vetor spae model that uses phrases to represent douments. Eah phrase onsists of multiple onepts and words. Similarity between two phrases is ointly determined by the oneptual similarity and their ommon word stems. Our experimental result reveals that the phrase-based VSM yields a 16% inrease of retrieval effetiveness than the stem-based VSM. This improvement is beause multiword onepts are natural unit of information and using word stems in phrase-based doument similarity ompensates for the inauray in oneptual similarities derived from inomplete knowledge soures. We also presented an implementation that integrates the above tehniques into a digital medial library at UCLA for the retrieval of patient reords, laboratory reports and medial literatures. [13] National Library of Mediine. UMLS Knowledge Soures, 12 th edition, 2001 [14] J. Lyons. Semantis, 1977 [15] A.V. Aho and M.J. Corasik. Effiient String Mathing: an Aid to Bibliographi Searh. In CACM, 18(6), , 1975 Referenes [1] G. Salton and M.J. MGill. Introdution to Modern Information Retrieval, 1983 [2] G.A. Miller, R. Bekwith, C. Fellbaum, D. Gross and K. Miller. Introdution to WordNet: an On-line Lexial Database. In WordNet: an Eletroni Lexial Database, 1-19, 1998 [3] M. Mitra, C. Bukley, A. Singhal and C. Cardie. An Analysis of Statistial and Syntati Phrases. In Pro. RIAO97, , 1997 [4] R. Rihardson and A.F. Smeaton. Using WordNet in a Knowledge-based Approah to Information Retrieval. In Pro. 17 th BCS-IRSG, 1995 [5] M. Sussna. Text Retrieval using Inferene in Semanti Matanetworks. PhD Thesis, University of California, San Diego, 1997 [6] E.M. Voorhees. Using WordNet to Disambiguate Word Sense for Text Retrieval. In Pro. 16 th ACM-SIGIR., , 1993 [7] J.B. Lovins. Development of a Stemming Algorithm. In Mehanial Translation and Computational Linguistis, 11(1-2), 11-31, 1968 [8] A.F. Smeaton and I. Quigley. Experiments on using Semanti Distanes Between Words in Image Caption Retrieval. In 19 th Pro. ACM-SIGIR, , 1996 [9] D. Johnson, W.W. Chu, J.D. Dionisio, R.K. Taira and H. Kangarloo. Creating and Indexing Teahing Files from Free-text Patient Reports. In AMIA 99, 1999 [10] J.A. Goldman, W.W. Chu, D.S. Parker and R.M. Goldman. Term Domain Distribution Analysis: A Data Mining Tool for Text Databases. In 2001 IMIA Yearbook of Medial Informatis, , 2001 [11] N. Ide and J. Véronis. Word Sense Disambiguation: the State of the Art. In Computational Linguistis, 24(1), 1-40, 1998 [12] W. Hersh, C. Bukley, T.J. Leone and D. Hikam. OHSUMED: an Interative Retrieval Evaluation and New Large Test Colletion for Researh. In Pro. 22 nd ACM- SIGIR Conf., , 1994

A model for measurement of the states in a coupled-dot qubit

A model for measurement of the states in a coupled-dot qubit A model for measurement of the states in a oupled-dot qubit H B Sun and H M Wiseman Centre for Quantum Computer Tehnology Centre for Quantum Dynamis Griffith University Brisbane 4 QLD Australia E-mail:

More information

Lightpath routing for maximum reliability in optical mesh networks

Lightpath routing for maximum reliability in optical mesh networks Vol. 7, No. 5 / May 2008 / JOURNAL OF OPTICAL NETWORKING 449 Lightpath routing for maximum reliability in optial mesh networks Shengli Yuan, 1, * Saket Varma, 2 and Jason P. Jue 2 1 Department of Computer

More information

Millennium Relativity Acceleration Composition. The Relativistic Relationship between Acceleration and Uniform Motion

Millennium Relativity Acceleration Composition. The Relativistic Relationship between Acceleration and Uniform Motion Millennium Relativity Aeleration Composition he Relativisti Relationship between Aeleration and niform Motion Copyright 003 Joseph A. Rybzyk Abstrat he relativisti priniples developed throughout the six

More information

DIGITAL DISTANCE RELAYING SCHEME FOR PARALLEL TRANSMISSION LINES DURING INTER-CIRCUIT FAULTS

DIGITAL DISTANCE RELAYING SCHEME FOR PARALLEL TRANSMISSION LINES DURING INTER-CIRCUIT FAULTS CHAPTER 4 DIGITAL DISTANCE RELAYING SCHEME FOR PARALLEL TRANSMISSION LINES DURING INTER-CIRCUIT FAULTS 4.1 INTRODUCTION Around the world, environmental and ost onsiousness are foring utilities to install

More information

Complexity of Regularization RBF Networks

Complexity of Regularization RBF Networks Complexity of Regularization RBF Networks Mark A Kon Department of Mathematis and Statistis Boston University Boston, MA 02215 mkon@buedu Leszek Plaskota Institute of Applied Mathematis University of Warsaw

More information

Danielle Maddix AA238 Final Project December 9, 2016

Danielle Maddix AA238 Final Project December 9, 2016 Struture and Parameter Learning in Bayesian Networks with Appliations to Prediting Breast Caner Tumor Malignany in a Lower Dimension Feature Spae Danielle Maddix AA238 Final Projet Deember 9, 2016 Abstrat

More information

The Effectiveness of the Linear Hull Effect

The Effectiveness of the Linear Hull Effect The Effetiveness of the Linear Hull Effet S. Murphy Tehnial Report RHUL MA 009 9 6 Otober 009 Department of Mathematis Royal Holloway, University of London Egham, Surrey TW0 0EX, England http://www.rhul.a.uk/mathematis/tehreports

More information

Wavetech, LLC. Ultrafast Pulses and GVD. John O Hara Created: Dec. 6, 2013

Wavetech, LLC. Ultrafast Pulses and GVD. John O Hara Created: Dec. 6, 2013 Ultrafast Pulses and GVD John O Hara Created: De. 6, 3 Introdution This doument overs the basi onepts of group veloity dispersion (GVD) and ultrafast pulse propagation in an optial fiber. Neessarily, it

More information

Coding for Random Projections and Approximate Near Neighbor Search

Coding for Random Projections and Approximate Near Neighbor Search Coding for Random Projetions and Approximate Near Neighbor Searh Ping Li Department of Statistis & Biostatistis Department of Computer Siene Rutgers University Pisataay, NJ 8854, USA pingli@stat.rutgers.edu

More information

Measuring & Inducing Neural Activity Using Extracellular Fields I: Inverse systems approach

Measuring & Inducing Neural Activity Using Extracellular Fields I: Inverse systems approach Measuring & Induing Neural Ativity Using Extraellular Fields I: Inverse systems approah Keith Dillon Department of Eletrial and Computer Engineering University of California San Diego 9500 Gilman Dr. La

More information

The Laws of Acceleration

The Laws of Acceleration The Laws of Aeleration The Relationships between Time, Veloity, and Rate of Aeleration Copyright 2001 Joseph A. Rybzyk Abstrat Presented is a theory in fundamental theoretial physis that establishes the

More information

Maximum Entropy and Exponential Families

Maximum Entropy and Exponential Families Maximum Entropy and Exponential Families April 9, 209 Abstrat The goal of this note is to derive the exponential form of probability distribution from more basi onsiderations, in partiular Entropy. It

More information

Assessing the Performance of a BCI: A Task-Oriented Approach

Assessing the Performance of a BCI: A Task-Oriented Approach Assessing the Performane of a BCI: A Task-Oriented Approah B. Dal Seno, L. Mainardi 2, M. Matteui Department of Eletronis and Information, IIT-Unit, Politenio di Milano, Italy 2 Department of Bioengineering,

More information

Variation Based Online Travel Time Prediction Using Clustered Neural Networks

Variation Based Online Travel Time Prediction Using Clustered Neural Networks Variation Based Online Travel Time Predition Using lustered Neural Networks Jie Yu, Gang-Len hang, H.W. Ho and Yue Liu Abstrat-This paper proposes a variation-based online travel time predition approah

More information

9 Geophysics and Radio-Astronomy: VLBI VeryLongBaseInterferometry

9 Geophysics and Radio-Astronomy: VLBI VeryLongBaseInterferometry 9 Geophysis and Radio-Astronomy: VLBI VeryLongBaseInterferometry VLBI is an interferometry tehnique used in radio astronomy, in whih two or more signals, oming from the same astronomial objet, are reeived

More information

A Spatiotemporal Approach to Passive Sound Source Localization

A Spatiotemporal Approach to Passive Sound Source Localization A Spatiotemporal Approah Passive Sound Soure Loalization Pasi Pertilä, Mikko Parviainen, Teemu Korhonen and Ari Visa Institute of Signal Proessing Tampere University of Tehnology, P.O.Box 553, FIN-330,

More information

An Adaptive Optimization Approach to Active Cancellation of Repeated Transient Vibration Disturbances

An Adaptive Optimization Approach to Active Cancellation of Repeated Transient Vibration Disturbances An aptive Optimization Approah to Ative Canellation of Repeated Transient Vibration Disturbanes David L. Bowen RH Lyon Corp / Aenteh, 33 Moulton St., Cambridge, MA 138, U.S.A., owen@lyonorp.om J. Gregory

More information

Sensitivity Analysis in Markov Networks

Sensitivity Analysis in Markov Networks Sensitivity Analysis in Markov Networks Hei Chan and Adnan Darwihe Computer Siene Department University of California, Los Angeles Los Angeles, CA 90095 {hei,darwihe}@s.ula.edu Abstrat This paper explores

More information

Combined Electric and Magnetic Dipoles for Mesoband Radiation, Part 2

Combined Electric and Magnetic Dipoles for Mesoband Radiation, Part 2 Sensor and Simulation Notes Note 53 3 May 8 Combined Eletri and Magneti Dipoles for Mesoband Radiation, Part Carl E. Baum University of New Mexio Department of Eletrial and Computer Engineering Albuquerque

More information

CMSC 451: Lecture 9 Greedy Approximation: Set Cover Thursday, Sep 28, 2017

CMSC 451: Lecture 9 Greedy Approximation: Set Cover Thursday, Sep 28, 2017 CMSC 451: Leture 9 Greedy Approximation: Set Cover Thursday, Sep 28, 2017 Reading: Chapt 11 of KT and Set 54 of DPV Set Cover: An important lass of optimization problems involves overing a ertain domain,

More information

QCLAS Sensor for Purity Monitoring in Medical Gas Supply Lines

QCLAS Sensor for Purity Monitoring in Medical Gas Supply Lines DOI.56/sensoren6/P3. QLAS Sensor for Purity Monitoring in Medial Gas Supply Lines Henrik Zimmermann, Mathias Wiese, Alessandro Ragnoni neoplas ontrol GmbH, Walther-Rathenau-Str. 49a, 7489 Greifswald, Germany

More information

15.12 Applications of Suffix Trees

15.12 Applications of Suffix Trees 248 Algorithms in Bioinformatis II, SoSe 07, ZBIT, D. Huson, May 14, 2007 15.12 Appliations of Suffix Trees 1. Searhing for exat patterns 2. Minimal unique substrings 3. Maximum unique mathes 4. Maximum

More information

Supplementary Materials

Supplementary Materials Supplementary Materials Neural population partitioning and a onurrent brain-mahine interfae for sequential motor funtion Maryam M. Shanehi, Rollin C. Hu, Marissa Powers, Gregory W. Wornell, Emery N. Brown

More information

The Concept of Mass as Interfering Photons, and the Originating Mechanism of Gravitation D.T. Froedge

The Concept of Mass as Interfering Photons, and the Originating Mechanism of Gravitation D.T. Froedge The Conept of Mass as Interfering Photons, and the Originating Mehanism of Gravitation D.T. Froedge V04 Formerly Auburn University Phys-dtfroedge@glasgow-ky.om Abstrat For most purposes in physis the onept

More information

Likelihood-confidence intervals for quantiles in Extreme Value Distributions

Likelihood-confidence intervals for quantiles in Extreme Value Distributions Likelihood-onfidene intervals for quantiles in Extreme Value Distributions A. Bolívar, E. Díaz-Franés, J. Ortega, and E. Vilhis. Centro de Investigaión en Matemátias; A.P. 42, Guanajuato, Gto. 36; Méxio

More information

Aharonov-Bohm effect. Dan Solomon.

Aharonov-Bohm effect. Dan Solomon. Aharonov-Bohm effet. Dan Solomon. In the figure the magneti field is onfined to a solenoid of radius r 0 and is direted in the z- diretion, out of the paper. The solenoid is surrounded by a barrier that

More information

Determination of the reaction order

Determination of the reaction order 5/7/07 A quote of the wee (or amel of the wee): Apply yourself. Get all the eduation you an, but then... do something. Don't just stand there, mae it happen. Lee Iaoa Physial Chemistry GTM/5 reation order

More information

On the Bit Error Probability of Noisy Channel Networks With Intermediate Node Encoding I. INTRODUCTION

On the Bit Error Probability of Noisy Channel Networks With Intermediate Node Encoding I. INTRODUCTION 5188 IEEE TRANSACTIONS ON INFORMATION THEORY, VOL. 54, NO. 11, NOVEMBER 2008 [8] A. P. Dempster, N. M. Laird, and D. B. Rubin, Maximum likelihood estimation from inomplete data via the EM algorithm, J.

More information

Contact Block Reduction Method for Ballistic Quantum Transport with Semi-empirical sp3d5s* Tight Binding band models

Contact Block Reduction Method for Ballistic Quantum Transport with Semi-empirical sp3d5s* Tight Binding band models Purdue University Purdue e-pubs Other Nanotehnology Publiations Birk Nanotehnology Center -2-28 Contat Redution Method for Ballisti Quantum Transport with Semi-empirial sp3d5s* Tight Binding band models

More information

Control Theory association of mathematics and engineering

Control Theory association of mathematics and engineering Control Theory assoiation of mathematis and engineering Wojieh Mitkowski Krzysztof Oprzedkiewiz Department of Automatis AGH Univ. of Siene & Tehnology, Craow, Poland, Abstrat In this paper a methodology

More information

A Heuristic Approach for Design and Calculation of Pressure Distribution over Naca 4 Digit Airfoil

A Heuristic Approach for Design and Calculation of Pressure Distribution over Naca 4 Digit Airfoil IOSR Journal of Engineering (IOSRJEN) ISSN (e): 2250-3021, ISSN (p): 2278-8719 PP 11-15 www.iosrjen.org A Heuristi Approah for Design and Calulation of Pressure Distribution over Naa 4 Digit Airfoil G.

More information

The Hanging Chain. John McCuan. January 19, 2006

The Hanging Chain. John McCuan. January 19, 2006 The Hanging Chain John MCuan January 19, 2006 1 Introdution We onsider a hain of length L attahed to two points (a, u a and (b, u b in the plane. It is assumed that the hain hangs in the plane under a

More information

Nonreversibility of Multiple Unicast Networks

Nonreversibility of Multiple Unicast Networks Nonreversibility of Multiple Uniast Networks Randall Dougherty and Kenneth Zeger September 27, 2005 Abstrat We prove that for any finite direted ayli network, there exists a orresponding multiple uniast

More information

Exploring the feasibility of on-site earthquake early warning using close-in records of the 2007 Noto Hanto earthquake

Exploring the feasibility of on-site earthquake early warning using close-in records of the 2007 Noto Hanto earthquake Exploring the feasibility of on-site earthquake early warning using lose-in reords of the 2007 Noto Hanto earthquake Yih-Min Wu 1 and Hiroo Kanamori 2 1. Department of Geosienes, National Taiwan University,

More information

Case I: 2 users In case of 2 users, the probability of error for user 1 was earlier derived to be 2 A1

Case I: 2 users In case of 2 users, the probability of error for user 1 was earlier derived to be 2 A1 MUTLIUSER DETECTION (Letures 9 and 0) 6:33:546 Wireless Communiations Tehnologies Instrutor: Dr. Narayan Mandayam Summary By Shweta Shrivastava (shwetash@winlab.rutgers.edu) bstrat This artile ontinues

More information

A new method of measuring similarity between two neutrosophic soft sets and its application in pattern recognition problems

A new method of measuring similarity between two neutrosophic soft sets and its application in pattern recognition problems Neutrosophi Sets and Systems, Vol. 8, 05 63 A new method of measuring similarity between two neutrosophi soft sets and its appliation in pattern reognition problems Anjan Mukherjee, Sadhan Sarkar, Department

More information

A simple expression for radial distribution functions of pure fluids and mixtures

A simple expression for radial distribution functions of pure fluids and mixtures A simple expression for radial distribution funtions of pure fluids and mixtures Enrio Matteoli a) Istituto di Chimia Quantistia ed Energetia Moleolare, CNR, Via Risorgimento, 35, 56126 Pisa, Italy G.

More information

Computer Science 786S - Statistical Methods in Natural Language Processing and Data Analysis Page 1

Computer Science 786S - Statistical Methods in Natural Language Processing and Data Analysis Page 1 Computer Siene 786S - Statistial Methods in Natural Language Proessing and Data Analysis Page 1 Hypothesis Testing A statistial hypothesis is a statement about the nature of the distribution of a random

More information

EE 321 Project Spring 2018

EE 321 Project Spring 2018 EE 21 Projet Spring 2018 This ourse projet is intended to be an individual effort projet. The student is required to omplete the work individually, without help from anyone else. (The student may, however,

More information

arxiv:gr-qc/ v2 6 Feb 2004

arxiv:gr-qc/ v2 6 Feb 2004 Hubble Red Shift and the Anomalous Aeleration of Pioneer 0 and arxiv:gr-q/0402024v2 6 Feb 2004 Kostadin Trenčevski Faulty of Natural Sienes and Mathematis, P.O.Box 62, 000 Skopje, Maedonia Abstrat It this

More information

The universal model of error of active power measuring channel

The universal model of error of active power measuring channel 7 th Symposium EKO TC 4 3 rd Symposium EKO TC 9 and 5 th WADC Workshop nstrumentation for the CT Era Sept. 8-2 Kosie Slovakia The universal model of error of ative power measuring hannel Boris Stogny Evgeny

More information

Weighted K-Nearest Neighbor Revisited

Weighted K-Nearest Neighbor Revisited Weighted -Nearest Neighbor Revisited M. Biego University of Verona Verona, Italy Email: manuele.biego@univr.it M. Loog Delft University of Tehnology Delft, The Netherlands Email: m.loog@tudelft.nl Abstrat

More information

Normative and descriptive approaches to multiattribute decision making

Normative and descriptive approaches to multiattribute decision making De. 009, Volume 8, No. (Serial No.78) China-USA Business Review, ISSN 57-54, USA Normative and desriptive approahes to multiattribute deision making Milan Terek (Department of Statistis, University of

More information

This document is downloaded from DR-NTU, Nanyang Technological University Library, Singapore.

This document is downloaded from DR-NTU, Nanyang Technological University Library, Singapore. This doument is downloaded from DR-NTU, Nanyang Tehnologial University Library, Singapore. Title Extrating Threshold Coneptual Strutures from Web Douments Author(s) Ciobanu, Gabriel; Horne, Ross; Vaideanu,

More information

Array Design for Superresolution Direction-Finding Algorithms

Array Design for Superresolution Direction-Finding Algorithms Array Design for Superresolution Diretion-Finding Algorithms Naushad Hussein Dowlut BEng, ACGI, AMIEE Athanassios Manikas PhD, DIC, AMIEE, MIEEE Department of Eletrial Eletroni Engineering Imperial College

More information

22.54 Neutron Interactions and Applications (Spring 2004) Chapter 6 (2/24/04) Energy Transfer Kernel F(E E')

22.54 Neutron Interactions and Applications (Spring 2004) Chapter 6 (2/24/04) Energy Transfer Kernel F(E E') 22.54 Neutron Interations and Appliations (Spring 2004) Chapter 6 (2/24/04) Energy Transfer Kernel F(E E') Referenes -- J. R. Lamarsh, Introdution to Nulear Reator Theory (Addison-Wesley, Reading, 1966),

More information

UNCERTAINTY RELATIONS AS A CONSEQUENCE OF THE LORENTZ TRANSFORMATIONS. V. N. Matveev and O. V. Matvejev

UNCERTAINTY RELATIONS AS A CONSEQUENCE OF THE LORENTZ TRANSFORMATIONS. V. N. Matveev and O. V. Matvejev UNCERTAINTY RELATIONS AS A CONSEQUENCE OF THE LORENTZ TRANSFORMATIONS V. N. Matveev and O. V. Matvejev Joint-Stok Company Sinerta Savanoriu pr., 159, Vilnius, LT-315, Lithuania E-mail: matwad@mail.ru Abstrat

More information

Evaluation of effect of blade internal modes on sensitivity of Advanced LIGO

Evaluation of effect of blade internal modes on sensitivity of Advanced LIGO Evaluation of effet of blade internal modes on sensitivity of Advaned LIGO T0074-00-R Norna A Robertson 5 th Otober 00. Introdution The urrent model used to estimate the isolation ahieved by the quadruple

More information

Physical Laws, Absolutes, Relative Absolutes and Relativistic Time Phenomena

Physical Laws, Absolutes, Relative Absolutes and Relativistic Time Phenomena Page 1 of 10 Physial Laws, Absolutes, Relative Absolutes and Relativisti Time Phenomena Antonio Ruggeri modexp@iafria.om Sine in the field of knowledge we deal with absolutes, there are absolute laws that

More information

Modeling Probabilistic Measurement Correlations for Problem Determination in Large-Scale Distributed Systems

Modeling Probabilistic Measurement Correlations for Problem Determination in Large-Scale Distributed Systems 009 9th IEEE International Conferene on Distributed Computing Systems Modeling Probabilisti Measurement Correlations for Problem Determination in Large-Sale Distributed Systems Jing Gao Guofei Jiang Haifeng

More information

Optimization of Statistical Decisions for Age Replacement Problems via a New Pivotal Quantity Averaging Approach

Optimization of Statistical Decisions for Age Replacement Problems via a New Pivotal Quantity Averaging Approach Amerian Journal of heoretial and Applied tatistis 6; 5(-): -8 Published online January 7, 6 (http://www.sienepublishinggroup.om/j/ajtas) doi:.648/j.ajtas.s.65.4 IN: 36-8999 (Print); IN: 36-96 (Online)

More information

Simplified Buckling Analysis of Skeletal Structures

Simplified Buckling Analysis of Skeletal Structures Simplified Bukling Analysis of Skeletal Strutures B.A. Izzuddin 1 ABSRAC A simplified approah is proposed for bukling analysis of skeletal strutures, whih employs a rotational spring analogy for the formulation

More information

Phase Diffuser at the Transmitter for Lasercom Link: Effect of Partially Coherent Beam on the Bit-Error Rate.

Phase Diffuser at the Transmitter for Lasercom Link: Effect of Partially Coherent Beam on the Bit-Error Rate. Phase Diffuser at the Transmitter for Laserom Link: Effet of Partially Coherent Beam on the Bit-Error Rate. O. Korotkova* a, L. C. Andrews** a, R. L. Phillips*** b a Dept. of Mathematis, Univ. of Central

More information

After the completion of this section the student should recall

After the completion of this section the student should recall Chapter I MTH FUNDMENTLS I. Sets, Numbers, Coordinates, Funtions ugust 30, 08 3 I. SETS, NUMERS, COORDINTES, FUNCTIONS Objetives: fter the ompletion of this setion the student should reall - the definition

More information

Einstein s Three Mistakes in Special Relativity Revealed. Copyright Joseph A. Rybczyk

Einstein s Three Mistakes in Special Relativity Revealed. Copyright Joseph A. Rybczyk Einstein s Three Mistakes in Speial Relativity Revealed Copyright Joseph A. Rybzyk Abstrat When the evidene supported priniples of eletromagneti propagation are properly applied, the derived theory is

More information

A Probabilistic Fusion Framework

A Probabilistic Fusion Framework A Probabilisti Fusion Framework ABSTRACT Yael Anava Faulty of IE&M, Tehnion yaelan@tx.tehnion.a.il Oren Kurland Faulty of IE&M, Tehnion kurland@ie.tehnion.a.il There are numerous methods for fusing doument

More information

Design and Development of Three Stages Mixed Sampling Plans for Variable Attribute Variable Quality Characteristics

Design and Development of Three Stages Mixed Sampling Plans for Variable Attribute Variable Quality Characteristics International Journal of Statistis and Systems ISSN 0973-2675 Volume 12, Number 4 (2017), pp. 763-772 Researh India Publiations http://www.ripubliation.om Design and Development of Three Stages Mixed Sampling

More information

Singular Event Detection

Singular Event Detection Singular Event Detetion Rafael S. Garía Eletrial Engineering University of Puerto Rio at Mayagüez Rafael.Garia@ee.uprm.edu Faulty Mentor: S. Shankar Sastry Researh Supervisor: Jonathan Sprinkle Graduate

More information

UPPER-TRUNCATED POWER LAW DISTRIBUTIONS

UPPER-TRUNCATED POWER LAW DISTRIBUTIONS Fratals, Vol. 9, No. (00) 09 World Sientifi Publishing Company UPPER-TRUNCATED POWER LAW DISTRIBUTIONS STEPHEN M. BURROUGHS and SARAH F. TEBBENS College of Marine Siene, University of South Florida, St.

More information

Taste for variety and optimum product diversity in an open economy

Taste for variety and optimum product diversity in an open economy Taste for variety and optimum produt diversity in an open eonomy Javier Coto-Martínez City University Paul Levine University of Surrey Otober 0, 005 María D.C. Garía-Alonso University of Kent Abstrat We

More information

Modeling of Threading Dislocation Density Reduction in Heteroepitaxial Layers

Modeling of Threading Dislocation Density Reduction in Heteroepitaxial Layers A. E. Romanov et al.: Threading Disloation Density Redution in Layers (II) 33 phys. stat. sol. (b) 99, 33 (997) Subjet lassifiation: 6.72.C; 68.55.Ln; S5.; S5.2; S7.; S7.2 Modeling of Threading Disloation

More information

Controller Design Based on Transient Response Criteria. Chapter 12 1

Controller Design Based on Transient Response Criteria. Chapter 12 1 Controller Design Based on Transient Response Criteria Chapter 12 1 Desirable Controller Features 0. Stable 1. Quik responding 2. Adequate disturbane rejetion 3. Insensitive to model, measurement errors

More information

An I-Vector Backend for Speaker Verification

An I-Vector Backend for Speaker Verification An I-Vetor Bakend for Speaker Verifiation Patrik Kenny, 1 Themos Stafylakis, 1 Jahangir Alam, 1 and Marel Kokmann 2 1 CRIM, Canada, {patrik.kenny, themos.stafylakis, jahangir.alam}@rim.a 2 VoieTrust, Canada,

More information

Sequence Analysis, WS 14/15, D. Huson & R. Neher (this part by D. Huson & J. Fischer) January 21,

Sequence Analysis, WS 14/15, D. Huson & R. Neher (this part by D. Huson & J. Fischer) January 21, Sequene Analysis, WS 14/15, D. Huson & R. Neher (this part by D. Huson & J. Fisher) January 21, 201511 9 Suffix Trees and Suffix Arrays This leture is based on the following soures, whih are all reommended

More information

FINITE WORD LENGTH EFFECTS IN DSP

FINITE WORD LENGTH EFFECTS IN DSP FINITE WORD LENGTH EFFECTS IN DSP PREPARED BY GUIDED BY Snehal Gor Dr. Srianth T. ABSTRACT We now that omputers store numbers not with infinite preision but rather in some approximation that an be paed

More information

IMPEDANCE EFFECTS OF LEFT TURNERS FROM THE MAJOR STREET AT A TWSC INTERSECTION

IMPEDANCE EFFECTS OF LEFT TURNERS FROM THE MAJOR STREET AT A TWSC INTERSECTION 09-1289 Citation: Brilon, W. (2009): Impedane Effets of Left Turners from the Major Street at A TWSC Intersetion. Transportation Researh Reord Nr. 2130, pp. 2-8 IMPEDANCE EFFECTS OF LEFT TURNERS FROM THE

More information

Feature Selection by Independent Component Analysis and Mutual Information Maximization in EEG Signal Classification

Feature Selection by Independent Component Analysis and Mutual Information Maximization in EEG Signal Classification Feature Seletion by Independent Component Analysis and Mutual Information Maximization in EEG Signal Classifiation Tian Lan, Deniz Erdogmus, Andre Adami, Mihael Pavel BME Department, Oregon Health & Siene

More information

ONLINE APPENDICES for Cost-Effective Quality Assurance in Crowd Labeling

ONLINE APPENDICES for Cost-Effective Quality Assurance in Crowd Labeling ONLINE APPENDICES for Cost-Effetive Quality Assurane in Crowd Labeling Jing Wang Shool of Business and Management Hong Kong University of Siene and Tehnology Clear Water Bay Kowloon Hong Kong jwang@usthk

More information

Planning with Uncertainty in Position: an Optimal Planner

Planning with Uncertainty in Position: an Optimal Planner Planning with Unertainty in Position: an Optimal Planner Juan Pablo Gonzalez Anthony (Tony) Stentz CMU-RI -TR-04-63 The Robotis Institute Carnegie Mellon University Pittsburgh, Pennsylvania 15213 Otober

More information

Mean Activity Coefficients of Peroxodisulfates in Saturated Solutions of the Conversion System 2NH 4. H 2 O at 20 C and 30 C

Mean Activity Coefficients of Peroxodisulfates in Saturated Solutions of the Conversion System 2NH 4. H 2 O at 20 C and 30 C Mean Ativity Coeffiients of Peroxodisulfates in Saturated Solutions of the Conversion System NH 4 Na S O 8 H O at 0 C and 0 C Jan Balej Heřmanova 5, 170 00 Prague 7, Czeh Republi balejan@seznam.z Abstrat:

More information

Particle-wave symmetry in Quantum Mechanics And Special Relativity Theory

Particle-wave symmetry in Quantum Mechanics And Special Relativity Theory Partile-wave symmetry in Quantum Mehanis And Speial Relativity Theory Author one: XiaoLin Li,Chongqing,China,hidebrain@hotmail.om Corresponding author: XiaoLin Li, Chongqing,China,hidebrain@hotmail.om

More information

Multicomponent analysis on polluted waters by means of an electronic tongue

Multicomponent analysis on polluted waters by means of an electronic tongue Sensors and Atuators B 44 (1997) 423 428 Multiomponent analysis on polluted waters by means of an eletroni tongue C. Di Natale a, *, A. Maagnano a, F. Davide a, A. D Amio a, A. Legin b, Y. Vlasov b, A.

More information

Adaptive neuro-fuzzy inference system-based controllers for smart material actuator modelling

Adaptive neuro-fuzzy inference system-based controllers for smart material actuator modelling Adaptive neuro-fuzzy inferene system-based ontrollers for smart material atuator modelling T L Grigorie and R M Botez Éole de Tehnologie Supérieure, Montréal, Quebe, Canada The manusript was reeived on

More information

The gravitational phenomena without the curved spacetime

The gravitational phenomena without the curved spacetime The gravitational phenomena without the urved spaetime Mirosław J. Kubiak Abstrat: In this paper was presented a desription of the gravitational phenomena in the new medium, different than the urved spaetime,

More information

The Second Postulate of Euclid and the Hyperbolic Geometry

The Second Postulate of Euclid and the Hyperbolic Geometry 1 The Seond Postulate of Eulid and the Hyperboli Geometry Yuriy N. Zayko Department of Applied Informatis, Faulty of Publi Administration, Russian Presidential Aademy of National Eonomy and Publi Administration,

More information

Development of Fuzzy Extreme Value Theory. Populations

Development of Fuzzy Extreme Value Theory. Populations Applied Mathematial Sienes, Vol. 6, 0, no. 7, 58 5834 Development of Fuzzy Extreme Value Theory Control Charts Using α -uts for Sewed Populations Rungsarit Intaramo Department of Mathematis, Faulty of

More information

A NETWORK SIMPLEX ALGORITHM FOR THE MINIMUM COST-BENEFIT NETWORK FLOW PROBLEM

A NETWORK SIMPLEX ALGORITHM FOR THE MINIMUM COST-BENEFIT NETWORK FLOW PROBLEM NETWORK SIMPLEX LGORITHM FOR THE MINIMUM COST-BENEFIT NETWORK FLOW PROBLEM Cen Çalışan, Utah Valley University, 800 W. University Parway, Orem, UT 84058, 801-863-6487, en.alisan@uvu.edu BSTRCT The minimum

More information

UTC. Engineering 329. Proportional Controller Design. Speed System. John Beverly. Green Team. John Beverly Keith Skiles John Barker.

UTC. Engineering 329. Proportional Controller Design. Speed System. John Beverly. Green Team. John Beverly Keith Skiles John Barker. UTC Engineering 329 Proportional Controller Design for Speed System By John Beverly Green Team John Beverly Keith Skiles John Barker 24 Mar 2006 Introdution This experiment is intended test the variable

More information

Relativistic Dynamics

Relativistic Dynamics Chapter 7 Relativisti Dynamis 7.1 General Priniples of Dynamis 7.2 Relativisti Ation As stated in Setion A.2, all of dynamis is derived from the priniple of least ation. Thus it is our hore to find a suitable

More information

"Research Note" ANALYSIS AND OPTIMIZATION OF A FISSION CHAMBER DETECTOR USING MCNP4C AND SRIM MONTE CARLO CODES *

Research Note ANALYSIS AND OPTIMIZATION OF A FISSION CHAMBER DETECTOR USING MCNP4C AND SRIM MONTE CARLO CODES * Iranian Journal of Siene & Tehnology, Transation A, Vol. 33, o. A3 Printed in the Islami Republi of Iran, 9 Shiraz University "Researh ote" AALYSIS AD OPTIMIZATIO OF A FISSIO CHAMBER DETECTOR USIG MCP4C

More information

Ranking of Bangladeshi Coals Based on Fuzzy Set Theory

Ranking of Bangladeshi Coals Based on Fuzzy Set Theory Proeedings of the International Conferene on Industrial Engineering and Operations Management Dhaka, Bangladesh, January 9, Ranking of Bangladeshi Coals Based on Fuzzy Set Theory M. Shamsuzzaman, and F.

More information

Average Rate Speed Scaling

Average Rate Speed Scaling Average Rate Speed Saling Nikhil Bansal David P. Bunde Ho-Leung Chan Kirk Pruhs May 2, 2008 Abstrat Speed saling is a power management tehnique that involves dynamially hanging the speed of a proessor.

More information

arxiv: v2 [math.pr] 9 Dec 2016

arxiv: v2 [math.pr] 9 Dec 2016 Omnithermal Perfet Simulation for Multi-server Queues Stephen B. Connor 3th Deember 206 arxiv:60.0602v2 [math.pr] 9 De 206 Abstrat A number of perfet simulation algorithms for multi-server First Come First

More information

Methods of evaluating tests

Methods of evaluating tests Methods of evaluating tests Let X,, 1 Xn be i.i.d. Bernoulli( p ). Then 5 j= 1 j ( 5, ) T = X Binomial p. We test 1 H : p vs. 1 1 H : p>. We saw that a LRT is 1 if t k* φ ( x ) =. otherwise (t is the observed

More information

estimated pulse sequene we produe estimates of the delay time struture of ripple-red events. Formulation of the Problem We model the k th seismi trae

estimated pulse sequene we produe estimates of the delay time struture of ripple-red events. Formulation of the Problem We model the k th seismi trae Bayesian Deonvolution of Seismi Array Data Using the Gibbs Sampler Eri A. Suess Robert Shumway, University of California, Davis Rong Chen, Texas A&M University Contat: Eri A. Suess, Division of Statistis,

More information

Conformal Mapping among Orthogonal, Symmetric, and Skew-Symmetric Matrices

Conformal Mapping among Orthogonal, Symmetric, and Skew-Symmetric Matrices AAS 03-190 Conformal Mapping among Orthogonal, Symmetri, and Skew-Symmetri Matries Daniele Mortari Department of Aerospae Engineering, Texas A&M University, College Station, TX 77843-3141 Abstrat This

More information

Performing Two-Way Analysis of Variance Under Variance Heterogeneity

Performing Two-Way Analysis of Variance Under Variance Heterogeneity Journal of Modern Applied Statistial Methods Volume Issue Artile 3 5--003 Performing Two-Way Analysis of Variane Under Variane Heterogeneity Sott J. Rihter University of North Carolina at Greensboro, sjriht@ung.edu

More information

Advanced Computational Fluid Dynamics AA215A Lecture 4

Advanced Computational Fluid Dynamics AA215A Lecture 4 Advaned Computational Fluid Dynamis AA5A Leture 4 Antony Jameson Winter Quarter,, Stanford, CA Abstrat Leture 4 overs analysis of the equations of gas dynamis Contents Analysis of the equations of gas

More information

3 Tidal systems modelling: ASMITA model

3 Tidal systems modelling: ASMITA model 3 Tidal systems modelling: ASMITA model 3.1 Introdution For many pratial appliations, simulation and predition of oastal behaviour (morphologial development of shorefae, beahes and dunes) at a ertain level

More information

INTERNATIONAL JOURNAL OF CIVIL AND STRUCTURAL ENGINEERING Volume 2, No 4, 2012

INTERNATIONAL JOURNAL OF CIVIL AND STRUCTURAL ENGINEERING Volume 2, No 4, 2012 INTERNATIONAL JOURNAL OF CIVIL AND STRUCTURAL ENGINEERING Volume, No 4, 01 Copyright 010 All rights reserved Integrated Publishing servies Researh artile ISSN 0976 4399 Strutural Modelling of Stability

More information

Collocations. (M&S Ch 5)

Collocations. (M&S Ch 5) Colloations M&S Ch 5 Introdution Colloations are haraterized by limited ompositionality. Large overlap between the onepts of olloations and terms tehnial term and terminologial phrase. Colloations sometimes

More information

7 Max-Flow Problems. Business Computing and Operations Research 608

7 Max-Flow Problems. Business Computing and Operations Research 608 7 Max-Flow Problems Business Computing and Operations Researh 68 7. Max-Flow Problems In what follows, we onsider a somewhat modified problem onstellation Instead of osts of transmission, vetor now indiates

More information

Seismic dip estimation based on the two-dimensional Hilbert transform and its application in random noise attenuation a

Seismic dip estimation based on the two-dimensional Hilbert transform and its application in random noise attenuation a Seismi dip estimation based on the two-dimensional Hilbert transform and its appliation in random noise attenuation a a Published in Applied Geophysis, 1, 55-63 (Marh 015) Cai Liu, Changle Chen, Dian Wang,

More information

State Diagrams. Margaret M. Fleck. 14 November 2011

State Diagrams. Margaret M. Fleck. 14 November 2011 State Diagrams Margaret M. Flek 14 November 2011 These notes over state diagrams. 1 Introdution State diagrams are a type of direted graph, in whih the graph nodes represent states and labels on the graph

More information

arxiv:cond-mat/ v1 [cond-mat.stat-mech] 16 Aug 2004

arxiv:cond-mat/ v1 [cond-mat.stat-mech] 16 Aug 2004 Computational omplexity and fundamental limitations to fermioni quantum Monte Carlo simulations arxiv:ond-mat/0408370v1 [ond-mat.stat-meh] 16 Aug 2004 Matthias Troyer, 1 Uwe-Jens Wiese 2 1 Theoretishe

More information

Sensitivity of Spectrum Sensing Techniques to RF impairments

Sensitivity of Spectrum Sensing Techniques to RF impairments Sensitivity of Spetrum Sensing Tehniques to RF impairments Jonathan Verlant-Chenet Julien Renard Jean-Mihel Driot Philippe De Donker François Horlin Université Libre de Bruelles - OPERA Dpt., Avenue F.D.

More information

An Integrated Architecture of Adaptive Neural Network Control for Dynamic Systems

An Integrated Architecture of Adaptive Neural Network Control for Dynamic Systems An Integrated Arhiteture of Adaptive Neural Network Control for Dynami Systems Robert L. Tokar 2 Brian D.MVey2 'Center for Nonlinear Studies, 2Applied Theoretial Physis Division Los Alamos National Laboratory,

More information

Improvements in the Modeling of the Self-ignition of Tetrafluoroethylene

Improvements in the Modeling of the Self-ignition of Tetrafluoroethylene Exerpt from the Proeedings of the OMSOL onferene 010 Paris Improvements in the Modeling of the Self-ignition of Tetrafluoroethylene M. Bekmann-Kluge 1 *,. errero 1, V. Shröder 1, A. Aikalin and J. Steinbah

More information

A Combination of BIM and BEM for efficiently analyzing optical. elements

A Combination of BIM and BEM for efficiently analyzing optical. elements A Combination of BIM and BEM for effiiently analyzing optial elements SUN Fang, LIU Juan, ZHANG Guo-ting, HU Chuan-fei, SU Xiaoxing Institute of Lightwave Tehnology, hool of eletronis and information engineering,

More information

Advances in Radio Science

Advances in Radio Science Advanes in adio Siene 2003) 1: 99 104 Copernius GmbH 2003 Advanes in adio Siene A hybrid method ombining the FDTD and a time domain boundary-integral equation marhing-on-in-time algorithm A Beker and V

More information