Lecture 1: Topics in Information Retrieval. Johan Bollen Old Dominion University Department of Computer Science
|
|
- Elizabeth Flowers
- 5 years ago
- Views:
Transcription
1 Lecture 1: Topics in Information Retrieval. Johan Bollen Old Dominion University Department of Computer Science jbollen January 16, 2003 Page 1
2 Structure 1. Class approach and schedule 2. Class methods and software: (a) LaTeX (b) Gnuplot (c) UNIX (d) Perl 3. Introduction to IR: (a) problem statement (b) definitions (c) history January 16, 2003 Page 2
3 Class Approach 1. LaTeX (a) Submission of reports, etc in PDF only (b) Use provided LaTeX templates to produce PDF (c) Word, Excel, etc: NO 2. Gnuplot (a) GPL licensed graph tool (b) Command-line driven (c) Available for most platforms 3. UNIX (a) Don t re-invent the wheel! (b) Use UNIX tools (c) sort, uniq, awk, grep, sed 4. Perl (a) For projects etc all programming languages are OK (b) Class examples: Perl (c) Strong recommendation to use Perl for your programming January 16, 2003 Page 3
4 Class Schedule 1. JAN 16: Class approach and schedule 2. JAN 23: IR models: taxonomy, formal characterization 3. JAN 30: Probabilistic models, Alternative Set Theoretic Models 4. FEB 6: Alternative Algabraic Models, Alternative Probabilistic Models. 5. FEB 13: Text Operations 6. FEB 20: Indexing and Searching. 7. FEB 27: Query Operations 8. MAR 6: Retrieval Evaluation 9. MAR 20: Human Computer Interaction 10. MAR 27: Digital Libraries 11. APR 3: Web IR, Citation analysis and link analysis 12. APR 10: Latent Semantic Analysis and Multi-Dimensional Scaling 13. APR 17: Recommender Systems 14. APR 24: Adaptive Hypertext and Hypermedia Subject to change January 16, 2003 Page 4
5 2 exams and 1 project 1. Exams: (a) Dates: i. Midterm: take home, scheduled Feb 27th (class 9), submission deadline: March 6th (class 10) ii. Final: Take home, May 8th - Exam (b) Submission: 2. Projects: i. LaTeX templates ii. PDF format only (a) I ll provide list of suggestions (b) Free to choose programming languages, software and tools (c) Proposal defenses: class 8 (d) Project progress report: class 12 (e) Final Submission and presentation: final exam 3. Grade distribution (a) Projects: 30% (b) Midterm: 30% (c) Final: 40% January 16, 2003 Page 5
6 Exams 1. Take home questions 3. choose points per question (a) 0=bad (b) 1=fair (c) 2=good (d) 3=excellent 5. Questions will cover: (a) Theory (b) Practice (c) Technical matters (d) Readings January 16, 2003 Page 6
7 Readings 1. Readings: (a) Seminal articles in IR (b) Additional Material (c) 2 readings per lecture 2. Important: (a) Connect to existing academic practices (b) Scholarly publications (c) IR is still very much alive 3. Presentations: (a) Two random students picked each start of class (b) 10 minute (precisely!) presentations (c) 1 Presentation = 1 exam question 4. Grading: (a) 0-3 scale (b) Subjective but we all know a bad presentation when we see one (c) You may not agree but your peers will! January 16, 2003 Page 7
8 Some words on Academic Honesty Everything you submit to me is supposed to be 1) your, 2) original work. 1. Your work: not that of others (a) Collaboration with other students: only when explicitly required (b) Copying other people s work, texts, software, articles, ideas and attributing it to yourself is fraud (c) Whether something is openly avalaible on the WWW or anywhere else is irrelevant 2. Original work and references: (a) Science is all about using other people s ideas as the foundation of your own (b) Your work can be based on other people s ideas as long as you identify the source (c) When you base your argument, idea, tenets, hypothesis on other people s work: i. Use a citation: clearly identify the source ii. Citations may not comprise more than 10% of your text (d) A citation or quotation does not constitute a valid response, answer or submission: I need to see your work January 16, 2003 Page 8
9 Last words Personally, I ABHOR academic dishonesty 1. The academic systems functions on trust: people building on what others have produced and identifying their sources 2. Open communication of ideas etc only works when people are honest enough to give due credit and work on their own ideas 3. People who steal words, texts, ideas are parasites within that fragile system and will worsen the situation of us all 4. In game theoretical terms, I believe in a tit-for-tat strategy: you cheat once, I cancel our cooperation. 5. The objective of this class is not to teach you how to copy and slavishly replay other people s ideas but to generate your own: be pretentious! Show some hubris! Be an iconoclast! Think for yourself! January 16, 2003 Page 9
10 Class software and tools: GnuPlot Gnuplot is free, powerful and generates great graphs! 1. Open Source product 2. Available for many platforms, including Windows 3. Command-line driven 4. Reads data files in plain text 5. Exports png, eps, and LaTeX! 6. Can be scripted 7. Generates great looking, no-frills graphs 8. Low overhead Did I mention it is free? January 16, 2003 Page 10
11 A Gnuplot Session (1): January 16, 2003 Page 11
12 A Gnuplot Session (2): January 16, 2003 Page 12
13 A Gnuplot Session (3): January 16, 2003 Page 13
14 Class software and tools: LaTeX LaTeX: submission of exams and project reports. 1. Why not use Word, WordPerfect, or your favorit word processor here (a) Proprietary formats that put burden on user to layout rather than write and structure content (b) Closed approach: you want your documents to be available anywhere on any system/os (c) Scientific publishing: LaTeX does math, references, etc much better and more transparantly than any other WYSIWYG processor (d) I prefer standardized submission format and layout for your project reports and exams (e) LaTeX is scientific standard among mathematicians, physicists, and computer scientists: essential for your career to learn principles 2. Principles (a) Markup language like HTML, SGML, etc (b) Markup specifies document structure and semantics, not layout (c) Writer takes care of content and structure, not how things look (d) LaTeX source code is compiled to good looking document by LaTeX compiler January 16, 2003 Page 14
15 Basics of LaTeX 1. LaTeX source code: text file generated in any editor you like (I prefer EMACS) 2. File consists of two parts: (a) Preamble i. Definition of document style: article, book, etc, ii. Definition of other general (b) Body document features (title, author, etc) i. Contains actual document body, interspersed with structural markup ii. Markup declares environments for text: indicate features and structure January 16, 2003 Page 15
16 LaTeX Compilation LaTeX latex pdflatex dvi PDF dvips PS 1. Compilation: (a) from LaTeX to postscript: latex compiler to dvi, dvips to postscript, ps2pdf postscript to pdf (b) from LaTeX to PDF: pdflatex 2. View final result (a) GhostView (b) Acrobat reader January 16, 2003 Page 16
17 Latex Example: \ documentclass { a r t i c l e } \ t i t l e { F i n a l Exam : CS695 T opics i n IR} \ author {You} \ begin { document } \ m a k e t i t l e \ s e c t i o n { I n t r o d u c t i o n } \ s u b s e c t i o n { Q u e s t i o n 1 } Etenim s p e c i e s e s t p u l c h r i s c o r p o r i b u s \ s u b s u b s e c t i o n { Q u e s t i o n 2 } Quid ego m i s e r i n t e amavi, o furtum meum Some math : $f ( x ) = x i ˆ2 $ and : $f ( x ) = \ sum { i =0}ˆ{10} w { i, j }$ \ s e c t i o n { C o n c l u s i o n } Invoco t e, deus meus, m i s e r i c o r d i a mea, q u i f e c i s t i me \end{ document } January 16, 2003 Page 16
18 Looks like: January 16, 2003 Page 17
19 Latex lists \ documentclass { a r t i c l e } \ begin { document } \ begin { enumerate } \ item F i r s t item \ begin { i t e m i z e } \ item B u l l e t 1 \ item B u l l e t 2 \end{ i t e m i z e } \ item Second item \end{ enumerate } \end{ document } January 16, 2003 Page 17
20 Looks like: January 16, 2003 Page 18
21 Latex lists \ documentclass { a r t i c l e } \ begin { document } Math e n v i r o n m e n t can be s t a r t e d with \ $ \\ and t e r m i n a t e d with \ $.\\ Exponents and i n d i c e s : $f ( x ) = x i ˆ2 $, or $f ( x ) = x { i, j }ˆ{12} $\\ Sigma : $f ( x ) = sum { i =0}ˆ{10} w { i, j }$\\ F r a c t i o n : $\ f r a c {\sum { i =0}ˆ{10} w { i, j }} {\ sum j w j }$\\ Roots and i n t e g r a l s :\\ $\ s q r t [ n ]{\ f r a c {1}{ sum { i =0}ˆ{10} w { i, j }}}$,\\ and $\ i n t a ˆ b f i ( x ) $\\ S e t t h e o r y :\\ $\ f o r a l l \ vec {x } \ in \ m a t h c a l {N} : \ e x i s t s \ vec {y } \ in A \ cup B$ \end{ document } January 16, 2003 Page 18
22 Looks like: January 16, 2003 Page 19
23 Latex lists \ documentclass [ p d f t e x ]{ a r t i c l e } \ usepackage [ d v i p s ]{ g r a p h i c x } \ begin { document } PNG, JPG and PDF images can be i n s e r t as f o l l o w s :\\ \ begin { f i g u r e } \ begin { c e n t e r } \ i n c l u d e g r a p h i c s [ s c a l e = 0. 4 ] { pngs / t e x i m a g e } \end{ c e n t e r } \ c a p t i o n {\ l a b e l { example } The f u n c t i o n $\ cos (\ f r a c {1}{\ alpha }) $} \end{ f i g u r e } The l a b e l i n c a p t i o n can be used t o r e f e r t o t h e image. For example, s e e f i g u r e \ r e f { example }. \end{ document } January 16, 2003 Page 19
24 Looks like: January 16, 2003 Page 20
25 Latex lists \ documentclass { a r t i c l e } \ begin { document } C i t a t i o n s a r e v ery easy. No more manually m a i n t a i n i n g a b i b l i o g r a p h y!\\ You keep a b i b t e x f i l e ( e. g. exam 1. b i b ) which s t o r e s your b i b l i o g r a p h i c r e c o r d s. In t h i s b i b t e x f i l e you a s s i g n each r e c o r d a key t h a t i s unique and makes s e n s e. Then c i t i n g a r e c o r d i s as easy as s i m p l y s t a t i n g : \ c i t e {key 1 }, i n your LaTeX code! Adding a n o t h e r c i t a t i o n? Simply use t h e r i g h t key : \ c i t e {key 2}.\\ To add a b i b l i o g r a p p h y s t o your document simply s t a t e : \ b i b l i o g r a p h y s t y l e { p l a i n } \ b i b l i o g r a p h y {exam1} \end{ document } January 16, 2003 Page 20
26 Looks like: January 16, 2003 Page 21
27 LaTeX Resources on the web http: // Best book: A Guide to LaTeX, by Helmut Kopka and Patrick W. Daly January 16, 2003 Page 22
28 Exams and projects I will provide templates for all project reports and exams You need to simply fill in your answers and text, and compile these documents to PDF files following the provided instructions This will provide you 1. An introduction into LaTeX authoring 2. A convenient means to produce professionally looking reports and take-home exams 3. An open standard for document exchange across multiple platforms January 16, 2003 Page 23
29 Perl programming and UNIX tools 1. Information Retrieval often deals with texts 2. Choose language designed to deal with text 3. Perl! (a) Convenient text parsing (b) Convenient access to high-level data objects e.g, associative arrays (c) Excellent database interface support (d) Easy to implement crawlers and HTTP clients (e) Perfect for UNIX scripting January 16, 2003 Page 24
30 Perl variable types #! / u s r / b i n / p e r l jbollen@cs.odu.edu # s c a l a r s $a= A l i s t of words d e l i m i t e d by s p a c e s ; $n = 0 ; # a l i s l i s t = ( item1, item2, item3 ) ; # f o r loop on l i s t i t e m s foreach $item l i s t ){ p r i n t $item. \ n ; } # s p l i t $a a c c o r d i n g t o l i s t 1 = s p l i t / \ s + /, $a ; foreach $item l i s t 1 ){ p r i n t $item. \ n ; } January 16, 2003 Page 24
31 Perl RE and associative arrays #! / u s r / b i n / p e r l $a= A l i s t of words d e l i m i t e d by s p a c e s l i s t 1 = s p l i t / \ s + /, $a ; # an a s s o c i a t i v e a r r a y ( marked %) %f r e q t a b l e = { } ; foreach $item l i s t 1 ){ # $item s p e c i f i e s key i n t o f r e q t a b l e # t h e n add 1 t o $ f r e q t a b l e { $item } $ f r e q t a b l e { $item }=1; } foreach $key ( keys % f r e q t a b l e ){ p r i n t $key. \ t. $ f r e q t a b l e { $key }. \ n ; } January 16, 2003 Page 24
32 Perl RE and associative arrays #! / u s r / b i n / p e r l # r e a d from STDIN while (<>){ # r e s u l t i s s t o r e d i n $ $ l i n e = $ ; chomp ( $ l i n e ) ; # s p l i t l i n e a t s p a c e l i s t = s p l i t / \ s + /, $ l i n e ; } foreach $item l i s t ){ $ f r e q t a b l e { $item }++; } foreach $key ( keys % f r e q t a b l e ){ p r i n t $ f r e q t a b l e { $key }. \ t. $key. \ n ; } January 16, 2003 Page 24
33 Perl: Reading and writing from and to files #! / u s r / b i n / p e r l p r i n t f i l e n a m e?\ n ; $ f i l e = <STDIN>; chomp ( $ f i l e ) ; open ( IN, < $ f i l e ) = < IN>; c l o s e ( IN ) ; $n =1; foreach $ l i n e ){ p r i n t $n. ). $ l i n e. \ n ; $n ++; } open ( OUT, > r e s u l t s ) ; f o r ( $ i = 0 ; $ i <10; $ i ++){ p r i n t OUT $ i. \ n ; } c l o s e ( OUT ) ; January 16, 2003 Page 24
34 UNIX tools 1. sort: alphabetically sorts a file 2. head: first n lines 3. tail: last n lines 5. wc: count number of lines and words in text 6. Pipe any of these tools: powerful combinations! 4. uniq: remove duplicates January 16, 2003 Page 25
35 Information Retrieval: problem statement 1. Rapid Growth in amount of information published (a) Moore s law (b) Papers published: (c) (d) Exponential growth (e) Doubling rate: years, 4-7% per annum for last 300 years (f) Scientists: exponential growth as well (g) Singularity approaching? (Vernor Vinge) Hyper-exponential growth 2. Finding a needle in a haystack (a) Stress inevitably needs to shift from mere storage to accomodating user information needs (b) Digitalization of scholarly communication is another important factor January 16, 2003 Page 26
36 Information Retrieval: origins 1. Library Science (a) Collection management (b) Indexes and organization (c) Rapid growth of volume and costs (d) Automation of indexes to automation of retrieval to recognition of user information needs 2. Information systems (a) Realization that computers can outperform humans in tedium of text indexing and analysis (b) Rapid growth of scholarly publication and need for supported analysis and synthesis January 16, 2003 Page 27
37 Information Retrieval: origins 1. First systems since 1950s (a) First applications of initial computer systems (b) Much of science developed in 1960 (c) Relatively slow growth in applications and data set since 1960s-1980s (d) Highly specific applications in following years 2. Then: Internet and World Wide Web (a) IR: a solution waiting for a problem (b) How about billion pages? (c) Hyperlinked, in a graphical markup language (HTML)? (d) Millions of users, hundreds of languages? (e) Mind the singularity! (f) Infinite user information needs? 3. Ideas, theories, systems surpassed by parallel developments 4. The future? January 16, 2003 Page 28
38 Relational models vs. Information Retrieval 1. DBs: Prevalent and best known approach (a) Oracle, MySQL, PostGreSQL, etc (b) DB tables (c) SQL queries (d) this is data retrieval 2. Information Retrieval? No. (a) no means to respond intelligently to user information needs (b) queries limited to Boolean queries (c) logical view of documents is very limited (d) approach is small and not very powerful subset of IR domain January 16, 2003 Page 29
39 Information Retrieval 1. Information Retrieval is about (a) representation (b) storage (c) organization (d) access 2. Mission: finding relevant information based on user information needs 3. Information vs. data retrieval (a) Data retrieval: i. Known location and nature ii. User query matches data objects exactly iii. Question of storage and retrieval (b) Information Retrieval: i. User information need can be specific to highly abstract/general ii. System to respond to such need: Find About problem iii. Relevancy of response is issue January 16, 2003 Page 30
40 Information Retrieval: common text substrate Framework 1. collection of text documents 2. User formulates text query (a) set of keyterms, words (b) indicates user information need 3. a subset of set of documents, or collections, is retrieved 4. user can accept or browse returbed results Basic concepts 1. Keyterm indexing 2. User task 3. Logical view of documents January 16, 2003 Page 31
41 Text representation January 16, 2003 Page 32
42 Index or keyterms 1. Means to represent semantics of document 2. Aboutness : what is the specific document about? (a) Define semantic layer over collection (b) Idea is that meaning of document can be represented by specific set of keyterms 3. Ties to library classification 4. Interpretation in IR is more general and natural 5. Issues: (a) Extraction: only works for text documents, unless human intervention (b) Languages: the world is not uni-lingual (c) Synonymy, homonymy (d) Jargon and user communities (e) Orthogonality January 16, 2003 Page 33
43 User task 1. User: formulates query which expresses information need 2. System: responds with answer set 3. User tasks: narrowness of interests (a) retrieval: specific information need (b) browsing: vague information need 4. Pull vs. push January 16, 2003 Page 34
44 Logical view 1. Description of content and semantics of document 2. Structure 3. Derivation of index terms: (a) removal of stopwords (b) stemming (c) identification of nouns 4. Pull vs. push January 16, 2003 Page 35
45 Retrieval Process 1. DB manager: (a) Text database (b) Logical view of document (c) Index 2. Retrieval process (a) user information need (b) query (c) query operations (d) retrieval January 16, 2003 Page 36
46 Information Retrieval is not a single application 1. Defined in functional terms (a) IR is defined as a process (b) different processes can lead to similar results (c) platonic ideals of relevancy (d) there does not exist a perfect system (e) IR toolbox 2. Many different models (a) Assumptions: text based, keyterms, (partial) matching, etc (b) Mechanisms: vector space, set-theory, probabilistic (c) Goals: push-pull, retrieval vs. recommendation, browsing 3. Let 10 6 flowers bloom! 4. Necessary to evaluate and compare results January 16, 2003 Page 37
47 An attempt to pigeonhole 1. Three basic classes re: retrieval (a) Boolean (Set theory) (b) Vector (c) Probabilistic 2. Expansion (a) Set theoretical: fuzzy sets, extended boolean) (b) Algebraic: Gen. Vector, LSI, NN (c) Probabilistic: Inference, belief netw. 3. Browsing: (a) Flat (b) Structure Guided (c) Hypertext, WWW January 16, 2003 Page 38
48 UserTask Retrieval Classic Model Boolean Fuzzy Ext Boolean Vect Gen. Vec LSI NN Prob Inf Netw Struct Models Non-overlap Lists Belief Netw Prox Nodes Browsing Flat Guided HT January 16, 2003 Page 38
49 Ad hoc, retrieval, filtering, browsing 1. Ad hoc: (a) Static collection (b) User formulates queries (c) User pull 2. Filtering (a) Collection changes (b) User doesn t? (c) Filter pushed information (d) User modeling (personal) 3. Browsing (a) Interactive environment of user push & pull (b) Hypertext: network structure + interface (c) User modeling and filtering: adaptive hypertext and hypermedia January 16, 2003 Page 39
50 Readings 1. Vannevar Bush (1945) As we may think. Atlantic Monthly, July. 2. Belkin (1992) Information Filtering and Information Retrieval. Communications of the ACM. 35(12). Download PDFs/URLs from web site: spring03_ir/readings. Presentations: 1. point-by-point overview minutes! 3. start with main message or problem statement 4. end by discussing how problem was solved 5. Be nice to your audience! January 16, 2003 Page 40
Extended IR Models. Johan Bollen Old Dominion University Department of Computer Science
Extended IR Models. Johan Bollen Old Dominion University Department of Computer Science jbollen@cs.odu.edu http://www.cs.odu.edu/ jbollen January 20, 2004 Page 1 UserTask Retrieval Classic Model Boolean
More informationFundamentals of Computational Science
Fundamentals of Computational Science Dr. Hyrum D. Carroll August 23, 2016 Introductions Each student: Name Undergraduate school & major Masters & major Previous research (if any) Why Computational Science
More informationExperimental Physics I & II "Junior Lab" Fall Spring 2008
MIT OpenCourseWare http://ocw.mit.edu 8.13-14 Experimental Physics I & II "Junior Lab" Fall 2007 - Spring 2008 For information about citing these materials or our Terms of Use, visit: http://ocw.mit.edu/terms.
More informationHow to Make or Plot a Graph or Chart in Excel
This is a complete video tutorial on How to Make or Plot a Graph or Chart in Excel. To make complex chart like Gantt Chart, you have know the basic principles of making a chart. Though I have used Excel
More informationCSE 494/598 Lecture-4: Correlation Analysis. **Content adapted from last year s slides
CSE 494/598 Lecture-4: Correlation Analysis LYDIA MANIKONDA HT TP://WWW.PUBLIC.ASU.EDU/~LMANIKON / **Content adapted from last year s slides Announcements Project-1 Due: February 12 th 2016 Analysis report:
More informationSPRING 2014 Department of Physics & Astronomy, UGA PHYS 4202/6202 Electricity and Magnetism II (as of Jan. 07/2014)
SPRING 2014 Department of Physics & Astronomy, UGA PHYS 4202/6202 Electricity and Magnetism II (as of Jan. 07/2014) The course syllabus is a general plan for the course; deviations announced to the class
More informationLecture 5: Web Searching using the SVD
Lecture 5: Web Searching using the SVD Information Retrieval Over the last 2 years the number of internet users has grown exponentially with time; see Figure. Trying to extract information from this exponentially
More informationDigital Library Evaluation by Analysis of User Retrieval Patterns
Digital Library Evaluation by Analysis of User Retrieval Patterns Johan Bollen, Somasekhar Vemulappali and Weining Xu Old Dominion University Department of Computer Science jbollen@cs.odu.edu http://www.cs.odu.edu/
More informationCollaborative topic models: motivations cont
Collaborative topic models: motivations cont Two topics: machine learning social network analysis Two people: " boy Two articles: article A! girl article B Preferences: The boy likes A and B --- no problem.
More informationCS 6783 (Applied Algorithms) Lecture 3
CS 6783 (Applied Algorithms) Lecture 3 Antonina Kolokolova January 14, 2013 1 Representative problems: brief overview of the course In this lecture we will look at several problems which, although look
More information13 Searching the Web with the SVD
13 Searching the Web with the SVD 13.1 Information retrieval Over the last 20 years the number of internet users has grown exponentially with time; see Figure 1. Trying to extract information from this
More informationINFO 4300 / CS4300 Information Retrieval. slides adapted from Hinrich Schütze s, linked from
INFO 4300 / CS4300 Information Retrieval slides adapted from Hinrich Schütze s, linked from http://informationretrieval.org/ IR 26/26: Feature Selection and Exam Overview Paul Ginsparg Cornell University,
More informationData. Notes. are required reading for the week. textbook reading and a few slides on data formats and data cleaning
CS 725/825 Information Visualization Fall 2017 Data Dr. Michele C. Weigle http://www.cs.odu.edu/~mweigle/cs725-f17/ Notes } We will not cover these slides in class, but they are required reading for the
More informationBASIC TECHNOLOGY Pre K starts and shuts down computer, monitor, and printer E E D D P P P P P P P P P P
BASIC TECHNOLOGY Pre K 1 2 3 4 5 6 7 8 9 10 11 12 starts and shuts down computer, monitor, and printer P P P P P P practices responsible use and care of technology devices P P P P P P opens and quits an
More informationCSE 311: Foundations of Computing I. Lecture 1: Propositional Logic
CSE 311: Foundations of Computing I Lecture 1: Propositional Logic About CSE 311 Some Perspective Computer Science and Engineering Programming CSE 14x Theory Hardware CSE 311 About the Course We will study
More information9 Searching the Internet with the SVD
9 Searching the Internet with the SVD 9.1 Information retrieval Over the last 20 years the number of internet users has grown exponentially with time; see Figure 1. Trying to extract information from this
More informationIntroduction. How to use this book. Linear algebra. Mathematica. Mathematica cells
Introduction How to use this book This guide is meant as a standard reference to definitions, examples, and Mathematica techniques for linear algebra. Complementary material can be found in the Help sections
More informationSampling for Frequent Itemset Mining The Power of PAC learning
Sampling for Frequent Itemset Mining The Power of PAC learning prof. dr Arno Siebes Algorithmic Data Analysis Group Department of Information and Computing Sciences Universiteit Utrecht The Papers Today
More informationTheory versus Experiment: Analysis and Measurements of Allocation Costs in Sorting (With Hints for Applying Similar Techniques to Garbage Collection)
Theory versus Experiment: Analysis and Measurements of Allocation Costs in Sorting (With Hints for Applying Similar Techniques to Garbage Collection) CS 152 Staff February, 2006 Introduction This handout
More informationPhysics 750: Exercise 2 Tuesday, August 29, 2017
Physics 750: Exercise 2 Tuesday, August 29, 207 Use the curl command to download from the class website everything you ll need for this exercise $ WEBPATH=http://wwwphyolemissedu/~kbeach/ $ curl $WEBPATH/courses/fall207/phys750/src/exercise2tgz
More informationPower Laws & Rich Get Richer
Power Laws & Rich Get Richer CMSC 498J: Social Media Computing Department of Computer Science University of Maryland Spring 2015 Hadi Amiri hadi@umd.edu Lecture Topics Popularity as a Network Phenomenon
More informationSnowden Cartography 1 GEOG 315:001 Cartography Thursdays 4:00 6:30PM F375 Fall 2010 Dr. Snowden Course Description
Snowden Cartography 1 www.drksnowden.com GEOG 315:001 Cartography Thursdays 4:00 6:30PM F375 Fall 2010 Dr. Snowden Course Description Principles and theory of basic map design, layout, and communication.
More informationComplex Numbers For High School Students
Complex Numbers For High School Students For the Love of Mathematics and Computing Saturday, October 14, 2017 Presented by: Rich Dlin Presented by: Rich Dlin Complex Numbers For High School Students 1
More informationCS246 Final Exam, Winter 2011
CS246 Final Exam, Winter 2011 1. Your name and student ID. Name:... Student ID:... 2. I agree to comply with Stanford Honor Code. Signature:... 3. There should be 17 numbered pages in this exam (including
More informationGIS FOR PLANNING. Course Overview. Schedule. Instructor. Prerequisites. Urban Planning 792 Thursday s 5:30-8:10pm SARUP 158
GIS FOR PLANNING Urban Planning 792 Thursday s 5:30-8:10pm SARUP 158 Schedule Class/Lab - SARUP 158 Thursdays 5:30pm - 8:10pm Office Hours - By Appointment Project Ideas - Week 4 Final - 5/10/2018 Instructor
More informationToday s Menu. Administrativia Two Problems Cutting a Pizza Lighting Rooms
Welcome! L01 Today s Menu Administrativia Two Problems Cutting a Pizza Lighting Rooms Administrativia Course page: https://www.cs.duke.edu/courses/spring13/compsci230/ Who we are: Instructor: TA: UTAs:
More informationBig Bang, Black Holes, No Math
ASTR/PHYS 109 Dr. David Toback Lecture 5 1 Prep For Today (is now due) L5 Reading: No new reading Unit 2 reading assigned at the end of class Pre-Lecture Reading Questions: Unit 1: Grades have been posted
More informationCSCE 561 Information Retrieval System Models
CSCE 561 Information Retrieval System Models Satya Katragadda 26 August 2015 Agenda Introduction to Information Retrieval Inverted Index IR System Models Boolean Retrieval Model 2 Introduction Information
More informationLaTeX in a nutshell. Master on Libre Software Miguel Vidal. September 24, GSyC/Libresoft URJC
Master on Libre Software 2010 mvidal@gsyc.urjc.es GSyC/Libresoft URJC September 24, 2010 (cc) 2010. Some rights reserved. This work is licensed under a Creative Commons Attribution-Share Alike 3.0 License,
More informationClass Note #20. In today s class, the following four concepts were introduced: decision
Class Note #20 Date: 03/29/2006 [Overall Information] In today s class, the following four concepts were introduced: decision version of a problem, formal language, P and NP. We also discussed the relationship
More informationCS Data Structures and Algorithm Analysis
CS 483 - Data Structures and Algorithm Analysis Lecture II: Chapter 2 R. Paul Wiegand George Mason University, Department of Computer Science February 1, 2006 Outline 1 Analysis Framework 2 Asymptotic
More informationComputer Science 385 Analysis of Algorithms Siena College Spring Topic Notes: Limitations of Algorithms
Computer Science 385 Analysis of Algorithms Siena College Spring 2011 Topic Notes: Limitations of Algorithms We conclude with a discussion of the limitations of the power of algorithms. That is, what kinds
More informationGalactic Census: Population of the Galaxy grades 9 12
Galactic Census: Population of the Galaxy grades 9 12 Objective Introduce students to a range of celestial objects that populate the galaxy, having them calculate estimates of how common each object is
More informationEntropy-based data organization tricks for browsing logs and packet captures
Entropy-based data organization tricks for browsing logs and packet captures Department of Computer Science Dartmouth College Outline 1 Log browsing moves Pipes and tables Trees are better than pipes and
More informationIntroduction To Spectroscopy 4th Edition Solution Manual
We have made it easy for you to find a PDF Ebooks without any digging. And by having access to our ebooks online or by storing it on your computer, you have convenient answers with introduction to spectroscopy
More informationÁkos Tarcsay CHEMAXON SOLUTIONS
Ákos Tarcsay CHEMAXON SOLUTIONS FINDING NOVEL COMPOUNDS WITH IMPROVED OVERALL PROPERTY PROFILES Two faces of one world Structure Footprint Linked Data Reactions Analytical Batch Phys-Chem Assay Project
More informationCOLLEGE OF THE DESERT
COLLEGE OF THE DESERT Course Code MATH-010 Course Outline of Record 1. Course Code: MATH-010 2. a. Long Course Title: College Algebra b. Short Course Title: COLLEGE ALGEBRA 3. a. Catalog Course Description:
More informationAt the start of the term, we saw the following formula for computing the sum of the first n integers:
Chapter 11 Induction This chapter covers mathematical induction. 11.1 Introduction to induction At the start of the term, we saw the following formula for computing the sum of the first n integers: Claim
More informationCAS GE 365 Introduction to Geographical Information Systems. The Applications of GIS are endless
Spring 2007 CAS GE 365 Introduction to Geographical Information Systems Boston University Department of Geography and Environment The Applications of GIS are endless images from www.esri.com CAS GE 365
More informationA GUI FOR EVOLVE ZAMS
A GUI FOR EVOLVE ZAMS D. R. Schlegel Computer Science Department Here the early work on a new user interface for the Evolve ZAMS stellar evolution code is presented. The initial goal of this project is
More informationHow to make a presentation with L A TEX?
How to make a presentation with L A TEX? Department of computer science King Saud University November 23, 2015 1 / 49 Contents 1 to L A TEX 2 2 / 49 to L A TEX L A TEX( pronounced /"la:tex/, /"la:tek/,
More informationPhysics 18, Introductory Physics I for Biological Sciences Spring 2010
Physics 18 page 1/6 Physics 18, Introductory Physics I for Biological Sciences Spring 2010 - Course Description - Instructor: Dr. Derrick Kiley Office: AOB 176; Office Phone 209 228-3076 E-mail Address:
More informationSciFinder Scholar Guide to Getting Started
SciFinder Scholar Guide to Getting Started What is SciFinder Scholar? SciFinder Scholar is the online version of Chemical Abstracts, covering chemistry publications worldwide from 1907 to the present.
More informationPhysics 9, Introductory Physics II Spring 2010
Physics 9 page 1/6 Physics 9, Introductory Physics II Spring 2010 - Course Description - Instructor: Dr. Derrick Kiley Office: AOB 176; Office Phone 209 228-3076 E-mail Address: dkiley@ucmerced.edu. Course
More informationCMPT 310 Artificial Intelligence Survey. Simon Fraser University Summer Instructor: Oliver Schulte
CMPT 310 Artificial Intelligence Survey Simon Fraser University Summer 2017 Instructor: Oliver Schulte Assignment 3: Chapters 13, 14, 18, 20. Probabilistic Reasoning and Learning Instructions: The university
More informationComputational Cognitive Science
Computational Cognitive Science Lecture 9: A Bayesian model of concept learning Chris Lucas School of Informatics University of Edinburgh October 16, 218 Reading Rules and Similarity in Concept Learning
More informationBassam Raychouni Pre-Algebra. Course Title: Head of Department:
Course Title: Head of Department:, Bassam Raychouni (bassam@greenwood.sh.ae) Teacher(s) + e-mail: Mireille Chammas (Mireille.c@greenwood.sch.ae) Cycle/Division: Middle school Grade Level: 8 Credit Unit:
More informationComp 11 Lectures. Mike Shah. July 26, Tufts University. Mike Shah (Tufts University) Comp 11 Lectures July 26, / 45
Comp 11 Lectures Mike Shah Tufts University July 26, 2017 Mike Shah (Tufts University) Comp 11 Lectures July 26, 2017 1 / 45 Please do not distribute or host these slides without prior permission. Mike
More informationSUN SIGNS & PAST LIVES: YOUR SOUL'S EVOLUTIONARY PATH BY BERNIE ASHMAN
SUN SIGNS & PAST LIVES: YOUR SOUL'S EVOLUTIONARY PATH BY BERNIE ASHMAN DOWNLOAD EBOOK : SUN SIGNS & PAST LIVES: YOUR SOUL'S EVOLUTIONARY Click link bellow and free register to download ebook: SUN SIGNS
More informationMachine Learning: Homework 5
0-60 Machine Learning: Homework 5 Due 5:0 p.m. Thursday, March, 06 TAs: Travis Dick and Han Zhao Instructions Late homework policy: Homework is worth full credit if submitted before the due date, half
More informationhttps://u.osu.edu/miller.81/ Texts 1. Required Worboys, M. and Duckham, M. (2004) GIS: A Computing Perspective. Other readings see course schedule.
GEOGR 5212 Geospatial Databases for GIS - Spring Instructor Harvey J. Miller Lecture/lab meeting time Tuesdays 3:55PM - 5:15PM, Denney Hall 238 and location (classroom) Thursdays 3:55PM - 5:15PM Derby
More informationCSEP 521 Applied Algorithms. Richard Anderson Winter 2013 Lecture 1
CSEP 521 Applied Algorithms Richard Anderson Winter 2013 Lecture 1 CSEP 521 Course Introduction CSEP 521, Applied Algorithms Monday s, 6:30-9:20 pm CSE 305 and Microsoft Building 99 Instructor Richard
More informationCS 188: Artificial Intelligence Spring Today
CS 188: Artificial Intelligence Spring 2006 Lecture 9: Naïve Bayes 2/14/2006 Dan Klein UC Berkeley Many slides from either Stuart Russell or Andrew Moore Bayes rule Today Expectations and utilities Naïve
More informationboolean queries Inverted index query processing Query optimization boolean model January 15, / 35
boolean model January 15, 2017 1 / 35 Outline 1 boolean queries 2 3 4 2 / 35 taxonomy of IR models Set theoretic fuzzy extended boolean set-based IR models Boolean vector probalistic algebraic generalized
More informationCMPSCI 240: Reasoning about Uncertainty
CMPSCI 240: Reasoning about Uncertainty Lecture 1: Introduction Andrew McGregor University of Massachusetts Last Compiled: January 24, 2017 Outline 1 Introduction 2 Course Logistics 3 Sets and Elements
More informationCONCEPTS OF GENETICS BY ROBERT BROOKER DOWNLOAD EBOOK : CONCEPTS OF GENETICS BY ROBERT BROOKER PDF
CONCEPTS OF GENETICS BY ROBERT BROOKER DOWNLOAD EBOOK : CONCEPTS OF GENETICS BY ROBERT BROOKER PDF Click link bellow and free register to download ebook: CONCEPTS OF GENETICS BY ROBERT BROOKER DOWNLOAD
More informationHIRES 2017 Syllabus. Instructors:
HIRES 2017 Syllabus Instructors: Dr. Brian Vant-Hull: Steinman 185, 212-650-8514, brianvh@ce.ccny.cuny.edu Ms. Hannah Aizenman: NAC 7/311, 212-650-6295, haizenman@ccny.cuny.edu Dr. Tarendra Lakhankar:
More informationRetrieval by Content. Part 2: Text Retrieval Term Frequency and Inverse Document Frequency. Srihari: CSE 626 1
Retrieval by Content Part 2: Text Retrieval Term Frequency and Inverse Document Frequency Srihari: CSE 626 1 Text Retrieval Retrieval of text-based information is referred to as Information Retrieval (IR)
More informationEVERYTHING YOU NEED TO KNOW ABOUT THE SCORPIO ZODIAC SIGN - ASTROLOGY, COMPATIBILITY, LOVE, TRAITS AND PERSONALITY (EVERYTHING YOU NEED TO
Read Online and Download Ebook EVERYTHING YOU NEED TO KNOW ABOUT THE SCORPIO ZODIAC SIGN - ASTROLOGY, COMPATIBILITY, LOVE, TRAITS AND PERSONALITY (EVERYTHING YOU NEED TO DOWNLOAD EBOOK : EVERYTHING YOU
More informationChemistry 883 Computational Quantum Chemistry
Chemistry 883 Computational Quantum Chemistry Instructor Contact Information Professor Benjamin G. Levine levine@chemistry.msu.edu 215 Chemistry Building 517-353-1113 Office Hours Tuesday 9:00-11:00 am
More informationGTECH 380/722 Analytical and Computer Cartography Hunter College, CUNY Department of Geography
GTECH 380/722 Analytical and Computer Cartography Hunter College, CUNY Department of Geography Fall 2014 Mondays 5:35PM to 9:15PM Instructor: Doug Williamson, PhD Email: Douglas.Williamson@hunter.cuny.edu
More information1.1 Administrative Stuff
601.433 / 601.633 Introduction to Algorithms Lecturer: Michael Dinitz Topic: Introduction, Karatsuba/Strassen Date: 9/4/18 1.1 Administrative Stuff Welcome to Algorithms! In this class you will learn the
More informationTHE IGOS APPRENTICE HANDBOOK: ACTIVATING THE INNER MAGICAL BEING BY DR THOR TEMPLAR
THE IGOS APPRENTICE HANDBOOK: ACTIVATING THE INNER MAGICAL BEING BY DR THOR TEMPLAR DOWNLOAD EBOOK : THE IGOS APPRENTICE HANDBOOK: ACTIVATING THE Click link bellow and free register to download ebook:
More informationUsing Microsoft Excel
Using Microsoft Excel Objective: Students will gain familiarity with using Excel to record data, display data properly, use built-in formulae to do calculations, and plot and fit data with linear functions.
More informationInorganic Chemistry By James R. Bowser
Inorganic Chemistry By James R. Bowser Inorganic Chemistry By James R. Bowser Designed to show students what it is like to be an inorganic chemist today, this book reflects the discipline, the mix of theory
More informationWho am I? Math 1080: Numerical Linear Algebra. Books. Format
Who am I? Math 18: Numerical Linear Algebra M M Sussman sussmanm@mathpittedu Office Hours: MW 1:45PM-2:45PM, Thack 622 Part-time faculty in Math Dept Experience at Bettis lab Administer 27/271 Numerical
More informationMA 580; Numerical Analysis I
MA 580; Numerical Analysis I C. T. Kelley NC State University tim kelley@ncsu.edu Version of October 23, 2016 NCSU, Fall 2016 c C. T. Kelley, I. C. F. Ipsen, 2016 MA 580, Fall 2016 1 / 43 Contents 1 Introduction
More informationWireless Network Security Spring 2016
Wireless Network Security Spring 2016 Patrick Tague Class #3 Project Discussion; OMNET++ Intro 2016 Patrick Tague 1 Waitlists If you're currently registered for this class, but not planning to stay: please
More informationEssential facts about NP-completeness:
CMPSCI611: NP Completeness Lecture 17 Essential facts about NP-completeness: Any NP-complete problem can be solved by a simple, but exponentially slow algorithm. We don t have polynomial-time solutions
More information1. Ignoring case, extract all unique words from the entire set of documents.
CS 378 Introduction to Data Mining Spring 29 Lecture 2 Lecturer: Inderjit Dhillon Date: Jan. 27th, 29 Keywords: Vector space model, Latent Semantic Indexing(LSI), SVD 1 Vector Space Model The basic idea
More informationDescriptive Inorganic, Coordination, And Solid State Chemistry PDF
Descriptive Inorganic, Coordination, And Solid State Chemistry PDF This proven book introduces the basics of coordination, solid-state, and descriptive main-group chemistry in a uniquely accessible manner,
More informationHow to write maths (well)
How to write maths (well) Dr Euan Spence 29 September 2017 These are the slides from a talk I gave to the new first-year students at Bath, annotated with some of the things I said (which appear in boxes
More informationGTECH 380/722 Analytical and Computer Cartography Hunter College, CUNY Department of Geography
GTECH 380/722 Analytical and Computer Cartography Hunter College, CUNY Department of Geography Spring 2010 Wednesdays 5:35PM to 9:15PM Instructor: Doug Williamson, PhD Email: Douglas.Williamson@hunter.cuny.edu
More informationBoolean and Vector Space Retrieval Models CS 290N Some of slides from R. Mooney (UTexas), J. Ghosh (UT ECE), D. Lee (USTHK).
Boolean and Vector Space Retrieval Models 2013 CS 290N Some of slides from R. Mooney (UTexas), J. Ghosh (UT ECE), D. Lee (USTHK). 1 Table of Content Boolean model Statistical vector space model Retrieval
More informationICS141: Discrete Mathematics for Computer Science I
ICS141: Discrete Mathematics for Computer Science I Dept. Information & Computer Sci., Originals slides by Dr. Baek and Dr. Still, adapted by J. Stelovsky Based on slides Dr. M. P. Frank and Dr. J.L. Gross
More informationCollaborative Topic Modeling for Recommending Scientific Articles
Collaborative Topic Modeling for Recommending Scientific Articles Chong Wang and David M. Blei Best student paper award at KDD 2011 Computer Science Department, Princeton University Presented by Tian Cao
More informationCS103 Handout 08 Spring 2012 April 20, 2012 Problem Set 3
CS103 Handout 08 Spring 2012 April 20, 2012 Problem Set 3 This third problem set explores graphs, relations, functions, cardinalities, and the pigeonhole principle. This should be a great way to get a
More informationData Mining Recitation Notes Week 3
Data Mining Recitation Notes Week 3 Jack Rae January 28, 2013 1 Information Retrieval Given a set of documents, pull the (k) most similar document(s) to a given query. 1.1 Setup Say we have D documents
More informationCS155: Probability and Computing: Randomized Algorithms and Probabilistic Analysis
CS155: Probability and Computing: Randomized Algorithms and Probabilistic Analysis Eli Upfal Eli Upfal@brown.edu Office: 319 TA s: Lorenzo De Stefani and Sorin Vatasoiu cs155tas@cs.brown.edu It is remarkable
More informationDiscrete Structures Proofwriting Checklist
CS103 Winter 2019 Discrete Structures Proofwriting Checklist Cynthia Lee Keith Schwarz Now that we re transitioning to writing proofs about discrete structures like binary relations, functions, and graphs,
More informationMathematical notation and writing
Mathematical notation and writing 19 March 2013 Mathematical notation and writing Mathematical Writing Writing is difficult. Writing Mathematics is very difficult. Keep your prose simple and direct. Use
More informationText and multimedia languages and properties
Text and multimedia languages and properties (Modern IR, Ch. 6) I VP R 1 Introduction Metadata Text Markup Languages Multimedia Trends and Research Issues Bibliographical Discussion I VP R 2 Introduction
More informationCS 188: Artificial Intelligence Spring Announcements
CS 188: Artificial Intelligence Spring 2011 Lecture 18: HMMs and Particle Filtering 4/4/2011 Pieter Abbeel --- UC Berkeley Many slides over this course adapted from Dan Klein, Stuart Russell, Andrew Moore
More informationDRAFT. Algebra I Honors Course Syllabus
Algebra I Honors 1200320 2011-2012 Course Syllabus Teacher Information COMPLETED BY INDIVIDUAL TEACHER Name: Phone: Email: Blog and/or Website: Credit: 1.0 Student must take the State of Florida's Algebra
More informationCareer and College Readiness in Terms of Next Generation Science Standards (NGSS)
Career and College Readiness in Terms of Next Generation Science Standards (NGSS) Michigan An NGSS Lead State Partner Next Generation Science Standards for Today s Students and Tomorrow s Workforce Building
More informationCS 188: Artificial Intelligence Fall 2008
CS 188: Artificial Intelligence Fall 2008 Lecture 14: Bayes Nets 10/14/2008 Dan Klein UC Berkeley 1 1 Announcements Midterm 10/21! One page note sheet Review sessions Friday and Sunday (similar) OHs on
More informationComputational Biology Course Descriptions 12-14
Computational Biology Course Descriptions 12-14 Course Number and Title INTRODUCTORY COURSES BIO 311C: Introductory Biology I BIO 311D: Introductory Biology II BIO 325: Genetics CH 301: Principles of Chemistry
More information30. TRANSFORMING TOOL #1 (the Addition Property of Equality)
30 TRANSFORMING TOOL #1 (the Addition Property of Equality) sentences that look different, but always have the same truth values What can you DO to a sentence that will make it LOOK different, but not
More informationRecent Advances in Bayesian Inference Techniques
Recent Advances in Bayesian Inference Techniques Christopher M. Bishop Microsoft Research, Cambridge, U.K. research.microsoft.com/~cmbishop SIAM Conference on Data Mining, April 2004 Abstract Bayesian
More informationCOMS 6100 Class Notes
COMS 6100 Class Notes Daniel Solus November 1, 2016 1 General Remarks We will start with a small overview of the material from the end of last class. Since we were interrupted by the fire alarm. 2 Review
More informationComputer Science and Logic A Match Made in Heaven
A Match Made in Heaven Luca Aceto Reykjavik University Reykjavik, 3 April 2009 Thanks to Moshe Vardi from whom I have drawn inspiration (read stolen ideas ) for this presentation. Why This Talk Today?
More informationAP Chemistry Syllabus
AP Chemistry Syllabus Classroom number: S202 Instructor: Mr. Krouba E-mail: patrick.krouba@douglas.k12.ga.us Website: http://lithiasprings.dce.schoolinsites.com/ https://sites.google.com/site/mrkroubaschems/home
More informationGENERAL, ORGANIC, AND BIOLOGICAL CHEMISTRY: STRUCTURES OF LIFE (4TH EDITION) BY KAREN C. TIMBERLAKE
Read Online and Download Ebook GENERAL, ORGANIC, AND BIOLOGICAL CHEMISTRY: STRUCTURES OF LIFE (4TH EDITION) BY KAREN C. TIMBERLAKE DOWNLOAD EBOOK : GENERAL, ORGANIC, AND BIOLOGICAL CHEMISTRY: STRUCTURES
More informationWelcome to Chemistry 376
CHM 376 Spring 2014 Welcome to Chemistry 376 This course will give you a chance to explore the experimental world of physical chemistry. In each of six experiments you will study how simple physical models,
More informationQuantum Physics III (8.06) Spring Term Paper
1 Project Summary Everyone in 8.06 will be expected to research, write and publish a short paper on a topic related to the content of 8.05 or 8.06. The paper can explain a physical effect or further explicate
More informationLECSS Physics 11 Introduction to Physics and Math Methods 1 Revised 8 September 2013 Don Bloomfield
LECSS Physics 11 Introduction to Physics and Math Methods 1 Physics 11 Introduction to Physics and Math Methods In this introduction, you will get a more in-depth overview of what Physics is, as well as
More informationCS 350 Algorithms and Complexity
CS 350 Algorithms and Complexity Winter 2019 Lecture 15: Limitations of Algorithmic Power Introduction to complexity theory Andrew P. Black Department of Computer Science Portland State University Lower
More informationCS 188: Artificial Intelligence Spring 2009
CS 188: Artificial Intelligence Spring 2009 Lecture 21: Hidden Markov Models 4/7/2009 John DeNero UC Berkeley Slides adapted from Dan Klein Announcements Written 3 deadline extended! Posted last Friday
More informationCS 350 Algorithms and Complexity
1 CS 350 Algorithms and Complexity Fall 2015 Lecture 15: Limitations of Algorithmic Power Introduction to complexity theory Andrew P. Black Department of Computer Science Portland State University Lower
More informationUrban Planning Internet Geographic Information Systems Fall 2010
Urban Planning - 794 Internet Geographic Information Systems Fall 2010 Instructor: Professor Huxhold (hux@uwm.edu) Lecturer: Melissa Mann (mmann@uwm.edu) Manager: Kurt Meingast (kurtm@uwm.edu) Schedule:
More information