Unied Presentation Contents Retrieval using Voice Information
|
|
- Harriet Flynn
- 5 years ago
- Views:
Transcription
1 DEWS2006 6c-o1,, UPRISE (Unified Presentation slide Retrieval by Impression Search Engine) UPRISE UPRISE 4 e-learning Unied Presentation Contents Retrieval using Voice Information Hiroaki OKAMOTO, Wataru NAKANO, Takashi KOBAYASHI, Satoshi NAOI,, Haruo YOKOTA,, Koji IWANO, and Sadaoki FURUI Department of Computer Science, Graduate School of Information Science and Engineering, Tokyo Institute of Technology Global Scientic Information & Computing Center, Tokyo Institute of Technology FUJITSU LABORATORIES LTD {okamoto,wnakano}@decstitechacjp, tkobaya@gsictitechacjp,, naoisatoshi@jpfujitsucom, {yokota,iwano,furui}@cstitechacjp Abstract We have proposed the UPRISE (Unified Presentation slide Retrieval by Impression Search Engine) to store lecture videos and presentation slides used in the lectures It provides dedicated scene search functions for the metadata-based unified presentation contents, using the slide structure, scene duration, and context information In this paper, we use the voice information in the lecture videos to improve the precision of scene retrieval using the speech recognition technique We propose four ways to add the influence of voice to the scene search functions of the UPRISE, and evaluate the effect of them The experimental results indicate that the voice information is effective to improve the precision Key words information integration, information retrieval, e-learning 1 [1] [3] e-learning e-learning
2 UPRISE(Unied Presentation slide Retrieval by Impression Search Engine) [4] [11] UPRISE / "!$#%! 1 3 UPRISE UPRISE UPRISE 2 1 UPRISE UPRISE 1 ( CSJ) [12], [13] UPRISE [11] [14] UPRISE [15] [7] UPRISE UPRISE [16], [17] [6] 2 2 UPRISE I p I p UPRISE L I p (s, k) = P(s, l) C(s, k, l) 2 UPRISE UPRISE l=1 2 2 s k l P(s, l) &('*)
3 s l C(s, k, l) s l [11] k P(s, l) I d I d I p ( ) θ T(s) I d (s, k, θ, u) = I p (s, k) u UPRISE T(s) s θ u I c I c I d 3 1 I c (s, k, θ, u, δ, ε 1, ε 2 ) = s+δ γ=s δ E(γ s, ε 1, ε 2 ) I d (s, k, θ, u) δ E(γ s, ε 1, ε 2 ) E(γ s, ε 1, ε 2 ) exp(ε 1 x) (x < 0) E(x, ε 1, ε 2 ) = [19], [20] exp( ε 2 x) (x > = 0) [21] δ [21] (CSJ) [12], [13] ε δ = 4, ε 1 = 50, ε 2 = 05 4 CSJ 967 ( 300 ) UPRISE I p I d I c ipadic 3 ( ) ( 300 ) 22,860 [9] [10] [9], [10] ( 90 ) 3 UPRISE Julius 1 [18] Julius 2 2 UPRISE, [21] 3 2 Julius ( ) ( ) XML forgejp/ 2 UPIRISE 3
4 XML XML fragment fragment ( ) string voice I p 0 voice 1 1 voice string in ( ) constimemilli I c ( ) constimesec ( ) 0 cmscore cmscore IDF [22] 2 XML IVF I vas (VA if keyword 3 2 is in the Scene) I vas (s, k, δ, θ, u, ε 1, ε 2, α) s k I c (s, k, δ, θ, u, ε 1, ε 2 ) + α ( ) T(s) θ u VA(s, k) (Ip 0) ( VA(s, k)(voice Appearance) VA T(s) ) θ = I c (s, k, δ, θ, u, ε 1, ε 2 ) (I p = 0) u α α = 6 I c 1 1 α = 0 I c VC(Voice I vas and Context) I c 0 s+δ ( ) θ T(s) I vac (VA if keyword is in Contexts) VC(s, k, θ, u, δ, ε 1, ε 2 ) = E(γ s, ε 1, ε 2 ) VA(s, k) u γ=s δ IVF(Inverse Voice Frequency) IVF(k) = log k I vac (s, k, δ, θ, u, ε 1, ε 2, α) I c (s, k, δ, θ, u, ε 1, ε 2 ) + α ( ) T(s) θ u VA(s, k) (Ic 0) = 0 (I c = 0) I vac ( (VC) ) I vcs
5 I vcs (s, k, δ, θ, u, ε 1, ε 2, α) I c (s, k, δ, θ, u, ε 1, ε 2 ) + αvc(s, k, δ, θ, u, ε 1, ε 2 ) (I p 0) = I c (s, k, δ, θ, u, ε 1, ε 2 ) (I p = 0) ( ) 254% (I c ) 0, (VC) I vcc,, I vcc (s, k, δ, θ, u, ε 1, ε 2, α) I c (s, k, δ, θ, u, ε 1, ε 2 ) + αvc(s, k, δ, θ, u, ε 1, ε 2 ) (I c 0) = 0 (I c = 0) 3 3 IVF [9] I vas I vac I vcs I vcc IS FP IVF IS FC IS FC IVF IS FC(k) = log k θ = 05 u = 60 δ = 4 ε 1 = 50 ε 2 = 05 4 I c 124 I x IS FC IVF x vas vac vcs vcc I p I c I c I x IS FC IVF (s, k, δ, θ, u, ε 1, ε 2, α) I c IS FC(k) + αi x IVF(k) = I c IS FC(k) 4 (I p 0 I c 0) (I p = 0 I c = 0) IS FC IVF UPRISE IS FC α % 267% I vas I vac I vcs I vcc α UPRISE ( 11 ) ( 12 ) [21] , (recall) (pre ,036 cision) [23] 1 ( ) 1
6 %& ' $ +, - *!#" ')(! "#!$ "#%$& "#%$&$ 3 4 α (5 ) α (1 ) UPRISE 0 1 α = I p 1 N = 1 N 1 N i i= α α α I vas I vac I vcs I vcc 0=I c I c 0560(α = 0 ) I c I vas 0035 I vac I vcs 0059 I vcc 005 I vcs α I vac α = 9, 10 2 ' ' ' ' ' I c ' 3 α α ' ' I c 14 I vcs α α I c 5 2 (I vcs ) α = 0 α = 10 α = 100 α =
7 3 I vcs I vcs IS FC I vcs IS FC IVF α = α = α = α = α = ' ' I vcs IS FC I p 4 (α = 5) I vcs I vcs IS FC IVF I vcs IS FC ' ' ( ) ( ) α α = 5 α = I vcs ' ' 4 ' ' ' ' ' 7 ' 3 α = 5 70 ' ' ' ' I vcs IS FC IVF I vcs IS FC IVF I vcs IS FC IVF IS FC 2 I vcs IS FC 3 α = 0( ) 0 ( IS FC ) IS FC I vcs IS FC α = 5 I vcs α ' ' I vcs α = 20 I vcs IS FC IVF 0623 ' ' ' ' I vcs IS FC IVF I vcs IS FC I vcs α = 15 I c I vcs IS FC IVF 0063(63%) 0623
8 K "! # $ % & "' (*) +-,& 6!#"%$'&(!)*$ +-, 0/ ":$'&;<=)*$ +?>0@BA!#"%$'&E+->2@BA GF!>0@HA CBDJI CBD*3 7 ( ) CBD* ( ) [11] 3 3 IDF [22] 1 IVF UPRISE UPRISE I 4 2 c ( 0560) I I vcs vcs 0059(59%) 0619 α ( I vcs IS FC I vcs IS FC IVF ) α 1 I 5 1 c ( 0560) 0063(63%) 0623 I vcs 6 ( ) α = α = 3
9 In Proc of WIRI2005, pp 47 52, April 2005 [10] Wataru Nakano, Yuta Ochi, Takashi Kobayashi, Yutaka Katsuyama, Satoshi Naoi, and Haruo Yokota Unied Presentation Contents Retrieval Using Laser Pointer Information In Proc of SWOD, pp , April 2005 [11],,,, 2005-dbs-137 (ii) (78), [24] pp ,, July 2005 ( ) [12] gojp/ csj/public/ [25] [13] Kikuo Maekawa, Hanae Koiso, Sadaoki Furui, and Hitoshi Isahara Spontaneous speech corpus of japanese In Proc LREC2000, Vol 2, pp , Athens, Greece, May 2000 [14] S Johnson, P Jourlin, G Moore, K S Jones, and PWoodland The 4 3 Cambridge University spoken document retrieval system In Proceedings of ICASSP'99, pp 49 52, [15],, 8, March 2002 [16],, Technical Report 2005-SLP ,, February 2005 [17],, e-learning, March 2004 [18], Julius 1, Vol 20, No 1, pp 41 49, 2005 [19],,, SP ,, Dec 2000 [20] Takahiro Shinozaki, Chiori Hori, and Sadaoki Furui Towards automatic transcription of spontaneous presentations In Proc Eu- rospeech2001, Vol 1, pp , Aalborg, Denmark, Sep 2001 Julius / [21], February 2005 [22] G Salton Automatic Text Processing: The Transformation, Analysis and Retrieval of Information by Computer Addison-Wesley, 1988 [23] D A Grossman and O Frieder Information Retrieval Algorithm and ( , ) Heuristics Kluwer, 1998 CREST 21 [24] A Singhal and F Pereria Document expansion for speech retrieval COE In Proc of ACM SIGIR 99, pp 34 41, 1999 [25] S Srinivasan and D Petkovic Phonetic confusion matrix based spoken document retrieval In Proc of ACM SIGIR2000, pp 81 87, 2000 [1] R Müller and T Ottmann The Authoring on the Fly system for automated recording and replay of (tele)presentations Multimedia Syst, Vol 8, No 3, pp , 2000 [2] Carnegie Mellon University The Informedia Project Informedia ii digital video library [3] G D Abowd Classroom 2000: an experiment with the instrumentation of a living edu cational environment IBM Syst J, Vol 38, No 4, pp , 1999 [4] DBS ,, July 2001 [5],,,, In Proc of DBWeb2002, pp , 2002 [6] Haruo Yokota, Takashi Kobayashi, Taichi Muraki, and Satoshi Naoi UPRISE: Unied Presentation Slide Retrieval by Impression Search Engine IEICE Transactions on Information and Systems, Vol E87- D, No 2, pp , February 2004 [7],,,, Vol J88-D-I, No 3, pp , March 2005 [8],,, pp DEWS B 3 DE, March 2004 [9] Hiroaki Okamoto, Takashi Kobayashi, and Haruo Yokota Presentation Retrieval Method Considering the Scope of Targets and Outputs
Considering Granularity of Laser Pointer and Speech in Information Integration for Lecture Scene Retrieval
DEWS2007 E1-3,, 152 8552 2 12 1 152 8550 2 12 1 211 8588 4 1 1 E-mail: wnakano@de.cs.titech.ac.jp, tkobaya@gsic.titech.ac.jp,, naoi.satoshi@jp.fujitsu.com, {yokota,furui}@cs.titech.ac.jp UPRISE e-learning
More informationN-gram N-gram Language Model for Large-Vocabulary Continuous Speech Recognition
2010 11 5 N-gram N-gram Language Model for Large-Vocabulary Continuous Speech Recognition 1 48-106413 Abstract Large-Vocabulary Continuous Speech Recognition(LVCSR) system has rapidly been growing today.
More informationAn Algorithm for Fast Calculation of Back-off N-gram Probabilities with Unigram Rescaling
An Algorithm for Fast Calculation of Back-off N-gram Probabilities with Unigram Rescaling Masaharu Kato, Tetsuo Kosaka, Akinori Ito and Shozo Makino Abstract Topic-based stochastic models such as the probabilistic
More informationPivoted Length Normalization I. Summary idf II. Review
2 Feb 2006 1/11 COM S/INFO 630: Representing and Accessing [Textual] Digital Information Lecturer: Lillian Lee Lecture 3: 2 February 2006 Scribes: Siavash Dejgosha (sd82) and Ricardo Hu (rh238) Pivoted
More informationSelecting Good Speech Features for Recognition
ETRI Journal, volume 18, number 1, April 1996 29 Selecting Good Speech Features for Recognition Youngjik Lee and Kyu-Woong Hwang CONTENTS I. INTRODUCTION II. COMPARISON OF MEASURES III. ANALYSIS OF SPEECH
More informationBobby Hunt, Mariappan S. Nadar, Paul Keller, Eric VonColln, and Anupam Goyal III. ASSOCIATIVE RECALL BY A POLYNOMIAL MAPPING
Synthesis of a Nonrecurrent Associative Memory Model Based on a Nonlinear Transformation in the Spectral Domain p. 1 Bobby Hunt, Mariappan S. Nadar, Paul Keller, Eric VonColln, Anupam Goyal Abstract -
More informationEffectiveness of complex index terms in information retrieval
Effectiveness of complex index terms in information retrieval Tokunaga Takenobu, Ogibayasi Hironori and Tanaka Hozumi Department of Computer Science Tokyo Institute of Technology Abstract This paper explores
More informationShankar Shivappa University of California, San Diego April 26, CSE 254 Seminar in learning algorithms
Recognition of Visual Speech Elements Using Adaptively Boosted Hidden Markov Models. Say Wei Foo, Yong Lian, Liang Dong. IEEE Transactions on Circuits and Systems for Video Technology, May 2004. Shankar
More informationFull-covariance model compensation for
compensation transms Presentation Toshiba, 12 Mar 2008 Outline compensation transms compensation transms Outline compensation transms compensation transms Noise model x clean speech; n additive ; h convolutional
More informationAutoregressive Neural Models for Statistical Parametric Speech Synthesis
Autoregressive Neural Models for Statistical Parametric Speech Synthesis シンワン Xin WANG 2018-01-11 contact: wangxin@nii.ac.jp we welcome critical comments, suggestions, and discussion 1 https://www.slideshare.net/kotarotanahashi/deep-learning-library-coyotecnn
More informationLanguage Models, Smoothing, and IDF Weighting
Language Models, Smoothing, and IDF Weighting Najeeb Abdulmutalib, Norbert Fuhr University of Duisburg-Essen, Germany {najeeb fuhr}@is.inf.uni-due.de Abstract In this paper, we investigate the relationship
More informationWhy Language Models and Inverse Document Frequency for Information Retrieval?
Why Language Models and Inverse Document Frequency for Information Retrieval? Catarina Moreira, Andreas Wichert Instituto Superior Técnico, INESC-ID Av. Professor Cavaco Silva, 2744-016 Porto Salvo, Portugal
More informationA Note on the Effect of Term Weighting on Selecting Intrinsic Dimensionality of Data
BULGARIAN ACADEMY OF SCIENCES CYBERNETICS AND INFORMATION TECHNOLOGIES Volume 9, No 1 Sofia 2009 A Note on the Effect of Term Weighting on Selecting Intrinsic Dimensionality of Data Ch. Aswani Kumar 1,
More informationA Variance Modeling Framework Based on Variational Autoencoders for Speech Enhancement
A Variance Modeling Framework Based on Variational Autoencoders for Speech Enhancement Simon Leglaive 1 Laurent Girin 1,2 Radu Horaud 1 1: Inria Grenoble Rhône-Alpes 2: Univ. Grenoble Alpes, Grenoble INP,
More informationVideo and Motion Analysis Computer Vision Carnegie Mellon University (Kris Kitani)
Video and Motion Analysis 16-385 Computer Vision Carnegie Mellon University (Kris Kitani) Optical flow used for feature tracking on a drone Interpolated optical flow used for super slow-mo optical flow
More informationLatent Semantic Analysis. Hongning Wang
Latent Semantic Analysis Hongning Wang CS@UVa Recap: vector space model Represent both doc and query by concept vectors Each concept defines one dimension K concepts define a high-dimensional space Element
More informationPhys 102 Lecture 12 Currents & magnetic fields
Phys 102 Lecture 12 Currents & magnetic fields 1 Today we will... Learn how magnetic fields are created by currents Use specific examples Long straight wire Current loop Solenoid Apply these concepts Electromagnets
More informationPhoneme segmentation based on spectral metrics
Phoneme segmentation based on spectral metrics Xianhua Jiang, Johan Karlsson, and Tryphon T. Georgiou Introduction We consider the classic problem of segmenting speech signals into individual phonemes.
More informationIndependent Component Analysis and Unsupervised Learning
Independent Component Analysis and Unsupervised Learning Jen-Tzung Chien National Cheng Kung University TABLE OF CONTENTS 1. Independent Component Analysis 2. Case Study I: Speech Recognition Independent
More informationA Constraint Generation Approach to Learning Stable Linear Dynamical Systems
A Constraint Generation Approach to Learning Stable Linear Dynamical Systems Sajid M. Siddiqi Byron Boots Geoffrey J. Gordon Carnegie Mellon University NIPS 2007 poster W22 steam Application: Dynamic Textures
More informationLatent Dirichlet Allocation Based Multi-Document Summarization
Latent Dirichlet Allocation Based Multi-Document Summarization Rachit Arora Department of Computer Science and Engineering Indian Institute of Technology Madras Chennai - 600 036, India. rachitar@cse.iitm.ernet.in
More informationCopyright Warning & Restrictions
Copyright Warning & Restrictions The copyright law of the United States (Title 17, United States Code) governs the making of photocopies or other reproductions of copyrighted material. Under certain conditions
More informationIndependent Component Analysis and Unsupervised Learning. Jen-Tzung Chien
Independent Component Analysis and Unsupervised Learning Jen-Tzung Chien TABLE OF CONTENTS 1. Independent Component Analysis 2. Case Study I: Speech Recognition Independent voices Nonparametric likelihood
More informationINSPIRE as a Powerful Tool for Managing and Opening Environmental Data
INSPIRE as a Powerful Tool for Managing and Opening Environmental Data Leszek Litwin Maria Lenartowicz 1, Alina Litwiak 2, Elżbieta Duraj 2, Łukasz Łukasiewicz 2, Leszek Litwin 2 1 Chief Inspectorate of
More informationModeling Timing Structure in Multimedia Signals
Modeling Timing Structure in Multimedia Signals Hiroaki Kawashima, Kimitaka Tsutsumi, and Takashi Matsuyama Kyoto University, Yoshida-Honmachi Sakyo, Kyoto 6068501, JAPAN, {kawashima,tm}@i.kyoto-u.ac.jp,
More informationRobust Sound Event Detection in Continuous Audio Environments
Robust Sound Event Detection in Continuous Audio Environments Haomin Zhang 1, Ian McLoughlin 2,1, Yan Song 1 1 National Engineering Laboratory of Speech and Language Information Processing The University
More informationAn Autoregressive Recurrent Mixture Density Network for Parametric Speech Synthesis
ICASSP 07 New Orleans, USA An Autoregressive Recurrent Mixture Density Network for Parametric Speech Synthesis Xin WANG, Shinji TAKAKI, Junichi YAMAGISHI National Institute of Informatics, Japan 07-03-07
More informationCOMPARISON OF FEATURES FOR DP-MATCHING BASED QUERY-BY-HUMMING SYSTEM
COMPARISON OF FEATURES FOR DP-MATCHING BASED QUERY-BY-HUMMING SYSTEM Akinori Ito Sung-Phil Heo Motoyuki Suzuki Shozo Makino Graduate School of Engineering Tohoku University Aoba 05, Aramaki, Sendai, 980-8579
More informationCan Vector Space Bases Model Context?
Can Vector Space Bases Model Context? Massimo Melucci University of Padua Department of Information Engineering Via Gradenigo, 6/a 35031 Padova Italy melo@dei.unipd.it Abstract Current Information Retrieval
More informationSTATISTICS FOR EFFICIENT LINEAR AND NON-LINEAR PICTURE ENCODING
STATISTICS FOR EFFICIENT LINEAR AND NON-LINEAR PICTURE ENCODING Item Type text; Proceedings Authors Kummerow, Thomas Publisher International Foundation for Telemetering Journal International Telemetering
More informationML (cont.): SUPPORT VECTOR MACHINES
ML (cont.): SUPPORT VECTOR MACHINES CS540 Bryan R Gibson University of Wisconsin-Madison Slides adapted from those used by Prof. Jerry Zhu, CS540-1 1 / 40 Support Vector Machines (SVMs) The No-Math Version
More informationIntroduction to Information Retrieval (Manning, Raghavan, Schutze) Chapter 6 Scoring term weighting and the vector space model
Introduction to Information Retrieval (Manning, Raghavan, Schutze) Chapter 6 Scoring term weighting and the vector space model Ranked retrieval Thus far, our queries have all been Boolean. Documents either
More informationCS630 Representing and Accessing Digital Information Lecture 6: Feb 14, 2006
Scribes: Gilly Leshed, N. Sadat Shami Outline. Review. Mixture of Poissons ( Poisson) model 3. BM5/Okapi method 4. Relevance feedback. Review In discussing probabilistic models for information retrieval
More informationWelcome to Pittsburgh!
Welcome to Pittsburgh! Overview of the IWSLT 2005 Evaluation Campaign atthias Eck and Chiori Hori InterACT Carnegie ellon niversity IWSLT2005 Evaluation campaign Working on the same test bed Training Corpus
More informationJust the Library Browsing the Library Back to the Front of the Library Search on Force and Collisions Results 2
Improved Web Support for Physics Teachers Brian Adrian and Dean Zollman Kansas State University and Scott Stevens and Mike Cristel Carnegie Mellon University Supported by the National Science Foundation
More informationINFO 4300 / CS4300 Information Retrieval. slides adapted from Hinrich Schütze s, linked from
INFO 4300 / CS4300 Information Retrieval slides adapted from Hinrich Schütze s, linked from http://informationretrieval.org/ IR 26/26: Feature Selection and Exam Overview Paul Ginsparg Cornell University,
More informationQuery Performance Prediction: Evaluation Contrasted with Effectiveness
Query Performance Prediction: Evaluation Contrasted with Effectiveness Claudia Hauff 1, Leif Azzopardi 2, Djoerd Hiemstra 1, and Franciska de Jong 1 1 University of Twente, Enschede, the Netherlands {c.hauff,
More informationA Study for Evaluating the Importance of Various Parts of Speech (POS) for Information Retrieval (IR)
A Study for Evaluating the Importance of Various Parts of Speech (POS) for Information Retrieval (IR) Chirag Shah Dept. of CSE IIT Bombay, Powai Mumbai - 400 076, Maharashtra, India. Email: chirag@cse.iitb.ac.in
More informationBoundary Contraction Training for Acoustic Models based on Discrete Deep Neural Networks
INTERSPEECH 2014 Boundary Contraction Training for Acoustic Models based on Discrete Deep Neural Networks Ryu Takeda, Naoyuki Kanda, and Nobuo Nukaga Central Research Laboratory, Hitachi Ltd., 1-280, Kokubunji-shi,
More informationA Low-Distortion Noise Canceller and Its Learning Algorithm in Presence of Crosstalk
414 IEICE TRANS. FUNDAMENTALS, VOL.E84 A, NO.2 FEBRUARY 2001 PAPER Special Section on Noise Cancellation Reduction Techniques A Low-Distortion Noise Canceller Its Learning Algorithm in Presence of Crosstalk
More informationLecture 3: Pivoted Document Length Normalization
CS 6740: Advanced Language Technologies February 4, 2010 Lecture 3: Pivoted Document Length Normalization Lecturer: Lillian Lee Scribes: Lakshmi Ganesh, Navin Sivakumar Abstract In this lecture, we examine
More informationHigh Precision Control of Ball Screw Driven Stage Using Repetitive Control with Sharp Roll-off Learning Filter
High Precision Control of Ball Screw Driven Stage Using Repetitive Control with Sharp Roll-off Learning Filter Tadashi Takemura and Hiroshi Fujimoto The University of Tokyo --, Kashiwanoha, Kashiwa, Chiba,
More informationGeoPlot : Spatial Data Mining on Video Libraries
GeoPlot : Spatial Data Mining on Video Libraries Jia-Yu Pan, Christos Faloutsos Computer Science Department Carnegie Mellon University Pittsburgh, PA {jypan, christos}@cs.cmu.edu ABSTRACT Are tornado touchdowns
More informationHidden Markov Models and Gaussian Mixture Models
Hidden Markov Models and Gaussian Mixture Models Hiroshi Shimodaira and Steve Renals Automatic Speech Recognition ASR Lectures 4&5 23&27 January 2014 ASR Lectures 4&5 Hidden Markov Models and Gaussian
More informationNoisy Speech Recognition using Wavelet Transform and Weighting Coefficients for a Specific Level
Proceedings of th International Congress on Acoustics, ICA 23-27 August, Sydney, Australia Noisy Speech Recognition using Wavelet Transform and Weighting Coefficients for a Specific Level Yoichi MIDORIKAWA,
More informationA Multi-Layered Bayesian Network Model for Structured Document Retrieval
A Multi-Layered Bayesian Network Model for Structured Document Retrieval Fabio Crestani 1, Luis M. de Campos 2, Juan M. Fernández-Luna 2, and Juan F. Huete 2 1 Department of Computer and Information Sciences.
More informationNon-Negative Matrix Factorization And Its Application to Audio. Tuomas Virtanen Tampere University of Technology
Non-Negative Matrix Factorization And Its Application to Audio Tuomas Virtanen Tampere University of Technology tuomas.virtanen@tut.fi 2 Contents Introduction to audio signals Spectrogram representation
More informationChemDataExtractor: A toolkit for automated extraction of chemical information from the scientific literature
: A toolkit for automated extraction of chemical information from the scientific literature Callum Court Molecular Engineering, University of Cambridge Supervisor: Dr Jacqueline Cole 1 / 20 Overview 1
More informationGMM-Based Speech Transformation Systems under Data Reduction
GMM-Based Speech Transformation Systems under Data Reduction Larbi Mesbahi, Vincent Barreaud, Olivier Boeffard IRISA / University of Rennes 1 - ENSSAT 6 rue de Kerampont, B.P. 80518, F-22305 Lannion Cedex
More informationTopic tracking language model for speech recognition
Available online at www.sciencedirect.com Computer Speech and Language 25 (2011) 440 461 Topic tracing language model for speech recognition Shinji Watanabe a,, Tomoharu Iwata a, Taaai Hori a, Atsushi
More informationSpeech Recognition Lecture 5: N-gram Language Models. Eugene Weinstein Google, NYU Courant Institute Slide Credit: Mehryar Mohri
Speech Recognition Lecture 5: N-gram Language Models Eugene Weinstein Google, NYU Courant Institute eugenew@cs.nyu.edu Slide Credit: Mehryar Mohri Components Acoustic and pronunciation model: Pr(o w) =
More informationStatistical Methods for NLP
Statistical Methods for NLP Sequence Models Joakim Nivre Uppsala University Department of Linguistics and Philology joakim.nivre@lingfil.uu.se Statistical Methods for NLP 1(21) Introduction Structured
More informationThe Moments of the Profile in Random Binary Digital Trees
Journal of mathematics and computer science 6(2013)176-190 The Moments of the Profile in Random Binary Digital Trees Ramin Kazemi and Saeid Delavar Department of Statistics, Imam Khomeini International
More information10-701/ Recitation : Kernels
10-701/15-781 Recitation : Kernels Manojit Nandi February 27, 2014 Outline Mathematical Theory Banach Space and Hilbert Spaces Kernels Commonly Used Kernels Kernel Theory One Weird Kernel Trick Representer
More informationA Study of the Dirichlet Priors for Term Frequency Normalisation
A Study of the Dirichlet Priors for Term Frequency Normalisation ABSTRACT Ben He Department of Computing Science University of Glasgow Glasgow, United Kingdom ben@dcs.gla.ac.uk In Information Retrieval
More informationFORMULATION OF DRIVER JUDGMENT PROCESS AROUND CURVES FOR DEVIATED STATE DETECTION. Tokyo, Japan
FORMULATION OF DRIVER JUDGMENT PROCESS AROUND CURVES FOR DEVIATED STATE DETECTION Motoki Shino 1, Hiroshi Yoshitake 1, Machiko Hiramatsu 2, Takashi Sunda 2 & Minoru Kamata 1 1 The University of Tokyo 2
More informationBehaviour Recognition using the Event Calculus
Behaviour Recognition using the Event Calculus Alexander Artikis and George Paliouras Abstract We present a system for recognising human behaviour given a symbolic representation of surveillance videos.
More informationRelated Term Suggestion using Cooking Recipe Document Structure and its Application to Interactive Query Expansion
Related Term Suggestion using Cooking Recipe Document Structure and its Application to Interactive Query Expansion 1 Michiko Yasukawa 1 1 1 Faculty of Science and Technology, Gunma University Abstract:
More information1 Information retrieval fundamentals
CS 630 Lecture 1: 01/26/2006 Lecturer: Lillian Lee Scribes: Asif-ul Haque, Benyah Shaparenko This lecture focuses on the following topics Information retrieval fundamentals Vector Space Model (VSM) Deriving
More informationTowards Collaborative Information Retrieval
Towards Collaborative Information Retrieval Markus Junker, Armin Hust, and Stefan Klink German Research Center for Artificial Intelligence (DFKI GmbH), P.O. Box 28, 6768 Kaiserslautern, Germany {markus.junker,
More informationUniversity Physics With Modern Physics Wolfgang Bauer
We have made it easy for you to find a PDF Ebooks without any digging. And by having access to our ebooks online or by storing it on your computer, you have convenient answers with university physics with
More informationDeep Learning (CNNs)
10-601 Introduction to Machine Learning Machine Learning Department School of Computer Science Carnegie Mellon University Deep Learning (CNNs) Deep Learning Readings: Murphy 28 Bishop - - HTF - - Mitchell
More informationRecent Advances in Bayesian Inference Techniques
Recent Advances in Bayesian Inference Techniques Christopher M. Bishop Microsoft Research, Cambridge, U.K. research.microsoft.com/~cmbishop SIAM Conference on Data Mining, April 2004 Abstract Bayesian
More informationSpeech and Language Processing. Chapter 9 of SLP Automatic Speech Recognition (II)
Speech and Language Processing Chapter 9 of SLP Automatic Speech Recognition (II) Outline for ASR ASR Architecture The Noisy Channel Model Five easy pieces of an ASR system 1) Language Model 2) Lexicon/Pronunciation
More informationPre-Initialized Composition For Large-Vocabulary Speech Recognition
Pre-Initialized Composition For Large-Vocabulary Speech Recognition Cyril Allauzen, Michael Riley Google Research, 76 Ninth Avenue, New York, NY, USA allauzen@google.com, riley@google.com Abstract This
More informationLecture 3: ASR: HMMs, Forward, Viterbi
Original slides by Dan Jurafsky CS 224S / LINGUIST 285 Spoken Language Processing Andrew Maas Stanford University Spring 2017 Lecture 3: ASR: HMMs, Forward, Viterbi Fun informative read on phonetics The
More informationThe hadronization into the octet of pseudoscalar mesons in terms of SU(N) gauge invariant Lagrangian
The hadronization into the octet of pseudoscalar mesons in terms of SU(N gauge invariant Lagrangian National Research Nuclear University Moscow 115409, Moscow, Russia E-mail: a kosh@internets.ru By breaking
More informationStatistical Filters for Crowd Image Analysis
Statistical Filters for Crowd Image Analysis Ákos Utasi, Ákos Kiss and Tamás Szirányi Distributed Events Analysis Research Group, Computer and Automation Research Institute H-1111 Budapest, Kende utca
More informationEFFECTS OF ALONG-TRACK INTEGRATION ON DOPPLER VELOCITY BIAS WITH A SPACEBORNE CLOUD-PROFILING RADAR
P3.11 EFFECTS OF ALONG-TRACK INTEGRATION ON DOPPLER VELOCITY BIAS WITH A SPACEBORNE CLOUD-PROFILING RADAR Akihisa Uematsu 1 *, Yuichi Ohno 1, Hiroaki Horie 1,2, Hiroshi Kumagai 1, and Nick Schutgens 3
More informationLatent Semantic Analysis. Hongning Wang
Latent Semantic Analysis Hongning Wang CS@UVa VS model in practice Document and query are represented by term vectors Terms are not necessarily orthogonal to each other Synonymy: car v.s. automobile Polysemy:
More informationExploring Asymmetric Clustering for Statistical Language Modeling
Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics (ACL, Philadelphia, July 2002, pp. 83-90. Exploring Asymmetric Clustering for Statistical Language Modeling Jianfeng
More informationBrief Introduction of Machine Learning Techniques for Content Analysis
1 Brief Introduction of Machine Learning Techniques for Content Analysis Wei-Ta Chu 2008/11/20 Outline 2 Overview Gaussian Mixture Model (GMM) Hidden Markov Model (HMM) Support Vector Machine (SVM) Overview
More informationLearning Sequential Patterns for Lipreading
ONG, BOWDEN: LEARNING SEQUENTIAL PATTERNS FOR LIPREADING 1 Learning Sequential Patterns for Lipreading Eng-Jon Ong e.ong@surrey.ac.uk Richard Bowden r.bowden@surrey.ac.uk The Centre for Vision, Speech
More informationProgram verification using Hoare Logic¹
Program verification using Hoare Logic¹ Automated Reasoning - Guest Lecture Petros Papapanagiotou Part 2 of 2 ¹Contains material from Mike Gordon s slides: Previously on Hoare Logic A simple while language
More informationLayered Hypernetwork Models for Cross-Modal Associative Text and Image Keyword Generation in Multimodal Information Retrieval
Layered Hypernetwork Models for Cross-Modal Associative Text and Image Keyword Generation in Multimodal Information Retrieval Jung-Woo Ha, Byoung-Hee Kim, Bado Lee, and Byoung-Tak Zhang Biointelligence
More informationWeibull-Gamma composite distribution: An alternative multipath/shadowing fading model
Weibull-Gamma composite distribution: An alternative multipath/shadowing fading model Petros S. Bithas Institute for Space Applications and Remote Sensing, National Observatory of Athens, Metaxa & Vas.
More informationModel Theory Based Fusion Framework with Application to. Multisensor Target Recognition. Zbigniew Korona and Mieczyslaw M. Kokar
Model Theory Based Framework with Application to Multisensor Target Recognition Abstract In this work, we present a model theory based fusion methodology for multisensor waveletfeatures based recognition
More informationDeep Learning for Speech Recognition. Hung-yi Lee
Deep Learning for Speech Recognition Hung-yi Lee Outline Conventional Speech Recognition How to use Deep Learning in acoustic modeling? Why Deep Learning? Speaker Adaptation Multi-task Deep Learning New
More informationESTIMATING TRAFFIC NOISE LEVELS USING ACOUSTIC MONITORING: A PRELIMINARY STUDY
ESTIMATING TRAFFIC NOISE LEVELS USING ACOUSTIC MONITORING: A PRELIMINARY STUDY Jean-Rémy Gloaguen, Arnaud Can Ifsttar - LAE Route de Bouaye - CS4 44344, Bouguenais, FR jean-remy.gloaguen@ifsttar.fr Mathieu
More informationLecture 10. Discriminative Training, ROVER, and Consensus. Michael Picheny, Bhuvana Ramabhadran, Stanley F. Chen
Lecture 10 Discriminative Training, ROVER, and Consensus Michael Picheny, Bhuvana Ramabhadran, Stanley F. Chen IBM T.J. Watson Research Center Yorktown Heights, New York, USA {picheny,bhuvana,stanchen}@us.ibm.com
More informationWe Live in Exciting Times. CSCI-567: Machine Learning (Spring 2019) Outline. Outline. ACM (an international computing research society) has named
We Live in Exciting Times ACM (an international computing research society) has named CSCI-567: Machine Learning (Spring 2019) Prof. Victor Adamchik U of Southern California Apr. 2, 2019 Yoshua Bengio,
More informationAutomatic Generation of Shogi Commentary with a Log-Linear Language Model
1,a) 2, 1,b) 1,c) 3,d) 1,e) 2011 11 4, 2011 12 1 2 Automatic Generation of Shogi Commentary with a Log-Linear Language Model Hirotaka Kameko 1,a) Makoto Miwa 2, 1,b) Yoshimasa Tsuruoka 1,c) Shinsuke Mori
More informationA FAST AND ACCURATE ADAPTIVE NOTCH FILTER USING A MONOTONICALLY INCREASING GRADIENT. Yosuke SUGIURA
A FAST AND ACCURATE ADAPTIVE NOTCH FILTER USING A MONOTONICALLY INCREASING GRADIENT Yosuke SUGIURA Tokyo University of Science Faculty of Science and Technology ABSTRACT In this paper, we propose a new
More informationCS 154, Lecture 2: Finite Automata, Closure Properties Nondeterminism,
CS 54, Lecture 2: Finite Automata, Closure Properties Nondeterminism, Why so Many Models? Streaming Algorithms 0 42 Deterministic Finite Automata Anatomy of Deterministic Finite Automata transition: for
More informationFall CS646: Information Retrieval. Lecture 6 Boolean Search and Vector Space Model. Jiepu Jiang University of Massachusetts Amherst 2016/09/26
Fall 2016 CS646: Information Retrieval Lecture 6 Boolean Search and Vector Space Model Jiepu Jiang University of Massachusetts Amherst 2016/09/26 Outline Today Boolean Retrieval Vector Space Model Latent
More informationA NONPARAMETRIC BAYESIAN APPROACH FOR SPOKEN TERM DETECTION BY EXAMPLE QUERY
A NONPARAMETRIC BAYESIAN APPROACH FOR SPOKEN TERM DETECTION BY EXAMPLE QUERY Amir Hossein Harati Nead Torbati and Joseph Picone College of Engineering, Temple University Philadelphia, Pennsylvania, USA
More informationRanked Retrieval (2)
Text Technologies for Data Science INFR11145 Ranked Retrieval (2) Instructor: Walid Magdy 31-Oct-2017 Lecture Objectives Learn about Probabilistic models BM25 Learn about LM for IR 2 1 Recall: VSM & TFIDF
More informationA Genetic Algorithm with Expansion and Exploration Operators for the Maximum Satisfiability Problem
Applied Mathematical Sciences, Vol. 7, 2013, no. 24, 1183-1190 HIKARI Ltd, www.m-hikari.com A Genetic Algorithm with Expansion and Exploration Operators for the Maximum Satisfiability Problem Anna Gorbenko
More information4214 IEEE TRANSACTIONS ON SIGNAL PROCESSING, VOL. 54, NO. 11, NOVEMBER 2006
4214 IEEE TRANSACTIONS ON SIGNAL PROCESSING, VOL. 54, NO. 11, NOVEMBER 2006 Closed-Form Design of Generalized Maxflat R-Regular FIR M th-band Filters Using Waveform Moments Xi Zhang, Senior Member, IEEE,
More informationCS 2336 Discrete Mathematics
CS 2336 Discrete Mathematics Lecture 8 Counting: Permutations and Combinations 1 Outline Definitions Permutation Combination Interesting Identities 2 Definitions Selection and arrangement of objects appear
More informationopic Difference Factor Extraction between wo Document Sets and its Application to ext Categorization akahiko Kawatani Hewlett-Packard Labs Japan 3-8-13, akaido-higashi, Suginami-Ku, okyo, 168-0072 Japan
More informationPhysics 112 Spring 2014
Physics 112 Spring 2014 Phys 112 (S12) Syllabus/introduction 1 Goals Deeper understanding of concepts: less mysterious Entropy Free energy Chemical potential Statistical mechanics fluctuations kinetic
More informationMath 3C Lecture 20. John Douglas Moore
Math 3C Lecture 20 John Douglas Moore May 18, 2009 TENTATIVE FORMULA I Midterm I: 20% Midterm II: 20% Homework: 10% Quizzes: 10% Final: 40% TENTATIVE FORMULA II Higher of two midterms: 30% Homework: 10%
More informationSoundex distance metric
Text Algorithms (4AP) Lecture: Time warping and sound Jaak Vilo 008 fall Jaak Vilo MTAT.03.90 Text Algorithms Soundex distance metric Soundex is a coarse phonetic indexing scheme, widely used in genealogy.
More informationThe Comparison Test & Limit Comparison Test
The Comparison Test & Limit Comparison Test Math4 Department of Mathematics, University of Kentucky February 5, 207 Math4 Lecture 3 / 3 Summary of (some of) what we have learned about series... Math4 Lecture
More informationSpatial Role Labeling CS365 Course Project
Spatial Role Labeling CS365 Course Project Amit Kumar, akkumar@iitk.ac.in Chandra Sekhar, gchandra@iitk.ac.in Supervisor : Dr.Amitabha Mukerjee ABSTRACT In natural language processing one of the important
More informationPrediction of Citations for Academic Papers
000 001 002 003 004 005 006 007 008 009 010 011 012 013 014 015 016 017 018 019 020 021 022 023 024 025 026 027 028 029 030 031 032 033 034 035 036 037 038 039 040 041 042 043 044 045 046 047 048 049 050
More informationFoundations of Natural Language Processing Lecture 6 Spelling correction, edit distance, and EM
Foundations of Natural Language Processing Lecture 6 Spelling correction, edit distance, and EM Alex Lascarides (Slides from Alex Lascarides and Sharon Goldwater) 2 February 2019 Alex Lascarides FNLP Lecture
More informationAutomatic Evaluation in Text Summarization
1 Automatic Evaluation in Text Summarization Hidetsugu Nanba Tsutomu Hirao Hiroshima City University nanba@hiroshima-cuacjp, http://wwwnlpitshiroshima-cuacjp/~nanba/ NTT NTT Communication Science Laboratories
More informationVector Space Scoring Introduction to Information Retrieval Informatics 141 / CS 121 Donald J. Patterson
Vector Space Scoring Introduction to Information Retrieval Informatics 141 / CS 121 Donald J. Patterson Content adapted from Hinrich Schütze http://www.informationretrieval.org Querying Corpus-wide statistics
More information