Unied Presentation Contents Retrieval using Voice Information

Size: px
Start display at page:

Download "Unied Presentation Contents Retrieval using Voice Information"

Transcription

1 DEWS2006 6c-o1,, UPRISE (Unified Presentation slide Retrieval by Impression Search Engine) UPRISE UPRISE 4 e-learning Unied Presentation Contents Retrieval using Voice Information Hiroaki OKAMOTO, Wataru NAKANO, Takashi KOBAYASHI, Satoshi NAOI,, Haruo YOKOTA,, Koji IWANO, and Sadaoki FURUI Department of Computer Science, Graduate School of Information Science and Engineering, Tokyo Institute of Technology Global Scientic Information & Computing Center, Tokyo Institute of Technology FUJITSU LABORATORIES LTD {okamoto,wnakano}@decstitechacjp, tkobaya@gsictitechacjp,, naoisatoshi@jpfujitsucom, {yokota,iwano,furui}@cstitechacjp Abstract We have proposed the UPRISE (Unified Presentation slide Retrieval by Impression Search Engine) to store lecture videos and presentation slides used in the lectures It provides dedicated scene search functions for the metadata-based unified presentation contents, using the slide structure, scene duration, and context information In this paper, we use the voice information in the lecture videos to improve the precision of scene retrieval using the speech recognition technique We propose four ways to add the influence of voice to the scene search functions of the UPRISE, and evaluate the effect of them The experimental results indicate that the voice information is effective to improve the precision Key words information integration, information retrieval, e-learning 1 [1] [3] e-learning e-learning

2 UPRISE(Unied Presentation slide Retrieval by Impression Search Engine) [4] [11] UPRISE / "!$#%! 1 3 UPRISE UPRISE UPRISE 2 1 UPRISE UPRISE 1 ( CSJ) [12], [13] UPRISE [11] [14] UPRISE [15] [7] UPRISE UPRISE [16], [17] [6] 2 2 UPRISE I p I p UPRISE L I p (s, k) = P(s, l) C(s, k, l) 2 UPRISE UPRISE l=1 2 2 s k l P(s, l) &('*)

3 s l C(s, k, l) s l [11] k P(s, l) I d I d I p ( ) θ T(s) I d (s, k, θ, u) = I p (s, k) u UPRISE T(s) s θ u I c I c I d 3 1 I c (s, k, θ, u, δ, ε 1, ε 2 ) = s+δ γ=s δ E(γ s, ε 1, ε 2 ) I d (s, k, θ, u) δ E(γ s, ε 1, ε 2 ) E(γ s, ε 1, ε 2 ) exp(ε 1 x) (x < 0) E(x, ε 1, ε 2 ) = [19], [20] exp( ε 2 x) (x > = 0) [21] δ [21] (CSJ) [12], [13] ε δ = 4, ε 1 = 50, ε 2 = 05 4 CSJ 967 ( 300 ) UPRISE I p I d I c ipadic 3 ( ) ( 300 ) 22,860 [9] [10] [9], [10] ( 90 ) 3 UPRISE Julius 1 [18] Julius 2 2 UPRISE, [21] 3 2 Julius ( ) ( ) XML forgejp/ 2 UPIRISE 3

4 XML XML fragment fragment ( ) string voice I p 0 voice 1 1 voice string in ( ) constimemilli I c ( ) constimesec ( ) 0 cmscore cmscore IDF [22] 2 XML IVF I vas (VA if keyword 3 2 is in the Scene) I vas (s, k, δ, θ, u, ε 1, ε 2, α) s k I c (s, k, δ, θ, u, ε 1, ε 2 ) + α ( ) T(s) θ u VA(s, k) (Ip 0) ( VA(s, k)(voice Appearance) VA T(s) ) θ = I c (s, k, δ, θ, u, ε 1, ε 2 ) (I p = 0) u α α = 6 I c 1 1 α = 0 I c VC(Voice I vas and Context) I c 0 s+δ ( ) θ T(s) I vac (VA if keyword is in Contexts) VC(s, k, θ, u, δ, ε 1, ε 2 ) = E(γ s, ε 1, ε 2 ) VA(s, k) u γ=s δ IVF(Inverse Voice Frequency) IVF(k) = log k I vac (s, k, δ, θ, u, ε 1, ε 2, α) I c (s, k, δ, θ, u, ε 1, ε 2 ) + α ( ) T(s) θ u VA(s, k) (Ic 0) = 0 (I c = 0) I vac ( (VC) ) I vcs

5 I vcs (s, k, δ, θ, u, ε 1, ε 2, α) I c (s, k, δ, θ, u, ε 1, ε 2 ) + αvc(s, k, δ, θ, u, ε 1, ε 2 ) (I p 0) = I c (s, k, δ, θ, u, ε 1, ε 2 ) (I p = 0) ( ) 254% (I c ) 0, (VC) I vcc,, I vcc (s, k, δ, θ, u, ε 1, ε 2, α) I c (s, k, δ, θ, u, ε 1, ε 2 ) + αvc(s, k, δ, θ, u, ε 1, ε 2 ) (I c 0) = 0 (I c = 0) 3 3 IVF [9] I vas I vac I vcs I vcc IS FP IVF IS FC IS FC IVF IS FC(k) = log k θ = 05 u = 60 δ = 4 ε 1 = 50 ε 2 = 05 4 I c 124 I x IS FC IVF x vas vac vcs vcc I p I c I c I x IS FC IVF (s, k, δ, θ, u, ε 1, ε 2, α) I c IS FC(k) + αi x IVF(k) = I c IS FC(k) 4 (I p 0 I c 0) (I p = 0 I c = 0) IS FC IVF UPRISE IS FC α % 267% I vas I vac I vcs I vcc α UPRISE ( 11 ) ( 12 ) [21] , (recall) (pre ,036 cision) [23] 1 ( ) 1

6 %& ' $ +, - *!#" ')(! "#!$ "#%$& "#%$&$ 3 4 α (5 ) α (1 ) UPRISE 0 1 α = I p 1 N = 1 N 1 N i i= α α α I vas I vac I vcs I vcc 0=I c I c 0560(α = 0 ) I c I vas 0035 I vac I vcs 0059 I vcc 005 I vcs α I vac α = 9, 10 2 ' ' ' ' ' I c ' 3 α α ' ' I c 14 I vcs α α I c 5 2 (I vcs ) α = 0 α = 10 α = 100 α =

7 3 I vcs I vcs IS FC I vcs IS FC IVF α = α = α = α = α = ' ' I vcs IS FC I p 4 (α = 5) I vcs I vcs IS FC IVF I vcs IS FC ' ' ( ) ( ) α α = 5 α = I vcs ' ' 4 ' ' ' ' ' 7 ' 3 α = 5 70 ' ' ' ' I vcs IS FC IVF I vcs IS FC IVF I vcs IS FC IVF IS FC 2 I vcs IS FC 3 α = 0( ) 0 ( IS FC ) IS FC I vcs IS FC α = 5 I vcs α ' ' I vcs α = 20 I vcs IS FC IVF 0623 ' ' ' ' I vcs IS FC IVF I vcs IS FC I vcs α = 15 I c I vcs IS FC IVF 0063(63%) 0623

8 K "! # $ % & "' (*) +-,& 6!#"%$'&(!)*$ +-, 0/ ":$'&;<=)*$ +?>0@BA!#"%$'&E+->2@BA GF!>0@HA CBDJI CBD*3 7 ( ) CBD* ( ) [11] 3 3 IDF [22] 1 IVF UPRISE UPRISE I 4 2 c ( 0560) I I vcs vcs 0059(59%) 0619 α ( I vcs IS FC I vcs IS FC IVF ) α 1 I 5 1 c ( 0560) 0063(63%) 0623 I vcs 6 ( ) α = α = 3

9 In Proc of WIRI2005, pp 47 52, April 2005 [10] Wataru Nakano, Yuta Ochi, Takashi Kobayashi, Yutaka Katsuyama, Satoshi Naoi, and Haruo Yokota Unied Presentation Contents Retrieval Using Laser Pointer Information In Proc of SWOD, pp , April 2005 [11],,,, 2005-dbs-137 (ii) (78), [24] pp ,, July 2005 ( ) [12] gojp/ csj/public/ [25] [13] Kikuo Maekawa, Hanae Koiso, Sadaoki Furui, and Hitoshi Isahara Spontaneous speech corpus of japanese In Proc LREC2000, Vol 2, pp , Athens, Greece, May 2000 [14] S Johnson, P Jourlin, G Moore, K S Jones, and PWoodland The 4 3 Cambridge University spoken document retrieval system In Proceedings of ICASSP'99, pp 49 52, [15],, 8, March 2002 [16],, Technical Report 2005-SLP ,, February 2005 [17],, e-learning, March 2004 [18], Julius 1, Vol 20, No 1, pp 41 49, 2005 [19],,, SP ,, Dec 2000 [20] Takahiro Shinozaki, Chiori Hori, and Sadaoki Furui Towards automatic transcription of spontaneous presentations In Proc Eu- rospeech2001, Vol 1, pp , Aalborg, Denmark, Sep 2001 Julius / [21], February 2005 [22] G Salton Automatic Text Processing: The Transformation, Analysis and Retrieval of Information by Computer Addison-Wesley, 1988 [23] D A Grossman and O Frieder Information Retrieval Algorithm and ( , ) Heuristics Kluwer, 1998 CREST 21 [24] A Singhal and F Pereria Document expansion for speech retrieval COE In Proc of ACM SIGIR 99, pp 34 41, 1999 [25] S Srinivasan and D Petkovic Phonetic confusion matrix based spoken document retrieval In Proc of ACM SIGIR2000, pp 81 87, 2000 [1] R Müller and T Ottmann The Authoring on the Fly system for automated recording and replay of (tele)presentations Multimedia Syst, Vol 8, No 3, pp , 2000 [2] Carnegie Mellon University The Informedia Project Informedia ii digital video library [3] G D Abowd Classroom 2000: an experiment with the instrumentation of a living edu cational environment IBM Syst J, Vol 38, No 4, pp , 1999 [4] DBS ,, July 2001 [5],,,, In Proc of DBWeb2002, pp , 2002 [6] Haruo Yokota, Takashi Kobayashi, Taichi Muraki, and Satoshi Naoi UPRISE: Unied Presentation Slide Retrieval by Impression Search Engine IEICE Transactions on Information and Systems, Vol E87- D, No 2, pp , February 2004 [7],,,, Vol J88-D-I, No 3, pp , March 2005 [8],,, pp DEWS B 3 DE, March 2004 [9] Hiroaki Okamoto, Takashi Kobayashi, and Haruo Yokota Presentation Retrieval Method Considering the Scope of Targets and Outputs

Considering Granularity of Laser Pointer and Speech in Information Integration for Lecture Scene Retrieval

Considering Granularity of Laser Pointer and Speech in Information Integration for Lecture Scene Retrieval DEWS2007 E1-3,, 152 8552 2 12 1 152 8550 2 12 1 211 8588 4 1 1 E-mail: wnakano@de.cs.titech.ac.jp, tkobaya@gsic.titech.ac.jp,, naoi.satoshi@jp.fujitsu.com, {yokota,furui}@cs.titech.ac.jp UPRISE e-learning

More information

N-gram N-gram Language Model for Large-Vocabulary Continuous Speech Recognition

N-gram N-gram Language Model for Large-Vocabulary Continuous Speech Recognition 2010 11 5 N-gram N-gram Language Model for Large-Vocabulary Continuous Speech Recognition 1 48-106413 Abstract Large-Vocabulary Continuous Speech Recognition(LVCSR) system has rapidly been growing today.

More information

An Algorithm for Fast Calculation of Back-off N-gram Probabilities with Unigram Rescaling

An Algorithm for Fast Calculation of Back-off N-gram Probabilities with Unigram Rescaling An Algorithm for Fast Calculation of Back-off N-gram Probabilities with Unigram Rescaling Masaharu Kato, Tetsuo Kosaka, Akinori Ito and Shozo Makino Abstract Topic-based stochastic models such as the probabilistic

More information

Pivoted Length Normalization I. Summary idf II. Review

Pivoted Length Normalization I. Summary idf II. Review 2 Feb 2006 1/11 COM S/INFO 630: Representing and Accessing [Textual] Digital Information Lecturer: Lillian Lee Lecture 3: 2 February 2006 Scribes: Siavash Dejgosha (sd82) and Ricardo Hu (rh238) Pivoted

More information

Selecting Good Speech Features for Recognition

Selecting Good Speech Features for Recognition ETRI Journal, volume 18, number 1, April 1996 29 Selecting Good Speech Features for Recognition Youngjik Lee and Kyu-Woong Hwang CONTENTS I. INTRODUCTION II. COMPARISON OF MEASURES III. ANALYSIS OF SPEECH

More information

Bobby Hunt, Mariappan S. Nadar, Paul Keller, Eric VonColln, and Anupam Goyal III. ASSOCIATIVE RECALL BY A POLYNOMIAL MAPPING

Bobby Hunt, Mariappan S. Nadar, Paul Keller, Eric VonColln, and Anupam Goyal III. ASSOCIATIVE RECALL BY A POLYNOMIAL MAPPING Synthesis of a Nonrecurrent Associative Memory Model Based on a Nonlinear Transformation in the Spectral Domain p. 1 Bobby Hunt, Mariappan S. Nadar, Paul Keller, Eric VonColln, Anupam Goyal Abstract -

More information

Effectiveness of complex index terms in information retrieval

Effectiveness of complex index terms in information retrieval Effectiveness of complex index terms in information retrieval Tokunaga Takenobu, Ogibayasi Hironori and Tanaka Hozumi Department of Computer Science Tokyo Institute of Technology Abstract This paper explores

More information

Shankar Shivappa University of California, San Diego April 26, CSE 254 Seminar in learning algorithms

Shankar Shivappa University of California, San Diego April 26, CSE 254 Seminar in learning algorithms Recognition of Visual Speech Elements Using Adaptively Boosted Hidden Markov Models. Say Wei Foo, Yong Lian, Liang Dong. IEEE Transactions on Circuits and Systems for Video Technology, May 2004. Shankar

More information

Full-covariance model compensation for

Full-covariance model compensation for compensation transms Presentation Toshiba, 12 Mar 2008 Outline compensation transms compensation transms Outline compensation transms compensation transms Noise model x clean speech; n additive ; h convolutional

More information

Autoregressive Neural Models for Statistical Parametric Speech Synthesis

Autoregressive Neural Models for Statistical Parametric Speech Synthesis Autoregressive Neural Models for Statistical Parametric Speech Synthesis シンワン Xin WANG 2018-01-11 contact: wangxin@nii.ac.jp we welcome critical comments, suggestions, and discussion 1 https://www.slideshare.net/kotarotanahashi/deep-learning-library-coyotecnn

More information

Language Models, Smoothing, and IDF Weighting

Language Models, Smoothing, and IDF Weighting Language Models, Smoothing, and IDF Weighting Najeeb Abdulmutalib, Norbert Fuhr University of Duisburg-Essen, Germany {najeeb fuhr}@is.inf.uni-due.de Abstract In this paper, we investigate the relationship

More information

Why Language Models and Inverse Document Frequency for Information Retrieval?

Why Language Models and Inverse Document Frequency for Information Retrieval? Why Language Models and Inverse Document Frequency for Information Retrieval? Catarina Moreira, Andreas Wichert Instituto Superior Técnico, INESC-ID Av. Professor Cavaco Silva, 2744-016 Porto Salvo, Portugal

More information

A Note on the Effect of Term Weighting on Selecting Intrinsic Dimensionality of Data

A Note on the Effect of Term Weighting on Selecting Intrinsic Dimensionality of Data BULGARIAN ACADEMY OF SCIENCES CYBERNETICS AND INFORMATION TECHNOLOGIES Volume 9, No 1 Sofia 2009 A Note on the Effect of Term Weighting on Selecting Intrinsic Dimensionality of Data Ch. Aswani Kumar 1,

More information

A Variance Modeling Framework Based on Variational Autoencoders for Speech Enhancement

A Variance Modeling Framework Based on Variational Autoencoders for Speech Enhancement A Variance Modeling Framework Based on Variational Autoencoders for Speech Enhancement Simon Leglaive 1 Laurent Girin 1,2 Radu Horaud 1 1: Inria Grenoble Rhône-Alpes 2: Univ. Grenoble Alpes, Grenoble INP,

More information

Video and Motion Analysis Computer Vision Carnegie Mellon University (Kris Kitani)

Video and Motion Analysis Computer Vision Carnegie Mellon University (Kris Kitani) Video and Motion Analysis 16-385 Computer Vision Carnegie Mellon University (Kris Kitani) Optical flow used for feature tracking on a drone Interpolated optical flow used for super slow-mo optical flow

More information

Latent Semantic Analysis. Hongning Wang

Latent Semantic Analysis. Hongning Wang Latent Semantic Analysis Hongning Wang CS@UVa Recap: vector space model Represent both doc and query by concept vectors Each concept defines one dimension K concepts define a high-dimensional space Element

More information

Phys 102 Lecture 12 Currents & magnetic fields

Phys 102 Lecture 12 Currents & magnetic fields Phys 102 Lecture 12 Currents & magnetic fields 1 Today we will... Learn how magnetic fields are created by currents Use specific examples Long straight wire Current loop Solenoid Apply these concepts Electromagnets

More information

Phoneme segmentation based on spectral metrics

Phoneme segmentation based on spectral metrics Phoneme segmentation based on spectral metrics Xianhua Jiang, Johan Karlsson, and Tryphon T. Georgiou Introduction We consider the classic problem of segmenting speech signals into individual phonemes.

More information

Independent Component Analysis and Unsupervised Learning

Independent Component Analysis and Unsupervised Learning Independent Component Analysis and Unsupervised Learning Jen-Tzung Chien National Cheng Kung University TABLE OF CONTENTS 1. Independent Component Analysis 2. Case Study I: Speech Recognition Independent

More information

A Constraint Generation Approach to Learning Stable Linear Dynamical Systems

A Constraint Generation Approach to Learning Stable Linear Dynamical Systems A Constraint Generation Approach to Learning Stable Linear Dynamical Systems Sajid M. Siddiqi Byron Boots Geoffrey J. Gordon Carnegie Mellon University NIPS 2007 poster W22 steam Application: Dynamic Textures

More information

Latent Dirichlet Allocation Based Multi-Document Summarization

Latent Dirichlet Allocation Based Multi-Document Summarization Latent Dirichlet Allocation Based Multi-Document Summarization Rachit Arora Department of Computer Science and Engineering Indian Institute of Technology Madras Chennai - 600 036, India. rachitar@cse.iitm.ernet.in

More information

Copyright Warning & Restrictions

Copyright Warning & Restrictions Copyright Warning & Restrictions The copyright law of the United States (Title 17, United States Code) governs the making of photocopies or other reproductions of copyrighted material. Under certain conditions

More information

Independent Component Analysis and Unsupervised Learning. Jen-Tzung Chien

Independent Component Analysis and Unsupervised Learning. Jen-Tzung Chien Independent Component Analysis and Unsupervised Learning Jen-Tzung Chien TABLE OF CONTENTS 1. Independent Component Analysis 2. Case Study I: Speech Recognition Independent voices Nonparametric likelihood

More information

INSPIRE as a Powerful Tool for Managing and Opening Environmental Data

INSPIRE as a Powerful Tool for Managing and Opening Environmental Data INSPIRE as a Powerful Tool for Managing and Opening Environmental Data Leszek Litwin Maria Lenartowicz 1, Alina Litwiak 2, Elżbieta Duraj 2, Łukasz Łukasiewicz 2, Leszek Litwin 2 1 Chief Inspectorate of

More information

Modeling Timing Structure in Multimedia Signals

Modeling Timing Structure in Multimedia Signals Modeling Timing Structure in Multimedia Signals Hiroaki Kawashima, Kimitaka Tsutsumi, and Takashi Matsuyama Kyoto University, Yoshida-Honmachi Sakyo, Kyoto 6068501, JAPAN, {kawashima,tm}@i.kyoto-u.ac.jp,

More information

Robust Sound Event Detection in Continuous Audio Environments

Robust Sound Event Detection in Continuous Audio Environments Robust Sound Event Detection in Continuous Audio Environments Haomin Zhang 1, Ian McLoughlin 2,1, Yan Song 1 1 National Engineering Laboratory of Speech and Language Information Processing The University

More information

An Autoregressive Recurrent Mixture Density Network for Parametric Speech Synthesis

An Autoregressive Recurrent Mixture Density Network for Parametric Speech Synthesis ICASSP 07 New Orleans, USA An Autoregressive Recurrent Mixture Density Network for Parametric Speech Synthesis Xin WANG, Shinji TAKAKI, Junichi YAMAGISHI National Institute of Informatics, Japan 07-03-07

More information

COMPARISON OF FEATURES FOR DP-MATCHING BASED QUERY-BY-HUMMING SYSTEM

COMPARISON OF FEATURES FOR DP-MATCHING BASED QUERY-BY-HUMMING SYSTEM COMPARISON OF FEATURES FOR DP-MATCHING BASED QUERY-BY-HUMMING SYSTEM Akinori Ito Sung-Phil Heo Motoyuki Suzuki Shozo Makino Graduate School of Engineering Tohoku University Aoba 05, Aramaki, Sendai, 980-8579

More information

Can Vector Space Bases Model Context?

Can Vector Space Bases Model Context? Can Vector Space Bases Model Context? Massimo Melucci University of Padua Department of Information Engineering Via Gradenigo, 6/a 35031 Padova Italy melo@dei.unipd.it Abstract Current Information Retrieval

More information

STATISTICS FOR EFFICIENT LINEAR AND NON-LINEAR PICTURE ENCODING

STATISTICS FOR EFFICIENT LINEAR AND NON-LINEAR PICTURE ENCODING STATISTICS FOR EFFICIENT LINEAR AND NON-LINEAR PICTURE ENCODING Item Type text; Proceedings Authors Kummerow, Thomas Publisher International Foundation for Telemetering Journal International Telemetering

More information

ML (cont.): SUPPORT VECTOR MACHINES

ML (cont.): SUPPORT VECTOR MACHINES ML (cont.): SUPPORT VECTOR MACHINES CS540 Bryan R Gibson University of Wisconsin-Madison Slides adapted from those used by Prof. Jerry Zhu, CS540-1 1 / 40 Support Vector Machines (SVMs) The No-Math Version

More information

Introduction to Information Retrieval (Manning, Raghavan, Schutze) Chapter 6 Scoring term weighting and the vector space model

Introduction to Information Retrieval (Manning, Raghavan, Schutze) Chapter 6 Scoring term weighting and the vector space model Introduction to Information Retrieval (Manning, Raghavan, Schutze) Chapter 6 Scoring term weighting and the vector space model Ranked retrieval Thus far, our queries have all been Boolean. Documents either

More information

CS630 Representing and Accessing Digital Information Lecture 6: Feb 14, 2006

CS630 Representing and Accessing Digital Information Lecture 6: Feb 14, 2006 Scribes: Gilly Leshed, N. Sadat Shami Outline. Review. Mixture of Poissons ( Poisson) model 3. BM5/Okapi method 4. Relevance feedback. Review In discussing probabilistic models for information retrieval

More information

Welcome to Pittsburgh!

Welcome to Pittsburgh! Welcome to Pittsburgh! Overview of the IWSLT 2005 Evaluation Campaign atthias Eck and Chiori Hori InterACT Carnegie ellon niversity IWSLT2005 Evaluation campaign Working on the same test bed Training Corpus

More information

Just the Library Browsing the Library Back to the Front of the Library Search on Force and Collisions Results 2

Just the Library Browsing the Library Back to the Front of the Library Search on Force and Collisions Results 2 Improved Web Support for Physics Teachers Brian Adrian and Dean Zollman Kansas State University and Scott Stevens and Mike Cristel Carnegie Mellon University Supported by the National Science Foundation

More information

INFO 4300 / CS4300 Information Retrieval. slides adapted from Hinrich Schütze s, linked from

INFO 4300 / CS4300 Information Retrieval. slides adapted from Hinrich Schütze s, linked from INFO 4300 / CS4300 Information Retrieval slides adapted from Hinrich Schütze s, linked from http://informationretrieval.org/ IR 26/26: Feature Selection and Exam Overview Paul Ginsparg Cornell University,

More information

Query Performance Prediction: Evaluation Contrasted with Effectiveness

Query Performance Prediction: Evaluation Contrasted with Effectiveness Query Performance Prediction: Evaluation Contrasted with Effectiveness Claudia Hauff 1, Leif Azzopardi 2, Djoerd Hiemstra 1, and Franciska de Jong 1 1 University of Twente, Enschede, the Netherlands {c.hauff,

More information

A Study for Evaluating the Importance of Various Parts of Speech (POS) for Information Retrieval (IR)

A Study for Evaluating the Importance of Various Parts of Speech (POS) for Information Retrieval (IR) A Study for Evaluating the Importance of Various Parts of Speech (POS) for Information Retrieval (IR) Chirag Shah Dept. of CSE IIT Bombay, Powai Mumbai - 400 076, Maharashtra, India. Email: chirag@cse.iitb.ac.in

More information

Boundary Contraction Training for Acoustic Models based on Discrete Deep Neural Networks

Boundary Contraction Training for Acoustic Models based on Discrete Deep Neural Networks INTERSPEECH 2014 Boundary Contraction Training for Acoustic Models based on Discrete Deep Neural Networks Ryu Takeda, Naoyuki Kanda, and Nobuo Nukaga Central Research Laboratory, Hitachi Ltd., 1-280, Kokubunji-shi,

More information

A Low-Distortion Noise Canceller and Its Learning Algorithm in Presence of Crosstalk

A Low-Distortion Noise Canceller and Its Learning Algorithm in Presence of Crosstalk 414 IEICE TRANS. FUNDAMENTALS, VOL.E84 A, NO.2 FEBRUARY 2001 PAPER Special Section on Noise Cancellation Reduction Techniques A Low-Distortion Noise Canceller Its Learning Algorithm in Presence of Crosstalk

More information

Lecture 3: Pivoted Document Length Normalization

Lecture 3: Pivoted Document Length Normalization CS 6740: Advanced Language Technologies February 4, 2010 Lecture 3: Pivoted Document Length Normalization Lecturer: Lillian Lee Scribes: Lakshmi Ganesh, Navin Sivakumar Abstract In this lecture, we examine

More information

High Precision Control of Ball Screw Driven Stage Using Repetitive Control with Sharp Roll-off Learning Filter

High Precision Control of Ball Screw Driven Stage Using Repetitive Control with Sharp Roll-off Learning Filter High Precision Control of Ball Screw Driven Stage Using Repetitive Control with Sharp Roll-off Learning Filter Tadashi Takemura and Hiroshi Fujimoto The University of Tokyo --, Kashiwanoha, Kashiwa, Chiba,

More information

GeoPlot : Spatial Data Mining on Video Libraries

GeoPlot : Spatial Data Mining on Video Libraries GeoPlot : Spatial Data Mining on Video Libraries Jia-Yu Pan, Christos Faloutsos Computer Science Department Carnegie Mellon University Pittsburgh, PA {jypan, christos}@cs.cmu.edu ABSTRACT Are tornado touchdowns

More information

Hidden Markov Models and Gaussian Mixture Models

Hidden Markov Models and Gaussian Mixture Models Hidden Markov Models and Gaussian Mixture Models Hiroshi Shimodaira and Steve Renals Automatic Speech Recognition ASR Lectures 4&5 23&27 January 2014 ASR Lectures 4&5 Hidden Markov Models and Gaussian

More information

Noisy Speech Recognition using Wavelet Transform and Weighting Coefficients for a Specific Level

Noisy Speech Recognition using Wavelet Transform and Weighting Coefficients for a Specific Level Proceedings of th International Congress on Acoustics, ICA 23-27 August, Sydney, Australia Noisy Speech Recognition using Wavelet Transform and Weighting Coefficients for a Specific Level Yoichi MIDORIKAWA,

More information

A Multi-Layered Bayesian Network Model for Structured Document Retrieval

A Multi-Layered Bayesian Network Model for Structured Document Retrieval A Multi-Layered Bayesian Network Model for Structured Document Retrieval Fabio Crestani 1, Luis M. de Campos 2, Juan M. Fernández-Luna 2, and Juan F. Huete 2 1 Department of Computer and Information Sciences.

More information

Non-Negative Matrix Factorization And Its Application to Audio. Tuomas Virtanen Tampere University of Technology

Non-Negative Matrix Factorization And Its Application to Audio. Tuomas Virtanen Tampere University of Technology Non-Negative Matrix Factorization And Its Application to Audio Tuomas Virtanen Tampere University of Technology tuomas.virtanen@tut.fi 2 Contents Introduction to audio signals Spectrogram representation

More information

ChemDataExtractor: A toolkit for automated extraction of chemical information from the scientific literature

ChemDataExtractor: A toolkit for automated extraction of chemical information from the scientific literature : A toolkit for automated extraction of chemical information from the scientific literature Callum Court Molecular Engineering, University of Cambridge Supervisor: Dr Jacqueline Cole 1 / 20 Overview 1

More information

GMM-Based Speech Transformation Systems under Data Reduction

GMM-Based Speech Transformation Systems under Data Reduction GMM-Based Speech Transformation Systems under Data Reduction Larbi Mesbahi, Vincent Barreaud, Olivier Boeffard IRISA / University of Rennes 1 - ENSSAT 6 rue de Kerampont, B.P. 80518, F-22305 Lannion Cedex

More information

Topic tracking language model for speech recognition

Topic tracking language model for speech recognition Available online at www.sciencedirect.com Computer Speech and Language 25 (2011) 440 461 Topic tracing language model for speech recognition Shinji Watanabe a,, Tomoharu Iwata a, Taaai Hori a, Atsushi

More information

Speech Recognition Lecture 5: N-gram Language Models. Eugene Weinstein Google, NYU Courant Institute Slide Credit: Mehryar Mohri

Speech Recognition Lecture 5: N-gram Language Models. Eugene Weinstein Google, NYU Courant Institute Slide Credit: Mehryar Mohri Speech Recognition Lecture 5: N-gram Language Models Eugene Weinstein Google, NYU Courant Institute eugenew@cs.nyu.edu Slide Credit: Mehryar Mohri Components Acoustic and pronunciation model: Pr(o w) =

More information

Statistical Methods for NLP

Statistical Methods for NLP Statistical Methods for NLP Sequence Models Joakim Nivre Uppsala University Department of Linguistics and Philology joakim.nivre@lingfil.uu.se Statistical Methods for NLP 1(21) Introduction Structured

More information

The Moments of the Profile in Random Binary Digital Trees

The Moments of the Profile in Random Binary Digital Trees Journal of mathematics and computer science 6(2013)176-190 The Moments of the Profile in Random Binary Digital Trees Ramin Kazemi and Saeid Delavar Department of Statistics, Imam Khomeini International

More information

10-701/ Recitation : Kernels

10-701/ Recitation : Kernels 10-701/15-781 Recitation : Kernels Manojit Nandi February 27, 2014 Outline Mathematical Theory Banach Space and Hilbert Spaces Kernels Commonly Used Kernels Kernel Theory One Weird Kernel Trick Representer

More information

A Study of the Dirichlet Priors for Term Frequency Normalisation

A Study of the Dirichlet Priors for Term Frequency Normalisation A Study of the Dirichlet Priors for Term Frequency Normalisation ABSTRACT Ben He Department of Computing Science University of Glasgow Glasgow, United Kingdom ben@dcs.gla.ac.uk In Information Retrieval

More information

FORMULATION OF DRIVER JUDGMENT PROCESS AROUND CURVES FOR DEVIATED STATE DETECTION. Tokyo, Japan

FORMULATION OF DRIVER JUDGMENT PROCESS AROUND CURVES FOR DEVIATED STATE DETECTION. Tokyo, Japan FORMULATION OF DRIVER JUDGMENT PROCESS AROUND CURVES FOR DEVIATED STATE DETECTION Motoki Shino 1, Hiroshi Yoshitake 1, Machiko Hiramatsu 2, Takashi Sunda 2 & Minoru Kamata 1 1 The University of Tokyo 2

More information

Behaviour Recognition using the Event Calculus

Behaviour Recognition using the Event Calculus Behaviour Recognition using the Event Calculus Alexander Artikis and George Paliouras Abstract We present a system for recognising human behaviour given a symbolic representation of surveillance videos.

More information

Related Term Suggestion using Cooking Recipe Document Structure and its Application to Interactive Query Expansion

Related Term Suggestion using Cooking Recipe Document Structure and its Application to Interactive Query Expansion Related Term Suggestion using Cooking Recipe Document Structure and its Application to Interactive Query Expansion 1 Michiko Yasukawa 1 1 1 Faculty of Science and Technology, Gunma University Abstract:

More information

1 Information retrieval fundamentals

1 Information retrieval fundamentals CS 630 Lecture 1: 01/26/2006 Lecturer: Lillian Lee Scribes: Asif-ul Haque, Benyah Shaparenko This lecture focuses on the following topics Information retrieval fundamentals Vector Space Model (VSM) Deriving

More information

Towards Collaborative Information Retrieval

Towards Collaborative Information Retrieval Towards Collaborative Information Retrieval Markus Junker, Armin Hust, and Stefan Klink German Research Center for Artificial Intelligence (DFKI GmbH), P.O. Box 28, 6768 Kaiserslautern, Germany {markus.junker,

More information

University Physics With Modern Physics Wolfgang Bauer

University Physics With Modern Physics Wolfgang Bauer We have made it easy for you to find a PDF Ebooks without any digging. And by having access to our ebooks online or by storing it on your computer, you have convenient answers with university physics with

More information

Deep Learning (CNNs)

Deep Learning (CNNs) 10-601 Introduction to Machine Learning Machine Learning Department School of Computer Science Carnegie Mellon University Deep Learning (CNNs) Deep Learning Readings: Murphy 28 Bishop - - HTF - - Mitchell

More information

Recent Advances in Bayesian Inference Techniques

Recent Advances in Bayesian Inference Techniques Recent Advances in Bayesian Inference Techniques Christopher M. Bishop Microsoft Research, Cambridge, U.K. research.microsoft.com/~cmbishop SIAM Conference on Data Mining, April 2004 Abstract Bayesian

More information

Speech and Language Processing. Chapter 9 of SLP Automatic Speech Recognition (II)

Speech and Language Processing. Chapter 9 of SLP Automatic Speech Recognition (II) Speech and Language Processing Chapter 9 of SLP Automatic Speech Recognition (II) Outline for ASR ASR Architecture The Noisy Channel Model Five easy pieces of an ASR system 1) Language Model 2) Lexicon/Pronunciation

More information

Pre-Initialized Composition For Large-Vocabulary Speech Recognition

Pre-Initialized Composition For Large-Vocabulary Speech Recognition Pre-Initialized Composition For Large-Vocabulary Speech Recognition Cyril Allauzen, Michael Riley Google Research, 76 Ninth Avenue, New York, NY, USA allauzen@google.com, riley@google.com Abstract This

More information

Lecture 3: ASR: HMMs, Forward, Viterbi

Lecture 3: ASR: HMMs, Forward, Viterbi Original slides by Dan Jurafsky CS 224S / LINGUIST 285 Spoken Language Processing Andrew Maas Stanford University Spring 2017 Lecture 3: ASR: HMMs, Forward, Viterbi Fun informative read on phonetics The

More information

The hadronization into the octet of pseudoscalar mesons in terms of SU(N) gauge invariant Lagrangian

The hadronization into the octet of pseudoscalar mesons in terms of SU(N) gauge invariant Lagrangian The hadronization into the octet of pseudoscalar mesons in terms of SU(N gauge invariant Lagrangian National Research Nuclear University Moscow 115409, Moscow, Russia E-mail: a kosh@internets.ru By breaking

More information

Statistical Filters for Crowd Image Analysis

Statistical Filters for Crowd Image Analysis Statistical Filters for Crowd Image Analysis Ákos Utasi, Ákos Kiss and Tamás Szirányi Distributed Events Analysis Research Group, Computer and Automation Research Institute H-1111 Budapest, Kende utca

More information

EFFECTS OF ALONG-TRACK INTEGRATION ON DOPPLER VELOCITY BIAS WITH A SPACEBORNE CLOUD-PROFILING RADAR

EFFECTS OF ALONG-TRACK INTEGRATION ON DOPPLER VELOCITY BIAS WITH A SPACEBORNE CLOUD-PROFILING RADAR P3.11 EFFECTS OF ALONG-TRACK INTEGRATION ON DOPPLER VELOCITY BIAS WITH A SPACEBORNE CLOUD-PROFILING RADAR Akihisa Uematsu 1 *, Yuichi Ohno 1, Hiroaki Horie 1,2, Hiroshi Kumagai 1, and Nick Schutgens 3

More information

Latent Semantic Analysis. Hongning Wang

Latent Semantic Analysis. Hongning Wang Latent Semantic Analysis Hongning Wang CS@UVa VS model in practice Document and query are represented by term vectors Terms are not necessarily orthogonal to each other Synonymy: car v.s. automobile Polysemy:

More information

Exploring Asymmetric Clustering for Statistical Language Modeling

Exploring Asymmetric Clustering for Statistical Language Modeling Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics (ACL, Philadelphia, July 2002, pp. 83-90. Exploring Asymmetric Clustering for Statistical Language Modeling Jianfeng

More information

Brief Introduction of Machine Learning Techniques for Content Analysis

Brief Introduction of Machine Learning Techniques for Content Analysis 1 Brief Introduction of Machine Learning Techniques for Content Analysis Wei-Ta Chu 2008/11/20 Outline 2 Overview Gaussian Mixture Model (GMM) Hidden Markov Model (HMM) Support Vector Machine (SVM) Overview

More information

Learning Sequential Patterns for Lipreading

Learning Sequential Patterns for Lipreading ONG, BOWDEN: LEARNING SEQUENTIAL PATTERNS FOR LIPREADING 1 Learning Sequential Patterns for Lipreading Eng-Jon Ong e.ong@surrey.ac.uk Richard Bowden r.bowden@surrey.ac.uk The Centre for Vision, Speech

More information

Program verification using Hoare Logic¹

Program verification using Hoare Logic¹ Program verification using Hoare Logic¹ Automated Reasoning - Guest Lecture Petros Papapanagiotou Part 2 of 2 ¹Contains material from Mike Gordon s slides: Previously on Hoare Logic A simple while language

More information

Layered Hypernetwork Models for Cross-Modal Associative Text and Image Keyword Generation in Multimodal Information Retrieval

Layered Hypernetwork Models for Cross-Modal Associative Text and Image Keyword Generation in Multimodal Information Retrieval Layered Hypernetwork Models for Cross-Modal Associative Text and Image Keyword Generation in Multimodal Information Retrieval Jung-Woo Ha, Byoung-Hee Kim, Bado Lee, and Byoung-Tak Zhang Biointelligence

More information

Weibull-Gamma composite distribution: An alternative multipath/shadowing fading model

Weibull-Gamma composite distribution: An alternative multipath/shadowing fading model Weibull-Gamma composite distribution: An alternative multipath/shadowing fading model Petros S. Bithas Institute for Space Applications and Remote Sensing, National Observatory of Athens, Metaxa & Vas.

More information

Model Theory Based Fusion Framework with Application to. Multisensor Target Recognition. Zbigniew Korona and Mieczyslaw M. Kokar

Model Theory Based Fusion Framework with Application to. Multisensor Target Recognition. Zbigniew Korona and Mieczyslaw M. Kokar Model Theory Based Framework with Application to Multisensor Target Recognition Abstract In this work, we present a model theory based fusion methodology for multisensor waveletfeatures based recognition

More information

Deep Learning for Speech Recognition. Hung-yi Lee

Deep Learning for Speech Recognition. Hung-yi Lee Deep Learning for Speech Recognition Hung-yi Lee Outline Conventional Speech Recognition How to use Deep Learning in acoustic modeling? Why Deep Learning? Speaker Adaptation Multi-task Deep Learning New

More information

ESTIMATING TRAFFIC NOISE LEVELS USING ACOUSTIC MONITORING: A PRELIMINARY STUDY

ESTIMATING TRAFFIC NOISE LEVELS USING ACOUSTIC MONITORING: A PRELIMINARY STUDY ESTIMATING TRAFFIC NOISE LEVELS USING ACOUSTIC MONITORING: A PRELIMINARY STUDY Jean-Rémy Gloaguen, Arnaud Can Ifsttar - LAE Route de Bouaye - CS4 44344, Bouguenais, FR jean-remy.gloaguen@ifsttar.fr Mathieu

More information

Lecture 10. Discriminative Training, ROVER, and Consensus. Michael Picheny, Bhuvana Ramabhadran, Stanley F. Chen

Lecture 10. Discriminative Training, ROVER, and Consensus. Michael Picheny, Bhuvana Ramabhadran, Stanley F. Chen Lecture 10 Discriminative Training, ROVER, and Consensus Michael Picheny, Bhuvana Ramabhadran, Stanley F. Chen IBM T.J. Watson Research Center Yorktown Heights, New York, USA {picheny,bhuvana,stanchen}@us.ibm.com

More information

We Live in Exciting Times. CSCI-567: Machine Learning (Spring 2019) Outline. Outline. ACM (an international computing research society) has named

We Live in Exciting Times. CSCI-567: Machine Learning (Spring 2019) Outline. Outline. ACM (an international computing research society) has named We Live in Exciting Times ACM (an international computing research society) has named CSCI-567: Machine Learning (Spring 2019) Prof. Victor Adamchik U of Southern California Apr. 2, 2019 Yoshua Bengio,

More information

Automatic Generation of Shogi Commentary with a Log-Linear Language Model

Automatic Generation of Shogi Commentary with a Log-Linear Language Model 1,a) 2, 1,b) 1,c) 3,d) 1,e) 2011 11 4, 2011 12 1 2 Automatic Generation of Shogi Commentary with a Log-Linear Language Model Hirotaka Kameko 1,a) Makoto Miwa 2, 1,b) Yoshimasa Tsuruoka 1,c) Shinsuke Mori

More information

A FAST AND ACCURATE ADAPTIVE NOTCH FILTER USING A MONOTONICALLY INCREASING GRADIENT. Yosuke SUGIURA

A FAST AND ACCURATE ADAPTIVE NOTCH FILTER USING A MONOTONICALLY INCREASING GRADIENT. Yosuke SUGIURA A FAST AND ACCURATE ADAPTIVE NOTCH FILTER USING A MONOTONICALLY INCREASING GRADIENT Yosuke SUGIURA Tokyo University of Science Faculty of Science and Technology ABSTRACT In this paper, we propose a new

More information

CS 154, Lecture 2: Finite Automata, Closure Properties Nondeterminism,

CS 154, Lecture 2: Finite Automata, Closure Properties Nondeterminism, CS 54, Lecture 2: Finite Automata, Closure Properties Nondeterminism, Why so Many Models? Streaming Algorithms 0 42 Deterministic Finite Automata Anatomy of Deterministic Finite Automata transition: for

More information

Fall CS646: Information Retrieval. Lecture 6 Boolean Search and Vector Space Model. Jiepu Jiang University of Massachusetts Amherst 2016/09/26

Fall CS646: Information Retrieval. Lecture 6 Boolean Search and Vector Space Model. Jiepu Jiang University of Massachusetts Amherst 2016/09/26 Fall 2016 CS646: Information Retrieval Lecture 6 Boolean Search and Vector Space Model Jiepu Jiang University of Massachusetts Amherst 2016/09/26 Outline Today Boolean Retrieval Vector Space Model Latent

More information

A NONPARAMETRIC BAYESIAN APPROACH FOR SPOKEN TERM DETECTION BY EXAMPLE QUERY

A NONPARAMETRIC BAYESIAN APPROACH FOR SPOKEN TERM DETECTION BY EXAMPLE QUERY A NONPARAMETRIC BAYESIAN APPROACH FOR SPOKEN TERM DETECTION BY EXAMPLE QUERY Amir Hossein Harati Nead Torbati and Joseph Picone College of Engineering, Temple University Philadelphia, Pennsylvania, USA

More information

Ranked Retrieval (2)

Ranked Retrieval (2) Text Technologies for Data Science INFR11145 Ranked Retrieval (2) Instructor: Walid Magdy 31-Oct-2017 Lecture Objectives Learn about Probabilistic models BM25 Learn about LM for IR 2 1 Recall: VSM & TFIDF

More information

A Genetic Algorithm with Expansion and Exploration Operators for the Maximum Satisfiability Problem

A Genetic Algorithm with Expansion and Exploration Operators for the Maximum Satisfiability Problem Applied Mathematical Sciences, Vol. 7, 2013, no. 24, 1183-1190 HIKARI Ltd, www.m-hikari.com A Genetic Algorithm with Expansion and Exploration Operators for the Maximum Satisfiability Problem Anna Gorbenko

More information

4214 IEEE TRANSACTIONS ON SIGNAL PROCESSING, VOL. 54, NO. 11, NOVEMBER 2006

4214 IEEE TRANSACTIONS ON SIGNAL PROCESSING, VOL. 54, NO. 11, NOVEMBER 2006 4214 IEEE TRANSACTIONS ON SIGNAL PROCESSING, VOL. 54, NO. 11, NOVEMBER 2006 Closed-Form Design of Generalized Maxflat R-Regular FIR M th-band Filters Using Waveform Moments Xi Zhang, Senior Member, IEEE,

More information

CS 2336 Discrete Mathematics

CS 2336 Discrete Mathematics CS 2336 Discrete Mathematics Lecture 8 Counting: Permutations and Combinations 1 Outline Definitions Permutation Combination Interesting Identities 2 Definitions Selection and arrangement of objects appear

More information

opic Difference Factor Extraction between wo Document Sets and its Application to ext Categorization akahiko Kawatani Hewlett-Packard Labs Japan 3-8-13, akaido-higashi, Suginami-Ku, okyo, 168-0072 Japan

More information

Physics 112 Spring 2014

Physics 112 Spring 2014 Physics 112 Spring 2014 Phys 112 (S12) Syllabus/introduction 1 Goals Deeper understanding of concepts: less mysterious Entropy Free energy Chemical potential Statistical mechanics fluctuations kinetic

More information

Math 3C Lecture 20. John Douglas Moore

Math 3C Lecture 20. John Douglas Moore Math 3C Lecture 20 John Douglas Moore May 18, 2009 TENTATIVE FORMULA I Midterm I: 20% Midterm II: 20% Homework: 10% Quizzes: 10% Final: 40% TENTATIVE FORMULA II Higher of two midterms: 30% Homework: 10%

More information

Soundex distance metric

Soundex distance metric Text Algorithms (4AP) Lecture: Time warping and sound Jaak Vilo 008 fall Jaak Vilo MTAT.03.90 Text Algorithms Soundex distance metric Soundex is a coarse phonetic indexing scheme, widely used in genealogy.

More information

The Comparison Test & Limit Comparison Test

The Comparison Test & Limit Comparison Test The Comparison Test & Limit Comparison Test Math4 Department of Mathematics, University of Kentucky February 5, 207 Math4 Lecture 3 / 3 Summary of (some of) what we have learned about series... Math4 Lecture

More information

Spatial Role Labeling CS365 Course Project

Spatial Role Labeling CS365 Course Project Spatial Role Labeling CS365 Course Project Amit Kumar, akkumar@iitk.ac.in Chandra Sekhar, gchandra@iitk.ac.in Supervisor : Dr.Amitabha Mukerjee ABSTRACT In natural language processing one of the important

More information

Prediction of Citations for Academic Papers

Prediction of Citations for Academic Papers 000 001 002 003 004 005 006 007 008 009 010 011 012 013 014 015 016 017 018 019 020 021 022 023 024 025 026 027 028 029 030 031 032 033 034 035 036 037 038 039 040 041 042 043 044 045 046 047 048 049 050

More information

Foundations of Natural Language Processing Lecture 6 Spelling correction, edit distance, and EM

Foundations of Natural Language Processing Lecture 6 Spelling correction, edit distance, and EM Foundations of Natural Language Processing Lecture 6 Spelling correction, edit distance, and EM Alex Lascarides (Slides from Alex Lascarides and Sharon Goldwater) 2 February 2019 Alex Lascarides FNLP Lecture

More information

Automatic Evaluation in Text Summarization

Automatic Evaluation in Text Summarization 1 Automatic Evaluation in Text Summarization Hidetsugu Nanba Tsutomu Hirao Hiroshima City University nanba@hiroshima-cuacjp, http://wwwnlpitshiroshima-cuacjp/~nanba/ NTT NTT Communication Science Laboratories

More information

Vector Space Scoring Introduction to Information Retrieval Informatics 141 / CS 121 Donald J. Patterson

Vector Space Scoring Introduction to Information Retrieval Informatics 141 / CS 121 Donald J. Patterson Vector Space Scoring Introduction to Information Retrieval Informatics 141 / CS 121 Donald J. Patterson Content adapted from Hinrich Schütze http://www.informationretrieval.org Querying Corpus-wide statistics

More information