Transmogrification: The Magic of Feature Engineering Leah McGuire and Mayukh Bhaowal
|
|
- Alan Bruce
- 5 years ago
- Views:
Transcription
1 Transmogrification: The Magic of Feature Engineering Leah McGuire and Mayukh Bhaowal
2
3
4
5
6 ML algorithms take center stage in AI Modeling Raw Data Feature Engineering Bottleneck
7 Mythical Numeric Matrix X X2 X3 X4 X5 Y A B B A A
8 Use the data types
9 Automatic Feature Engineering Numeric Categorical Text Temporal Spatial Time difference Augment with external data e.g avg income Imputation Imputation Track null value Log transformation for large range Scaling - znormalize Smart Binning Track null value One Hot Encoding Dynamic Top K pivot Smart Binning LabelCount Encoding Category Embedding Tokenization Hash Encoding Circular Statistics Tf-Idf Word2Vec Sentiment Analysis Language Detection Time extraction (day, week, month, year) Closeness to major events Spatial fraudulent behavior e.g: impossible travel speed Geo-encoding
10 Transmogrification val featurevector = Seq(age, phone, , subject, zipcode).transmogrify()
11 Impact on Feature Engineering Is Spammy Top Domain Phone Country Code Age Phone Is Valid Age [-5] Vector Subject Age [5-35] Age [>35] Zipcode Top TF-IDF Terms Average Income
12
13
14 The Black Swan of Perfectly Interpretable Models Leah McGuire, Mayukh Bhaowal
15
16 Roadmap for this talk What does it mean to explain your model? Why explain your model? How to explain your model? Interpretability vs accuracy tradeoff Complications of feature engineering Global (full model) solutions Local (record level) solutions
17 Roadmap for this talk What does it mean to explain your model? Why explain your model? How to explain your model? Interpretability vs accuracy tradeoff Complications of feature engineering Global (full model) solutions Local (record level) solutions
18 The Question Why did the machine learning model make the decision that it did?
19 Translation # How do I fix this model? Data Scientist
20 Translation #2 Do we have our bases covered, in case of a regulatory audit? Legal Counsel
21 Translation #3 Does Einstein know what I know? How do I use this prediction? Non Technical End User
22 P(c f) Input Pk(c f) Pn(c f) Output Σ
23 Model Insights Report
24 Roadmap for this talk What does it mean to explain your model? Why explain your model? How to explain your model? Interpretability vs accuracy tradeoff Complications of feature engineering Global (full model) solutions Local (record level) solutions
25 Debuggability Top contributing features for surviving the Titanic: F. Gender 2. pclass 3. Body
26 Trust How can you trust a man that wears both a belt and suspenders? Man can't even trust his own pants.
27 Right Human Machine Wrong
28 Bias
29 Legal
30
31 Black defendant has higher risk scores
32 Actionable
33 Roadmap for this talk What does it mean to explain your model? Why explain your model? How to explain your model? Interpretability vs accuracy tradeoff Complications of feature engineering Global (full model) solutions Local (record level) solutions
34 It s complicated
35 Can you use a simple model? Feature Weights/ Importance Global Feature Weights/ Importance Local Are the raw features fed into the model interpretable? Does the consumer care about how features affect the model or just feature insights? Does the consumer care about individual predictions? Feature Impact Model Agnostic Global Secondary Model Global Feature Impact Model Agnostic Local Secondary Model Local
36 Roadmap for this talk What does it mean to explain your model? Why explain your model? How to explain your model? Interpretability vs accuracy tradeoff Complications of feature engineering Global (full model) solutions Local (record level) solutions
37 The best model or the model you can explain?
38 Roadmap for this talk What does it mean to explain your model? Why explain your model? How to explain your model? Interpretability vs accuracy tradeoff Complications of feature engineering Global (full model) solutions Local (record level) solutions
39 Where did you get the feature matrix? X X2 X3 X4 X5 Y A B B A A
40 Feature Engineering Is Spammy Top Domain Phone Country Code Age Phone Is Valid Age [-5] Vector Subject Age [5-35] Age [>35] Zipcode Top TF-IDF Terms Average Income
41 Metadata!!! The name of the feature the column was made from The name of the RAW feature(s) the column was made from Everything you did to get the column Any grouping information across columns Description of the value in the column
42 Roadmap for this talk What does it mean to explain your model? Why explain your model? How to explain your model? Interpretability vs accuracy tradeoff Complications of feature engineering Global (full model) solutions Local (record level) solutions
43 Interpretability: Global vs Local
44 Can you use a simple model? Feature Weights/ Importance Global Are the raw features fed into the model interpretable? Does the consumer care about how features affect the model or just feature insights? Feature Impact Model Agnostic Global Does the consumer care about individual predictions? Secondary Model Global
45 Feature Weight / Importance (Global)
46 Predict House Price
47 Predict Titanic Passenger Survival
48 P(c f) Input Pk(c f) Pn(c f) Output Σ
49 Feature Impact (Global - the hard way) X X2 X3 X4 X5 Y A B B A A
50 Feature Impact (Global - the hard way)
51 Issues with Feature Importance / Weight / Impact (Global)
52 Secondary Model Prediction Input Explanation
53 Secondary Model (Global)
54 Secondary Model (Global)
55 What we do: All the metadata about how you got the feature Correlation Mutual information Feature weight / importance Feature distribution
56 { "featurename" : "sex", What "derivedfeatures" : [ { "stagesapplied" : [ "pivottext_opsetvectorizer" ], "derivedfeaturevalue" : "Male", "corr" : , "mutualinformation" : , "contribution" : ,. }, { "stagesapplied" : [ "pivottext_opsetvectorizer" ], "derivedfeaturevalue" : "Female", "corr" : , "mutualinformation" : , "contribution" : ,. } } we do:
57 Roadmap for this talk What does it mean to explain your model? Why explain your model? How to explain your model? Interpretability vs accuracy tradeoff Complications of feature engineering Global (full model) solutions Local (record level) solutions
58 Can you use a simple model? Feature Weights/ Importance Local Are the raw features fed into the model interpretable? Does the consumer care about how features affect the model or just feature insights? Feature Impact Model Agnostic Local Does the consumer care about individual predictions? Secondary Model Local
59 Feature Weight (Local)
60 Predict House Price
61 Feature Weight (Local)
62 Feature Impact (LOCO) {"age":7., "embarked":"c", "name":"attalah, Miss. Malake", "pclass":"3", "parch":"", "sex":"female", "sibsp":"", "survived":., "ticket":"2627"} Score =.62 Why? sex = "female" (+.3), pclass = 3 (-.5),...
63 Secondary Model (LIME)
64 Secondary Model (Correlation) Norm (feature) * Corr
65 What we do: Use case determines LOCO or correlation Use case determines what level of features we show
66
67
Online Supplementary Material. MetaLP: A Nonparametric Distributed Learning Framework for Small and Big Data
Online Supplementary Material MetaLP: A Nonparametric Distributed Learning Framework for Small and Big Data PI : Subhadeep Mukhopadhyay Department of Statistics, Temple University Philadelphia, Pennsylvania,
More informationDecision Tree Ensembles
Decision Tree Ensembles Random Forest & Gradient Boosting CSE 416 Quiz Section 4/26/2018 Kaggle Titanic Data Passen gerid Survived Pclass Name Sex Age SibSp Parch Ticket Fare Cabin Embarked 1 0 3 Braund,
More informationMachine Learning Interpretability with H2O Driverless AI
Machine Learning Interpretability with H2O Driverless AI Patrick Hall, Navdeep Gill, Megan Kurka, & Wen Phan Edited by: Angela Bartz http://docs.h2o.ai September 2017: First Edition Machine Learning Interpretability
More informationHomework 3 COMS 4705 Fall 2017 Prof. Kathleen McKeown
Homework 3 COMS 4705 Fall 017 Prof. Kathleen McKeown The assignment consists of a programming part and a written part. For the programming part, make sure you have set up the development environment as
More informationsmart reply and implicit semantics Matthew Henderson and Brian Strope Google AI
smart reply and implicit semantics Matthew Henderson and Brian Strope Google AI collaborators include: Rami Al-Rfou, Yun-hsuan Sung Laszlo Lukacs, Ruiqi Guo, Sanjiv Kumar Balint Miklos, Ray Kurzweil and
More informationStats 170A: Project in Data Science Predictive Modeling: Regression
Stats 170A: Project in Data Science Predictive Modeling: Regression Padhraic Smyth Department of Computer Science Bren School of Information and Computer Sciences University of California, Irvine Reading,
More informationClassification: Analyzing Sentiment
Classification: Analyzing Sentiment STAT/CSE 416: Machine Learning Emily Fox University of Washington April 17, 2018 Predicting sentiment by topic: An intelligent restaurant review system 1 4/16/18 It
More informationHypothesis testing. Data to decisions
Hypothesis testing Data to decisions The idea Null hypothesis: H 0 : the DGP/population has property P Under the null, a sample statistic has a known distribution If, under that that distribution, the
More informationClassification: Analyzing Sentiment
Classification: Analyzing Sentiment STAT/CSE 416: Machine Learning Emily Fox University of Washington April 17, 2018 Predicting sentiment by topic: An intelligent restaurant review system 1 It s a big
More informationLet s see if we can predict whether a student returns or does not return to St. Ambrose for their second year.
Assignment #13: GLM Scenario: Over the past few years, our first-to-second year retention rate has ranged from 77-80%. In other words, 77-80% of our first-year students come back to St. Ambrose for their
More informationFeature Engineering, Model Evaluations
Feature Engineering, Model Evaluations Giri Iyengar Cornell University gi43@cornell.edu Feb 5, 2018 Giri Iyengar (Cornell Tech) Feature Engineering Feb 5, 2018 1 / 35 Overview 1 ETL 2 Feature Engineering
More informationLogistic Regression. Some slides from Craig Burkett. STA303/STA1002: Methods of Data Analysis II, Summer 2016 Michael Guerzhoy
Logistic Regression Some slides from Craig Burkett STA303/STA1002: Methods of Data Analysis II, Summer 2016 Michael Guerzhoy Titanic Survival Case Study The RMS Titanic A British passenger liner Collided
More informationGenerative Clustering, Topic Modeling, & Bayesian Inference
Generative Clustering, Topic Modeling, & Bayesian Inference INFO-4604, Applied Machine Learning University of Colorado Boulder December 12-14, 2017 Prof. Michael Paul Unsupervised Naïve Bayes Last week
More information9/12/17. Types of learning. Modeling data. Supervised learning: Classification. Supervised learning: Regression. Unsupervised learning: Clustering
Types of learning Modeling data Supervised: we know input and targets Goal is to learn a model that, given input data, accurately predicts target data Unsupervised: we know the input only and want to make
More informationBag of Words Meets Bags of Popcorn
Sentiment Analysis via and Natural Language Processing Tarleton State University July 16, 2015 Data Description Sentiment Score tf-idf NDSI AFINN List word score invincible 2 mirthful 3 flops -2 hypocritical
More informationMaking sense of Econometrics: Basics
Making sense of Econometrics: Basics Lecture 4: Qualitative influences and Heteroskedasticity Egypt Scholars Economic Society November 1, 2014 Assignment & feedback enter classroom at http://b.socrative.com/login/student/
More informationOnline Passive-Aggressive Algorithms. Tirgul 11
Online Passive-Aggressive Algorithms Tirgul 11 Multi-Label Classification 2 Multilabel Problem: Example Mapping Apps to smart folders: Assign an installed app to one or more folders Candy Crush Saga 3
More informationEssence of Machine Learning (and Deep Learning) Hoa M. Le Data Science Lab, HUST hoamle.github.io
Essence of Machine Learning (and Deep Learning) Hoa M. Le Data Science Lab, HUST hoamle.github.io 1 Examples https://www.youtube.com/watch?v=bmka1zsg2 P4 http://www.r2d3.us/visual-intro-to-machinelearning-part-1/
More informationTitanic: Data Analysis
Titanic: Data Analysis Victor Bernal Arzola May 2, 26 victor.bernal@mathmods.eu Introduction Data analysis is a process for obtaining raw data and converting it into information useful for decisionmaking
More informationhypotheses. P-value Test for a 2 Sample z-test (Large Independent Samples) n > 30 P-value Test for a 2 Sample t-test (Small Samples) n < 30 Identify α
Chapter 8 Notes Section 8-1 Independent and Dependent Samples Independent samples have no relation to each other. An example would be comparing the costs of vacationing in Florida to the cost of vacationing
More informationContingency Tables. Contingency tables are used when we want to looking at two (or more) factors. Each factor might have two more or levels.
Contingency Tables Definition & Examples. Contingency tables are used when we want to looking at two (or more) factors. Each factor might have two more or levels. (Using more than two factors gets complicated,
More informationNonparametric Distributed Learning Framework: Algorithm and Application to Variable Selection
Nonparametric Distributed Learning Framework: Algorithm and Application to Variable Selection Scott Bruce, Zeda Li, Hsiang-Chieh Yang, and Subhadeep Mukhopadhyay Temple University, Department of Statistics
More informationVocabulary: Samples and Populations
Vocabulary: Samples and Populations Concept Different types of data Categorical data results when the question asked in a survey or sample can be answered with a nonnumerical answer. For example if we
More informationFrom perceptrons to word embeddings. Simon Šuster University of Groningen
From perceptrons to word embeddings Simon Šuster University of Groningen Outline A basic computational unit Weighting some input to produce an output: classification Perceptron Classify tweets Written
More informationRegression. Goal: Learn a mapping from observations (features) to continuous labels given a training set (supervised learning)
Linear Regression Regression Goal: Learn a mapping from observations (features) to continuous labels given a training set (supervised learning) Example: Height, Gender, Weight Shoe Size Audio features
More informationContingency Tables. Safety equipment in use Fatal Non-fatal Total. None 1, , ,128 Seat belt , ,878
Contingency Tables I. Definition & Examples. A) Contingency tables are tables where we are looking at two (or more - but we won t cover three or more way tables, it s way too complicated) factors, each
More informationRegression. Goal: Learn a mapping from observations (features) to continuous labels given a training set (supervised learning)
Linear Regression Regression Goal: Learn a mapping from observations (features) to continuous labels given a training set (supervised learning) Example: Height, Gender, Weight Shoe Size Audio features
More informationWeighting Missing Data Coding and Data Preparation Wrap-up Preview of Next Time. Data Management
Data Management Department of Political Science and Government Aarhus University November 24, 2014 Data Management Weighting Handling missing data Categorizing missing data types Imputation Summary measures
More informationLecture 7 Time-dependent Covariates in Cox Regression
Lecture 7 Time-dependent Covariates in Cox Regression So far, we ve been considering the following Cox PH model: λ(t Z) = λ 0 (t) exp(β Z) = λ 0 (t) exp( β j Z j ) where β j is the parameter for the the
More informationSGA-2: WP6- Early estimates. SURS: Boro Nikić, Tomaž Špeh ESSnet Big Data Project Brussels 26.,
SGA-2: WP6- Early estimates SURS: Boro Nikić, Tomaž Špeh ESSnet Big Data Project Brussels 26.,27.10. 2017 Outline WP6 state of affairs WP6 pilot: Early estimates of economic indicators Methods Data sources
More informationHow to Present Results of Regression Models to Clinicians
How to Present Results of Regression Models to Clinicians Frank E Harrell Jr Department of Biostatistics Vanderbilt University School of Medicine f.harrell@vanderbilt.edu biostat.mc.vanderbilt.edu/fhhandouts
More informationALL OF NONPARAMETRIC STATISTICS SOLUTIONS
page 1 / 5 page 2 / 5 all of nonparametric statistics pdf Nonparametric statistics is the branch of statistics that is not based solely on parametrized families of probability distributions (common examples
More informationCS570 Introduction to Data Mining
CS570 Introduction to Data Mining Department of Mathematics and Computer Science Li Xiong Data Exploration and Data Preprocessing Data and Attributes Data exploration Data pre-processing Data cleaning
More informationStatistical and Econometric Methods
Statistical and Econometric Methods Assignment #1 (Continuous Data - Regression Analysis) You are given 151 observations of a travel survey collected in State College, Pennsylvania. All of the households
More informationBivariate Relationships Between Variables
Bivariate Relationships Between Variables BUS 735: Business Decision Making and Research 1 Goals Specific goals: Detect relationships between variables. Be able to prescribe appropriate statistical methods
More informationYour Virtual Workforce. On Demand. Worldwide. COMPANY PRESENTATION. clickworker GmbH 2017
Your Virtual Workforce. On Demand. Worldwide. COMPANY PRESENTATION 2017 CLICKWORKER AT A GLANCE Segment: Paid Crowdsourcing / Microtasking Services: Text Creation (incl. SEO Texts), AI-Training Data, Internet
More informationPart A. P (w 1 )P (w 2 w 1 )P (w 3 w 1 w 2 ) P (w M w 1 w 2 w M 1 ) P (w 1 )P (w 2 w 1 )P (w 3 w 2 ) P (w M w M 1 )
Part A 1. A Markov chain is a discrete-time stochastic process, defined by a set of states, a set of transition probabilities (between states), and a set of initial state probabilities; the process proceeds
More informationKnown unknowns : using multiple imputation to fill in the blanks for missing data
Known unknowns : using multiple imputation to fill in the blanks for missing data James Stanley Department of Public Health University of Otago, Wellington james.stanley@otago.ac.nz Acknowledgments Cancer
More informationRecurrent Neural Networks Deep Learning Lecture 5. Efstratios Gavves
Recurrent Neural Networks Deep Learning Lecture 5 Efstratios Gavves Sequential Data So far, all tasks assumed stationary data Neither all data, nor all tasks are stationary though Sequential Data: Text
More informationSimilarity and Dissimilarity
1//015 Similarity and Dissimilarity COMP 465 Data Mining Similarity of Data Data Preprocessing Slides Adapted From : Jiawei Han, Micheline Kamber & Jian Pei Data Mining: Concepts and Techniques, 3 rd ed.
More information1. True/False (40 points) total
Name: 1 2 3 4 5 6 7 total 40 20 45 45 20 15 15 200 UMBC CMSC 671 Final Exam December 20, 2009 Please write all of your answers on this exam. The exam is closed book and has seven problems that add up to
More informationCategorization ANLP Lecture 10 Text Categorization with Naive Bayes
1 Categorization ANLP Lecture 10 Text Categorization with Naive Bayes Sharon Goldwater 6 October 2014 Important task for both humans and machines object identification face recognition spoken word recognition
More informationANLP Lecture 10 Text Categorization with Naive Bayes
ANLP Lecture 10 Text Categorization with Naive Bayes Sharon Goldwater 6 October 2014 Categorization Important task for both humans and machines 1 object identification face recognition spoken word recognition
More informationApproaches to Spatial Analysis. Flora Vale, Linda Beale, Mark Harrower, Clint Brown Esri Redlands
Approaches to Spatial Analysis Flora Vale, Linda Beale, Mark Harrower, Clint Brown Esri Redlands Analysis (noun) Detailed examination of the elements or structure of something, as a basis for discussion,
More informationSampling Distribution Models. Chapter 17
Sampling Distribution Models Chapter 17 Objectives: 1. Sampling Distribution Model 2. Sampling Variability (sampling error) 3. Sampling Distribution Model for a Proportion 4. Central Limit Theorem 5. Sampling
More informationAIM HIGH SCHOOL. Curriculum Map W. 12 Mile Road Farmington Hills, MI (248)
AIM HIGH SCHOOL Curriculum Map 2923 W. 12 Mile Road Farmington Hills, MI 48334 (248) 702-6922 www.aimhighschool.com COURSE TITLE: Statistics DESCRIPTION OF COURSE: PREREQUISITES: Algebra 2 Students will
More informationSTATS216v Introduction to Statistical Learning Stanford University, Summer Midterm Exam (Solutions) Duration: 1 hours
Instructions: STATS216v Introduction to Statistical Learning Stanford University, Summer 2017 Remember the university honor code. Midterm Exam (Solutions) Duration: 1 hours Write your name and SUNet ID
More informationCourse Introduction and Overview Descriptive Statistics Conceptualizations of Variance Review of the General Linear Model
Course Introduction and Overview Descriptive Statistics Conceptualizations of Variance Review of the General Linear Model EPSY 905: Multivariate Analysis Lecture 1 20 January 2016 EPSY 905: Lecture 1 -
More informationCS 570: Machine Learning Seminar. Fall 2016
CS 570: Machine Learning Seminar Fall 2016 Class Information Class web page: http://web.cecs.pdx.edu/~mm/mlseminar2016-2017/fall2016/ Class mailing list: cs570@cs.pdx.edu My office hours: T,Th, 2-3pm or
More informationCHAPTER 4 & 5 Linear Regression with One Regressor. Kazu Matsuda IBEC PHBU 430 Econometrics
CHAPTER 4 & 5 Linear Regression with One Regressor Kazu Matsuda IBEC PHBU 430 Econometrics Introduction Simple linear regression model = Linear model with one independent variable. y = dependent variable
More informationAd Placement Strategies
Case Study 1: Estimating Click Probabilities Tackling an Unknown Number of Features with Sketching Machine Learning for Big Data CSE547/STAT548, University of Washington Emily Fox 2014 Emily Fox January
More information16.400/453J Human Factors Engineering. Design of Experiments II
J Human Factors Engineering Design of Experiments II Review Experiment Design and Descriptive Statistics Research question, independent and dependent variables, histograms, box plots, etc. Inferential
More informationStatistics for Managers Using Microsoft Excel
Statistics for Managers Using Microsoft Excel 7 th Edition Chapter 1 Chi-Square Tests and Nonparametric Tests Statistics for Managers Using Microsoft Excel 7e Copyright 014 Pearson Education, Inc. Chap
More informationThe prediction of house price
000 001 002 003 004 005 006 007 008 009 010 011 012 013 014 015 016 017 018 019 020 021 022 023 024 025 026 027 028 029 030 031 032 033 034 035 036 037 038 039 040 041 042 043 044 045 046 047 048 049 050
More informationGaussian and Linear Discriminant Analysis; Multiclass Classification
Gaussian and Linear Discriminant Analysis; Multiclass Classification Professor Ameet Talwalkar Slide Credit: Professor Fei Sha Professor Ameet Talwalkar CS260 Machine Learning Algorithms October 13, 2015
More informationSemantics with Dense Vectors. Reference: D. Jurafsky and J. Martin, Speech and Language Processing
Semantics with Dense Vectors Reference: D. Jurafsky and J. Martin, Speech and Language Processing 1 Semantics with Dense Vectors We saw how to represent a word as a sparse vector with dimensions corresponding
More informationLoss Functions, Decision Theory, and Linear Models
Loss Functions, Decision Theory, and Linear Models CMSC 678 UMBC January 31 st, 2018 Some slides adapted from Hamed Pirsiavash Logistics Recap Piazza (ask & answer questions): https://piazza.com/umbc/spring2018/cmsc678
More informationFeature Design. Feature Design. Feature Design. & Deep Learning
Artificial Intelligence and its applications Lecture 9 & Deep Learning Professor Daniel Yeung danyeung@ieee.org Dr. Patrick Chan patrickchan@ieee.org South China University of Technology, China Appropriately
More informationAPPLIED DEEP LEARNING PROF ALEXIEI DINGLI
APPLIED DEEP LEARNING PROF ALEXIEI DINGLI TECH NEWS TECH NEWS HOW TO DO IT? TECH NEWS APPLICATIONS TECH NEWS TECH NEWS NEURAL NETWORKS Interconnected set of nodes and edges Designed to perform complex
More informationSparse vectors recap. ANLP Lecture 22 Lexical Semantics with Dense Vectors. Before density, another approach to normalisation.
ANLP Lecture 22 Lexical Semantics with Dense Vectors Henry S. Thompson Based on slides by Jurafsky & Martin, some via Dorota Glowacka 5 November 2018 Previous lectures: Sparse vectors recap How to represent
More informationANLP Lecture 22 Lexical Semantics with Dense Vectors
ANLP Lecture 22 Lexical Semantics with Dense Vectors Henry S. Thompson Based on slides by Jurafsky & Martin, some via Dorota Glowacka 5 November 2018 Henry S. Thompson ANLP Lecture 22 5 November 2018 Previous
More informationCourse Introduction and Overview Descriptive Statistics Conceptualizations of Variance Review of the General Linear Model
Course Introduction and Overview Descriptive Statistics Conceptualizations of Variance Review of the General Linear Model PSYC 943 (930): Fundamentals of Multivariate Modeling Lecture 1: August 22, 2012
More informationAverage weight of Eisenhower dollar: 23 grams
Average weight of Eisenhower dollar: 23 grams Average cost of dinner in Decatur: 23 dollars Would it be more surprising to see A dinner that costs more than 27 dollars, or An Eisenhower dollar that weighs
More informationDeep Learning for Speech Recognition. Hung-yi Lee
Deep Learning for Speech Recognition Hung-yi Lee Outline Conventional Speech Recognition How to use Deep Learning in acoustic modeling? Why Deep Learning? Speaker Adaptation Multi-task Deep Learning New
More informationHypotheses and Errors
Hypotheses and Errors Jonathan Bagley School of Mathematics, University of Manchester Jonathan Bagley, September 23, 2005 Hypotheses & Errors - p. 1/22 Overview Today we ll develop the standard framework
More informationInformation Retrieval. Lecture 6
Information Retrieval Lecture 6 Recap of the last lecture Parametric and field searches Zones in documents Scoring documents: zone weighting Index support for scoring tf idf and vector spaces This lecture
More informationParametric versus Nonparametric Statistics-when to use them and which is more powerful? Dr Mahmoud Alhussami
Parametric versus Nonparametric Statistics-when to use them and which is more powerful? Dr Mahmoud Alhussami Parametric Assumptions The observations must be independent. Dependent variable should be continuous
More informationCh 7: Dummy (binary, indicator) variables
Ch 7: Dummy (binary, indicator) variables :Examples Dummy variable are used to indicate the presence or absence of a characteristic. For example, define female i 1 if obs i is female 0 otherwise or male
More informationwhere Female = 0 for males, = 1 for females Age is measured in years (22, 23, ) GPA is measured in units on a four-point scale (0, 1.22, 3.45, etc.
Notes on regression analysis 1. Basics in regression analysis key concepts (actual implementation is more complicated) A. Collect data B. Plot data on graph, draw a line through the middle of the scatter
More informationMarch 14 th March 18 th
March 14 th March 18 th Unit 8: Linear Functions Jump Start Using your own words, what is the question asking? Explain a strategy you ve learned this year to solve this problem. Solve the problem! 1 Scatter
More information2.4 Solving an Equation
2.4 Solving an Equation Purpose The role this topic plays in quantitative reasoning You can solve some equations that arise in the real world by isolating a variable. You can use this method to solve the
More informationDraft Proof - Do not copy, post, or distribute
1 LEARNING OBJECTIVES After reading this chapter, you should be able to: 1. Distinguish between descriptive and inferential statistics. Introduction to Statistics 2. Explain how samples and populations,
More informationLecture 5 Neural models for NLP
CS546: Machine Learning in NLP (Spring 2018) http://courses.engr.illinois.edu/cs546/ Lecture 5 Neural models for NLP Julia Hockenmaier juliahmr@illinois.edu 3324 Siebel Center Office hours: Tue/Thu 2pm-3pm
More informationWhy aquarius women ignore. Why aquarius women ignore
Paieška Paieška Paieška Why aquarius women ignore Why aquarius women ignore > > Jan 6, 2017. Aquarians don't intentionally ignore someone unless they're trying to. Aquarius women love a man who is striving
More informationMatrices, Row Reduction of Matrices
Matrices, Row Reduction of Matrices October 9, 014 1 Row Reduction and Echelon Forms In the previous section, we saw a procedure for solving systems of equations It is simple in that it consists of only
More informationText Categorization CSE 454. (Based on slides by Dan Weld, Tom Mitchell, and others)
Text Categorization CSE 454 (Based on slides by Dan Weld, Tom Mitchell, and others) 1 Given: Categorization A description of an instance, x X, where X is the instance language or instance space. A fixed
More informationYou know I m not goin diss you on the internet Cause my mama taught me better than that I m a survivor (What?) I m not goin give up (What?
You know I m not goin diss you on the internet Cause my mama taught me better than that I m a survivor (What?) I m not goin give up (What?) I m not goin stop (What?) I m goin work harder (What?) Sir David
More informationBusiness Statistics. Lecture 9: Simple Regression
Business Statistics Lecture 9: Simple Regression 1 On to Model Building! Up to now, class was about descriptive and inferential statistics Numerical and graphical summaries of data Confidence intervals
More informationIntroduction to Gaussian Process
Introduction to Gaussian Process CS 778 Chris Tensmeyer CS 478 INTRODUCTION 1 What Topic? Machine Learning Regression Bayesian ML Bayesian Regression Bayesian Non-parametric Gaussian Process (GP) GP Regression
More informationChi-Squared Tests. Semester 1. Chi-Squared Tests
Semester 1 Goodness of Fit Up to now, we have tested hypotheses concerning the values of population parameters such as the population mean or proportion. We have not considered testing hypotheses about
More informationCS 175, Project in Artificial Intelligence Lecture 3: Document Classification
CS 175, Project in Artificial Intelligence Lecture 3: Document Classification Padhraic Smyth Department of Computer Science Bren School of Information and Computer Sciences University of California, Irvine
More information(Feed-Forward) Neural Networks Dr. Hajira Jabeen, Prof. Jens Lehmann
(Feed-Forward) Neural Networks 2016-12-06 Dr. Hajira Jabeen, Prof. Jens Lehmann Outline In the previous lectures we have learned about tensors and factorization methods. RESCAL is a bilinear model for
More informationMSP Research Note. RDQ Reliability, Validity and Norms
MSP Research Note RDQ Reliability, Validity and Norms Introduction This research note describes the technical properties of the RDQ. Evidence for the reliability and validity of the RDQ is presented against
More informationCLUe Training An Introduction to Machine Learning in R with an example from handwritten digit recognition
CLUe Training An Introduction to Machine Learning in R with an example from handwritten digit recognition Ad Feelders Universiteit Utrecht Department of Information and Computing Sciences Algorithmic Data
More informationTEEN DRIVER ELECTRONIC DEVICE OBSERVATION FORM
TEEN OBSERVATION FORM High Schools will count only teen drivers. If your school does not have many teen drivers, please use the Junior High form to count adult drivers. Choose a day to conduct your pre-observations.
More informationChapter. Hypothesis Testing with Two Samples. Copyright 2015, 2012, and 2009 Pearson Education, Inc. 1
Chapter 8 Hypothesis Testing with Two Samples Copyright 2015, 2012, and 2009 Pearson Education, Inc 1 Two Sample Hypothesis Test Compares two parameters from two populations Sampling methods: Independent
More informationAP Statistics Bivariate Data Analysis Test Review. Multiple-Choice
Name Period AP Statistics Bivariate Data Analysis Test Review Multiple-Choice 1. The correlation coefficient measures: (a) Whether there is a relationship between two variables (b) The strength of the
More informationSection 4. Test-Level Analyses
Section 4. Test-Level Analyses Test-level analyses include demographic distributions, reliability analyses, summary statistics, and decision consistency and accuracy. Demographic Distributions All eligible
More informationB. Weaver (24-Mar-2005) Multiple Regression Chapter 5: Multiple Regression Y ) (5.1) Deviation score = (Y i
B. Weaver (24-Mar-2005) Multiple Regression... 1 Chapter 5: Multiple Regression 5.1 Partial and semi-partial correlation Before starting on multiple regression per se, we need to consider the concepts
More informationA proportion is the fraction of individuals having a particular attribute. Can range from 0 to 1!
Proportions A proportion is the fraction of individuals having a particular attribute. It is also the probability that an individual randomly sampled from the population will have that attribute Can range
More informationC-14 Finding the Right Synergy from GLMs and Machine Learning
C-14 Finding the Right Synergy from GLMs and Machine Learning 2010 CAS Annual Meeting Claudine Modlin November 8, 2010 Slide 1 Definitions Parametric modeling Objective: build a predictive model User makes
More informationOptimal Treatment Regimes for Survival Endpoints from a Classification Perspective. Anastasios (Butch) Tsiatis and Xiaofei Bai
Optimal Treatment Regimes for Survival Endpoints from a Classification Perspective Anastasios (Butch) Tsiatis and Xiaofei Bai Department of Statistics North Carolina State University 1/35 Optimal Treatment
More informationCSE 105 THEORY OF COMPUTATION
CSE 105 THEORY OF COMPUTATION "Winter" 2018 http://cseweb.ucsd.edu/classes/wi18/cse105-ab/ Today's learning goals Sipser Ch 4.1 Explain what it means for a problem to be decidable. Justify the use of encoding.
More informationPS5: Two Variable Statistics LT3: Linear regression LT4: The test of independence.
PS5: Two Variable Statistics LT3: Linear regression LT4: The test of independence. Example by eye. On a hot day, nine cars were left in the sun in a car parking lot. The length of time each car was left
More informationIntroduction to Machine Learning (Pattern recognition and model fitting) for Master students
Introduction to Machine Learning (Pattern recognition and model fitting) for Master students Spring 007, ÖU/RAP Thorsteinn Rögnvaldsson thorsteinn.rognvaldsson@tech.oru.se Contents Machine learning algorithms
More informationStatistics 1L03 - Midterm #2 Review
Statistics 1L03 - Midterm # Review Atinder Bharaj Made with L A TEX October, 01 Introduction As many of you will soon find out, I will not be holding the next midterm review. To make it a bit easier on
More informationInferences About Two Population Proportions
Inferences About Two Population Proportions MATH 130, Elements of Statistics I J. Robert Buchanan Department of Mathematics Fall 2018 Background Recall: for a single population the sampling proportion
More informationAnnouncements. Problem Set Four due Thursday at 7:00PM (right before the midterm).
Finite Automata Announcements Problem Set Four due Thursday at 7:PM (right before the midterm). Stop by OH with questions! Email cs3@cs.stanford.edu with questions! Review session tonight, 7PM until whenever
More informationPresented By: Omer Shmueli and Sivan Niv
Deep Speaker: an End-to-End Neural Speaker Embedding System Chao Li, Xiaokong Ma, Bing Jiang, Xiangang Li, Xuewei Zhang, Xiao Liu, Ying Cao, Ajay Kannan, Zhenyao Zhu Presented By: Omer Shmueli and Sivan
More informationAnalysis of Variance. Contents. 1 Analysis of Variance. 1.1 Review. Anthony Tanbakuchi Department of Mathematics Pima Community College
Introductory Statistics Lectures Analysis of Variance 1-Way ANOVA: Many sample test of means Department of Mathematics Pima Community College Redistribution of this material is prohibited without written
More information