Application of Machine Learning to Materials Discovery and Development
|
|
- Aldous Benson
- 5 years ago
- Views:
Transcription
1 Application of Machine Learning to Materials Discovery and Development Ankit Agrawal and Alok Choudhary Department of Electrical Engineering and Computer Science Northwestern University Contributors: Surya Kalidindi (GaTech), Basavarsu (TRDDC), Chris Wolverton (NU), Ahmet Cecen (GaTech), Parijat Deshpande (TRDDC), Bryce Meredig (NU) MURI 3-Year Review June 22-23, 2015
2 Integrated Computational Materials Engineering (ICME) Goal/means Performance Properties Structure Processing Cause and effect Olson, G. B. (1997). Computational design of hierarchically structured materials. Science, 277(5330),
3 Project Collaboration Goal/means Processing Structure Properties Performance Project I. Multi-objective Structure-Property Optimization Cause and effect Olson, G. B. (1997). Computational design of hierarchically structured materials. Science, 277(5330),
4 Project Collaboration Goal/means Processing Structure Properties Performance Project I. Multi-objective Structure-Property Optimization Project II. Multiscale Prediction of Localization Relationships Cause and effect Olson, G. B. (1997). Computational design of hierarchically structured materials. Science, 277(5330),
5 Project Collaboration Goal/means Processing Structure Properties Performance Cause and effect Project I. Multi-objective Structure-Property Optimization Project II. Multiscale Prediction of Localization Relationships Project III. Exploring Composition-Processing- Property Relationships Project IV. Compositionbased Discovery of Stable Compounds Olson, G. B. (1997). Computational design of hierarchically structured materials. Science, 277(5330),
6 Predicting fatigue strength of steel from composition and processing parameters Collaborative project between Agrawal (NU), Choudhary (NU), Kalidindi (GaTech), Basavarsu (TRDDC) Objective: Employ data-driven approaches to the NIMS public domain materials database for exploring composition-processingproperty relationships and constructing predictive models for fatigue strength of steels. COMPOSITION MANUFACTURING PROCESSES CORRELATES TO CORRELATES TO PROPERTIES (FATIGUE STRENGTH)
7 NIMS Database Attributes Fatigue Data Sheet Information: Chemical composition - %C, %Si, %Mn, %P, %S, %Ni, %Cr, Cu %, Mo% (all in wt. %) Upstream processing details - Ingot size, Reduction ratio, Non-metallic inclusions Heat treatment conditions Temperature, Time and other process conditions for Normalizing, Carburizing-Quenching and Tempering processes Mechanical properties - YS, UTS, %EL (Elongation), %RA (Reduction in Area), Vickers Hardness, Charpy impact value (J/cm 2 ), Rotating bending fatigue 10 7 cycles Total data records Carbon and low alloy steels observations, Carburizing steels - 48 observations and Spring steels -18 observations Reference : 6
8 Steel Fatigue Strength Prediction Framework 7
9 Data Mining Modeling Classification/Regression Learning a predictive model based on supervised (labeled) training data, which can then be used to classify unseen data E.g. Decision trees, Neural Networks, Support Vector Machines, etc. Model evaluation Test-train split Split the labeled data into training and testing sets Cross-validation Test every instance in the dataset using a model that has not seen that instance Types k-fold cross validation Leave-one-out cross-validation (LOOCV) with k=n Training split Testing split 8
10 Cluster Visualization 9
11 Information Gain Based Feature Ranking 10
12 Evaluation Metrics Compare vectors of actual and predicted values Coefficient of correlation (R) Coefficient of determination (R 2 ) Mean Absolute Error (MAE) Root Mean Squared Error (RMSE) Standard Deviation of Error (SDE) Mean Absolute Error Fraction (MAE) Root Mean Squared Error Fraction (RMSE) Standard Deviation of Error Fraction (SDE)
13 Results Comparison 12
14 13
15 Results Comparison A. Agrawal, P. D. Deshpande, A. Cecen, G. P. Basavarsu, A. N. Choudhary, and S. R. Kalidindi, Exploration of data science techniques to predict fatigue strength of steel from composition and processing parameters, Integrating Materials 14 and Manufacturing Innovation, 3 (8): 1 19, 2014.
16 Discovery of stable compounds Collaborative project between Agrawal (NU), Choudhary (NU), Wolverton (NU)
17 Discovery Framework Database Construction Thousands of DFT formation energies Empirical elemental data Predictive Modeling Model 1: established heuristic Model 2: data mining Model Evaluation Test models on unseen formation energies (a) Prediction Run combinatorial list of compositions through models Ranking Combine heuristic and data mining predictions Validation Experiments Crystal structure prediction Millions of candidate ternary compositions Models Formation energy predictions Ranked highpotential candidates Compound discovery (b)
18 Model formation energy (ev/atom)! Model Validation: Numerical DM: binaries! R 2 = 0.87! MAE = 0.27 ev/at! DM: bin. + tern.! R 2 = 0.93! MAE = 0.16 ev/at! Heuristic! R 2 = 0.95! MAE = 0.12 ev/at! DFT formation energy (ev/atom)!
19 True positive rate (sensitivity)! Model Validation: Ranking 1! 0.8! perfect classifier Classify all unstable DM: bin. +4k tern.! classify all stable Combined model outperforms either alone in regime of interest 0.6! 0.4! combined! heuristic! Classifier becomes: more conservative less conservative 0.2! random guessing! classify all unstable 0! 0! 0.2! 0.4! 0.6! 0.8! 1! False positive rate (1 - specificity)!
20 What happens when we rank all possible ternaries by their likelihood of stability?
21 Predictions for Discovery Interesting insights: Fingerprint of entire unexplored ternary composition space! Highest ranked ternary: SiYb 3 F 5 Si acts as an anion Validated with structure and DFT calculations pnictides, chalcogenides, halides Pt-X-Y Pm 12 S 19 Se a missing binary Pm 2 S 3? Average of all A-B-X ternaries
22 Validation Example of discovered stable ternary compositions whose stability was explicitly confirmed with crystal structure prediction. Our method is successful at identifying new stable compounds across a wide variety of chemistries. B. Meredig*, A. Agrawal*, S. Kirklin, J. E. Saal, J. W. Doak, A. Thompson, K. Zhang, A. Choudhary, and C. Wolverton, Combinatorial screening for new materials in unconstrained composition space with machine learning, Phys. Rev. B, 89, , March
23 Summary Steel Fatigue Strength Prediction o NIMS database consisting of composition and processing parameters linked with performance (fatigue strength). o Neural networks, decision trees, multivariate polynomial regression able to achieve high R 2 values of >0.98. Stable Compound Discovery o A database of DFT calculations used to learn compositionproperty relationships, thus mimicking DFT for estimating stability. o The resulting predictive models used to scan the entire ternary composition space to discover likely stable compositions. o Many predictions explicitly confirmed with crystal structure prediction and DFT. 22
24 Future Outlook Goal/means Processing Structure Properties Performance Cause and effect Project I. Multi-objective Structure-Property Optimization Project II. Multiscale Prediction of Localization Relationships Project III. Exploring Composition-Processing- Property Relationships Project IV. Compositionbased Discovery of Stable Compounds 23
25 Publications A. Agrawal, P. D. Deshpande, A. Cecen, G. P. Basavarsu, A. N. Choudhary, and S. R. Kalidindi, Exploration of data science techniques to predict fatigue strength of steel from composition and processing parameters, Integrating Materials and Manufacturing Innovation, vol. 3, no. 8, pp. 1 19, B. Meredig, A. Agrawal, S. Kirklin, J. E. Saal, J. W. Doak, A. Thompson, K. Zhang, A. Choudhary, and C. Wolverton, Combinatorial screening for new materials in unconstrained composition space with machine learning, Physical Review B, vol. 89, no , pp. 1 7, BM and AA are co-first authors. Ruoqian Liu, Abhishek Kumar, Zhengzhang Chen, Ankit Agrawal, Veera Sundararaghavan, Alok Choudhary, A predictive machine learning approach for microstructure optimization and materials design, Scientific Reports, Nature Publishing Group, 2015, in press. R. Liu, Z. Chen, T. Fast, S. Kalidindi, A. Agrawal, and A. Choudhary, Predictive Modeling in Characterizing Localization Relationships TMS Annual Meeting & Exhibition, Symposium of Data Analytics for Materials Science and Manufacturing, Feb , San Diego, CA. R. Liu, A. Kumar, Z. Chen, A. Agrawal, V. Sundararaghavan, and A. Choudhary, A Data Mining Approach in Structure-Property Optimization TMS Annual Meeting & Exhibition, Symposium of Data Analytics for Materials Science and Manufacturing, Feb , San Diego, CA. P. D. Deshpande, B. P. Gautham, A. Cecen, S. Kalidindi, A. Agrawal, and A. Choudhary, Application of Statistical and Machine Learning Techniques for Correlating Properties to Composition and Manufacturing Processes of Steels, in 2nd World Congress on Integrated Computational Materials Engineering, July 7-11, 2013, Salt Lake City, Utah, 2013, pp R. Liu, Y. Yabansu, S. Kalidindi, A. Agrawal, and A. Choudhary, Predictive Modeling in Characterizing Localization Relationships. 2015, in preparation. 24
26 Thank You! Ankit Agrawal Research Associate Professor Dept. of Electrical Engineering and Computer Science Northwestern University 25
MultiscaleMaterialsDesignUsingInformatics. S. R. Kalidindi, A. Agrawal, A. Choudhary, V. Sundararaghavan AFOSR-FA
MultiscaleMaterialsDesignUsingInformatics S. R. Kalidindi, A. Agrawal, A. Choudhary, V. Sundararaghavan AFOSR-FA9550-12-1-0458 1 Hierarchical Material Structure Kalidindi and DeGraef, ARMS, 2015 Main Challenges
More information(S1) (S2) = =8.16
Formulae for Attributes As described in the manuscript, the first step of our method to create a predictive model of material properties is to compute attributes based on the composition of materials.
More informationA Hybrid Computational-Experimental Approach for Automated Crystal Structure Solution
Engineering Conferences International ECI Digital Archives Harnessing The Materials Genome: Accelerated Materials Development via Computational and Experimental Tools Proceedings Fall 10-3-2012 A Hybrid
More informationDeep Learning for Chemical Compound Stability Prediction
Deep Learning for Chemical Compound Stability Prediction Ruoqian Liu rosanne@northwestern.edu Logan Ward Materials Science and Engineering loganward2012@u.north western.edu Ankit Agrawal ankitag@northwestern.edu
More informationElemNet: Deep Learning the Chemistry of Materials From Only Elemental Composition
www.nature.com/scientificreports Received: 1 August 2018 Accepted: 6 November 2018 Published: xx xx xxxx OPEN ElemNet: Deep Learning the Chemistry of Materials From Only Elemental Composition Dipendra
More informationData Mining. Preamble: Control Application. Industrial Researcher s Approach. Practitioner s Approach. Example. Example. Goal: Maintain T ~Td
Data Mining Andrew Kusiak 2139 Seamans Center Iowa City, Iowa 52242-1527 Preamble: Control Application Goal: Maintain T ~Td Tel: 319-335 5934 Fax: 319-335 5669 andrew-kusiak@uiowa.edu http://www.icaen.uiowa.edu/~ankusiak
More informationCS 229 Final Report: Data-Driven Prediction of Band Gap of Materials
CS 229 Final Report: Data-Driven Prediction of Band Gap of Materials Fariah Hayee and Isha Datye Department of Electrical Engineering, Stanford University Rahul Kini Department of Material Science and
More informationDATA ANALYTICS IN NANOMATERIALS DISCOVERY
DATA ANALYTICS IN NANOMATERIALS DISCOVERY Michael Fernandez OCE-Postdoctoral Fellow September 2016 www.data61.csiro.au Materials Discovery Process Materials Genome Project Integrating computational methods
More informationFinding Nature s Missing Ternary Oxide Compounds using Machine Learning and Density Functional Theory: supporting information
Finding Nature s Missing Ternary Oxide Compounds using Machine Learning and Density Functional Theory: supporting information Geoffroy Hautier, Christopher C. Fischer, Anubhav Jain, Tim Mueller, Gerbrand
More informationHoldout and Cross-Validation Methods Overfitting Avoidance
Holdout and Cross-Validation Methods Overfitting Avoidance Decision Trees Reduce error pruning Cost-complexity pruning Neural Networks Early stopping Adjusting Regularizers via Cross-Validation Nearest
More informationDISTINGUISH HARD INSTANCES OF AN NP-HARD PROBLEM USING MACHINE LEARNING
DISTINGUISH HARD INSTANCES OF AN NP-HARD PROBLEM USING MACHINE LEARNING ZHE WANG, TONG ZHANG AND YUHAO ZHANG Abstract. Graph properties suitable for the classification of instance hardness for the NP-hard
More informationIntroduction to Machine Learning
Introduction to Machine Learning CS4731 Dr. Mihail Fall 2017 Slide content based on books by Bishop and Barber. https://www.microsoft.com/en-us/research/people/cmbishop/ http://web4.cs.ucl.ac.uk/staff/d.barber/pmwiki/pmwiki.php?n=brml.homepage
More informationClassification Using Decision Trees
Classification Using Decision Trees 1. Introduction Data mining term is mainly used for the specific set of six activities namely Classification, Estimation, Prediction, Affinity grouping or Association
More informationIchiro Takeuchi University of Maryland
High-throughput Experimentation and Machine Learning for Materials Discovery 55 Å 45 Å 35Å Ferroelectric library t s (Å) 25 Å 20 Å 15 Å 10 Å 5 Å No impurity Ti (3 Å) Ti (6 Å) Ti (9 Å) Cu (3 Å) Cu (6Å)
More informationCS 6375 Machine Learning
CS 6375 Machine Learning Nicholas Ruozzi University of Texas at Dallas Slides adapted from David Sontag and Vibhav Gogate Course Info. Instructor: Nicholas Ruozzi Office: ECSS 3.409 Office hours: Tues.
More informationarxiv: v2 [stat.ml] 4 Jul 2017
Noname manuscript No. (will be inserted by the editor) High-Dimensional Materials and Process Optimization using Data-driven Experimental Design with Well-Calibrated Uncertainty Estimates Julia Ling* Maxwell
More informationCLUe Training An Introduction to Machine Learning in R with an example from handwritten digit recognition
CLUe Training An Introduction to Machine Learning in R with an example from handwritten digit recognition Ad Feelders Universiteit Utrecht Department of Information and Computing Sciences Algorithmic Data
More informationWMO Aeronautical Meteorology Scientific Conference 2017
Session 1 Science underpinning meteorological observations, forecasts, advisories and warnings 1.6 Observation, nowcast and forecast of future needs 1.6.1 Advances in observing methods and use of observations
More informationThe exam is closed book, closed notes except your one-page (two sides) or two-page (one side) crib sheet.
CS 189 Spring 013 Introduction to Machine Learning Final You have 3 hours for the exam. The exam is closed book, closed notes except your one-page (two sides) or two-page (one side) crib sheet. Please
More informationORIE 4741: Learning with Big Messy Data. Train, Test, Validate
ORIE 4741: Learning with Big Messy Data Train, Test, Validate Professor Udell Operations Research and Information Engineering Cornell December 4, 2017 1 / 14 Exercise You run a hospital. A vendor wants
More informationMark your answers ON THE EXAM ITSELF. If you are not sure of your answer you may wish to provide a brief explanation.
CS 189 Spring 2015 Introduction to Machine Learning Midterm You have 80 minutes for the exam. The exam is closed book, closed notes except your one-page crib sheet. No calculators or electronic items.
More informationLearning with multiple models. Boosting.
CS 2750 Machine Learning Lecture 21 Learning with multiple models. Boosting. Milos Hauskrecht milos@cs.pitt.edu 5329 Sennott Square Learning with multiple models: Approach 2 Approach 2: use multiple models
More information6.036 midterm review. Wednesday, March 18, 15
6.036 midterm review 1 Topics covered supervised learning labels available unsupervised learning no labels available semi-supervised learning some labels available - what algorithms have you learned that
More informationMachine learning for ligand-based virtual screening and chemogenomics!
Machine learning for ligand-based virtual screening and chemogenomics! Jean-Philippe Vert Institut Curie - INSERM U900 - Mines ParisTech In silico discovery of molecular probes and drug-like compounds:
More informationMachine Learning, Midterm Exam: Spring 2009 SOLUTION
10-601 Machine Learning, Midterm Exam: Spring 2009 SOLUTION March 4, 2009 Please put your name at the top of the table below. If you need more room to work out your answer to a question, use the back of
More informationAlgorithm-Independent Learning Issues
Algorithm-Independent Learning Issues Selim Aksoy Department of Computer Engineering Bilkent University saksoy@cs.bilkent.edu.tr CS 551, Spring 2007 c 2007, Selim Aksoy Introduction We have seen many learning
More informationCS534 Machine Learning - Spring Final Exam
CS534 Machine Learning - Spring 2013 Final Exam Name: You have 110 minutes. There are 6 questions (8 pages including cover page). If you get stuck on one question, move on to others and come back to the
More informationMicroarray Data Analysis: Discovery
Microarray Data Analysis: Discovery Lecture 5 Classification Classification vs. Clustering Classification: Goal: Placing objects (e.g. genes) into meaningful classes Supervised Clustering: Goal: Discover
More informationReal Estate Price Prediction with Regression and Classification CS 229 Autumn 2016 Project Final Report
Real Estate Price Prediction with Regression and Classification CS 229 Autumn 2016 Project Final Report Hujia Yu, Jiafu Wu [hujiay, jiafuwu]@stanford.edu 1. Introduction Housing prices are an important
More informationCMU-Q Lecture 24:
CMU-Q 15-381 Lecture 24: Supervised Learning 2 Teacher: Gianni A. Di Caro SUPERVISED LEARNING Hypotheses space Hypothesis function Labeled Given Errors Performance criteria Given a collection of input
More informationLearning Decision Trees
Learning Decision Trees Machine Learning Spring 2018 1 This lecture: Learning Decision Trees 1. Representation: What are decision trees? 2. Algorithm: Learning decision trees The ID3 algorithm: A greedy
More informationDATA MINING WITH DIFFERENT TYPES OF X-RAY DATA
DATA MINING WITH DIFFERENT TYPES OF X-RAY DATA 315 C. K. Lowe-Ma, A. E. Chen, D. Scholl Physical & Environmental Sciences, Research and Advanced Engineering Ford Motor Company, Dearborn, Michigan, USA
More informationMachine learning comes from Bayesian decision theory in statistics. There we want to minimize the expected value of the loss function.
Bayesian learning: Machine learning comes from Bayesian decision theory in statistics. There we want to minimize the expected value of the loss function. Let y be the true label and y be the predicted
More informationCombinatorial Heterogeneous Catalysis
Combinatorial Heterogeneous Catalysis 650 μm by 650 μm, spaced 100 μm apart Identification of a new blue photoluminescent (PL) composite material, Gd 3 Ga 5 O 12 /SiO 2 Science 13 March 1998: Vol. 279
More informationCS145: INTRODUCTION TO DATA MINING
CS145: INTRODUCTION TO DATA MINING 5: Vector Data: Support Vector Machine Instructor: Yizhou Sun yzsun@cs.ucla.edu October 18, 2017 Homework 1 Announcements Due end of the day of this Thursday (11:59pm)
More informationData Mining Classification: Basic Concepts, Decision Trees, and Model Evaluation
Data Mining Classification: Basic Concepts, Decision Trees, and Model Evaluation Lecture Notes for Chapter 4 Part I Introduction to Data Mining by Tan, Steinbach, Kumar Adapted by Qiang Yang (2010) Tan,Steinbach,
More informationMachine Learning Concepts in Chemoinformatics
Machine Learning Concepts in Chemoinformatics Martin Vogt B-IT Life Science Informatics Rheinische Friedrich-Wilhelms-Universität Bonn BigChem Winter School 2017 25. October Data Mining in Chemoinformatics
More informationMachine Learning & Data Mining
Group M L D Machine Learning M & Data Mining Chapter 7 Decision Trees Xin-Shun Xu @ SDU School of Computer Science and Technology, Shandong University Top 10 Algorithm in DM #1: C4.5 #2: K-Means #3: SVM
More informationSUPERVISED LEARNING: INTRODUCTION TO CLASSIFICATION
SUPERVISED LEARNING: INTRODUCTION TO CLASSIFICATION 1 Outline Basic terminology Features Training and validation Model selection Error and loss measures Statistical comparison Evaluation measures 2 Terminology
More informationSupporting Information. Kinetically-Driven Phase Transformation during Lithiation in Copper Sulfide Nanoflakes
Supporting Information Kinetically-Driven Phase Transformation during Lithiation in Copper Sulfide Nanoflakes Kai He,*,, Zhenpeng Yao, Sooyeon Hwang, Na Li, Ke Sun, Hong Gan, Yaping Du, Hua Zhang, Chris
More informationIncorporating Boosted Regression Trees into Ecological Latent Variable Models
Incorporating Boosted Regression Trees into Ecological Latent Variable Models Rebecca A. Hutchinson, Li-Ping Liu, Thomas G. Dietterich School of EECS, Oregon State University Motivation Species Distribution
More informationPREDICTION THE JOMINY CURVES BY MEANS OF NEURAL NETWORKS
Tomislav Filetin, Dubravko Majetić, Irena Žmak Faculty of Mechanical Engineering and Naval Architecture, University of Zagreb, Croatia PREDICTION THE JOMINY CURVES BY MEANS OF NEURAL NETWORKS ABSTRACT:
More informationMachine Learning Alternatives to Manual Knowledge Acquisition
Machine Learning Alternatives to Manual Knowledge Acquisition Interactive programs which elicit knowledge from the expert during the course of a conversation at the terminal. Programs which learn by scanning
More informationQualifying Exam in Machine Learning
Qualifying Exam in Machine Learning October 20, 2009 Instructions: Answer two out of the three questions in Part 1. In addition, answer two out of three questions in two additional parts (choose two parts
More informationCHAPTER 6 CONCLUSION AND FUTURE SCOPE
CHAPTER 6 CONCLUSION AND FUTURE SCOPE 146 CHAPTER 6 CONCLUSION AND FUTURE SCOPE 6.1 SUMMARY The first chapter of the thesis highlighted the need of accurate wind forecasting models in order to transform
More informationBasics of Multivariate Modelling and Data Analysis
Basics of Multivariate Modelling and Data Analysis Kurt-Erik Häggblom 2. Overview of multivariate techniques 2.1 Different approaches to multivariate data analysis 2.2 Classification of multivariate techniques
More informationMachine Learning: Homework 5
0-60 Machine Learning: Homework 5 Due 5:0 p.m. Thursday, March, 06 TAs: Travis Dick and Han Zhao Instructions Late homework policy: Homework is worth full credit if submitted before the due date, half
More informationEvaluation Metrics for Intrusion Detection Systems - A Study
Evaluation Metrics for Intrusion Detection Systems - A Study Gulshan Kumar Assistant Professor, Shaheed Bhagat Singh State Technical Campus, Ferozepur (Punjab)-India 152004 Email: gulshanahuja@gmail.com
More informationFrom statistics to data science. BAE 815 (Fall 2017) Dr. Zifei Liu
From statistics to data science BAE 815 (Fall 2017) Dr. Zifei Liu Zifeiliu@ksu.edu Why? How? What? How much? How many? Individual facts (quantities, characters, or symbols) The Data-Information-Knowledge-Wisdom
More informationDeep Generative Models. (Unsupervised Learning)
Deep Generative Models (Unsupervised Learning) CEng 783 Deep Learning Fall 2017 Emre Akbaş Reminders Next week: project progress demos in class Describe your problem/goal What you have done so far What
More informationGene Expression Data Classification with Revised Kernel Partial Least Squares Algorithm
Gene Expression Data Classification with Revised Kernel Partial Least Squares Algorithm Zhenqiu Liu, Dechang Chen 2 Department of Computer Science Wayne State University, Market Street, Frederick, MD 273,
More informationSupport Vector Ordinal Regression using Privileged Information
Support Vector Ordinal Regression using Privileged Information Fengzhen Tang 1, Peter Tiňo 2, Pedro Antonio Gutiérrez 3 and Huanhuan Chen 4 1,2,4- The University of Birmingham, School of Computer Science,
More informationRepresenting Material Structure. Surya R. Kalidindi. Funding: AFOSR-MURI, NIST
Representing Material Structure Surya R. Kalidindi Funding: AFOSR-MURI, NIST Hierarchical Material Structure Kalidindi and DeGraef, ARMS, 2015 Well-Separated Length Scales and RVEs Macroscale Object Mesoscale
More informationFinal Exam, Machine Learning, Spring 2009
Name: Andrew ID: Final Exam, 10701 Machine Learning, Spring 2009 - The exam is open-book, open-notes, no electronics other than calculators. - The maximum possible score on this exam is 100. You have 3
More informationReading, UK 1 2 Abstract
, pp.45-54 http://dx.doi.org/10.14257/ijseia.2013.7.5.05 A Case Study on the Application of Computational Intelligence to Identifying Relationships between Land use Characteristics and Damages caused by
More informationDecision Tree Learning
Decision Tree Learning Berlin Chen Department of Computer Science & Information Engineering National Taiwan Normal University References: 1. Machine Learning, Chapter 3 2. Data Mining: Concepts, Models,
More informationApplied Machine Learning Annalisa Marsico
Applied Machine Learning Annalisa Marsico OWL RNA Bionformatics group Max Planck Institute for Molecular Genetics Free University of Berlin 22 April, SoSe 2015 Goals Feature Selection rather than Feature
More informationThe Changing Landscape of Land Administration
The Changing Landscape of Land Administration B r e n t J o n e s P E, PLS E s r i World s Largest Media Company No Journalists No Content Producers No Photographers World s Largest Hospitality Company
More informationBayesian Decision Theory
Introduction to Pattern Recognition [ Part 4 ] Mahdi Vasighi Remarks It is quite common to assume that the data in each class are adequately described by a Gaussian distribution. Bayesian classifier is
More informationEmpirical Risk Minimization, Model Selection, and Model Assessment
Empirical Risk Minimization, Model Selection, and Model Assessment CS6780 Advanced Machine Learning Spring 2015 Thorsten Joachims Cornell University Reading: Murphy 5.7-5.7.2.4, 6.5-6.5.3.1 Dietterich,
More informationMachine Learning for Signal Processing Bayes Classification and Regression
Machine Learning for Signal Processing Bayes Classification and Regression Instructor: Bhiksha Raj 11755/18797 1 Recap: KNN A very effective and simple way of performing classification Simple model: For
More informationSample questions for Fundamentals of Machine Learning 2018
Sample questions for Fundamentals of Machine Learning 2018 Teacher: Mohammad Emtiyaz Khan A few important informations: In the final exam, no electronic devices are allowed except a calculator. Make sure
More informationFinal Overview. Introduction to ML. Marek Petrik 4/25/2017
Final Overview Introduction to ML Marek Petrik 4/25/2017 This Course: Introduction to Machine Learning Build a foundation for practice and research in ML Basic machine learning concepts: max likelihood,
More informationInstitute for Functional Imaging of Materials (IFIM)
Institute for Functional Imaging of Materials (IFIM) Sergei V. Kalinin Guiding the design of materials tailored for functionality Dynamic matter: information dimension Static matter Functional matter Imaging
More informationCharacterization of Jet Charge at the LHC
Characterization of Jet Charge at the LHC Thomas Dylan Rueter, Krishna Soni Abstract The Large Hadron Collider (LHC) produces a staggering amount of data - about 30 petabytes annually. One of the largest
More informationLinear and Logistic Regression. Dr. Xiaowei Huang
Linear and Logistic Regression Dr. Xiaowei Huang https://cgi.csc.liv.ac.uk/~xiaowei/ Up to now, Two Classical Machine Learning Algorithms Decision tree learning K-nearest neighbor Model Evaluation Metrics
More informationMachine Learning Linear Classification. Prof. Matteo Matteucci
Machine Learning Linear Classification Prof. Matteo Matteucci Recall from the first lecture 2 X R p Regression Y R Continuous Output X R p Y {Ω 0, Ω 1,, Ω K } Classification Discrete Output X R p Y (X)
More informationPredictive Analytics on Accident Data Using Rule Based and Discriminative Classifiers
Advances in Computational Sciences and Technology ISSN 0973-6107 Volume 10, Number 3 (2017) pp. 461-469 Research India Publications http://www.ripublication.com Predictive Analytics on Accident Data Using
More informationGlobal Scene Representations. Tilke Judd
Global Scene Representations Tilke Judd Papers Oliva and Torralba [2001] Fei Fei and Perona [2005] Labzebnik, Schmid and Ponce [2006] Commonalities Goal: Recognize natural scene categories Extract features
More informationA Gradient-based Adaptive Learning Framework for Efficient Personal Recommendation
A Gradient-based Adaptive Learning Framework for Efficient Personal Recommendation Yue Ning 1 Yue Shi 2 Liangjie Hong 2 Huzefa Rangwala 3 Naren Ramakrishnan 1 1 Virginia Tech 2 Yahoo Research. Yue Shi
More informationDETECTING HUMAN ACTIVITIES IN THE ARCTIC OCEAN BY CONSTRUCTING AND ANALYZING SUPER-RESOLUTION IMAGES FROM MODIS DATA INTRODUCTION
DETECTING HUMAN ACTIVITIES IN THE ARCTIC OCEAN BY CONSTRUCTING AND ANALYZING SUPER-RESOLUTION IMAGES FROM MODIS DATA Shizhi Chen and YingLi Tian Department of Electrical Engineering The City College of
More informationOutline Introduction OLS Design of experiments Regression. Metamodeling. ME598/494 Lecture. Max Yi Ren
1 / 34 Metamodeling ME598/494 Lecture Max Yi Ren Department of Mechanical Engineering, Arizona State University March 1, 2015 2 / 34 1. preliminaries 1.1 motivation 1.2 ordinary least square 1.3 information
More informationGaussian Process Regression with K-means Clustering for Very Short-Term Load Forecasting of Individual Buildings at Stanford
Gaussian Process Regression with K-means Clustering for Very Short-Term Load Forecasting of Individual Buildings at Stanford Carol Hsin Abstract The objective of this project is to return expected electricity
More informationDiscovery Through Situational Awareness
Discovery Through Situational Awareness BRETT AMIDAN JIM FOLLUM NICK BETZSOLD TIM YIN (UNIVERSITY OF WYOMING) SHIKHAR PANDEY (WASHINGTON STATE UNIVERSITY) Pacific Northwest National Laboratory February
More informationPrediction of Citations for Academic Papers
000 001 002 003 004 005 006 007 008 009 010 011 012 013 014 015 016 017 018 019 020 021 022 023 024 025 026 027 028 029 030 031 032 033 034 035 036 037 038 039 040 041 042 043 044 045 046 047 048 049 050
More informationDecision Trees. CSC411/2515: Machine Learning and Data Mining, Winter 2018 Luke Zettlemoyer, Carlos Guestrin, and Andrew Moore
Decision Trees Claude Monet, The Mulberry Tree Slides from Pedro Domingos, CSC411/2515: Machine Learning and Data Mining, Winter 2018 Luke Zettlemoyer, Carlos Guestrin, and Andrew Moore Michael Guerzhoy
More informationComparison of Shannon, Renyi and Tsallis Entropy used in Decision Trees
Comparison of Shannon, Renyi and Tsallis Entropy used in Decision Trees Tomasz Maszczyk and W lodzis law Duch Department of Informatics, Nicolaus Copernicus University Grudzi adzka 5, 87-100 Toruń, Poland
More informationVC dimension, Model Selection and Performance Assessment for SVM and Other Machine Learning Algorithms
03/Feb/2010 VC dimension, Model Selection and Performance Assessment for SVM and Other Machine Learning Algorithms Presented by Andriy Temko Department of Electrical and Electronic Engineering Page 2 of
More informationGeneralization Error on Pruning Decision Trees
Generalization Error on Pruning Decision Trees Ryan R. Rosario Computer Science 269 Fall 2010 A decision tree is a predictive model that can be used for either classification or regression [3]. Decision
More informationHierarchical models for the rainfall forecast DATA MINING APPROACH
Hierarchical models for the rainfall forecast DATA MINING APPROACH Thanh-Nghi Do dtnghi@cit.ctu.edu.vn June - 2014 Introduction Problem large scale GCM small scale models Aim Statistical downscaling local
More informationScalable Bayesian Event Detection and Visualization
Scalable Bayesian Event Detection and Visualization Daniel B. Neill Carnegie Mellon University H.J. Heinz III College E-mail: neill@cs.cmu.edu This work was partially supported by NSF grants IIS-0916345,
More informationData Mining and Machine Learning (Machine Learning: Symbolische Ansätze)
Data Mining and Machine Learning (Machine Learning: Symbolische Ansätze) Learning Individual Rules and Subgroup Discovery Introduction Batch Learning Terminology Coverage Spaces Descriptive vs. Predictive
More informationA Deep Interpretation of Classifier Chains
A Deep Interpretation of Classifier Chains Jesse Read and Jaakko Holmén http://users.ics.aalto.fi/{jesse,jhollmen}/ Aalto University School of Science, Department of Information and Computer Science and
More information10-810: Advanced Algorithms and Models for Computational Biology. Optimal leaf ordering and classification
10-810: Advanced Algorithms and Models for Computational Biology Optimal leaf ordering and classification Hierarchical clustering As we mentioned, its one of the most popular methods for clustering gene
More informationStyle-aware Mid-level Representation for Discovering Visual Connections in Space and Time
Style-aware Mid-level Representation for Discovering Visual Connections in Space and Time Experiment presentation for CS3710:Visual Recognition Presenter: Zitao Liu University of Pittsburgh ztliu@cs.pitt.edu
More informationDecision Trees. CSC411/2515: Machine Learning and Data Mining, Winter 2018 Luke Zettlemoyer, Carlos Guestrin, and Andrew Moore
Decision Trees Claude Monet, The Mulberry Tree Slides from Pedro Domingos, CSC411/2515: Machine Learning and Data Mining, Winter 2018 Luke Zettlemoyer, Carlos Guestrin, and Andrew Moore Michael Guerzhoy
More informationDecision Support. Dr. Johan Hagelbäck.
Decision Support Dr. Johan Hagelbäck johan.hagelback@lnu.se http://aiguy.org Decision Support One of the earliest AI problems was decision support The first solution to this problem was expert systems
More informationData Mining and Knowledge Discovery: Practice Notes
Data Mining and Knowledge Discovery: Practice Notes dr. Petra Kralj Novak Petra.Kralj.Novak@ijs.si 7.11.2017 1 Course Prof. Bojan Cestnik Data preparation Prof. Nada Lavrač: Data mining overview Advanced
More informationData Mining Based Anomaly Detection In PMU Measurements And Event Detection
Data Mining Based Anomaly Detection In PMU Measurements And Event Detection P. Banerjee, S. Pandey, M. Zhou, A. Srivastava, Y. Wu Smart Grid Demonstration and Research Investigation Lab (SGDRIL) Energy
More informationMachine Learning Practice Page 2 of 2 10/28/13
Machine Learning 10-701 Practice Page 2 of 2 10/28/13 1. True or False Please give an explanation for your answer, this is worth 1 pt/question. (a) (2 points) No classifier can do better than a naive Bayes
More informationWeighted Classification Cascades for Optimizing Discovery Significance
Weighted Classification Cascades for Optimizing Discovery Significance Lester Mackey Collaborators: Jordan Bryan and Man Yue Mo Stanford University December 13, 2014 Mackey (Stanford) Weighted Classification
More informationML in Practice: CMSC 422 Slides adapted from Prof. CARPUAT and Prof. Roth
ML in Practice: CMSC 422 Slides adapted from Prof. CARPUAT and Prof. Roth N-fold cross validation Instead of a single test-training split: train test Split data into N equal-sized parts Train and test
More informationCSCI-567: Machine Learning (Spring 2019)
CSCI-567: Machine Learning (Spring 2019) Prof. Victor Adamchik U of Southern California Mar. 19, 2019 March 19, 2019 1 / 43 Administration March 19, 2019 2 / 43 Administration TA3 is due this week March
More informationCOMP9444: Neural Networks. Vapnik Chervonenkis Dimension, PAC Learning and Structural Risk Minimization
: Neural Networks Vapnik Chervonenkis Dimension, PAC Learning and Structural Risk Minimization 11s2 VC-dimension and PAC-learning 1 How good a classifier does a learner produce? Training error is the precentage
More informationSelf Organizing Maps. We are drowning in information and starving for knowledge. A New Approach for Integrated Analysis of Geological Data.
Radiometrics Processed Landsat TM Self Organizing Maps A New Approach for Integrated Analysis of Geological Data. We are drowning in information and starving for knowledge. Rutherford D. Roger Stephen.Fraser@csiro.au
More informationMaking Our Cities Safer: A Study In Neighbhorhood Crime Patterns
Making Our Cities Safer: A Study In Neighbhorhood Crime Patterns Aly Kane alykane@stanford.edu Ariel Sagalovsky asagalov@stanford.edu Abstract Equipped with an understanding of the factors that influence
More informationMachine Learning, Fall 2009: Midterm
10-601 Machine Learning, Fall 009: Midterm Monday, November nd hours 1. Personal info: Name: Andrew account: E-mail address:. You are permitted two pages of notes and a calculator. Please turn off all
More informationCPSC 340: Machine Learning and Data Mining. Linear Least Squares Fall 2016
CPSC 340: Machine Learning and Data Mining Linear Least Squares Fall 2016 Assignment 2 is due Friday: Admin You should already be started! 1 late day to hand it in on Wednesday, 2 for Friday, 3 for next
More informationData Mining. Supervised Learning. Hamid Beigy. Sharif University of Technology. Fall 1396
Data Mining Supervised Learning Hamid Beigy Sharif University of Technology Fall 1396 Hamid Beigy (Sharif University of Technology) Data Mining Fall 1396 1 / 15 Table of contents 1 Introduction 2 Supervised
More informationhsnim: Hyper Scalable Network Inference Machine for Scale-Free Protein-Protein Interaction Networks Inference
CS 229 Project Report (TR# MSB2010) Submitted 12/10/2010 hsnim: Hyper Scalable Network Inference Machine for Scale-Free Protein-Protein Interaction Networks Inference Muhammad Shoaib Sehgal Computer Science
More information