DATA ANALYTICS IN NANOMATERIALS DISCOVERY

Size: px
Start display at page:

Download "DATA ANALYTICS IN NANOMATERIALS DISCOVERY"

Transcription

1 DATA ANALYTICS IN NANOMATERIALS DISCOVERY Michael Fernandez OCE-Postdoctoral Fellow September

2 Materials Discovery Process Materials Genome Project Integrating computational methods and information with sophisticated computational and analytical tools to shorten the duration of materials development from years to 2 or 3 years. 3 Data Analytics for Nanomaterials Michael Fernandez

3 Materials and Molecular Modeling Sun Baichuan Team leader Amanda Barnard Collaborators Big data and HPC integration Piotr Szul Yulia Arzhaeva Deep learning and GPU computations Chris Watkins 2 Data Analytics for Nanomaterials Michael Fernandez

4 Outline Material Discovery Process Methods for atomistic simulations of materials Experimental confirmation Data-driven Computatioanl Nanomaterials Discovery Hypothetical material space sampling (structure generation and statistics) In silico high-throughput characterization (atomistic simulations and machine learning) Data storage, analytics, exploitation and integration 2 Data Analytics for Nanomaterials Michael Fernandez

5 Atomistic Simulations of Materials Theory can predict properties of materials Quantum chemistry methods can cover any chemistry Empirical potentials exist for large number of elements Computation is scalable and generically deployable N H = E nuclei ({R I })-σ e i=1 2 i + V nuclei r i + 1 σ N e 1 2 i j r i r j 4 Data Analytics for Nanomaterials Michael Fernandez

6 Atomistic Simulations of Materials Computational predictions later confirmed by experiments Self-assembly mechanism of nanodiamonds Barnard, A. S. et al. Nanoscale 3, (2011) DFTB simulations of the surface electrostatic potential of dodecahedral diamond nanoparticles of a) 2.2 nm and b) 2.5 nm The arrow indicates (111) (111) interface between two 4 nm sized nanodiamonds. 5 Data Analytics for Nanomaterials Michael Fernandez

7 Modern Nano- materials discovery cycle Hypothetical materials Experiment design Materials libraries Database Performance Measurement Polydispersive systems Exponential increase in complexity and diversity Nearly infinite combinatorial problem Theory, modeling and informatics In-silico screening Potyrailo, R. et al. ACS Comb. Sci. 13, (2011). Lead materials scale up Knowledge for rational design Banks of materials 6 Data Analytics for Nanomaterials Michael Fernandez

8 Nanomaterials Screening Departing from the Edisonian approach We can accurately predict a property, so it can be computed for entire materials spaces In silico structure generation vs. Combinatorial In silico design 7 Data Analytics for Nanomaterials Michael Fernandez

9 Nanomaterials Screening Systematic and extensive materials performance ranking. Big data discovery of structure-property relationships in unknown materials domains. Accelerated identification of high potential candidates and rational design principles. 7 Data Analytics for Nanomaterials Michael Fernandez

10 Data Analytics Challenges Information representation Fingerprints Information extraction Multivariate statistical analysis Knowledge discovery Data mining and machine learning Knowledge representation Visualization 7 Data Analytics for Nanomaterials Michael Fernandez

11 Polydispersity Challenge in Nanomaterials $$$ Controlled synthesis Purification Polydispersive sample Quasi-monodisperse sample Polydispersity can be detrimental for high-performing applications Purification of polydispersive nanoparticles samples is expensive 9 Data Analytics for Nanomaterials Michael Fernandez

12 Data Analytics of Nanocarbons Virtual structures relaxed using TB-DFT Nanodiamonds Graphenes 10 Data Analytics for Nanomaterials Michael Fernandez

13 Archetypal Analysis (AA) Finds a k m matrix Z that corresponds to the archetypal or pure patterns in the data in such a way that each data point can be represented as a mixture of those archetypes. In other words, the archetypal analysis yields the two n k coefficient matrices α and β, which minimize the residual sum of squares: Cutler, A. & Breiman, L. Archetypal Analysis. Technometrics 36, (1994). The predictors of X i are finite mixtures of archetypes Z j, which are convex combinations of the observations. 10 Data Analytics for Nanomaterials Michael Fernandez

14 Archetypal Analysis of Nanocarbons Nanodiamonds Graphene nanoflakes Fernandez, M. & Barnard, A. S. ACS Nano 9, (2015). 11 Data Analytics for Nanomaterials Michael Fernandez

15 Nanocarbons Prototypes Nanodiamonds prototypes Graphene prototypes Fernandez, M. & Barnard, A. S. ACS Nano 9, (2015). 13 Data Analytics for Nanomaterials Michael Fernandez

16 Estimation of Nanodiamonds Properties Fernandez, M. & Barnard, A. S. ACS Nano 9, (2015). 13 Data Analytics for Nanomaterials Michael Fernandez

17 Structural Diversity Challenge Graphene nanoflakes Trigonal Rectangular Hexagonal Defects, oxidation and edge passivation yield large structural diversity 13 Data Analytics for Nanomaterials Michael Fernandez

18 Structural Diversity Challenge Silicon qbits P P P Single Si substitution by P yields a large structural diversity 13 Data Analytics for Nanomaterials Michael Fernandez

19 Structural Diversity Challenge Metal-Organic Framework (MOF) ZIF-68 benzimidazole nitroimidazole 13 Data Analytics for Nanomaterials Michael Fernandez Zn 2+ CO 2 capture and sequestration (Science, 2008)

20 Structural Diversity Challenge Metal-Organic Framework (MOF) In-silico Combinatorial design 13 Data Analytics for Nanomaterials Michael Fernandez

21 Structural Diversity Challenge Metal-Organic Framework (MOF) MOF ZBP a) Organic Linkers b) Modification with of 35 functional groups gives a total of ~1.5 million (35 4 ) unique combinations 13 Data Analytics for Nanomaterials Michael Fernandez

22 Machine Learning Approach Machine learning prediction of functional properties Feature fingerprints Machine learning 13 Data Analytics for Nanomaterials Michael Fernandez

23 Data Analytics Challenge Binary decision tree of the Band Gap of graphene Accuracy 80% Features Surface area Number of atoms Shape aspect ratio Fernandez, M., Shi, H. & Barnard, A. S Carbon (2016). doi: /j.carbon Data Analytics for Nanomaterials Michael Fernandez

24 Machine Learning vs. Atomistic Simulations Estimation of the graphene Band Gap from topological features P i and P j, are the values of a bond order of the carbon atoms in graphene, while L is the topological distance, whilst L ij is a delta function delta function Fernandez, M. et al. ACS Comb. Sci. (2016) doi: /acscombsci.6b Data Analytics for Nanomaterials Michael Fernandez

25 Machine Learning of Graphene Radial Distribution Function (RDF) scores for graphene the summation is over the N atom pairs in the graphene structure, and r ij is the distance of these pairs and B is a smoothing parameter set to 10. Fernandez, M.; Shi, H.; Barnard, A et al. J. Chem. Inf. Model. (2015), 55, Data Analytics for Nanomaterials Michael Fernandez

26 Machine Learning vs. Atomistic Simulations Energy of the Fermi level Ionization Potential From RDF scores Fernandez, M.; Shi, H.; Barnard, A et al. J. Chem. Inf. Model. (2015), 55, Data Analytics for Nanomaterials Michael Fernandez

27 Machine Learning vs. Atomistic Simulations Machine learning prediction of gas adsorption in MOF Fernandez, M., et al. J. Phys. Chem. C 117, (2013). 13 Data Analytics for Nanomaterials Michael Fernandez

28 System size Accuracy and Coverage Challenges 5000 atoms TBDF/ Semiempirical 500 atoms Accuracy of electronic calculations methods vs. system size Density Functional Quantum Monte Carlo Coupled Cluster 100 atoms 20 atoms Accuracy 13 Data Analytics for Nanomaterials Michael Fernandez

29 Data-driven Challenge Machine learning for large material spaces Full Database Predictions Machine Learning 3 Partial Database Screening Ramakrishnan, R. et al. J. Chem. Theory Comput. 11, (2015). Fernandez, M et al. J. Chem. Inf. Model. (2015), 55, Machine learning predictions: -Functional property value or threshold =f( ) structure -Accuracy of different quantum-chemistry methods 15 Data Analytics for Nanomaterials Michael Fernandez

30 Challenges and Limitations Accuracy Gap Between Different Levels of Theory 6,095 isomers of C 7 H 10 O 2 E B3LYP QMC big gap big gap 13 Data Analytics for Nanomaterials Michael Fernandez

31 Accuracy Gap Predictions Predictions Machine learning calibration 13 Data Analytics for Nanomaterials Michael Fernandez

32 Data-driven High-throughput Screening Computational resources Input jobs Queue management Resubmit or kill failed runs Finished runs Data storage outputs Accuracy refinement 13 Data Analytics for Nanomaterials Michael Fernandez

33 Self-organization Map (SOM) of NPs SOM Ag-NP Electrostatic Potential 13 Data Analytics for Nanomaterials Michael Fernandez

34 Deep Learning for Nanomaterials Feature extraction Classification input 32x32 C 1 Feature maps 28x28 C 2 Feature maps S 1 10x10 Feature maps 14x14 S 2 Feature maps 5x5 B1 B2 Output Image 32x32 5x5 convolution 2x2 subsampling 5x5 convolution 2x2 subsampling Fully connected 13 Data Analytics for Nanomaterials Michael Fernandez

35 Thank you Data61 Michael Fernandez e michael.fernandezllamosa@csiro.au w

Institute for Functional Imaging of Materials (IFIM)

Institute for Functional Imaging of Materials (IFIM) Institute for Functional Imaging of Materials (IFIM) Sergei V. Kalinin Guiding the design of materials tailored for functionality Dynamic matter: information dimension Static matter Functional matter Imaging

More information

Computational Materials Design and Discovery Energy and Electronic Applications Synthesis Structure Properties

Computational Materials Design and Discovery Energy and Electronic Applications Synthesis Structure Properties Computational Materials Design and Discovery Energy and Electronic Applications Synthesis Structure Properties Supercapacitors Rechargeable batteries Supercomputer Photocatalysts Fuel cell catalysts First

More information

MultiscaleMaterialsDesignUsingInformatics. S. R. Kalidindi, A. Agrawal, A. Choudhary, V. Sundararaghavan AFOSR-FA

MultiscaleMaterialsDesignUsingInformatics. S. R. Kalidindi, A. Agrawal, A. Choudhary, V. Sundararaghavan AFOSR-FA MultiscaleMaterialsDesignUsingInformatics S. R. Kalidindi, A. Agrawal, A. Choudhary, V. Sundararaghavan AFOSR-FA9550-12-1-0458 1 Hierarchical Material Structure Kalidindi and DeGraef, ARMS, 2015 Main Challenges

More information

Material Surfaces, Grain Boundaries and Interfaces: Structure-Property Relationship Predictions

Material Surfaces, Grain Boundaries and Interfaces: Structure-Property Relationship Predictions Material Surfaces, Grain Boundaries and Interfaces: Structure-Property Relationship Predictions Susan B. Sinnott Department of Materials Science and Engineering Penn State University September 16, 2016

More information

New Materials and Process Development for Energy-Efficient Carbon Capture in the Presence of Water Vapor

New Materials and Process Development for Energy-Efficient Carbon Capture in the Presence of Water Vapor New Materials and Process Development for Energy-Efficient Carbon Capture in the Presence of Water Vapor Randy Snurr, 1 Joe Hupp, 2 Omar Farha, 2 Fengqi You 1 1 Department of Chemical & Biological Engineering

More information

Introduction. Monday, January 6, 14

Introduction. Monday, January 6, 14 Introduction 1 Introduction Why to use a simulation Some examples of questions we can address 2 Molecular Simulations Molecular dynamics: solve equations of motion Monte Carlo: importance sampling Calculate

More information

Machine learning for ligand-based virtual screening and chemogenomics!

Machine learning for ligand-based virtual screening and chemogenomics! Machine learning for ligand-based virtual screening and chemogenomics! Jean-Philippe Vert Institut Curie - INSERM U900 - Mines ParisTech In silico discovery of molecular probes and drug-like compounds:

More information

A report (dated September 20, 2011) on. scientific research carried out under Grant: FA

A report (dated September 20, 2011) on. scientific research carried out under Grant: FA A report (dated September 20, 2011) on scientific research carried out under Grant: FA2386-10-1-4150 First-principles determination of thermal properties in nano-structured hexagonal solids with doping

More information

Electronic structure and transport in silicon nanostructures with non-ideal bonding environments

Electronic structure and transport in silicon nanostructures with non-ideal bonding environments Purdue University Purdue e-pubs Other Nanotechnology Publications Birck Nanotechnology Center 9-15-2008 Electronic structure and transport in silicon nanostructures with non-ideal bonding environments

More information

(S1) (S2) = =8.16

(S1) (S2) = =8.16 Formulae for Attributes As described in the manuscript, the first step of our method to create a predictive model of material properties is to compute attributes based on the composition of materials.

More information

Application of Machine Learning to Materials Discovery and Development

Application of Machine Learning to Materials Discovery and Development Application of Machine Learning to Materials Discovery and Development Ankit Agrawal and Alok Choudhary Department of Electrical Engineering and Computer Science Northwestern University {ankitag,choudhar}@eecs.northwestern.edu

More information

Introduction to Chemoinformatics and Drug Discovery

Introduction to Chemoinformatics and Drug Discovery Introduction to Chemoinformatics and Drug Discovery Irene Kouskoumvekaki Associate Professor February 15 th, 2013 The Chemical Space There are atoms and space. Everything else is opinion. Democritus (ca.

More information

André Schleife Department of Materials Science and Engineering

André Schleife Department of Materials Science and Engineering André Schleife Department of Materials Science and Engineering Yesterday you (should have) learned this: http://upload.wikimedia.org/wikipedia/commons/e/ea/ Simple_Harmonic_Motion_Orbit.gif 1. deterministic

More information

Rick Ebert & Joseph Mazzarella For the NED Team. Big Data Task Force NASA, Ames Research Center 2016 September 28-30

Rick Ebert & Joseph Mazzarella For the NED Team. Big Data Task Force NASA, Ames Research Center 2016 September 28-30 NED Mission: Provide a comprehensive, reliable and easy-to-use synthesis of multi-wavelength data from NASA missions, published catalogs, and the refereed literature, to enhance and enable astrophysical

More information

Cheminformatics Role in Pharmaceutical Industry. Randal Chen Ph.D. Abbott Laboratories Aug. 23, 2004 ACS

Cheminformatics Role in Pharmaceutical Industry. Randal Chen Ph.D. Abbott Laboratories Aug. 23, 2004 ACS Cheminformatics Role in Pharmaceutical Industry Randal Chen Ph.D. Abbott Laboratories Aug. 23, 2004 ACS Agenda The big picture for pharmaceutical industry Current technological/scientific issues Types

More information

Chemical Space. Space, Diversity, and Synthesis. Jeremy Henle, 4/23/2013

Chemical Space. Space, Diversity, and Synthesis. Jeremy Henle, 4/23/2013 Chemical Space Space, Diversity, and Synthesis Jeremy Henle, 4/23/2013 Computational Modeling Chemical Space As a diversity construct Outline Quantifying Diversity Diversity Oriented Synthesis Wolf and

More information

Modeling and Computation Core (MCC)

Modeling and Computation Core (MCC) List of Research by Research Cluster Modeling and Computation Core (MCC) GOAL 1: Develop multiscale theories and materials databank that complement experimental approaches for materials design Objective

More information

Defects and diffusion in metal oxides: Challenges for first-principles modelling

Defects and diffusion in metal oxides: Challenges for first-principles modelling Defects and diffusion in metal oxides: Challenges for first-principles modelling Karsten Albe, FG Materialmodellierung, TU Darmstadt Johan Pohl, Peter Agoston, Paul Erhart, Manuel Diehm FUNDING: ICTP Workshop

More information

STA 4273H: Statistical Machine Learning

STA 4273H: Statistical Machine Learning STA 4273H: Statistical Machine Learning Russ Salakhutdinov Department of Statistics! rsalakhu@utstat.toronto.edu! http://www.utstat.utoronto.ca/~rsalakhu/ Sidney Smith Hall, Room 6002 Lecture 3 Linear

More information

Topology based deep learning for biomolecular data

Topology based deep learning for biomolecular data Topology based deep learning for biomolecular data Guowei Wei Departments of Mathematics Michigan State University http://www.math.msu.edu/~wei American Institute of Mathematics July 23-28, 2017 Grant

More information

Kd = koff/kon = [R][L]/[RL]

Kd = koff/kon = [R][L]/[RL] Taller de docking y cribado virtual: Uso de herramientas computacionales en el diseño de fármacos Docking program GLIDE El programa de docking GLIDE Sonsoles Martín-Santamaría Shrödinger is a scientific

More information

Computational Chemistry in Drug Design. Xavier Fradera Barcelona, 17/4/2007

Computational Chemistry in Drug Design. Xavier Fradera Barcelona, 17/4/2007 Computational Chemistry in Drug Design Xavier Fradera Barcelona, 17/4/2007 verview Introduction and background Drug Design Cycle Computational methods Chemoinformatics Ligand Based Methods Structure Based

More information

Navigation in Chemical Space Towards Biological Activity. Peter Ertl Novartis Institutes for BioMedical Research Basel, Switzerland

Navigation in Chemical Space Towards Biological Activity. Peter Ertl Novartis Institutes for BioMedical Research Basel, Switzerland Navigation in Chemical Space Towards Biological Activity Peter Ertl Novartis Institutes for BioMedical Research Basel, Switzerland Data Explosion in Chemistry CAS 65 million molecules CCDC 600 000 structures

More information

Early Stages of Drug Discovery in the Pharmaceutical Industry

Early Stages of Drug Discovery in the Pharmaceutical Industry Early Stages of Drug Discovery in the Pharmaceutical Industry Daniel Seeliger / Jan Kriegl, Discovery Research, Boehringer Ingelheim September 29, 2016 Historical Drug Discovery From Accidential Discovery

More information

Deep learning quantum matter

Deep learning quantum matter Deep learning quantum matter Machine-learning approaches to the quantum many-body problem Joaquín E. Drut & Andrew C. Loheac University of North Carolina at Chapel Hill SAS, September 2018 Outline Why

More information

Large scale classification of chemical reactions from patent data

Large scale classification of chemical reactions from patent data Large scale classification of chemical reactions from patent data Gregory Landrum NIBR Informatics, Basel Novartis Institutes for BioMedical Research 10th International Conference on Chemical Structures/

More information

Combinatorial Heterogeneous Catalysis

Combinatorial Heterogeneous Catalysis Combinatorial Heterogeneous Catalysis 650 μm by 650 μm, spaced 100 μm apart Identification of a new blue photoluminescent (PL) composite material, Gd 3 Ga 5 O 12 /SiO 2 Science 13 March 1998: Vol. 279

More information

GECP Hydrogen Project: "Nanomaterials Engineering for Hydrogen Storage"

GECP Hydrogen Project: Nanomaterials Engineering for Hydrogen Storage GECP Hydrogen Project: "Nanomaterials Engineering for Hydrogen Storage" PI: KJ Cho Students and Staff Members: Zhiyong Zhang, Wei Xiao, Byeongchan Lee, Experimental Collaboration: H. Dai, B. Clemens, A.

More information

LigandScout. Automated Structure-Based Pharmacophore Model Generation. Gerhard Wolber* and Thierry Langer

LigandScout. Automated Structure-Based Pharmacophore Model Generation. Gerhard Wolber* and Thierry Langer LigandScout Automated Structure-Based Pharmacophore Model Generation Gerhard Wolber* and Thierry Langer * E-Mail: wolber@inteligand.com Pharmacophores from LigandScout Pharmacophores & the Protein Data

More information

Introduction to Benchmark Test for Multi-scale Computational Materials Software

Introduction to Benchmark Test for Multi-scale Computational Materials Software Introduction to Benchmark Test for Multi-scale Computational Materials Software Shun Xu*, Jian Zhang, Zhong Jin xushun@sccas.cn Computer Network Information Center Chinese Academy of Sciences (IPCC member)

More information

Introduction. OntoChem

Introduction. OntoChem Introduction ntochem Providing drug discovery knowledge & small molecules... Supporting the task of medicinal chemistry Allows selecting best possible small molecule starting point From target to leads

More information

Machine Learning Concepts in Chemoinformatics

Machine Learning Concepts in Chemoinformatics Machine Learning Concepts in Chemoinformatics Martin Vogt B-IT Life Science Informatics Rheinische Friedrich-Wilhelms-Universität Bonn BigChem Winter School 2017 25. October Data Mining in Chemoinformatics

More information

Dr. Sander B. Nabuurs. Computational Drug Discovery group Center for Molecular and Biomolecular Informatics Radboud University Medical Centre

Dr. Sander B. Nabuurs. Computational Drug Discovery group Center for Molecular and Biomolecular Informatics Radboud University Medical Centre Dr. Sander B. Nabuurs Computational Drug Discovery group Center for Molecular and Biomolecular Informatics Radboud University Medical Centre The road to new drugs. How to find new hits? High Throughput

More information

Controlled healing of graphene nanopore

Controlled healing of graphene nanopore Controlled healing of graphene nanopore Konstantin Zakharchenko Alexander Balatsky Zakharchenko K.V., Balatsky A.V. Controlled healing of graphene nanopore. Carbon (80), December 2014, pp. 12 18. http://dx.doi.org/10.1016/j.carbon.2014.07.085

More information

Wulff construction and molecular dynamics simulations for Au nanoparticles

Wulff construction and molecular dynamics simulations for Au nanoparticles J. Chem. Eng. Chem. Res. Journal of Chemical Engineering and Chemistry Research Vol. **, No. **, 2014, pp. Received: ** **, 2014, Published: ** **, 2014 Wulff construction and molecular dynamics simulations

More information

PROVIDING CHEMINFORMATICS SOLUTIONS TO SUPPORT DRUG DISCOVERY DECISIONS

PROVIDING CHEMINFORMATICS SOLUTIONS TO SUPPORT DRUG DISCOVERY DECISIONS 179 Molecular Informatics: Confronting Complexity, May 13 th - 16 th 2002, Bozen, Italy PROVIDING CHEMINFORMATICS SOLUTIONS TO SUPPORT DRUG DISCOVERY DECISIONS CARLETON R. SAGE, KEVIN R. HOLME, NIANISH

More information

Strong Correlation Effects in Fullerene Molecules and Solids

Strong Correlation Effects in Fullerene Molecules and Solids Strong Correlation Effects in Fullerene Molecules and Solids Fei Lin Physics Department, Virginia Tech, Blacksburg, VA 2461 Fei Lin (Virginia Tech) Correlations in Fullerene SESAPS 211, Roanoke, VA 1 /

More information

Capturing Chemistry. What you see is what you get In the world of mechanism and chemical transformations

Capturing Chemistry. What you see is what you get In the world of mechanism and chemical transformations Capturing Chemistry What you see is what you get In the world of mechanism and chemical transformations Dr. Stephan Schürer ead of Intl. Sci. Content Libraria, Inc. sschurer@libraria.com Distribution of

More information

Hydrogenated Graphene

Hydrogenated Graphene Hydrogenated Graphene Stefan Heun NEST, Istituto Nanoscienze-CNR and Scuola Normale Superiore Pisa, Italy Outline Epitaxial Graphene Hydrogen Chemisorbed on Graphene Hydrogen-Intercalated Graphene Outline

More information

Statistical Clustering of Vesicle Patterns Practical Aspects of the Analysis of Large Datasets with R

Statistical Clustering of Vesicle Patterns Practical Aspects of the Analysis of Large Datasets with R Statistical Clustering of Vesicle Patterns Mirko Birbaumer Rmetrics Workshop 3th July 2008 1 / 23 Statistical Clustering of Vesicle Patterns Practical Aspects of the Analysis of Large Datasets with R Mirko

More information

Nano Materials and Devices

Nano Materials and Devices Nano Materials and Devices Professor Michael Austin Platform Technologies Research Institute Nano Materials and Devices Program Aim: to develop an integrated capability in nanotechnology Design and modelling

More information

Drug Informatics for Chemical Genomics...

Drug Informatics for Chemical Genomics... Drug Informatics for Chemical Genomics... An Overview First Annual ChemGen IGERT Retreat Sept 2005 Drug Informatics for Chemical Genomics... p. Topics ChemGen Informatics The ChemMine Project Library Comparison

More information

Quantum Mechanics, Chemical Space, And Machine Learning Or

Quantum Mechanics, Chemical Space, And Machine Learning Or Quantum Mechanics, Chemical Space, And Machine Learning Or B Anatole von Lilienfeld Institute of Physical Chemistry and National Center for Computational Design and Discovery of Novel Materials (MARVEL),

More information

How many materials are left to discover?

How many materials are left to discover? How many materials are left to discover? An exploration of quaternary space Presentation by Michael Sluydts Research performed by Michael Sluydts, Michiel Larmuseau, Karel Dumon, Titus Crepain, Kurt Lejaeghere

More information

Chemoinformatics and information management. Peter Willett, University of Sheffield, UK

Chemoinformatics and information management. Peter Willett, University of Sheffield, UK Chemoinformatics and information management Peter Willett, University of Sheffield, UK verview What is chemoinformatics and why is it necessary Managing structural information Typical facilities in chemoinformatics

More information

Modeling Coating Thermal Noise at Gravitational Wave Detectors

Modeling Coating Thermal Noise at Gravitational Wave Detectors Modeling Coating Thermal Noise at Gravitational Wave Detectors Hai-Ping Cheng, Chris Billman Quantum Theory Project, University of Florida GWADW 5/24/2015 LIGO-G1601068 1 Set Goal Computational Screening

More information

Report on Atomistic Modeling of Bonding in Carbon-Based Nanostructures

Report on Atomistic Modeling of Bonding in Carbon-Based Nanostructures Report on Atomistic Modeling of Bonding in Carbon-Based Nanostructures Timothy Stillings Department of Physics, Astronomy and Materials Science Missouri State University Advisor: Ridwan Sakidja Abstract

More information

MATH36061 Convex Optimization

MATH36061 Convex Optimization M\cr NA Manchester Numerical Analysis MATH36061 Convex Optimization Martin Lotz School of Mathematics The University of Manchester Manchester, September 26, 2017 Outline General information What is optimization?

More information

Potentials, periodicity

Potentials, periodicity Potentials, periodicity Lecture 2 1/23/18 1 Survey responses 2 Topic requests DFT (10), Molecular dynamics (7), Monte Carlo (5) Machine Learning (4), High-throughput, Databases (4) NEB, phonons, Non-equilibrium

More information

2. Surface geometric and electronic structure: a primer

2. Surface geometric and electronic structure: a primer 2. Surface geometric and electronic structure: a primer 2.1 Surface crystallography 2.1.1. Crystal structures - A crystal structure is made up of two basic elements: lattice + basis Basis: Lattice: simplest

More information

Analysis of a Large Structure/Biological Activity. Data Set Using Recursive Partitioning and. Simulated Annealing

Analysis of a Large Structure/Biological Activity. Data Set Using Recursive Partitioning and. Simulated Annealing Analysis of a Large Structure/Biological Activity Data Set Using Recursive Partitioning and Simulated Annealing Student: Ke Zhang MBMA Committee: Dr. Charles E. Smith (Chair) Dr. Jacqueline M. Hughes-Oliver

More information

RESEARCH HIGHLIGHTS. Computationally-guided Design of Polymer Electrolytes

RESEARCH HIGHLIGHTS. Computationally-guided Design of Polymer Electrolytes RESEARCH HIGHLIGHTS From the Resnick Sustainability Institute Graduate Research Fellows at the California Institute of Technology Computationally-guided Design of Polymer Electrolytes Global Significance

More information

Nanomaterials special properties are primarily due to size and shape

Nanomaterials special properties are primarily due to size and shape Nanomaterials special properties are primarily due to size and shape Office of Basic Energy Sciences Office of Science, U.S. DOE Version 05-26-06, pmd inflow ATMOSPHERE Particle Phase outflow desorption

More information

Machine learning comes from Bayesian decision theory in statistics. There we want to minimize the expected value of the loss function.

Machine learning comes from Bayesian decision theory in statistics. There we want to minimize the expected value of the loss function. Bayesian learning: Machine learning comes from Bayesian decision theory in statistics. There we want to minimize the expected value of the loss function. Let y be the true label and y be the predicted

More information

O WILEY- MODERN NUCLEAR CHEMISTRY. WALTER D. LOVELAND Oregon State University. DAVID J. MORRISSEY Michigan State University

O WILEY- MODERN NUCLEAR CHEMISTRY. WALTER D. LOVELAND Oregon State University. DAVID J. MORRISSEY Michigan State University MODERN NUCLEAR CHEMISTRY WALTER D. LOVELAND Oregon State University DAVID J. MORRISSEY Michigan State University GLENN T. SEABORG University of California, Berkeley O WILEY- INTERSCIENCE A JOHN WILEY &

More information

Exploring the chemical space of screening results

Exploring the chemical space of screening results Exploring the chemical space of screening results Edmund Champness, Matthew Segall, Chris Leeding, James Chisholm, Iskander Yusof, Nick Foster, Hector Martinez ACS Spring 2013, 7 th April 2013 Optibrium,

More information

A phylogenomic toolbox for assembling the tree of life

A phylogenomic toolbox for assembling the tree of life A phylogenomic toolbox for assembling the tree of life or, The Phylota Project (http://www.phylota.org) UC Davis Mike Sanderson Amy Driskell U Pennsylvania Junhyong Kim Iowa State Oliver Eulenstein David

More information

Contents 1 Open-Source Tools, Techniques, and Data in Chemoinformatics

Contents 1 Open-Source Tools, Techniques, and Data in Chemoinformatics Contents 1 Open-Source Tools, Techniques, and Data in Chemoinformatics... 1 1.1 Chemoinformatics... 2 1.1.1 Open-Source Tools... 2 1.1.2 Introduction to Programming Languages... 3 1.2 Chemical Structure

More information

Nanoscale Modeling and Simulation. George C. Schatz Northwestern University

Nanoscale Modeling and Simulation. George C. Schatz Northwestern University Nanoscale Modeling and Simulation George C. Schatz Northwestern University Where can simulation play a role in nanoscience, both now and in the future? 1. Structures of disordered nanomaterials: peptide

More information

Molecular and Electronic Dynamics Using the OpenAtom Software

Molecular and Electronic Dynamics Using the OpenAtom Software Molecular and Electronic Dynamics Using the OpenAtom Software Sohrab Ismail-Beigi (Yale Applied Physics) Subhasish Mandal (Yale), Minjung Kim (Yale), Raghavendra Kanakagiri (UIUC), Kavitha Chandrasekar

More information

Stephen McDonald and Mark D. Wrona Waters Corporation, Milford, MA, USA INT RO DU C T ION. Batch-from-Structure Analysis WAT E R S SO LU T IONS

Stephen McDonald and Mark D. Wrona Waters Corporation, Milford, MA, USA INT RO DU C T ION. Batch-from-Structure Analysis WAT E R S SO LU T IONS Chemical Intelligence in UNIFI: Enhancing in-silico Decision Making to Provide Confident and Accurate Results for Metabolite Identification Experiments Stephen McDonald and Mark D. Wrona Waters Corporation,

More information

Binding energy of 2D materials using Quantum Monte Carlo

Binding energy of 2D materials using Quantum Monte Carlo Quantum Monte Carlo in the Apuan Alps IX International Workshop, 26th July to 2nd August 2014 The Apuan Alps Centre for Physics @ TTI, Vallico Sotto, Tuscany, Italy Binding energy of 2D materials using

More information

Breakout Report on Correlated Materials. Identification of Grand Challenges

Breakout Report on Correlated Materials. Identification of Grand Challenges Breakout Report on Correlated Materials Identification of Grand Challenges Breakout - correlated materials Chairs: Littlewood (Argonne), Parkin (IBM) Speakers: Terris (HGST), Kotliar(Rutgers) Correlated

More information

Improving L-BFGS Initialization for Trust-Region Methods in Deep Learning

Improving L-BFGS Initialization for Trust-Region Methods in Deep Learning Improving L-BFGS Initialization for Trust-Region Methods in Deep Learning Jacob Rafati http://rafati.net jrafatiheravi@ucmerced.edu Ph.D. Candidate, Electrical Engineering and Computer Science University

More information

Probabilistic photometric redshifts in the era of Petascale Astronomy

Probabilistic photometric redshifts in the era of Petascale Astronomy Probabilistic photometric redshifts in the era of Petascale Astronomy Matías Carrasco Kind NCSA/Department of Astronomy University of Illinois at Urbana-Champaign Tools for Astronomical Big Data March

More information

Basics of Multivariate Modelling and Data Analysis

Basics of Multivariate Modelling and Data Analysis Basics of Multivariate Modelling and Data Analysis Kurt-Erik Häggblom 2. Overview of multivariate techniques 2.1 Different approaches to multivariate data analysis 2.2 Classification of multivariate techniques

More information

Machine Learning Overview

Machine Learning Overview Machine Learning Overview Sargur N. Srihari University at Buffalo, State University of New York USA 1 Outline 1. What is Machine Learning (ML)? 2. Types of Information Processing Problems Solved 1. Regression

More information

A Deep Convolutional Neural Network for Bioactivity Prediction in Structure-based Drug Discovery

A Deep Convolutional Neural Network for Bioactivity Prediction in Structure-based Drug Discovery AtomNet A Deep Convolutional Neural Network for Bioactivity Prediction in Structure-based Drug Discovery Izhar Wallach, Michael Dzamba, Abraham Heifets Victor Storchan, Institute for Computational and

More information

Expérimentation haut-débit Science ou loterie? David FARRUSSENG

Expérimentation haut-débit Science ou loterie? David FARRUSSENG Expérimentation haut-débit Science ou loterie? David FARRUSSENG 10 years of HT Catalysis Pharma Fine Chemicals Polymers Chemicals Refining Green chemistry 1970s 1980s 1990s 1999 2000 2001 2008 Symyx hte

More information

Virtual affinity fingerprints in drug discovery: The Drug Profile Matching method

Virtual affinity fingerprints in drug discovery: The Drug Profile Matching method Ágnes Peragovics Virtual affinity fingerprints in drug discovery: The Drug Profile Matching method PhD Theses Supervisor: András Málnási-Csizmadia DSc. Associate Professor Structural Biochemistry Doctoral

More information

Application of Optimization Methods and Edge AI

Application of Optimization Methods and Edge AI and Hai-Liang Zhao hliangzhao97@gmail.com November 17, 2018 This slide can be downloaded at Link. Outline 1 How to Design Novel Models with Methods Embedded Naturally? Outline 1 How to Design Novel Models

More information

Carbon Nanomaterials: Nanotubes and Nanobuds and Graphene towards new products 2030

Carbon Nanomaterials: Nanotubes and Nanobuds and Graphene towards new products 2030 Carbon Nanomaterials: Nanotubes and Nanobuds and Graphene towards new products 2030 Prof. Dr. Esko I. Kauppinen Helsinki University of Technology (TKK) Espoo, Finland Forecast Seminar February 13, 2009

More information

High pressure core structures of Si nanoparticles for solar energy conversion

High pressure core structures of Si nanoparticles for solar energy conversion High pressure core structures of Si nanoparticles for solar energy conversion S. Wippermann, M. Vörös, D. Rocca, A. Gali, G. Zimanyi, G. Galli [Phys. Rev. Lett. 11, 4684 (213)] NSF/Solar DMR-135468 NISE-project

More information

Constraint Reasoning and Kernel Clustering for Pattern Decomposition With Scaling

Constraint Reasoning and Kernel Clustering for Pattern Decomposition With Scaling Constraint Reasoning and Kernel Clustering for Pattern Decomposition With Scaling Ronan LeBras Theodoros Damoulas Ashish Sabharwal Carla P. Gomes John M. Gregoire Bruce van Dover Computer Science Computer

More information

Relative Stability of Highly Charged Fullerenes Produced in Energetic Collisions

Relative Stability of Highly Charged Fullerenes Produced in Energetic Collisions Relative Stability of Highly Charged Fullerenes Produced in Energetic Collisions Yang Wang Departamento de Química, Módulo 13 & Institute for Advanced Research in Chemical Sciences (IAdChem) Universidad

More information

Chemical Space: Modeling Exploration & Understanding

Chemical Space: Modeling Exploration & Understanding verview Chemical Space: Modeling Exploration & Understanding Rajarshi Guha School of Informatics Indiana University 16 th August, 2006 utline verview 1 verview 2 3 CDK R utline verview 1 verview 2 3 CDK

More information

Machine Learning with Quantum-Inspired Tensor Networks

Machine Learning with Quantum-Inspired Tensor Networks Machine Learning with Quantum-Inspired Tensor Networks E.M. Stoudenmire and David J. Schwab Advances in Neural Information Processing 29 arxiv:1605.05775 RIKEN AICS - Mar 2017 Collaboration with David

More information

An Integrated Approach to in-silico

An Integrated Approach to in-silico An Integrated Approach to in-silico Screening Joseph L. Durant Jr., Douglas. R. Henry, Maurizio Bronzetti, and David. A. Evans MDL Information Systems, Inc. 14600 Catalina St., San Leandro, CA 94577 Goals

More information

Interatomic potentials with error bars. Gábor Csányi Engineering Laboratory

Interatomic potentials with error bars. Gábor Csányi Engineering Laboratory Interatomic potentials with error bars Gábor Csányi Engineering Laboratory What makes a potential Ingredients Desirable properties Representation of atomic neighbourhood smoothness, faithfulness, continuity

More information

Opportunities for Spectroscopic Analysis with ALMA (and EVLA)

Opportunities for Spectroscopic Analysis with ALMA (and EVLA) Opportunities for Spectroscopic Analysis with ALMA (and EVLA) Brooks Pate Department of Chemistry University of Virginia Tony Remijan (NRAO), Phil Jewel (NRAO), Mike McCarthy (CfA), Susanna Widicus Weaver

More information

Quantum Computing & Scientific Applications at ORNL

Quantum Computing & Scientific Applications at ORNL Quantum Computing & Scientific Applications at ORNL Pavel Lougovski Oak Ridge National Laboratory ORNL is managed by UT-Battelle for the US Department of Energy ORNL at a glance: DOE s largest science

More information

Computational and Statistical Aspects of Statistical Machine Learning. John Lafferty Department of Statistics Retreat Gleacher Center

Computational and Statistical Aspects of Statistical Machine Learning. John Lafferty Department of Statistics Retreat Gleacher Center Computational and Statistical Aspects of Statistical Machine Learning John Lafferty Department of Statistics Retreat Gleacher Center Outline Modern nonparametric inference for high dimensional data Nonparametric

More information

Si-nanoparticles embedded in solid matrices for solar energy conversion: electronic and optical properties from first principles

Si-nanoparticles embedded in solid matrices for solar energy conversion: electronic and optical properties from first principles Si-nanoparticles embedded in solid matrices for solar energy conversion: electronic and optical properties from first principles S. Wippermann, M. Vörös, D. Rocca, T. Li, A. Gali, G. Zimanyi, F. Gygi,

More information

Maximum Margin Matrix Factorization for Collaborative Ranking

Maximum Margin Matrix Factorization for Collaborative Ranking Maximum Margin Matrix Factorization for Collaborative Ranking Joint work with Quoc Le, Alexandros Karatzoglou and Markus Weimer Alexander J. Smola sml.nicta.com.au Statistical Machine Learning Program

More information

CS145: INTRODUCTION TO DATA MINING

CS145: INTRODUCTION TO DATA MINING CS145: INTRODUCTION TO DATA MINING 5: Vector Data: Support Vector Machine Instructor: Yizhou Sun yzsun@cs.ucla.edu October 18, 2017 Homework 1 Announcements Due end of the day of this Thursday (11:59pm)

More information

9.2 Support Vector Machines 159

9.2 Support Vector Machines 159 9.2 Support Vector Machines 159 9.2.3 Kernel Methods We have all the tools together now to make an exciting step. Let us summarize our findings. We are interested in regularized estimation problems of

More information

Computational Materials Science. Krishnendu Biswas CY03D0031

Computational Materials Science. Krishnendu Biswas CY03D0031 Computational Materials Science Krishnendu Biswas CY03D0031 Outline Introduction Fundamentals How to start Application examples Softwares Sophisticated methods Summary References 2 Introduction Uses computers

More information

Machine Learning for the Materials Scientist

Machine Learning for the Materials Scientist Machine Learning for the Materials Scientist Chris Fischer*, Kevin Tibbetts, Gerbrand Ceder Massachusetts Institute of Technology, Cambridge, MA Dane Morgan University of Wisconsin, Madison, WI Motivation:

More information

Design of Manufacturing Systems Manufacturing Cells

Design of Manufacturing Systems Manufacturing Cells Design of Manufacturing Systems Manufacturing Cells Outline General features Examples Strengths and weaknesses Group technology steps System design Virtual cellular manufacturing 2 Manufacturing cells

More information

Classification Using Decision Trees

Classification Using Decision Trees Classification Using Decision Trees 1. Introduction Data mining term is mainly used for the specific set of six activities namely Classification, Estimation, Prediction, Affinity grouping or Association

More information

Next Generation Computational Chemistry Tools to Predict Toxicity of CWAs

Next Generation Computational Chemistry Tools to Predict Toxicity of CWAs Next Generation Computational Chemistry Tools to Predict Toxicity of CWAs William (Bill) Welsh welshwj@umdnj.edu Prospective Funding by DTRA/JSTO-CBD CBIS Conference 1 A State-wide, Regional and National

More information

Supplementary Figure 1(a) The trajectory of the levitated pyrolytic graphite test sample (blue curve) and

Supplementary Figure 1(a) The trajectory of the levitated pyrolytic graphite test sample (blue curve) and Supplementary Figure 1(a) The trajectory of the levitated pyrolytic graphite test sample (blue curve) and the envelope from free vibration theory (red curve). (b) The FFT of the displacement-time curve

More information

Observation of topological phenomena in a programmable lattice of 1800 superconducting qubits

Observation of topological phenomena in a programmable lattice of 1800 superconducting qubits Observation of topological phenomena in a programmable lattice of 18 superconducting qubits Andrew D. King Qubits North America 218 Nature 56 456 46, 218 Interdisciplinary teamwork Theory Simulation QA

More information

Detecting unfolded regions in protein sequences. Anne Poupon Génomique Structurale de la Levure IBBMC Université Paris-Sud / CNRS France

Detecting unfolded regions in protein sequences. Anne Poupon Génomique Structurale de la Levure IBBMC Université Paris-Sud / CNRS France Detecting unfolded regions in protein sequences Anne Poupon Génomique Structurale de la Levure IBBMC Université Paris-Sud / CNRS France Large proteins and complexes: a domain approach Structural studies

More information

Currently, worldwide major semiconductor alloy epitaxial growth is divided into two material groups.

Currently, worldwide major semiconductor alloy epitaxial growth is divided into two material groups. ICQNM 2014 Currently, worldwide major semiconductor alloy epitaxial growth is divided into two material groups. Cubic: Diamond structures: group IV semiconductors (Si, Ge, C), Cubic zinc-blende structures:

More information

A Tiered Screen Protocol for the Discovery of Structurally Diverse HIV Integrase Inhibitors

A Tiered Screen Protocol for the Discovery of Structurally Diverse HIV Integrase Inhibitors A Tiered Screen Protocol for the Discovery of Structurally Diverse HIV Integrase Inhibitors Rajarshi Guha, Debojyoti Dutta, Ting Chen and David J. Wild School of Informatics Indiana University and Dept.

More information

Numerical Statistics and Quantum Algorithms. Valerii Fedorov ICON

Numerical Statistics and Quantum Algorithms. Valerii Fedorov ICON Numerical Statistics and Quantum Algorithms Valerii Fedorov ICON 1 Quantum Computing and Healthcare Statistics In partnership with ICON, Lockheed Martin is exploring computational challenges in healthcare

More information

Supporting Information. Kinetically-Driven Phase Transformation during Lithiation in Copper Sulfide Nanoflakes

Supporting Information. Kinetically-Driven Phase Transformation during Lithiation in Copper Sulfide Nanoflakes Supporting Information Kinetically-Driven Phase Transformation during Lithiation in Copper Sulfide Nanoflakes Kai He,*,, Zhenpeng Yao, Sooyeon Hwang, Na Li, Ke Sun, Hong Gan, Yaping Du, Hua Zhang, Chris

More information

Biologically Relevant Molecular Comparisons. Mark Mackey

Biologically Relevant Molecular Comparisons. Mark Mackey Biologically Relevant Molecular Comparisons Mark Mackey Agenda > Cresset Technology > Cresset Products > FieldStere > FieldScreen > FieldAlign > FieldTemplater > Cresset and Knime About Cresset > Specialist

More information

Computational Methods for Mass Spectrometry Proteomics

Computational Methods for Mass Spectrometry Proteomics Computational Methods for Mass Spectrometry Proteomics Eidhammer, Ingvar ISBN-13: 9780470512975 Table of Contents Preface. Acknowledgements. 1 Protein, Proteome, and Proteomics. 1.1 Primary goals for studying

More information