Center for Spintronic Materials, Interfaces, and Novel Architectures. Spintronics Enabled Efficient Neuromorphic Computing: Prospects and Perspectives

Similar documents
NEUROMORPHIC COMPUTING WITH MAGNETO-METALLIC NEURONS & SYNAPSES: PROSPECTS AND PERSPECTIVES

RE-ENGINEERING COMPUTING WITH NEURO- MIMETIC DEVICES, CIRCUITS, AND ALGORITHMS

Neuromorphic computing with Memristive devices. NCM group

Synaptic Devices and Neuron Circuits for Neuron-Inspired NanoElectronics

Magnetic tunnel junction beyond memory from logic to neuromorphic computing WANJUN PARK DEPT. OF ELECTRONIC ENGINEERING, HANYANG UNIVERSITY

Addressing Challenges in Neuromorphic Computing with Memristive Synapses

An Overview of Spin-based Integrated Circuits

A 68 Parallel Row Access Neuromorphic Core with 22K Multi-Level Synapses Based on Logic- Compatible Embedded Flash Memory Technology

Proposal for an All-Spin Artificial Neural Network: Emulating Neural and Synaptic Functionalities Through Domain Wall Motion in Ferromagnets

From Spin Torque Random Access Memory to Spintronic Memristor. Xiaobin Wang Seagate Technology

Memory and computing beyond CMOS

BEYOND CHARGE-BASED COMPUTING

Artificial Neural Network and Fuzzy Logic

Power Dissipation. Where Does Power Go in CMOS?

This document is an author-formatted work. The definitive version for citation appears as:

The N3XT Technology for. Brain-Inspired Computing

Neuromorphic architectures: challenges and opportunites in the years to come

A Technology-Agnostic MTJ SPICE Model with User-Defined Dimensions for STT-MRAM Scalability Studies

Lecture 6 NEW TYPES OF MEMORY

28.7: Pairwise Coupled Hybrid Vanadium Dioxide-MOSFET (HVFET) Oscillators for Non- Boolean Associative Computing

Neural Networks Introduction

Page 1. A portion of this study was supported by NEDO.

Emerging spintronics-based logic technologies

Artificial Neural Network

Post Von Neumann Computing

Applications of Memristors in ANNs

EE115C Winter 2017 Digital Electronic Circuits. Lecture 6: Power Consumption

Machine Learning. Neural Networks. (slides from Domingos, Pardo, others)

Machine Learning. Neural Networks. (slides from Domingos, Pardo, others)

Machine Learning. Neural Networks. (slides from Domingos, Pardo, others)

Large-scale neural modeling

Semiconductor Memories

Announcements. EE141- Fall 2002 Lecture 7. MOS Capacitances Inverter Delay Power

EE04 804(B) Soft Computing Ver. 1.2 Class 2. Neural Networks - I Feb 23, Sasidharan Sreedharan

New Delhi & Affiliated to VTU, Belgavi ) Oorgaum, Kolar Dist ,Karnataka

Novel VLSI Implementation for Triplet-based Spike-Timing Dependent Plasticity

Ultralow-Power Reconfigurable Computing with Complementary Nano-Electromechanical Carbon Nanotube Switches

MTJ-Based Nonvolatile Logic-in-Memory Architecture and Its Application

Neuromorphic Network Based on Carbon Nanotube/Polymer Composites

A Hybrid CMOS/Memristive Nanoelectronic Circuit for Programming Synaptic Weights

A High-Yield Area-Power Efficient DWT Hardware for Implantable Neural Interface Applications

Consider the following spike trains from two different neurons N1 and N2:

EE 466/586 VLSI Design. Partha Pande School of EECS Washington State University

Realization of 2:4 reversible decoder and its applications

Introduction Biologically Motivated Crude Model Backpropagation

Data Mining Part 5. Prediction

GaAs and InGaAs Single Electron Hex. Title. Author(s) Kasai, Seiya; Hasegawa, Hideki. Citation 13(2-4): Issue Date DOI

Design of Basic Logic Gates using CMOS and Artificial Neural Networks (ANN)

Balance of Electric and Diffusion Forces

Digital Integrated Circuits A Design Perspective

Administrative Stuff

Case Studies of Logical Computation on Stochastic Bit Streams

Author : Fabrice BERNARD-GRANGER September 18 th, 2014

Introduction and Perceptron Learning

Neuron. Detector Model. Understanding Neural Components in Detector Model. Detector vs. Computer. Detector. Neuron. output. axon

INTRODUCTION TO NEURAL NETWORKS

ESE 570: Digital Integrated Circuits and VLSI Fundamentals

New Approaches to Reducing Energy Consumption of MRAM write cycles, Ultra-high efficient writing (Voltage-Control) Spintronics Memory (VoCSM)

Liquid Computing in a Simplified Model of Cortical Layer IV: Learning to Balance a Ball

Using a Hopfield Network: A Nuts and Bolts Approach

MESL: Proposal for a Non-volatile Cascadable Magneto-Electric Spin Logic

Artificial Neural Networks

Center for Spintronic Materials, Interfaces, and Novel Architectures. Voltage Controlled Antiferromagnetics and Future Spin Memory

Kaushik Roy Department of Electrical and Computer Engineering, Purdue University, West Lafayette, IN

Instituto Tecnológico y de Estudios Superiores de Occidente Departamento de Electrónica, Sistemas e Informática. Introductory Notes on Neural Networks

Quasi Analog Formal Neuron and Its Learning Algorithm Hardware

Compartmental Modelling

3 Detector vs. Computer

Stand-Alone Hardware-Based Learning System. Trevor CLARKSON and Chi Kwong NG 1. Department of Electronic and Electrical Engineering,

! Charge Leakage/Charge Sharing. " Domino Logic Design Considerations. ! Logic Comparisons. ! Memory. " Classification. " ROM Memories.

Implementing Synaptic Plasticity in a VLSI Spiking Neural Network Model

Introduction to Neural Networks U. Minn. Psy 5038 Spring, 1999 Daniel Kersten. Lecture 2a. The Neuron - overview of structure. From Anderson (1995)

Moores Law for DRAM. 2x increase in capacity every 18 months 2006: 4GB

Analog CMOS Circuits Implementing Neural Segmentation Model Based on Symmetric STDP Learning

Digital Integrated Circuits A Design Perspective. Semiconductor. Memories. Memories

Artificial Neural Networks D B M G. Data Base and Data Mining Group of Politecnico di Torino. Elena Baralis. Politecnico di Torino

Advanced Topics In Solid State Devices EE290B. Will a New Milli-Volt Switch Replace the Transistor for Digital Applications?

Merging the Interface: Power, Area and Accuracy Co-optimization for RRAM Crossbar-based Mixed-Signal Computing System

CISC 3250 Systems Neuroscience

Reservoir Computing with Stochastic Bitstream Neurons

arxiv: v2 [cs.et] 26 Jan 2018

Last Lecture. Power Dissipation CMOS Scaling. EECS 141 S02 Lecture 8

Carbon Nanotube Synaptic Transistor Network for. Pattern Recognition. Supporting Information for

EECS 427 Lecture 11: Power and Energy Reading: EECS 427 F09 Lecture Reminders

Design Considerations for Integrated Semiconductor Control Electronics for a Large-scale Solid State Quantum Processor

Event-Driven Random Backpropagation: Enabling Neuromorphic Deep Learning Machines

Hopfield Neural Network and Associative Memory. Typical Myelinated Vertebrate Motoneuron (Wikipedia) Topic 3 Polymers and Neurons Lecture 5

CMSC 421: Neural Computation. Applications of Neural Networks

3/10/2013. Lecture #1. How small is Nano? (A movie) What is Nanotechnology? What is Nanoelectronics? What are Emerging Devices?

Artificial Neural Networks

Design of A Efficient Hybrid Adder Using Qca

Lecture 4: Perceptrons and Multilayer Perceptrons

STDP Learning of Image Patches with Convolutional Spiking Neural Networks

Neural networks. Chapter 19, Sections 1 5 1

Emerging Interconnect Technologies for CMOS and beyond-cmos Circuits

The Spin Torque Lego

A design methodology and device/circuit/ architecture compatible simulation framework for low-power magnetic quantum cellular automata systems

Decoding. How well can we learn what the stimulus is by looking at the neural responses?

Artifical Neural Networks

Lecture 25. Semiconductor Memories. Issues in Memory

Transcription:

Center for Spintronic Materials, Interfaces, and Novel Architectures Spintronics Enabled Efficient Neuromorphic Computing: Prospects and Perspectives KAUSHIK ROY ABHRONIL SENGUPTA, KARTHIK YOGENDRA, DELIANG FAN, SYED SARWAR, PRIYA PANDA, GOPAL SRINIVASAN, JASON ALLRED, ZUBAIR AZIM, A. RAGHUNATHAN Purdue University C-SPIN, Sep 20, 2016

The Computational Efficiency Gap IBM Watson playing Jeopardy, 2011 ~200000 W 20 W 20 W IBM Blue Gene supercomputer, equipped with 147456 CPUs and 144TB of memory, consumed 1.4MW of power to simulate 5 secs of brain activity of a cat at 83 times slower firing rates C-SPIN, Sep 20, 2016 2

Neuromorphic Computing Technologies Approximate Neural Nets, ISLPED 14 Conditional Deep Learning, DATE 2016. Approximate Computing, Semantic Decomposition, Conditional DLN Spintronics-Enabled Hardware Accelerators SW (Multicores/GPUs) 1 uj/neuron C-SPIN, Sep 20, 2016 3 Spin neuron, IJCNN 12, APL 15, TNANO, DAC, DRC, IEDM Spintronic Deep Learning Engine, ISLPED 14 Spin synapse, APL 15. 3

Device/Circuit/Algorithm Co-Design: Spin/ANN/SNN C-SPIN, Sep 20, 2016 4

BUILDING PRIMITIVES: MEMORY, NEURONS, SYNAPSES Lateral Spin Valve (Local & Non-local) I Domain Wall transistor D G I Spin current m1 DWM m2 I REF ± I in I REF Integrator V OUT I + = + v i g i Excitory Input Preset I = v i g i Inhibitory Input Synaptic current In = (In + In ) F MN M L CH + m L CH MgO PL FL C-SPIN, Sep 20, 2016 5 SHM Clock

DW-MTJ: Domain Wall Motion/MTJ Three terminal device structure provides decoupled write and read current paths Write current flowing through heavy metal programs domain wall position Read current is modulated by device conductance which varies linearly with domain wall position Universal device: Suitable for memory, neuron, synapse, interconnects C-SPIN, Sep 20, 2016 6

DW-MTJ for Interconnects/Memory Interconnect Design Memory Bit-Cell Energy-efficient interconnect design can circumvent the energy and delay penalties in CMOS based global interconnects for scaled technology nodes DW-MTJ memory bit cell with decoupled write and read current paths C-SPIN, Sep 20, 2016 7

Simple ANN: Activation Artificial NN synapses axon w 1 w 2 transmitting neuron Signal transmission w n Summation of weighted inputs Thresholding function Spin Hall based Switching DW-MTJ MgO PL FL SHM Switch a magnet using spin current, read using TMR effect C-SPIN, Sep 20, 2016 8 8

Step and Analog ANN Neurons OUT OUT IN IN Step Neuron Analog Neuron Neuron, acting as the computing element, provides an output current (IOUT) which is a function of the input current (IIN) Axon functionality is implemented by the CMOS transistor Note: Stochastic nature of switching of MTJ can be in Stochastic Neural nets 9 C-SPIN, Sep 20, 2016 9

Benchmarking with CMOS Implementation Neurons Power Speed Energy Function technology CMOS Analog neuron 1 [1] CMOS Analog neuron 2 [2] CMOS Analog neuron 3 [5] ~12µW (assume 1V supply) 65ns 780fJ Sigmoid / 15µW / / Sigmoid 180nm 70µW 10ns 700fJ Step 45nm Digital Neuron [3] 83.62µW 10ns 832.6fJ 5-bit tanh 45nm Hard-Limiting Spin-Neuron Soft-Limiting Spin-Neuron 0.81µW 1ns 0.81fJ Step / 1.25µW 3ns 3.75fJ Rational/ Hyperbolic Compared with analog/ digital CMOS based neuron design, spin based neuron designs have the potential to achieve more than two orders lower energy consumption [1]: A. J. Annema, Hardware realisation of a neuron transfer function and its derivative, Electronics Letters, 1994 [2]: M. T. Abuelma ati, etc, A reconfigurable satlin/sigmoid/gaussian/triangular basis functions, APCCAS, 2006 [3]: S. Ramasubramanian, et al., "SPINDLE: SPINtronic Deep Learning Engine for large-scale neuromorphic computing", ISLPED, 2014 [4]: D. Coue, etc A four-quadrant subthreshold mode multiplier for analog neural network applications, TNN, 1996 [5]: M. Sharad, etc, Spin-neurons: A possible path to energy-efficient neuromorphic computers, JAP, 2013 C-SPIN, Sep 20, 2016 10 /

In-Memory Computing (Dot Product) Artificial NN nucleolus axon synapses w 1 w 2 nucleolus transmitting neuron Signal transmission w n Summation of weighted inputs Thresholding function Input Voltages V 3 V 2 V 1 w11 w12 w13 w21 w22 w23 w31 w32 w33 V i1 w i1 V i2 w i2 V i3 w i2 Programmable resistors/dwm C-SPIN, Sep 20, 2016 11

All-Spin Artificial Neural Network Biological Neural Network Spin-synapse Spin-neuron All-spin ANN where spintronic devices directly mimic neuron and synapse functionalities and axon (CMOS transistor) transmits the neuron s output to the next stage Ultra-low voltage (~100mV) operation of spintronic synaptic crossbar array made possible by magneto-metallic spin-neurons System level simulations for character recognition shows maximum energy consumption of 0.32fJ per neuron which is ~100x lower in comparison to analog and digital CMOS neurons (45nm technology) Spintronic Neural Network All-spin Neuromorphic Architecture C-SPIN, Sep 20, 2016 12

Spiking Neural Networks (Self-Learning) C-SPIN, Sep 20, 2016

Spiking Neuron Membrane Potential Biological Spiking Neuron MTJ Spiking Neuron LIF Equation: LLGS Equation: The leaky fire and integrate can be approximated by an MTJ the magnetization dynamics mimics the leaky fire and integrate operation C-SPIN, Sep 20, 2016 14

MTJ as a Spiking Neuron Spikes at 3ns interval Spikes at 6ns interval MTJ magnetization leaks and integrates input spikes (LLG equation) in presence of thermal noise Associated write and read energy consumption is ~ 1fJ and ~1.6fJ per time-step which is much lower than state-of-the-art CMOS spiking neuron designs (267pJ [1] and 41.3pJ [2] per spike) C-SPIN, Sep 20, 2016 15

Spiking Neurons LLGS Based Spiking Neuron DW-MTJ base IF Neurons Input Spikes Membrane Potential Input Spikes MTJ conductance LLG Equation Mimicking Spiking Neurons Output Spikes DW Integrating Property Mimicking IF Neuron C-SPIN, Sep 20, 2016 16 16

Arrangement of DW-MTJ Synapses in Array for STDP Learning Spike-Timing Dependent Plasticity Spintronic synapse in spiking neural networks exhibits spike timing dependent plasticity observed in biological synapses Programming current flowing through heavy metal varies in a similar nature as STDP curve Decoupled spike transmission and programming current paths assist online learning 15fJ energy consumption per synaptic event which is ~10-100x lower in comparison to SRAM based synapses /emerging devices like PCM C-SPIN, Sep 20, 2016 17

Comparison with Other Synapses Device Reference Dimension Prog. Energy Prog. Time GeSbTe memristor GeSbTe memristor D. Modha ACM JETCAS, 2013 (IBM ) H.-S. P. Wong Nano Letters, 2012 (Stanford) 40nm mushroom and 10nm pore 75nm electrode diameter Average 2.74 pj/ event 50pJ (reset) 0.675pJ (set) Terminals Prog. Mechanism ~60ns 2 Programmed by Joule heating (Phase change) 10ns 2 Programmed by Joule heating (Phase change) Ag-Si memristor Wei Lu Nano Letters, 2010 (U Michigan) 100nmx100nm Threshold voltage~2.2v ~300µs 2 Movement of Ag ions FeFET Y. Nishitani JJAP, 2013 (Panasonic, Japan) Channel Length-3µm Maximum gate voltage 4V 10µs 3 Gate voltage modulation of ferroelectric polarization Floating gate transistor P. Hasler IEEE TBIOCAS, 2011 (GaTech) 1.8µm/0.6µm (0.35µm CMOS technology) Vdd - 4.2V Tunneling Voltage 15V 100µs (injection) 2ms (tunneling) 3 Injection and tunneling currents SRAM synapse B. Rajendran IEEE TED, 2013 (IIT Bombay) 0.3µm 2 (10nm CMOS technology) Average 328fJ for 4-bit synapse - - Digital counter based circuits Spintronic synapse NRL Purdue 340nmx20nm Maximum 48fJ /event 1ns 3 Spin-orbit torque C-SPIN, Sep 20, 2016 18

MTJ Enabled All-Spin Spiking Neural Network Probabilistic Spiking Neuron A pre-neuronal spike modulated by synapse to generate current that controls the post-neuronal spiking probability. Exploit stochastic switching behavior of MTJ in presence of thermal noise. C-SPIN, Sep 20, 2016 20

MTJ Enabled All-Spin Spiking Neural Network Stochastic Binary Synapse Synaptic strength proportional to temporal correlation between pre- and post-spike trains. Stochastic STDP Synaptic learning embedded in the switching probability of binary synapses. C-SPIN, Sep 20, 2016 21

MTJ Enabled All-Spin Spiking Neural Network Stochastic SNN Hardware Implementation Crossbar arrangement of the spin neurons and synapses for energy efficiency. Average neuronal energy of 1fJ and 1.6fJ per timestep for write and read operations, and 4.5fJ for reset. Average synaptic programming energy of 70fJ per training epoch. Classification accuracy of 73% for MNIST digit recognition. C-SPIN, Sep 20, 2016 22

Computing with Coupled STNOs C-SPIN, Sep 20, 2016

Coupled Spin-Torque Oscillators I REF ± I in I REF V OUT Coupled Integrator V OUT = V HIGH when unlocked V LOW when locked Processor (STNO_P) Reference (STNO_R) When I in is within the locking range (neuron threshold), two oscillators are locked. Otherwise, they are unlocked. STNOs can be used to provide thresholding functionality with tunable threshold C-SPIN, Sep 20, 2016 24

Image Edge Detection using STNOs I bias based on pixel intensity Gilbert damping constant (alpha) = 0.01 Saturation magnetization = 800 emu/cc Magnet volume (IMA) = 20x20x2 nm3 Eb = 30kT lambda = 2 epsilon prime = 0 P = 0.9 Hext = 11k Oe at 0.45 degrees from normal to the plane; Ibias range = 10uA to 50uA (i.e 10uA for black and 50uA for white pixels); Distance between STOs = 70nm (for coupling) C-SPIN, Sep 20, 2016 25

Summary Spintronics do show promise for low-power non- Boolean/brain-inspired computing Need for new leaning techniques suitable for emerging devices Materials research, new physics, new devices, simulation models An exciting path ahead C-SPIN, Sep 20, 2016 26