Deep convolutional framelets:

Similar documents
Deep Learning: Approximation of Functions by Composition

Introduction to Convolutional Neural Networks 2018 / 02 / 23

Bayesian approach to image reconstruction in photoacoustic tomography

Source Reconstruction for 3D Bioluminescence Tomography with Sparse regularization

Machine Learning for Computer Vision 8. Neural Networks and Deep Learning. Vladimir Golkov Technical University of Munich Computer Vision Group

<Special Topics in VLSI> Learning for Deep Neural Networks (Back-propagation)

Artificial Neural Networks D B M G. Data Base and Data Mining Group of Politecnico di Torino. Elena Baralis. Politecnico di Torino

Deep Learning Autoencoder Models

Comments. Assignment 3 code released. Thought questions 3 due this week. Mini-project: hopefully you have started. implement classification algorithms

EE-559 Deep learning 9. Autoencoders and generative models

University of Cyprus. Reflectance and Diffuse Spectroscopy

Machine Learning: Basis and Wavelet 김화평 (CSE ) Medical Image computing lab 서진근교수연구실 Haar DWT in 2 levels

Encoder Based Lifelong Learning - Supplementary materials

11/10/2014. Chapter 1: Introduction to Medical Imaging. Projection (Transmission) vs. Emission Imaging. Emission Imaging

Improved Bayesian Compression

CS 229 Project Final Report: Reinforcement Learning for Neural Network Architecture Category : Theory & Reinforcement Learning

Forward and inverse wave problems:quantum billiards and brain imaging p.1

Convolutional Neural Networks

Efficient Inference in Fully Connected CRFs with Gaussian Edge Potentials

KAI ZHANG, SHIYU LIU, AND QING WANG

Super-Resolution. Shai Avidan Tel-Aviv University

Imaging In Challenging Weather Conditions

arxiv: v5 [stat.ml] 25 Jan 2018

Inverse problem and optimization

Diffuse Optical Spectroscopy: Analysis of BiophotonicsSignals Short Course in Computational Biophotonics Albert Cerussi Day 4/Lecture #2

MEG Source Localization Using an MLP With Distributed Output Representation

Trends in hybrid data tomography Workshop at DTU Compute Wednesday January 24, 2018 Room 324/050

Inverse Transport Problems and Applications. II. Optical Tomography and Clear Layers. Guillaume Bal

Autoencoders and Score Matching. Based Models. Kevin Swersky Marc Aurelio Ranzato David Buchman Benjamin M. Marlin Nando de Freitas

Convolutional Neural Networks II. Slides from Dr. Vlad Morariu

Super-Resolution. Dr. Yossi Rubner. Many slides from Miki Elad - Technion

Statistical and Computational Inverse Problems with Applications Part 2: Introduction to inverse problems and example applications

IN the field of ElectroMagnetics (EM), Boundary Value

STOCHASTIC & DETERMINISTIC SOLVERS

Mathematical analysis of ultrafast ultrasound imaging

Tutorial on Methods for Interpreting and Understanding Deep Neural Networks. Part 3: Applications & Discussion

Sparse Approximation: from Image Restoration to High Dimensional Classification

AN NONNEGATIVELY CONSTRAINED ITERATIVE METHOD FOR POSITRON EMISSION TOMOGRAPHY. Johnathan M. Bardsley

Simple Techniques for Improving SGD. CS6787 Lecture 2 Fall 2017

Study of Pair-Monitor for ILC using Deep Learning

Framing U-Net via Deep Convolutional Framelets: Application to Sparse-view CT

OPTIMIZATION METHODS IN DEEP LEARNING

Linear Regression. CSL603 - Fall 2017 Narayanan C Krishnan

Linear Regression. CSL465/603 - Fall 2016 Narayanan C Krishnan

Lecture 14: Deep Generative Learning

Agenda. Digit Classification using CNN Digit Classification using SAE Visualization: Class models, filters and saliency 2 DCT

Improving L-BFGS Initialization for Trust-Region Methods in Deep Learning

Deep Learning. Convolutional Neural Network (CNNs) Ali Ghodsi. October 30, Slides are partially based on Book in preparation, Deep Learning

Quantitative in vivo measurements of blood oxygen saturation using multiwavelength photoacoustic imaging

Dreem Challenge report (team Bussanati)

CENG 783. Special topics in. Deep Learning. AlchemyAPI. Week 8. Sinan Kalkan

Joint Neural Network Equalizer and Decoder

WHY ARE DEEP NETS REVERSIBLE: A SIMPLE THEORY,

Linear Inverse Problems

Lightweight Probabilistic Deep Networks Supplemental Material

Introduction to Machine Learning Midterm Exam

How to do backpropagation in a brain

Machine Learning Ensemble Learning I Hamid R. Rabiee Jafar Muhammadi, Alireza Ghasemi Spring /

Singing Voice Separation using Generative Adversarial Networks

Self-Calibration and Biconvex Compressive Sensing

Dictionary Learning for photo-z estimation

Generative adversarial networks

Deep Learning & Neural Networks Lecture 4

Nonlinear Optimization Methods for Machine Learning

Overview I. Approaches to Reconstruction in Photoacoustic Tomography. Bell and the Photoacoustic Effect I. Overview II

Opportunities and challenges in quantum-enhanced machine learning in near-term quantum computers

CS 179: LECTURE 16 MODEL COMPLEXITY, REGULARIZATION, AND CONVOLUTIONAL NETS

Global Optimality in Matrix and Tensor Factorization, Deep Learning & Beyond

Non-uniqueness result for a hybrid inverse problem

Practical Bayesian Optimization of Machine Learning. Learning Algorithms

End-to-End Event Classification of High-Energy Physics Data

The Deep Ritz method: A deep learning-based numerical algorithm for solving variational problems

Maxout Networks. Hien Quoc Dang

Detection of Subsurface Defects using Active Infrared Thermography

A Fast Augmented Lagrangian Algorithm for Learning Low-Rank Matrices

PDEs in Image Processing, Tutorials

Non-uniqueness result for a hybrid inverse problem

Change Detection in Multivariate Data

Deep learning / Ian Goodfellow, Yoshua Bengio and Aaron Courville. - Cambridge, MA ; London, Spis treści

Phononic Crystals. J.H. Page

Feature Design. Feature Design. Feature Design. & Deep Learning

Mini-project in scientific computing

GENERATIVE ADVERSARIAL LEARNING

Cheng Soon Ong & Christian Walder. Canberra February June 2018

MODULE -4 BAYEIAN LEARNING

BACKPROPAGATION. Neural network training optimization problem. Deriving backpropagation

Sajid Anwar, Kyuyeon Hwang and Wonyong Sung

Image enhancement. Why image enhancement? Why image enhancement? Why image enhancement? Example of artifacts caused by image encoding

On double integrals over spheres

PATTERN RECOGNITION AND MACHINE LEARNING

Deep learning on 3D geometries. Hope Yao Design Informatics Lab Department of Mechanical and Aerospace Engineering

The XOR problem. Machine learning for vision. The XOR problem. The XOR problem. x 1 x 2. x 2. x 1. Fall Roland Memisevic

Adapting Wavenet for Speech Enhancement DARIO RETHAGE JULY 12, 2017

Training Neural Networks Practical Issues

Stochastic Optimization with Variance Reduction for Infinite Datasets with Finite Sum Structure

WEAK-VALUES AMPLIFICATION: WHEN PRECISION MEASUREMENTS BECOME EASY

WAVELET BASED DENOISING FOR IMAGES DEGRADED BY POISSON NOISE

TUTORIAL PART 1 Unsupervised Learning

Data Mining Based Anomaly Detection In PMU Measurements And Event Detection

Single-photon and two-photon absorption induced charge model calibration

Transcription:

Deep convolutional framelets: application to diffuse optical tomography : learning based approach for inverse scattering problems Jaejun Yoo NAVER Clova ML

OUTLINE (bottom up!) I. INTRODUCTION II. EXPERIMENTS III.THEORY NETWORK DESIGN FRAMEWORK IV. CONCLUSION

0. AGENDA SETUP

INVERSE PROBLEMS noise? System (G)? original data model d = G(x) + n observed data? Inverse problem unknown

INVERSE PROBLEMS e.g. natural images noise Blurring, Down Sampling original image model d = G(x) + n observed image Inverse problem denoising & super-resolution* Restored image * Bae W and Yoo J, CVPRW, 2017

INVERSE PROBLEMS e.g. electromagnetic, or acoustic wave noise Maxwell equation Láme equation original density model d = G(x) + n observed field Inverse problem physical property reconstruction* or source localization Restored density Yoo J et al., SIAM 2016 Yoo J et al., SIAM 2018 (minor revision)

INVERSE PROBLEMS General statement of the problem To find the best model such that d = G x In linear system, e.g. X-ray CT, we minimize the following cost function: In signal processing society: Gx = 1, 0, 1, 0 0, 1, 0, 1 1, 1, 0, 0 0, 0, 1 1 d = Gx, φ = d Gx 2 2 A B C D, rank G = 4 1 0 න G(C + D) dx A C B 1 0 0 D 0 body well posed problem, G 1 σ 1 0 more constraints, assumptions, regularization, iterative methods, etc.

INVERSE PROBLEMS General statement of the problem To find the best model such that d = G x In linear system, e.g. X-ray CT, we minimize the following cost function: In machine learning society: d = Gx, φ = d Gx 2 2 CNN Conv, ReLU, pooling, etc. d G T, where G T G = I x

AGENDA Deep learning works extremely well but why? Sometimes blind application of these techniques provides even better performance than mathematically driven classical signal processing approaches. The more we observe impressive empirical results in image reconstruction problems, the more unanswered questions we encounter: Why convolution? Why do we need a pooling and unpooling in some architectures? etc. What is the link between the classical signal processing theory and deep learning? Can deep learning go beyond? Would it be possible to train the network learn the complicated non-linear physics? How?

I. INTRODUCTION A. Inverse scattering problem (DOT) B. What to solve? & How to solve?

INVERSE SCATTERING PROBLEMS Photon/Wave scattering in a turbid media (Very non-linear, ill-posed ) Trajectories/Interactions (absorption, scattering, reflection, etc.) Electromagnetic, Elastic, Optical, Acoustic waves : d = G x more generalized model photon trajectories Light interaction with matter

INVERSE SCATTERING PROBLEMS Inverse Scattering Problems in Medical Imaging Applications Near Infra-Red (NIR), Ultrasound, Photo-acoustic, EEG, etc. Non-invasive, non-destructive examination Fast, cheap, and portable machine

INVERSE SCATTERING PROBLEMS Near-infrared (NIR, ~650-950 nm) Near-infrared light can travel deep in tissue, as a result of the relatively small absorption of water and hemoglobin. * Durduran et al., MICCAI, 2010 The goal of DOT is to reconstruct the spatial distribution of optical/physiological properties at each point (or volume element) in the tissue from measurements of fluence rate on the tissue surface.

INVERSE SCATTERING PROBLEMS Lipmann-Schwinger Equation Mapping between the 3D distribution of optical properties f and the measurements g 1) 광센서로부터광학계수분포 f 에대한측정데이터 g 를얻습니다. 3D distribution f Measurement g (f comes from a smoothly varying perturbation δμ) 2) 이때, 둘사이의관계는적분방정식으로표현할수있습니다. Measurement g 3D distribution f??? non-linear, ill-posed G 1???

INVERSE SCATTERING PROBLEMS Lipmann-Schwinger Equation Mapping between the 3D distribution of optical properties f and the measurements g 1) 광센서로부터광학계수분포 f 에대한측정데이터 g 를얻습니다. DEEP LEARNING!!!...?? 3D distribution f Measurement g (f comes from a smoothly varying perturbation δμ) 2) 이때, 둘사이의관계는적분방정식으로표현할수있습니다. OKAY.. Measurement g 3D distribution f??? non-linear, ill-posed

ISSUES TO OVERCOME 1. Lack of good conventional algorithms Applying conventional inversion algorithms and denoising the artifacts using the CNNs are unsatisfactory since they rely on heavy assumptions and linearization. e.g. Born approximation i u m i x u m s (x)

ISSUES TO OVERCOME 1. Lack of good conventional algorithms Applying conventional inversion algorithms and denoising the artifacts using the CNNs are unsatisfactory since they rely on heavy assumptions and linearization. Photoacoustic tomography (PAT) [3] [1] ODT, Kamilov et al., Optica, 2015 [2] Electron scattering, Broek and Koch, Physical review letter [3] PAT, Antholzer et al. arxiv, 2017

ISSUES TO OVERCOME 1. Lack of good conventional algorithms Applying conventional inversion algorithms and denoising the artifacts using the CNNs are unsatisfactory since they rely on heavy assumptions and linearization. 2. Domain & dimension mismatch (How to design the network???) The measurements g and optical distribution image f live in different domain with different dimension (1D 3D, severely ill-posed). 3. Commercial device was not available. Still laboratory or research level usage. Proto-type device available (not calibrated or validated). 4. Lack of real data At the early stage, only a single data measured by proto-type device were available

ISSUES TO OVERCOME 1. Lack of good conventional algorithms Applying conventional inversion algorithms and denoising the artifacts using the CNNs DCNN is certainly the trend of this era are unsatisfactory since they rely on heavy assumptions and linearization. 2. Domain & dimension mismatch (How to design the network???) The measurements g and optical distribution image f live in different domain with We need a new design of network architecture different dimension (1D 3D, severely ill-posed). 3. Commercial device was not available. Still laboratory or research level usage. Proto-type device available (not calibrated or validated). 4. Lack of real data At the early stage, only a single data measured by proto-type device were available

ISSUES #1,2 : SOLUTION Neural network for inverting Lipmann-Schwinger Equation The inverting operator is naturally found during the training phase. Achieve a denoised signal with a good signal representation which is trained via data without any assumption.

ISSUES #1,2 : SOLUTION The overall structure of the networks is remained same across different set of experiment data The number of convolution layers and their filters vary dependent on the data but this is chosen to show the best performance not due to the failure of the network Note that the rest of parameters such as Gaussian noise variance and dropout rate are remained same for every case

ISSUES TO OVERCOME 1. Lack of good conventional algorithms Applying conventional inversion algorithms and denoising the artifacts using the CNNs are unsatisfactory since they rely on heavy assumptions and linearization. 2. Domain & dimension mismatch The measurements g and optical distribution image f live in different domain with different dimension (1D 3D, severely ill-posed). 3. Commercial device was not available. Still laboratory or research level usage. Make your hands dirty. Start from the system. Proto-type device available (not calibrated or validated). 4. Lack of real data At the early stage, only If a you single don t data have measured the data. by Make proto-type it up. device were available

ISSUES #3,4 : SOLUTION System calibration and data acquisition Based on the data from hardware system (KERI), performed the signal analysis to calibrate the hardware system. Phantom with known optical values are used. S1, D1 S1, D2 S1, D3 S1, Dm S2, D1 Sn, Dm Schematic illustration of DOT system DOT system (KERI)

ISSUES #3,4 : SOLUTION System calibration and data acquisition

ISSUES #3,4 : SOLUTION Data preprocessing Discard the measurement pairs over the src-det distance limit (51 mm) Find the optical coefficients based on the homogeneous model Source <51 mm Data preprocessing based on src-det distance Detector

ISSUES #3,4 : SOLUTION Simulation data generation Using the finite element method (FEM) based solver NIRFAST Mesh data are re-gridded to matrix form Up to three anomalies with different size (radius: 2 mm ~ 13 mm) and optical properties at various position (x, y, z) Anomaly has two to five times bigger optical properties than the homogeneous background (similar to tumor compared to the normal tissue). 1500 data (1000 for training / 500 for validation)

ISSUES #3,4 : SOLUTION Data preprocessing Discard the measurement pairs over the src-det distance limit (51 mm) Find the optical coefficients based on the homogeneous model Domain adaptation from real to simulation data (matching the signal envelope, amplitudes, etc.) Matching the signal envelop i C = u simul x./ui exp (x)

ISSUES #3,4 : SOLUTION Data preprocessing Discard the measurement pairs over the src-det distance limit (51 mm) Find the optical coefficients based on the homogeneous model Domain adaptation from real to simulation data (matching the signal envelope, amplitudes, etc.) EXPERIMENT DESIGN, DEVICE CALIBRATION, DATA PREPROCESSING, UNDERSTANING PHYSICS, DATA GENERATION, EXPERIMENT, EXPERIMENT, EXPERIMENT TRIALS AND ERRORS DL MODEL DESIGN This took 80% of the research Matching the signal envelop i C = u simul x./ui exp (x)

ISSUES TO OVERCOME 1. Lack of good conventional algorithms Applying conventional inversion algorithms and denoising the artifacts using the CNNs are unsatisfactory since they rely on heavy assumptions and linearization. 2. Domain & dimension mismatch The measurements g and optical distribution image f live in different domain with different dimension (1D 3D, severely ill-posed). 3. Commercial device was not available. Still laboratory or research level usage. Proto-type device available (not calibrated or validated). 4. Lack of real data At the early stage, only a single data measured by proto-type device were available

II. EXPERIMENTS A. Results B. Take home messages

SIMULATION TRAINING Additive Gaussian noise (σ = 0.2) Dropout (p = 0.7) for FC layer Background μ a values are subtracted Data is centered and normalized to range between (-1,1) MSE loss ADAM optimizer (default setup) Batch size: 64 Early stoppling (no improvement in validation loss for 10 epochs) Training time: ~380 SEC

PHANTOM

PHANTOM DBT Image DBT Image

PHANTOM DBT Image DBT Image

IN VIVO Mouse (normal) Iterative method middle bottom Mouse (normal) Proposed middle bottom DBT Image Mouse (tumor) Proposed middle bottom

IN VIVO Mouse (normal) Iterative method middle bottom Mouse (normal) Proposed middle bottom DBT Image Mouse (tumor) Proposed middle bottom

TAKE HOME MESSAGE 1) Domain knowledge matters 2) Data preprocessing is important * DL NEEDS BABYSITTING A LOT! GARBAGE IN GARBAGE OUT 측정신호에대한이해 (DOMAIN KNOWLEDGE) 를바탕으로충분한 PREPROCESSING 필요하다. 3) Do not afraid to make your hands dirty 4) Quick trial and errors * MAKE YOUR WORKING ENVIRONMENT IDEA가생겼을때바로실험을해볼수있는환경을만드는것이중요하다. EXPERIMENT DESIGN, DEVICE CALIBRATION, DATA PREPROCESSING, UNDERSTANING PHYSICS, DATA GENERATION, EXPERIMENT, EXPERIMENT, EXPERIMENT TRIALS AND ERRORS DL MODEL DESIGN This took 80% of the research

TAKE HOME MESSAGE 1) Domain knowledge matters 2) Data preprocessing is important * DL NEEDS BABYSITTING A LOT! GARBAGE IN GARBAGE OUT 측정신호에대한이해 (DOMAIN KNOWLEDGE) 를바탕으로충분한 PREPROCESSING 필요하다. 3) Do not afraid to make your hands dirty 4) Quick trial and errors * MAKE YOUR WORKING ENVIRONMENT IDEA가생겼을때바로실험을해볼수있는환경을만드는것이중요하다. 20% 가 20% 일수있었던이유 EXPERIMENT DESIGN, DEVICE CALIBRATION, DATA PREPROCESSING, UNDERSTANING PHYSICS, DATA GENERATION, EXPERIMENT, EXPERIMENT, EXPERIMENT TRIALS AND ERRORS DL MODEL DESIGN

OKAY THAT WORKS BUT WHY?

IV. THEORY A. Deep Convolutional Framelets

DEEP CONVOLUTIONAL FRAMELETS OBSERVATION 1 Lifted Hankel matrix of noiseless signal f is often low-ranked whereas that of noise ε is usually full-ranked [1-3] f 1 f 2 For an image patch f, smooth or structured signals has a sparse coefficients in Fourier domain. Let f R n be a vectorized patch with n number of pixels. Here, is a Hankel matrix of signal f with size d. f 3 f: 1 2 3 1 2 3 2 3 1 3 1 2 Ye JC et al., 2017

DEEP CONVOLUTIONAL FRAMELETS OBSERVATION 1 Lifted Hankel matrix of noiseless signal f is often low-ranked whereas that of noise ε is usually full-ranked [1-3] f 1 f 2 f 3 For an image patch f, መf R n = f + ε Matrix factorization : low-ranked : full-ranked (or sparse) 1 2 3 4 5 2 3 4 5 6 3 4 5 6 7 4 5 6 7 8 5 6 7 8 9 6 7 7 8 8 9 9 10 10 11 8 9 10 11 1 9 10 11 1 2 10 11 1 2 3 11 1 2 3 4 Ye JC et al., 2017 R n d

DEEP CONVOLUTIONAL FRAMELETS OBSERVATION 1 Lifted Hankel matrix of noiseless signal f is often low-ranked whereas that of noise ε is usually full-ranked [1-3] For an image patch with missing pixels f R n, R n d is a rank-deficient Hankel matrix, Matrix completion, or Netflix problem Jin et al., IEEE TIP, 2015

DEEP CONVOLUTIONAL FRAMELETS OBSERVATION 2 There is a close relationship between Hankel matrix and convolution operation, which leads us to CNN. Convolution convolution, WiKipedia

DEEP CONVOLUTIONAL FRAMELETS OBSERVATION 2 There is a close relationship between Hankel matrix and convolution operation, which leads us to CNN. Convolution f R n ψ R d y 1 y ch

DEEP CONVOLUTIONAL FRAMELETS OBSERVATION 2 There is a close relationship between Hankel matrix and convolution operation, which leads us to CNN. c.f. Single-input multi-output (SIMO) Multi-input single-output (MISO)

DEEP CONVOLUTIONAL FRAMELETS OBSERVATION 2 There is a close relationship between Hankel matrix and convolution operation, which leads us to CNN. encoder f f decoder

DEEP CONVOLUTIONAL FRAMELETS OBSERVATION 3 In the signal processing point of view, what CNNs are doing is to find an energy-compacting signal representations (low-ranked) by training a set of local bases Ψ for a given non-local bases Φ. encoder decoder Yoshua Bengio's slides, 2013

SUMMARY SO FAR Observation 1 Lifted Hankel matrix of noiseless signal f is often low-ranked whereas that of noise ε is usually full-ranked [1-3]. Observation 2 There is a close relationship between Hankel matrix and Convolution. Observation 3 In signal processing perspective, what CNN actually does is to find a new signal representation by learning a set of local bases from the data while the global bases are fixed. [1] Ye JC et al., IEEE TIT, 2017 [2] Yin et al., SIAM, 2017 [3] Ye and han, 2017

PROPOSED FRAMEWORK Neural network for inverting Lipmann-Schwinger Equation * Deep Convolutional Framelets The inverting operator is naturally found during the training phase. Achieve a denoised signal with a good signal representation which is trained via data without any assumption.

CONCLUSION I PROPOSED A NOVEL DEEP LEARNING FRAMEWORK FOR INVERSE SCATTERING PROBLEMS Developed deep learning framework inverting Lippmann-Schwinger equation Showed that the physical intuition is directly mapped to each layer of network Showed that the framework successfully works in various examples ADVANTAGES OVER THE OTHER APPROACHES 1. end-to-end system / simple architecture no explicit modeling and boundary conditions data-driven (benefit from the data set) fast and efficient learning and inference no post-processing for parameter tuning conventional analytic approaches

CONCLUSION I PROPOSED A NOVEL DEEP LEARNING FRAMEWORK FOR INVERSE SCATTERING PROBLEMS Developed deep learning framework inverting Lippmann-Schwinger equation Showed that the physical intuition is directly mapped to each layer of network Showed that the framework successfully works in various examples ADVANTAGES OVER THE OTHER APPROACHES 2. extensibility / practicality for different modalities and final image sizes for different experimental conditions trainable with numerical data standard CNNs (learning the signal representation)