arxiv: v1 [cs.cv] 30 Apr 2018

Similar documents
A Multi-Affine Model for Tensor Decomposition

Diffeomorphic Diffusion Registration of Lung CT Images

d = Dx subject to Stochastic programming methods for

N-mode Analysis (Tensor Framework) Behrouz Saghafi

Tensor Decompositions for Machine Learning. G. Roeder 1. UBC Machine Learning Reading Group, June University of British Columbia

/16/$ IEEE 1728

A Tensor Approximation Approach to Dimensionality Reduction

arxiv: v1 [math.ra] 13 Jan 2009

Discriminative Direction for Kernel Classifiers

PCA & ICA. CE-717: Machine Learning Sharif University of Technology Spring Soleymani

HST.582J / 6.555J / J Biomedical Signal and Image Processing Spring 2007

Karhunen Loéve Expansion of a Set of Rotated Templates

HST.582J/6.555J/16.456J

Subspace Methods for Visual Learning and Recognition

Two-View Segmentation of Dynamic Scenes from the Multibody Fundamental Matrix

Wafer Pattern Recognition Using Tucker Decomposition

Postgraduate Course Signal Processing for Big Data (MSc)

Introduction to Machine Learning. PCA and Spectral Clustering. Introduction to Machine Learning, Slides: Eran Halperin

Statistical Analysis Aspects of Resting State Functional Connectivity

F3-A2/F3-A3: Tensor-based formulation for Spectral Computed Tomography (CT) with novel regularization techniques

Multilinear Singular Value Decomposition for Two Qubits

CSE 554 Lecture 7: Alignment

Tensor-Based Dictionary Learning for Multidimensional Sparse Recovery. Florian Römer and Giovanni Del Galdo

PCA and LDA. Man-Wai MAK

Regularized Nonrigid Registration of Lung CT Images by Preserving Tissue Volume and Vesselness Measure

Multilinear Subspace Analysis of Image Ensembles

Principal Component Analysis (PCA)

A Study of Kruppa s Equation for Camera Self-calibration

Reducing false discovery rates for on-line model checking based detection of respiratory motion artifacts

Truncation Strategy of Tensor Compressive Sensing for Noisy Video Sequences

A concise proof of Kruskal s theorem on tensor decomposition

On the convergence of higher-order orthogonality iteration and its extension

Linear Combinations of Optic Flow Vectors for Estimating Self-Motion a Real-World Test of a Neural Model

Kronecker Product Approximation with Multiple Factor Matrices via the Tensor Product Algorithm

Deformable image registration with automated motion-mask extraction

Principal Component Analysis and Linear Discriminant Analysis

Accelerated MRI Image Reconstruction

Parallel Tensor Compression for Large-Scale Scientific Data

Probabilistic Latent Semantic Analysis

Discriminant Uncorrelated Neighborhood Preserving Projections

Linear Algebra Review. Fei-Fei Li

INTERNATIONAL JOURNAL OF ENGINEERING SCIENCES & RESEARCH TECHNOLOGY

Donald Goldfarb IEOR Department Columbia University UCLA Mathematics Department Distinguished Lecture Series May 17 19, 2016

Data Preprocessing Tasks

Data Mining Techniques

Third-Order Tensor Decompositions and Their Application in Quantum Chemistry

Uncorrelated Multilinear Principal Component Analysis through Successive Variance Maximization

Multiscale Tensor Decomposition

Higher Order Separable LDA Using Decomposed Tensor Classifiers

Unsupervised Machine Learning and Data Mining. DS 5230 / DS Fall Lecture 7. Jan-Willem van de Meent

Multilinear Analysis of Image Ensembles: TensorFaces

PCA and LDA. Man-Wai MAK

Improved Kalman Filter Initialisation using Neurofuzzy Estimation

Principal Component Analysis and Singular Value Decomposition. Volker Tresp, Clemens Otte Summer 2014

Part 01 Introduction to X-ray imaging. Overview Module 07 Part 1 X-ray

Ke Li and Guang-Hong Chen

Phase Retrieval from Local Measurements: Deterministic Measurement Constructions and Efficient Recovery Algorithms

RESEARCH ON COMPLEX THREE ORDER CUMULANTS COUPLING FEATURES IN FAULT DIAGNOSIS

Determining the Translational Speed of a Camera from Time-Varying Optical Flow

A Modular NMF Matching Algorithm for Radiation Spectra

GI07/COMPM012: Mathematical Programming and Research Methods (Part 2) 2. Least Squares and Principal Components Analysis. Massimiliano Pontil

20th European Signal Processing Conference (EUSIPCO 2012) Bucharest, Romania, August 27-31, 2012

Supplementary Material SPNet: Shape Prediction using a Fully Convolutional Neural Network

Matrix-Tensor and Deep Learning in High Dimensional Data Analysis

OBJECT DETECTION AND RECOGNITION IN DIGITAL IMAGES

the tensor rank is equal tor. The vectorsf (r)

Tutorial on Principal Component Analysis

CS4495/6495 Introduction to Computer Vision. 3D-L3 Fundamental matrix

Covariance-Based PCA for Multi-Size Data

Machine Learning (Spring 2012) Principal Component Analysis

Face Detection and Recognition

Myoelectrical signal classification based on S transform and two-directional 2DPCA

Symmetric Two Dimensional Linear Discriminant Analysis (2DLDA)

Linear Algebra and Dirac Notation, Pt. 3

System 1 (last lecture) : limited to rigidly structured shapes. System 2 : recognition of a class of varying shapes. Need to:

What is Principal Component Analysis?

Online prediction of respiratory motion: multidimensional processing with low-dimensional

Statistical Pattern Recognition

Vision 3D articielle Session 2: Essential and fundamental matrices, their computation, RANSAC algorithm

L26: Advanced dimensionality reduction

Covariance and Principal Components

Linear Algebra & Geometry why is linear algebra useful in computer vision?

Linear Algebra & Geometry why is linear algebra useful in computer vision?

Maximal Entropy for Reconstruction of Back Projection Images

Reconstruction for Proton Computed Tomography: A Monte Carlo Study

Singular value decomposition. If only the first p singular values are nonzero we write. U T o U p =0

Data Acquisition and Image Formation Methods for Multi Energy CT

Initial Certification

CS281 Section 4: Factor Analysis and PCA

Notes on Latent Semantic Analysis

Face Recognition. Face Recognition. Subspace-Based Face Recognition Algorithms. Application of Face Recognition

CVPR A New Tensor Algebra - Tutorial. July 26, 2017

A Constraint Generation Approach to Learning Stable Linear Dynamical Systems

Restoration of Missing Data in Limited Angle Tomography Based on Helgason- Ludwig Consistency Conditions

Statistical and Computational Analysis of Locality Preserving Projection

Linear Regression and Its Applications

Extraction of Fetal ECG from the Composite Abdominal Signal

A new truncation strategy for the higher-order singular value decomposition

Learning with Singular Vectors

A Coupled Helmholtz Machine for PCA

Transcription:

Decoupling Respiratory and Angular Variation in Rotational X-ray Scans Using a Prior Bilinear Model arxiv:1804.11227v1 [cs.cv] 30 Apr 2018 T. Geimer 1,2,3, P. Keall 4, K. Breininger 1, V. Caillet 4, M. Dunbar 4, C. Bert 2,3, and A. Maier 1,2 1 Pattern Recognition Lab, Department of Computer Science, Friedrich-Alexander-Universität Erlangen-Nürnberg, Germany 2 Erlangen Graduate School in Advanced Optical Technologies (SAOT), Friedrich-Alexander-Universität Erlangen-Nürnberg, Germany 3 Department of Radiation Oncology, Universitätsklinikum Erlangen, Friedrich-Alexander-Universität Erlangen-Nürnberg, Germany 4 ACRF Image X Institute, The University of Sydney, Australia Abstract. Data-driven respiratory signal extraction from rotational X- ray scans has always been challenging due to angular effects overlapping with respiration-induced change in the scene. In the context of motion modelling their main drawback is the fact that most of these methods only extract a 1D signal, that, at best, can be decomposed into amplitude and phase. In this paper, we use the linearity of the Radon operator to propose a bilinear model based on a prior 4D scan to separate angular and respiratory variation. Out-of-sample extension is enhanced by a B- spline interpolation using prior knowledge about the trajectory angle to extract respiratory feature weights independent of the acquisition angle. Though the prerequisite of a prior 4D scan seems steep, our proposed application of respiratory motion estimation in radiation therapy usually meets this requirement. We test our approach on DRRs of a patient s 4D CT in a leave-one-out manner and achieve a mean estimation error of 5.2% in the gray values for previously unseen viewing angles. 1 Introduction Extracting information about a patient s breathing state from X-ray projections is important for many time-resolved application, such as CT reconstruction or motion compensation in radiation therapy [6]. Data-driven approaches have been proposed ranging from the established Amsterdam Shroud [11] to more sophisticated approaches based on epipolar consistency [1]. In the context of motion models [9], their main drawback is the fact that most of these methods only extract a 1D signal, that, at best, can be decomposed into amplitude and phase [4]. When the motion representation is covered by an active shape model [9], it is highly desirable to extract more features resulting in a lower reconstruction error. A mathematical foundation to decompose multiple sources of variation was given by De Lathauwer et al. [3] who formulated Higher-order Singular Value

2 Decomposition (HOSVD) on a tensor of arbitrary dimensionality. Meanwhile, bilinear models have seen use in many fields including medical applications. Among others, Tenenbaum et al. [10] used a bilinear model to separate pose and identity from face images, while Çimen et al. [2] constructed a spatio-temporal bilinear model of coronary artery centerlines. In this work, we show that due to the linearity of the radon transform, a bilinear decomposition of respiratory and angular variation exists (Sec. 2.1). Subsequently, Sec. 2.2 and 2.3 will cover both model training and its application in feature estimation. With out-of-sample extension being an ill-posed problem, we propose a B-spline interpolation of rotational weights based on prior knowledge about the trajectory angle. We validated our model in a leave-one-out manner using digitally reconstructed radiographs (DRRs) of a patient s 4D CT. 2 Material & Methods 2.1 Radon Transform under Respiratory and Angular Variation An X-ray projection p i,j R N 2 1 at rotation angle φ i [0, 2π) and respiratory phase t j [0, 1) is given by the Radon transform R i R N 2 N 3 applied to the volume v j R N 3 1 p i,j = R i v j, (1) where N indicates the arbitrary dimension of the volume and projection image. It has been shown that the respiratory-induced changes in the anatomy can be described by an active shape model of the internal anatomy [9]: v j = M a j + v, (2) where v is the data mean of the mode, M R N 3 f contains the eigenvectors corresponding to the first f principal components, and a j R f 1 are the model weights corresponding to phase t j. Thus, Eq. 1 can be further processed p i,j = R i (M a j + v) = M R i a j + p i. (3) Now, M R i R N 2 f represents a model (with mean p i = R i v) for describing the projection image p i,j under the fixed angle φ i given the respiratory weights a j. The inversion of the Radon transform R itself is ill-posed for a single projection. However, M R i can be inverted more easily as a j mostly encodes variation in superior-inferior direction observable in the projection. As a result, respiratory model weights can be estimated from a single projection image if the angledependent model matrix is known [5]. Furthermore, we propose a Radon transform R i to be approximated by a linear combination of other Radon transforms, such that R i = k b k,i R k. (4)

3 The resulting scalar factors b k,i form the weight vector b i = [..., b k,i,... ]. Note that in this formulation, we implicitly assume a continuous trajectory and that the breathing motion is observable from each view. This gives rise to a bilinear formulation to any given projection image p i,j. However, bilinear models typically do not operate on mean-normalized data. Therefore, we use the decomposition described in Eq. 2 without mean subtraction: p i,j = R i (M a j ) = k b k,i R k (M a j ) = k b k,i M R k a j = M 1 a j 2 b i, (5) where M R N 2 f g is a model tensor with respiratory and rotational feature dimensionality f and g. Here, denotes the mode product along the given mode. For more details on tensor notation please refer to [7]. 2.2 Model Training For model training, a prior 4D CT scan is required yielding F phase-binned volumes v j, j {1,..., F }. DRRs are computed at G angles φ i, i {1,..., G} along a circular trajectory. The resulting F G projection images p i,j form the data tensor D R N 2 F G. Using HOSVD [3] we perform dimensionality reduction on the data tensor. First, D is unfolded along mode k: D (k) R d k ( d l ) l k. (6) Fig. 1 illustrates the unfolding process. Second, SVD is performed on each unfolded matrix D (k) = U k S k V k, (7) which yields the tensor basis {U k } K k=1 to project D onto (see Fig. 2): Finally, D can be described by a model tensor M : D }{{} N 2 F G D = M K k=1 U k. (8) = M }{{} N 2 f g 1 A }{{} F f 2 B }{{}, (9) G g where A and B carry the low-dimensional (f F, g G) model weights for respiratory and angular variation, respectively.

4 D (0) D D (1) D D (2) D Fig. 1. Tensor unfolding concatenates the slices of a tensor along a selected mode. F f G g A N 2 F D N 2 f M G B g Fig. 2. Dimensionality reduction (HOSVD) of the rotational and respiratory domain. 2.3 Weight Estimation Given an observed projection image p x,y at unknown respiratory phase t y and angle φ x, our objective is to find coefficients a y, b x for respiration and rotation to best represent the observation in terms of the model: p x,y = M 1 a y 2 b x. (10) However, as a y and b x need to be optimized simultaneously, this task is highly ill-posed. Tenenbaum et al. [10] used an Expectation-Maximization algorithm to cope with this problem. However, they benefit from the fact that only the pose is a continuous variable whereas identity is a discrete state, drastically simplifying their EM-approach. In our case, both respiratory and angular variation have to be considered continuous. Fortunately, we can incorporate prior knowledge about the trajectory into the estimation process. From the trajectory, the angle of each projection image is known even though the corresponding weights b x are only given for particular angles within the training samples. Content B-spline Interpolation. Using the rotation angle as prior knowledge, we propose extending the bilinear model with a B-spline curve fitted to

5 the rotational weights b i from training: b(u) = G b i N i (u), (11) i=1 with N i being the B-spline basis functions. Using a uniform parametrization with respect to the training angles, u(φ) of new angle φ is given as u(φ) = φ φ min φ max φ min. (12) Style Weight Computation. With the rotational weight interpolated, multiplying M and b (u (φ x )), first removes the angular variation: M R x = M 2 b (u (φ x )) R N 2 f 1. (13) Collapsing the 1-dimension results in the angle-dependent model matrix M R x R N 2 f for the new angle φ x. Closing the loop to Eq. 3 without mean, computation of the respiratory weights simplifies to a linear problem solved via the pseudo-inverse of M R x : ( ) 1 a y = M R x px,y. (14) 2.4 Data & Experiments Evaluation was performed on a patient s 4D CT consisting of eight phase-binned volumes at respiratory states 0 %, 15 %, 50 %, 85 %, 100 % inhale, and 85 %, 50 %, 15 % exhale. Using CONRAD [8], DRRs of size 512 512 with an 0.8 mm isotropic pixel spacing were created by forward projecting each of the F = 8 volumes at G = 60 angles over a circular trajectory of 354. Consequently, the final data tensor D featured a dimensionality of 512 2 12 60. A dense bilinear model was trained on the entire data tensor to assess the variance explained by the respiratory domain. For comparison, we investigated a linear PCA with mean normalization of the 4D CT volumes to assess weights and variance prior to the influence of the Radon transform. The dense bilinear model also provided a reference for the features in the subsequent leave-oneout evaluation where each phase and every 6th angle were removed one after another from the data tensor prior to training. Subsequently, the corresponding projection image and its respective angle were fed to the model and the rotational and respiratory weights were estimated. Accuracy was assessed with respect to the mean gray-value error between the reconstructed image and the original. 3 Results & Discussion Fig. 3 shows the weights of the first three principal components for the linear PCA of the volumes as well as each phase and angle in the projection images.

6 feature weights feature weights 1.5 1 0.5 0-0.5-1 -1.5 0.6 0.4 0.2 0-0.2-0.4-0.6 Linear PCA of 4D CT 1. PC 2. PC 3. PC 0 In 15 In 50 In 85 In 100 In 85 Ex 50 Ex 15 Ex respiratory phase Bilinear respiratory weights PC 3. PC 1. PC 2. 0 In 15 In 50 In 85 In 100 In 85 Ex 50 Ex 15 Ex respiratory phase explained variance / % feature weights Respiratory variance 100 95 90 85 80 75 Bilinear model Linear PCA 70 1 2 3 4 5 6 7 8 principal components Bilinear rotational weights 0.3 0.2 0.1 0-0.1-0.2 1. PC 2. PC 3. PC -0.3 0 30 60 90 120 150 180 210 240 270 300 330 rotation angle Fig. 3. Feature weights for the first three principal components for linear PCA on the 4D CT (t. l.), bilinear respiratory (b. l.) and rotational weights (b. r.) as well as the explained variance caused by respiration in the linear and bilinear case (t. r.). Fig. 4. Eigenimages in model tensor M corresponding to rotational and respiratory features both starting from the same mean. Notably, both first bilinear components are near constant. Unlike linear PCA (Eq. 2), data in the bilinear model does not have zero-mean (Eq. 5). Consequently, the first component points to the data mean while variation in the respective domain is encoded starting from the second component [10]. Appropriately, the (n + 1)-th bilinear respiratory component corresponds to the n-th linear component of the 4D CT indicating that separation of respiratory and angular variance in the projections is in fact achieved. Additionally, the respiratory variance explained by the principal components is plotted (top right). Most of the variance in the bilinear case is already explained by movement towards the mean. Thus, the linear components better reflect that four components accurately describe over 90% of the data variance. Fig. 4 shows the eigenimages corresponding to angular and respiratory variation, respectively. Noticeably, the rotational eigenimages contain mostly low-frequent variation inherent to the moving gantry whereas the respiratory direction encodes comparably high-frequent changes. For evaluation, Tab. 1 lists the mean gray value error in the reconstructed projection images for each test-angle averaged over all estimated phases. The mean gray value in the original image is listed for reference. The average error

7 Fig. 5. Example reconstruction for 85% exhale phase at 234 angle. (a) Original DRR sample. (b) Leave-one-out bilinear reconstruction. (c) Difference image with level/window 0.15/3.75. (d) Rotational weights from dense bilinear model and interpolated weights for the loo estimation. (e) Respiratory weight estimate. was 1.50 ± 1.65 (5.2%) compared to a reference mean value of 28.87 ± 12.21. Exemplarily for one phase-angle combination, Fig. 5 shows the leave-one-out estimation result for 85%Ex and 234. The proposed B-spline interpolation yields rotational weights close to the dense bilinear model up to the 10th component. Since both sets of weights correspond to slightly different eigenvectors, due to one model being trained on less data, deviation especially in the lower components is to be expected. Still, four to five respiratory weights are estimated accurately which, as shown previously, is sufficient to recover over 90% respiratory variance. As such, we believe they contain much more information than just the respiratory phase. In future work, we will investigate their use to predict entire dense deformation fields. Table 1. Mean gray-value error and standard deviation for leave-one-out evaluation of representing the projection images in terms of the bilinear model. 18 54 90 126 162 198 234 1.50 1.07 1.60 1.48 1.35 0.87 0.75 gray value error ±1.49 ±0.97 ±1.64 ±1.45 ±1.31 ±0.82 ±0.68 27.62 25.89 27.94 31.15 31.57 27.71 25.43 reference mean ±6.93 ±8.76 ±9.95 ±17.59 ±13.74 ±6.48 ±9.00 trajectory angle 270 306 342 1.53 2.61 2.20 ±1.40 ±2.99 ±2.33 29.23 31.88 30.29 ±8.44 ±19.16 ±14.38 Our current leave-one-out evaluation assumed two simplifications, that will pose additional challenges. First, a perfect baseline registration of the training CT to the projection images may not be the case in every scenario. However,

8 for the case of radiation therapy accurate alignment of patient and system is a prerequisite for optimal treatment. Second, no anatomical changes between the 4D CT and the rotational scan are taken into account. Further investigation on how these effects interfere with the decomposition are subject to future work. 4 Conclusion In this paper, we demonstrate that the Radon transform under respiratory and angular variation can be expressed in terms of a bilinear model given a continuous trajectory and that motion is observable in every projection. Using a prior 4D CT, we show that projection images on the trajectory can be bilinearly decomposed into rotational and respiratory components. Prior knowledge about the gantry angle is used to solve this ill-posed out-of-sample problem. Results demonstrate that up to five components of the respiratory variance are recovered independent of the view-angle. This explains more than 90 % of the volumetric variation. As such, recovery of 3D motion seems possible. Currently our study is limited by two simplifications, namely perfect alignment and no inter-acquisition changes. Their investigation is subject to future work. Acknowledgement This work was conducted at the ACRF Image X Institute as part of a visiting research scholar program. The authors gratefully acknowledge funding of this stay by the Erlangen Graduate School in Advanced Optical Technologies (SAOT). References 1. Aichert, A., Berger, M., Wang, J., Maass, N., Doerfler, A., Hornegger, J., Maier, A.K.: Epipolar Consistency in Transmission Imaging. IEEE Trans Med Imag 34(11), 2205 2219 (2015) 2. Çimen, S., Hoogendoorn, C., Morris, P.D., Gunn, J., Frangi, A.F.: Reconstruction of Coronary Trees from 3DRA Using a 3D+t Statistical Cardiac Prior. In: Med Image Comput Comput Assist Interv (MICCAI). pp. 619 626 (2014) 3. De Lathauwer, L., De Moor, B., Vandewalle, J.: A Multilinear Singular Value Decomposition. SIAM J Matrix Anal Appl 21(4), 1253 1278 (Jan 2000) 4. Fassi, A., Seregni, M., Riboldi, M., Cerveri, P., Sarrut, D., Ivaldi, G.B., de Fatis, P.T., Liotta, M., Baroni, G.: Surrogate-driven deformable motion model for organ motion tracking in particle radiation therapy. Phys Med Biol 60(4), 1565 1582 (2015) 5. Fischer, P., Pohl, T., Faranesh, A., Maier, A., Hornegger, J.: Unsupervised Learning for Robust Respiratory Signal Estimation from X-Ray Fluoroscopy. IEEE Trans Med Imag 36(4), 865 877 (2017) 6. Keall, P.J., Aun Ng, J., O Brien, R., Colvill, E., Huang, C.Y., Rugaard Poulsen, P., Fledelius, W., Juneja, P., Simpson, E., Bell, L., Alfieri, F., Eade, T., Kneebone, A., Booth, J.T.: The first clinical treatment with kilovoltage intrafraction monitoring (KIM): a real-time image guidance method. Med Phys 42(1), 354 358 (2015)

7. Kolda, T.G., Bader, B.W.: Tensor Decompositions and Applications. SIAM Rev 51(3), 455 500 (2009) 8. Maier, A., Hofmann, H.G., Berger, M., Fischer, P., Schwemmer, C., Wu, H., Müller, K., Hornegger, J., Choi, J.H., Riess, C., Keil, A., Fahrig, R.: CONRAD a software framework for cone-beam imaging in radiology. Med Phys 40(11), 111914 (2013) 9. McClelland, J.R., Hawkes, D.J., Schaeffter, T., King, A.P.: Respiratory motion models: A review. Med Image Anal 17(1), 19 42 (2013) 10. Tenenbaum, J.B., Freeman, W.T.: Separating Style And Content With Bilinear Models. Neural Comput 12(6), 1247 1283 (2000) 11. Yan, H., Wang, X., Yin, W., Pan, T., Ahmad, M., Mou, X., Cerviño, L., Jia, X., Jiang, S.B.: Extracting respiratory signals from thoracic cone beam CT projections. Phys Med Biol 58(5), 1447 64 (2013) 9