ARTEFACT DETECTION IN ASTROPHYSICAL IMAGE DATA USING INDEPENDENT COMPONENT ANALYSIS. Maria Funaro, Erkki Oja, and Harri Valpola

Similar documents
Independent Component Analysis and Its Applications. By Qing Xue, 10/15/2004

Independent Component Analysis (ICA) Bhaskar D Rao University of California, San Diego

One-unit Learning Rules for Independent Component Analysis

Independent component analysis and blind source separation

7. Variable extraction and dimensionality reduction

TWO METHODS FOR ESTIMATING OVERCOMPLETE INDEPENDENT COMPONENT BASES. Mika Inki and Aapo Hyvärinen

Extraction of Sleep-Spindles from the Electroencephalogram (EEG)

THEORETICAL CONCEPTS & APPLICATIONS OF INDEPENDENT COMPONENT ANALYSIS

Semiblind Source Separation of Climate Data Detects El Niño as the Component with the Highest Interannual Variability

NONLINEAR INDEPENDENT FACTOR ANALYSIS BY HIERARCHICAL MODELS

Introduction to Independent Component Analysis. Jingmei Lu and Xixi Lu. Abstract

Artificial Intelligence Module 2. Feature Selection. Andrea Torsello

Advanced Introduction to Machine Learning CMU-10715

Independent Component Analysis

Using Kernel PCA for Initialisation of Variational Bayesian Nonlinear Blind Source Separation Method

MULTICHANNEL SIGNAL PROCESSING USING SPATIAL RANK COVARIANCE MATRICES

Simple-Cell-Like Receptive Fields Maximize Temporal Coherence in Natural Video

Simple-Cell-Like Receptive Fields Maximize Temporal Coherence in Natural Video

ACENTRAL problem in neural-network research, as well

1 Introduction Independent component analysis (ICA) [10] is a statistical technique whose main applications are blind source separation, blind deconvo

Independent Component Analysis and Its Application on Accelerator Physics

Independent Component Analysis

c Springer, Reprinted with permission.

Shape Measurement: An introduction to KSB

Temporal Coherence, Natural Image Sequences, and the Visual Cortex

Non-orthogonal Support-Width ICA

Blind separation of sources that have spatiotemporal variance dependencies

Lecture'12:' SSMs;'Independent'Component'Analysis;' Canonical'Correla;on'Analysis'

ICA [6] ICA) [7, 8] ICA ICA ICA [9, 10] J-F. Cardoso. [13] Matlab ICA. Comon[3], Amari & Cardoso[4] ICA ICA

Clutter Rejection for Doppler Ultrasound Based on Blind Source Separation

Massoud BABAIE-ZADEH. Blind Source Separation (BSS) and Independent Componen Analysis (ICA) p.1/39

Frequency-Based Separation of Climate Signals

Blind Source Separation Using Artificial immune system

ICA. Independent Component Analysis. Zakariás Mátyás

Independent component analysis: algorithms and applications

Separation of the EEG Signal using Improved FastICA Based on Kurtosis Contrast Function

Reduction procedure of long-slit optical spectra. Astrophysical observatory of Asiago

FEATURE EXTRACTION USING SUPERVISED INDEPENDENT COMPONENT ANALYSIS BY MAXIMIZING CLASS DISTANCE

STUDIES OF SELECTED VOIDS. SURFACE PHOTOMETRY OF FAINT GALAXIES IN THE DIRECTION OF IN HERCULES VOID

Localization of Multiple Deep Epileptic Sources in a Realistic Head Model via Independent Component Analysis

An Ensemble Learning Approach to Nonlinear Dynamic Blind Source Separation Using State-Space Models

Emergence of Phase- and Shift-Invariant Features by Decomposition of Natural Images into Independent Feature Subspaces

Bayesian ensemble learning of generative models

NON-NEGATIVE SPARSE CODING

Lab 4 Radial Velocity Determination of Membership in Open Clusters

Principal Component Analysis vs. Independent Component Analysis for Damage Detection

Study of Physical Characteristics of High Apogee Space Debris

Modern Image Processing Techniques in Astronomical Sky Surveys

THE functional role of simple and complex cells has

Introduction to Machine Learning

DETECTING PROCESS STATE CHANGES BY NONLINEAR BLIND SOURCE SEPARATION. Alexandre Iline, Harri Valpola and Erkki Oja

102 Mm PAIR OF GALILEO AND SANTILLI TELESCOPES

A Constrained EM Algorithm for Independent Component Analysis

New Machine Learning Methods for Neuroimaging

An Adaptive Autoguider using a Starlight Xpress SX Camera S. B. Foulkes, Westward, Ashperton, Nr. Ledbury, HR8 2RY. Abstract

A Ramble Through the Night Sky

The shapes of faint galaxies: A window unto mass in the universe

The Challenge of AZ Cas-Part 1. John Menke Barnesville, MD Abstract

More Optical Telescopes

Improved Rotational Invariance for Statistical Inverse in Electrical Impedance Tomography

Estimating Overcomplete Independent Component Bases for Image Windows

A Calibration Method for Wide Field Multicolor. Photometric System 1

Amateur Astronomer Participation in the TESS Exoplanet Mission

Astronomy. Optics and Telescopes

EXTENSIONS OF ICA AS MODELS OF NATURAL IMAGES AND VISUAL PROCESSING. Aapo Hyvärinen, Patrik O. Hoyer and Jarmo Hurri

BLIND SEPARATION OF POSITIVE SOURCES USING NON-NEGATIVE PCA

The Optical Microvariability and Spectral Changes of the BL Lacertae Object S

Iterative face image feature extraction with Generalized Hebbian Algorithm and a Sanger-like BCM rule

CIFAR Lectures: Non-Gaussian statistics and natural images

Introduction of near-infrared (NIR) spectroscopy. Ken-ichi Tadaki (NAOJ)

Recursive Generalized Eigendecomposition for Independent Component Analysis

Part two of a year-long introduction to astrophysics:

Transiting Exoplanet in the Near Infra-red for the XO-3 System

Estimation of linear non-gaussian acyclic models for latent factors

Speed and Accuracy Enhancement of Linear ICA Techniques Using Rational Nonlinear Functions

AUTOMATIC MORPHOLOGICAL CLASSIFICATION OF GALAXIES. 1. Introduction

Lecture 8. October 25, 2017 Lab 5

Gatsby Theoretical Neuroscience Lectures: Non-Gaussian statistics and natural images Parts I-II

Independent component analysis for noisy data MEG data analysis

Neural Networks and Machine Learning research at the Laboratory of Computer and Information Science, Helsinki University of Technology

Fundamentals of Principal Component Analysis (PCA), Independent Component Analysis (ICA), and Independent Vector Analysis (IVA)

ORIENTED PCA AND BLIND SIGNAL SEPARATION

Observations of gravitational microlensing events with OSIRIS. A Proposal for a Cruise Science Observation

OPTION E, ASTROPHYSICS TEST REVIEW

Independent component analysis and blind source separation

IMPROVING THE DECONVOLUTION METHOD FOR ASTEROID IMAGES: OBSERVING 511 DAVIDA, 52 EUROPA, AND 12 VICTORIA

C. Watson, E. Churchwell, R. Indebetouw, M. Meade, B. Babler, B. Whitney

Transiting Hot Jupiters near the Galactic Center

A Random Walk Through Astrometry

DOME C AS A SETTING FOR THE PERMANENT ALL SKY SURVEY (PASS)

Mirror Alignment and Optical Quality of the H.E.S.S. Imaging Atmospheric Cherenkov Telescopes

CHAPTER 4 PRINCIPAL COMPONENT ANALYSIS-BASED FUSION

12.2 Dimensionality Reduction

CCD Star Images: On the Determination of Moffat s PSF Shape Parameters

Dark Matter and Dark Energy

Astronomical Experiments for the Chang E-2 Project

arxiv: v2 [astro-ph.sr] 16 May 2013

You, too, can make useful and beautiful astronomical images at Mees: Lesson 3

A Ramble Through the Night Sky

Detection of Artificial Satellites in Images Acquired in Track Rate Mode.

Transcription:

ARTEFACT DETECTION IN ASTROPHYSICAL IMAGE DATA USING INDEPENDENT COMPONENT ANALYSIS Maria Funaro, Erkki Oja, and Harri Valpola Neural Networks Research Centre, Helsinki University of Technology P.O.Box 54, 215 HUT, Finland maria.funaro,erkki.oja,harri.valpola @hut.fi ABSTRACT This paper is the first reported application of ICA on astrophysical image data. When studying far-out galaxies from a series of consequent telescope images, there are several sources for artefacts that influence all the images such as camera noise, atmospheric fluctuations and disturbances, and stars in our own galaxy. For this problem, the linear ICA model holds very accurately, because the independence of such artefacts is guaranteed. Using image data on the M31 Galaxy, it is shown that several clear artefacts can be detected and recognized based on their temporal pixel luminosity profiles and independent component images. Once these are removed, it is possible to concentrate on the real physical events like gravitational lensing. ICA might provide a very useful preprocessing for the large amounts of available telescope image data. 1. INTRODUCTION In modern astrophysics, one of the main research directions is understanding the dark matter in the universe. Possible candidates include compact objects such as small black holes, dwarf stars, or planets. When such an object passes near the line of sight of a star, the luminosity of the star will increase an effect called gravitational lensing, predicted by the general theory of relativity. In studying other galaxies than our own, individual stars cannot be resolved, but a whole group of unresolved stars is registered in a single pixel element of a telescope CCD camera. In a new technique called pixel lensing (see [1]), the pixel luminosity variations over time are monitored, and using these time series the lensing events can yet be detected even in the case of unresolved stars. A problem in the analysis of the images and luminosity variations is the presence of artefacts. One of the possible artefacts are the resolved or individual stars between the farout galaxy and the camera, which emerge sharply from the This work was supported by the Centre of Excellence Program of the Academy of Finland, project New Information Processing Principles, 44886. The corresponding author is E. Oja luminosity background. Other artefacts are cosmic rays and noise in the CCD camera. Separating these artefacts from possible physical events is one of the steps in the analysis of pixel lensing data. The new idea proposed here is to use ICA for artefact detection and removal. This is motivated by the fact that for astrophysical data, the independence of the artefacts is often theoretically guaranteed, and also the linear mixing model holds exactly. This is an almost ideal application for ICA. The ICA technique has been quite successful in artefact removal for biomedical signals [7]. The difference in the basic set-up between the biomedical signals and the astrophysical data is that in the latter case, the signals are digital images. Up to now, there have been few applications of ICA on the global analysis of image data, with functional MRI imaging the most advanced one [4]. From a mathematical point of view, our problem has similarities with the fmri analysis. The contents of this paper are as follows: our basic ICA approach for the spatial - temporal image data is outlined in Section 2. Section 3 describes the test for pixel lensing data and gives the results. Some conclusions are drawn in Section 4. 2. ICA FOR TIME-VARYING IMAGES In the astrophysical data, we have a number of digital images, recorded over consequent nights when the conditions are favourable, and carefully calibrated for geometrical and photometric alignments. Let be the number of pixels in an image and be the number of image samples. Let be the data matrix whose rows are the individual images, stacked row by row into vectors, and whose columns are the single pixel luminosity time series, here simply called light curves. In this case, formally similar to functional neuroimaging, we have two possibilities for performing both ICA and the preceding PCA decorrelation / compression: spatial or temporal [6]. The spatial ICA model is (1) 43

where is an mixing matrix and is an matrix whose rows are. The temporal ICA model is independent source images, with where is another mixing matrix 1 and the rows of are the individual light curves. For astronomical data, the temporal model is not feasible because the spatial dimension is very much larger than the temporal dimension and reliable estimation of matrix would be difficult in this case [3]. Moreover, the spatial model is quite natural because, for instance, the interference caused by fixed individual stars between the desired far-out objects and the camera are superimposed on all the images, and they are even in theory totally independent of the extragalactic events. Let us write the spatial ICA model of eq. (1) in the more conventional vector form as (2) Now is the dimensional vector representing the -th images, and is the pixel light curve through all the corresponding source vector with independent components. Written as we see that the columns of or mixing vectors can also be interpreted as virtual light curves, whose linear combinations give the observed light curves. The mixing vector characterizes the temporal behaviour of the -th source, while the source image characterizes the spatial behaviour over the pixel field. Both of these can be used to interpret the physical meaning of a given term in the sum. For ICA analysis, we have chosen to use the FastICA algorithm [2, 3] because of its appealing convergence properties. Preliminary sphering of the data is recommendable to simplify the algorithm and to reduce noise. This means transforming the vectors new vectors "! (3) into such that the have uncorrelated and unit variance elements. One of the methods to accomplish this is classical PCA. Computing the eigenvalues of the data covariance matrix gives indications about the number of sources to be used in the model. After whitening, the mixing model becomes #! $! $%& % where the matrix is % orthogonal. To compute matrix by the FastICA algorithm, its individual columns ')( are updated by the iteration '*(,+ #-/. 1 2' ( 4365 -/. 2' 1798 should not be confused with the transpose of 7 ( 43'*( % followed by orthonormalization of the matrix after each updating step. Function in the update rule is an odd nonlinear function and is its derivative. The choice of a suitable function is discussed in detail in [3]. In our case, the function % was 2: tanh :. When has been estimated, ;! the % original mixing matrix is approximated from. The independent components are obtained from #%, 3. EXPERIMENTAL RESULTS <. The digital images used in our study have been recorded over 35 unevenly sampled nights at the MDM observatory [5] with the telescope McGraw-Hill pointed toward the M31 Galaxy, equipped with a CCD camera of 248 248 pixels. The exposure time was 6 minutes for each of the frames. We tested the performance of ICA for artefact removal in some windows of 1 1 or 11 11 pixels, randomly taken in each of the four fields in which the CCD camera is divided. In this section we report the results obtained on two of them. To have an idea of how the CCD camera images look like after preprocessing, see Fig. 1a-b. Note that the temporally averaged intensity has been subtracted from each pixel since only the changes of intensity are of interest for lensing events, not the intensity per se. Dark areas in Fig. 1a-b correspond to pixels with an unusually low intensity for those particular pixels although the actual intensity might be higher than in some other pixels which are shown white in the figure. In our ICA model (2), is now a 35-dimensional vector representing the -th pixel light curve. The sample size =>= is =>=?=>=>=>= =@ or =@ <=BAC=@, according to the area we are examining. In the preliminary whitening, we have also reduced the dimension of the whitened vectors to 1. This choice is justified by the fact that for all the tests we have done, the first ten eigenvectors always contribute to more than 9% of the energy content. Thus, is a 35 1 matrix, and we have 1 independent components or source images. The mixing vectors are 35-dimensional and they can be plotted as temporal light curves over the 35 observation times. image window, whose coordinates are those of Fig. 1a. Looking at the ten mixing vectors and the corresponding source images, we find four mixing vectors whose temporal profiles, shown in Fig. 2, have a conspicuous spike. These are characteristic of cosmic rays. When a cosmic ray hits a pixel at time t, the luminosity value of this pixel has a sudden increase and also some other neighboring pixels are influenced and have a similar behaviour. Note that in each of these four mixing vectors, the luminous peak on the temporal axis occurs at a different time. The occurrence of cosmic rays is further evidenced by the corresponding independent component im- The first results concern the =>= =>= 44

ages, shown in Fig. 3. The cosmic rays have occurred at different times and at different locations of the image field and are thus fully independent events. Note also that from the original camera images, an example of which is Fig. 1a, these cosmic rays are not easily detected. One might question whether these mixing vectors and source images could be in fact artificial results of the statistical estimation process and not any real physical events at all. Now that the locations of the cosmic rays have been detected, it is easy to check the original light curves at these pixel locations. The light curves have the same spiky behaviour as the mixing vectors, proving that the artefacts are real. Looking at the other six mixing vectors for this image window, we have found that two of them show systematic increase and decrease in time which could be related to possible physical events like gravitational lensing. The other mixing vectors, instead, represent noisy fluctuations and their independent component images do not show evidence of any particular structure. This kind of random fluctuations over the 35 separate observations are due to atmospheric disturbances and camera noise. =@ The second results concern the =@ image window, whose coordinates are those of Fig. 1b. Now we have three mixing vectors which obviously characterize resolved stars, because they show large fluctuations around a constant value. In particular, one of them has larger values compared to the other two. We show this mixing vector in Fig. 4a. Its independent component image, shown in Fig. 4b, shows a particular circular structure, which is characteristic of resolved stars. This star can be distinguished also in the original image (Fig. 1, but in the independent component image it has been accentuated and separated from the rest of the star field. Of the remaining seven independent components, one shows a bright line on the CCD camera (Fig. 5; we do not know exactly what it is but it is probably another artefact. Because the mixing vector is just noise except for one of the observation times, where there is a strong luminosity spike, this might be a lighted object passing throuh the sky. Again, we have three other mixing vectors which show increase and decrease in time, related to possible interesting astrophysical events. Concerning the remaining three mixing vectors, they show noisy fluctuations and their independent component images do not indicate any particular structure. 4. CONCLUSION We claim here that standard linear ICA can be used to detect artefacts present in astrophysical data. To the best of our knowledge, this is the first application of ICA on this specific problem. Especially, when studying far-out galaxies, the possible artefacts caused by camera noise, atmospheric fluctuations or disturbances, or stars in our own galaxy, are all theoretically independent of the real physical pixel lensing effects that the investigators are interested in. Because ICA is an unsupervised technique, qualitative and domain-dependent expertise is necessary for interpretation of the found independent components. To corroborate our claim, we showed that the mixing vectors which characterize cosmic rays and resolved stars have a well-defined profile and their corresponding independent component images indicate particular structures (Figs. 2, 3 and 4). Some other mixing vectors characterize the artefacts due to noisy fluctuations in the camera and the atmosphere, and their independent component images do not show any typical structure. Using ICA, the large amounts of telescope image data could be preprocessed automatically to pinpoint the exact locations and occurrence times of possible artefacts and to accentuate the physical pixel lensing events. This affirmation is also confirmed by the further tests we are performing on other areas of the CCD camera. 5. REFERENCES [1] A. Gould, The theory of pixel lensing, Astrophys. J. 47, 1996, pp. 21-21. [2] A. Hyvärinen and E. Oja, A fast fixed-point algorithm for independent component analysis, Neural Comp. 9, 1997, pp. 1483-1492. [3] A. Hyvärinen, J. Karhunen, and E. Oja: Independent Component Analysis. Wiley Interscience, New York, 21. [4] T. P. Jung, S. Makeig, T. W. Lee, M. J. McKeown, G. Brown, A. T. Bell, and T. J. Sejnowski, Independent component analysis of biomedical signals, Proc. 2nd Int. Workshop on ICA and BSS, June 19-22, 2, Helsinki, Finland, pp. 633-644. [5] MDM Observatory: http: //www.astro.lsa.umich.edu/obs/mdm/ [6] K. S. Petersen, L. K. Hansen, and T. Kolenda, On the independent components of functional neuroimages, Proc. 2nd Int. Workshop on ICA and BSS, June 19-22, 2, Helsinki, Finland, pp. 615-62. [7] R. Vigario, J. Särelä, V. Jousmäki, M. Hämäläinen, and E. Oja, Independent component approach to the analysis of EEG and MEG recordings, IEEE Tr. Biomed. Eng. 47, 2, pp. 589-593. 45

9 2 88 18 86 16 84 14 82 12 22 24 26 28 3 1 1 12 14 16 18 2 Fig. 1. Image of the pixel luminosity values for one of the 35 observation times in the fourth camera field, with coordinates AC=@ =>= =@ >=>= and. Another image of the pixel luminosity values for one of the 35 observations times, with the pixel coordinates =>= #AC=>= and =>= #AC=>=..5 x 1 3.5 1 x 1 3 3.5.5 145 15 155 16 165 17 175.5 x 1 3.5.5.5 3 3.5 145 15 155 16 165 17 175 c) 4 5 6 7 145 15 155 16 165 17 175 4 6 8 2 4 4 x 1 4 2 6 145 15 155 16 165 17 175 d) Fig. 2. The 35 elements of four mixing vectors. On the horizontal axis are the observation times. These profiles are typical for cosmic rays. 46

9 9 88 88 86 86 84 84 82 82 9 22 24 26 28 3 9 22 24 26 28 3 88 88 86 86 84 84 82 82 22 24 26 28 3 c) 22 24 26 28 3 d) Fig. 3. Independent component images related to cosmic rays. Note that dark values here correspond to high intensity because the mixing vectors have their signs inverted. This can happen because the sign of independent components cannot be determined without prior knowledge about the physical setting. 47

2 4 x 1 3 18 2 16 14 4 6 12 8 145 15 155 16 165 17 175 1 1 12 14 16 18 2 Fig. 4. One of the three mixing vectors ( corresponding to a resolved star (. 2 1.5 x 1 3 18 1.5 16.5.5 14 12.5 3 145 15 155 16 165 17 175 1 1 12 14 16 18 2 Fig. 5. The mixing vector ( corresponding to a bright line (. This is probably another artefact. 48