Generalized Transfer Component Analysis for Mismatched Jpeg Steganalysis

Size: px

Start display at page:

Download "Generalized Transfer Component Analysis for Mismatched Jpeg Steganalysis"

Cory Hart
5 years ago
Views:

1 Generalized Transfer Component Analysis for Mismatched Jpeg Steganalysis Xiaofeng Li 1, Xiangwei Kong 1, Bo Wang 1, Yanqing Guo 1,Xingang You 2 1 School of Information and Communication Engineering Dalian University of Technology, Dalian, , China 2 Beijing Institute of Electronic Technology and Application Beijing, , China 1

2 Contents 1 Motivation of Our Work 2 Proposed Method 3 Experiments and Results 4 Summary and Future Work 2

3 Contents 1 Motivation of Our Work 2 Proposed Method 3 Experiments and Results 4 Summary and Future Work 3

4 Motivation of Our Work Battle of steganography and steganalysis Steganography embed message signal into cover images to get stego images; message undetectable in covert communication steganalysis features sensitive to change due to embedding build decision model using machine learning recognize stego images from plain cover images Steganalysis seem to win the battle recently [Kodovsky and Fridrich 12]: Rich model perform well to detect six modern steganographic schemes at low embedding rate. 4

Motivation of Our Work Steganalysis Really Win the Battle? Success of state of the art steganalysis methods rely on having prior knowledge of steganography to build the training set.

5 Motivation of Our Work Steganalysis Really Win the Battle? Success of state of the art steganalysis methods rely on having prior knowledge of steganography to build the training set. cover images, embedding algorithm is known. Matched steganalysis train set and test set: matched cover images matched embedding algorithm laboratory Mismatched steganalysis train set and test set: mismatched cover images mismatched embedding algorithm real world 5

using rich models [2] [Tomas Pevny and Jessica Fridrich

6 Motivation of Our Work Steganalysis Really Win the battle? [1] [Kodovsky and Fridrich 12]:Steganalysis of jpeg images using rich models [2] [Tomas Pevny and Jessica Fridrich 07]:Merging markovand dct features for multi-class jpeg steganalysis 6

Motivation of Our Work New Challenge in Steganalysis State-of-the-art steganalysis method could not be used effectively in the real world. Moving steganalysis from the laboratory to the real world.

7 Motivation of Our Work New Challenge in Steganalysis State-of-the-art steganalysis method could not be used effectively in the real world. Moving steganalysis from the laboratory to the real world. [Andrew D. Ker, Patrick Bas, Rainer Bohme, Remi Cogranne, Scott Craver Tomas Filler, Jessica Fridrich, Tomas Pevny 13]: Moving steganography And steganalysis from the laboratory to the real world. laboratory matched How? Real world mismatched 7

8 Related Work Motivation of Our Work [Ivans Lubenko, Andrew D. Ker 13]: Steganalysis with Mismatched Covers: Do Simple Classifiers Help? Large data large samples diverse enough rich features Simple classifier Ensemble Fisher Linear Discriminant Online Ensemble Average Perceptron general model Limitation: It costs much labor to collect images for such a training set. Can we train a model robust to mismatched steganalysis using a small set of samples? only a single set, not diverse, small number. 8

9 Contents 1 Motivation of Our Work 2 Proposed Method 3 Experiments and Results 4 Summary and Future Work 9

10 Main idea Proposed Method Train set: Source domain D = {( x, y ), i = 1, 2,, N} ~ P ( X, Y ) S i i S Test set: Target domain D = {( x,?), i = 1, 2,, M} ~ P ( X, Y ) T i T The two distributions are not the same! The two distributions are similar! 10

11 Proposed Method domain adaptation & transfer learning For Mismatch in other area: Natural Language Processing Object recognition Ex. [J. Blitzer et al EMNLP 2006] Ex. [R. Gopalan et al ICCV 2011] Video analysis Text classification Ex. [Jeff Donahue et al CVPR 2013] Ex. [Pan et al IEEE Tran-NN 2011] domain adaptation & transfer learning Learning a shared representation Assumption: a latent feature space exists in which classification hypotheses fit both domains. min P ( f ( X ), Y )- P ( f ( X ), Y ) f S T 11

12 Proposed Method Main idea Such a latent feature space leads to loss of some information, and may not be sensitive to embedding. According to target domain, transform source domain to an intermediate domain. Then find a latent feature space between target domain and intermediate domain. 12

13 Proposed Method Generalized Transfer Component Analysis Domain Alignment: transform source domain to intermediate domain. Learn Shared Feature Space: find a latent feature space between target and intermediate domain. Map Samples into the Feature Space Construct Classifier and Make Decision in the New Feature Space 13

14 Domain Alignment Proposed Method The aim is to transform source to an intermediate that is close to target. similar to 0-1 normalization liner transformation to hold the feature sensitivity to different categories. Objective : E ( ϕ ( X ), Y ) = E ( X, Y ) s σ ( ϕ ( X ), Y ) = σ ( X, Y ) s i i σ ( X t, yi ) ϕ ( xs ) = ( xs E ( X s, yi )) + E ( X t, yi ) σ ( X, y ) No labels in test set (target domain). We can t get the values. s t t i 14

15 Domain Alignment Proposed Method Objective : Liner transformation : E ( ψ ( X )) = E( X ) s s t σ ( ψ ( X )) = σ ( X ) s i i σ ( X t ) ϕ( xs ) = ( xs E( X s )) + E( X t ) σ ( X ) s t Train Model p ( y x ) s t t 1 i i E( Xt Y) xt p( y xt ) i p( y x ) σ 1 t i 2 i ( Xt Y) ( xt - E( Xt, Y)) p( y xt ) i p( y xt ) 15

16 Proposed Method Find shared feature space Objective: Simplify: min P ( f ( X ), Y )- P ( f ( X ), Y ) f P( X, Y ) = P( Y X ) P( X ) P ( Y X ) = P ( Y X ) s S t T min P ( f ( X))- P ( f ( X)) f S T Measure the distance of two distribution: ns nt 1 i 1 i Dis( PS ( X ), PT ( X )) = φ( xs ) φ( xt ) ns i= 1 nt i= 1 Dis( P ( X ), P ( X )) = trace( KL) S T φ (.) RHKS K. M. Borgwardt, A. Gretton, M. J. Rasch, H. P. Kriegel, B. Scholkopf, and A. J. Smola, Integrating structured biological data by kernel maximum mean discrepancy, Bioinformatics,

17 Proposed Method Find shared feature space define a non-liner kernel feature extraction matrix W as transformation: X = KW Update the new K: Update the new distance: T Dis( P ( X ), P ( X )) = trace( KL) = trace( KWW KL) S T new T T K new = X new X new = KWW K min trace( KWW T KL) W S. Pan, I. Tsang, J. Kwok, and Q.Yang, Domain adap-tation via transfer component analysis, IEEE Trans-actions on Neural Networks,

18 Proposed Method Find shared feature space To avoid the solution W=0, we add a constrain that which can preserve (or maximize) the initial data variance in the new space: T W KHKW The final kernel learning problem is then set up as: T T min tr( W W ) + µ tr( KWW KL) W = T s. t. W KHKW = I 1 W ( I µ KLK ) KHK I + (M leading eigenvectors) S. Pan, I. Tsang, J. Kwok, and Q.Yang, Domain adap-tation via transfer component analysis, IEEE Trans-actions on Neural Networks,

19 Proposed Method source domain Domain alignment intermediate domain target domain w Mapped source Mapped target Train model 19

20 Contents 1 Motivation of Our Work 2 Proposed Method 3 Experiments and Results 4 Summary and Future Work

21 Experiments and Results Experimental Setup Database: eight mismatched domains F5 Jsteg MBS A-F A-J A-M Set A: 1800 Outguess F5 A-O B-F Each domain Cover:450 Stego:450 Jsteg B-J MBS B-M Set B:1800 Outguess B-O 21

22 Experiments and Results Experimental Setup Database: eight mismatched domains Features: PF274 + our approach (GTCA) Classifier: lib-svm Approach compared with: Orig-Fea: PF-274+ lib-svm [Pevny and Fridrich 07]:Merging markov and dct features for multi-class jpeg steganalysis OEAP: JRM features + OEAP [Kodovsky and Fridrich 12]:Steganalysis of jpeg images using rich models [Ivans Lubenko, Andrew D. Ker 13]: Steganalysis with Mismatched Covers: Do Simple Classifiers Help? TCA: PF274+ TCA+ lib-svm [Pan et al 2011] Domain adap-tation via transfer component analysis 22

23 Experiments and Results Mismatched Experiment 1 Mismatched covers: different quantization table 23

24 Experiments and Results Mismatched Experiment 2 Mismatched stegos: different embedding algorithm 24

25 Experiments and Results Mismatched Experiment 3 Mismatched covers and stegos: different quantization table and different embedding algorithm 25

26 Contents 1 Motivation of Our Work 2 Proposed Method 3 Experiments and Results 4 Summary

27 Summary Mismatched steganalysis Important in real application Traditional steganalysis methods perform badly Two distributions are not the same Generalized Transfer Component Analysis (GTCA) Learn new representations to correct mismatches A small set of training samples Empirically successful New Strategy for Mismatched Steganalysis Domain adaptation, transfer learning 27

28 28

Information Hiding and Covert Communication

Information Hiding and Covert Communication Andrew Ker adk @ comlab.ox.ac.uk Royal Society University Research Fellow Oxford University Computing Laboratory Foundations of Security Analysis and Design