Domain adaptation for deep learning

Size: px

Start display at page:

Download "Domain adaptation for deep learning"

Gerard Griffin
6 years ago
Views:

1 What you saw is not what you get Domain adaptation for deep learning Kate Saenko

2 Successes of Deep Learning in AI A Learning Advance in Artificial Intelligence Rivals Human Abilities Deep Learning for self-driving cars Translate FLOWER FIELD SKY CLOUDS Google's DeepMind Masters Atari Games Face Recognition 2

3 So is AI solved? pedestrian detection FAIL 3

4 Major limitation of deep learning Not data efficient: Learning requires millions of labeled examples, models do not generalize well to new domains; not like humans! New domain?

5 What you saw is not what you get What your net is trained on Dataset Bias Domain Shift Domain Adaptation Domain Transfer What it s asked to label

6 Example: scene segmentation sky building road building people road Train on Cityscapes, Test on Cityscapes FCNs in the Wild: Pixel-level Adversarial and Constraint-based Adaptation, Judy Hoffman, Dequan Wang, Fisher Yu, Trevor Darrell, Arxiv 2016

7 Domain shift: Cityscapes to SF FCNs in the Wild: Pixel-level Adversarial and Constraint-based Adaptation, Judy Hoffman, Dequan Wang, Fisher Yu, Trevor Darrell, Arxiv 2016 tree car sign road car tree Train on Cityscapes, Test on San Francisco Dashcam

8 No tunnels in CityScapes?... FCNs in the Wild: Pixel-level Adversarial and Constraint-based Adaptation, Judy Hoffman, Dequan Wang, Fisher Yu, Trevor Darrell, Arxiv 2016 tree building

9 Applications to different types of domain shift From dataset to dataset From RGB to depth From simulated to real control From CAD models to real images

10 Today Show that deep models can be adapted without labels Propose two deep adaptation methods: adversarial alignment correlation alignment Show applications

11 Background: Domain Adaptation from source to target distribution Adapt backpack chair bike?? bike Source Domain lots of labeled data Target Domain unlabeled or limited labels?

Background: unsupervised domain adaptation Sample re-weighting NO labels in target domain Roughly, three categories of methods Sample re-weighting Subspace matching Deep methods Subspace alignment

12 Background: unsupervised domain adaptation Sample re-weighting NO labels in target domain Roughly, three categories of methods Sample re-weighting Subspace matching Deep methods Subspace alignment Deep alignment B. Femando, A. Habrard, M. Sebban, and T. Tuytelaars. Unsupervised visual domain adaptation using subspace alignment. In ICCV, B. Gong, Y. Shi, F. Sha, and K. Grauman. Geodesic flow kernel for unsupervised domain adaptation. In CVPR, 2012 Y. Ganin and V. Lempitsky. Unsupervised domain adaptation by back propagation. In ICML

13 How to adapt a deep network? backpack chair bike conv1 conv5 fc 6 fc 7 fc 8 classification loss Source Data

14 How to adapt a deep network? backpack chair bike conv1 conv5 fc 6 fc 7 fc 8 classification loss Source Data backpack Target Data Applying source classifier to target domain can yield inferior performance?

15 How to adapt a deep network? IDEA: align feature distributions backpack chair bike conv1 conv5 fc 6 fc 7 source data fc 8 Source Data shared shared shared shared shared classification loss backpack? Target Data same conv1 conv5 fc 6 fc 7 labeled target data Fine tune?..zero or few labels in target domain Siamese network?..no paired / aligned instance examples! fc 8

16 Deep distribution alignment by minimizing distance between distributions, e.g. Maximum Mean Discrepancy M. Long, et al. ICML 2015 CORrelation ALignment Sun and Saenko, AAAI 2016 or by adversarial domain alignment, e.g. Domain Confusion E. Tzeng et al. ICCV 2015 Reverse Gradient Y. Ganin and V. Lempitsky ICML

17 Adversarial Domain Adaptation Eric Tzeng UC Berkeley Judy Hoffman UC Berkeley Trevor Darrell UC Berkeley 18

18 Adversarial networks

19 Adversarial networks P P Q Q

20 Adversarial domain adaptation backpack chair bike Source Data + Labels conv1 conv5 fc 6 fc 7 Classifier classification loss Unlabeled Target Data conv1 conv5 fc 6 fc 7?

21 Adversarial domain adaptation backpack chair bike Source Data + Labels Encoder Classifier classification loss can be shared Unlabeled Target Data Encoder?

22 Adversarial domain adaptation backpack chair bike Source Data + Labels Encoder Classifier classification loss can be shared Unlabeled Target Data Encoder Discriminator Adversarial loss?

23 Adversarial domain adaptation backpack chair bike Source Data + Labels Encoder Classifier classification loss can be shared Unlabeled Target Data Encoder Discriminator Adversarial loss?

Design choices in adversarial adaptation backpack chair bike Source Data + Labels Generative or discriminative? Unlabeled Target Data Encoder Shared or not?

24 Design choices in adversarial adaptation backpack chair bike Source Data + Labels Generative or discriminative? Unlabeled Target Data Encoder Shared or not? Encoder Classifier Discriminator classification loss confusion Which loss? Adversarial loss? Ming-Yu Liu and Oncel Tuzel. Coupled generative adversarial networks, NIPS 2016

25 Deep domain confusion [Tzeng ICCV15 ] backpack chair bike conv1 conv5 fc 6 fc 7 source data fc 8 Source Data Unlabeled Target Data shared shared shared shared fc D domain confusion loss domain classifier loss shared classification loss? conv1 conv5 fc 6 fc 7 labeled target data fc 8 Adversarial Training of domain label predictor and domain confusion loss: Domain Label Cross-entropy with uniform distribution

domain classifier loss domain classifier prediction domain classifier (fixed) network parameters

26 Deep domain confusion [Tzeng ICCV15 ] Train a network to minimize classification loss AND confuse two domains source inputs target inputs network parameters (fixed) domain classifier (learn) domain classifier loss domain classifier prediction domain classifier (fixed) network parameters (learn) = pp(yy DD = 1 xx) domain confusion loss iterate (cross-entropy with uniform distribution) 27

27 Applications to different types of domain shift From dataset to dataset From RGB to depth From simulated to real control From CAD models to real images

28 ImageNet adapted to Caltech [Tzeng ICCV15 ] Multiclass Accuracy Number of labeled target examples

29 FCNs in the Wild: Pixel-level Adversarial and Constraint-based Adaptation, Judy Hoffman, Dequan Wang, Fisher Yu, Trevor Darrell, Arxiv 2016 Results on Cityscapes to SF adaptation Before domain confusion After domain confusion

30 Adversarial Loss Functions Confusion loss [Tzeng 2015] Minimax loss [Ganin 2015] GAN loss [Goodfellow 2014] stronger gradients

discriminative? Unlabeled Target Data Encoder Shared or not?

31 Adversarial Discriminative Domain Adaptation (ADDA) (in submission) backpack chair bike Source Data + Labels Generative or discriminative? Unlabeled Target Data Encoder Shared or not? Encoder Classifier Discriminator classification loss Which loss? Adversarial loss GAN?

32 ADDA: Adaptation on digits (in submission)

33 Applications to different types of domain shift From dataset to dataset From RGB to depth From simulated to real control From CAD models to real images

34 ADDA: Adaptation on RGB-D (in submission) Train on RGB Test on depth

35 ADDA: Adaptation on RGB-D (in submission)

36 Applications to different types of domain shift From dataset to dataset From RGB to depth From simulated to real control From CAD models to real images

37 Adapting Deep Visuomotor Representations with Weak Pairwise Constraints Eric Tzeng1, Coline Devin1, Judy Hoffman1, Chelsea Finn1, Pieter Abbeel1, Sergey Levine1, Kate Saenko2, Trevor Darrell1 1 University of California, Berkeley 2 Boston University

38 From simulation to real world control [Tzeng, Devin, et al 16]

39 Weak pairwise constraints [Tzeng, Devin, et al 16]

40 Robotic task: place rope on scale [Tzeng, Devin, et al 16]

41 Applications to different types of domain shift From dataset to dataset From RGB to depth From simulated to real control From CAD models to real images

42 Domain Adaptation via Correlation Alignment Baochen Sun Microsoft Xingchao Peng Boston University 49

43 Deep CORAL: Correlation Alignment for Deep Domain Adaptation [Sun 2016]

44 Deep CORAL: Correlation Alignment for Deep Domain Adaptation [Sun 2016]

45 Generative CORAL Network (in submission)

46 Synthetic to real adaptation for object recognition (in submission) Train on synthetic Test on real

47 Synthetic to Real Adaptation with Deep Generative Correlation Alignment Networks (in submission)

48 Synthetic to Real Adaptation with Deep Generative Correlation Alignment Networks (in submission)

49 Synthetic to Real Adaptation with Deep Generative Correlation Alignment Networks (in submission)

50 Summary Deep models can be adapted to new domains without labels Proposed two deep feature alignment methods: adversarial alignment correlation alignment Many potential applications

51 Thank you References Eric Tzeng, Judy Hoffman, Trevor Darrell, Kate Saenko, Simultaneous Deep Transfer Across Domains and Tasks, ICCV 2015 Eric Tzeng, Coline Devin, Judy Hoffman, Chelsea Finn, Pieter Abbeel, Sergey Levine, Kate Saenko, Trevor Darrell, Adapting Deep Visuomotor Representations with Weak Pairwise Constraints, WAFR 2016 Baochen Sun, Jiashi Feng, Kate Saenko, Return of Frustratingly Easy Domain Adaptation, AAAI 2016 Baochen Sun, Kate Saenko, Deep CORAL: Correlation Alignment for Deep Domain Adaptation, TASK-CV Workshop at ICCV 2016 Adversarial Discriminative Domain Adaptation, in submission Synthetic to Real Adaptation with Deep Generative Correlation Alignment Networks, in submission

Two at Once: Enhancing Learning and Generalization Capacities via IBN-Net

Two at Once: Enhancing Learning and Generalization Capacities via IBN-Net Supplementary Material Xingang Pan 1, Ping Luo 1, Jianping Shi 2, and Xiaoou Tang 1 1 CUHK-SenseTime Joint Lab, The Chinese University