Singlets. Multi-resolution Motion Singularities for Soccer Video Abstraction. Katy Blanc, Diane Lingrand, Frederic Precioso

Size: px

Start display at page:

Download "Singlets. Multi-resolution Motion Singularities for Soccer Video Abstraction. Katy Blanc, Diane Lingrand, Frederic Precioso"

Jack Gibbs
6 years ago
Views:

1 Singlets Multi-resolution Motion Singularities for Soccer Video Abstraction Katy Blanc, Diane Lingrand, Frederic Precioso I3S Laboratory, Sophia Antipolis, France Katy Blanc Diane Lingrand Frederic Precioso 1

2 Overview VIDEO & MOTION SINGULARITIES & SINGLETS SOCCER SALIENT MOMENTS 2

databases : Youtube 8M [1] much analyzed

and sports Diverse applications : browsing

3 Video Analysis Burst of video content production new sources of videos big databases : Youtube 8M [1] much analyzed types : meeting/conferences, movies, news and sports Diverse applications : browsing in database, automatic video surveillance, driverless car, Exponential amount of information: a match of soccer of HDTV images of pixels each Collaboration with Wildmoka, themselves in collaboration with l INA and BeIN. 3

4 Related works: Video description Modelling human motion: [2] Handcrafted features: Stip [3], idt [4] Deep learning representations [5] 4

Related works : sport abstraction Clue Detection

color, shot segmentation and view classification

score appearances and goal mouth position [7]

5 Related works : sport abstraction Clue Detection to detect highlights: ground color, jersey color, shot segmentation and view classification Line marks positions and shot detection [6] Ye et al. Goals, attacks and other events using logo and score appearances and goal mouth position [7] Zawbaa et al. Face and skin detection, whistle detector and user specifications [8] Raventos et al. 5

6 Overview VIDEO & MOTION SINGULARITIES & SINGLETS SOCCER SALIENT MOMENTS 6

7 Inspiration: fluid movement Inspired from the work of Druon et al. [9] and the further work of Kihl et al. [10] 7

Optical Flow Approximation Optical Flow = discrete bivariable vector field F Ω R 2 x 1, x 2 U x 1, x 2, V x 1, x 2 with Ω = 1, 1 2 Polynomial subspace and the Legendre Basis P K,L

8 Optical Flow Approximation Optical Flow = discrete bivariable vector field F Ω R 2 x 1, x 2 U x 1, x 2, V x 1, x 2 with Ω = 1, 1 2 Polynomial subspace and the Legendre Basis P K,L x 1, x 2 = K L k=0 l=0 γ k,l. x 1 k x 2 l Projection on the Legendre Basis with K + L < D U = u 0,0 P 0,0 + u 0,1 P 0,1 + u 1,0 P 1,0 V = v 0,0 P 0,0 + v 0,1 P 0,1 + v 1,0 P 1,0 8

Polynomial projection = (, ) Original Flow F Flow U Flow V Projection U V = 1.38 + 0 + 0.

9 Polynomial projection = (, ) Original Flow F Flow U Flow V Projection U V = E 5 U V = x 1 x

Approximation and coefficient analysis From a simple analysis example to production, scenarization, event sementization U V = 1.

10 Approximation and coefficient analysis From a simple analysis example to production, scenarization, event sementization U V = Coefficient value counterattack u 0,0 horizontal global displacement v 0,0 vertical global displacement u 0,0 horizontal global position v 0,0 vertical global position Frame number 10

11 Singularity First projection on the Legendre basis U = u 0,0 P 0,0 + u 0,1 P 0,1 + u 1,0 P 1,0 V = v 0,0 P 0,0 + v 0,1 P 0,1 + v 1,0 P 1,0 Then on the canonical basis U V = A. x 1 x 2 + b with A M 2,2 and b R 2 6 types of singularities A = tr A 2 4. det A, λ 1 and λ 2 the Eigenvalues of A 11

12 Singularities extraction From a multi-resolution analysis of the optical flow U V = A. x 1 x 2 + b 12

13 Singlets: match singularities during time From singularity on optical flow frame to tracks of singularities: Singlets 13

14 Overview VIDEO & MOTION SINGULARITIES & SINGLETS SOCCER SALIENT MOMENTS 14

15 Application on soccer abstraction Our database Zoom Slow Motion Global Excitement Soccer Saliant Moment 15

16 Soccer Summarization: Our database Lack of standard benchmark for comparison sake Germany vs Portugal Nigeria vs Argentina France vs Honduras Switzerland vs France

17 Soccer Summarization: Our database 17

singularity it is a star or improper node : A = 0 time consistent : last for one

18 Zoom detection Zoom motion is a pure star node singularity. Zoom combined with a translation is an improper node Conditions: there is a singularity it is a star or improper node : A = 0 time consistent : last for one second Positives eigenvalues -> zoom out Negatives eigenvalues -> zoom in Determine the zoom direction center 18

19 Zoom detection A = 0 A 0.2 Threshold on an average A on a second set to 0,2 Comparison - Global motion estimation [6,11] - Duan et al. method [12] : Two histograms, one on magnitude one on angles to detect diagonal pattern (DL) 19

20 Slow Motion Detection How to differentiate a fast motion that has been artificially slowed down from a slow motion? 20

21 Slow Motion Detection Fast Motion Slow Motion 21

22 Global excitement Spatial histogram of 3x3 per frame Sum spatial histograms on 10 frames Threshold set on 1500 to select the most agitated moments 22

23 Soccer Saliant Moment Within 30 seconds: - at least two zoom direction changes - an activity peaks higher than 1500 in the farthest view - a slow motion replay in a close up view 23

24 Example of Summarization Zooms Agitation Slow Motion 24

25 Extension to other sports Set and Trained on Soccer and Tested on Handball All hyperparameters, parameters and SVM were trained and set on soccer videos and we test our framework on 5 minutes of Qatar Handball World Championship final without any adjustement or retraining. 25

approximation Motion description Global distribution Singlets Track singularities Length histograms

26 Conclusion Optical Flow Approximation Possibility to use : - different basis - different degree - other space than the polynomial one Singularity extraction Vanishing point 6 types for the affine approximation Motion description Global distribution Singlets Track singularities Length histograms Compute a description like for IDT Sport Video Abstraction Zoom detection Slow Motion Detection Global excitement 26

27 Because there is not only soccer in life Abstraction of concert Facial Emotion Action recognition 27

28 Vision in Sports: CVSports 28

29 References [1] S. Abu-El-Haija, N. Kothari, J. Lee, P. Natsev, G. Toderici, B. Varadarajan, and S. ijayanarasimhan. Youtube-8m: A large-scale video classification benchmark. CoRR, abs/ , [2] Gorelick, Lena, et al. "Actions as space-time shapes." IEEE transactions on pattern analysis and machine intelligence (2007): [3] I. Laptev. On space-time interest points. Int. J. Comput. Vision, 64(2-3): , Sept [4] H. Wang, A. Kläser, C. Schmid, and C.-L. Liu. Action Recognition by Dense Trajectories. In IEEE Conference on Computer Vision & Pattern Recognition, pages , Colorado Springs, United States, June [5] D. Tran, L. Bourdev, R. Fergus, L. Torresani, and M. Paluri. Learning spatiotemporal features with 3d convolutional net-works. In roceedings of the IEEE International Conference on Computer Vision, pages , [6] Q. Ye, Q. Huang, W. Gao, and S. Jiang. Exciting event detection in broadcast soccer video with mid-level description and incremental learning. In Proceedings of the 13th annual ACM international conference on Multimedia, [7] H. M. Zawbaa, N. El-Bendary, A. E. Hassanien, and T. Kim. Event detection based approach for soccer video summarization using machine learning. International Journal of Multimedia and Ubiquitous Engineering, 2012 [8] A. Raventos, R. Quijada, L. Torres, and F. Tarres. Automatic summarization of soccer highlights using audio-visual descriptors. arxiv preprint arxiv: , [9] Druon, Martin. Modélisation du mouvement par polynômes orthogonaux: application à l'étude d'écoulements fluides. Diss. Université de Poitiers, [10] O. Kihl, B. Tremblais, and B. Augereau. Multivariate orthogonal polynomials to extract singular points. In IEEE International Conference on Image Processing ICIP 2008 San Diego, CA, United States, Oct [11] X. Qian. Global Motion Estimation and Its Applications. INTECH Open Access Publisher, [12] L.-Y. Duan, M. Xu, Q. Tian, C.-S. Xu, and J. S. Jin. A unified framework for semantic shot classification in sports video. IEEE Transactions on Multimedia,

30 Optical Flow Extraction Gunnar Farneback Method Pixel Neighborhood Approximation Approximation translation Relations Speed vector estimation 30

31 Polynomial Space Projection Scalar product Legendre basis with w x 1, x 2 = 1 and Ω = 1,1 2 Polynomial degree D nd = (D+1)(D+2) polynomials in the basis and so nd 2 coefficients Flot optique blurring (decreasing with the degree) 31

32 Multi-Resolution Singularity One sliding windows -> possibly one singularity The same singularity can be at different size and aprroximately the same position: We keep the one with less angular deviation from the original flow. dev = ω Ω 1 2 sin θ ω θ ω With θ ω the angle of the motion vector at the pixel position ω for the originalf flow and θ ω for the polynomial approximation 32

Classification of Hand-Written Digits Using Scattering Convolutional Network

Mid-year Progress Report Classification of Hand-Written Digits Using Scattering Convolutional Network Dongmian Zou Advisor: Professor Radu Balan Co-Advisor: Dr. Maneesh Singh (SRI) Background Overview