Machine Learning for Large-Scale Data Analysis and Decision Making A. Week #1

Size: px
Start display at page:

Download "Machine Learning for Large-Scale Data Analysis and Decision Making A. Week #1"

Transcription

1 Machine Learning for Large-Scale Data Analysis and Decision Making A Week #1

2 Today Introduction to machine learning The course (syllabus) Math review (probability + linear algebra) The future of machine learning Why are you enrolled and what do you want from this class?

3 Machine Learning (ML) Science that studies statistical and computational aspects of modeling data for predictive purposes (Mostly) Empirical science

4 Data ML Predictions Task: Predict whether an image contains a malignant tumor. Task: Predict the next movie a person should watch.

5

6 Subjective view of how ML relates to other fields

7 Historical View (Modern) Statistics: ~1900 Machine Learning and Data Mining: ~1960 Data Science: ~2000

8 Computer Science + Engineering Machine Learning Statistics + Mathematics Danger Zone Data Science (BI) Traditional Research Substantive Expertise [Inspired by Drew Conway]

9 Data analysis, machine learning and data mining are various names given to the practice of statistical inference, depending on the context. Probability Data generating process Observed data Inference Larry Wasserman in All of Statistics: A Concise Course in Statistical Inference.

10 [Reproduced from R. Neal: ]

11 ML vs. other fields Traditional statistics Causality and understanding are more important than prediction. Data size is less important. Less computational aspects. Simpler, linear relationships. Data mining (somewhat synonymous of ML) Computational Efficiency [ ] more important than statistical sophistication. Often business-related problems. [Ideas and text borrowed from R. Neal: ]

12 Data View Statistics Designs experiments to collect data Machine Learning Uses available data but also designs collection mechanisms Data Science Uses data made available by other processes

13 Artificial Machine Learning Intelligence Deep Learning

14 Applications of ML

15

16

17

18

19

20

21

22 Medicine: personalized, automate diagnostics Social sciences: prediction problem (e.g., predict recidivism) Engineering: to propose new design, evaluate without building Finance: capture uncertainty, short-term trading Marketing: to understand and quantify user experience, advertising efficacy Many others: conservation, social projects Your domain of expertise

23 Course Introduction & Goals

24 Course webpage Google my name. There s a link from my home page.

25 Fit with other courses HEC PhD level Computationally oriented Other ML courses in Montreal (U.Montreal, Polytechnique, Mcgill) More applied Emphasize on large-scale data using distributed computational methods

26 Short review of linear algebra, statistics, and probabilities

27 Based on chapters 2 and 3 of Deep Learning

28 Linear algebra Scalar: a single value. a R, a N a = 3 Vector: an array of values. a R D, a N D a = Matrix: a table of values. A R D 1 D 2, A N D 1 D 2 [ ] A = 1 2 9

29 Indexing notation Indexing elements of a vector: 3 a = 4 2 a 1 a i Convention: The first element is the zero th. Indexing elements of a matrix: a ij [ ] A = a 12

30 Simple operations Transpose a 0 a = a 1 a 2 (A ij ) = A ji a = a 0 a 1 a 2 Addition Vectors and matrices w. the same shape b 0 a 0 + b 0 b = b 1 a + b = b 2 a 1 + b 1 a 2 + b 2 (A + B) ij = A ij + B ij

31 Simple operations Multiply by a scalar a 0 a = a 1 a 2 Vector product. The dot product a a = i a i a i Note: it yields a scalar. Element-wise product: a a = a 0 a 0 a 1 a 1 Also known as Hadamard product a 2 a 2

32 Operations Matrix product (dot product): C ij = k A ik B kj A's columns must equal B s rows (order is important) A R D 1 D 2, B R D 2 D 3 Distributive: A(B + C) =AB + AC Associative: A(BC) =(AB)C (AB) = B A

33 Inverse We denote a matrix s inverse as A 1 A matrix has an inverse iff: it s square. D 1 = D 2 its columns are linearly independent. A square matrix not inversible is singular No column can be recovered using a combination of other columns Inverses are useful to solve systems of equations: Ax = b x = A 1 b

34 Norms norm. Size of a vector (or matrix) L p 1/p a p = a i p i Standard norms in ML: Euclidean norm (p=2) a 2 = a i 2 i Dot product w. 2-norm: a t b = a 2 b 2 cos xy Frobenius norm (matrix): A 2 = i j a ij 2

35 Special matrices & vectors Identity. Denoted I n I 3 = All zeros except for ones on the main diagonal Symmetric: A = A Unit vector: a 2 = 1 Orthogonal vectors: a b = 0 Orthonormal vectors: unit and orthogonal Orthogonal matrix: Orthonormal rows & columns A A = AA = I

36 Skip eigendecomposition, SVD, pseudo-inverse, determinants (Sections ). We will get back to them if/when needed in the course.

37 On to probabilities Chapter 3 of Deep Learning I ve adapted some of the lecture slides from the book. Thanks to Ian Goodfellow for providing slides.

38 Why probabilities? To capture uncertainty E.g., What time will I get home tonight? Probabilities provide a formalism for making statements about data generating processes (L. Wasserman) E.g., what happens when I flip a fair coin? Probability Data generating process Observed data Inference

39 The example Generate data by throwing a fair die. What do we know about a single throw? 6 possible outcomes. (sample space) Each outcome (e.g., 1). (element, state) A subset of outcomes (e.g., <3). (event) Outcomes are equiprobable. (uniform distribution)

40 Random variables and probabilities A random variable (r.v.) is a probabilistic outcome. For example, Die throw (X) The actual outcome is {1, 2, 3, 4, 5, 6}. (x) A probability function (P) assigns a real number to each possible event: P(x) 0, x X P x = 1

41 <latexit sha1_base64="xudvadcuhlkgwfqr9uwkndxf7yo=">aaadfhicfvjnb9naen2yr2k+wjhysygqpviu2qgjlqbktkihckeibaq4isabsbrketfahzdelv8eby7wu7ghrtz5i5xzpwhvcwkktz7em5l9450kk8jsgp5sefeuxrt+y+umf+v2nbv3tnfun1idg449rqu2/qqssqgwr4ik9jodkcyst5pzy0o/pudjhvbvazhhmiwpehpbgrzv77bmz+cjstfaboadcbnbjohwomlw0r3tnh7fy83zfbvxcdyoojcjyqggbjdy+nfumqm+gykohfsqoh0ws8nlsouyctdrxh1fwzk9xffaau0itvxmcnrm17wk/jc2ygnybfgileweil9cnmllqdqopg/gwianuxaaubhoa8dpwaan949qtyrazwgs2wzjynfoc0nc6a+1wyqlgqx5nz3nsna9xjvw0pwmlp5ubjfsekqavthcey7obwrvmme/qutbya0dmrvk26/dc6n2k4m429soqfu7dh+lzjp9ef7x6nk2hx+ge0qdx67kbyao1kaid8sirpoksxfylr7blgh9lzbbyenofhaid0+a+y9wo7pfhrjhrmui9pttsypwzt3gmwqf2wf2xfvkffw+ed8vur3gquybq4x34zdf4aht</latexit> <latexit sha1_base64="xudvadcuhlkgwfqr9uwkndxf7yo=">aaadfhicfvjnb9naen2yr2k+wjhysygqpviu2qgjlqbktkihckeibaq4isabsbrketfahzdelv8eby7wu7ghrtz5i5xzpwhvcwkktz7em5l9450kk8jsgp5sefeuxrt+y+umf+v2nbv3tnfun1idg449rqu2/qqssqgwr4ik9jodkcyst5pzy0o/pudjhvbvazhhmiwpehpbgrzv77bmz+cjstfaboadcbnbjohwomlw0r3tnh7fy83zfbvxcdyoojcjyqggbjdy+nfumqm+gykohfsqoh0ws8nlsouyctdrxh1fwzk9xffaau0itvxmcnrm17wk/jc2ygnybfgileweil9cnmllqdqopg/gwianuxaaubhoa8dpwaan949qtyrazwgs2wzjynfoc0nc6a+1wyqlgqx5nz3nsna9xjvw0pwmlp5ubjfsekqavthcey7obwrvmme/qutbya0dmrvk26/dc6n2k4m429soqfu7dh+lzjp9ef7x6nk2hx+ge0qdx67kbyao1kaid8sirpoksxfylr7blgh9lzbbyenofhaid0+a+y9wo7pfhrjhrmui9pttsypwzt3gmwqf2wf2xfvkffw+ed8vur3gquybq4x34zdf4aht</latexit> <latexit sha1_base64="xudvadcuhlkgwfqr9uwkndxf7yo=">aaadfhicfvjnb9naen2yr2k+wjhysygqpviu2qgjlqbktkihckeibaq4isabsbrketfahzdelv8eby7wu7ghrtz5i5xzpwhvcwkktz7em5l9450kk8jsgp5sefeuxrt+y+umf+v2nbv3tnfun1idg449rqu2/qqssqgwr4ik9jodkcyst5pzy0o/pudjhvbvazhhmiwpehpbgrzv77bmz+cjstfaboadcbnbjohwomlw0r3tnh7fy83zfbvxcdyoojcjyqggbjdy+nfumqm+gykohfsqoh0ws8nlsouyctdrxh1fwzk9xffaau0itvxmcnrm17wk/jc2ygnybfgileweil9cnmllqdqopg/gwianuxaaubhoa8dpwaan949qtyrazwgs2wzjynfoc0nc6a+1wyqlgqx5nz3nsna9xjvw0pwmlp5ubjfsekqavthcey7obwrvmme/qutbya0dmrvk26/dc6n2k4m429soqfu7dh+lzjp9ef7x6nk2hx+ge0qdx67kbyao1kaid8sirpoksxfylr7blgh9lzbbyenofhaid0+a+y9wo7pfhrjhrmui9pttsypwzt3gmwqf2wf2xfvkffw+ed8vur3gquybq4x34zdf4aht</latexit> <latexit sha1_base64="xudvadcuhlkgwfqr9uwkndxf7yo=">aaadfhicfvjnb9naen2yr2k+wjhysygqpviu2qgjlqbktkihckeibaq4isabsbrketfahzdelv8eby7wu7ghrtz5i5xzpwhvcwkktz7em5l9450kk8jsgp5sefeuxrt+y+umf+v2nbv3tnfun1idg449rqu2/qqssqgwr4ik9jodkcyst5pzy0o/pudjhvbvazhhmiwpehpbgrzv77bmz+cjstfaboadcbnbjohwomlw0r3tnh7fy83zfbvxcdyoojcjyqggbjdy+nfumqm+gykohfsqoh0ws8nlsouyctdrxh1fwzk9xffaau0itvxmcnrm17wk/jc2ygnybfgileweil9cnmllqdqopg/gwianuxaaubhoa8dpwaan949qtyrazwgs2wzjynfoc0nc6a+1wyqlgqx5nz3nsna9xjvw0pwmlp5ubjfsekqavthcey7obwrvmme/qutbya0dmrvk26/dc6n2k4m429soqfu7dh+lzjp9ef7x6nk2hx+ge0qdx67kbyao1kaid8sirpoksxfylr7blgh9lzbbyenofhaid0+a+y9wo7pfhrjhrmui9pttsypwzt3gmwqf2wf2xfvkffw+ed8vur3gquybq4x34zdf4aht</latexit> <latexit sha1_base64="xudvadcuhlkgwfqr9uwkndxf7yo=">aaadfhicfvjnb9naen2yr2k+wjhysygqpviu2qgjlqbktkihckeibaq4isabsbrketfahzdelv8eby7wu7ghrtz5i5xzpwhvcwkktz7em5l9450kk8jsgp5sefeuxrt+y+umf+v2nbv3tnfun1idg449rqu2/qqssqgwr4ik9jodkcyst5pzy0o/pudjhvbvazhhmiwpehpbgrzv77bmz+cjstfaboadcbnbjohwomlw0r3tnh7fy83zfbvxcdyoojcjyqggbjdy+nfumqm+gykohfsqoh0ws8nlsouyctdrxh1fwzk9xffaau0itvxmcnrm17wk/jc2ygnybfgileweil9cnmllqdqopg/gwianuxaaubhoa8dpwaan949qtyrazwgs2wzjynfoc0nc6a+1wyqlgqx5nz3nsna9xjvw0pwmlp5ubjfsekqavthcey7obwrvmme/qutbya0dmrvk26/dc6n2k4m429soqfu7dh+lzjp9ef7x6nk2hx+ge0qdx67kbyao1kaid8sirpoksxfylr7blgh9lzbbyenofhaid0+a+y9wo7pfhrjhrmui9pttsypwzt3gmwqf2wf2xfvkffw+ed8vur3gquybq4x34zdf4aht</latexit> <latexit sha1_base64="qqamk0eaqxemjspazau5fqosd54=">aaadf3icfvjnaxrbeo0dv+l4lejry+asilasmylosqimmiprcg4s2flctw/tppme7qg7jmzp5md48ki/xzt49egf8wzpzpxmrlgww+o9qurx1zwvulik45+d4mrva9dvrn0mb92+c/fe+sb9q6srw3hatdtmoaoluigckccjx6vbkdkjr1n+stgpztbyodv7mpu4kmcqxerwie8n04kb7pla5fxjejfux/oivkgyaf22iiotjc6vdkx5vaailshayrkxnhjgshcjdzhwfkvgouxx6kgcau3izt3x0aznxtfeg/8piubs5qohhbwzivozbdcpxdya8l/asklj85etqqwifb84afljihtudcaac4oc5mwd4ez4rxe/bt8g8mnqnzjpnrnktgfgwkxxvjke0r9af3nzayxynntekch1gjdysedkoa43u4tugfdn3d0eyjp0pib6gxx+vx3frt7aevnbtvfav5jqvtki+eovkla/ff+z6d3th+f/rs5nh+ko+qc0uo9l3pbosw1culu7tgmszw63rko/kcnyxqycwyf9jo4n7552t18sdmanpwsp2bzl2do2zfbyarswzjt7yd6zl8gn4gvwlfh+krp0fjupwcuch78bawid+q==</latexit> <latexit sha1_base64="qqamk0eaqxemjspazau5fqosd54=">aaadf3icfvjnaxrbeo0dv+l4lejry+asilasmylosqimmiprcg4s2flctw/tppme7qg7jmzp5md48ki/xzt49egf8wzpzpxmrlgww+o9qurx1zwvulik45+d4mrva9dvrn0mb92+c/fe+sb9q6srw3hatdtmoaoluigckccjx6vbkdkjr1n+stgpztbyodv7mpu4kmcqxerwie8n04kb7pla5fxjejfux/oivkgyaf22iiotjc6vdkx5vaailshayrkxnhjgshcjdzhwfkvgouxx6kgcau3izt3x0aznxtfeg/8piubs5qohhbwzivozbdcpxdya8l/asklj85etqqwifb84afljihtudcaac4oc5mwd4ez4rxe/bt8g8mnqnzjpnrnktgfgwkxxvjke0r9af3nzayxynntekch1gjdysedkoa43u4tugfdn3d0eyjp0pib6gxx+vx3frt7aevnbtvfav5jqvtki+eovkla/ff+z6d3th+f/rs5nh+ko+qc0uo9l3pbosw1culu7tgmszw63rko/kcnyxqycwyf9jo4n7552t18sdmanpwsp2bzl2do2zfbyarswzjt7yd6zl8gn4gvwlfh+krp0fjupwcuch78bawid+q==</latexit> <latexit sha1_base64="qqamk0eaqxemjspazau5fqosd54=">aaadf3icfvjnaxrbeo0dv+l4lejry+asilasmylosqimmiprcg4s2flctw/tppme7qg7jmzp5md48ki/xzt49egf8wzpzpxmrlgww+o9qurx1zwvulik45+d4mrva9dvrn0mb92+c/fe+sb9q6srw3hatdtmoaoluigckccjx6vbkdkjr1n+stgpztbyodv7mpu4kmcqxerwie8n04kb7pla5fxjejfux/oivkgyaf22iiotjc6vdkx5vaailshayrkxnhjgshcjdzhwfkvgouxx6kgcau3izt3x0aznxtfeg/8piubs5qohhbwzivozbdcpxdya8l/asklj85etqqwifb84afljihtudcaac4oc5mwd4ez4rxe/bt8g8mnqnzjpnrnktgfgwkxxvjke0r9af3nzayxynntekch1gjdysedkoa43u4tugfdn3d0eyjp0pib6gxx+vx3frt7aevnbtvfav5jqvtki+eovkla/ff+z6d3th+f/rs5nh+ko+qc0uo9l3pbosw1culu7tgmszw63rko/kcnyxqycwyf9jo4n7552t18sdmanpwsp2bzl2do2zfbyarswzjt7yd6zl8gn4gvwlfh+krp0fjupwcuch78bawid+q==</latexit> <latexit sha1_base64="qqamk0eaqxemjspazau5fqosd54=">aaadf3icfvjnaxrbeo0dv+l4lejry+asilasmylosqimmiprcg4s2flctw/tppme7qg7jmzp5md48ki/xzt49egf8wzpzpxmrlgww+o9qurx1zwvulik45+d4mrva9dvrn0mb92+c/fe+sb9q6srw3hatdtmoaoluigckccjx6vbkdkjr1n+stgpztbyodv7mpu4kmcqxerwie8n04kb7pla5fxjejfux/oivkgyaf22iiotjc6vdkx5vaailshayrkxnhjgshcjdzhwfkvgouxx6kgcau3izt3x0aznxtfeg/8piubs5qohhbwzivozbdcpxdya8l/asklj85etqqwifb84afljihtudcaac4oc5mwd4ez4rxe/bt8g8mnqnzjpnrnktgfgwkxxvjke0r9af3nzayxynntekch1gjdysedkoa43u4tugfdn3d0eyjp0pib6gxx+vx3frt7aevnbtvfav5jqvtki+eovkla/ff+z6d3th+f/rs5nh+ko+qc0uo9l3pbosw1culu7tgmszw63rko/kcnyxqycwyf9jo4n7552t18sdmanpwsp2bzl2do2zfbyarswzjt7yd6zl8gn4gvwlfh+krp0fjupwcuch78bawid+q==</latexit> <latexit sha1_base64="qqamk0eaqxemjspazau5fqosd54=">aaadf3icfvjnaxrbeo0dv+l4lejry+asilasmylosqimmiprcg4s2flctw/tppme7qg7jmzp5md48ki/xzt49egf8wzpzpxmrlgww+o9qurx1zwvulik45+d4mrva9dvrn0mb92+c/fe+sb9q6srw3hatdtmoaoluigckccjx6vbkdkjr1n+stgpztbyodv7mpu4kmcqxerwie8n04kb7pla5fxjejfux/oivkgyaf22iiotjc6vdkx5vaailshayrkxnhjgshcjdzhwfkvgouxx6kgcau3izt3x0aznxtfeg/8piubs5qohhbwzivozbdcpxdya8l/asklj85etqqwifb84afljihtudcaac4oc5mwd4ez4rxe/bt8g8mnqnzjpnrnktgfgwkxxvjke0r9af3nzayxynntekch1gjdysedkoa43u4tugfdn3d0eyjp0pib6gxx+vx3frt7aevnbtvfav5jqvtki+eovkla/ff+z6d3th+f/rs5nh+ko+qc0uo9l3pbosw1culu7tgmszw63rko/kcnyxqycwyf9jo4n7552t18sdmanpwsp2bzl2do2zfbyarswzjt7yd6zl8gn4gvwlfh+krp0fjupwcuch78bawid+q==</latexit> Discrete RVs An RV is discrete if it takes a finite number of values 1 P(x = x i ) 0, i E.g., uniform distribution: i P(x = x i )=1 P(x = x i ) 1 k P(x = x i )= 1 k, i E.g., Poisson distribution: x i exp P(x = x i ; )= x i! 1. technically: it must be countable both images are from: wikipedia.org

42 Continuous RVs An RV is continuous if f(x) 0, x X f(x)dx = 1 b P(a < x < b) = a f(x)dx f(x) is a probability density function (PDF) E.g., (continuous) uniform distribution: u(x = x i ; a, b) = 1 b a E.g., Gaussian distribution from: wikipedia.org

43 A few useful properties (shown for discrete variables for simplicity) Sum rule: P(x) = y P(x, y) Product rule: P(x, y) =P(x y)p(y) Chain rule: P(x 1,...,x n )=P(x 1 )P(x 2 x 1 )P(x 3 x 2, x 1 ) = P(x 1 ) P(x i x 1,...,x i 1 ) If x and y are independent: P(x, y) =P(x)P(y) Bayes Rule: P(Y X) = P(X Y)P(Y) P(X)

44 <latexit sha1_base64="js8ou7gs2gl0dn4g0ziansvjgmw=">aaadq3icfvlnihnbeo6mf2v8y+rrs2nysgaim6lorvjyxd2dqyuy3zvmcd2dstkkp3vorokzhnksn8adf30cn8gbidfbnmyutsiwdppxfvxvvduvp1jydikvne/s5stxr21dr9+4eev2ncb23rorm8ohy7xu5ixmfqrq0ewbes5sayyjjzzg071kp52bsukrt5in0e/ywimr4awdnwg8jsxet+z+3qbpadqyjbcrwhylpt0rw2f+u3zzrfamezay0yxiy0gjgxschdfnec5bkyztelbd+xknnc8sumgls7yxbin2c2zqcallpcospixp2rh6diqwgo0xi/5kuuoyir1p445cumavrhqsstzpyuezmjzyda0i/6x1mhw97rdcprmc4ucpjtjjudnqwhqodhcuuqomg+fqpxzc3jdqjxtllvjrkbly+swylvtjjley/x6lswjrqap8lz1nsna9hdvw4hwnk+s7kqvmmfbv78uhybm4ohh9brn8vv3esm7ti7fa6790v6v8fwzg2t4iwcl35c4lrmb8i/wv1uxverqp7isnhlmq1yk4upsiond7uiwj4+kglotuu8l1vdgejw87ydaj3zxq7j5f7swwuu8ekbyjyroysw7jmekstj6qj+qz+ej98r55370f565ebrlzj6yy9+s3yhau4g==</latexit> <latexit sha1_base64="js8ou7gs2gl0dn4g0ziansvjgmw=">aaadq3icfvlnihnbeo6mf2v8y+rrs2nysgaim6lorvjyxd2dqyuy3zvmcd2dstkkp3vorokzhnksn8adf30cn8gbidfbnmyutsiwdppxfvxvvduvp1jydikvne/s5stxr21dr9+4eev2ncb23rorm8ohy7xu5ixmfqrq0ewbes5sayyjjzzg071kp52bsukrt5in0e/ywimr4awdnwg8jsxet+z+3qbpadqyjbcrwhylpt0rw2f+u3zzrfamezay0yxiy0gjgxschdfnec5bkyztelbd+xknnc8sumgls7yxbin2c2zqcallpcospixp2rh6diqwgo0xi/5kuuoyir1p445cumavrhqsstzpyuezmjzyda0i/6x1mhw97rdcprmc4ucpjtjjudnqwhqodhcuuqomg+fqpxzc3jdqjxtllvjrkbly+swylvtjjley/x6lswjrqap8lz1nsna9hdvw4hwnk+s7kqvmmfbv78uhybm4ohh9brn8vv3esm7ti7fa6790v6v8fwzg2t4iwcl35c4lrmb8i/wv1uxverqp7isnhlmq1yk4upsiond7uiwj4+kglotuu8l1vdgejw87ydaj3zxq7j5f7swwuu8ekbyjyroysw7jmekstj6qj+qz+ej98r55370f565ebrlzj6yy9+s3yhau4g==</latexit> <latexit sha1_base64="js8ou7gs2gl0dn4g0ziansvjgmw=">aaadq3icfvlnihnbeo6mf2v8y+rrs2nysgaim6lorvjyxd2dqyuy3zvmcd2dstkkp3vorokzhnksn8adf30cn8gbidfbnmyutsiwdppxfvxvvduvp1jydikvne/s5stxr21dr9+4eev2ncb23rorm8ohy7xu5ixmfqrq0ewbes5sayyjjzzg071kp52bsukrt5in0e/ywimr4awdnwg8jsxet+z+3qbpadqyjbcrwhylpt0rw2f+u3zzrfamezay0yxiy0gjgxschdfnec5bkyztelbd+xknnc8sumgls7yxbin2c2zqcallpcospixp2rh6diqwgo0xi/5kuuoyir1p445cumavrhqsstzpyuezmjzyda0i/6x1mhw97rdcprmc4ucpjtjjudnqwhqodhcuuqomg+fqpxzc3jdqjxtllvjrkbly+swylvtjjley/x6lswjrqap8lz1nsna9hdvw4hwnk+s7kqvmmfbv78uhybm4ohh9brn8vv3esm7ti7fa6790v6v8fwzg2t4iwcl35c4lrmb8i/wv1uxverqp7isnhlmq1yk4upsiond7uiwj4+kglotuu8l1vdgejw87ydaj3zxq7j5f7swwuu8ekbyjyroysw7jmekstj6qj+qz+ej98r55370f565ebrlzj6yy9+s3yhau4g==</latexit> <latexit sha1_base64="js8ou7gs2gl0dn4g0ziansvjgmw=">aaadq3icfvlnihnbeo6mf2v8y+rrs2nysgaim6lorvjyxd2dqyuy3zvmcd2dstkkp3vorokzhnksn8adf30cn8gbidfbnmyutsiwdppxfvxvvduvp1jydikvne/s5stxr21dr9+4eev2ncb23rorm8ohy7xu5ixmfqrq0ewbes5sayyjjzzg071kp52bsukrt5in0e/ywimr4awdnwg8jsxet+z+3qbpadqyjbcrwhylpt0rw2f+u3zzrfamezay0yxiy0gjgxschdfnec5bkyztelbd+xknnc8sumgls7yxbin2c2zqcallpcospixp2rh6diqwgo0xi/5kuuoyir1p445cumavrhqsstzpyuezmjzyda0i/6x1mhw97rdcprmc4ucpjtjjudnqwhqodhcuuqomg+fqpxzc3jdqjxtllvjrkbly+swylvtjjley/x6lswjrqap8lz1nsna9hdvw4hwnk+s7kqvmmfbv78uhybm4ohh9brn8vv3esm7ti7fa6790v6v8fwzg2t4iwcl35c4lrmb8i/wv1uxverqp7isnhlmq1yk4upsiond7uiwj4+kglotuu8l1vdgejw87ydaj3zxq7j5f7swwuu8ekbyjyroysw7jmekstj6qj+qz+ej98r55370f565ebrlzj6yy9+s3yhau4g==</latexit> <latexit sha1_base64="js8ou7gs2gl0dn4g0ziansvjgmw=">aaadq3icfvlnihnbeo6mf2v8y+rrs2nysgaim6lorvjyxd2dqyuy3zvmcd2dstkkp3vorokzhnksn8adf30cn8gbidfbnmyutsiwdppxfvxvvduvp1jydikvne/s5stxr21dr9+4eev2ncb23rorm8ohy7xu5ixmfqrq0ewbes5sayyjjzzg071kp52bsukrt5in0e/ywimr4awdnwg8jsxet+z+3qbpadqyjbcrwhylpt0rw2f+u3zzrfamezay0yxiy0gjgxschdfnec5bkyztelbd+xknnc8sumgls7yxbin2c2zqcallpcospixp2rh6diqwgo0xi/5kuuoyir1p445cumavrhqsstzpyuezmjzyda0i/6x1mhw97rdcprmc4ucpjtjjudnqwhqodhcuuqomg+fqpxzc3jdqjxtllvjrkbly+swylvtjjley/x6lswjrqap8lz1nsna9hdvw4hwnk+s7kqvmmfbv78uhybm4ohh9brn8vv3esm7ti7fa6790v6v8fwzg2t4iwcl35c4lrmb8i/wv1uxverqp7isnhlmq1yk4upsiond7uiwj4+kglotuu8l1vdgejw87ydaj3zxq7j5f7swwuu8ekbyjyroysw7jmekstj6qj+qz+ej98r55370f565ebrlzj6yy9+s3yhau4g==</latexit> Moments Expectation: E[X] = P(x = x i )x i E[aX] =ae[x] i Variance: 2 = E[(X E[X]) 2 ] Covariance: Cov(X, Y) =E (X E[X])(Y E[Y]) Cov(X, Y) correlation: (x, y) = x y

45 What is the goal of ML?

46 A bit of historical context When I started my PhD very few in ML talked about AI Recent ML makes progress toward AI tasks Examples of AI tasks: translation, object recognition In that context: create a machine with human-like capacities? Or a machine that can help humans?

47 My (imperfect) view: Understand data through predictive models Understand the world through predictive models

48 Further Reading Prologue to The Master Algorithm Ch. 1 of Hastie et al. Math Preparation Ch.2 of Bishop Ch.2-3 of Goodfellow et al. Slightly more advanced:

49 Why did you enroll? Why are you taking this class? What do you hope to learn from it? How do you plan on using it? Any particular topics you would like us to cover?

[POLS 8500] Review of Linear Algebra, Probability and Information Theory

[POLS 8500] Review of Linear Algebra, Probability and Information Theory [POLS 8500] Review of Linear Algebra, Probability and Information Theory Professor Jason Anastasopoulos ljanastas@uga.edu January 12, 2017 For today... Basic linear algebra. Basic probability. Programming

More information

Today. Probability and Statistics. Linear Algebra. Calculus. Naïve Bayes Classification. Matrix Multiplication Matrix Inversion

Today. Probability and Statistics. Linear Algebra. Calculus. Naïve Bayes Classification. Matrix Multiplication Matrix Inversion Today Probability and Statistics Naïve Bayes Classification Linear Algebra Matrix Multiplication Matrix Inversion Calculus Vector Calculus Optimization Lagrange Multipliers 1 Classical Artificial Intelligence

More information

Review (probability, linear algebra) CE-717 : Machine Learning Sharif University of Technology

Review (probability, linear algebra) CE-717 : Machine Learning Sharif University of Technology Review (probability, linear algebra) CE-717 : Machine Learning Sharif University of Technology M. Soleymani Fall 2012 Some slides have been adopted from Prof. H.R. Rabiee s and also Prof. R. Gutierrez-Osuna

More information

Lecture 2: Repetition of probability theory and statistics

Lecture 2: Repetition of probability theory and statistics Algorithms for Uncertainty Quantification SS8, IN2345 Tobias Neckel Scientific Computing in Computer Science TUM Lecture 2: Repetition of probability theory and statistics Concept of Building Block: Prerequisites:

More information

Review (Probability & Linear Algebra)

Review (Probability & Linear Algebra) Review (Probability & Linear Algebra) CE-725 : Statistical Pattern Recognition Sharif University of Technology Spring 2013 M. Soleymani Outline Axioms of probability theory Conditional probability, Joint

More information

Linear Algebra Review. Vectors

Linear Algebra Review. Vectors Linear Algebra Review 9/4/7 Linear Algebra Review By Tim K. Marks UCSD Borrows heavily from: Jana Kosecka http://cs.gmu.edu/~kosecka/cs682.html Virginia de Sa (UCSD) Cogsci 8F Linear Algebra review Vectors

More information

Vectors To begin, let us describe an element of the state space as a point with numerical coordinates, that is x 1. x 2. x =

Vectors To begin, let us describe an element of the state space as a point with numerical coordinates, that is x 1. x 2. x = Linear Algebra Review Vectors To begin, let us describe an element of the state space as a point with numerical coordinates, that is x 1 x x = 2. x n Vectors of up to three dimensions are easy to diagram.

More information

MA 575 Linear Models: Cedric E. Ginestet, Boston University Revision: Probability and Linear Algebra Week 1, Lecture 2

MA 575 Linear Models: Cedric E. Ginestet, Boston University Revision: Probability and Linear Algebra Week 1, Lecture 2 MA 575 Linear Models: Cedric E Ginestet, Boston University Revision: Probability and Linear Algebra Week 1, Lecture 2 1 Revision: Probability Theory 11 Random Variables A real-valued random variable is

More information

Vectors and Matrices Statistics with Vectors and Matrices

Vectors and Matrices Statistics with Vectors and Matrices Vectors and Matrices Statistics with Vectors and Matrices Lecture 3 September 7, 005 Analysis Lecture #3-9/7/005 Slide 1 of 55 Today s Lecture Vectors and Matrices (Supplement A - augmented with SAS proc

More information

CS 143 Linear Algebra Review

CS 143 Linear Algebra Review CS 143 Linear Algebra Review Stefan Roth September 29, 2003 Introductory Remarks This review does not aim at mathematical rigor very much, but instead at ease of understanding and conciseness. Please see

More information

Lecture 3 Linear Algebra Background

Lecture 3 Linear Algebra Background Lecture 3 Linear Algebra Background Dan Sheldon September 17, 2012 Motivation Preview of next class: y (1) w 0 + w 1 x (1) 1 + w 2 x (1) 2 +... + w d x (1) d y (2) w 0 + w 1 x (2) 1 + w 2 x (2) 2 +...

More information

1 Proof techniques. CS 224W Linear Algebra, Probability, and Proof Techniques

1 Proof techniques. CS 224W Linear Algebra, Probability, and Proof Techniques 1 Proof techniques Here we will learn to prove universal mathematical statements, like the square of any odd number is odd. It s easy enough to show that this is true in specific cases for example, 3 2

More information

PROBABILITY THEORY REVIEW

PROBABILITY THEORY REVIEW PROBABILITY THEORY REVIEW CMPUT 466/551 Martha White Fall, 2017 REMINDERS Assignment 1 is due on September 28 Thought questions 1 are due on September 21 Chapters 1-4, about 40 pages If you are printing,

More information

Introduction to Machine Learning

Introduction to Machine Learning Introduction to Machine Learning Introduction to Probabilistic Methods Varun Chandola Computer Science & Engineering State University of New York at Buffalo Buffalo, NY, USA chandola@buffalo.edu Chandola@UB

More information

A Introduction to Matrix Algebra and the Multivariate Normal Distribution

A Introduction to Matrix Algebra and the Multivariate Normal Distribution A Introduction to Matrix Algebra and the Multivariate Normal Distribution PRE 905: Multivariate Analysis Spring 2014 Lecture 6 PRE 905: Lecture 7 Matrix Algebra and the MVN Distribution Today s Class An

More information

MA/ST 810 Mathematical-Statistical Modeling and Analysis of Complex Systems

MA/ST 810 Mathematical-Statistical Modeling and Analysis of Complex Systems MA/ST 810 Mathematical-Statistical Modeling and Analysis of Complex Systems Review of Basic Probability The fundamentals, random variables, probability distributions Probability mass/density functions

More information

Large Scale Data Analysis Using Deep Learning

Large Scale Data Analysis Using Deep Learning Large Scale Data Analysis Using Deep Learning Linear Algebra U Kang Seoul National University U Kang 1 In This Lecture Overview of linear algebra (but, not a comprehensive survey) Focused on the subset

More information

Fundamentals. CS 281A: Statistical Learning Theory. Yangqing Jia. August, Based on tutorial slides by Lester Mackey and Ariel Kleiner

Fundamentals. CS 281A: Statistical Learning Theory. Yangqing Jia. August, Based on tutorial slides by Lester Mackey and Ariel Kleiner Fundamentals CS 281A: Statistical Learning Theory Yangqing Jia Based on tutorial slides by Lester Mackey and Ariel Kleiner August, 2011 Outline 1 Probability 2 Statistics 3 Linear Algebra 4 Optimization

More information

Linear Algebra (Review) Volker Tresp 2018

Linear Algebra (Review) Volker Tresp 2018 Linear Algebra (Review) Volker Tresp 2018 1 Vectors k, M, N are scalars A one-dimensional array c is a column vector. Thus in two dimensions, ( ) c1 c = c 2 c i is the i-th component of c c T = (c 1, c

More information

Elementary maths for GMT

Elementary maths for GMT Elementary maths for GMT Linear Algebra Part 2: Matrices, Elimination and Determinant m n matrices The system of m linear equations in n variables x 1, x 2,, x n a 11 x 1 + a 12 x 2 + + a 1n x n = b 1

More information

Review of Basic Probability Theory

Review of Basic Probability Theory Review of Basic Probability Theory James H. Steiger Department of Psychology and Human Development Vanderbilt University James H. Steiger (Vanderbilt University) 1 / 35 Review of Basic Probability Theory

More information

CS 246 Review of Linear Algebra 01/17/19

CS 246 Review of Linear Algebra 01/17/19 1 Linear algebra In this section we will discuss vectors and matrices. We denote the (i, j)th entry of a matrix A as A ij, and the ith entry of a vector as v i. 1.1 Vectors and vector operations A vector

More information

Introduction to Stochastic Processes

Introduction to Stochastic Processes Stat251/551 (Spring 2017) Stochastic Processes Lecture: 1 Introduction to Stochastic Processes Lecturer: Sahand Negahban Scribe: Sahand Negahban 1 Organization Issues We will use canvas as the course webpage.

More information

01 Probability Theory and Statistics Review

01 Probability Theory and Statistics Review NAVARCH/EECS 568, ROB 530 - Winter 2018 01 Probability Theory and Statistics Review Maani Ghaffari January 08, 2018 Last Time: Bayes Filters Given: Stream of observations z 1:t and action data u 1:t Sensor/measurement

More information

Recitation. Shuang Li, Amir Afsharinejad, Kaushik Patnaik. Machine Learning CS 7641,CSE/ISYE 6740, Fall 2014

Recitation. Shuang Li, Amir Afsharinejad, Kaushik Patnaik. Machine Learning CS 7641,CSE/ISYE 6740, Fall 2014 Recitation Shuang Li, Amir Afsharinejad, Kaushik Patnaik Machine Learning CS 7641,CSE/ISYE 6740, Fall 2014 Probability and Statistics 2 Basic Probability Concepts A sample space S is the set of all possible

More information

Knowledge Discovery and Data Mining 1 (VO) ( )

Knowledge Discovery and Data Mining 1 (VO) ( ) Knowledge Discovery and Data Mining 1 (VO) (707.003) Review of Linear Algebra Denis Helic KTI, TU Graz Oct 9, 2014 Denis Helic (KTI, TU Graz) KDDM1 Oct 9, 2014 1 / 74 Big picture: KDDM Probability Theory

More information

Review of Linear Algebra

Review of Linear Algebra Review of Linear Algebra Definitions An m n (read "m by n") matrix, is a rectangular array of entries, where m is the number of rows and n the number of columns. 2 Definitions (Con t) A is square if m=

More information

Linear Algebra (Review) Volker Tresp 2017

Linear Algebra (Review) Volker Tresp 2017 Linear Algebra (Review) Volker Tresp 2017 1 Vectors k is a scalar (a number) c is a column vector. Thus in two dimensions, c = ( c1 c 2 ) (Advanced: More precisely, a vector is defined in a vector space.

More information

Linear Algebra for Machine Learning. Sargur N. Srihari

Linear Algebra for Machine Learning. Sargur N. Srihari Linear Algebra for Machine Learning Sargur N. srihari@cedar.buffalo.edu 1 Overview Linear Algebra is based on continuous math rather than discrete math Computer scientists have little experience with it

More information

Algorithms for Uncertainty Quantification

Algorithms for Uncertainty Quantification Algorithms for Uncertainty Quantification Tobias Neckel, Ionuț-Gabriel Farcaș Lehrstuhl Informatik V Summer Semester 2017 Lecture 2: Repetition of probability theory and statistics Example: coin flip Example

More information

DS-GA 1002 Lecture notes 0 Fall Linear Algebra. These notes provide a review of basic concepts in linear algebra.

DS-GA 1002 Lecture notes 0 Fall Linear Algebra. These notes provide a review of basic concepts in linear algebra. DS-GA 1002 Lecture notes 0 Fall 2016 Linear Algebra These notes provide a review of basic concepts in linear algebra. 1 Vector spaces You are no doubt familiar with vectors in R 2 or R 3, i.e. [ ] 1.1

More information

Reminders. Thought questions should be submitted on eclass. Please list the section related to the thought question

Reminders. Thought questions should be submitted on eclass. Please list the section related to the thought question Linear regression Reminders Thought questions should be submitted on eclass Please list the section related to the thought question If it is a more general, open-ended question not exactly related to a

More information

STAT 430/510 Probability Lecture 7: Random Variable and Expectation

STAT 430/510 Probability Lecture 7: Random Variable and Expectation STAT 430/510 Probability Lecture 7: Random Variable and Expectation Pengyuan (Penelope) Wang June 2, 2011 Review Properties of Probability Conditional Probability The Law of Total Probability Bayes Formula

More information

Probability. Paul Schrimpf. January 23, Definitions 2. 2 Properties 3

Probability. Paul Schrimpf. January 23, Definitions 2. 2 Properties 3 Probability Paul Schrimpf January 23, 2018 Contents 1 Definitions 2 2 Properties 3 3 Random variables 4 3.1 Discrete........................................... 4 3.2 Continuous.........................................

More information

Introduction to Machine Learning

Introduction to Machine Learning What does this mean? Outline Contents Introduction to Machine Learning Introduction to Probabilistic Methods Varun Chandola December 26, 2017 1 Introduction to Probability 1 2 Random Variables 3 3 Bayes

More information

ME751 Advanced Computational Multibody Dynamics

ME751 Advanced Computational Multibody Dynamics ME751 Advanced Computational Multibody Dynamics Review: Elements of Linear Algebra & Calculus September 9, 2016 Dan Negrut University of Wisconsin-Madison Quote of the day If you can't convince them, confuse

More information

Linear Algebra in Computer Vision. Lecture2: Basic Linear Algebra & Probability. Vector. Vector Operations

Linear Algebra in Computer Vision. Lecture2: Basic Linear Algebra & Probability. Vector. Vector Operations Linear Algebra in Computer Vision CSED441:Introduction to Computer Vision (2017F Lecture2: Basic Linear Algebra & Probability Bohyung Han CSE, POSTECH bhhan@postech.ac.kr Mathematics in vector space Linear

More information

Review of Probability. CS1538: Introduction to Simulations

Review of Probability. CS1538: Introduction to Simulations Review of Probability CS1538: Introduction to Simulations Probability and Statistics in Simulation Why do we need probability and statistics in simulation? Needed to validate the simulation model Needed

More information

Lecture 3: Matrix and Matrix Operations

Lecture 3: Matrix and Matrix Operations Lecture 3: Matrix and Matrix Operations Representation, row vector, column vector, element of a matrix. Examples of matrix representations Tables and spreadsheets Scalar-Matrix operation: Scaling a matrix

More information

Deep Learning Book Notes Chapter 2: Linear Algebra

Deep Learning Book Notes Chapter 2: Linear Algebra Deep Learning Book Notes Chapter 2: Linear Algebra Compiled By: Abhinaba Bala, Dakshit Agrawal, Mohit Jain Section 2.1: Scalars, Vectors, Matrices and Tensors Scalar Single Number Lowercase names in italic

More information

Intro to Probability. Andrei Barbu

Intro to Probability. Andrei Barbu Intro to Probability Andrei Barbu Some problems Some problems A means to capture uncertainty Some problems A means to capture uncertainty You have data from two sources, are they different? Some problems

More information

Lecture Note 1: Probability Theory and Statistics

Lecture Note 1: Probability Theory and Statistics Univ. of Michigan - NAME 568/EECS 568/ROB 530 Winter 2018 Lecture Note 1: Probability Theory and Statistics Lecturer: Maani Ghaffari Jadidi Date: April 6, 2018 For this and all future notes, if you would

More information

Problem Set 2. MAS 622J/1.126J: Pattern Recognition and Analysis. Due: 5:00 p.m. on September 30

Problem Set 2. MAS 622J/1.126J: Pattern Recognition and Analysis. Due: 5:00 p.m. on September 30 Problem Set 2 MAS 622J/1.126J: Pattern Recognition and Analysis Due: 5:00 p.m. on September 30 [Note: All instructions to plot data or write a program should be carried out using Matlab. In order to maintain

More information

Probability Theory for Machine Learning. Chris Cremer September 2015

Probability Theory for Machine Learning. Chris Cremer September 2015 Probability Theory for Machine Learning Chris Cremer September 2015 Outline Motivation Probability Definitions and Rules Probability Distributions MLE for Gaussian Parameter Estimation MLE and Least Squares

More information

a b = a T b = a i b i (1) i=1 (Geometric definition) The dot product of two Euclidean vectors a and b is defined by a b = a b cos(θ a,b ) (2)

a b = a T b = a i b i (1) i=1 (Geometric definition) The dot product of two Euclidean vectors a and b is defined by a b = a b cos(θ a,b ) (2) This is my preperation notes for teaching in sections during the winter 2018 quarter for course CSE 446. Useful for myself to review the concepts as well. More Linear Algebra Definition 1.1 (Dot Product).

More information

Discrete Mathematics and Probability Theory Fall 2015 Lecture 21

Discrete Mathematics and Probability Theory Fall 2015 Lecture 21 CS 70 Discrete Mathematics and Probability Theory Fall 205 Lecture 2 Inference In this note we revisit the problem of inference: Given some data or observations from the world, what can we infer about

More information

SEBASTIAN RASCHKA. Introduction to Artificial Neural Networks and Deep Learning. with Applications in Python

SEBASTIAN RASCHKA. Introduction to Artificial Neural Networks and Deep Learning. with Applications in Python SEBASTIAN RASCHKA Introduction to Artificial Neural Networks and Deep Learning with Applications in Python Introduction to Artificial Neural Networks with Applications in Python Sebastian Raschka Last

More information

Review of Basic Concepts in Linear Algebra

Review of Basic Concepts in Linear Algebra Review of Basic Concepts in Linear Algebra Grady B Wright Department of Mathematics Boise State University September 7, 2017 Math 565 Linear Algebra Review September 7, 2017 1 / 40 Numerical Linear Algebra

More information

Statistics for scientists and engineers

Statistics for scientists and engineers Statistics for scientists and engineers February 0, 006 Contents Introduction. Motivation - why study statistics?................................... Examples..................................................3

More information

Mathematical foundations - linear algebra

Mathematical foundations - linear algebra Mathematical foundations - linear algebra Andrea Passerini passerini@disi.unitn.it Machine Learning Vector space Definition (over reals) A set X is called a vector space over IR if addition and scalar

More information

Introduction to Computational Finance and Financial Econometrics Matrix Algebra Review

Introduction to Computational Finance and Financial Econometrics Matrix Algebra Review You can t see this text! Introduction to Computational Finance and Financial Econometrics Matrix Algebra Review Eric Zivot Spring 2015 Eric Zivot (Copyright 2015) Matrix Algebra Review 1 / 54 Outline 1

More information

Linear Algebra & Geometry why is linear algebra useful in computer vision?

Linear Algebra & Geometry why is linear algebra useful in computer vision? Linear Algebra & Geometry why is linear algebra useful in computer vision? References: -Any book on linear algebra! -[HZ] chapters 2, 4 Some of the slides in this lecture are courtesy to Prof. Octavia

More information

ECS130 Scientific Computing. Lecture 1: Introduction. Monday, January 7, 10:00 10:50 am

ECS130 Scientific Computing. Lecture 1: Introduction. Monday, January 7, 10:00 10:50 am ECS130 Scientific Computing Lecture 1: Introduction Monday, January 7, 10:00 10:50 am About Course: ECS130 Scientific Computing Professor: Zhaojun Bai Webpage: http://web.cs.ucdavis.edu/~bai/ecs130/ Today

More information

Probability. Machine Learning and Pattern Recognition. Chris Williams. School of Informatics, University of Edinburgh. August 2014

Probability. Machine Learning and Pattern Recognition. Chris Williams. School of Informatics, University of Edinburgh. August 2014 Probability Machine Learning and Pattern Recognition Chris Williams School of Informatics, University of Edinburgh August 2014 (All of the slides in this course have been adapted from previous versions

More information

Jeffrey D. Ullman Stanford University

Jeffrey D. Ullman Stanford University Jeffrey D. Ullman Stanford University 2 Often, our data can be represented by an m-by-n matrix. And this matrix can be closely approximated by the product of two matrices that share a small common dimension

More information

Linear Algebra V = T = ( 4 3 ).

Linear Algebra V = T = ( 4 3 ). Linear Algebra Vectors A column vector is a list of numbers stored vertically The dimension of a column vector is the number of values in the vector W is a -dimensional column vector and V is a 5-dimensional

More information

Matrices and Vectors. Definition of Matrix. An MxN matrix A is a two-dimensional array of numbers A =

Matrices and Vectors. Definition of Matrix. An MxN matrix A is a two-dimensional array of numbers A = 30 MATHEMATICS REVIEW G A.1.1 Matrices and Vectors Definition of Matrix. An MxN matrix A is a two-dimensional array of numbers A = a 11 a 12... a 1N a 21 a 22... a 2N...... a M1 a M2... a MN A matrix can

More information

3 Multiple Discrete Random Variables

3 Multiple Discrete Random Variables 3 Multiple Discrete Random Variables 3.1 Joint densities Suppose we have a probability space (Ω, F,P) and now we have two discrete random variables X and Y on it. They have probability mass functions f

More information

Linear Algebra and Matrix Inversion

Linear Algebra and Matrix Inversion Jim Lambers MAT 46/56 Spring Semester 29- Lecture 2 Notes These notes correspond to Section 63 in the text Linear Algebra and Matrix Inversion Vector Spaces and Linear Transformations Matrices are much

More information

Chapter 1: Systems of linear equations and matrices. Section 1.1: Introduction to systems of linear equations

Chapter 1: Systems of linear equations and matrices. Section 1.1: Introduction to systems of linear equations Chapter 1: Systems of linear equations and matrices Section 1.1: Introduction to systems of linear equations Definition: A linear equation in n variables can be expressed in the form a 1 x 1 + a 2 x 2

More information

Grundlagen der Künstlichen Intelligenz

Grundlagen der Künstlichen Intelligenz Grundlagen der Künstlichen Intelligenz Uncertainty & Probabilities & Bandits Daniel Hennes 16.11.2017 (WS 2017/18) University Stuttgart - IPVS - Machine Learning & Robotics 1 Today Uncertainty Probability

More information

Probability and Information Theory. Sargur N. Srihari

Probability and Information Theory. Sargur N. Srihari Probability and Information Theory Sargur N. srihari@cedar.buffalo.edu 1 Topics in Probability and Information Theory Overview 1. Why Probability? 2. Random Variables 3. Probability Distributions 4. Marginal

More information

Dimensionality Reduction: PCA. Nicholas Ruozzi University of Texas at Dallas

Dimensionality Reduction: PCA. Nicholas Ruozzi University of Texas at Dallas Dimensionality Reduction: PCA Nicholas Ruozzi University of Texas at Dallas Eigenvalues λ is an eigenvalue of a matrix A R n n if the linear system Ax = λx has at least one non-zero solution If Ax = λx

More information

Week 12-13: Discrete Probability

Week 12-13: Discrete Probability Week 12-13: Discrete Probability November 21, 2018 1 Probability Space There are many problems about chances or possibilities, called probability in mathematics. When we roll two dice there are possible

More information

Probabilistic classification CE-717: Machine Learning Sharif University of Technology. M. Soleymani Fall 2016

Probabilistic classification CE-717: Machine Learning Sharif University of Technology. M. Soleymani Fall 2016 Probabilistic classification CE-717: Machine Learning Sharif University of Technology M. Soleymani Fall 2016 Topics Probabilistic approach Bayes decision theory Generative models Gaussian Bayes classifier

More information

Human-Oriented Robotics. Probability Refresher. Kai Arras Social Robotics Lab, University of Freiburg Winter term 2014/2015

Human-Oriented Robotics. Probability Refresher. Kai Arras Social Robotics Lab, University of Freiburg Winter term 2014/2015 Probability Refresher Kai Arras, University of Freiburg Winter term 2014/2015 Probability Refresher Introduction to Probability Random variables Joint distribution Marginalization Conditional probability

More information

Statistics 202: Data Mining. c Jonathan Taylor. Week 2 Based in part on slides from textbook, slides of Susan Holmes. October 3, / 1

Statistics 202: Data Mining. c Jonathan Taylor. Week 2 Based in part on slides from textbook, slides of Susan Holmes. October 3, / 1 Week 2 Based in part on slides from textbook, slides of Susan Holmes October 3, 2012 1 / 1 Part I Other datatypes, preprocessing 2 / 1 Other datatypes Document data You might start with a collection of

More information

Machine Learning. B. Unsupervised Learning B.2 Dimensionality Reduction. Lars Schmidt-Thieme, Nicolas Schilling

Machine Learning. B. Unsupervised Learning B.2 Dimensionality Reduction. Lars Schmidt-Thieme, Nicolas Schilling Machine Learning B. Unsupervised Learning B.2 Dimensionality Reduction Lars Schmidt-Thieme, Nicolas Schilling Information Systems and Machine Learning Lab (ISMLL) Institute for Computer Science University

More information

Part I. Other datatypes, preprocessing. Other datatypes. Other datatypes. Week 2 Based in part on slides from textbook, slides of Susan Holmes

Part I. Other datatypes, preprocessing. Other datatypes. Other datatypes. Week 2 Based in part on slides from textbook, slides of Susan Holmes Week 2 Based in part on slides from textbook, slides of Susan Holmes Part I Other datatypes, preprocessing October 3, 2012 1 / 1 2 / 1 Other datatypes Other datatypes Document data You might start with

More information

Matrix Operations. Linear Combination Vector Algebra Angle Between Vectors Projections and Reflections Equality of matrices, Augmented Matrix

Matrix Operations. Linear Combination Vector Algebra Angle Between Vectors Projections and Reflections Equality of matrices, Augmented Matrix Linear Combination Vector Algebra Angle Between Vectors Projections and Reflections Equality of matrices, Augmented Matrix Matrix Operations Matrix Addition and Matrix Scalar Multiply Matrix Multiply Matrix

More information

Cheng Soon Ong & Christian Walder. Canberra February June 2018

Cheng Soon Ong & Christian Walder. Canberra February June 2018 Cheng Soon Ong & Christian Walder Research Group and College of Engineering and Computer Science Canberra February June 2018 (Many figures from C. M. Bishop, "Pattern Recognition and ") 1of 89 Part II

More information

L14. 1 Lecture 14: Crash Course in Probability. July 7, Overview and Objectives. 1.2 Part 1: Probability

L14. 1 Lecture 14: Crash Course in Probability. July 7, Overview and Objectives. 1.2 Part 1: Probability L14 July 7, 2017 1 Lecture 14: Crash Course in Probability CSCI 1360E: Foundations for Informatics and Analytics 1.1 Overview and Objectives To wrap up the fundamental concepts of core data science, as

More information

Review of Linear Algebra

Review of Linear Algebra Review of Linear Algebra Dr Gerhard Roth COMP 40A Winter 05 Version Linear algebra Is an important area of mathematics It is the basis of computer vision Is very widely taught, and there are many resources

More information

Introduction to Mobile Robotics Compact Course on Linear Algebra. Wolfram Burgard, Bastian Steder

Introduction to Mobile Robotics Compact Course on Linear Algebra. Wolfram Burgard, Bastian Steder Introduction to Mobile Robotics Compact Course on Linear Algebra Wolfram Burgard, Bastian Steder Reference Book Thrun, Burgard, and Fox: Probabilistic Robotics Vectors Arrays of numbers Vectors represent

More information

What is the Matrix? Linear control of finite-dimensional spaces. November 28, 2010

What is the Matrix? Linear control of finite-dimensional spaces. November 28, 2010 What is the Matrix? Linear control of finite-dimensional spaces. November 28, 2010 Scott Strong sstrong@mines.edu Colorado School of Mines What is the Matrix? p. 1/20 Overview/Keywords/References Advanced

More information

Tensor Decompositions for Machine Learning. G. Roeder 1. UBC Machine Learning Reading Group, June University of British Columbia

Tensor Decompositions for Machine Learning. G. Roeder 1. UBC Machine Learning Reading Group, June University of British Columbia Network Feature s Decompositions for Machine Learning 1 1 Department of Computer Science University of British Columbia UBC Machine Learning Group, June 15 2016 1/30 Contact information Network Feature

More information

CPSC340. Probability. Nando de Freitas September, 2012 University of British Columbia

CPSC340. Probability. Nando de Freitas September, 2012 University of British Columbia CPSC340 Probability Nando de Freitas September, 2012 University of British Columbia Outline of the lecture This lecture is intended at revising probabilistic concepts that play an important role in the

More information

Random Variables. Random variables. A numerically valued map X of an outcome ω from a sample space Ω to the real line R

Random Variables. Random variables. A numerically valued map X of an outcome ω from a sample space Ω to the real line R In probabilistic models, a random variable is a variable whose possible values are numerical outcomes of a random phenomenon. As a function or a map, it maps from an element (or an outcome) of a sample

More information

Regression. Oscar García

Regression. Oscar García Regression Oscar García Regression methods are fundamental in Forest Mensuration For a more concise and general presentation, we shall first review some matrix concepts 1 Matrices An order n m matrix is

More information

An Introduction to Matrix Algebra

An Introduction to Matrix Algebra An Introduction to Matrix Algebra EPSY 905: Fundamentals of Multivariate Modeling Online Lecture #8 EPSY 905: Matrix Algebra In This Lecture An introduction to matrix algebra Ø Scalars, vectors, and matrices

More information

Gaussian random variables inr n

Gaussian random variables inr n Gaussian vectors Lecture 5 Gaussian random variables inr n One-dimensional case One-dimensional Gaussian density with mean and standard deviation (called N, ): fx x exp. Proposition If X N,, then ax b

More information

CS100: DISCRETE STRUCTURES. Lecture 3 Matrices Ch 3 Pages:

CS100: DISCRETE STRUCTURES. Lecture 3 Matrices Ch 3 Pages: CS100: DISCRETE STRUCTURES Lecture 3 Matrices Ch 3 Pages: 246-262 Matrices 2 Introduction DEFINITION 1: A matrix is a rectangular array of numbers. A matrix with m rows and n columns is called an m x n

More information

Machine Learning. Bayes Basics. Marc Toussaint U Stuttgart. Bayes, probabilities, Bayes theorem & examples

Machine Learning. Bayes Basics. Marc Toussaint U Stuttgart. Bayes, probabilities, Bayes theorem & examples Machine Learning Bayes Basics Bayes, probabilities, Bayes theorem & examples Marc Toussaint U Stuttgart So far: Basic regression & classification methods: Features + Loss + Regularization & CV All kinds

More information

Linear Algebra Section 2.6 : LU Decomposition Section 2.7 : Permutations and transposes Wednesday, February 13th Math 301 Week #4

Linear Algebra Section 2.6 : LU Decomposition Section 2.7 : Permutations and transposes Wednesday, February 13th Math 301 Week #4 Linear Algebra Section. : LU Decomposition Section. : Permutations and transposes Wednesday, February 1th Math 01 Week # 1 The LU Decomposition We learned last time that we can factor a invertible matrix

More information

STAT2201. Analysis of Engineering & Scientific Data. Unit 3

STAT2201. Analysis of Engineering & Scientific Data. Unit 3 STAT2201 Analysis of Engineering & Scientific Data Unit 3 Slava Vaisman The University of Queensland School of Mathematics and Physics What we learned in Unit 2 (1) We defined a sample space of a random

More information

Phys 201. Matrices and Determinants

Phys 201. Matrices and Determinants Phys 201 Matrices and Determinants 1 1.1 Matrices 1.2 Operations of matrices 1.3 Types of matrices 1.4 Properties of matrices 1.5 Determinants 1.6 Inverse of a 3 3 matrix 2 1.1 Matrices A 2 3 7 =! " 1

More information

Multivariate Statistical Analysis

Multivariate Statistical Analysis Multivariate Statistical Analysis Fall 2011 C. L. Williams, Ph.D. Lecture 4 for Applied Multivariate Analysis Outline 1 Eigen values and eigen vectors Characteristic equation Some properties of eigendecompositions

More information

Basic Concepts in Linear Algebra

Basic Concepts in Linear Algebra Basic Concepts in Linear Algebra Grady B Wright Department of Mathematics Boise State University February 2, 2015 Grady B Wright Linear Algebra Basics February 2, 2015 1 / 39 Numerical Linear Algebra Linear

More information

Bindel, Fall 2016 Matrix Computations (CS 6210) Notes for

Bindel, Fall 2016 Matrix Computations (CS 6210) Notes for 1 Logistics Notes for 2016-08-29 General announcement: we are switching from weekly to bi-weekly homeworks (mostly because the course is much bigger than planned). If you want to do HW but are not formally

More information

Probability theory for Networks (Part 1) CS 249B: Science of Networks Week 02: Monday, 02/04/08 Daniel Bilar Wellesley College Spring 2008

Probability theory for Networks (Part 1) CS 249B: Science of Networks Week 02: Monday, 02/04/08 Daniel Bilar Wellesley College Spring 2008 Probability theory for Networks (Part 1) CS 249B: Science of Networks Week 02: Monday, 02/04/08 Daniel Bilar Wellesley College Spring 2008 1 Review We saw some basic metrics that helped us characterize

More information

Review of Probability Theory

Review of Probability Theory Review of Probability Theory Arian Maleki and Tom Do Stanford University Probability theory is the study of uncertainty Through this class, we will be relying on concepts from probability theory for deriving

More information

Quantum Computing Lecture 2. Review of Linear Algebra

Quantum Computing Lecture 2. Review of Linear Algebra Quantum Computing Lecture 2 Review of Linear Algebra Maris Ozols Linear algebra States of a quantum system form a vector space and their transformations are described by linear operators Vector spaces

More information

Lecture 1: Probability Fundamentals

Lecture 1: Probability Fundamentals Lecture 1: Probability Fundamentals IB Paper 7: Probability and Statistics Carl Edward Rasmussen Department of Engineering, University of Cambridge January 22nd, 2008 Rasmussen (CUED) Lecture 1: Probability

More information

Machine Learning. Principal Components Analysis. Le Song. CSE6740/CS7641/ISYE6740, Fall 2012

Machine Learning. Principal Components Analysis. Le Song. CSE6740/CS7641/ISYE6740, Fall 2012 Machine Learning CSE6740/CS7641/ISYE6740, Fall 2012 Principal Components Analysis Le Song Lecture 22, Nov 13, 2012 Based on slides from Eric Xing, CMU Reading: Chap 12.1, CB book 1 2 Factor or Component

More information

AMS526: Numerical Analysis I (Numerical Linear Algebra)

AMS526: Numerical Analysis I (Numerical Linear Algebra) AMS526: Numerical Analysis I (Numerical Linear Algebra) Lecture 1: Course Overview & Matrix-Vector Multiplication Xiangmin Jiao SUNY Stony Brook Xiangmin Jiao Numerical Analysis I 1 / 20 Outline 1 Course

More information

Singular Value Decomposition and Principal Component Analysis (PCA) I

Singular Value Decomposition and Principal Component Analysis (PCA) I Singular Value Decomposition and Principal Component Analysis (PCA) I Prof Ned Wingreen MOL 40/50 Microarray review Data per array: 0000 genes, I (green) i,i (red) i 000 000+ data points! The expression

More information

L3: Review of linear algebra and MATLAB

L3: Review of linear algebra and MATLAB L3: Review of linear algebra and MATLAB Vector and matrix notation Vectors Matrices Vector spaces Linear transformations Eigenvalues and eigenvectors MATLAB primer CSCE 666 Pattern Analysis Ricardo Gutierrez-Osuna

More information

Linear Algebra Massoud Malek

Linear Algebra Massoud Malek CSUEB Linear Algebra Massoud Malek Inner Product and Normed Space In all that follows, the n n identity matrix is denoted by I n, the n n zero matrix by Z n, and the zero vector by θ n An inner product

More information

Statistical Learning Theory

Statistical Learning Theory Statistical Learning Theory Part I : Mathematical Learning Theory (1-8) By Sumio Watanabe, Evaluation : Report Part II : Information Statistical Mechanics (9-15) By Yoshiyuki Kabashima, Evaluation : Report

More information