Financial Informatics XVII: Unsupervised Learning Khurshid Ahmad, Professor of Computer Science, Department of Computer Science Trinity College, Dublin-, IRELAND November 9 th, 8. https://www.cs.tcd.ie/khurshid.ahmad/teaching.html
Preamble Neural Networks 'learn' by adapting in accordance with a training regimen: Five key algorithms. ERROR-CORRECTION OR PERFORMANCE LEARNING HEBBIAN OR COINCIDENCE LEARNING BOLTZMAN LEARNING STOCHASTIC NET LEARNING COMPETITIVE LEARNING FILTER LEARNING GROSSBERG'S NETS
Preamble Neural Networks 'learn' by adapting in accordance with a training regimen: Five key algorithms. California sought to have the license of one of the largest auditing firms Ernst & Young removed because of their role in the well-publicized collapse of Lincoln Savings & Loan Association. Further, regulators could use a bankruptcy 3
ANN Learning Algorithms ENVIRONMENT Vector describing the environment TEACHER LEARNING SYSTEM Desired response Actual Σ Response - + Error Signal 4
ANN Learning Algorithms ENVIRONMENT Vector describing state of the environment LEARNING SYSTEM 5
ANN Learning Algorithms State-vector input ENVIRONMENT CRITIC Primary Reinforcement Heuristic Reinforcement Actions LEARNING SYSTEM 6
Hebbian Learning DONALD HEBB, a Canadian psychologist, was interested in investigating PLAUSIBLE MECHANISMS FOR LEARNING AT THE CELLULAR LEVELS IN THE BRAIN. see for eample, Donald Hebb's 949 The Organisation of Behaviour. New York: Wiley 7
Hebbian Learning HEBB s POSTULATE: When an aon of cell A is near enough to ecite a cell B and repeatedly or persistently takes part in firing it, some growth process or metabolic changes take place in one or both cells such that A's efficiency as one of the cells firing B, is increased. 8
Hebbian Learning Hebbian Learning laws CAUSE WEIGHT CHANGES IN RESPONSE TO EVENTS WITHIN A PROCESSING ELEMENT THAT HAPPEN SIMULTANEOUSLY. THE LEARNING LAWS IN THIS CATEGORY ARE CHARACTERIZED BY THEIR COMPLETELY LOCAL - BOTH IN SPACE AND IN TIME-CHARACTER. 9
Hebbian Learning LINEAR ASSOCIATOR: A substrate for Hebbian Learning Systems y y y y y 3 y 3 Output y w w w 3 w w w 3 w 3 w 3 w 33 Input 3
Hebbian Learning A simple form of Hebbian Learning Rule new old w y k j k j k w + η j, where h is the so-called rate of learning and and y are the input and output respectively. This rule is also called the activity product rule.
Hebbian Learning A simple form of Hebbian Learning Rule If there are "m" pairs of vectors, v v L v v to be stored in a network,then the training sequence will change the weight-matri, w, from its initial value of ZERO to its final state by simply adding together all of the incremental weight change caused by the "m" applications of Hebb's law: v v v + v v +L+
3 Hebbian Learning A worked eample: Consider the Hebbian learning of three input vectors:.5 ;.5.5 ;.5 3 in a network with the following initial weight vector:.5 w
4 Hebbian Learning A worked eample: Consider the Hebbian learning of three input vectors:.5 ;.5.5 ;.5 3 in a network with the following initial weight vector:.5 w
5 Hebbian Learning A worked eample: Consider the Hebbian learning of three input vectors:.5 ;.5.5 ;.5 3 in a network with the following initial weight vector:.5 w
6 Hebbian Learning A worked eample: Consider the Hebbian learning of three input vectors:.5 ;.5.5 ;.5 3 in a network with the following initial weight vector:.5 w
Hebbian Learning The worked eample shows that with discrete fnet and η, the weight change involves ADDING or SUBTRACTING the entire input pattern vectors to and from the weight vectors respectively. Consider the case when the activation function is a continuous one. For eample, take the bipolar continuous activation function: f net net + ep λ * ; where λ. 7
Hebbian Learning The worked eample shows that with bipolar continuous activation function indicates that the weight adjustments are tapered for the continuous function but are generally in the same direction: Vector Discrete Bipolar fnet Continuous Bipolar fnet.95 - -.77 3 - -.93 8
9 Hebbian Learning The details of the computation for the three steps with a discrete bipolar activation function are presented below in the notes pages. The input vectors and the initial weight vector are:.5 ;.5.5 ;.5 3 5. w
Hebbian Learning The details of the computation for the three steps with a continuous bipolar activation function are presented below in the notes pages. The input vectors and the initial weight vector are:.5 ;.5.5 ;.5 3 5. w
Hebbian Learning Recall that the simple form of Hebbian learning law suggests that the repeated application of the presynaptic signal j leads to an increase in y k and therefore eponential growth that finally drives the synaptic connection into saturation. new old w w y k j k j k j k w η A number of researchers have proposed ways in which such saturation can be avoided. Sejnowski has suggested that j w η k j y k y j, where the time averaged value of y the time averaged value of y j k ; &.
Hebbian Learning The Hebbian synapse described below is said to involve the use of POSITIVE FEEDBACK. new old w w y k j k j k j k w η j
Hebbian Learning What is the principal limitation of this simplest form of learning? new old w w y k j k j k j k w η j The above equation suggests that the repeated application of the input signal leads to an increase in, and therefore eponential growth that finally drives the synaptic connection into saturation. At that point of saturation no information cannot be stored in the synapse and selectivity will be lost. Graphically the relationship with the postsynaptic activityis a simple one: it is linear with a slope. 3
Hebbian Learning The so-called covariance hypothesis was introduced to deal with the principal limitation of the simplest form of Hebbian learning and is given as w kj k j n η y n y n where and denote the time-averaged values of the pre-synaptic and postsynaptic signals. 4
5 Hebbian Learning If we epand the above equation: the last term in the above equation is a constant and the first term is what we have for the simplest Hebbian learning rule: y n y n y n n y n w k j j k kj η η η η + y n y n y n w n w k j kj kj Simple Modified η η η + n y n y n w j k kj η
Hebbian Learning Graphically the relationship w ij with the postsynaptic activity y k is still linear but with a slope η j n and the assurance that the straight line curve changes its rate of change at and the minimum value of the weight change w ij is η y j n 6