A Theory of Universal AI

Similar documents
Framework for functional tree simulation applied to 'golden delicious' apple trees

ETIKA V PROFESII PSYCHOLÓGA


An Example file... log.txt

Matrices and Determinants

Planning for Reactive Behaviors in Hide and Seek

An Introduction to Optimal Control Applied to Disease Models

LA PRISE DE CALAIS. çoys, çoys, har - dis. çoys, dis. tons, mantz, tons, Gas. c est. à ce. C est à ce. coup, c est à ce

THE APPENDIX FOR THE PAPER: INCENTIVE-AWARE JOB ALLOCATION FOR ONLINE SOCIAL CLOUDS. Appendix A 1

Optimal Control of PDEs


QUESTIONS ON QUARKONIUM PRODUCTION IN NUCLEAR COLLISIONS

Constructive Decision Theory

Principal Secretary to Government Haryana, Town & Country Planning Department, Haryana, Chandigarh.

" #$ P UTS W U X [ZY \ Z _ `a \ dfe ih j mlk n p q sr t u s q e ps s t x q s y i_z { U U z W } y ~ y x t i e l US T { d ƒ ƒ ƒ j s q e uˆ ps i ˆ p q y

OC330C. Wiring Diagram. Recommended PKH- P35 / P50 GALH PKA- RP35 / RP50. Remarks (Drawing No.) No. Parts No. Parts Name Specifications

Vectors. Teaching Learning Point. Ç, where OP. l m n

Redoing the Foundations of Decision Theory

AN IDENTIFICATION ALGORITHM FOR ARMAX SYSTEMS

The University of Bath School of Management is one of the oldest established management schools in Britain. It enjoys an international reputation for

A Brief Introduction to Markov Chains and Hidden Markov Models

236 Chapter 4 Applications of Derivatives

Machine Learning: Logistic Regression. Lecture 04

CS 331: Artificial Intelligence Propositional Logic 2. Review of Last Time

Examination paper for TFY4240 Electromagnetic theory

Loop parallelization using compiler analysis

Laplace Examples, Inverse, Rational Form

Books. Book Collection Editor. Editor. Name Name Company. Title "SA" A tree pattern. A database instance

Max. Input Power (W) Input Current (Arms) Dimming. Enclosure

Pharmacological and genomic profiling identifies NF-κB targeted treatment strategies for mantle cell lymphoma

Front-end. Organization of a Modern Compiler. Middle1. Middle2. Back-end. converted to control flow) Representation

SOLAR MORTEC INDUSTRIES.

General Neoclassical Closure Theory: Diagonalizing the Drift Kinetic Operator

A Beamforming Method for Blind Calibration of Time-Interleaved A/D Converters

Towards a numerical solution of the "1/2 vs. 3/2" puzzle

Juan Juan Salon. EH National Bank. Sandwich Shop Nail Design. OSKA Beverly. Chase Bank. Marina Rinaldi. Orogold. Mariposa.

February 17, 2015 REQUEST FOR PROPOSALS. For Columbus Metropolitan Library. Issued by: Purchasing Division 96 S. Grant Ave. Columbus, OH 43215

SCOTT PLUMMER ASHTON ANTONETTI

Damping Ring Requirements for 3 TeV CLIC

Complex Analysis. PH 503 Course TM. Charudatt Kadolkar Indian Institute of Technology, Guwahati

New method for solving nonlinear sum of ratios problem based on simplicial bisection

New BaBar Results on Rare Leptonic B Decays

Symbols and dingbats. A 41 Α a 61 α À K cb ➋ à esc. Á g e7 á esc. Â e e5 â. Ã L cc ➌ ã esc ~ Ä esc : ä esc : Å esc * å esc *

Optimization of a parallel 3d-FFT with non-blocking collective operations

MARKOV CHAINS AND MARKOV DECISION THEORY. Contents

F O R SOCI AL WORK RESE ARCH

PART IV LIVESTOCK, POULTRY AND FISH PRODUCTION

UNIQUE FJORDS AND THE ROYAL CAPITALS UNIQUE FJORDS & THE NORTH CAPE & UNIQUE NORTHERN CAPITALS

1. Allstate Group Critical Illness Claim Form: When filing Critical Illness claim, please be sure to include the following:

4.3 Laplace Transform in Linear System Analysis

A Functional Quantum Programming Language

Vector analysis. 1 Scalars and vectors. Fields. Coordinate systems 1. 2 The operator The gradient, divergence, curl, and Laplacian...

Lecture 16: Modern Classification (I) - Separating Hyperplanes

Connection equations with stream variables are generated in a model when using the # $ % () operator or the & ' %

ALTER TABLE Employee ADD ( Mname VARCHAR2(20), Birthday DATE );

$%! & (, -3 / 0 4, 5 6/ 6 +7, 6 8 9/ 5 :/ 5 A BDC EF G H I EJ KL N G H I. ] ^ _ ` _ ^ a b=c o e f p a q i h f i a j k e i l _ ^ m=c n ^

Approximate Planning in Large POMDPs via Reusable Trajectories

ISO/IEC JTC1/SC2/WG2 N2

B œ c " " ã B œ c 8 8. such that substituting these values for the B 3 's will make all the equations true

Queues, Stack Modules, and Abstract Data Types. CS2023 Winter 2004

MARKET AREA UPDATE Report as of: 1Q 2Q 3Q 4Q

Research of Application the Virtual Reality Technology in Chemistry Education

MAT 167: Advanced Linear Algebra

The EM Algorithm applied to determining new limit points of Mahler measures

Ed S MArket. NarROW } ] T O P [ { U S E R S G U I D E. urrrrrrrrrrrv

Minimizing Total Weighted Completion Time on Uniform Machines with Unbounded Batch

Explicit overall risk minimization transductive bound

Stat 155 Game theory, Yuval Peres Fall Lectures 4,5,6

. ffflffluary 7, 1855.

1) Standards and Methodology for the Evaluation of Financial Statements 2) Letter of Credit Instructions

SECTION A - EMPLOYEE DETAILS Termination date Employee number Contract/ Department SECTION C - STANDARD DOCUMENTS REQUIRED FOR ALL TERMINATIONS

LEAD IN DRINKING WATER REPORT

THE ANALYSIS OF THE CONVECTIVE-CONDUCTIVE HEAT TRANSFER IN THE BUILDING CONSTRUCTIONS. Zbynek Svoboda

A Fuzzy Approach to Co-Ordinated and Non Co-Ordinated Two Stage Supply Chain

Expectation-Maximization for Estimating Parameters for a Mixture of Poissons

Human genome contains ~30K genes Not all genes are expressed at same time

TELEMATICS LINK LEADS

Harlean. User s Guide

BP neural network-based sports performance prediction model applied research

S T A T E B U D G E T : T A X A M E N D M E N T S

Adorn. sans condensed Smooth. v22622x

: œ Ö: =? À =ß> real numbers. œ the previous plane with each point translated by : Ðfor example,! is translated to :)

Statistical Learning Theory: A Primer

Plasma diagnostics and abundance determinations for planetary nebulae current status. Xiaowei Liu Department of Astronomy, Peking University

Continuing Education and Workforce Training Course Schedule Winter / Spring 2010

1. CALL TO ORDER 2. ROLL CALL 3. PLEDGE OF ALLEGIANCE 4. ORAL COMMUNICATIONS (NON-ACTION ITEM) 5. APPROVAL OF AGENDA 6. EWA SEJPA INTEGRATION PROPOSAL

Efficiently Generating Random Bits from Finite State Markov Chains

The incident energy per unit area on the electron is given by the Poynting vector, '

Asynchronous Control for Coupled Markov Decision Systems

Some emission processes are intrinsically polarised e.g. synchrotron radiation.

On the Goal Value of a Boolean Function

Inductive Bias: How to generalize on novel data. CS Inductive Bias 1

A Novel Learning Method for Elman Neural Network Using Local Search

Gradient Ascent Chris Piech CS109, Stanford University

This document has been prepared by Sunder Kidambi with the blessings of

Towards a Universal Theory of Artificial Intelligence based on Algorithmic Probability and Sequential Decisions

Algorithms to solve massively under-defined systems of multivariate quadratic equations

Math 131 Exam 4 (Final Exam) F04M

International Journal "Information Technologies & Knowledge" Vol.5, Number 1,

Obtain from theory/experiment a value for the absorbtion cross-section. Calculate the number of atoms/ions/molecules giving rise to the line.

Transcription:

A Theory of Universa AI Literature Marcus Hutter Kircherr, Li, and Vitanyi Presenter Prashant J. Doshi CS594: Optima Decision Maing A Theory of Universa Artificia Inteigence p.1/18

Roadmap Caim Bacground Key Concepts The AI Mode in Functiona Form The AI Mode in Recursive Form The Universa AI Mode Exampe Constants and Limits Appication Seuentia Decision Theory Concusions A Theory of Universa Artificia Inteigence p.2/18

Caim Deveopment of a universay optima AI mode Universa Optima faster parameteress, unbiased, mode-free No other program can earn or sove the tas A Theory of Universa Artificia Inteigence p.3/18

Bacground Decision Theory Soves the probem of rationa agent behaviour in uncertain ords given an environment Knon prior probabiity distribution over the environment Soomonoff s Universa Induction Soves the probem of seuence prediction for an unnon prior distribution Predict the continuation of a given binary seuence A Theory of Universa Artificia Inteigence p.4/18

. # < (. ; 98 ( Bacground Soomonoff s resut: is finite and expected Eucidean distance beteen (-, ' )+* ( $&% "! Convergence 43 )/1 2 is the Universa Probabiity Distribution ;, 9: 8 576 is the Komogorov Compexity of $>= A Theory of Universa Artificia Inteigence p.5/18

O B D C B A?@ I N P O K L B A C B D Q Q@ S Key Concepts The Cybernetic or Agent Mode KML? J I EGFH is partia function / chronoogica Turing machine J I FH I ERFH is partia function / chronoogica Turing machine A Theory of Universa Artificia Inteigence p.6/18

e c b d c T X b e c b d c e d c c b X b b [ d c e d c e b b Key Concepts History : [ X a` [ T a` X]_^ ^ ^ T] X\ T\ X[ T[ YZ X U TVU Probabiity of input given the history X [f TG[f e Z X U T U X U T U ^ e [ ^^ T ` [ a` X U [ a` T U X ` T X U T U e Z X [f Tg[f T[ X[ T\ \ X U \ T U X\ ^ ^ ^ A Theory of Universa Artificia Inteigence p.7/18

i h u v o z n z z y # The AI Mode in Functiona Form Tas: Derive hich maximizes the tota credit over a predefined ifetime(t) For a non deterministic environment s ts r _npo m Optima poicy } n o 6 n { _npo xu v n For a prior distribution over environments be the set of a environments that ~4 Let produces the history ~ ~ ~ ~ ~ # ~ ~ ~ ~ ~ npo m ˆ r ~ ~ _n ƒ m A Theory of Universa Artificia Inteigence p.8/18

Ž v n ; yy v v n n y i h The AI Mode in Functiona Form AI maximizes expected future reard over the next (horizon) cyces Œg Š Z Optima poicy ~ ~ _n ƒ m xu ˆ 9 v n ~ v n ~ ~ ~ ~ ~ n ƒ xu ~ Best output are finite Š and, is computabe if A Theory of Universa Artificia Inteigence p.9/18

Œ Š Ž Š u v v u v u u ž The AI Mode in Recursive Form to Tas: Derive expected reard sum in cyces using expected reard sum in cyces to Œ ~ ~ ~ ~ v ƒ $ a ˆ ~ ~ ƒ m Optima expected reard ~ ~ ƒ m ~ ~ ƒ m ~ ~ ~ ~ ƒ $ ˆ ~ ~ ƒ output œ Vš ~ ~ ƒ m xu ~ Expectimax seuence xu ~ ž ~ ~ A Theory of Universa Artificia Inteigence p.1/18

Ÿ Ÿ c f The AI Mode in Recursive Form Functiona AI Recursive AI e cm e X [f TR[f G ªp«G A Theory of Universa Artificia Inteigence p.11/18

; yy ; ; yy ; The Universa AI Mode Tas: Repace the true but unnon prior probabiity ith mode In the Functiona AI _npo m xu ~ ˆ ˆ 9 9 _npo m ;9 8 ( xu ~ ˆ ˆ 9 9 ˆ A Theory of Universa Artificia Inteigence p.12/18

u u ž u u ž The Universa AI Mode Tas: Repace the true but unnon prior probabiity ith mode In the Recursive AI ˆ xu ~ ž ~ ~ ˆ xu ~ ž ~ ~ A Theory of Universa Artificia Inteigence p.13/18

s. # Ž Z The Universa AI Mode Tas: Sho the convergence of AI to AI Utiize the Soomonoff s resut generaized from arbitrary aphabet are pure spectators = 2 to an (, ' )+* ( $t%! 3 )/ 2 Outputs of the AI of the AI mode converge to the outputs mode ateast for the bounded horizon T T Œ Š A Theory of Universa Artificia Inteigence p.14/18

d d Œ µ ³ X T ² ± ¾» º] º ] \ ¼ \ º[ ¹ ¹ ¹ ¹ Exampe Constants and Limits 1 1 ½ (a) The agents interface is ide (b) The interface can be sufficienty expored (c) The death is far aay (d) Most input/output combinations do not occur These imits are never used in proofs but eare ony interested in theorems hich do not degenerate under the above imits A Theory of Universa Artificia Inteigence p.15/18

vã Â vã vã Â n o À À Á Á Í Ù Û Ú Ê Ç Î Ð Ì Ë Í Ù Û É å â Ç Í Ù Û É å â Ù Û Ç Ü Ç Ç É Í Í Ò É Ò å â Ç ÒÇ ç Ç Ç å Appication Seuentia Decision Theory (MDP) Beman euation for optima poicy Ä À sá ˆ / Å / Ä À sá xu / v Appy the AI mode Ò É+Ù Ø Ç ÔÖÕ Ó ÍÏÎ ÉÊ ÆÈÇ Ò>Û Ò Î Ç Ñ Ò>Û á Íæ +ß É Ù Þ á +ß äã Í áéü à ÍÏÎ +ß É Ù Þ ÍÉÜ Ý Ò>Û ÒÛ Î á â è ã Íæ É Ù Þ Ù á Í áéç à ÍÏÎ É+Ù Þ ÍÉç Ý ÙG Î Ù A Theory of Universa Artificia Inteigence p.16/18

Appication Observations e use the compete history as the environment state The AI mode does not assume Marovian property stationary environment accessibe environment Other appications Game Paying Function Minimization Supervised Learning Bod Caim: AI is the most genera mode A Theory of Universa Artificia Inteigence p.17/18

Concusion A parameteress mode of AI based on Decision Theory and Agorithmic Probabiity is presented Maes minima assumptions about the environment Is the AI Future or mode computabe? Derive vaue and reard bounds for AI Appy AI mode to more probem casses mode A Theory of Universa Artificia Inteigence p.18/18