CS 380: ARTIFICIAL INTELLIGENCE RATIONAL AGENTSsanti/teaching/2017/CS...Rationality • Imagine two...

CS 380: ARTIFICIAL INTELLIGENCE

RATIONAL AGENTS

Santiago Ontañónso367@drexel.edu

Outline• What is an Agent?• Rationality• Agents and Environments• Agent Types

(these slides are adapted from Russel & Norvig’s official slides)

What is an Agent?• Agent: autonomous entity which observes and acts upon

an environment and directs its activity towards achieving goals.

• Example agents:• Humans• Robots• Software agents• ATMs• etc.

What is an Agent?

• Theoretically:

• In practice: the agent runs on a physical system (e.g. a computer) to produce

f : P ⇤ ! A

What is an Agent?

• Theoretically:

• In practice: the agent runs on a physical system (e.g. a computer) to produce

f : P ⇤ ! A

Maps sequences of percepts to actions

Example: Vacuum Cleaner

• Percepts:• Location sensor and Dirt sensor

• Actions:• Left, Right, Suck, Wait

Room A Room B

Vacuum-cleaner world

Percepts: location and contents, e.g., [A, Dirty]

Actions: Left, Right, Suck, NoOp

Chapter 2 5

• Percepts:• Location sensor and Dirt sensor

• Actions:• Left, Right, Suck, Wait

Room A Room B

Vacuum-cleaner world

Percepts: location and contents, e.g., [A, Dirty]

Actions: Left, Right, Suck, NoOp

Chapter 2 5

[A, Clean]

Two questions:

• What is the function?

• Can it be implemented as a computer program? (How?)

Rationality• As we discussed last class, “intelligence” is an ill-defined

concept.

• Let us assume a performance measure, that is given (usually external) to the agent. • For example:

• $1 per room cleaned• $10 per room cleaned minus $1 per each movement• etc.

• Rational agent:• Chooses actions that maximize the expected value of the

performance measure given the current percept sequence.

Rationality• Imagine two agents A and B whose performance measure

is defined by how many $ they make at the casino.• A and B play one game of Black Jack• A plays the move that probabilistically has the highest chance of

winning• B plays a different, more risky move• B ends up winning

• Which agent is rational?

Rationality• Imagine two agents A and B whose performance measure

is defined by how many $ they make at the casino.• A and B play one game of Black Jack• A plays the move that probabilistically has the highest chance of

winning• B plays a different, more risky move• B ends up winning

• Which agent is rational? A is the rational agent• Rationality does not imply clairvoyance• Rationality does not imply omniscient• Thus, rational does not mean successful

How do we Design a Rational Agent?• To design a rational agent, we first need to define its

environment, performance measure, etc.

• PEAS:• Performance• Environment• Actions• Sensors (percepts)

Example (informal): Automated Taxi• Performance:

• Environment:

• Actions:

• Sensors:

Example (informal): Automated Taxi• Performance: profit, safety, legality, …

• Environment: streets, pedestrians, cars, weather, traffic lights/signs, …

• Actions: steering, throttle, break, horn, display, …

• Sensors: GPS, video, sonar, keyboard, accelerometers, microphones, …

Example: Chess Agent• Performance:

• Environment:

• Actions:

• Sensors:

Example: Chess Agent• Performance:

• Environment: Chess board, and opponent, chess rules.

• Actions:

• Sensors:

Example: Chess Agent• Performance: Once reached a terminal state (draw, win):

-1 for losing, 0 for draw, 1 for winning.• Environment: Chess board, and opponent.

• Actions:

• Sensors:

• Actions: Pairs of coordinates: (x1,y1) à (x2, y2). Specify the source piece and the target position (for castling, the king’s move is specified).

• Sensors:

• Sensors: Board perceived as a 8x8 matrix. Each element in the matrix can take one of the following values: E, BP, WP, BB, WB, BK, WN, BR, WN, BQ, WK, BK, WK

Example: Chess Agent• Performance: Once reached a terminal state (draw, win): -1 for

losing, 0 for draw, 1 for winning.• Environment: Chess board, and opponent.

• Sensors: Board perceived as a 8x8 matrix. Each element in the matrix can take one of the following values: E, BP, WP, BB, WB, BN, WN, BR, WR, BQ, WQ, BK, WK• The environment might have lots of features (piece size and material,

opponent’s facial expression, sunny vs. raining etc.) • For rational agents we only need to build sensors for the information

that is necessary to maximize the performance measure.

Example: Chess Agent

Environment

WR WN WB WK WQ WB WN WR

WP WP WP WP WP WP WP WP

BP BP BP BP BP BP BP BP

BR BN BB BK BQ BB BN BR

Percept

Properties of Environments• Observable: The agent can sense the entire state of the

environment.• Deterministic: The outcome of actions is determined by

the state of the board and the action executed.• Sequential: Actions outcomes can be effected by prior

actions (alternative to episodic).• Static: State of the world doesn’t change during

deliberation (between actions).• Discrete: Actions can only be performed in a small fixed

number of ways.• Single-Agent: There is only one agent of change.

Environment TypesObservable Deterministic Sequential Static Discrete Single-Agent

Solitaire

Backgammon

StarCraft

Real Life

Solitaire No Yes Yes Yes Yes Yes

Backgammon

StarCraft

Real Life

Chess Yes Yes Yes Yes Yes No

Backgammon

StarCraft

Real Life

Backgammon Yes No Yes Yes Yes No

StarCraft

Real Life

Taxi No No Yes No No No

StarCraft

Real Life

Taxi No No Yes No No No

StarCraft No Almost Yes No Yes No

Real Life No No (?) Yes No No (?) No

Agent Types• We will cover 4 basic types:

• Reactive agents (reflex agents)• State-based agents• Goal-based agents• Utility-based agents

• They can all incorporate learning

Reactive Agents

State-based Agents

Goal-based Agents

Utility-based Agents

Learning Agents

• What we have covered so far.• What is AI?• What is a Rational Agent?

• In the remainder weeks, we will cover:• How can rational agents solve problems?• How can rational agents represent knowledge?• How can rational agents learn?

CS 380: ARTIFICIAL INTELLIGENCE RATIONAL AGENTSsanti/teaching/2017/CS...Rationality • Imagine two...

Documents

Strategic Rationality

Rationality, Reasoning and Group Agencyppettit/papers/Rationality, Reasoning an… · Rationality, Reasoning and Group Agency Philip Pettit† Abstract The rationality of individual

CS 380 Introduction to Database Systems Chapter 7: The Relational Algebra and Relational Calculus

CS 380 - GPU and GPGPU Programming Lecture 2: Introduction ... · CS 380 - GPU and GPGPU Programming Lecture 2: Introduction; GPU Architecture 1 Markus Hadwiger, KAUST

Quotations Rationality

Adaptative Rationality

FMW 380 CR G BK FMW 380 CS G XS FMW 380 SM G XS ......1 Poštovani kupci: Prije svega, želimo vam zahvaliti što se odlučili za naš proizvod. Ova mikrovalna pećnica bazirana je

FMW 380 CR G BK FMW 380 CS G XS FMW 380 SM G XS · bazirana je na savremenoj tehnologiji, funkcionalna je, praktična i proizvedena od vrhunskih materijala, ... za mikrotalasno grejanje

IPL, McCulloch CS 340, CS 380 340, CS...cs 340 cs 380 mcci2014_10 ipl, cs 340, cs 380, 966631401, ... 10 15 16. page a1 cs380 chain ... 10 581 06 39-01 1 assembly oil pump complete

ModelNo.PFTL39908.0 USERʼSMANUAL - SPORTSMITH · 2012. 10. 25. · Thank you for selecting the revolutionary PROFORM ® 380 CS treadmill. The 380 CS treadmill offers an im - pressive

DOCUMENT RESUME ED 358 443 CS 011 380 AUTHOR Behm, Mary; Behm, Richard TITLE · 2013-08-02 · DOCUMENT RESUME ED 358 443 CS 011 380 AUTHOR Behm, Mary; Behm, Richard TITLE Read! 101

BOUNDED RATIONALITY

ModelNo.PFTL39908.1 USERʼSMANUAL · Thank you for selecting the revolutionary PROFORM ® 380 CS treadmill. The 380 CS treadmill offers an im - pressive selection of features designed

Positioning Rationality and Emotion: Rationality Is Up … · Positioning Rationality and Emotion: Rationality Is Up and Emotion Is Down LUCA CIAN ARADHNA KRISHNA NORBERT SCHWARZ

Contextual Rationality

CS 380 - GPU and GPGPU Programming Lecture 4: GPU ... · CS 380 - GPU and GPGPU Programming Lecture 4: GPU Architecture 3 Markus Hadwiger, KAUST

CS 380 - GPU and GPGPU Programming Lecture 16: GPU Texturing 6 · CS 380 - GPU and GPGPU Programming Lecture 16: GPU Texturing 6 Markus Hadwiger, KAUST

Rule Rationality

Value Rationality and Instrumental Rationality in Humboldt

CS 380 Introduction to Computer Graphics - KAISTvclab.kaist.ac.kr/cs380/cs380_lab_02.pdf · 2018. 3. 12. · LAB (1) : OpenGL Tutorial (2) 2018. 03. 12 CS 380 Introduction to Computer