Previously searched by you

Summary

Summary Reinforcement Learning

Rating

-

Sold

-

Pages

20

Uploaded on

03-01-2022

Written in

2004/2005

Control learning Control policies that choose optimal actions Q learning Convergence

Institution

Course

Content preview

13. Reinforcement Learning

[Read Chapter 13]
[Exercises 13.1, 13.2, 13.4]
Control learning
Control policies that choose optimal actions
Q learning
Convergence

255 lecture slides for textbook Machine Learning, T. Mitchell, McGraw Hill, 1997

, Control Learning

Consider learning to choose actions, e.g.,
Robot learning to dock on battery charger
Learning to choose actions to optimize factory
output
Learning to play Backgammon
Note several problem characteristics:
Delayed reward
Opportunity for active exploration
Possibility that state only partially observable
Possible need to learn multiple tasks with same
sensors/e ectors

256 lecture slides for textbook Machine Learning, T. Mitchell, McGraw Hill, 1997

, One Example: TD-Gammon

[Tesauro, 1995]
Learn to play Backgammon
Immediate reward
+100 if win
-100 if lose
0 for all other states
Trained by playing 1.5 million games against itself
Now approximately equal to best human player

257 lecture slides for textbook Machine Learning, T. Mitchell, McGraw Hill, 1997

Report Copyright Violation

Written for

Institution: Bamberg University
Course: Machine learing part 2

All documents for this subject (13)

Document information

Uploaded on: January 3, 2022
Number of pages: 20
Written in: 2004/2005
Type: SUMMARY

Subjects

convergence
q learning

$7.99

Get access to the full document:

Written by students who passed

Immediately available after payment

Read online or as PDF

Get to know the seller

Get to know the seller

riyadhalgburi Southwest Jiaotong University

View profile

Follow

Sold

-

Member since

4 year

Number of followers

0

Documents

33

Last sold

-

0.0

0 reviews

5

0

4

0

3

0

2

0

1

0

Recently viewed by you

Why students choose Stuvia

Created by fellow students, verified by reviews

Quality you can trust: written by students who passed their tests and reviewed by others who've used these notes.

Didn't get what you expected? Choose another document

No worries! You can instantly pick a different document that better fits what you're looking for.

Pay as you like, start learning right away

No subscription, no commitments. Pay the way you're used to via credit card and download your PDF document instantly.

Student with book image

“Bought, downloaded, and aced it. It really can be that simple.”

Alisha Student

Frequently asked questions

What do I get when I buy this document?

You get a PDF, available immediately after your purchase. The purchased document is accessible anytime, anywhere and indefinitely through your profile.

Satisfaction guarantee: how does it work?

Our satisfaction guarantee ensures that you always find a study document that suits you well. You fill out a form, and our customer service team takes care of the rest.

Who am I buying these notes from?

Stuvia is a marketplace, so you are not buying this document from us, but from seller riyadhalgburi. Stuvia facilitates payment to the seller.

Will I be stuck with a subscription?

No, you only buy these notes for $7.99. You're not tied to anything after your purchase.

Can Stuvia be trusted?

4.6 stars on Google & Trustpilot (+1000 reviews) 47251 documents were sold in the last 30 days Founded in 2010, the go-to place to buy study notes for 16 years now