DK3079106T3 - UDVÆLGELSE AF FORSTÆRKNINGSLÆRINGSHANDLINGER VED HJÆLP AF MÅL og OBSERVATIONER

UDVÆLGELSE AF FORSTÆRKNINGSLÆRINGSHANDLINGER VED HJÆLP AF MÅL og OBSERVATIONER Download PDF

Info

Publication number: DK3079106T3
Authority: DK; Denmark
Prior art keywords: observations; objectives; reinforcement learning; learning actions; selecting
Prior art date: 2015-04-06

Application number

DK16164072.7T

Other languages

Inventor

Tom Schaul

Daniel George Horgan

Karol Gregor

David Silver

Original Assignee

Deepmind Tech Ltd

Priority date

2015-04-06

Filing date

2016-04-06

Publication date

2022-08-01

2016-04-06 Application filed by Deepmind Tech Ltd filed Critical Deepmind Tech Ltd

2022-08-01 Application granted granted Critical

2022-08-01 Publication of DK3079106T3 publication Critical patent/DK3079106T3/da