CN117910543A
(zh)
|
2015-11-12 |
2024-04-19 |
渊慧科技有限公司 |
使用优先化经验存储器训练神经网络
|
KR102156303B1
(ko)
|
2015-11-12 |
2020-09-15 |
딥마인드 테크놀로지스 리미티드 |
비동기 심층 강화 학습
|
KR102424893B1
(ko)
*
|
2016-11-04 |
2022-07-25 |
딥마인드 테크놀로지스 리미티드 |
보조 작업들을 통한 강화 학습
|
CN117521725A
(zh)
*
|
2016-11-04 |
2024-02-06 |
渊慧科技有限公司 |
加强学习系统
|
EP3593292B8
(en)
*
|
2017-06-09 |
2024-08-14 |
DeepMind Technologies Limited |
Training action selection neural networks
|
KR102100350B1
(ko)
*
|
2017-10-16 |
2020-04-14 |
농업회사법인 상상텃밭 주식회사 |
온실 시스템의 제어 모델 생성 방법
|
US11604941B1
(en)
*
|
2017-10-27 |
2023-03-14 |
Deepmind Technologies Limited |
Training action-selection neural networks from demonstrations using multiple losses
|
US12020167B2
(en)
*
|
2018-05-17 |
2024-06-25 |
Magic Leap, Inc. |
Gradient adversarial training of neural networks
|
EP3776370A1
(en)
*
|
2018-05-18 |
2021-02-17 |
Deepmind Technologies Limited |
Graph neural network systems for behavior prediction and reinforcement learning in multple agent environments
|
WO2019219965A1
(en)
*
|
2018-05-18 |
2019-11-21 |
Deepmind Technologies Limited |
Meta-gradient updates for training return functions for reinforcement learning systems
|
TWI745693B
(zh)
*
|
2018-05-18 |
2021-11-11 |
宏達國際電子股份有限公司 |
控制方法以及醫學系統
|
EP3576023A1
(en)
*
|
2018-05-25 |
2019-12-04 |
Royal Bank Of Canada |
Trade platform with reinforcement learning network and matching engine
|
EP3807823A1
(en)
|
2018-06-12 |
2021-04-21 |
Intergraph Corporation |
Artificial intelligence applications for computer-aided dispatch systems
|
CN109239661A
(zh)
*
|
2018-09-18 |
2019-01-18 |
广西大学 |
一种基于深度q网络的rfid室内定位系统及算法
|
US12026610B2
(en)
*
|
2018-09-25 |
2024-07-02 |
International Business Machines Corporation |
Reinforcement learning by sharing individual data within dynamic groups
|
WO2020065024A1
(en)
*
|
2018-09-27 |
2020-04-02 |
Deepmind Technologies Limited |
Stacked convolutional long short-term memory for model-free reinforcement learning
|
US11663441B2
(en)
|
2018-09-27 |
2023-05-30 |
Deepmind Technologies Limited |
Action selection neural network training using imitation learning in latent space
|
CA3060914A1
(en)
*
|
2018-11-05 |
2020-05-05 |
Royal Bank Of Canada |
Opponent modeling with asynchronous methods in deep rl
|
CA3060900A1
(en)
|
2018-11-05 |
2020-05-05 |
Royal Bank Of Canada |
System and method for deep reinforcement learning
|
US20220059083A1
(en)
*
|
2018-12-10 |
2022-02-24 |
Interactive-Ai, Llc |
Neural modulation codes for multilingual and style dependent speech and language processing
|
US11313950B2
(en)
|
2019-01-15 |
2022-04-26 |
Image Sensing Systems, Inc. |
Machine learning based highway radar vehicle classification across multiple lanes and speeds
|
US11074480B2
(en)
*
|
2019-01-31 |
2021-07-27 |
StradVision, Inc. |
Learning method and learning device for supporting reinforcement learning by using human driving data as training data to thereby perform personalized path planning
|
DE102019105280A1
(de)
*
|
2019-03-01 |
2020-09-03 |
Friedrich-Alexander-Universität Erlangen-Nürnberg |
Autonomes selbstlernendes System
|
KR102267316B1
(ko)
*
|
2019-03-05 |
2021-06-21 |
네이버랩스 주식회사 |
심층 강화 학습에 기반한 자율주행 에이전트의 학습 방법 및 시스템
|
KR102596158B1
(ko)
|
2019-03-20 |
2023-11-01 |
소니그룹주식회사 |
이중 액터 크리틱 알고리즘을 통한 강화 학습
|
US11308362B2
(en)
*
|
2019-03-26 |
2022-04-19 |
Shenzhen Keya Medical Technology Corporation |
Method and system for generating a centerline for an object, and computer readable medium
|
US11587552B2
(en)
|
2019-04-30 |
2023-02-21 |
Sutherland Global Services Inc. |
Real time key conversational metrics prediction and notability
|
WO2021007019A1
(en)
*
|
2019-07-08 |
2021-01-14 |
Google Llc |
Optimizing a cellular network using machine learning
|
KR102082113B1
(ko)
*
|
2019-07-23 |
2020-02-27 |
주식회사 애자일소다 |
데이터 기반 강화 학습 장치 및 방법
|
US11676064B2
(en)
*
|
2019-08-16 |
2023-06-13 |
Mitsubishi Electric Research Laboratories, Inc. |
Constraint adaptor for reinforcement learning control
|
US20220331962A1
(en)
*
|
2019-09-15 |
2022-10-20 |
Google Llc |
Determining environment-conditioned action sequences for robotic tasks
|
KR102155055B1
(ko)
*
|
2019-10-28 |
2020-09-11 |
라온피플 주식회사 |
강화학습 기반 신호 제어 장치 및 신호 제어 방법
|
CN110852438B
(zh)
*
|
2019-11-11 |
2023-08-04 |
北京百度网讯科技有限公司 |
模型生成方法和装置
|
US20210158196A1
(en)
*
|
2019-11-25 |
2021-05-27 |
Deepmind Technologies Limited |
Non-stationary delayed bandits with intermediate signals
|
KR102173579B1
(ko)
*
|
2019-12-02 |
2020-11-03 |
한국기술교육대학교 산학협력단 |
연합강화학습을 통한 다중 디바이스 제어 시스템 및 그 방법
|
US11579575B2
(en)
*
|
2019-12-03 |
2023-02-14 |
Baidu Usa Llc |
Inverse reinforcement learning with model predictive control
|
CN111026272B
(zh)
*
|
2019-12-09 |
2023-10-31 |
网易(杭州)网络有限公司 |
虚拟对象行为策略的训练方法及装置、电子设备、存储介质
|
CN111130698B
(zh)
*
|
2019-12-26 |
2022-05-31 |
南京中感微电子有限公司 |
无线通信接收窗口预测方法、装置及无线通信设备
|
WO2021156511A1
(en)
*
|
2020-02-07 |
2021-08-12 |
Deepmind Technologies Limited |
Recurrent unit for generating or processing a sequence of images
|
KR102440817B1
(ko)
*
|
2020-02-19 |
2022-09-06 |
사회복지법인 삼성생명공익재단 |
기록된 데이터에서 인과성을 식별하는 강화학습 방법, 장치 및 프로그램
|
KR102100686B1
(ko)
*
|
2020-02-19 |
2020-04-14 |
주식회사 애자일소다 |
손실률을 낮추기 위한 데이터 기반 강화 학습 장치 및 방법
|
KR102100688B1
(ko)
*
|
2020-02-19 |
2020-04-14 |
주식회사 애자일소다 |
한도 소진률을 높이기 위한 데이터 기반 강화 학습 장치 및 방법
|
CN111416774B
(zh)
*
|
2020-03-17 |
2023-03-21 |
深圳市赛为智能股份有限公司 |
网络拥塞控制方法、装置、计算机设备及存储介质
|
CN111461325B
(zh)
*
|
2020-03-30 |
2023-06-20 |
华南理工大学 |
一种用于稀疏奖励环境问题的多目标分层强化学习算法
|
CN112533681B
(zh)
*
|
2020-04-02 |
2024-07-12 |
支付宝(杭州)信息技术有限公司 |
确定执行设备的动作选择方针
|
KR102195433B1
(ko)
*
|
2020-04-07 |
2020-12-28 |
주식회사 애자일소다 |
학습의 목표와 보상을 연계한 데이터 기반 강화 학습 장치 및 방법
|
KR102272501B1
(ko)
*
|
2020-04-24 |
2021-07-01 |
연세대학교 산학협력단 |
분산 강화 학습 장치 및 방법
|
CN111496794B
(zh)
*
|
2020-04-29 |
2022-04-01 |
华中科技大学 |
一种基于仿真工业机器人的运动学自抓取学习方法和系统
|
CN111666149B
(zh)
*
|
2020-05-06 |
2023-04-07 |
西北工业大学 |
基于深度强化学习的超密边缘计算网络移动性管理方法
|
US20230144995A1
(en)
*
|
2020-06-05 |
2023-05-11 |
Deepmind Technologies Limited |
Learning options for action selection with meta-gradients in multi-task reinforcement learning
|
US11528347B2
(en)
*
|
2020-06-25 |
2022-12-13 |
Nokia Solutions And Networks Oy |
Inter-packet communication of machine learning information
|
CN111882030B
(zh)
*
|
2020-06-29 |
2023-12-05 |
武汉钢铁有限公司 |
一种基于深度强化学习的加锭策略方法
|
CN111818570B
(zh)
*
|
2020-07-25 |
2022-04-01 |
清华大学 |
一种面向真实网络环境的智能拥塞控制方法及系统
|
JP7523665B2
(ja)
|
2020-07-28 |
2024-07-26 |
ディープマインド テクノロジーズ リミテッド |
観測埋込みを制御する補助タスクを使用する、アクション選択ニューラルネットワークのトレーニング
|
DE102020209685B4
(de)
|
2020-07-31 |
2023-07-06 |
Robert Bosch Gesellschaft mit beschränkter Haftung |
Verfahren zum steuern einer robotervorrichtung und robotervorrichtungssteuerung
|
CN112002321B
(zh)
*
|
2020-08-11 |
2023-09-19 |
海信电子科技(武汉)有限公司 |
显示设备、服务器及语音交互方法
|
CN113422751B
(zh)
*
|
2020-08-27 |
2023-12-05 |
阿里巴巴集团控股有限公司 |
基于在线强化学习的流媒体处理方法、装置及电子设备
|
KR102345267B1
(ko)
*
|
2020-10-12 |
2021-12-31 |
서울대학교산학협력단 |
목표 지향적 강화학습 방법 및 이를 수행하기 위한 장치
|
CN112347104B
(zh)
*
|
2020-11-06 |
2023-09-29 |
中国人民大学 |
一种基于深度强化学习的列存储布局优化方法
|
CN112541835A
(zh)
*
|
2020-12-08 |
2021-03-23 |
香港中文大学(深圳) |
一种基于混合模型的风电场控制学习方法
|
CN112949988B
(zh)
*
|
2021-02-01 |
2024-01-05 |
浙江大学 |
一种基于强化学习的服务流程构造方法
|
KR102599363B1
(ko)
*
|
2021-02-04 |
2023-11-09 |
박근식 |
사용자기반의 ai에너지 절감 및 수요예측시스템
|
GB2604640B
(en)
*
|
2021-03-12 |
2024-06-19 |
Samsung Electronics Co Ltd |
Performing an image processing task instructed by an image processing application
|
US20220303191A1
(en)
*
|
2021-03-18 |
2022-09-22 |
Nokia Solutions And Networks Oy |
Network management
|
WO2022199792A1
(en)
*
|
2021-03-22 |
2022-09-29 |
Telefonaktiebolaget Lm Ericsson (Publ) |
Reward estimation for a target policy
|
CN113242469B
(zh)
*
|
2021-04-21 |
2022-07-12 |
南京大学 |
一种自适应视频传输配置方法和系统
|
CN113156958B
(zh)
*
|
2021-04-27 |
2024-05-31 |
东莞理工学院 |
基于卷积长短期记忆网络的自主移动机器人自监督学习及导航方法
|
EP4102405A1
(en)
*
|
2021-06-10 |
2022-12-14 |
Naver Corporation |
Demonstration-conditioned reinforcement learning for few-shot imitation
|
CN113420806B
(zh)
*
|
2021-06-21 |
2023-02-03 |
西安电子科技大学 |
一种人脸检测质量评分方法及系统
|
WO2023023848A1
(en)
*
|
2021-08-24 |
2023-03-02 |
Royal Bank Of Canada |
System and method for machine learning architecture with multiple policy heads
|
CN113810954B
(zh)
*
|
2021-09-08 |
2023-12-29 |
国网宁夏电力有限公司信息通信公司 |
基于流量预测与深度强化学习的虚拟资源动态扩缩容方法
|
CN113849313B
(zh)
*
|
2021-09-30 |
2024-09-13 |
郑州大学 |
一种节能的云-边弹性光网络中计算任务链部署方法
|
CN116330310B
(zh)
*
|
2023-02-14 |
2023-11-07 |
河南泽远网络科技有限公司 |
一种低延时机器人交互方法
|
CN116453706B
(zh)
*
|
2023-06-14 |
2023-09-08 |
之江实验室 |
一种基于强化学习的血液透析方案制定方法及系统
|