SG11202006017SA - System reinforcement learning method and apparatus, electronic device, and computer storage medium - Google Patents
System reinforcement learning method and apparatus, electronic device, and computer storage mediumInfo
- Publication number
- SG11202006017SA SG11202006017SA SG11202006017SA SG11202006017SA SG11202006017SA SG 11202006017S A SG11202006017S A SG 11202006017SA SG 11202006017S A SG11202006017S A SG 11202006017SA SG 11202006017S A SG11202006017S A SG 11202006017SA SG 11202006017S A SG11202006017S A SG 11202006017SA
- Authority
- SG
- Singapore
- Prior art keywords
- electronic device
- storage medium
- computer storage
- learning method
- reinforcement learning
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/004—Artificial life, i.e. computing arrangements simulating life
- G06N3/006—Artificial life, i.e. computing arrangements simulating life based on simulated virtual individual or collective life forms, e.g. social simulations or particle swarm optimisation [PSO]
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/21—Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
- G06F18/214—Generating training patterns; Bootstrap methods, e.g. bagging or boosting
- G06F18/2155—Generating training patterns; Bootstrap methods, e.g. bagging or boosting characterised by the incorporation of unlabelled data, e.g. multiple instance learning [MIL], semi-supervised techniques using expectation-maximisation [EM] or naïve labelling
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/045—Combinations of networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/047—Probabilistic or stochastic networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
- G06N3/088—Non-supervised learning, e.g. competitive learning
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
- G06N3/084—Backpropagation, e.g. using gradient descent
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N7/00—Computing arrangements based on specific mathematical models
- G06N7/01—Probabilistic graphical models, e.g. probabilistic networks
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- Data Mining & Analysis (AREA)
- Evolutionary Computation (AREA)
- Life Sciences & Earth Sciences (AREA)
- Artificial Intelligence (AREA)
- General Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- Computing Systems (AREA)
- Software Systems (AREA)
- Molecular Biology (AREA)
- Computational Linguistics (AREA)
- Biophysics (AREA)
- Biomedical Technology (AREA)
- Mathematical Physics (AREA)
- General Health & Medical Sciences (AREA)
- Health & Medical Sciences (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Bioinformatics & Computational Biology (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Evolutionary Biology (AREA)
- Probability & Statistics with Applications (AREA)
- Image Analysis (AREA)
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201810428099.3A CN108776834B (zh) | 2018-05-07 | 2018-05-07 | 系统增强学习方法和装置、电子设备、计算机存储介质 |
PCT/CN2019/078520 WO2019214344A1 (zh) | 2018-05-07 | 2019-03-18 | 系统增强学习方法和装置、电子设备、计算机存储介质 |
Publications (1)
Publication Number | Publication Date |
---|---|
SG11202006017SA true SG11202006017SA (en) | 2020-07-29 |
Family
ID=64026991
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
SG11202006017SA SG11202006017SA (en) | 2018-05-07 | 2019-03-18 | System reinforcement learning method and apparatus, electronic device, and computer storage medium |
Country Status (6)
Country | Link |
---|---|
US (1) | US11669711B2 (zh) |
JP (1) | JP6896176B2 (zh) |
KR (1) | KR102420715B1 (zh) |
CN (1) | CN108776834B (zh) |
SG (1) | SG11202006017SA (zh) |
WO (1) | WO2019214344A1 (zh) |
Families Citing this family (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108776834B (zh) | 2018-05-07 | 2021-08-06 | 上海商汤智能科技有限公司 | 系统增强学习方法和装置、电子设备、计算机存储介质 |
CN110211122A (zh) * | 2019-06-12 | 2019-09-06 | 京东方科技集团股份有限公司 | 一种检测图像处理方法及装置 |
CN110472029B (zh) * | 2019-08-01 | 2024-03-19 | 腾讯科技(深圳)有限公司 | 一种数据处理方法、装置以及计算机可读存储介质 |
CN110610534B (zh) * | 2019-09-19 | 2023-04-07 | 电子科技大学 | 基于Actor-Critic算法的口型动画自动生成方法 |
CN111488806A (zh) * | 2020-03-25 | 2020-08-04 | 天津大学 | 一种基于并行分支神经网络的多尺度人脸识别方法 |
CN111766782B (zh) * | 2020-06-28 | 2021-07-13 | 浙江大学 | 基于深度强化学习中Actor-Critic框架的策略选择方法 |
US20220253724A1 (en) * | 2021-02-10 | 2022-08-11 | Ford Global Technologies, Llc | Variance of gradient based active learning framework for training perception algorithms |
JP7507712B2 (ja) | 2021-03-18 | 2024-06-28 | 株式会社日本製鋼所 | 強化学習方法、コンピュータプログラム、強化学習装置及び成形機 |
CN114494081B (zh) * | 2022-04-01 | 2022-07-05 | 武汉大学 | 一种无人机遥感测绘图像增强方法 |
Family Cites Families (16)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7001243B1 (en) * | 2003-06-27 | 2006-02-21 | Lam Research Corporation | Neural network control of chemical mechanical planarization |
JP2008542859A (ja) | 2005-05-07 | 2008-11-27 | エル ターラー、ステフエン | 有用な情報を自律的にブートストラッピングする装置 |
CN103020602B (zh) * | 2012-10-12 | 2015-10-14 | 北京建筑工程学院 | 基于神经网络的人脸识别方法 |
US9749188B2 (en) * | 2014-05-13 | 2017-08-29 | Cisco Technology, Inc. | Predictive networking architecture for next-generation multiservice, multicarrier WANs |
JP6072103B2 (ja) * | 2015-02-04 | 2017-02-01 | エヌ・ティ・ティ・コムウェア株式会社 | 学習装置、学習方法、およびプログラム |
CN105279555B (zh) * | 2015-10-28 | 2017-10-17 | 清华大学 | 一种基于进化算法的自适应学习神经网络实现方法 |
CN106709565A (zh) * | 2016-11-16 | 2017-05-24 | 广州视源电子科技股份有限公司 | 一种神经网络的优化方法及装置 |
CN108154222B (zh) * | 2016-12-02 | 2020-08-11 | 北京市商汤科技开发有限公司 | 深度神经网络训练方法和系统、电子设备 |
CN106651774B (zh) * | 2016-12-27 | 2020-12-04 | 深圳市捷顺科技实业股份有限公司 | 一种车牌超分辨率模型重建方法及装置 |
CN106934346B (zh) * | 2017-01-24 | 2019-03-15 | 北京大学 | 一种目标检测性能优化的方法 |
CN106941602B (zh) * | 2017-03-07 | 2020-10-13 | 中国铁路总公司 | 机车司机行为识别方法及装置 |
CN107301383B (zh) * | 2017-06-07 | 2020-11-24 | 华南理工大学 | 一种基于Fast R-CNN的路面交通标志识别方法 |
CN107704857B (zh) * | 2017-09-25 | 2020-07-24 | 北京邮电大学 | 一种端到端的轻量级车牌识别方法及装置 |
TWI699816B (zh) * | 2017-12-26 | 2020-07-21 | 雲象科技股份有限公司 | 自動化顯微鏡系統之控制方法、顯微鏡系統及電腦可讀取記錄媒體 |
CN108073910B (zh) * | 2017-12-29 | 2021-05-07 | 百度在线网络技术(北京)有限公司 | 用于生成人脸特征的方法和装置 |
CN108776834B (zh) * | 2018-05-07 | 2021-08-06 | 上海商汤智能科技有限公司 | 系统增强学习方法和装置、电子设备、计算机存储介质 |
-
2018
- 2018-05-07 CN CN201810428099.3A patent/CN108776834B/zh active Active
-
2019
- 2019-03-18 JP JP2020535040A patent/JP6896176B2/ja active Active
- 2019-03-18 SG SG11202006017SA patent/SG11202006017SA/en unknown
- 2019-03-18 KR KR1020207026754A patent/KR102420715B1/ko active IP Right Grant
- 2019-03-18 WO PCT/CN2019/078520 patent/WO2019214344A1/zh active Application Filing
-
2020
- 2020-06-18 US US16/904,915 patent/US11669711B2/en active Active
Also Published As
Publication number | Publication date |
---|---|
KR102420715B1 (ko) | 2022-07-14 |
JP2021507421A (ja) | 2021-02-22 |
KR20200119873A (ko) | 2020-10-20 |
CN108776834B (zh) | 2021-08-06 |
CN108776834A (zh) | 2018-11-09 |
JP6896176B2 (ja) | 2021-06-30 |
WO2019214344A1 (zh) | 2019-11-14 |
US11669711B2 (en) | 2023-06-06 |
US20200349431A1 (en) | 2020-11-05 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
SG11202006017SA (en) | System reinforcement learning method and apparatus, electronic device, and computer storage medium | |
SG11202100959RA (en) | Data sharing method, apparatus, and system, and electronic device | |
SG11202005729TA (en) | Collision control method and apparatus, and electronic device and storage medium | |
SG11202103527XA (en) | Interactive plot implementation method, device, computer apparatus, and storage medium | |
SG11202102267XA (en) | Image processing method and apparatus, electronic device, and computer readable storage medium | |
EP3734475A4 (en) | METHOD AND DEVICE FOR TRAINING DATA, STORAGE MEDIUM AND ELECTRONIC DEVICE | |
SG11202102960XA (en) | Image processing method and apparatus, electronic device, and computer readable storage medium | |
SG11202000065TA (en) | Gaze point determination method and apparatus, electronic device, and computer storage medium | |
EP3627808A4 (en) | SCREEN EXTINGUISHING CONTROL METHOD AND APPARATUS, STORAGE MEDIUM, AND ELECTRONIC DEVICE | |
EP3813379A4 (en) | DATA PROCESSING APPARATUS AND METHOD, ELECTRONIC DEVICE, RECORDING SYSTEM AND MEDIA | |
SG11202005736VA (en) | Collision control method and apparatus, and electronic device and storage medium | |
EP3627429A4 (en) | INFORMATION PROCESSING METHOD AND DEVICE, ELECTRONIC DEVICE AND STORAGE MEDIUM | |
EP3993436C0 (en) | DATA PROCESSING METHOD AND APPARATUS, COMPUTER-READABLE STORAGE MEDIUM, AND ELECTRONIC DEVICE | |
EP3531290A4 (en) | DATA PROTECTION, DEVICE, ELECTRONIC DEVICE, STORAGE MEDIUM AND SYSTEM | |
SG11202007158UA (en) | Object prediction method and apparatus, electronic device and storage medium | |
SG11202010699XA (en) | Risk control method, risk control apparatus, electronic device, and storage medium | |
SG11202005080VA (en) | Method, apparatus and system for liveness detection, electronic device, and storage medium | |
EP3627461A4 (en) | INFORMATION PROCESSING METHOD AND APPARATUS, ELECTRONIC DEVICE AND INFORMATION MEDIUM | |
EP3676702A4 (en) | METHOD AND DEVICE FOR DATA COMPILATION, ELECTRONIC DEVICE AND COMPUTER-READABLE STORAGE MEDIUM | |
SG11202010510XA (en) | Data processing method and apparatus, electronic device and storage medium | |
SG11202102886TA (en) | Teaching system and method, electronic device, and storage medium | |
EP3509270A4 (en) | BACKUP PROCESS AND DEVICE, STORAGE MEDIUM AND ELECTRONIC DEVICE | |
SG11202103119UA (en) | Method and device for processing control information, electronic equipment, and storage medium | |
EP3796530A4 (en) | METHOD, DEVICE AND DEVICE FOR CURRENT EQUALIZATION AND COMPUTER-READABLE STORAGE MEDIUM | |
EP3843386A4 (en) | DATA PROCESSING PROCESS AND APPARATUS, ELECTRONIC DEVICE AND STORAGE MEDIA |