SG11202006017SA - System reinforcement learning method and apparatus, electronic device, and computer storage medium - Google Patents

System reinforcement learning method and apparatus, electronic device, and computer storage medium

Info

Publication number
SG11202006017SA
SG11202006017SA SG11202006017SA SG11202006017SA SG11202006017SA SG 11202006017S A SG11202006017S A SG 11202006017SA SG 11202006017S A SG11202006017S A SG 11202006017SA SG 11202006017S A SG11202006017S A SG 11202006017SA SG 11202006017S A SG11202006017S A SG 11202006017SA
Authority
SG
Singapore
Prior art keywords
electronic device
storage medium
computer storage
learning method
reinforcement learning
Prior art date
Application number
SG11202006017SA
Other languages
English (en)
Inventor
Shuqin Xie
Zitian Chen
Chao Xu
Cewu Lu
Original Assignee
Shanghai Sensetime Intelligent Tech Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shanghai Sensetime Intelligent Tech Co Ltd filed Critical Shanghai Sensetime Intelligent Tech Co Ltd
Publication of SG11202006017SA publication Critical patent/SG11202006017SA/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/004Artificial life, i.e. computing arrangements simulating life
    • G06N3/006Artificial life, i.e. computing arrangements simulating life based on simulated virtual individual or collective life forms, e.g. social simulations or particle swarm optimisation [PSO]
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/214Generating training patterns; Bootstrap methods, e.g. bagging or boosting
    • G06F18/2155Generating training patterns; Bootstrap methods, e.g. bagging or boosting characterised by the incorporation of unlabelled data, e.g. multiple instance learning [MIL], semi-supervised techniques using expectation-maximisation [EM] or naïve labelling
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/047Probabilistic or stochastic networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/088Non-supervised learning, e.g. competitive learning
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/084Backpropagation, e.g. using gradient descent
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N7/00Computing arrangements based on specific mathematical models
    • G06N7/01Probabilistic graphical models, e.g. probabilistic networks

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • Evolutionary Computation (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Artificial Intelligence (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Computing Systems (AREA)
  • Software Systems (AREA)
  • Molecular Biology (AREA)
  • Computational Linguistics (AREA)
  • Biophysics (AREA)
  • Biomedical Technology (AREA)
  • Mathematical Physics (AREA)
  • General Health & Medical Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Evolutionary Biology (AREA)
  • Probability & Statistics with Applications (AREA)
  • Image Analysis (AREA)
SG11202006017SA 2018-05-07 2019-03-18 System reinforcement learning method and apparatus, electronic device, and computer storage medium SG11202006017SA (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201810428099.3A CN108776834B (zh) 2018-05-07 2018-05-07 系统增强学习方法和装置、电子设备、计算机存储介质
PCT/CN2019/078520 WO2019214344A1 (zh) 2018-05-07 2019-03-18 系统增强学习方法和装置、电子设备、计算机存储介质

Publications (1)

Publication Number Publication Date
SG11202006017SA true SG11202006017SA (en) 2020-07-29

Family

ID=64026991

Family Applications (1)

Application Number Title Priority Date Filing Date
SG11202006017SA SG11202006017SA (en) 2018-05-07 2019-03-18 System reinforcement learning method and apparatus, electronic device, and computer storage medium

Country Status (6)

Country Link
US (1) US11669711B2 (zh)
JP (1) JP6896176B2 (zh)
KR (1) KR102420715B1 (zh)
CN (1) CN108776834B (zh)
SG (1) SG11202006017SA (zh)
WO (1) WO2019214344A1 (zh)

Families Citing this family (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108776834B (zh) 2018-05-07 2021-08-06 上海商汤智能科技有限公司 系统增强学习方法和装置、电子设备、计算机存储介质
CN110211122A (zh) * 2019-06-12 2019-09-06 京东方科技集团股份有限公司 一种检测图像处理方法及装置
CN110472029B (zh) * 2019-08-01 2024-03-19 腾讯科技(深圳)有限公司 一种数据处理方法、装置以及计算机可读存储介质
CN110610534B (zh) * 2019-09-19 2023-04-07 电子科技大学 基于Actor-Critic算法的口型动画自动生成方法
CN111488806A (zh) * 2020-03-25 2020-08-04 天津大学 一种基于并行分支神经网络的多尺度人脸识别方法
CN111766782B (zh) * 2020-06-28 2021-07-13 浙江大学 基于深度强化学习中Actor-Critic框架的策略选择方法
US20220253724A1 (en) * 2021-02-10 2022-08-11 Ford Global Technologies, Llc Variance of gradient based active learning framework for training perception algorithms
JP7507712B2 (ja) 2021-03-18 2024-06-28 株式会社日本製鋼所 強化学習方法、コンピュータプログラム、強化学習装置及び成形機
CN114494081B (zh) * 2022-04-01 2022-07-05 武汉大学 一种无人机遥感测绘图像增强方法

Family Cites Families (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7001243B1 (en) * 2003-06-27 2006-02-21 Lam Research Corporation Neural network control of chemical mechanical planarization
JP2008542859A (ja) 2005-05-07 2008-11-27 エル ターラー、ステフエン 有用な情報を自律的にブートストラッピングする装置
CN103020602B (zh) * 2012-10-12 2015-10-14 北京建筑工程学院 基于神经网络的人脸识别方法
US9749188B2 (en) * 2014-05-13 2017-08-29 Cisco Technology, Inc. Predictive networking architecture for next-generation multiservice, multicarrier WANs
JP6072103B2 (ja) * 2015-02-04 2017-02-01 エヌ・ティ・ティ・コムウェア株式会社 学習装置、学習方法、およびプログラム
CN105279555B (zh) * 2015-10-28 2017-10-17 清华大学 一种基于进化算法的自适应学习神经网络实现方法
CN106709565A (zh) * 2016-11-16 2017-05-24 广州视源电子科技股份有限公司 一种神经网络的优化方法及装置
CN108154222B (zh) * 2016-12-02 2020-08-11 北京市商汤科技开发有限公司 深度神经网络训练方法和系统、电子设备
CN106651774B (zh) * 2016-12-27 2020-12-04 深圳市捷顺科技实业股份有限公司 一种车牌超分辨率模型重建方法及装置
CN106934346B (zh) * 2017-01-24 2019-03-15 北京大学 一种目标检测性能优化的方法
CN106941602B (zh) * 2017-03-07 2020-10-13 中国铁路总公司 机车司机行为识别方法及装置
CN107301383B (zh) * 2017-06-07 2020-11-24 华南理工大学 一种基于Fast R-CNN的路面交通标志识别方法
CN107704857B (zh) * 2017-09-25 2020-07-24 北京邮电大学 一种端到端的轻量级车牌识别方法及装置
TWI699816B (zh) * 2017-12-26 2020-07-21 雲象科技股份有限公司 自動化顯微鏡系統之控制方法、顯微鏡系統及電腦可讀取記錄媒體
CN108073910B (zh) * 2017-12-29 2021-05-07 百度在线网络技术(北京)有限公司 用于生成人脸特征的方法和装置
CN108776834B (zh) * 2018-05-07 2021-08-06 上海商汤智能科技有限公司 系统增强学习方法和装置、电子设备、计算机存储介质

Also Published As

Publication number Publication date
KR102420715B1 (ko) 2022-07-14
JP2021507421A (ja) 2021-02-22
KR20200119873A (ko) 2020-10-20
CN108776834B (zh) 2021-08-06
CN108776834A (zh) 2018-11-09
JP6896176B2 (ja) 2021-06-30
WO2019214344A1 (zh) 2019-11-14
US11669711B2 (en) 2023-06-06
US20200349431A1 (en) 2020-11-05

Similar Documents

Publication Publication Date Title
SG11202006017SA (en) System reinforcement learning method and apparatus, electronic device, and computer storage medium
SG11202100959RA (en) Data sharing method, apparatus, and system, and electronic device
SG11202005729TA (en) Collision control method and apparatus, and electronic device and storage medium
SG11202103527XA (en) Interactive plot implementation method, device, computer apparatus, and storage medium
SG11202102267XA (en) Image processing method and apparatus, electronic device, and computer readable storage medium
EP3734475A4 (en) METHOD AND DEVICE FOR TRAINING DATA, STORAGE MEDIUM AND ELECTRONIC DEVICE
SG11202102960XA (en) Image processing method and apparatus, electronic device, and computer readable storage medium
SG11202000065TA (en) Gaze point determination method and apparatus, electronic device, and computer storage medium
EP3627808A4 (en) SCREEN EXTINGUISHING CONTROL METHOD AND APPARATUS, STORAGE MEDIUM, AND ELECTRONIC DEVICE
EP3813379A4 (en) DATA PROCESSING APPARATUS AND METHOD, ELECTRONIC DEVICE, RECORDING SYSTEM AND MEDIA
SG11202005736VA (en) Collision control method and apparatus, and electronic device and storage medium
EP3627429A4 (en) INFORMATION PROCESSING METHOD AND DEVICE, ELECTRONIC DEVICE AND STORAGE MEDIUM
EP3993436C0 (en) DATA PROCESSING METHOD AND APPARATUS, COMPUTER-READABLE STORAGE MEDIUM, AND ELECTRONIC DEVICE
EP3531290A4 (en) DATA PROTECTION, DEVICE, ELECTRONIC DEVICE, STORAGE MEDIUM AND SYSTEM
SG11202007158UA (en) Object prediction method and apparatus, electronic device and storage medium
SG11202010699XA (en) Risk control method, risk control apparatus, electronic device, and storage medium
SG11202005080VA (en) Method, apparatus and system for liveness detection, electronic device, and storage medium
EP3627461A4 (en) INFORMATION PROCESSING METHOD AND APPARATUS, ELECTRONIC DEVICE AND INFORMATION MEDIUM
EP3676702A4 (en) METHOD AND DEVICE FOR DATA COMPILATION, ELECTRONIC DEVICE AND COMPUTER-READABLE STORAGE MEDIUM
SG11202010510XA (en) Data processing method and apparatus, electronic device and storage medium
SG11202102886TA (en) Teaching system and method, electronic device, and storage medium
EP3509270A4 (en) BACKUP PROCESS AND DEVICE, STORAGE MEDIUM AND ELECTRONIC DEVICE
SG11202103119UA (en) Method and device for processing control information, electronic equipment, and storage medium
EP3796530A4 (en) METHOD, DEVICE AND DEVICE FOR CURRENT EQUALIZATION AND COMPUTER-READABLE STORAGE MEDIUM
EP3843386A4 (en) DATA PROCESSING PROCESS AND APPARATUS, ELECTRONIC DEVICE AND STORAGE MEDIA