SG11202010916SA - Text recognition method and apparatus, electronic device and storage medium - Google Patents

Text recognition method and apparatus, electronic device and storage medium

Info

Publication number
SG11202010916SA
SG11202010916SA SG11202010916SA SG11202010916SA SG11202010916SA SG 11202010916S A SG11202010916S A SG 11202010916SA SG 11202010916S A SG11202010916S A SG 11202010916SA SG 11202010916S A SG11202010916S A SG 11202010916SA SG 11202010916S A SG11202010916S A SG 11202010916SA
Authority
SG
Singapore
Prior art keywords
electronic device
storage medium
recognition method
text recognition
text
Prior art date
Application number
SG11202010916SA
Other languages
English (en)
Inventor
Xuebo Liu
Original Assignee
Beijing Sensetime Technology Development Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Sensetime Technology Development Co Ltd filed Critical Beijing Sensetime Technology Development Co Ltd
Publication of SG11202010916SA publication Critical patent/SG11202010916SA/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/60Type of objects
    • G06V20/62Text, e.g. of license plates, overlay texts or captions on TV images
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/084Backpropagation, e.g. using gradient descent
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/10Complex mathematical operations
    • G06F17/16Matrix or vector computation, e.g. matrix-matrix or matrix-vector multiplication, matrix factorization
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/30Semantic analysis
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/20Image preprocessing
    • G06V10/24Aligning, centring, orientation detection or correction of the image
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/40Extraction of image or video features
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/82Arrangements for image or video recognition or understanding using pattern recognition or machine learning using neural networks

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Mathematical Physics (AREA)
  • Data Mining & Analysis (AREA)
  • General Engineering & Computer Science (AREA)
  • Software Systems (AREA)
  • Artificial Intelligence (AREA)
  • Computing Systems (AREA)
  • Health & Medical Sciences (AREA)
  • General Health & Medical Sciences (AREA)
  • Evolutionary Computation (AREA)
  • Computational Linguistics (AREA)
  • Multimedia (AREA)
  • Molecular Biology (AREA)
  • Biomedical Technology (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Biophysics (AREA)
  • Pure & Applied Mathematics (AREA)
  • Computational Mathematics (AREA)
  • Mathematical Analysis (AREA)
  • Mathematical Optimization (AREA)
  • Databases & Information Systems (AREA)
  • Algebra (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Medical Informatics (AREA)
  • Character Discrimination (AREA)
  • Image Analysis (AREA)
SG11202010916SA 2019-03-29 2020-01-17 Text recognition method and apparatus, electronic device and storage medium SG11202010916SA (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201910251661.4A CN111753822B (zh) 2019-03-29 2019-03-29 文本识别方法及装置、电子设备和存储介质
PCT/CN2020/072804 WO2020199730A1 (zh) 2019-03-29 2020-01-17 文本识别方法及装置、电子设备和存储介质

Publications (1)

Publication Number Publication Date
SG11202010916SA true SG11202010916SA (en) 2020-12-30

Family

ID=72664623

Family Applications (1)

Application Number Title Priority Date Filing Date
SG11202010916SA SG11202010916SA (en) 2019-03-29 2020-01-17 Text recognition method and apparatus, electronic device and storage medium

Country Status (6)

Country Link
US (1) US12014275B2 (ja)
JP (1) JP7153088B2 (ja)
CN (1) CN111753822B (ja)
SG (1) SG11202010916SA (ja)
TW (1) TW202036464A (ja)
WO (1) WO2020199730A1 (ja)

Families Citing this family (26)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP7363107B2 (ja) * 2019-06-04 2023-10-18 コニカミノルタ株式会社 発想支援装置、発想支援システム及びプログラム
CN110569846A (zh) * 2019-09-16 2019-12-13 北京百度网讯科技有限公司 图像文字识别方法、装置、设备及存储介质
US11227009B1 (en) * 2019-09-30 2022-01-18 Amazon Technologies, Inc. Text de-obfuscation with image recognition of text
CN112487826A (zh) * 2020-11-30 2021-03-12 北京百度网讯科技有限公司 信息抽取方法、抽取模型训练方法、装置以及电子设备
CN112733830A (zh) * 2020-12-31 2021-04-30 上海芯翌智能科技有限公司 店铺招牌识别方法及装置、存储介质和计算机设备
CN112949477B (zh) * 2021-03-01 2024-03-15 苏州美能华智能科技有限公司 基于图卷积神经网络的信息识别方法、装置及存储介质
CN113190643B (zh) * 2021-04-13 2023-02-03 安阳师范学院 信息生成方法、终端设备和计算机可读介质
CN113762050B (zh) * 2021-05-12 2024-05-24 腾讯云计算(北京)有限责任公司 图像数据处理方法、装置、设备以及介质
CN113326887B (zh) * 2021-06-16 2024-03-29 深圳思谋信息科技有限公司 文本检测方法、装置、计算机设备
US20220405524A1 (en) * 2021-06-17 2022-12-22 International Business Machines Corporation Optical character recognition training with semantic constraints
CN113448477B (zh) * 2021-08-31 2021-11-23 南昌航空大学 交互式图像编辑方法、装置、可读存储介质及电子设备
CN113704478B (zh) * 2021-09-07 2023-08-22 平安银行股份有限公司 文本要素提取方法、装置、电子设备及介质
CN113792741B (zh) * 2021-09-17 2023-08-11 平安普惠企业管理有限公司 文字识别方法、装置、设备及存储介质
CN113837965B (zh) * 2021-09-26 2024-06-18 北京百度网讯科技有限公司 图像清晰度识别方法、装置、电子设备及存储介质
CN113869426B (zh) * 2021-09-29 2024-07-26 北京搜狗科技发展有限公司 一种公式识别方法及装置
CN113688955B (zh) * 2021-10-25 2022-02-15 北京世纪好未来教育科技有限公司 文本识别方法、装置、设备及介质
CN114067327A (zh) * 2021-11-18 2022-02-18 北京有竹居网络技术有限公司 文本识别方法、装置、可读介质及电子设备
CN114298054A (zh) * 2021-11-29 2022-04-08 北京捷通鸿泰科技有限公司 一种文本识别方法、装置、电子设备及可读存储介质
CN114239598A (zh) * 2021-12-17 2022-03-25 上海高德威智能交通系统有限公司 文本元素阅读顺序确定方法、装置、电子设备及存储介质
CN113963358B (zh) * 2021-12-20 2022-03-04 北京易真学思教育科技有限公司 文本识别模型训练方法、文本识别方法、装置及电子设备
CN114207673A (zh) * 2021-12-20 2022-03-18 商汤国际私人有限公司 序列识别方法及装置、电子设备和存储介质
CN114495102B (zh) * 2022-01-12 2024-09-06 北京百度网讯科技有限公司 文本识别方法、文本识别网络的训练方法及装置
CN114495101A (zh) * 2022-01-12 2022-05-13 北京百度网讯科技有限公司 文本检测方法、文本检测网络的训练方法及装置
CN114492437B (zh) * 2022-02-16 2023-07-18 平安科技(深圳)有限公司 关键词识别方法、装置、电子设备及存储介质
CN115062118B (zh) * 2022-07-26 2023-01-31 神州医疗科技股份有限公司 双通道信息抽取方法、装置、电子设备和介质
CN115601752A (zh) * 2022-10-26 2023-01-13 维沃移动通信有限公司(Cn) 文字识别方法、装置、电子设备及介质

Family Cites Families (24)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH0954814A (ja) * 1995-08-04 1997-02-25 At & T Corp 入力記号表現の分析及び入力記号表現の可能解釈のスコアリングシステム
GB201511887D0 (en) * 2015-07-07 2015-08-19 Touchtype Ltd Improved artificial neural network for language modelling and prediction
JP6057112B1 (ja) * 2016-04-19 2017-01-11 AI inside株式会社 文字認識装置、方法およびプログラム
WO2018094295A1 (en) 2016-11-18 2018-05-24 Salesforce.Com, Inc. Adaptive attention model for image captioning
CN108287858B (zh) * 2017-03-02 2021-08-10 腾讯科技(深圳)有限公司 自然语言的语义提取方法及装置
CN107168952B (zh) * 2017-05-15 2021-06-04 北京百度网讯科技有限公司 基于人工智能的信息生成方法和装置
CN108228686B (zh) * 2017-06-15 2021-03-23 北京市商汤科技开发有限公司 用于实现图文匹配的方法、装置和电子设备
US10628668B2 (en) * 2017-08-09 2020-04-21 Open Text Sa Ulc Systems and methods for generating and using semantic images in deep learning for classification and data extraction
CN107590192B (zh) * 2017-08-11 2023-05-05 深圳市腾讯计算机系统有限公司 文本问题的数学化处理方法、装置、设备和存储介质
CN107644209A (zh) * 2017-09-21 2018-01-30 百度在线网络技术(北京)有限公司 人脸检测方法和装置
US10438371B2 (en) 2017-09-22 2019-10-08 Zoox, Inc. Three-dimensional bounding box from two-dimensional image and point cloud data
CN107797985B (zh) * 2017-09-27 2022-02-25 百度在线网络技术(北京)有限公司 建立同义鉴别模型以及鉴别同义文本的方法、装置
US10810467B2 (en) * 2017-11-17 2020-10-20 Hong Kong Applied Science and Technology Research Institute Company Limited Flexible integrating recognition and semantic processing
CN108288078B (zh) * 2017-12-07 2020-09-29 腾讯科技(深圳)有限公司 一种图像中字符识别方法、装置和介质
CN108287585A (zh) 2018-01-25 2018-07-17 西安文理学院 一种稳压电源
CN108615036B (zh) * 2018-05-09 2021-10-01 中国科学技术大学 一种基于卷积注意力网络的自然场景文本识别方法
CN108874174B (zh) * 2018-05-29 2020-04-24 腾讯科技(深圳)有限公司 一种文本纠错方法、装置以及相关设备
US10585988B2 (en) * 2018-06-08 2020-03-10 Microsoft Technology Licensing, Llc Graph representations for identifying a next word
CN108960330B (zh) * 2018-07-09 2021-09-10 西安电子科技大学 基于快速区域卷积神经网络的遥感图像语义生成方法
EP3598339B1 (en) * 2018-07-19 2024-09-04 Tata Consultancy Services Limited Systems and methods for end-to-end handwritten text recognition using neural networks
CN109389091B (zh) * 2018-10-22 2022-05-03 重庆邮电大学 基于神经网络和注意力机制结合的文字识别系统及方法
CN109446328A (zh) * 2018-11-02 2019-03-08 成都四方伟业软件股份有限公司 一种文本识别方法、装置及其存储介质
US11010560B2 (en) * 2018-11-08 2021-05-18 International Business Machines Corporation Multi-resolution convolutional neural networks for sequence modeling
CN109471945B (zh) * 2018-11-12 2021-11-23 中山大学 基于深度学习的医疗文本分类方法、装置及存储介质

Also Published As

Publication number Publication date
CN111753822A (zh) 2020-10-09
JP2021520002A (ja) 2021-08-12
CN111753822B (zh) 2024-05-24
WO2020199730A1 (zh) 2020-10-08
US12014275B2 (en) 2024-06-18
US20210042474A1 (en) 2021-02-11
JP7153088B2 (ja) 2022-10-13
TW202036464A (zh) 2020-10-01

Similar Documents

Publication Publication Date Title
SG11202010916SA (en) Text recognition method and apparatus, electronic device and storage medium
SG11202105174XA (en) Text sequence recognition method and apparatus, electronic device, and storage medium
SG11202110565RA (en) Face recognition method and apparatus, electronic device, and storage medium
SG11202006192YA (en) Face recognition method and apparatus, electronic device, and storage medium
EP3770905C0 (en) SPEECH RECOGNITION METHOD, DEVICE AND APPARATUS AND STORAGE MEDIUM
SG11202006328YA (en) Image clustering method and apparatus, electronic device, and storage medium
EP4013007A4 (en) VEHICLE-ROAD COOPERATION APPARATUS AND METHOD, ELECTRONIC DEVICE AND STORAGE MEDIA
SG11202109192QA (en) Interaction method and apparatus, electronic device and storage medium
SG11202003818YA (en) Key point detection method and apparatus, electronic device, and storage medium
SG11201911625YA (en) Target recognition method and apparatus, storage medium, and electronic device
SG11202000076WA (en) Target object recognition method and device, storage medium, and electronic apparatus
EP3840484A4 (en) WAKE-UP METHOD, WAKE-UP DEVICE, ELECTRONIC DEVICE AND COMPUTER-READABLE STORAGE MEDIA
EP3648099A4 (en) VOICE RECOGNITION METHOD, DEVICE, DEVICE AND STORAGE MEDIUM
EP3779800A4 (en) IMAGE RECOGNITION PROCESS, APPARATUS, ELECTRONIC DEVICE AND INFORMATION MEDIA
SG11201912620YA (en) Voiceprint recognition method, device, terminal apparatus and storage medium
SG11201913865PA (en) Method and apparatus for recognizing sequence in image, electronic device, and storage medium
EP3993436C0 (en) DATA PROCESSING METHOD AND APPARATUS, COMPUTER-READABLE STORAGE MEDIUM, AND ELECTRONIC DEVICE
EP3584786A4 (en) VOICE RECOGNITION METHOD, ELECTRONIC DEVICE, AND COMPUTER STORAGE MEDIUM
SG11202005736VA (en) Collision control method and apparatus, and electronic device and storage medium
EP3627429A4 (en) INFORMATION PROCESSING METHOD AND DEVICE, ELECTRONIC DEVICE AND STORAGE MEDIUM
EP3907653A4 (en) ACTION RECOGNITION METHOD, EQUIPMENT AND DEVICE AND STORAGE MEDIA
SG11202007158UA (en) Object prediction method and apparatus, electronic device and storage medium
EP3627461A4 (en) INFORMATION PROCESSING METHOD AND APPARATUS, ELECTRONIC DEVICE AND INFORMATION MEDIUM
SG11202010699XA (en) Risk control method, risk control apparatus, electronic device, and storage medium
EP3848798A4 (en) APPARATUS AND METHOD FOR PROCESSING INFORMATION, DATA MEDIA, AND ELECTRONIC DEVICE