CN114359905B - 一种文本识别方法、装置、电子设备及存储介质 - Google Patents

一种文本识别方法、装置、电子设备及存储介质 Download PDF

Info

Publication number
CN114359905B
CN114359905B CN202210013631.1A CN202210013631A CN114359905B CN 114359905 B CN114359905 B CN 114359905B CN 202210013631 A CN202210013631 A CN 202210013631A CN 114359905 B CN114359905 B CN 114359905B
Authority
CN
China
Prior art keywords
dimension
feature
feature map
map
dimensional
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202210013631.1A
Other languages
English (en)
Chinese (zh)
Other versions
CN114359905A (zh
Inventor
吕鹏原
范森
王晓燕
庾悦晨
章成全
姚锟
韩钧宇
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Baidu Netcom Science and Technology Co Ltd
Original Assignee
Beijing Baidu Netcom Science and Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Baidu Netcom Science and Technology Co Ltd filed Critical Beijing Baidu Netcom Science and Technology Co Ltd
Priority to CN202210013631.1A priority Critical patent/CN114359905B/zh
Publication of CN114359905A publication Critical patent/CN114359905A/zh
Priority to JP2022140728A priority patent/JP7418517B2/ja
Priority to US17/946,464 priority patent/US20230010031A1/en
Priority to KR1020220147012A priority patent/KR20220155948A/ko
Application granted granted Critical
Publication of CN114359905B publication Critical patent/CN114359905B/zh
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/77Processing image or video features in feature spaces; using data integration or data reduction, e.g. principal component analysis [PCA] or independent component analysis [ICA] or self-organising maps [SOM]; Blind source separation
    • G06V10/7715Feature extraction, e.g. by transforming the feature space, e.g. multi-dimensional scaling [MDS]; Mappings, e.g. subspace methods
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/74Image or video pattern matching; Proximity measures in feature spaces
    • G06V10/761Proximity, similarity or dissimilarity measures
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/82Arrangements for image or video recognition or understanding using pattern recognition or machine learning using neural networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/60Type of objects
    • G06V20/62Text, e.g. of license plates, overlay texts or captions on TV images
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition
    • G06V30/18Extraction of features or characteristics of the image
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition
    • G06V30/19Recognition using electronic means
    • G06V30/19007Matching; Proximity measures
    • G06V30/19093Proximity measures, i.e. similarity or distance measures
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition
    • G06V30/19Recognition using electronic means
    • G06V30/191Design or setup of recognition systems or techniques; Extraction of features in feature space; Clustering techniques; Blind source separation
    • G06V30/19127Extracting features by transforming the feature space, e.g. multidimensional scaling; Mappings, e.g. subspace methods

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • General Physics & Mathematics (AREA)
  • Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Evolutionary Computation (AREA)
  • Artificial Intelligence (AREA)
  • Computing Systems (AREA)
  • Databases & Information Systems (AREA)
  • General Health & Medical Sciences (AREA)
  • Medical Informatics (AREA)
  • Software Systems (AREA)
  • Health & Medical Sciences (AREA)
  • Image Analysis (AREA)
  • Character Discrimination (AREA)
CN202210013631.1A 2022-01-06 2022-01-06 一种文本识别方法、装置、电子设备及存储介质 Active CN114359905B (zh)

Priority Applications (4)

Application Number Priority Date Filing Date Title
CN202210013631.1A CN114359905B (zh) 2022-01-06 2022-01-06 一种文本识别方法、装置、电子设备及存储介质
JP2022140728A JP7418517B2 (ja) 2022-01-06 2022-09-05 テキスト認識の方法、装置、電子機器、記憶媒体およびコンピュータプログラム
US17/946,464 US20230010031A1 (en) 2022-01-06 2022-09-16 Method for recognizing text, electronic device and storage medium
KR1020220147012A KR20220155948A (ko) 2022-01-06 2022-11-07 텍스트 인식 방법, 장치, 전자 기기 및 저장 매체

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202210013631.1A CN114359905B (zh) 2022-01-06 2022-01-06 一种文本识别方法、装置、电子设备及存储介质

Publications (2)

Publication Number Publication Date
CN114359905A CN114359905A (zh) 2022-04-15
CN114359905B true CN114359905B (zh) 2023-05-26

Family

ID=81107773

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202210013631.1A Active CN114359905B (zh) 2022-01-06 2022-01-06 一种文本识别方法、装置、电子设备及存储介质

Country Status (4)

Country Link
US (1) US20230010031A1 (ko)
JP (1) JP7418517B2 (ko)
KR (1) KR20220155948A (ko)
CN (1) CN114359905B (ko)

Family Cites Families (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102081731B (zh) * 2009-11-26 2013-01-23 中国移动通信集团广东有限公司 一种从图像中提取文本的方法和装置
CN106599773B (zh) * 2016-10-31 2019-12-24 清华大学 用于智能驾驶的深度学习图像识别方法、系统及终端设备
CN111126410B (zh) 2019-12-31 2022-11-18 讯飞智元信息科技有限公司 字符识别方法、装置、设备及可读存储介质
JP7479925B2 (ja) 2020-05-14 2024-05-09 キヤノン株式会社 画像処理システム、画像処理方法、及びプログラム
CN111914843B (zh) 2020-08-20 2021-04-16 合肥综合性国家科学中心人工智能研究院(安徽省人工智能实验室) 文字检测方法、系统、设备及存储介质
CN112801103B (zh) * 2021-01-19 2024-02-27 网易(杭州)网络有限公司 文本方向识别及文本方向识别模型训练方法、装置
CN113435210B (zh) * 2021-06-30 2024-10-15 平安科技(深圳)有限公司 社交图片文本识别方法、装置、计算机设备及存储介质
CN113591862A (zh) 2021-07-09 2021-11-02 上海智臻智能网络科技股份有限公司 文本识别的方法及装置

Also Published As

Publication number Publication date
JP2022172292A (ja) 2022-11-15
US20230010031A1 (en) 2023-01-12
JP7418517B2 (ja) 2024-01-19
CN114359905A (zh) 2022-04-15
KR20220155948A (ko) 2022-11-24

Similar Documents

Publication Publication Date Title
CN113343982B (zh) 多模态特征融合的实体关系提取方法、装置和设备
CN114186632B (zh) 关键点检测模型的训练方法、装置、设备、存储介质
CN113657397B (zh) 循环生成网络模型的训练方法、建立字库的方法和装置
US20020150298A1 (en) System and method for signal matching and characterization
CN113393468A (zh) 图像处理方法、模型训练方法、装置和电子设备
CN113888410A (zh) 图像超分辨率方法、装置、设备、存储介质以及程序产品
US20230005171A1 (en) Visual positioning method, related apparatus and computer program product
CN114792355A (zh) 虚拟形象生成方法、装置、电子设备和存储介质
CN114092708A (zh) 特征图像的处理方法、装置和存储介质
CN114049491A (zh) 指纹分割模型训练、指纹分割方法、装置、设备及介质
CN113344213A (zh) 知识蒸馏方法、装置、电子设备及计算机可读存储介质
CN117746125A (zh) 图像处理模型的训练方法、装置及电子设备
CN114359905B (zh) 一种文本识别方法、装置、电子设备及存储介质
CN115760614A (zh) 图像去噪方法、装置、电子设备及存储介质
CN113436292B (zh) 图像处理方法、图像处理模型的训练方法、装置及设备
US20220318950A1 (en) Video enhancement method and apparatus, and electronic device and storage medium
CN112784967B (zh) 信息处理方法、装置以及电子设备
CN114359903B (zh) 一种文本识别方法、装置、设备及存储介质
CN114723796A (zh) 一种三维点云生成方法、装置及电子设备
CN114842489A (zh) 表格解析方法及装置
CN113903071A (zh) 人脸识别方法、装置、电子设备和存储介质
CN114282664A (zh) 自反馈模型训练方法、装置、路侧设备及云控平台
CN113205131A (zh) 图像数据的处理方法、装置、路侧设备和云控平台
CN112991451A (zh) 图像识别方法、相关装置及计算机程序产品
CN114581676B (zh) 特征图像的处理方法、装置和存储介质

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant