CN113748429A - 单词识别方法、设备及存储介质 - Google Patents

单词识别方法、设备及存储介质 Download PDF

Info

Publication number
CN113748429A
CN113748429A CN202080000447.2A CN202080000447A CN113748429A CN 113748429 A CN113748429 A CN 113748429A CN 202080000447 A CN202080000447 A CN 202080000447A CN 113748429 A CN113748429 A CN 113748429A
Authority
CN
China
Prior art keywords
word
recognized
image
taking
character
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202080000447.2A
Other languages
English (en)
Inventor
黄光伟
李月
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
BOE Technology Group Co Ltd
Original Assignee
BOE Technology Group Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by BOE Technology Group Co Ltd filed Critical BOE Technology Group Co Ltd
Publication of CN113748429A publication Critical patent/CN113748429A/zh
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition
    • G06V30/19Recognition using electronic means
    • G06V30/191Design or setup of recognition systems or techniques; Extraction of features in feature space; Clustering techniques; Blind source separation
    • G06V30/19173Classification techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T3/00Geometric image transformations in the plane of the image
    • G06T3/40Scaling of whole images or parts thereof, e.g. expanding or contracting
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/70Determining position or orientation of objects or cameras
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/20Image preprocessing
    • G06V10/24Aligning, centring, orientation detection or correction of the image
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition
    • G06V30/14Image acquisition
    • G06V30/148Segmentation of character regions
    • G06V30/153Segmentation of character regions using recognition of characters or words
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/20Special algorithmic details
    • G06T2207/20084Artificial neural networks [ANN]

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Multimedia (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Character Discrimination (AREA)

Abstract

本发明提出一种单词识别方法、设备及存储介质,其中,方法包括:采集待识别的单词图像;从待识别的单词图像中识别待识别的单词的每个字符的边缘范围,确定待识别的单词的几何位置,将待识别的单词的几何位置拉伸成水平位置;识别处于水平位置的待识别的单词。解决现有技术中当文本发生倾斜、透视、弯曲等情况可能使得文字相对模糊时,对文本识别准确性还有待进一步提高的技术问题。

Description

PCT国内申请,说明书已公开。

Claims (21)

  1. PCT国内申请,权利要求书已公开。
CN202080000447.2A 2020-03-31 2020-03-31 单词识别方法、设备及存储介质 Pending CN113748429A (zh)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2020/082566 WO2021196013A1 (zh) 2020-03-31 2020-03-31 单词识别方法、设备及存储介质

Publications (1)

Publication Number Publication Date
CN113748429A true CN113748429A (zh) 2021-12-03

Family

ID=77926983

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202080000447.2A Pending CN113748429A (zh) 2020-03-31 2020-03-31 单词识别方法、设备及存储介质

Country Status (3)

Country Link
US (1) US11651604B2 (zh)
CN (1) CN113748429A (zh)
WO (1) WO2021196013A1 (zh)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111783760B (zh) * 2020-06-30 2023-08-08 北京百度网讯科技有限公司 文字识别的方法、装置、电子设备及计算机可读存储介质
CN113902046B (zh) * 2021-12-10 2022-02-18 北京惠朗时代科技有限公司 一种特效字体识别方法及装置
CN115527215A (zh) * 2022-10-10 2022-12-27 杭州睿胜软件有限公司 包含文本的图像处理方法、系统及存储介质

Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH08315077A (ja) * 1995-05-15 1996-11-29 Nippon Telegr & Teleph Corp <Ntt> 単語認識方法
JPH09319824A (ja) * 1996-05-30 1997-12-12 Hitachi Ltd 帳票認識方法
KR20010078127A (ko) * 2000-01-28 2001-08-20 니시무로 타이죠 단어인식방법과 단어인식 프로그램을 기억한 기억매체
JP2005202533A (ja) * 2004-01-14 2005-07-28 Hitachi Ltd 情報処理装置、情報処理方法及びソフトウェア
US20100073735A1 (en) * 2008-05-06 2010-03-25 Compulink Management Center, Inc. Camera-based document imaging
CN107516096A (zh) * 2016-06-15 2017-12-26 阿里巴巴集团控股有限公司 一种字符识别方法及装置
CN108985137A (zh) * 2017-06-02 2018-12-11 杭州海康威视数字技术股份有限公司 一种车牌识别方法、装置及系统
CN110569830A (zh) * 2019-08-01 2019-12-13 平安科技(深圳)有限公司 多语言文本识别方法、装置、计算机设备及存储介质
WO2020010547A1 (zh) * 2018-07-11 2020-01-16 深圳前海达闼云端智能科技有限公司 字符识别方法、装置、存储介质及电子设备

Family Cites Families (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TWI319547B (en) * 2006-12-01 2010-01-11 Compal Electronics Inc Method for generating typographical line
US9171204B2 (en) 2012-12-12 2015-10-27 Qualcomm Incorporated Method of perspective correction for devanagari text
CN104239861A (zh) * 2014-09-10 2014-12-24 深圳市易讯天空网络技术有限公司 卷曲文本图像预处理方法和彩票扫描识别方法
US9977976B2 (en) * 2016-06-29 2018-05-22 Konica Minolta Laboratory U.S.A., Inc. Path score calculating method for intelligent character recognition
CN106951896B (zh) * 2017-02-22 2020-01-03 武汉黄丫智能科技发展有限公司 一种车牌图像倾斜校正方法
CN110321755A (zh) * 2018-03-28 2019-10-11 中移(苏州)软件技术有限公司 一种识别方法及装置
US10783400B2 (en) * 2018-04-06 2020-09-22 Dropbox, Inc. Generating searchable text for documents portrayed in a repository of digital images utilizing orientation and text prediction neural networks
CN108647681B (zh) * 2018-05-08 2019-06-14 重庆邮电大学 一种带有文本方向校正的英文文本检测方法

Patent Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH08315077A (ja) * 1995-05-15 1996-11-29 Nippon Telegr & Teleph Corp <Ntt> 単語認識方法
JPH09319824A (ja) * 1996-05-30 1997-12-12 Hitachi Ltd 帳票認識方法
KR20010078127A (ko) * 2000-01-28 2001-08-20 니시무로 타이죠 단어인식방법과 단어인식 프로그램을 기억한 기억매체
JP2005202533A (ja) * 2004-01-14 2005-07-28 Hitachi Ltd 情報処理装置、情報処理方法及びソフトウェア
US20100073735A1 (en) * 2008-05-06 2010-03-25 Compulink Management Center, Inc. Camera-based document imaging
CN107516096A (zh) * 2016-06-15 2017-12-26 阿里巴巴集团控股有限公司 一种字符识别方法及装置
CN108985137A (zh) * 2017-06-02 2018-12-11 杭州海康威视数字技术股份有限公司 一种车牌识别方法、装置及系统
WO2020010547A1 (zh) * 2018-07-11 2020-01-16 深圳前海达闼云端智能科技有限公司 字符识别方法、装置、存储介质及电子设备
CN110569830A (zh) * 2019-08-01 2019-12-13 平安科技(深圳)有限公司 多语言文本识别方法、装置、计算机设备及存储介质

Also Published As

Publication number Publication date
US11651604B2 (en) 2023-05-16
US20220036112A1 (en) 2022-02-03
WO2021196013A1 (zh) 2021-10-07

Similar Documents

Publication Publication Date Title
CN110232311B (zh) 手部图像的分割方法、装置及计算机设备
US5410611A (en) Method for identifying word bounding boxes in text
US7302099B2 (en) Stroke segmentation for template-based cursive handwriting recognition
US7369702B2 (en) Template-based cursive handwriting recognition
CN113748429A (zh) 单词识别方法、设备及存储介质
CN110287952B (zh) 一种维语图片字符的识别方法及系统
CN111626238B (zh) 文本识别方法、电子设备及存储介质
JP5229050B2 (ja) 画像からの文書領域抽出装置、方法、及びプログラム
CN111353501A (zh) 一种基于深度学习的书本点读方法及系统
CN107545223B (zh) 图像识别方法及电子设备
US8406467B2 (en) Method and system for actively detecting and recognizing placards
CN113378764B (zh) 基于聚类算法的视频人脸采集方法、装置、设备及介质
JP4704601B2 (ja) 文字認識方法,プログラム及び記録媒体
CN111492407A (zh) 用于绘图美化的系统和方法
CN112419207A (zh) 一种图像矫正方法及装置、系统
JP6542230B2 (ja) 投影ひずみを補正するための方法及びシステム
CN111753812A (zh) 文本识别方法及设备
CN114511865A (zh) 一种结构化信息的生成方法、装置和计算机可读存储介质
CN113628113A (zh) 一种图像拼接方法及其相关设备
CN111340040B (zh) 一种纸张字符识别方法、装置、电子设备及存储介质
CN117392698A (zh) 手绘电路图的识别方法、装置、设备和存储介质
JPH10307889A (ja) 文字認識方法、装置及び文字認識プログラムを記録した記録媒体
US20150186718A1 (en) Segmentation of Overwritten Online Handwriting Input
CN109978829B (zh) 一种待检测对象的检测方法及其系统
CN113780040A (zh) 唇部关键点的定位方法及装置、存储介质、电子设备

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination