CN113748429A - 单词识别方法、设备及存储介质 - Google Patents
单词识别方法、设备及存储介质 Download PDFInfo
- Publication number
- CN113748429A CN113748429A CN202080000447.2A CN202080000447A CN113748429A CN 113748429 A CN113748429 A CN 113748429A CN 202080000447 A CN202080000447 A CN 202080000447A CN 113748429 A CN113748429 A CN 113748429A
- Authority
- CN
- China
- Prior art keywords
- word
- recognized
- image
- taking
- character
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000000034 method Methods 0.000 title claims abstract description 68
- 102100032202 Cornulin Human genes 0.000 claims description 33
- 101000920981 Homo sapiens Cornulin Proteins 0.000 claims description 33
- 238000011176 pooling Methods 0.000 claims description 19
- 238000005070 sampling Methods 0.000 claims description 17
- 238000013527 convolutional neural network Methods 0.000 claims description 12
- 238000013528 artificial neural network Methods 0.000 claims description 10
- 230000000306 recurrent effect Effects 0.000 claims description 8
- 230000002457 bidirectional effect Effects 0.000 claims description 7
- 238000004590 computer program Methods 0.000 claims description 7
- 238000013518 transcription Methods 0.000 claims description 7
- 230000035897 transcription Effects 0.000 claims description 7
- 239000011159 matrix material Substances 0.000 claims description 6
- 230000009466 transformation Effects 0.000 claims description 6
- 238000013519 translation Methods 0.000 description 20
- 230000006870 function Effects 0.000 description 18
- 238000010606 normalization Methods 0.000 description 13
- 230000004913 activation Effects 0.000 description 12
- 238000012545 processing Methods 0.000 description 12
- 230000000694 effects Effects 0.000 description 11
- 238000012937 correction Methods 0.000 description 6
- 238000010586 diagram Methods 0.000 description 6
- 230000008569 process Effects 0.000 description 5
- 238000000605 extraction Methods 0.000 description 4
- 238000005452 bending Methods 0.000 description 3
- 239000000463 material Substances 0.000 description 3
- 125000004122 cyclic group Chemical group 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 230000003287 optical effect Effects 0.000 description 2
- 230000004075 alteration Effects 0.000 description 1
- 238000013135 deep learning Methods 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 239000000284 extract Substances 0.000 description 1
- 238000010801 machine learning Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 239000013307 optical fiber Substances 0.000 description 1
- 230000008447 perception Effects 0.000 description 1
- 238000004904 shortening Methods 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V30/00—Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
- G06V30/10—Character recognition
- G06V30/19—Recognition using electronic means
- G06V30/191—Design or setup of recognition systems or techniques; Extraction of features in feature space; Clustering techniques; Blind source separation
- G06V30/19173—Classification techniques
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T3/00—Geometric image transformations in the plane of the image
- G06T3/40—Scaling of whole images or parts thereof, e.g. expanding or contracting
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
- G06T7/70—Determining position or orientation of objects or cameras
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/20—Image preprocessing
- G06V10/24—Aligning, centring, orientation detection or correction of the image
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V30/00—Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
- G06V30/10—Character recognition
- G06V30/14—Image acquisition
- G06V30/148—Segmentation of character regions
- G06V30/153—Segmentation of character regions using recognition of characters or words
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/20—Special algorithmic details
- G06T2207/20084—Artificial neural networks [ANN]
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- Multimedia (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Character Discrimination (AREA)
Abstract
本发明提出一种单词识别方法、设备及存储介质,其中,方法包括:采集待识别的单词图像;从待识别的单词图像中识别待识别的单词的每个字符的边缘范围,确定待识别的单词的几何位置,将待识别的单词的几何位置拉伸成水平位置;识别处于水平位置的待识别的单词。解决现有技术中当文本发生倾斜、透视、弯曲等情况可能使得文字相对模糊时,对文本识别准确性还有待进一步提高的技术问题。
Description
PCT国内申请,说明书已公开。
Claims (21)
- PCT国内申请,权利要求书已公开。
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
PCT/CN2020/082566 WO2021196013A1 (zh) | 2020-03-31 | 2020-03-31 | 单词识别方法、设备及存储介质 |
Publications (1)
Publication Number | Publication Date |
---|---|
CN113748429A true CN113748429A (zh) | 2021-12-03 |
Family
ID=77926983
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202080000447.2A Pending CN113748429A (zh) | 2020-03-31 | 2020-03-31 | 单词识别方法、设备及存储介质 |
Country Status (3)
Country | Link |
---|---|
US (1) | US11651604B2 (zh) |
CN (1) | CN113748429A (zh) |
WO (1) | WO2021196013A1 (zh) |
Families Citing this family (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111783760B (zh) * | 2020-06-30 | 2023-08-08 | 北京百度网讯科技有限公司 | 文字识别的方法、装置、电子设备及计算机可读存储介质 |
CN113902046B (zh) * | 2021-12-10 | 2022-02-18 | 北京惠朗时代科技有限公司 | 一种特效字体识别方法及装置 |
CN115527215A (zh) * | 2022-10-10 | 2022-12-27 | 杭州睿胜软件有限公司 | 包含文本的图像处理方法、系统及存储介质 |
Citations (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH08315077A (ja) * | 1995-05-15 | 1996-11-29 | Nippon Telegr & Teleph Corp <Ntt> | 単語認識方法 |
JPH09319824A (ja) * | 1996-05-30 | 1997-12-12 | Hitachi Ltd | 帳票認識方法 |
KR20010078127A (ko) * | 2000-01-28 | 2001-08-20 | 니시무로 타이죠 | 단어인식방법과 단어인식 프로그램을 기억한 기억매체 |
JP2005202533A (ja) * | 2004-01-14 | 2005-07-28 | Hitachi Ltd | 情報処理装置、情報処理方法及びソフトウェア |
US20100073735A1 (en) * | 2008-05-06 | 2010-03-25 | Compulink Management Center, Inc. | Camera-based document imaging |
CN107516096A (zh) * | 2016-06-15 | 2017-12-26 | 阿里巴巴集团控股有限公司 | 一种字符识别方法及装置 |
CN108985137A (zh) * | 2017-06-02 | 2018-12-11 | 杭州海康威视数字技术股份有限公司 | 一种车牌识别方法、装置及系统 |
CN110569830A (zh) * | 2019-08-01 | 2019-12-13 | 平安科技(深圳)有限公司 | 多语言文本识别方法、装置、计算机设备及存储介质 |
WO2020010547A1 (zh) * | 2018-07-11 | 2020-01-16 | 深圳前海达闼云端智能科技有限公司 | 字符识别方法、装置、存储介质及电子设备 |
Family Cites Families (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
TWI319547B (en) * | 2006-12-01 | 2010-01-11 | Compal Electronics Inc | Method for generating typographical line |
US9171204B2 (en) | 2012-12-12 | 2015-10-27 | Qualcomm Incorporated | Method of perspective correction for devanagari text |
CN104239861A (zh) * | 2014-09-10 | 2014-12-24 | 深圳市易讯天空网络技术有限公司 | 卷曲文本图像预处理方法和彩票扫描识别方法 |
US9977976B2 (en) * | 2016-06-29 | 2018-05-22 | Konica Minolta Laboratory U.S.A., Inc. | Path score calculating method for intelligent character recognition |
CN106951896B (zh) * | 2017-02-22 | 2020-01-03 | 武汉黄丫智能科技发展有限公司 | 一种车牌图像倾斜校正方法 |
CN110321755A (zh) * | 2018-03-28 | 2019-10-11 | 中移(苏州)软件技术有限公司 | 一种识别方法及装置 |
US10783400B2 (en) * | 2018-04-06 | 2020-09-22 | Dropbox, Inc. | Generating searchable text for documents portrayed in a repository of digital images utilizing orientation and text prediction neural networks |
CN108647681B (zh) * | 2018-05-08 | 2019-06-14 | 重庆邮电大学 | 一种带有文本方向校正的英文文本检测方法 |
-
2020
- 2020-03-31 WO PCT/CN2020/082566 patent/WO2021196013A1/zh active Application Filing
- 2020-03-31 CN CN202080000447.2A patent/CN113748429A/zh active Pending
- 2020-03-31 US US17/263,418 patent/US11651604B2/en active Active
Patent Citations (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH08315077A (ja) * | 1995-05-15 | 1996-11-29 | Nippon Telegr & Teleph Corp <Ntt> | 単語認識方法 |
JPH09319824A (ja) * | 1996-05-30 | 1997-12-12 | Hitachi Ltd | 帳票認識方法 |
KR20010078127A (ko) * | 2000-01-28 | 2001-08-20 | 니시무로 타이죠 | 단어인식방법과 단어인식 프로그램을 기억한 기억매체 |
JP2005202533A (ja) * | 2004-01-14 | 2005-07-28 | Hitachi Ltd | 情報処理装置、情報処理方法及びソフトウェア |
US20100073735A1 (en) * | 2008-05-06 | 2010-03-25 | Compulink Management Center, Inc. | Camera-based document imaging |
CN107516096A (zh) * | 2016-06-15 | 2017-12-26 | 阿里巴巴集团控股有限公司 | 一种字符识别方法及装置 |
CN108985137A (zh) * | 2017-06-02 | 2018-12-11 | 杭州海康威视数字技术股份有限公司 | 一种车牌识别方法、装置及系统 |
WO2020010547A1 (zh) * | 2018-07-11 | 2020-01-16 | 深圳前海达闼云端智能科技有限公司 | 字符识别方法、装置、存储介质及电子设备 |
CN110569830A (zh) * | 2019-08-01 | 2019-12-13 | 平安科技(深圳)有限公司 | 多语言文本识别方法、装置、计算机设备及存储介质 |
Also Published As
Publication number | Publication date |
---|---|
US11651604B2 (en) | 2023-05-16 |
US20220036112A1 (en) | 2022-02-03 |
WO2021196013A1 (zh) | 2021-10-07 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN110232311B (zh) | 手部图像的分割方法、装置及计算机设备 | |
US5410611A (en) | Method for identifying word bounding boxes in text | |
US7302099B2 (en) | Stroke segmentation for template-based cursive handwriting recognition | |
US7369702B2 (en) | Template-based cursive handwriting recognition | |
CN113748429A (zh) | 单词识别方法、设备及存储介质 | |
CN110287952B (zh) | 一种维语图片字符的识别方法及系统 | |
CN111626238B (zh) | 文本识别方法、电子设备及存储介质 | |
JP5229050B2 (ja) | 画像からの文書領域抽出装置、方法、及びプログラム | |
CN111353501A (zh) | 一种基于深度学习的书本点读方法及系统 | |
CN107545223B (zh) | 图像识别方法及电子设备 | |
US8406467B2 (en) | Method and system for actively detecting and recognizing placards | |
CN113378764B (zh) | 基于聚类算法的视频人脸采集方法、装置、设备及介质 | |
JP4704601B2 (ja) | 文字認識方法,プログラム及び記録媒体 | |
CN111492407A (zh) | 用于绘图美化的系统和方法 | |
CN112419207A (zh) | 一种图像矫正方法及装置、系统 | |
JP6542230B2 (ja) | 投影ひずみを補正するための方法及びシステム | |
CN111753812A (zh) | 文本识别方法及设备 | |
CN114511865A (zh) | 一种结构化信息的生成方法、装置和计算机可读存储介质 | |
CN113628113A (zh) | 一种图像拼接方法及其相关设备 | |
CN111340040B (zh) | 一种纸张字符识别方法、装置、电子设备及存储介质 | |
CN117392698A (zh) | 手绘电路图的识别方法、装置、设备和存储介质 | |
JPH10307889A (ja) | 文字認識方法、装置及び文字認識プログラムを記録した記録媒体 | |
US20150186718A1 (en) | Segmentation of Overwritten Online Handwriting Input | |
CN109978829B (zh) | 一种待检测对象的检测方法及其系统 | |
CN113780040A (zh) | 唇部关键点的定位方法及装置、存储介质、电子设备 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |