CN105825214B - 一种基于tesseract引擎的文字识别方法及装置 - Google Patents
一种基于tesseract引擎的文字识别方法及装置 Download PDFInfo
- Publication number
- CN105825214B CN105825214B CN201610143955.1A CN201610143955A CN105825214B CN 105825214 B CN105825214 B CN 105825214B CN 201610143955 A CN201610143955 A CN 201610143955A CN 105825214 B CN105825214 B CN 105825214B
- Authority
- CN
- China
- Prior art keywords
- server
- recognition result
- literal pool
- cloud server
- text
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V20/00—Scenes; Scene-specific elements
- G06V20/60—Type of objects
- G06V20/62—Text, e.g. of license plates, overlay texts or captions on TV images
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/70—Arrangements for image or video recognition or understanding using pattern recognition or machine learning
- G06V10/768—Arrangements for image or video recognition or understanding using pattern recognition or machine learning using context analysis, e.g. recognition aided by known co-occurring patterns
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V30/00—Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
- G06V30/10—Character recognition
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Multimedia (AREA)
- Artificial Intelligence (AREA)
- Health & Medical Sciences (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Computing Systems (AREA)
- Databases & Information Systems (AREA)
- Evolutionary Computation (AREA)
- General Health & Medical Sciences (AREA)
- Medical Informatics (AREA)
- Software Systems (AREA)
- Character Discrimination (AREA)
Abstract
Description
Claims (3)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201610143955.1A CN105825214B (zh) | 2016-03-14 | 2016-03-14 | 一种基于tesseract引擎的文字识别方法及装置 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201610143955.1A CN105825214B (zh) | 2016-03-14 | 2016-03-14 | 一种基于tesseract引擎的文字识别方法及装置 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN105825214A CN105825214A (zh) | 2016-08-03 |
CN105825214B true CN105825214B (zh) | 2019-02-05 |
Family
ID=56987765
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201610143955.1A Active CN105825214B (zh) | 2016-03-14 | 2016-03-14 | 一种基于tesseract引擎的文字识别方法及装置 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN105825214B (zh) |
Families Citing this family (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107707458A (zh) * | 2017-10-01 | 2018-02-16 | 李子盈 | 一种传输图片格式文字信息的通信方法及系统与设备 |
CN107862312A (zh) * | 2017-11-22 | 2018-03-30 | 朱秋华 | 一种基于tesseract引擎的文字识别方法、装置、设备及存储介质 |
CN108846419A (zh) * | 2018-05-25 | 2018-11-20 | 平安科技(深圳)有限公司 | 单页高负载图像识别方法、装置、计算机设备及存储介质 |
CN110895924B (zh) * | 2018-08-23 | 2023-01-03 | 珠海金山办公软件有限公司 | 一种文档内容朗读方法、装置、电子设备及可读存储介质 |
CN109389084A (zh) * | 2018-10-09 | 2019-02-26 | 郑州云海信息技术有限公司 | 一种处理图像信息的方法及装置 |
CN109829516A (zh) * | 2019-03-07 | 2019-05-31 | 苏州达家迎信息技术有限公司 | 图像处理方法及装置、设备及存储介质 |
CN112800240A (zh) * | 2021-01-22 | 2021-05-14 | 中信银行股份有限公司 | 字库更新方法、身份识别方法、装置及电子设备 |
CN113936285A (zh) * | 2021-11-03 | 2022-01-14 | 重庆海创云链数字科技有限公司 | 一种ocr自动识别方法 |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101976148A (zh) * | 2010-10-28 | 2011-02-16 | 广东开心信息技术有限公司 | 一种手写输入系统和方法 |
CN103247291A (zh) * | 2013-05-07 | 2013-08-14 | 华为终端有限公司 | 一种语音识别设备的更新方法、装置及系统 |
CN103366151A (zh) * | 2012-03-30 | 2013-10-23 | 佳能株式会社 | 手写字符识别方法以及设备 |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2004027673A1 (en) * | 2002-09-20 | 2004-04-01 | Siemens Dematic Postal Automation, L.P. | Hand held ocr apparatus and method |
-
2016
- 2016-03-14 CN CN201610143955.1A patent/CN105825214B/zh active Active
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101976148A (zh) * | 2010-10-28 | 2011-02-16 | 广东开心信息技术有限公司 | 一种手写输入系统和方法 |
CN103366151A (zh) * | 2012-03-30 | 2013-10-23 | 佳能株式会社 | 手写字符识别方法以及设备 |
CN103247291A (zh) * | 2013-05-07 | 2013-08-14 | 华为终端有限公司 | 一种语音识别设备的更新方法、装置及系统 |
Non-Patent Citations (1)
Title |
---|
基于跳变检测和Tesseract的机打发票识别算法;邬满;《信息与电脑》;20150923(第18期);第43-45页,第1.5节以及图4 |
Also Published As
Publication number | Publication date |
---|---|
CN105825214A (zh) | 2016-08-03 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN105825214B (zh) | 一种基于tesseract引擎的文字识别方法及装置 | |
JP7474587B2 (ja) | 対話型インタフェース及びデータベースクエリを用いた文書画像からの情報抽出の方法及びシステム | |
CN109858453B (zh) | 一种通用的多引擎票据识别系统及方法 | |
US8014604B2 (en) | OCR of books by word recognition | |
US10943106B2 (en) | Recognizing text in image data | |
US8108764B2 (en) | Document recognition using static and variable strings to create a document signature | |
CN110569341B (zh) | 配置聊天机器人的方法、装置、计算机设备和存储介质 | |
CN105701488A (zh) | 一种身份证识别方法 | |
WO2011029011A1 (en) | System and method for the localization of statistical classifiers based on machine translation | |
CN105184289B (zh) | 字符识别方法和装置 | |
US20220044013A1 (en) | Enhancing electronic documents for character recognition | |
US20190095447A1 (en) | Method, apparatus, device and storage medium for establishing error correction model based on error correction platform | |
CN104239853A (zh) | 一种图像的处理方法和装置 | |
Ul-Hasan et al. | Ocroract: A sequence learning ocr system trained on isolated characters | |
CN106708963B (zh) | 一种人工智能模式下的网站编辑器文章录入方法及系统 | |
Kaló et al. | Key-Value Pair Searhing System via Tesseract OCR and Post Processing | |
CN112801923A (zh) | 文字处理方法、系统、可读存储介质及计算机设备 | |
KR20220019501A (ko) | 딥러닝 기반 전자책 자동변환 서비스 제공 방법 | |
US20190333516A1 (en) | Speech recognition device and method of identifying speech | |
CN116627460A (zh) | 固件升级方法及装置 | |
CN116680261A (zh) | 数据报送方法、系统以及装置 | |
CN112307251B (zh) | 英语词汇知识点图谱自适应识别关联系统和方法 | |
US20230125177A1 (en) | Methods and systems for matching and optimizing technology solutions to requested enterprise products | |
CN111046864A (zh) | 一种合同扫描件五要素自动提取方法及系统 | |
US11934447B2 (en) | Agnostic image digitizer |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
TR01 | Transfer of patent right | ||
TR01 | Transfer of patent right |
Effective date of registration: 20200722 Address after: Hangzhou City, Zhejiang Province, Binjiang District Puyan Street 310000 Albert Road No. 1 building 4, room 105, 103 Patentee after: HANGZHOU CCRFID MICROELECTRONICS Co.,Ltd. Address before: 210096 Jiangsu city Nanjing Province four pailou No. 2 Patentee before: SOUTHEAST University |
|
PE01 | Entry into force of the registration of the contract for pledge of patent right | ||
PE01 | Entry into force of the registration of the contract for pledge of patent right |
Denomination of invention: A character recognition method and device based on Tesseract engine Effective date of registration: 20210604 Granted publication date: 20190205 Pledgee: China Minsheng Banking Corp Hangzhou branch Pledgor: HANGZHOU CCRFID MICROELECTRONICS Co.,Ltd. Registration number: Y2021330000513 |
|
PC01 | Cancellation of the registration of the contract for pledge of patent right | ||
PC01 | Cancellation of the registration of the contract for pledge of patent right |
Date of cancellation: 20220315 Granted publication date: 20190205 Pledgee: China Minsheng Banking Corp Hangzhou branch Pledgor: HANGZHOU CCRFID MICROELECTRONICS Co.,Ltd. Registration number: Y2021330000513 |