CN102385707A - 一种数字图像识别的方法、装置及爬虫服务器 - Google Patents

一种数字图像识别的方法、装置及爬虫服务器 Download PDF

Info

Publication number
CN102385707A
CN102385707A CN2010102704561A CN201010270456A CN102385707A CN 102385707 A CN102385707 A CN 102385707A CN 2010102704561 A CN2010102704561 A CN 2010102704561A CN 201010270456 A CN201010270456 A CN 201010270456A CN 102385707 A CN102385707 A CN 102385707A
Authority
CN
China
Prior art keywords
image
sub
module
recognized
thinned
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN2010102704561A
Other languages
English (en)
Chinese (zh)
Inventor
孙翔
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Alibaba Group Holding Ltd
Original Assignee
Alibaba Group Holding Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Alibaba Group Holding Ltd filed Critical Alibaba Group Holding Ltd
Priority to CN2010102704561A priority Critical patent/CN102385707A/zh
Priority to US13/199,332 priority patent/US8781227B2/en
Priority to JP2013525899A priority patent/JP5701388B2/ja
Priority to EP11822246.2A priority patent/EP2572317B1/en
Priority to PCT/US2011/001512 priority patent/WO2012030384A1/en
Publication of CN102385707A publication Critical patent/CN102385707A/zh
Priority to US14/302,277 priority patent/US8958643B2/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition
    • G06V30/32Digital ink
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition
    • G06V30/18Extraction of features or characteristics of the image
    • G06V30/1801Detecting partial patterns, e.g. edges or contours, or configurations, e.g. loops, corners, strokes or intersections
    • G06V30/18076Detecting partial patterns, e.g. edges or contours, or configurations, e.g. loops, corners, strokes or intersections by analysing connectivity, e.g. edge linking, connected component analysis or slices
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Theoretical Computer Science (AREA)
  • Character Discrimination (AREA)
  • Character Input (AREA)
CN2010102704561A 2010-08-30 2010-08-30 一种数字图像识别的方法、装置及爬虫服务器 Pending CN102385707A (zh)

Priority Applications (6)

Application Number Priority Date Filing Date Title
CN2010102704561A CN102385707A (zh) 2010-08-30 2010-08-30 一种数字图像识别的方法、装置及爬虫服务器
US13/199,332 US8781227B2 (en) 2010-08-30 2011-08-25 Recognition of numerical characters in digital images
JP2013525899A JP5701388B2 (ja) 2010-08-30 2011-08-26 デジタル画像の認識
EP11822246.2A EP2572317B1 (en) 2010-08-30 2011-08-26 Recognition of digital images
PCT/US2011/001512 WO2012030384A1 (en) 2010-08-30 2011-08-26 Recognition of digital images
US14/302,277 US8958643B2 (en) 2010-08-30 2014-06-11 Recognition of numerical characters in digital images

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN2010102704561A CN102385707A (zh) 2010-08-30 2010-08-30 一种数字图像识别的方法、装置及爬虫服务器

Publications (1)

Publication Number Publication Date
CN102385707A true CN102385707A (zh) 2012-03-21

Family

ID=45697350

Family Applications (1)

Application Number Title Priority Date Filing Date
CN2010102704561A Pending CN102385707A (zh) 2010-08-30 2010-08-30 一种数字图像识别的方法、装置及爬虫服务器

Country Status (5)

Country Link
US (2) US8781227B2 (https=)
EP (1) EP2572317B1 (https=)
JP (1) JP5701388B2 (https=)
CN (1) CN102385707A (https=)
WO (1) WO2012030384A1 (https=)

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104915664A (zh) * 2015-05-22 2015-09-16 腾讯科技(深圳)有限公司 联系对象标识获取方法和装置
CN105184328A (zh) * 2015-08-17 2015-12-23 浪潮软件集团有限公司 一种识别图像的方法及装置
CN105117723B (zh) * 2015-08-17 2018-07-06 浪潮金融信息技术有限公司 一种图像识别方法及装置
TWI697795B (zh) * 2019-02-12 2020-07-01 緯創資通股份有限公司 資料擷取方法及其系統
CN111680688A (zh) * 2020-06-10 2020-09-18 创新奇智(成都)科技有限公司 字符识别方法及装置、电子设备、存储介质
CN115116064A (zh) * 2022-05-18 2022-09-27 腾讯科技(深圳)有限公司 字符图像中的字符识别方法、装置、设备及存储介质

Families Citing this family (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9053359B2 (en) 2012-06-07 2015-06-09 Konica Minolta Laboratory U.S.A., Inc. Method and system for document authentication using Krawtchouk decomposition of image patches for image comparison
US9230383B2 (en) 2012-12-28 2016-01-05 Konica Minolta Laboratory U.S.A., Inc. Document image compression method and its application in document authentication
US10725650B2 (en) * 2014-03-17 2020-07-28 Kabushiki Kaisha Kawai Gakki Seisakusho Handwritten music sign recognition device and program
CN104933138A (zh) * 2015-06-16 2015-09-23 携程计算机技术(上海)有限公司 网页爬虫系统及网页爬取方法
CN106407932B (zh) * 2016-09-20 2019-05-28 中国石油大学(华东) 基于分数阶微积分与广义逆神经网络的手写数字识别方法
US10438098B2 (en) * 2017-05-19 2019-10-08 Hand Held Products, Inc. High-speed OCR decode using depleted centerlines
JP6919990B2 (ja) * 2017-10-17 2021-08-18 株式会社日立製作所 オンライン認識装置、オンライン認識方法、及びそれに用いる設定画面
JP7003617B2 (ja) * 2017-12-12 2022-01-20 富士通株式会社 推定装置、推定方法、及び推定プログラム
CN108363943B (zh) * 2017-12-27 2020-12-01 苏州工业园区报关有限公司 基于智能化识别技术的通关机器人
WO2023105291A2 (en) * 2021-12-06 2023-06-15 Hanzifinder Llc Systems and methods for representing and searching characters

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101201902A (zh) * 2006-12-14 2008-06-18 汤浩钧 基于六角网格的识别系统及识别方法
US20080247674A1 (en) * 2003-02-28 2008-10-09 Walch Mark A Systems and methods for source language word pattern matching
US20100215276A1 (en) * 2009-02-25 2010-08-26 Fujitsu Limited Storage medium storing character recognition program, character recognition method, and character recognition apparatus

Family Cites Families (19)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
DE3280099D1 (de) * 1981-09-11 1990-03-08 Burroughs Corp Geometrische zeichenerkennung mit darstellung des skelettes und der strichbreite.
JPS61182182A (ja) * 1985-02-06 1986-08-14 Omron Tateisi Electronics Co 文字認識装置
US4742556A (en) 1985-09-16 1988-05-03 Davis Jr Ray E Character recognition method
US5097517A (en) 1987-03-17 1992-03-17 Holt Arthur W Method and apparatus for processing bank checks, drafts and like financial documents
JP2619427B2 (ja) * 1987-10-30 1997-06-11 グローリー工業株式会社 文字パターン認識方法
JPH01229388A (ja) * 1988-03-09 1989-09-13 Nippon Steel Corp 数字認識方法および装置
JPH0324682A (ja) * 1989-06-21 1991-02-01 Aisin Seiki Co Ltd 文字特徴抽出方法
JP3260843B2 (ja) * 1992-08-25 2002-02-25 株式会社リコー 文字認識方法
JP4742404B2 (ja) 2000-05-17 2011-08-10 コニカミノルタビジネステクノロジーズ株式会社 画像認識装置、画像形成装置、画像認識方法および画像認識プログラムを記憶したコンピュータ読取り可能な記録媒体
WO2002037933A2 (en) 2000-11-08 2002-05-16 New York University System, process and software arrangement for recognizing handwritten characters
JP3965983B2 (ja) * 2001-11-30 2007-08-29 松下電工株式会社 画像処理方法およびその装置
WO2006088222A1 (ja) * 2005-02-15 2006-08-24 Kite Image Technologies Inc. 手書き文字認識方法、手書き文字認識システム、手書き文字認識プログラム及び記憶媒体
WO2006091156A1 (en) * 2005-02-28 2006-08-31 Zi Decuma Ab Recognition graph
US20070058856A1 (en) * 2005-09-15 2007-03-15 Honeywell International Inc. Character recoginition in video data
WO2007082187A2 (en) 2006-01-11 2007-07-19 Gannon Technologies Group, Llc Pictographic recognition technology applied to distinctive characteristics of handwritten arabic text
US7860313B2 (en) * 2006-01-11 2010-12-28 Gannon Technologies Group, Llc Methods and apparatuses for extending dynamic handwriting recognition to recognize static handwritten and machine generated text
JP5253788B2 (ja) 2007-10-31 2013-07-31 富士通株式会社 画像認識装置、画像認識プログラムおよび画像認識方法
US8452108B2 (en) * 2008-06-25 2013-05-28 Gannon Technologies Group Llc Systems and methods for image recognition using graph-based pattern matching
WO2010087886A1 (en) 2009-01-27 2010-08-05 Gannon Technologies Group Llc Systems and methods for graph-based pattern recognition technology applied to the automated identification of fingerprints

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20080247674A1 (en) * 2003-02-28 2008-10-09 Walch Mark A Systems and methods for source language word pattern matching
CN101201902A (zh) * 2006-12-14 2008-06-18 汤浩钧 基于六角网格的识别系统及识别方法
US20100215276A1 (en) * 2009-02-25 2010-08-26 Fujitsu Limited Storage medium storing character recognition program, character recognition method, and character recognition apparatus

Cited By (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104915664A (zh) * 2015-05-22 2015-09-16 腾讯科技(深圳)有限公司 联系对象标识获取方法和装置
CN104915664B (zh) * 2015-05-22 2021-02-09 腾讯科技(深圳)有限公司 联系对象标识获取方法和装置
CN105184328A (zh) * 2015-08-17 2015-12-23 浪潮软件集团有限公司 一种识别图像的方法及装置
CN105117723B (zh) * 2015-08-17 2018-07-06 浪潮金融信息技术有限公司 一种图像识别方法及装置
CN105184328B (zh) * 2015-08-17 2018-11-27 浪潮金融信息技术有限公司 一种识别图像的方法及装置
TWI697795B (zh) * 2019-02-12 2020-07-01 緯創資通股份有限公司 資料擷取方法及其系統
CN111553340A (zh) * 2019-02-12 2020-08-18 昆山纬绩资通有限公司 数据撷取方法及其系统
US11062171B2 (en) 2019-02-12 2021-07-13 Wistron Corp. Data capturing method and system thereof
CN111680688A (zh) * 2020-06-10 2020-09-18 创新奇智(成都)科技有限公司 字符识别方法及装置、电子设备、存储介质
CN111680688B (zh) * 2020-06-10 2023-08-08 创新奇智(成都)科技有限公司 字符识别方法及装置、电子设备、存储介质
CN115116064A (zh) * 2022-05-18 2022-09-27 腾讯科技(深圳)有限公司 字符图像中的字符识别方法、装置、设备及存储介质
CN115116064B (zh) * 2022-05-18 2025-08-29 腾讯科技(深圳)有限公司 字符图像中的字符识别方法、装置、设备及存储介质

Also Published As

Publication number Publication date
US20140328541A1 (en) 2014-11-06
JP5701388B2 (ja) 2015-04-15
JP2013536958A (ja) 2013-09-26
EP2572317A4 (en) 2017-05-17
US8781227B2 (en) 2014-07-15
EP2572317B1 (en) 2020-10-07
EP2572317A1 (en) 2013-03-27
US8958643B2 (en) 2015-02-17
WO2012030384A1 (en) 2012-03-08
US20120051645A1 (en) 2012-03-01

Similar Documents

Publication Publication Date Title
CN102385707A (zh) 一种数字图像识别的方法、装置及爬虫服务器
CN107690657B (zh) 根据影像发现商户
CN112699775B (zh) 基于深度学习的证件识别方法、装置、设备及存储介质
US20210216765A1 (en) Receipt identification method, apparatus, device and storage medium
CN111640130A (zh) 表格还原方法及装置
CN109685052A (zh) 文本图像处理方法、装置、电子设备及计算机可读介质
WO2018233055A1 (zh) 保单信息录入的方法、装置、计算机设备及存储介质
CN114120305A (zh) 文本分类模型的训练方法、文本内容的识别方法及装置
CN113673528B (zh) 文本处理方法、装置、电子设备和可读存储介质
CN113850060A (zh) 民航文档数据识别录入方法及系统
CN110880000A (zh) 图片文字定位方法、装置、计算机设备和存储介质
CN114332883A (zh) 发票信息识别方法、装置、计算机设备及存储介质
CN112396059A (zh) 一种证件识别方法、装置、计算机设备及存储介质
CN113936286B (zh) 图像文本识别方法、装置、计算机设备及存储介质
CN111291758A (zh) 用于识别印章文字的方法和装置
CN110737687A (zh) 数据查询方法、装置、设备及存储介质
CN120932217A (zh) 一种商户门头照的识别方法、装置、设备及介质
CN115311451A (zh) 图像模糊度的评估方法、装置、计算机设备及存储介质
CN110414497A (zh) 对象电子化的方法、装置、服务器及存储介质
TWI497425B (zh) Method, apparatus and reptile server for digital image recognition
CN113591657B (zh) Ocr版面识别的方法、装置、电子设备及介质
CN116090422A (zh) 一种电力业扩表单的录入方法及装置
CN116543400A (zh) 一种错字识别方法、装置设备及介质
CN110427820B (zh) 一种基于神经网络的ppt边框识别方法及相关设备
CN114254616A (zh) 文本比对方法、电子设备、存储介质及程序产品

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
REG Reference to a national code

Ref country code: HK

Ref legal event code: DE

Ref document number: 1163885

Country of ref document: HK

C12 Rejection of a patent application after its publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20120321