KR101690981B1 - 형태 인식 방법 및 디바이스 - Google Patents

형태 인식 방법 및 디바이스 Download PDF

Info

Publication number
KR101690981B1
KR101690981B1 KR1020157000030A KR20157000030A KR101690981B1 KR 101690981 B1 KR101690981 B1 KR 101690981B1 KR 1020157000030 A KR1020157000030 A KR 1020157000030A KR 20157000030 A KR20157000030 A KR 20157000030A KR 101690981 B1 KR101690981 B1 KR 101690981B1
Authority
KR
South Korea
Prior art keywords
shape
bounds
straight line
boundaries
pixel
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
KR1020157000030A
Other languages
English (en)
Korean (ko)
Other versions
KR20150017755A (ko
Inventor
후이 수
Original Assignee
알리바바 그룹 홀딩 리미티드
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 알리바바 그룹 홀딩 리미티드 filed Critical 알리바바 그룹 홀딩 리미티드
Publication of KR20150017755A publication Critical patent/KR20150017755A/ko
Application granted granted Critical
Publication of KR101690981B1 publication Critical patent/KR101690981B1/ko
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition
    • G06V30/22Character recognition characterised by the type of writing
    • G06V30/224Character recognition characterised by the type of writing of printed characters having additional code marks or containing code marks
    • G06K9/18
    • G06K9/00449
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/40Document-oriented image-based pattern recognition
    • G06V30/41Analysis of document content
    • G06V30/412Layout analysis of documents structured with printed lines or input boxes, e.g. business forms or tables

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Theoretical Computer Science (AREA)
  • Artificial Intelligence (AREA)
  • Character Input (AREA)
  • Image Analysis (AREA)
  • Document Processing Apparatus (AREA)
KR1020157000030A 2012-07-24 2013-07-23 형태 인식 방법 및 디바이스 Active KR101690981B1 (ko)

Applications Claiming Priority (5)

Application Number Priority Date Filing Date Title
CN201210258883.7A CN103577817B (zh) 2012-07-24 2012-07-24 表单识别方法与装置
CN201210258883.7 2012-07-24
US13/947,412 2013-07-22
US13/947,412 US9047529B2 (en) 2012-07-24 2013-07-22 Form recognition method and device
PCT/US2013/051576 WO2014018482A2 (en) 2012-07-24 2013-07-23 Form recognition method and device

Publications (2)

Publication Number Publication Date
KR20150017755A KR20150017755A (ko) 2015-02-17
KR101690981B1 true KR101690981B1 (ko) 2016-12-29

Family

ID=49994954

Family Applications (1)

Application Number Title Priority Date Filing Date
KR1020157000030A Active KR101690981B1 (ko) 2012-07-24 2013-07-23 형태 인식 방법 및 디바이스

Country Status (6)

Country Link
US (1) US9047529B2 (enExample)
JP (1) JP6000455B2 (enExample)
KR (1) KR101690981B1 (enExample)
CN (1) CN103577817B (enExample)
TW (1) TWI536277B (enExample)
WO (1) WO2014018482A2 (enExample)

Families Citing this family (49)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9235755B2 (en) * 2013-08-15 2016-01-12 Konica Minolta Laboratory U.S.A., Inc. Removal of underlines and table lines in document images while preserving intersecting character strokes
AU2013273778A1 (en) * 2013-12-20 2015-07-09 Canon Kabushiki Kaisha Text line fragments for text line analysis
US9256780B1 (en) * 2014-09-22 2016-02-09 Intel Corporation Facilitating dynamic computations for performing intelligent body segmentations for enhanced gesture recognition on computing devices
US10395133B1 (en) * 2015-05-08 2019-08-27 Open Text Corporation Image box filtering for optical character recognition
US10997407B2 (en) * 2015-10-02 2021-05-04 Hewlett-Packard Development Company, L.P. Detecting document objects
CN105550633B (zh) * 2015-10-30 2018-12-11 小米科技有限责任公司 区域识别方法及装置
US9865038B2 (en) * 2015-11-25 2018-01-09 Konica Minolta Laboratory U.S.A., Inc. Offsetting rotated tables in images
US9697423B1 (en) * 2015-12-31 2017-07-04 Konica Minolta Laboratory U.S.A., Inc. Identifying the lines of a table
US10002306B2 (en) * 2016-06-30 2018-06-19 Konica Minolta Laboratory U.S.A., Inc. Merging overlapping broken lines of a table
CN108090068B (zh) * 2016-11-21 2021-05-25 医渡云(北京)技术有限公司 医院数据库中的表的分类方法及装置
CN106875408B (zh) * 2017-02-27 2020-03-17 网易(杭州)网络有限公司 用于截图的方法、装置及终端设备
JP7059514B2 (ja) * 2017-03-15 2022-04-26 オムロン株式会社 文字認識装置、文字認識方法、および、文字認識プログラム
CN108734687B (zh) * 2017-04-21 2020-04-28 游诚曦 一种斜拉线不受力缺陷识别方法及装置
CN107085734A (zh) * 2017-05-24 2017-08-22 南京华设科技股份有限公司 智能业务受理机器人
US10331949B2 (en) * 2017-07-25 2019-06-25 Konica Minolta Laboratory U.S.A., Inc. Splitting merged table cells
US10268920B2 (en) * 2017-08-31 2019-04-23 Konica Minolta Laboratory U.S.A., Inc. Detection of near rectangular cells
CN107679024B (zh) * 2017-09-11 2023-04-18 畅捷通信息技术股份有限公司 识别表格的方法、系统、计算机设备、可读存储介质
TWI682327B (zh) 2018-01-02 2020-01-11 虹光精密工業股份有限公司 影像整合列印系統以及影像整合列印方法
CN108416377B (zh) * 2018-02-26 2021-12-10 阿博茨德(北京)科技有限公司 柱状图中的信息提取方法及装置
CN108763606B (zh) * 2018-03-12 2019-12-10 江苏艾佳家居用品有限公司 一种基于机器视觉的户型图元素自动提取方法与系统
JP6487100B1 (ja) * 2018-05-24 2019-03-20 株式会社東芝 帳票処理装置及び帳票処理方法
CN109214385B (zh) * 2018-08-15 2021-06-08 腾讯科技(深圳)有限公司 数据采集方法、数据采集装置及存储介质
CN109460544A (zh) * 2018-10-26 2019-03-12 长沙通诺信息科技有限责任公司 电子表单生成方法及装置、计算机设备及存储介质
CN109635633B (zh) * 2018-10-26 2024-10-29 平安科技(深圳)有限公司 电子装置、票据识别方法及存储介质
CN109684957A (zh) * 2018-12-14 2019-04-26 新博卓畅技术(北京)有限公司 一种自动按照纸质表单展现系统数据的方法及系统
CN109934160B (zh) * 2019-03-12 2023-06-02 天津瑟威兰斯科技有限公司 基于表格识别的表格文字信息提取的方法及系统
CN110084117B (zh) * 2019-03-22 2021-07-20 中国科学院自动化研究所 基于二值图分段投影的文档表格线检测方法、系统
CN109977910B (zh) * 2019-04-04 2021-08-20 厦门商集网络科技有限责任公司 基于彩色线段的票据快速定位方法及其系统
CN110188336B (zh) * 2019-05-27 2022-06-10 厦门商集网络科技有限责任公司 一种基于oa申请单生成报销单的方法和装置
CN110598575B (zh) * 2019-08-21 2023-06-02 科大讯飞股份有限公司 表格版面分析与提取方法及相关装置
WO2021062896A1 (zh) * 2019-09-30 2021-04-08 北京市商汤科技开发有限公司 表单识别方法、表格提取方法及相关装置
JP2022504454A (ja) * 2019-09-30 2022-01-13 北京市商▲湯▼科技▲開▼▲發▼有限公司 フォーム認識方法、フォーム抽出方法および関連する装置
CN110796031B (zh) * 2019-10-11 2024-08-02 腾讯科技(深圳)有限公司 基于人工智能的表格识别方法、装置及电子设备
KR102645291B1 (ko) * 2019-10-30 2024-03-07 선문대학교 산학협력단 상품 정보 제공 및 상품 주문이 가능한 어플리케이션과 연동되는 스마트 자판기 관리 장치
CN111144081B (zh) * 2019-12-10 2024-05-24 东软集团股份有限公司 表单生成方法、装置、存储介质及电子设备
CN111091090A (zh) * 2019-12-11 2020-05-01 上海眼控科技股份有限公司 一种银行报表ocr识别方法、装置、平台和终端
CN113139370B (zh) 2020-01-16 2025-01-10 京东方科技集团股份有限公司 一种表格提取方法、装置及触控显示装置
CN111553187B (zh) * 2020-03-20 2023-06-02 广联达科技股份有限公司 识别cad图纸中表格的方法及系统
CN111626027B (zh) * 2020-05-20 2023-03-24 北京百度网讯科技有限公司 表格结构还原方法、装置、设备、系统和可读存储介质
CN111695553B (zh) * 2020-06-05 2023-09-08 北京百度网讯科技有限公司 表格识别方法、装置、设备和介质
CN111882534B (zh) * 2020-07-17 2025-04-11 广联达科技股份有限公司 一种识别线型的方法、设备及可读存储介质
US11990214B2 (en) 2020-07-21 2024-05-21 International Business Machines Corporation Handling form data errors arising from natural language processing
CN112464955B (zh) * 2020-12-03 2025-03-25 上海连尚网络科技集团有限公司 图像重合度确定方法、电子设备及计算机可读存储介质
US11816913B2 (en) 2021-03-02 2023-11-14 Tata Consultancy Services Limited Methods and systems for extracting information from document images
CN113065536B (zh) * 2021-06-03 2021-09-14 北京欧应信息技术有限公司 处理表格的方法、计算设备和计算机可读存储介质
CN115273116A (zh) * 2022-07-29 2022-11-01 天翼云科技有限公司 表格检测识别方法、装置、设备和存储介质
CN116109651A (zh) * 2022-09-16 2023-05-12 国网湖北省电力有限公司超高压公司 一种电气设备图纸自动分割方法
CN115512385B (zh) * 2022-09-30 2024-09-27 中交第二航务工程局有限公司 一种位图格式钢筋图纸数据提取方法和系统
CN117454859B (zh) * 2023-12-19 2024-04-02 四川弘和数智集团有限公司 油气站数据自动录入方法、装置、电子设备及存储介质

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2007052808A (ja) * 1996-12-27 2007-03-01 Fujitsu Ltd フォーム識別方法

Family Cites Families (19)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS5851372A (ja) * 1981-09-22 1983-03-26 Ricoh Co Ltd 高画質化方法
JPS61877A (ja) * 1984-06-14 1986-01-06 Amada Co Ltd 形状認識装置
JPS6232581A (ja) * 1985-08-05 1987-02-12 Nippon Telegr & Teleph Corp <Ntt> 掌形認識方法
JPH027183A (ja) * 1988-06-25 1990-01-11 Toshiba Corp 文字切出装置
JP3096481B2 (ja) * 1991-02-22 2000-10-10 グローリー工業株式会社 帳票類の種類判別方法
WO1993005481A1 (en) * 1991-08-30 1993-03-18 Trw Financial Systems, Inc. Method and apparatus for converting documents between paper medium and electronic media
US5680479A (en) * 1992-04-24 1997-10-21 Canon Kabushiki Kaisha Method and apparatus for character recognition
JPH07141471A (ja) * 1993-11-19 1995-06-02 Sharp Corp 文字認識方法
JPH0877294A (ja) * 1994-09-06 1996-03-22 Toshiba Corp 文書画像処理装置
US5841905A (en) * 1996-10-25 1998-11-24 Eastman Kodak Company Business form image identification using projected profiles of graphical lines and text string lines
JPH11232382A (ja) * 1998-02-10 1999-08-27 Hitachi Ltd 罫線抽出方法及び罫線除去方法
JP2002324236A (ja) 2001-04-25 2002-11-08 Hitachi Ltd 帳票識別方法及び帳票登録方法
US6898317B2 (en) 2001-05-07 2005-05-24 Hewlett-Packard Development Company, L.P. Method and system for fit-to-form scanning with a scanning device
US7725834B2 (en) 2005-03-04 2010-05-25 Microsoft Corporation Designer-created aspect for an electronic form template
US7583841B2 (en) * 2005-12-21 2009-09-01 Microsoft Corporation Table detection in ink notes
US8320674B2 (en) * 2008-09-03 2012-11-27 Sony Corporation Text localization for image and video OCR
CN101676930A (zh) * 2008-09-17 2010-03-24 北大方正集团有限公司 一种识别扫描图像中表格单元的方法及装置
CN101908136B (zh) * 2009-06-08 2013-02-13 比亚迪股份有限公司 一种表格识别处理方法及系统
US8274523B2 (en) 2009-07-30 2012-09-25 Eastman Kodak Company Processing digital templates for image display

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2007052808A (ja) * 1996-12-27 2007-03-01 Fujitsu Ltd フォーム識別方法

Also Published As

Publication number Publication date
CN103577817A (zh) 2014-02-12
US20140029853A1 (en) 2014-01-30
WO2014018482A2 (en) 2014-01-30
KR20150017755A (ko) 2015-02-17
JP2015528960A (ja) 2015-10-01
TWI536277B (zh) 2016-06-01
JP6000455B2 (ja) 2016-09-28
CN103577817B (zh) 2017-03-01
US9047529B2 (en) 2015-06-02
WO2014018482A3 (en) 2014-03-20
TW201405440A (zh) 2014-02-01

Similar Documents

Publication Publication Date Title
KR101690981B1 (ko) 형태 인식 방법 및 디바이스
CN110008809B (zh) 表格数据的获取方法、装置和服务器
US10354133B2 (en) Method for structural analysis and recognition of handwritten mathematical formula in natural scene image
Xiao et al. Text region extraction in a document image based on the Delaunay tessellation
CN100474340C (zh) 图像处理方法和图像处理装置
US20150063699A1 (en) Line segmentation method applicable to document images containing handwriting and printed text characters or skewed text lines
JP7132050B2 (ja) テキスト行の区分化方法
JP2002133426A (ja) 多値画像から罫線を抽出する罫線抽出装置
US11074443B2 (en) Method and device for acquiring slant value of slant image, terminal and storage medium
Roy et al. Text line extraction in graphical documents using background and foreground information
JP2019102061A5 (enExample)
Aldavert et al. Manuscript text line detection and segmentation using second-order derivatives
JP6542230B2 (ja) 投影ひずみを補正するための方法及びシステム
Tran et al. Hybrid page segmentation using multilevel homogeneity structure
JP2011248702A (ja) 画像処理装置、画像処理方法、画像処理プログラム及びプログラム記憶媒体
CN107437084B (zh) 一种脱机手写体文本识别的字符重心定位方法
CN110321887B (zh) 文档图像处理方法、文档图像处理装置及存储介质
Haji et al. A novel segmentation and skew correction approach for handwritten Malayalam documents
US7769234B2 (en) Ruled line extracting program, ruled line extracting apparatus and ruled line extracting method
CN107145887B (zh) 一种针对物体删除的缝裁剪图像定位取证方法
CN104992161B (zh) 一种基于部件识别的汉字部件分割与结构判定方法
Ziaratban et al. Adaptive script-independent text line extraction
JP4244692B2 (ja) 文字認識装置及び文字認識プログラム
Kumar et al. Quad: Quality assessment of documents
JP6580201B2 (ja) 被写体検出装置、被写体検出方法及びプログラム

Legal Events

Date Code Title Description
A201 Request for examination
P11-X000 Amendment of application requested

St.27 status event code: A-2-2-P10-P11-nap-X000

P13-X000 Application amended

St.27 status event code: A-2-2-P10-P13-nap-X000

PA0105 International application

St.27 status event code: A-0-1-A10-A15-nap-PA0105

PA0201 Request for examination

St.27 status event code: A-1-2-D10-D11-exm-PA0201

PG1501 Laying open of application

St.27 status event code: A-1-1-Q10-Q12-nap-PG1501

D13-X000 Search requested

St.27 status event code: A-1-2-D10-D13-srh-X000

D14-X000 Search report completed

St.27 status event code: A-1-2-D10-D14-srh-X000

E902 Notification of reason for refusal
PE0902 Notice of grounds for rejection

St.27 status event code: A-1-2-D10-D21-exm-PE0902

P11-X000 Amendment of application requested

St.27 status event code: A-2-2-P10-P11-nap-X000

P13-X000 Application amended

St.27 status event code: A-2-2-P10-P13-nap-X000

E701 Decision to grant or registration of patent right
PE0701 Decision of registration

St.27 status event code: A-1-2-D10-D22-exm-PE0701

GRNT Written decision to grant
PR0701 Registration of establishment

St.27 status event code: A-2-4-F10-F11-exm-PR0701

PR1002 Payment of registration fee

St.27 status event code: A-2-2-U10-U12-oth-PR1002

Fee payment year number: 1

PG1601 Publication of registration

St.27 status event code: A-4-4-Q10-Q13-nap-PG1601

P22-X000 Classification modified

St.27 status event code: A-4-4-P10-P22-nap-X000

FPAY Annual fee payment

Payment date: 20191213

Year of fee payment: 4

PR1001 Payment of annual fee

St.27 status event code: A-4-4-U10-U11-oth-PR1001

Fee payment year number: 4

PR1001 Payment of annual fee

St.27 status event code: A-4-4-U10-U11-oth-PR1001

Fee payment year number: 5

PR1001 Payment of annual fee

St.27 status event code: A-4-4-U10-U11-oth-PR1001

Fee payment year number: 6

P22-X000 Classification modified

St.27 status event code: A-4-4-P10-P22-nap-X000

R18-X000 Changes to party contact information recorded

St.27 status event code: A-5-5-R10-R18-oth-X000

PR1001 Payment of annual fee

St.27 status event code: A-4-4-U10-U11-oth-PR1001

Fee payment year number: 7

P22-X000 Classification modified

St.27 status event code: A-4-4-P10-P22-nap-X000

PR1001 Payment of annual fee

St.27 status event code: A-4-4-U10-U11-oth-PR1001

Fee payment year number: 8

PR1001 Payment of annual fee

St.27 status event code: A-4-4-U10-U11-oth-PR1001

Fee payment year number: 9

PR1001 Payment of annual fee

St.27 status event code: A-4-4-U10-U11-oth-PR1001

Fee payment year number: 10