CN104517112A - 一种表格识别方法与系统 - Google Patents
一种表格识别方法与系统 Download PDFInfo
- Publication number
- CN104517112A CN104517112A CN201310455065.0A CN201310455065A CN104517112A CN 104517112 A CN104517112 A CN 104517112A CN 201310455065 A CN201310455065 A CN 201310455065A CN 104517112 A CN104517112 A CN 104517112A
- Authority
- CN
- China
- Prior art keywords
- cutting plate
- table recognition
- feature
- directed graph
- text
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000000034 method Methods 0.000 title claims abstract description 119
- 239000011159 matrix material Substances 0.000 claims abstract description 18
- 238000005520 cutting process Methods 0.000 claims description 120
- 230000006870 function Effects 0.000 claims description 50
- 239000000284 extract Substances 0.000 claims description 34
- 230000008569 process Effects 0.000 claims description 34
- 230000011218 segmentation Effects 0.000 claims description 34
- 238000001514 detection method Methods 0.000 claims description 22
- 238000000605 extraction Methods 0.000 claims description 19
- 238000012549 training Methods 0.000 claims description 16
- 238000007373 indentation Methods 0.000 claims description 6
- 239000012634 fragment Substances 0.000 abstract 4
- 238000013467 fragmentation Methods 0.000 abstract 2
- 238000006062 fragmentation reaction Methods 0.000 abstract 2
- 238000010586 diagram Methods 0.000 description 13
- 238000004458 analytical method Methods 0.000 description 10
- 230000008859 change Effects 0.000 description 8
- 238000004590 computer program Methods 0.000 description 7
- 230000000694 effects Effects 0.000 description 5
- 230000008901 benefit Effects 0.000 description 4
- 238000005516 engineering process Methods 0.000 description 4
- 238000012545 processing Methods 0.000 description 4
- 230000008520 organization Effects 0.000 description 3
- 241000139306 Platt Species 0.000 description 2
- 238000007621 cluster analysis Methods 0.000 description 2
- 238000013461 design Methods 0.000 description 2
- 238000012706 support-vector machine Methods 0.000 description 2
- 238000013316 zoning Methods 0.000 description 2
- 241000931705 Cicada Species 0.000 description 1
- 230000007812 deficiency Effects 0.000 description 1
- 230000004927 fusion Effects 0.000 description 1
- 238000010801 machine learning Methods 0.000 description 1
- 238000012423 maintenance Methods 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 238000013507 mapping Methods 0.000 description 1
- 238000010606 normalization Methods 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 238000012946 outsourcing Methods 0.000 description 1
- 238000003672 processing method Methods 0.000 description 1
- 238000011084 recovery Methods 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 238000003860 storage Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V30/00—Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
- G06V30/40—Document-oriented image-based pattern recognition
- G06V30/41—Analysis of document content
- G06V30/412—Layout analysis of documents structured with printed lines or input boxes, e.g. business forms or tables
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/93—Document management systems
- G06F16/94—Hypermedia
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V30/00—Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
- G06V30/40—Document-oriented image-based pattern recognition
- G06V30/41—Analysis of document content
- G06V30/414—Extracting the geometrical structure, e.g. layout tree; Block segmentation, e.g. bounding boxes for graphics or text
Landscapes
- Engineering & Computer Science (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Multimedia (AREA)
- Artificial Intelligence (AREA)
- Computer Graphics (AREA)
- Databases & Information Systems (AREA)
- Geometry (AREA)
- General Engineering & Computer Science (AREA)
- Data Mining & Analysis (AREA)
- General Business, Economics & Management (AREA)
- Business, Economics & Management (AREA)
- Image Analysis (AREA)
- Character Input (AREA)
Abstract
Description
Claims (20)
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201310455065.0A CN104517112B (zh) | 2013-09-29 | 2013-09-29 | 一种表格识别方法与系统 |
US14/096,532 US9268999B2 (en) | 2013-09-29 | 2013-12-04 | Table recognizing method and table recognizing system |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201310455065.0A CN104517112B (zh) | 2013-09-29 | 2013-09-29 | 一种表格识别方法与系统 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN104517112A true CN104517112A (zh) | 2015-04-15 |
CN104517112B CN104517112B (zh) | 2017-11-28 |
Family
ID=52740244
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201310455065.0A Active CN104517112B (zh) | 2013-09-29 | 2013-09-29 | 一种表格识别方法与系统 |
Country Status (2)
Country | Link |
---|---|
US (1) | US9268999B2 (zh) |
CN (1) | CN104517112B (zh) |
Cited By (22)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104881641A (zh) * | 2015-05-18 | 2015-09-02 | 上海交通大学 | 基于移动设备的问卷和表格数字化识别方法及系统 |
CN106446881A (zh) * | 2016-07-29 | 2017-02-22 | 北京交通大学 | 从医疗化验单图像中提取化验结果信息的方法 |
CN107066997A (zh) * | 2016-12-16 | 2017-08-18 | 浙江工业大学 | 一种基于图像识别的电气元件报价方法 |
CN107679024A (zh) * | 2017-09-11 | 2018-02-09 | 畅捷通信息技术股份有限公司 | 识别表格的方法、系统、计算机设备、可读存储介质 |
CN108470021A (zh) * | 2018-03-26 | 2018-08-31 | 阿博茨德(北京)科技有限公司 | Pdf文档中表格的定位方法及装置 |
CN108614898A (zh) * | 2018-05-10 | 2018-10-02 | 爱因互动科技发展(北京)有限公司 | 文档解析方法与装置 |
CN109284495A (zh) * | 2018-11-03 | 2019-01-29 | 上海犀语科技有限公司 | 一种对文本进行无表格线切表的方法及装置 |
CN109522816A (zh) * | 2018-10-26 | 2019-03-26 | 北京慧流科技有限公司 | 表格识别方法及装置、计算机存储介质 |
CN109858468A (zh) * | 2019-03-04 | 2019-06-07 | 汉王科技股份有限公司 | 一种表格线识别方法及装置 |
CN109961008A (zh) * | 2019-02-13 | 2019-07-02 | 平安科技(深圳)有限公司 | 基于文字定位识别的表格解析方法、介质及计算机设备 |
CN110348294A (zh) * | 2019-05-30 | 2019-10-18 | 平安科技(深圳)有限公司 | Pdf文档中图表的定位方法、装置及计算机设备 |
CN110472209A (zh) * | 2019-07-04 | 2019-11-19 | 重庆金融资产交易所有限责任公司 | 基于深度学习的表格生成方法、装置和计算机设备 |
CN111104871A (zh) * | 2019-11-28 | 2020-05-05 | 北京明略软件系统有限公司 | 表格区域识别模型生成方法、装置及表格定位方法、装置 |
CN111695371A (zh) * | 2019-03-12 | 2020-09-22 | 珠海金山办公软件有限公司 | 一种表格识别的方法、装置、电子设备及存储介质 |
CN111860257A (zh) * | 2020-07-10 | 2020-10-30 | 上海交通大学 | 融合多种文本特征及几何信息的表格识别方法及系统 |
WO2020233379A1 (zh) * | 2019-05-17 | 2020-11-26 | 上海肇观电子科技有限公司 | 版面分析方法、阅读辅助设备、电路及介质 |
CN112380812A (zh) * | 2020-10-09 | 2021-02-19 | 北京中科凡语科技有限公司 | Pdf不完整框线表格提取方法、装置、设备及存储介质 |
WO2021124715A1 (ja) * | 2019-12-19 | 2021-06-24 | キヤノン株式会社 | 識別装置、処理装置、処理方法、およびプログラム |
JP2021096774A (ja) * | 2019-12-19 | 2021-06-24 | キヤノン株式会社 | 識別装置、処理装置、処理方法、およびプログラム |
CN113408256A (zh) * | 2021-06-30 | 2021-09-17 | 平安科技(深圳)有限公司 | 一种表格图片的表格重构方法、装置及相关设备 |
CN113728321A (zh) * | 2019-04-08 | 2021-11-30 | 微软技术许可有限责任公司 | 利用训练表的集合来准确预测各种表内的错误 |
CN113903016A (zh) * | 2021-12-09 | 2022-01-07 | 深圳佑驾创新科技有限公司 | 分岔点检测方法、装置、计算机设备和计算机程序产品 |
Families Citing this family (41)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20160055376A1 (en) * | 2014-06-21 | 2016-02-25 | iQG DBA iQGATEWAY LLC | Method and system for identification and extraction of data from structured documents |
CN106097313B (zh) * | 2016-06-02 | 2020-05-29 | 甘肃读者动漫科技有限公司 | 图像分割方法及装置 |
US9965678B2 (en) | 2016-06-29 | 2018-05-08 | Konica Minolta Laboratory U.S.A., Inc. | Method for recognizing table and flowchart in document images |
US9984471B2 (en) * | 2016-07-26 | 2018-05-29 | Intuit Inc. | Label and field identification without optical character recognition (OCR) |
US10303938B2 (en) * | 2016-12-29 | 2019-05-28 | Factset Research Systems Inc | Identifying a structure presented in portable document format (PDF) |
US20180260389A1 (en) * | 2017-03-08 | 2018-09-13 | Fujitsu Limited | Electronic document segmentation and relation discovery between elements for natural language processing |
US10223585B2 (en) | 2017-05-08 | 2019-03-05 | Adobe Systems Incorporated | Page segmentation of vector graphics documents |
US10339212B2 (en) * | 2017-08-14 | 2019-07-02 | Adobe Inc. | Detecting the bounds of borderless tables in fixed-format structured documents using machine learning |
US10831704B1 (en) * | 2017-10-16 | 2020-11-10 | BlueOwl, LLC | Systems and methods for automatically serializing and deserializing models |
US11379655B1 (en) | 2017-10-16 | 2022-07-05 | BlueOwl, LLC | Systems and methods for automatically serializing and deserializing models |
US11650970B2 (en) | 2018-03-09 | 2023-05-16 | International Business Machines Corporation | Extracting structure and semantics from tabular data |
US10241992B1 (en) * | 2018-04-27 | 2019-03-26 | Open Text Sa Ulc | Table item information extraction with continuous machine learning through local and global models |
CN110610495B (zh) * | 2018-06-15 | 2022-06-07 | 北京京东尚科信息技术有限公司 | 图像处理方法、系统和电子设备 |
CN110619325B (zh) * | 2018-06-20 | 2024-03-08 | 北京搜狗科技发展有限公司 | 一种文本识别方法及装置 |
US11200413B2 (en) * | 2018-07-31 | 2021-12-14 | International Business Machines Corporation | Table recognition in portable document format documents |
CN109543690B (zh) * | 2018-11-27 | 2020-04-07 | 北京百度网讯科技有限公司 | 用于提取信息的方法和装置 |
US11450125B2 (en) | 2018-12-04 | 2022-09-20 | Leverton Holding Llc | Methods and systems for automated table detection within documents |
CN109902724B (zh) * | 2019-01-31 | 2023-09-01 | 平安科技(深圳)有限公司 | 基于支持向量机的文字识别方法、装置和计算机设备 |
US10614345B1 (en) | 2019-04-12 | 2020-04-07 | Ernst & Young U.S. Llp | Machine learning based extraction of partition objects from electronic documents |
US11062133B2 (en) | 2019-06-24 | 2021-07-13 | International Business Machines Corporation | Data structure generation for tabular information in scanned images |
US11113518B2 (en) | 2019-06-28 | 2021-09-07 | Eygs Llp | Apparatus and methods for extracting data from lineless tables using Delaunay triangulation and excess edge removal |
US11048933B2 (en) * | 2019-07-31 | 2021-06-29 | Intuit Inc. | Generating structured representations of forms using machine learning |
US11915465B2 (en) | 2019-08-21 | 2024-02-27 | Eygs Llp | Apparatus and methods for converting lineless tables into lined tables using generative adversarial networks |
US11625934B2 (en) | 2020-02-04 | 2023-04-11 | Eygs Llp | Machine learning based end-to-end extraction of tables from electronic documents |
CN111368695B (zh) * | 2020-02-28 | 2023-06-20 | 上海汇航捷讯网络科技有限公司 | 一种表格结构提取方法 |
US11222201B2 (en) | 2020-04-14 | 2022-01-11 | International Business Machines Corporation | Vision-based cell structure recognition using hierarchical neural networks |
US11734576B2 (en) | 2020-04-14 | 2023-08-22 | International Business Machines Corporation | Cooperative neural networks with spatial containment constraints |
CN111709339B (zh) * | 2020-06-09 | 2023-09-19 | 北京百度网讯科技有限公司 | 一种票据图像识别方法、装置、设备及存储介质 |
CN111695517B (zh) * | 2020-06-12 | 2023-08-18 | 北京百度网讯科技有限公司 | 图像的表格提取方法、装置、电子设备及存储介质 |
CN111860502B (zh) * | 2020-07-15 | 2024-07-16 | 北京思图场景数据科技服务有限公司 | 图片表格的识别方法、装置、电子设备及存储介质 |
US20220147843A1 (en) * | 2020-11-12 | 2022-05-12 | Samsung Electronics Co., Ltd. | On-device knowledge extraction from visually rich documents |
US11688193B2 (en) | 2020-11-13 | 2023-06-27 | International Business Machines Corporation | Interactive structure annotation with artificial intelligence |
CN112257400B (zh) * | 2020-11-13 | 2024-09-03 | 腾讯科技(深圳)有限公司 | 表格数据提取方法、装置、计算机设备和存储介质 |
US11727215B2 (en) | 2020-11-16 | 2023-08-15 | SparkCognition, Inc. | Searchable data structure for electronic documents |
US11599711B2 (en) * | 2020-12-03 | 2023-03-07 | International Business Machines Corporation | Automatic delineation and extraction of tabular data in portable document format using graph neural networks |
US11681734B2 (en) * | 2020-12-09 | 2023-06-20 | International Business Machines Corporation | Organizing fragments of meaningful text |
US11721119B2 (en) * | 2020-12-18 | 2023-08-08 | Konica Minolta Business Solutions U.S.A., Inc. | Finding natural images in document pages |
US12056171B2 (en) * | 2021-01-11 | 2024-08-06 | Tata Consultancy Services Limited | System and method for automated information extraction from scanned documents |
US11887393B2 (en) * | 2021-03-02 | 2024-01-30 | Claritrics Inc. | End-to-end system for extracting tabular data present in electronic documents and method thereof |
CN112860905A (zh) * | 2021-04-08 | 2021-05-28 | 深圳壹账通智能科技有限公司 | 文本信息抽取方法、装置、设备及可读存储介质 |
EP4099215B1 (en) | 2021-06-03 | 2024-01-10 | Telefonica Cibersecurity & Cloud Tech S.L.U. | Computer vision method for detecting document regions that will be excluded from an embedding process and computer programs thereof |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20090144614A1 (en) * | 2007-12-03 | 2009-06-04 | Microsoft Corporation | Document layout extraction |
CN101770446A (zh) * | 2008-12-26 | 2010-07-07 | 北大方正集团有限公司 | 一种版式文件中表格识别方法及系统 |
CN101887413A (zh) * | 2009-05-14 | 2010-11-17 | 北大方正集团有限公司 | 版式表格的结构处理方法和系统 |
CN102184395A (zh) * | 2011-06-08 | 2011-09-14 | 天津大学 | 基于字符串核的手绘草图识别方法 |
Family Cites Families (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5528701A (en) * | 1994-09-02 | 1996-06-18 | Panasonic Technologies, Inc. | Trie based method for indexing handwritten databases |
US7451140B2 (en) * | 2005-01-11 | 2008-11-11 | Xerox Corporation | System and method for proofing individual documents of variable information document runs using document quality measurements |
US8869023B2 (en) * | 2007-08-06 | 2014-10-21 | Ricoh Co., Ltd. | Conversion of a collection of data to a structured, printable and navigable format |
US8645819B2 (en) * | 2011-06-17 | 2014-02-04 | Xerox Corporation | Detection and extraction of elements constituting images in unstructured document files |
US11631265B2 (en) * | 2012-05-24 | 2023-04-18 | Esker, Inc. | Automated learning of document data fields |
US9224207B2 (en) * | 2012-09-17 | 2015-12-29 | Raytheon Bbn Technologies Corp. | Segmentation co-clustering |
US9443132B2 (en) * | 2013-02-05 | 2016-09-13 | Children's National Medical Center | Device and method for classifying a condition based on image analysis |
US9558396B2 (en) * | 2013-10-22 | 2017-01-31 | Samsung Electronics Co., Ltd. | Apparatuses and methods for face tracking based on calculated occlusion probabilities |
US9324038B2 (en) * | 2013-11-15 | 2016-04-26 | Xerox Corporation | Method and system for clustering, modeling, and visualizing process models from noisy logs |
-
2013
- 2013-09-29 CN CN201310455065.0A patent/CN104517112B/zh active Active
- 2013-12-04 US US14/096,532 patent/US9268999B2/en not_active Expired - Fee Related
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20090144614A1 (en) * | 2007-12-03 | 2009-06-04 | Microsoft Corporation | Document layout extraction |
CN101770446A (zh) * | 2008-12-26 | 2010-07-07 | 北大方正集团有限公司 | 一种版式文件中表格识别方法及系统 |
CN101887413A (zh) * | 2009-05-14 | 2010-11-17 | 北大方正集团有限公司 | 版式表格的结构处理方法和系统 |
CN102184395A (zh) * | 2011-06-08 | 2011-09-14 | 天津大学 | 基于字符串核的手绘草图识别方法 |
Non-Patent Citations (2)
Title |
---|
房婧等: "版式电子文档表格自动检测与性能评估", 《北京大学学报(自然科学版)》 * |
贺岩等: "基于加权无向图的表格分割方法", 《计算机应用》 * |
Cited By (39)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104881641A (zh) * | 2015-05-18 | 2015-09-02 | 上海交通大学 | 基于移动设备的问卷和表格数字化识别方法及系统 |
CN104881641B (zh) * | 2015-05-18 | 2019-01-25 | 上海交通大学 | 基于移动设备的问卷和表格数字化识别方法及系统 |
CN106446881B (zh) * | 2016-07-29 | 2019-05-21 | 北京交通大学 | 从医疗化验单图像中提取化验结果信息的方法 |
CN106446881A (zh) * | 2016-07-29 | 2017-02-22 | 北京交通大学 | 从医疗化验单图像中提取化验结果信息的方法 |
CN107066997A (zh) * | 2016-12-16 | 2017-08-18 | 浙江工业大学 | 一种基于图像识别的电气元件报价方法 |
CN107066997B (zh) * | 2016-12-16 | 2019-07-30 | 浙江工业大学 | 一种基于图像识别的电气元件报价方法 |
CN107679024A (zh) * | 2017-09-11 | 2018-02-09 | 畅捷通信息技术股份有限公司 | 识别表格的方法、系统、计算机设备、可读存储介质 |
CN108470021A (zh) * | 2018-03-26 | 2018-08-31 | 阿博茨德(北京)科技有限公司 | Pdf文档中表格的定位方法及装置 |
CN108470021B (zh) * | 2018-03-26 | 2022-06-03 | 阿博茨德(北京)科技有限公司 | Pdf文档中表格的定位方法及装置 |
CN108614898A (zh) * | 2018-05-10 | 2018-10-02 | 爱因互动科技发展(北京)有限公司 | 文档解析方法与装置 |
CN109522816A (zh) * | 2018-10-26 | 2019-03-26 | 北京慧流科技有限公司 | 表格识别方法及装置、计算机存储介质 |
CN109284495A (zh) * | 2018-11-03 | 2019-01-29 | 上海犀语科技有限公司 | 一种对文本进行无表格线切表的方法及装置 |
CN109284495B (zh) * | 2018-11-03 | 2023-02-07 | 上海犀语科技有限公司 | 一种对文本进行无表格线切表的方法及装置 |
CN109961008A (zh) * | 2019-02-13 | 2019-07-02 | 平安科技(深圳)有限公司 | 基于文字定位识别的表格解析方法、介质及计算机设备 |
CN109858468A (zh) * | 2019-03-04 | 2019-06-07 | 汉王科技股份有限公司 | 一种表格线识别方法及装置 |
CN111695371B (zh) * | 2019-03-12 | 2024-05-03 | 珠海金山办公软件有限公司 | 一种表格识别的方法、装置、电子设备及存储介质 |
CN111695371A (zh) * | 2019-03-12 | 2020-09-22 | 珠海金山办公软件有限公司 | 一种表格识别的方法、装置、电子设备及存储介质 |
CN113728321A (zh) * | 2019-04-08 | 2021-11-30 | 微软技术许可有限责任公司 | 利用训练表的集合来准确预测各种表内的错误 |
WO2020233379A1 (zh) * | 2019-05-17 | 2020-11-26 | 上海肇观电子科技有限公司 | 版面分析方法、阅读辅助设备、电路及介质 |
CN110348294A (zh) * | 2019-05-30 | 2019-10-18 | 平安科技(深圳)有限公司 | Pdf文档中图表的定位方法、装置及计算机设备 |
WO2020238054A1 (zh) * | 2019-05-30 | 2020-12-03 | 平安科技(深圳)有限公司 | Pdf文档中图表的定位方法、装置及计算机设备 |
CN110348294B (zh) * | 2019-05-30 | 2024-04-16 | 平安科技(深圳)有限公司 | Pdf文档中图表的定位方法、装置及计算机设备 |
CN110472209B (zh) * | 2019-07-04 | 2024-02-06 | 深圳同奈信息科技有限公司 | 基于深度学习的表格生成方法、装置和计算机设备 |
CN110472209A (zh) * | 2019-07-04 | 2019-11-19 | 重庆金融资产交易所有限责任公司 | 基于深度学习的表格生成方法、装置和计算机设备 |
CN111104871B (zh) * | 2019-11-28 | 2023-11-07 | 北京明略软件系统有限公司 | 表格区域识别模型生成方法、装置及表格定位方法、装置 |
CN111104871A (zh) * | 2019-11-28 | 2020-05-05 | 北京明略软件系统有限公司 | 表格区域识别模型生成方法、装置及表格定位方法、装置 |
JP2021095275A (ja) * | 2019-12-19 | 2021-06-24 | キヤノン株式会社 | 識別装置、処理装置、処理方法、およびプログラム |
JP7418200B2 (ja) | 2019-12-19 | 2024-01-19 | キヤノン株式会社 | 識別装置、処理装置、処理方法、およびプログラム |
US11930142B2 (en) | 2019-12-19 | 2024-03-12 | Canon Kabushiki Kaisha | Identification apparatus, processing apparatus, processing method, and storage medium |
JP2021096774A (ja) * | 2019-12-19 | 2021-06-24 | キヤノン株式会社 | 識別装置、処理装置、処理方法、およびプログラム |
WO2021124715A1 (ja) * | 2019-12-19 | 2021-06-24 | キヤノン株式会社 | 識別装置、処理装置、処理方法、およびプログラム |
JP7361594B2 (ja) | 2019-12-19 | 2023-10-16 | キヤノン株式会社 | 識別装置、処理装置、処理方法、およびプログラム |
CN111860257B (zh) * | 2020-07-10 | 2022-11-11 | 上海交通大学 | 融合多种文本特征及几何信息的表格识别方法及系统 |
CN111860257A (zh) * | 2020-07-10 | 2020-10-30 | 上海交通大学 | 融合多种文本特征及几何信息的表格识别方法及系统 |
CN112380812A (zh) * | 2020-10-09 | 2021-02-19 | 北京中科凡语科技有限公司 | Pdf不完整框线表格提取方法、装置、设备及存储介质 |
CN113408256B (zh) * | 2021-06-30 | 2023-12-19 | 平安科技(深圳)有限公司 | 一种表格图片的表格重构方法、装置及相关设备 |
CN113408256A (zh) * | 2021-06-30 | 2021-09-17 | 平安科技(深圳)有限公司 | 一种表格图片的表格重构方法、装置及相关设备 |
CN113903016B (zh) * | 2021-12-09 | 2022-05-13 | 深圳佑驾创新科技有限公司 | 分岔点检测方法、装置、计算机设备和存储介质 |
CN113903016A (zh) * | 2021-12-09 | 2022-01-07 | 深圳佑驾创新科技有限公司 | 分岔点检测方法、装置、计算机设备和计算机程序产品 |
Also Published As
Publication number | Publication date |
---|---|
CN104517112B (zh) | 2017-11-28 |
US9268999B2 (en) | 2016-02-23 |
US20150093021A1 (en) | 2015-04-02 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN104517112A (zh) | 一种表格识别方法与系统 | |
CN104517106A (zh) | 一种列表识别方法与系统 | |
AU2018247340B2 (en) | Dvqa: understanding data visualizations through question answering | |
Kim et al. | Transparency and accountability in AI decision support: Explaining and visualizing convolutional neural networks for text information | |
Zhao et al. | Recognition of building group patterns using graph convolutional network | |
CN107169485B (zh) | 一种数学公式识别方法和装置 | |
CN111095296A (zh) | 使用机器学习对字符串进行分类 | |
Wilkinson et al. | Neural Ctrl-F: segmentation-free query-by-string word spotting in handwritten manuscript collections | |
CN112949476B (zh) | 基于图卷积神经网络的文本关系检测方法、装置及存储介质 | |
CN106844481B (zh) | 字体相似度及字体替换方法 | |
Deng et al. | Recognizing building groups for generalization: a comparative study | |
US11769341B2 (en) | System and method to extract information from unstructured image documents | |
US20230138491A1 (en) | Continuous learning for document processing and analysis | |
CN104142912A (zh) | 一种精确的语料类别标注方法及装置 | |
CN106407392A (zh) | 一种基于标记语言的节点映射关系抽取方法及系统 | |
US10402484B2 (en) | Aligning annotation of fields of documents | |
CN112416992B (zh) | 基于大数据和关键词的行业类型识别方法、系统及设备 | |
CN115936003A (zh) | 基于神经网络的软件功能点查重方法、装置、设备及介质 | |
Chen et al. | A deep learning-based method for deep information extraction from multimodal data for geological reports to support geological knowledge graph construction | |
CN111046934B (zh) | 一种swift报文软条款识别方法及装置 | |
CN110399984A (zh) | 一种信息的预测方法、系统以及电子设备 | |
Touya | Lessons learned from research on multimedia summarization | |
Wang et al. | [Retracted] Deep‐Learning‐Guided Point Cloud Modeling with Applications in Intelligent Manufacturing | |
Jayawardhana et al. | Sketch based database querying system | |
Qi et al. | A mixed image segmentation method based on intelligent equipment |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
TR01 | Transfer of patent right |
Effective date of registration: 20220629 Address after: 3007, Hengqin international financial center building, No. 58, Huajin street, Hengqin new area, Zhuhai, Guangdong 519031 Patentee after: New founder holdings development Co.,Ltd. Patentee after: Beijing Fangzheng apapi Technology Co., Ltd. Address before: 100871, Beijing, Haidian District Cheng Fu Road 298, founder building, 9 floor Patentee before: PEKING UNIVERSITY FOUNDER GROUP Co.,Ltd. Patentee before: Beijing Fangzheng apapi Technology Co., Ltd. |
|
TR01 | Transfer of patent right |