CN114365202B - 经由上下文无关的递归文档分解来学习的可扩展结构 - Google Patents
经由上下文无关的递归文档分解来学习的可扩展结构 Download PDFInfo
- Publication number
- CN114365202B CN114365202B CN202080063240.XA CN202080063240A CN114365202B CN 114365202 B CN114365202 B CN 114365202B CN 202080063240 A CN202080063240 A CN 202080063240A CN 114365202 B CN114365202 B CN 114365202B
- Authority
- CN
- China
- Prior art keywords
- image
- sum values
- row
- bitmap
- frequency
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V30/00—Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
- G06V30/40—Document-oriented image-based pattern recognition
- G06V30/41—Analysis of document content
- G06V30/414—Extracting the geometrical structure, e.g. layout tree; Block segmentation, e.g. bounding boxes for graphics or text
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V30/00—Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
- G06V30/10—Character recognition
- G06V30/18—Extraction of features or characteristics of the image
- G06V30/186—Extraction of features or characteristics of the image by deriving mathematical or geometrical properties from the whole image
- G06V30/187—Frequency domain transformation; Autocorrelation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V30/00—Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
- G06V30/40—Document-oriented image-based pattern recognition
- G06V30/41—Analysis of document content
- G06V30/416—Extracting the logical structure, e.g. chapters, sections or page numbers; Identifying elements of the document, e.g. authors
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V30/00—Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
- G06V30/10—Character recognition
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V30/00—Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
- G06V30/40—Document-oriented image-based pattern recognition
- G06V30/43—Editing text-bitmaps, e.g. alignment, spacing; Semantic analysis of bitmaps of text without OCR
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Computer Vision & Pattern Recognition (AREA)
- General Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- Multimedia (AREA)
- Artificial Intelligence (AREA)
- Mathematical Optimization (AREA)
- Pure & Applied Mathematics (AREA)
- Mathematical Physics (AREA)
- Mathematical Analysis (AREA)
- Computer Graphics (AREA)
- Geometry (AREA)
- Algebra (AREA)
- Character Discrimination (AREA)
- Character Input (AREA)
- Image Analysis (AREA)
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US16/571,301 | 2019-09-16 | ||
US16/571,301 US11188748B2 (en) | 2019-09-16 | 2019-09-16 | Scalable structure learning via context-free recursive document decomposition |
PCT/IB2020/058572 WO2021053510A1 (en) | 2019-09-16 | 2020-09-15 | Scalable structure learning via context-free recursive document decomposition |
Publications (2)
Publication Number | Publication Date |
---|---|
CN114365202A CN114365202A (zh) | 2022-04-15 |
CN114365202B true CN114365202B (zh) | 2022-09-20 |
Family
ID=74869686
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202080063240.XA Active CN114365202B (zh) | 2019-09-16 | 2020-09-15 | 经由上下文无关的递归文档分解来学习的可扩展结构 |
Country Status (5)
Country | Link |
---|---|
US (1) | US11188748B2 (de) |
CN (1) | CN114365202B (de) |
DE (1) | DE112020003002T5 (de) |
GB (1) | GB2602229B (de) |
WO (1) | WO2021053510A1 (de) |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US11232454B2 (en) | 2019-11-14 | 2022-01-25 | Bank Of America Corporation | Authentication framework for real-time document processing |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7046848B1 (en) * | 2001-08-22 | 2006-05-16 | Olcott Peter L | Method and system for recognizing machine generated character glyphs and icons in graphic images |
US7400768B1 (en) * | 2001-08-24 | 2008-07-15 | Cardiff Software, Inc. | Enhanced optical recognition of digitized images through selective bit insertion |
US8311331B2 (en) * | 2010-03-09 | 2012-11-13 | Microsoft Corporation | Resolution adjustment of an image that includes text undergoing an OCR process |
US8739022B2 (en) * | 2007-09-27 | 2014-05-27 | The Research Foundation For The State University Of New York | Parallel approach to XML parsing |
CN108460385A (zh) * | 2018-03-02 | 2018-08-28 | 山东超越数控电子股份有限公司 | 一种文本图像分割方法与装置 |
Family Cites Families (15)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP0358815B1 (de) | 1988-09-12 | 1993-05-26 | Océ-Nederland B.V. | System und Verfahren für automatische Segmentierung |
US5335290A (en) | 1992-04-06 | 1994-08-02 | Ricoh Corporation | Segmentation of text, picture and lines of a document image |
US6307962B1 (en) | 1995-09-01 | 2001-10-23 | The University Of Rochester | Document data compression system which automatically segments documents and generates compressed smart documents therefrom |
US7751596B2 (en) * | 1996-11-12 | 2010-07-06 | Digimarc Corporation | Methods and arrangements employing digital content items |
US6853854B1 (en) * | 1998-09-18 | 2005-02-08 | Q Step Technologies, Llc | Noninvasive measurement system |
US8249344B2 (en) | 2005-07-01 | 2012-08-21 | Microsoft Corporation | Grammatical parsing of document visual structures |
US7889885B2 (en) * | 2005-11-23 | 2011-02-15 | Pitney Bowes Inc. | Method for detecting perforations on the edge of an image of a form |
US7961959B2 (en) * | 2006-08-24 | 2011-06-14 | Dell Products L.P. | Methods and apparatus for reducing storage size |
JP6129759B2 (ja) * | 2014-02-03 | 2017-05-17 | 満男 江口 | Simd型超並列演算処理装置向け超解像処理方法、装置、プログラム及び記憶媒体 |
US10140548B2 (en) * | 2014-08-15 | 2018-11-27 | Lenovo (Singapore) Pte. Ltd. | Statistical noise analysis for motion detection |
US10158840B2 (en) | 2015-06-19 | 2018-12-18 | Amazon Technologies, Inc. | Steganographic depth images |
US10070009B2 (en) | 2016-09-22 | 2018-09-04 | Kyocera Document Solutions Inc. | Selection of halftoning technique based on microstructure detection |
US10515606B2 (en) * | 2016-09-28 | 2019-12-24 | Samsung Electronics Co., Ltd. | Parallelizing display update |
US10489502B2 (en) | 2017-06-30 | 2019-11-26 | Accenture Global Solutions Limited | Document processing |
US10922540B2 (en) * | 2018-07-03 | 2021-02-16 | Neural Vision Technologies LLC | Clustering, classifying, and searching documents using spectral computer vision and neural networks |
-
2019
- 2019-09-16 US US16/571,301 patent/US11188748B2/en active Active
-
2020
- 2020-09-15 CN CN202080063240.XA patent/CN114365202B/zh active Active
- 2020-09-15 WO PCT/IB2020/058572 patent/WO2021053510A1/en active Application Filing
- 2020-09-15 GB GB2203443.3A patent/GB2602229B/en active Active
- 2020-09-15 DE DE112020003002.4T patent/DE112020003002T5/de active Pending
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7046848B1 (en) * | 2001-08-22 | 2006-05-16 | Olcott Peter L | Method and system for recognizing machine generated character glyphs and icons in graphic images |
US7400768B1 (en) * | 2001-08-24 | 2008-07-15 | Cardiff Software, Inc. | Enhanced optical recognition of digitized images through selective bit insertion |
US8739022B2 (en) * | 2007-09-27 | 2014-05-27 | The Research Foundation For The State University Of New York | Parallel approach to XML parsing |
US8311331B2 (en) * | 2010-03-09 | 2012-11-13 | Microsoft Corporation | Resolution adjustment of an image that includes text undergoing an OCR process |
CN108460385A (zh) * | 2018-03-02 | 2018-08-28 | 山东超越数控电子股份有限公司 | 一种文本图像分割方法与装置 |
Also Published As
Publication number | Publication date |
---|---|
GB202203443D0 (en) | 2022-04-27 |
DE112020003002T5 (de) | 2022-03-10 |
US11188748B2 (en) | 2021-11-30 |
US20210081662A1 (en) | 2021-03-18 |
JP2022547962A (ja) | 2022-11-16 |
WO2021053510A1 (en) | 2021-03-25 |
GB2602229A (en) | 2022-06-22 |
CN114365202A (zh) | 2022-04-15 |
GB2602229B (en) | 2023-05-17 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN110135411B (zh) | 名片识别方法和装置 | |
US9619735B1 (en) | Pure convolutional neural network localization | |
EP3117369B1 (de) | Erkennung und extraktion von bilddokumentkomponenten zur erzeugung eines flussdokuments | |
US8965127B2 (en) | Method for segmenting text words in document images | |
RU2571545C1 (ru) | Классификация изображений документов на основании контента | |
US8718365B1 (en) | Text recognition for textually sparse images | |
US9535910B2 (en) | Corpus generation based upon document attributes | |
CN111209827B (zh) | 一种基于特征检测的ocr识别票据问题的方法及系统 | |
US20220292803A1 (en) | Systems and methods for stamp detection and classification | |
JP2016110647A (ja) | 画像処理装置及び画像処理方法 | |
CN113221918B (zh) | 目标检测方法、目标检测模型的训练方法及装置 | |
JP5539488B2 (ja) | 参照背景色に基づく透明化塗りつぶしの判定 | |
Akinbade et al. | An adaptive thresholding algorithm-based optical character recognition system for information extraction in complex images | |
CN114365202B (zh) | 经由上下文无关的递归文档分解来学习的可扩展结构 | |
CN112508000B (zh) | 一种用于ocr图像识别模型训练数据生成的方法及设备 | |
CN112839185B (zh) | 用于处理图像的方法、装置、设备和介质 | |
US10963690B2 (en) | Method for identifying main picture in web page | |
CN112528610A (zh) | 一种数据标注方法、装置、电子设备及存储介质 | |
Li et al. | A text-line segmentation method for historical Tibetan documents based on baseline detection | |
JP7486574B2 (ja) | コンテキスト・フリーの再帰的な文書分解による拡張性のある構造学習 | |
US20220051009A1 (en) | Systems and methods for automatic context-based annotation | |
CN111783572B (zh) | 一种文本检测方法和装置 | |
US11281475B2 (en) | Reusable asset performance estimation | |
Sumetphong et al. | Modeling broken characters recognition as a set-partitioning problem | |
CN109614463B (zh) | 文本匹配处理方法及装置 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |