CN115019310B - 图文识别方法及设备 - Google Patents
图文识别方法及设备 Download PDFInfo
- Publication number
- CN115019310B CN115019310B CN202210934997.2A CN202210934997A CN115019310B CN 115019310 B CN115019310 B CN 115019310B CN 202210934997 A CN202210934997 A CN 202210934997A CN 115019310 B CN115019310 B CN 115019310B
- Authority
- CN
- China
- Prior art keywords
- text
- image
- abscissa
- text box
- corners
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000000034 method Methods 0.000 title claims abstract description 47
- 230000015654 memory Effects 0.000 claims description 18
- 230000000750 progressive effect Effects 0.000 claims description 13
- 230000006870 function Effects 0.000 claims description 8
- 238000013527 convolutional neural network Methods 0.000 claims description 7
- 238000013528 artificial neural network Methods 0.000 claims description 5
- 239000000126 substance Substances 0.000 claims description 3
- 102100032202 Cornulin Human genes 0.000 claims 2
- 101000920981 Homo sapiens Cornulin Proteins 0.000 claims 2
- 238000005457 optimization Methods 0.000 abstract description 5
- 238000001514 detection method Methods 0.000 description 16
- 230000000694 effects Effects 0.000 description 5
- 230000005291 magnetic effect Effects 0.000 description 5
- 238000004590 computer program Methods 0.000 description 4
- 238000012986 modification Methods 0.000 description 3
- 230000004048 modification Effects 0.000 description 3
- 238000012015 optical character recognition Methods 0.000 description 3
- 238000004458 analytical method Methods 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 239000011159 matrix material Substances 0.000 description 2
- 238000010606 normalization Methods 0.000 description 2
- 230000003287 optical effect Effects 0.000 description 2
- 230000008569 process Effects 0.000 description 2
- 238000000342 Monte Carlo simulation Methods 0.000 description 1
- 230000005540 biological transmission Effects 0.000 description 1
- 238000004422 calculation algorithm Methods 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 238000002372 labelling Methods 0.000 description 1
- 238000012805 post-processing Methods 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 238000001454 recorded image Methods 0.000 description 1
- 230000011218 segmentation Effects 0.000 description 1
- 230000003068 static effect Effects 0.000 description 1
- 230000001052 transient effect Effects 0.000 description 1
- 230000003936 working memory Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V30/00—Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
- G06V30/10—Character recognition
- G06V30/14—Image acquisition
- G06V30/1444—Selective acquisition, locating or processing of specific regions, e.g. highlighted text, fiducial marks or predetermined fields
- G06V30/1448—Selective acquisition, locating or processing of specific regions, e.g. highlighted text, fiducial marks or predetermined fields based on markings or identifiers characterising the document or the area
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/70—Arrangements for image or video recognition or understanding using pattern recognition or machine learning
- G06V10/82—Arrangements for image or video recognition or understanding using pattern recognition or machine learning using neural networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V30/00—Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
- G06V30/40—Document-oriented image-based pattern recognition
Abstract
Description
Claims (9)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202210934997.2A CN115019310B (zh) | 2022-08-05 | 2022-08-05 | 图文识别方法及设备 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202210934997.2A CN115019310B (zh) | 2022-08-05 | 2022-08-05 | 图文识别方法及设备 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN115019310A CN115019310A (zh) | 2022-09-06 |
CN115019310B true CN115019310B (zh) | 2022-11-29 |
Family
ID=83065495
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202210934997.2A Active CN115019310B (zh) | 2022-08-05 | 2022-08-05 | 图文识别方法及设备 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN115019310B (zh) |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN115375689B (zh) * | 2022-10-25 | 2023-07-07 | 深圳华付技术股份有限公司 | 基于机器视觉的烟丝桶检测方法、装置、设备及介质 |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104574422A (zh) * | 2015-01-30 | 2015-04-29 | 北京控制工程研究所 | 一种低信噪比红外地球图像信息处理方法 |
CN112085022A (zh) * | 2020-09-09 | 2020-12-15 | 上海蜜度信息技术有限公司 | 一种用于识别文字的方法、系统及设备 |
CN113435449A (zh) * | 2021-08-03 | 2021-09-24 | 全知科技(杭州)有限责任公司 | 基于深度学习的ocr图像文字识别与段落输出方法 |
CN114330247A (zh) * | 2021-11-09 | 2022-04-12 | 世纪保众(北京)网络科技有限公司 | 一种基于图像识别的自动化保险条款解析方法 |
CN114429542A (zh) * | 2021-12-10 | 2022-05-03 | 北京航空航天大学 | 针对医疗化验单的结构化识别方法 |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101398902B (zh) * | 2008-09-27 | 2012-07-04 | 宁波新然电子信息科技发展有限公司 | 一种自然手写阿拉伯字母联机识别方法 |
-
2022
- 2022-08-05 CN CN202210934997.2A patent/CN115019310B/zh active Active
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104574422A (zh) * | 2015-01-30 | 2015-04-29 | 北京控制工程研究所 | 一种低信噪比红外地球图像信息处理方法 |
CN112085022A (zh) * | 2020-09-09 | 2020-12-15 | 上海蜜度信息技术有限公司 | 一种用于识别文字的方法、系统及设备 |
CN113435449A (zh) * | 2021-08-03 | 2021-09-24 | 全知科技(杭州)有限责任公司 | 基于深度学习的ocr图像文字识别与段落输出方法 |
CN114330247A (zh) * | 2021-11-09 | 2022-04-12 | 世纪保众(北京)网络科技有限公司 | 一种基于图像识别的自动化保险条款解析方法 |
CN114429542A (zh) * | 2021-12-10 | 2022-05-03 | 北京航空航天大学 | 针对医疗化验单的结构化识别方法 |
Also Published As
Publication number | Publication date |
---|---|
CN115019310A (zh) | 2022-09-06 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN105868758B (zh) | 图像中文本区域检测方法、装置及电子设备 | |
CA2668413C (en) | Media material analysis of continuing article portions | |
CN113158808B (zh) | 中文古籍字符识别、组段与版面重建方法、介质和设备 | |
WO2020133442A1 (zh) | 一种识别文本的方法及终端设备 | |
CN112085022B (zh) | 一种用于识别文字的方法、系统及设备 | |
CN112597773B (zh) | 文档结构化方法、系统、终端及介质 | |
CN111460927B (zh) | 对房产证图像进行结构化信息提取的方法 | |
CN112883926B (zh) | 表格类医疗影像的识别方法及装置 | |
CN111626145B (zh) | 一种简捷有效的残缺表格识别及跨页拼接方法 | |
CN112241730A (zh) | 一种基于机器学习的表格提取方法和系统 | |
CN110991403A (zh) | 一种基于视觉深度学习的文档信息碎片化抽取方法 | |
CN114004204A (zh) | 基于计算机视觉的表格结构重建与文字提取方法和系统 | |
CN115019310B (zh) | 图文识别方法及设备 | |
CN112541922A (zh) | 基于数字图像的试卷布局分割方法、电子设备及存储介质 | |
CN114529773A (zh) | 基于结构单元的表格识别方法、系统、终端及介质 | |
CN116824608A (zh) | 基于目标检测技术的答题卡版面分析方法 | |
CN110443235B (zh) | 一种智能纸质试卷总分识别方法及系统 | |
CN114998905A (zh) | 一种复杂结构化文档内容的校验方法、装置与设备 | |
CN114386504A (zh) | 一种工程图纸文字识别方法 | |
CN114463770A (zh) | 一种用于普遍试卷题目的智能切题方法 | |
CN111832497B (zh) | 一种基于几何特征的文本检测后处理方法 | |
CN112784932A (zh) | 一种字体识别方法、装置和存储介质 | |
Yuan et al. | An opencv-based framework for table information extraction | |
CN112766269B (zh) | 一种图片文本检索方法、智能终端及存储介质 | |
CN115205881A (zh) | 一种表格识别方法、设备及介质 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
PE01 | Entry into force of the registration of the contract for pledge of patent right | ||
PE01 | Entry into force of the registration of the contract for pledge of patent right |
Denomination of invention: Method and equipment for image and text recognition Effective date of registration: 20230215 Granted publication date: 20221129 Pledgee: Shanghai Rural Commercial Bank Co.,Ltd. Pudong branch Pledgor: SHANGHAI MDATA INFORMATION TECHNOLOGY Co.,Ltd. Registration number: Y2023310000031 |
|
CP01 | Change in the name or title of a patent holder | ||
CP01 | Change in the name or title of a patent holder |
Address after: Room 301ab, No.10, Lane 198, zhangheng Road, China (Shanghai) pilot Free Trade Zone, Pudong New Area, Shanghai 201204 Patentee after: Shanghai Mido Technology Co.,Ltd. Address before: Room 301ab, No.10, Lane 198, zhangheng Road, China (Shanghai) pilot Free Trade Zone, Pudong New Area, Shanghai 201204 Patentee before: SHANGHAI MDATA INFORMATION TECHNOLOGY Co.,Ltd. |
|
PC01 | Cancellation of the registration of the contract for pledge of patent right | ||
PC01 | Cancellation of the registration of the contract for pledge of patent right |
Granted publication date: 20221129 Pledgee: Shanghai Rural Commercial Bank Co.,Ltd. Pudong branch Pledgor: SHANGHAI MDATA INFORMATION TECHNOLOGY Co.,Ltd. Registration number: Y2023310000031 |