CN112836484A - 一种文本对齐方法、装置、电子设备、计算机可读存储介质 - Google Patents
一种文本对齐方法、装置、电子设备、计算机可读存储介质 Download PDFInfo
- Publication number
- CN112836484A CN112836484A CN202110421920.0A CN202110421920A CN112836484A CN 112836484 A CN112836484 A CN 112836484A CN 202110421920 A CN202110421920 A CN 202110421920A CN 112836484 A CN112836484 A CN 112836484A
- Authority
- CN
- China
- Prior art keywords
- text
- keywords
- matching
- keyword
- search
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/10—Text processing
- G06F40/189—Automatic justification
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/22—Matching criteria, e.g. proximity measures
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/10—Text processing
- G06F40/166—Editing, e.g. inserting or deleting
- G06F40/186—Templates
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/049—Temporal neural networks, e.g. delay elements, oscillating neurons or pulsed inputs
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/20—Image preprocessing
- G06V10/22—Image preprocessing by selection of a specific region containing or referencing a pattern; Locating or processing of specific regions to guide the detection or recognition
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/70—Arrangements for image or video recognition or understanding using pattern recognition or machine learning
- G06V10/74—Image or video pattern matching; Proximity measures in feature spaces
- G06V10/75—Organisation of the matching processes, e.g. simultaneous or sequential comparisons of image or video features; Coarse-fine approaches, e.g. multi-scale approaches; using context analysis; Selection of dictionaries
- G06V10/751—Comparing pixel values or logical combinations thereof, or feature values having positional relevance, e.g. template matching
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Artificial Intelligence (AREA)
- General Health & Medical Sciences (AREA)
- Health & Medical Sciences (AREA)
- General Engineering & Computer Science (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Computational Linguistics (AREA)
- Evolutionary Computation (AREA)
- Data Mining & Analysis (AREA)
- Software Systems (AREA)
- Multimedia (AREA)
- Life Sciences & Earth Sciences (AREA)
- Computing Systems (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Medical Informatics (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Bioinformatics & Computational Biology (AREA)
- Databases & Information Systems (AREA)
- Evolutionary Biology (AREA)
- Biomedical Technology (AREA)
- Biophysics (AREA)
- Molecular Biology (AREA)
- Mathematical Physics (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
Description
Claims (10)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110421920.0A CN112836484B (zh) | 2021-04-20 | 2021-04-20 | 一种文本对齐方法、装置、电子设备、计算机可读存储介质 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110421920.0A CN112836484B (zh) | 2021-04-20 | 2021-04-20 | 一种文本对齐方法、装置、电子设备、计算机可读存储介质 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN112836484A true CN112836484A (zh) | 2021-05-25 |
CN112836484B CN112836484B (zh) | 2021-08-27 |
Family
ID=75929858
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202110421920.0A Active CN112836484B (zh) | 2021-04-20 | 2021-04-20 | 一种文本对齐方法、装置、电子设备、计算机可读存储介质 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN112836484B (zh) |
Cited By (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113223661A (zh) * | 2021-05-26 | 2021-08-06 | 杭州比康信息科技有限公司 | 中药处方传输系统 |
CN113779308A (zh) * | 2021-11-12 | 2021-12-10 | 冠传网络科技(南京)有限公司 | 一种短视频检测和多分类方法、装置及存储介质 |
CN113987593A (zh) * | 2021-12-28 | 2022-01-28 | 北京妙医佳健康科技集团有限公司 | 一种数据处理方法 |
CN114241487A (zh) * | 2021-12-20 | 2022-03-25 | 北京妙医佳健康科技集团有限公司 | 一种ocr识别方法 |
CN115482537A (zh) * | 2022-10-14 | 2022-12-16 | 北京中科万国互联网技术有限公司 | 基于迭代聚类处理ocr识别结果的文本对齐方法及系统 |
CN117792806A (zh) * | 2023-12-26 | 2024-03-29 | 安徽思宇微电子技术有限责任公司 | 一种基于poe供电的用电信息采集终端 |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20070244879A1 (en) * | 2006-04-14 | 2007-10-18 | Clausner Timothy C | System and method for retrieving task information using task-based semantic indexes |
CN101996631A (zh) * | 2009-08-28 | 2011-03-30 | 国际商业机器公司 | 用于对齐文本的方法和装置 |
CN106156082A (zh) * | 2015-03-31 | 2016-11-23 | 华为技术有限公司 | 一种本体对齐方法及装置 |
CN108647319A (zh) * | 2018-05-10 | 2018-10-12 | 思派(北京)网络科技有限公司 | 一种基于短文本聚类的标注系统及其方法 |
CN109033060A (zh) * | 2018-08-16 | 2018-12-18 | 科大讯飞股份有限公司 | 一种信息对齐方法、装置、设备及可读存储介质 |
CN112541062A (zh) * | 2020-11-27 | 2021-03-23 | 北京百分点信息科技有限公司 | 平行语料对齐方法、装置、存储介质及电子设备 |
-
2021
- 2021-04-20 CN CN202110421920.0A patent/CN112836484B/zh active Active
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20070244879A1 (en) * | 2006-04-14 | 2007-10-18 | Clausner Timothy C | System and method for retrieving task information using task-based semantic indexes |
CN101996631A (zh) * | 2009-08-28 | 2011-03-30 | 国际商业机器公司 | 用于对齐文本的方法和装置 |
CN106156082A (zh) * | 2015-03-31 | 2016-11-23 | 华为技术有限公司 | 一种本体对齐方法及装置 |
CN108647319A (zh) * | 2018-05-10 | 2018-10-12 | 思派(北京)网络科技有限公司 | 一种基于短文本聚类的标注系统及其方法 |
CN109033060A (zh) * | 2018-08-16 | 2018-12-18 | 科大讯飞股份有限公司 | 一种信息对齐方法、装置、设备及可读存储介质 |
CN112541062A (zh) * | 2020-11-27 | 2021-03-23 | 北京百分点信息科技有限公司 | 平行语料对齐方法、装置、存储介质及电子设备 |
Cited By (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113223661A (zh) * | 2021-05-26 | 2021-08-06 | 杭州比康信息科技有限公司 | 中药处方传输系统 |
CN113779308A (zh) * | 2021-11-12 | 2021-12-10 | 冠传网络科技(南京)有限公司 | 一种短视频检测和多分类方法、装置及存储介质 |
CN114241487A (zh) * | 2021-12-20 | 2022-03-25 | 北京妙医佳健康科技集团有限公司 | 一种ocr识别方法 |
CN113987593A (zh) * | 2021-12-28 | 2022-01-28 | 北京妙医佳健康科技集团有限公司 | 一种数据处理方法 |
CN113987593B (zh) * | 2021-12-28 | 2022-03-15 | 北京妙医佳健康科技集团有限公司 | 一种数据处理方法 |
CN115482537A (zh) * | 2022-10-14 | 2022-12-16 | 北京中科万国互联网技术有限公司 | 基于迭代聚类处理ocr识别结果的文本对齐方法及系统 |
CN115482537B (zh) * | 2022-10-14 | 2024-03-12 | 北京中科万国互联网技术有限公司 | 基于迭代聚类处理ocr识别结果的文本对齐方法及系统 |
CN117792806A (zh) * | 2023-12-26 | 2024-03-29 | 安徽思宇微电子技术有限责任公司 | 一种基于poe供电的用电信息采集终端 |
Also Published As
Publication number | Publication date |
---|---|
CN112836484B (zh) | 2021-08-27 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN112836484B (zh) | 一种文本对齐方法、装置、电子设备、计算机可读存储介质 | |
WO2022105122A1 (zh) | 基于人工智能的答案生成方法、装置、计算机设备及介质 | |
EP2565804B1 (en) | Text-based searching of image data | |
US20230004604A1 (en) | Ai-augmented auditing platform including techniques for automated document processing | |
CN108664595B (zh) | 领域知识库构建方法、装置、计算机设备和存储介质 | |
US9652695B2 (en) | Label consistency for image analysis | |
CN111626048A (zh) | 文本纠错方法、装置、设备及存储介质 | |
US20200125954A1 (en) | Systems and methods for selecting and generating log parsers using neural networks | |
CN110781460A (zh) | 版权认证方法、装置、设备、系统及计算机可读存储介质 | |
CN112307164A (zh) | 信息推荐方法、装置、计算机设备和存储介质 | |
CN110362798B (zh) | 裁决信息检索分析方法、装置、计算机设备和存储介质 | |
US11507901B1 (en) | Apparatus and methods for matching video records with postings using audiovisual data processing | |
CN112966626A (zh) | 人脸识别方法和装置 | |
CN113159013A (zh) | 基于机器学习的段落识别方法、装置、计算机设备和介质 | |
CN113705691B (zh) | 基于人工智能的图像标注校验方法、装置、设备及介质 | |
CN113033271A (zh) | 利用人工智能模块学习脸部辨识的处理方法 | |
CN110874326A (zh) | 测试用例生成方法、装置、计算机设备及存储介质 | |
CN114547087B (zh) | 提案自动识别并生成报告的方法、装置、设备和介质 | |
CN116384344A (zh) | 一种文档转换方法、装置及存储介质 | |
CN112989820B (zh) | 法律文书定位方法、装置、设备及存储介质 | |
US11880798B2 (en) | Determining section conformity and providing recommendations | |
CN113806472A (zh) | 一种对文字图片和图像型扫描件实现全文检索的方法及设备 | |
Tornés et al. | Receipt Dataset for Document Forgery Detection | |
CN112329468B (zh) | 异质关系网络的构建方法、装置、计算机设备及存储介质 | |
CN113688268B (zh) | 图片信息抽取方法、装置、计算机设备及存储介质 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
CB03 | Change of inventor or designer information | ||
CB03 | Change of inventor or designer information |
Inventor after: Liu Chaozhen Inventor after: Wang Hai Inventor after: Liu Bangchang Inventor after: Chang Dejie Inventor after: Li Dongdong Inventor after: Zhao Hongwen Inventor after: Gu Shufeng Inventor after: Zhao Jin Inventor after: Luo Xiaobin Inventor before: Liu Chaozhen Inventor before: Wang Hai Inventor before: Liu Bangchang Inventor before: Chang Dejie Inventor before: Li Dongdong Inventor before: Zhao Hongwen Inventor before: Gu Shufeng Inventor before: Zhao Jin Inventor before: Luo Xiaobin |