CN112836484B - 一种文本对齐方法、装置、电子设备、计算机可读存储介质 - Google Patents
一种文本对齐方法、装置、电子设备、计算机可读存储介质 Download PDFInfo
- Publication number
- CN112836484B CN112836484B CN202110421920.0A CN202110421920A CN112836484B CN 112836484 B CN112836484 B CN 112836484B CN 202110421920 A CN202110421920 A CN 202110421920A CN 112836484 B CN112836484 B CN 112836484B
- Authority
- CN
- China
- Prior art keywords
- text
- matching
- keywords
- keyword
- search
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/10—Text processing
- G06F40/189—Automatic justification
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/22—Matching criteria, e.g. proximity measures
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/10—Text processing
- G06F40/166—Editing, e.g. inserting or deleting
- G06F40/186—Templates
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/049—Temporal neural networks, e.g. delay elements, oscillating neurons or pulsed inputs
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/20—Image preprocessing
- G06V10/22—Image preprocessing by selection of a specific region containing or referencing a pattern; Locating or processing of specific regions to guide the detection or recognition
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/70—Arrangements for image or video recognition or understanding using pattern recognition or machine learning
- G06V10/74—Image or video pattern matching; Proximity measures in feature spaces
- G06V10/75—Organisation of the matching processes, e.g. simultaneous or sequential comparisons of image or video features; Coarse-fine approaches, e.g. multi-scale approaches; using context analysis; Selection of dictionaries
- G06V10/751—Comparing pixel values or logical combinations thereof, or feature values having positional relevance, e.g. template matching
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Artificial Intelligence (AREA)
- General Health & Medical Sciences (AREA)
- Health & Medical Sciences (AREA)
- General Engineering & Computer Science (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Computational Linguistics (AREA)
- Evolutionary Computation (AREA)
- Data Mining & Analysis (AREA)
- Software Systems (AREA)
- Multimedia (AREA)
- Life Sciences & Earth Sciences (AREA)
- Computing Systems (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Medical Informatics (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Bioinformatics & Computational Biology (AREA)
- Databases & Information Systems (AREA)
- Evolutionary Biology (AREA)
- Biomedical Technology (AREA)
- Biophysics (AREA)
- Molecular Biology (AREA)
- Mathematical Physics (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
Description
Claims (10)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110421920.0A CN112836484B (zh) | 2021-04-20 | 2021-04-20 | 一种文本对齐方法、装置、电子设备、计算机可读存储介质 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110421920.0A CN112836484B (zh) | 2021-04-20 | 2021-04-20 | 一种文本对齐方法、装置、电子设备、计算机可读存储介质 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN112836484A CN112836484A (zh) | 2021-05-25 |
CN112836484B true CN112836484B (zh) | 2021-08-27 |
Family
ID=75929858
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202110421920.0A Active CN112836484B (zh) | 2021-04-20 | 2021-04-20 | 一种文本对齐方法、装置、电子设备、计算机可读存储介质 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN112836484B (zh) |
Families Citing this family (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113223661B (zh) * | 2021-05-26 | 2023-07-21 | 杭州比康信息科技有限公司 | 中药处方传输系统 |
CN113779308B (zh) * | 2021-11-12 | 2022-02-25 | 冠传网络科技(南京)有限公司 | 一种短视频检测和多分类方法、装置及存储介质 |
CN114241487B (zh) * | 2021-12-20 | 2022-12-16 | 北京妙医佳健康科技集团有限公司 | 一种ocr识别方法 |
CN113987593B (zh) * | 2021-12-28 | 2022-03-15 | 北京妙医佳健康科技集团有限公司 | 一种数据处理方法 |
CN115482537B (zh) * | 2022-10-14 | 2024-03-12 | 北京中科万国互联网技术有限公司 | 基于迭代聚类处理ocr识别结果的文本对齐方法及系统 |
CN117792806A (zh) * | 2023-12-26 | 2024-03-29 | 安徽思宇微电子技术有限责任公司 | 一种基于poe供电的用电信息采集终端 |
Family Cites Families (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7979452B2 (en) * | 2006-04-14 | 2011-07-12 | Hrl Laboratories, Llc | System and method for retrieving task information using task-based semantic indexes |
CN101996631B (zh) * | 2009-08-28 | 2014-12-03 | 国际商业机器公司 | 用于对齐文本的方法和装置 |
CN106156082B (zh) * | 2015-03-31 | 2019-09-20 | 华为技术有限公司 | 一种本体对齐方法及装置 |
CN108647319B (zh) * | 2018-05-10 | 2021-07-06 | 思派(北京)网络科技有限公司 | 一种基于短文本聚类的标注系统及其方法 |
CN109033060B (zh) * | 2018-08-16 | 2023-01-17 | 科大讯飞股份有限公司 | 一种信息对齐方法、装置、设备及可读存储介质 |
CN112541062B (zh) * | 2020-11-27 | 2022-11-25 | 北京百分点科技集团股份有限公司 | 平行语料对齐方法、装置、存储介质及电子设备 |
-
2021
- 2021-04-20 CN CN202110421920.0A patent/CN112836484B/zh active Active
Also Published As
Publication number | Publication date |
---|---|
CN112836484A (zh) | 2021-05-25 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN112836484B (zh) | 一种文本对齐方法、装置、电子设备、计算机可读存储介质 | |
US10049096B2 (en) | System and method of template creation for a data extraction tool | |
US20220004878A1 (en) | Systems and methods for synthetic document and data generation | |
US20100150453A1 (en) | Determining near duplicate "noisy" data objects | |
US20190164109A1 (en) | Similarity Learning System and Similarity Learning Method | |
EP4363993A1 (en) | Ai-augmented auditing platform including techniques for automated document processing | |
US9652695B2 (en) | Label consistency for image analysis | |
CN112307164A (zh) | 信息推荐方法、装置、计算机设备和存储介质 | |
US20230237395A1 (en) | Apparatus and methods for matching video records with postings using audiovisual data processing | |
US20210406351A1 (en) | Non-face-to-face authentication system | |
CN112966626A (zh) | 人脸识别方法和装置 | |
CN113159013A (zh) | 基于机器学习的段落识别方法、装置、计算机设备和介质 | |
CN112801099A (zh) | 一种图像处理方法、装置、终端设备及介质 | |
CN113705691B (zh) | 基于人工智能的图像标注校验方法、装置、设备及介质 | |
CN113033271A (zh) | 利用人工智能模块学习脸部辨识的处理方法 | |
CN114495139A (zh) | 一种基于图像的作业查重系统及方法 | |
CN113705749A (zh) | 基于深度学习的二维码识别方法、装置、设备及存储介质 | |
KR102145858B1 (ko) | 문서 이미지로부터 인식된 용어를 표준화하기 위한 방법 | |
CN116384344A (zh) | 一种文档转换方法、装置及存储介质 | |
CN110874326A (zh) | 测试用例生成方法、装置、计算机设备及存储介质 | |
CN114547087B (zh) | 提案自动识别并生成报告的方法、装置、设备和介质 | |
CN112989820B (zh) | 法律文书定位方法、装置、设备及存储介质 | |
CN115294593A (zh) | 一种图像信息抽取方法、装置、计算机设备及存储介质 | |
CN114611471A (zh) | 一种电子文档的读取方法、装置、电子设备及存储介质 | |
US20200104588A1 (en) | Character authenticity determination |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
CB03 | Change of inventor or designer information |
Inventor after: Liu Chaozhen Inventor after: Wang Hai Inventor after: Liu Bangchang Inventor after: Chang Dejie Inventor after: Li Dongdong Inventor after: Zhao Hongwen Inventor after: Gu Shufeng Inventor after: Zhao Jin Inventor after: Luo Xiaobin Inventor before: Liu Chaozhen Inventor before: Wang Hai Inventor before: Liu Bangchang Inventor before: Chang Dejie Inventor before: Li Dongdong Inventor before: Zhao Hongwen Inventor before: Gu Shufeng Inventor before: Zhao Jin Inventor before: Luo Xiaobin |
|
CB03 | Change of inventor or designer information |