CN111881257B - 基于主题词和语句主旨的自动匹配方法、系统及存储介质 - Google Patents
基于主题词和语句主旨的自动匹配方法、系统及存储介质 Download PDFInfo
- Publication number
- CN111881257B CN111881257B CN202010720583.0A CN202010720583A CN111881257B CN 111881257 B CN111881257 B CN 111881257B CN 202010720583 A CN202010720583 A CN 202010720583A CN 111881257 B CN111881257 B CN 111881257B
- Authority
- CN
- China
- Prior art keywords
- text
- subject
- matching
- coding
- prediction model
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000000034 method Methods 0.000 title claims abstract description 65
- 238000003860 storage Methods 0.000 title claims abstract description 11
- 239000013598 vector Substances 0.000 claims abstract description 35
- 238000012549 training Methods 0.000 claims abstract description 14
- 238000000605 extraction Methods 0.000 claims abstract description 12
- 230000007115 recruitment Effects 0.000 claims description 17
- 230000011218 segmentation Effects 0.000 claims description 14
- 230000006870 function Effects 0.000 claims description 12
- 239000011159 matrix material Substances 0.000 claims description 12
- 238000013135 deep learning Methods 0.000 claims description 7
- 238000004140 cleaning Methods 0.000 claims description 4
- 230000004913 activation Effects 0.000 claims description 3
- 230000008569 process Effects 0.000 abstract description 15
- 238000012545 processing Methods 0.000 abstract description 8
- 230000008901 benefit Effects 0.000 abstract description 6
- 238000003058 natural language processing Methods 0.000 abstract description 3
- 230000008447 perception Effects 0.000 abstract description 3
- 238000013527 convolutional neural network Methods 0.000 description 6
- 238000004422 calculation algorithm Methods 0.000 description 5
- 238000005516 engineering process Methods 0.000 description 5
- 238000013528 artificial neural network Methods 0.000 description 4
- 238000010586 diagram Methods 0.000 description 3
- 238000004519 manufacturing process Methods 0.000 description 3
- 125000004122 cyclic group Chemical group 0.000 description 2
- 238000013461 design Methods 0.000 description 2
- 230000000694 effects Effects 0.000 description 2
- 239000000463 material Substances 0.000 description 2
- 238000012821 model calculation Methods 0.000 description 2
- 230000003287 optical effect Effects 0.000 description 2
- 238000005457 optimization Methods 0.000 description 2
- 238000012216 screening Methods 0.000 description 2
- 230000004075 alteration Effects 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000002457 bidirectional effect Effects 0.000 description 1
- 238000004891 communication Methods 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000018109 developmental process Effects 0.000 description 1
- 230000002708 enhancing effect Effects 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 230000010354 integration Effects 0.000 description 1
- 238000002372 labelling Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 239000013307 optical fiber Substances 0.000 description 1
- 239000000047 product Substances 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- 239000013589 supplement Substances 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
- 238000000844 transformation Methods 0.000 description 1
- 238000005406 washing Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/33—Querying
- G06F16/3331—Query processing
- G06F16/334—Query execution
- G06F16/3344—Query execution using natural language analysis
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/10—Text processing
- G06F40/12—Use of codes for handling textual entities
- G06F40/126—Character encoding
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/30—Semantic analysis
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/045—Combinations of networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q10/00—Administration; Management
- G06Q10/10—Office automation; Time management
- G06Q10/105—Human resources
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- General Engineering & Computer Science (AREA)
- Artificial Intelligence (AREA)
- Business, Economics & Management (AREA)
- Human Resources & Organizations (AREA)
- Data Mining & Analysis (AREA)
- General Health & Medical Sciences (AREA)
- Health & Medical Sciences (AREA)
- Strategic Management (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Entrepreneurship & Innovation (AREA)
- Biophysics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Molecular Biology (AREA)
- Mathematical Physics (AREA)
- Databases & Information Systems (AREA)
- Evolutionary Computation (AREA)
- Computing Systems (AREA)
- Software Systems (AREA)
- Biomedical Technology (AREA)
- Economics (AREA)
- Marketing (AREA)
- Operations Research (AREA)
- Quality & Reliability (AREA)
- Tourism & Hospitality (AREA)
- General Business, Economics & Management (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
Description
Claims (9)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010720583.0A CN111881257B (zh) | 2020-07-24 | 2020-07-24 | 基于主题词和语句主旨的自动匹配方法、系统及存储介质 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010720583.0A CN111881257B (zh) | 2020-07-24 | 2020-07-24 | 基于主题词和语句主旨的自动匹配方法、系统及存储介质 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN111881257A CN111881257A (zh) | 2020-11-03 |
CN111881257B true CN111881257B (zh) | 2022-06-03 |
Family
ID=73200235
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202010720583.0A Active CN111881257B (zh) | 2020-07-24 | 2020-07-24 | 基于主题词和语句主旨的自动匹配方法、系统及存储介质 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN111881257B (zh) |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN115544213B (zh) * | 2022-11-28 | 2023-03-10 | 上海朝阳永续信息技术股份有限公司 | 获取文本中的信息的方法、设备和存储介质 |
Citations (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109086375A (zh) * | 2018-07-24 | 2018-12-25 | 武汉大学 | 一种基于词向量增强的短文本主题抽取方法 |
CN109815336A (zh) * | 2019-01-28 | 2019-05-28 | 无码科技(杭州)有限公司 | 一种文本聚合方法及系统 |
CN109918510A (zh) * | 2019-03-26 | 2019-06-21 | 中国科学技术大学 | 跨领域关键词提取方法 |
CN109992648A (zh) * | 2019-04-10 | 2019-07-09 | 北京神州泰岳软件股份有限公司 | 基于词迁徙学习的深度文本匹配方法及装置 |
CN110287494A (zh) * | 2019-07-01 | 2019-09-27 | 济南浪潮高新科技投资发展有限公司 | 一种基于深度学习bert算法的短文本相似匹配的方法 |
CN110413785A (zh) * | 2019-07-25 | 2019-11-05 | 淮阴工学院 | 一种基于bert和特征融合的文本自动分类方法 |
CN110866095A (zh) * | 2019-10-10 | 2020-03-06 | 重庆金融资产交易所有限责任公司 | 一种文本相似度的确定方法及相关设备 |
CN111241828A (zh) * | 2020-01-10 | 2020-06-05 | 平安科技(深圳)有限公司 | 情感智能识别方法、装置及计算机可读存储介质 |
CN111368038A (zh) * | 2020-03-09 | 2020-07-03 | 广州市百果园信息技术有限公司 | 一种关键词的提取方法、装置、计算机设备和存储介质 |
Family Cites Families (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
AU2003256456A1 (en) * | 2002-07-03 | 2004-01-23 | Word Data Corp. | Text-representation, text-matching and text-classification code, system and method |
US20200159863A1 (en) * | 2018-11-20 | 2020-05-21 | Sap Se | Memory networks for fine-grain opinion mining |
CN109670029B (zh) * | 2018-12-28 | 2021-09-07 | 百度在线网络技术(北京)有限公司 | 用于确定问题答案的方法、装置、计算机设备及存储介质 |
-
2020
- 2020-07-24 CN CN202010720583.0A patent/CN111881257B/zh active Active
Patent Citations (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109086375A (zh) * | 2018-07-24 | 2018-12-25 | 武汉大学 | 一种基于词向量增强的短文本主题抽取方法 |
CN109815336A (zh) * | 2019-01-28 | 2019-05-28 | 无码科技(杭州)有限公司 | 一种文本聚合方法及系统 |
CN109918510A (zh) * | 2019-03-26 | 2019-06-21 | 中国科学技术大学 | 跨领域关键词提取方法 |
CN109992648A (zh) * | 2019-04-10 | 2019-07-09 | 北京神州泰岳软件股份有限公司 | 基于词迁徙学习的深度文本匹配方法及装置 |
CN110287494A (zh) * | 2019-07-01 | 2019-09-27 | 济南浪潮高新科技投资发展有限公司 | 一种基于深度学习bert算法的短文本相似匹配的方法 |
CN110413785A (zh) * | 2019-07-25 | 2019-11-05 | 淮阴工学院 | 一种基于bert和特征融合的文本自动分类方法 |
CN110866095A (zh) * | 2019-10-10 | 2020-03-06 | 重庆金融资产交易所有限责任公司 | 一种文本相似度的确定方法及相关设备 |
CN111241828A (zh) * | 2020-01-10 | 2020-06-05 | 平安科技(深圳)有限公司 | 情感智能识别方法、装置及计算机可读存储介质 |
CN111368038A (zh) * | 2020-03-09 | 2020-07-03 | 广州市百果园信息技术有限公司 | 一种关键词的提取方法、装置、计算机设备和存储介质 |
Non-Patent Citations (1)
Title |
---|
知识图谱构建技术综述;刘峤;《计算机研究与发展》;20160315;582-596 * |
Also Published As
Publication number | Publication date |
---|---|
CN111881257A (zh) | 2020-11-03 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US10380236B1 (en) | Machine learning system for annotating unstructured text | |
CN109992782B (zh) | 法律文书命名实体识别方法、装置及计算机设备 | |
CN106502985B (zh) | 一种用于生成标题的神经网络建模方法及装置 | |
CN106202010B (zh) | 基于深度神经网络构建法律文本语法树的方法和装置 | |
CN113254610B (zh) | 面向专利咨询的多轮对话生成方法 | |
CN114118065B (zh) | 一种电力领域中文文本纠错方法、装置、存储介质及计算设备 | |
Shanmugavadivel et al. | An analysis of machine learning models for sentiment analysis of Tamil code-mixed data | |
CN108256066B (zh) | 端到端层次解码任务型对话系统 | |
CN112528637A (zh) | 文本处理模型训练方法、装置、计算机设备和存储介质 | |
CN108363685B (zh) | 基于递归变分自编码模型的自媒体数据文本表示方法 | |
CN116737938A (zh) | 基于微调大模型在线数据网络细粒度情感检测方法及装置 | |
CN115831102A (zh) | 基于预训练特征表示的语音识别方法、装置及电子设备 | |
CN113033182A (zh) | 文本创作的辅助方法、装置及服务器 | |
CN116932762A (zh) | 一种小样本金融文本分类方法、系统、介质和设备 | |
CN113111190A (zh) | 一种知识驱动的对话生成方法及装置 | |
CN112183106A (zh) | 一种基于音素联想及深度学习的语义理解方法及装置 | |
CN111881257B (zh) | 基于主题词和语句主旨的自动匹配方法、系统及存储介质 | |
CN114529917A (zh) | 一种零样本中文单字识别方法、系统、装置及存储介质 | |
US11941360B2 (en) | Acronym definition network | |
CN117436522A (zh) | 生物事件关系抽取方法及癌症主题的大规模生物事件关系知识库构建方法 | |
CN117453917A (zh) | 模型训练方法、装置、存储介质及电子设备 | |
CN117316140A (zh) | 语音合成方法、装置、设备、存储介质及程序产品 | |
CN116909435A (zh) | 一种数据处理方法、装置、电子设备及存储介质 | |
CN116795970A (zh) | 一种对话生成方法及其在情感陪护中的应用 | |
CN111259673A (zh) | 一种基于反馈序列多任务学习的法律判决预测方法及系统 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
TR01 | Transfer of patent right |
Effective date of registration: 20240808 Address after: 1003, Building A, Zhiyun Industrial Park, No. 13 Huaxing Road, Tongsheng Community, Dalang Street, Longhua District, Shenzhen City, Guangdong Province, 518000 Patentee after: Shenzhen Wanzhida Enterprise Management Co.,Ltd. Country or region after: China Address before: 510006 No. 230 West Ring Road, University of Guangdong, Guangzhou Patentee before: Guangzhou University Country or region before: China |
|
TR01 | Transfer of patent right | ||
TR01 | Transfer of patent right |
Effective date of registration: 20240809 Address after: 518000 E-1, 7th Floor, Building A, Jinfeng Building, No. 1001 and 1005 Shangbu South Road, Binjiang Community, Nanyuan Street, Futian District, Shenzhen, Guangdong Province Patentee after: Shenzhen Jinzong Talent Network Service Co.,Ltd. Country or region after: China Address before: 1003, Building A, Zhiyun Industrial Park, No. 13 Huaxing Road, Tongsheng Community, Dalang Street, Longhua District, Shenzhen City, Guangdong Province, 518000 Patentee before: Shenzhen Wanzhida Enterprise Management Co.,Ltd. Country or region before: China |
|
TR01 | Transfer of patent right |