CN110134777A - 问题去重方法、装置、电子设备和计算机可读存储介质 - Google Patents
问题去重方法、装置、电子设备和计算机可读存储介质 Download PDFInfo
- Publication number
- CN110134777A CN110134777A CN201910457996.1A CN201910457996A CN110134777A CN 110134777 A CN110134777 A CN 110134777A CN 201910457996 A CN201910457996 A CN 201910457996A CN 110134777 A CN110134777 A CN 110134777A
- Authority
- CN
- China
- Prior art keywords
- corpus
- vocabulary
- word
- typical
- category
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/33—Querying
- G06F16/332—Query formulation
- G06F16/3329—Natural language query formulation or dialogue systems
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/35—Clustering; Classification
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Mathematical Physics (AREA)
- Data Mining & Analysis (AREA)
- Databases & Information Systems (AREA)
- General Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Human Computer Interaction (AREA)
- Artificial Intelligence (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Machine Translation (AREA)
Abstract
Description
Claims (12)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910457996.1A CN110134777B (zh) | 2019-05-29 | 2019-05-29 | 问题去重方法、装置、电子设备和计算机可读存储介质 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910457996.1A CN110134777B (zh) | 2019-05-29 | 2019-05-29 | 问题去重方法、装置、电子设备和计算机可读存储介质 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN110134777A true CN110134777A (zh) | 2019-08-16 |
CN110134777B CN110134777B (zh) | 2021-11-26 |
Family
ID=67582640
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201910457996.1A Active CN110134777B (zh) | 2019-05-29 | 2019-05-29 | 问题去重方法、装置、电子设备和计算机可读存储介质 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN110134777B (zh) |
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110543551A (zh) * | 2019-09-04 | 2019-12-06 | 北京香侬慧语科技有限责任公司 | 一种问题语句处理方法和装置 |
CN111159370A (zh) * | 2019-12-20 | 2020-05-15 | 中国建设银行股份有限公司 | 一种短会话新问题生成方法、存储介质和人机交互装置 |
CN111241239A (zh) * | 2020-01-07 | 2020-06-05 | 科大讯飞股份有限公司 | 重题检测方法、相关设备及可读存储介质 |
CN112613295A (zh) * | 2020-12-21 | 2021-04-06 | 竹间智能科技(上海)有限公司 | 语料识别方法及装置、电子设备、存储介质 |
CN112883715A (zh) * | 2019-11-29 | 2021-06-01 | 武汉渔见晚科技有限责任公司 | 一种词向量的构建方法及装置 |
Citations (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106445907A (zh) * | 2015-08-06 | 2017-02-22 | 北京国双科技有限公司 | 一种领域词典的生成方法及装置 |
US20180032897A1 (en) * | 2016-07-26 | 2018-02-01 | International Business Machines Corporation | Event clustering and classification with document embedding |
CN107844533A (zh) * | 2017-10-19 | 2018-03-27 | 云南大学 | 一种智能问答系统及分析方法 |
US20180097749A1 (en) * | 2016-10-03 | 2018-04-05 | Nohold, Inc. | Interactive virtual conversation interface systems and methods |
US20180173697A1 (en) * | 2013-09-09 | 2018-06-21 | Ayasdi, Inc. | Automated discovery using textual analysis |
CN108345672A (zh) * | 2018-02-09 | 2018-07-31 | 平安科技(深圳)有限公司 | 智能应答方法、电子装置及存储介质 |
CN108595696A (zh) * | 2018-05-09 | 2018-09-28 | 长沙学院 | 一种基于云平台的人机交互智能问答方法和系统 |
CN109033221A (zh) * | 2018-06-29 | 2018-12-18 | 上海银赛计算机科技有限公司 | 答案生成方法、装置及服务器 |
CN105045812B (zh) * | 2015-06-18 | 2019-01-29 | 上海高欣计算机系统有限公司 | 文本主题的分类方法及系统 |
-
2019
- 2019-05-29 CN CN201910457996.1A patent/CN110134777B/zh active Active
Patent Citations (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20180173697A1 (en) * | 2013-09-09 | 2018-06-21 | Ayasdi, Inc. | Automated discovery using textual analysis |
CN105045812B (zh) * | 2015-06-18 | 2019-01-29 | 上海高欣计算机系统有限公司 | 文本主题的分类方法及系统 |
CN106445907A (zh) * | 2015-08-06 | 2017-02-22 | 北京国双科技有限公司 | 一种领域词典的生成方法及装置 |
US20180032897A1 (en) * | 2016-07-26 | 2018-02-01 | International Business Machines Corporation | Event clustering and classification with document embedding |
US20180097749A1 (en) * | 2016-10-03 | 2018-04-05 | Nohold, Inc. | Interactive virtual conversation interface systems and methods |
CN107844533A (zh) * | 2017-10-19 | 2018-03-27 | 云南大学 | 一种智能问答系统及分析方法 |
CN108345672A (zh) * | 2018-02-09 | 2018-07-31 | 平安科技(深圳)有限公司 | 智能应答方法、电子装置及存储介质 |
CN108595696A (zh) * | 2018-05-09 | 2018-09-28 | 长沙学院 | 一种基于云平台的人机交互智能问答方法和系统 |
CN109033221A (zh) * | 2018-06-29 | 2018-12-18 | 上海银赛计算机科技有限公司 | 答案生成方法、装置及服务器 |
Cited By (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110543551A (zh) * | 2019-09-04 | 2019-12-06 | 北京香侬慧语科技有限责任公司 | 一种问题语句处理方法和装置 |
CN110543551B (zh) * | 2019-09-04 | 2022-11-08 | 北京香侬慧语科技有限责任公司 | 一种问题语句处理方法和装置 |
CN112883715A (zh) * | 2019-11-29 | 2021-06-01 | 武汉渔见晚科技有限责任公司 | 一种词向量的构建方法及装置 |
CN112883715B (zh) * | 2019-11-29 | 2023-11-07 | 武汉渔见晚科技有限责任公司 | 一种词向量的构建方法及装置 |
CN111159370A (zh) * | 2019-12-20 | 2020-05-15 | 中国建设银行股份有限公司 | 一种短会话新问题生成方法、存储介质和人机交互装置 |
CN111241239A (zh) * | 2020-01-07 | 2020-06-05 | 科大讯飞股份有限公司 | 重题检测方法、相关设备及可读存储介质 |
CN111241239B (zh) * | 2020-01-07 | 2022-12-02 | 科大讯飞股份有限公司 | 重题检测方法、相关设备及可读存储介质 |
CN112613295A (zh) * | 2020-12-21 | 2021-04-06 | 竹间智能科技(上海)有限公司 | 语料识别方法及装置、电子设备、存储介质 |
CN112613295B (zh) * | 2020-12-21 | 2023-12-22 | 竹间智能科技(上海)有限公司 | 语料识别方法及装置、电子设备、存储介质 |
Also Published As
Publication number | Publication date |
---|---|
CN110134777B (zh) | 2021-11-26 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN106570708B (zh) | 一种智能客服知识库的管理方法及系统 | |
CN110134777A (zh) | 问题去重方法、装置、电子设备和计算机可读存储介质 | |
CN112711953B (zh) | 一种基于注意力机制和gcn的文本多标签分类方法和系统 | |
CN111581354A (zh) | 一种faq问句相似度计算方法及其系统 | |
CN109189767B (zh) | 数据处理方法、装置、电子设备及存储介质 | |
CN111767716B (zh) | 企业多级行业信息的确定方法、装置及计算机设备 | |
CN104573130B (zh) | 基于群体计算的实体解析方法及装置 | |
CN109271514B (zh) | 短文本分类模型的生成方法、分类方法、装置及存储介质 | |
CN108416032A (zh) | 一种文本分类方法、装置及存储介质 | |
CN113780007A (zh) | 语料筛选方法、意图识别模型优化方法、设备及存储介质 | |
CN113254655B (zh) | 文本分类方法、电子设备及计算机存储介质 | |
CN110019820A (zh) | 一种病历中主诉与现病史症状时间一致性检测方法 | |
Ranjan et al. | Document classification using lstm neural network | |
CN114610865A (zh) | 召回文本推荐方法、装置、设备及存储介质 | |
CN112131453A (zh) | 一种基于bert的网络不良短文本检测方法、装置及存储介质 | |
CN111144453A (zh) | 构建多模型融合计算模型的方法及设备、网站数据识别方法及设备 | |
CN110348497A (zh) | 一种基于WT-GloVe词向量构建的文本表示方法 | |
Yafooz et al. | Enhancing multi-class web video categorization model using machine and deep learning approaches | |
Suhasini et al. | A Hybrid TF-IDF and N-Grams Based Feature Extraction Approach for Accurate Detection of Fake News on Twitter Data | |
CN111341404B (zh) | 一种基于ernie模型的电子病历数据组解析方法及系统 | |
Al Mahmud et al. | A New Approach to Analysis of Public Sentiment on Padma Bridge in Bangla Text | |
KR102155692B1 (ko) | 소셜 네트워크 서비스 메시지의 감정 분석을 위한 POS(part of speech) 특징기반의 감정 분석 방법 및 이를 수행하는 감정 분석 장치 | |
CN113761104A (zh) | 知识图谱中实体关系的检测方法、装置和电子设备 | |
CN115310564B (zh) | 一种分类标签更新方法及系统 | |
CN116304058B (zh) | 企业负面信息的识别方法、装置、电子设备及存储介质 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
TA01 | Transfer of patent application right | ||
TA01 | Transfer of patent application right |
Effective date of registration: 20200727 Address after: 518057 Nanshan District science and technology zone, Guangdong, Zhejiang Province, science and technology in the Tencent Building on the 1st floor of the 35 layer Applicant after: TENCENT TECHNOLOGY (SHENZHEN) Co.,Ltd. Address before: 100029, Beijing, Chaoyang District new East Street, building No. 2, -3 to 25, 101, 8, 804 rooms Applicant before: Tricorn (Beijing) Technology Co.,Ltd. |
|
TA01 | Transfer of patent application right | ||
TA01 | Transfer of patent application right |
Effective date of registration: 20200927 Address after: 518057 Nanshan District science and technology zone, Guangdong, Zhejiang Province, science and technology in the Tencent Building on the 1st floor of the 35 layer Applicant after: TENCENT TECHNOLOGY (SHENZHEN) Co.,Ltd. Applicant after: BEIJING RESEARCH CENTER FOR INFORMATION TECHNOLOGY IN AGRICULTURE Applicant after: NONGXIN TECHNOLOGY (BEIJING) Co.,Ltd. Address before: 518057 Nanshan District science and technology zone, Guangdong, Zhejiang Province, science and technology in the Tencent Building on the 1st floor of the 35 layer Applicant before: TENCENT TECHNOLOGY (SHENZHEN) Co.,Ltd. |
|
GR01 | Patent grant | ||
GR01 | Patent grant |