CN104778256A - 一种领域问答系统咨询的快速可增量聚类方法 - Google Patents
一种领域问答系统咨询的快速可增量聚类方法 Download PDFInfo
- Publication number
- CN104778256A CN104778256A CN201510187231.2A CN201510187231A CN104778256A CN 104778256 A CN104778256 A CN 104778256A CN 201510187231 A CN201510187231 A CN 201510187231A CN 104778256 A CN104778256 A CN 104778256A
- Authority
- CN
- China
- Prior art keywords
- cluster
- similarity
- consulting
- class
- sentence
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000000034 method Methods 0.000 title claims abstract description 32
- 238000010606 normalization Methods 0.000 claims abstract description 4
- 238000002156 mixing Methods 0.000 claims description 13
- 230000004927 fusion Effects 0.000 claims description 12
- 238000000605 extraction Methods 0.000 claims description 8
- 238000011524 similarity measure Methods 0.000 claims description 6
- 238000010586 diagram Methods 0.000 claims description 5
- 230000008569 process Effects 0.000 claims description 4
- 238000006243 chemical reaction Methods 0.000 claims description 3
- 230000008878 coupling Effects 0.000 claims description 3
- 238000010168 coupling process Methods 0.000 claims description 3
- 238000005859 coupling reaction Methods 0.000 claims description 3
- 230000015572 biosynthetic process Effects 0.000 claims description 2
- 230000008859 change Effects 0.000 claims description 2
- 230000004044 response Effects 0.000 abstract description 3
- 238000009223 counseling Methods 0.000 description 4
- 239000000203 mixture Substances 0.000 description 4
- 238000005516 engineering process Methods 0.000 description 2
- 238000002474 experimental method Methods 0.000 description 2
- 238000012360 testing method Methods 0.000 description 2
- 238000013473 artificial intelligence Methods 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 238000007418 data mining Methods 0.000 description 1
- 230000007812 deficiency Effects 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 230000013011 mating Effects 0.000 description 1
- 238000005259 measurement Methods 0.000 description 1
- COCAUCFPFHUGAA-MGNBDDOMSA-N n-[3-[(1s,7s)-5-amino-4-thia-6-azabicyclo[5.1.0]oct-5-en-7-yl]-4-fluorophenyl]-5-chloropyridine-2-carboxamide Chemical compound C=1C=C(F)C([C@@]23N=C(SCC[C@@H]2C3)N)=CC=1NC(=O)C1=CC=C(Cl)C=N1 COCAUCFPFHUGAA-MGNBDDOMSA-N 0.000 description 1
- 238000003058 natural language processing Methods 0.000 description 1
- 230000011218 segmentation Effects 0.000 description 1
- 238000012549 training Methods 0.000 description 1
Landscapes
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
Description
Claims (5)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201510187231.2A CN104778256B (zh) | 2015-04-20 | 2015-04-20 | 一种领域问答系统咨询的快速可增量聚类方法 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201510187231.2A CN104778256B (zh) | 2015-04-20 | 2015-04-20 | 一种领域问答系统咨询的快速可增量聚类方法 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN104778256A true CN104778256A (zh) | 2015-07-15 |
CN104778256B CN104778256B (zh) | 2017-10-17 |
Family
ID=53619720
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201510187231.2A Active CN104778256B (zh) | 2015-04-20 | 2015-04-20 | 一种领域问答系统咨询的快速可增量聚类方法 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN104778256B (zh) |
Cited By (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN105512106A (zh) * | 2015-12-09 | 2016-04-20 | 江苏科技大学 | 一种汉语离合词的自动识别方法 |
CN105824955A (zh) * | 2016-03-30 | 2016-08-03 | 北京小米移动软件有限公司 | 短信聚类方法及装置 |
CN106445920A (zh) * | 2016-09-29 | 2017-02-22 | 北京理工大学 | 利用句义结构特征的句子相似度计算方法 |
CN106446148A (zh) * | 2016-09-21 | 2017-02-22 | 中国运载火箭技术研究院 | 一种基于聚类的文本查重方法 |
CN107341157A (zh) * | 2016-04-29 | 2017-11-10 | 阿里巴巴集团控股有限公司 | 一种客服对话聚类方法和装置 |
CN109461037A (zh) * | 2018-12-17 | 2019-03-12 | 北京百度网讯科技有限公司 | 评论观点聚类方法、装置和终端 |
CN110162604A (zh) * | 2019-01-24 | 2019-08-23 | 腾讯科技(深圳)有限公司 | 语句生成方法、装置、设备及存储介质 |
CN110472055A (zh) * | 2019-08-21 | 2019-11-19 | 北京百度网讯科技有限公司 | 用于标注数据的方法和装置 |
CN110727779A (zh) * | 2019-10-16 | 2020-01-24 | 信雅达系统工程股份有限公司 | 基于多模型融合的问答方法及系统 |
CN112599120A (zh) * | 2020-12-11 | 2021-04-02 | 上海中通吉网络技术有限公司 | 基于自定义加权的wmd算法的语意确定方法及装置 |
CN113836275A (zh) * | 2020-06-08 | 2021-12-24 | 菜鸟智能物流控股有限公司 | 对话模型建立方法及装置 |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20090012926A1 (en) * | 2006-03-01 | 2009-01-08 | Nec Corporation | Question answering device, question answering method, and question answering program |
CN101630312A (zh) * | 2009-08-19 | 2010-01-20 | 腾讯科技(深圳)有限公司 | 一种用于问答平台中问句的聚类方法及系统 |
CN102682000A (zh) * | 2011-03-09 | 2012-09-19 | 北京百度网讯科技有限公司 | 一种文本聚类方法以及采用该方法的问答系统和搜索引擎 |
CN102955856A (zh) * | 2012-11-09 | 2013-03-06 | 北京航空航天大学 | 一种基于特征扩展的中文短文本分类方法 |
CN104008166A (zh) * | 2014-05-30 | 2014-08-27 | 华东师范大学 | 一种基于形态和语义相似度的对话短文本聚类方法 |
-
2015
- 2015-04-20 CN CN201510187231.2A patent/CN104778256B/zh active Active
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20090012926A1 (en) * | 2006-03-01 | 2009-01-08 | Nec Corporation | Question answering device, question answering method, and question answering program |
CN101630312A (zh) * | 2009-08-19 | 2010-01-20 | 腾讯科技(深圳)有限公司 | 一种用于问答平台中问句的聚类方法及系统 |
CN102682000A (zh) * | 2011-03-09 | 2012-09-19 | 北京百度网讯科技有限公司 | 一种文本聚类方法以及采用该方法的问答系统和搜索引擎 |
CN102955856A (zh) * | 2012-11-09 | 2013-03-06 | 北京航空航天大学 | 一种基于特征扩展的中文短文本分类方法 |
CN104008166A (zh) * | 2014-05-30 | 2014-08-27 | 华东师范大学 | 一种基于形态和语义相似度的对话短文本聚类方法 |
Non-Patent Citations (4)
Title |
---|
刘亮亮 等: "基于查询模板的特定领域中文问答系统的研究与实现", 《江苏科技大学学报(自然科学版)》 * |
潘敏 等: "基于簇特征的文本增量聚类研究", 《江西师范大学学报(自然科学版)》 * |
王石 等: "一种基于搭配的中文词汇语义相似度计算方法", 《中文信息学报》 * |
王金铨 等: "基于N-gram 和向量空间模型的语句相似度研究", 《现代外语(季刊)》 * |
Cited By (19)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN105512106B (zh) * | 2015-12-09 | 2018-04-06 | 江苏科技大学 | 一种汉语离合词的自动识别方法 |
CN105512106A (zh) * | 2015-12-09 | 2016-04-20 | 江苏科技大学 | 一种汉语离合词的自动识别方法 |
CN105824955A (zh) * | 2016-03-30 | 2016-08-03 | 北京小米移动软件有限公司 | 短信聚类方法及装置 |
CN107341157B (zh) * | 2016-04-29 | 2021-01-22 | 阿里巴巴集团控股有限公司 | 一种客服对话聚类方法和装置 |
CN107341157A (zh) * | 2016-04-29 | 2017-11-10 | 阿里巴巴集团控股有限公司 | 一种客服对话聚类方法和装置 |
CN106446148A (zh) * | 2016-09-21 | 2017-02-22 | 中国运载火箭技术研究院 | 一种基于聚类的文本查重方法 |
CN106446148B (zh) * | 2016-09-21 | 2019-08-09 | 中国运载火箭技术研究院 | 一种基于聚类的文本查重方法 |
CN106445920A (zh) * | 2016-09-29 | 2017-02-22 | 北京理工大学 | 利用句义结构特征的句子相似度计算方法 |
CN109461037B (zh) * | 2018-12-17 | 2022-10-28 | 北京百度网讯科技有限公司 | 评论观点聚类方法、装置和终端 |
CN109461037A (zh) * | 2018-12-17 | 2019-03-12 | 北京百度网讯科技有限公司 | 评论观点聚类方法、装置和终端 |
CN110162604A (zh) * | 2019-01-24 | 2019-08-23 | 腾讯科技(深圳)有限公司 | 语句生成方法、装置、设备及存储介质 |
WO2020151690A1 (zh) * | 2019-01-24 | 2020-07-30 | 腾讯科技(深圳)有限公司 | 语句生成方法、装置、设备及存储介质 |
CN110162604B (zh) * | 2019-01-24 | 2023-09-12 | 腾讯科技(深圳)有限公司 | 语句生成方法、装置、设备及存储介质 |
CN110472055A (zh) * | 2019-08-21 | 2019-11-19 | 北京百度网讯科技有限公司 | 用于标注数据的方法和装置 |
CN110472055B (zh) * | 2019-08-21 | 2021-09-14 | 北京百度网讯科技有限公司 | 用于标注数据的方法和装置 |
CN110727779A (zh) * | 2019-10-16 | 2020-01-24 | 信雅达系统工程股份有限公司 | 基于多模型融合的问答方法及系统 |
CN113836275A (zh) * | 2020-06-08 | 2021-12-24 | 菜鸟智能物流控股有限公司 | 对话模型建立方法及装置 |
CN113836275B (zh) * | 2020-06-08 | 2023-09-05 | 菜鸟智能物流控股有限公司 | 对话模型建立方法、装置、非易失性存储介质和电子装置 |
CN112599120A (zh) * | 2020-12-11 | 2021-04-02 | 上海中通吉网络技术有限公司 | 基于自定义加权的wmd算法的语意确定方法及装置 |
Also Published As
Publication number | Publication date |
---|---|
CN104778256B (zh) | 2017-10-17 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN104778256A (zh) | 一种领域问答系统咨询的快速可增量聚类方法 | |
CN110852087B (zh) | 中文纠错方法和装置、存储介质及电子装置 | |
CN107766324B (zh) | 一种基于深度神经网络的文本一致性分析方法 | |
CN108038205B (zh) | 针对中文微博的观点分析原型系统 | |
Fetaya et al. | Restoration of fragmentary Babylonian texts using recurrent neural networks | |
CN114065758B (zh) | 一种基于超图随机游走的文档关键词抽取方法 | |
Suleiman et al. | The use of hidden Markov model in natural ARABIC language processing: a survey | |
Saloot et al. | An architecture for Malay Tweet normalization | |
CN111222318B (zh) | 基于双通道双向lstm-crf网络的触发词识别方法 | |
CN104598588A (zh) | 基于双聚类的微博用户标签自动生成算法 | |
Zu et al. | Resume information extraction with a novel text block segmentation algorithm | |
Gharatkar et al. | Review preprocessing using data cleaning and stemming technique | |
CN103324626A (zh) | 一种建立多粒度词典的方法、分词的方法及其装置 | |
CN105956158A (zh) | 基于海量微博文本和用户信息的网络新词自动提取的方法 | |
Ali et al. | SiNER: A large dataset for Sindhi named entity recognition | |
Jia et al. | A Chinese unknown word recognition method for micro-blog short text based on improved FP-growth | |
CN115269834A (zh) | 一种基于bert的高精度文本分类方法及装置 | |
Sembok et al. | Arabic word stemming algorithms and retrieval effectiveness | |
Andrews et al. | Robust entity clustering via phylogenetic inference | |
Han | Improving the utility of social media with natural language processing | |
CN110929022A (zh) | 一种文本摘要生成方法及系统 | |
Čibej et al. | Normalisation, tokenisation and sentence segmentation of Slovene tweets | |
Havrashenko et al. | Analysis of text augmentation algorithms in artificial language machine translation systems | |
Adak | A bilingual machine translation system: English & Bengali | |
Hammad et al. | Sentiment analysis of sindhi tweets dataset using supervised machine learning techniques |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
EXSB | Decision made by sipo to initiate substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
CB03 | Change of inventor or designer information |
Inventor after: Wu Jiankang Inventor after: Liu Liangliang Inventor after: Li Hongmei Inventor after: Ma Jian Inventor before: Ma Jian Inventor before: Liu Liangliang Inventor before: Wu Jiankang Inventor before: Li Hongmei |
|
CB03 | Change of inventor or designer information | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
EE01 | Entry into force of recordation of patent licensing contract |
Application publication date: 20150715 Assignee: JIANGSU KEDA HUIFENG SCIENCE AND TECHNOLOGY Co.,Ltd. Assignor: JIANGSU University OF SCIENCE AND TECHNOLOGY Contract record no.: X2020980007325 Denomination of invention: A fast incremental clustering method for domain question answering system consultation Granted publication date: 20171017 License type: Common License Record date: 20201029 |
|
EE01 | Entry into force of recordation of patent licensing contract | ||
EC01 | Cancellation of recordation of patent licensing contract |
Assignee: JIANGSU KEDA HUIFENG SCIENCE AND TECHNOLOGY Co.,Ltd. Assignor: JIANGSU University OF SCIENCE AND TECHNOLOGY Contract record no.: X2020980007325 Date of cancellation: 20201223 |
|
EC01 | Cancellation of recordation of patent licensing contract | ||
TR01 | Transfer of patent right |
Effective date of registration: 20221228 Address after: Room 02A-084, Building C (Second Floor), No. 28, Xinxi Road, Haidian District, Beijing 100085 Patentee after: Jingchuang United (Beijing) Intellectual Property Service Co.,Ltd. Address before: 212003, No. 2, Mengxi Road, Zhenjiang, Jiangsu Patentee before: JIANGSU University OF SCIENCE AND TECHNOLOGY Effective date of registration: 20221228 Address after: Room 606-609, Compound Office Complex Building, No. 757, Dongfeng East Road, Yuexiu District, Guangzhou, Guangdong Province, 510699 Patentee after: China Southern Power Grid Internet Service Co.,Ltd. Address before: Room 02A-084, Building C (Second Floor), No. 28, Xinxi Road, Haidian District, Beijing 100085 Patentee before: Jingchuang United (Beijing) Intellectual Property Service Co.,Ltd. |
|
TR01 | Transfer of patent right |