SG11201802373WA - Method and device for processing question clustering in automatic question and answering system - Google Patents
Method and device for processing question clustering in automatic question and answering systemInfo
- Publication number
- SG11201802373WA SG11201802373WA SG11201802373WA SG11201802373WA SG11201802373WA SG 11201802373W A SG11201802373W A SG 11201802373WA SG 11201802373W A SG11201802373W A SG 11201802373WA SG 11201802373W A SG11201802373W A SG 11201802373WA SG 11201802373W A SG11201802373W A SG 11201802373WA
- Authority
- SG
- Singapore
- Prior art keywords
- question
- clustering
- feature
- clustered
- processing
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/33—Querying
- G06F16/332—Query formulation
- G06F16/3329—Natural language query formulation or dialogue systems
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/33—Querying
- G06F16/3331—Query processing
- G06F16/334—Query execution
- G06F16/3347—Query execution using vector based model
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/35—Clustering; Classification
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/35—Clustering; Classification
- G06F16/355—Class or cluster creation or modification
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/21—Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
- G06F18/213—Feature extraction, e.g. by transforming the feature space; Summarisation; Mappings, e.g. subspace methods
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/40—Processing or translation of natural language
- G06F40/53—Processing of non-Latin text
Abstract
METHOD AND DEVICE FOR PROCESSING QUESTION CLUSTERING IN AUTOMATIC QUESTION AND ANSWERING SYSTEM The present invention provides a method and a device for processing question clustering in an automatic question and answering system. The method comprises: receiving a clustering request input by a writer; acquiring a question set to be clustered from a database of unanswered questions based on the clustering request; performing feature extraction on the question set to be clustered with a text feature extraction algorithm to output a question feature set; determining whether the question feature set meets a preset splitting condition; performing segmenting clustering on the question feature set with a segmenting clustering algorithm if the preset splitting condition is met to output at least two question feature subsets; updating the question feature subsets to a question feature set, and determining whether the question feature set meets the preset splitting condition; and outputting the question feature set as a clustering class cluster if the preset splitting condition is not met. In the method and device for processing question clustering in the automatic question and answering system, the question set to be clustered may be automatically clustered to help the writer understand question consultation requirements and improve the coverage of the written question and answering pairs. FIG. 1
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201611002092.2A CN107656948B (en) | 2016-11-14 | 2016-11-14 | The problems in automatically request-answering system clustering processing method and device |
PCT/CN2017/099708 WO2018086401A1 (en) | 2016-11-14 | 2017-08-30 | Cluster processing method and device for questions in automatic question and answering system |
Publications (1)
Publication Number | Publication Date |
---|---|
SG11201802373WA true SG11201802373WA (en) | 2018-06-28 |
Family
ID=61127345
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
SG11201802373WA SG11201802373WA (en) | 2016-11-14 | 2017-08-30 | Method and device for processing question clustering in automatic question and answering system |
Country Status (8)
Country | Link |
---|---|
US (1) | US20190073416A1 (en) |
EP (1) | EP3540612A4 (en) |
JP (1) | JP6634515B2 (en) |
KR (1) | KR102113413B1 (en) |
CN (1) | CN107656948B (en) |
AU (1) | AU2017329098B2 (en) |
SG (1) | SG11201802373WA (en) |
WO (1) | WO2018086401A1 (en) |
Families Citing this family (17)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108804567A (en) * | 2018-05-22 | 2018-11-13 | 平安科技(深圳)有限公司 | Improve method, equipment, storage medium and the device of intelligent customer service response rate |
CN109002434A (en) * | 2018-05-31 | 2018-12-14 | 青岛理工大学 | Customer service question and answer matching process, server and storage medium |
CN109189901B (en) * | 2018-08-09 | 2021-05-18 | 北京中关村科金技术有限公司 | Method for automatically discovering new classification and corresponding corpus in intelligent customer service system |
CN109145118B (en) * | 2018-09-06 | 2021-01-26 | 北京京东尚科信息技术有限公司 | Information management method and device |
CN110110084A (en) * | 2019-04-23 | 2019-08-09 | 北京科技大学 | The recognition methods of high quality user-generated content |
CN110728298A (en) * | 2019-09-05 | 2020-01-24 | 北京三快在线科技有限公司 | Multi-task classification model training method, multi-task classification method and device |
CN110767224B (en) * | 2019-10-15 | 2020-08-07 | 上海云从企业发展有限公司 | Service management method, system, equipment and medium based on characteristic right level |
CN111046158B (en) * | 2019-12-13 | 2020-12-15 | 腾讯科技(深圳)有限公司 | Question-answer matching method, model training method, device, equipment and storage medium |
CN111191687B (en) * | 2019-12-14 | 2023-02-10 | 贵州电网有限责任公司 | Power communication data clustering method based on improved K-means algorithm |
CN111259154B (en) * | 2020-02-07 | 2021-04-13 | 腾讯科技(深圳)有限公司 | Data processing method and device, computer equipment and storage medium |
CN111309881A (en) * | 2020-02-11 | 2020-06-19 | 深圳壹账通智能科技有限公司 | Method and device for processing unknown questions in intelligent question answering, computer equipment and medium |
CN111352988B (en) * | 2020-02-29 | 2023-05-23 | 重庆百事得大牛机器人有限公司 | Big data warehouse storage, analysis and extraction system aiming at legal information |
CN111813905A (en) * | 2020-06-17 | 2020-10-23 | 平安科技(深圳)有限公司 | Corpus generation method and device, computer equipment and storage medium |
KR102445841B1 (en) * | 2020-10-16 | 2022-09-22 | 성균관대학교산학협력단 | Medical Chatbot System Using Multiple Search Methods |
CN112650841A (en) * | 2020-12-07 | 2021-04-13 | 北京有竹居网络技术有限公司 | Information processing method and device and electronic equipment |
CN112995719B (en) * | 2021-04-21 | 2021-07-27 | 平安科技(深圳)有限公司 | Bullet screen text-based problem set acquisition method and device and computer equipment |
CN113220853B (en) * | 2021-05-12 | 2022-10-04 | 燕山大学 | Automatic generation method and system for legal questions |
Family Cites Families (16)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6675159B1 (en) * | 2000-07-27 | 2004-01-06 | Science Applic Int Corp | Concept-based search and retrieval system |
JP4081065B2 (en) * | 2004-10-22 | 2008-04-23 | クオリカ株式会社 | FAQ data creation apparatus, method, and program |
SG138575A1 (en) * | 2006-06-23 | 2008-01-28 | Colorzip Media Inc | Method of classifying colors of color based image code |
CN101308496A (en) * | 2008-07-04 | 2008-11-19 | 沈阳格微软件有限责任公司 | Large scale text data external clustering method and system |
CN101477563B (en) * | 2009-01-21 | 2010-11-10 | 北京百问百答网络技术有限公司 | Short text clustering method and system, and its data processing device |
CN101599071B (en) * | 2009-07-10 | 2012-04-18 | 华中科技大学 | Automatic extraction method of conversation text topic |
CN101630312A (en) * | 2009-08-19 | 2010-01-20 | 腾讯科技(深圳)有限公司 | Clustering method for question sentences in question-and-answer platform and system thereof |
JP5574842B2 (en) * | 2010-06-21 | 2014-08-20 | 株式会社野村総合研究所 | FAQ candidate extraction system and FAQ candidate extraction program |
US9230009B2 (en) * | 2013-06-04 | 2016-01-05 | International Business Machines Corporation | Routing of questions to appropriately trained question and answer system pipelines using clustering |
CN103559175B (en) * | 2013-10-12 | 2016-08-10 | 华南理工大学 | A kind of Spam Filtering System based on cluster and method |
CN103699695B (en) * | 2014-01-14 | 2017-02-01 | 吉林大学 | Centroid method-based self-adaption text clustering algorithm |
WO2015151162A1 (en) * | 2014-03-31 | 2015-10-08 | 楽天株式会社 | Similarity calculation system, similarity calculation method, and program |
CN104142918B (en) * | 2014-07-31 | 2017-04-05 | 天津大学 | Short text clustering and focus subject distillation method based on TF IDF features |
US10387430B2 (en) * | 2015-02-26 | 2019-08-20 | International Business Machines Corporation | Geometry-directed active question selection for question answering systems |
KR101720972B1 (en) * | 2015-04-16 | 2017-03-30 | 주식회사 플런티코리아 | Recommendation Reply Apparatus and Method |
CN105975460A (en) * | 2016-05-30 | 2016-09-28 | 上海智臻智能网络科技股份有限公司 | Question information processing method and device |
-
2016
- 2016-11-14 CN CN201611002092.2A patent/CN107656948B/en active Active
-
2017
- 2017-08-30 AU AU2017329098A patent/AU2017329098B2/en active Active
- 2017-08-30 WO PCT/CN2017/099708 patent/WO2018086401A1/en active Application Filing
- 2017-08-30 JP JP2018513838A patent/JP6634515B2/en active Active
- 2017-08-30 SG SG11201802373WA patent/SG11201802373WA/en unknown
- 2017-08-30 US US16/093,610 patent/US20190073416A1/en not_active Abandoned
- 2017-08-30 EP EP17847762.6A patent/EP3540612A4/en not_active Ceased
- 2017-08-30 KR KR1020187015559A patent/KR102113413B1/en active IP Right Grant
Also Published As
Publication number | Publication date |
---|---|
AU2017329098A1 (en) | 2018-05-31 |
JP6634515B2 (en) | 2020-01-22 |
KR20180077261A (en) | 2018-07-06 |
CN107656948A (en) | 2018-02-02 |
WO2018086401A1 (en) | 2018-05-17 |
CN107656948B (en) | 2019-05-07 |
EP3540612A4 (en) | 2020-06-17 |
AU2017329098B2 (en) | 2020-01-23 |
KR102113413B1 (en) | 2020-05-21 |
EP3540612A1 (en) | 2019-09-18 |
JP2019504371A (en) | 2019-02-14 |
US20190073416A1 (en) | 2019-03-07 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
SG11201802373WA (en) | Method and device for processing question clustering in automatic question and answering system | |
CO2017011540A2 (en) | Automatic extraction of commitments and requests for communications and content | |
AU2016409886A1 (en) | Intelligent list reading | |
AU2018323509A1 (en) | Method and system for characterization for female reproductive system-related conditions associated with microorganisms | |
WO2019118469A3 (en) | Methods and systems for management of media content associated with message context on mobile computing devices | |
EP3767620A3 (en) | Speech endpointing based on word comparisons | |
PH12017550118A1 (en) | Management of commitments and requests extracted from communications and content | |
MX367096B (en) | Discriminating ambiguous expressions to enhance user experience. | |
MX2019000222A (en) | Systems and methods for identifying matching content. | |
NZ700273A (en) | Negative example (anti-word) based performance improvement for speech recognition | |
MX2017014355A (en) | System and method for extracting and sharing application-related user data. | |
GB2574969A (en) | Systems and methods of matching style attributes | |
EP3154055A3 (en) | Dynamic threshold for speaker verification | |
EP4280210A3 (en) | Hotword detection on multiple devices | |
EP2787449A3 (en) | Text data processing method and corresponding electronic device | |
PH12018501016A1 (en) | Information recommendation method and apparatus | |
IN2014MU00919A (en) | ||
WO2015138497A3 (en) | Systems and methods for rapid data analysis | |
GB2509667A (en) | System & method for analyzing conceptually-related portions of text | |
SG11201810237YA (en) | Method and device for creating underwriting decision tree, computer device and storage medium | |
GB2556283A (en) | Defect discrimination apparatus, methods, and systems | |
GB2501633A (en) | A voice based system and method for data input | |
SG10201610585WA (en) | Passsword management system and process | |
EP2892051A3 (en) | Apparatus and method for structuring contents of meeting | |
MX2019006981A (en) | Livestock biosecurity system and method of use. |