SG11201802373WA - Method and device for processing question clustering in automatic question and answering system - Google Patents

Method and device for processing question clustering in automatic question and answering system

Info

Publication number
SG11201802373WA
SG11201802373WA SG11201802373WA SG11201802373WA SG11201802373WA SG 11201802373W A SG11201802373W A SG 11201802373WA SG 11201802373W A SG11201802373W A SG 11201802373WA SG 11201802373W A SG11201802373W A SG 11201802373WA SG 11201802373W A SG11201802373W A SG 11201802373WA
Authority
SG
Singapore
Prior art keywords
question
clustering
feature
clustered
processing
Prior art date
Application number
SG11201802373WA
Inventor
Jianzong Wang
Weiqiang Yuan
Maokun Han
Jing Xiao
Original Assignee
Ping An Technology Shenzhen Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Ping An Technology Shenzhen Co Ltd filed Critical Ping An Technology Shenzhen Co Ltd
Publication of SG11201802373WA publication Critical patent/SG11201802373WA/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/332Query formulation
    • G06F16/3329Natural language query formulation or dialogue systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/3331Query processing
    • G06F16/334Query execution
    • G06F16/3347Query execution using vector based model
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/35Clustering; Classification
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/35Clustering; Classification
    • G06F16/355Class or cluster creation or modification
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/213Feature extraction, e.g. by transforming the feature space; Summarisation; Mappings, e.g. subspace methods
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/40Processing or translation of natural language
    • G06F40/53Processing of non-Latin text

Abstract

METHOD AND DEVICE FOR PROCESSING QUESTION CLUSTERING IN AUTOMATIC QUESTION AND ANSWERING SYSTEM The present invention provides a method and a device for processing question clustering in an automatic question and answering system. The method comprises: receiving a clustering request input by a writer; acquiring a question set to be clustered from a database of unanswered questions based on the clustering request; performing feature extraction on the question set to be clustered with a text feature extraction algorithm to output a question feature set; determining whether the question feature set meets a preset splitting condition; performing segmenting clustering on the question feature set with a segmenting clustering algorithm if the preset splitting condition is met to output at least two question feature subsets; updating the question feature subsets to a question feature set, and determining whether the question feature set meets the preset splitting condition; and outputting the question feature set as a clustering class cluster if the preset splitting condition is not met. In the method and device for processing question clustering in the automatic question and answering system, the question set to be clustered may be automatically clustered to help the writer understand question consultation requirements and improve the coverage of the written question and answering pairs. FIG. 1
SG11201802373WA 2016-11-14 2017-08-30 Method and device for processing question clustering in automatic question and answering system SG11201802373WA (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201611002092.2A CN107656948B (en) 2016-11-14 2016-11-14 The problems in automatically request-answering system clustering processing method and device
PCT/CN2017/099708 WO2018086401A1 (en) 2016-11-14 2017-08-30 Cluster processing method and device for questions in automatic question and answering system

Publications (1)

Publication Number Publication Date
SG11201802373WA true SG11201802373WA (en) 2018-06-28

Family

ID=61127345

Family Applications (1)

Application Number Title Priority Date Filing Date
SG11201802373WA SG11201802373WA (en) 2016-11-14 2017-08-30 Method and device for processing question clustering in automatic question and answering system

Country Status (8)

Country Link
US (1) US20190073416A1 (en)
EP (1) EP3540612A4 (en)
JP (1) JP6634515B2 (en)
KR (1) KR102113413B1 (en)
CN (1) CN107656948B (en)
AU (1) AU2017329098B2 (en)
SG (1) SG11201802373WA (en)
WO (1) WO2018086401A1 (en)

Families Citing this family (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108804567A (en) * 2018-05-22 2018-11-13 平安科技(深圳)有限公司 Improve method, equipment, storage medium and the device of intelligent customer service response rate
CN109002434A (en) * 2018-05-31 2018-12-14 青岛理工大学 Customer service question and answer matching process, server and storage medium
CN109189901B (en) * 2018-08-09 2021-05-18 北京中关村科金技术有限公司 Method for automatically discovering new classification and corresponding corpus in intelligent customer service system
CN109145118B (en) * 2018-09-06 2021-01-26 北京京东尚科信息技术有限公司 Information management method and device
CN110110084A (en) * 2019-04-23 2019-08-09 北京科技大学 The recognition methods of high quality user-generated content
CN110728298A (en) * 2019-09-05 2020-01-24 北京三快在线科技有限公司 Multi-task classification model training method, multi-task classification method and device
CN110767224B (en) * 2019-10-15 2020-08-07 上海云从企业发展有限公司 Service management method, system, equipment and medium based on characteristic right level
CN111046158B (en) * 2019-12-13 2020-12-15 腾讯科技(深圳)有限公司 Question-answer matching method, model training method, device, equipment and storage medium
CN111191687B (en) * 2019-12-14 2023-02-10 贵州电网有限责任公司 Power communication data clustering method based on improved K-means algorithm
CN111259154B (en) * 2020-02-07 2021-04-13 腾讯科技(深圳)有限公司 Data processing method and device, computer equipment and storage medium
CN111309881A (en) * 2020-02-11 2020-06-19 深圳壹账通智能科技有限公司 Method and device for processing unknown questions in intelligent question answering, computer equipment and medium
CN111352988B (en) * 2020-02-29 2023-05-23 重庆百事得大牛机器人有限公司 Big data warehouse storage, analysis and extraction system aiming at legal information
CN111813905A (en) * 2020-06-17 2020-10-23 平安科技(深圳)有限公司 Corpus generation method and device, computer equipment and storage medium
KR102445841B1 (en) * 2020-10-16 2022-09-22 성균관대학교산학협력단 Medical Chatbot System Using Multiple Search Methods
CN112650841A (en) * 2020-12-07 2021-04-13 北京有竹居网络技术有限公司 Information processing method and device and electronic equipment
CN112995719B (en) * 2021-04-21 2021-07-27 平安科技(深圳)有限公司 Bullet screen text-based problem set acquisition method and device and computer equipment
CN113220853B (en) * 2021-05-12 2022-10-04 燕山大学 Automatic generation method and system for legal questions

Family Cites Families (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6675159B1 (en) * 2000-07-27 2004-01-06 Science Applic Int Corp Concept-based search and retrieval system
JP4081065B2 (en) * 2004-10-22 2008-04-23 クオリカ株式会社 FAQ data creation apparatus, method, and program
SG138575A1 (en) * 2006-06-23 2008-01-28 Colorzip Media Inc Method of classifying colors of color based image code
CN101308496A (en) * 2008-07-04 2008-11-19 沈阳格微软件有限责任公司 Large scale text data external clustering method and system
CN101477563B (en) * 2009-01-21 2010-11-10 北京百问百答网络技术有限公司 Short text clustering method and system, and its data processing device
CN101599071B (en) * 2009-07-10 2012-04-18 华中科技大学 Automatic extraction method of conversation text topic
CN101630312A (en) * 2009-08-19 2010-01-20 腾讯科技(深圳)有限公司 Clustering method for question sentences in question-and-answer platform and system thereof
JP5574842B2 (en) * 2010-06-21 2014-08-20 株式会社野村総合研究所 FAQ candidate extraction system and FAQ candidate extraction program
US9230009B2 (en) * 2013-06-04 2016-01-05 International Business Machines Corporation Routing of questions to appropriately trained question and answer system pipelines using clustering
CN103559175B (en) * 2013-10-12 2016-08-10 华南理工大学 A kind of Spam Filtering System based on cluster and method
CN103699695B (en) * 2014-01-14 2017-02-01 吉林大学 Centroid method-based self-adaption text clustering algorithm
WO2015151162A1 (en) * 2014-03-31 2015-10-08 楽天株式会社 Similarity calculation system, similarity calculation method, and program
CN104142918B (en) * 2014-07-31 2017-04-05 天津大学 Short text clustering and focus subject distillation method based on TF IDF features
US10387430B2 (en) * 2015-02-26 2019-08-20 International Business Machines Corporation Geometry-directed active question selection for question answering systems
KR101720972B1 (en) * 2015-04-16 2017-03-30 주식회사 플런티코리아 Recommendation Reply Apparatus and Method
CN105975460A (en) * 2016-05-30 2016-09-28 上海智臻智能网络科技股份有限公司 Question information processing method and device

Also Published As

Publication number Publication date
AU2017329098A1 (en) 2018-05-31
JP6634515B2 (en) 2020-01-22
KR20180077261A (en) 2018-07-06
CN107656948A (en) 2018-02-02
WO2018086401A1 (en) 2018-05-17
CN107656948B (en) 2019-05-07
EP3540612A4 (en) 2020-06-17
AU2017329098B2 (en) 2020-01-23
KR102113413B1 (en) 2020-05-21
EP3540612A1 (en) 2019-09-18
JP2019504371A (en) 2019-02-14
US20190073416A1 (en) 2019-03-07

Similar Documents

Publication Publication Date Title
SG11201802373WA (en) Method and device for processing question clustering in automatic question and answering system
CO2017011540A2 (en) Automatic extraction of commitments and requests for communications and content
AU2016409886A1 (en) Intelligent list reading
AU2018323509A1 (en) Method and system for characterization for female reproductive system-related conditions associated with microorganisms
WO2019118469A3 (en) Methods and systems for management of media content associated with message context on mobile computing devices
EP3767620A3 (en) Speech endpointing based on word comparisons
PH12017550118A1 (en) Management of commitments and requests extracted from communications and content
MX367096B (en) Discriminating ambiguous expressions to enhance user experience.
MX2019000222A (en) Systems and methods for identifying matching content.
NZ700273A (en) Negative example (anti-word) based performance improvement for speech recognition
MX2017014355A (en) System and method for extracting and sharing application-related user data.
GB2574969A (en) Systems and methods of matching style attributes
EP3154055A3 (en) Dynamic threshold for speaker verification
EP4280210A3 (en) Hotword detection on multiple devices
EP2787449A3 (en) Text data processing method and corresponding electronic device
PH12018501016A1 (en) Information recommendation method and apparatus
IN2014MU00919A (en)
WO2015138497A3 (en) Systems and methods for rapid data analysis
GB2509667A (en) System & method for analyzing conceptually-related portions of text
SG11201810237YA (en) Method and device for creating underwriting decision tree, computer device and storage medium
GB2556283A (en) Defect discrimination apparatus, methods, and systems
GB2501633A (en) A voice based system and method for data input
SG10201610585WA (en) Passsword management system and process
EP2892051A3 (en) Apparatus and method for structuring contents of meeting
MX2019006981A (en) Livestock biosecurity system and method of use.