SG11201802373WA - Method and device for processing question clustering in automatic question and answering system - Google Patents

Method and device for processing question clustering in automatic question and answering system

Info

Publication number
SG11201802373WA
SG11201802373WA SG11201802373WA SG11201802373WA SG11201802373WA SG 11201802373W A SG11201802373W A SG 11201802373WA SG 11201802373W A SG11201802373W A SG 11201802373WA SG 11201802373W A SG11201802373W A SG 11201802373WA SG 11201802373W A SG11201802373W A SG 11201802373WA
Authority
SG
Singapore
Prior art keywords
question
clustering
feature
clustered
processing
Prior art date
Application number
SG11201802373WA
Other languages
English (en)
Inventor
Jianzong Wang
Weiqiang Yuan
Maokun Han
Jing Xiao
Original Assignee
Ping An Technology Shenzhen Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Ping An Technology Shenzhen Co Ltd filed Critical Ping An Technology Shenzhen Co Ltd
Publication of SG11201802373WA publication Critical patent/SG11201802373WA/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/332Query formulation
    • G06F16/3329Natural language query formulation or dialogue systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/3331Query processing
    • G06F16/334Query execution
    • G06F16/3347Query execution using vector based model
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/35Clustering; Classification
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/35Clustering; Classification
    • G06F16/355Class or cluster creation or modification
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/213Feature extraction, e.g. by transforming the feature space; Summarisation; Mappings, e.g. subspace methods
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/40Processing or translation of natural language
    • G06F40/53Processing of non-Latin text

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Databases & Information Systems (AREA)
  • Artificial Intelligence (AREA)
  • Mathematical Physics (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Computational Linguistics (AREA)
  • Human Computer Interaction (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Evolutionary Biology (AREA)
  • Evolutionary Computation (AREA)
  • General Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Health & Medical Sciences (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)
SG11201802373WA 2016-11-14 2017-08-30 Method and device for processing question clustering in automatic question and answering system SG11201802373WA (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201611002092.2A CN107656948B (zh) 2016-11-14 2016-11-14 自动问答系统中的问题聚类处理方法及装置
PCT/CN2017/099708 WO2018086401A1 (zh) 2016-11-14 2017-08-30 自动问答系统中的问题聚类处理方法及装置

Publications (1)

Publication Number Publication Date
SG11201802373WA true SG11201802373WA (en) 2018-06-28

Family

ID=61127345

Family Applications (1)

Application Number Title Priority Date Filing Date
SG11201802373WA SG11201802373WA (en) 2016-11-14 2017-08-30 Method and device for processing question clustering in automatic question and answering system

Country Status (8)

Country Link
US (1) US20190073416A1 (ko)
EP (1) EP3540612A4 (ko)
JP (1) JP6634515B2 (ko)
KR (1) KR102113413B1 (ko)
CN (1) CN107656948B (ko)
AU (1) AU2017329098B2 (ko)
SG (1) SG11201802373WA (ko)
WO (1) WO2018086401A1 (ko)

Families Citing this family (19)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108804567A (zh) * 2018-05-22 2018-11-13 平安科技(深圳)有限公司 提高智能客服应答率的方法、设备、存储介质及装置
CN109002434A (zh) * 2018-05-31 2018-12-14 青岛理工大学 客服问答匹配方法、服务器及存储介质
CN109189901B (zh) * 2018-08-09 2021-05-18 北京中关村科金技术有限公司 一种智能客服系统中自动发现新分类以及对应语料的方法
CN109145118B (zh) * 2018-09-06 2021-01-26 北京京东尚科信息技术有限公司 信息管理方法和装置
CN110110084A (zh) * 2019-04-23 2019-08-09 北京科技大学 高质量用户生成内容的识别方法
CN110728298A (zh) * 2019-09-05 2020-01-24 北京三快在线科技有限公司 多任务分类模型训练方法、多任务分类方法及装置
CN110767224B (zh) * 2019-10-15 2020-08-07 上海云从企业发展有限公司 一种基于特征权级的业务管理方法、系统、设备和介质
CN111046158B (zh) * 2019-12-13 2020-12-15 腾讯科技(深圳)有限公司 问答匹配方法及模型训练方法、装置、设备、存储介质
CN111191687B (zh) * 2019-12-14 2023-02-10 贵州电网有限责任公司 基于改进K-means算法的电力通信数据聚类方法
CN111259154B (zh) * 2020-02-07 2021-04-13 腾讯科技(深圳)有限公司 一种数据处理方法、装置、计算机设备及存储介质
CN111309881A (zh) * 2020-02-11 2020-06-19 深圳壹账通智能科技有限公司 智能问答中未知问题处理方法、装置、计算机设备和介质
CN111352988B (zh) * 2020-02-29 2023-05-23 重庆百事得大牛机器人有限公司 针对法务信息的大数据仓库存储、分析、提取系统
CN111813905B (zh) * 2020-06-17 2024-05-10 平安科技(深圳)有限公司 语料生成方法、装置、计算机设备及存储介质
KR102445841B1 (ko) * 2020-10-16 2022-09-22 성균관대학교산학협력단 다중 검색 방식을 이용한 의료 챗봇 시스템
CN112650841A (zh) * 2020-12-07 2021-04-13 北京有竹居网络技术有限公司 信息处理方法、装置和电子设备
CN112559723B (zh) * 2020-12-28 2024-05-28 广东国粒教育技术有限公司 一种基于深度学习的faq检索式问答构建方法及系统
CN112995719B (zh) * 2021-04-21 2021-07-27 平安科技(深圳)有限公司 基于弹幕文本的问题集获取方法、装置及计算机设备
CN113010664B (zh) * 2021-04-27 2024-06-14 数网金融有限公司 一种数据处理方法、装置及计算机设备
CN113220853B (zh) * 2021-05-12 2022-10-04 燕山大学 一种法律提问自动生成方法及系统

Family Cites Families (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6675159B1 (en) * 2000-07-27 2004-01-06 Science Applic Int Corp Concept-based search and retrieval system
JP4081065B2 (ja) * 2004-10-22 2008-04-23 クオリカ株式会社 Faqデータ作成装置、方法、及びプログラム
SG138575A1 (en) * 2006-06-23 2008-01-28 Colorzip Media Inc Method of classifying colors of color based image code
CN101308496A (zh) * 2008-07-04 2008-11-19 沈阳格微软件有限责任公司 大规模文本数据的外部聚类方法及系统
CN101477563B (zh) * 2009-01-21 2010-11-10 北京百问百答网络技术有限公司 一种短文本聚类的方法、系统及其数据处理装置
CN101599071B (zh) * 2009-07-10 2012-04-18 华中科技大学 对话文本主题的自动提取方法
CN101630312A (zh) * 2009-08-19 2010-01-20 腾讯科技(深圳)有限公司 一种用于问答平台中问句的聚类方法及系统
JP5574842B2 (ja) * 2010-06-21 2014-08-20 株式会社野村総合研究所 Faq候補抽出システムおよびfaq候補抽出プログラム
US9230009B2 (en) * 2013-06-04 2016-01-05 International Business Machines Corporation Routing of questions to appropriately trained question and answer system pipelines using clustering
CN103559175B (zh) * 2013-10-12 2016-08-10 华南理工大学 一种基于聚类的垃圾邮件过滤系统及方法
CN103699695B (zh) * 2014-01-14 2017-02-01 吉林大学 基于中心法的自适应文本聚类算法
US10678765B2 (en) * 2014-03-31 2020-06-09 Rakuten, Inc. Similarity calculation system, method of calculating similarity, and program
CN104142918B (zh) * 2014-07-31 2017-04-05 天津大学 基于tf‑idf特征的短文本聚类以及热点主题提取方法
US10387430B2 (en) * 2015-02-26 2019-08-20 International Business Machines Corporation Geometry-directed active question selection for question answering systems
KR101720972B1 (ko) * 2015-04-16 2017-03-30 주식회사 플런티코리아 답변 추천 장치 및 방법
CN105975460A (zh) * 2016-05-30 2016-09-28 上海智臻智能网络科技股份有限公司 问句信息处理方法及装置

Also Published As

Publication number Publication date
JP6634515B2 (ja) 2020-01-22
AU2017329098A1 (en) 2018-05-31
AU2017329098B2 (en) 2020-01-23
JP2019504371A (ja) 2019-02-14
EP3540612A1 (en) 2019-09-18
KR20180077261A (ko) 2018-07-06
EP3540612A4 (en) 2020-06-17
US20190073416A1 (en) 2019-03-07
KR102113413B1 (ko) 2020-05-21
WO2018086401A1 (zh) 2018-05-17
CN107656948B (zh) 2019-05-07
CN107656948A (zh) 2018-02-02

Similar Documents

Publication Publication Date Title
SG11201802373WA (en) Method and device for processing question clustering in automatic question and answering system
AU2018323509A1 (en) Method and system for characterization for female reproductive system-related conditions associated with microorganisms
AU2016409886A1 (en) Intelligent list reading
WO2019118469A3 (en) Methods and systems for management of media content associated with message context on mobile computing devices
GB2549875A (en) Automated content classification/filtering
MX2019000212A (es) Sistemas y metodos para identificar contenido coincidente.
SG10201802554YA (en) Blockchain-based digital identity management method
PH12017550118A1 (en) Management of commitments and requests extracted from communications and content
MX2017008583A (es) Discriminacion de expresiones ambiguas para mejorar la experiencia del usuario.
MX2019000222A (es) Sistemas y metodos para identificar contenido coincidente.
EP2499582A4 (en) SYSTEM AND METHOD FOR HYBRID PROCESSING IN AN ENVIRONMENT OF TELEPHONE SERVICES IN NATURAL LANGUAGE
NZ700273A (en) Negative example (anti-word) based performance improvement for speech recognition
GB2556850A (en) Sequentially overlaying media content
PH12018501016A1 (en) Information recommendation method and apparatus
EP3154055A3 (en) Dynamic threshold for speaker verification
EP4280210A3 (en) Hotword detection on multiple devices
SG11201810237YA (en) Method and device for creating underwriting decision tree, computer device and storage medium
EP2680258A3 (en) Providing audio-activated resource access for user devices based on speaker voiceprint
GB2509667A (en) System & method for analyzing conceptually-related portions of text
GB2556283A (en) Defect discrimination apparatus, methods, and systems
GB2501633A (en) A voice based system and method for data input
EP2892051A3 (en) Apparatus and method for structuring contents of meeting
MX2019006981A (es) Sistema de bioseguridad de ganado y metodo de uso.
EP2966644A3 (en) Methods and systems for managing speech recognition in a multi-speech system environment
MX2021009164A (es) Dispositivos y metodos de recomendacion de alimento para mascotas.