CA3156718A1 - Unsupervised induction of user intents from conversational customer service corpora - Google Patents
Unsupervised induction of user intents from conversational customer service corporaInfo
- Publication number
- CA3156718A1 CA3156718A1 CA3156718A CA3156718A CA3156718A1 CA 3156718 A1 CA3156718 A1 CA 3156718A1 CA 3156718 A CA3156718 A CA 3156718A CA 3156718 A CA3156718 A CA 3156718A CA 3156718 A1 CA3156718 A1 CA 3156718A1
- Authority
- CA
- Canada
- Prior art keywords
- intent
- keywords
- conversational
- customer service
- user intents
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/30—Semantic analysis
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Health & Medical Sciences (AREA)
- Artificial Intelligence (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Computational Linguistics (AREA)
- General Health & Medical Sciences (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Machine Translation (AREA)
Abstract
A methodology and system are presented for inducing user intent in a corpus and storing this intent in an intent library. To accurately detect intent, the corpus is first cleaned of nonsensical words and symbols and then syntactically analyzed to extract words and dependencies between them, which are then semantically analyzed to select keywords that are indicative of intent, and map the keywords to ordered broad semantic categories of the types of action, modifier and object. Keywords are then converted into embedding vectors whose dimensions are reduced and clustered according to category and order. Relations are calculated for the clusters across the semantic categories and intent is then calculated with the help of intent templates and word dictionaries.
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
PCT/EP2019/076984 WO2021063524A1 (en) | 2019-10-04 | 2019-10-04 | Unsupervised induction of user intents from conversational customer service corpora |
Publications (1)
Publication Number | Publication Date |
---|---|
CA3156718A1 true CA3156718A1 (en) | 2021-04-08 |
Family
ID=68165563
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CA3156718A Pending CA3156718A1 (en) | 2019-10-04 | 2019-10-04 | Unsupervised induction of user intents from conversational customer service corpora |
Country Status (3)
Country | Link |
---|---|
EP (1) | EP4038538A1 (en) |
CA (1) | CA3156718A1 (en) |
WO (1) | WO2021063524A1 (en) |
Families Citing this family (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN114611524B (en) * | 2022-02-08 | 2023-11-17 | 马上消费金融股份有限公司 | Text error correction method and device, electronic equipment and storage medium |
CN115618968B (en) * | 2022-12-02 | 2023-03-31 | 北京红棉小冰科技有限公司 | New idea discovery method and device, electronic device and storage medium |
Family Cites Families (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7672831B2 (en) * | 2005-10-24 | 2010-03-02 | Invention Machine Corporation | System and method for cross-language knowledge searching |
EP2569716A1 (en) * | 2010-03-26 | 2013-03-20 | Virtuoz, Inc. | Semantic clustering |
US10339122B2 (en) * | 2015-09-10 | 2019-07-02 | Conduent Business Services, Llc | Enriching how-to guides by linking actionable phrases |
US10740566B2 (en) * | 2018-03-23 | 2020-08-11 | Servicenow, Inc. | Method and system for automated intent mining, classification and disposition |
-
2019
- 2019-10-04 WO PCT/EP2019/076984 patent/WO2021063524A1/en unknown
- 2019-10-04 CA CA3156718A patent/CA3156718A1/en active Pending
- 2019-10-04 EP EP19783514.3A patent/EP4038538A1/en active Pending
Also Published As
Publication number | Publication date |
---|---|
EP4038538A1 (en) | 2022-08-10 |
WO2021063524A1 (en) | 2021-04-08 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Pranckevičius et al. | Application of logistic regression with part-of-the-speech tagging for multi-class text classification | |
US20180173694A1 (en) | Methods and computer systems for named entity verification, named entity verification model training, and phrase expansion | |
US11144723B2 (en) | Method, device, and program for text classification | |
CA3156718A1 (en) | Unsupervised induction of user intents from conversational customer service corpora | |
US20150178274A1 (en) | Speech translation apparatus and speech translation method | |
US20230214382A1 (en) | Systems and methods for interpreting natural language search queries | |
US20220215167A1 (en) | Deep learning based automatic ontology extraction to detect new domain knowledge | |
EP2988298A1 (en) | Response generation method, response generation apparatus, and response generation program | |
KR20170018620A (en) | similar meaning detection method and detection device using same | |
Nowson et al. | XRCE personal language analytics engine for multilingual author profiling | |
Yuwana et al. | On part of speech tagger for Indonesian language | |
Cho et al. | Crf-based disfluency detection using semantic features for german to english spoken language translation | |
Gao et al. | Improving language model size reduction using better pruning criteria | |
Labbé et al. | Is my automatic audio captioning system so bad? spider-max: a metric to consider several caption candidates | |
Ventura et al. | New techniques for relevant word ranking and extraction | |
US9953652B1 (en) | Selective generalization of search queries | |
Büyük et al. | Leveraging the information in in-domain datasets for transformer-based intent detection | |
Chadha et al. | Code switched and code mixed speech recognition for indic languages | |
CN110321404B (en) | Vocabulary entry selection method and device for vocabulary learning, electronic equipment and storage medium | |
KR102117281B1 (en) | Method for generating chatbot utterance using frequency table | |
JP7131130B2 (en) | Classification method, device and program | |
Chandramouli et al. | Unsupervised paradigm for information extraction from transcripts using BERT | |
Müller et al. | Improved modeling of out-of-vocabulary words using morphological classes | |
Bullard et al. | Computational analysis to explore authors’ depiction of characters | |
Hosier et al. | Lightweight domain adaptation: A filtering pipeline to improve accuracy of an Automatic Speech Recognition (ASR) engine |