CA3156718A1 - Unsupervised induction of user intents from conversational customer service corpora - Google Patents

Unsupervised induction of user intents from conversational customer service corpora

Info

Publication number
CA3156718A1
CA3156718A1 CA3156718A CA3156718A CA3156718A1 CA 3156718 A1 CA3156718 A1 CA 3156718A1 CA 3156718 A CA3156718 A CA 3156718A CA 3156718 A CA3156718 A CA 3156718A CA 3156718 A1 CA3156718 A1 CA 3156718A1
Authority
CA
Canada
Prior art keywords
intent
keywords
conversational
customer service
user intents
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CA3156718A
Other languages
French (fr)
Inventor
Konstantinos GKIKAS
Paraskevi GKOTSOULIA
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Omilia Natural Language Solutions Ltd
Original Assignee
Omilia Natural Language Solutions Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Omilia Natural Language Solutions Ltd filed Critical Omilia Natural Language Solutions Ltd
Publication of CA3156718A1 publication Critical patent/CA3156718A1/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/30Semantic analysis

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • General Health & Medical Sciences (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Machine Translation (AREA)

Abstract

A methodology and system are presented for inducing user intent in a corpus and storing this intent in an intent library. To accurately detect intent, the corpus is first cleaned of nonsensical words and symbols and then syntactically analyzed to extract words and dependencies between them, which are then semantically analyzed to select keywords that are indicative of intent, and map the keywords to ordered broad semantic categories of the types of action, modifier and object. Keywords are then converted into embedding vectors whose dimensions are reduced and clustered according to category and order. Relations are calculated for the clusters across the semantic categories and intent is then calculated with the help of intent templates and word dictionaries.
CA3156718A 2019-10-04 2019-10-04 Unsupervised induction of user intents from conversational customer service corpora Pending CA3156718A1 (en)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/EP2019/076984 WO2021063524A1 (en) 2019-10-04 2019-10-04 Unsupervised induction of user intents from conversational customer service corpora

Publications (1)

Publication Number Publication Date
CA3156718A1 true CA3156718A1 (en) 2021-04-08

Family

ID=68165563

Family Applications (1)

Application Number Title Priority Date Filing Date
CA3156718A Pending CA3156718A1 (en) 2019-10-04 2019-10-04 Unsupervised induction of user intents from conversational customer service corpora

Country Status (3)

Country Link
EP (1) EP4038538A1 (en)
CA (1) CA3156718A1 (en)
WO (1) WO2021063524A1 (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114611524B (en) * 2022-02-08 2023-11-17 马上消费金融股份有限公司 Text error correction method and device, electronic equipment and storage medium
CN115618968B (en) * 2022-12-02 2023-03-31 北京红棉小冰科技有限公司 New idea discovery method and device, electronic device and storage medium

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7672831B2 (en) * 2005-10-24 2010-03-02 Invention Machine Corporation System and method for cross-language knowledge searching
EP2569716A1 (en) * 2010-03-26 2013-03-20 Virtuoz, Inc. Semantic clustering
US10339122B2 (en) * 2015-09-10 2019-07-02 Conduent Business Services, Llc Enriching how-to guides by linking actionable phrases
US10740566B2 (en) * 2018-03-23 2020-08-11 Servicenow, Inc. Method and system for automated intent mining, classification and disposition

Also Published As

Publication number Publication date
EP4038538A1 (en) 2022-08-10
WO2021063524A1 (en) 2021-04-08

Similar Documents

Publication Publication Date Title
Pranckevičius et al. Application of logistic regression with part-of-the-speech tagging for multi-class text classification
US20180173694A1 (en) Methods and computer systems for named entity verification, named entity verification model training, and phrase expansion
US11144723B2 (en) Method, device, and program for text classification
CA3156718A1 (en) Unsupervised induction of user intents from conversational customer service corpora
US20150178274A1 (en) Speech translation apparatus and speech translation method
US20230214382A1 (en) Systems and methods for interpreting natural language search queries
US20220215167A1 (en) Deep learning based automatic ontology extraction to detect new domain knowledge
EP2988298A1 (en) Response generation method, response generation apparatus, and response generation program
KR20170018620A (en) similar meaning detection method and detection device using same
Nowson et al. XRCE personal language analytics engine for multilingual author profiling
Yuwana et al. On part of speech tagger for Indonesian language
Cho et al. Crf-based disfluency detection using semantic features for german to english spoken language translation
Gao et al. Improving language model size reduction using better pruning criteria
Labbé et al. Is my automatic audio captioning system so bad? spider-max: a metric to consider several caption candidates
Ventura et al. New techniques for relevant word ranking and extraction
US9953652B1 (en) Selective generalization of search queries
Büyük et al. Leveraging the information in in-domain datasets for transformer-based intent detection
Chadha et al. Code switched and code mixed speech recognition for indic languages
CN110321404B (en) Vocabulary entry selection method and device for vocabulary learning, electronic equipment and storage medium
KR102117281B1 (en) Method for generating chatbot utterance using frequency table
JP7131130B2 (en) Classification method, device and program
Chandramouli et al. Unsupervised paradigm for information extraction from transcripts using BERT
Müller et al. Improved modeling of out-of-vocabulary words using morphological classes
Bullard et al. Computational analysis to explore authors’ depiction of characters
Hosier et al. Lightweight domain adaptation: A filtering pipeline to improve accuracy of an Automatic Speech Recognition (ASR) engine