JP2023519713A5 - - Google Patents

Info

Publication number
JP2023519713A5
JP2023519713A5 JP2022559639A JP2022559639A JP2023519713A5 JP 2023519713 A5 JP2023519713 A5 JP 2023519713A5 JP 2022559639 A JP2022559639 A JP 2022559639A JP 2022559639 A JP2022559639 A JP 2022559639A JP 2023519713 A5 JP2023519713 A5 JP 2023519713A5
Authority
JP
Japan
Prior art keywords
text
utterances
utterance
training set
intent
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
JP2022559639A
Other languages
English (en)
Japanese (ja)
Other versions
JP7721559B2 (ja
JP2023519713A (ja
Filing date
Publication date
Application filed filed Critical
Priority claimed from PCT/US2020/050342 external-priority patent/WO2021201907A1/en
Publication of JP2023519713A publication Critical patent/JP2023519713A/ja
Publication of JP2023519713A5 publication Critical patent/JP2023519713A5/ja
Priority to JP2025125324A priority Critical patent/JP2025170253A/ja
Application granted granted Critical
Publication of JP7721559B2 publication Critical patent/JP7721559B2/ja
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

JP2022559639A 2020-03-30 2020-09-11 自然言語処理のためのノイズデータ拡張 Active JP7721559B2 (ja)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP2025125324A JP2025170253A (ja) 2020-03-30 2025-07-28 自然言語処理のためのノイズデータ拡張

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US202063002066P 2020-03-30 2020-03-30
US63/002,066 2020-03-30
PCT/US2020/050342 WO2021201907A1 (en) 2020-03-30 2020-09-11 Noise data augmentation for natural language processing

Related Child Applications (1)

Application Number Title Priority Date Filing Date
JP2025125324A Division JP2025170253A (ja) 2020-03-30 2025-07-28 自然言語処理のためのノイズデータ拡張

Publications (3)

Publication Number Publication Date
JP2023519713A JP2023519713A (ja) 2023-05-12
JP2023519713A5 true JP2023519713A5 (https=) 2023-06-15
JP7721559B2 JP7721559B2 (ja) 2025-08-12

Family

ID=72659890

Family Applications (2)

Application Number Title Priority Date Filing Date
JP2022559639A Active JP7721559B2 (ja) 2020-03-30 2020-09-11 自然言語処理のためのノイズデータ拡張
JP2025125324A Pending JP2025170253A (ja) 2020-03-30 2025-07-28 自然言語処理のためのノイズデータ拡張

Family Applications After (1)

Application Number Title Priority Date Filing Date
JP2025125324A Pending JP2025170253A (ja) 2020-03-30 2025-07-28 自然言語処理のためのノイズデータ拡張

Country Status (5)

Country Link
US (2) US11538457B2 (https=)
EP (1) EP4128010A1 (https=)
JP (2) JP7721559B2 (https=)
CN (1) CN115398436B (https=)
WO (1) WO2021201907A1 (https=)

Families Citing this family (24)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN118642630A (zh) * 2018-08-21 2024-09-13 谷歌有限责任公司 用于自动助理调用的方法
US11538457B2 (en) * 2020-03-30 2022-12-27 Oracle International Corporation Noise data augmentation for natural language processing
US11556788B2 (en) * 2020-06-15 2023-01-17 International Business Machines Corporation Text-based response environment action selection
US11599721B2 (en) * 2020-08-25 2023-03-07 Salesforce, Inc. Intelligent training set augmentation for natural language processing tasks
DK202170043A1 (en) * 2021-01-29 2022-12-12 A P Moeller Mærsk As A method for autonomous reconciliation of invoice data and related electronic device
US12026471B2 (en) * 2021-04-16 2024-07-02 Accenture Global Solutions Limited Automated generation of chatbot
US12242816B2 (en) * 2021-06-30 2025-03-04 Microsoft Technology Licensing, Llc Task-action prediction engine for a task management system
US12321428B2 (en) * 2021-07-08 2025-06-03 Nippon Telegraph And Telephone Corporation User authentication device, user authentication method, and user authentication computer program
EP4363965A1 (en) * 2021-08-06 2024-05-08 Siemens Aktiengesellschaft Source code synthesis for domain specific languages from natural language text
US12468938B2 (en) * 2021-09-21 2025-11-11 International Business Machines Corporation Training example generation to create new intents for chatbots
CN114491048B (zh) * 2022-02-16 2025-08-15 北京微播易科技股份有限公司 一种数据增强方法、文本分类模型的训练方法和装置
CN115878765B (zh) * 2022-04-18 2024-09-13 北京中关村科金技术有限公司 一种融合意图识别降噪的催款话术挖掘方法及装置
CN114881130A (zh) * 2022-04-26 2022-08-09 华北电力大学 一种基于Bagging模型的继电保护缺陷文本定级方法
US12451141B2 (en) 2022-06-08 2025-10-21 International Business Machines Corporation Generating multi-turn dialog datasets
US12579448B2 (en) 2022-06-22 2026-03-17 Oracle International Corporation Techniques for positive entity aware augmentation using two-stage augmentation
CN117668216A (zh) * 2022-08-12 2024-03-08 南方电网大数据服务有限公司 意图识别模型训练方法、意图识别方法和装置
CN116150311A (zh) * 2022-08-16 2023-05-23 马上消费金融股份有限公司 文本匹配模型的训练方法、意图识别方法及装置
US12499385B2 (en) * 2022-08-22 2025-12-16 Oracle International Corporation Adaptive training data augmentation to facilitate training named entity recognition models
CN115909354B (zh) * 2022-11-11 2023-11-10 北京百度网讯科技有限公司 文本生成模型的训练方法、文本获取方法及装置
US12512089B2 (en) * 2022-12-07 2025-12-30 International Business Machines Corporation Testing cascaded deep learning pipelines comprising a speech-to-text model and a text intent classifier
JP2024098791A (ja) * 2023-01-11 2024-07-24 株式会社東芝 情報処理装置、情報処理方法及び情報処理プログラム
US12231378B2 (en) * 2023-06-08 2025-02-18 Sap Se Realtime conversation AI insights and deployment
US20250008021A1 (en) * 2023-06-28 2025-01-02 Jpmorgan Chase Bank, N.A. Systems and methods for artificial intelligence-based coaching using microlearning
US12367342B1 (en) * 2025-01-15 2025-07-22 Conversational AI Ltd Automated analysis of computerized conversational agent conversational data

Family Cites Families (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20110289025A1 (en) * 2010-05-19 2011-11-24 Microsoft Corporation Learning user intent from rule-based training data
US20160055240A1 (en) 2014-08-22 2016-02-25 Microsoft Corporation Orphaned utterance detection system and method
WO2016055240A1 (en) * 2014-10-06 2016-04-14 Zentrum Mikroelektronik Dresden Ag Pulsed linear power converter
CN105786798B (zh) * 2016-02-25 2018-11-02 上海交通大学 一种人机交互中自然语言意图理解方法
KR20180055189A (ko) 2016-11-16 2018-05-25 삼성전자주식회사 자연어 처리 방법 및 장치와 자연어 처리 모델을 학습하는 방법 및 장치
US10510336B2 (en) * 2017-06-12 2019-12-17 International Business Machines Corporation Method, apparatus, and system for conflict detection and resolution for competing intent classifiers in modular conversation system
CN107515857B (zh) 2017-08-31 2020-08-18 科大讯飞股份有限公司 基于定制技能的语义理解方法及系统
US10303978B1 (en) * 2018-03-26 2019-05-28 Clinc, Inc. Systems and methods for intelligently curating machine learning training data and improving machine learning model performance
US10726204B2 (en) * 2018-05-24 2020-07-28 International Business Machines Corporation Training data expansion for natural language classification
US11093707B2 (en) * 2019-01-15 2021-08-17 International Business Machines Corporation Adversarial training data augmentation data for text classifiers
CN110223674B (zh) * 2019-04-19 2023-05-26 平安科技(深圳)有限公司 语音语料训练方法、装置、计算机设备和存储介质
CN110457447A (zh) * 2019-05-15 2019-11-15 国网浙江省电力有限公司电力科学研究院 一种电网任务型对话系统
CN110209791B (zh) * 2019-06-12 2021-03-26 百融云创科技股份有限公司 一种多轮对话智能语音交互系统及装置
US11538457B2 (en) * 2020-03-30 2022-12-27 Oracle International Corporation Noise data augmentation for natural language processing

Similar Documents

Publication Publication Date Title
JP2023519713A5 (https=)
JP2022547631A5 (https=)
US8589163B2 (en) Adapting language models with a bit mask for a subset of related words
CN105632499B (zh) 用于优化语音识别结果的方法和装置
US12159627B2 (en) Improving custom keyword spotting system accuracy with text-to-speech-based data augmentation
JP2020537224A5 (https=)
JP2005010691A (ja) 音声認識装置、音声認識方法、会話制御装置、会話制御方法及びこれらのためのプログラム
JP2017515147A5 (https=)
Trmal et al. A keyword search system using open source software
CN105280177A (zh) 语音合成字典创建装置、语音合成器、以及语音合成字典创建方法
TW202020854A (zh) 語音辨識系統及其方法、與電腦程式產品
Le et al. Developing STT and KWS systems using limited language resources.
KR20160080915A (ko) 음성 인식 방법 및 음성 인식 장치
Lileikytė et al. Conversational telephone speech recognition for Lithuanian
WO2019113911A1 (zh) 设备控制方法、云端设备、智能设备、计算机介质及设备
CN115668360A (zh) 构建语音识别模型和语音处理的方法和系统
Kipyatkova et al. A study of neural network Russian language models for automatic continuous speech recognition systems
CN104021117A (zh) 语言处理方法与电子设备
CN108899016B (zh) 一种语音文本规整方法、装置、设备及可读存储介质
JP6485941B2 (ja) 言語モデル生成装置、およびそのプログラム、ならびに音声認識装置
Pakoci et al. Language model optimization for a deep neural network based speech recognition system for Serbian
Liu et al. The Cambridge University 2014 BOLT conversational telephone Mandarin Chinese LVCSR system for speech translation.
KR20200102309A (ko) 단어 유사도를 이용한 음성 인식 시스템 및 그 방법
CN106294310A (zh) 一种藏语声调预测方法及系统
JP2011154061A (ja) 辞書作成装置、そのコンピュータプログラムおよびデータ処理方法