JP6675463B2 - 自然言語の双方向確率的な書換えおよび選択 - Google Patents

自然言語の双方向確率的な書換えおよび選択 Download PDF

Info

Publication number
JP6675463B2
JP6675463B2 JP2018189730A JP2018189730A JP6675463B2 JP 6675463 B2 JP6675463 B2 JP 6675463B2 JP 2018189730 A JP2018189730 A JP 2018189730A JP 2018189730 A JP2018189730 A JP 2018189730A JP 6675463 B2 JP6675463 B2 JP 6675463B2
Authority
JP
Japan
Prior art keywords
token
rewrite
tokens
probability
sequence
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
JP2018189730A
Other languages
English (en)
Japanese (ja)
Other versions
JP2019070799A5 (enExample
JP2019070799A (ja
Inventor
ルーク・レフェビュア
プラナフ・シン
Original Assignee
サウンドハウンド,インコーポレイテッド
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by サウンドハウンド,インコーポレイテッド filed Critical サウンドハウンド,インコーポレイテッド
Publication of JP2019070799A publication Critical patent/JP2019070799A/ja
Publication of JP2019070799A5 publication Critical patent/JP2019070799A5/ja
Application granted granted Critical
Publication of JP6675463B2 publication Critical patent/JP6675463B2/ja
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2453Query optimisation
    • G06F16/24534Query rewriting; Transformation
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/332Query formulation
    • G06F16/3329Natural language query formulation
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/3331Query processing
    • G06F16/3332Query translation
    • G06F16/3338Query expansion
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/205Parsing
    • G06F40/211Syntactic parsing, e.g. based on context-free grammar [CFG] or unification grammars
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/253Grammatical analysis; Style critique
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/279Recognition of textual entities
    • G06F40/284Lexical analysis, e.g. tokenisation or collocates
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/30Semantic analysis
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/40Processing or translation of natural language
    • G06F40/55Rule-based translation
    • G06F40/56Natural language generation
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/40Processing or translation of natural language
    • G06F40/58Use of machine translation, e.g. for multi-lingual retrieval, for server-side translation for client devices or for real-time translation
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N7/00Computing arrangements based on specific mathematical models
    • G06N7/01Probabilistic graphical models, e.g. probabilistic networks
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • G10L15/183Speech classification or search using natural language modelling using context dependencies, e.g. language models

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • General Engineering & Computer Science (AREA)
  • Artificial Intelligence (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Health & Medical Sciences (AREA)
  • General Health & Medical Sciences (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Mathematical Physics (AREA)
  • Human Computer Interaction (AREA)
  • Multimedia (AREA)
  • Acoustics & Sound (AREA)
  • Evolutionary Computation (AREA)
  • Pure & Applied Mathematics (AREA)
  • Computational Mathematics (AREA)
  • Algebra (AREA)
  • Probability & Statistics with Applications (AREA)
  • Mathematical Analysis (AREA)
  • Software Systems (AREA)
  • Mathematical Optimization (AREA)
  • Computing Systems (AREA)
  • Machine Translation (AREA)
JP2018189730A 2017-10-06 2018-10-05 自然言語の双方向確率的な書換えおよび選択 Active JP6675463B2 (ja)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US15/726,394 2017-10-06
US15/726,394 US10599645B2 (en) 2017-10-06 2017-10-06 Bidirectional probabilistic natural language rewriting and selection

Publications (3)

Publication Number Publication Date
JP2019070799A JP2019070799A (ja) 2019-05-09
JP2019070799A5 JP2019070799A5 (enExample) 2020-01-09
JP6675463B2 true JP6675463B2 (ja) 2020-04-01

Family

ID=65992537

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2018189730A Active JP6675463B2 (ja) 2017-10-06 2018-10-05 自然言語の双方向確率的な書換えおよび選択

Country Status (3)

Country Link
US (1) US10599645B2 (enExample)
JP (1) JP6675463B2 (enExample)
CN (1) CN109635270B (enExample)

Families Citing this family (24)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109325227A (zh) * 2018-09-14 2019-02-12 北京字节跳动网络技术有限公司 用于生成修正语句的方法和装置
US11437025B2 (en) * 2018-10-04 2022-09-06 Google Llc Cross-lingual speech recognition
CN112151024B (zh) * 2019-06-28 2023-09-22 声音猎手公司 用于生成语音音频的经编辑的转录的方法和装置
US11205052B2 (en) 2019-07-02 2021-12-21 Servicenow, Inc. Deriving multiple meaning representations for an utterance in a natural language understanding (NLU) framework
US11886461B2 (en) * 2019-07-31 2024-01-30 Salesforce, Inc. Machine-learnt field-specific standardization
CN110660384B (zh) * 2019-10-14 2022-03-22 内蒙古工业大学 一种基于端到端的蒙古语异形同音词声学建模方法
KR20210044056A (ko) * 2019-10-14 2021-04-22 삼성전자주식회사 중복 토큰 임베딩을 이용한 자연어 처리 방법 및 장치
US11276391B2 (en) * 2020-02-06 2022-03-15 International Business Machines Corporation Generation of matched corpus for language model training
US11373657B2 (en) * 2020-05-01 2022-06-28 Raytheon Applied Signal Technology, Inc. System and method for speaker identification in audio data
US11315545B2 (en) * 2020-07-09 2022-04-26 Raytheon Applied Signal Technology, Inc. System and method for language identification in audio data
US12020697B2 (en) 2020-07-15 2024-06-25 Raytheon Applied Signal Technology, Inc. Systems and methods for fast filtering of audio keyword search
US12387720B2 (en) 2020-11-20 2025-08-12 SoundHound AI IP, LLC. Neural sentence generator for virtual assistants
US11489793B2 (en) 2020-11-22 2022-11-01 International Business Machines Corporation Response qualification monitoring in real-time chats
CN112528980B (zh) * 2020-12-16 2022-02-15 北京华宇信息技术有限公司 Ocr识别结果纠正方法及其终端、系统
US20220284193A1 (en) * 2021-03-04 2022-09-08 Tencent America LLC Robust dialogue utterance rewriting as sequence tagging
US11847111B2 (en) * 2021-04-09 2023-12-19 Bitdefender IPR Management Ltd. Anomaly detection systems and methods
US11711469B2 (en) * 2021-05-10 2023-07-25 International Business Machines Corporation Contextualized speech to text conversion
CN113869069B (zh) * 2021-09-10 2024-08-06 厦门大学 基于译文树结构解码路径动态选择的机器翻译方法
CN118715561A (zh) * 2021-12-14 2024-09-27 谷歌有限责任公司 网格语音修正
US12223948B2 (en) * 2022-02-03 2025-02-11 Soundhound, Inc. Token confidence scores for automatic speech recognition
CN115064170B (zh) * 2022-08-17 2022-12-13 广州小鹏汽车科技有限公司 语音交互方法、服务器和存储介质
US12394411B2 (en) 2022-10-27 2025-08-19 SoundHound AI IP, LLC. Domain specific neural sentence generator for multi-domain virtual assistants
US20250045523A1 (en) * 2023-08-02 2025-02-06 Mediatek Inc. Execution Methods of a Machine Learning Model
CN120236568A (zh) 2023-12-28 2025-07-01 通用汽车环球科技运作有限责任公司 基于乘员意图动作的语音识别系统

Family Cites Families (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6848080B1 (en) * 1999-11-05 2005-01-25 Microsoft Corporation Language input architecture for converting one text form to another text form with tolerance to spelling, typographical, and conversion errors
US7822597B2 (en) * 2004-12-21 2010-10-26 Xerox Corporation Bi-dimensional rewriting rules for natural language processing
GB2424742A (en) * 2005-03-31 2006-10-04 Ibm Automatic speech recognition
US20080270110A1 (en) * 2007-04-30 2008-10-30 Yurick Steven J Automatic speech recognition with textual content input
US9552355B2 (en) * 2010-05-20 2017-01-24 Xerox Corporation Dynamic bi-phrases for statistical machine translation
US8762156B2 (en) * 2011-09-28 2014-06-24 Apple Inc. Speech recognition repair using contextual information
CN103198149B (zh) * 2013-04-23 2017-02-08 中国科学院计算技术研究所 一种查询纠错方法和系统
CN104157285B (zh) * 2013-05-14 2016-01-20 腾讯科技(深圳)有限公司 语音识别方法、装置及电子设备
CN105912521A (zh) * 2015-12-25 2016-08-31 乐视致新电子科技(天津)有限公司 一种解析语音内容的方法及装置

Also Published As

Publication number Publication date
US10599645B2 (en) 2020-03-24
US20190108257A1 (en) 2019-04-11
CN109635270A (zh) 2019-04-16
JP2019070799A (ja) 2019-05-09
CN109635270B (zh) 2023-03-07

Similar Documents

Publication Publication Date Title
JP6675463B2 (ja) 自然言語の双方向確率的な書換えおよび選択
CN108124477B (zh) 基于伪数据改进分词器以处理自然语言
Mairesse et al. Stochastic language generation in dialogue using factored language models
US8688435B2 (en) Systems and methods for normalizing input media
US20050154580A1 (en) Automated grammar generator (AGG)
CN109637537B (zh) 一种自动获取标注数据优化自定义唤醒模型的方法
Păiş et al. Capitalization and punctuation restoration: a survey
CN113225612B (zh) 字幕生成方法、装置、计算机可读存储介质及电子设备
JP2016529603A (ja) オンライン音声翻訳方法及び装置
CN113614825A (zh) 用于自动语音识别的字词网格扩增
CN106570180A (zh) 基于人工智能的语音搜索方法及装置
US10553203B2 (en) Training data optimization for voice enablement of applications
KR101677859B1 (ko) 지식 베이스를 이용하는 시스템 응답 생성 방법 및 이를 수행하는 장치
CN106354716A (zh) 转换文本的方法和设备
US10565982B2 (en) Training data optimization in a service computing system for voice enablement of applications
CN105912521A (zh) 一种解析语音内容的方法及装置
US20200394258A1 (en) Generation of edited transcription for speech audio
Palmer et al. Robust information extraction from automatically generated speech transcriptions
CN113705202A (zh) 搜索输入信息纠错方法、装置以及电子设备、存储介质
Fenogenova et al. A general method applicable to the search for anglicisms in russian social network texts
Mekki et al. COTA 2.0: An automatic corrector of Tunisian Arabic social media texts
US11361761B2 (en) Pattern-based statement attribution
CN116052672A (zh) 一种标点预测方法、装置、设备及存储介质
CN112151024B (zh) 用于生成语音音频的经编辑的转录的方法和装置
Mckeown System for Cross-Language Information Processing, Translation and Summarization (SCRIPTS)

Legal Events

Date Code Title Description
A621 Written request for application examination

Free format text: JAPANESE INTERMEDIATE CODE: A621

Effective date: 20190911

A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20191121

A871 Explanation of circumstances concerning accelerated examination

Free format text: JAPANESE INTERMEDIATE CODE: A871

Effective date: 20191121

A975 Report on accelerated examination

Free format text: JAPANESE INTERMEDIATE CODE: A971005

Effective date: 20191125

TRDD Decision of grant or rejection written
A01 Written decision to grant a patent or to grant a registration (utility model)

Free format text: JAPANESE INTERMEDIATE CODE: A01

Effective date: 20200212

A61 First payment of annual fees (during grant procedure)

Free format text: JAPANESE INTERMEDIATE CODE: A61

Effective date: 20200310

R150 Certificate of patent or registration of utility model

Ref document number: 6675463

Country of ref document: JP

Free format text: JAPANESE INTERMEDIATE CODE: R150

R250 Receipt of annual fees

Free format text: JAPANESE INTERMEDIATE CODE: R250

R250 Receipt of annual fees

Free format text: JAPANESE INTERMEDIATE CODE: R250

R250 Receipt of annual fees

Free format text: JAPANESE INTERMEDIATE CODE: R250