CN109635270B - 双向概率性的自然语言重写和选择 - Google Patents

双向概率性的自然语言重写和选择 Download PDF

Info

Publication number
CN109635270B
CN109635270B CN201811151807.XA CN201811151807A CN109635270B CN 109635270 B CN109635270 B CN 109635270B CN 201811151807 A CN201811151807 A CN 201811151807A CN 109635270 B CN109635270 B CN 109635270B
Authority
CN
China
Prior art keywords
token
tokens
sequence
rewrite
new
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201811151807.XA
Other languages
English (en)
Chinese (zh)
Other versions
CN109635270A (zh
Inventor
冯鹿
邢博纳
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
SoundHound Inc
Original Assignee
SoundHound Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by SoundHound Inc filed Critical SoundHound Inc
Publication of CN109635270A publication Critical patent/CN109635270A/zh
Application granted granted Critical
Publication of CN109635270B publication Critical patent/CN109635270B/zh
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2453Query optimisation
    • G06F16/24534Query rewriting; Transformation
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/332Query formulation
    • G06F16/3329Natural language query formulation
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/3331Query processing
    • G06F16/3332Query translation
    • G06F16/3338Query expansion
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/205Parsing
    • G06F40/211Syntactic parsing, e.g. based on context-free grammar [CFG] or unification grammars
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/253Grammatical analysis; Style critique
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/279Recognition of textual entities
    • G06F40/284Lexical analysis, e.g. tokenisation or collocates
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/30Semantic analysis
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/40Processing or translation of natural language
    • G06F40/55Rule-based translation
    • G06F40/56Natural language generation
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/40Processing or translation of natural language
    • G06F40/58Use of machine translation, e.g. for multi-lingual retrieval, for server-side translation for client devices or for real-time translation
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N7/00Computing arrangements based on specific mathematical models
    • G06N7/01Probabilistic graphical models, e.g. probabilistic networks
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • G10L15/183Speech classification or search using natural language modelling using context dependencies, e.g. language models

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • General Engineering & Computer Science (AREA)
  • Artificial Intelligence (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Health & Medical Sciences (AREA)
  • General Health & Medical Sciences (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Mathematical Physics (AREA)
  • Human Computer Interaction (AREA)
  • Multimedia (AREA)
  • Acoustics & Sound (AREA)
  • Evolutionary Computation (AREA)
  • Pure & Applied Mathematics (AREA)
  • Computational Mathematics (AREA)
  • Algebra (AREA)
  • Probability & Statistics with Applications (AREA)
  • Mathematical Analysis (AREA)
  • Software Systems (AREA)
  • Mathematical Optimization (AREA)
  • Computing Systems (AREA)
  • Machine Translation (AREA)
CN201811151807.XA 2017-10-06 2018-09-29 双向概率性的自然语言重写和选择 Active CN109635270B (zh)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US15/726,394 2017-10-06
US15/726,394 US10599645B2 (en) 2017-10-06 2017-10-06 Bidirectional probabilistic natural language rewriting and selection

Publications (2)

Publication Number Publication Date
CN109635270A CN109635270A (zh) 2019-04-16
CN109635270B true CN109635270B (zh) 2023-03-07

Family

ID=65992537

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201811151807.XA Active CN109635270B (zh) 2017-10-06 2018-09-29 双向概率性的自然语言重写和选择

Country Status (3)

Country Link
US (1) US10599645B2 (enExample)
JP (1) JP6675463B2 (enExample)
CN (1) CN109635270B (enExample)

Families Citing this family (24)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109325227A (zh) * 2018-09-14 2019-02-12 北京字节跳动网络技术有限公司 用于生成修正语句的方法和装置
US11437025B2 (en) * 2018-10-04 2022-09-06 Google Llc Cross-lingual speech recognition
CN112151024B (zh) * 2019-06-28 2023-09-22 声音猎手公司 用于生成语音音频的经编辑的转录的方法和装置
US11205052B2 (en) 2019-07-02 2021-12-21 Servicenow, Inc. Deriving multiple meaning representations for an utterance in a natural language understanding (NLU) framework
US11886461B2 (en) * 2019-07-31 2024-01-30 Salesforce, Inc. Machine-learnt field-specific standardization
CN110660384B (zh) * 2019-10-14 2022-03-22 内蒙古工业大学 一种基于端到端的蒙古语异形同音词声学建模方法
KR20210044056A (ko) * 2019-10-14 2021-04-22 삼성전자주식회사 중복 토큰 임베딩을 이용한 자연어 처리 방법 및 장치
US11276391B2 (en) * 2020-02-06 2022-03-15 International Business Machines Corporation Generation of matched corpus for language model training
US11373657B2 (en) * 2020-05-01 2022-06-28 Raytheon Applied Signal Technology, Inc. System and method for speaker identification in audio data
US11315545B2 (en) * 2020-07-09 2022-04-26 Raytheon Applied Signal Technology, Inc. System and method for language identification in audio data
US12020697B2 (en) 2020-07-15 2024-06-25 Raytheon Applied Signal Technology, Inc. Systems and methods for fast filtering of audio keyword search
US12387720B2 (en) 2020-11-20 2025-08-12 SoundHound AI IP, LLC. Neural sentence generator for virtual assistants
US11489793B2 (en) 2020-11-22 2022-11-01 International Business Machines Corporation Response qualification monitoring in real-time chats
CN112528980B (zh) * 2020-12-16 2022-02-15 北京华宇信息技术有限公司 Ocr识别结果纠正方法及其终端、系统
US20220284193A1 (en) * 2021-03-04 2022-09-08 Tencent America LLC Robust dialogue utterance rewriting as sequence tagging
US11847111B2 (en) * 2021-04-09 2023-12-19 Bitdefender IPR Management Ltd. Anomaly detection systems and methods
US11711469B2 (en) * 2021-05-10 2023-07-25 International Business Machines Corporation Contextualized speech to text conversion
CN113869069B (zh) * 2021-09-10 2024-08-06 厦门大学 基于译文树结构解码路径动态选择的机器翻译方法
CN118715561A (zh) * 2021-12-14 2024-09-27 谷歌有限责任公司 网格语音修正
US12223948B2 (en) * 2022-02-03 2025-02-11 Soundhound, Inc. Token confidence scores for automatic speech recognition
CN115064170B (zh) * 2022-08-17 2022-12-13 广州小鹏汽车科技有限公司 语音交互方法、服务器和存储介质
US12394411B2 (en) 2022-10-27 2025-08-19 SoundHound AI IP, LLC. Domain specific neural sentence generator for multi-domain virtual assistants
US20250045523A1 (en) * 2023-08-02 2025-02-06 Mediatek Inc. Execution Methods of a Machine Learning Model
CN120236568A (zh) 2023-12-28 2025-07-01 通用汽车环球科技运作有限责任公司 基于乘员意图动作的语音识别系统

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1387650A (zh) * 1999-11-05 2002-12-25 微软公司 对拼写、打字和转换错误具有容错能力的将一种文本形式转换为另一种文本形式的语言输入体系结构
CN103198149A (zh) * 2013-04-23 2013-07-10 中国科学院计算技术研究所 一种查询纠错方法和系统
US8812316B1 (en) * 2011-09-28 2014-08-19 Apple Inc. Speech recognition repair using contextual information
CN104157285A (zh) * 2013-05-14 2014-11-19 腾讯科技(深圳)有限公司 语音识别方法、装置及电子设备
CN105912521A (zh) * 2015-12-25 2016-08-31 乐视致新电子科技(天津)有限公司 一种解析语音内容的方法及装置

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7822597B2 (en) * 2004-12-21 2010-10-26 Xerox Corporation Bi-dimensional rewriting rules for natural language processing
GB2424742A (en) * 2005-03-31 2006-10-04 Ibm Automatic speech recognition
US20080270110A1 (en) * 2007-04-30 2008-10-30 Yurick Steven J Automatic speech recognition with textual content input
US9552355B2 (en) * 2010-05-20 2017-01-24 Xerox Corporation Dynamic bi-phrases for statistical machine translation

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1387650A (zh) * 1999-11-05 2002-12-25 微软公司 对拼写、打字和转换错误具有容错能力的将一种文本形式转换为另一种文本形式的语言输入体系结构
US8812316B1 (en) * 2011-09-28 2014-08-19 Apple Inc. Speech recognition repair using contextual information
CN103198149A (zh) * 2013-04-23 2013-07-10 中国科学院计算技术研究所 一种查询纠错方法和系统
CN104157285A (zh) * 2013-05-14 2014-11-19 腾讯科技(深圳)有限公司 语音识别方法、装置及电子设备
CN105912521A (zh) * 2015-12-25 2016-08-31 乐视致新电子科技(天津)有限公司 一种解析语音内容的方法及装置

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
ASR error detection using recurrent neural network language model and complementary ASR;Yik-Cheung Tam 等;《https://www.researchgate.net/publication/262308966》;20140515;2331-2335 *
基于前向后向语言模型的语音识别词图生成算法;李伟 等;《计算机应用》;20101031;第30卷(第10期);2563-2566,2571 *
语音识别后处理中错误处理研究;苏建雷;《中国优秀硕士学位论文全文数据库 信息科技辑》;20150815;I136-137 *

Also Published As

Publication number Publication date
US10599645B2 (en) 2020-03-24
JP6675463B2 (ja) 2020-04-01
US20190108257A1 (en) 2019-04-11
CN109635270A (zh) 2019-04-16
JP2019070799A (ja) 2019-05-09

Similar Documents

Publication Publication Date Title
CN109635270B (zh) 双向概率性的自然语言重写和选择
US8527272B2 (en) Method and apparatus for aligning texts
US8571849B2 (en) System and method for enriching spoken language translation with prosodic information
US9501470B2 (en) System and method for enriching spoken language translation with dialog acts
US20170372693A1 (en) System and method for translating real-time speech using segmentation based on conjunction locations
CN109858038B (zh) 一种文本标点确定方法及装置
Păiş et al. Capitalization and punctuation restoration: a survey
CN112580340A (zh) 逐字歌词生成方法及装置、存储介质和电子设备
US20110213610A1 (en) Processor Implemented Systems and Methods for Measuring Syntactic Complexity on Spontaneous Non-Native Speech Data by Using Structural Event Detection
CN113614825A (zh) 用于自动语音识别的字词网格扩增
CN108899013A (zh) 语音搜索方法、装置和语音识别系统
US20200394258A1 (en) Generation of edited transcription for speech audio
US11900072B1 (en) Quick lookup for speech translation
Ayan et al. “Can you give me another word for hyperbaric?”: Improving speech translation using targeted clarification questions
CN106649278A (zh) 扩展口语对话系统语料库的方法和系统
CN117094329A (zh) 一种用于解决语音歧义的语音翻译方法及装置
KR20120045906A (ko) 코퍼스 오류 교정 장치 및 그 방법
US6772116B2 (en) Method of decoding telegraphic speech
Wang Porting the galaxy system to Mandarin Chinese
Prochazka et al. Performance of Czech Speech Recognition with Language Models Created from Public Resources.
KR102592623B1 (ko) 외부 정렬정보에 기반한 실시간 동시통역 모델 학습 방법, 동시통역 방법 및 시스템
JP2003162524A (ja) 言語処理装置
KR20110119478A (ko) 음성 인식 장치 및 음성 인식 방법
CN112151024B (zh) 用于生成语音音频的经编辑的转录的方法和装置
Lee et al. Interlingua-based English–Korean Two-way Speech Translation of Doctor–Patient Dialogues with CCLINC

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
TG01 Patent term adjustment
TG01 Patent term adjustment