CN110720104A - 一种语音信息处理方法、装置及终端 - Google Patents

一种语音信息处理方法、装置及终端 Download PDF

Info

Publication number
CN110720104A
CN110720104A CN201780091549.8A CN201780091549A CN110720104A CN 110720104 A CN110720104 A CN 110720104A CN 201780091549 A CN201780091549 A CN 201780091549A CN 110720104 A CN110720104 A CN 110720104A
Authority
CN
China
Prior art keywords
event
text information
probability
terminal
field
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201780091549.8A
Other languages
English (en)
Other versions
CN110720104B (zh
Inventor
隋志成
李艳明
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Huawei Technologies Co Ltd
Original Assignee
Huawei Technologies Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Huawei Technologies Co Ltd filed Critical Huawei Technologies Co Ltd
Publication of CN110720104A publication Critical patent/CN110720104A/zh
Application granted granted Critical
Publication of CN110720104B publication Critical patent/CN110720104B/zh
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/26Speech to text systems
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • G10L15/1822Parsing for meaning understanding
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/30Semantic analysis
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • G10L15/1815Semantic context, e.g. disambiguation of the recognition hypotheses based on word meaning

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Physics & Mathematics (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Artificial Intelligence (AREA)
  • Theoretical Computer Science (AREA)
  • General Health & Medical Sciences (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Telephone Function (AREA)
  • Machine Translation (AREA)

Abstract

本申请实施例提供一种语音信息处理方法、装置及终端,涉及计算机技术领域,可以提高终端执行语义理解结果对应的事件的效率,并节省进行语义理解消耗的网络流量。具体方案包括:终端接收语音信息,将该语音信息转换为文本信息;获取文本信息归属于预设M个事件领域中的每个事件领域的领域概率;获取文本信息归属于N个事件领域中的每个事件领域的先验概率,N≤M;获取文本信息归属于N个事件领域中的每个事件领域的置信度;根据文本信息归属于N个事件领域中的每个事件领域的领域概率、先验概率和置信度,计算文本信息分别归属于N个事件领域的N个概率值;输出根据N个概率值中概率值最高的事件领域对文本信息进行语义理解的语义理解结果。

Description

PCT国内申请,说明书已公开。

Claims (18)

  1. PCT国内申请,权利要求书已公开。
CN201780091549.8A 2017-10-09 2017-10-13 一种语音信息处理方法、装置及终端 Active CN110720104B (zh)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
CN2017109315049 2017-10-09
CN201710931504 2017-10-09
PCT/CN2017/106168 WO2019071607A1 (zh) 2017-10-09 2017-10-13 一种语音信息处理方法、装置及终端

Publications (2)

Publication Number Publication Date
CN110720104A true CN110720104A (zh) 2020-01-21
CN110720104B CN110720104B (zh) 2021-11-19

Family

ID=66101210

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201780091549.8A Active CN110720104B (zh) 2017-10-09 2017-10-13 一种语音信息处理方法、装置及终端

Country Status (5)

Country Link
US (1) US11308965B2 (zh)
EP (1) EP3686758A4 (zh)
CN (1) CN110720104B (zh)
AU (1) AU2017435621B2 (zh)
WO (1) WO2019071607A1 (zh)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR20210064594A (ko) * 2019-11-26 2021-06-03 삼성전자주식회사 전자장치 및 그 제어방법
CN112652307A (zh) * 2020-12-02 2021-04-13 北京博瑞彤芸科技股份有限公司 一种语音触发抽奖的方法、系统及电子设备
CN117059095B (zh) * 2023-07-21 2024-04-30 广州市睿翔通信科技有限公司 基于ivr的服务提供方法、装置、计算机设备及存储介质

Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20080052080A1 (en) * 2005-11-30 2008-02-28 University Of Southern California Emotion Recognition System
US20090222329A1 (en) * 2005-09-14 2009-09-03 Jorey Ramer Syndication of a behavioral profile associated with an availability condition using a monetization platform
US20120191719A1 (en) * 2000-05-09 2012-07-26 Cbs Interactive Inc. Content aggregation method and apparatus for on-line purchasing system
CN104050160A (zh) * 2014-03-12 2014-09-17 北京紫冬锐意语音科技有限公司 一种机器与人工翻译相融合的口语翻译方法和装置
CN104424290A (zh) * 2013-09-02 2015-03-18 佳能株式会社 基于语音的问答系统和用于交互式语音系统的方法
CN105378830A (zh) * 2013-05-31 2016-03-02 朗桑有限公司 音频数据的处理
CN106205607A (zh) * 2015-05-05 2016-12-07 联想(北京)有限公司 语音信息处理方法和语音信息处理装置
CN106407333A (zh) * 2016-09-05 2017-02-15 北京百度网讯科技有限公司 基于人工智能的口语查询识别方法及装置
CN107004140A (zh) * 2014-12-05 2017-08-01 星球智能有限责任公司 文本识别方法和计算机程序产品

Family Cites Families (24)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6904405B2 (en) * 1999-07-17 2005-06-07 Edwin A. Suominen Message recognition using shared language model
ES2343786T3 (es) * 2002-03-27 2010-08-10 University Of Southern California Modelo de probabilidad de union basado en frases para traduccion automatica estadistica.
US8015143B2 (en) 2002-05-22 2011-09-06 Estes Timothy W Knowledge discovery agent system and method
CN1719438A (zh) 2004-07-06 2006-01-11 台达电子工业股份有限公司 整合式对话系统及其方法
US7640160B2 (en) * 2005-08-05 2009-12-29 Voicebox Technologies, Inc. Systems and methods for responding to natural language speech utterance
US9318108B2 (en) 2010-01-18 2016-04-19 Apple Inc. Intelligent automated assistant
US8326599B2 (en) * 2009-04-21 2012-12-04 Xerox Corporation Bi-phrase filtering for statistical machine translation
CN101587493B (zh) * 2009-06-29 2012-07-04 中国科学技术大学 文本分类方法
US8798984B2 (en) 2011-04-27 2014-08-05 Xerox Corporation Method and system for confidence-weighted learning of factored discriminative language models
US20130031476A1 (en) * 2011-07-25 2013-01-31 Coin Emmett Voice activated virtual assistant
US8914288B2 (en) * 2011-09-01 2014-12-16 At&T Intellectual Property I, L.P. System and method for advanced turn-taking for interactive spoken dialog systems
KR20140089862A (ko) 2013-01-07 2014-07-16 삼성전자주식회사 디스플레이 장치 및 그의 제어 방법
US9269354B2 (en) 2013-03-11 2016-02-23 Nuance Communications, Inc. Semantic re-ranking of NLU results in conversational dialogue applications
JP6245846B2 (ja) * 2013-05-30 2017-12-13 インターナショナル・ビジネス・マシーンズ・コーポレーションInternational Business Machines Corporation 音声認識における読み精度を改善するシステム、方法、およびプログラム
KR102222122B1 (ko) * 2014-01-21 2021-03-03 엘지전자 주식회사 감성음성 합성장치, 감성음성 합성장치의 동작방법, 및 이를 포함하는 이동 단말기
EP2933067B1 (en) * 2014-04-17 2019-09-18 Softbank Robotics Europe Method of performing multi-modal dialogue between a humanoid robot and user, computer program product and humanoid robot for implementing said method
US10431214B2 (en) 2014-11-26 2019-10-01 Voicebox Technologies Corporation System and method of determining a domain and/or an action related to a natural language input
US9805713B2 (en) * 2015-03-13 2017-10-31 Google Inc. Addressing missing features in models
US11250218B2 (en) 2015-12-11 2022-02-15 Microsoft Technology Licensing, Llc Personalizing natural language understanding systems
CN105632487B (zh) * 2015-12-31 2020-04-21 北京奇艺世纪科技有限公司 一种语音识别方法和装置
CN105869629B (zh) * 2016-03-30 2018-03-20 乐视控股(北京)有限公司 语音识别方法及装置
CN106095834A (zh) 2016-06-01 2016-11-09 竹间智能科技(上海)有限公司 基于话题的智能对话方法及系统
CN107092593B (zh) * 2017-04-12 2020-11-03 华中师范大学 初等数学分层抽样应用题的句子语义角色识别方法及系统
CN107193973B (zh) 2017-05-25 2021-07-20 百度在线网络技术(北京)有限公司 语义解析信息的领域识别方法及装置、设备及可读介质

Patent Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20120191719A1 (en) * 2000-05-09 2012-07-26 Cbs Interactive Inc. Content aggregation method and apparatus for on-line purchasing system
US20090222329A1 (en) * 2005-09-14 2009-09-03 Jorey Ramer Syndication of a behavioral profile associated with an availability condition using a monetization platform
US20080052080A1 (en) * 2005-11-30 2008-02-28 University Of Southern California Emotion Recognition System
CN105378830A (zh) * 2013-05-31 2016-03-02 朗桑有限公司 音频数据的处理
CN104424290A (zh) * 2013-09-02 2015-03-18 佳能株式会社 基于语音的问答系统和用于交互式语音系统的方法
CN104050160A (zh) * 2014-03-12 2014-09-17 北京紫冬锐意语音科技有限公司 一种机器与人工翻译相融合的口语翻译方法和装置
CN107004140A (zh) * 2014-12-05 2017-08-01 星球智能有限责任公司 文本识别方法和计算机程序产品
CN106205607A (zh) * 2015-05-05 2016-12-07 联想(北京)有限公司 语音信息处理方法和语音信息处理装置
CN106407333A (zh) * 2016-09-05 2017-02-15 北京百度网讯科技有限公司 基于人工智能的口语查询识别方法及装置

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
JAVIER TEJEDO 等: ""Term-Dependent Confidence Normalisation for Out-of-Vocabulary Spoken Term Detection"", 《JOURNAL OF COMPUTER SCIENCE & TECHNOLOGY》 *
付跃文 等: ""基于Word Lattice结构的语音识别置信度算法"", 《计算机工程与应用》 *

Also Published As

Publication number Publication date
US20200273463A1 (en) 2020-08-27
AU2017435621A1 (en) 2020-05-07
US11308965B2 (en) 2022-04-19
AU2017435621B2 (en) 2022-01-27
EP3686758A1 (en) 2020-07-29
CN110720104B (zh) 2021-11-19
EP3686758A4 (en) 2020-12-16
WO2019071607A1 (zh) 2019-04-18

Similar Documents

Publication Publication Date Title
EP3092555B1 (en) Audio triggers based on context
US9769634B2 (en) Providing personalized content based on historical interaction with a mobile device
EP3611663A1 (en) Image recognition method, terminal and storage medium
KR101758302B1 (ko) 컨텍스트에 기초한 음성 인식 문법 선택
KR101894499B1 (ko) 상태-종속 쿼리 응답
CN104123937B (zh) 提醒设置方法、装置和系统
US20140025371A1 (en) Method and apparatus for recommending texts
US9754581B2 (en) Reminder setting method and apparatus
US20140337861A1 (en) Method of using use log of portable terminal and apparatus using the same
US20140222435A1 (en) Navigation system with user dependent language mechanism and method of operation thereof
CN109219953B (zh) 一种闹钟提醒方法、电子设备及计算机可读存储介质
CN110720104B (zh) 一种语音信息处理方法、装置及终端
CN112257436A (zh) 文本检测方法及装置
US20200380076A1 (en) Contextual feedback to a natural understanding system in a chat bot using a knowledge model
CN112673367A (zh) 用于预测用户意图的电子设备和方法
CN113838479B (zh) 单词发音评测方法、服务器及系统
CN116403573A (zh) 一种语音识别方法
CN111052050A (zh) 一种输入信息的方法及终端
CN110178130B (zh) 一种生成相册标题的方法及设备
CN111639217A (zh) 一种口语评级方法、终端设备及存储介质
CN112925963B (zh) 数据推荐方法和装置
CN111768788B (zh) 用于转换信息的方法、装置、电子设备和计算机可读介质
CN109101586B (zh) 电影信息获取方法、装置及移动终端
CN114171028A (zh) 一种语音识别方法及装置

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant