EP3965101C0 - Spracherkennungsverfahren, -vorrichtung und -gerät sowie computerlesbares speichermedium - Google Patents

Spracherkennungsverfahren, -vorrichtung und -gerät sowie computerlesbares speichermedium

Info

Publication number
EP3965101C0
EP3965101C0 EP20814489.9A EP20814489A EP3965101C0 EP 3965101 C0 EP3965101 C0 EP 3965101C0 EP 20814489 A EP20814489 A EP 20814489A EP 3965101 C0 EP3965101 C0 EP 3965101C0
Authority
EP
European Patent Office
Prior art keywords
computer
storage medium
readable storage
speech recognition
recognition method
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
EP20814489.9A
Other languages
English (en)
French (fr)
Other versions
EP3965101B1 (de
EP3965101A1 (de
EP3965101A4 (de
Inventor
Weiran Nie
Fuliang Weng
youjia Huang
Hai Yu
Shumang Hu
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shenzhen Yinwang Intelligenttechnologies Co Ltd
Original Assignee
Shenzhen Yinwang Intelligent Technologies Co Ltd
Shenzhen Yinwang Intelligent Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shenzhen Yinwang Intelligent Technologies Co Ltd, Shenzhen Yinwang Intelligent Technology Co Ltd filed Critical Shenzhen Yinwang Intelligent Technologies Co Ltd
Publication of EP3965101A1 publication Critical patent/EP3965101A1/de
Publication of EP3965101A4 publication Critical patent/EP3965101A4/de
Application granted granted Critical
Publication of EP3965101C0 publication Critical patent/EP3965101C0/de
Publication of EP3965101B1 publication Critical patent/EP3965101B1/de
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • G10L15/1822Parsing for meaning understanding
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • G10L15/183Speech classification or search using natural language modelling using context dependencies, e.g. language models
    • G10L15/187Phonemic context, e.g. pronunciation rules, phonotactical constraints or phoneme n-grams
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • G10L15/183Speech classification or search using natural language modelling using context dependencies, e.g. language models
    • G10L15/19Grammatical context, e.g. disambiguation of the recognition hypotheses based on word sequence rules
    • G10L15/193Formal grammars, e.g. finite state automata, context free grammars or word networks
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/26Speech to text systems
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/04Segmentation; Word boundary detection
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
    • G10L15/063Training
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
    • G10L15/065Adaptation
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
    • G10L15/065Adaptation
    • G10L15/07Adaptation to the speaker
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L2015/088Word spotting

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Artificial Intelligence (AREA)
  • Machine Translation (AREA)
EP20814489.9A 2019-05-31 2020-03-16 Spracherkennungsverfahren, -vorrichtung und -gerät sowie computerlesbares speichermedium Active EP3965101B1 (de)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201910470966.4A CN112017642B (zh) 2019-05-31 2019-05-31 语音识别的方法、装置、设备及计算机可读存储介质
PCT/CN2020/079522 WO2020238341A1 (zh) 2019-05-31 2020-03-16 语音识别的方法、装置、设备及计算机可读存储介质

Publications (4)

Publication Number Publication Date
EP3965101A1 EP3965101A1 (de) 2022-03-09
EP3965101A4 EP3965101A4 (de) 2022-06-29
EP3965101C0 true EP3965101C0 (de) 2025-08-27
EP3965101B1 EP3965101B1 (de) 2025-08-27

Family

ID=73501103

Family Applications (1)

Application Number Title Priority Date Filing Date
EP20814489.9A Active EP3965101B1 (de) 2019-05-31 2020-03-16 Spracherkennungsverfahren, -vorrichtung und -gerät sowie computerlesbares speichermedium

Country Status (5)

Country Link
US (1) US12087289B2 (de)
EP (1) EP3965101B1 (de)
JP (1) JP7343087B2 (de)
CN (2) CN112017642B (de)
WO (1) WO2020238341A1 (de)

Families Citing this family (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112331210B (zh) * 2021-01-05 2021-05-18 太极计算机股份有限公司 一种语音识别装置
US11984125B2 (en) * 2021-04-23 2024-05-14 Cisco Technology, Inc. Speech recognition using on-the-fly-constrained language model per utterance
US12300223B2 (en) * 2022-01-04 2025-05-13 Sap Se Support for syntax analysis during processing instructions for execution
CN114882886B (zh) * 2022-04-27 2024-10-01 卡斯柯信号有限公司 Ctc仿真实训语音识别处理方法、存储介质和电子设备
CN115810359A (zh) * 2022-09-28 2023-03-17 海尔优家智能科技(北京)有限公司 语音的识别方法和装置、存储介质及电子装置
CN117112065B (zh) * 2023-08-30 2024-06-25 北京百度网讯科技有限公司 大模型插件调用方法、装置、设备及介质

Family Cites Families (34)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5384892A (en) * 1992-12-31 1995-01-24 Apple Computer, Inc. Dynamic language model for speech recognition
US6754626B2 (en) * 2001-03-01 2004-06-22 International Business Machines Corporation Creating a hierarchical tree of language models for a dialog system based on prompt and dialog context
US7328155B2 (en) * 2002-09-25 2008-02-05 Toyota Infotechnology Center Co., Ltd. Method and system for speech recognition using grammar weighted based upon location information
US20040148170A1 (en) * 2003-01-23 2004-07-29 Alejandro Acero Statistical classifiers for spoken language understanding and command/control scenarios
JP3991914B2 (ja) * 2003-05-08 2007-10-17 日産自動車株式会社 移動体用音声認識装置
US7228278B2 (en) * 2004-07-06 2007-06-05 Voxify, Inc. Multi-slot dialog systems and methods
JP2006023345A (ja) * 2004-07-06 2006-01-26 Alpine Electronics Inc テレビ画像自動キャプチャー方法及び装置
US7716056B2 (en) 2004-09-27 2010-05-11 Robert Bosch Corporation Method and system for interactive conversational dialogue for cognitively overloaded device users
JP4846336B2 (ja) * 2005-10-21 2011-12-28 株式会社ユニバーサルエンターテインメント 会話制御装置
JP5149737B2 (ja) * 2008-08-20 2013-02-20 株式会社ユニバーサルエンターテインメント 自動会話システム、並びに会話シナリオ編集装置
US8239129B2 (en) 2009-07-27 2012-08-07 Robert Bosch Gmbh Method and system for improving speech recognition accuracy by use of geographic information
US8990085B2 (en) * 2009-09-30 2015-03-24 At&T Intellectual Property I, L.P. System and method for handling repeat queries due to wrong ASR output by modifying an acoustic, a language and a semantic model
KR20100012051A (ko) * 2010-01-12 2010-02-04 주식회사 다날 스타 음성 메시지 청취 시스템
US8938391B2 (en) * 2011-06-12 2015-01-20 Microsoft Corporation Dynamically adding personalization features to language models for voice search
US9082403B2 (en) * 2011-12-15 2015-07-14 Microsoft Technology Licensing, Llc Spoken utterance classification training for a speech recognition system
CN105027197B (zh) * 2013-03-15 2018-12-14 苹果公司 训练至少部分语音命令系统
JP6280342B2 (ja) * 2013-10-22 2018-02-14 株式会社Nttドコモ 機能実行指示システム及び機能実行指示方法
US9286892B2 (en) * 2014-04-01 2016-03-15 Google Inc. Language modeling in speech recognition
US10460720B2 (en) * 2015-01-03 2019-10-29 Microsoft Technology Licensing, Llc. Generation of language understanding systems and methods
CN105529030B (zh) * 2015-12-29 2020-03-03 百度在线网络技术(北京)有限公司 语音识别处理方法和装置
CN105590626B (zh) * 2015-12-29 2020-03-03 百度在线网络技术(北京)有限公司 持续语音人机交互方法和系统
CN105632495B (zh) * 2015-12-30 2019-07-05 百度在线网络技术(北京)有限公司 语音识别方法和装置
US10832664B2 (en) * 2016-08-19 2020-11-10 Google Llc Automated speech recognition using language models that selectively use domain-specific model components
US10217458B2 (en) * 2016-09-23 2019-02-26 Intel Corporation Technologies for improved keyword spotting
CN106486120B (zh) * 2016-10-21 2019-11-12 上海智臻智能网络科技股份有限公司 交互式语音应答方法及应答系统
CN106448670B (zh) * 2016-10-21 2019-11-19 竹间智能科技(上海)有限公司 基于深度学习和强化学习的自动回复对话系统
CN107240394A (zh) * 2017-06-14 2017-10-10 北京策腾教育科技有限公司 一种动态自适应语音分析技术以用于人机口语考试的方法及系统
KR20190004495A (ko) * 2017-07-04 2019-01-14 삼성에스디에스 주식회사 챗봇을 이용한 태스크 처리 방법, 장치 및 시스템
US10083006B1 (en) * 2017-09-12 2018-09-25 Google Llc Intercom-style communication using multiple computing devices
CN108735215A (zh) * 2018-06-07 2018-11-02 爱驰汽车有限公司 车载语音交互系统、方法、设备和存储介质
CN109003611B (zh) * 2018-09-29 2022-05-27 阿波罗智联(北京)科技有限公司 用于车辆语音控制的方法、装置、设备和介质
US11004449B2 (en) * 2018-11-29 2021-05-11 International Business Machines Corporation Vocal utterance based item inventory actions
CN109616108B (zh) * 2018-11-29 2022-05-31 出门问问创新科技有限公司 多轮对话交互处理方法、装置、电子设备及存储介质
US10997968B2 (en) * 2019-04-30 2021-05-04 Microsofttechnology Licensing, Llc Using dialog context to improve language understanding

Also Published As

Publication number Publication date
US12087289B2 (en) 2024-09-10
US20220093087A1 (en) 2022-03-24
JP2022534242A (ja) 2022-07-28
CN118379989A (zh) 2024-07-23
WO2020238341A1 (zh) 2020-12-03
EP3965101B1 (de) 2025-08-27
CN112017642A (zh) 2020-12-01
CN112017642B (zh) 2024-04-26
JP7343087B2 (ja) 2023-09-12
EP3965101A1 (de) 2022-03-09
EP3965101A4 (de) 2022-06-29

Similar Documents

Publication Publication Date Title
EP3770905C0 (de) Spracherkennungsverfahren, gerät und vorrichtung sowie speichermedium
EP4156176C0 (de) Spracherkennungsverfahren, -vorrichtung und -vorrichtung sowie speichermedium
EP4053835A4 (de) Spracherkennungsverfahren und -gerät sowie vorrichtung sowie speichermedium
EP4044175C0 (de) Spracherkennungsverfahren und -gerät und computerlesbares speichermedium
EP3937165C0 (de) Sprachsyntheseverfahren und -vorrichtung sowie computerlesbares speichermedium
EP3965101C0 (de) Spracherkennungsverfahren, -vorrichtung und -gerät sowie computerlesbares speichermedium
EP4152052A4 (de) Standortbestimmungsverfahren, -vorrichtung und -system sowie computerlesbares speichermedium
EP4425484A4 (de) Spracherkennungsverfahren und -vorrichtung, vorrichtung und speichermedium
SG11201912620YA (en) Voiceprint recognition method, device, terminal apparatus and storage medium
EP3648099A4 (de) Spracherkennungsverfahren, vorrichtung, einrichtung und speichermedium
SG11202001627XA (en) Speech recognition method, apparatus, and computer readable storage medium
EP4068280C0 (de) Spracherkennungsfehlerkorrekturverfahren, zugehörige vorrichtungen und lesbares speichermedium
EP4401074A4 (de) Spracherkennungsverfahren, -vorrichtung und -vorrichtung sowie speichermedium
KR102351008B9 (ko) 감정 인식 장치 및 감정 인식 방법
EP3193328A4 (de) Verfahren und vorrichtung zur durchführung von spracherkennung mit einem grammatikmodell
EP3584790A4 (de) Stimmabdruckerkennungsverfahren, vorrichtung, speichermedium und hintergrundserver
EP3806505C0 (de) Kommunikationsverfahren und -vorrichtung sowie speichermedium
KR102244013B9 (ko) 얼굴 인식 방법 및 장치
EP4060458A4 (de) Gestenerkennungsverfahren und -vorrichtung sowie speichermedium
EP3839944A4 (de) Sprachverarbeitungsverfahren und -vorrichtung sowie computerspeichermedium
EP4242912A4 (de) Bilderkennungsverfahren, -vorrichtung und -vorrichtung sowie computerlesbares speichermedium
SG11202107826QA (en) Facial recognition method and apparatus
SG11202101838VA (en) Speech recognition method, system and storage medium
EP3757874A4 (de) Verfahren und vorrichtung zur aktionserkennung
EP3576016A4 (de) Gesichtserkennungsverfahren und -vorrichtung sowie mobiles endgerät und speichermedium

Legal Events

Date Code Title Description
STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: THE INTERNATIONAL PUBLICATION HAS BEEN MADE

PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: REQUEST FOR EXAMINATION WAS MADE

17P Request for examination filed

Effective date: 20211202

AK Designated contracting states

Kind code of ref document: A1

Designated state(s): AL AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO RS SE SI SK SM TR

A4 Supplementary search report drawn up and despatched

Effective date: 20220601

RIC1 Information provided on ipc code assigned before grant

Ipc: G10L 15/04 20130101ALN20220525BHEP

Ipc: G10L 15/193 20130101ALI20220525BHEP

Ipc: G10L 15/22 20060101ALI20220525BHEP

Ipc: G10L 15/18 20130101AFI20220525BHEP

DAV Request for validation of the european patent (deleted)
DAX Request for extension of the european patent (deleted)
STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: EXAMINATION IS IN PROGRESS

17Q First examination report despatched

Effective date: 20230914

RAP1 Party data changed (applicant data changed or rights of an application transferred)

Owner name: SHENZHEN YINWANG INTELLIGENTTECHNOLOGIES CO., LTD.

GRAP Despatch of communication of intention to grant a patent

Free format text: ORIGINAL CODE: EPIDOSNIGR1

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: GRANT OF PATENT IS INTENDED

RIC1 Information provided on ipc code assigned before grant

Ipc: G10L 15/08 20060101ALN20250304BHEP

Ipc: G10L 15/04 20130101ALN20250304BHEP

Ipc: G10L 15/193 20130101ALI20250304BHEP

Ipc: G10L 15/22 20060101ALI20250304BHEP

Ipc: G10L 15/18 20130101AFI20250304BHEP

RIC1 Information provided on ipc code assigned before grant

Ipc: G10L 15/08 20060101ALN20250307BHEP

Ipc: G10L 15/04 20130101ALN20250307BHEP

Ipc: G10L 15/193 20130101ALI20250307BHEP

Ipc: G10L 15/22 20060101ALI20250307BHEP

Ipc: G10L 15/18 20130101AFI20250307BHEP

INTG Intention to grant announced

Effective date: 20250321

GRAS Grant fee paid

Free format text: ORIGINAL CODE: EPIDOSNIGR3

GRAA (expected) grant

Free format text: ORIGINAL CODE: 0009210

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: THE PATENT HAS BEEN GRANTED

AK Designated contracting states

Kind code of ref document: B1

Designated state(s): AL AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO RS SE SI SK SM TR

REG Reference to a national code

Ref country code: CH

Ref legal event code: EP

REG Reference to a national code

Ref country code: IE

Ref legal event code: FG4D

U01 Request for unitary effect filed

Effective date: 20250827

U07 Unitary effect registered

Designated state(s): AT BE BG DE DK EE FI FR IT LT LU LV MT NL PT RO SE SI

Effective date: 20250902