EP3797414A4 - Verfahren und vorrichtung zur spracherkennung in einer umgebung mit mehreren geräten - Google Patents

Verfahren und vorrichtung zur spracherkennung in einer umgebung mit mehreren geräten Download PDF

Info

Publication number
EP3797414A4
EP3797414A4 EP19874900.4A EP19874900A EP3797414A4 EP 3797414 A4 EP3797414 A4 EP 3797414A4 EP 19874900 A EP19874900 A EP 19874900A EP 3797414 A4 EP3797414 A4 EP 3797414A4
Authority
EP
European Patent Office
Prior art keywords
apparatuses
speech recognition
recognition method
environment including
including plurality
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Withdrawn
Application number
EP19874900.4A
Other languages
English (en)
French (fr)
Other versions
EP3797414A1 (de
Inventor
Keunseok CHO
Jaeyoung ROH
Jiwon HYUNG
Donghan JANG
Jaewon Lee
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Samsung Electronics Co Ltd
Original Assignee
Samsung Electronics Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Samsung Electronics Co Ltd filed Critical Samsung Electronics Co Ltd
Priority claimed from PCT/KR2019/013903 external-priority patent/WO2020085769A1/en
Publication of EP3797414A1 publication Critical patent/EP3797414A1/de
Publication of EP3797414A4 publication Critical patent/EP3797414A4/de
Withdrawn legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification techniques
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification techniques
    • G10L17/02Preprocessing operations, e.g. segment selection; Pattern representation or modelling, e.g. based on linear discriminant analysis [LDA] or principal components; Feature selection or extraction
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/16Sound input; Sound output
    • G06F3/167Audio in a user interface, e.g. using voice commands for navigating, audio feedback
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification techniques
    • G10L17/06Decision making techniques; Pattern matching strategies
    • G10L17/08Use of distortion metrics or a particular distance between probe pattern and reference templates
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification techniques
    • G10L17/06Decision making techniques; Pattern matching strategies
    • G10L17/12Score normalisation
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/28Constructional details of speech recognition systems
    • G10L15/32Multiple recognisers used in sequence or in parallel; Score combination systems therefor, e.g. voting systems
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/221Announcement of recognition results

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Business, Economics & Management (AREA)
  • Game Theory and Decision Science (AREA)
  • Theoretical Computer Science (AREA)
  • Computational Linguistics (AREA)
  • General Health & Medical Sciences (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Telephonic Communication Services (AREA)
EP19874900.4A 2018-10-24 2019-10-22 Verfahren und vorrichtung zur spracherkennung in einer umgebung mit mehreren geräten Withdrawn EP3797414A4 (de)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
KR20180127696 2018-10-24
KR1020190110772A KR20200047311A (ko) 2018-10-24 2019-09-06 복수의 장치들이 있는 환경에서의 음성 인식 방법 및 장치
PCT/KR2019/013903 WO2020085769A1 (en) 2018-10-24 2019-10-22 Speech recognition method and apparatus in environment including plurality of apparatuses

Publications (2)

Publication Number Publication Date
EP3797414A1 EP3797414A1 (de) 2021-03-31
EP3797414A4 true EP3797414A4 (de) 2021-08-25

Family

ID=70733911

Family Applications (1)

Application Number Title Priority Date Filing Date
EP19874900.4A Withdrawn EP3797414A4 (de) 2018-10-24 2019-10-22 Verfahren und vorrichtung zur spracherkennung in einer umgebung mit mehreren geräten

Country Status (3)

Country Link
EP (1) EP3797414A4 (de)
KR (1) KR20200047311A (de)
CN (1) CN112639965A (de)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11915697B2 (en) 2020-11-11 2024-02-27 Samsung Electronics Co., Ltd. Electronic device, system and control method thereof
KR20220099831A (ko) 2021-01-07 2022-07-14 삼성전자주식회사 전자 장치 및 전자 장치에서 사용자 발화 처리 방법

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20130073293A1 (en) * 2011-09-20 2013-03-21 Lg Electronics Inc. Electronic device and method for controlling the same
US20170076720A1 (en) * 2015-09-11 2017-03-16 Amazon Technologies, Inc. Arbitration between voice-enabled devices
WO2018067528A1 (en) * 2016-10-03 2018-04-12 Google Llc Device leadership negotiation among voice interface devices
US20180182397A1 (en) * 2016-12-22 2018-06-28 Google Inc. Collaborative voice controlled devices

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20130073293A1 (en) * 2011-09-20 2013-03-21 Lg Electronics Inc. Electronic device and method for controlling the same
US20170076720A1 (en) * 2015-09-11 2017-03-16 Amazon Technologies, Inc. Arbitration between voice-enabled devices
WO2018067528A1 (en) * 2016-10-03 2018-04-12 Google Llc Device leadership negotiation among voice interface devices
US20180182397A1 (en) * 2016-12-22 2018-06-28 Google Inc. Collaborative voice controlled devices

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
QIN JIN ET AL: "Far-Field Speaker Recognition", ACOUSTICS, SPEECH AND SIGNAL PROCESSING, 2006. ICASSP 2006 PROCEEDINGS . 2006 IEEE INTERNATIONAL CONFERENCE ON TOULOUSE, FRANCE 14-19 MAY 2006, PISCATAWAY, NJ, USA,IEEE, PISCATAWAY, NJ, USA, 14 May 2006 (2006-05-14), pages I, XP031330964, ISBN: 978-1-4244-0469-8, DOI: 10.1109/ICASSP.2006.1660176 *
See also references of WO2020085769A1 *

Also Published As

Publication number Publication date
KR20200047311A (ko) 2020-05-07
CN112639965A (zh) 2021-04-09
EP3797414A1 (de) 2021-03-31

Similar Documents

Publication Publication Date Title
EP3767619A4 (de) Spracherkennung und spracherkennungsmodelltrainingsverfahren und -vorrichtung
EP3501023A4 (de) Spracherkennungsverfahren und -vorrichtung
EP4016330A4 (de) Sprachdialogverarbeitungsverfahren und vorrichtung
EP3479376A4 (de) Spracherkennungsverfahren und vorrichtung auf der basis von sprechererkennung
EP3857546A4 (de) Verfahren und vorrichtung zur verarbeitung von gesprochenen sprachdaten
EP3751569A4 (de) Verfahren und vorrichtung zur trennung der stimmen von mehreren personen
EP3735662A4 (de) Verfahren zur durchführung des lernens eines tiefen neuronalen netzwerks und vorrichtung dafür
EP3373293A4 (de) Spracherkennungsverfahren und -vorrichtung
EP3504703A4 (de) Spracherkennungsverfahren und -vorrichtung
EP3779972A4 (de) Stimmaufweckverfahren und -vorrichtung
EP3933693A4 (de) Objekterkennungsverfahren und -vorrichtung
EP3533052A4 (de) Spracherkennungsverfahren und -vorrichtung
SG11202107826QA (en) Facial recognition method and apparatus
EP3757873A4 (de) Gesichtserkennungsverfahren und -vorrichtung
EP3701521A4 (de) Vorrichtung zur spracherkennung und betriebsverfahren dafür
EP3497696A4 (de) Verfahren und vorrichtung zur sprachverarbeitung
EP3757874A4 (de) Verfahren und vorrichtung zur aktionserkennung
EP3621240A4 (de) Verfahren zum lernen der anzahl von ressourceneinheiten in einem kommunikationsverfahren und zugehörige vorrichtung
EP4064123A4 (de) Texterkennungsverfahren und -apparat
EP3759263A4 (de) Vorrichtung und verfahren zur katalyse
EP3744152A4 (de) Vorrichtung und verfahren für datenschutzbewahrende voiceprint-authentifizierung
EP3667517A4 (de) Verfahren und vorrichtung zur verarbeitung von natürlicher sprache
EP3819810A4 (de) Gesichtserkennungsverfahren und -vorrichtung
EP3837634A4 (de) Verfahren und vorrichtung zur gesichtserkennung
EP3897821A4 (de) Mikrostrom-stimulationstherapiegerät und verfahren

Legal Events

Date Code Title Description
STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: THE INTERNATIONAL PUBLICATION HAS BEEN MADE

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: THE INTERNATIONAL PUBLICATION HAS BEEN MADE

PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: REQUEST FOR EXAMINATION WAS MADE

17P Request for examination filed

Effective date: 20201222

AK Designated contracting states

Kind code of ref document: A1

Designated state(s): AL AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO RS SE SI SK SM TR

AX Request for extension of the european patent

Extension state: BA ME

A4 Supplementary search report drawn up and despatched

Effective date: 20210722

RIC1 Information provided on ipc code assigned before grant

Ipc: G10L 15/22 20060101AFI20210716BHEP

Ipc: G06F 3/16 20060101ALI20210716BHEP

Ipc: G10L 17/00 20130101ALI20210716BHEP

Ipc: G10L 15/32 20130101ALN20210716BHEP

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: THE APPLICATION HAS BEEN WITHDRAWN

18W Application withdrawn

Effective date: 20211105