SG11201901419QA - Information processing apparatus, speech recognition system, and information processing method - Google Patents

Information processing apparatus, speech recognition system, and information processing method

Info

Publication number
SG11201901419QA
SG11201901419QA SG11201901419QA SG11201901419QA SG11201901419QA SG 11201901419Q A SG11201901419Q A SG 11201901419QA SG 11201901419Q A SG11201901419Q A SG 11201901419QA SG 11201901419Q A SG11201901419Q A SG 11201901419QA SG 11201901419Q A SG11201901419Q A SG 11201901419QA
Authority
SG
Singapore
Prior art keywords
speech
information processing
controller
processing apparatus
obtainer
Prior art date
Application number
SG11201901419QA
Inventor
Masayuki Kozuka
Tomoki Ogawa
Yoshihiro Mori
Original Assignee
Panasonic Ip Man Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Panasonic Ip Man Co Ltd filed Critical Panasonic Ip Man Co Ltd
Publication of SG11201901419QA publication Critical patent/SG11201901419QA/en

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/28Constructional details of speech recognition systems
    • G10L15/30Distributed recognition, e.g. in client-server systems, for mobile phones or network applications
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/16Sound input; Sound output
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/16Sound input; Sound output
    • G06F3/162Interface to dedicated audio devices, e.g. audio drivers, interface to CODECs
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/16Sound input; Sound output
    • G06F3/167Audio in a user interface, e.g. using voice commands for navigating, audio feedback
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/20Speech recognition techniques specially adapted for robustness in adverse environments, e.g. in noise, of stress induced speech
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/28Constructional details of speech recognition systems
    • G10L15/32Multiple recognisers used in sequence or in parallel; Score combination systems therefor, e.g. voting systems
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L2015/088Word spotting
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/223Execution procedure of a spoken command

Landscapes

  • Engineering & Computer Science (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Health & Medical Sciences (AREA)
  • Multimedia (AREA)
  • Theoretical Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Acoustics & Sound (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Health & Medical Sciences (AREA)
  • Telephonic Communication Services (AREA)
  • Telephone Function (AREA)
  • Computer And Data Communications (AREA)

Abstract

INFORMATION PROCESSING APPARATUS, SPEECH RECOGNITION SYSTEM, AND INFORMATION PROCESSING METHOD 5 An information processing apparatus (10b) includes: a speech obtainer (11) which obtains speech of a user; a first controller (12b) which, when the first controller (12b) recognizes that the speech obtained by the speech obtainer (11) is a first activation word, outputs a speech signal corresponding to the first activation word; and a second controller (13b). In the first speech 10 transmission process in which the speech signal of the speech obtained by speech obtainer (11) is transmitted to the VPA cloud server (120b), the first controller (12b) determines whether to output a speech signal corresponding to a second activation word to the second controller (13b) based on a predetermined priority level when the first controller (12b) recognizes that the 15 speech obtained by the speech obtainer indicates the second activation word for causing the second controller (13b) to start a second speech transmission process. Fig. 12 20
SG11201901419QA 2017-08-02 2018-02-02 Information processing apparatus, speech recognition system, and information processing method SG11201901419QA (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US201762540415P 2017-08-02 2017-08-02
PCT/JP2018/003522 WO2019026314A1 (en) 2017-08-02 2018-02-02 Information processing device, voice recognition system, and information processing method

Publications (1)

Publication Number Publication Date
SG11201901419QA true SG11201901419QA (en) 2019-03-28

Family

ID=65232459

Family Applications (2)

Application Number Title Priority Date Filing Date
SG11201901419QA SG11201901419QA (en) 2017-08-02 2018-02-02 Information processing apparatus, speech recognition system, and information processing method
SG11201901441QA SG11201901441QA (en) 2017-08-02 2018-02-02 Information processing apparatus, speech recognition system, and information processing method

Family Applications After (1)

Application Number Title Priority Date Filing Date
SG11201901441QA SG11201901441QA (en) 2017-08-02 2018-02-02 Information processing apparatus, speech recognition system, and information processing method

Country Status (8)

Country Link
US (2) US11145311B2 (en)
EP (2) EP3663905B1 (en)
JP (2) JP6928882B2 (en)
CN (2) CN109601017B (en)
BR (2) BR112019002636A2 (en)
MX (2) MX2019001803A (en)
SG (2) SG11201901419QA (en)
WO (2) WO2019026313A1 (en)

Families Citing this family (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR102498007B1 (en) * 2018-01-08 2023-02-08 엘지전자 주식회사 Laundry Treating Apparatus Control System by Voice Recognition and operating Method the same
JP7412414B2 (en) * 2019-03-22 2024-01-12 三菱重工サーマルシステムズ株式会社 Control device, equipment control system, control method and program
US11501761B2 (en) * 2019-04-05 2022-11-15 Samsung Electronics Co., Ltd. Method and apparatus for speech recognition
JP7236919B2 (en) * 2019-04-12 2023-03-10 三菱電機株式会社 VOICE INPUT DEVICE, VOICE OPERATION SYSTEM, VOICE OPERATION METHOD AND PROGRAM
JP2020178177A (en) * 2019-04-16 2020-10-29 シャープ株式会社 Network system
CN110570859B (en) * 2019-09-20 2022-05-27 Oppo广东移动通信有限公司 Intelligent sound box control method, device and system and storage medium
JP7248564B2 (en) * 2019-12-05 2023-03-29 Tvs Regza株式会社 Information processing device and program
JP7264071B2 (en) * 2020-01-23 2023-04-25 トヨタ自動車株式会社 Information processing system, information processing device, and program
CN111353771A (en) * 2020-02-19 2020-06-30 北京声智科技有限公司 Method, device, equipment and medium for remotely controlling payment
CN111768783B (en) 2020-06-30 2024-04-02 北京百度网讯科技有限公司 Voice interaction control method, device, electronic equipment, storage medium and system
CN114726830A (en) * 2020-12-18 2022-07-08 阿里巴巴集团控股有限公司 Voice service access method, system and vehicle

Family Cites Families (27)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2005031758A (en) 2003-07-07 2005-02-03 Canon Inc Voice processing device and method
KR100719776B1 (en) * 2005-02-25 2007-05-18 에이디정보통신 주식회사 Portable cord recognition voice output device
JP2009080183A (en) * 2007-09-25 2009-04-16 Panasonic Electric Works Co Ltd Speech recognition control device
JP5658641B2 (en) 2011-09-15 2015-01-28 株式会社Nttドコモ Terminal device, voice recognition program, voice recognition method, and voice recognition system
US9117449B2 (en) * 2012-04-26 2015-08-25 Nuance Communications, Inc. Embedded system for construction of small footprint speech recognition with user-definable constraints
US10381001B2 (en) * 2012-10-30 2019-08-13 Google Technology Holdings LLC Voice control user interface during low-power mode
JP2015011170A (en) * 2013-06-28 2015-01-19 株式会社ATR−Trek Voice recognition client device performing local voice recognition
CN103383134B (en) * 2013-08-06 2016-12-28 四川长虹电器股份有限公司 A kind of intelligent air-conditioning system and air conditioning control method
EP3047481A4 (en) * 2013-09-20 2017-03-01 Amazon Technologies Inc. Local and remote speech processing
US9508345B1 (en) * 2013-09-24 2016-11-29 Knowles Electronics, Llc Continuous voice sensing
CN105280180A (en) * 2014-06-11 2016-01-27 中兴通讯股份有限公司 Terminal control method, device, voice control device and terminal
JP6229071B2 (en) * 2014-10-24 2017-11-08 株式会社ソニー・インタラクティブエンタテインメント Control device, control method, program, and information storage medium
JP2016095383A (en) * 2014-11-14 2016-05-26 株式会社ATR−Trek Voice recognition client device and server-type voice recognition device
TWI525532B (en) 2015-03-30 2016-03-11 Yu-Wei Chen Set the name of the person to wake up the name for voice manipulation
US9996316B2 (en) * 2015-09-28 2018-06-12 Amazon Technologies, Inc. Mediation of wakeword response for multiple devices
JP2017117371A (en) * 2015-12-25 2017-06-29 パナソニック インテレクチュアル プロパティ コーポレーション オブ アメリカPanasonic Intellectual Property Corporation of America Control method, control device, and program
JP2017138476A (en) 2016-02-03 2017-08-10 ソニー株式会社 Information processing device, information processing method, and program
US10133612B2 (en) 2016-03-17 2018-11-20 Nuance Communications, Inc. Session processing interaction between two or more virtual assistants
US10115400B2 (en) * 2016-08-05 2018-10-30 Sonos, Inc. Multiple voice services
US10685656B2 (en) * 2016-08-31 2020-06-16 Bose Corporation Accessing multiple virtual personal assistants (VPA) from a single device
US10437841B2 (en) 2016-10-10 2019-10-08 Microsoft Technology Licensing, Llc Digital assistant extension automatic ranking and selection
US10127908B1 (en) * 2016-11-11 2018-11-13 Amazon Technologies, Inc. Connected accessory for a voice-controlled device
US10559309B2 (en) * 2016-12-22 2020-02-11 Google Llc Collaborative voice controlled devices
US11164570B2 (en) * 2017-01-17 2021-11-02 Ford Global Technologies, Llc Voice assistant tracking and activation
CA3052978A1 (en) * 2017-02-07 2018-08-16 Lutron Technology Company Llc Audio-based load control system
US10748531B2 (en) * 2017-04-13 2020-08-18 Harman International Industries, Incorporated Management layer for multiple intelligent personal assistant services
US20190013019A1 (en) * 2017-07-10 2019-01-10 Intel Corporation Speaker command and key phrase management for muli -virtual assistant systems

Also Published As

Publication number Publication date
US20190214015A1 (en) 2019-07-11
WO2019026313A1 (en) 2019-02-07
EP3663906A4 (en) 2020-07-22
CN109601016B (en) 2023-07-28
SG11201901441QA (en) 2019-03-28
EP3663905B1 (en) 2020-12-09
JP7033713B2 (en) 2022-03-11
EP3663905A4 (en) 2020-06-17
JPWO2019026314A1 (en) 2020-06-18
JPWO2019026313A1 (en) 2020-05-28
EP3663906A1 (en) 2020-06-10
CN109601017A (en) 2019-04-09
MX2019001807A (en) 2019-06-06
US10803872B2 (en) 2020-10-13
MX2019001803A (en) 2019-07-04
US20190187953A1 (en) 2019-06-20
BR112019002636A2 (en) 2019-05-28
BR112019002607A2 (en) 2019-05-28
CN109601016A (en) 2019-04-09
JP6928882B2 (en) 2021-09-01
EP3663906B1 (en) 2024-04-03
WO2019026314A1 (en) 2019-02-07
US11145311B2 (en) 2021-10-12
EP3663905A1 (en) 2020-06-10
CN109601017B (en) 2024-05-03

Similar Documents

Publication Publication Date Title
SG11201901419QA (en) Information processing apparatus, speech recognition system, and information processing method
EP3754497A8 (en) Data processing method and related products
SG10201707702YA (en) Collaborative Voice Controlled Devices
EP3373292A3 (en) Method for controlling artificial intelligence system that performs multilingual processing
PH12019502894A1 (en) Automated response server device, terminal device, response system, response method, and program
AU2019268131A1 (en) Speech recognition method, speech wakeup apparatus, speech recognition apparatus, and terminal
KR20180084592A (en) System and method for provining sercive in response to voice command
MX2018015642A (en) Information processing device, reception device, and information processing method.
EP3751561A3 (en) Hotword recognition
MX2021013237A (en) Customized output to optimize for user preference in a distributed system.
GB2602211A (en) Account association with device
MX2015009063A (en) Image processing apparatus, control method thereof, and image processing system.
PH12019500347A1 (en) Method for determining change in distance, location prompting method and apparatus and system thereof
EP4280112A3 (en) Data processing method and end-cloud collaboration system
JP2019139211A (en) Voice wake-up method and device
EP4280210A3 (en) Hotword detection on multiple devices
EP4414977A3 (en) Speech endpointing
AU2018212531A8 (en) Data content filter
WO2020050882A3 (en) Hot-word free adaptation of automated assistant function(s)
MX2019011211A (en) Transform method in image coding system and apparatus for same.
WO2019118469A3 (en) Methods and systems for management of media content associated with message context on mobile computing devices
EP4235395A3 (en) Device voice control
SG11201809812WA (en) Method, apparatus and device for voiceprint recognition, and medium
EP3851972A3 (en) Display apparatus and control methods thereof
EP4235648A3 (en) Language model biasing