SG11201901419QA - Information processing apparatus, speech recognition system, and information processing method - Google Patents
Information processing apparatus, speech recognition system, and information processing methodInfo
- Publication number
- SG11201901419QA SG11201901419QA SG11201901419QA SG11201901419QA SG11201901419QA SG 11201901419Q A SG11201901419Q A SG 11201901419QA SG 11201901419Q A SG11201901419Q A SG 11201901419QA SG 11201901419Q A SG11201901419Q A SG 11201901419QA SG 11201901419Q A SG11201901419Q A SG 11201901419QA
- Authority
- SG
- Singapore
- Prior art keywords
- speech
- information processing
- controller
- processing apparatus
- obtainer
- Prior art date
Links
- 230000010365 information processing Effects 0.000 title abstract 5
- 238000003672 processing method Methods 0.000 title abstract 2
- 230000004913 activation Effects 0.000 abstract 4
- 230000005540 biological transmission Effects 0.000 abstract 2
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/28—Constructional details of speech recognition systems
- G10L15/30—Distributed recognition, e.g. in client-server systems, for mobile phones or network applications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/16—Sound input; Sound output
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/16—Sound input; Sound output
- G06F3/162—Interface to dedicated audio devices, e.g. audio drivers, interface to CODECs
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/16—Sound input; Sound output
- G06F3/167—Audio in a user interface, e.g. using voice commands for navigating, audio feedback
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/20—Speech recognition techniques specially adapted for robustness in adverse environments, e.g. in noise, of stress induced speech
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/28—Constructional details of speech recognition systems
- G10L15/32—Multiple recognisers used in sequence or in parallel; Score combination systems therefor, e.g. voting systems
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L2015/088—Word spotting
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
- G10L2015/223—Execution procedure of a spoken command
Landscapes
- Engineering & Computer Science (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Health & Medical Sciences (AREA)
- Multimedia (AREA)
- Theoretical Computer Science (AREA)
- Computational Linguistics (AREA)
- Acoustics & Sound (AREA)
- General Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- Telephonic Communication Services (AREA)
- Telephone Function (AREA)
- Computer And Data Communications (AREA)
Abstract
INFORMATION PROCESSING APPARATUS, SPEECH RECOGNITION SYSTEM, AND INFORMATION PROCESSING METHOD 5 An information processing apparatus (10b) includes: a speech obtainer (11) which obtains speech of a user; a first controller (12b) which, when the first controller (12b) recognizes that the speech obtained by the speech obtainer (11) is a first activation word, outputs a speech signal corresponding to the first activation word; and a second controller (13b). In the first speech 10 transmission process in which the speech signal of the speech obtained by speech obtainer (11) is transmitted to the VPA cloud server (120b), the first controller (12b) determines whether to output a speech signal corresponding to a second activation word to the second controller (13b) based on a predetermined priority level when the first controller (12b) recognizes that the 15 speech obtained by the speech obtainer indicates the second activation word for causing the second controller (13b) to start a second speech transmission process. Fig. 12 20
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US201762540415P | 2017-08-02 | 2017-08-02 | |
PCT/JP2018/003522 WO2019026314A1 (en) | 2017-08-02 | 2018-02-02 | Information processing device, voice recognition system, and information processing method |
Publications (1)
Publication Number | Publication Date |
---|---|
SG11201901419QA true SG11201901419QA (en) | 2019-03-28 |
Family
ID=65232459
Family Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
SG11201901419QA SG11201901419QA (en) | 2017-08-02 | 2018-02-02 | Information processing apparatus, speech recognition system, and information processing method |
SG11201901441QA SG11201901441QA (en) | 2017-08-02 | 2018-02-02 | Information processing apparatus, speech recognition system, and information processing method |
Family Applications After (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
SG11201901441QA SG11201901441QA (en) | 2017-08-02 | 2018-02-02 | Information processing apparatus, speech recognition system, and information processing method |
Country Status (8)
Country | Link |
---|---|
US (2) | US11145311B2 (en) |
EP (2) | EP3663905B1 (en) |
JP (2) | JP6928882B2 (en) |
CN (2) | CN109601017B (en) |
BR (2) | BR112019002636A2 (en) |
MX (2) | MX2019001803A (en) |
SG (2) | SG11201901419QA (en) |
WO (2) | WO2019026313A1 (en) |
Families Citing this family (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR102498007B1 (en) * | 2018-01-08 | 2023-02-08 | 엘지전자 주식회사 | Laundry Treating Apparatus Control System by Voice Recognition and operating Method the same |
JP7412414B2 (en) * | 2019-03-22 | 2024-01-12 | 三菱重工サーマルシステムズ株式会社 | Control device, equipment control system, control method and program |
US11501761B2 (en) * | 2019-04-05 | 2022-11-15 | Samsung Electronics Co., Ltd. | Method and apparatus for speech recognition |
JP7236919B2 (en) * | 2019-04-12 | 2023-03-10 | 三菱電機株式会社 | VOICE INPUT DEVICE, VOICE OPERATION SYSTEM, VOICE OPERATION METHOD AND PROGRAM |
JP2020178177A (en) * | 2019-04-16 | 2020-10-29 | シャープ株式会社 | Network system |
CN110570859B (en) * | 2019-09-20 | 2022-05-27 | Oppo广东移动通信有限公司 | Intelligent sound box control method, device and system and storage medium |
JP7248564B2 (en) * | 2019-12-05 | 2023-03-29 | Tvs Regza株式会社 | Information processing device and program |
JP7264071B2 (en) * | 2020-01-23 | 2023-04-25 | トヨタ自動車株式会社 | Information processing system, information processing device, and program |
CN111353771A (en) * | 2020-02-19 | 2020-06-30 | 北京声智科技有限公司 | Method, device, equipment and medium for remotely controlling payment |
CN111768783B (en) | 2020-06-30 | 2024-04-02 | 北京百度网讯科技有限公司 | Voice interaction control method, device, electronic equipment, storage medium and system |
CN114726830A (en) * | 2020-12-18 | 2022-07-08 | 阿里巴巴集团控股有限公司 | Voice service access method, system and vehicle |
Family Cites Families (27)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2005031758A (en) | 2003-07-07 | 2005-02-03 | Canon Inc | Voice processing device and method |
KR100719776B1 (en) * | 2005-02-25 | 2007-05-18 | 에이디정보통신 주식회사 | Portable cord recognition voice output device |
JP2009080183A (en) * | 2007-09-25 | 2009-04-16 | Panasonic Electric Works Co Ltd | Speech recognition control device |
JP5658641B2 (en) | 2011-09-15 | 2015-01-28 | 株式会社Nttドコモ | Terminal device, voice recognition program, voice recognition method, and voice recognition system |
US9117449B2 (en) * | 2012-04-26 | 2015-08-25 | Nuance Communications, Inc. | Embedded system for construction of small footprint speech recognition with user-definable constraints |
US10381001B2 (en) * | 2012-10-30 | 2019-08-13 | Google Technology Holdings LLC | Voice control user interface during low-power mode |
JP2015011170A (en) * | 2013-06-28 | 2015-01-19 | 株式会社ATR−Trek | Voice recognition client device performing local voice recognition |
CN103383134B (en) * | 2013-08-06 | 2016-12-28 | 四川长虹电器股份有限公司 | A kind of intelligent air-conditioning system and air conditioning control method |
EP3047481A4 (en) * | 2013-09-20 | 2017-03-01 | Amazon Technologies Inc. | Local and remote speech processing |
US9508345B1 (en) * | 2013-09-24 | 2016-11-29 | Knowles Electronics, Llc | Continuous voice sensing |
CN105280180A (en) * | 2014-06-11 | 2016-01-27 | 中兴通讯股份有限公司 | Terminal control method, device, voice control device and terminal |
JP6229071B2 (en) * | 2014-10-24 | 2017-11-08 | 株式会社ソニー・インタラクティブエンタテインメント | Control device, control method, program, and information storage medium |
JP2016095383A (en) * | 2014-11-14 | 2016-05-26 | 株式会社ATR−Trek | Voice recognition client device and server-type voice recognition device |
TWI525532B (en) | 2015-03-30 | 2016-03-11 | Yu-Wei Chen | Set the name of the person to wake up the name for voice manipulation |
US9996316B2 (en) * | 2015-09-28 | 2018-06-12 | Amazon Technologies, Inc. | Mediation of wakeword response for multiple devices |
JP2017117371A (en) * | 2015-12-25 | 2017-06-29 | パナソニック インテレクチュアル プロパティ コーポレーション オブ アメリカPanasonic Intellectual Property Corporation of America | Control method, control device, and program |
JP2017138476A (en) | 2016-02-03 | 2017-08-10 | ソニー株式会社 | Information processing device, information processing method, and program |
US10133612B2 (en) | 2016-03-17 | 2018-11-20 | Nuance Communications, Inc. | Session processing interaction between two or more virtual assistants |
US10115400B2 (en) * | 2016-08-05 | 2018-10-30 | Sonos, Inc. | Multiple voice services |
US10685656B2 (en) * | 2016-08-31 | 2020-06-16 | Bose Corporation | Accessing multiple virtual personal assistants (VPA) from a single device |
US10437841B2 (en) | 2016-10-10 | 2019-10-08 | Microsoft Technology Licensing, Llc | Digital assistant extension automatic ranking and selection |
US10127908B1 (en) * | 2016-11-11 | 2018-11-13 | Amazon Technologies, Inc. | Connected accessory for a voice-controlled device |
US10559309B2 (en) * | 2016-12-22 | 2020-02-11 | Google Llc | Collaborative voice controlled devices |
US11164570B2 (en) * | 2017-01-17 | 2021-11-02 | Ford Global Technologies, Llc | Voice assistant tracking and activation |
CA3052978A1 (en) * | 2017-02-07 | 2018-08-16 | Lutron Technology Company Llc | Audio-based load control system |
US10748531B2 (en) * | 2017-04-13 | 2020-08-18 | Harman International Industries, Incorporated | Management layer for multiple intelligent personal assistant services |
US20190013019A1 (en) * | 2017-07-10 | 2019-01-10 | Intel Corporation | Speaker command and key phrase management for muli -virtual assistant systems |
-
2018
- 2018-02-02 SG SG11201901419QA patent/SG11201901419QA/en unknown
- 2018-02-02 JP JP2018568454A patent/JP6928882B2/en active Active
- 2018-02-02 JP JP2018567322A patent/JP7033713B2/en active Active
- 2018-02-02 EP EP18842080.6A patent/EP3663905B1/en active Active
- 2018-02-02 MX MX2019001803A patent/MX2019001803A/en unknown
- 2018-02-02 BR BR112019002636A patent/BR112019002636A2/en unknown
- 2018-02-02 US US16/325,793 patent/US11145311B2/en active Active
- 2018-02-02 US US16/325,844 patent/US10803872B2/en active Active
- 2018-02-02 BR BR112019002607A patent/BR112019002607A2/en unknown
- 2018-02-02 CN CN201880003041.2A patent/CN109601017B/en active Active
- 2018-02-02 MX MX2019001807A patent/MX2019001807A/en unknown
- 2018-02-02 SG SG11201901441QA patent/SG11201901441QA/en unknown
- 2018-02-02 CN CN201880003037.6A patent/CN109601016B/en active Active
- 2018-02-02 WO PCT/JP2018/003521 patent/WO2019026313A1/en unknown
- 2018-02-02 WO PCT/JP2018/003522 patent/WO2019026314A1/en unknown
- 2018-02-02 EP EP18842220.8A patent/EP3663906B1/en active Active
Also Published As
Publication number | Publication date |
---|---|
US20190214015A1 (en) | 2019-07-11 |
WO2019026313A1 (en) | 2019-02-07 |
EP3663906A4 (en) | 2020-07-22 |
CN109601016B (en) | 2023-07-28 |
SG11201901441QA (en) | 2019-03-28 |
EP3663905B1 (en) | 2020-12-09 |
JP7033713B2 (en) | 2022-03-11 |
EP3663905A4 (en) | 2020-06-17 |
JPWO2019026314A1 (en) | 2020-06-18 |
JPWO2019026313A1 (en) | 2020-05-28 |
EP3663906A1 (en) | 2020-06-10 |
CN109601017A (en) | 2019-04-09 |
MX2019001807A (en) | 2019-06-06 |
US10803872B2 (en) | 2020-10-13 |
MX2019001803A (en) | 2019-07-04 |
US20190187953A1 (en) | 2019-06-20 |
BR112019002636A2 (en) | 2019-05-28 |
BR112019002607A2 (en) | 2019-05-28 |
CN109601016A (en) | 2019-04-09 |
JP6928882B2 (en) | 2021-09-01 |
EP3663906B1 (en) | 2024-04-03 |
WO2019026314A1 (en) | 2019-02-07 |
US11145311B2 (en) | 2021-10-12 |
EP3663905A1 (en) | 2020-06-10 |
CN109601017B (en) | 2024-05-03 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
SG11201901419QA (en) | Information processing apparatus, speech recognition system, and information processing method | |
EP3754497A8 (en) | Data processing method and related products | |
SG10201707702YA (en) | Collaborative Voice Controlled Devices | |
EP3373292A3 (en) | Method for controlling artificial intelligence system that performs multilingual processing | |
PH12019502894A1 (en) | Automated response server device, terminal device, response system, response method, and program | |
AU2019268131A1 (en) | Speech recognition method, speech wakeup apparatus, speech recognition apparatus, and terminal | |
KR20180084592A (en) | System and method for provining sercive in response to voice command | |
MX2018015642A (en) | Information processing device, reception device, and information processing method. | |
EP3751561A3 (en) | Hotword recognition | |
MX2021013237A (en) | Customized output to optimize for user preference in a distributed system. | |
GB2602211A (en) | Account association with device | |
MX2015009063A (en) | Image processing apparatus, control method thereof, and image processing system. | |
PH12019500347A1 (en) | Method for determining change in distance, location prompting method and apparatus and system thereof | |
EP4280112A3 (en) | Data processing method and end-cloud collaboration system | |
JP2019139211A (en) | Voice wake-up method and device | |
EP4280210A3 (en) | Hotword detection on multiple devices | |
EP4414977A3 (en) | Speech endpointing | |
AU2018212531A8 (en) | Data content filter | |
WO2020050882A3 (en) | Hot-word free adaptation of automated assistant function(s) | |
MX2019011211A (en) | Transform method in image coding system and apparatus for same. | |
WO2019118469A3 (en) | Methods and systems for management of media content associated with message context on mobile computing devices | |
EP4235395A3 (en) | Device voice control | |
SG11201809812WA (en) | Method, apparatus and device for voiceprint recognition, and medium | |
EP3851972A3 (en) | Display apparatus and control methods thereof | |
EP4235648A3 (en) | Language model biasing |