JPWO2020214844A5 - - Google Patents
Download PDFInfo
- Publication number
- JPWO2020214844A5 JPWO2020214844A5 JP2021562002A JP2021562002A JPWO2020214844A5 JP WO2020214844 A5 JPWO2020214844 A5 JP WO2020214844A5 JP 2021562002 A JP2021562002 A JP 2021562002A JP 2021562002 A JP2021562002 A JP 2021562002A JP WO2020214844 A5 JPWO2020214844 A5 JP WO2020214844A5
- Authority
- JP
- Japan
- Prior art keywords
- voice activity
- pause
- determining
- audio signal
- endpoint
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US201962836593P | 2019-04-19 | 2019-04-19 | |
US62/836,593 | 2019-04-19 | ||
PCT/US2020/028570 WO2020214844A1 (en) | 2019-04-19 | 2020-04-16 | Identifying input for speech recognition engine |
Publications (2)
Publication Number | Publication Date |
---|---|
JP2022529783A JP2022529783A (ja) | 2022-06-24 |
JPWO2020214844A5 true JPWO2020214844A5 (zh) | 2023-04-24 |
Family
ID=72830867
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
JP2021562002A Pending JP2022529783A (ja) | 2019-04-19 | 2020-04-16 | 発話認識エンジンのための入力の識別 |
Country Status (5)
Country | Link |
---|---|
US (1) | US20200335128A1 (zh) |
EP (1) | EP3956883A4 (zh) |
JP (1) | JP2022529783A (zh) |
CN (1) | CN113994424A (zh) |
WO (1) | WO2020214844A1 (zh) |
Families Citing this family (12)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
DK201770427A1 (en) | 2017-05-12 | 2018-12-20 | Apple Inc. | LOW-LATENCY INTELLIGENT AUTOMATED ASSISTANT |
CN112513983A (zh) | 2018-06-21 | 2021-03-16 | 奇跃公司 | 可穿戴系统语音处理 |
JP2022522748A (ja) | 2019-03-01 | 2022-04-20 | マジック リープ, インコーポレイテッド | 発話処理エンジンのための入力の決定 |
US11328740B2 (en) | 2019-08-07 | 2022-05-10 | Magic Leap, Inc. | Voice onset detection |
US11749265B2 (en) * | 2019-10-04 | 2023-09-05 | Disney Enterprises, Inc. | Techniques for incremental computer-based natural language understanding |
EP4099318A4 (en) * | 2020-01-31 | 2023-05-10 | Sony Group Corporation | INFORMATION PROCESSING DEVICE AND INFORMATION PROCESSING METHOD |
US11917384B2 (en) | 2020-03-27 | 2024-02-27 | Magic Leap, Inc. | Method of waking a device using spoken voice commands |
US11984124B2 (en) * | 2020-11-13 | 2024-05-14 | Apple Inc. | Speculative task flow execution |
JP7331025B2 (ja) * | 2021-02-05 | 2023-08-22 | Necパーソナルコンピュータ株式会社 | 学習支援システム、学習支援方法、及びプログラム |
US20230053341A1 (en) * | 2021-08-17 | 2023-02-23 | Google Llc | Enabling natural conversations with soft endpointing for an automated assistant |
CN114898755B (zh) * | 2022-07-14 | 2023-01-17 | 科大讯飞股份有限公司 | 语音处理方法及相关装置、电子设备、存储介质 |
CN117351993B (zh) * | 2023-12-04 | 2024-02-13 | 方图智能(深圳)科技集团股份有限公司 | 一种基于音频分发的音频传输质量评价方法及系统 |
Family Cites Families (19)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
GB9930731D0 (en) * | 1999-12-22 | 2000-02-16 | Ibm | Voice processing apparatus |
US7607097B2 (en) * | 2003-09-25 | 2009-10-20 | International Business Machines Corporation | Translating emotion to braille, emoticons and other special symbols |
JP4906379B2 (ja) * | 2006-03-22 | 2012-03-28 | 富士通株式会社 | 音声認識装置、音声認識方法、及びコンピュータプログラム |
WO2008067413A2 (en) * | 2006-11-28 | 2008-06-05 | Attune Interactive, Inc. | Training system using an interactive prompt character |
US9583108B2 (en) * | 2011-12-08 | 2017-02-28 | Forrest S. Baker III Trust | Voice detection for automated communication system |
US10522151B2 (en) * | 2015-02-03 | 2019-12-31 | Dolby Laboratories Licensing Corporation | Conference segmentation based on conversational dynamics |
US10186254B2 (en) * | 2015-06-07 | 2019-01-22 | Apple Inc. | Context-based endpoint detection |
US9740678B2 (en) * | 2015-06-25 | 2017-08-22 | Intel Corporation | Method and system of automatic speech recognition with dynamic vocabularies |
US20160379638A1 (en) * | 2015-06-26 | 2016-12-29 | Amazon Technologies, Inc. | Input speech quality matching |
US10134425B1 (en) * | 2015-06-29 | 2018-11-20 | Amazon Technologies, Inc. | Direction-based speech endpointing |
US10121471B2 (en) * | 2015-06-29 | 2018-11-06 | Amazon Technologies, Inc. | Language model speech endpointing |
US10269341B2 (en) * | 2015-10-19 | 2019-04-23 | Google Llc | Speech endpointing |
US10289205B1 (en) * | 2015-11-24 | 2019-05-14 | Google Llc | Behind the ear gesture control for a head mountable device |
US20180358021A1 (en) * | 2015-12-23 | 2018-12-13 | Intel Corporation | Biometric information for dialog system |
KR20180055661A (ko) * | 2016-11-16 | 2018-05-25 | 삼성전자주식회사 | 전자 장치 및 그 제어 방법 |
US11151997B2 (en) * | 2017-03-10 | 2021-10-19 | Nippon Telegraph And Telephone Corporation | Dialog system, dialog method, dialog apparatus and program |
US10460728B2 (en) * | 2017-06-16 | 2019-10-29 | Amazon Technologies, Inc. | Exporting dialog-driven applications to digital communication platforms |
EP3486900A1 (en) * | 2017-11-16 | 2019-05-22 | Softbank Robotics Europe | System and method for dialog session management |
EP3901740A1 (en) * | 2018-10-15 | 2021-10-27 | Orcam Technologies Ltd. | Hearing aid systems and methods |
-
2020
- 2020-04-16 US US16/850,965 patent/US20200335128A1/en active Pending
- 2020-04-16 WO PCT/US2020/028570 patent/WO2020214844A1/en active Application Filing
- 2020-04-16 JP JP2021562002A patent/JP2022529783A/ja active Pending
- 2020-04-16 CN CN202080044362.4A patent/CN113994424A/zh active Pending
- 2020-04-16 EP EP20791183.5A patent/EP3956883A4/en active Pending
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US20220165268A1 (en) | Indicator for voice-based communications | |
US10276164B2 (en) | Multi-speaker speech recognition correction system | |
US11990120B2 (en) | Non-speech input to speech processing system | |
US20200335128A1 (en) | Identifying input for speech recognition engine | |
US10800043B2 (en) | Interaction apparatus and method for determining a turn-taking behavior using multimodel information | |
US10692489B1 (en) | Non-speech input to speech processing system | |
US8762144B2 (en) | Method and apparatus for voice activity detection | |
US11443750B2 (en) | User authentication method and apparatus | |
EP3618063B1 (en) | Voice interaction system, voice interaction method and corresponding program | |
JPWO2020214844A5 (zh) | ||
JP6585733B2 (ja) | 情報処理装置 | |
WO2020140840A1 (zh) | 用于唤醒可穿戴设备的方法及装置 | |
US20230230594A1 (en) | Facial movements wake up wearable | |
WO2020244411A1 (zh) | 基于麦克风信号的语音交互唤醒电子设备、方法和介质 | |
KR20200025226A (ko) | 전자 장치 및 그 제어 방법 | |
JP7023504B2 (ja) | 処理装置、処理方法、及びプログラム | |
WO2021153101A1 (ja) | 情報処理装置、情報処理方法および情報処理プログラム | |
JP2018005122A (ja) | 検出装置、検出方法及び検出プログラム | |
KR20220134347A (ko) | 다화자 훈련 데이터셋에 기초한 음성합성 방법 및 장치 | |
JP7435641B2 (ja) | 制御装置、ロボット、制御方法およびプログラム | |
CN117836823A (zh) | 对检测到的无声语音的破译 | |
JP2022063279A (ja) | 処理装置、処理方法、及びプログラム | |
JP5949634B2 (ja) | 音声合成システム、及び音声合成方法 | |
JP7378770B2 (ja) | 評価装置、評価方法、及び評価プログラム | |
EP4207805A1 (en) | Electronic device and control method thereof |