JP2023553867A5 - - Google Patents
Info
- Publication number
- JP2023553867A5 JP2023553867A5 JP2023533713A JP2023533713A JP2023553867A5 JP 2023553867 A5 JP2023553867 A5 JP 2023553867A5 JP 2023533713 A JP2023533713 A JP 2023533713A JP 2023533713 A JP2023533713 A JP 2023533713A JP 2023553867 A5 JP2023553867 A5 JP 2023553867A5
- Authority
- JP
- Japan
- Prior art keywords
- audio
- audio feature
- speaker
- user utterance
- dataset
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Applications Claiming Priority (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US17/115,158 | 2020-12-08 | ||
| US17/115,158 US11626104B2 (en) | 2020-12-08 | 2020-12-08 | User speech profile management |
| PCT/US2021/071617 WO2022126040A1 (en) | 2020-12-08 | 2021-09-28 | User speech profile management |
Publications (3)
| Publication Number | Publication Date |
|---|---|
| JP2023553867A JP2023553867A (ja) | 2023-12-26 |
| JP2023553867A5 true JP2023553867A5 (https=) | 2024-09-05 |
| JP7753363B2 JP7753363B2 (ja) | 2025-10-14 |
Family
ID=78303075
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| JP2023533713A Active JP7753363B2 (ja) | 2020-12-08 | 2021-09-28 | ユーザ発話プロファイル管理 |
Country Status (6)
| Country | Link |
|---|---|
| US (1) | US11626104B2 (https=) |
| EP (1) | EP4260314A1 (https=) |
| JP (1) | JP7753363B2 (https=) |
| KR (1) | KR20230118089A (https=) |
| CN (1) | CN116583899A (https=) |
| WO (1) | WO2022126040A1 (https=) |
Families Citing this family (6)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US11929077B2 (en) * | 2019-12-23 | 2024-03-12 | Dts Inc. | Multi-stage speaker enrollment in voice authentication and identification |
| US11462218B1 (en) * | 2020-04-29 | 2022-10-04 | Amazon Technologies, Inc. | Conserving battery while detecting for human voice |
| US12198677B2 (en) * | 2022-05-27 | 2025-01-14 | Tencent America LLC | Techniques for end-to-end speaker diarization with generalized neural speaker clustering |
| KR102516391B1 (ko) * | 2022-09-02 | 2023-04-03 | 주식회사 액션파워 | 음성 구간 길이를 고려하여 오디오에서 음성 구간을 검출하는 방법 |
| CN116364063B (zh) * | 2023-06-01 | 2023-09-05 | 蔚来汽车科技(安徽)有限公司 | 音素对齐方法、设备、驾驶设备和介质 |
| WO2025254947A1 (en) * | 2024-06-04 | 2025-12-11 | Qualcomm Incorporated | Speech profile management |
Family Cites Families (18)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US6424946B1 (en) | 1999-04-09 | 2002-07-23 | International Business Machines Corporation | Methods and apparatus for unknown speaker labeling using concurrent speech recognition, segmentation, classification and clustering |
| WO2005122141A1 (en) * | 2004-06-09 | 2005-12-22 | Canon Kabushiki Kaisha | Effective audio segmentation and classification |
| US7536304B2 (en) * | 2005-05-27 | 2009-05-19 | Porticus, Inc. | Method and system for bio-metric voice print authentication |
| US8630854B2 (en) * | 2010-08-31 | 2014-01-14 | Fujitsu Limited | System and method for generating videoconference transcriptions |
| GB2489489B (en) | 2011-03-30 | 2013-08-21 | Toshiba Res Europ Ltd | A speech processing system and method |
| US9898723B2 (en) * | 2012-12-19 | 2018-02-20 | Visa International Service Association | System and method for voice authentication |
| US9666204B2 (en) * | 2014-04-30 | 2017-05-30 | Qualcomm Incorporated | Voice profile management and speech signal generation |
| WO2016022588A1 (en) * | 2014-08-04 | 2016-02-11 | Flagler Llc | Voice tallying system |
| GB2525464B (en) * | 2015-01-13 | 2016-03-16 | Validsoft Uk Ltd | Authentication method |
| US10373612B2 (en) * | 2016-03-21 | 2019-08-06 | Amazon Technologies, Inc. | Anchored speech detection and speech recognition |
| JP6676009B2 (ja) | 2017-06-23 | 2020-04-08 | 日本電信電話株式会社 | 話者判定装置、話者判定情報生成方法、プログラム |
| WO2019048062A1 (en) | 2017-09-11 | 2019-03-14 | Telefonaktiebolaget Lm Ericsson (Publ) | MANAGING USER PROFILES WITH VOICE COMMAND |
| WO2019203794A1 (en) * | 2018-04-16 | 2019-10-24 | Google Llc | Automatically determining language for speech recognition of spoken utterance received via an automated assistant interface |
| US11398218B1 (en) * | 2018-04-26 | 2022-07-26 | United Services Automobile Association (Usaa) | Dynamic speech output configuration |
| US10991379B2 (en) * | 2018-06-22 | 2021-04-27 | Babblelabs Llc | Data driven audio enhancement |
| EP3627505B1 (en) * | 2018-09-21 | 2023-11-15 | Televic Conference NV | Real-time speaker identification with diarization |
| US11024291B2 (en) * | 2018-11-21 | 2021-06-01 | Sri International | Real-time class recognition for an audio stream |
| US11545156B2 (en) * | 2020-05-27 | 2023-01-03 | Microsoft Technology Licensing, Llc | Automated meeting minutes generation service |
-
2020
- 2020-12-08 US US17/115,158 patent/US11626104B2/en active Active
-
2021
- 2021-09-28 KR KR1020237018503A patent/KR20230118089A/ko active Pending
- 2021-09-28 EP EP21795235.7A patent/EP4260314A1/en active Pending
- 2021-09-28 JP JP2023533713A patent/JP7753363B2/ja active Active
- 2021-09-28 WO PCT/US2021/071617 patent/WO2022126040A1/en not_active Ceased
- 2021-09-28 CN CN202180080295.6A patent/CN116583899A/zh active Pending
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| JP2023553867A5 (https=) | ||
| US8645132B2 (en) | Truly handsfree speech recognition in high noise environments | |
| CN106663446B (zh) | 知晓用户环境的声学降噪 | |
| CN114360527B (zh) | 车载语音交互方法、装置、设备及存储介质 | |
| EP2994911B1 (en) | Adaptive audio frame processing for keyword detection | |
| JP7753363B2 (ja) | ユーザ発話プロファイル管理 | |
| US20070239454A1 (en) | Personalizing a context-free grammar using a dictation language model | |
| JP2017535809A (ja) | サウンド検出モデルを生成するためのサウンドサンプル検証 | |
| JP2020079921A (ja) | 音声インタラクション実現方法、装置、コンピュータデバイス及びプログラム | |
| CN106233376A (zh) | 用于通过话音输入激活应用程序的方法和设备 | |
| US12413440B2 (en) | Output device selection | |
| JP7819337B2 (ja) | ビデオ処理方法、装置、機器及び媒体 | |
| JP7343087B2 (ja) | 音声認識の方法、装置、およびデバイス、並びにコンピュータ可読記憶媒体 | |
| CN110070859A (zh) | 一种语音识别方法及装置 | |
| CN110503991B (zh) | 语音播报方法、装置、电子设备及存储介质 | |
| CN114385800A (zh) | 语音对话方法和装置 | |
| US20240212687A1 (en) | Supplemental content output | |
| JP2021149664A (ja) | 出力装置、出力方法及び出力プログラム | |
| WO2020228226A1 (zh) | 一种纯音乐检测方法、装置及存储介质 | |
| CN118609608A (zh) | 音频处理系统和应用中使用话音活动检测的降噪 | |
| CN113051902B (zh) | 语音数据脱敏方法、电子设备及计算机可读存储介质 | |
| CN111696550A (zh) | 语音处理方法和装置、用于语音处理的装置 | |
| CN112017662A (zh) | 控制指令确定方法、装置、电子设备和存储介质 | |
| CN112863496B (zh) | 一种语音端点检测方法以及装置 | |
| CN110289010B (zh) | 一种声音采集的方法、装置、设备和计算机存储介质 |