CN1174374C - 并发进行语音识别、说话者分段和分类的方法 - Google Patents
并发进行语音识别、说话者分段和分类的方法 Download PDFInfo
- Publication number
- CN1174374C CN1174374C CNB001183885A CN00118388A CN1174374C CN 1174374 C CN1174374 C CN 1174374C CN B001183885 A CNB001183885 A CN B001183885A CN 00118388 A CN00118388 A CN 00118388A CN 1174374 C CN1174374 C CN 1174374C
- Authority
- CN
- China
- Prior art keywords
- speaker
- giving
- label
- sound source
- debate
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
Images
Landscapes
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
Description
Claims (19)
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US09/345,237 US6421645B1 (en) | 1999-04-09 | 1999-06-30 | Methods and apparatus for concurrent speech recognition, speaker segmentation and speaker classification |
US09/345,237 | 1999-06-30 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN1279462A CN1279462A (zh) | 2001-01-10 |
CN1174374C true CN1174374C (zh) | 2004-11-03 |
Family
ID=23354161
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CNB001183885A Expired - Fee Related CN1174374C (zh) | 1999-06-30 | 2000-06-14 | 并发进行语音识别、说话者分段和分类的方法 |
Country Status (2)
Country | Link |
---|---|
JP (1) | JP4132590B2 (zh) |
CN (1) | CN1174374C (zh) |
Families Citing this family (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20030154084A1 (en) * | 2002-02-14 | 2003-08-14 | Koninklijke Philips Electronics N.V. | Method and system for person identification using video-speech matching |
US6667700B1 (en) * | 2002-10-30 | 2003-12-23 | Nbt Technology, Inc. | Content-based segmentation scheme for data compression in storage and transmission including hierarchical segment representation |
US6954522B2 (en) | 2003-12-15 | 2005-10-11 | International Business Machines Corporation | Caller identifying information encoded within embedded digital information |
JP5175724B2 (ja) * | 2005-07-06 | 2013-04-03 | コーニンクレッカ フィリップス エレクトロニクス エヌ ヴィ | エレメントのシーケンスの発生の方法及び装置 |
CN102655002B (zh) * | 2011-03-01 | 2013-11-27 | 株式会社理光 | 音频处理方法和音频处理设备 |
CN102522084B (zh) * | 2011-12-22 | 2013-09-18 | 广东威创视讯科技股份有限公司 | 一种将语音数据转换为文本文件的方法和系统 |
CN105161094A (zh) * | 2015-06-26 | 2015-12-16 | 徐信 | 一种语音音频切分手动调整切分点的系统及方法 |
CN108074574A (zh) * | 2017-11-29 | 2018-05-25 | 维沃移动通信有限公司 | 音频处理方法、装置及移动终端 |
CN111145752B (zh) * | 2020-01-03 | 2022-08-02 | 百度在线网络技术(北京)有限公司 | 智能音频装置、方法、电子设备及计算机可读介质 |
CN111931482B (zh) * | 2020-09-22 | 2021-09-24 | 思必驰科技股份有限公司 | 文本分段方法和装置 |
DE102022115111A1 (de) | 2022-04-07 | 2023-10-12 | Grundig Business Systems Gmbh | Verfahren und Vorrichtung zur Verarbeitung von Audio- und/oder Videoinformationen |
-
2000
- 2000-06-14 CN CNB001183885A patent/CN1174374C/zh not_active Expired - Fee Related
- 2000-06-23 JP JP2000188625A patent/JP4132590B2/ja not_active Expired - Fee Related
Also Published As
Publication number | Publication date |
---|---|
JP2001060098A (ja) | 2001-03-06 |
JP4132590B2 (ja) | 2008-08-13 |
CN1279462A (zh) | 2001-01-10 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US6421645B1 (en) | Methods and apparatus for concurrent speech recognition, speaker segmentation and speaker classification | |
US20210183395A1 (en) | Method and system for automatically diarising a sound recording | |
US6424946B1 (en) | Methods and apparatus for unknown speaker labeling using concurrent speech recognition, segmentation, classification and clustering | |
US7337115B2 (en) | Systems and methods for providing acoustic classification | |
US6567775B1 (en) | Fusion of audio and video based speaker identification for multimedia information access | |
CN1270361A (zh) | 使用内容和扬声器信息进行音频信息检索的方法和装置 | |
CN1174374C (zh) | 并发进行语音识别、说话者分段和分类的方法 | |
Abdallah et al. | Theory and evaluation of a Bayesian music structure extractor | |
CN101136199A (zh) | 语音数据处理方法和设备 | |
CN103985381A (zh) | 一种基于参数融合优化决策的音频索引方法 | |
Vanhoucke | Confidence scoring and rejection using multi-pass speech recognition. | |
CN1758263A (zh) | 基于得分差加权融合的多模态身份识别方法 | |
CN107452403A (zh) | 一种说话人标记方法 | |
Fan et al. | Deep Hashing for Speaker Identification and Retrieval. | |
Huijbregts et al. | Speaker diarization error analysis using oracle components | |
Ntalampiras et al. | Automatic recognition of urban soundscenes | |
Huang et al. | A fast-match approach for robust, faster than real-time speaker diarization | |
Wolters et al. | Proposal-based few-shot sound event detection for speech and environmental sounds with perceivers | |
Nishida et al. | Unsupervised speaker indexing using speaker model selection based on Bayesian information criterion | |
Iqbal et al. | Stacked convolutional neural networks for general-purpose audio tagging | |
CN116524960A (zh) | 一种基于混合熵下采样和集成分类器的语音情感识别系统 | |
Wang et al. | The NPU-ASLP System for Deepfake Algorithm Recognition in ADD 2023 Challenge. | |
Hakkani-Tür et al. | An active approach to spoken language processing | |
Tsau et al. | Content/context-adaptive feature selection for environmental sound recognition | |
CN115862639A (zh) | 一种基于k—均值聚类分析的人工智能语音分析方法 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C06 | Publication | ||
PB01 | Publication | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
ASS | Succession or assignment of patent right |
Owner name: NEW ANST COMMUNICATION CO.,LTD. Free format text: FORMER OWNER: INTERNATIONAL BUSINESS MACHINE CORP. Effective date: 20090911 |
|
C41 | Transfer of patent application or patent right or utility model | ||
TR01 | Transfer of patent right |
Effective date of registration: 20090911 Address after: Massachusetts, USA Patentee after: Nuance Communications Inc Address before: American New York Patentee before: International Business Machines Corp. |
|
CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20041103 Termination date: 20170614 |
|
CF01 | Termination of patent right due to non-payment of annual fee |