CN1152366C - 声音识别系统 - Google Patents
声音识别系统 Download PDFInfo
- Publication number
- CN1152366C CN1152366C CNB011328746A CN01132874A CN1152366C CN 1152366 C CN1152366 C CN 1152366C CN B011328746 A CNB011328746 A CN B011328746A CN 01132874 A CN01132874 A CN 01132874A CN 1152366 C CN1152366 C CN 1152366C
- Authority
- CN
- China
- Prior art keywords
- sound
- input signal
- inner product
- afterpower
- parts
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
- 239000013598 vector Substances 0.000 claims abstract description 60
- 238000001514 detection method Methods 0.000 claims description 51
- 230000008676 import Effects 0.000 claims description 4
- 238000001228 spectrum Methods 0.000 description 28
- 238000000034 method Methods 0.000 description 20
- 238000010586 diagram Methods 0.000 description 10
- 239000011159 matrix material Substances 0.000 description 10
- 206010038743 Restlessness Diseases 0.000 description 9
- 238000004458 analytical method Methods 0.000 description 8
- 101000685663 Homo sapiens Sodium/nucleoside cotransporter 1 Proteins 0.000 description 4
- 101000821827 Homo sapiens Sodium/nucleoside cotransporter 2 Proteins 0.000 description 4
- 102100023116 Sodium/nucleoside cotransporter 1 Human genes 0.000 description 4
- 102100021541 Sodium/nucleoside cotransporter 2 Human genes 0.000 description 4
- 230000001932 seasonal effect Effects 0.000 description 3
- GOLXNESZZPUPJE-UHFFFAOYSA-N spiromesifen Chemical compound CC1=CC(C)=CC(C)=C1C(C(O1)=O)=C(OC(=O)CC(C)(C)C)C11CCCC1 GOLXNESZZPUPJE-UHFFFAOYSA-N 0.000 description 3
- 238000004364 calculation method Methods 0.000 description 2
- 238000006243 chemical reaction Methods 0.000 description 2
- 230000000694 effects Effects 0.000 description 2
- 241001269238 Data Species 0.000 description 1
- 238000007476 Maximum Likelihood Methods 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 238000012423 maintenance Methods 0.000 description 1
- 230000017105 transposition Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/78—Detection of presence or absence of voice signals
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Computational Linguistics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Measurement Of Mechanical Vibrations Or Ultrasonic Waves (AREA)
- Complex Calculations (AREA)
- Machine Translation (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Applications Claiming Priority (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| JP2000277024A JP4201470B2 (ja) | 2000-09-12 | 2000-09-12 | 音声認識システム |
| JP277024/00 | 2000-09-12 | ||
| JP277024/2000 | 2000-09-12 |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| CN1343966A CN1343966A (zh) | 2002-04-10 |
| CN1152366C true CN1152366C (zh) | 2004-06-02 |
Family
ID=18762410
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CNB011328746A Expired - Fee Related CN1152366C (zh) | 2000-09-12 | 2001-09-12 | 声音识别系统 |
Country Status (5)
| Country | Link |
|---|---|
| US (2) | US20020049592A1 (https=) |
| EP (1) | EP1189200B1 (https=) |
| JP (1) | JP4201470B2 (https=) |
| CN (1) | CN1152366C (https=) |
| DE (1) | DE60142729D1 (https=) |
Families Citing this family (10)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| FI114358B (fi) * | 2002-05-29 | 2004-09-30 | Nokia Corp | Menetelmä digitaalisessa verkkojärjestelmässä päätelaitteen lähetyksen ohjaamiseksi |
| US20050010413A1 (en) * | 2003-05-23 | 2005-01-13 | Norsworthy Jon Byron | Voice emulation and synthesis process |
| US20050058978A1 (en) * | 2003-09-12 | 2005-03-17 | Benevento Francis A. | Individualized learning system |
| KR100717396B1 (ko) | 2006-02-09 | 2007-05-11 | 삼성전자주식회사 | 로컬 스펙트럴 정보를 이용하여 음성 인식을 위한 유성음을판단하는 방법 및 장치 |
| CN101689364B (zh) * | 2007-07-09 | 2011-11-23 | 富士通株式会社 | 声音识别装置和声音识别方法 |
| US20090030676A1 (en) * | 2007-07-26 | 2009-01-29 | Creative Technology Ltd | Method of deriving a compressed acoustic model for speech recognition |
| KR100930060B1 (ko) * | 2008-01-09 | 2009-12-08 | 성균관대학교산학협력단 | 신호 검출 방법, 장치 및 그 방법을 실행하는 프로그램이기록된 기록매체 |
| JP5385810B2 (ja) * | 2010-02-04 | 2014-01-08 | 日本電信電話株式会社 | 線形分類モデルに基づく音響モデルパラメータ学習方法とその装置、音素重み付き有限状態変換器生成方法とその装置、それらのプログラム |
| KR102238979B1 (ko) * | 2013-11-15 | 2021-04-12 | 현대모비스 주식회사 | 음성 인식을 위한 전처리 장치 및 그 방법 |
| JP7657312B2 (ja) * | 2021-12-20 | 2025-04-04 | 深▲セン▼市韶音科技有限公司 | 音声活動検出方法、システム、音声強調方法及びシステム |
Family Cites Families (15)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US4592086A (en) * | 1981-12-09 | 1986-05-27 | Nippon Electric Co., Ltd. | Continuous speech recognition system |
| JPS58143394A (ja) * | 1982-02-19 | 1983-08-25 | 株式会社日立製作所 | 音声区間の検出・分類方式 |
| DE3370423D1 (en) * | 1983-06-07 | 1987-04-23 | Ibm | Process for activity detection in a voice transmission system |
| JPS62169199A (ja) * | 1986-01-22 | 1987-07-25 | 株式会社デンソー | 音声認識装置 |
| US5276765A (en) * | 1988-03-11 | 1994-01-04 | British Telecommunications Public Limited Company | Voice activity detection |
| US5159637A (en) * | 1988-07-27 | 1992-10-27 | Fujitsu Limited | Speech word recognizing apparatus using information indicative of the relative significance of speech features |
| EP0381507A3 (en) * | 1989-02-02 | 1991-04-24 | Kabushiki Kaisha Toshiba | Silence/non-silence discrimination apparatus |
| JP3002204B2 (ja) * | 1989-03-13 | 2000-01-24 | 株式会社東芝 | 時系列信号認識装置 |
| JPH06332492A (ja) * | 1993-05-19 | 1994-12-02 | Matsushita Electric Ind Co Ltd | 音声検出方法および検出装置 |
| IN184794B (https=) * | 1993-09-14 | 2000-09-30 | British Telecomm | |
| GB2317084B (en) * | 1995-04-28 | 2000-01-19 | Northern Telecom Ltd | Methods and apparatus for distinguishing speech intervals from noise intervals in audio signals |
| US6084967A (en) * | 1997-10-29 | 2000-07-04 | Motorola, Inc. | Radio telecommunication device and method of authenticating a user with a voice authentication token |
| EP0953971A1 (en) * | 1998-05-01 | 1999-11-03 | Entropic Cambridge Research Laboratory Ltd. | Speech recognition system and method |
| US6615170B1 (en) * | 2000-03-07 | 2003-09-02 | International Business Machines Corporation | Model-based voice activity detection system and method using a log-likelihood ratio and pitch |
| US6542869B1 (en) * | 2000-05-11 | 2003-04-01 | Fuji Xerox Co., Ltd. | Method for automatic analysis of audio including music and speech |
-
2000
- 2000-09-12 JP JP2000277024A patent/JP4201470B2/ja not_active Expired - Fee Related
-
2001
- 2001-09-10 DE DE60142729T patent/DE60142729D1/de not_active Expired - Lifetime
- 2001-09-10 US US09/948,762 patent/US20020049592A1/en not_active Abandoned
- 2001-09-10 EP EP01307684A patent/EP1189200B1/en not_active Expired - Lifetime
- 2001-09-12 CN CNB011328746A patent/CN1152366C/zh not_active Expired - Fee Related
-
2004
- 2004-11-24 US US10/995,509 patent/US20050091053A1/en not_active Abandoned
Also Published As
| Publication number | Publication date |
|---|---|
| DE60142729D1 (de) | 2010-09-16 |
| JP2002091467A (ja) | 2002-03-27 |
| US20020049592A1 (en) | 2002-04-25 |
| EP1189200A1 (en) | 2002-03-20 |
| EP1189200B1 (en) | 2010-08-04 |
| US20050091053A1 (en) | 2005-04-28 |
| JP4201470B2 (ja) | 2008-12-24 |
| CN1343966A (zh) | 2002-04-10 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US12243532B2 (en) | Privacy mode based on speaker identifier | |
| US11996097B2 (en) | Multilingual wakeword detection | |
| US8532991B2 (en) | Speech models generated using competitive training, asymmetric training, and data boosting | |
| US12387727B1 (en) | Speech processing optimizations based on microphone array | |
| US10276149B1 (en) | Dynamic text-to-speech output | |
| US7869999B2 (en) | Systems and methods for selecting from multiple phonectic transcriptions for text-to-speech synthesis | |
| US8019602B2 (en) | Automatic speech recognition learning using user corrections | |
| US12531063B2 (en) | Speech-processing system | |
| US20110196678A1 (en) | Speech recognition apparatus and speech recognition method | |
| US11715472B2 (en) | Speech-processing system | |
| CN1454380A (zh) | 具有多个话音识别引擎的话音识别系统和方法 | |
| CN1152366C (zh) | 声音识别系统 | |
| US11044567B1 (en) | Microphone degradation detection and compensation | |
| CN1819017A (zh) | 提取特征向量用于语音识别的方法 | |
| CN1787076A (zh) | 基于混合支持向量机的说话人识别方法 | |
| JPH09325798A (ja) | 音声認識装置 | |
| CN1198261C (zh) | 基于决策树的语音辨别方法 | |
| CN1249665C (zh) | 语音识别系统 | |
| CN1957397A (zh) | 声音识别装置和声音识别方法 | |
| US11961514B1 (en) | Streaming self-attention in a neural network | |
| RU2234746C2 (ru) | Способ дикторонезависимого распознавания звуков речи | |
| Wang et al. | Improved Mandarin speech recognition by lattice rescoring with enhanced tone models | |
| Scharenborg et al. | ASR in a human word recognition model: generating phonemic input for Shortlist |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| C06 | Publication | ||
| PB01 | Publication | ||
| C14 | Grant of patent or utility model | ||
| GR01 | Patent grant | ||
| C19 | Lapse of patent right due to non-payment of the annual fee | ||
| CF01 | Termination of patent right due to non-payment of annual fee |