CN101256768A - Time frequency two-dimension converse spectrum characteristic extracting method for recognizing language species - Google Patents
Time frequency two-dimension converse spectrum characteristic extracting method for recognizing language species Download PDFInfo
- Publication number
- CN101256768A CN101256768A CNA2008101033280A CN200810103328A CN101256768A CN 101256768 A CN101256768 A CN 101256768A CN A2008101033280 A CNA2008101033280 A CN A2008101033280A CN 200810103328 A CN200810103328 A CN 200810103328A CN 101256768 A CN101256768 A CN 101256768A
- Authority
- CN
- China
- Prior art keywords
- frequency
- frame
- feature
- matrix
- voice
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000000034 method Methods 0.000 title claims abstract description 13
- 238000001228 spectrum Methods 0.000 title description 2
- 239000011159 matrix material Substances 0.000 claims abstract description 21
- 238000006243 chemical reaction Methods 0.000 claims abstract description 4
- 238000009826 distribution Methods 0.000 claims description 8
- 238000010606 normalization Methods 0.000 claims description 3
- 238000012545 processing Methods 0.000 claims description 3
- 230000017105 transposition Effects 0.000 claims description 3
- 238000001914 filtration Methods 0.000 claims description 2
- 230000006870 function Effects 0.000 claims description 2
- 230000008707 rearrangement Effects 0.000 claims description 2
- 238000012546 transfer Methods 0.000 claims description 2
- 239000000284 extract Substances 0.000 abstract description 2
- 238000003491 array Methods 0.000 abstract 1
- 230000007423 decrease Effects 0.000 abstract 1
- 238000012360 testing method Methods 0.000 description 4
- 238000004364 calculation method Methods 0.000 description 3
- 238000002474 experimental method Methods 0.000 description 3
- 230000006835 compression Effects 0.000 description 2
- 238000007906 compression Methods 0.000 description 2
- 238000000605 extraction Methods 0.000 description 2
- 230000008447 perception Effects 0.000 description 2
- 238000012935 Averaging Methods 0.000 description 1
- 241001672694 Citrus reticulata Species 0.000 description 1
- 238000007476 Maximum Likelihood Methods 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000007812 deficiency Effects 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 238000011156 evaluation Methods 0.000 description 1
- 239000012467 final product Substances 0.000 description 1
- 230000002452 interceptive effect Effects 0.000 description 1
- 238000012544 monitoring process Methods 0.000 description 1
- 238000005070 sampling Methods 0.000 description 1
- 238000012549 training Methods 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
Images
Landscapes
- Machine Translation (AREA)
- Complex Calculations (AREA)
Abstract
Description
Claims (1)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN2008101033280A CN101256768B (en) | 2008-04-03 | 2008-04-03 | Time frequency two-dimension converse spectrum characteristic extracting method for recognizing language species |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN2008101033280A CN101256768B (en) | 2008-04-03 | 2008-04-03 | Time frequency two-dimension converse spectrum characteristic extracting method for recognizing language species |
Publications (2)
Publication Number | Publication Date |
---|---|
CN101256768A true CN101256768A (en) | 2008-09-03 |
CN101256768B CN101256768B (en) | 2011-03-30 |
Family
ID=39891525
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN2008101033280A Expired - Fee Related CN101256768B (en) | 2008-04-03 | 2008-04-03 | Time frequency two-dimension converse spectrum characteristic extracting method for recognizing language species |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN101256768B (en) |
Cited By (12)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101702314B (en) * | 2009-10-13 | 2011-11-09 | 清华大学 | Method for establishing identified type language recognition model based on language pair |
CN102723081A (en) * | 2012-05-30 | 2012-10-10 | 林其灿 | Voice signal processing method, voice and voiceprint recognition method and device |
CN103021407A (en) * | 2012-12-18 | 2013-04-03 | 中国科学院声学研究所 | Method and system for recognizing speech of agglutinative language |
CN103295583A (en) * | 2012-02-24 | 2013-09-11 | 佳能株式会社 | Method and equipment for extracting sub-band energy features of sound and monitoring system |
CN104992424A (en) * | 2015-07-27 | 2015-10-21 | 北京航空航天大学 | Single-pixel rapid active imaging system based on discrete cosine transform |
CN105068048A (en) * | 2015-08-14 | 2015-11-18 | 南京信息工程大学 | Distributed microphone array sound source positioning method based on space sparsity |
CN106205638A (en) * | 2016-06-16 | 2016-12-07 | 清华大学 | A kind of double-deck fundamental tone feature extracting method towards audio event detection |
CN109036458A (en) * | 2018-08-22 | 2018-12-18 | 昆明理工大学 | A kind of multilingual scene analysis method based on audio frequency characteristics parameter |
CN112530407A (en) * | 2020-11-25 | 2021-03-19 | 北京快鱼电子股份公司 | Language identification method and system |
CN114067834A (en) * | 2020-07-30 | 2022-02-18 | 中国移动通信集团有限公司 | Bad preamble recognition method and device, storage medium and computer equipment |
CN114209325A (en) * | 2021-12-23 | 2022-03-22 | 东风柳州汽车有限公司 | Driver fatigue behavior monitoring method, device, equipment and storage medium |
CN115840877A (en) * | 2022-12-06 | 2023-03-24 | 中国科学院空间应用工程与技术中心 | Distributed stream processing method and system for MFCC extraction, storage medium and computer |
Family Cites Families (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
FI19992351A (en) * | 1999-10-29 | 2001-04-30 | Nokia Mobile Phones Ltd | voice recognizer |
JP3699912B2 (en) * | 2001-07-26 | 2005-09-28 | 株式会社東芝 | Voice feature extraction method, apparatus, and program |
-
2008
- 2008-04-03 CN CN2008101033280A patent/CN101256768B/en not_active Expired - Fee Related
Cited By (19)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101702314B (en) * | 2009-10-13 | 2011-11-09 | 清华大学 | Method for establishing identified type language recognition model based on language pair |
CN103295583A (en) * | 2012-02-24 | 2013-09-11 | 佳能株式会社 | Method and equipment for extracting sub-band energy features of sound and monitoring system |
CN103295583B (en) * | 2012-02-24 | 2015-09-30 | 佳能株式会社 | For extracting the method for the sub belt energy feature of sound, equipment and surveillance |
CN102723081A (en) * | 2012-05-30 | 2012-10-10 | 林其灿 | Voice signal processing method, voice and voiceprint recognition method and device |
CN102723081B (en) * | 2012-05-30 | 2014-05-21 | 无锡百互科技有限公司 | Voice signal processing method, voice and voiceprint recognition method and device |
CN103021407A (en) * | 2012-12-18 | 2013-04-03 | 中国科学院声学研究所 | Method and system for recognizing speech of agglutinative language |
CN103021407B (en) * | 2012-12-18 | 2015-07-08 | 中国科学院声学研究所 | Method and system for recognizing speech of agglutinative language |
CN104992424B (en) * | 2015-07-27 | 2018-05-25 | 北京航空航天大学 | A kind of single pixel based on discrete cosine transform quickly imaging system |
CN104992424A (en) * | 2015-07-27 | 2015-10-21 | 北京航空航天大学 | Single-pixel rapid active imaging system based on discrete cosine transform |
CN105068048A (en) * | 2015-08-14 | 2015-11-18 | 南京信息工程大学 | Distributed microphone array sound source positioning method based on space sparsity |
CN106205638A (en) * | 2016-06-16 | 2016-12-07 | 清华大学 | A kind of double-deck fundamental tone feature extracting method towards audio event detection |
CN106205638B (en) * | 2016-06-16 | 2019-11-08 | 清华大学 | A kind of double-deck fundamental tone feature extracting method towards audio event detection |
CN109036458A (en) * | 2018-08-22 | 2018-12-18 | 昆明理工大学 | A kind of multilingual scene analysis method based on audio frequency characteristics parameter |
CN114067834A (en) * | 2020-07-30 | 2022-02-18 | 中国移动通信集团有限公司 | Bad preamble recognition method and device, storage medium and computer equipment |
CN112530407A (en) * | 2020-11-25 | 2021-03-19 | 北京快鱼电子股份公司 | Language identification method and system |
CN112530407B (en) * | 2020-11-25 | 2021-07-23 | 北京快鱼电子股份公司 | Language identification method and system |
CN114209325A (en) * | 2021-12-23 | 2022-03-22 | 东风柳州汽车有限公司 | Driver fatigue behavior monitoring method, device, equipment and storage medium |
CN114209325B (en) * | 2021-12-23 | 2023-06-23 | 东风柳州汽车有限公司 | Driver fatigue behavior monitoring method, device, equipment and storage medium |
CN115840877A (en) * | 2022-12-06 | 2023-03-24 | 中国科学院空间应用工程与技术中心 | Distributed stream processing method and system for MFCC extraction, storage medium and computer |
Also Published As
Publication number | Publication date |
---|---|
CN101256768B (en) | 2011-03-30 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN101256768B (en) | Time frequency two-dimension converse spectrum characteristic extracting method for recognizing language species | |
CN106847292B (en) | Method for recognizing sound-groove and device | |
CN102968986B (en) | Overlapped voice and single voice distinguishing method based on long time characteristics and short time characteristics | |
Tiwari | MFCC and its applications in speaker recognition | |
Thomas et al. | Cross-lingual and multi-stream posterior features for low resource LVCSR systems. | |
US6370504B1 (en) | Speech recognition on MPEG/Audio encoded files | |
CN100514446C (en) | Pronunciation evaluating method based on voice identification and voice analysis | |
CN101226743A (en) | Method for recognizing speaker based on conversion of neutral and affection sound-groove model | |
CN102800316A (en) | Optimal codebook design method for voiceprint recognition system based on nerve network | |
CN102737633A (en) | Method and device for recognizing speaker based on tensor subspace analysis | |
CN102789779A (en) | Speech recognition system and recognition method thereof | |
CN104732972A (en) | HMM voiceprint recognition signing-in method and system based on grouping statistics | |
CN101887722A (en) | Rapid voiceprint authentication method | |
CN1787070B (en) | On-chip system for language learner | |
CN101546555A (en) | Constraint heteroscedasticity linear discriminant analysis method for language identification | |
CN103258537A (en) | Method utilizing characteristic combination to identify speech emotions and device thereof | |
CN106297769B (en) | A kind of distinctive feature extracting method applied to languages identification | |
Samal et al. | On the use of MFCC feature vector clustering for efficient text dependent speaker recognition | |
CN104240699A (en) | Simple and effective phrase speech recognition method | |
CN103778914A (en) | Anti-noise voice identification method and device based on signal-to-noise ratio weighing template characteristic matching | |
Al-Rawahy et al. | Text-independent speaker identification system based on the histogram of DCT-cepstrum coefficients | |
Liu et al. | Supra-Segmental Feature Based Speaker Trait Detection. | |
Bansod et al. | Speaker Recognition using Marathi (Varhadi) Language | |
Meghanani et al. | Pitch-synchronous DCT features: A pilot study on speaker identification | |
Sailaja et al. | Text independent speaker identification with finite multivariate generalized gaussian mixture model and hierarchical clustering algorithm |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
TR01 | Transfer of patent right | ||
TR01 | Transfer of patent right |
Effective date of registration: 20161223 Address after: 100084 Zhongguancun Haidian District East Road No. 1, building 8, floor 8, A803B, Patentee after: Beijing Hua Chong Chong Information Technology Co., Ltd. Address before: 100084 Beijing 100084-82 mailbox Patentee before: Qinghua UNiversity |
|
TR01 | Transfer of patent right | ||
TR01 | Transfer of patent right |
Effective date of registration: 20200317 Address after: 100084 Tsinghua University, Beijing, Haidian District Patentee after: TSINGHUA University Address before: 100084 Zhongguancun Haidian District East Road No. 1, building 8, floor 8, A803B, Patentee before: BEIJING HUA KONG CHUANG WEI INFORMATION TECHNOLOGY Co.,Ltd. |
|
CF01 | Termination of patent right due to non-payment of annual fee | ||
CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20110330 Termination date: 20210403 |