AU736133B2 - Speech detection in a telecommunication system - Google Patents
Speech detection in a telecommunication system Download PDFInfo
- Publication number
- AU736133B2 AU736133B2 AU70453/98A AU7045398A AU736133B2 AU 736133 B2 AU736133 B2 AU 736133B2 AU 70453/98 A AU70453/98 A AU 70453/98A AU 7045398 A AU7045398 A AU 7045398A AU 736133 B2 AU736133 B2 AU 736133B2
- Authority
- AU
- Australia
- Prior art keywords
- signal
- speech
- neural network
- noise
- peak value
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
- 238000001514 detection method Methods 0.000 title claims description 23
- 238000013528 artificial neural network Methods 0.000 claims description 68
- 238000000034 method Methods 0.000 claims description 48
- 210000002569 neuron Anatomy 0.000 claims description 45
- 239000000872 buffer Substances 0.000 claims description 31
- 238000005311 autocorrelation function Methods 0.000 claims description 17
- 239000013598 vector Substances 0.000 claims description 15
- 238000012549 training Methods 0.000 claims description 14
- 238000004364 calculation method Methods 0.000 claims description 11
- 238000001914 filtration Methods 0.000 claims description 5
- 238000012545 processing Methods 0.000 claims description 4
- 230000006870 function Effects 0.000 description 23
- 206010019133 Hangover Diseases 0.000 description 17
- 238000010295 mobile communication Methods 0.000 description 16
- 238000012546 transfer Methods 0.000 description 11
- 239000000523 sample Substances 0.000 description 8
- 238000004422 calculation algorithm Methods 0.000 description 5
- 230000000694 effects Effects 0.000 description 5
- 238000001228 spectrum Methods 0.000 description 4
- 230000003044 adaptive effect Effects 0.000 description 3
- 238000010586 diagram Methods 0.000 description 3
- 239000012723 sample buffer Substances 0.000 description 3
- 102100040006 Annexin A1 Human genes 0.000 description 2
- 101000959738 Homo sapiens Annexin A1 Proteins 0.000 description 2
- 101000929342 Lytechinus pictus Actin, cytoskeletal 1 Proteins 0.000 description 2
- 101000797296 Lytechinus pictus Actin, cytoskeletal 3 Proteins 0.000 description 2
- 101000799321 Lytechinus pictus Actin, cytoskeletal 4 Proteins 0.000 description 2
- 238000013461 design Methods 0.000 description 2
- 238000005070 sampling Methods 0.000 description 2
- 230000005236 sound signal Effects 0.000 description 2
- 238000004458 analytical method Methods 0.000 description 1
- 238000013529 biological neural network Methods 0.000 description 1
- 230000005540 biological transmission Effects 0.000 description 1
- 210000004556 brain Anatomy 0.000 description 1
- 230000003925 brain function Effects 0.000 description 1
- 238000005314 correlation function Methods 0.000 description 1
- 230000003111 delayed effect Effects 0.000 description 1
- 230000009977 dual effect Effects 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 230000005284 excitation Effects 0.000 description 1
- 230000007774 longterm Effects 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 239000011159 matrix material Substances 0.000 description 1
- 210000000214 mouth Anatomy 0.000 description 1
- 239000010813 municipal solid waste Substances 0.000 description 1
- 210000003928 nasal cavity Anatomy 0.000 description 1
- 210000004205 output neuron Anatomy 0.000 description 1
- 238000013139 quantization Methods 0.000 description 1
- 229920006395 saturated elastomer Polymers 0.000 description 1
- 238000004088 simulation Methods 0.000 description 1
- 230000036962 time dependent Effects 0.000 description 1
- 210000001260 vocal cord Anatomy 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/78—Detection of presence or absence of voice signals
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
- G10L25/06—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being correlation coefficients
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
- G10L25/12—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being prediction coefficients
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/27—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the analysis technique
- G10L25/30—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the analysis technique using neural networks
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Computational Linguistics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Mobile Radio Communication Systems (AREA)
- Monitoring And Testing Of Exchanges (AREA)
- Cable Transmission Systems, Equalization Of Radio And Reduction Of Echo (AREA)
- Complex Calculations (AREA)
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
FI971679 | 1997-04-18 | ||
FI971679A FI971679A (fi) | 1997-04-18 | 1997-04-18 | Puheen havaitseminen tietoliikennejärjestelmässä |
PCT/FI1998/000345 WO1998048407A2 (en) | 1997-04-18 | 1998-04-17 | Speech detection in a telecommunication system |
Publications (2)
Publication Number | Publication Date |
---|---|
AU7045398A AU7045398A (en) | 1998-11-13 |
AU736133B2 true AU736133B2 (en) | 2001-07-26 |
Family
ID=8548676
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
AU70453/98A Ceased AU736133B2 (en) | 1997-04-18 | 1998-04-17 | Speech detection in a telecommunication system |
Country Status (6)
Country | Link |
---|---|
EP (1) | EP0976124A2 (fi) |
AU (1) | AU736133B2 (fi) |
CA (1) | CA2286770A1 (fi) |
FI (1) | FI971679A (fi) |
NZ (1) | NZ500272A (fi) |
WO (1) | WO1998048407A2 (fi) |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US10381020B2 (en) | 2017-06-16 | 2019-08-13 | Apple Inc. | Speech model-based neural network-assisted signal enhancement |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5315704A (en) * | 1989-11-28 | 1994-05-24 | Nec Corporation | Speech/voiceband data discriminator |
EP0628947A1 (en) * | 1993-06-10 | 1994-12-14 | SIP SOCIETA ITALIANA PER l'ESERCIZIO DELLE TELECOMUNICAZIONI P.A. | Method and device for speech signal pitch period estimation and classification in digital speech coders |
GB2278984A (en) * | 1993-06-11 | 1994-12-14 | Redifon Technology Limited | Speech presence detector |
Family Cites Families (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5276765A (en) * | 1988-03-11 | 1994-01-04 | British Telecommunications Public Limited Company | Voice activity detection |
JP2776848B2 (ja) * | 1988-12-14 | 1998-07-16 | 株式会社日立製作所 | 雑音除去方法、それに用いるニューラルネットワークの学習方法 |
JPH03111898A (ja) * | 1989-09-26 | 1991-05-13 | Sekisui Chem Co Ltd | 音声検出方式 |
-
1997
- 1997-04-18 FI FI971679A patent/FI971679A/fi unknown
-
1998
- 1998-04-17 WO PCT/FI1998/000345 patent/WO1998048407A2/en not_active Application Discontinuation
- 1998-04-17 AU AU70453/98A patent/AU736133B2/en not_active Ceased
- 1998-04-17 EP EP98917143A patent/EP0976124A2/en not_active Withdrawn
- 1998-04-17 CA CA002286770A patent/CA2286770A1/en not_active Abandoned
- 1998-04-17 NZ NZ500272A patent/NZ500272A/xx unknown
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5315704A (en) * | 1989-11-28 | 1994-05-24 | Nec Corporation | Speech/voiceband data discriminator |
EP0628947A1 (en) * | 1993-06-10 | 1994-12-14 | SIP SOCIETA ITALIANA PER l'ESERCIZIO DELLE TELECOMUNICAZIONI P.A. | Method and device for speech signal pitch period estimation and classification in digital speech coders |
GB2278984A (en) * | 1993-06-11 | 1994-12-14 | Redifon Technology Limited | Speech presence detector |
Also Published As
Publication number | Publication date |
---|---|
FI971679A (fi) | 1998-10-19 |
NZ500272A (en) | 2001-03-30 |
FI971679A0 (fi) | 1997-04-18 |
WO1998048407A2 (en) | 1998-10-29 |
CA2286770A1 (en) | 1998-10-29 |
EP0976124A2 (en) | 2000-02-02 |
WO1998048407A3 (en) | 1999-02-11 |
AU7045398A (en) | 1998-11-13 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
JP2654917B2 (ja) | ニューラル・ネットワークを使用する話者独立孤立単語音声認識システム | |
JPH0816187A (ja) | 音声分析における音声認識方法 | |
CN106409310A (zh) | 一种音频信号分类方法和装置 | |
EP2089877A1 (en) | Voice activity detection system and method | |
KR20020052191A (ko) | 음성 분류를 이용한 음성의 가변 비트 속도 켈프 코딩 방법 | |
CN102089803A (zh) | 用以将信号的不同段分类的方法与鉴别器 | |
CN113889090A (zh) | 一种基于多任务学习的多语种识别模型的构建和训练方法 | |
Rajesh Kumar et al. | Optimization-enabled deep convolutional network for the generation of normal speech from non-audible murmur based on multi-kernel-based features | |
CN112825250A (zh) | 语音唤醒方法、设备、存储介质及程序产品 | |
AU736133B2 (en) | Speech detection in a telecommunication system | |
Bäckström et al. | Voice activity detection | |
Chetouani et al. | Neural predictive coding for speech discriminant feature extraction: The DFE-NPC. | |
CN113782000B (zh) | 一种基于多任务的语种识别方法 | |
Maeran et al. | Speech recognition through phoneme segmentation and neural classification | |
JP3183072B2 (ja) | 音声符号化装置 | |
Wang et al. | Chip design of portable speech memopad suitable for persons with visual disabilities | |
JPH04115299A (ja) | 音声有音無音判定方法および装置 | |
Nisa et al. | Meta-Heuristic Application in Suppression of Noise | |
CN118398010B (zh) | 一种基于语音识别的养老服务机器人交互方法 | |
Vini | Voice Activity Detection Techniques-A Review | |
Cooper | Speech detection using gammatone features and one-class support vector machine | |
Beritelli et al. | A pattern recognition approach to robust voiced/unvoiced speech classification using fuzzy logic | |
JPH04198997A (ja) | 音声認識方法 | |
Nielsen et al. | Prosodic features improve sentence segmentation and parsing | |
Ananthapadmanabha et al. | Relative occurrences and difference of extrema for detection of transitions between broad phonetic classes |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
FGA | Letters patent sealed or granted (standard patent) | ||
MK14 | Patent ceased section 143(a) (annual fees not paid) or expired |