AU736133B2 - Speech detection in a telecommunication system - Google Patents

Speech detection in a telecommunication system Download PDF

Info

Publication number
AU736133B2
AU736133B2 AU70453/98A AU7045398A AU736133B2 AU 736133 B2 AU736133 B2 AU 736133B2 AU 70453/98 A AU70453/98 A AU 70453/98A AU 7045398 A AU7045398 A AU 7045398A AU 736133 B2 AU736133 B2 AU 736133B2
Authority
AU
Australia
Prior art keywords
signal
speech
neural network
noise
peak value
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
AU70453/98A
Other languages
English (en)
Other versions
AU7045398A (en
Inventor
Samu Kaajas
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Nokia Oyj
Original Assignee
Nokia Networks Oy
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Nokia Networks Oy filed Critical Nokia Networks Oy
Publication of AU7045398A publication Critical patent/AU7045398A/en
Application granted granted Critical
Publication of AU736133B2 publication Critical patent/AU736133B2/en
Anticipated expiration legal-status Critical
Ceased legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • G10L25/06Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being correlation coefficients
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • G10L25/12Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being prediction coefficients
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/27Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the analysis technique
    • G10L25/30Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the analysis technique using neural networks

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Computational Linguistics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Mobile Radio Communication Systems (AREA)
  • Monitoring And Testing Of Exchanges (AREA)
  • Cable Transmission Systems, Equalization Of Radio And Reduction Of Echo (AREA)
  • Complex Calculations (AREA)
AU70453/98A 1997-04-18 1998-04-17 Speech detection in a telecommunication system Ceased AU736133B2 (en)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
FI971679 1997-04-18
FI971679A FI971679A (fi) 1997-04-18 1997-04-18 Puheen havaitseminen tietoliikennejärjestelmässä
PCT/FI1998/000345 WO1998048407A2 (en) 1997-04-18 1998-04-17 Speech detection in a telecommunication system

Publications (2)

Publication Number Publication Date
AU7045398A AU7045398A (en) 1998-11-13
AU736133B2 true AU736133B2 (en) 2001-07-26

Family

ID=8548676

Family Applications (1)

Application Number Title Priority Date Filing Date
AU70453/98A Ceased AU736133B2 (en) 1997-04-18 1998-04-17 Speech detection in a telecommunication system

Country Status (6)

Country Link
EP (1) EP0976124A2 (fi)
AU (1) AU736133B2 (fi)
CA (1) CA2286770A1 (fi)
FI (1) FI971679A (fi)
NZ (1) NZ500272A (fi)
WO (1) WO1998048407A2 (fi)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10381020B2 (en) 2017-06-16 2019-08-13 Apple Inc. Speech model-based neural network-assisted signal enhancement

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5315704A (en) * 1989-11-28 1994-05-24 Nec Corporation Speech/voiceband data discriminator
EP0628947A1 (en) * 1993-06-10 1994-12-14 SIP SOCIETA ITALIANA PER l'ESERCIZIO DELLE TELECOMUNICAZIONI P.A. Method and device for speech signal pitch period estimation and classification in digital speech coders
GB2278984A (en) * 1993-06-11 1994-12-14 Redifon Technology Limited Speech presence detector

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5276765A (en) * 1988-03-11 1994-01-04 British Telecommunications Public Limited Company Voice activity detection
JP2776848B2 (ja) * 1988-12-14 1998-07-16 株式会社日立製作所 雑音除去方法、それに用いるニューラルネットワークの学習方法
JPH03111898A (ja) * 1989-09-26 1991-05-13 Sekisui Chem Co Ltd 音声検出方式

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5315704A (en) * 1989-11-28 1994-05-24 Nec Corporation Speech/voiceband data discriminator
EP0628947A1 (en) * 1993-06-10 1994-12-14 SIP SOCIETA ITALIANA PER l'ESERCIZIO DELLE TELECOMUNICAZIONI P.A. Method and device for speech signal pitch period estimation and classification in digital speech coders
GB2278984A (en) * 1993-06-11 1994-12-14 Redifon Technology Limited Speech presence detector

Also Published As

Publication number Publication date
FI971679A (fi) 1998-10-19
NZ500272A (en) 2001-03-30
FI971679A0 (fi) 1997-04-18
WO1998048407A2 (en) 1998-10-29
CA2286770A1 (en) 1998-10-29
EP0976124A2 (en) 2000-02-02
WO1998048407A3 (en) 1999-02-11
AU7045398A (en) 1998-11-13

Similar Documents

Publication Publication Date Title
JP2654917B2 (ja) ニューラル・ネットワークを使用する話者独立孤立単語音声認識システム
JPH0816187A (ja) 音声分析における音声認識方法
CN106409310A (zh) 一种音频信号分类方法和装置
EP2089877A1 (en) Voice activity detection system and method
KR20020052191A (ko) 음성 분류를 이용한 음성의 가변 비트 속도 켈프 코딩 방법
CN102089803A (zh) 用以将信号的不同段分类的方法与鉴别器
CN113889090A (zh) 一种基于多任务学习的多语种识别模型的构建和训练方法
Rajesh Kumar et al. Optimization-enabled deep convolutional network for the generation of normal speech from non-audible murmur based on multi-kernel-based features
CN112825250A (zh) 语音唤醒方法、设备、存储介质及程序产品
AU736133B2 (en) Speech detection in a telecommunication system
Bäckström et al. Voice activity detection
Chetouani et al. Neural predictive coding for speech discriminant feature extraction: The DFE-NPC.
CN113782000B (zh) 一种基于多任务的语种识别方法
Maeran et al. Speech recognition through phoneme segmentation and neural classification
JP3183072B2 (ja) 音声符号化装置
Wang et al. Chip design of portable speech memopad suitable for persons with visual disabilities
JPH04115299A (ja) 音声有音無音判定方法および装置
Nisa et al. Meta-Heuristic Application in Suppression of Noise
CN118398010B (zh) 一种基于语音识别的养老服务机器人交互方法
Vini Voice Activity Detection Techniques-A Review
Cooper Speech detection using gammatone features and one-class support vector machine
Beritelli et al. A pattern recognition approach to robust voiced/unvoiced speech classification using fuzzy logic
JPH04198997A (ja) 音声認識方法
Nielsen et al. Prosodic features improve sentence segmentation and parsing
Ananthapadmanabha et al. Relative occurrences and difference of extrema for detection of transitions between broad phonetic classes

Legal Events

Date Code Title Description
FGA Letters patent sealed or granted (standard patent)
MK14 Patent ceased section 143(a) (annual fees not paid) or expired