CN1152366C - 声音识别系统 - Google Patents

声音识别系统 Download PDF

Info

Publication number
CN1152366C
CN1152366C CNB011328746A CN01132874A CN1152366C CN 1152366 C CN1152366 C CN 1152366C CN B011328746 A CNB011328746 A CN B011328746A CN 01132874 A CN01132874 A CN 01132874A CN 1152366 C CN1152366 C CN 1152366C
Authority
CN
China
Prior art keywords
sound
input signal
inner product
afterpower
parts
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CNB011328746A
Other languages
English (en)
Chinese (zh)
Other versions
CN1343966A (zh
Inventor
С
小林载
驹村光弥
����һ
外山聪一
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Pioneer Corp
Original Assignee
Pioneer Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Pioneer Corp filed Critical Pioneer Corp
Publication of CN1343966A publication Critical patent/CN1343966A/zh
Application granted granted Critical
Publication of CN1152366C publication Critical patent/CN1152366C/zh
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Computational Linguistics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Measurement Of Mechanical Vibrations Or Ultrasonic Waves (AREA)
  • Complex Calculations (AREA)
  • Machine Translation (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
CNB011328746A 2000-09-12 2001-09-12 声音识别系统 Expired - Fee Related CN1152366C (zh)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
JP2000277024A JP4201470B2 (ja) 2000-09-12 2000-09-12 音声認識システム
JP277024/00 2000-09-12
JP277024/2000 2000-09-12

Publications (2)

Publication Number Publication Date
CN1343966A CN1343966A (zh) 2002-04-10
CN1152366C true CN1152366C (zh) 2004-06-02

Family

ID=18762410

Family Applications (1)

Application Number Title Priority Date Filing Date
CNB011328746A Expired - Fee Related CN1152366C (zh) 2000-09-12 2001-09-12 声音识别系统

Country Status (5)

Country Link
US (2) US20020049592A1 (https=)
EP (1) EP1189200B1 (https=)
JP (1) JP4201470B2 (https=)
CN (1) CN1152366C (https=)
DE (1) DE60142729D1 (https=)

Families Citing this family (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
FI114358B (fi) * 2002-05-29 2004-09-30 Nokia Corp Menetelmä digitaalisessa verkkojärjestelmässä päätelaitteen lähetyksen ohjaamiseksi
US20050010413A1 (en) * 2003-05-23 2005-01-13 Norsworthy Jon Byron Voice emulation and synthesis process
US20050058978A1 (en) * 2003-09-12 2005-03-17 Benevento Francis A. Individualized learning system
KR100717396B1 (ko) 2006-02-09 2007-05-11 삼성전자주식회사 로컬 스펙트럴 정보를 이용하여 음성 인식을 위한 유성음을판단하는 방법 및 장치
CN101689364B (zh) * 2007-07-09 2011-11-23 富士通株式会社 声音识别装置和声音识别方法
US20090030676A1 (en) * 2007-07-26 2009-01-29 Creative Technology Ltd Method of deriving a compressed acoustic model for speech recognition
KR100930060B1 (ko) * 2008-01-09 2009-12-08 성균관대학교산학협력단 신호 검출 방법, 장치 및 그 방법을 실행하는 프로그램이기록된 기록매체
JP5385810B2 (ja) * 2010-02-04 2014-01-08 日本電信電話株式会社 線形分類モデルに基づく音響モデルパラメータ学習方法とその装置、音素重み付き有限状態変換器生成方法とその装置、それらのプログラム
KR102238979B1 (ko) * 2013-11-15 2021-04-12 현대모비스 주식회사 음성 인식을 위한 전처리 장치 및 그 방법
JP7657312B2 (ja) * 2021-12-20 2025-04-04 深▲セン▼市韶音科技有限公司 音声活動検出方法、システム、音声強調方法及びシステム

Family Cites Families (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4592086A (en) * 1981-12-09 1986-05-27 Nippon Electric Co., Ltd. Continuous speech recognition system
JPS58143394A (ja) * 1982-02-19 1983-08-25 株式会社日立製作所 音声区間の検出・分類方式
DE3370423D1 (en) * 1983-06-07 1987-04-23 Ibm Process for activity detection in a voice transmission system
JPS62169199A (ja) * 1986-01-22 1987-07-25 株式会社デンソー 音声認識装置
US5276765A (en) * 1988-03-11 1994-01-04 British Telecommunications Public Limited Company Voice activity detection
US5159637A (en) * 1988-07-27 1992-10-27 Fujitsu Limited Speech word recognizing apparatus using information indicative of the relative significance of speech features
EP0381507A3 (en) * 1989-02-02 1991-04-24 Kabushiki Kaisha Toshiba Silence/non-silence discrimination apparatus
JP3002204B2 (ja) * 1989-03-13 2000-01-24 株式会社東芝 時系列信号認識装置
JPH06332492A (ja) * 1993-05-19 1994-12-02 Matsushita Electric Ind Co Ltd 音声検出方法および検出装置
IN184794B (https=) * 1993-09-14 2000-09-30 British Telecomm
GB2317084B (en) * 1995-04-28 2000-01-19 Northern Telecom Ltd Methods and apparatus for distinguishing speech intervals from noise intervals in audio signals
US6084967A (en) * 1997-10-29 2000-07-04 Motorola, Inc. Radio telecommunication device and method of authenticating a user with a voice authentication token
EP0953971A1 (en) * 1998-05-01 1999-11-03 Entropic Cambridge Research Laboratory Ltd. Speech recognition system and method
US6615170B1 (en) * 2000-03-07 2003-09-02 International Business Machines Corporation Model-based voice activity detection system and method using a log-likelihood ratio and pitch
US6542869B1 (en) * 2000-05-11 2003-04-01 Fuji Xerox Co., Ltd. Method for automatic analysis of audio including music and speech

Also Published As

Publication number Publication date
DE60142729D1 (de) 2010-09-16
JP2002091467A (ja) 2002-03-27
US20020049592A1 (en) 2002-04-25
EP1189200A1 (en) 2002-03-20
EP1189200B1 (en) 2010-08-04
US20050091053A1 (en) 2005-04-28
JP4201470B2 (ja) 2008-12-24
CN1343966A (zh) 2002-04-10

Similar Documents

Publication Publication Date Title
US12243532B2 (en) Privacy mode based on speaker identifier
US11996097B2 (en) Multilingual wakeword detection
US8532991B2 (en) Speech models generated using competitive training, asymmetric training, and data boosting
US12387727B1 (en) Speech processing optimizations based on microphone array
US10276149B1 (en) Dynamic text-to-speech output
US7869999B2 (en) Systems and methods for selecting from multiple phonectic transcriptions for text-to-speech synthesis
US8019602B2 (en) Automatic speech recognition learning using user corrections
US12531063B2 (en) Speech-processing system
US20110196678A1 (en) Speech recognition apparatus and speech recognition method
US11715472B2 (en) Speech-processing system
CN1454380A (zh) 具有多个话音识别引擎的话音识别系统和方法
CN1152366C (zh) 声音识别系统
US11044567B1 (en) Microphone degradation detection and compensation
CN1819017A (zh) 提取特征向量用于语音识别的方法
CN1787076A (zh) 基于混合支持向量机的说话人识别方法
JPH09325798A (ja) 音声認識装置
CN1198261C (zh) 基于决策树的语音辨别方法
CN1249665C (zh) 语音识别系统
CN1957397A (zh) 声音识别装置和声音识别方法
US11961514B1 (en) Streaming self-attention in a neural network
RU2234746C2 (ru) Способ дикторонезависимого распознавания звуков речи
Wang et al. Improved Mandarin speech recognition by lattice rescoring with enhanced tone models
Scharenborg et al. ASR in a human word recognition model: generating phonemic input for Shortlist

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C14 Grant of patent or utility model
GR01 Patent grant
C19 Lapse of patent right due to non-payment of the annual fee
CF01 Termination of patent right due to non-payment of annual fee