HK1206863A1 - Audio classification based on perceptual quality for low or medium bit rates - Google Patents

Audio classification based on perceptual quality for low or medium bit rates

Info

Publication number
HK1206863A1
HK1206863A1 HK15107348.7A HK15107348A HK1206863A1 HK 1206863 A1 HK1206863 A1 HK 1206863A1 HK 15107348 A HK15107348 A HK 15107348A HK 1206863 A1 HK1206863 A1 HK 1206863A1
Authority
HK
Hong Kong
Prior art keywords
low
bit rates
classification based
perceptual quality
audio classification
Prior art date
Application number
HK15107348.7A
Other languages
English (en)
Chinese (zh)
Inventor
高揚
Original Assignee
Huawei Tech Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Huawei Tech Co Ltd filed Critical Huawei Tech Co Ltd
Publication of HK1206863A1 publication Critical patent/HK1206863A1/xx

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/18Vocoders using multiple modes
    • G10L19/24Variable rate codecs, e.g. for generating different qualities using a scalable representation such as hierarchical encoding or layered encoding
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/002Dynamic bit allocation
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/18Vocoders using multiple modes
    • G10L19/20Vocoders using multiple modes using sound class specific coding, hybrid encoders or object based coding
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/90Pitch determination of speech signals
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/93Discriminating between voiced and unvoiced parts of speech signals
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/93Discriminating between voiced and unvoiced parts of speech signals
    • G10L2025/937Signal energy in various frequency bands
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • G10L25/06Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being correlation coefficients

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Quality & Reliability (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
HK15107348.7A 2012-09-18 2015-07-31 Audio classification based on perceptual quality for low or medium bit rates HK1206863A1 (en)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US201261702342P 2012-09-18 2012-09-18
US14/027,052 US9589570B2 (en) 2012-09-18 2013-09-13 Audio classification based on perceptual quality for low or medium bit rates
PCT/CN2013/083794 WO2014044197A1 (en) 2012-09-18 2013-09-18 Audio classification based on perceptual quality for low or medium bit rates

Publications (1)

Publication Number Publication Date
HK1206863A1 true HK1206863A1 (en) 2016-01-15

Family

ID=50275348

Family Applications (2)

Application Number Title Priority Date Filing Date
HK15107348.7A HK1206863A1 (en) 2012-09-18 2015-07-31 Audio classification based on perceptual quality for low or medium bit rates
HK18105294.2A HK1245988A1 (zh) 2012-09-18 2015-07-31 針對中低比特率的基於感知質量的音頻分類

Family Applications After (1)

Application Number Title Priority Date Filing Date
HK18105294.2A HK1245988A1 (zh) 2012-09-18 2015-07-31 針對中低比特率的基於感知質量的音頻分類

Country Status (9)

Country Link
US (3) US9589570B2 (ko)
EP (2) EP3296993B1 (ko)
JP (3) JP6148342B2 (ko)
KR (2) KR101801758B1 (ko)
BR (1) BR112015005980B1 (ko)
ES (1) ES2870487T3 (ko)
HK (2) HK1206863A1 (ko)
SG (2) SG10201706360RA (ko)
WO (1) WO2014044197A1 (ko)

Families Citing this family (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP3576089B1 (en) * 2012-05-23 2020-10-14 Nippon Telegraph And Telephone Corporation Encoding of an audio signal
US9589570B2 (en) * 2012-09-18 2017-03-07 Huawei Technologies Co., Ltd. Audio classification based on perceptual quality for low or medium bit rates
EP2830065A1 (en) * 2013-07-22 2015-01-28 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Apparatus and method for decoding an encoded audio signal using a cross-over filter around a transition frequency
US9685166B2 (en) * 2014-07-26 2017-06-20 Huawei Technologies Co., Ltd. Classification between time-domain coding and frequency domain coding
EP2980794A1 (en) 2014-07-28 2016-02-03 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Audio encoder and decoder using a frequency domain processor and a time domain processor
EP2980795A1 (en) 2014-07-28 2016-02-03 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Audio encoding and decoding using a frequency domain processor, a time domain processor and a cross processor for initialization of the time domain processor
WO2023153228A1 (ja) * 2022-02-08 2023-08-17 パナソニック インテレクチュアル プロパティ コーポレーション オブ アメリカ 符号化装置、及び、符号化方法

Family Cites Families (40)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
AU3708597A (en) * 1996-08-02 1998-02-25 Matsushita Electric Industrial Co., Ltd. Voice encoder, voice decoder, recording medium on which program for realizing voice encoding/decoding is recorded and mobile communication apparatus
US6456965B1 (en) * 1997-05-20 2002-09-24 Texas Instruments Incorporated Multi-stage pitch and mixed voicing estimation for harmonic speech coders
US6233550B1 (en) * 1997-08-29 2001-05-15 The Regents Of The University Of California Method and apparatus for hybrid coding of speech at 4kbps
ATE302991T1 (de) * 1998-01-22 2005-09-15 Deutsche Telekom Ag Verfahren zur signalgesteuerten schaltung zwischen verschiedenen audiokodierungssystemen
US6496797B1 (en) * 1999-04-01 2002-12-17 Lg Electronics Inc. Apparatus and method of speech coding and decoding using multiple frames
US6298322B1 (en) * 1999-05-06 2001-10-02 Eric Lindemann Encoding and synthesis of tonal audio signals using dominant sinusoids and a vector-quantized residual tonal signal
US6604070B1 (en) * 1999-09-22 2003-08-05 Conexant Systems, Inc. System of encoding and decoding speech signals
US6782360B1 (en) * 1999-09-22 2004-08-24 Mindspeed Technologies, Inc. Gain quantization for a CELP speech coder
US6694293B2 (en) 2001-02-13 2004-02-17 Mindspeed Technologies, Inc. Speech coding system with a music classifier
US6738739B2 (en) * 2001-02-15 2004-05-18 Mindspeed Technologies, Inc. Voiced speech preprocessing employing waveform interpolation or a harmonic model
US20030028386A1 (en) * 2001-04-02 2003-02-06 Zinser Richard L. Compressed domain universal transcoder
US6917912B2 (en) * 2001-04-24 2005-07-12 Microsoft Corporation Method and apparatus for tracking pitch in audio analysis
US6871176B2 (en) * 2001-07-26 2005-03-22 Freescale Semiconductor, Inc. Phase excited linear prediction encoder
US7124075B2 (en) * 2001-10-26 2006-10-17 Dmitry Edward Terez Methods and apparatus for pitch determination
CA2388439A1 (en) * 2002-05-31 2003-11-30 Voiceage Corporation A method and device for efficient frame erasure concealment in linear predictive based speech codecs
CA2392640A1 (en) * 2002-07-05 2004-01-05 Voiceage Corporation A method and device for efficient in-based dim-and-burst signaling and half-rate max operation in variable bit-rate wideband speech coding for cdma wireless systems
KR100546758B1 (ko) * 2003-06-30 2006-01-26 한국전자통신연구원 음성의 상호부호화시 전송률 결정 장치 및 방법
US7447630B2 (en) * 2003-11-26 2008-11-04 Microsoft Corporation Method and apparatus for multi-sensory speech enhancement
US7783488B2 (en) * 2005-12-19 2010-08-24 Nuance Communications, Inc. Remote tracing and debugging of automatic speech recognition servers by speech reconstruction from cepstra and pitch information
KR100964402B1 (ko) * 2006-12-14 2010-06-17 삼성전자주식회사 오디오 신호의 부호화 모드 결정 방법 및 장치와 이를 이용한 오디오 신호의 부호화/복호화 방법 및 장치
CN101256772B (zh) 2007-03-02 2012-02-15 华为技术有限公司 确定非噪声音频信号归属类别的方法和装置
US20080249783A1 (en) * 2007-04-05 2008-10-09 Texas Instruments Incorporated Layered Code-Excited Linear Prediction Speech Encoder and Decoder Having Plural Codebook Contributions in Enhancement Layers Thereof and Methods of Layered CELP Encoding and Decoding
KR100925256B1 (ko) 2007-05-03 2009-11-05 인하대학교 산학협력단 음성 및 음악을 실시간으로 분류하는 방법
US8185388B2 (en) * 2007-07-30 2012-05-22 Huawei Technologies Co., Ltd. Apparatus for improving packet loss, frame erasure, or jitter concealment
US8473283B2 (en) * 2007-11-02 2013-06-25 Soundhound, Inc. Pitch selection modules in a system for automatic transcription of sung or hummed melodies
KR101380297B1 (ko) 2008-07-11 2014-04-02 프라운호퍼 게젤샤프트 쭈르 푀르데룽 데어 안겐반텐 포르슝 에. 베. 상이한 신호 세그먼트를 분류하기 위한 판별기와 방법
EP2144230A1 (en) * 2008-07-11 2010-01-13 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Low bitrate audio encoding/decoding scheme having cascaded switches
US9037474B2 (en) * 2008-09-06 2015-05-19 Huawei Technologies Co., Ltd. Method for classifying audio signal into fast signal or slow signal
CN101604525B (zh) * 2008-12-31 2011-04-06 华为技术有限公司 基音增益获取方法、装置及编码器、解码器
US8185384B2 (en) * 2009-04-21 2012-05-22 Cambridge Silicon Radio Limited Signal pitch period estimation
KR20120032444A (ko) * 2010-09-28 2012-04-05 한국전자통신연구원 적응 코드북 업데이트를 이용한 오디오 신호 디코딩 방법 및 장치
KR101858466B1 (ko) 2010-10-25 2018-06-28 보이세지 코포레이션 혼합형 시간-영역/주파수-영역 코딩 장치, 인코더, 디코더, 혼합형 시간-영역/주파수-영역 코딩 방법, 인코딩 방법 및 디코딩 방법
MY159444A (en) * 2011-02-14 2017-01-13 Fraunhofer-Gesellschaft Zur Forderung Der Angewandten Forschung E V Encoding and decoding of pulse positions of tracks of an audio signal
US9037456B2 (en) * 2011-07-26 2015-05-19 Google Technology Holdings LLC Method and apparatus for audio coding and decoding
WO2013068634A1 (en) * 2011-11-10 2013-05-16 Nokia Corporation A method and apparatus for detecting audio sampling rate
CN107293311B (zh) * 2011-12-21 2021-10-26 华为技术有限公司 非常短的基音周期检测和编码
CN104254886B (zh) * 2011-12-21 2018-08-14 华为技术有限公司 自适应编码浊音语音的基音周期
US9111531B2 (en) * 2012-01-13 2015-08-18 Qualcomm Incorporated Multiple coding mode signal classification
US9589570B2 (en) * 2012-09-18 2017-03-07 Huawei Technologies Co., Ltd. Audio classification based on perceptual quality for low or medium bit rates
US9685166B2 (en) * 2014-07-26 2017-06-20 Huawei Technologies Co., Ltd. Classification between time-domain coding and frequency domain coding

Also Published As

Publication number Publication date
KR101801758B1 (ko) 2017-11-27
JP2019174834A (ja) 2019-10-10
KR20170018091A (ko) 2017-02-15
EP2888734A4 (en) 2015-11-04
US9589570B2 (en) 2017-03-07
US10283133B2 (en) 2019-05-07
EP3296993B1 (en) 2021-03-10
ES2870487T3 (es) 2021-10-27
BR112015005980B1 (pt) 2021-06-15
US11393484B2 (en) 2022-07-19
US20170116999A1 (en) 2017-04-27
EP3296993A1 (en) 2018-03-21
JP6148342B2 (ja) 2017-06-14
HK1245988A1 (zh) 2018-08-31
JP6843188B2 (ja) 2021-03-17
KR20150055035A (ko) 2015-05-20
EP2888734A1 (en) 2015-07-01
JP6545748B2 (ja) 2019-07-17
JP2015534109A (ja) 2015-11-26
EP2888734B1 (en) 2017-11-15
SG10201706360RA (en) 2017-09-28
BR112015005980A2 (pt) 2017-07-04
US20140081629A1 (en) 2014-03-20
WO2014044197A1 (en) 2014-03-27
KR101705276B1 (ko) 2017-02-22
JP2017156767A (ja) 2017-09-07
SG11201502040YA (en) 2015-04-29
US20190237088A1 (en) 2019-08-01

Similar Documents

Publication Publication Date Title
HK1245988A1 (zh) 針對中低比特率的基於感知質量的音頻分類
TWI562133B (en) Bit allocating method and non-transitory computer-readable recording medium
EP2939420A4 (en) USE OF QUALITY INFORMATION FOR THE ADAPTIVE STREAMING OF MEDIA CONTENT
RS64927B1 (sr) Postupci za slanje ili prijem medijskih podataka
EP2776921A4 (en) GEOGRAPHIC BARRIER BASED ON GEO-REFERENCED MEDIA
EP2581908A4 (en) PLAYING DEVICE, RECORDING MEDIA, PLAY PROCESS, PROGRAM
EP2559258A4 (en) ENHANCED READING QUALITY OF MULTIMEDIA CONTENT
EP3065952A4 (en) PRINTABLE PRINT SUPPORT
GB201122234D0 (en) Updating apparatus, updating method and recording medium
EP2701922A4 (en) RECORDING MEDIA
EP2908331A4 (en) Exposure device, exposure method, device production method, program, and recording medium
EP2810779A4 (en) Inkjet recording apparatus
IL236173A0 (en) A method of recording data
EP2752021A4 (en) SELECTIVE MULTIMEDIA CONTENT RECORDING
EP3126151A4 (en) Printable recording media
EP3024663A4 (en) Printable recording media
EP2892052A4 (en) BILLING PROCESS AND DEVICE FOR SOUND SIGNALS
EP2750389A4 (en) RECORDING MEDIUM, READING DEVICE, RECORDING DEVICE, AND RECORDING METHOD
EP2922291A4 (en) METHOD AND DEVICE FOR SENDING AND RECEIVING AUDIO DATA
EP3341210A4 (en) Printable recording media
EP2863388A4 (en) BIT ASSIGNMENT METHOD AND DEVICE FOR AUDIO SIGNAL
EP2898367A4 (en) RECORDING MEDIUM, IMAGE RECORDING APPARATUS, AND IMAGE RECORDING ASSEMBLY
EP3250394A4 (en) Printable recording media
EP3079913A4 (en) Printable recording media
EP3044009A4 (en) Printable recording media