WO2006090553A1 - 音声帯域拡張装置 - Google Patents
音声帯域拡張装置 Download PDFInfo
- Publication number
- WO2006090553A1 WO2006090553A1 PCT/JP2006/301287 JP2006301287W WO2006090553A1 WO 2006090553 A1 WO2006090553 A1 WO 2006090553A1 JP 2006301287 W JP2006301287 W JP 2006301287W WO 2006090553 A1 WO2006090553 A1 WO 2006090553A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- signal
- frequency
- audio signal
- low
- original
- Prior art date
Links
- 238000001514 detection method Methods 0.000 claims abstract description 8
- 230000005236 sound signal Effects 0.000 claims description 44
- 238000000034 method Methods 0.000 claims description 15
- 230000015572 biosynthetic process Effects 0.000 claims description 8
- 238000003786 synthesis reaction Methods 0.000 claims description 8
- 230000002194 synthesizing effect Effects 0.000 claims description 4
- 238000006243 chemical reaction Methods 0.000 description 25
- 238000010586 diagram Methods 0.000 description 16
- 230000000694 effects Effects 0.000 description 8
- 238000004364 calculation method Methods 0.000 description 6
- 238000005070 sampling Methods 0.000 description 6
- 238000005311 autocorrelation function Methods 0.000 description 5
- 230000001934 delay Effects 0.000 description 4
- 238000005314 correlation function Methods 0.000 description 3
- 239000004606 Fillers/Extenders Substances 0.000 description 2
- 230000003111 delayed effect Effects 0.000 description 2
- 230000000717 retained effect Effects 0.000 description 2
- 230000002159 abnormal effect Effects 0.000 description 1
- 239000002131 composite material Substances 0.000 description 1
- 230000007812 deficiency Effects 0.000 description 1
- 239000000284 extract Substances 0.000 description 1
- 230000007274 generation of a signal involved in cell-cell signaling Effects 0.000 description 1
- 230000000737 periodic effect Effects 0.000 description 1
- 230000010363 phase shift Effects 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/038—Speech enhancement, e.g. noise reduction or echo cancellation using band spreading techniques
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/04—Time compression or expansion
Definitions
- the present invention relates to a voice band extending device, and can be applied to, for example, a narrow band telephone or a device for wideband voice signals of exchange power.
- Patent Document 1 proposes a band expander that expands a voice band.
- Narrowband audio signal (digital signal) DC whose frequency is limited from 300 Hz to 3.4 kHz is input to Band Extender 10.
- This narrowband audio signal DC is converted to a converted original signal S whose sample frequency is increased by the sample frequency change (for example, 8 kHz to 16 kHz).
- This band extension signal V is a low-frequency band generated from a band-limited narrowband audio signal DC.
- Component signals and high-frequency component signals are provided together with the transmitted signals at the same time, making it possible to listen to the sound as if it were a wide-band signal containing these components.
- Patent Document 1 Japanese Patent Laid-Open No. 9258787
- the newly generated component signal is different in frequency component from the original signal, so the phase of the newly generated component signal and the transmitted signal is different. Since the addition and synthesis are simply performed without causing any problems, the finally created wideband audio signal may have an unnatural audio signal compared to the original wideband audio signal.
- the present invention provides an audio band extending apparatus having an extended audio generating means for generating an extended audio signal having a band that the original audio signal does not have from the original audio signal.
- a timing shift detection unit that detects a timing shift between the audio signal and the extended audio signal; an adjustment unit that adjusts the timing of the original audio signal and / or the extended audio signal according to the detected timing shift;
- a synthesis means for synthesizing the original voice signal and the extended voice signal.
- the audio band extending apparatus of the present invention although the bands are different, the original audio signal and the extended audio signal are synthesized in synchronism with each other. Voice signals can be realized.
- FIG. 1 is a block diagram showing a configuration of a voice band extending apparatus according to a first embodiment.
- FIG. 2 is a block diagram showing a configuration of a conventional band extender.
- FIG. 3 is a block diagram showing a detailed configuration of an adjustment adder according to the first embodiment.
- FIG. 4 is a block diagram showing a detailed configuration of a low-frequency adjustment adder according to the first embodiment.
- FIG. 5 is an explanatory diagram of the processing of the adjuster in the low-frequency adjustment adder of the first embodiment.
- FIG. 6 is a block diagram showing a detailed configuration of a low-frequency adjustment adder according to the second embodiment.
- FIG. 7 is a block diagram showing a detailed configuration of a low-frequency adjustment adder according to a third embodiment.
- FIG. 8 is a block diagram showing a detailed configuration of a low-frequency adjustment adder according to a fourth embodiment.
- FIG. 9 is a block diagram showing a detailed configuration of an adjustment adder according to a fifth embodiment.
- FIG. 10 is a block diagram showing a detailed configuration of a low frequency adjuster according to a fifth embodiment.
- FIG. 1 is a block diagram showing a configuration of a voice band extending apparatus 100 according to the first embodiment.
- the same reference numerals are given to the same and corresponding parts as in FIG. ing.
- a speech band extending apparatus 100 includes a sampling frequency converter 11, a low-frequency signal generator 12, a high-frequency signal generator 13, a silent part signal generator 14, and adjustment addition.
- Has vessel 20 The sampling frequency converter 11, a low-frequency signal generator 12, a high-frequency signal generator 13, a silent part signal generator 14, and adjustment addition.
- sampling frequency converter 11, low-frequency signal generator 12, high-frequency signal generator 13, and silent part signal generator 14 are the same as those described in Patent Document 1, respectively.
- synthetic low-frequency signal LS, synthetic high-frequency signal HS, and synthetic unvoiced signal for generating the band extension signal V The US generation method is not limited to the method described in Patent Document 1, and other existing methods may be applied.
- the adjustment adder 20 provided in place of the adder 15 in FIG. 2 adjusts the timing of the synthesized low-frequency signal LS, the synthesized high-frequency signal HS, and the synthesized unvoiced signal US with respect to the frequency conversion original signal S. The difference is that the timing is adjusted.
- the low-frequency adjustment adder 21 includes two zero-crossing detectors 31 and 32, a delay detector 33, an adjuster 34, and an adder circuit 35.
- each unit operates as follows each time one audio frame is input.
- the second zero-crossing detector 32 is configured such that the input to the first zero-crossing detector 31 is the synthesized low-frequency signal LS instead of the frequency conversion original signal S, and the output is the original zero-crossing information SZ. Instead, the operation is the same as that of the first zero-crossing detector 31 except that the low-frequency zero-crossing information LZ is used, and a detailed description thereof will be omitted.
- the delay detector 33 inputs the zero-crossing information SZ obtained from the frequency-converted original signal S and the low-frequency zero-crossing information LZ obtained from the synthesized low-frequency signal LS, and receives the synthesized low-frequency signal for the frequency-converted original signal S.
- the delay time of LS is calculated, and this is output to the regulator 35 as delay information LD.
- the delay time is, for example, the time difference between the zero-crossing time with the positive slope detected first in the frame together with the original zero-crossing information SZ and the low-frequency zero-crossing information LZ.
- the period calculation method in which a known autocorrelation function may be used is not limited.
- the latest end force of the delay-added signal LSI is also copied to the earliest side, and the signal waveform corresponding to one period is copied as the interpolated signal LS2 at the position on the latest side by one period LT.
- the part ES corresponding to the shortage of the waveform is combined with the delay grant signal LSI to generate the low-frequency adjustment signal LA.
- the frequency conversion original signal S is used instead of the synthesized low frequency signal LS, and the original delay information SD is used instead of the low frequency delay signal LD. Only handle positive delays. Here, only negative delays that handle only positive delays may be handled.
- the correlation calculator 41 is provided in place of the zero-crossing detectors 31 and 32 and the delay detector 33 of the first embodiment, and includes a frequency conversion original signal S and a low-frequency signal generator 13. Correlation information (low-band correlation information) LC with the generated combined low-frequency signal LS is obtained and output to the adjuster 42.
- the low-frequency adjustment adder 21 of the fourth embodiment includes a period detector 43 in addition to the correlation calculator 41, the adjuster 42, and the addition circuit 35.
- the period detector 43 since the period detector 43 is provided, the function of the correlation calculator 41 is slightly different from that of the third embodiment.
- the internal configuration of the adjustment adder 20 is different from that of the above-described embodiment.
- the extension component signal has three types of signals, that is, the low-frequency component, the high-frequency component, and the unvoiced component, but the number of types of the extension component signal is not limited to three. It may be more or less than this. For example, a plurality of types of high frequency components having different bands may be generated.
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Quality & Reliability (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Telephone Function (AREA)
- Synchronisation In Digital Transmission Systems (AREA)
Abstract
Description
Claims
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US11/884,780 US8000976B2 (en) | 2005-02-22 | 2006-01-27 | Speech band extension device |
GB0716155A GB2439660A (en) | 2005-02-22 | 2006-01-27 | Voice band extension device |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2005-045995 | 2005-02-22 | ||
JP2005045995A JP4821131B2 (ja) | 2005-02-22 | 2005-02-22 | 音声帯域拡張装置 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2006090553A1 true WO2006090553A1 (ja) | 2006-08-31 |
Family
ID=36927198
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/JP2006/301287 WO2006090553A1 (ja) | 2005-02-22 | 2006-01-27 | 音声帯域拡張装置 |
Country Status (5)
Country | Link |
---|---|
US (1) | US8000976B2 (ja) |
JP (1) | JP4821131B2 (ja) |
CN (1) | CN100587809C (ja) |
GB (1) | GB2439660A (ja) |
WO (1) | WO2006090553A1 (ja) |
Families Citing this family (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
BRPI0818927A2 (pt) * | 2007-11-02 | 2015-06-16 | Huawei Tech Co Ltd | Método e aparelho para a decodificação de áudio |
CN102194458B (zh) * | 2010-03-02 | 2013-02-27 | 中兴通讯股份有限公司 | 频带复制方法、装置及音频解码方法、系统 |
CN102800317B (zh) * | 2011-05-25 | 2014-09-17 | 华为技术有限公司 | 信号分类方法及设备、编解码方法及设备 |
ES2549953T3 (es) * | 2012-08-27 | 2015-11-03 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Aparato y método para la reproducción de una señal de audio, aparato y método para la generación de una señal de audio codificada, programa de ordenador y señal de audio codificada |
CN107402405B (zh) * | 2016-05-18 | 2019-07-19 | 中国石油化工股份有限公司 | 静相位虚源道集构建方法 |
CN106328153B (zh) * | 2016-08-24 | 2020-05-08 | 青岛歌尔声学科技有限公司 | 电子通信设备语音信号处理系统、方法和电子通信设备 |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH09146593A (ja) * | 1995-11-27 | 1997-06-06 | Victor Co Of Japan Ltd | 音響信号符号化方法、音響信号復号化方法、音響信号符号化装置及び音響信号復号化装置 |
JPH11126097A (ja) * | 1997-10-22 | 1999-05-11 | Victor Co Of Japan Ltd | 音声情報処理方法、音声情報処理装置、音声情報の記録媒体への記録方法 |
Family Cites Families (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH0774564A (ja) * | 1993-06-23 | 1995-03-17 | Clarion Co Ltd | 音質改善装置 |
JP3243174B2 (ja) * | 1996-03-21 | 2002-01-07 | 株式会社日立国際電気 | 狭帯域音声信号の周波数帯域拡張回路 |
WO1999010719A1 (en) * | 1997-08-29 | 1999-03-04 | The Regents Of The University Of California | Method and apparatus for hybrid coding of speech at 4kbps |
EP1580747A3 (en) * | 1997-10-22 | 2005-11-02 | Victor Company of Japan Limited | Audio information processing method, audio information processing apparatus, and method of recording audio information on recording medium |
US7003121B1 (en) * | 1998-04-08 | 2006-02-21 | Bang & Olufsen Technology A/S | Method and an apparatus for processing an auscultation signal |
JP3654117B2 (ja) * | 2000-03-13 | 2005-06-02 | ヤマハ株式会社 | 時間軸方向における楽音波形信号の伸縮方法 |
US7610205B2 (en) * | 2002-02-12 | 2009-10-27 | Dolby Laboratories Licensing Corporation | High quality time-scaling and pitch-scaling of audio signals |
JP2004350077A (ja) * | 2003-05-23 | 2004-12-09 | Matsushita Electric Ind Co Ltd | アナログオーディオ信号送信装置および受信装置並びにアナログオーディオ信号伝送方法 |
EP1755112B1 (en) * | 2004-02-20 | 2008-05-28 | Sony Corporation | Method and apparatus for separating a sound-source signal |
-
2005
- 2005-02-22 JP JP2005045995A patent/JP4821131B2/ja active Active
-
2006
- 2006-01-27 CN CN200680005711A patent/CN100587809C/zh active Active
- 2006-01-27 GB GB0716155A patent/GB2439660A/en not_active Withdrawn
- 2006-01-27 WO PCT/JP2006/301287 patent/WO2006090553A1/ja not_active Application Discontinuation
- 2006-01-27 US US11/884,780 patent/US8000976B2/en active Active
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH09146593A (ja) * | 1995-11-27 | 1997-06-06 | Victor Co Of Japan Ltd | 音響信号符号化方法、音響信号復号化方法、音響信号符号化装置及び音響信号復号化装置 |
JPH11126097A (ja) * | 1997-10-22 | 1999-05-11 | Victor Co Of Japan Ltd | 音声情報処理方法、音声情報処理装置、音声情報の記録媒体への記録方法 |
Also Published As
Publication number | Publication date |
---|---|
GB0716155D0 (en) | 2007-09-26 |
CN101128868A (zh) | 2008-02-20 |
CN100587809C (zh) | 2010-02-03 |
JP4821131B2 (ja) | 2011-11-24 |
GB2439660A (en) | 2008-01-02 |
US8000976B2 (en) | 2011-08-16 |
US20080255831A1 (en) | 2008-10-16 |
JP2006234967A (ja) | 2006-09-07 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
JP5925742B2 (ja) | 通信システムにおける隠蔽フレームの生成方法 | |
US8271292B2 (en) | Signal bandwidth expanding apparatus | |
JP6281336B2 (ja) | 音声復号化装置及びプログラム | |
JP5598536B2 (ja) | 帯域拡張装置および帯域拡張方法 | |
JP2002528775A (ja) | 広帯域信号の符号化における適応帯域ピッチ探索のための方法および装置 | |
US20090278573A1 (en) | Band Converted Signal Generator and Band Extender | |
JPH09258787A (ja) | 狭帯域音声信号の周波数帯域拡張回路 | |
JP4031813B2 (ja) | オーディオ信号処理装置、オーディオ信号処理方法およびその方法をコンピュータに実行させるプログラム | |
WO2006090553A1 (ja) | 音声帯域拡張装置 | |
JP2003256000A (ja) | 電話装置 | |
JP2002082685A (ja) | 音声帯域拡張装置及び音声帯域拡張方法 | |
JP3430985B2 (ja) | 合成音生成装置 | |
JP2004198485A (ja) | 音響符号化信号復号化装置及び音響符号化信号復号化プログラム | |
US20030182106A1 (en) | Method and device for changing the temporal length and/or the tone pitch of a discrete audio signal | |
JP4604864B2 (ja) | 帯域拡張装置及び不足帯域信号生成器 | |
WO2007135786A1 (ja) | 帯域外信号生成装置及び周波数帯域拡張装置 | |
JP2012208177A (ja) | 帯域拡張装置及び音声補正装置 | |
US7228271B2 (en) | Telephone apparatus | |
JP2007310296A (ja) | 帯域拡張装置及び方法 | |
JPH08305396A (ja) | 音声帯域拡大装置および音声帯域拡大方法 | |
JP3360423B2 (ja) | 音声強調装置 | |
JP2010151965A (ja) | 音声処理装置、及び、音声処理方法 | |
JP2002175099A (ja) | 雑音抑制方法および雑音抑制装置 | |
Suzuki et al. | Time-scale modification of speech signals using cross-correlation functions | |
JP2001249676A (ja) | 雑音が付加された周期波形の基本周期あるいは基本周波数の抽出方法 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application | ||
DPE2 | Request for preliminary examination filed before expiration of 19th month from priority date (pct application filed from 20040101) | ||
ENP | Entry into the national phase |
Ref document number: 0716155 Country of ref document: GB Kind code of ref document: A Free format text: PCT FILING DATE = 20060127 |
|
WWE | Wipo information: entry into national phase |
Ref document number: 0716155.7 Country of ref document: GB |
|
WWE | Wipo information: entry into national phase |
Ref document number: 11884780 Country of ref document: US |
|
WWE | Wipo information: entry into national phase |
Ref document number: 200680005711.1 Country of ref document: CN |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
REG | Reference to national code |
Ref country code: GB Ref legal event code: 789A Ref document number: 0716155 Country of ref document: GB |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 06712453 Country of ref document: EP Kind code of ref document: A1 |
|
WWW | Wipo information: withdrawn in national office |
Ref document number: 6712453 Country of ref document: EP |
|
NENP | Non-entry into the national phase |
Ref country code: JP |