KR100372576B1

KR100372576B1 - Method of Processing Audio Signal

Info

Publication number: KR100372576B1
Application number: KR10-2000-0028582A
Authority: KR
Inventors: 최원용; 이병철; 정상현; 최원식
Original assignee: 주식회사 코스모탄
Priority date: 2000-05-26
Filing date: 2000-05-26
Publication date: 2003-02-17
Also published as: KR20010107238A

Abstract

사용자가 설정해준 값에 따라서 약음을 강음에 비하여 상대적으로 더 높이 이득률을 적용하여 차등증폭하고, 이와 함께, 원래의 신호를 시간 축으로 변환시켜 재생시간 즉, 재생속도를 더 빠르게 혹은 느리게 가변시킬 수 있는 오디오신호 가공방법을 개시한다. 차등증폭을 위해, 샘플데이터를 단조증가함수에 대입하여 그 함수값으로 대체한다. 필요에 따라 차등증폭을 적용하는 범위를 선정하여 그 범위에 포함되는 샘플들만을 차등적으로 증폭한다. 시간축변환을 위해, 원래의 신호를 프레임단위로 읽어와서 시간축변환신호에 중첩시켜 부가하되 그 중첩되는 부분은 사용자가 지정한 재생시간조정율 α와 원신호 x(·)의 각 프레임간 시작점의 기본차이값 Sa에 따라 다르게 정해진다. 또한 중첩부분은 가중치를 적용하여 합한다. 원신호를 읽어올 때는 프레임간 시작점의 기본차이값 Sa을 기준으로 소정의 체크범위(윈도우) k_m내에서 원신호 x(·)와 시간축변환신호 y(·)간의 파형의 유사성을 기준으로 최적 상관성을 검출하고 그 위치를 동기지연 Km으로 정한다. 차등증폭은 시간축변환을 위해 읽어낸 프레임데이터에 대하여 시간축변환 전이나 후에 수행할 수 있다.Depending on the value set by the user, the attenuation is differentially amplified by applying a higher gain ratio relative to the strong sound. An audio signal processing method is disclosed. For differential amplification, we substitute sample data into the monotonic increment function and replace it with the value of that function. If necessary, select a range to apply differential amplification and differentially amplify only the samples included in the range. For time-base conversion, the original signal is read in units of frames and superimposed on the time-base conversion signal, and the overlapped part is the basic difference value between the start time adjustment rate α specified by the user and the starting point of each frame of the original signal x (·). It depends on Sa. In addition, the overlapping parts are summed by applying weights. The original signal to read a predetermined check, based on the primary differential value Sa of the inter-frame entry point when all range (window) optimal, based on the similarity of the waveform between the original signal x (·) and the time-base conversion signal y (·) in the k _m Correlation is detected and its position is determined as the synchronization delay Km. The differential amplification can be performed before or after the time base transformation on the frame data read for the time base transformation.

Description

Method of Processing Audio Signal

본 발명은 오디오신호 가공방법과 이를 위한 장치에 관한 것이다. 보다 구체적으로는, 사용자가 설정한 값에 따라 오디오신호의 재생시간과 세기를 실시간으로 동시에 가변시켜 재생하는 오디오신호의 가공방법에 관한 것이다.The present invention relates to an audio signal processing method and apparatus therefor. More specifically, the present invention relates to a method for processing an audio signal in which the reproduction time and intensity of the audio signal are simultaneously changed in real time according to a value set by a user.

음성이나 노래 등과 같이 오디오재생장치를 통해서 출력되는 재생음을 알아듣기 힘들 때가 있다. 이에 관한 원인은 여러 가지가 있을 수 있다. 원래의 신호음이 약하게 기록되어 있거나 대단히 빠른 속도로 기록되어 있는 경우도 그 원인이 될 수 있지만, 청취자의 청력이 약해 약음을 잘 알아듣지 못하거나 혹은 재생음이 외국어이어서 청해력이 기본적으로 낮은 경우도 그 원인이 될 수 있다.Sometimes it is difficult to hear the playback sound output through the audio playback device such as voice or song. There are many possible causes for this. The reason may be that the original tone is weakly recorded or recorded at a very high speed, but it is also possible that the listener's hearing is weak and he / she does not hear it well, or that the hearing is fundamentally low due to the foreign language being played. This can be

특히, 외국어 청취력 학습에 있어서, 청취력을 높여주는 방법으로서 잘 들리지 않는 약한 음의 세기를 보강해주는 것이 하나의 해결책이 될 수 있다. 일반적으로 영어와 같은 음절언어(syllabic language)는 한국어 혹은 일본어와는 달리 억양과 악센트에 의해 보다 많은 정보를 전달할 수 있는 언어적 특징을 갖고 있다. 하나의 단어에 포함된 여러 가지 발음중 그 일부를 생략하거나 약화시켜도 그 의미 전달에 큰 문제가 없기 때문에 음절언어 특히 영어 발음에는 묵음이나 약음으로 처리되는 경우가 많다. 이에 비해, 한국어나 일본어 등과 같은 비음절언어(nonsyllabic language)의 경우 그 음성을 통한 의사전달은 각 단어가 포함하고 있는 의미가 주로 담당하므로 억양이나 악센트는 큰 비중을 차지하지 않는다. 비음절언어의 발음상의 특징은 각각의 단어를 정확하게 발음하는 것이고 묵음이나 약음으로 발음되는 경우가 드물다. 한국어나 일본어와 같은 비음절언어를 모국어로 사용하는 사람들은 영어 발음 중에서 액센트가 없는 음절 부분 즉, 묵음이나 약음으로 발음되는 부분을 정확하게 청해하기란 매우 힘들다. 결국, 한국어나 일본어 발음체계에 익숙한 사람들은 영어발음을 정확하게 이해하기 위해서는 청취학습을 무수히 많이 반복하는 노력을 필요로 하였다.In particular, in learning foreign language listening ability, one solution may be to reinforce the weak sound strength, which is difficult to hear as a method of enhancing the listening ability. In general, syllabic language such as English has a linguistic characteristic that can convey more information by accent and accent, unlike Korean or Japanese. Omitting or attenuating some of the various pronunciations in a word does not have much problem in conveying its meaning, so syllable languages, especially English pronunciations, are often treated as mute or weak. In contrast, in the case of nonsyllabic languages such as Korean or Japanese, communication through the voice is mainly responsible for the meaning included in each word, so the accent or accent does not take much weight. The pronunciation feature of non-syllable languages is that each word is pronounced correctly, and it is rarely pronounced as a silent or weak sound. It is very difficult for people who use non-syllable languages such as Korean or Japanese to listen to the accented syllable parts of English pronunciations, such as silent or weak sounds. As a result, those who are familiar with Korean or Japanese pronunciation systems needed to repeat listening lessons in order to understand English pronunciation correctly.

묵음을 인위적으로 부가하는 것은 매우 어려워 이는 논외로 하더라도, 오디오신호의 재생시 약음의 세기를 강화시키는 것은 가능하다. 다만, 알려진 오디오신호의 이득제어방법들은 음의 세기에 의존하지 않고 일률적으로 동일한 증폭율을 적용하는 방법들이어서 오히려 재생음의 품질을 떨어뜨려 실제로는 채용되지 못하고 있다. 즉, 이들 방법에 따르면, 약음이나 강음은 이를 구별하지 않고 일률적으로 동일한 이득률을 적용하여 약음의 세기를 원하는 수준까지 증폭하면 강음마저도 그세기가 강화되어 강음부분은 매우 크고 찢어지는 소리로 들리는 등 재생음의 품질이 오히려 나빠지게 된다.It is very difficult to artificially add silence, so that it is possible to enhance the strength of the weak sound when the audio signal is reproduced. However, known gain control methods of audio signals are methods that apply the same amplification rate uniformly without depending on the intensity of sound, but rather are not actually employed because they degrade the quality of the reproduced sound. In other words, according to these methods, if the amplification of the weak sound level is applied to the desired level by applying the same gain rate uniformly without distinguishing the weak sound or strong sound, even the strong sound is strengthened and the strong sound part is heard as a loud and torn sound. The quality of the reproduction sound is rather worse.

한편, 원래의 정상적인 재생속도가 상대적으로 빠르게 느껴지는 경우에는 재생속도를 더 천천히 하거나, 그 반대의 경우에는 재생속도를 보다 빠르게 해주는 것이 필요한 경우가 있다. 재생속도를 가변시켜주는 기술로는 종래부터 여러 가지가 알려져있다. 그 대표적인 것으로서 1985년 Roucus와 Wilgus에 의해 오버랩가산법(overlap-addition: OLA)이 소개된 이후, 동기식 오버랩가산법(Synchronized OLA: SOLA), 파형유사성에 의거한 OLA법 즉, WSOLA법 등으로 변화 발전해오고 있다. SOLA 알고리즘은 재생속도를 가변시키기 위하여 음성데이터를 일정한 크기의 윈도우를 사용하여 일정한 간격으로 오버랩 시키면서 해당하는 블록을 잘라내고 요구된 속도변화에 대응하여 블록들을 시간축상에 재배치하여 더하는 방법으로 오디오신호를 가공한다. 여기서, 음질저하를 방지하기 위해 블록을 재배치할 때 오버랩되는 부분의 파형 유사성이 가장 큰 값에 해당하는 간격만큼 이동시켜 두 개의 블록신호를 합성하는 기술을 적용하기도 한다.On the other hand, if the original normal playback speed feels relatively fast, it may be necessary to make the playback speed slower or vice versa. Various techniques are known in the art for varying the playback speed. For example, since the overlap-addition (OLA) was introduced by Roucus and Wilgus in 1985, it changed to the synchronous overlap addition method (SOLA), the OLA method based on waveform similarity, that is, the WSOLA method. It is developing. In order to change the playback speed, the SOLA algorithm overlaps the audio data at regular intervals by using a window of a constant size, cuts out the corresponding block, and adds the audio signal by rearranging the blocks on the time axis in response to the required speed change. Processing. In order to prevent sound degradation, a technique of synthesizing two block signals by shifting the similarity of waveforms of overlapping portions by an interval corresponding to the largest value may be applied.

그런데 오디오신호의 재생속도 가변기술은 그 자체에 관해서는 가공 후에도 양호한 음질을 유지할 수 있으며, 더 나아가 음의 세기를 가변시키는 기술과 결합하여 동시적으로 처리할 수 있다면 매우 바람직할 것이다.However, the technology of changing the reproduction speed of the audio signal can maintain good sound quality even after processing, and furthermore, it is highly desirable if it can be simultaneously processed in combination with a technique of varying the sound intensity.

본 발명의 제 1의 목적은 원래의 디지털 오디오신호를 사용자가 정해준 값에 의거하여 약한 음을 강한 음에 비해 상대적으로 더 강하게 증폭하여 약음을 보다알아듣기 쉽게 가공해주는 오디오신호 가공방법을 제공하는 것이다.It is a first object of the present invention to provide an audio signal processing method that amplifies a weak sound relatively stronger than a strong sound based on a value determined by a user, thereby processing the weak sound more easily. .

본 발명의 제 2의 목적은 원래의 오디오신호를 사용자가 정해준 값에 따라 재생시간을 가변시킴과 동시에 약음의 세기가 강음에 비해 차등적으로 더 강화시킬 수 있는 오디오신호 가공방법을 제공하는 것이다.It is a second object of the present invention to provide an audio signal processing method which can vary the reproduction time according to a value determined by a user of an original audio signal, and at the same time, further enhance the strength of the weak sound compared to the strong sound.

도 1은 본 발명에 따른 오디오신호 가공방법을 실현하기 위한 장치의 구성을 도시한 블록도이다.1 is a block diagram showing the configuration of an apparatus for realizing an audio signal processing method according to the present invention.

도 2는 본 발명에 따른 오디오신호 가공방법을 개략적으로 설명하기 위한 흐름도이다.2 is a flowchart schematically illustrating an audio signal processing method according to the present invention.

도 3은 원래의 오디오신호를 사용자가 설정한 재생시간조정율 α에 따라 시간축변환처리를 수행하는 알고리즘을 도시한 흐름도이다.Fig. 3 is a flowchart showing an algorithm for performing time-base conversion processing on the original audio signal according to the reproduction time adjustment rate? Set by the user.

도 4는 원래의 오디오신호 x(·)를 사용자가 설정한 재생시간조정율 α의 크기에 대응하여 그 길이를 늘이거나 줄여 시간축변환신호 y(·)를 얻기 위한 하는 원리를 설명하기 위한 참고도이다.FIG. 4 is a reference diagram for explaining a principle of obtaining the time-axis conversion signal y (·) by increasing or decreasing the length of the original audio signal x (·) corresponding to the size of the reproduction time adjustment rate α set by the user. .

도 5는 약음을 강음보다 더 높은 증폭율을 적용하여 차등증폭하는 알고리즘을 설명하기 위한 흐름도이다.5 is a flowchart for explaining an algorithm for differentially amplifying by applying amplification higher than weak sound.

도 6은 차등증폭함수의 여러 가지 예를 도시한 그래프이다.6 is a graph illustrating various examples of the differential amplification function.

도 7은 원래의 신호와 시간축변환된 신호 및 차등증폭된 신호의 파형도를 각각 도시한다.7 shows waveform diagrams of the original signal, the time-base converted signal, and the differentially amplified signal, respectively.

** 도면의 주요부분에 대한 부호의 설명 **** Explanation of symbols for main parts of drawings **

10: 메모리10: memory

12: 재생시간조정부12: playback time adjustment unit

14: 차등증폭부14: differential amplifier

16: 필터부16: filter section

18: 버퍼부18: buffer part

20: 사용자 인터페이스부20: user interface

22: 오디오출력부22: audio output unit

24: 스피커24: speaker

상기 제 1의 목적을 달성하기 위하여, 본 발명은 시간에 대한 음의 세기를 정수 스트림으로 표현한 디지털 오디오신호를 가공하는 방법에 있어서,In order to achieve the first object, the present invention provides a method for processing a digital audio signal expressing the sound intensity over time as an integer stream,

사용자에 의해 지정된 차등증폭율 β을 사용자 인터페이스로부터 읽어들이는 단계; 및Reading the differential amplification factor β specified by the user from the user interface; And

상기 정수 스트림의 각 샘플데이터를 상기 차등증폭율 β에 대응하는 소정의 차등증폭함수 f(x_i)의 변수값으로 적용하여 증폭하되, 특히 상기 차등증폭함수 f(x_i)는 약음에서 강음으로 갈수록 증폭이득이 상대적으로 낮게 적용되는 함수이어서 약음을 강음에 비해 차등적으로 더 강화시키는 차등증폭단계를 포함하는 것을 특징으로 하는 오디오신호 가공방법을 제공한다.Each sample data of the integer stream is amplified by applying a variable value of a predetermined differential amplification function f (x _i ) corresponding to the differential amplification ratio β, and in particular, the differential amplification function f (x _i ) is weak to strong. It provides a method for processing an audio signal, characterized in that it comprises a differential amplification step that is a function of applying a relatively low amplification gain gradually differentially compared to strong sound.

바람직하게는 상기 차등증폭함수 f(x_i)는 단조증가함수중에서 선택한다.Preferably, the differential amplification function f (x _i ) is selected from the monotonically increasing function.

상기 오디오신호 가공방법은 바람직하게는 차등증폭된 샘플데이터가 패킷데이터량만큼 모이면 상기 패킷데이터에 대하여 저역통과필터링처리를 하는 단계를 더 포함한다.The audio signal processing method preferably further includes performing low pass filtering on the packet data when the differentially amplified sample data is collected by the packet data amount.

하나의 선택가능한 방안으로서, 상기 차등증폭단계는 상기 샘플데이터 x_i를문턱값 x_th와 그 크기를 비교하는 단계; 및 상기 비교의 결과, 상기 각 샘플데이터 x_i가 상기 문턱값 x_th보다 크면 상기 차등증폭함수 f(x_i)는 f(x_i) = x_i ^1/β이고, 상기 각 샘플데이터 x_i가 상기 문턱값 x_th보다 작으면 상기 차등증폭함수 f(x_i)는 f(x_i) = x_i ^β이 되어 상기 차등증폭처리를 수행하는 단계를 포함한다.In one selectable method, the differential amplifying step includes: comparing the sample data x _i with a threshold value x _th ; And as a result of the comparison, when the respective sample data x _i is greater than the threshold value x _th , the differential amplification function f (x _i ) is f (x _i ) = x _i ^{1 / β} , and each sample data x _i is If the threshold value is smaller than x _th , the differential amplification function f (x _i ) becomes f (x _i ) = x _i ^β and includes performing the differential amplification process.

다른 선택가능한 방안으로서, 상기 차등증폭단계의 상기 샘플데이터 x_i를 문턱값 x_th와 그 크기를 비교하는 단계; 및 상기 비교의 결과, 상기 각 샘플데이터 x_i가 상기 문턱값 x_th보다 큰 경우에만 상기 차등증폭함수 f(x_i)는 f(x_i) = x_i ^1/β로 하여 상기 차등증폭처리를 수행하는 단계를 포함한다.As another selectable method, the step of comparing the sample data x _i of the differential amplification step with a threshold value x _th and its magnitude; And as a result of the comparison, the differential amplification function f (x _i ) is performed as f (x _i ) = x _i ^{1 / β} only when each sample data x _i is larger than the threshold value x _th . Performing the steps.

한편, 상기 제 2의 목적을 달성하기 위하여, 본 발명은 시간에 대한 음의 세기를 정수 스트림으로 표현한 디지털 오디오신호를 입력신호로 하여 상기 디지털 오디오신호를 사용자의 지정값에 따라 그 재생시간의 조절과 차등증폭에 의한 이득조절을 병행적으로 처리하는 것을 특징으로 하는 오디오신호 가공방법을 제공한다.On the other hand, in order to achieve the second object, the present invention is to adjust the reproduction time of the digital audio signal according to the user's designated value by using the digital audio signal expressing the sound intensity with respect to time as an integer stream as an input signal The present invention provides an audio signal processing method characterized in that the gain control by the differential amplification and processing in parallel.

이 오디오신호 가공방법은 사용자에 의해 지정되는 입력값인 재생시간조정율 α와 차등증폭율 β를 사용자 인터페이스로부터 읽어들이는 단계로서, 상기 재생시간조정율 α는 오디오신호의 재생시간을 정상적인 재생시간보다 길거나 짧게 조정하기 위한 것으로서 α_min≤ α ≤ α_max의 범위 내에서 정해지며 상기 차등증폭율β는 오디오신호의 약음을 강음보다 상대적으로 더 강화시키기 위한 것으로서 β_min≤ β ≤ β_max의 범위 내에서 정해지는 설정값읽기단계;The audio signal processing method reads the reproduction time adjustment rate α and the differential amplification ratio β, which are input values specified by the user, from the user interface, wherein the reproduction time adjustment rate α is longer than the normal reproduction time. As a short adjustment, it is determined within the range of α _min ≤ α ≤ α _max , and the differential amplification ratio β is for reinforcing the weakness of the audio signal relatively more than the strong sound, and is determined within the range of β _min ≤ β ≤ β _max . Losing set value reading step;

제1 메모리에 저장되어 있는 원래의 오디오신호 x(·)를 N개의 샘플로 구성된 프레임 단위로 연속해서 읽어내되, 다음 프레임 F_m은 이전 프레임 F_m-1의 시작 위치에서 상기 N개보다 작은 수의 샘플을 건너뛴 위치의 샘플부터 읽어내므로써 인접 프레임간에 소정개수의 샘플을 중복하여 읽어내는 데이터독출단계;The original audio signal x (·) stored in the first memory is continuously read in units of N samples, and the next frame F _m is smaller than the N at the start of the previous frame F _m-1 . A data reading step of reading a predetermined number of samples in duplicate between adjacent frames by reading from a sample of a position where a sample of the skipped position is skipped;

상기 다음 프레임 F_m을 제 2 메모리에 시간축변환신호 y(·)로서 이미 기록된 상기 이전 프레임 F_m-1과 소정 샘플 수만큼 중복되도록 부가하되 그 중복부분의 샘플은 가중치를 주고 더하며 비중복부분은 그대로 더하고, 상기 중복부분의 샘플 수는 상기 재생시간조정율 α의 크기에 대응하여 증감하는 방식으로 부가함으로써 시간축 길이가 상기 재생시간조정율 α에 비례하여 변환된 신호 y(·)를 얻기 위한 시간축변환단계; 및The next frame F _{m is added} to the second memory so as to overlap the previous frame F _m-1 already recorded as the time-base conversion signal y (·) by a predetermined number of samples, and the samples of the overlapped portion are weighted, added, and not duplicated. The portion is added as it is, and the number of samples of the overlapped portion is added in such a manner as to increase or decrease in correspondence with the size of the reproduction time adjustment rate α so that the time axis for obtaining a signal y (·) whose time axis length is converted in proportion to the reproduction time adjustment rate α is obtained. Conversion step; And

각 프레임의 샘플데이터 x_i를 상기 차등증폭율 β에 대응하는 소정의 차등증폭함수 f(x_i)에 차례로 대입하여 증폭하되, 특히 상기 차등증폭함수 f(x_i)는 약음에서 강음으로 갈수록 증폭이득이 상대적으로 낮게 적용되는 함수이어서 약음을 강음에 비해 차등적으로 더 강화시키는 차등증폭단계를 포함한다.Sample data x _i of each frame is amplified by substituting a predetermined differential amplification function f (x _i ) corresponding to the differential amplification ratio β, in particular, and the differential amplification function f (x _i ) is amplified from weak to strong. The gain is a relatively low applied function, and includes a differential amplification step that differentially intensifies the weakness compared to strong sound.

필요에 따라서, 데이터독출단계에서 읽어낸 각 프레임의 샘플데이터가 정규화되지 않는 데이터인 경우에는 이를 정규화하는 정규화단계를 더 포함한다.If necessary, if the sample data of each frame read in the data reading step is data that is not normalized, further includes a normalization step of normalizing it.

또한 상기 오디오신호 가공방법은, 바람직하게는 출력버퍼를 다수개 마련하고, 상기 시간축변환단계 및 상기 차등증폭단계를 통해 가공된 신호의 다수개의 프레임을 하나의 패킷으로 묶어서 상기 복수개의 출력버퍼들 중 어느 하나에 기입하되 특히 상기 패킷데이터를 기입하는 출력버퍼는 순환적으로 교체하고, 이와 병행하여 상기 다수개의 출력버퍼에 기록된 패킷데이터는 기록순서가 앞선 출력버퍼의 패킷데이터가 먼저 스피커를 포함하는 오디오출력부로 출력되도록 제어하는 데이터출력단계를 더 포함한다.In the audio signal processing method, preferably, a plurality of output buffers are provided, and a plurality of frames of a signal processed through the time-base conversion step and the differential amplification step are bundled into one packet, and among the plurality of output buffers. In particular, the output buffer for writing the packet data is replaced in a circular manner, and the packet data recorded in the plurality of output buffers in parallel is the packet data of the output buffer prior to the recording order including the speaker first. It further includes a data output step of controlling to be output to the audio output unit.

또한, 상기 오디오신호 가공방법은 차등증폭된 샘플데이터를 소정의 데이터량마다 저역통과필터링 처리를 하여 노이즈를 제거하는 필터링단계를 더 포함할 수 있다.The audio signal processing method may further include a filtering step of removing noise by performing low pass filtering on the differentially amplified sample data for each predetermined data amount.

이하에서는 첨부한 도면을 참조하여 본 발명에 따른 오디오신호 가공방법을 보다 구체적으로 설명하기로 한다.Hereinafter, an audio signal processing method according to the present invention will be described in more detail with reference to the accompanying drawings.

도 1은 본 발명에 따른 오디오신호 가공방법을 실현하기 위한 장치의 구성을 도시한 블록도이다. 도 1의 오디오신호 가공장치는, 디지털 오디오데이터를 입력받아 이를 저장하기 위한 메모리(8)와, 이 메모리(8)로부터 오디오데이터를 읽어들여 재생시간을 가변시키는 신호가공처리를 하는 재생시간조정부(12)와 약음을 강음에 비해 상대적으로 더 큰 이득률로 증폭시키는 차등증폭부(14)를 포함한다. 재생시간조정부(12)는 재생시간조정율(α)에 의거하여 오디오신호의 재생시간을 늘이거나 줄여준다. 차등증폭부(14)는 차등증폭율(β)에 따라 차등증폭 처리를 수행한다. 재생시간조정율(α)과 차등증폭율(β)은 사용자인터페이스부(20)로부터 각각 제공된다. 즉, 사용자 인터페이스부(20)는 사용자의 조작을 모니터링하여 오디오신호의 재생시간과 약음의 강화에 대한 사용자의 설정값을 재생시간조정율(α)과 차등증폭율(β)로 변환하여 재생시간조정부(12)와 차등증폭부(14)로 각각 제공한다. 사용자인터페이스부(20)는 경우에 따라서는 약음강화처리가 적용되는 음의 세기의 문턱값(x_th)을 더 제공한다.1 is a block diagram showing the configuration of an apparatus for realizing an audio signal processing method according to the present invention. The audio signal processing apparatus of FIG. 1 includes a memory 8 for receiving digital audio data and storing the same, and a playback time adjusting unit for processing a signal processing to read audio data from the memory 8 and vary the playback time ( 12) and a differential amplifier 14 for amplifying a weak sound with a relatively larger gain than the strong sound. The reproduction time adjustment section 12 increases or decreases the reproduction time of the audio signal based on the reproduction time adjustment rate α. The differential amplifier 14 performs the differential amplification process in accordance with the differential amplification ratio β. The reproduction time adjustment rate α and the differential amplification rate β are provided from the user interface unit 20, respectively. That is, the user interface unit 20 monitors the user's operation and converts the user's setting value for the reproduction time of the audio signal and the reinforcement of the attenuation into a reproduction time adjustment rate α and a differential amplification rate β, thereby adjusting the reproduction time adjustment unit. (12) and the differential amplifier (14). In some cases, the user interface unit 20 further provides a threshold value x _th of the sound intensity to which the weakening process is applied.

메모리(10)에 제공되는 디지털 오디오데이터는 시간에 대한 음의 세기를 정수 스트림으로 표현한 것으로서, 그 최대값에 의해 정규화된 데이터인 것이 바람직하며, 비정규화 데이터인 경우에는 재생시간조정부(12) 혹은 차등증폭부(14)에서 신호 가공 전에 미리 정규화한다. 디지털 오디오 데이터의 소스는 오디오파일(예컨대 wav 혹은 avi 형식의 오디오 파일)을 저장하는 하드디스크나 시디롬이 될 수도 있고, 마이크 등과 같은 온라인 입력수단이 될 수도 있다. 마이크의 출력신호를 가공하기 위해서는 그 출력신호가 아날로그신호이므로 이를 디지털화하는 처리가 선행되어야 할 것이다.The digital audio data provided to the memory 10 expresses the sound intensity with respect to time as an integer stream, and is preferably data normalized by the maximum value. In the case of non-normalized data, the playback time adjusting section 12 or The differential amplifier 14 normalizes in advance before signal processing. The source of digital audio data may be a hard disk or a CD-ROM which stores an audio file (eg, an audio file in wav or avi format), or may be an online input means such as a microphone. In order to process the output signal of the microphone, since the output signal is an analog signal, the process of digitizing it must be preceded.

오디오신호 가공장치는 또한 재생시간 및 차등증폭 처리를 마친 데이터를 입력받아 저역통과필터링을 하여 고주파수대역의 노이즈를 제거하는 필터부(16)와 전후단의 데이터처리 속도차이를 완충시켜 출력단으로 데이터의 원활한 공급이 이루어지도록 해주는 출력버퍼부(18) 그리고 디지털 오디오신호를 아날로그 오디오신호로 변환하여 스피커(24)를 통해 음으로 출력되도록 하는 오디오출력부(22)를 더 포함한다.The audio signal processing device also receives the data after the reproduction time and the differential amplification processing, and performs a low pass filtering to remove the noise in the high frequency band, and buffers the data processing speed difference between the front and rear ends to output the data to the output stage. An output buffer unit 18 for smooth supply is further included, and an audio output unit 22 for converting a digital audio signal into an analog audio signal and outputting sound through the speaker 24.

재생시간조정부(12)와 차등증폭부(14)는 신호가공부(10)를 구성하며, 이는 필터부(16)와 함께 디지털신호처리(DSP)칩(비도시)에 소프트웨어로 구현될 수도 있고, 컴퓨터장치의 경우에도 소프트웨어로 구현된다. 오디오출력부(22)는 내부에 아날로그/디지털 변환기와 신호증폭기 등을 포함하며 컴퓨터장치에 있어서는 사운드카드가 이에 해당될 수 있다.The reproduction time adjusting unit 12 and the differential amplifying unit 14 constitute the signal processing unit 10, which may be implemented in software on a digital signal processing (DSP) chip (not shown) together with the filter unit 16. Computer devices are also implemented in software. The audio output unit 22 includes an analog / digital converter, a signal amplifier, and the like, and in the computer device, the sound card may correspond to this.

사용자는 소정의 조작수단을 통해 자신이 원하는 재생시간조정율 α과 차등증폭율 β를 설정할 수 있다. 상기 재생시간조정율 α는 오디오신호의 재생시간을 정상적인 재생시간보다 길거나 짧게 조정하기 위한 것으로서 α_min≤ α ≤ α_max의 범위 내에서 정해진다. α의 값이 1인 경우에는 원래의 정상재생속도와 동일하게 재생되며, 1 보다 적은 값이면 디지털신호의 양이 줄어들므로 재생시간은 짧아진다(즉, 재생속도가 빨라진다). 그리고, 1보다 큰 값이면 재생시간은 늘어나서 재생음이 느리게 들린다. 음성신호의 경우에는 α는 0.7이상 3이하의 범위에서 정하더라도 충분히 양호한 음질로 재생될 수 있다는 것을 실제 구현을 통해 확인할 수 있었다. 상기 차등증폭율 β는 오디오신호의 약음을 강음보다 상대적으로 더 강화시키기 위한 것으로서 β_min≤ β ≤ β_max의 범위 내에서 정해진다. 음성신호의 경우 β또한 1이상 3이하의 범위내의 값을 가지도록 하면 충분히 양호한 음질로 재생할 수 있다는 것을 실제 구현을 통해 확인할 수 있었다.The user can set the desired reproduction time adjustment rate α and the differential amplification rate β through predetermined operation means. The reproduction time adjustment rate α is for adjusting the reproduction time of the audio signal longer or shorter than the normal reproduction time, and is determined within the range of α _min ≦ α ≦ α _max . If the value of α is 1, it is reproduced at the same speed as the original normal reproduction speed. If the value is less than 1, the reproduction time is shortened (ie, the reproduction speed is increased) because the amount of the digital signal is reduced. If the value is greater than 1, the playback time increases and the playback sound is slow. In the case of the audio signal, it can be confirmed through actual implementation that α can be reproduced with sufficiently good sound quality even if it is set within the range of 0.7 or more and 3 or less. The differential amplification ratio β is for reinforcing the weak sound of the audio signal relatively more than the strong sound, and is determined within the range of β _min ≦ β ≦ β _max . In the case of the audio signal, β also has a value within the range of 1 to 3, it can be confirmed through the actual implementation that it can be reproduced with a sufficiently good sound quality.

사용자가 소정의 입력수단 예컨대 키보드나 마우스 혹은 다른 종류의 입력수단을 사용하여 재생시간조정율 α와 차등증폭율 β를 입력하면 사용자 인터페이스(20)는 그 입력값을 모니터링하여 재생시간조정부(12)와 차등증폭부(14)로 각각 제공한다(S10 단계). 오디오신호의 가공 중에 이들 값들이 제공되면 재생시간조정부(12)와 차등증폭부(14)에는 인터럽트가 걸리어 기존의 값들이 새로이 제공되는 재생시간조정율 α와 차등증폭율 β에 의해 대체된다.When the user inputs the reproduction time adjustment rate α and the differential amplification ratio β by using a predetermined input means such as a keyboard, a mouse, or another type of input means, the user interface 20 monitors the input value to monitor the input value. Provide each to the differential amplifier 14 (step S10). When these values are provided during the processing of the audio signal, the reproduction time adjustment section 12 and the differential amplifier section 14 are interrupted, and the existing values are replaced by the newly provided reproduction time adjustment rate α and the differential amplification ratio β.

메모리(8)에는 가공전의 원래의 디지털 오디오신호 x(·)가 저장되어 있는데, 이를 신호가공부(10)에서 읽어들인다(S12 단계). 신호를 읽어들이는 단위는 프레임이다. 프레임은 N개의 샘플데이터를 포함한다. 재생시간을 조정하기 위해서는 다음 프레임을 이전 프레임의 끝부분부터 읽는 것이 아니라 인접 프레임끼리는 일정부분 중복하여 읽는다. 즉, 다음 프레임 F_m은 이전 프레임 F_m-1의 시작 위치에서 상기 N개보다 작은 수의 샘플을 건너뛴 위치의 샘플부터 읽어내므로써 인접 프레임간에 소정개수의 샘플을 중복하여 읽어낸다. 이에 관한 보다 자세한 설명은 후술한다.The memory 8 stores the original digital audio signal x (·) before processing, which is read by the signal processing section 10 (step S12). The unit of reading signal is frame. The frame includes N sample data. In order to adjust the playback time, the next frame is not read from the end of the previous frame, but adjacent frames are read in duplicate. That is, the next frame F _m reads out a predetermined number of samples repeatedly between adjacent frames by reading from the sample of the position where the number of samples less than N is skipped from the start position of the previous frame F _m-1 . A more detailed description thereof will be described later.

데이터 독출단계(S12 단계)를 통해 읽어들인 각 프레임에 대하여 신호가공부(10)는 차등증폭단계(S14 단계)와 시간축변환단계(S16 단계) 즉, 재생시간변환단계를 수행한다. 차등증폭단계(S14 단계)와 시간축변환단계(S16 단계)는 도 2의 흐름도처럼 차등증폭을 먼저 처리한 다음 시간축변환을 처리하는 순서로 오디오신호를 가공할 수도 있지만, 이와는 반대의 순서도 가능하다.For each frame read through the data reading step (step S12), the signal processing unit 10 performs a differential amplification step (step S14) and a time axis conversion step (step S16), that is, a playback time conversion step. The differential amplification step (S14) and the time-base conversion step (S16 step) may process the audio signal in the order of processing the differential amplification first and then processing the time-base conversion as shown in the flowchart of FIG. 2, but the reverse order is also possible.

차등증폭단계(S14 단계)는 각 프레임의 샘플데이터 x_i를 상기 차등증폭율 α에 대응하는 소정의 차등증폭함수 f(x_i)에 차례로 대입하여 증폭하는 단계이다. 여기서, 상기 차등증폭함수 f(x_i)는 특히 약음에서 강음으로 갈수록 증폭이득이 상대적으로 낮게 적용되는 함수이다. 따라서, 이 함수를 이용하여 오디오데이터를 변환하면 약음을 강음에 비해 차등적으로 더 강화시킬 수 있다.The differential amplification step (step S14) is a step of amplifying by substituting the sample data x _i of each frame into a predetermined differential amplification function f (x _i ) corresponding to the differential amplification factor α. Here, the differential amplification function f (x _i ) is a function in which the amplification gain is relatively low, especially from weak to strong. Therefore, by converting the audio data using this function, it is possible to differentially strengthen the weak sound compared to the strong sound.

시간축변환단계(S16 단계)는 상기 다음 프레임 F_m을 메모리(물리적으로는 상기 메모리(8)의 특정 영역이 되거나 신호가공부(10)의 내부 메모리가 될 수 있음)에 시간축변환신호 y(·)로서 이미 기록된 상기 이전 프레임 F_m-1과 소정 샘플수만큼 중복되도록 부가하여 기록한다. 여기서 중복 기록되는 부분의 샘플은 가중치를 주고 더하며 비중복부분은 그대로 더한다. 또한, 중복부분의 샘플수는 재생시간조정율 α의 크기에 대응하여 증감하는 방식으로 부가한다. 따라서, 원래의 신호 x(·)는 그 시간축 길이가 상기 재생시간조정율 α에 비례하여 변환된다.In the time-base conversion step (step S16), the next frame F _m is converted into a time-base conversion signal y (· which may be physically a specific area of the memory 8 or an internal memory of the signal processing unit 10). ) Is recorded so as to overlap with the previous frame F _m-1 already recorded as a predetermined number of samples. Here, the sample of the duplicated part is weighted and added, and the non-duplicate part is added as it is. The number of samples of the overlapping portion is added in a manner of increasing and decreasing corresponding to the size of the reproduction time adjustment rate?. Thus, the original signal x (·) has its time axis length converted in proportion to the reproduction time adjustment rate α.

이와 같이 차등증폭과 시간축변환 처리에 의해 사용자가 설정한 재생시간조정율과 차등증폭율로 변환가공된 오디오데이터는 필터부(16)에 의한 필터링을 거쳐 버퍼부(18)에 기록된다(S18 단계). 차등증폭과 시간축변환 과정에서 원치 않는 노이즈가 도입될 수 있는데, 노이즈는 대개의 경우 고주파성분을 가지므로 저역통과필터링 처리를 함으로써 노이즈를 제거한다. 필터링은 프레임단위로도 할 수 있을 것이나, 효율적인 처리를 위해 하나의 패킷을 구성하는 여러 개의 프레임을 처리단위로 하는 것이 바람직하다. 필터링처리를 거친 패킷데이터는 출력버퍼(18)에 기록된다. 본 발명을 컴퓨터장치를 통해 구현하는 경우, 출력버퍼(18)는 다수개 마련하는 것이 바람직하다. 상기 시간축변환단계 및 상기 차등증폭단계를 통해 가공된 신호는 상기 다수개의 출력버퍼(18)에 기입하되 상기 패킷데이터를 기입하는 출력버퍼는 순환적으로 교체한다. 예컨대, 출력버퍼가 Buf_1, Buf_2, Buf_3, Buf_4 이렇게 4개라고 가정하면, 패킷데이터의 기입순서는 'Buf_1 -> Buf_2 -> Buf_3 -> Buf_4 -> Buf_1 ....'의 순서로 순환된다.In this way, the audio data converted into the reproduction time adjustment rate and the differential amplification ratio set by the user by the differential amplification and time-axis conversion processing are filtered by the filter unit 16 and recorded in the buffer unit 18 (step S18). . Undesired noise can be introduced during the differential amplification and time-base conversion process. Since the noise usually has a high frequency component, the low pass filtering process removes the noise. Filtering may be performed in units of frames, but for efficient processing, it is preferable to process a plurality of frames constituting one packet in units of processing. The filtered packet data is recorded in the output buffer 18. When the present invention is implemented through a computer device, it is preferable that a plurality of output buffers 18 be provided. The signals processed through the time-base conversion step and the differential amplification step are written to the plurality of output buffers 18, but the output buffers for writing the packet data are cyclically replaced. For example, assuming that there are four output buffers Buf_1, Buf_2, Buf_3, and Buf_4, the order of writing the packet data is circulated in the order of 'Buf_1-> Buf_2-> Buf_3-> Buf_4-> Buf_1 ....'.

출력버퍼(18)에 대한 패킷데이터의 기입과 병행하여, 상기 다수개의 출력버퍼에 기록된 패킷데이터는 기록순서가 앞선 출력버퍼의 패킷데이터가 먼저 오디오출력부(22)로 출력된다(S20 단계). 즉, 출력버퍼의 순환순서 역시 위의 패킷데이터의 기입순서와 동일한 순서이다. 오디오출력부(22)에 입력된 디지털데이터는 아날로그신호로 변환되고 증폭처리 등 음성출력에 필요한 후처리를 거쳐 스피커(24)로 제공된다.In parallel with the writing of the packet data to the output buffer 18, the packet data recorded in the plurality of output buffers is first outputted to the audio output unit 22 in the packet data of the output buffer prior to the recording order (step S20). . That is, the cyclic order of the output buffers is also the same as the above write order of packet data. The digital data input to the audio output unit 22 is converted into an analog signal and provided to the speaker 24 through post-processing necessary for audio output such as amplification processing.

도 3은 시간축변환단계(S16 단계)의 알고리즘을 도시한 흐름도이다. 도 4는 원래의 오디오신호 x(·)를 사용자가 설정한 재생시간조정율 ??의 크기에 대응하여 그 길이를 늘이거나 줄여 시간축변환신호 y(·)를 얻기 위한 하는 원리를 설명하기 위한 참고도이다. 도 4에서 이해를 돕기 위해, 프레임 Fm은 320개의 샘플을 포함하며, 재생시간조정율 α은 2이며, 원신호 x(·)의 각 프레임간 시작점의 기본차이값 Sa는 120(샘플)이며, 원신호 x(·)와 시간축변환신호 y(·)간의 파형의 유사성을기준으로 최적 상관성을 검출하기 위한 체크범위(윈도우) k_max를 상기 Sa값을 기준으로 ±40(샘플)인 경우를 예로 한다. 이들 값은 예시적인 값일 뿐 적용환경에 따라 다른 값으로 변경 가능함을 물론이다.3 is a flowchart showing an algorithm of a time axis conversion step (step S16). 4 is a reference diagram for explaining a principle of obtaining a time-axis conversion signal y (·) by increasing or decreasing the length of the original audio signal x (·) corresponding to the size of the playback time adjustment rate ?? set by the user. to be. For the sake of understanding in FIG. 4, the frame Fm includes 320 samples, the reproduction time adjustment rate α is 2, and the basic difference value Sa of the starting point between each frame of the original signal x (·) is 120 (sample). For example, a check range k _max for detecting an optimal correlation based on the similarity of a waveform between a signal x (·) and a time axis conversion signal y (·) is ± 40 (sample) based on the Sa value. . These values are exemplary values and can be changed to other values according to the application environment.

먼저, 제 1 메모리 즉, 메모리(8)로부터 원래의 오디오신호 x(·)의 최초 프레임 F0을 읽어와서 제 2 메모리(메모리(8) 또는 신호가공부(10)의 내부 메모리)에 시간축변환신호 y(·)로서 그대로 복사한다(S22 단계).First, the time-axis conversion signal is read from the first memory, that is, the first frame F0 of the original audio signal x (·) from the memory 8, and the second memory (memory 8 or the internal memory of the signal processing section 10). Copy as is (y) (step S22).

그런 다음에 프레임 인덱스 m의 값을 1로 설정한다(S24 단계).Then, the value of the frame index m is set to 1 (step S24).

이후에는 다음의 루프를 원신호 x(·)를 전부 변환완료 할 때까지 실행한다(S36 단계).Thereafter, the next loop is executed until the conversion of the original signal x (·) is completed (step S36).

우선, 원래의 신호 x(·)에서 다음 프레임 F1을 읽어내어 시간축변환신호 y(·)에 부가하여야 한다. 여기서, 다음 프레임 F1을 원래의 신호 x(·)로부터 읽어낼 때 의 시작위치는 가변적으로 결정된다. 그 기준은 시간축변환신호 y(·)에 복사되어 있는 이전 프레임 F0와의 동기지연(Synchronized Lag) K1에 의해 결정된다. 또한, 읽어낸 다음 프레임 F1의 시간축변환신호 y(·)로 부가하는 위치도 재생시간조정율 α의 크기에 따라 달라진다.First, the next frame F1 should be read from the original signal x (·) and added to the time axis converted signal y (·). Here, the start position when reading the next frame F1 from the original signal x (·) is variably determined. The criterion is determined by the synchronized lag K1 with the previous frame F0 copied in the time-axis conversion signal y (·). Also, the position to be read out and added as the time axis conversion signal y (·) of the frame F1 also varies depending on the magnitude of the reproduction time adjustment rate?.

이를 일반적으로 설명하면 다음과 같다. 동기지연 Km이란 일반적으로 시간축변환신호 y(·)와 최적의 상관성을 갖도록 원신호 x(·)의 다음 프레임의 시작 위치의 변동값을 의미한다. 동기지연 Km을 구하기 위해서는 우선 상기 상관성 체크범위의 하한값 mSa-40과 상한값 mSa+40 사이의 범위에 대하여 아래의 상관성을 구하는 식을 이용하여 동기지연 Km을 구한다. 동기지연 K_m은 소정의 범위 내에서 상기 다음 프레임 F_m이 상기 시간축변환신호 y(·)로서 이미 기록된 상기 이전 프레임 F_m-1과 최적 상관성을 갖도록 정하는 데 이용된다.This is generally described as follows. The synchronization delay Km generally means a change value of the start position of the next frame of the original signal x (·) so as to have an optimal correlation with the time-axis conversion signal y (·). In order to calculate the synchronization delay Km, first, the synchronization delay Km is obtained using the following equation for the range between the lower limit mSa-40 and the upper limit mSa + 40 of the correlation check range. The synchronization delay K _m is used to determine, within a predetermined range, that the next frame F _m has an optimum correlation with the previous frame F _m-1 already recorded as the time-base conversion signal y (·).

<수식 1><Equation 1>

<수식 2><Formula 2>

단, L은 인접 프레임간의 오버랩 부분의 샘플수를 의미한다.However, L means the number of samples of the overlap part between adjacent frames.

위의 두 수식을 이용하여 최적의 상관성을 갖는 동기지연 Km이 구해지면, 이 값을 이용하여 N개의 샘플을 포함하는 다음 프레임 F_m을 원래의 신호 x(·)로부터 읽어낸다. 다음 프레임 F_m의 읽기시작위치는 이전 프레임 F_m-1의 읽기시작위치에서 프레임시작간격 S_a에 동기지연 K_m을 가감한 값인 S_a±K_m(단, 0 < S_a±K_m< N)개의 샘플을 건너뛴 위치가 된다. 예컨대, 도 4를 참조하면, K₁, K₂, K₃가 각각 20, -10, 35로 정해진 경우, 두 번째, 세 번째 및 네 번째 프레임 F1, F2, F3의 읽기시작위치는 각각 140, 230, 395번째의 샘플이 된다. 물론 각 프레임의 샘플수는 항상 N개인 320개가 된다. 이와 같은 방식으로 원래의 신호 x(·)를 읽어내면 이전 프레임과 다음 프레임은 상당한 양의 샘플이 중복된다. 그리고 이 중복부분은 시간축변환신호 y(·)에서는 재생시간조정율 α의 크기에 따라 줄어들거나(α>1 인 경우) 늘어난다(α<1인 경우). 주목할 것은 각 프레임의 읽기시작점이 각 프레임간 시작점의 기본차이값 Sa와 프레임 인덱스 m의 곱에 의해 규칙적으로 변동하는 것이 아니라 수식 1과 2를 이용하여 결정된 최적상관성 Km의 크기에 따라 불규칙하게 변한다는 점이다.Using the two equations above the ground determined the Km synchronization delay with the best correlation, reads the next frame F _m, including N samples, using the values from the original signal x (·) of the. The read start position of the next frame F _m is S _a ± K _m, which is the value obtained by subtracting the synchronization delay K _m from the frame start interval S _a at the read start position of the previous frame F _m-1 , where 0 <S _a ± K _m < N) samples will be skipped. For example, referring to FIG. 4, when K ₁ , K ₂ , and K ₃ are set to 20, -10, and 35, respectively, the read start positions of the second, third, and fourth frames F1, F2, and F3 are 140, respectively. It is the 230th and 395th samples. Of course, the number of samples in each frame is always 320, which is N. When the original signal x (·) is read in this way, a significant amount of samples are duplicated in the previous frame and the next frame. This overlapping portion decreases (in the case of α> 1) or increases (in the case of α <1) in accordance with the magnitude of the reproduction time adjustment rate α in the time axis conversion signal y (·). Note that the reading start point of each frame does not vary regularly by the product of the basic difference value Sa and the frame index m of the starting point between each frame, but irregularly according to the size of the optimal correlation Km determined using Equations 1 and 2 Is the point.

이렇게 읽어낸 다음 프레임 F_m을 시간축변환신호 y(·)에 부가한다. 시간축변환신호 y(·)에 부가되는 다음 프레임 F_m의 시작위치는 프레임시작간격 S_a, 상기 재생시간조정율 α, 그리고 상기 각 프레임의 인덱스 m의 곱 mαS_a으로 정해진다. 그러므로 도 4에서는 αSa = 2x120 = 240 이므로 두 번째, 세 번째 및 네 번째 프레임 F1, F2, F3의 부가위치는 240, 480, 720이 된다. 부가시 다음 프레임 F_m의 앞부분은 이전 프레임 F_m-1의 뒷부분간에 중복부분이 발생한다. 양 프레임의 중복되는 부분은 아래 수식 3과 4를 각각 이용하여 가중치를 주고 더하고 중복되지 않는 다음 프레임 F_m의 나머지 부분은 그대로 복사한다.After this reading, the frame F _m is added to the time-axis conversion signal y (·). The start position of the next frame F _m added to the time-axis conversion signal y (·) is determined by the frame start interval S _a , the reproduction time adjustment rate α, and the product m αS _a of the index m of each frame. Therefore, in FIG. 4, since αSa = 2x120 = 240, additional positions of the second, third and fourth frames F1, F2, and F3 are 240, 480, and 720. In addition, the front part of the next frame F _m overlaps with the back part of the previous frame F _m-1 . The overlapping portions of both frames are weighted using Equations 3 and 4 below, and the remaining portions of the next non-overlapping frame F _m are copied as they are.

<수식 3><Equation 3>

<수식 4><Equation 4>

여기서, g(j)는 가중치 함수이다. 이 가중치 함수 g(j)는 알기 쉽게는 선형함수가 대표적이지만 지수함수를 적용할 수도 있을 것이다.Where g (j) is a weight function. This weight function g (j) is obviously a linear function, but an exponential function may be applied.

이와 같은 방식으로 원래의 신호 x(·)를 프레임단위로 읽어와서 시간축변환신호 y(·)에 부가해나간다. 이 과정에서 하나의 프레임을 처리할 때마다 프레임 인덱스 m의 값을 1씩 증가시킨다(S30 단계). 그리고, 프레임 인덱스 m의 크기가 하나의 패킷을 구성하는 프레임갯수 PL의 배수인지를 확인한다. 프레임 인덱스 m이 PL의 배수에 이르면 시간축변환신호 y(·)의 데이터 량이 패킷데이터량에 이른 상태이므로 이 경우 그 변환된 데이터를 저역통과필터링을 처리를 한 다음 버퍼부(18)에 기록한다(S32 및 S34 단계).In this manner, the original signal x (·) is read in units of frames and added to the time axis conversion signal y (·). In this process, each time one frame is processed, the value of the frame index m is increased by one (step S30). Then, it is checked whether the size of the frame index m is a multiple of the number of frames PL constituting one packet. When the frame index m reaches a multiple of PL, the data amount of the time-base conversion signal y (·) has reached the packet data amount. In this case, the converted data is subjected to low pass filtering and then recorded in the buffer unit 18 ( Steps S32 and S34).

그리고 원신호 x(·)의 끝까지 변환되었는지를 체크하면서 원신호 x(·)의 모든 데이터를 처리할 때까지 S26단계~S36단계로 구성되는 루프를 반복적으로 실행한다. 이와 같은 방식으로 사용자가 설정한 재생시간조정율 α에 따른 시간축 변환처리가 수행된다. 도 7의 (a)는 정규화된 원래의 신호 x(·)의 시간에 대한 파형도를 예시적으로 도시한 것이다. 이 신호를 그 길이가 2배로 늘어나도록(즉, 재생시간조정율 α가 2인 경우) 시간축변환을 하여 얻어진 신호 y(·)의 시간에 대한 파형도이다. 이들 파형을 비교해보면, 원래의 신호 x(·)는 시간축상에서 대략 2배정도 늘어났으면서도 그 형태적 특징이 거의 그대로 유지되는 것을 볼 수 있으며, 실제로 재생음을 청취해보면 양호한 음질을 느낄 수 있다.Then, the loop composed of steps S26 to S36 is repeatedly executed until all data of the original signal x (·) is processed while checking whether the signal has been converted to the end of the original signal x (·). In this manner, time axis conversion processing according to the reproduction time adjustment rate α set by the user is performed. FIG. 7A exemplarily shows a waveform diagram of time of a normalized original signal x (·). This waveform is a waveform diagram of the time of the signal y (·) obtained by time-base conversion so that its length is doubled (that is, when the reproduction time adjustment ratio? Is 2). Comparing these waveforms, it can be seen that the original signal x (·) is approximately doubled on the time axis, but its morphological characteristics remain almost intact, and it is possible to feel good sound quality by actually listening to the reproduction sound.

다음으로, 도 5는 약음을 강음보다 더 높은 증폭율을 적용하여 차등증폭하는 알고리즘을 설명하기 위한 흐름도이다.Next, FIG. 5 is a flowchart for explaining an algorithm for differentially amplifying by applying amplification higher than weak sound.

차등증폭 알고리즘은 샘플단위로 처리한다. 따라서, 이 알고리즘은 도 3의 시간축변환 알고리즘과 병행적으로 실행될 수 있는 특징이 있다. 예컨대, 시간축변환 알고리즘에 있어서, 원래의 신호 x(·)를 프레임단위로 읽어낸 다음 그 프레임의 각 샘플에 대하여 차등증폭 알고리즘을 수행한 다음 시간축변환신호 y(·)로 변환하는 것이 하나의 가능한 방안이다. 또 다른 선택 가능한 방안으로서는 시간축변환신호 y(·)가 패킷데이터량만큼 모이면 이를 저역통과필터링 처리를 하는데 이 저역통과필터링에 앞서서 시간축변환신호 y(·)에 대한 샘플단위의 차등증폭을 실행하는 방안이다. 물론 차등증폭을 저역통과필터링 후에 하는 것도 불가능한 것은 저역통과필터링은 노이즈제거라는 목적을 가지므로 차등증폭을 저역통과필터링보다 앞세우는 것이 효과적일 것이다.The differential amplification algorithm handles samples. Therefore, this algorithm has a feature that can be executed in parallel with the time-base conversion algorithm of FIG. For example, in the time-base conversion algorithm, it is possible to read the original signal x (·) in units of frames, perform a differential amplification algorithm on each sample of the frame, and then convert it into a time-base conversion signal y (·). That's the way. As another selectable method, when the time-axis conversion signal y (·) is collected by the packet data amount, low pass filtering is performed, and before the low-pass filtering, the differential amplification of the sample unit for the time-axis conversion signal y (·) is executed. That's the way. Of course, it is impossible to perform differential amplification after low pass filtering, so it is effective to put differential amplification ahead of low pass filtering because low pass filtering has the purpose of removing noise.

도 5를 참조하여 차동증폭 알고리즘을 구체적으로 설명하면 다음과 같다.The differential amplification algorithm will be described in detail with reference to FIG. 5 as follows.

차등증폭을 하는 방법은 어떤 함수를 차등증폭함수로 적용할 것인가, 그리고 샘플데이터의 값에 따라 차등증폭대상을 구별할 것인가 등에 따라서 여러 가지 다양한 형태로 변형될 수 있다. 기본적으로 차등증폭함수 f(x_i)는 음의 세기가 약한 음에서 강한 음으로 갈수록 증폭이득이 상대적으로 낮게 적용되는 함수들 중에서 선택되어야 한다. 이러한 요건을 만족시키는 함수는 단조증가함수로서 예컨대 도 6의 (a)에 예시된 바와 같이 f(x)=x인 선형함수나 혹은 f(x)=x^1/β인 지수함수가 이에 해당된다. 샘플데이터를 이들 함수의 변수로 대입하여 그 함수값을 샘플데이터 대신에 취하면 약음은 강음에 비해 차등적으로 더 강화된다.The differential amplification method can be modified in various forms depending on which function is applied as the differential amplification function and whether the differential amplification object is to be distinguished according to the value of the sample data. Basically, the differential amplification function f (x _i ) should be selected from among functions in which the amplification gain is applied relatively low from the weak sound to the strong sound. A function that satisfies this requirement is a monotonically increasing function, for example, a linear function with f (x) = x or an exponential function with f (x) = x ^{1 / β} , as illustrated in FIG. . Substituting the sample data into the variables of these functions and taking the value of the function in place of the sample data, the weakness is strengthened differentially compared to the strong sound.

도 6의 (b)는 문턱값 x_th을 기준으로 이 값보다 더 큰 값의 샘플데이터만을 선택적으로 차등증폭하고 문턱값 x_th이하의 샘플데이터는 원래의 값을 그대로 유지하는 방안을 적용할 경우를 도시한 것이다.(B) of Figure 6 if only the sample data of a larger value than a value optionally differential amplifier based on the threshold value x _th and apply the methods to the sample data below a threshold value x _th retains its original value as it is It is shown.

도 6의 (c)는 문턱값 x_th을 기준으로 이 값보다 더 큰 값의 샘플데이터는 차등증폭함수 f(x_i) = x_i ^1/β를 이용하여 함수값을 연산하고 문턱값보다 작은 값을 갖는 샘플데이터는 차등증폭함수 f(x_i) = x_i ^β를 이용하여 함수값을 각각 연산하여 원래의 샘플데이터 x_i를 대체하는 방안을 적용하는 경우를 도시한다.6 (c) shows that the sample data having a value larger than this value based on the threshold value x _th is used to calculate a function value using the differential amplification function f (x _i ) = x _i ^{1 / β} and the value smaller than the threshold value. The sample data having a value shows a case of applying a method of replacing the original sample data x _i by calculating a function value using a differential amplification function f (x _i ) = x _i ^β .

물론 샘플데이터 x_i를 문턱값 x_th과 비교하는 절차를 거치지 않고 모든 샘플데이터를 차등증폭함수로 연산하여 그 함수값으로 변환하는 방안도 가능하다.Of course, instead of going through the procedure of comparing the sample data x _i with the threshold value x _th , it is also possible to calculate all the sample data as a differential amplification function and convert it into the function value.

도 5의 흐름도는 이들 방안중 도 6의 (c)에 따른 차등증폭을 하는 경우의 실행 알고리즘의 순서도이다.5 is a flowchart of an execution algorithm in the case of performing differential amplification according to FIG. 6C among these methods.

이 흐름도를 참조하여 설명하면, 우선 처리대상 샘플 x_i의 크기를 문턱값 x_th와 비교한다(S40 단계). 이 문턱값 x_th은 사용자가 설정할 수도 있고 시스템에 의해 미리 정의해둘 수도 있다.Referring to this flowchart, first, the size of the sample to be processed x _i is compared with a threshold value x _th (step S40). This threshold x _th can be set by the user or predefined by the system.

상기 비교의 결과, 샘플데이터 x_i가 문턱값 x_th보다 클 때에는 샘플데이터 x_i는 함수값 x_i ^1/β로 대체되고, 반대로 샘플데이터 x_i가 문턱값 x_th보다 작을 때에는 샘플데이터 x_i는 함수값 x_i ^β로 대체된다(S42, S44 단계). x_i가 문턱값 x_th보다 작은 경우 차등증폭함수를 S42단계처럼 지수함수를 적용하면 작은 음값에 의한 노이즈는효과적으로 약화시킬 수 있다는 장점이 있는 반면 계산량이 많아지는 단점도 있다. 반면에 도 6의 (b)의 경우처럼 선형함수를 차등증폭함수로서 적용하면 계산량은 상대적으로 작아지는 효과가 있다. 원래의 신호 x(·)가 녹음실과 같은 소음이 없는 공간에서 기록된 것이라면 x_i가 문턱값 x_th보다 작은 경우 굳이 지수함수를 적용할 필요까지는 없을 것이다.As a result of the comparison, when the sample data x _i is larger than the threshold value x _th , the sample data x _i is replaced by the function value x _i ^{1 / β} , and conversely, when the sample data x _i is smaller than the threshold value x _th , the sample data x _{i is used.} Is replaced by the function value x _i ^β (steps S42 and S44). If x _i is smaller than the threshold x _th , applying the exponential function to the differential amplification function as in step S42 has the advantage that the noise due to a small negative value can be effectively attenuated. On the other hand, when the linear function is applied as the differential amplification function as in the case of FIG. 6 (b), the amount of calculation is relatively small. If the original signal x (·) was recorded in a noise-free room such as a recording studio, it would not be necessary to apply an exponential function if x _i was less than the threshold x _th .

매 샘플데이터를 위와 같이 처리한 다음에는 샘플인덱스 i가 패킷데이터의 샘플 개수 PS의 정수 배를 초과한지를 체크하고 (S46 단계), 처리된 샘플데이터의 수가 아직 하나의 패킷에 이르지 못하였으면 샘플인덱스 i만 1을 증가시킨 다음(S48 단계), 다음 샘플데이터에 대하여 새로이 차등증폭함수를 이용한 연산을 반복한다.After each sample data is processed as above, it is checked whether the sample index i exceeds an integer multiple of the sample number PS of the packet data (step S46), and if the number of processed sample data has not yet reached one packet, the sample index is Only i is increased by 1 (step S48), and the operation using the new differential amplification function is repeated for the next sample data.

처리된 샘플데이터의 수가 패킷데이터량에 이르면 차등증폭된 데이터집합 {x_i}를 노이즈제거를 위한 저역통과필터링 처리를 한다(S50 단계). 그리고 필터링 처리를 거친 데이터는 버퍼부(18)에 기록되고 결국 오디오출력부(22)를 통해 아날로그신호를 변환되어 스피커(24)로 출력된다(S52 단계). 물론 이 차등증폭 알고리즘이 시간축변환 알고리즘의 일부로 배치하는 경우에는 차등증폭함수를 이용한 샘플데이터의 연산단계가 편입될 뿐이다.When the number of processed sample data reaches the packet data amount, the low-pass filtering process for noise reduction is performed on the differentially amplified data set {x _i } (step S50). Then, the filtered data is recorded in the buffer unit 18, and finally, the analog signal is converted through the audio output unit 22 and output to the speaker 24 (step S52). Of course, when the differential amplification algorithm is arranged as part of the time-base transformation algorithm, the calculation step of the sample data using the differential amplification function is only incorporated.

이와 같은 처리루틴은 원신호의 모든 샘플데이터를 다 처리할 때까지 계속 적으로 반복된다(S54, S48 단계).This processing routine is repeated repeatedly until all the sample data of the original signal is processed (steps S54 and S48).

도 7의 (c)는 도 7의 (a)의 원신호 x(·)를 소정의 차등증폭율을 적용하여실제로 차등증폭하여 얻은 신호의 파형도이다. 이들 두 파형도를 대비해보면 음의 세기 즉 진폭이 큰 부분보다는 진폭이 낮은 부분이 상대적으로 더 많이 증폭되었음을 확인할 수 있다. 이는 목적한 바의 결과와 일치하고 있다.FIG. 7C is a waveform diagram of a signal obtained by actually amplifying the original signal x (·) of FIG. 7A by applying a predetermined differential amplification factor. By comparing these two waveform diagrams, it can be seen that the low amplitude parts, rather than the loud intensity parts, are relatively more amplified. This is consistent with the intended result.

이상에서는 디지털 오디오신호의 재생시간을 늘리거나 줄이기 위한 신호가공방법과 약음을 차등적으로 더 강화시키기 위한 신호가공방법을 설명하였다. 이들 두 방법은 병행적으로 실행 가능하므로 여러 가지 장점을 갖는다. 예컨대, 본 발명의 오디오신호 가공방법을 외국어 청취력 학습을 위한 장치에 적용하면, 사용자는 자신의 청취능력에 맞게 오디오신호의 재생속도를 조절할 수 있을 뿐만 아니라 잘 들리지 않는 약음도 정확하게 알아들 수 있어서 학습효과를 크게 높여줄 수 있을 것으로 기대된다.In the above, the signal processing method for increasing or decreasing the reproduction time of the digital audio signal and the signal processing method for further strengthening the weak sound have been described. Both of these methods can be implemented in parallel and therefore have several advantages. For example, if the audio signal processing method of the present invention is applied to a device for learning foreign language listening ability, the user can not only adjust the reproduction speed of the audio signal according to his or her listening ability, but also can accurately recognize the inaudible sound. The effect is expected to be greatly improved.

본 발명에 따른 오디오신호 처리방법은 컴퓨터, 오디오 카세트 테이프 레코더는 물론 그 밖의 오디오신호 재생장치에 광범위하게 응용되어 청취력이 약한 사용자의 청취력 보조수단으로 활용될 수 있을 것이다.The audio signal processing method according to the present invention may be widely applied to a computer, an audio cassette tape recorder, as well as other audio signal reproducing apparatus, and may be used as a listening power assistance means of a user with low listening power.

또한, 본 발명은 오디오신호의 시간축변환을 위한 가공은 프레임단위로 하지만 스피커가 있는 오디오출력부로는 여러 개의 프레임을 하나로 묶은 패킷단위로 제공하고, 또한 여러 개의 출력버퍼를 사용하여 오디오출력부로 데이터를 공급해주므로써 오디오신호가 스피커로 원활하게 공급된다. 그 결과 재생음은 끊어지지 않고 그 음질이 대단히 양호하게 나타난다.In addition, the present invention provides the processing for the time-base conversion of the audio signal in the frame unit, but the audio output unit with the speaker provides a plurality of frames in a single packet unit, and the data output to the audio output unit using a plurality of output buffers The audio signal is smoothly supplied to the speaker by supplying it. As a result, the reproduction sound is not cut off and the sound quality is very good.

이상에서는 본 발명의 실시예에 따라 본 발명이 설명되었지만, 본 발명의 사상을 일탈하지 않는 범위 내에서 다양한 변형이 가능함은 본 발명이 속하는 기술 분야의 당업자라면 명확히 인지할 수 있을 것이다.Although the present invention has been described above according to an embodiment of the present invention, it will be apparent to those skilled in the art that various modifications may be made without departing from the spirit of the present invention.

Claims

In the method for processing a digital audio signal expressing the intensity of the sound with respect to time as an integer stream,

A playback time adjustment rate α and a differential amplification rate β, which are input values designated by a user, are read from the user interface, wherein the reproduction time adjustment rate α is for adjusting the reproduction time of the audio signal longer or shorter than the normal reproduction time. a predetermined value reading step determined in a range of _min ≦ α ≦ α _max , wherein the differential amplification ratio β is for reinforcing a weak sound of an audio signal relatively more than a loud sound, and setting a predetermined value within a range of β _min ≦ β ≦ β _max ;

The original audio signal x (·) stored in the first memory is continuously read in units of N samples, and the next frame F _m is smaller than the N at the start of the previous frame F _m-1 . A data reading step of reading a predetermined number of samples in duplicate between adjacent frames by reading from a sample of a position where a sample of the skipped position is skipped;

The next frame F _{m is added} to the second memory so as to overlap the previous frame F _m-1 already recorded as the time-base conversion signal y (·) by a predetermined number of samples, and the samples of the overlapped portion are weighted, added, and not duplicated. The portion is added as it is, and the number of samples of the overlapped portion is added in such a manner as to increase or decrease in correspondence with the magnitude of the reproduction time adjustment rate α so that the time axis for obtaining a signal y (·) whose time axis length is converted in proportion to the reproduction time adjustment rate α is obtained. Conversion step; And

Sample data x _i of each frame is amplified by substituting a predetermined differential amplification function f (x _i ) corresponding to the differential amplification ratio β, in particular, and the differential amplification function f (x _i ) is amplified from weak to strong. Including a differential amplification step where the gain is a relatively low applied function that enhances the weakness differentially compared to strong sound,

And processing the reproduction time and gain control by differential amplification in parallel with the digital audio signal according to a user-specified value.

The audio signal processing method according to claim 1, further comprising a normalization step of normalizing sample data of each frame read in the data reading step.

The method of claim 1, wherein a plurality of output buffers are provided, and a plurality of frames of a signal processed through the time-base conversion step and the differential amplification step are bundled into one packet and written in one of the plurality of output buffers. In particular, the output buffer for writing the packet data is cyclically replaced, and in parallel, the packet data recorded in the plurality of output buffers is output such that the packet data of the output buffer prior to the recording order is first output to the audio output unit including the speaker. The audio signal processing method further comprises a data output step of controlling.

The audio signal processing method according to claim 1, further comprising a filtering step of removing noise by performing low pass filtering on the differentially amplified sample data for each predetermined data amount.

The read start position of the next frame F _m in relation to the data reading step is _a synchronous delay to the frame start interval S _a at the read start position of the previous frame F _m-1 . value of the acceleration a K _m S _a ± K _m _{_{(stage, 0 <S a ± K m}} <N) is the position skipped samples, the synchronization delay K _m is the next frame F within a predetermined range of the _m is the And an optimum correlation with the previous frame F _m-1 already recorded as the time-axis conversion signal y (·).

The start position of each frame added to the time axis conversion signal y (·) in the time axis conversion step is the frame start interval S _a , the reproduction time adjustment rate α, and the An audio signal processing method, characterized by a product mαS _a of the index m of each frame.

The audio signal processing method according to any one of claims 1 to 4, wherein the differential amplification function f (x _i ) is a monotonically increasing function.

8. The audio signal processing method according to claim 7, wherein the differential amplification function f (x _i ) is a function expressed by f (x _i ) = x _i ^{1 / β} .

The differential amplification operation using the differential amplification function f (x _i ) of the differential amplifying step is read in the data reading step prior to the time-base transform processing in the time-base converting step. An audio signal processing method, characterized in that performed for each frame.

The differential amplification operation using the differential amplification function of the differential amplification step is performed on the time axis transform signal y (·) obtained after the time axis transform processing in the time axis transform step. Audio signal processing method characterized in that.

5. The audio signal processing method according to any one of claims 1 to 4, wherein the reproduction time adjustment rate alpha has a value of 0.7 or more and 3 or less, and the differential amplification factor beta has a value of 1 or more and 3 or less.

Reading a value of the differential amplification ratio β (where 1 <β≤3) specified by the user from the user interface; And

Comparing the size of each sample data x _i of the integer stream with a threshold value x _th of a predetermined size; and

The comparison, the sample data of the sample data x _i is greater than the chin value x _th of a given size, each of the samples by replacing the value of a data x _i to x _i ^{1 / β,} a value greater than the threshold value x _th And a differential amplifying step for reinforcing the weak sound relative to the strong sound for the field.

(delete)

The method of claim 12, wherein the audio signal processing method, the comparison of the results, each of the sample data x _i is less than the threshold value x _th the sample data x _i is more steps are replaced by x _i ^{1 / β} Audio signal processing method comprising the.

The audio signal processing method according to claim 12, further comprising: performing low pass filtering on the packet data when the differentially amplified sample data is collected by the packet data amount.

(delete)