TW202241148A - Audio signal processing method and audio signal processing apparatus - Google Patents

Audio signal processing method and audio signal processing apparatus Download PDF

Info

Publication number
TW202241148A
TW202241148A TW110113007A TW110113007A TW202241148A TW 202241148 A TW202241148 A TW 202241148A TW 110113007 A TW110113007 A TW 110113007A TW 110113007 A TW110113007 A TW 110113007A TW 202241148 A TW202241148 A TW 202241148A
Authority
TW
Taiwan
Prior art keywords
audio
channel audio
center
right channel
left channel
Prior art date
Application number
TW110113007A
Other languages
Chinese (zh)
Other versions
TWI839606B (en
Inventor
李鵬
楊立顥
Original Assignee
英霸聲學科技股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 英霸聲學科技股份有限公司 filed Critical 英霸聲學科技股份有限公司
Priority to TW110113007A priority Critical patent/TWI839606B/en
Priority claimed from TW110113007A external-priority patent/TWI839606B/en
Priority to JP2022063937A priority patent/JP7332745B2/en
Priority to US17/716,339 priority patent/US20220329957A1/en
Publication of TW202241148A publication Critical patent/TW202241148A/en
Application granted granted Critical
Publication of TWI839606B publication Critical patent/TWI839606B/en

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S3/00Systems employing more than two channels, e.g. quadraphonic
    • H04S3/008Systems employing more than two channels, e.g. quadraphonic in which the audio signals are in digital form, i.e. employing more than two discrete digital channels
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R3/00Circuits for transducers, loudspeakers or microphones
    • H04R3/04Circuits for transducers, loudspeakers or microphones for correcting frequency response
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S3/00Systems employing more than two channels, e.g. quadraphonic
    • H04S3/002Non-adaptive circuits, e.g. manually adjustable or static, for enhancing the sound image or the spatial distribution
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R5/00Stereophonic arrangements
    • H04R5/04Circuit arrangements, e.g. for selective connection of amplifier inputs/outputs to loudspeakers, for loudspeaker detection, or for adaptation of settings to personal preferences or hearing impairments
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2400/00Details of stereophonic systems covered by H04S but not provided for in its groups
    • H04S2400/01Multi-channel, i.e. more than two input channels, sound reproduction with two speakers wherein the multi-channel information is substantially preserved
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2400/00Details of stereophonic systems covered by H04S but not provided for in its groups
    • H04S2400/05Generation or adaptation of centre channel in multi-channel audio systems
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2420/00Techniques used stereophonic systems covered by H04S but not provided for in its groups
    • H04S2420/01Enhancing the perception of the sound image or of the spatial distribution using head related transfer functions [HRTF's] or equivalents thereof, e.g. interaural time difference [ITD] or interaural level difference [ILD]

Abstract

An audio signal processing method is provided. The method separates a left and a right channel audio signal to a center-left and center-right channel audio signals, and a side-left and side-right channel audio signals, and processes the center and side channel audio signals by a Head Related Transfer Functions (HRTF) so as to simulate the audio signals into several audio source positions related to a user, then synthesizes the processed audio signals into dual channel audio signals according to the user's hearing characteristics by a frequency compensation process. The invention further provides an audio signal processing apparatus to execute the audio signal processing method.

Description

音訊處理方法以及音訊處理裝置Audio processing method and audio processing device

本發明係關於一種音訊處理技術,特別是關於一種具有將雙聲道音訊模擬為多聲道音訊的音訊處理方法以及音訊處理裝置。The invention relates to an audio processing technology, in particular to an audio processing method and an audio processing device for simulating two-channel audio into multi-channel audio.

現今多媒體資訊日益發達,不管是電影、電視劇或是遊戲等都與人們的生活密不可分。也因為如此,人們在通勤時經常會戴著耳機來聆聽音樂、觀賞電影或是電視劇,也會戴著耳機來進行3D遊戲以獲得更身歷其境的音效體驗。然而,一般傳統的雙聲道耳機僅能帶給使用者兩個聲道的聆聽體驗,這樣一來當在觀賞電影或電視劇時,使用者所聽到的聲音並不會有環繞感,或是在進行遊戲時,也無法辨別來自多個方位的聲音。除此之外,每一位使用者的聽力表現都不盡相同,不同的使用者對於不同的聲音頻率也會各自有著不同的反應表現。因此,若有一種耳機能夠將雙聲道音源處理為多聲道,且能夠應不同使用者的聽力特性調整輸出聲音的頻率,必能帶給使用者更佳的聆聽體驗。Nowadays, multimedia information is increasingly developed, whether it is movies, TV dramas or games, etc. are inseparable from people's lives. Because of this, people often wear headphones to listen to music, watch movies or TV shows when commuting, and also wear headphones to play 3D games for a more immersive sound experience. However, the general traditional binaural earphones can only give the user two-channel listening experience, so that when watching movies or TV shows, the sound heard by the user will not have a sense of surround, or in When playing games, it is also impossible to distinguish sounds from multiple directions. In addition, each user's hearing performance is different, and different users will have different responses to different sound frequencies. Therefore, if there is an earphone that can process a two-channel sound source into multiple channels, and can adjust the frequency of the output sound according to the hearing characteristics of different users, it will definitely bring users a better listening experience.

本發明提供一種音訊處理方法,能夠將雙聲道音源處理為多聲道,且能因應不同使用者而補償不同頻率的音訊The present invention provides an audio processing method, which can process a two-channel audio source into multi-channel, and can compensate audio of different frequencies in response to different users

本發明另提供一種音訊處理裝置,可執行上述的音訊處理方法The present invention further provides an audio processing device capable of executing the above-mentioned audio processing method

本發明提出一種音訊處理方法,包括:將左聲道音訊分離為中央左聲道音訊以及側邊左聲道音訊;將右聲道音訊分離為中央右聲道音訊以及側邊右聲道音訊;將中央左聲道音訊以及中央右聲道音訊進行中央頭部轉移函數處理,以將中央左聲道音訊以及中央右聲道音訊模擬為相對於使用者之第一音源位置以及第二音源位置;將側邊左聲道音訊以及側邊右聲道音訊進行側邊頭部轉移函數處理,以將側邊左聲道音訊以及側邊右聲道音訊模擬為相對於使用者之第三音源位置以及第四音源位置;以及將中央頭部函數轉移以及側邊頭部轉移函數所處理後之音訊根據使用者之聽力特性進行頻率補償以合成為雙聲道音訊。The present invention proposes an audio processing method, including: separating left channel audio into center left channel audio and side left channel audio; separating right channel audio into center right channel audio and side right channel audio; subjecting the center left channel audio and the center right channel audio to the center head transfer function, so as to simulate the center left channel audio and the center right channel audio as the first sound source position and the second sound source position relative to the user; subjecting the side left channel audio and the side right channel audio to the side head transfer function to model the side left channel audio and the side right channel audio as a third source position relative to the user and The fourth sound source position; and performing frequency compensation on the audio processed by the central head function transfer and the side head transfer function according to the user's hearing characteristics to synthesize binaural audio.

本發明另提出一種音訊處理裝置,包括聲道分離單元、音訊運算單元以及音訊合成單元。聲道分離單元接收左聲道音訊以及右聲道音訊,將左聲道音訊分離為中央左聲道音訊以及側邊左聲道音訊,並將右聲道音訊分離為中央右聲道音訊以及側邊右聲道音訊。音訊運算單元將中央左聲道音訊以及中央右聲道音訊進行中央頭部轉移函數處理,以將中央左聲道音訊以及中央右聲道音訊模擬為相對於使用者之第一音源位置以及第二音源位置,且將側邊左聲道音訊以及側邊右聲道音訊進行側邊頭部轉移函數處理,以將側邊左聲道音訊以及側邊右聲道音訊模擬為相對於使用者之第三音源位置以及第四音源位置。音訊合成單元將中央頭部函數轉移以及側邊頭部轉移函數所處理後之音訊根據使用者之聽力特性進行頻率補償以合成為雙聲道音訊。The present invention further proposes an audio processing device, which includes a channel separation unit, an audio computing unit, and an audio synthesis unit. The channel separation unit receives left channel audio and right channel audio, separates left channel audio into center left channel audio and side left channel audio, and separates right channel audio into center right channel audio and side Right channel audio. The audio computing unit processes the center left channel audio and the center right channel audio through the center head transfer function, so as to simulate the center left channel audio and the center right channel audio as relative to the user's first sound source position and second The position of the sound source, and the side left channel audio and the side right channel audio are processed by the side head transfer function, so that the side left channel audio and the side right channel audio are simulated as the first relative to the user Three sound source positions and a fourth sound source position. The audio synthesis unit performs frequency compensation on the audio processed by the central head function transfer and the side head transfer function according to the user's hearing characteristics to synthesize binaural audio.

於一些實施例中,音訊處理方法包含撥放多個具有不同頻率的音訊給使用者,並因應這些不同頻率的音訊產生多個頻率響應值以獲得使用者的聽力特性,接著將這些頻率響應值與預設值進行比較,進而產生至少一個頻率響應差值,接著根據這些頻率響應差值而補償不同頻率的音訊。In some embodiments, the audio processing method includes playing multiple tones with different frequencies to the user, and generating multiple frequency response values in response to the tones with different frequencies to obtain the hearing characteristics of the user, and then converting the frequency response values Compared with the preset value, at least one frequency response difference is generated, and then audio signals of different frequencies are compensated according to the frequency response difference.

於一些實施例中,音訊處理裝置更包括音訊撥放單元以及比較單元。音訊播放單元播放多個具有不同頻率的音訊給使用者,前述的音訊運算單元因應者些具有不同頻率的音訊產生多個頻率響應值以獲得使用者之聽力特性。比較單元比較這些頻率響應值與至少一個預設值以產生至少一個頻率響應差值,前述的音訊運算單元根據這些頻率響應差值而補償具有不同頻率之音訊。In some embodiments, the audio processing device further includes an audio playback unit and a comparison unit. The audio playing unit plays a plurality of audios with different frequencies to the user, and the aforementioned audio computing unit generates a plurality of frequency response values in response to the audios with different frequencies to obtain the hearing characteristics of the user. The comparison unit compares these frequency response values with at least one preset value to generate at least one frequency response difference, and the aforementioned audio computing unit compensates the audio with different frequencies according to the frequency response difference.

綜上所述,本發明提供的音訊處理方法以及音訊處理裝置可以將左右聲道音訊處理為四個不同來源方位的音訊,並且能夠根據使用者的聽力表現而對不同頻率的音訊進行補償,使得雙聲道音訊也能夠具備環繞音效的聆聽體驗。To sum up, the audio processing method and the audio processing device provided by the present invention can process the audio from the left and right channels into audio from four different source directions, and can compensate the audio of different frequencies according to the hearing performance of the user, so that Binaural audio can also provide a surround sound listening experience.

下文將進一步揭露本發明之特徵、目的及功能。然而,以下所述者,僅為本發明之實施例,當不能以之限制本發明之範圍,即但凡依本發明申請專利範圍所作之均等變化及修飾,仍將不失為本發明之要意所在,亦不脫離本發明之精神和範圍,故應將視為本發明的進一步實施態樣。The features, purpose and functions of the present invention will be further disclosed below. However, what is described below is only an embodiment of the present invention, and should not be used to limit the scope of the present invention, that is, all equivalent changes and modifications made according to the patent scope of the present invention will still be the gist of the present invention. It does not depart from the spirit and scope of the present invention, so it should be regarded as a further embodiment of the present invention.

圖1係繪示依據本發明一實施例之音訊處理方法的流程圖。如圖1所示,本發明的音訊處理方法包括步驟101-105。步驟101:將左聲道音訊分離為中央左聲道音訊以及側邊左聲道音訊。步驟102:將右聲道音訊分離為中央右聲道音訊以及側邊右聲道音訊。步驟103:將中央左聲道音訊以及中央右聲道音訊進行中央頭部轉移函數處理,以將中央左聲道音訊以及中央右聲道音訊模擬為相對於使用者之第一音源位置以及第二音源位置。步驟104:將側邊左聲道音訊以及側邊右聲道音訊進行側邊頭部轉移函數處理,以將側邊左聲道音訊以及側邊右聲道音訊模擬為相對於使用者之第三音源位置以及第四音源位置。步驟:105:以及將中央頭部函數轉移以及側邊頭部轉移函數所處理後之音訊根據使用者之聽力特性進行頻率補償以合成為雙聲道音訊。FIG. 1 is a flowchart illustrating an audio processing method according to an embodiment of the present invention. As shown in FIG. 1 , the audio processing method of the present invention includes steps 101-105. Step 101: Separate the left channel audio into center left channel audio and side left channel audio. Step 102: Separate the right channel audio into center right channel audio and side right channel audio. Step 103: Process the center left channel audio and the center right channel audio with the center head transfer function, so as to simulate the center left channel audio and the center right channel audio as relative to the user's first sound source position and second The location of the sound source. Step 104: Perform side head transfer function processing on the side left channel audio and the side right channel audio, so as to simulate the side left channel audio and the side right channel audio as the third relative to the user The position of the sound source and the position of the fourth sound source. Step: 105: and performing frequency compensation on the audio processed by the central head function transfer and the side head transfer function according to the user's hearing characteristics to synthesize binaural audio.

圖2係繪示依據本發明一實施例之音訊處理裝置的方塊圖。圖3係繪示依據本發明一實施例之音訊來源分佈的示意圖。以下將以圖2的方塊來說明本發明的音訊處理裝置如何執行圖1的音訊處理方法。請參照圖1、圖2及圖3。如圖2所示,音訊處理裝置200包括立體聲分離單元201、等化器202、聲道分離單元203、音訊運算單元204以及音訊合成單元205。立體聲分離單元201接收立體聲音訊SA並將其分離為左聲道音訊L以及右聲道音訊R。在此實施例中,立體聲音訊SA例如是包含左聲道音訊L以及右聲道音訊R,但本發明並不以此為限,其也可包含更多聲道。等化器202接收左聲道音訊L以及右聲道音訊R,經過等化器202處理後的左聲道音訊L以及右聲道音訊R可加強其低音效果,進而產生低音效果較佳的左聲道音訊L_Eq以及右聲道音訊R_Eq。聲道分離單元203接收左聲道音訊L_Eq以及右聲道音訊R_Eq,並將左聲道音訊L_Eq分離為中央左聲道音訊Cent_L以及側邊左聲道音訊Side_L,並將右聲道音訊R_Eq分離為中央右聲道音訊Cent_R以及側邊右聲道音訊Side_R。FIG. 2 is a block diagram of an audio processing device according to an embodiment of the present invention. FIG. 3 is a schematic diagram illustrating the distribution of audio sources according to an embodiment of the present invention. Hereinafter, the blocks in FIG. 2 will be used to illustrate how the audio processing device of the present invention executes the audio processing method in FIG. 1 . Please refer to Figure 1, Figure 2 and Figure 3. As shown in FIG. 2 , the audio processing device 200 includes a stereo separation unit 201 , an equalizer 202 , a channel separation unit 203 , an audio computing unit 204 and an audio synthesis unit 205 . The stereo splitting unit 201 receives the stereo audio SA and splits it into left channel audio L and right channel audio R. In this embodiment, the stereo audio SA includes, for example, the left channel audio L and the right channel audio R, but the present invention is not limited thereto, and it may also include more channels. The equalizer 202 receives the left-channel audio L and the right-channel audio R, and the left-channel audio L and the right-channel audio R processed by the equalizer 202 can enhance their bass effects, thereby producing left-channel audio with better bass effects. channel audio L_Eq and right channel audio R_Eq. The channel separation unit 203 receives the left channel audio L_Eq and the right channel audio R_Eq, and separates the left channel audio L_Eq into the center left channel audio Cent_L and the side left channel audio Side_L, and separates the right channel audio R_Eq It is the center right channel audio Cent_R and the side right channel audio Side_R.

具體而言,中央左聲道音訊Cent_L以及中央右聲道音訊Cent_R對於使用者來說相當於來自於正前方的左右聲源,而側邊左聲道音訊Side_L以及側邊右聲道音訊Side_R則相當於來自於側邊的左右聲源。特別一提的是,上述的左聲道聲源以及右聲道聲源分別是使用者的左耳及右耳所聆聽到的音訊。音訊運算單元204會接收上述的中央左聲道音訊Cent_L、中央右聲道音訊Cent_R、側邊左聲道音訊Side_L以及側邊右聲道音訊Side_R,並將其分別進行中央頭部轉移函數以及側邊頭部轉移函數處理。頭部轉移函數(Head Related Transfer Functions, HRTF)是一種音效定位演算法,關於其定位及演算的過程是本領域技術者所知悉的,故於此不再贅述。透過頭部轉移函數的運算,中央左聲道音訊Cent_L、中央右聲道音訊Cent_R、側邊左聲道音訊Side_L以及側邊右聲道音訊Side_R會被虛擬為相對於使用者300的音源位置301-304,如圖3所示。音訊合成單元205會接收前述經過頭部轉移函數處理後的音訊Cent_LH, Cent_RH, Side_LH, Side_RH,再根據使用者的聽力特性對所接收到的音訊進行頻率補償之後合成為雙聲道音訊,如此一來,即便是透過雙聲道耳機,使用者也可以聆聽到相當於多聲道的環繞音效。Specifically, the center left channel audio Cent_L and the center right channel audio Cent_R are equivalent to the left and right sound sources from the front for the user, while the side left channel audio Side_L and the side right channel audio Side_R are Equivalent to left and right sound sources from the sides. In particular, the above-mentioned left channel sound source and right channel sound source are the audio heard by the user's left ear and right ear respectively. The audio computing unit 204 receives the above-mentioned center left channel audio Cent_L, center right channel audio Cent_R, side left channel audio Side_L, and side right channel audio Side_R, and performs center head transfer function and side Edge head transfer function processing. Head Related Transfer Functions (HRTF) is a sound localization algorithm. The process of its localization and calculation is known to those skilled in the art, so it will not be repeated here. Through the operation of the head transfer function, the center left channel audio Cent_L, the center right channel audio Cent_R, the side left channel audio Side_L, and the side right channel audio Side_R will be virtualized as the sound source position 301 relative to the user 300 -304, as shown in Figure 3. The audio synthesizing unit 205 will receive the aforementioned audio Cent_LH, Cent_RH, Side_LH, Side_RH processed by the head transfer function, and then perform frequency compensation on the received audio according to the user's hearing characteristics to synthesize a two-channel audio. Now, even through binaural headphones, users can listen to the equivalent of multi-channel surround sound.

進一步而言,音訊處理裝置200例如是雙聲道耳機,立體聲分離單元201、等化器202、聲道分離單元203、音訊運算單元204以及音訊合成單元205則是耳機內部的獨立或是整合後之元件、電路或晶片。音訊處理裝置200更包含音訊播放單元以及比較單元(圖未示),音訊播放單元會播放多個具有不同頻率的音訊給使用者,使用者聆聽後會因應這些具有不同頻率的音訊作出反饋,藉此產生多個可以代表使用者之聽力個性的頻率響應值。比較單元接著將這些頻率響應值與一個預設值做比較而產生至少一個頻率響應差值,這個頻率響應差值代表的意義在於使用者對於某些頻率的聲音可能辨識能力較弱,因此透過音訊運算單元204可以根據頻率響應差值而補償對應的音訊,如此一來,使用者在聆聽時不但能有環繞音效的體驗,對於本身辨識能力較差的頻率音訊也可以獲得聽力補償,更進一步優化聆聽的感受。特別一提的是,補償具有多種方法,本實施例中例如是補償為預設值,但本發明並不以此為限,本領域通常知識者也可依照設計或是使用者聽力特性而補償為其他的值。Furthermore, the audio processing device 200 is, for example, a two-channel earphone, and the stereo separation unit 201, the equalizer 202, the channel separation unit 203, the audio computing unit 204, and the audio synthesis unit 205 are independent or integrated in the earphone. components, circuits or chips. The audio processing device 200 further includes an audio playback unit and a comparison unit (not shown in the figure), the audio playback unit will play a plurality of audios with different frequencies to the user, and the user will give feedback in response to these audios with different frequencies after listening. This produces a number of frequency response values that can represent the hearing personality of the user. The comparing unit then compares these frequency response values with a preset value to generate at least one frequency response difference. The computing unit 204 can compensate the corresponding audio according to the frequency response difference. In this way, the user can not only have the experience of surround sound effect when listening, but also can obtain hearing compensation for the frequency audio with poor identification ability, further optimizing listening feelings. In particular, there are many methods of compensation. In this embodiment, for example, the compensation is a preset value, but the present invention is not limited thereto. Those skilled in the art can also compensate according to the design or the hearing characteristics of the user. for other values.

更具體而言,音訊運算單元204會對所接收到的中央左聲道音訊Cent_L、中央右聲道音訊Cent_R、側邊左聲道音訊Side_L以及側邊右聲道音訊Side_R進行濾波處理,在本實施例中,例如是高通濾波處理,藉此將人耳聽力感受度相對來說較不敏感的波段濾除,保留人耳聽力感受度較敏感的波段,但本發明並不以此為限,本領域通常知識者也可以根據設計或是使用者特性而進行不同波段的濾波處理。音訊運算單元204會將濾波處理後的音訊與預設的頭部轉移函數進行卷積(convolution)運算。關於卷積運算為本領域技術人員所知悉的,因此於此不在贅述。透過卷積運算之後,便能將音訊虛擬至相對於使用者而言的各種方位,藉此產生環繞音效的聆聽體驗。特別一提的是,音訊運算單元204還可以將卷積運算後的音訊乘以預設參數,此預設參數可以是任意的值,其可根據需求來優化音訊的強度。More specifically, the audio computing unit 204 performs filtering processing on the received center left channel audio Cent_L, center right channel audio Cent_R, side left channel audio Side_L, and side right channel audio Side_R. In the embodiment, for example, high-pass filtering is used to filter out bands that are relatively insensitive to human hearing sensitivity and retain bands that are more sensitive to human hearing sensitivity, but the present invention is not limited thereto. Those skilled in the art can also perform filtering in different bands according to design or user characteristics. The audio computing unit 204 performs a convolution operation on the filtered audio and a preset head transfer function. The convolution operation is known to those skilled in the art, so details are not repeated here. After the convolution operation, the audio can be virtualized to various directions relative to the user, thereby generating a listening experience of surround sound. In particular, the audio computing unit 204 can also multiply the convolved audio by a preset parameter. The preset parameter can be any value, which can optimize the intensity of the audio according to requirements.

此外,音訊合成單元205還可以接收未經頭部轉移函數處理的中央左聲道音訊Cent_L以及中央右聲道音訊Cent_R,並將所接收到未經頭部函數轉移處理的音訊與經過頭部轉移函數處理的音訊合成在一起,也就是將人耳聽力感受度較不敏感的波段與較敏感的波段合成在一起,如此一來,合成後的音訊便可涵蓋更寬的波段,進而增加最後所輸出音訊的豐富度。In addition, the audio synthesis unit 205 can also receive the center left channel audio Cent_L and the center right channel audio Cent_R without the head transfer function processing, and combine the received audio without the head function transfer processing with the head transfer function The audio processed by the function is synthesized together, that is, the band that is less sensitive to human hearing sensitivity is synthesized with the band that is more sensitive. In this way, the synthesized audio can cover a wider band, thereby increasing the final The richness of the output audio.

綜上所述,本發明提供的音訊處理方法以及音訊處理裝置是藉由音訊分離以及頭部轉移函數的運算而將原本左右聲道的音訊模擬為多聲道的音訊,且可進一步調整模擬後之音訊相對於使用者的來源方位,且能更進一步對使用者的聽力特性來補償對應的音訊頻率,因此能夠讓使用者感受到環繞音效以及聽力補償的優化感受。To sum up, the audio processing method and audio processing device provided by the present invention simulate the original left and right channel audio into multi-channel audio through audio separation and head transfer function calculation, and can further adjust the simulated The audio is relative to the source direction of the user, and can further compensate the corresponding audio frequency for the user's hearing characteristics, so that the user can experience the optimized feeling of surround sound effect and hearing compensation.

101-105:步驟 200:音訊處理裝置 201:立體聲分離單元 202:等化器 203:聲道分離單元 204:音訊運算單元 205:音訊合成單元 300:使用者 301-304:音源位置 SA:立體聲音訊 L:左聲道音訊 R:右聲道音訊 L_Eq:低音強化之左聲道音訊 R_Eq:低音強化之右聲道音訊 Cent_L:中央左聲道音訊 Cent_R:中央右聲道音訊 Side_L:側邊左聲道音訊 Side_R:側邊右聲道音訊 Cent_LH:頭部轉移函數處理後之中央左聲道音訊 Cent_RH:頭部轉移函數處理後之中央右聲道音訊 Side_LH:頭部轉移函數處理後之側邊左聲道音訊 Side_RH:頭部轉移函數處理後之側邊右聲道音訊 101-105: Steps 200: Audio processing device 201: Stereo separation unit 202: equalizer 203: channel separation unit 204: Audio computing unit 205: Audio synthesis unit 300: user 301-304: Sound source location SA: stereo audio L: left channel audio R: right channel audio L_Eq: left channel audio with bass enhancement R_Eq: Right channel audio with bass enhancement Cent_L: Center left channel audio Cent_R: Center right channel audio Side_L: Side left channel audio Side_R: Side right channel audio Cent_LH: Center left channel audio after head transfer function processing Cent_RH: Center right channel audio after head transfer function processing Side_LH: Side left channel audio after head transfer function processing Side_RH: Side right channel audio after head transfer function processing

圖1係繪示依據本發明一實施例之音訊處理方法的流程圖。FIG. 1 is a flowchart illustrating an audio processing method according to an embodiment of the present invention.

圖2係繪示依據本發明一實施例之音訊處理裝置的方塊圖。FIG. 2 is a block diagram of an audio processing device according to an embodiment of the present invention.

圖3係繪示依據本發明一實施例之音訊來源分佈的示意圖。FIG. 3 is a schematic diagram illustrating the distribution of audio sources according to an embodiment of the present invention.

none

104~105:步驟 104~105: Steps

Claims (10)

一種音訊處理方法,包括: 將一左聲道音訊分離為一中央左聲道音訊以及一側邊左聲道音訊; 將一右聲道音訊分離為一中央右聲道音訊以及一側邊右聲道音訊; 將該中央左聲道音訊以及該中央右聲道音訊進行一中央頭部轉移函數處理,以將該中央左聲道音訊以及該中央右聲道音訊模擬為相對於一使用者之一第一音源位置以及一第二音源位置; 將該側邊左聲道音訊以及該側邊右聲道音訊進行一側邊頭部轉移函數處理,以將該側邊左聲道音訊以及該側邊右聲道音訊模擬為相對於該使用者之一第三音源位置以及一第四音源位置;以及 將該中央頭部函數轉移以及該側邊頭部轉移函數所處理後之音訊根據該使用者之一聽力特性進行一頻率補償以合成為一雙聲道音訊。 An audio processing method, comprising: Split a left channel audio into a center left channel audio and a side left channel audio; Splitting a right channel audio into a center right channel audio and side right channel audio; subjecting the center left channel audio and the center right channel audio to a center head transfer function to simulate the center left channel audio and the center right channel audio as relative to a first audio source of a user position and a second sound source position; subjecting the side left channel audio and the side right channel audio to a side head transfer function to simulate the side left channel audio and the side right channel audio as relative to the user a third sound source position and a fourth sound source position; and The audio processed by the central head function transfer and the side head transfer function is subjected to frequency compensation according to the hearing characteristic of the user to synthesize a binaural audio. 根據申請專利範圍第1項之音訊處理方法,其中,根據該使用者之該聽力特性進行該頻率補償之步驟包括: 播放多個具有不同頻率之音訊給該使用者; 因應該些具有不同頻率之音訊產生多個頻率響應值以獲得該使用者之該聽力特性; 比較該些頻率響應值與至少一預設值以產生至少一頻率響應差值;以及 根據該至少一頻率響應差值而補償該些具有不同頻率之音訊。 According to the audio processing method of claim 1, the step of performing the frequency compensation according to the hearing characteristic of the user includes: Play multiple tones with different frequencies to the user; Generate multiple frequency response values in response to the audio with different frequencies to obtain the hearing characteristics of the user; comparing the frequency response values with at least one preset value to generate at least one frequency response difference; and The audio signals with different frequencies are compensated according to the at least one frequency response difference. 根據申請專利範圍第2項之音訊處理方法,其中,補償該些具有不同頻率之音訊之步驟,係將該些具有不同頻率之音訊所對應之頻率響應值補償為該預設值。According to the audio processing method of item 2 of the scope of the patent application, the step of compensating the audio with different frequencies is compensating the frequency response values corresponding to the audio with different frequencies to the preset value. 根據申請專利範圍第1項之音訊處理方法,其中,該中央頭部轉移函數以及該側邊頭部轉移函數所處理之音訊係透過一濾波以及一卷積運算。According to the audio processing method of claim 1, the audio processed by the central head transfer function and the side head transfer function is passed through a filter and a convolution operation. 根據申請專利範圍第4項之音訊處理方法,其中,透過該濾波以及該卷積運算後之音訊更進一步乘以一預設參數。According to the audio processing method of item 4 of the scope of the patent application, the audio after the filtering and the convolution operation is further multiplied by a preset parameter. 一種音訊處理裝置,包括: 一聲道分離單元,接收一左聲道音訊以及一右聲道音訊,將該左聲道音訊分離為一中央左聲道音訊以及一側邊左聲道音訊,並將該右聲道音訊分離為一中央右聲道音訊以及一側邊右聲道音訊; 一音訊運算單元,接收並將該中央左聲道音訊以及該中央右聲道音訊進行一中央頭部轉移函數處理,以將該中央左聲道音訊以及該中央右聲道音訊模擬為相對於一使用者之一第一音源位置以及一第二音源位置,且接收並將該側邊左聲道音訊以及該側邊右聲道音訊進行一側邊頭部轉移函數處理,以將該側邊左聲道音訊以及該側邊右聲道音訊模擬為相對於該使用者之一第三音源位置以及一第四音源位置;以及 一音訊合成單元,接收並將該中央頭部函數轉移以及該側邊頭部轉移函數所處理後之音訊根據該使用者之一聽力特性進行一頻率補償以合成為一雙聲道音訊。 An audio processing device, comprising: A channel separation unit, receiving a left channel audio and a right channel audio, separating the left channel audio into a central left channel audio and a side left channel audio, and separating the right channel audio is a center right channel audio and a side right channel audio; An audio computing unit, receiving and performing a center head transfer function processing on the center left channel audio and the center right channel audio, so as to simulate the center left channel audio and the center right channel audio relative to a A first sound source position and a second sound source position of the user, and receive and process the side left channel audio and the side right channel audio with a side head transfer function, so that the side left channel audio and the side right channel audio are modeled as a third source position and a fourth source position relative to the user; and An audio synthesis unit receives and performs frequency compensation on the audio processed by the central head function transfer and the side head transfer function according to the hearing characteristics of the user to synthesize a binaural audio. 根據申請專利範圍第6項之音訊處理裝置,其中該音訊運算單元具有一濾波器,將該中央左聲道音訊、該中央右聲道音訊、該側邊左聲道音訊以及該側邊右聲道音訊進行一濾波處理。The audio processing device according to item 6 of the scope of the patent application, wherein the audio computing unit has a filter, the center left channel audio, the center right channel audio, the side left channel audio and the side right sound A filtering process is performed on the channel audio. 根據申請專利範圍第7項之音訊處理裝置,其中該音訊運算單元對該些濾波後之該中央左聲道音訊以及該中央右聲道音訊與該中央頭部轉移函數進行一卷積運算。In the audio processing device according to claim 7 of the scope of the patent application, the audio computing unit performs a convolution operation on the filtered center left channel audio and the center right channel audio with the center head transfer function. 根據申請專利範圍第7項之音訊處理裝置,其中該音訊運算單元對該些濾波後之該側邊左聲道音訊以及該側邊右聲道音訊與該側邊頭部轉移函數進行一卷積運算。The audio processing device according to item 7 of the scope of the patent application, wherein the audio computing unit performs a convolution on the filtered side left channel audio and the side right channel audio with the side head transfer function operation. 根據申請專利範圍第7項之音訊處理裝置,更包括: 一音訊播放單元,播放多個具有不同頻率之音訊給該使用者,該音訊運算單元因應該些具有不同頻率之音訊產生多個頻率響應值以獲得該使用者之該聽力特性;以及 一比較單元,比較該些頻率響應值與至少一預設值以產生至少一頻率響應差值,該音訊運算單元根據該至少一頻率響應差值而補償該些具有不同頻率之音訊。 The audio processing device according to item 7 of the scope of the patent application further includes: An audio playback unit, which plays a plurality of audios with different frequencies to the user, and the audio computing unit generates a plurality of frequency response values in response to the audios with different frequencies to obtain the hearing characteristics of the user; and A comparison unit compares the frequency response values with at least one preset value to generate at least one frequency response difference, and the audio computing unit compensates the audio signals with different frequencies according to the at least one frequency response difference.
TW110113007A 2021-04-10 2021-04-10 Audio signal processing method and audio signal processing apparatus TWI839606B (en)

Priority Applications (3)

Application Number Priority Date Filing Date Title
TW110113007A TWI839606B (en) 2021-04-10 Audio signal processing method and audio signal processing apparatus
JP2022063937A JP7332745B2 (en) 2021-04-10 2022-04-07 Speech processing method and speech processing device
US17/716,339 US20220329957A1 (en) 2021-04-10 2022-04-08 Audio signal processing method and audio signal processing apparatus

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
TW110113007A TWI839606B (en) 2021-04-10 Audio signal processing method and audio signal processing apparatus

Publications (2)

Publication Number Publication Date
TW202241148A true TW202241148A (en) 2022-10-16
TWI839606B TWI839606B (en) 2024-04-21

Family

ID=

Also Published As

Publication number Publication date
JP2022161881A (en) 2022-10-21
US20220329957A1 (en) 2022-10-13
JP7332745B2 (en) 2023-08-23

Similar Documents

Publication Publication Date Title
TWI489887B (en) Virtual audio processing for loudspeaker or headphone playback
KR101368859B1 (en) Method and apparatus for reproducing a virtual sound of two channels based on individual auditory characteristic
CN113170271B (en) Method and apparatus for processing stereo signals
CN111556425B (en) Tone equalization method for virtual sound reproduction of loudspeaker
JP2006033847A (en) Sound-reproducing apparatus for providing optimum virtual sound source, and sound reproducing method
JP2020508590A (en) Apparatus and method for downmixing multi-channel audio signals
JP2003523675A (en) Multi-channel sound reproduction system for stereophonic sound signals
JP5338053B2 (en) Wavefront synthesis signal conversion apparatus and wavefront synthesis signal conversion method
US20200059750A1 (en) Sound spatialization method
JP2007202021A (en) Audio signal processing apparatus, audio signal processing system, and program
TW202241148A (en) Audio signal processing method and audio signal processing apparatus
Drossos et al. Stereo goes mobile: Spatial enhancement for short-distance loudspeaker setups
CN115278506A (en) Audio processing method and audio processing device
CN113645531A (en) Earphone virtual space sound playback method and device, storage medium and earphone
US20140056429A1 (en) Spatialization using stereo decorrelation
JP2004527961A (en) Method and circuit for listening to sound recording media with a headset
US11924628B1 (en) Virtual surround sound process for loudspeaker systems
US20240056735A1 (en) Stereo headphone psychoacoustic sound localization system and method for reconstructing stereo psychoacoustic sound signals using same
JP5590169B2 (en) Wavefront synthesis signal conversion apparatus and wavefront synthesis signal conversion method
JP2023545547A (en) Sound reproduction by multi-order HRTF between the left and right ears
Maher Single-ended spatial enhancement using a cross-coupled lattice equalizer
WO2024081957A1 (en) Binaural externalization processing
Aarts et al. NAG