EP4007310A1 - Verfahren zur verarbeitung eines eingangsaudiosignals zur erzeugung eines stereoausgangsaudiosignals mit spezifischen nachhallseigenschaften - Google Patents

Verfahren zur verarbeitung eines eingangsaudiosignals zur erzeugung eines stereoausgangsaudiosignals mit spezifischen nachhallseigenschaften Download PDF

Info

Publication number
EP4007310A1
EP4007310A1 EP20210629.0A EP20210629A EP4007310A1 EP 4007310 A1 EP4007310 A1 EP 4007310A1 EP 20210629 A EP20210629 A EP 20210629A EP 4007310 A1 EP4007310 A1 EP 4007310A1
Authority
EP
European Patent Office
Prior art keywords
rir
audio signal
stereo
samples
mono
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
EP20210629.0A
Other languages
English (en)
French (fr)
Inventor
Eftychios PAPOULIS
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Ask Industries GmbH
Original Assignee
Ask Industries GmbH
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Ask Industries GmbH filed Critical Ask Industries GmbH
Priority to EP20210629.0A priority Critical patent/EP4007310A1/de
Publication of EP4007310A1 publication Critical patent/EP4007310A1/de
Pending legal-status Critical Current

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S5/00Pseudo-stereo systems, e.g. in which additional channel signals are derived from monophonic signals by means of phase shifting, time delay or reverberation 
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S1/00Two-channel systems
    • H04S1/002Non-adaptive circuits, e.g. manually adjustable or static, for enhancing the sound image or the spatial distribution
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S1/00Two-channel systems
    • H04S1/007Two-channel systems in which the audio signals are in digital form
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S7/00Indicating arrangements; Control arrangements, e.g. balance control
    • H04S7/30Control circuits for electronic adaptation of the sound field
    • H04S7/305Electronic adaptation of stereophonic audio signals to reverberation of the listening space
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2420/00Techniques used stereophonic systems covered by H04S but not provided for in its groups
    • H04S2420/01Enhancing the perception of the sound image or of the spatial distribution using head related transfer functions [HRTF's] or equivalents thereof, e.g. interaural time difference [ITD] or interaural level difference [ILD]

Definitions

  • the invention relates to a method of processing an input audio signal for generating a stereo output audio signal having specific reverberation characteristics.
  • Audio signal processing generally, comprises processing of input audio signals, i.e. audio signals which are input to a digital signal processing unit ("DSP unit"), having specific audio signal properties, so as to generate output audio signals, i.e. audio signals which are output of the audio signal processing unit, having specific audio signal properties, at least partly different from the input audio signal properties.
  • audio signal processing may comprise modifying one or more properties of an input audio signal so as to obtain an output audio signal having one or more properties which are modified relative to the respective properties of the input audio signal.
  • a specific aim in audio signal processing comprises processing of an input audio signal for generating an output audio signal having specific reverberation characteristics, such as the specific reverberation characteristics of a specific acoustic environment, e.g. a specific room or venue.
  • a stereophonic real-time convolution-based artificial reverberation of a stereo input audio signal using pre-recorded stereo room impulse response (“RIR”) data from a real-life room, often requires more computing operations, such as Floating-Point Operations Per Input Sample (“FLOPIS”), and memory than available in a common DSP-unit, such as in a common DSP-unit of a vehicle.
  • RIR stereo room impulse response
  • FLOPIS Floating-Point Operations Per Input Sample
  • the required number of FLOPIS and memory size typically, depend on the size of the room and the sampling rate used during the recording of the RIR data and the play-back of the audio signal to be reverberated.
  • the length of the RIR data turns out to be very large.
  • the stereo RIR model has 2 x 96 x 10 3 samples.
  • a direct convolution in the time-domain would require 2 x 192 x 10 3 FLOPIS and 2 x 192 x 10 3 memory locations to store the 2 x 96 x 10 3 RIR samples plus the 2 x 92 x 10 3 most recent samples of the input signal.
  • the factor of two accounts for the stereo property of the audio signals.
  • the figure of 192 x 10 3 comes from the fact that each multiply-and-add operation counts as two FLOPIS.
  • UPOLS Uniform Partition Overlap-Save
  • the object of the present invention to provide an improved method of processing an input audio signal for generating a stereo output audio signal with specific reverberation characteristics, particularly specific reverberation characteristics of a specific acoustic environment, particularly with respect to the computing power and memory requirements for digital audio signal processing and with respect to the ease of implementation.
  • the object is achieved by a method of processing an input audio signal for generating a stereo output audio signal having specific reverberation characteristics according to Claim 1.
  • the Claims depending on Claim 1 refer to possible embodiments of the method of Claim 1.
  • a first aspect of the invention refers to a method of processing an input audio signal for generating a stereo output audio signal with specific reverberation characteristics.
  • the method thus, enables generating a stereo output audio signal having specific reverberation characteristics, i.e. specific reverberation characteristics, such as the specific reverberation characteristics of a specific acoustic environment, e.g. a specific room of a specific building, by processing of an input audio signal.
  • the method can be implemented by a hardware- and/or software-embodied digital signal processing unit ("DSP-unit") which is configured to perform the method.
  • the DSP unit may comprise at least one processing unit, such as a processor, and at least one memory unit.
  • the DSP unit may form part of an apparatus for processing an input audio signal so as to generate a stereo output audio signal.
  • a respective apparatus can form a vehicle audio system or a car audio system, i.e. an audio system that is to be installed or is installed in a vehicle or a car, respectively.
  • a respective apparatus can form part of a respective vehicle audio system or car audio system, respectively.
  • an input audio signal is provided.
  • the input audio signal is a mono input audio signal ("mono signal") or a stereo input audio signal ("stereo signal").
  • the input audio signal is the signal which is to be processed in accordance with the method so as to generate a stereo output audio signal of a specific reverberation.
  • the generated output audio signal is always a stereo audio signal, regardless of whether the input audio signal is mono or stereo.
  • the first step of the method can be implemented by a hardware- and/or software-embodied input audio signal providing unit which is configured to provide an input audio signal that is mono or stereo.
  • An input audio signal can be a music- and/or speech-signal, i.e. a signal comprising music- and/or speech-content.
  • pre-recorded stereo data of the Room-Impulse-Response ("RIR") of a specific acoustic environment such as a specific building or venue
  • the pre-recorded RIR data respectively comprise a defined number of RIR samples, in particular left channel samples and right channel samples.
  • the RIR data are or comprise thus, typically stereo data.
  • the RIR data comprise an equal number of left channel samples and right channel samples.
  • the RIR data can be obtained through known methods for recording the RIR of acoustic environments. The actual recording of a respective acoustic environment is typically, not a step of the method.
  • the second step of the method can be implemented by a hardware- and/or software-embodied RIR data providing unit which is configured to provide pre-recorded RIR data of a specific acoustic environment.
  • a first number of RIR samples representing a stereo part of the RIR represented by the RIR data and a second number of RIR samples representing a mono part of the RIR represented by the RIR data is determined.
  • the stereo part of the RIR comprises a number of left channel RIR samples for the left output channel and an equal number of right channel RIR samples for the right output channel.
  • the mono part of the RIR represented by the RIR data comprises a number of RIR samples to be used for both the left and the right output channel.
  • the third step of the method thus, comprises the determination of a stereo part of the RIR which is or can be represented by a first number of RIR samples and a mono part of the RIR which is or can be represented by a second number of RIR samples.
  • the stereo part of the RIR is the first part of the RIR and is followed by the mono part of the RIR which is the second part of the RIR.
  • the duration of the stereo part and of the mono part of the RIR typically, add up to the total duration of the RIR.
  • the third step of the method can be implemented by a hardware- and/or software-embodied determining unit which is configured to determine a first number of RIR samples representing a stereo part of the RIR and a second number of RIR samples representing a mono part of the RIR, whereby the stereo part of the RIR comprises a number of left channel RIR samples for the left output channel and an equal number of right channel RIR samples for the right output channel.
  • the mono part of the RIR comprises a number of RIR samples for both the left and the right output channel.
  • the samples of the RIR are subdivided into a first group of RIR samples representing the stereo part of the RIR and into a second group of RIR samples representing the mono part of the RIR.
  • the first group of RIR samples typically comprises the first number of RIR samples representing the stereo part of the RIR and the second group of RIR samples typically comprises the second number of RIR samples representing the mono part of the RIR.
  • the first group of RIR samples typically, represents a (distinct) early reflections part ("ERP") of the RIR and the second group of RIR samples typically, represents a (distinct) late reflections part ("LRP") of the RIR.
  • the first group of RIR samples can comprise a period ranging between 1 ms and 150 ms, particularly between 10 ms and 100 ms, of the initial duration of the RIR.
  • the second group of RIR samples comprises the remaining duration of the RIR.
  • the fourth step of the method can be implemented by a hardware- and/or software-embodied subdividing unit which is configured to subdivide the samples of the RIR into a first group of RIR samples representing the stereo part of the RIR and into a second group of RIR samples representing the mono part of the RIR.
  • the third and fourth step of the method can be combined in one step which comprises both the determining aspect as specified above in context with the third step and the subdividing aspect as specified above in context with the fourth step.
  • a hardware- and/or software-embodied determining and subdividing unit which is configured to determine a first number of RIR samples representing a stereo part of the RIR and a second number of RIR samples representing a mono part of the RIR, whereby the stereo part of the RIR comprises a number of left channel RIR samples for the left output channel and an equal number of right channel RIR samples for the right output channel and configured to subdivide the samples of the RIR into a first group of RIR samples representing the stereo part of the RIR and into a second group of RIR samples representing the mono part of the RIR, can be used when the third and fourth step are combined.
  • a first signal processing rule for processing, particularly by convolving, the input audio signal with the left channel samples of the stereo part of the RIR and for processing, particularly by convolving, the input audio signal with the right channel samples of the stereo part of the RIR is applied or implemented.
  • a processed left channel audio signal part and a processed right channel audio signal part, representing the reverberation of the input audio signal - which can be a mono input audio signal or a stereo input audio signal - from the first group of samples of the RIR is obtained.
  • the input audio signal is processed, i.e.
  • the input audio signal is processed, i.e. typically convolved, with the right channel samples of the stereo part of the RIR, whereby a processed, i.e. typically convolved, right channel audio signal part is obtained.
  • the respective processed left and right channel audio signal parts represent the reverberation, i.e. specifically the artificially generated reverberation, of the input audio signal from the first group of samples of the RIR.
  • the fifth step comprises applying or implementing a second signal processing rule for processing, particularly by convolving, the input audio signal with the mono part of the RIR data.
  • a second signal processing rule for processing for processing, particularly by convolving, the input audio signal with the mono part of the RIR data.
  • a mono input audio signal is processed so as to obtain a processed mono audio signal part representing the reverberation of the mono input audio signal from the second group of samples of the RIR (the mono part of the RIR), and a stereo input audio signal is processed so as to obtain a processed mono audio signal part representing the reverberation of both the left and right channel of the stereo input audio signal from the second group of samples of the RIR (the mono part of the RIR).
  • the signal resulting from the processing of the input audio signal with the mono part of the RIR, namely with the second group of samples of the RIR is always a mono signal, regardless if the input audio signal is a mono or a stereo signal.
  • the mono input audio signal, or the stereo input audio signal after being converted into a mono input audio signal is processed with the second group of samples of the RIR, namely with the mono part of the RIR, to generate a processed mono audio signal, representing the reverberation of the input audio signal with the second group of samples of the RIR, namely with the mono part of the RIR.
  • the fifth step of the method can be implemented by a hardware- and/or software-embodied signal processing structure or unit which is configured to apply or implement a first signal processing rule for processing, particularly by convolving, the input audio signal with the left channel samples of the stereo part of the RIR data and for processing, particularly by convolving, the input audio signal with the right channel samples of the stereo part of the RIR data, thereby obtaining a processed left channel audio signal part and a processed right channel audio signal part representing the reverberation of the mono or stereo input audio signal from the first group of samples of the RIR.
  • a first signal processing rule for processing particularly by convolving, the input audio signal with the left channel samples of the stereo part of the RIR data and for processing, particularly by convolving, the input audio signal with the right channel samples of the stereo part of the RIR data, thereby obtaining a processed left channel audio signal part and a processed right channel audio signal part representing the reverberation of the mono or stereo input audio signal from the first group of samples of the RIR.
  • the hardware- and/or software-embodied signal processing structure or unit is configured to apply or implement a second signal processing rule for processing, particularly by convolving, the input audio signal with the mono part of the RIR data, thereby obtaining, a processed mono audio signal part representing the reverberation of the mono or stereo input audio signal from the second group of samples of the RIR.
  • the left channel audio signal part resulting from the processing of the input audio signal with the left channel samples of the stereo part of the RIR is mixed with the audio signal part resulting from the processing of the input audio signal with the mono part of the RIR, whereby a left channel output signal is generated.
  • the right channel audio signal part resulting from the processing of the input audio signal with the right channel samples of the stereo part of the RIR is mixed with the audio signal part resulting from the processing of the input audio signal with the mono part of the RIR, whereby a right channel output signal is generated.
  • a left channel output signal is generated; and by mixing the right channel audio signal part with the audio signal part resulting from the processing of the input audio signal with the mono part of the RIR, a right channel output signal is generated.
  • the generated left and right channel output signals build the stereo output audio signal having the specific reverberation characteristics.
  • the sixth step of the method can be implemented by a hardware- and/or software-embodied mixing unit which is configured to mix the left channel audio signal part resulting from the processing of the input audio signal with the left channel samples of the stereo part of the RIR data with the audio signal part resulting from the processing of the input audio signal with the mono part of the RIR data, thereby generating a left channel output signal; and to mix the right channel audio signal part resulting from the processing of the input audio signal with the right channel samples of the stereo part of the RIR data with the audio signal part resulting from the processing of the input audio signal with the mono part of the RIR data, thereby generating a right channel output signal.
  • the method thus, allows for implementing a Hybrid Mono-Stereo Uniform Partition Overlap-Save (“HUPOLS”) reverberation principle.
  • the HUPOLS principle is an efficient method of convolving an input audio signal - which can be a mono or a stereo audio signal - with a real-life room stereo impulse response of large length.
  • the HUPOLS principle is based on the conventional UPOLS method and the generic principle of performing concurrent stereo convolution of the input audio signal with the early stereo part of the RIR and mono convolution of the input audio signal with the late mono part of the RIR.
  • HUPOLS thereby, significantly reduces the number of FLOPIS and the amount of memory needed, without any noticeable degradation of the stereo perception of the stereo output audio signal, by exploiting the different effect and importance that the early and the late reflections of a real-life room have on the reverberated stereo output audio signal.
  • HUPOLS can exploit the structure of a UPOLS building block to economize on the Discrete Fourier Transform operations ("DFT” operations) and Inverse Discrete Fourier Transform operations (“IDFT” operations) and to eliminate the delay needed for modelling the late mono part of the RIR.
  • DFT Discrete Fourier Transform operations
  • IDFT Inverse Discrete Fourier Transform operations
  • the method makes use of the insight of taking advantage of the differing subjective perceptions caused by the early and late parts of a pre-recorded RIR.
  • the method particularly, uses a stereo model for the early-reflections-part of the RIR to reproduce the reverberation caused by the early reflections of the room. This early part determines the spatial impression, the understanding of our position, and the sound source position within the room.
  • the method particularly, uses a mono model to reproduce the reverberation caused by the late reflections of the room. This late part determines the perception of the room size and geometry.
  • the method achieves a noticeable reduction in the required computing resources, since the processing of stereo audio signals requires twice the recourses as the processing of mono audio signals.
  • the input audio signal applied to the stereo model of the early-reflections-part of the RIR can be mono or stereo, whereas the generated output signal is always a stereo signal. If the duration of the early and the late part of the RIR is properly determined, the method allows for reducing the required computing resources at no expense in the quality of the reverberated stereo audio signal.
  • step e) can comprise applying or implementing a or the first signal processing rule for processing, particularly by convolving, the mono input audio signal with the left channel samples of the stereo part of the RIR and for processing, particularly by convolving, the mono input audio signal with the right channel samples of the stereo part of the RIR.
  • a processed left audio signal part and a processed right audio signal part representing the reverberation of the mono input audio signal from the first group of samples of the RIR data can be obtained.
  • step e) can comprise applying or implementing a or the second signal processing rule for processing, particularly by convolving, the mono input audio signal with the mono part of the RIR data, thereby obtaining a processed mono audio signal part representing the reverberation of the mono input audio signal from the second group of samples of the RIR data.
  • step f) can comprise mixing the processed mono audio signal part with the processed left audio signal part, thereby generating a or the left channel output signal, and mixing the processed mono audio signal part with the processed right audio signal part, thereby generating a or the right channel output signal.
  • the generated left and right channel output signals build the stereo output audio signal having the specific reverberation.
  • step e) can comprise applying or implementing a or the first signal processing rule for processing, particularly by convolving, the left channel of the stereo input audio signal with the left channel samples of the stereo part of the RIR data and for processing the right channel of the stereo input audio signal with the right channel samples of the stereo part of the RIR data, thereby obtaining a processed left channel audio signal part and a processed right channel audio signal part representing the reverberation of the left and right channel of the stereo input audio signal from the left channel samples and from the right channel samples of the stereo part of the RIR.
  • step e) can comprise applying or implementing a or the second signal processing rule for processing, particularly by convolving, the mono version of the stereo input audio signal with the mono part of the RIR data, thereby obtaining a processed mono audio signal part, representing the reverberation of the mono version of the stereo input audio signal from the second group of samples of the RIR.
  • step f) can comprise mixing the left channel audio signal part resulting from the processing of the input signal with the left channel samples of the stereo part of the RIR data with the mono audio signal part resulting from the processing of the input signal with the mono part of the RIR data, thereby generating a reverberated left channel output signal; and mixing the right channel audio signal part resulting from the processing of the input signal with the right channel samples of the stereo part of the RIR data with the mono audio signal part resulting from the processing of the input signal with the mono part of the RIR data, thereby generating a reverberated right channel output signal.
  • the generated left and right channel output signals build the stereo output audio signal having the specific reverberation.
  • the method can comprise outputting the left channel output signal via a left output audio channel and outputting the right channel output signal via a right output audio channel.
  • Respective left and right output audio channels can be embodied through loudspeakers of an audio system, i.e. particularly a vehicle audio system or a car audio system, i.e. an audio system that is to be installed or is installed in a vehicle or a car.
  • the left and right channel of the stereo input audio signal can be pre-processed by applying a pre-processing rule for converting stereo input audio signal to mono before applying the second signal processing rule.
  • a respective pre-processing rule can be embodied via a hardware- and/or software embodied pre-processing unit which is configured to pre-process the left and right channel of stereo input audio signal for converting stereo input audio signal to mono before applying the second signal processing rule.
  • a respective pre-processing can be beneficial for the (subsequent) application or implementation of the second signal processing rule, e.g. due to reduced computational efforts for carrying out the second signal processing rule.
  • a respective pre-processing rule for converting the stereo input audio signal to mono can comprise forming the arithmetic mean between the left channel samples and the right channel samples of the stereo input audio signal.
  • a respective pre-processing rule for converting the stereo input audio signal to mono can comprise summing the left channel samples with the right channel samples and for each pair of samples that have been added together dividing the result by two.
  • the summing can particularly, comprise adding of corresponding blocks of the left and the right channel of the or a respective stereo input audio signal.
  • the summing typically, further comprises or can be followed by dividing the result of the addition by two.
  • the method typically, comprises applying a time-delay filter before application of the second signal processing rule for processing the input audio signal with the mono part of the RIR.
  • the time-delay filter can be applied by a hardware- and/or software-embodied time-delay filter unit which is configured to apply a time-delay filter before application of the second signal processing rule.
  • the time delay introduced by the time-delay filter is typically, equal to the time duration of the stereo part of the RIR data.
  • the length of the delay filter typically, corresponds to the length of the stereo part of the RIR.
  • the first and second signal processing rule can each comprise at least one filtering operation, particularly at least one convolving operation.
  • the first signal processing rule typically, comprises (exactly) two filtering operations and the second signal processing rule typically, comprises (exactly) one filtering operation.
  • the hardware- and/or software-embodied signal processing unit or structure for implementing the first and second signal processing rule can be embodied as filtering units, particularly as convolution units, configured to comprise at least one filtering operation, particularly at least one convolving operation.
  • the determination of the first number of RIR samples representing the stereo part of the RIR and the second number of RIR samples representing the mono part of the RIR can be done iteratively.
  • the determination of the first number of RIR samples representing the stereo part of the RIR and the second number of RIR samples representing the mono part of the RIR can be done experimentally, e.g. using a suitable hardware- and software-embodied signal processing structure. This determination can be an iterative process and can require the attention of a user, i.e. particularly an audio engineer.
  • a signal processing unit or structure can be used for applying or implementing both the first and the second signal processing rule.
  • the signal processing structure can comprise four hardware- and/or software-embodied signal processing blocks, particularly built as or comprising Discrete Fourier Transform blocks, particularly Fast Discrete Fourier Transform blocks. If the input audio signal is mono, the signal processing structure can comprise three hardware- and/or software-embodied signal processing blocks, particularly built as or comprising Discrete Fourier Transform blocks, particularly Fast Discrete Fourier Transform blocks.
  • the signal processing structure used for implementing the method or the respective steps of the method can thus, have a relatively simple and/or effective configuration.
  • the signal processing structure can comprise one or more first signal processing blocks for implementing the first signal processing rule, and one or more second signal processing blocks for implementing the second signal processing rule.
  • a second aspect of the invention refers to a signal processing device, comprising means, particularly a signal processing structure, for carrying out the method of the first aspect of the invention.
  • means particularly a signal processing structure
  • a third aspect of the invention refers to a computer program product comprising instructions which, when the program is executed by a computer, particularly a DSP unit, cause the computer to carry out the method of the first aspect of the invention.
  • a computer particularly a DSP unit
  • a fourth aspect of the invention refers to a computer-readable data carrier having stored thereon the computer program product of the third aspect.
  • all remarks regarding the method of the first aspect also apply to the computer-readable data carrier.
  • a fifth aspect of the invention refers to an audio processing apparatus for processing an input audio signal, comprising a signal processing device according to the second aspect.
  • a sixth aspect of the invention refers to a vehicle, particularly a car, comprising an audio processing apparatus for processing an input audio signal according to the fifth aspect.
  • Fig.1 shows a principle drawing of a digital signal processing structure 100 for implementing a method of processing a stereo input audio signal for generating a stereo output audio signal of a specific reverberation according to an exemplary embodiment of the invention.
  • the method enables generating a stereo output audio signal having a specific reverberation, i.e. specific reverberation characteristics, such as the specific reverberation characteristics of a specific acoustic environment, e.g. a specific room of a specific building, by processing of an input audio signal.
  • a specific reverberation i.e. specific reverberation characteristics, such as the specific reverberation characteristics of a specific acoustic environment, e.g. a specific room of a specific building
  • an input audio signal is provided.
  • the input audio signal is a mono input audio signal ("mono signal”) or a stereo input audio signal (“stereo signal”).
  • the first step of the method can be implemented by a hardware- and/or software-embodied input audio signal providing unit which is configured to provide an input audio signal that is mono or stereo.
  • An input audio signal can be a music- and/or speech-signal, i.e. a signal comprising music- and/or speech-content.
  • pre-recorded stereo data of a Room-Impulse-Response ("RIR") of a specific acoustic environment such as a specific building or venue
  • the pre-recorded RIR data respectively comprise a defined number of RIR samples, i.e. left channel samples and right channel samples.
  • the pre-recorded RIR data are or comprise stereo data.
  • the RIR data comprise an equal number of left channel samples and right channel samples.
  • the RIR data can be obtained through known methods for recording the RIR of acoustic environments. The actual recording of a respective acoustic environment is typically, not a step of the method.
  • the second step of the method can be implemented by a hardware- and/or software-embodied RIR data providing unit which is configured to provide pre-recorded RIR data of a specific acoustic environment.
  • a first number of RIR samples representing a stereo part of the RIR represented by the RIR data and a second number of RIR samples representing a mono part of the RIR represented by the RIR data is determined.
  • the stereo part of the RIR comprises a number of left channel samples for the left output channel and an equal number of right channel samples for the right output channel.
  • the mono part of the RIR represented by the RIR data comprises a number of RIR samples to be used for both the left and the right output channel.
  • the third step of the method thus, comprises the determination of a stereo part of the RIR which is or can be represented by a first number of RIR samples and a mono part of the RIR which is or can be represented by a second number of RIR samples.
  • the stereo part of the RIR is the first part of the RIR and is followed by the mono part of the RIR which is the second part of the RIR.
  • the duration of the stereo part and of the mono part of the RIR typically, add up to the total duration of the RIR.
  • the third step of the method can be implemented by a hardware- and/or software-embodied determining unit which is configured to determine a first number of RIR samples representing a stereo part of the RIR and a second number of RIR samples representing a mono part of the RIR, whereby the stereo part of the RIR comprises a number of left channel RIR samples for the left output channel and an equal number of right channel RIR samples for the right output channel.
  • the mono part of the RIR comprises a number of RIR samples for both the left and the right output channel.
  • the samples of the RIR are subdivided into a first group of RIR samples representing the stereo part of the RIR and into a second group of RIR samples representing the mono part of the RIR.
  • the first group of RIR samples typically comprises the first number of RIR samples representing the stereo part of the RIR and the second group of RIR samples typically comprises the second number of RIR samples representing the mono part of the RIR.
  • the first group of RIR samples typically, represents a (distinct) early reflection part of the RIR ("ERP") and the second group of RIR samples typically, represents a (distinct) late reflection part of the RIR ("LRP").
  • the first group of RIR samples can comprise a period ranging between 1 ms and 150 ms, particularly between 10 ms and 100 ms, of the (initial) duration of the RIR.
  • the second group of RIR samples comprises the remaining duration of the RIR.
  • the fourth step of the method can be implemented by a hardware- and/or software-embodied subdividing unit which is configured to subdivide the samples of the RIR into a first group of RIR samples representing the stereo part of the RIR and into a second group of RIR samples representing the mono part of the RIR.
  • the third and fourth step of the method can be combined in one step which comprises both the determining aspect as specified above in context with the third step and the subdividing aspect as specified above in context with the fourth step.
  • a hardware- and/or software-embodied determining and subdividing unit which is configured to determine a first number of RIR samples representing a stereo part of the RIR and a second number of RIR samples representing a mono part of the RIR, whereby the stereo part of the RIR comprises a number of left channel RIR samples for the left output channel and an equal number of right channel RIR samples for the right output channel and configured to subdivide the samples of the RIR into a first group of RIR samples representing the stereo part of the RIR and into a second group of RIR samples representing the mono part of the RIR, can be used when the third and fourth step are combined.
  • a first signal processing rule for processing, particularly by convolving, the input audio signal with the left channel samples of the stereo part of the RIR and for processing, particularly by convolving, the input audio signal with the right channel samples of the stereo part of the RIR is applied or implemented.
  • a processed left channel audio signal part and a processed right channel audio signal part representing the reverberation of the input audio signal - which can be a mono input audio signal or a stereo input audio signal - from the first group of samples of the RIR is obtained.
  • the input audio signal is processed, i.e.
  • the input audio signal is processed, i.e. typically convolved, with the right channel samples of the stereo part of the RIR, whereby a processed, i.e. typically convolved, right channel audio signal part is obtained.
  • the respective processed left and right channel audio signal parts represent the reverberation, i.e. specifically the artificially generated reverberation, of the input audio signal from the first group of samples of the RIR.
  • the fifth step comprises applying or implementing a second signal processing rule for processing, particularly by convolving, the input audio signal with the mono part of the RIR.
  • a second signal processing rule for processing for processing, particularly by convolving, the input audio signal with the mono part of the RIR.
  • a mono input audio signal is processed so as to obtain a processed mono audio signal part representing the reverberation of the mono input audio signal from the second group of samples of the RIR (the mono part of the RIR), and a stereo input audio signal is processed so as to obtain a processed mono audio signal part representing the reverberation of both the left and right channel of the stereo input audio signal from the second group of samples of the RIR (the mono part of the RIR).
  • the signal resulting from the processing of the input audio signal with the mono part of the RIR, namely with the second group of samples of the RIR is always a mono signal, regardless if the input audio signal is a mono or a stereo signal.
  • the mono input audio signal, or the stereo input audio signal after being converted into a mono input audio signal is processed with the second group of samples of the RIR, namely with the mono part of the RIR, to generate a processed mono audio signal, representing the reverberation of the input audio signal with the second group of samples of the RIR, namely with the mono part of the RIR.
  • the fifth step of the method can be implemented by a hardware- and/or software-embodied signal processing structure or unit which is configured to apply a first signal processing rule for processing, particularly by convolving, the input audio signal with the left channel samples of the stereo part of the RIR data and for processing, particularly by convolving, the input audio signal with the right channel samples of the stereo part of the RIR data, thereby obtaining a processed left channel audio signal part and a processed right channel audio signal part representing the reverberation of the mono or stereo input audio signal from the first group of samples of the RIR.
  • the hardware- and/or software-embodied signal processing structure or unit is configured to apply a second signal processing rule for processing, particularly by convolving, the input audio signal with the mono part of the RIR data, thereby obtaining a processed mono audio signal part representing the reverberation of the mono or stereo input audio signal from the second group of samples of the RIR.
  • the left channel audio signal part resulting from the processing of the input audio signal with the left channel samples of the stereo part of the RIR is mixed with the audio signal part resulting from the processing of the input audio signal with the mono part of the RIR, whereby a left channel output signal is generated.
  • the right channel audio signal part resulting from the processing of the input audio signal with the right channel samples of the stereo part of the RIR is mixed with the audio signal part resulting from the processing of the input audio signal with the mono part of the RIR, whereby a right channel output signal is generated.
  • a left channel output signal is generated; and by mixing the right channel audio signal part with the audio signal part resulting from the processing of the input audio signal with the mono part of the RIR, a right channel output signal is generated.
  • the generated left and right channel output signals build the stereo output audio signal having the specific reverberation characteristics.
  • the sixth step of the method can be implemented by a hardware- and/or software-embodied mixing unit which is configured to mix the left channel audio signal part resulting from the processing of the input audio signal with the left channel samples of the stereo part of the RIR data with the audio signal part resulting from the processing of the input audio signal with the mono part of the RIR data, thereby generating a left channel output signal; and to mix the right channel audio signal part resulting from the processing of the input audio signal with the right channel samples of the stereo part of the RIR data with the audio signal part resulting from the processing of the input audio signal with the mono part of the RIR data, thereby generating a right channel output signal.
  • step e) can comprise applying or implementing a or the first signal processing rule for processing, particularly by convolving, the mono input audio signal with the left channel samples of the stereo part of the RIR and for processing, particularly by convolving, the mono input audio signal with the right channel samples of the stereo part of the RIR.
  • a processed left audio signal part and a processed right audio signal part representing the reverberation of the mono input audio signal from the first group of samples of the RIR can be obtained.
  • step e) can comprise applying or implementing a or the second signal processing rule for processing, particularly by convolving, the mono input audio signal with the mono part of the RIR data, thereby obtaining a processed mono audio signal part representing the reverberation of the mono input audio signal from the second group of samples of the RIR data.
  • step f) can comprise mixing the processed mono audio signal part with the processed left audio signal part, thereby generating a or the left channel output signal, and mixing the processed mono audio signal part with the processed right audio signal part, thereby generating a or the right channel output signal.
  • the generated left and right channel output signals build the stereo output audio signal having the specific reverberation.
  • step e) can comprise applying or implementing a or the first signal processing rule for processing, particularly by convolving, the left channel of the stereo input audio signal with the left channel samples of the stereo part of the RIR data and for processing, particularly by convolving, the right channel of the stereo input audio signal with the right channel samples of the stereo part of the RIR data, thereby obtaining a processed left channel audio signal part and a processed right channel audio signal part representing the reverberation of the left and right channel of the stereo input audio signal from the left channel samples and from the right channel samples of the stereo part of the RIR.
  • step e) can comprise applying or implementing a or the second signal processing rule for processing, particularly by convolving, the mono version of the stereo input audio signal with the mono part of the RIR data, thereby obtaining a processed mono audio signal part representing the reverberation of the mono version of the stereo input audio signal from the second group of samples of the RIR.
  • step f) can comprise mixing the left channel audio signal part resulting from the processing of the input signal with the left channel samples of the stereo part of the RIR data with the mono audio signal part resulting from the processing of the input signal with the mono part of the RIR data, thereby generating a reverberated left channel output signal; and mixing the right channel audio signal part resulting from the processing of the input signal with the right channel samples of the stereo part of the RIR data with the mono audio signal part resulting from the processing of the input signal with the mono part of the RIR data, thereby generating a reverberated right channel output signal.
  • the generated left and right channel output signals build the stereo output audio signal having the specific reverberation.
  • the method can comprise outputting the left channel output signal via a left output audio channel and outputting the right channel output signal via a right output audio channel.
  • Respective left and right output audio channels can be embodied through loudspeakers of an audio system, i.e. particularly a vehicle audio system or a car audio system, i.e. an audio system that is to be installed or is installed in a vehicle or a car.
  • the left and right channel of the stereo input audio signal can be pre-processed by applying a pre-processing rule for converting stereo input audio signal to mono before applying the second signal processing rule.
  • a respective pre-processing rule can be embodied via a hardware- and/or software embodied pre-processing unit which is configured to pre-process the left and right channel of the stereo input audio signal for converting the stereo input audio signal to mono before applying the second signal processing rule.
  • a respective pre-processing can be beneficial for the (subsequent) application or implementation of the second signal processing rule, e.g. due to reduced computational efforts for carrying out the second signal processing rule.
  • a respective pre-processing rule for converting the stereo input audio signal to mono can comprise forming the arithmetic mean between the left channel samples and the right channel samples of the stereo input audio signal.
  • a respective pre-processing rule for converting the stereo input audio signal to mono can comprise summing the left channel samples with the right channel samples and for each pair of samples that have been added together dividing the result by two.
  • the summing can particularly, comprise adding of corresponding blocks of the left and the right channel of the or a respective stereo input audio signal.
  • the summing typically, further comprises or can be followed by dividing the result of the addition by two.
  • the method can further comprise applying a time-delay filter before application of the second signal processing rule for processing the input audio signal with the mono part of the RIR.
  • the time-delay filter can be applied by a hardware- and/or software-embodied time-delay filter unit which is configured to apply a time-delay filter before application of the second signal processing rule.
  • the time delay introduced by the time-delay filter is typically, equal to the time duration of the stereo part of the RIR data.
  • the length of the delay filter typically, corresponds to the length of the stereo part of the RIR.
  • the first and second signal processing rule can each comprise at least one filtering operation, particularly at least one convolving operation.
  • the first signal processing rule typically, comprises (exactly) two filtering operations and the second signal processing rule typically, comprises (exactly) one filtering operation.
  • the hardware- and/or software-embodied signal processing unit or structure for implementing the first and second signal processing rule can be embodied as filtering units, particularly as convolution units, configured to comprise at least one filtering operation, particularly at least one convolving operation.
  • the determination of the first number of RIR samples representing the stereo part of the RIR and the second number of RIR samples representing the mono part of the RIR can be done iteratively.
  • the determination of the first number of RIR samples representing the stereo part of the RIR and the second number of RIR samples representing the mono part of the RIR can be done experimentally, e.g. using a suitable hardware- and software-embodied signal processing structure. This determination can be an iterative process and can require the attention of a user, i.e. particularly an audio engineer.
  • the exemplary embodiments of the method comprise subdividing a pre-recorded RIR in early-stereo blocks (first group of RIR samples) and in late-mono blocks (second group of RIR samples).
  • the minimum required number of trailing zero samples are appended to the sequence h n to extend it to a length that is a multiple of B. These zero samples are placed after the last sample of the sequence.
  • the length of the sequence increases then to B[L/B], where [L/B] is the smallest integer greater than or equal to (L/B).
  • B 1 sample, no trailing zeros are appended to the RIR.
  • K L.
  • h n and h k refer to any of the left or the right channel.
  • FIG. 3 An example of the RIR partitioning into early-stereo blocks and late-mono blocks is shown in box I of Fig. 3 .
  • the Early Reflections Part (“ERP") of the RIR comprises the early-stereo blocks and the Late Reflections Part (“LRP") of the RIR comprises the late-mono blocks.
  • ERP Early Reflections Part
  • LRP Late Reflections Part
  • S 5 early-stereo blocks per channel
  • M 15 (15 late-mono blocks)
  • K 20 (a total of 20 blocks per channel comprise the RIR).
  • FIG. 1 shows a principle drawing of a digital signal processing structure 100 for implementing a method of processing a stereo input audio signal for generating a stereo output audio signal of a specific reverberation according to an exemplary embodiment of the invention
  • the digital signal processing structure 100 is divided to the upper part (what stands above block 24) and to the lower part (what stands below block 24).
  • the lower part can be referred to as the “Mono Subsystem” because the signals flowing through it are monophonic, whereas the upper part can be referred to as the “Stereo Subsystem” because the signals flowing through it are stereophonic.
  • Block 24 provides the transition from the "Stereo Subsystem” to the "Mono Subsystem” (note its two input ports and one output port).
  • Blocks 27 merge the two subsystems together and provide the transition from the "Mono Subsystem” back to the "Stereo Subsystem".
  • Blocks 15 represent the input to the structure for the left and right channel and blocks 31 the corresponding output of the structure for the left and right channel.
  • the building blocks of the "Stereo Subsystem” can be two modified UPOLS algorithms, one for the left and one for the right channel. These can be referred to as the UPOLS-left and the UPOLS-right subsystems.
  • the building block of the "Mono Subsystem” is a pruned UPOLS method that shares certain blocks with the UPOLS-left and UPOLS-right subsystems. This can be referred to as the UPOLS-mono subsystem.
  • the HUPOLS-stereo reverberator illustrated in Fig. 1 is based on the UPOLS algorithm.
  • the reverberator processes incoming samples x n frame-by-frame.
  • x n represents the value of the signal at time n ⁇ 0.
  • the signal is assumed to be zero for n ⁇ 0 and therefore no processing takes place before time zero.
  • the frame size is B samples, where B ⁇ 1.
  • Buffers 15 contain these samples. Buffer 15 on the left contains the samples of the vector x L k and buffer 15 on the right contains the samples of the vector x R k .
  • the first sample x kB (simplified notation for x L kB ) of the vector x k (simplified notation for x L k ) is located at the first (leftmost) location of buffer 15. This convention is followed for all buffers and vectors in this document, namely, the first element of a vector is placed at the first (leftmost) location of the buffer and the last element of a vector is placed at the last (rightmost) location of the buffer.
  • Transform 18 represents the size 2B Real-to-Complex Discrete Fourier Transform ("2B R-C DFT"), of the time-domain vector [ x k-1
  • x k ] [x kB-B , ..., x kB+(B-1) ] 1x2B .
  • 2B R-C DFT 2B Real-to-Complex Discrete Fourier Transform
  • the first Discrete Fourier Transform (“DFT") coefficient (this is the DC term) is X k2B and the last DFT coefficient is X k2B+(2B-1) .
  • y k ] [d kB , ..., d kB+(B-1)
  • Transform 28 represents the size 2B Complex-to-Real Inverse Discrete Fourier Transform ("2B C-R IDFT”)
  • This output has an inherent delay of (B-1) samples since a total of B input samples need to be collected to build up the block x k for the processing to start. Typically, only when an input block is complete can the output to this block be calculated.
  • the latency of the algorithm is typically, (B-1) samples.
  • Buffer 1 contains the samples of the vector e 0 (simplified notation for e L 0 ), buffer 2 the samples of the vector e 1 (simplified notation for e L 1 ), and buffer 3 the samples of the vector e S -1 (simplified notation for e L S-1 ).
  • Transform 5 represents the size 2B R-C DFT of the vector [ e 0
  • FD-LRIR Frequency-Domain Left RIR
  • a total of S buffers containing complex data comprise the FD-LRIR.
  • the content of the FD-LRIR is typically, calculated off-line and typically, stays constant throughout the streaming and processing of the data.
  • the static FD-LRIR can be stored in the processor memory. The same applies to the Frequency-Domain Right RIR (“FD-RRIR”) that appears on the right-hand side of the HUPOLS-stereo structure illustrated in Fig. 1 .
  • buffers and transforms 8 through 14 The meaning of buffers and transforms 8 through 14 is similar to that for the buffers and transforms 1 through 7 explained above.
  • Buffer 8 contains the samples of the vector l 0
  • buffer 9 the samples of the vector l 1
  • buffer 10 the samples of the vector l M-1 .
  • Transform 12 represents the size 2B R-C DFT of the vector [ l 0
  • Buffer 13 and all the buffers underneath are referred to as the Frequency-Domain Mono RIR ("FD-MRIR").
  • FD-MRIR Frequency-Domain Mono RIR
  • a total of M buffers containing complex data comprise the FD-MRIR.
  • the content of the FD-MRIR is typically, calculated off-line and typically, stays constant throughout the streaming and processing of the data.
  • the static FD-MRIR can be stored in the processor memory.
  • All S buffers under transform 18 are referred to as the Frequency-Domain Left Vector Delay Line ("FD-LVDL").
  • Buffers 19 and 20 are the first and the last buffer of FD-LVDL. Buffers not shown in Figure 1 are implied by the ellipsis 21.
  • All S buffers have the same size (B+1) and are initialised with zeros.
  • the output X k of transform 18 is calculated.
  • the last (B-1) elements of X k are implied by the complex-conjugate symmetry property of the size 2B R-C DFT. These are all discarded immediately after the output of transform 18 is calculated.
  • the remaining (B+1) samples of X k are shifted into buffer 19 and the previous elements of buffer 19 are shifted into the next buffer, namely into the buffer just below.
  • the M buffers of the FD-MVDL have all the same size (B+1) and are initialised with zeros. Buffers not shown in Fig. 1 are implied by the ellipsis 23.
  • Buffer 22 is the first buffer of the FD-MVDL. This buffer is updated with the sum of the complex vectors contained in buffers 20 of the FD-LVDL and the FD-RVDL (the Frequency-Domain Right Vector Delay Line), just before the elements of buffers 20 are updated. Block 24 is responsible for adding these two complex vectors. Apart from the way that its first buffer is fed, FD-MVDL works just like FD-LVDL and FD-RVDL.
  • the first input frame for which the initial zero values of FD-LVDL and FD-RVDL are completely removed is x L S-1 and x R S-1 .
  • the first input frame for which the initial zero values of FD-MVDL are completely removed is x L S+M-1 and x R S+M-1 .
  • the complex multiplier 25 forms the complex vector [E 0 X k2B , E 1 X k2B+1 , ..., E B X k2B+B ] 1x(B+1) .
  • each of the multipliers under multiplier 25 forms in a similar way the element-by-element complex product between the contents of its corresponding FD-LVDL buffer and FD-LRIR buffer.
  • the resulting S complex vectors (the vector products) are fed to the upper S input ports of the accumulator block 27.
  • the complex multiplier 26 forms the element-by-element complex product between the contents of its corresponding FD-MVDL buffer and FD-MRIR buffer.
  • the resulting M complex vectors are fed to the lower M input ports of the accumulator blocks 27.
  • This extension corresponds to the removal of the last (B-1) elements from the results of the transforms 18.
  • the extended vector [Y k2B+0 , Y k2B+1 , ..., Y k2B+(2B-1) ] 1x2B becomes the input of transform 28.
  • Fig. 2 shows another digital signal processing structure 100 according to an exemplary embodiment.
  • the digital signal processing structure can be deemed a HUPOLS-mono reverberator useable or used for a mono input audio signal.
  • FD-CVDL Frequency-Domain Common Vector Delay Line
  • the S buffers comprising the FD-CVDL are buffers 19, 20, and the buffers below buffer 19 and above buffer 20.
  • the first buffer of FD-MVDL is updated directly from the last buffer of FD-CVDL.
  • the vector data of FD-CVDL are used for both the left and the right channel of the ERP of the RIR.
  • the HUPOLS-stereo and HUPOLS-mono structures of Fig. 1 and Fig. 2 work in exactly the same way.
  • the HUPOLS-stereo reverberator of Fig. 1 achieves the complexity reduction by modelling the last M of the K RIR blocks with the UPOLS-mono subsystem (Mono Subsystem).
  • M 0, the UPOLS-mono subsystem, the block 24, and the lower M ports of the two blocks 27 in Fig. 1 vanish.
  • the digital signal processing structure 100 of Fig. 1 turns then into the UPOLS-stereo system, that independently processes the left and the right channel of the input audio signal with the left and the right channel of RIR.
  • This digital signal processing structure 100 which can also be denoted UPOLS-stereo system is illustrated in the exemplary embodiment of Fig. 4 .
  • HUPOLS-stereo uses ((2S+M) / (2K)) x 100% of the resources (FLOPIS and memory) required by the UPOLS-stereo system. This is a number between 50% and 100%.
  • HUPOLS-stereo uses 100% of the resources required by the UPOLS-stereo system, since the two methods become then identical.
  • HUPOLS-stereo uses 50% of the resources required by the UPOLS-stereo system. This setting corresponds to a monophonic configuration.
  • the structure of Fig. 2 turns then into the digital signal processing structure 100 illustrated in Fig. 5 which can be deemed a UPOLS-mono system.
  • This system processes the mono input audio signal with the left and the right channel of RIR.
  • HUPOLS-mono uses ((2S+M) / (2K)) x 100% of the FLOPIS and ((3S+2M) / (3K)) x 100% of the memory required by the UPOLS-mono system.
  • the last figure ranges from 66.6% to 100%.
  • FIG. 3 shows abstract models for the digital signal processing structures of Fig. 1 and Fig. 2 in accordance with an exemplary embodiment.
  • box II shows the abstract model for the structure of Fig. 1
  • box III shows the abstract model for the structure of Fig. 2 .
  • Box I of Fig. 3 shows the partitioning of the RIR into early-stereo and late-mono blocks, that was assumed for box II and box III of this figure.
  • the stereo digital signal processing structure 100 of Fig. 1 and the mono digital signal processing structure 100 of Fig. 2 uses for this configuration 62,5% of the resources required by the UPOLS-stereo system of Fig. 4 or the UPOLS-mono system of Fig. 5 .
  • blocks starting with block 1 represent the early-stereo blocks e L k , 0 ⁇ k ⁇ 5, for the left channel.
  • Blocks starting with block 2 represent the early-stereo blocks e R k , 0 ⁇ k ⁇ 5, for the right channel.
  • Blocks starting with block 3 represent the late-mono blocks l k , 0 ⁇ k ⁇ 15. These blocks were defined above.
  • Blocks starting with block 31 represent the blocks h L k+S used to define the blocks l k (see above), where 0 ⁇ k ⁇ 15.
  • Blocks starting with block 32 represent the blocks h R k+S used to define the blocks l k (see above), where 0 ⁇ k ⁇ 15.
  • block 4 represents any possible way of convolving the block's input signal with the left channel samples of ERP. It is accordingly for block 5 for the right channel samples of ERP and for block 6 for the mono samples of LRP.
  • Blocks 8 and 9 correspond to blocks 15 for the left and the right channel in the HUPOLS-stereo structure of Fig. 1 .
  • Block 89 corresponds to block 15 in the HUPOLS-mono structure of Fig. 2 .
  • Blocks 10 and 11 correspond to blocks 31 in Fig. 1 and Fig. 2 .
  • Adder 12 converts the stereo input audio signal to mono by adding the samples of the left and right channel. The division by two required for this conversion is incorporated into the definition of the late-mono blocks and is therefore omitted from the flow-graph of Fig. 3 .
  • Adders 13 and 14 mix the mono output signal of block 6 to the left channel signal (output of block 4) and to the right channel signal (output of block 5), to yield the left channel output (block 10) and the right channel output (block 11) of the model.
  • the outputs of blocks 4/5 represent the reverberation of the left/right channel of the input audio signal from the early-stereo samples of the left/right channel of RIR.
  • the output of block 6 represents the reverberation of the mono version of the input stereo signal from the late-mono samples of RIR.
  • the left and right channels of the input audio signal are the same (mono input audio signal). Apart from this, the description is the same as for the stereo configurations HUPOLS-s, (box II of Fig.3 ).
  • the HUPOLS-s structures illustrated in Fig. 1 and Fig. 3 (box II) are equivalent, in that they produce the same outputs for the same inputs. Due to this equivalence, reference is made to the digital signal processing structure 100 of Fig. 3 (box II) as the abstract model of the HUPOLS-s stereo configuration 100 of Fig. 1 .
  • the adder 12 of the abstract model corresponds to the vector summation block 24 of HUPOLS-s of Fig. 1 .
  • the SB samples delay of the abstract model is implemented in HUPOLS-s of Fig. 1 by the FD-LVDL and FD-RVDL, by the mechanism of buffers 16 and 17, and by the transforms 18.
  • Each buffer of the FD-LVDL corresponds to a delay of B samples and in this way the cascade of the S buffers yields the SB samples delay.
  • the adder 12 of the abstract model is implemented in the stereo configuration of HUPOLS-s of Fig. 1 in the frequency-domain, due to the transform 18 of HUPOLS-s, and is done after the SB samples delay, since the HUPOLS-s block 24 is placed after the FD-LVDL and FD-RVDL.
  • the adders 13 and 14 of the abstract model are implemented with the accumulation blocks 27 of the stereo configuration HUPOLS-s of Fig. 1 for the left and right channel, respectively.
  • adder 13 is achieved by adding the sum of the lower M input vectors to the sum of the upper S input vectors with the left block 27. It is accordingly for adder 14.
  • the adders 13 and 14 of the abstract model are implemented in the stereo configuration HUPOLS-s of Fig. 1 in the frequency-domain, since the left and right blocks 27 stand in-between the transforms 18 and 28.
  • the blocks 4 and 5 of the abstract model correspond to the UPOLS-left and the UPOLS-right subsystems of HUPOLS-s of Fig. 1 .
  • block 6 of the abstract model corresponds to the UPOLS-mono subsystem of Fig. 1 .
  • HUPOLS-s of Fig. 1 does not require extra memory to implement the delay filter 7 of the abstract model. Moreover, it economizes the DFT and IDFT operations that would normally be needed for implementing block 6 of the abstract model with the stand-alone UPOLS method, by taking advantage of the linearity property of the DFT and IDFT operations. It is similarly for HUPOLS-m of Fig. 2 .
  • V L k and V R k where 1 ⁇ k ⁇ K, of the K multipliers for the left and right channel (multipliers 25 and multipliers underneath) are calculated, these outputs for S+1 ⁇ k ⁇ K are replaced by their arithmetic mean ( V L k + V R k )/2, for both the left and the right channel.
  • V L k + V R k the arithmetic mean
  • V L 3 and V R 3 for V L 4 and V R 4 , etc.
  • V L K and V R K are replaced by their arithmetic mean.
  • S can vary during the simulation, e.g. can slowly decrease starting from large values, to determine when a deterioration of the stereo signal quality starts being noticeable.
  • the partitioning of RIR to ERP and LRP can be done as described above and then the HUPOLS-s reverberator of Fig. 1 can be setup as described above.
  • the digital signal processing structure 100 of Fig. 1 and Fig. 2 can form part of a hardware- and/or software-embodied digital signal processing device, comprising means, particularly a respective digital signal processing structure 100, for carrying out the method as described in context with the above embodiments.
  • a respective digital signal processing device can comprise a computer program product comprising instructions which, when the program is executed by a computer, cause the computer to carry out the method as described in context with the above embodiments.
  • a respective computer program product can be stored on a computer-readable data carrier.
  • a respective digital signal processing device can form part of an audio signal processing apparatus for processing an input audio signal.
  • a respective audio processing apparatus can be installed in a vehicle, particularly a car.

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Signal Processing (AREA)
  • Multimedia (AREA)
  • Stereophonic System (AREA)
EP20210629.0A 2020-11-30 2020-11-30 Verfahren zur verarbeitung eines eingangsaudiosignals zur erzeugung eines stereoausgangsaudiosignals mit spezifischen nachhallseigenschaften Pending EP4007310A1 (de)

Priority Applications (1)

Application Number Priority Date Filing Date Title
EP20210629.0A EP4007310A1 (de) 2020-11-30 2020-11-30 Verfahren zur verarbeitung eines eingangsaudiosignals zur erzeugung eines stereoausgangsaudiosignals mit spezifischen nachhallseigenschaften

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
EP20210629.0A EP4007310A1 (de) 2020-11-30 2020-11-30 Verfahren zur verarbeitung eines eingangsaudiosignals zur erzeugung eines stereoausgangsaudiosignals mit spezifischen nachhallseigenschaften

Publications (1)

Publication Number Publication Date
EP4007310A1 true EP4007310A1 (de) 2022-06-01

Family

ID=73646173

Family Applications (1)

Application Number Title Priority Date Filing Date
EP20210629.0A Pending EP4007310A1 (de) 2020-11-30 2020-11-30 Verfahren zur verarbeitung eines eingangsaudiosignals zur erzeugung eines stereoausgangsaudiosignals mit spezifischen nachhallseigenschaften

Country Status (1)

Country Link
EP (1) EP4007310A1 (de)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115460516A (zh) * 2022-09-05 2022-12-09 中国第一汽车股份有限公司 单声道转立体声的信号处理方法、装置、设备及介质

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20150350801A1 (en) * 2013-01-17 2015-12-03 Koninklijke Philips N.V. Binaural audio processing
EP3402222A1 (de) * 2014-01-03 2018-11-14 Dolby Laboratories Licensing Corporation Erzeugung eines binauralen tons in reaktion auf ein mehrkanalaudiosystem mit mindestens einem rückkopplungsverzögerungsnetzwerk

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20150350801A1 (en) * 2013-01-17 2015-12-03 Koninklijke Philips N.V. Binaural audio processing
EP3402222A1 (de) * 2014-01-03 2018-11-14 Dolby Laboratories Licensing Corporation Erzeugung eines binauralen tons in reaktion auf ein mehrkanalaudiosystem mit mindestens einem rückkopplungsverzögerungsnetzwerk

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
FRANK WEFERS: "Partitioned convolution algorithms for real-time auralization", 13 May 2015 (2015-05-13), XP055566796, Retrieved from the Internet <URL:http://publications.rwth-aachen.de/record/466561/files/466561.pdf?subformat=pdfa> [retrieved on 20190311] *

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115460516A (zh) * 2022-09-05 2022-12-09 中国第一汽车股份有限公司 单声道转立体声的信号处理方法、装置、设备及介质

Similar Documents

Publication Publication Date Title
JP7183467B2 (ja) 少なくとも一つのフィードバック遅延ネットワークを使ったマルチチャネル・オーディオに応答したバイノーラル・オーディオの生成
EP2829082B1 (de) Verfahren und system zur kopfbezogenen übertragungsfunktionserzeugung durch lineares mischen von kopfbezogenen übertragungsfunktionen
US10187741B2 (en) Device and method for processing a signal in the frequency domain
KR100971700B1 (ko) 공간큐 기반의 바이노럴 스테레오 합성 장치 및 그 방법과,그를 이용한 바이노럴 스테레오 복호화 장치
KR101325644B1 (ko) 변환 영역에서의 효율적인 바이노럴 사운드 공간화 방법 및장치
JP2792311B2 (ja) 多チャンネルエコー除去方法および装置
EP3247135B1 (de) Fortschrittliche verarbeitung auf basis einer mit komplexer exponentialfunktion modulierten filterbank
JP5122879B2 (ja) 時間および周波数領域内のパーティションされた高速畳み込み
EP2939443B1 (de) System und verfahren zur variablen dekorrelation von audiosignalen
US7502816B2 (en) Signal-processing apparatus and method
US9431987B2 (en) Sound synthesis with fixed partition size convolution of audio signals
CN107039043A (zh) 信号处理的方法及装置、多人会话的方法及系统
EP4007310A1 (de) Verfahren zur verarbeitung eines eingangsaudiosignals zur erzeugung eines stereoausgangsaudiosignals mit spezifischen nachhallseigenschaften
EP2907324B1 (de) System und verfahren zur reduzierung der latenzzeit in transposerbasierten virtuellen basssystemen
JP2005065231A (ja) 信号処理装置及びその方法
EP3329485B1 (de) System und verfahren zur räumlichen verarbeitung von schallfeldsignalen
Spors et al. Efficient realization of model-based rendering for 2.5-dimensional near-field compensated higher order Ambisonics
JP2009027388A (ja) 同相成分抽出方法及び装置
KR102310859B1 (ko) 공간 효과를 갖는 사운드 공간화
EP2730026B1 (de) Filterung mit geringer verzögerung
CN101106384A (zh) 时域和频域中的分段快速卷积
JP6518661B2 (ja) 複雑さの観点から最適化された、空間効果を伴う音響空間化
US20050223050A1 (en) Efficient method and apparatus for convolution of input signals
Moir Inverting non-minimum phase FIR transfer functions with application to reverberant speech
JP2000267682A (ja) 畳み込み演算装置

Legal Events

Date Code Title Description
PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: THE APPLICATION HAS BEEN PUBLISHED

AK Designated contracting states

Kind code of ref document: A1

Designated state(s): AL AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO RS SE SI SK SM TR

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: REQUEST FOR EXAMINATION WAS MADE

17P Request for examination filed

Effective date: 20221201

RBV Designated contracting states (corrected)

Designated state(s): AL AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO RS SE SI SK SM TR

P01 Opt-out of the competence of the unified patent court (upc) registered

Effective date: 20230530

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: EXAMINATION IS IN PROGRESS

17Q First examination report despatched

Effective date: 20240430