WO2020062922A1

WO2020062922A1 - Sound effect processing method and related product

Info

Publication number: WO2020062922A1
Application number: PCT/CN2019/090380
Authority: WO
Inventors: 严锋贵
Original assignee: Oppo广东移动通信有限公司
Priority date: 2018-09-25
Filing date: 2019-06-06
Publication date: 2020-04-02
Also published as: CN109246580A; CN109246580B

Abstract

Disclosed are a sound effect processing method and a related product. The method comprises: determining a three-dimensional coordinate of each sound source from among a plurality of sound sources corresponding to an electronic device, and single-track data generated by each sound source, so as to obtain a plurality of first three-dimensional coordinates and a plurality of pieces of single-track data; determining a second three-dimensional coordinate of a target object corresponding to the electronic device; and according to the plurality of first three-dimensional coordinates and the second three-dimensional coordinate, synthesizing the plurality of pieces of single-track data to obtain target dual-track data. By means of the present application, a playing effect of audio data can be improved.

Description

Sound effect processing method and related products

This application claims the priority of a Chinese patent application filed on September 25, 2018 with the Chinese Patent Office, application number 201811118269.4, and application name "3D Sound Processing Method and Related Products", the entire contents of which are incorporated herein by reference. .

Technical field

The present application relates to the technical field of audio playback, and in particular, to a sound effect processing method and related products.

Background technique

With the diversified development of electronic device functions and its portable characteristics, more and more people like to use electronic devices for some entertainment activities. In particular, you can listen to songs and watch videos anytime, anywhere according to user needs.

In the existing audio playback methods, the sound volume is often based on the volume set by the user, and the sounding body uses a constant power corresponding to the volume set by the user to play the audio, so that the played sound meets the user's loudness requirements. However, such a playback method is too simple in form, which often brings sensory fatigue to users.

Summary of the Invention

In a first aspect, an embodiment of the present application provides a sound effect processing method, including:

Determining the three-dimensional coordinates of each of the plurality of sound sources corresponding to the electronic device and the mono data generated by each sound source to obtain a plurality of first three-dimensional coordinates and a plurality of mono data;

Determining a second three-dimensional coordinate of a target object corresponding to the electronic device;

Synthesizing the plurality of mono data according to the plurality of first three-dimensional coordinates and the second three-dimensional coordinate to obtain target two-channel data.

In a second aspect, an embodiment of the present application provides a sound effect processing device, including:

A determining unit, configured to determine the three-dimensional coordinates of each of the plurality of sound sources corresponding to the electronic device and the mono data generated by each sound source to obtain a plurality of first three-dimensional coordinates and a plurality of mono data; The second three-dimensional coordinates of the target object corresponding to the electronic device;

A synthesizing unit is configured to synthesize the plurality of mono data according to the plurality of first three-dimensional coordinates and the second three-dimensional coordinate to obtain target two-channel data.

According to a third aspect, an embodiment of the present application provides an electronic device including a processor, a memory, a communication interface, and one or more programs, wherein the one or more programs are stored in the memory and configured to be processed by the foregoing. And the program includes instructions for some or all of the steps as described in the first aspect.

In a fourth aspect, an embodiment of the present application provides a computer-readable storage medium, wherein the computer-readable storage medium stores a computer program, and the computer program causes a computer to execute the program as described in the first aspect of the embodiment of the application. Describe some or all of the steps.

In a fifth aspect, an embodiment of the present application provides a computer program product, wherein the computer program product includes a non-transitory computer-readable storage medium storing a computer program, and the computer program is operable to cause a computer to execute Some or all of the steps described in the first aspect of the embodiment of the application. The computer program product can be a software installation package.

BRIEF DESCRIPTION OF THE DRAWINGS

In order to explain the technical solutions in the embodiments of the present application more clearly, the drawings used in the embodiments will be briefly introduced below. Obviously, the drawings in the following description are only some embodiments of the present application. Those of ordinary skill in the art can obtain other drawings according to the drawings without paying creative labor.

FIG. 1A is a schematic structural diagram of an electronic device according to an embodiment of the present application; FIG.

FIG. 1B is a schematic diagram of a scene of a coordinate axis of an electronic device according to an embodiment of the present application; FIG.

2A is a schematic flowchart of a sound effect processing method according to an embodiment of the present application;

FIG. 2B is a schematic diagram of a multi-channel dual-channel data scenario according to an embodiment of the present application; FIG.

3 is a schematic structural diagram of a sound effect processing device according to an embodiment of the present application;

FIG. 4 is a schematic structural diagram of another electronic device according to an embodiment of the present application.

detailed description

In order to enable those skilled in the art to better understand the solution of the present application, the technical solution in the embodiments of the present application will be clearly and completely described below with reference to the accompanying drawings in the embodiments of the present application.

The electronic devices involved in the embodiments of the present application may include various handheld devices (such as smart phones), vehicle-mounted devices, virtual reality (VR) / augmented reality (AR) devices with wireless communication functions, and may Wearable devices, computing devices or other processing devices connected to wireless modems, and various forms of user equipment (UE), mobile stations (MS), terminal devices, R & D / test platforms, Server and so on. For ease of description, the devices mentioned above are collectively referred to as electronic devices.

Please refer to FIG. 1A, which is a schematic structural diagram of an electronic device according to an embodiment of the present application. The electronic device includes a control circuit and an input-output circuit, and the input-output circuit is connected to the control circuit.

The control circuit may include a storage and processing circuit. The storage circuit in the storage and processing circuit may be a memory, such as a hard disk drive memory, a non-volatile memory (such as a flash memory or other electronic programmable read-only memory used to form a solid-state drive, etc.), a volatile memory (such as a static Or dynamic random access memory, etc.), this embodiment is not limited. The processing circuit in the storage and processing circuit can be used to control the operation of the electronic device. The processing circuit can be implemented based on one or more microprocessors, microcontrollers, digital signal processors, baseband processors, power management units, audio codec chips, application specific integrated circuits, display driver integrated circuits, and the like.

The storage and processing circuit can be used to run software in an electronic device, for example, playing an incoming call alert ringing application, playing a short message alert ringing application, playing an alarm alert ringing application, playing a media file application, an Internet protocol voice ( Voice over Internet Protocol (VOIP) telephone calling applications, operating system functions, etc. These software can be used to perform some control operations, such as playing the call alert ring, playing the short message alert ring, playing the alarm alert ring, playing media files, making voice phone calls, and other functions in electronic devices. The examples are not limited.

Among them, the input-output circuit can be used to enable the electronic device to implement data input and output, that is, to allow the electronic device to receive data from an external device and to allow the electronic device to output data from the electronic device to the external device.

The input-output circuit may further include a sensor. The sensor may include an ambient light sensor, an infrared proximity sensor based on light and capacitance, an ultrasonic sensor, and a touch sensor (for example, a light touch sensor and / or a capacitive touch sensor, where the touch sensor may be part of a touch display screen, or Can be used independently as a touch sensor structure), acceleration sensor, gravity sensor, and other sensors. The input-output circuit may further include an audio component, and the audio component may be used to provide audio input and output functions for the electronic device. The audio component may also include a tone generator and other components for generating and detecting sound.

In the embodiment of the present application, the sensor further includes a three-axis acceleration sensor for measuring a posture and an inclination angle of the electronic device. In addition to automatically switching the horizontal and vertical display angles, it can also be used as a motion offset compensation calculation when the global positioning system (GPS) signal is not good, which can fully and accurately reflect the motion properties of the object.

Please refer to FIG. 1B, which is a schematic diagram of a scene where a three-dimensional acceleration sensor determines a coordinate axis of an electronic device. As shown in Figure 1B, the x-axis, y-axis, and z-axis are relative to the body of the electronic device. Usually, the y-axis body is upward, the x-axis body is right, and the z-axis is perpendicular to the front of the fuselage, and the center The same direction of gravity. The horizontal component, vertical component, and vertical component are generally a unit of gravity (the size is 1g (m * m / s), the direction is perpendicular to the ground downward), and the projection on each axis. That is, the horizontal component is the corresponding value on the x-axis, the vertical component is the corresponding value on the y-axis, and the vertical component is the corresponding value on the z-axis.

For example: place the electronic device on the desktop, the x-axis defaults to 0, the y-axis defaults to 0, and the z-axis defaults to 9.81; place the electronic device on the desktop with the z-axis at -9.81; tilt the electronic device to the left, x The axis is positive; tilt the electronic device to the right and the x-axis is negative; tilt the electronic device upwards and the y-axis is negative; tilt the electronic device downwards and the y-axis is positive; set the z-axis to less than -3 In this case, the touch screen of the electronic device faces downward.

The input-output circuit may also include one or more display screens. The display screen may include one or a combination of a liquid crystal display, an organic light emitting diode display, an electronic ink display, a plasma display, and a display using other display technologies. The display screen may include a touch sensor array (ie, the display screen may be a touch display screen). The touch sensor can be a capacitive touch sensor formed by a transparent array of touch sensor electrodes (such as indium tin oxide (ITO) electrodes), or it can be a touch sensor formed using other touch technologies, such as sonic touch, pressure-sensitive touch, resistance Touch, optical touch, etc. are not limited in the embodiments of the present application.

The input-output circuit may further include a communication circuit for providing an electronic device with a capability to communicate with an external device. The communication circuit may include analog and digital input-output interface circuits, and wireless communication circuits based on radio frequency signals and / or optical signals. The wireless communication circuit in the communication circuit may include a radio frequency transceiver circuit, a power amplifier circuit, a low noise amplifier, a switch, a filter, and an antenna. For example, the wireless communication circuit in the communication circuit may include a circuit for supporting near field communication (NFC) by transmitting and receiving a near field coupled electromagnetic signal. For example, the communication circuit may include a near field communication antenna and a near field communication transceiver. The communication circuit may also include a cellular phone transceiver and antenna, a wireless local area network transceiver circuit and antenna, and the like.

The input-output circuit may further include an input-output unit. The input-output unit may include a button, a joystick, a click wheel, a scroll wheel, a touch pad, a keypad, a keyboard, a camera, a light emitting diode, and other status indicators.

The electronic device may further include a battery (not shown), and the battery is used to provide power to the electronic device.

The embodiments of the present application are described in detail below.

Referring to FIG. 2A, an embodiment of the present application provides a schematic flowchart of a sound effect processing method, which is applied to an electronic device. Specifically, as shown in FIG. 2A, the method includes:

S201: Determine the three-dimensional coordinates of each sound source in the multiple sound sources corresponding to the electronic device and the mono data generated by each sound source to obtain multiple first three-dimensional coordinates and multiple mono data.

The embodiments of the present application can be applied to a virtual reality / augmented reality scene, or a three-dimensional (3D) recording scene. In the embodiment of the present application, the sound source may be a sounding body in a virtual scene, for example, an airplane in a game scene, and the sound source may be a fixed sound source or a mobile sound source. As shown in the coordinate axis of the electronic device shown in FIG. 1B, the sound source corresponding to the electronic device can use the above-mentioned coordinate axis as a reference to determine the first three-dimensional coordinate corresponding to the sound source, and when the sound source emits sound, it can obtain the source Of mono data.

The electronic device may include multiple sound sources. For example, the sound source of the game scene includes airplanes, guns, rivers, etc. The corresponding mono data is the gliding sound of the aircraft, the loading of guns, the sound of the fire, the sound of the water flowing in the river; the sound source of the game scene can also include game players, The corresponding mono data is footstep sounds, voice sounds, etc. of the game player, which is not limited here.

In a possible example, the specific implementation of step S201 may include: determining multiple reference objects corresponding to the electronic device; determining behavior information of each reference object in the multiple reference objects to obtain multiple behavior information; and according to the The plurality of behavior information determines a plurality of sound sources corresponding to the electronic device in the plurality of reference objects; determining a coordinate position corresponding to each sound source in the plurality of sound sources to obtain the plurality of first three-dimensional coordinates; The behavior information corresponding to each of the plurality of sound sources determines the mono data of the sound source to obtain the plurality of mono data.

The reference object may be an object presented on a display page of the electronic device, for example, a house on the display page, a car in which a game player rides, or a firearm held. The reference object may also be an object that is not presented on the display page, such as a nearby game player, a gun, a vehicle, or the like.

The behavior information is dynamic information of the reference object. It can be understood that different reference objects correspond to different types of behaviors. For example, guns can emit the sound of loading and firing guns, but not the sound of water flowing or talking. And each reference corresponds to a different sound when the behavior information is different. For example, the house will not emit sound under normal circumstances, but will emit a sound of bombardment when it is bombarded; the starting sound and driving sound will only occur when the car is moving. Sound; gamers produce voices when sending voices, footsteps when walking, and more. Therefore, in the possible examples described above, first determine multiple reference objects corresponding to the electronic device, and then obtain behavior information of each reference object, and determine whether the reference object is a sound source according to each behavior information, thereby improving the accuracy of determining the sound source. . Then, the coordinate position of each sound source is further determined to obtain a plurality of first three-dimensional coordinates, and then the mono data of the sound source is determined according to the behavior information corresponding to each sound source, which improves the accuracy of determining the mono data.

This application does not limit how to determine the coordinate position corresponding to the sound source. For example, if it corresponds to a game scene and the game scene corresponds to a three-dimensional map, the coordinate position corresponding to different sound sources can be determined according to the map, that is, for characters Determining the first three-dimensional coordinates at a specific location can improve the accuracy of determining the first three-dimensional coordinates, facilitate the improvement of the 3D sound effect of the target two-channel data, and allow the user to be immersed in the game and feel the game world more realistic.

This application does not limit how to determine mono data according to behavior information. If the plurality of behavior information includes target behavior information corresponding to a target sound source among the multiple sound sources, in a possible example, the Determining the mono data of the sound source according to the behavior information corresponding to each of the multiple sound sources to obtain the multiple mono data includes: determining a sound type and a playback parameter corresponding to the target behavior information; The sound type and the playback parameter generate mono data of the target sound source.

The sound type is the sound type of the target behavior information corresponding to the sound. For example, the firearm includes the sound type of loading, firing, and hitting, and the playback parameters are loudness, frequency, and tone.

It can be understood that the specific implementation of step S201 takes the target sound source as an example, determines the sound type and playback parameters of the target sound source according to the target behavior information corresponding to the target sound source, and then generates the target sound source according to the sound type and playback parameters. Mono data can further improve the accuracy of determining mono data, improve the fit of mono data to application scenarios, and improve user experience.

S202: Determine a second three-dimensional coordinate of a target object corresponding to the electronic device.

In the embodiment of the present application, the target object may be a game player corresponding to an electronic device in a game, a virtual reality or an augmented reality scene, or a target user corresponding to an electronic device in a 3D recording scene. The target object may also correspond to a three-dimensional position, that is, a second three-dimensional position. Of course, the first three-dimensional position is different from the second three-dimensional position. For the method of determining the second three-dimensional coordinate, reference may be made to the method of determining the first three-dimensional coordinate, that is, the second three-dimensional coordinate of the target object is determined using the coordinate axis shown in FIG. 1B as a reference, and details are not described herein again.

S203: Combining the plurality of mono data according to the plurality of first three-dimensional coordinates and the second three-dimensional coordinate to obtain target two-channel data.

Where the first three-dimensional coordinates corresponding to each sound source and the second three-dimensional coordinates of the target object are known, the mono data corresponding to each sound source can be synthesized to obtain the target two-channel data. Specifically, a plurality of first three-dimensional coordinates, a second three-dimensional coordinate, and a plurality of mono data can be input into a Head Related Transformation Function (HRTF) model to obtain target binaural data.

In specific implementation, the electronic device can filter the audio data (sound from the sound source) using HRTF filters to obtain virtual surround sound, also called surround sound or panoramic sound, to achieve a three-dimensional stereo sound effect. The corresponding name of HRTF in the time domain is Head-related Impulse Response (HRIR), or convolution of audio data with Binaural Room Impulse Response (BRIR), Binaural Room Impulse Response Consists of three parts: direct sound, early reflections, and reverb.

For example, the electronic device may be based on the spatial three-dimensional coordinate position (x, y, z) of the sound source, and the position may be any coordinate. The left and right channels are generated based on the mono data generated by the sound source. The principle of the left and right channels is based on the time difference between the sound source and the listener (X, Y, Z). And the phase pressure difference generates a two-channel sound.

In a possible example, the specific implementation of step S203 may include: determining the left ear three-dimensional coordinates and the right ear three-dimensional coordinates corresponding to the second three-dimensional coordinates; and according to the plurality of first three-dimensional coordinates, the left ear three-dimensional coordinates Determining the transmission paths between the plurality of sound sources and the target object by the coordinates and the three-dimensional coordinates of the right ear to obtain a plurality of transmission paths; determining each of the plurality of mono data according to the plurality of transmission paths The time and phase pressure of the mono data transmitted to the three-dimensional coordinates of the left ear and the three-dimensional coordinates of the right ear are obtained in multiple times and multiple phase pressures; and the multiple mono data are determined according to the multiple times. A plurality of time differences are obtained for the time difference corresponding to each mono data, and a phase pressure difference corresponding to each mono data in the plurality of mono data is determined according to the plurality of phase pressures to obtain a plurality of phase pressure differences; Determining the delay parameters corresponding to each of the plurality of mono data by the plurality of time differences and the plurality of phase pressure differences to obtain a plurality of delay parameters; and according to the plurality of delays Parameter, processing each mono data in the plurality of mono data to obtain corresponding left channel parameters and right channel parameters; corresponding to each mono data in the plurality of mono data And synthesizing the plurality of mono data to obtain the target two-channel data.

The target object corresponds to a left ear three-dimensional coordinate and a right ear three-dimensional coordinate. This application does not limit how to determine the three-dimensional coordinates of the left ear and the three-dimensional coordinates of the right ear, and may be determined according to the 3D character model of the target object, that is, according to the second three-dimensional coordinates of the target object and the right ear that are preset in the 3D character model. The correlation between the three-dimensional coordinates, the second three-dimensional coordinates and the left ear three-dimensional coordinates is determined.

The time difference and the phase pressure difference are respectively the time difference and the phase pressure difference transmitted to the left ear three-dimensional coordinates and the right ear three-dimensional coordinates. That is, the time difference and phase pressure difference between the mono data corresponding to the sound source and the left and right ears of the target object are transmitted.

It can be understood that due to the distance between the left and right ears, there is a pressure difference when the sound travels through the air. Therefore, by implementing the specific implementation of step S203, the left channel parameters and the right channel parameters of the mono data can be determined according to the delay parameters, and then synthesized to obtain the target two-channel data, which improves the playback effect of the audio data. Create immersive sensations to improve user experience.

This application does not limit how to determine the time and phase pressure. This application takes the time and phase pressure corresponding to the three-dimensional coordinates of the target sound source to the left ear as an example. The time and phase pressure corresponding to the three-dimensional coordinates of the target sound source to the left ear are transmitted. Refer to this method for the method of determining the phase pressure and the monophonic data corresponding to other sound sources other than the target sound source among multiple sound sources to the left ear three-dimensional coordinates and the right ear three-dimensional coordinates.

Specifically, if the plurality of sound sources includes a target sound source, the plurality of first three-dimensional coordinates includes a target first three-dimensional coordinate corresponding to the target sound source, and the plurality of mono data includes the target sound Target mono data corresponding to the source; in a possible example, determining, according to the plurality of transmission paths, each mono data of the plurality of mono data transmitted to the three-dimensional coordinates of the left ear Obtaining multiple time and multiple phase pressures with time and phase pressure includes: obtaining a cross-section using the target first three-dimensional coordinate and the left ear three-dimensional coordinate as an axis; determining the target first three-dimensional coordinate and the left ear three-dimensional Blocking objects between coordinates; determining a plurality of reference transmission paths for transmitting the target mono data to the three-dimensional coordinates of the left ear according to the cross section and the occluding objects; determining the reference transmission paths according to the plurality of reference transmission paths The time and phase pressure of the target mono data transmitted to the three-dimensional coordinates of the left ear.

In the real environment, sound travels in all directions. Of course, reflection, refraction, interference, diffraction and other phenomena also occur during the propagation process. Therefore, the propagation of the target mono data can include multiple reference transmissions. path. As shown in FIG. 2B, the first three-dimensional coordinates of the target and the three-dimensional coordinates of the left ear are used as the cross-sections. Since the sound propagation direction is fixed, the propagation trajectory will also have a certain symmetry along a certain symmetry axis, and multiple transmissions can be obtained path. The propagation of sound will be dispersed and transmitted when it encounters occluded objects, so that multiple corresponding reference transmission paths are determined according to the cross-section and the occluded objects in the application scene.

It can be understood that the cross-section of a sound source data transmission is determined by using the first three-dimensional coordinates of the target and the three-dimensional coordinates of the left ear as axes, and then multiple reference transmission paths corresponding to the target mono data are determined according to the cross-section and the occluded object. The multiple reference transmission paths determine the phase pressure transmitted by the target sound source to the left ear of the target object. That is, the possible multiple reference transmission paths are determined according to the cross-sections corresponding to the first three-dimensional coordinates of the target and the three-dimensional coordinates of the left ear, and then the time and phase pressure are determined according to the multiple reference transmission paths, which improves the accuracy of determining the time and phase pressure. Sex.

In a possible example, the determining the time and phase pressure of transmitting the target mono data to the three-dimensional coordinates of the left ear according to the multiple reference transmission paths includes: determining each of the multiple reference transmission paths The sound intensity and sound pressure corresponding to a transmission path are used to obtain a plurality of sound intensity and a plurality of sound pressures; and the target mono data is determined to be transmitted to the left ear three-dimensionally according to the sound intensity and the sound pressure. Coordinate time and phase pressure.

Among them, the sound intensity refers to the sound energy of a unit area per unit time, which is perpendicular to the direction of propagation, and the unit is W / m2. Sound pressure is the increase in pressure due to the presence of sound waves, and the unit is Pa.

This application does not limit how to determine the phase pressure based on multiple sound pressures. The preset weights can be determined according to the relative distance corresponding to the corresponding reference transmission path, and then multiple sound pressures and corresponding preset weights are weighted. Get the phase pressure.

It can be understood that each sound source data has a corresponding sound pressure. Therefore, first determining the sound pressure corresponding to each reference transmission path to obtain multiple sound pressures, and then determining the phase pressure of the target mono data transmitted to the three-dimensional coordinates of the left ear based on the multiple sound pressures can improve the accuracy of determining the phase pressure.

In the sound effect processing method shown in FIG. 2A, the three-dimensional coordinates of each sound source in a plurality of sound sources corresponding to the electronic device and the mono data generated by each sound source are determined to obtain a plurality of first three-dimensional coordinates and a plurality of sound sources. The channel data and the second three-dimensional coordinates of the target object corresponding to the electronic device are determined. Then, according to the plurality of first three-dimensional coordinates and the second three-dimensional coordinate, the plurality of mono data is synthesized to obtain the target two-channel data. In this way, after the first three-dimensional coordinates of the multiple sound sources and the second three-dimensional coordinates of the target object are determined, the target two-channel data corresponding to the multiple sound sources is generated, thereby improving the playback effect of the audio data and generating an immersive feeling. Facilitate user experience.

In a possible example, the method further includes: determining a target reverberation parameter corresponding to the target object; and processing the target two-channel data according to the target reverberation parameter to obtain reverberant two-channel data.

Among them, the target reverb parameters include input volume, low frequency cut, high frequency cut, early reflection time, spatial breadth, diffusion degree, low mixing ratio, crossover point, reverberation time, high frequency attenuation point, dry sound adjustment, and reverberation. The volume, the amount of early reflected sound, the width of the sound field, the output sound field, and the tail sound are not limited here.

It can be understood that the target reverberation parameter corresponding to the target object is determined, and then the target two-channel data is processed according to the target reverberation parameter to obtain the reverberant two-channel data. In this way, processing the target two-channel data according to the target object can further improve the playback effect of the audio data and improve the user experience.

This application does not limit how to determine the target reverberation parameter. In a possible example, determining the target reverberation parameter corresponding to the target object includes: obtaining multiple historical reverberations corresponding to the target object that are stored in advance. Playing records; acquiring listening parameters corresponding to each historical reverb playing record in the plurality of historical reverb playing records to obtain a plurality of listening parameters; and determining a target reverb parameter corresponding to the target object according to the plurality of listening parameters.

The listening parameters include audio type, playback duration, playback adjustment times, user mood parameters, and so on. It can be understood that obtaining multiple historical reverberation playback records corresponding to the target object in advance, obtaining listening parameters corresponding to each historical reverberation playback record, and then determining the target reverberation parameters based on the multiple listening parameters improves the determination of the target reverberation. The accuracy of the parameters is convenient for improving the playback effect.

This application does not limit how to determine the target reverberation parameter according to multiple listening parameters. In a possible example, determining the target reverberation parameter corresponding to the target object according to the multiple listening parameters includes: according to the Multiple listening parameters determine the evaluation value corresponding to each historical reverb play record in the multiple historical reverb play records to obtain multiple evaluation values; and use the historical reverb record corresponding to the maximum value among the multiple evaluation values as A target historical reverberation record; and a reverberation parameter corresponding to the target historical reverberation record as a target reverberation parameter corresponding to the target object.

Wherein, if the plurality of historical reverberation playback records includes a target historical reverberation playback record, taking the target historical reverberation playback record as an example, first determine an application scenario of the electronic device, and determine that the application scenario corresponds to Preset mood parameters. Then, an evaluation value corresponding to the target historical reverberation record is determined according to a difference between the preset mood parameter and the user mood parameter.

It can be understood that, in different application scenarios, the preset mood parameters corresponding to the target object are different. In this way, the preset mood parameters of the target object are determined according to the application scenario corresponding to the electronic device. Then, the evaluation value corresponding to the target historical reverberation record is determined according to the difference between the preset mood parameter and the user mood parameter. In this application, the evaluation value corresponding to the historical reverberation record is determined according to the listening parameters of each historical reverberation record, and then the historical reverberation record corresponding to the largest evaluation value is selected as the target historical reverberation record, and then according to the target historical reverberation record. The recorded listening parameters determine the target reverberation parameters, which improves the accuracy of determining the target reverberation parameters.

Please refer to FIG. 3. FIG. 3 is a schematic structural diagram of a sound effect processing device according to an embodiment of the present application. As shown in FIG. 3, the sound effect processing device 300 includes a determining unit 301 and a synthesizing unit 302, where:

The determining unit 301 is configured to determine the three-dimensional coordinates of each sound source in the multiple sound sources corresponding to the electronic device and the mono data generated by each sound source to obtain multiple first three-dimensional coordinates and multiple mono data; The second three-dimensional coordinates of the target object corresponding to the electronic device;

The synthesizing unit 302 is configured to synthesize the plurality of mono data according to the plurality of first three-dimensional coordinates and the second three-dimensional coordinate to obtain target binaural data.

In a possible example, in the aspect of synthesizing the plurality of mono data according to the plurality of first three-dimensional coordinates and the second three-dimensional coordinate to obtain target two-channel data, the determining unit 301 is further configured to determine the left ear three-dimensional coordinates and the right ear three-dimensional coordinates corresponding to the second three-dimensional coordinates; and determine the multiple according to the plurality of first three-dimensional coordinates, the left ear three-dimensional coordinates, and the right ear three-dimensional coordinates. A plurality of transmission paths are obtained from transmission paths between the sound sources and the target object; and each of the plurality of mono data is determined to be transmitted to the left ear three-dimensional coordinates according to the plurality of transmission paths. And time and phase pressure of the three-dimensional coordinate of the right ear to obtain multiple times and multiple phase pressures; determining a time difference corresponding to each of the plurality of mono data according to the plurality of times to obtain multiple Time difference, determining a phase pressure difference corresponding to each of the plurality of mono data according to the plurality of phase pressures to obtain a plurality of phase pressure differences; and according to the plurality of time differences and the plurality of phases, The bit pressure difference determines a delay parameter corresponding to each of the plurality of mono data to obtain a plurality of delay parameters; according to the plurality of delay parameters, for each of the plurality of mono data, The channel data is processed to obtain corresponding left channel parameters and right channel parameters; the synthesizing unit 302 is specifically configured to use the left channel parameters and the right channels corresponding to each of the plurality of mono data Channel parameters, synthesizing the plurality of mono data to obtain target two-channel data.

In a possible example, the plurality of sound sources includes a target sound source, the plurality of first three-dimensional coordinates includes a target first three-dimensional coordinate corresponding to the target sound source, and the plurality of mono data includes all The target mono data corresponding to the target sound source;

The time and phase pressure of each mono data transmission to the left ear three-dimensional coordinate and the right ear three-dimensional coordinate in the plurality of mono data determined according to the plurality of transmission paths to obtain a plurality of times With respect to multiple phase pressures, the determining unit 301 is specifically configured to obtain a cross-section using the target first three-dimensional coordinate and the left ear three-dimensional coordinate as an axis; determining the target first three-dimensional coordinate and the left ear three-dimensional Blocking objects between coordinates; determining a plurality of reference transmission paths for transmitting the target mono data to the three-dimensional coordinates of the left ear according to the cross section and the occluding objects; determining the reference transmission paths according to the plurality of reference transmission paths The time and phase pressure of the target mono data transmitted to the three-dimensional coordinates of the left ear.

In a possible example, in determining the time and phase pressure for transmitting the target mono data to the three-dimensional coordinates of the left ear according to the multiple reference transmission paths, the determining unit 301 is specifically configured to determine The sound intensity and sound pressure corresponding to each transmission path in the plurality of reference transmission paths obtain a plurality of sound intensity and a plurality of sound pressures; and determine the target mono data to be transmitted to the left according to the plurality of sound intensity. At the time of the three-dimensional coordinate of the ear, the phase pressure at which the target mono data is transmitted to the three-dimensional coordinate of the left ear is determined according to the plurality of sound pressures.

In a possible example, the three-dimensional coordinates of each sound source in the plurality of sound sources corresponding to the electronic device and the mono data generated by each sound source are used to obtain multiple first three-dimensional coordinates and multiple mono channels. In terms of data, the determining unit 301 is specifically configured to determine multiple reference objects corresponding to the electronic device; determine behavior information of each reference object in the multiple reference objects to obtain multiple behavior information; and determine based on the multiple behavior information A plurality of sound sources corresponding to the electronic device in the plurality of reference objects; determining a coordinate position corresponding to each sound source in the plurality of sound sources to obtain a plurality of first three-dimensional coordinates; and according to the plurality of sound sources, The behavior information corresponding to each sound source determines the mono data of the sound source to obtain a plurality of mono data.

In a possible example, the multiple behavior information includes target behavior information corresponding to a target sound source; and determining the mono data of the sound source according to the behavior information corresponding to each sound source in the multiple sound sources In terms of obtaining the plurality of mono data, the determining unit 301 is specifically configured to determine a sound type and a playback parameter corresponding to the target behavior information; and generate a target sound source according to the sound type and the playback parameter. Mono data.

In a possible example, the determination unit 301 is further configured to determine a target reverberation parameter corresponding to the target object; and the synthesis unit 302 is further configured to perform the target two-channel data on the target reverberation parameter according to the target reverberation parameter. Process it to get reverberant two-channel data.

In a possible example, the determining unit 301 is specifically configured to obtain multiple historical reverb play records corresponding to the target object stored in advance; obtain each historical reverb play in the multiple historical reverb play records Recording corresponding listening parameters to obtain multiple listening parameters; and determining a target reverberation parameter corresponding to the target object according to the multiple listening parameters.

In a possible example, the determining unit 301 is specifically configured to determine an evaluation value corresponding to each historical reverb play record in the multiple historical reverb play records according to the multiple listening parameters to obtain multiple evaluation values; The historical reverberation record corresponding to the maximum value among the multiple evaluation values is used as the target historical reverberation record, and the reverberation parameter corresponding to the target historical reverberation record is used as the target reverberation parameter corresponding to the target object.

Please refer to FIG. 4, which is a schematic structural diagram of an electronic device according to an embodiment of the present application. As shown in FIG. 4, the electronic device 400 includes a processor 410, a memory 420, a communication interface 430, and one or more programs 440. The one or more programs 440 are stored in the memory 420, and are configured by The processor 410 executes, and the program 440 includes instructions for performing the following steps:

Determining the three-dimensional coordinates of each of the plurality of sound sources corresponding to the electronic device 400 and the mono data generated by each sound source to obtain a plurality of first three-dimensional coordinates and a plurality of mono data;

Determining a second three-dimensional coordinate of a target object corresponding to the electronic device 400;

And synthesizing the plurality of mono data according to the plurality of first three-dimensional coordinates and the second three-dimensional coordinate to obtain target binaural data.

In a possible example, in terms of synthesizing the plurality of mono data according to the plurality of first three-dimensional coordinates and the second three-dimensional coordinate to obtain target two-channel data, the program 440 The instructions in are specifically used to perform the following operations:

Determining a left ear three-dimensional coordinate and a right ear three-dimensional coordinate corresponding to the second three-dimensional coordinate;

Determining transmission paths between the plurality of sound sources and the target object according to the plurality of first three-dimensional coordinates, the left ear three-dimensional coordinates, and the right ear three-dimensional coordinates to obtain a plurality of transmission paths;

Determining, according to the plurality of transmission paths, the time and phase pressure of each of the plurality of mono data transmitted to the left ear three-dimensional coordinate and the right ear three-dimensional coordinate to obtain multiple times and multiple Phase pressure

Determining a time difference corresponding to each of the plurality of mono data according to the plurality of times to obtain a plurality of time differences, and determining each of the plurality of mono data according to the plurality of phase pressures The phase pressure difference corresponding to the channel data is used to obtain multiple phase pressure differences;

Determining, according to the plurality of time differences and the plurality of phase pressure differences, a delay parameter corresponding to each of the plurality of mono data to obtain a plurality of delay parameters;

Processing each of the plurality of mono data according to the plurality of delay parameters to obtain corresponding left channel parameters and right channel parameters;

Synthesizing the plurality of mono data according to the left channel parameter and the right channel parameter corresponding to each of the plurality of mono data to obtain the target two-channel data.

In a possible example, the plurality of sound sources includes a target sound source, the plurality of first three-dimensional coordinates includes a target first three-dimensional coordinate corresponding to the target sound source, and the plurality of mono data includes all The target mono data corresponding to the target sound source; and determining the time and phase at which each mono data of the plurality of mono data is transmitted to the left ear three-dimensional coordinate according to the plurality of transmission paths The pressure obtains multiple time and multiple phase pressure aspects, and the instructions in the program 440 are specifically used to perform the following operations:

Obtain a cross section by using the first three-dimensional coordinates of the target and the three-dimensional coordinates of the left ear as axes;

Determining an occlusion object between the target first three-dimensional coordinate and the left ear three-dimensional coordinate;

Determining a plurality of reference transmission paths for transmitting the target mono data to the three-dimensional coordinates of the left ear according to the cross section and the occluded object;

Time and phase pressures for transmitting the target mono data to the three-dimensional coordinates of the left ear according to the multiple reference transmission paths.

In a possible example, in determining the time and phase pressure for transmitting the target mono data to the three-dimensional coordinates of the left ear according to the multiple reference transmission paths, the instructions in the program 440 are specifically used To do the following:

Determining a sound intensity and a sound pressure corresponding to each transmission path in the plurality of reference transmission paths to obtain a plurality of sound intensity and a plurality of sound pressures;

Determining the time at which the target mono data is transmitted to the left ear three-dimensional coordinates according to the plurality of sound intensities, and determining the time at which the target mono data is transmitted to the left ear three-dimensional coordinates according to the plurality of sound pressures Phase pressure.

In a possible example, the three-dimensional coordinates of each sound source and the mono data generated by each sound source in the plurality of sound sources corresponding to the electronic device 400 are obtained to obtain a plurality of first three-dimensional coordinates and a plurality of mono sounds. In terms of data, the instructions in the program 440 are specifically used to perform the following operations:

Determining a plurality of reference objects corresponding to the electronic device 400;

Determining behavior information of each reference object in the multiple reference objects to obtain multiple behavior information;

Determining a plurality of sound sources corresponding to the electronic device among the plurality of reference objects according to the plurality of behavior information;

Determining coordinate positions corresponding to each of the plurality of sound sources to obtain a plurality of first three-dimensional coordinates;

The mono data of the sound source is determined according to the behavior information corresponding to each of the multiple sound sources to obtain multiple mono data.

In a possible example, the multiple behavior information includes target behavior information corresponding to a target sound source; and determining the mono data of the sound source according to the behavior information corresponding to each sound source in the multiple sound sources In terms of obtaining the plurality of mono data, the instructions in the program 440 are specifically used to perform the following operations:

Determining a sound type and a playback parameter corresponding to the target behavior information;

Mono data of the target sound source is generated according to the sound type and the playback parameter.

In a possible example, the instructions in the program 440 are further used to perform the following operations:

Determining a target reverberation parameter corresponding to the target object;

Processing the target two-channel data according to the target reverberation parameter to obtain reverberant two-channel data.

In a possible example, in terms of processing the target two-channel data according to the target reverberation parameter to obtain reverberant two-channel data, the instructions in the program 440 are specifically used to perform the following operations:

Acquiring multiple historical reverberation playback records corresponding to the target object stored in advance;

Acquiring listening parameters corresponding to each historical reverb play record in the multiple historical reverb play records to obtain multiple listening parameters;

A target reverberation parameter corresponding to the target object is determined according to the plurality of listening parameters.

In a possible example, in determining the target reverberation parameter corresponding to the target object according to the multiple listening parameters, the instructions in the program 440 are specifically used to perform the following operations:

Determining, according to the plurality of listening parameters, an evaluation value corresponding to each historical reverb play record in the multiple historical reverb play records to obtain multiple evaluation values;

The historical reverberation record corresponding to the maximum value among the multiple evaluation values is used as the target historical reverberation record, and the reverberation parameter corresponding to the target historical reverberation record is used as the target reverberation parameter corresponding to the target object.

Based on the same inventive concept, the sound processing method and the electronic device provided in the embodiments of the present application are similar to the method embodiments of the present application. Therefore, the implementation of the sound processing method and the electronic device can refer to the method implementation, the sound processing method and the electronic For the beneficial effects of the device, refer to the beneficial effects of the method. For brevity description, they are not repeated here.

An embodiment of the present application further provides a computer storage medium, wherein the computer storage medium stores a computer program for causing a computer to execute a part or all of the steps of any method as described in the method embodiment, and the computer includes an electronic device. device.

An embodiment of the present application further provides a computer program product. The computer program product includes a non-transitory computer-readable storage medium storing the computer program, and the computer program is operable to cause a computer to execute a part of any method as described in the method embodiments. Or all steps. The computer program product may be a software installation package, and the computer includes an electronic device.

It should be noted that, for the foregoing method embodiments, for the sake of simple description, they are all described as a series of action combinations. However, those skilled in the art should know that this application is not limited by the described action order. Because according to the present application, certain steps may be performed in another order or simultaneously. Secondly, those skilled in the art should also know that the embodiments described in the specification are all preferred embodiments, and the actions and modes involved are not necessarily required for this application.

In the above embodiments, the description of each embodiment has its own emphasis. For a part that is not described in detail in one embodiment, reference may be made to related descriptions in other embodiments.

In the several embodiments provided in this application, it should be understood that the disclosed device may be implemented in other ways. For example, the device embodiments described above are only schematic, such as the division of units, which is only a logical function division. In actual implementation, there may be another division manner. For example, multiple units or components may be combined or integrated into Another system, or some features, can be ignored or not implemented. In addition, the displayed or discussed mutual coupling or direct coupling or communication connection may be indirect coupling or communication connection through some interfaces, devices or units, and may be electrical or other forms.

The units described as separate components may or may not be physically separated, and the components displayed as units may or may not be physical units, which may be located in one place, or may be distributed on multiple network units. Some or all of the units may be selected according to actual needs to achieve the objective of the solution of this embodiment.

In addition, each functional unit in each embodiment of the present application may be integrated into one processing unit, or each of the units may exist separately physically, or two or more units may be integrated into one unit. The above integrated unit may be implemented in the form of hardware or in the form of a software program mode.

When the integrated unit is implemented in the form of a software program and sold or used as an independent product, it can be stored in a computer-readable memory. Based on such an understanding, the technical solution of the present application essentially or part that contributes to the existing technology or all or part of the technical solution can be embodied in the form of a software product, which is stored in a memory, Several instructions are included to enable a computer device (which may be a personal computer, a server, or a network device, etc.) to perform all or part of the steps of the methods in the embodiments of the present application. The foregoing memory includes: a U disk, a read-only memory (ROM), a random access memory (RAM), a mobile hard disk, a magnetic disk, or an optical disk, and other media that can store program codes.

A person of ordinary skill in the art may understand that all or part of the steps in the various methods of the foregoing embodiments may be completed by a program instructing related hardware. The program may be stored in a computer-readable memory, and the memory may include a flash disk. , ROM, RAM, disk or disc, etc.

The embodiments of the present application have been described in detail above. Specific examples have been used in this document to explain the principles and implementation of the present application. The descriptions of the above embodiments are only used to help understand the methods and core ideas of the present application. Persons of ordinary skill in the art may change the specific implementation manner and the scope of application according to the idea of the present application. In summary, the content of this specification should not be construed as a limitation on the present application.

Claims

A sound effect processing method, comprising:

Determining the three-dimensional coordinates of each of the plurality of sound sources corresponding to the electronic device and the mono data generated by each sound source to obtain a plurality of first three-dimensional coordinates and a plurality of mono data;

Determining a second three-dimensional coordinate of a target object corresponding to the electronic device;

Synthesizing the plurality of mono data according to the plurality of first three-dimensional coordinates and the second three-dimensional coordinate to obtain target binaural data.
The method according to claim 1, wherein the synthesizing the plurality of mono data according to the plurality of first three-dimensional coordinates and the second three-dimensional coordinate to obtain target two-channel data ,include:

Determining a left ear three-dimensional coordinate and a right ear three-dimensional coordinate corresponding to the second three-dimensional coordinate;

Determining transmission paths between the plurality of sound sources and the target object according to the plurality of first three-dimensional coordinates, the left ear three-dimensional coordinates, and the right ear three-dimensional coordinates to obtain a plurality of transmission paths;

Determining the time and phase pressure for transmitting each of the plurality of mono data to the three-dimensional coordinate of the left ear and the three-dimensional coordinate of the right ear according to the plurality of transmission paths to obtain a plurality of time and multiple Phase pressure

Determining a time difference corresponding to each of the plurality of mono data according to the plurality of times, obtaining a plurality of time differences, and determining each of the plurality of mono data according to the plurality of phase pressures The phase pressure difference corresponding to the mono data to obtain multiple phase pressure differences;

Determining, according to the plurality of time differences and the plurality of phase pressure differences, a delay parameter corresponding to each of the plurality of mono data to obtain a plurality of delay parameters;

Processing each of the plurality of mono data according to the plurality of delay parameters to obtain corresponding left channel parameters and right channel parameters;

Synthesizing the plurality of mono data according to the left channel parameter and the right channel parameter corresponding to each of the plurality of mono data to obtain the target two-channel data.
The method according to claim 2, wherein the plurality of sound sources include a target sound source, the plurality of first three-dimensional coordinates include a target first three-dimensional coordinate corresponding to the target sound source, and the plurality of The mono data includes target mono data corresponding to the target sound source;

The determining the time and phase pressure of transmitting each of the plurality of mono data to the three-dimensional coordinate of the left ear according to the plurality of transmission paths includes:

Obtain the cross section by taking the first three-dimensional coordinates of the target and the three-dimensional coordinates of the left ear as axes;

Determining an occlusion object between the target first three-dimensional coordinate and the left ear three-dimensional coordinate;

Determining a plurality of reference transmission paths for transmitting the target mono data to the three-dimensional coordinates of the left ear according to the cross section and the occluded object;

Time and phase pressures for transmitting the target mono data to the three-dimensional coordinates of the left ear according to the multiple reference transmission paths.
The method according to claim 3, wherein determining the time and phase pressure for transmitting the target mono data to the three-dimensional coordinate of the left ear according to the multiple reference transmission paths comprises:

Determining a sound intensity and a sound pressure corresponding to each of the plurality of reference transmission paths to obtain a plurality of sound intensity and a plurality of sound pressures;

Determining the time at which the target mono data is transmitted to the left ear three-dimensional coordinates according to the plurality of sound intensities, and determining the time at which the target mono data is transmitted to the left ear three-dimensional coordinates according to the plurality of sound pressures Phase pressure.
The method according to any one of claims 1-4, wherein the determining the three-dimensional coordinates of each sound source among the plurality of sound sources corresponding to the electronic device and the mono data generated by each sound source obtains multiple First three-dimensional coordinates and multiple mono data, including:

Determining multiple reference objects corresponding to the electronic device;

Determining behavior information of each reference object in the multiple reference objects to obtain multiple behavior information;

Determining a plurality of sound sources corresponding to the electronic device among the plurality of reference objects according to the plurality of behavior information;

Determining coordinate positions corresponding to each of the plurality of sound sources to obtain a plurality of first three-dimensional coordinates;

Mono data of the sound source is determined according to behavior information corresponding to each of the multiple sound sources to obtain multiple mono data.
The method according to claim 5, wherein the plurality of behavior information includes target behavior information corresponding to a target sound source among the plurality of sound sources;

The determining the mono data of the sound source according to the behavior information corresponding to each of the multiple sound sources, and obtaining the multiple mono data includes:

Determining a sound type and a playback parameter corresponding to the target behavior information;

Mono data of the target sound source is generated according to the sound type and the playback parameter.
The method according to any one of claims 1-6, further comprising:

Determining a target reverberation parameter corresponding to the target object;

And processing the target two-channel data according to the target reverberation parameter to obtain reverberated two-channel data.
The method according to claim 7, wherein the determining a target reverberation parameter corresponding to the target object comprises:

Acquiring multiple historical reverberation playback records corresponding to the target object stored in advance;

Acquiring listening parameters corresponding to each historical reverberation playback record in the multiple historical reverberation playback records to obtain multiple listening parameters;

A target reverberation parameter corresponding to the target object is determined according to the plurality of listening parameters.
The method according to claim 8, wherein determining the target reverberation parameter corresponding to the target object according to the plurality of listening parameters comprises:

Determining an evaluation value corresponding to each historical reverb play record in the multiple historical reverb play records according to the multiple listening parameters, and obtaining multiple evaluation values;

The historical reverberation record corresponding to the maximum value among the multiple evaluation values is used as the target historical reverberation record, and the reverberation parameter corresponding to the target historical reverberation record is used as the target reverberation parameter corresponding to the target object.
A sound effect processing device, comprising:

A determining unit, configured to determine the three-dimensional coordinates of each of the plurality of sound sources corresponding to the electronic device and the mono data generated by each sound source, to obtain a plurality of first three-dimensional coordinates and a plurality of mono data; The second three-dimensional coordinate of the target object corresponding to the electronic device;

A synthesizing unit is configured to synthesize the plurality of mono data according to the plurality of first three-dimensional coordinates and the second three-dimensional coordinate to obtain target two-channel data.
The device according to claim 10, wherein the determining unit is further configured to determine a left ear three-dimensional coordinate and a right ear three-dimensional coordinate corresponding to the second three-dimensional coordinate; The three-dimensional coordinates of the left ear and the three-dimensional coordinates of the right ear determine transmission paths between the plurality of sound sources and the target object to obtain a plurality of transmission paths; and determine the plurality of mono sounds according to the plurality of transmission paths. Time and phase pressure of each mono data in the channel data transmitted to the three-dimensional coordinates of the left ear and the three-dimensional coordinates of the right ear to obtain multiple times and multiple phase pressures; determine the multiple based on the multiple times A time difference corresponding to each mono data in the mono data to obtain a plurality of time differences, and determine a phase pressure difference corresponding to each mono data in the plurality of mono data according to the plurality of phase pressures, Obtaining a plurality of phase pressure differences; determining a delay parameter corresponding to each of the plurality of mono data according to the plurality of time differences and the plurality of phase pressure differences, and obtaining a plurality of delay parameters The delay parameters of said plurality of said plurality of single data channels each single channel data is processed to obtain parameters corresponding to the left channel and a right channel parameter;

The synthesizing unit is specifically configured to synthesize the plurality of mono data according to a left channel parameter and a right channel parameter corresponding to each of the plurality of mono data to obtain a target dual Channel data.
The device according to claim 11, wherein the plurality of sound sources include a target sound source, the plurality of first three-dimensional coordinates include a target first three-dimensional coordinate corresponding to the target sound source, and the plurality of The mono data includes target mono data corresponding to the target sound source; the determining unit is specifically configured to obtain a cross section by using the target first three-dimensional coordinate and the left ear three-dimensional coordinate as an axis; determining the An occlusion object between a first three-dimensional coordinate of the target and the three-dimensional coordinate of the left ear; determining a plurality of reference transmission paths for transmitting the target mono data to the three-dimensional coordinate of the left ear according to the cross section and the occluded object Determining, according to the multiple reference transmission paths, the time and phase pressure at which the target mono data is transmitted to the three-dimensional coordinates of the left ear.
The device according to claim 12, wherein the determining unit is specifically configured to determine a sound intensity and a sound pressure corresponding to each transmission path in the multiple reference transmission paths, to obtain multiple sound intensities and multiple sounds Pressure; determining the time when the target mono data is transmitted to the left ear three-dimensional coordinates according to the plurality of sound intensities, and determining the target mono data transmission to the left ear three-dimensional coordinates according to the plurality of sound pressures Phase pressure of the coordinates.
The device according to any one of claims 10-13, wherein the determining unit is specifically configured to determine a plurality of reference objects corresponding to the electronic device; determine behavior information of each reference object in the plurality of reference objects To obtain a plurality of behavior information; determine a plurality of sound sources corresponding to the electronic device in the plurality of reference objects according to the plurality of behavior information; determine a coordinate position corresponding to each sound source in the plurality of sound sources, Obtaining multiple first three-dimensional coordinates; determining mono data of the sound source according to behavior information corresponding to each of the multiple sound sources, and obtaining multiple mono data.
The device according to claim 14, wherein the plurality of behavior information includes target behavior information corresponding to a target sound source; and the determining unit is specifically configured to determine a sound type and a playback parameter corresponding to the target behavior information; Mono data of the target sound source is generated according to the sound type and the playback parameter.
The device according to any one of claims 10 to 15, wherein the determining unit is further configured to determine a target reverberation parameter corresponding to the target object;

The synthesis unit is further configured to process the target two-channel data according to the target reverberation parameter to obtain reverberated two-channel data.
The device according to claim 16, wherein the determining unit is specifically configured to obtain a plurality of historical reverb play records corresponding to the target object stored in advance; and acquire each of the plurality of historical reverb play records. A listening parameter corresponding to a historical reverberation playback record is used to obtain multiple listening parameters; and a target reverberation parameter corresponding to the target object is determined according to the multiple listening parameters.
An electronic device, comprising a processor, a memory, a communication interface, and one or more programs, wherein the one or more programs are stored in the memory and configured to be executed by the processor The program includes instructions for performing the steps in the method of any one of claims 1-9.
A computer-readable storage medium, which is used to store a computer program, wherein the computer program causes a computer to execute the method according to any one of claims 1-9.
A computer program product, characterized in that the computer program product includes a non-transitory computer-readable storage medium storing a computer program, and the computer program is operable to cause a computer to execute the computer program according to any one of claims 1-9. The method described.