CN117376633A

CN117376633A - Audio signal processing method, device, intelligent television, medium and program product

Info

Publication number: CN117376633A
Application number: CN202210757671.7A
Authority: CN
Inventors: 范伟杰
Original assignee: Guangzhou Shikun Electronic Technology Co Ltd
Current assignee: Guangzhou Shikun Electronic Technology Co Ltd
Priority date: 2022-06-30
Filing date: 2022-06-30
Publication date: 2024-01-09

Abstract

The embodiment of the application discloses an audio signal processing method, an audio signal processing device, an intelligent television, a medium and a program product, and belongs to the technical field of intelligent televisions. The method is applied to the intelligent television, and comprises the following steps: receiving a mixing system starting instruction triggered by a user when the first audio signal is played; the first audio signal is not an audio signal provided by the karaoke application server; responding to a starting instruction of the audio mixing system, and displaying a first user interface of the audio mixing system; responding to a mixing start instruction triggered by a user through a first user interface, and collecting a second audio signal of the user; mixing the first audio signal and the second audio signal to obtain a mixed audio signal; playing the mixed audio signal. Therefore, the embodiment of the application can improve the flexibility of K songs through the intelligent television.

Description

Audio signal processing method, device, intelligent television, medium and program product

Technical Field

The present disclosure relates to the field of smart televisions, and in particular, to an audio signal processing method, an audio signal processing device, a smart television, a medium, and a program product.

Background

With the development of intelligent television technology, more entertainment projects can be realized based on intelligent televisions. For example, a user may make a K song through a smart television. At present, when a user performs K songs through the intelligent television, a third-party K song application program is firstly required to be installed through the intelligent television. And then, the user needs to open a third-party Karaoke application program through the intelligent television, and select a song to be singed from a song list provided by an application server corresponding to the third-party Karaoke application program to carry out Karaoke entertainment.

That is, when using the existing smart tv to perform the K song, the user may only select songs provided by the third party K song application installed in the smart tv. Therefore, the flexibility of the existing intelligent television is poor.

Disclosure of Invention

The embodiment of the application provides an audio signal processing method, an audio signal processing device, an intelligent television, a medium and a program product, so that the flexibility of the intelligent television is improved. The technical scheme is as follows:

in a first aspect, the present application provides an audio signal processing method, where the method is applied to a smart television, the method includes:

receiving a mixing system starting instruction triggered by a user when the first audio signal is played; the mixing system is not a third party application program;

responding to the starting instruction of the audio mixing system, and displaying a first user interface of the audio mixing system;

responding to a mixing start instruction triggered by the user through the first user interface, and collecting a second audio signal of the user;

mixing the first audio signal and the second audio signal to obtain a mixed audio signal;

and playing the mixed audio signal.

Optionally, the first audio signal is a television signal;

Alternatively, before the playing the first audio signal, the method further comprises:

acquiring the first audio signal from the data stored in the intelligent television; or, acquiring the first audio signal from the electronic equipment connected with the intelligent television; alternatively, the first audio signal is obtained from a target application server.

Optionally, the displaying the first user interface of the audio mixing system includes:

suspending and displaying the first user interface on a playing interface corresponding to the first audio signal; the first user interface is smaller than a playing interface corresponding to the first audio signal.

Optionally, the method further comprises:

and stopping displaying the first user interface when the duration of not receiving the user operation through the first user interface reaches the preset duration.

Optionally, after stopping displaying the first user interface, the method further includes:

and displaying a first control at a target position of a display interface of the intelligent television, wherein the first control is used for opening the first user interface.

Optionally, the receiving the user-triggered audio mixing system start instruction includes:

receiving a mixing system starting instruction triggered by the user from a target remote controller; or,

And receiving the voice mixing system starting instruction triggered by the user through voice.

Optionally, the receiving the user-triggered audio mixing system start instruction from the target remote controller includes:

receiving a mixing system starting instruction triggering key of the target remote controller by pressing the mixing system starting instruction triggering key of the target remote controller by the user, and triggering a mixing system starting instruction sent by the target remote controller; or,

and receiving a sound mixing system starting instruction sent by the user through voice control of the target remote controller.

Optionally, the mixing the first audio signal and the second audio signal to obtain a mixed audio signal includes:

acquiring a value of at least one mixing parameter through the first user interface;

and mixing the first audio signal and the second audio signal according to the value of the at least one mixing parameter to obtain a mixed audio signal.

Optionally, the first user interface includes: and receiving control of the mixing parameter value, wherein the obtaining, through the first user interface, the value of at least one mixing parameter comprises:

receiving values of at least one mixing parameter input by a user in response to the operation of a receiving control of each mixing parameter value by the user; or,

The first user interface includes: and acquiring the mixing parameters through the first user interface by default controls of the mixing parameters, wherein the default controls comprise:

and responding to the operation of the user on the default control of the mixing parameters, and acquiring the preset value of at least one mixing parameter.

Optionally, the first user interface further includes a preset two-dimensional code, so that a terminal device scanning the preset two-dimensional code displays at least one preset animation; the method further comprises the steps of:

responding to the target animation displayed by the terminal equipment by clicking the user to trigger an animation display instruction sent by the terminal equipment, and displaying the target animation; the target animation is one of the at least one preset animation.

Optionally, the method further comprises:

and responding to the starting instruction of the sound mixing system, searching a target sound collecting device and connecting with the target sound collecting device.

Optionally, the method further comprises:

and displaying first prompt information on the first user interface, wherein the first prompt information is used for indicating the connection state of the intelligent television and the target sound acquisition device.

In a second aspect, the present application provides an audio signal processing apparatus, where the apparatus is applied to a smart tv, the apparatus includes:

The receiving module is used for receiving a sound mixing system starting instruction triggered by a user when the first audio signal is played; the mixing system is not a third party application program;

the display module is used for responding to the starting instruction of the audio mixing system and displaying a first user interface of the audio mixing system;

the acquisition module is used for responding to a mixing start instruction triggered by the user through the first user interface and acquiring a second audio signal of the user;

the processing module is used for mixing the first audio signal and the second audio signal to obtain a mixed audio signal;

and the playing module is used for playing the mixed audio signal.

In a third aspect, the present application provides a smart television, including: at least one processor, memory;

the memory stores computer-executable instructions;

the at least one processor executing computer-executable instructions stored in the memory causes the intelligent television to perform the method of any one of the first aspects.

In a fourth aspect, the present application provides a computer-readable storage medium having stored thereon computer-executable instructions which, when executed by a processor, implement the method of any of the first aspects.

In a fifth aspect, the present application provides a computer program product comprising a computer program which, when executed by a processor, implements the method of any of the first aspects.

According to the audio signal processing method, the device, the intelligent television, the medium and the program product, when the first audio signal is played, the intelligent television can display the first user interface of the audio mixing system through the opening instruction of the audio mixing system triggered by the user without switching the playing interface of the first audio signal. According to the audio mixing system of the non-third party application program, the first user interface can be displayed without interface switching, and the flexibility of the intelligent television is improved. Then, a mixing start instruction is received through the first user interface, and a second audio signal of the user is collected. Then, the smart television can mix the first audio signal and the second audio signal to obtain a mixed audio signal and play the mixed audio signal. By the method, the intelligent television does not need to rely on external Soundbar sound box equipment and is not limited by a third party application program, and the flexibility of the intelligent television is further improved.

Drawings

In order to more clearly illustrate the embodiments of the present application or the technical solutions in the prior art, the drawings that are required in the embodiments or the description of the prior art will be briefly described below, it being obvious that the drawings in the following description are only some embodiments of the present application, and that other drawings may be obtained according to these drawings without inventive effort for a person skilled in the art.

Fig. 1 is a schematic view of an application scenario of a smart tv in the related art;

fig. 2a is a schematic structural diagram of an intelligent tv provided in the present application;

fig. 2b is a schematic view of an application scenario of the smart tv provided in the present application;

fig. 3 is a flow chart of an audio signal processing method provided in the present application;

FIG. 4 is a schematic diagram of a first user interface provided herein;

FIG. 5 is a schematic illustration of another first user interface provided herein;

fig. 6 is a schematic diagram of a display interface of an intelligent television provided by the application;

fig. 7 is a flowchart of a method for mixing a first audio signal and a second audio signal provided in the present application;

fig. 8 is a schematic view of an application scenario of another smart tv provided in the present application;

fig. 9 is a schematic structural diagram of an audio signal processing device provided in the present application.

Detailed Description

For the purpose of making the objects, technical solutions and advantages of the present application more apparent, the following detailed description of the embodiments of the present application will be given with reference to the accompanying drawings.

It should be understood that the described embodiments are merely some, but not all, of the embodiments of the present application. All other embodiments, based on the embodiments herein, which would be apparent to one of ordinary skill in the art without making any inventive effort, are intended to be within the scope of the present application.

When the following description refers to the accompanying drawings, the same numbers in different drawings refer to the same or similar elements, unless otherwise indicated. The implementations described in the following exemplary examples are not representative of all implementations consistent with the present application. Rather, they are merely examples of apparatus and methods consistent with some aspects of the present application as detailed in the accompanying claims.

In the description of this application, it should be understood that the terms "first," "second," "third," and the like are used merely to distinguish between similar objects and are not necessarily used to describe a particular order or sequence, nor should they be construed to indicate or imply relative importance. The specific meaning of the terms in this application will be understood by those of ordinary skill in the art as the case may be. Furthermore, in the description of the present application, unless otherwise indicated, "a plurality" means two or more. "and/or", describes an association relationship of an association object, and indicates that there may be three relationships, for example, a and/or B, and may indicate: a exists alone, A and B exist together, and B exists alone. The character "/" generally indicates that the context-dependent object is an "or" relationship.

Taking K songs by the smart tv as an example, fig. 1 is a schematic view of an application scenario of the smart tv in the related art. As shown in fig. 1, in the related art, a smart TV (TV) needs to be externally connected to a Soundbar speaker (a strip speaker capable of integrating the speaker functions of multiple channels into one box), so as to have a hardware base for implementing the K song function. In addition, if the user wants to perform the K song through the smart television, the third party K song application program is also required to be downloaded and installed through the smart television.

When a user performs K songs through the intelligent television, the intelligent television needs to interact with an application server corresponding to the third-party K song application program so as to acquire songs selected by the user and play the songs. As shown in fig. 1, after the smart tv is connected with the Soundbar speaker, the primary audio of the tv, that is, the audio corresponding to the song obtained by the smart tv from the K song application server, may be output to the audio processor of the Soundbar speaker through the main Chip (System on Chip, SOC) of the smart tv. On the other hand, a microphone for picking up human voice may be connected to the microphone receiver of the Soundbar speaker (microphone 1 receiver and microphone 2 receiver as shown in fig. 1). The microphone receiver can receive human voice collected by the microphone and output the human voice to the audio processor of the Soundbar loudspeaker box for receiving.

Then, the Soundbar speaker may mix the audio provided by the K song application server and the microphone voice through the audio processor, and output the mixed audio to an audio power Amplifier (AMP) of the Soundbar speaker. Then, the above-mentioned mixed sound can be played through the speaker of the Soundbar sound box.

In the existing karaoke mode, the user can only select songs provided by the application server corresponding to the third-party karaoke application program installed in the smart television, and can only perform karaoke through the smart television depending on the Soundbar speaker device connected with the smart television. Therefore, the existing intelligent television has the problem of poor flexibility.

Considering the problem of poor flexibility of the existing intelligent television, the application provides a method for processing the audio signal of the intelligent television by using a non-third party application program, and the method does not need to rely on the external sound box equipment of the intelligent television. By the method, the intelligent television is not limited by the third party application program, and flexibility of the intelligent television is improved.

The execution subject of the audio signal processing method provided by the application can be an intelligent television. It should be understood that the structure of the smart television is not limited in this application. Fig. 2a is a schematic structural diagram of an intelligent television according to the present application. As shown in fig. 2a, the smart tv 10 may include: at least one processor 11 and a memory 12.

A memory 12 for storing a program. In particular, the program may include program code including computer-operating instructions.

The memory 12 may comprise high-speed RAM memory or may further comprise non-volatile memory (non-volatile memory), such as at least one disk memory.

The processor 11 is configured to execute computer-executable instructions stored in the memory 12 to implement the audio signal processing method described in the foregoing method embodiment. The processor 11 may be a central processing unit (Central Processing Unit, abbreviated as CPU), or an application specific integrated circuit (Application Specific Integrated Circuit, abbreviated as ASIC), or one or more integrated circuits configured to implement embodiments of the present application.

Optionally, the smart television 10 may further include a communication interface 13. In a specific implementation, if the communication interface 13, the memory 12 and the processor 11 are implemented independently, the communication interface 13, the memory 12 and the processor 11 may be connected to each other through a bus and perform communication with each other. The bus may be an industry standard architecture (Industry Standard Architecture, abbreviated ISA) bus, an external device interconnect (Peripheral Component, abbreviated PCI) bus, or an extended industry standard architecture (Extended Industry Standard Architecture, abbreviated EISA) bus, among others. Buses may be divided into address buses, data buses, control buses, etc., but do not represent only one bus or one type of bus.

Alternatively, in a specific implementation, if the communication interface 13, the memory 12 and the processor 11 are implemented integrally on a chip, the communication interface 13, the memory 12 and the processor 11 may complete communication through internal interfaces.

For example, taking connection of the smart tv and two sound collection devices as an example, fig. 2b is a schematic view of an application scenario of the smart tv provided in the present application. As shown in fig. 2b, the smart tv may mix, by the mixing processor, a first audio signal acquired through the SOC of the smart tv and a second audio signal of the user received through a microphone receiver (e.g., a microphone 1 receiver, a microphone 2 receiver shown in fig. 2 b), to obtain a mixed audio signal. Then, the smart television can perform power amplification on the mixed audio signal through the audio power amplifier AMP, and play the mixed audio signal after power amplification through a loudspeaker of the smart television, so that audio signal processing is realized.

In this example, compared with the problem that in the related art, the mixing processing link for mixing through the TV SOC is longer, resulting in a longer time delay of the mixed audio signal, the present application can mix the first audio signal and the second audio signal through the mixing processor, so as to reduce the playing time delay of the mixed audio signal, and improve the user experience.

It should be understood that the structure of the smart tv shown in fig. 2b is merely provided as an example of the present application, and the present application does not limit whether the smart tv further includes other components. Further, the function of the mixing processor as shown in fig. 2b may also be integrated in the TV SOC, which is not limited in this application.

The audio signal processing method provided in the present application is described in detail below with reference to specific embodiments. The following embodiments may be combined with each other, and some embodiments may not be repeated for the same or similar concepts or processes.

Fig. 3 is a flow chart of an audio signal processing method provided in the present application. As shown in fig. 3, the method comprises the steps of:

s101, when the first audio signal is played, receiving a user-triggered sound mixing system starting instruction.

Wherein the mixing system is not a third party application. By way of example, the third party application may include: a K song application, an audio/video playing application, etc.

Alternatively, the first audio signal may be an audio signal not provided by the K song application server. In some embodiments, the first audio signal may be a television signal. It should be understood that the television signal referred to herein may be a web television signal, or a cable television signal, etc.

In some embodiments, before playing the first audio signal, the smart tv may further obtain the first audio signal from data stored in the smart tv. That is, the first audio signal may be data stored in the smart tv itself.

Alternatively, the smart television may also obtain the first audio signal from an electronic device connected to the smart television. Optionally, the smart television may be connected to the electronic device, for example, by using a wired communication or a wireless communication method.

Taking the connection of the smart television with the electronic device through wired communication as an example, the smart television can be connected with the electronic device through a universal serial bus (Universal Serial Bus, USB) to acquire a first audio signal in the electronic device. In this example, the electronic device may be, for example, a usb disk, a cell phone, a tablet computer, or the like. Or, the smart television can also be connected with the electronic device through a high-definition multimedia interface (High Definition Multimedia Interface, HDMI) to acquire a first audio signal in the electronic device. In this example, the electronic device may be, for example, a computer or the like.

Still alternatively, the smart tv may further acquire the first audio signal from the target application server. The target application server may be, for example, an internet streaming (e.g., audio playback application or video playback application) application server, or the like. In some embodiments, the smart television may also be installed with a karaoke application. In this implementation, the first audio signal may be an audio signal provided by a K song application server.

In the above example, the smart television may acquire the first audio signal to perform the K song without passing through the K song application server. Compared with the existing audio signal processing method capable of only obtaining songs in the K song application server to carry out K songs, the method improves flexibility of carrying out K songs through the intelligent television.

Optionally, the first audio signal may be an audio format signal, or may be a corresponding audio signal in a video format signal. Taking the first audio signal as an example of the audio format signal, optionally, the smart television may further display a preset picture when playing the first audio signal. It should be understood that the preset frames corresponding to the different first audio signals may be the same or different, and the preset frames may be still images or animations. In some embodiments, the smart tv may further determine, for example, a Music Video (MV) corresponding to the first audio signal according to the first audio signal through a Music listening and song recognition function, and display the MV corresponding to the first audio signal when the first audio signal is played.

Taking the first audio signal as a corresponding audio signal in the signals in the video format as an example, optionally, after the smart television acquires the signals in the video format, the smart television may parse the audio signal from the signals in the video format to be used as the first audio signal.

Optionally, taking the example that the smart tv matches with the target remote controller, and the target remote controller includes a mixing system start instruction trigger key, the smart tv may, for example, receive a mixing system start instruction sent by "the user clicks the mixing system start instruction trigger key of the target remote controller to trigger the target remote controller".

S102, responding to a starting instruction of the audio mixing system, and displaying a first user interface of the audio mixing system.

Compared with the existing smart television, because the audio mixing system is not a third party application program, when the smart television provided by the application plays the first audio signal, the playing interface of the first audio signal is not required to be switched to the interface of the third party application program. That is, the smart tv may display the first user interface of the mixing system in response to the mixing system start command while displaying the playing interface of the first audio signal.

Taking a user performing K songs through the smart television as an example, fig. 4 is a schematic diagram of a first user interface provided in the present application. As shown in fig. 4, the first user interface may include, for example: a K song start control, a mixing system closing control, etc.

S103, responding to a mixing start instruction triggered by a user through a first user interface, and collecting a second audio signal of the user.

Still taking the first user interface shown in fig. 4 as an example, optionally, a user may, for example, operate the target remote controller to send an instruction for triggering the K song start control in the first user interface to the smart television, so as to implement triggering of the mixing start instruction through the first user interface. Or taking the smart television as an example of the smart television with the touch function, a user can trigger a mixing start instruction through the first user interface by directly clicking a K song start control in the first user interface.

Optionally, the smart television may collect the second audio signal of the user through a sound collection device connected to the smart television. The connection mode of the intelligent television and the sound collecting device is not limited. For example, the sound collection device may be connected to the smart tv by a wireless connection or a wired connection. Or, the intelligent television can also be an intelligent television with a sound collection function. Under the implementation mode, the intelligent television can acquire the second audio signal of the user through the sound acquisition device of the intelligent television.

Alternatively, the sound pickup apparatus described in the above examples may be a microphone.

S104, mixing the first audio signal and the second audio signal to obtain a mixed audio signal.

Optionally, the smart television may directly superimpose the first audio signal and the second audio signal, to obtain a mixed audio signal.

Or, the smart television may further mix the first audio signal and the second audio signal according to the mixing parameter. The mixing parameters may include, for example: the volume of the first audio signal, the volume of the second audio signal, the pitch of the second audio signal, the sound effect of the mixed audio signal, etc. Taking the first user interface shown in fig. 4 as an example, at least one mixing parameter value receiving control may be further included in the first user interface. The smart television receives the values of the mixing parameters, for example, through the first user interface.

For example, taking the example that the first audio signal includes the voices of at least two roles, the smart television may further filter the voices of one of the roles in the first audio signal, and retain the voices of the other roles. Then, the smart television can mix the first audio signal with the voice of one character filtered out with the second audio signal to obtain a mixed audio signal. By the method, a user can simulate the speech of one character in the first audio signal (audio in a television program) and play with other characters through the intelligent television.

S105, playing the mixed audio signal.

After the mixed audio signal is acquired, the smart television may play the mixed audio signal through a speaker, for example. Optionally, the speaker may be a speaker of the smart tv itself, or may be a speaker connected to the smart tv by a wire or wirelessly.

In this embodiment, when the smart television plays the first audio signal, the first user interface of the audio mixing system can be displayed by the user-triggered audio mixing system start instruction without switching away the playing interface of the first audio signal. According to the audio mixing system of the non-third party application program, the first user interface can be displayed without interface switching, and the flexibility of the intelligent television is improved. Then, a mixing start instruction is received through the first user interface, and a second audio signal of the user is collected. Then, the smart television can mix the first audio signal and the second audio signal to obtain a mixed audio signal and play the mixed audio signal. By the method, the intelligent television does not need to rely on external Soundbar sound box equipment and is not limited by a third party application program, and the flexibility of the intelligent television is further improved.

The following describes in detail how the smart tv receives a user-triggered audio mixing system start instruction.

As a first possible implementation manner, the smart tv may receive a user-triggered mixing system start instruction from the target remote controller. That is, the user may trigger the mixing system start command through the target remote controller, and the target remote controller may send the mixing system start command to the smart tv.

In this implementation manner, optionally, taking the case that the target remote controller includes a mixing system start instruction trigger key as an example, the smart television may, for example, receive a mixing system start instruction trigger key of the target remote controller pressed by a user, and trigger a mixing system start instruction sent by the target remote controller.

Or the intelligent television can also receive a sound mixing system starting instruction sent by a user through the voice control target remote controller. In this example, the target remote controller may or may not include a mixing system start command trigger key, which is not limited in this application.

As a second possible implementation manner, the smart tv may receive a mixing system start instruction triggered by a user through voice. In the implementation manner, optionally, the smart television recognizes the user voice through any one of the existing voice recognition algorithms, and determines that a starting instruction of the mixing system is received when determining that the semantics of the user voice is that the mixing system is started. Or the intelligent television can also send the collected user voice to a background server corresponding to the intelligent television, and the semantics corresponding to the user voice are determined through the background server and fed back to the intelligent television.

In this embodiment, the audio mixing system start command may come from the target remote controller, or the user may directly control the smart television to receive the audio mixing system start command, so as to improve flexibility of the smart television in entering the audio mixing system, and further improve flexibility of the smart television.

The following describes in detail how the smart tv displays the first user interface of the audio mixing system:

as a possible implementation manner, the smart television may hover and display the first user interface on a playing interface corresponding to the first audio signal. The first user interface may be smaller than a playing interface corresponding to the first audio signal. Taking K songs through a smart television as an example, fig. 5 is a schematic diagram of another first user interface provided in the present application. As shown in fig. 5, the first user interface may be displayed on a playing interface corresponding to the first audio signal.

By the method, the first user interface with the smaller picture is displayed on the playing interface corresponding to the first audio signal, so that a user can view the first user interface and the playing interface corresponding to the first audio signal, and the display flexibility of the intelligent television during audio processing is improved.

As a possible implementation manner, the smart television may further stop displaying the first user interface when the duration of not receiving the user operation through the first user interface reaches the preset duration. The preset duration may be, for example, stored in the smart television in advance for the user. If the duration that the intelligent television does not receive the user operation through the first user interface reaches the preset duration, the user is not required to operate the first user interface, so that the intelligent television can stop displaying the first user interface, the display flexibility of the intelligent television is further improved, and the user experience is improved.

Further, optionally, after stopping displaying the first user interface, the smart television may further display a first control for opening the first user interface at a target position of the display interface of the smart television. By the method, the intelligent television can respond to the operation of the user on the first control, and the first user interface is displayed again. For example, the target position may be a boundary position of the display interface, so as to avoid shielding the playing interface corresponding to the first audio signal. Furthermore, the first control can also be in a semitransparent form, so that the blocking of a playing interface corresponding to the first audio signal is further avoided.

Fig. 6 is a schematic diagram of a display interface of an intelligent television provided by the application. As shown in fig. 6, the smart tv may, for example, display that a first control in the shape of a small sphere is attached to and at the boundary position of the display interface. As shown in fig. 6, in the first control, for example, a prompt content of "K song" may be displayed to prompt the user that the smart tv has currently turned on the K song mode.

The following describes in detail how the smart tv mixes the first audio signal and the second audio signal to obtain a mixed audio signal.

Fig. 7 is a flowchart of a method for mixing a first audio signal and a second audio signal provided in the present application. As shown in fig. 7, as a possible implementation manner, the step S104 may include the following steps:

s201, acquiring a value of at least one mixing parameter through a first user interface.

Optionally, the mixing parameters may include at least one of: the identification for indicating whether to open the original singing, the volume of the second audio signal, the tone of the second audio signal, the sound effect of the mixed audio signal, the volume of the first audio signal, the volume of the mixed audio signal, etc.

In some embodiments, the first user interface may include: and receiving control of the mixing parameter value. In the implementation manner, the smart television can receive the value of at least one mixing parameter input by the user in response to the operation of the receiving control of each mixing parameter value by the user. For example, taking fig. 5 as an example, if the user-triggered operation of the original singing switch control is to close the original singing, the smart television may determine an identifier for indicating whether to open the original singing is used to indicate that the original singing is not to be opened. The intelligent television can respond to the operation of a microphone volume adjustment control in the first user interface by a user to determine the volume value of the second audio signal; determining a tone of the second audio signal in response to a user operation of a tone adjustment control in the first user interface; and responding to the operation of the user on different sound effect controls in the first user interface, and determining the sound effect of the mixed audio signal. By way of example, any of the sound effects shown in fig. 5 may be, for example: electrical sound effects, three-dimensional (3 d) sound effects, subwoofer sound effects, etc.

Under the implementation mode, the intelligent television can determine the value of the mixing parameters according to the user requirements, so that the accuracy of mixing is improved, and further the user experience is improved.

In some embodiments, the first user interface may include: mixing parameters default controls. In this implementation manner, optionally, the smart television may obtain the preset value of at least one mixing parameter in response to the user's operation on the default control of the mixing parameter. Optionally, the first user interface may include at least one mixing parameter default control. In the implementation manner, the intelligent television can respond to the operation of the user on the default control of the mixing parameters, and the value of each mixing parameter is determined to be the preset value corresponding to the default control of the mixing parameter selected by the user.

Under the implementation mode, a user operates a default control of the mixing parameters, the intelligent television can determine the values of the mixing parameters as preset values, and the efficiency of determining the values of the mixing parameters through the first user interface by the intelligent television is improved.

In some embodiments, the first user interface may further include a receive control for each mixing parameter value, and a mixing parameter default control. The user can select the value of each mixing parameter through the first user interface, and can trigger the default control of the mixing parameter.

S202, according to the value of the at least one mixing parameter, mixing the first audio signal and the second audio signal to obtain a mixed audio signal.

For example, taking the value of the at least one mixing parameter to indicate that the original singing is not opened and the tone of the second audio signal is the target tone as an example, the smart television may remove the voice in the first audio signal by a preset voice removing mode to obtain the first audio signal from which the voice is removed, and adjust the voice in the second audio signal to the target tone by a preset tone adjusting mode to obtain the second audio signal with the tone adjusted. Then, the smart television can mix the first audio signal after removing the human voice and the second audio signal after adjusting the tone to obtain a mixed audio signal.

Optionally, the above preset voice removing manner may be any existing audio processing algorithm capable of removing voice in audio, which is not described herein. The preset tone adjustment manner may be any existing audio processing algorithm capable of adjusting the tone of the voice in the audio, which is not described herein.

It should be understood that, the sequence of the step of performing the step of removing the voice in the first audio signal to obtain the first audio signal after removing the voice and the step of performing the step of adjusting the voice in the second audio signal to the target tone to obtain the second audio signal after tone adjustment is not limited.

Under this realization mode, the intelligent television can realize the accompaniment (not open former singing) function of K singing according to the user's demand to and, change the function of user's voice tone, further improved the flexibility that carries out K singing through the intelligent television, improved user experience.

In some embodiments, the execution body of the step S202 may be a background server corresponding to the smart tv. In this implementation manner, the smart tv may send the obtained first audio signal, the obtained second audio signal, and the value of the mixing parameter to the background server. The background server may perform the above step S202 based on the first audio signal, the second audio signal, and the value of the mixing parameter from the smart tv, and transmit the mixing audio signal to the smart tv. Accordingly, the smart television can receive and play the mixed audio signal.

The above mixing parameters also include: the volume of the second audio signal and the sound effect of the mixed audio signal are taken as examples, and the smart television can adjust the volume of the second audio signal according to the volume of the second audio signal selected by the user. The intelligent television can determine a target sound mixing mode according to the target sound effect selected by the user, and obtain a mixed sound audio signal meeting the target sound effect.

In this embodiment, the smart tv may obtain, through the first user interface, values of each mixing parameter, so as to mix the first audio signal and the second audio signal according to the values of each mixing parameter, and obtain a mixed audio signal. By the method, the intelligent television can acquire and play the audio signals according to the user demands, and the flexibility of K songs through the intelligent television is further improved.

In some embodiments, the first user interface may further comprise: the two-dimensional code is preset, so that the terminal equipment for scanning the preset two-dimensional code can display at least one preset animation. The terminal device may be any terminal device capable of scanning and displaying information carried by the two-dimensional code, such as a mobile phone or a tablet computer. In the implementation manner, the smart television can also respond to an animation display instruction which is sent by the terminal equipment and triggered by a user by clicking the target animation displayed by the terminal equipment, so as to display the target animation. Wherein the target animation is one of the at least one preset animation.

Still taking the first user interface shown in fig. 5 as an example, the smart tv may receive a user operation through a K song mode switch control in the first user interface to determine whether to exit the K song mode. The smart television can enable a user to scan the two-dimensional code through terminal equipment such as a mobile phone by displaying the preset two-dimensional code (the interactive two-dimensional code shown in fig. 5). The terminal device can display the name of at least one preset animation or the dynamic effect of at least one preset animation according to the links borne by the two-dimensional code. The user can trigger the intelligent television to display the target animation selected by the user by clicking at least one preset displayed by the terminal equipment, so that the user experience is further improved.

For example, taking the terminal device as a mobile phone or a tablet pc as an example, fig. 8 is a schematic application scenario diagram of another smart tv provided in the present application. The user can scan the interactive two-dimensional code displayed by the smart tv through the mobile phone, so that the mobile phone can display a plurality of preset animations (animation 1 to animation 4 as shown in fig. 8). In the process of singing by the user, assuming that the user clicks the animation 4, the smart television can respond to an animation display instruction that the user triggers the terminal device to send by clicking the animation 4 displayed by the mobile phone, and display the animation 4 on a playing interface corresponding to the first audio signal.

As a possible implementation manner, after receiving the audio mixing system start command, the smart television may also search for the target sound collection device in response to the audio mixing system start command, and connect with the target sound collection device. It should be understood that the number of sound collection devices to which the smart tv can be connected is not limited in the present application. The sound collection device may be, for example, a microphone. The sound collection device can be connected with the intelligent television in a wired mode or in a wireless mode.

Taking wireless connection of the sound collection device and the smart television as an example, the target sound collection device may be a sound collection device paired with the smart television in advance. Optionally, the specific implementation manner of pairing the smart tv with the target sound collecting device may refer to any existing wireless connection matching method, which is not limited in this application.

In this implementation manner, optionally, the smart television may further display a first prompt message "used for indicating a connection state between the smart television and the target sound collection device" on the first user interface, so that the user knows the connection state between the smart television and the target sound collection device, and user experience is improved. Taking the above-mentioned target sound collection device as a microphone as an example, the first user interface as shown in fig. 5 is still taken as an example, where "microphone connection: a B "indicates that microphone a and microphone B have been connected to the smart tv. If no microphone is currently connected with the smart television, optionally, the smart television may display a first prompt message "no microphone connection" through the first user interface, for example.

In some embodiments, after the connection with the target sound collecting device is successful, the smart television may further directly display a prompt message that the connection with the sound collecting device is successful on a playing interface corresponding to the first audio signal. If the intelligent television does not search the target sound collection device or fails to connect with the target sound collection device, optionally, the intelligent television may not output any prompt information.

In this embodiment, the smart television may actively search for the target sound collection device in response to the mixing system start instruction and connect with the target sound collection device, thereby improving the efficiency of audio signal processing by the smart television. Taking K song through the intelligent television as an example, through the method, the intelligent television can respond to the starting instruction of the audio mixing system, actively search the microphone and connect with the microphone, and the efficiency of K song through the intelligent television is improved.

Taking K songs through a smart television as an example, as a possible implementation manner, the application further provides another audio signal processing method, and the method may include the following steps:

step 1, after the smart television is started, the operation of playing the first audio signal by the user can be responded. The first audio signal may be music, music play, concert, dance performance, etc. acquired by the smart tv through USB, HDMI, television signal, internet streaming media, etc.

And 2, the user can click a button marked with a K song word on the remote controller, or send a sound mixing system starting instruction to the intelligent television through the voice control remote controller, or the user can control the intelligent television to start the real-time sound mixing system through the voice.

And 3, after the intelligent television is started up, the real-time audio mixing system can suspend on a playing interface corresponding to the current first audio signal in a small window mode.

The first user interface of the real-time audio mixing system can be preset with a K song mode on/off, a microphone connection state prompt, accompanying sound/original singing switching selection, tone change and sound adjustment, K song sound effect adjustment, interactive two-dimensional codes and the like. The mobile phone can scan the two-dimension code to be connected to the real-time audio mixing system. The user can conduct interaction of the K atmosphere such as praise, cheering and the like through the mobile phone.

And 4, when the intelligent television turns on the real-time audio mixing system and selects to turn on a 'Karaoke mode', the intelligent television can actively search the bound microphone equipment. And prompting that the intelligent television is connected on the first user interface after the intelligent television finds out and completes connection.

And 5, the intelligent television mixes the human voice collected by the microphone with the sound of the first audio signal of the television system.

When the sound function is turned on, the intelligent television can call a voice removing algorithm to remove the voice of the first audio signal, so that real-time available sound melodies are provided for users. When the tone change is regulated, the intelligent television can call a voice change algorithm to regulate the tone of the voice collected by the microphone.

And 6, the smart television can control the floating window of the first user interface to automatically exit from being hidden after a certain time, or change into a semitransparent ball to be attached to the left upper side or the right upper side of the screen of the smart television.

And 7, when the Karaoke mode needs to be set or exited again, the user can call out the first user interface through a Karaoke button of the remote controller, or directly adopts voice to control the intelligent television to close the Karaoke mode.

It should be noted that, for the sake of brevity, this specification is not exhaustive of all of the alternative embodiments, and after reading this specification, one skilled in the art will appreciate that any combination of features may constitute an alternative embodiment as long as the features do not contradict each other.

Fig. 9 is a schematic structural diagram of an audio signal processing device provided in the present application. The device is applied to the intelligent television. As shown in fig. 9, the apparatus includes: the device comprises a receiving module 31, a display module 32, an acquisition module 33, a processing module 34 and a playing module 35. Wherein,

The receiving module 31 is configured to receive a mixing system start instruction triggered by a user when the first audio signal is played. Wherein the mixing system is not a third party application.

And the display module 32 is used for responding to the starting instruction of the audio mixing system and displaying a first user interface of the audio mixing system.

And the acquisition module 33 is used for responding to the mixing start instruction triggered by the user through the first user interface and acquiring a second audio signal of the user.

And the processing module 34 is configured to mix the first audio signal and the second audio signal to obtain a mixed audio signal.

A playing module 35, configured to play the mixed audio signal.

Optionally, the first audio signal is a television signal. Optionally, the processing module 34 is further configured to obtain the first audio signal from the data stored in the smart tv before the playing of the first audio signal. Or, the receiving module 31 is further configured to receive the first audio signal from an electronic device connected to the smart tv. Alternatively, the processing module 34 is further configured to obtain the first audio signal from the target application server.

Optionally, the display module 32 is specifically configured to hover and display the first user interface on a playing interface corresponding to the first audio signal. The first user interface is smaller than a playing interface corresponding to the first audio signal.

Optionally, the display module 32 is further configured to stop displaying the first user interface when a duration of not accepting the user operation through the first user interface reaches a preset duration.

Optionally, the display module 32 is further configured to display a first control at a target position of the display interface of the smart tv after stopping displaying the first user interface. The first control is used for opening the first user interface.

Optionally, the receiving module 31 is specifically configured to receive the user-triggered audio mixing system start instruction from the target remote controller; or receiving the voice mixing system starting instruction triggered by the user through voice.

Optionally, the receiving module 31 is specifically configured to receive a mixing system start instruction trigger key of the target remote controller by pressing a mixing system start instruction trigger key of the target remote controller, and trigger a mixing system start instruction sent by the target remote controller; or receiving a sound mixing system starting instruction sent by the user through voice control of the target remote controller.

Optionally, the processing module 34 is specifically configured to obtain, through the first user interface, a value of at least one mixing parameter; and mixing the first audio signal and the second audio signal according to the value of the at least one mixing parameter to obtain a mixed audio signal.

Optionally, the first user interface includes: for example, the processing module 34 is specifically configured to receive, in response to a user operation on the receiving control of each mixing parameter value, a value of at least one mixing parameter input by the user. Alternatively, the first user interface includes: for example, the mixing parameter default control is specifically configured to obtain a preset value of at least one mixing parameter in response to a user operation of the mixing parameter default control by using the processing module 34.

Optionally, the first user interface further includes: presetting a two-dimensional code so that terminal equipment for scanning the preset two-dimensional code displays at least one preset animation. And the display module 32 is further configured to respond to an animation display instruction sent by the terminal device triggered by the user by clicking the target animation displayed by the terminal device, and display the target animation. Wherein the target animation is one of the at least one preset animation.

Optionally, the processing module 34 is further configured to search for a target sound collection device in response to the mixing system start command, and connect with the target sound collection device.

Optionally, the display module 32 is further configured to display a first prompt message on the first user interface. The first prompt message is used for indicating the connection state of the intelligent television and the target sound collection device.

The audio signal processing device provided in the present application is configured to execute the foregoing embodiment of the audio signal processing method, and its implementation principle is similar to that of the technical effect, and will not be described again.

The present application also provides a computer-readable storage medium, which may include: a U-disk, a removable hard disk, a Read-Only Memory (ROM), a random access Memory (RAM, random Access Memory), a magnetic disk or an optical disk, etc., in which program codes may be stored, and in particular, the computer-readable storage medium stores program instructions for the methods in the above embodiments.

The present application also provides a program product comprising execution instructions stored in a readable storage medium. The at least one processor of the electronic device may read the execution instructions from the readable storage medium, and execution of the execution instructions by the at least one processor causes the electronic device to implement the audio signal processing method provided by the various embodiments described above.

It should also be noted that the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. Without further limitation, an element defined by the phrase "comprising one … …" does not exclude the presence of other like elements in a process, method, article or apparatus that comprises an element.

The foregoing is merely exemplary of the present application and is not intended to limit the present application. Various modifications and changes may be made to the present application by those skilled in the art. Any modifications, equivalent substitutions, improvements, etc. which are within the spirit and principles of the present application are intended to be included within the scope of the claims of the present application.

Claims

1. An audio signal processing method, wherein the method is applied to a smart television, and the method comprises the following steps:

and playing the mixed audio signal.

2. The method of claim 1, wherein the first audio signal is a television signal;

3. The method according to claim 1 or 2, wherein the displaying the first user interface of the mixing system comprises:

4. A method according to claim 3, characterized in that the method further comprises:

5. The method of claim 4, wherein after ceasing to display the first user interface, the method further comprises:

6. The method according to claim 1 or 2, wherein the receiving a user-triggered mixing system start instruction includes:

7. The method of claim 6, wherein the receiving the user-triggered mixing system start instruction from the target remote control comprises:

8. The method according to claim 1 or 2, wherein mixing the first audio signal and the second audio signal to obtain a mixed audio signal comprises:

9. The method of claim 8, wherein the first user interface comprises: and receiving control of the mixing parameter value, wherein the obtaining, through the first user interface, the value of at least one mixing parameter comprises:

10. The method of claim 8, wherein the first user interface further comprises a preset two-dimensional code to cause a terminal device scanning the preset two-dimensional code to display at least one preset animation; the method further comprises the steps of:

11. The method according to claim 1 or 2, characterized in that the method further comprises:

12. The method of claim 11, wherein the method further comprises:

13. An audio signal processing apparatus, wherein the apparatus is applied to a smart television, the apparatus comprising:

and the playing module is used for playing the mixed audio signal.

14. An intelligent television, characterized by comprising: at least one processor, memory;

the memory stores computer-executable instructions;

the at least one processor executing computer-executable instructions stored in the memory causes the intelligent television to perform the method of any one of claims 1-12.

15. A computer readable storage medium having stored thereon computer executable instructions which, when executed by a processor, implement the method of any of claims 1-12.

16. A computer program product comprising a computer program, characterized in that the computer program, when executed by a processor, implements the method of any of claims 1-12.