WO2022237463A1 - Livestreaming background sound processing method and apparatus, device, medium, and program product - Google Patents

Livestreaming background sound processing method and apparatus, device, medium, and program product Download PDF

Info

Publication number
WO2022237463A1
WO2022237463A1 PCT/CN2022/087482 CN2022087482W WO2022237463A1 WO 2022237463 A1 WO2022237463 A1 WO 2022237463A1 CN 2022087482 W CN2022087482 W CN 2022087482W WO 2022237463 A1 WO2022237463 A1 WO 2022237463A1
Authority
WO
WIPO (PCT)
Prior art keywords
background sound
audio
live broadcast
code stream
live
Prior art date
Application number
PCT/CN2022/087482
Other languages
French (fr)
Chinese (zh)
Inventor
陈映宜
Original Assignee
北京字节跳动网络技术有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 北京字节跳动网络技术有限公司 filed Critical 北京字节跳动网络技术有限公司
Publication of WO2022237463A1 publication Critical patent/WO2022237463A1/en

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/439Processing of audio elementary streams
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/431Generation of visual interfaces for content selection or interaction; Content or additional data rendering
    • H04N21/4312Generation of visual interfaces for content selection or interaction; Content or additional data rendering involving specific graphical features, e.g. screen layout, special fonts or colors, blinking icons, highlights or animations
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/442Monitoring of processes or resources, e.g. detecting the failure of a recording device, monitoring the downstream bandwidth, the number of times a movie has been viewed, the storage space available from the internal hard disk
    • H04N21/44213Monitoring of end-user related data

Definitions

  • Embodiments of the present disclosure relate to the technical field of broadcast television and Internet live broadcast, and in particular to a method, device, equipment, medium and program product for processing live background sound.
  • Live broadcast technology enables people to break through the limitations of space and obtain various information on the live broadcast site in real time.
  • webcasting has also become a platform for people to show themselves, and has gained great social popularity.
  • Mobile live broadcasting lowers the technical threshold of live broadcasting, allowing ordinary people without professional skills to simply broadcast live broadcasting.
  • a large number of peripheral auxiliary products have been derived around mobile live broadcast devices such as mobile phones.
  • a high-quality mobile live broadcast requires hardware equipment including: desktop computers, sound cards, several intermediate converters/connectors, mobile phones, earphones, microphones, etc. Real-time addition of a live broadcast effect.
  • the existing technology has the technical problem of only relying on complex professional equipment and a team of professional technicians to add background sound when facing high-quality live broadcast requirements. This undoubtedly raised the technical threshold for live broadcasting and increased the cost of live broadcasting for anchors.
  • Embodiments of the present disclosure provide a live broadcast background sound processing method, device, equipment, media, and program products to solve the existing technology that can only rely on complex professional equipment and professional technical personnel teams when faced with high-quality live broadcast requirements technical issues to add background sound.
  • the embodiment of the present disclosure provides a live broadcast background sound processing method, which is applied to a user end, and the user end includes: a main controller and an audio processor, and the main controller is installed with a live broadcast application and a music playback application,
  • the method includes:
  • the background sound operation controls are displayed on the display screen
  • the audio processor concatenates and synthesizes the first audio data stream and the second audio code stream to determine the input audio code stream of the live broadcast application, and the second audio code stream includes The device receives the sound signal from the anchor, and/or the sound signal in the live broadcast environment.
  • an embodiment of the present disclosure provides a live broadcast background sound processing device, including: a main control module and an audio processing module, where a live broadcast application and a music playback application are installed in the main control module; wherein,
  • the main control module is used to display background sound operation controls on the display screen when preset display conditions are met;
  • the main control module is further configured to send the corresponding first audio code stream in the music playing application to the audio processing module in response to the user's operation instruction on the background sound operation control;
  • the audio processing module is configured to concatenate and synthesize the first audio data stream and the second audio code stream to determine the input audio code stream of the live application, and the second audio code stream includes Receive the sound signal from the anchor, and/or the sound signal in the live broadcast environment.
  • an electronic device including:
  • processors at least one processor and memory
  • the memory stores a computer program
  • the at least one processor executes the computer program stored in the memory, so that the at least one processor executes the live broadcast background sound processing method described in the above first aspect and various possible designs of the first aspect.
  • an embodiment of the present disclosure provides a live streaming all-in-one machine, including any possible electronic device provided in the third aspect.
  • the embodiments of the present disclosure provide a computer-readable storage medium, where a computer program is stored in the computer-readable storage medium, and when the processor executes the computer program, the above first aspect and each of the first aspect are realized.
  • a possible design of the described live background sound processing method is provided.
  • an embodiment of the present disclosure provides a computer program product, including a computer program, which, when executed by a processor, implements the live broadcast background sound processing method described in the above first aspect and various possible designs of the first aspect .
  • the embodiments of the present disclosure provide a computer program.
  • the computer program When the computer program is executed by a processor, it implements the live broadcast background sound processing method described in the first aspect and various possible designs of the first aspect.
  • the live broadcast background sound processing method, device, equipment, medium and program product provided by the embodiments of the present disclosure, the method first displays the background sound operation control on the display screen when the preset display condition is met, and then responds to the user's operation on the background sound
  • the operation command of the control sends the corresponding first audio code stream in the music player application to the audio processor, and finally the audio processor merges and synthesizes the first audio data stream and the second audio code stream to determine the input of the live broadcast application Audio stream.
  • the second audio code stream includes the sound signal sent by the anchor received by the client through the audio collection device, and/or the sound signal in the live broadcast environment.
  • the embodiments of the present disclosure solve the technical problem in the prior art that the background sound can only be added by relying on complex professional equipment and a team of professional technicians in the face of high-quality live broadcast requirements. It achieves the technical effect that the host can conveniently and quickly add background sound to the live audio stream on the live broadcast terminal, reduces the cost and technical threshold of live broadcast, and improves the user experience.
  • FIG. 1 is a schematic structural diagram of a live broadcast equipment kit used by an existing indoor anchor provided by an embodiment of the present disclosure
  • FIG. 2 is a first schematic flow diagram of a method for processing live background sound provided by an embodiment of the present disclosure
  • FIG. 3 is a schematic diagram of a background audio code stream processed by a client provided by an embodiment of the present disclosure
  • FIG. 4 is a second schematic flow diagram of a method for processing live background sound provided by an embodiment of the present disclosure
  • 5a-5c are schematic diagrams of a user terminal display screen split-screen display background sound operation area provided by an embodiment of the present disclosure
  • FIG. 6 is a third schematic flowchart of a method for processing live broadcast background sound provided by an embodiment of the present disclosure
  • FIGS. 7a-7e are schematic diagrams of displaying background sound operation controls in the form of a floating window according to an embodiment of the present disclosure
  • FIG. 8 is a structural block diagram of a live broadcast background sound processing device provided by an embodiment of the present disclosure.
  • FIG. 9 is a schematic structural diagram of an electronic device provided by an embodiment of the present disclosure.
  • live broadcast technology is an efficient means for people to receive live broadcast information in different geographical spaces.
  • the live webcast that has emerged in recent years has added a lot of fun to people's entertainment life and has been sought after by people.
  • a large number of anchors have joined the ranks of the webcast.
  • Live broadcasting through mobile devices is favored by many anchors for its simplicity.
  • the anchor needs to add live background music when performing live broadcast.
  • FIG. 1 is a schematic structural diagram of a live broadcast equipment kit used by an existing indoor anchor provided by an embodiment of the present disclosure.
  • the live broadcast equipment kit includes: a mobile terminal 101 , an intermediate adapter 102 , an intermediate converter 103 , a sound card 104 , and a desktop computer 105 .
  • the live application is installed in the mobile terminal 101 .
  • the intermediate adapter 102 is used to solve the problem that the charging interface of the mobile terminal 101 cannot be charged because the charging interface is occupied by the data line when the mobile terminal 101 is live broadcasting for a long time.
  • the intermediate converter 103 is used to transfer the special effect data in the desktop computer 105 to the mobile terminal 101, and after connecting the earphones, it can provide the customer with an ear return function, and it can also solve the problem that the sound card 104 can only output a single channel and cannot receive live broadcast in reverse A problem with the audio data output by the app.
  • the sound card 104 is connected with the microphone to convert the audio analog signal into a digital signal, and input it into the desktop computer 105 for various sound effect processing.
  • the inventive concept of the present application aims at:
  • the anchor can quickly and easily add the required audio to the live audio stream through the screen during the live broadcast, that is, set the live background music.
  • the live broadcast does not require multiple devices, and only one small device can complete the background sound processing of the live broadcast, and does not require a team of professional and technical personnel to participate in operation and maintenance.
  • the anchor can select and add audio such as background music in real time during the live broadcast.
  • FIG. 2 is a first schematic flowchart of a method for processing live background sound provided by an embodiment of the present disclosure.
  • the method of this embodiment is applied to the user end, and the implementation form of the user end includes a device with a live broadcast function.
  • the device with a live broadcast function includes but is not limited to: a live broadcast integrated terminal, a mobile terminal, and a computer device. Users can use the live broadcast integrated
  • the live broadcast can be directly completed by a mobile terminal or a mobile terminal, without using other supporting equipment to perform additional processing on the live audio or video.
  • the client includes: a main controller and an audio processor, and a live broadcast application and a music playing application are installed on the main controller.
  • the preset display conditions include: detection of an instruction to turn on the camera device, detection of a live broadcast terminal moving in a preset motion mode (such as shaking, circular motion, flipping motion, etc.), receiving a preset voice command and At least one of an opening instruction corresponding to a preset button or a switch control is received.
  • a preset motion mode such as shaking, circular motion, flipping motion, etc.
  • the type of the display screen can be a touch screen or a non-touch screen.
  • the user clicks the shortcut control displayed on the live broadcast interface to display the background sound operation control.
  • the display mode of the background sound operation control includes: split screen, and/or, floating window display.
  • the user terminal includes at least one display screen.
  • the user terminal can be connected to multiple external display screens through an extension interface, and the extension interface includes a wireless interface and a wired interface.
  • the background sound operation control is displayed on the display screen where the non-live broadcast interface is located.
  • S202 Send the corresponding first audio code stream in the music playing application to the audio processor in response to the user's operation instruction on the background sound operation control.
  • the operation instructions for the background sound operation control include: pause, play, previous song, next song, playlist, list or playlist loop playback, sequential play, music search, lyrics search, list search Wait for at least one of the operations.
  • the background sound operation control can be used as a display medium of the music playing application, and the operation interface of the music playing application is directly copied and projected onto the preset display area of the background sound operation control.
  • the anchor can directly search for the background music needed, play, pause, etc., like operating a music player application, or select a playlist in the preset song list, and then choose a playback method, such as: play in order of playlist, random Play, list loop playback, single loop playback, etc.
  • anchors usually need another music playback device to play music in the live broadcast environment, so that the microphone can collect the music sound in the live broadcast environment, and then add it to the input audio stream of the live broadcast application.
  • a desktop computer 105 is used to start a music player application, and the audio stream generated by it is input to an external independent sound card 104, and the microphone is not directly connected to the external sound card 104.
  • the mobile phone or mobile terminal 101 is connected to the sound card 104 instead, so that the sound card 104 synthesizes the audio code stream generated by the music player application and the sound code stream collected by the microphone.
  • This embodiment breaks the technical barrier that two applications cannot be installed on the same device, and installs the live broadcast application and the music player application on the same terminal, but the opening and operation methods of the music player application during the live broadcast , but in a different way than when not live.
  • Call the music playback application through the background sound operation control so that the original music playback application needs to occupy peripheral resources such as headphone jacks and speakers when it is started, and instead occupies the audio processor resources in the terminal device corresponding to the background sound operation control.
  • Processor resources can be physical hardware devices such as IO pin interfaces corresponding to audio processing chips, or virtual resources such as virtual sound cards, virtual headphones, and virtual speakers.
  • virtual resources need to be implemented by the main controller through a special interface program or application module, which can be selected by those skilled in the art according to the actual situation, which is not limited in this application.
  • auxiliary live broadcast equipment such as desktop computers, external independent sound cards, intermediate converters, intermediate adapters, etc. are eliminated, making it easy to add background music during live broadcasting. It is fast and reduces the cost of equipment, and does not need to rely on the support of a professional technical team.
  • the anchor can easily live broadcast by one person.
  • the audio processor concatenates and synthesizes the first audio data stream and the second audio code stream, so as to determine the input audio code stream of the live application.
  • the second audio code stream includes the sound signal sent by the anchor received by the client through the audio collection device, and/or the sound signal in the live broadcast environment.
  • a preset synthesis algorithm is used to synthesize the first audio data stream and the second audio code stream into an input audio code stream
  • the live application receives the input audio code stream and encodes the input audio code stream to determine the output audio code stream;
  • the output audio code stream is sent from the network interface to the network server, so that the network server sends the output audio code stream to the live viewing terminal used by each audience.
  • the live broadcast application encodes the input audio stream, transmits it to the live broadcast platform server through the network, and then sends it to each viewer by the live broadcast platform service.
  • FIG. 3 is a schematic diagram of processing a background audio code stream at a user terminal according to an embodiment of the present disclosure.
  • the client terminal 300 includes a main controller 31 and an audio processor 32 , wherein a live broadcast application 311 and a music playing application 312 are installed in the main controller 31 .
  • the user opens the background sound operation control by clicking the shortcut key, thereby calling the music playback application 312, and the user selects the audio played as the background music from the music playback application 312 by operating the background sound operation control
  • the music player application 312 generates a continuous audio code stream, that is, the first audio data stream, and inputs it into the audio processor 32 .
  • Audio processor 32 is also connected (as USB (Universal Serial Bus, Universal Serial Bus) audio input interface, 3.5mm interface microphone audio input interface, Canon head interface microphone audio input interface) with each audio collection interface of client 300, with The audio of the anchor received by each audio collection device or the audio of the environment where the anchor is located is the second audio stream.
  • USB Universal Serial Bus
  • 3.5mm interface microphone audio input interface 3.5mm interface microphone audio input interface
  • Canon head interface microphone audio input interface as USB (Universal Serial Bus, Universal Serial Bus) audio input interface
  • 3.5mm interface microphone audio input interface 3.5mm interface microphone audio input interface
  • Canon head interface microphone audio input interface Canon head interface microphone audio input interface
  • the audio processor 32 inputs the received first audio code stream and the second audio code stream into a preset synthesis algorithm model to determine a synthesized audio code stream as an input audio code stream of the live application 311 .
  • the background sound operation control is displayed on the display screen, and then in response to the user's operation instruction on the background sound operation control, the corresponding The first audio code stream is sent to the audio processor, and finally the audio processor merges and synthesizes the first audio data stream and the second audio code stream to determine the input audio code stream of the live application.
  • the second audio code stream includes the sound signal sent by the anchor received by the client through the audio collection device, and/or the sound signal in the live broadcast environment.
  • FIG. 4 is a second schematic flowchart of a method for processing live background sound provided by an embodiment of the present disclosure.
  • the method of this embodiment is applied to the user end, and the implementation form of the user end includes a device with a live broadcast function.
  • the device with a live broadcast function includes but is not limited to: a live broadcast integrated terminal, a mobile terminal, and a computer device. Users can use the live broadcast integrated
  • the live broadcast can be directly completed by a mobile terminal or a mobile terminal, without using other supporting equipment to perform additional processing on the live audio or video.
  • the enabling instruction includes: clicking the shortcut key button or control for enabling the background sound on the live broadcast interface.
  • the user after the user starts the live broadcast application on the user terminal, the user clicks to start live broadcast, and the live broadcast application starts the camera on the user terminal. And when the user terminal detects that the camera is turned on, the split-screen control is displayed on the edge of the display screen.
  • the background sound activation control includes a floating window control for split-screen display, that is, a split-screen control, and the main controller at the user end can detect the activation instruction when the user clicks the split-screen control.
  • S402 Simultaneously display background sound operation controls on the live broadcast interface according to the preset display mode indicated by the activation instruction.
  • the preset display mode includes: a split-screen display mode.
  • At least one display screen of the user terminal is divided into a live interface area and a background sound operation area.
  • the boundaries of the live interface area and the background sound operation area include: straight line boundaries and/or curved boundaries.
  • FIG. 5a-5c are schematic diagrams of a display screen of a user terminal for displaying a background sound operation area in split screens according to an embodiment of the present disclosure.
  • the display screen will be divided into a live interface area 510 and a background sound operation area 520.
  • the content displayed in the background sound operation control includes: all or part of the content of the operation interface of the music playing application.
  • the user terminal 500 maps and corresponds each control in the background sound operation area 520 to the operation interface of the music playing application through the background control algorithm.
  • the background sound operation area 520 includes:
  • the song list sub-control 521 is used to switch the song list
  • the menu sub-control 523 is used to display more functional options of the music player application
  • the operation sub-control 524 is used to implement operations such as play, pause, stop, previous song, and next song.
  • the background sound operation area 520 may also include information controls such as playback progress, volume control, song title, singer, lyricist/musician, and the like. It should be noted that those skilled in the art can select the layout and implementation style of the background sound operation area 520 according to the actual situation, which is not limited in this application.
  • the split screen can also be a left and right split screen, as shown in Figure 5b, after the split screen, the ratio of the live broadcast interface area 510 and the background sound operation area 520 to the display screen can be adjusted according to actual needs. For example: the background sound operation area 520 occupies 30%-50%.
  • the boundary between the live broadcast interface area 510 and the background sound operation area 520 after screen splitting can also be any geometric shape surrounded by straight lines and curves. As shown in FIG. 5 c , there may be multiple background sound operation areas 520 , and each sub-control is distributed in multiple background sound operation areas 520 .
  • the user's operation instructions on the background sound operation controls include: various operation instructions of the user in the background sound operation area 520 , and operation instructions on the boundary of the background sound operation area 520 .
  • the user's various operating instructions in the background sound operation area 520 including: pause, play, previous song, next song, playlist, list or playlist cycle play, sequential play, music search, lyrics search, ranking list At least one of the search.
  • the specific function of these operation instructions is to select the audio data or audio code stream as the background sound of the live broadcast in the music playing application.
  • the operation instructions on the border of the background sound operation area 520 include: the user changes the live interface area and the border adjacent to the background sound operation area in a preset manner to change the display state of the background sound operation control, and the display state includes: on or off , expand or shrink the background sound operation area.
  • the user can close the background sound operation area 520 or change the background sound operation by clicking at least one point on the boundary or clicking at least one straight line segment on the boundary.
  • the proportional size of the area 520 is a predefined range of the background sound operation area 520.
  • the user presses and holds at least one point on the boundary, and then slides up and down or left and right to change the proportion of the background sound operation area 520 .
  • the user presses a point on the boundary, and then slides up and down or left and right to change the proportion of the background sound operation area 520 .
  • this kind of boundary has both straight lines and curves.
  • Different operation methods can be set. For example, when the user presses the straight line segment of the boundary and slides in the direction perpendicular to the straight line segment, the Expand or shrink the background sound operation area 520 in the direction of .
  • the background sound operation area 520 is expanded or reduced according to the corresponding direction.
  • the proportion of the background sound operation area 520 can also be changed.
  • the corresponding first audio code stream generated when the audio data is played in the music playing application is sent to the audio processor.
  • S405 The audio processor concatenates and synthesizes the first audio data stream and the second audio code stream, so as to determine the input audio code stream of the live application.
  • the second audio code stream includes the sound signal sent by the anchor received by the client through the audio collection device, and/or the sound signal in the live broadcast environment.
  • the background sound operation control is displayed on the display screen in a split-screen form, and then in response to the user's operation instruction on the background sound operation control, the The corresponding first audio code stream in the music playing application is sent to the audio processor, and finally the audio processor merges and synthesizes the first audio data stream and the second audio code stream to determine the input audio code stream of the live application.
  • the second audio code stream includes the sound signal sent by the anchor received by the client through the audio collection device, and/or the sound signal in the live broadcast environment.
  • the embodiments of the present disclosure solve the technical problem in the prior art that the background sound can only be added by relying on complex professional equipment and a team of professional technicians in the face of high-quality live broadcast requirements. It achieves the technical effect that the host can easily and quickly add background sound to the live audio stream on the live broadcast terminal, reduces the cost and technical threshold of live broadcast, and improves the user experience.
  • FIG. 6 is a third schematic flowchart of a method for processing live broadcast background sound provided by an embodiment of the present disclosure.
  • the method of this embodiment is applied to the user end, and the implementation form of the user end includes a device with a live broadcast function.
  • the device with a live broadcast function includes but is not limited to: a live broadcast integrated terminal, a mobile terminal, and a computer device. Users can use the live broadcast integrated
  • the live broadcast can be directly completed by a mobile terminal or a mobile terminal, without using other supporting equipment to perform additional processing on the live audio or video.
  • S602 According to the opening instruction, simultaneously display the background sound operation controls on the live broadcast interface in the preset display mode of the floating window.
  • the background sound operation control is superimposed and displayed on the live broadcast interface in the form of a floating window.
  • FIG. 7a-7e are schematic diagrams of displaying background sound operation controls in the form of a floating window according to an embodiment of the present disclosure.
  • the floating window 710 which is the background sound operation control, is superimposed and displayed on the live broadcast interface.
  • the floating window 710 includes: a song icon 711 , a search control 712 , a menu control 713 , a play mode control 714 and a zoom control 715 .
  • the song icon 711 is used to display the cover image of the last played song.
  • the search control 712 is used to search for audio-related information such as songs, lyrics, singers, albums, playlists, rankings, etc., so that the user can filter out the background music that needs to be played during the live broadcast.
  • S603 Send the corresponding first audio code stream in the music playing application to the audio processor in response to the user's operation instruction on the background sound operation control.
  • the operation instruction refers to the user's operation on each sub-control on the floating window, specifically, as shown in Figures 7a-7d:
  • the floating window 710 only displays the song icon 711, so as to reduce the visual impact of the floating window 710 on the live interface.
  • the floating window 710 will change to the form shown in Fig.
  • song list 1 begins to play the song in this song list, as shown in Figure 7c, this song can be selected from the song list at random, also can be the song of preset position in this song list, as the first The song can also be the song played last time in the playlist.
  • the search When the user enters the search keyword/word on the search control 712 and instructs to perform the search (such as pressing Enter, or clicking the "magnifying glass button"), as shown in Figure 7e, the search will be launched below the floating window 710 In the result list, if the user clicks on any search result, the song corresponding to the search result will be switched and played.
  • the main controller of the user terminal After the floating window 710 starts to play the song, the main controller of the user terminal sends the corresponding first audio data stream in the music playing application to the audio processor.
  • the floating window can also be designed to display the same operation interface as the music player application, so that the host does not need to change the usage habits of the music player application, making it easier to use and improving user experience.
  • the audio processor concatenates and synthesizes the first audio data stream and the second audio code stream, so as to determine an input audio code stream of the live application.
  • the second audio code stream includes the sound signal sent by the anchor received by the client through the audio collection device, and/or the sound signal in the live broadcast environment.
  • the background sound operation control is displayed on the display screen in the form of a floating window, and then in response to the user's operation instruction on the background sound operation control, the The corresponding first audio code stream in the music playing application is sent to the audio processor, and finally the audio processor merges and synthesizes the first audio data stream and the second audio code stream to determine the input audio code stream of the live application.
  • the second audio code stream includes the sound signal sent by the anchor received by the client through the audio collection device, and/or the sound signal in the live broadcast environment.
  • the embodiments of the present disclosure solve the technical problem in the prior art that the background sound can only be added by relying on complex professional equipment and a team of professional technicians in the face of high-quality live broadcast requirements. It achieves the technical effect that the host can conveniently and quickly add background sound to the live audio stream on the live broadcast terminal, reduces the cost and technical threshold of live broadcast, and improves the user experience.
  • FIG. 8 is a structural block diagram of an apparatus 800 for processing live background sound provided in an embodiment of the present disclosure.
  • the device includes:
  • Main control module 801 and audio processing module 802, live application and music playing application are installed in the main control module 801;
  • the main control module 801 is used to display background sound operation controls on the display screen when the preset display conditions are met;
  • the main control module 801 is also configured to send the corresponding first audio code stream in the music playing application to the audio processing module 802 in response to the user's operation instruction on the background sound operation control;
  • the audio processing module 802 is configured to concatenate and synthesize the first audio data stream and the second audio code stream to determine the input audio code stream of the live broadcast application.
  • the second audio code stream includes the sound from the host received through the audio collection device signal, and/or, sound signal in a live environment.
  • the main control module 801 is configured to:
  • the background sound operation controls are simultaneously displayed on the live broadcast interface.
  • the preset display mode includes: at least one mode of split screen and floating window.
  • the content displayed by the background sound operation control includes: all or part of the content of the operation interface of the music playing application.
  • the main control module 801 when the preset display mode is split screen, is configured to divide at least one display screen of the user terminal into a live interface area and a background sound operation area;
  • the main control module 801 is also used to obtain the user to change the adjacent boundary of the live interface area and the background sound operation area in a preset manner, so as to change the display state of the background sound operation control.
  • the display state includes: open or closed state, enlargement or reduction Background sound operation area.
  • the preset method includes: clicking at least one point on the boundary, clicking at least one straight line or curved segment on the boundary, and sliding the boundary along a preset direction and/or a preset path.
  • the boundary includes a straight line boundary and/or a curved boundary.
  • the operation instructions include: pause, play, previous song, next song, playlist, list or playlist loop play, sequential play, music search, lyrics search, list search at least one.
  • the audio processing module 802 is configured to use a preset synthesis algorithm to synthesize the first audio data stream and the second audio code stream into an input audio code stream;
  • the main control module 801 is also used to receive the input audio code stream through the live broadcast application, and encode the input audio code stream to determine the output audio code stream; send the output audio code stream from the network interface to the network server, so that The network server sends the output audio code stream to the live viewing terminal used by each audience.
  • the device provided in this embodiment can be used to execute any one of the above method embodiments, and its implementation principles and technical effects are similar, so this embodiment will not repeat them here.
  • the electronic device 900 may be a terminal device or a server.
  • the terminal equipment may include but not limited to mobile phones, notebook computers, digital broadcast receivers, personal digital assistants (Personal Digital Assistant, PDA for short), tablet computers (Portable Android Device, PAD for short), portable multimedia players (Portable Media Player, PMP for short), mobile terminals such as vehicle-mounted terminals (such as vehicle-mounted navigation terminals), and fixed terminals such as digital TV (Television), desktop computers, etc.
  • PDA Personal Digital Assistant
  • PMP Portable Multimedia Player
  • mobile terminals such as vehicle-mounted terminals (such as vehicle-mounted navigation terminals)
  • fixed terminals such as digital TV (Television), desktop computers, etc.
  • the electronic device shown in FIG. 9 is only an example, and should not limit the functions and application scope of the embodiments of the present disclosure.
  • an electronic device 900 may include a processing device (such as a central processing unit, a graphics processing unit, etc.) 908 loads the programs in the random access memory (Random Access Memory, RAM for short) 903 to execute various appropriate actions and processes.
  • a processing device such as a central processing unit, a graphics processing unit, etc.
  • RAM Random Access Memory
  • various programs and data necessary for the operation of the electronic device 900 are also stored.
  • the processing device 901, ROM 902, and RAM 903 are connected to each other through a bus 904.
  • An input/output (Input/Output, I/O for short) interface 905 is also connected to the bus 904 .
  • an input device 909 including, for example, a touch screen, a touchpad, a keyboard, a mouse, a camera, a microphone, an accelerometer, a gyroscope, etc.; ), a speaker, a vibrator, etc.
  • a storage device 908 including, for example, a magnetic tape, a hard disk, etc.
  • the communication means 909 may allow the electronic device 900 to perform wireless or wired communication with other devices to exchange data. While FIG. 9 shows electronic device 900 having various means, it is to be understood that implementing or having all of the means shown is not a requirement. More or fewer means may alternatively be implemented or provided.
  • embodiments of the present disclosure include a computer program product, which includes a computer program carried on a computer-readable medium, where the computer program includes program codes for executing the methods shown in the flowcharts.
  • the computer program may be downloaded and installed from a network via communication means 909, or from storage means 908, or from ROM 902.
  • the processing device 901 the above-mentioned functions defined in the methods of the embodiments of the present disclosure are executed.
  • the above-mentioned computer-readable medium in the present disclosure may be a computer-readable signal medium or a computer-readable storage medium or any combination of the above two.
  • a computer readable storage medium may be, for example, but not limited to, an electrical, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, or device, or any combination thereof.
  • Computer-readable storage media may include, but are not limited to, electrical connections with one or more wires, portable computer diskettes, hard disks, random access memory (RAM), read-only memory (ROM), erasable Programmable Read Only Memory (Erasable Programmable Read Only Memory, referred to as EPROM or flash memory), optical fiber, portable compact disk read only memory (Compact Disc Read Only Memory, referred to as CD-ROM), optical storage device, magnetic storage device, or any of the above the right combination.
  • a computer-readable storage medium may be any tangible medium that contains or stores a program that can be used by or in conjunction with an instruction execution system, apparatus, or device.
  • a computer-readable signal medium may include a data signal propagated in baseband or as part of a carrier wave carrying computer-readable program code therein. Such propagated data signals may take many forms, including but not limited to electromagnetic signals, optical signals, or any suitable combination of the foregoing.
  • a computer-readable signal medium may also be any computer-readable medium other than a computer-readable storage medium, which can transmit, propagate, or transmit a program for use by or in conjunction with an instruction execution system, apparatus, or device .
  • the program code contained on the computer readable medium can be transmitted by any appropriate medium, including but not limited to: electric wire, optical cable, RF (Radio Frequency, radio frequency), etc., or any suitable combination of the above.
  • the above-mentioned computer-readable medium may be included in the above-mentioned electronic device, or may exist independently without being incorporated into the electronic device.
  • the above-mentioned computer-readable medium carries one or more programs, and when the above-mentioned one or more programs are executed by the electronic device, the electronic device is made to execute the methods shown in the above-mentioned embodiments.
  • Computer program code for carrying out the operations of the present disclosure can be written in one or more programming languages, or combinations thereof, including object-oriented programming languages—such as Java, Smalltalk, C++, and conventional Procedural Programming Language - such as "C" or a similar programming language.
  • the program code may execute entirely on the user's computer, partly on the user's computer, as a stand-alone software package, partly on the user's computer and partly on a remote computer or entirely on the remote computer or server.
  • the remote computer can be connected to the user's computer through any kind of network, including a Local Area Network (LAN) or a Wide Area Network (WAN), or it can be connected to an external A computer (connected via the Internet, eg, using an Internet service provider).
  • LAN Local Area Network
  • WAN Wide Area Network
  • each block in a flowchart or block diagram may represent a module, program segment, or portion of code that contains one or more logical functions for implementing specified executable instructions.
  • the functions noted in the block may occur out of the order noted in the figures. For example, two blocks shown in succession may, in fact, be executed substantially concurrently, or they may sometimes be executed in the reverse order, depending upon the functionality involved.
  • each block of the block diagrams and/or flowchart illustrations, and combinations of blocks in the block diagrams and/or flowchart illustrations can be implemented by a dedicated hardware-based system that performs the specified functions or operations , or may be implemented by a combination of dedicated hardware and computer instructions.
  • the units involved in the embodiments described in the present disclosure may be implemented by software or by hardware. Wherein, the name of the unit does not constitute a limitation of the unit itself under certain circumstances, for example, the first obtaining unit may also be described as "a unit for obtaining at least two Internet Protocol addresses".
  • exemplary types of hardware logic components include: Field Programmable Gate Array (Field Programmable Gate Array, FPGA for short), Application Specific Integrated Circuit (ASIC for short), Application Specific Standard Products ( Application Specific Standard Parts (ASSP for short), System on Chip (SOC for short), Complex Programmable Logic Device (CPLD for short), etc.
  • a machine-readable medium may be a tangible medium that may contain or store a program for use by or in conjunction with an instruction execution system, apparatus, or device.
  • a machine-readable medium may be a machine-readable signal medium or a machine-readable storage medium.
  • a machine-readable medium may include, but is not limited to, electronic, magnetic, optical, electromagnetic, infrared, or semiconductor systems, apparatus, or devices, or any suitable combination of the foregoing.
  • machine-readable storage media would include one or more wire-based electrical connections, portable computer discs, hard drives, random access memory (RAM), read only memory (ROM), erasable programmable read only memory (EPROM or flash memory), optical fiber, compact disk read only memory (CD-ROM), optical storage, magnetic storage, or any suitable combination of the foregoing.
  • RAM random access memory
  • ROM read only memory
  • EPROM or flash memory erasable programmable read only memory
  • CD-ROM compact disk read only memory
  • magnetic storage or any suitable combination of the foregoing.
  • Embodiments of the present disclosure also provide a computer program product, including a computer program, which implements the methods in the foregoing embodiments when the computer program is executed by a processor.
  • An embodiment of the present disclosure also provides a live broadcast integrated machine or a live broadcast integrated device, including the electronic device corresponding to FIG. 9 .
  • the control circuit of the live broadcast all-in-one machine includes: a main control module and an audio processing module, wherein the main control module is installed with a live broadcast application and an audio playback application, and the audio processing module is used for the audio played in the audio playback application.
  • the data code stream is synthesized into the host audio collected by the audio collection device (such as a microphone) to form the input audio of the live application.
  • a live broadcast background sound processing method which is applied to a user end, and the user end includes: a main controller and an audio processor, and the main controller installs There are live broadcast applications and music playback applications, and the methods include:
  • the background sound operation controls are displayed on the display screen
  • the audio processor concatenates and synthesizes the first audio data stream and the second audio code stream to determine the input audio code stream of the live broadcast application, and the second audio code stream includes The device receives the sound signal from the anchor, and/or the sound signal in the live broadcast environment.
  • displaying the background sound operation control on the display screen includes:
  • the background sound operation control is simultaneously displayed on the live broadcast interface.
  • the preset display manner includes: at least one of split screen and floating window.
  • the content displayed by the background sound operation control includes: all or part of the content of the operation interface of the music playing application.
  • the simultaneously displaying the background sound operation control on the live broadcast interface includes:
  • the operating instructions include:
  • the user changes the live broadcast interface area and the border adjacent to the background sound operation area in a preset manner to change the display state of the background sound operation control, and the display state includes: open or closed state, enlarged or Reduce the background sound operation area.
  • the preset method includes: clicking at least one point on the boundary, clicking at least one straight line or curve segment on the boundary, sliding all stated boundary.
  • the boundary includes a straight line boundary and/or a curved boundary.
  • the operation instructions include: pause, play, previous song, next song, song list, list or song list loop play, sequential play, music search, lyrics search, ranking list At least one of the single searches.
  • the audio processor concatenates the first audio data stream and the second audio code stream to determine the input audio code stream of the live application, including:
  • the live application receives the input audio code stream, and encodes the input audio code stream to determine the output audio code stream;
  • the output audio code stream is sent from the network interface to the network server, so that the network server sends the output audio code stream to the live viewing terminal used by each audience.
  • a live broadcast background sound processing device including:
  • the main control module and the audio processing module, the live application and the music playing application are installed in the main control module;
  • the main control module is used to display background sound operation controls on the display screen when the preset display conditions are met;
  • the main control module is also used to send the corresponding first audio code stream in the music playing application to the audio processing module in response to the user's operation instruction on the background sound operation control;
  • the audio processing module is used to concatenate and synthesize the first audio data stream and the second audio code stream to determine the input audio code stream of the live broadcast application.
  • the second audio code stream includes the sound signal sent by the anchor received through the audio collection device , and/or, the sound signal in the live environment.
  • the main control module is configured to:
  • the background sound operation controls are simultaneously displayed on the live broadcast interface.
  • the preset display manner includes: at least one of split screen and floating window.
  • the content displayed by the background sound operation control includes: all or part of the content of the operation interface of the music playing application.
  • the main control module when the preset display mode is split screen, is configured to divide at least one display screen of the client into a live interface area and a background sound operation area;
  • the main control module is also used to obtain the user to change the adjacent boundary of the live interface area and the background sound operation area according to the preset method, so as to change the display state of the background sound operation control.
  • the display state includes: open or close state, expand or reduce the background sound operation area.
  • the preset method includes: clicking at least one point on the boundary, clicking at least one straight line or curved segment on the boundary, and sliding the boundary along a preset direction and/or a preset path.
  • the boundary includes a straight line boundary and/or a curved boundary.
  • the operation instructions include: pause, play, previous song, next song, playlist, list or playlist loop play, sequential play, music search, lyrics search, list search at least one of the
  • the audio processing module is configured to use a preset synthesis algorithm to synthesize the first audio data stream and the second audio code stream into an input audio code stream;
  • the main control module is also used to receive the input audio code stream through the live broadcast application, and encode the input audio code stream to determine the output audio code stream; send the output audio code stream from the network interface to the network server, so that the network The server sends the output audio code stream to the live viewing terminals used by each viewer.
  • an electronic device including:
  • processors at least one processor and memory
  • the memory stores the computer program
  • At least one processor executes the computer program stored in the memory, so that the at least one processor executes the live broadcast background sound processing method of the above first aspect and various possible designs of the first aspect.
  • a live streaming all-in-one machine including: electronic devices of various possible designs in the third aspect.
  • a computer-readable storage medium in which a computer program is stored, and when the processor executes the computer program, the above first aspect and The first aspect is various possible designs of live background sound processing methods.
  • a computer program product including a computer program.
  • the processor executes the computer program, the live background sound processing of various possible designs of the above first aspect can be realized. method.
  • a computer program is provided.
  • the processor executes the computer program, various possible designs of the live background sound processing method in the above first aspect are realized.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • General Health & Medical Sciences (AREA)
  • Social Psychology (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Databases & Information Systems (AREA)
  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)

Abstract

Embodiments of the present disclosure provide a livestreaming background sound processing method and apparatus, a device, a medium, and a program product. The method comprises: first displaying a background sound operation control on a display screen when a preset display condition is satisfied; then sending an audio processor a corresponding first audio code stream in a music playing application in response to an operation instruction of a user on the background sound operation control; and finally, the audio processor performing convergence and synthesis on a first audio data stream and a second audio code stream to determine an input audio code stream of the livestreaming application, the second audio code stream comprising a sound signal sent by a livestreamer and/or a sound signal in a livestreaming environment received by a client by means of an audio acquisition apparatus. The embodiments of the present disclosure solve the technical problem in the prior art that when facing a high-quality livestreaming requirement, the addition of background sound can only depend on a complex professional device and a professional technician team. The technical effect that background sound can be conveniently and quickly added by a livestreamer on a livestreaming terminal is achieved.

Description

直播背景音处理方法、装置、设备、介质及程序产品Live broadcast background sound processing method, device, equipment, medium and program product
相关申请交叉引用Related Application Cross Reference
本申请要求于2021年5月13日提交中国专利局、申请号为202110522604.2、发明名称为“直播背景音处理方法、装置、设备、介质及程序产品”的中国专利申请的优先权,其全部内容通过引用并入本文。This application claims the priority of the Chinese patent application submitted to the China Patent Office on May 13, 2021, with the application number 202110522604.2, and the title of the invention is "Live background sound processing method, device, equipment, medium and program product", the entire content of which Incorporated herein by reference.
技术领域technical field
本公开实施例涉及广播电视及互联网直播技术领域,尤其涉及一种直播背景音处理方法、装置、设备、介质及程序产品。Embodiments of the present disclosure relate to the technical field of broadcast television and Internet live broadcast, and in particular to a method, device, equipment, medium and program product for processing live background sound.
背景技术Background technique
随着社会信息化的不断深入发展,人们对于信息获取的及时性需求也不断提高。直播技术能够使得人们突破空间的限制,实时获取到直播现场的各类信息。而近年来兴起的网络直播,也成为了人们展示自我的一个平台,获得了极大的社会热度。手机直播更是降低了直播的技术门槛,让不具备专业技术的普通人也能简单地进行直播。随着观众对于直播内容及质量的要求不断提高,围绕着手机这类移动直播设备,衍生出了大量的周边辅助产品。With the continuous and in-depth development of social informatization, people's demand for the timeliness of information acquisition is also increasing. Live broadcast technology enables people to break through the limitations of space and obtain various information on the live broadcast site in real time. In recent years, webcasting has also become a platform for people to show themselves, and has gained great social popularity. Mobile live broadcasting lowers the technical threshold of live broadcasting, allowing ordinary people without professional skills to simply broadcast live broadcasting. As the audience's requirements for live broadcast content and quality continue to increase, a large number of peripheral auxiliary products have been derived around mobile live broadcast devices such as mobile phones.
目前,一个高质量的手机直播,所需要的硬件设备包括:台式电脑、声卡、若干中间转换器/衔接器、手机、耳机、麦克风等,除了主播外,还需要配备专门的技术人员来负责各种直播效果的实时添加。At present, a high-quality mobile live broadcast requires hardware equipment including: desktop computers, sound cards, several intermediate converters/connectors, mobile phones, earphones, microphones, etc. Real-time addition of a live broadcast effect.
即现有技术在面对高质量的直播需求时,存在只能够依赖复杂的专业设备和专业技术人员团队来添加背景音的技术问题。这无疑又提高了直播的技术门槛,增加了主播们的直播成本。That is to say, the existing technology has the technical problem of only relying on complex professional equipment and a team of professional technicians to add background sound when facing high-quality live broadcast requirements. This undoubtedly raised the technical threshold for live broadcasting and increased the cost of live broadcasting for anchors.
发明内容Contents of the invention
本公开实施例提供一种直播背景音处理方法、装置、设备、介质及程序产品,以解决现有技术在面对高质量的直播需求时,存在只能够依赖复杂的专业设备和专业技术人员团队来添加背景音的技术问题。Embodiments of the present disclosure provide a live broadcast background sound processing method, device, equipment, media, and program products to solve the existing technology that can only rely on complex professional equipment and professional technical personnel teams when faced with high-quality live broadcast requirements technical issues to add background sound.
第一方面,本公开实施例提供一种直播背景音处理方法,应用于用户端,所述用户端包括:主控制器以及音频处理器,所述主控制器安装有直播应用以及音乐播放应用,该方法包括:In the first aspect, the embodiment of the present disclosure provides a live broadcast background sound processing method, which is applied to a user end, and the user end includes: a main controller and an audio processor, and the main controller is installed with a live broadcast application and a music playback application, The method includes:
在满足预设显示条件时,在显示屏上显示背景音操作控件;When the preset display conditions are met, the background sound operation controls are displayed on the display screen;
响应于用户对所述背景音操作控件的操作指令,将所述音乐播放应用中对应的第一音频码流发送到所述音频处理器中;Responding to the user's operation instruction on the background sound operation control, sending the corresponding first audio code stream in the music playing application to the audio processor;
所述音频处理器将所述第一音频数据流与第二音频码流进行汇流合成,以确定所述直播应用的输入音频码流,所述第二音频码流包括所述用户端通过音频采集装置接收到主播所发出的声音信号,和/或,直播环境中的声音信号。The audio processor concatenates and synthesizes the first audio data stream and the second audio code stream to determine the input audio code stream of the live broadcast application, and the second audio code stream includes The device receives the sound signal from the anchor, and/or the sound signal in the live broadcast environment.
第二方面,本公开实施例提供一种直播背景音处理装置,包括:主控模块以及音频处理模块,所述主控模块中安装有直播应用以及音乐播放应用;其中,In a second aspect, an embodiment of the present disclosure provides a live broadcast background sound processing device, including: a main control module and an audio processing module, where a live broadcast application and a music playback application are installed in the main control module; wherein,
所述主控模块,用于在满足预设显示条件时,在显示屏上显示背景音操作控件;The main control module is used to display background sound operation controls on the display screen when preset display conditions are met;
所述主控模块,还用于响应于用户对所述背景音操作控件的操作指令,将所述音乐播放应用中对应的第一音频码流发送到所述音频处理模块中;The main control module is further configured to send the corresponding first audio code stream in the music playing application to the audio processing module in response to the user's operation instruction on the background sound operation control;
所述音频处理模块,用于将所述第一音频数据流与第二音频码流进行汇流合成,以确定所述直播应用的输入音频码流,所述第二音频码流包括通过音频采集装置接收到主播所发出的声音信号,和/或,直播环境中的声音信号。The audio processing module is configured to concatenate and synthesize the first audio data stream and the second audio code stream to determine the input audio code stream of the live application, and the second audio code stream includes Receive the sound signal from the anchor, and/or the sound signal in the live broadcast environment.
第三方面,本公开实施例提供一种电子设备,包括:In a third aspect, an embodiment of the present disclosure provides an electronic device, including:
至少一个处理器和存储器;at least one processor and memory;
所述存储器存储计算机程序;the memory stores a computer program;
所述至少一个处理器执行所述存储器存储的计算机程序,使得所述至少一个处理器执行如上第一方面以及第一方面各种可能的设计所述的直播背景音处理方法。The at least one processor executes the computer program stored in the memory, so that the at least one processor executes the live broadcast background sound processing method described in the above first aspect and various possible designs of the first aspect.
第四方面,本公开实施例提供一种直播一体机,包括第三方面所提供的任意一种可能的电子设备。In a fourth aspect, an embodiment of the present disclosure provides a live streaming all-in-one machine, including any possible electronic device provided in the third aspect.
第五方面,本公开实施例提供一种计算机可读存储介质,所述计算机可读存储介质中存储有计算机程序,当处理器执行所述计算机程序时,实现如上第一方面以及第一方面各种可能的设计所述的直播背景音处理方法。In the fifth aspect, the embodiments of the present disclosure provide a computer-readable storage medium, where a computer program is stored in the computer-readable storage medium, and when the processor executes the computer program, the above first aspect and each of the first aspect are realized. A possible design of the described live background sound processing method.
第六方面,本公开实施例提供一种计算机程序产品,包括计算机程序,该计算机程序被处理器执行时,实现如上第一方面以及第一方面各种可能的设计所述的直播背景音处理方法。In a sixth aspect, an embodiment of the present disclosure provides a computer program product, including a computer program, which, when executed by a processor, implements the live broadcast background sound processing method described in the above first aspect and various possible designs of the first aspect .
第七方面,本公开实施例提供一种计算机程序,该计算机程序被处理器执行时,实现如上第一方面以及第一方面各种可能的设计所述的直播背景音处理方法。In the seventh aspect, the embodiments of the present disclosure provide a computer program. When the computer program is executed by a processor, it implements the live broadcast background sound processing method described in the first aspect and various possible designs of the first aspect.
本公开实施例提供的直播背景音处理方法、装置、设备、介质及程序产品,该方法首先在满足预设显示条件时,在显示屏上显示背景音操作控件,然后响应于用户对背景音操作控件的操作指令,将音乐播放应用中对应的第一音频码流发送到音频处理器中,最后音频处理器将第一音频数据流与第二音频码流进行汇流合成,以确定直播应用的输入音频码流。第二音频码流包括用户端通过音频采集装置接收到主播所发出的声音信号,和/或,直播环境中的声音信号。本公开实施例解决了现有技术在面对高质量的直播需求时,存在只能够依赖复杂的专业设备和专业技术人员团队来添加背景音的技术问题。达到了主播在直播终端上即可便捷快速地在直播音频流中添加背景音的技术效果,降低了直播的成本和技术门槛,提高了用户的使用体验感。The live broadcast background sound processing method, device, equipment, medium and program product provided by the embodiments of the present disclosure, the method first displays the background sound operation control on the display screen when the preset display condition is met, and then responds to the user's operation on the background sound The operation command of the control sends the corresponding first audio code stream in the music player application to the audio processor, and finally the audio processor merges and synthesizes the first audio data stream and the second audio code stream to determine the input of the live broadcast application Audio stream. The second audio code stream includes the sound signal sent by the anchor received by the client through the audio collection device, and/or the sound signal in the live broadcast environment. The embodiments of the present disclosure solve the technical problem in the prior art that the background sound can only be added by relying on complex professional equipment and a team of professional technicians in the face of high-quality live broadcast requirements. It achieves the technical effect that the host can conveniently and quickly add background sound to the live audio stream on the live broadcast terminal, reduces the cost and technical threshold of live broadcast, and improves the user experience.
附图说明Description of drawings
为了更清楚地说明本公开实施例或现有技术中的技术方案,下面将对实施例或现有技术描述中所需要使用的附图作一简单地介绍,显而易见地,下面描述中的附图是本公开的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动性的前提下,还可以根据这些附图获得其他的附图。In order to more clearly illustrate the technical solutions in the embodiments of the present disclosure or the prior art, the following will briefly introduce the drawings that need to be used in the description of the embodiments or the prior art. Obviously, the accompanying drawings in the following description These are some embodiments of the present disclosure. Those skilled in the art can also obtain other drawings based on these drawings without any creative effort.
图1为本公开实施例提供的现有室内主播使用的直播设备套件结构示意图;FIG. 1 is a schematic structural diagram of a live broadcast equipment kit used by an existing indoor anchor provided by an embodiment of the present disclosure;
图2为本公开实施例提供的直播背景音处理方法的流程示意图一;FIG. 2 is a first schematic flow diagram of a method for processing live background sound provided by an embodiment of the present disclosure;
图3为本公开实施例提供的用户端处理背景音频码流的示意图;FIG. 3 is a schematic diagram of a background audio code stream processed by a client provided by an embodiment of the present disclosure;
图4为本公开实施例提供的直播背景音处理方法的流程示意图二;FIG. 4 is a second schematic flow diagram of a method for processing live background sound provided by an embodiment of the present disclosure;
图5a-5c为本公开实施例提供的一种用户端的显示屏分屏显示背景音操作区的示意图;5a-5c are schematic diagrams of a user terminal display screen split-screen display background sound operation area provided by an embodiment of the present disclosure;
图6为本公开实施例提供的直播背景音处理方法的流程示意图三;FIG. 6 is a third schematic flowchart of a method for processing live broadcast background sound provided by an embodiment of the present disclosure;
图7a-7e为本公开实施例提供的一种以悬浮窗的形式显示背景音操作控件的示意图;7a-7e are schematic diagrams of displaying background sound operation controls in the form of a floating window according to an embodiment of the present disclosure;
图8为本公开实施例提供的直播背景音处理装置的结构框图;FIG. 8 is a structural block diagram of a live broadcast background sound processing device provided by an embodiment of the present disclosure;
图9为本公开实施例提供的电子设备的结构示意图。FIG. 9 is a schematic structural diagram of an electronic device provided by an embodiment of the present disclosure.
具体实施方式Detailed ways
为使本公开实施例的目的、技术方案和优点更加清楚,下面将结合本公开实施例中的附图,对本公开实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例是本公开一部分实施例,而不是全部的实施例。基于本公开中的实施例,本领域普通技术人员在没有作出创造性劳动前提下所获得的所有其他实施例,包括但不限于对多个实施例的组合,都属于本公开保护的范围。In order to make the purpose, technical solutions and advantages of the embodiments of the present disclosure clearer, the technical solutions in the embodiments of the present disclosure will be clearly and completely described below in conjunction with the drawings in the embodiments of the present disclosure. Obviously, the described embodiments It is a part of the embodiments of the present disclosure, but not all of them. Based on the embodiments in the present disclosure, all other embodiments obtained by persons of ordinary skill in the art without creative work, including but not limited to combinations of multiple embodiments, fall within the protection scope of the present disclosure.
本公开的说明书和权利要求书及上述附图中的术语“第一”、“第二”、“第三”、“第四”等(如果存在)是用于区别类似的对象,而不必用于描述特定的顺序或先后次序。应该理解这样使用的数据在适当情况下可以互换,以便这里描述的本公开的实施例例如能够以除了在这里图示或描述的那些以外的顺序实施。此外,术语“包括”和“具有”以及他们的任何变形,意图在于覆盖不排他的包含,例如,包含了一系列步骤或单元的过程、方法、系统、产品或设备不必限于清楚地列出的那些步骤或单元,而是可包括没有清楚地列出的或对于这些过程、方法、产品或设备固有的其它步骤或单元。The terms "first", "second", "third", "fourth", etc. (if any) in the description and claims of the present disclosure and the above drawings are used to distinguish similar objects, and not necessarily Used to describe a specific sequence or sequence. It is to be understood that the data so used are interchangeable under appropriate circumstances such that the embodiments of the disclosure described herein are, for example, capable of practice in sequences other than those illustrated or described herein. Furthermore, the terms "comprising" and "having", as well as any variations thereof, are intended to cover a non-exclusive inclusion, for example, a process, method, system, product or device comprising a sequence of steps or elements is not necessarily limited to the expressly listed instead, may include other steps or elements not explicitly listed or inherent to the process, method, product or apparatus.
随着移动互联网技术的不断发展,人们的日常生活已经高度信息化,而直播技术是解决人们在不同地理空间上异地接收直播地信息的一种高效手段。近年来兴起的网络直播,更是给人们的娱乐生活增加了很多乐趣,收到了人们的追捧。大量主播纷纷加入了网络直播的行列。而通过移动设备来进行直播,以其简便性受到了很多主播的青睐。但是,随着观众对于直播质量要求的不断提高,主播在进行直播的时候,需要添加直播背景音乐。With the continuous development of mobile Internet technology, people's daily life has become highly informatized, and live broadcast technology is an efficient means for people to receive live broadcast information in different geographical spaces. The live webcast that has emerged in recent years has added a lot of fun to people's entertainment life and has been sought after by people. A large number of anchors have joined the ranks of the webcast. Live broadcasting through mobile devices is favored by many anchors for its simplicity. However, as the audience's requirements for live broadcast quality continue to improve, the anchor needs to add live background music when performing live broadcast.
面对这种需求,对于大多数主播来说,只能够通过购置一套专业的直播设备,包括:台式电脑、专业声卡、若干个中间转换器/衔接器等专业直播辅助设备来实现直播中的直播背景音处理需求。Faced with this demand, for most anchors, it is only possible to purchase a set of professional live broadcast equipment, including: desktop computers, professional sound cards, several intermediate converters/connectors and other professional live broadcast auxiliary equipment to realize live broadcast. Live background sound processing requirements.
图1为本公开实施例提供的现有室内主播使用的直播设备套件结构示意图。如图1所示,直播设备套件包括:移动终端101、中间衔接器102、中间转换器103、声卡104、台式电脑105。FIG. 1 is a schematic structural diagram of a live broadcast equipment kit used by an existing indoor anchor provided by an embodiment of the present disclosure. As shown in FIG. 1 , the live broadcast equipment kit includes: a mobile terminal 101 , an intermediate adapter 102 , an intermediate converter 103 , a sound card 104 , and a desktop computer 105 .
其中,移动终端101中安装有直播应用。Wherein, the live application is installed in the mobile terminal 101 .
中间衔接器102用于解决移动终端101在长时间直播时,充电接口被数据线占用而无法充电的问题。The intermediate adapter 102 is used to solve the problem that the charging interface of the mobile terminal 101 cannot be charged because the charging interface is occupied by the data line when the mobile terminal 101 is live broadcasting for a long time.
中间转换器103用于将台式电脑105中的特效数据转接到移动终端101中,并且连接耳机后可以给客户提供耳返功能,还能够解决声卡104只能单路输出而不能反向接收直播应用输出的音频数据的问题。The intermediate converter 103 is used to transfer the special effect data in the desktop computer 105 to the mobile terminal 101, and after connecting the earphones, it can provide the customer with an ear return function, and it can also solve the problem that the sound card 104 can only output a single channel and cannot receive live broadcast in reverse A problem with the audio data output by the app.
声卡104与麦克风连接,实现将音频模拟信号转换为数字信号,输入台式电脑105中进行各种音效处理。The sound card 104 is connected with the microphone to convert the audio analog signal into a digital signal, and input it into the desktop computer 105 for various sound effect processing.
但是,图1所示的方式造成了直播成本的提高,需要配置多个辅助设备,而绝大多数主播不具备专业技术,无法自己去配置这一套设备,因此还需要配备专门的技术人员团队来构建、运营和维护这套专业直播设备,这些都使得本已通过手机等移动终端降低了的直播门槛,又被提高了起来。However, the method shown in Figure 1 increases the cost of live broadcasting and requires the configuration of multiple auxiliary equipment, and most anchors do not have professional skills and cannot configure this set of equipment by themselves, so a dedicated team of technical personnel is also required To build, operate and maintain this set of professional live broadcast equipment, these have raised the live broadcast threshold that has been lowered through mobile terminals such as mobile phones.
并且,由于连接了多个设备,使得整套直播设备的稳定性容易受到各种因素的影响,任何一个设备的接插件插口出现问题,或者线路由于某种原因断路,都会使得系统无法工作。Moreover, due to the connection of multiple devices, the stability of the entire set of live broadcast equipment is easily affected by various factors. If there is a problem with the connector socket of any device, or the line is broken for some reason, the system will not work.
基于上述技术问题,本申请的发明构思旨在:Based on the above-mentioned technical problems, the inventive concept of the present application aims at:
通过一个用户端直接让主播在直播的时候在通过屏幕快速简便地将所需要的音频添加到直播的音频流中即设置直播背景音乐。使得直播无需多个设备,仅用一个小型设备就能完成直播背景音处理,也无需专业技术人员团队参与运维,主播一人即可在直播时实时选择并添加音频如背景音乐。Through a client, the anchor can quickly and easily add the required audio to the live audio stream through the screen during the live broadcast, that is, set the live background music. The live broadcast does not require multiple devices, and only one small device can complete the background sound processing of the live broadcast, and does not require a team of professional and technical personnel to participate in operation and maintenance. The anchor can select and add audio such as background music in real time during the live broadcast.
下面结合附图对本申请提供的直播背景音处理方法进行详细介绍。The live broadcast background sound processing method provided by the present application will be described in detail below with reference to the accompanying drawings.
参考图2,图2为本公开实施例提供的直播背景音处理方法的流程示意图一。本实施例的方法应用在用户端中,用户端的实现形式包括具有直播功能的设备,该具有直播功能的设备包括但不限于:直播一体式终端、移动终端和计算机设备,用户可以通过该直播一体式终端或移动终端直接完成直播,无需使用其它配套设备来对直播的音频或视频进行额外的处理。Referring to FIG. 2 , FIG. 2 is a first schematic flowchart of a method for processing live background sound provided by an embodiment of the present disclosure. The method of this embodiment is applied to the user end, and the implementation form of the user end includes a device with a live broadcast function. The device with a live broadcast function includes but is not limited to: a live broadcast integrated terminal, a mobile terminal, and a computer device. Users can use the live broadcast integrated The live broadcast can be directly completed by a mobile terminal or a mobile terminal, without using other supporting equipment to perform additional processing on the live audio or video.
该直播背景音处理方法包括:The live broadcast background sound processing method includes:
S201:在满足预设显示条件时,在显示屏上显示背景音操作控件。S201: When the preset display condition is met, display the background sound operation control on the display screen.
在本实施例中,用户端包括:主控制器以及音频处理器,在主控制器上安装有直播应用以及音乐播放应用。In this embodiment, the client includes: a main controller and an audio processor, and a live broadcast application and a music playing application are installed on the main controller.
在本步骤中,预设显示条件包括:检测到摄像设备开启指令、检测到直播终端按预设运动方式进行运动(如摇一摇、圆周运动、翻转运动等)、接收到预设语音指令和接收到预设按钮或开关控件对应的开启指令中的至少一种。In this step, the preset display conditions include: detection of an instruction to turn on the camera device, detection of a live broadcast terminal moving in a preset motion mode (such as shaking, circular motion, flipping motion, etc.), receiving a preset voice command and At least one of an opening instruction corresponding to a preset button or a switch control is received.
值得说明的是,显示屏的类型可以是触控屏或非触控屏。It should be noted that the type of the display screen can be a touch screen or a non-touch screen.
具体的,例如,用户通过点击显示在直播界面上的快捷控件,来显示背景音操作控件。Specifically, for example, the user clicks the shortcut control displayed on the live broadcast interface to display the background sound operation control.
在一种可能的设计中背景音操作控件的显示方式包括:分屏,和/或,悬浮窗显示。In a possible design, the display mode of the background sound operation control includes: split screen, and/or, floating window display.
在本实施例中,用户端上包括至少一个显示屏。在一种可能的设计中,用户端可以通过扩展接口连接多个外接显示屏,扩展接口包括无线接口和有线接口。In this embodiment, the user terminal includes at least one display screen. In a possible design, the user terminal can be connected to multiple external display screens through an extension interface, and the extension interface includes a wireless interface and a wired interface.
在一种可能的设计中,当用户端的显示屏数量大于1个时,背景音操作控件显示在非直播界面所在的显示屏上。In a possible design, when the number of display screens on the user terminal is greater than one, the background sound operation control is displayed on the display screen where the non-live broadcast interface is located.
S202:响应于用户对背景音操作控件的操作指令,将音乐播放应用中对应的第一音频码流发送到音频处理器中。S202: Send the corresponding first audio code stream in the music playing application to the audio processor in response to the user's operation instruction on the background sound operation control.
在本步骤中,对背景音操作控件的操作指令包括:暂停、播放、上一曲、下一曲、歌单、列表或歌单循环播放、顺序播放、音乐搜索、歌词搜索、排行榜单搜索等操作中的至少一个。In this step, the operation instructions for the background sound operation control include: pause, play, previous song, next song, playlist, list or playlist loop playback, sequential play, music search, lyrics search, list search Wait for at least one of the operations.
具体的,背景音操作控件可以作为音乐播放应用的显示媒介,直接将音乐播放应用的操作界面直接复制投影到背景音操作控件的预设显示区域上。这样主播就可以直接像操作音乐播放应用一样,搜索需要的背景音乐,进行播放、暂停等操作,或者在预设歌单中选择播放 列表,然后选择播放方式,如:按播放列表顺序播放,随机播放,列表循环播放,单曲循环播放等。Specifically, the background sound operation control can be used as a display medium of the music playing application, and the operation interface of the music playing application is directly copied and projected onto the preset display area of the background sound operation control. In this way, the anchor can directly search for the background music needed, play, pause, etc., like operating a music player application, or select a playlist in the preset song list, and then choose a playback method, such as: play in order of playlist, random Play, list loop playback, single loop playback, etc.
需要说明的是,在现有技术中,当一个终端上安装有多个应用程序,且若存在多个应用程序同时开启时都需要使用相同的接口或外接设备的情况下,这些应用程序就会产生冲突。如在手机直播时,直播应用需要占用耳机接口或扬声器,此时若想在手机终端上打开音频播放应用,由于音频播放应用也需要用到扬声器或耳机接口,这就使得两个应用产生了冲突,若用户执意打开音频播放应用则直播应用将被迫关闭。It should be noted that, in the prior art, when multiple applications are installed on a terminal, and if there are multiple applications that need to use the same interface or external device when they are opened at the same time, these applications will conflict. For example, during a live broadcast on a mobile phone, the live broadcast application needs to occupy the headphone jack or the speaker. At this time, if you want to open the audio playback application on the mobile terminal, since the audio playback application also needs to use the speaker or the headphone jack, this creates a conflict between the two applications. , if the user insists on opening the audio playback application, the live broadcast application will be forced to close.
现有技术的这个问题就使得主播无法直接在手机或其它移动终端上直接打开音频播放应用来进行背景音乐的播放。而主播们解决此问题,一般是需要另一个音乐播放设备在直播环境中播放音乐,使得麦克风采集到直播环境中的音乐声,从而将其加入到直播应用的输入音频码流中。This problem in the prior art makes it impossible for the anchor to directly open the audio playback application on the mobile phone or other mobile terminals to play the background music. To solve this problem, anchors usually need another music playback device to play music in the live broadcast environment, so that the microphone can collect the music sound in the live broadcast environment, and then add it to the input audio stream of the live broadcast application.
或者是如图1所示通过外接声卡104和中间转换器103的形式,利用台式电脑105开启音乐播放应用,其产生的音频码流输入到外置独立的声卡104当中,而麦克风也不是直接与手机或移动终端101连接,而是与声卡104连接,以便于声卡104将音乐播放应用产生的音频码流与麦克风采集到的声音码流进行合成。Or as shown in Figure 1, through the form of an external sound card 104 and an intermediate converter 103, a desktop computer 105 is used to start a music player application, and the audio stream generated by it is input to an external independent sound card 104, and the microphone is not directly connected to the external sound card 104. The mobile phone or mobile terminal 101 is connected to the sound card 104 instead, so that the sound card 104 synthesizes the audio code stream generated by the music player application and the sound code stream collected by the microphone.
显然上述现有技术是通过将直播应用与音乐播放应用分置于两个不同设备中,再附加中间连接设备,才能解决两个应用同时开启时的资源占用矛盾。但是这样也就大大增加了直播成本和直播技术门槛。Apparently, in the prior art mentioned above, the contradiction of resource occupation when the two applications are started at the same time can be solved by separating the live broadcast application and the music playback application into two different devices, and then adding an intermediate connection device. But this will greatly increase the cost of live broadcast and the threshold of live broadcast technology.
而本实施例却是一反常态,打破了两个应用不能安装在同一设备上的技术壁垒,将直播应用与音乐播放应用一同安装在同一个终端上,但是在直播时音乐播放应用的开启和操作方式,却与在非直播时的方式不同。通过背景音操作控件来调用音乐播放应用,让原本音乐播放应用开启时需要占用耳机接口、扬声器等外设资源,转为占用背景音操作控件所对应的终端设备中的音频处理器资源,该音频处理器资源可以是实体的硬件设备如音频处理芯片对应的IO管脚接口,也可以是虚拟声卡、虚拟耳机、虚拟扬声器等虚拟资源。This embodiment, on the other hand, breaks the technical barrier that two applications cannot be installed on the same device, and installs the live broadcast application and the music player application on the same terminal, but the opening and operation methods of the music player application during the live broadcast , but in a different way than when not live. Call the music playback application through the background sound operation control, so that the original music playback application needs to occupy peripheral resources such as headphone jacks and speakers when it is started, and instead occupies the audio processor resources in the terminal device corresponding to the background sound operation control. Processor resources can be physical hardware devices such as IO pin interfaces corresponding to audio processing chips, or virtual resources such as virtual sound cards, virtual headphones, and virtual speakers.
需要说明的是,虚拟资源需要由主控制器通过专门的接口程序或应用模块来实现,本领域技术人员可以根据实际情况进行选择,本申请不做限定。It should be noted that the virtual resources need to be implemented by the main controller through a special interface program or application module, which can be selected by those skilled in the art according to the actual situation, which is not limited in this application.
通过可视化的背景音操作控件以及音频处理器的结合,得以实现省去了台式电脑、外置独立声卡、中间转换器、中间衔接器等等外部辅助直播设备,既使得直播时添加背景音乐的简单快捷,又降低了设备成本,还无需依靠专业的技术人员团队支持,主播一人便可轻松直播。Through the combination of visual background sound operation controls and audio processors, external auxiliary live broadcast equipment such as desktop computers, external independent sound cards, intermediate converters, intermediate adapters, etc. are eliminated, making it easy to add background music during live broadcasting. It is fast and reduces the cost of equipment, and does not need to rely on the support of a professional technical team. The anchor can easily live broadcast by one person.
S203:音频处理器将第一音频数据流与第二音频码流进行汇流合成,以确定直播应用的输入音频码流。S203: The audio processor concatenates and synthesizes the first audio data stream and the second audio code stream, so as to determine the input audio code stream of the live application.
在本步骤中,第二音频码流包括用户端通过音频采集装置接收到主播所发出的声音信号,和/或,直播环境中的声音信号。In this step, the second audio code stream includes the sound signal sent by the anchor received by the client through the audio collection device, and/or the sound signal in the live broadcast environment.
在本实施例中,利用预设合成算法,将第一音频数据流与第二音频码流合成为输入音频码流;In this embodiment, a preset synthesis algorithm is used to synthesize the first audio data stream and the second audio code stream into an input audio code stream;
直播应用接收输入音频码流,并对输入音频码流进行编码,以确定输出音频码流;The live application receives the input audio code stream and encodes the input audio code stream to determine the output audio code stream;
将输出音频码流从网络接口中发送到网络服务器中,以使网络服务器将输出音频码流发送给各个观众所使用的直播观看终端。The output audio code stream is sent from the network interface to the network server, so that the network server sends the output audio code stream to the live viewing terminal used by each audience.
即直播应用将输入音频码流编码后,通过网络传输给直播平台服务器,再由直播平台服务发送给各个观众。That is, the live broadcast application encodes the input audio stream, transmits it to the live broadcast platform server through the network, and then sends it to each viewer by the live broadcast platform service.
图3为本公开实施例提供的用户端处理背景音频码流的示意图。如图3所示,用户端300中包括主控制器31和音频处理器32,其中,主控制器31中安装了直播应用311和音乐播放应用312。在开启了直播应用后,用户通过点击快捷键打开背景音操作控件,以此调用音乐播放应用312,并且,用户通过操作背景音操作控件,从音乐播放应用312中选择作为背景音乐播放的音频,音乐播放应用312产生连续的音频码流即第一音频数据流,输入到音频处理器32中。FIG. 3 is a schematic diagram of processing a background audio code stream at a user terminal according to an embodiment of the present disclosure. As shown in FIG. 3 , the client terminal 300 includes a main controller 31 and an audio processor 32 , wherein a live broadcast application 311 and a music playing application 312 are installed in the main controller 31 . After opening the live broadcast application, the user opens the background sound operation control by clicking the shortcut key, thereby calling the music playback application 312, and the user selects the audio played as the background music from the music playback application 312 by operating the background sound operation control, The music player application 312 generates a continuous audio code stream, that is, the first audio data stream, and inputs it into the audio processor 32 .
音频处理器32还与用户端300的各个音频采集接口连接(如USB(Universal Serial Bus,通用串行总线)音频输入接口,3.5mm接口麦克风音频输入接口,卡农头接口麦克风音频输入接口),以接收各个音频采集设备所接收到的主播音频或主播所处环境的音频即第二音频码流。 Audio processor 32 is also connected (as USB (Universal Serial Bus, Universal Serial Bus) audio input interface, 3.5mm interface microphone audio input interface, Canon head interface microphone audio input interface) with each audio collection interface of client 300, with The audio of the anchor received by each audio collection device or the audio of the environment where the anchor is located is the second audio stream.
音频处理器32将所接收到的第一音频码流和第二音频码流,输入到预设合成算法模型中,以确定合成音频码流,作为直播应用311的输入音频码流。The audio processor 32 inputs the received first audio code stream and the second audio code stream into a preset synthesis algorithm model to determine a synthesized audio code stream as an input audio code stream of the live application 311 .
需要说明的是,本领域技术人员可以根据实际需要选择具体的预设合成算法模型,本申请不做限定。It should be noted that those skilled in the art can select a specific preset synthesis algorithm model according to actual needs, which is not limited in this application.
本公开实施例提供的直播背景音处理方法,首先在满足预设显示条件时,在显示屏上显示背景音操作控件,然后响应于用户对背景音操作控件的操作指令,将音乐播放应用中对应的第一音频码流发送到音频处理器中,最后音频处理器将第一音频数据流与第二音频码流进行汇流合成,以确定直播应用的输入音频码流。第二音频码流包括用户端通过音频采集装置接收到主播所发出的声音信号,和/或,直播环境中的声音信号。本公开实施例通过,解决了现有技术在面对高质量的直播需求时,存在只能够依赖复杂的专业设备和专业技术人员团队来添加背景音的技术问题。达到了主播在直播终端上即可便捷快速地在直播音频流中添加背景音的技术效果,降低了直播的成本和技术门槛,提高了用户的使用体验感。In the live broadcast background sound processing method provided by the embodiments of the present disclosure, firstly, when the preset display conditions are met, the background sound operation control is displayed on the display screen, and then in response to the user's operation instruction on the background sound operation control, the corresponding The first audio code stream is sent to the audio processor, and finally the audio processor merges and synthesizes the first audio data stream and the second audio code stream to determine the input audio code stream of the live application. The second audio code stream includes the sound signal sent by the anchor received by the client through the audio collection device, and/or the sound signal in the live broadcast environment. The embodiment of the present disclosure solves the technical problem in the prior art that it can only rely on complex professional equipment and a team of professional technicians to add background sound when facing high-quality live broadcast requirements. It achieves the technical effect that the host can conveniently and quickly add background sound to the live audio stream on the live broadcast terminal, reduces the cost and technical threshold of live broadcast, and improves the user experience.
为了进一步说明S201中的分屏显示和悬浮窗显示背景音操作控件,下面以两个具体的实施例来进行分别说明。In order to further illustrate the split-screen display in S201 and the background sound operation controls displayed in the floating window, the following two specific embodiments are used to describe them respectively.
参考图4,图4为本公开实施例提供的直播背景音处理方法的流程示意图二。本实施例的方法应用在用户端中,用户端的实现形式包括具有直播功能的设备,该具有直播功能的设备包括但不限于:直播一体式终端、移动终端和计算机设备,用户可以通过该直播一体式终端或移动终端直接完成直播,无需使用其它配套设备来对直播的音频或视频进行额外的处理。Referring to FIG. 4 , FIG. 4 is a second schematic flowchart of a method for processing live background sound provided by an embodiment of the present disclosure. The method of this embodiment is applied to the user end, and the implementation form of the user end includes a device with a live broadcast function. The device with a live broadcast function includes but is not limited to: a live broadcast integrated terminal, a mobile terminal, and a computer device. Users can use the live broadcast integrated The live broadcast can be directly completed by a mobile terminal or a mobile terminal, without using other supporting equipment to perform additional processing on the live audio or video.
该直播背景音处理方法包括:The live broadcast background sound processing method includes:
S401:获取所述用户发出的开启指令。S401: Obtain an opening instruction issued by the user.
在本步骤中,开启指令包括:点击直播界面上的背景音开启的快捷键按钮或控件。In this step, the enabling instruction includes: clicking the shortcut key button or control for enabling the background sound on the live broadcast interface.
在本实施例中,用户在启动用户端上的直播应用后,用户点击开始直播,则直播应用开启用户端上的摄像头。而当用户端检测到摄像头被打开后,则在显示屏的边缘显示分屏控件。In this embodiment, after the user starts the live broadcast application on the user terminal, the user clicks to start live broadcast, and the live broadcast application starts the camera on the user terminal. And when the user terminal detects that the camera is turned on, the split-screen control is displayed on the edge of the display screen.
在一种可能的设计中,背景音开启控件包括用于分屏显示的悬浮窗控件即分屏控件,则用户点击分屏控件即可被用户端的主控制器检测到开启指令。In a possible design, the background sound activation control includes a floating window control for split-screen display, that is, a split-screen control, and the main controller at the user end can detect the activation instruction when the user clicks the split-screen control.
S402:根据开启指令所指示的预设显示方式,在直播界面上同时显示背景音操作控件。S402: Simultaneously display background sound operation controls on the live broadcast interface according to the preset display mode indicated by the activation instruction.
在本实施例中,预设显示方式包括:分屏显示方式。In this embodiment, the preset display mode includes: a split-screen display mode.
具体的,将所述用户端的至少一个显示屏分割为直播界面区以及背景音操作区。Specifically, at least one display screen of the user terminal is divided into a live interface area and a background sound operation area.
需要说明的是,直播界面区以及背景音操作区的边界包括:直线边界和/或曲线边界。It should be noted that the boundaries of the live interface area and the background sound operation area include: straight line boundaries and/or curved boundaries.
图5a-5c为本公开实施例提供的一种用户端的显示屏分屏显示背景音操作区的示意图。如图5a所示,用户在用户端500的显示屏上点击“分屏控件”后,显示屏就会分为直播界面区510和背景音操作区520。5a-5c are schematic diagrams of a display screen of a user terminal for displaying a background sound operation area in split screens according to an embodiment of the present disclosure. As shown in FIG. 5 a , after the user clicks “split screen control” on the display screen of the user terminal 500, the display screen will be divided into a live interface area 510 and a background sound operation area 520.
需要说明的是,背景音操作控件即背景音操作区520所显示的内容包括:所述音乐播放应用的操作界面的全部或部分内容。It should be noted that the content displayed in the background sound operation control, that is, the background sound operation area 520 includes: all or part of the content of the operation interface of the music playing application.
用户端500通过后台控制算法,将背景音操作区520的各个控件与音乐播放应用的操作界面进行映射对应。The user terminal 500 maps and corresponds each control in the background sound operation area 520 to the operation interface of the music playing application through the background control algorithm.
如图5a所示,背景音操作区520中包括:As shown in Figure 5a, the background sound operation area 520 includes:
歌单子控件521,用于切换歌单列表;The song list sub-control 521 is used to switch the song list;
搜索子控件522,用于搜索歌曲; Search sub-control 522 for searching songs;
菜单子控件523,用于显示音乐播放应用的更多功能选项;The menu sub-control 523 is used to display more functional options of the music player application;
操作子控件524,用于实现播放、暂停、停止、上一曲、下一曲等操作。The operation sub-control 524 is used to implement operations such as play, pause, stop, previous song, and next song.
当然可以理解的是,背景音操作区520还可以包括播放进度,音量控制,歌名,演唱者,作词/曲人等等信息控件。需要说明的是,本领域技术人员可以根据实际情况选择背景音操作区520的布局及实现样式,本申请不做限定。Of course, it can be understood that the background sound operation area 520 may also include information controls such as playback progress, volume control, song title, singer, lyricist/musician, and the like. It should be noted that those skilled in the art can select the layout and implementation style of the background sound operation area 520 according to the actual situation, which is not limited in this application.
在一种可能的设计中,分屏也可以是左右分屏,如图5b所示,分屏后直播界面区510和背景音操作区520所占显示屏的比例,可以根据实际需要进行调整,如:背景音操作区520占30%~50%。In a possible design, the split screen can also be a left and right split screen, as shown in Figure 5b, after the split screen, the ratio of the live broadcast interface area 510 and the background sound operation area 520 to the display screen can be adjusted according to actual needs. For example: the background sound operation area 520 occupies 30%-50%.
需要说明的是,分屏后直播界面区510和背景音操作区520的边界也可以是直线与曲线所围成的任意几何形状。如图5c所示,背景音操作区520可以有多个,各个子控件分布在多个背景音操作区520中。It should be noted that the boundary between the live broadcast interface area 510 and the background sound operation area 520 after screen splitting can also be any geometric shape surrounded by straight lines and curves. As shown in FIG. 5 c , there may be multiple background sound operation areas 520 , and each sub-control is distributed in multiple background sound operation areas 520 .
S403:获取用户对背景音操作控件的操作指令。S403: Obtain the user's operation instruction on the background sound operation control.
在本实施例中,用户对背景音操作控件的操作指令包括:用户在背景音操作区520中的各类操作指令,以及对背景音操作区520边界的操作指令。In this embodiment, the user's operation instructions on the background sound operation controls include: various operation instructions of the user in the background sound operation area 520 , and operation instructions on the boundary of the background sound operation area 520 .
用户在背景音操作区520中的各类操作指令,包括:暂停、播放、上一曲、下一曲、歌单、列表或歌单循环播放、顺序播放、音乐搜索、歌词搜索、排行榜单搜索中的至少一个。这些操作指令的具体作用,是在音乐播放应用中选择作为直播背景音的音频数据或音频码流。The user's various operating instructions in the background sound operation area 520, including: pause, play, previous song, next song, playlist, list or playlist cycle play, sequential play, music search, lyrics search, ranking list At least one of the search. The specific function of these operation instructions is to select the audio data or audio code stream as the background sound of the live broadcast in the music playing application.
对背景音操作区520边界的操作指令,包括:用户按预设方式改变直播界面区以及背景音操作区相邻的边界,以改变背景音操作控件的显示状态,显示状态包括:开启或关闭状态、扩大或缩小背景音操作区。The operation instructions on the border of the background sound operation area 520 include: the user changes the live interface area and the border adjacent to the background sound operation area in a preset manner to change the display state of the background sound operation control, and the display state includes: on or off , expand or shrink the background sound operation area.
具体的,如图5a和5b所示的背景音操作区520,用户可以通过点击边界上至少一点或点击所述边界上至少一个直线段,来实现收起背景音操作区520或改变背景音操作区520所占的比例大小。Specifically, in the background sound operation area 520 shown in Figures 5a and 5b, the user can close the background sound operation area 520 or change the background sound operation by clicking at least one point on the boundary or clicking at least one straight line segment on the boundary. The proportional size of the area 520.
在一种可能的设计中,用户按住边界上的至少一点不放,然后进行上下或左右滑动,即可实现改变背景音操作区520所占的比例大小。如图5a中,用户按住边界上的一点,然后进行上下或左右滑动,即可实现改变背景音操作区520所占的比例大小。In a possible design, the user presses and holds at least one point on the boundary, and then slides up and down or left and right to change the proportion of the background sound operation area 520 . As shown in FIG. 5 a , the user presses a point on the boundary, and then slides up and down or left and right to change the proportion of the background sound operation area 520 .
这里需要说明的是,左右滑动时,也可以理解为用户手指沿着边界所指示的路径进行滑动,从左到右滑动时,代表扩大背景音操作区520所占的比例大小;从右到左滑动时,代表缩小背景音操作区520所占的比例大小。What needs to be explained here is that when sliding left and right, it can also be understood that the user's finger slides along the path indicated by the boundary. When sliding from left to right, it means expanding the proportion of the background sound operation area 520; from right to left When sliding, it means reducing the proportion of the background sound operation area 520 .
如图5c,这种边界既有直线,又有曲线的情况,可以设定不同的操作方式,如当用户按住边界的直线段,以垂直于直线段的方向滑动,则在垂直于直线段的方向扩大或缩小背景音操作区520。As shown in Figure 5c, this kind of boundary has both straight lines and curves. Different operation methods can be set. For example, when the user presses the straight line segment of the boundary and slides in the direction perpendicular to the straight line segment, the Expand or shrink the background sound operation area 520 in the direction of .
当用户按住曲线边界,并在曲线上被按住的点所在的切线方向,或者切线的垂直方向上滑动时,按照对应方向来扩大或缩小背景音操作区520。When the user presses the boundary of the curve and slides in the direction of the tangent of the pressed point on the curve, or in the vertical direction of the tangent, the background sound operation area 520 is expanded or reduced according to the corresponding direction.
当用户沿曲线边界或直线边界滑动时,也可实现改变背景音操作区520所占的比例大小。When the user slides along the curved boundary or the straight boundary, the proportion of the background sound operation area 520 can also be changed.
S404:将音乐播放应用中对应的第一音频码流发送到音频处理器中。S404: Send the corresponding first audio code stream in the music playing application to the audio processor.
用户在背景音操作区520中选中并开始播放某个音频时,音乐播放应用中对应的将该音频数据播放时产生的第一音频码流发送到音频处理器中。When the user selects and starts to play a certain audio in the background sound operation area 520, the corresponding first audio code stream generated when the audio data is played in the music playing application is sent to the audio processor.
S405:音频处理器将第一音频数据流与第二音频码流进行汇流合成,以确定直播应用的输入音频码流。S405: The audio processor concatenates and synthesizes the first audio data stream and the second audio code stream, so as to determine the input audio code stream of the live application.
在本步骤中,第二音频码流包括用户端通过音频采集装置接收到主播所发出的声音信号,和/或,直播环境中的声音信号。In this step, the second audio code stream includes the sound signal sent by the anchor received by the client through the audio collection device, and/or the sound signal in the live broadcast environment.
本步骤的具体原理及名词解释参考S203,在此不再赘述。Please refer to S203 for the specific principle and terminology explanation of this step, which will not be repeated here.
本公开实施例提供的直播背景音处理方法,首先在满足预设显示条件时,在显示屏上以分屏的形式显示背景音操作控件,然后响应于用户对背景音操作控件的操作指令,将音乐播放应用中对应的第一音频码流发送到音频处理器中,最后音频处理器将第一音频数据流与第二音频码流进行汇流合成,以确定直播应用的输入音频码流。第二音频码流包括用户端通过音频采集装置接收到主播所发出的声音信号,和/或,直播环境中的声音信号。本公开实施例解决了现有技术在面对高质量的直播需求时,存在只能够依赖复杂的专业设备和专业技术人员团队来添加背景音的技术问题。达到了主播在直播终端上即可便捷快速地在直播音频流中添加背景音的技术效果,降低了直播的成本和技术门槛,提高了用户的使用体验感。In the live broadcast background sound processing method provided by the embodiments of the present disclosure, first, when the preset display conditions are met, the background sound operation control is displayed on the display screen in a split-screen form, and then in response to the user's operation instruction on the background sound operation control, the The corresponding first audio code stream in the music playing application is sent to the audio processor, and finally the audio processor merges and synthesizes the first audio data stream and the second audio code stream to determine the input audio code stream of the live application. The second audio code stream includes the sound signal sent by the anchor received by the client through the audio collection device, and/or the sound signal in the live broadcast environment. The embodiments of the present disclosure solve the technical problem in the prior art that the background sound can only be added by relying on complex professional equipment and a team of professional technicians in the face of high-quality live broadcast requirements. It achieves the technical effect that the host can easily and quickly add background sound to the live audio stream on the live broadcast terminal, reduces the cost and technical threshold of live broadcast, and improves the user experience.
图6为本公开实施例提供的直播背景音处理方法的流程示意图三。本实施例的方法应用在用户端中,用户端的实现形式包括具有直播功能的设备,该具有直播功能的设备包括但不限于:直播一体式终端、移动终端和计算机设备,用户可以通过该直播一体式终端或移动终端直接完成直播,无需使用其它配套设备来对直播的音频或视频进行额外的处理。FIG. 6 is a third schematic flowchart of a method for processing live broadcast background sound provided by an embodiment of the present disclosure. The method of this embodiment is applied to the user end, and the implementation form of the user end includes a device with a live broadcast function. The device with a live broadcast function includes but is not limited to: a live broadcast integrated terminal, a mobile terminal, and a computer device. Users can use the live broadcast integrated The live broadcast can be directly completed by a mobile terminal or a mobile terminal, without using other supporting equipment to perform additional processing on the live audio or video.
该直播背景音处理方法包括:The live broadcast background sound processing method includes:
S601:获取所述用户发出的开启指令。S601: Obtain an opening instruction issued by the user.
本步骤的具体原理及名词解释参考S401,在此不再赘述。Please refer to S401 for the specific principles and terminology explanation of this step, which will not be repeated here.
S602:根据开启指令,以悬浮窗的预设显示方式,在直播界面上同时显示背景音操作控件。S602: According to the opening instruction, simultaneously display the background sound operation controls on the live broadcast interface in the preset display mode of the floating window.
在本实施例中,背景音操作控件以悬浮窗的形式叠加在直播界面上显示。In this embodiment, the background sound operation control is superimposed and displayed on the live broadcast interface in the form of a floating window.
图7a-7e为本公开实施例提供的一种以悬浮窗的形式显示背景音操作控件的示意图。如图7a所示,用户在用户端700上点击了“背景音乐”快捷键后,在直播界面上叠加显示背景音操作控件即悬浮窗710。7a-7e are schematic diagrams of displaying background sound operation controls in the form of a floating window according to an embodiment of the present disclosure. As shown in FIG. 7 a , after the user clicks the shortcut key of “Background Music” on the client terminal 700 , the floating window 710 , which is the background sound operation control, is superimposed and displayed on the live broadcast interface.
悬浮窗710中包括:歌曲图标711、搜索控件712、菜单控件713、播放方式控件714以及缩放控件715。The floating window 710 includes: a song icon 711 , a search control 712 , a menu control 713 , a play mode control 714 and a zoom control 715 .
在一种可能的设计中,歌曲图标711用于显示最近一次播放的歌曲的封面图片。In a possible design, the song icon 711 is used to display the cover image of the last played song.
搜索控件712,用于搜索歌曲、歌词、歌手、专辑、歌单、排行榜等音频相关信息,以供用户筛选出直播时需要播放的背景音乐。The search control 712 is used to search for audio-related information such as songs, lyrics, singers, albums, playlists, rankings, etc., so that the user can filter out the background music that needs to be played during the live broadcast.
S603:响应于用户对背景音操作控件的操作指令,将音乐播放应用中对应的第一音频码流发送到音频处理器中。S603: Send the corresponding first audio code stream in the music playing application to the audio processor in response to the user's operation instruction on the background sound operation control.
在本实施例中,操作指令是指用户对悬浮窗上各个子控件的操作,具体的,如图7a-7d所示:In this embodiment, the operation instruction refers to the user's operation on each sub-control on the floating window, specifically, as shown in Figures 7a-7d:
当用户在图7a所示的悬浮窗710上点击了缩放控件715时,悬浮窗710就只显示歌曲图标711,以减少悬浮窗710对直播界面的视觉影响。When the user clicks the zoom control 715 on the floating window 710 shown in FIG. 7a, the floating window 710 only displays the song icon 711, so as to reduce the visual impact of the floating window 710 on the live interface.
当用户在图7a所示的悬浮窗710上点击了菜单控件713时,悬浮窗710将转变为图7b所示的形式,即在下方显示歌单列表,用户再选中了某个歌单后,如歌单1,则开始播放该歌单中的歌曲,如图7c所示,该歌曲可以是随机从歌单中选择的,也可以是该歌单中预设位置的歌曲,如第一首歌曲,也可以是该歌单最近一次所播放的歌曲。When the user clicks the menu control 713 on the floating window 710 shown in Fig. 7a, the floating window 710 will change to the form shown in Fig. Such as song list 1, then begin to play the song in this song list, as shown in Figure 7c, this song can be selected from the song list at random, also can be the song of preset position in this song list, as the first The song can also be the song played last time in the playlist.
此时,若用户再次点击菜单控件713时,歌单列表将会被收起,如图7d所示。At this time, if the user clicks the menu control 713 again, the play list will be collapsed, as shown in FIG. 7d.
当用户在搜索控件712上输入了搜索关键字/词,并指示执行搜索后(如按回车,或点击“放大镜状按钮”),如图7e所示,在悬浮窗710的下方将展开搜索结果列表,用户点击任意一个搜索结果,则切换播放该搜索结果所对应的歌曲。When the user enters the search keyword/word on the search control 712 and instructs to perform the search (such as pressing Enter, or clicking the "magnifying glass button"), as shown in Figure 7e, the search will be launched below the floating window 710 In the result list, if the user clicks on any search result, the song corresponding to the search result will be switched and played.
当悬浮窗710开始播放歌曲后,用户端的主控制器将音乐播放应用中对应的第一音频数据流发送给音频处理器。After the floating window 710 starts to play the song, the main controller of the user terminal sends the corresponding first audio data stream in the music playing application to the audio processor.
还需要说明的是,悬浮窗也可以设计为显示与音乐播放应用相同的操作界面,以使得主播不用改变对音乐播放应用的使用习惯,使用起来更加容易上手,提高用户体验。It should also be noted that the floating window can also be designed to display the same operation interface as the music player application, so that the host does not need to change the usage habits of the music player application, making it easier to use and improving user experience.
S604、音频处理器将第一音频数据流与第二音频码流进行汇流合成,以确定直播应用的输入音频码流。S604. The audio processor concatenates and synthesizes the first audio data stream and the second audio code stream, so as to determine an input audio code stream of the live application.
在本步骤中,第二音频码流包括用户端通过音频采集装置接收到主播所发出的声音信号,和/或,直播环境中的声音信号。In this step, the second audio code stream includes the sound signal sent by the anchor received by the client through the audio collection device, and/or the sound signal in the live broadcast environment.
本步骤的具体原理及名词解释参考S203,在此不再赘述。Please refer to S203 for the specific principle and terminology explanation of this step, which will not be repeated here.
本公开实施例提供的直播背景音处理方法,首先在满足预设显示条件时,在显示屏上以悬浮窗的形式显示背景音操作控件,然后响应于用户对背景音操作控件的操作指令,将音乐播放应用中对应的第一音频码流发送到音频处理器中,最后音频处理器将第一音频数据流与第二音频码流进行汇流合成,以确定直播应用的输入音频码流。第二音频码流包括用户端通过音频采集装置接收到主播所发出的声音信号,和/或,直播环境中的声音信号。本公开实施例解决了现有技术在面对高质量的直播需求时,存在只能够依赖复杂的专业设备和专业技术人员团队来添加背景音的技术问题。达到了主播在直播终端上即可便捷快速地在直播音频流中添加背景音的技术效果,降低了直播的成本和技术门槛,提高了用户的使用体验感。In the live broadcast background sound processing method provided by the embodiments of the present disclosure, first, when the preset display conditions are met, the background sound operation control is displayed on the display screen in the form of a floating window, and then in response to the user's operation instruction on the background sound operation control, the The corresponding first audio code stream in the music playing application is sent to the audio processor, and finally the audio processor merges and synthesizes the first audio data stream and the second audio code stream to determine the input audio code stream of the live application. The second audio code stream includes the sound signal sent by the anchor received by the client through the audio collection device, and/or the sound signal in the live broadcast environment. The embodiments of the present disclosure solve the technical problem in the prior art that the background sound can only be added by relying on complex professional equipment and a team of professional technicians in the face of high-quality live broadcast requirements. It achieves the technical effect that the host can conveniently and quickly add background sound to the live audio stream on the live broadcast terminal, reduces the cost and technical threshold of live broadcast, and improves the user experience.
对应于上文实施例的直播背景音处理方法,图8为本公开实施例提供的直播背景音处理装置800的结构框图。为了便于说明,仅示出了与本公开实施例相关的部分。参照图8,装置包括:Corresponding to the method for processing live background sound in the above embodiments, FIG. 8 is a structural block diagram of an apparatus 800 for processing live background sound provided in an embodiment of the present disclosure. For ease of description, only the parts related to the embodiments of the present disclosure are shown. Referring to Figure 8, the device includes:
主控模块801以及音频处理模块802,主控模块801中安装有直播应用以及音乐播放应用; Main control module 801 and audio processing module 802, live application and music playing application are installed in the main control module 801;
其中,主控模块801,用于在满足预设显示条件时,在显示屏上显示背景音操作控件;Wherein, the main control module 801 is used to display background sound operation controls on the display screen when the preset display conditions are met;
主控模块801,还用于响应于用户对背景音操作控件的操作指令,将音乐播放应用中对应的第一音频码流发送到音频处理模块802中;The main control module 801 is also configured to send the corresponding first audio code stream in the music playing application to the audio processing module 802 in response to the user's operation instruction on the background sound operation control;
音频处理模块802,用于将第一音频数据流与第二音频码流进行汇流合成,以确定直播应用的输入音频码流,第二音频码流包括通过音频采集装置接收到主播所发出的声音信号,和/或,直播环境中的声音信号。The audio processing module 802 is configured to concatenate and synthesize the first audio data stream and the second audio code stream to determine the input audio code stream of the live broadcast application. The second audio code stream includes the sound from the host received through the audio collection device signal, and/or, sound signal in a live environment.
在本公开的一个实施例中,主控模块801,用于:In one embodiment of the present disclosure, the main control module 801 is configured to:
获取用户发出的开启指令;Obtain the opening instruction issued by the user;
根据开启指令所指示的预设显示方式,在直播界面上同时显示背景音操作控件。According to the preset display mode indicated by the activation instruction, the background sound operation controls are simultaneously displayed on the live broadcast interface.
在本公开的一个实施例中,预设显示方式包括:分屏以及悬浮窗中的至少一种方式。In an embodiment of the present disclosure, the preset display mode includes: at least one mode of split screen and floating window.
在本公开的一个实施例中,背景音操作控件所显示的内容包括:音乐播放应用的操作界面的全部或部分内容。In an embodiment of the present disclosure, the content displayed by the background sound operation control includes: all or part of the content of the operation interface of the music playing application.
在本公开的一个实施例中,当预设显示方式为分屏时,主控模块801,用于将用户端的至少一个显示屏分割为直播界面区以及背景音操作区;In one embodiment of the present disclosure, when the preset display mode is split screen, the main control module 801 is configured to divide at least one display screen of the user terminal into a live interface area and a background sound operation area;
主控模块801,还用于获取用户按预设方式改变直播界面区以及背景音操作区相邻的边界,以改变背景音操作控件的显示状态,显示状态包括:开启或关闭状态、扩大或缩小背景音操作区。The main control module 801 is also used to obtain the user to change the adjacent boundary of the live interface area and the background sound operation area in a preset manner, so as to change the display state of the background sound operation control. The display state includes: open or closed state, enlargement or reduction Background sound operation area.
在本公开的一个实施例中,预设方式包括:点击边界上至少一点、点击边界上至少一个直线或曲线段、按预设方向和/或预设路径滑动边界。In an embodiment of the present disclosure, the preset method includes: clicking at least one point on the boundary, clicking at least one straight line or curved segment on the boundary, and sliding the boundary along a preset direction and/or a preset path.
在本公开的一个实施例中,边界包括直线边界和/或曲线边界。In one embodiment of the present disclosure, the boundary includes a straight line boundary and/or a curved boundary.
在本公开的一个实施例中,操作指令包括:暂停、播放、上一曲、下一曲、歌单、列表或歌单循环播放、顺序播放、音乐搜索、歌词搜索、排行榜单搜索中的至少一个。In an embodiment of the present disclosure, the operation instructions include: pause, play, previous song, next song, playlist, list or playlist loop play, sequential play, music search, lyrics search, list search at least one.
在本公开的一个实施例中,音频处理模块802,用于利用预设合成算法,将第一音频数据流与第二音频码流合成为输入音频码流;In one embodiment of the present disclosure, the audio processing module 802 is configured to use a preset synthesis algorithm to synthesize the first audio data stream and the second audio code stream into an input audio code stream;
主控模块801,还用于通过直播应用接收输入音频码流,并对输入音频码流进行编码,以确定输出音频码流;将输出音频码流从网络接口中发送到网络服务器中,以使网络服务器将输出音频码流发送给各个观众所使用的直播观看终端。The main control module 801 is also used to receive the input audio code stream through the live broadcast application, and encode the input audio code stream to determine the output audio code stream; send the output audio code stream from the network interface to the network server, so that The network server sends the output audio code stream to the live viewing terminal used by each audience.
本实施例提供的装置,可用于执行上述任意一个方法实施例,其实现原理和技术效果类似,本实施例此处不再赘述。The device provided in this embodiment can be used to execute any one of the above method embodiments, and its implementation principles and technical effects are similar, so this embodiment will not repeat them here.
参考图9,其示出了适于用来实现本公开实施例的电子设备的结构示意图,该电子设备900可以为终端设备或服务器。其中,终端设备可以包括但不限于诸如移动电话、笔记本电脑、数字广播接收器、个人数字助理(Personal Digital Assistant,简称PDA)、平板电脑(Portable Android Device,简称PAD)、便携式多媒体播放器(Portable Media Player,简称PMP)、车载终端(例如车载导航终端)等等的移动终端以及诸如数字TV(Television)、台式计算机等等的固定终端。图9示出的电子设备仅仅是一个示例,不应对本公开实施例的功能和使用范围带来任何限制。Referring to FIG. 9 , it shows a schematic structural diagram of an electronic device suitable for implementing the embodiments of the present disclosure. The electronic device 900 may be a terminal device or a server. Among them, the terminal equipment may include but not limited to mobile phones, notebook computers, digital broadcast receivers, personal digital assistants (Personal Digital Assistant, PDA for short), tablet computers (Portable Android Device, PAD for short), portable multimedia players (Portable Media Player, PMP for short), mobile terminals such as vehicle-mounted terminals (such as vehicle-mounted navigation terminals), and fixed terminals such as digital TV (Television), desktop computers, etc. The electronic device shown in FIG. 9 is only an example, and should not limit the functions and application scope of the embodiments of the present disclosure.
如图9所示,电子设备900可以包括处理装置(例如中央处理器、图形处理器等)901,其可以根据存储在只读存储器(Read Only Memory,简称ROM)902中的程序或者从存储装 置908加载到随机访问存储器(Random Access Memory,简称RAM)903中的程序而执行各种适当的动作和处理。在RAM 903中,还存储有电子设备900操作所需的各种程序和数据。处理装置901、ROM 902以及RAM 903通过总线904彼此相连。输入/输出(Input/Output,简称I/O)接口905也连接至总线904。As shown in Figure 9, an electronic device 900 may include a processing device (such as a central processing unit, a graphics processing unit, etc.) 908 loads the programs in the random access memory (Random Access Memory, RAM for short) 903 to execute various appropriate actions and processes. In the RAM 903, various programs and data necessary for the operation of the electronic device 900 are also stored. The processing device 901, ROM 902, and RAM 903 are connected to each other through a bus 904. An input/output (Input/Output, I/O for short) interface 905 is also connected to the bus 904 .
通常,以下装置可以连接至I/O接口905:包括例如触摸屏、触摸板、键盘、鼠标、摄像头、麦克风、加速度计、陀螺仪等的输入装置909;包括例如液晶显示器(Liquid Crystal Display,简称LCD)、扬声器、振动器等的输出装置907;包括例如磁带、硬盘等的存储装置908;以及通信装置909。通信装置909可以允许电子设备900与其他设备进行无线或有线通信以交换数据。虽然图9示出了具有各种装置的电子设备900,但是应理解的是,并不要求实施或具备所有示出的装置。可以替代地实施或具备更多或更少的装置。Generally, the following devices can be connected to the I/O interface 905: an input device 909 including, for example, a touch screen, a touchpad, a keyboard, a mouse, a camera, a microphone, an accelerometer, a gyroscope, etc.; ), a speaker, a vibrator, etc.; a storage device 908 including, for example, a magnetic tape, a hard disk, etc.; and a communication device 909. The communication means 909 may allow the electronic device 900 to perform wireless or wired communication with other devices to exchange data. While FIG. 9 shows electronic device 900 having various means, it is to be understood that implementing or having all of the means shown is not a requirement. More or fewer means may alternatively be implemented or provided.
特别地,根据本公开的实施例,上文参考流程图描述的过程可以被实现为计算机软件程序。例如,本公开的实施例包括一种计算机程序产品,其包括承载在计算机可读介质上的计算机程序,该计算机程序包含用于执行流程图所示的方法的程序代码。在这样的实施例中,该计算机程序可以通过通信装置909从网络上被下载和安装,或者从存储装置908被安装,或者从ROM 902被安装。在该计算机程序被处理装置901执行时,执行本公开实施例的方法中限定的上述功能。In particular, according to an embodiment of the present disclosure, the processes described above with reference to the flowcharts can be implemented as computer software programs. For example, embodiments of the present disclosure include a computer program product, which includes a computer program carried on a computer-readable medium, where the computer program includes program codes for executing the methods shown in the flowcharts. In such an embodiment, the computer program may be downloaded and installed from a network via communication means 909, or from storage means 908, or from ROM 902. When the computer program is executed by the processing device 901, the above-mentioned functions defined in the methods of the embodiments of the present disclosure are executed.
需要说明的是,本公开上述的计算机可读介质可以是计算机可读信号介质或者计算机可读存储介质或者是上述两者的任意组合。计算机可读存储介质例如可以是——但不限于——电、磁、光、电磁、红外线、或半导体的系统、装置或器件,或者任意以上的组合。计算机可读存储介质的更具体的例子可以包括但不限于:具有一个或多个导线的电连接、便携式计算机磁盘、硬盘、随机访问存储器(RAM)、只读存储器(ROM)、可擦式可编程只读存储器(Erasable Programmable Read Only Memory,简称EPROM或闪存)、光纤、便携式紧凑磁盘只读存储器(Compact Disc Read Only Memory,简称CD-ROM)、光存储器件、磁存储器件、或者上述的任意合适的组合。在本公开中,计算机可读存储介质可以是任何包含或存储程序的有形介质,该程序可以被指令执行系统、装置或者器件使用或者与其结合使用。而在本公开中,计算机可读信号介质可以包括在基带中或者作为载波一部分传播的数据信号,其中承载了计算机可读的程序代码。这种传播的数据信号可以采用多种形式,包括但不限于电磁信号、光信号或上述的任意合适的组合。计算机可读信号介质还可以是计算机可读存储介质以外的任何计算机可读介质,该计算机可读信号介质可以发送、传播或者传输用于由指令执行系统、装置或者器件使用或者与其结合使用的程序。计算机可读介质上包含的程序代码可以用任何适当的介质传输,包括但不限于:电线、光缆、RF(Radio Frequency,射频)等等,或者上述的任意合适的组合。It should be noted that the above-mentioned computer-readable medium in the present disclosure may be a computer-readable signal medium or a computer-readable storage medium or any combination of the above two. A computer readable storage medium may be, for example, but not limited to, an electrical, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, or device, or any combination thereof. More specific examples of computer-readable storage media may include, but are not limited to, electrical connections with one or more wires, portable computer diskettes, hard disks, random access memory (RAM), read-only memory (ROM), erasable Programmable Read Only Memory (Erasable Programmable Read Only Memory, referred to as EPROM or flash memory), optical fiber, portable compact disk read only memory (Compact Disc Read Only Memory, referred to as CD-ROM), optical storage device, magnetic storage device, or any of the above the right combination. In the present disclosure, a computer-readable storage medium may be any tangible medium that contains or stores a program that can be used by or in conjunction with an instruction execution system, apparatus, or device. In the present disclosure, however, a computer-readable signal medium may include a data signal propagated in baseband or as part of a carrier wave carrying computer-readable program code therein. Such propagated data signals may take many forms, including but not limited to electromagnetic signals, optical signals, or any suitable combination of the foregoing. A computer-readable signal medium may also be any computer-readable medium other than a computer-readable storage medium, which can transmit, propagate, or transmit a program for use by or in conjunction with an instruction execution system, apparatus, or device . The program code contained on the computer readable medium can be transmitted by any appropriate medium, including but not limited to: electric wire, optical cable, RF (Radio Frequency, radio frequency), etc., or any suitable combination of the above.
上述计算机可读介质可以是上述电子设备中所包含的;也可以是单独存在,而未装配入该电子设备中。The above-mentioned computer-readable medium may be included in the above-mentioned electronic device, or may exist independently without being incorporated into the electronic device.
上述计算机可读介质承载有一个或者多个程序,当上述一个或者多个程序被该电子设备执行时,使得该电子设备执行上述实施例所示的方法。The above-mentioned computer-readable medium carries one or more programs, and when the above-mentioned one or more programs are executed by the electronic device, the electronic device is made to execute the methods shown in the above-mentioned embodiments.
可以以一种或多种程序设计语言或其组合来编写用于执行本公开的操作的计算机程序代码,上述程序设计语言包括面向对象的程序设计语言—诸如Java、Smalltalk、C++,还包括常规的过程式程序设计语言—诸如“C”语言或类似的程序设计语言。程序代码可以完全地在用 户计算机上执行、部分地在用户计算机上执行、作为一个独立的软件包执行、部分在用户计算机上部分在远程计算机上执行、或者完全在远程计算机或服务器上执行。在涉及远程计算机的情形中,远程计算机可以通过任意种类的网络——包括局域网(Local Area Network,简称LAN)或广域网(Wide Area Network,简称WAN)—连接到用户计算机,或者,可以连接到外部计算机(例如利用因特网服务提供商来通过因特网连接)。Computer program code for carrying out the operations of the present disclosure can be written in one or more programming languages, or combinations thereof, including object-oriented programming languages—such as Java, Smalltalk, C++, and conventional Procedural Programming Language - such as "C" or a similar programming language. The program code may execute entirely on the user's computer, partly on the user's computer, as a stand-alone software package, partly on the user's computer and partly on a remote computer or entirely on the remote computer or server. In cases involving a remote computer, the remote computer can be connected to the user's computer through any kind of network, including a Local Area Network (LAN) or a Wide Area Network (WAN), or it can be connected to an external A computer (connected via the Internet, eg, using an Internet service provider).
附图中的流程图和框图,图示了按照本公开各种实施例的系统、方法和计算机程序产品的可能实现的体系架构、功能和操作。在这点上,流程图或框图中的每个方框可以代表一个模块、程序段、或代码的一部分,该模块、程序段、或代码的一部分包含一个或多个用于实现规定的逻辑功能的可执行指令。也应当注意,在有些作为替换的实现中,方框中所标注的功能也可以以不同于附图中所标注的顺序发生。例如,两个接连地表示的方框实际上可以基本并行地执行,它们有时也可以按相反的顺序执行,这依所涉及的功能而定。也要注意的是,框图和/或流程图中的每个方框、以及框图和/或流程图中的方框的组合,可以用执行规定的功能或操作的专用的基于硬件的系统来实现,或者可以用专用硬件与计算机指令的组合来实现。The flowchart and block diagrams in the Figures illustrate the architecture, functionality, and operation of possible implementations of systems, methods and computer program products according to various embodiments of the present disclosure. In this regard, each block in a flowchart or block diagram may represent a module, program segment, or portion of code that contains one or more logical functions for implementing specified executable instructions. It should also be noted that, in some alternative implementations, the functions noted in the block may occur out of the order noted in the figures. For example, two blocks shown in succession may, in fact, be executed substantially concurrently, or they may sometimes be executed in the reverse order, depending upon the functionality involved. It should also be noted that each block of the block diagrams and/or flowchart illustrations, and combinations of blocks in the block diagrams and/or flowchart illustrations, can be implemented by a dedicated hardware-based system that performs the specified functions or operations , or may be implemented by a combination of dedicated hardware and computer instructions.
描述于本公开实施例中所涉及到的单元可以通过软件的方式实现,也可以通过硬件的方式来实现。其中,单元的名称在某种情况下并不构成对该单元本身的限定,例如,第一获取单元还可以被描述为“获取至少两个网际协议地址的单元”。The units involved in the embodiments described in the present disclosure may be implemented by software or by hardware. Wherein, the name of the unit does not constitute a limitation of the unit itself under certain circumstances, for example, the first obtaining unit may also be described as "a unit for obtaining at least two Internet Protocol addresses".
本文中以上描述的功能可以至少部分地由一个或多个硬件逻辑部件来执行。例如,非限制性地,可以使用的示范类型的硬件逻辑部件包括:现场可编程门阵列(Field Programmable Gate Array,简称FPGA)、专用集成电路(Application Specific Integrated Circuit,简称ASIC)、专用标准产品(Application Specific Standard Parts,简称ASSP)、片上系统(System on Chip,简称SOC)、复杂可编程逻辑设备(Complex Programmable Logic Device,简称CPLD)等等。The functions described herein above may be performed at least in part by one or more hardware logic components. For example, without limitation, exemplary types of hardware logic components that may be used include: Field Programmable Gate Array (Field Programmable Gate Array, FPGA for short), Application Specific Integrated Circuit (ASIC for short), Application Specific Standard Products ( Application Specific Standard Parts (ASSP for short), System on Chip (SOC for short), Complex Programmable Logic Device (CPLD for short), etc.
在本公开的上下文中,机器可读介质可以是有形的介质,其可以包含或存储以供指令执行系统、装置或设备使用或与指令执行系统、装置或设备结合地使用的程序。机器可读介质可以是机器可读信号介质或机器可读储存介质。机器可读介质可以包括但不限于电子的、磁性的、光学的、电磁的、红外的、或半导体系统、装置或设备,或者上述内容的任何合适组合。机器可读存储介质的更具体示例会包括基于一个或多个线的电气连接、便携式计算机盘、硬盘、随机存取存储器(RAM)、只读存储器(ROM)、可擦除可编程只读存储器(EPROM或快闪存储器)、光纤、便捷式紧凑盘只读存储器(CD-ROM)、光学储存设备、磁储存设备、或上述内容的任何合适组合。In the context of the present disclosure, a machine-readable medium may be a tangible medium that may contain or store a program for use by or in conjunction with an instruction execution system, apparatus, or device. A machine-readable medium may be a machine-readable signal medium or a machine-readable storage medium. A machine-readable medium may include, but is not limited to, electronic, magnetic, optical, electromagnetic, infrared, or semiconductor systems, apparatus, or devices, or any suitable combination of the foregoing. More specific examples of machine-readable storage media would include one or more wire-based electrical connections, portable computer discs, hard drives, random access memory (RAM), read only memory (ROM), erasable programmable read only memory (EPROM or flash memory), optical fiber, compact disk read only memory (CD-ROM), optical storage, magnetic storage, or any suitable combination of the foregoing.
本公开实施例中还提供了一种计算机程序产品,包括计算机程序,该计算机程序被处理器执行时实现上述各实施例中的方法。Embodiments of the present disclosure also provide a computer program product, including a computer program, which implements the methods in the foregoing embodiments when the computer program is executed by a processor.
本公开实施例中还提供了一种直播一体机或直播一体式设备,包括图9所对应的电子设备。还需要说明的是,该直播一体机的控制电路包括:主控模块和音频处理模块,其中,主控模块上安装了直播应用以及音频播放应用,音频处理模块用于音频播放应用中播放的音频数据码流合成到音频采集设备(如麦克风)采集到的主播音频中,以形成直播应用的输入音频。An embodiment of the present disclosure also provides a live broadcast integrated machine or a live broadcast integrated device, including the electronic device corresponding to FIG. 9 . It should also be noted that the control circuit of the live broadcast all-in-one machine includes: a main control module and an audio processing module, wherein the main control module is installed with a live broadcast application and an audio playback application, and the audio processing module is used for the audio played in the audio playback application. The data code stream is synthesized into the host audio collected by the audio collection device (such as a microphone) to form the input audio of the live application.
第一方面,根据本公开的一个或多个实施例,提供了一种直播背景音处理方法,应用于用户端,所述用户端包括:主控制器以及音频处理器,所述主控制器安装有直播应用以及音乐播放应用,所述方法包括:In the first aspect, according to one or more embodiments of the present disclosure, there is provided a live broadcast background sound processing method, which is applied to a user end, and the user end includes: a main controller and an audio processor, and the main controller installs There are live broadcast applications and music playback applications, and the methods include:
在满足预设显示条件时,在显示屏上显示背景音操作控件;When the preset display conditions are met, the background sound operation controls are displayed on the display screen;
响应于用户对所述背景音操作控件的操作指令,将所述音乐播放应用中对应的第一音频码流发送到所述音频处理器中;Responding to the user's operation instruction on the background sound operation control, sending the corresponding first audio code stream in the music playing application to the audio processor;
所述音频处理器将所述第一音频数据流与第二音频码流进行汇流合成,以确定所述直播应用的输入音频码流,所述第二音频码流包括所述用户端通过音频采集装置接收到主播所发出的声音信号,和/或,直播环境中的声音信号。The audio processor concatenates and synthesizes the first audio data stream and the second audio code stream to determine the input audio code stream of the live broadcast application, and the second audio code stream includes The device receives the sound signal from the anchor, and/or the sound signal in the live broadcast environment.
根据本公开的一个或多个实施例,所述在满足预设显示条件时,在显示屏上显示背景音操作控件,包括:According to one or more embodiments of the present disclosure, when the preset display condition is met, displaying the background sound operation control on the display screen includes:
获取所述用户发出的开启指令;Obtain an opening instruction issued by the user;
根据所述开启指令所指示的预设显示方式,在直播界面上同时显示所述背景音操作控件。According to the preset display mode indicated by the opening instruction, the background sound operation control is simultaneously displayed on the live broadcast interface.
根据本公开的一个或多个实施例,所述预设显示方式包括:分屏以及悬浮窗中的至少一种方式。According to one or more embodiments of the present disclosure, the preset display manner includes: at least one of split screen and floating window.
根据本公开的一个或多个实施例,所述背景音操作控件所显示的内容包括:所述音乐播放应用的操作界面的全部或部分内容。According to one or more embodiments of the present disclosure, the content displayed by the background sound operation control includes: all or part of the content of the operation interface of the music playing application.
根据本公开的一个或多个实施例,当所述预设显示方式为所述分屏时,所述在直播界面上同时显示所述背景音操作控件,包括:According to one or more embodiments of the present disclosure, when the preset display mode is the split screen, the simultaneously displaying the background sound operation control on the live broadcast interface includes:
将所述用户端的至少一个显示屏分割为直播界面区以及背景音操作区;Divide at least one display screen of the user terminal into a live interface area and a background sound operation area;
所述操作指令包括:The operating instructions include:
所述用户按预设方式改变所述直播界面区以及所述背景音操作区相邻的边界,以改变所述背景音操作控件的显示状态,所述显示状态包括:开启或关闭状态、扩大或缩小所述背景音操作区。The user changes the live broadcast interface area and the border adjacent to the background sound operation area in a preset manner to change the display state of the background sound operation control, and the display state includes: open or closed state, enlarged or Reduce the background sound operation area.
根据本公开的一个或多个实施例,所述预设方式包括:点击所述边界上至少一点、点击所述边界上至少一个直线或曲线段、按预设方向和/或预设路径滑动所述边界。According to one or more embodiments of the present disclosure, the preset method includes: clicking at least one point on the boundary, clicking at least one straight line or curve segment on the boundary, sliding all stated boundary.
根据本公开的一个或多个实施例,所述边界包括直线边界和/或曲线边界。According to one or more embodiments of the present disclosure, the boundary includes a straight line boundary and/or a curved boundary.
根据本公开的一个或多个实施例,所述操作指令包括:暂停、播放、上一曲、下一曲、歌单、列表或歌单循环播放、顺序播放、音乐搜索、歌词搜索、排行榜单搜索中的至少一个。According to one or more embodiments of the present disclosure, the operation instructions include: pause, play, previous song, next song, song list, list or song list loop play, sequential play, music search, lyrics search, ranking list At least one of the single searches.
根据本公开的一个或多个实施例,所述音频处理器将所述第一音频数据流与第二音频码流进行汇流合成,以确定所述直播应用的输入音频码流,包括:According to one or more embodiments of the present disclosure, the audio processor concatenates the first audio data stream and the second audio code stream to determine the input audio code stream of the live application, including:
利用预设合成算法,将所述第一音频数据流与第二音频码流合成为所述输入音频码流;Synthesizing the first audio data stream and the second audio code stream into the input audio code stream by using a preset synthesis algorithm;
所述直播应用接收所述输入音频码流,并对所述输入音频码流进行编码,以确定输出音频码流;The live application receives the input audio code stream, and encodes the input audio code stream to determine the output audio code stream;
将所述输出音频码流从网络接口中发送到网络服务器中,以使所述网络服务器将所述输出音频码流发送给各个观众所使用的直播观看终端。The output audio code stream is sent from the network interface to the network server, so that the network server sends the output audio code stream to the live viewing terminal used by each audience.
第二方面,根据本公开的一个或多个实施例,提供了一种直播背景音处理装置,包括:In the second aspect, according to one or more embodiments of the present disclosure, a live broadcast background sound processing device is provided, including:
主控模块以及音频处理模块,主控模块中安装有直播应用以及音乐播放应用;The main control module and the audio processing module, the live application and the music playing application are installed in the main control module;
其中,主控模块,用于在满足预设显示条件时,在显示屏上显示背景音操作控件;Wherein, the main control module is used to display background sound operation controls on the display screen when the preset display conditions are met;
主控模块,还用于响应于用户对背景音操作控件的操作指令,将音乐播放应用中对应的第一音频码流发送到音频处理模块中;The main control module is also used to send the corresponding first audio code stream in the music playing application to the audio processing module in response to the user's operation instruction on the background sound operation control;
音频处理模块,用于将第一音频数据流与第二音频码流进行汇流合成,以确定直播应用的输入音频码流,第二音频码流包括通过音频采集装置接收到主播所发出的声音信号,和/或,直播环境中的声音信号。The audio processing module is used to concatenate and synthesize the first audio data stream and the second audio code stream to determine the input audio code stream of the live broadcast application. The second audio code stream includes the sound signal sent by the anchor received through the audio collection device , and/or, the sound signal in the live environment.
根据本公开的一个或多个实施例,主控模块,用于:According to one or more embodiments of the present disclosure, the main control module is configured to:
获取用户发出的开启指令;Obtain the opening instruction issued by the user;
根据开启指令所指示的预设显示方式,在直播界面上同时显示背景音操作控件。According to the preset display mode indicated by the activation instruction, the background sound operation controls are simultaneously displayed on the live broadcast interface.
根据本公开的一个或多个实施例,预设显示方式包括:分屏以及悬浮窗中的至少一种方式。According to one or more embodiments of the present disclosure, the preset display manner includes: at least one of split screen and floating window.
根据本公开的一个或多个实施例,背景音操作控件所显示的内容包括:音乐播放应用的操作界面的全部或部分内容。According to one or more embodiments of the present disclosure, the content displayed by the background sound operation control includes: all or part of the content of the operation interface of the music playing application.
根据本公开的一个或多个实施例,当预设显示方式为分屏时,主控模块,用于将用户端的至少一个显示屏分割为直播界面区以及背景音操作区;According to one or more embodiments of the present disclosure, when the preset display mode is split screen, the main control module is configured to divide at least one display screen of the client into a live interface area and a background sound operation area;
主控模块,还用于获取用户按预设方式改变直播界面区以及背景音操作区相邻的边界,以改变背景音操作控件的显示状态,显示状态包括:开启或关闭状态、扩大或缩小背景音操作区。The main control module is also used to obtain the user to change the adjacent boundary of the live interface area and the background sound operation area according to the preset method, so as to change the display state of the background sound operation control. The display state includes: open or close state, expand or reduce the background sound operation area.
根据本公开的一个或多个实施例,预设方式包括:点击边界上至少一点、点击边界上至少一个直线或曲线段、按预设方向和/或预设路径滑动边界。According to one or more embodiments of the present disclosure, the preset method includes: clicking at least one point on the boundary, clicking at least one straight line or curved segment on the boundary, and sliding the boundary along a preset direction and/or a preset path.
根据本公开的一个或多个实施例,边界包括直线边界和/或曲线边界。According to one or more embodiments of the present disclosure, the boundary includes a straight line boundary and/or a curved boundary.
根据本公开的一个或多个实施例,操作指令包括:暂停、播放、上一曲、下一曲、歌单、列表或歌单循环播放、顺序播放、音乐搜索、歌词搜索、排行榜单搜索中的至少一个。According to one or more embodiments of the present disclosure, the operation instructions include: pause, play, previous song, next song, playlist, list or playlist loop play, sequential play, music search, lyrics search, list search at least one of the
根据本公开的一个或多个实施例,音频处理模块,用于利用预设合成算法,将第一音频数据流与第二音频码流合成为输入音频码流;According to one or more embodiments of the present disclosure, the audio processing module is configured to use a preset synthesis algorithm to synthesize the first audio data stream and the second audio code stream into an input audio code stream;
主控模块,还用于通过直播应用接收输入音频码流,并对输入音频码流进行编码,以确定输出音频码流;将输出音频码流从网络接口中发送到网络服务器中,以使网络服务器将输出音频码流发送给各个观众所使用的直播观看终端。The main control module is also used to receive the input audio code stream through the live broadcast application, and encode the input audio code stream to determine the output audio code stream; send the output audio code stream from the network interface to the network server, so that the network The server sends the output audio code stream to the live viewing terminals used by each viewer.
第三方面,根据本公开的一个或多个实施例,提供了一种电子设备,包括:In a third aspect, according to one or more embodiments of the present disclosure, an electronic device is provided, including:
至少一个处理器和存储器;at least one processor and memory;
存储器存储计算机程序;the memory stores the computer program;
至少一个处理器执行存储器存储的计算机程序,使得至少一个处理器执行如上第一方面以及第一方面各种可能的设计的直播背景音处理方法。At least one processor executes the computer program stored in the memory, so that the at least one processor executes the live broadcast background sound processing method of the above first aspect and various possible designs of the first aspect.
第四方面,根据本公开的一个或多个实施例,提供了一种直播一体机,包括:第三方面各种可能的设计的电子设备。In a fourth aspect, according to one or more embodiments of the present disclosure, there is provided a live streaming all-in-one machine, including: electronic devices of various possible designs in the third aspect.
第五方面,根据本公开的一个或多个实施例,提供了一种计算机可读存储介质,计算机可读存储介质中存储有计算机程序,当处理器执行计算机程序时,实现如上第一方面以及第一方面各种可能的设计的直播背景音处理方法。In the fifth aspect, according to one or more embodiments of the present disclosure, there is provided a computer-readable storage medium, in which a computer program is stored, and when the processor executes the computer program, the above first aspect and The first aspect is various possible designs of live background sound processing methods.
第六方面,根据本公开的一个或多个实施例,提供了一种计算机程序产品,包括计算机程序,当处理器执行计算机程序时,实现如上第一方面各种可能的设计的直播背景音处理方法。In the sixth aspect, according to one or more embodiments of the present disclosure, a computer program product is provided, including a computer program. When the processor executes the computer program, the live background sound processing of various possible designs of the above first aspect can be realized. method.
第七方面,根据本公开的一个或多个实施例,提供了一种计算机程序,当处理器执行计算机程序时,实现如上第一方面各种可能的设计的直播背景音处理方法。In the seventh aspect, according to one or more embodiments of the present disclosure, a computer program is provided. When the processor executes the computer program, various possible designs of the live background sound processing method in the above first aspect are realized.
以上描述仅为本公开的较佳实施例以及对所运用技术原理的说明。本领域技术人员应当理解,本公开中所涉及的公开范围,并不限于上述技术特征的特定组合而成的技术方案,同时也应涵盖在不脱离上述公开构思的情况下,由上述技术特征或其等同特征进行任意组合而形成的其它技术方案。例如上述特征与本公开中公开的(但不限于)具有类似功能的技术特征进行互相替换而形成的技术方案。The above description is only a preferred embodiment of the present disclosure and an illustration of the applied technical principles. Those skilled in the art should understand that the disclosure scope involved in this disclosure is not limited to the technical solution formed by the specific combination of the above-mentioned technical features, but also covers the technical solutions formed by the above-mentioned technical features or Other technical solutions formed by any combination of equivalent features. For example, a technical solution formed by replacing the above-mentioned features with (but not limited to) technical features with similar functions disclosed in this disclosure.
此外,虽然采用特定次序描绘了各操作,但是这不应当理解为要求这些操作以所示出的特定次序或以顺序次序执行来执行。在一定环境下,多任务和并行处理可能是有利的。同样地,虽然在上面论述中包含了若干具体实现细节,但是这些不应当被解释为对本公开的范围的限制。在单独的实施例的上下文中描述的某些特征还可以组合地实现在单个实施例中。相反地,在单个实施例的上下文中描述的各种特征也可以单独地或以任何合适的子组合的方式实现在多个实施例中。In addition, while operations are depicted in a particular order, this should not be understood as requiring that the operations be performed in the particular order shown or performed in sequential order. Under certain circumstances, multitasking and parallel processing may be advantageous. Likewise, while the above discussion contains several specific implementation details, these should not be construed as limitations on the scope of the disclosure. Certain features that are described in the context of separate embodiments can also be implemented in combination in a single embodiment. Conversely, various features that are described in the context of a single embodiment can also be implemented in multiple embodiments separately or in any suitable subcombination.
尽管已经采用特定于结构特征和/或方法逻辑动作的语言描述了本主题,但是应当理解所附权利要求书中所限定的主题未必局限于上面描述的特定特征或动作。相反,上面所描述的特定特征和动作仅仅是实现权利要求书的示例形式。Although the subject matter has been described in language specific to structural features and/or methodological acts, it is to be understood that the subject matter defined in the appended claims is not necessarily limited to the specific features or acts described above. Rather, the specific features and acts described above are merely example forms of implementing the claims.

Claims (15)

  1. 一种直播背景音处理方法,其特征在于,应用于用户端,所述用户端包括:主控制器以及音频处理器,所述主控制器安装有直播应用以及音乐播放应用,所述方法包括:A live broadcast background sound processing method, characterized in that it is applied to a user end, the user end includes: a main controller and an audio processor, the main controller is installed with a live broadcast application and a music playback application, and the method includes:
    在满足预设显示条件时,在显示屏上显示背景音操作控件;When the preset display conditions are met, the background sound operation controls are displayed on the display screen;
    响应于用户对所述背景音操作控件的操作指令,将所述音乐播放应用中对应的第一音频码流发送到所述音频处理器中;Responding to the user's operation instruction on the background sound operation control, sending the corresponding first audio code stream in the music playing application to the audio processor;
    所述音频处理器将所述第一音频数据流与第二音频码流进行汇流合成,以确定所述直播应用的输入音频码流,所述第二音频码流包括所述用户端通过音频采集装置接收到主播所发出的声音信号,和/或,直播环境中的声音信号。The audio processor concatenates and synthesizes the first audio data stream and the second audio code stream to determine the input audio code stream of the live broadcast application, and the second audio code stream includes The device receives the sound signal from the anchor, and/or the sound signal in the live broadcast environment.
  2. 根据权利要求1所述的直播背景音处理方法,其特征在于,所述在满足预设显示条件时,在显示屏上显示背景音操作控件,包括:The method for processing live background sound according to claim 1, wherein the displaying the background sound operation control on the display screen when the preset display condition is satisfied comprises:
    获取所述用户发出的开启指令;Obtain an opening instruction issued by the user;
    根据所述开启指令所指示的预设显示方式,在直播界面上同时显示所述背景音操作控件。According to the preset display mode indicated by the opening instruction, the background sound operation control is simultaneously displayed on the live broadcast interface.
  3. 根据权利要求2所述的直播背景音处理方法,其特征在于,所述预设显示方式包括:分屏以及悬浮窗中的至少一种方式。The method for processing live background sound according to claim 2, wherein the preset display mode includes at least one of split screen and floating window.
  4. 根据权利要求1-3中任一项所述的直播背景音处理方法,其特征在于,所述背景音操作控件所显示的内容包括:所述音乐播放应用的操作界面的全部或部分内容。The method for processing live background sound according to any one of claims 1-3, wherein the content displayed by the background sound operation control includes: all or part of the content of the operation interface of the music playing application.
  5. 根据权利要求3所述的直播背景音处理方法,其特征在于,当所述预设显示方式为所述分屏时,所述在直播界面上同时显示所述背景音操作控件,包括:The live broadcast background sound processing method according to claim 3, wherein when the preset display mode is the split screen, the simultaneous display of the background sound operation controls on the live broadcast interface includes:
    将所述用户端的至少一个显示屏分割为直播界面区以及背景音操作区;Divide at least one display screen of the user terminal into a live interface area and a background sound operation area;
    所述操作指令包括:The operating instructions include:
    所述用户按预设方式改变所述直播界面区以及所述背景音操作区相邻的边界,以改变所述背景音操作控件的显示状态,所述显示状态包括:开启或关闭状态、扩大或缩小所述背景音操作区。The user changes the live broadcast interface area and the border adjacent to the background sound operation area in a preset manner to change the display state of the background sound operation control, and the display state includes: open or closed state, enlarged or Reduce the background sound operation area.
  6. 根据权利要求5所述的直播背景音处理方法,其特征在于,所述预设方式包括:点击所述边界上至少一点、点击所述边界上至少一个直线或曲线段、按预设方向和/或预设路径滑动所述边界。The live broadcast background sound processing method according to claim 5, wherein the preset method includes: clicking at least one point on the boundary, clicking at least one straight line or curve segment on the boundary, pressing a preset direction and/or Or slide the boundary along a preset path.
  7. 根据权利要求5或6所述的直播背景音处理方法,其特征在于,所述边界包括直线边界和/或曲线边界。The live broadcast background sound processing method according to claim 5 or 6, wherein the boundary includes a straight line boundary and/or a curved boundary.
  8. 根据权利要求1-7中任一项所述的直播背景音处理方法,其特征在于,所述操作指令包括:暂停、播放、上一曲、下一曲、歌单、列表或歌单循环播放、顺序播放、音乐搜索、歌词搜索、排行榜单搜索中的至少一个。The live broadcast background sound processing method according to any one of claims 1-7, wherein the operation instructions include: pause, play, previous song, next song, song list, list or song list loop playback , sequence play, music search, lyrics search, and list search at least one.
  9. 根据权利要求1-8中任一项所述的直播背景音处理方法,其特征在于,所述音频处理器将所述第一音频数据流与第二音频码流进行汇流合成,以确定所述直播应用的输入音频码流,包括:The live broadcast background sound processing method according to any one of claims 1-8, wherein the audio processor combines and synthesizes the first audio data stream and the second audio code stream to determine the The input audio code stream of the live application, including:
    利用预设合成算法,将所述第一音频数据流与第二音频码流合成为所述输入音频码流;Synthesizing the first audio data stream and the second audio code stream into the input audio code stream by using a preset synthesis algorithm;
    所述直播应用接收所述输入音频码流,并对所述输入音频码流进行编码,以确定输 出音频码流;The live application receives the input audio code stream, and encodes the input audio code stream to determine the output audio code stream;
    将所述输出音频码流从网络接口中发送到网络服务器中,以使所述网络服务器将所述输出音频码流发送给各个观众所使用的直播观看终端。The output audio code stream is sent from the network interface to the network server, so that the network server sends the output audio code stream to the live viewing terminal used by each audience.
  10. 一种直播背景音处理装置,其特征在于,包括:主控模块以及音频处理模块,所述主控模块中安装有直播应用以及音乐播放应用;其中,A live broadcast background sound processing device, characterized in that it includes: a main control module and an audio processing module, and a live broadcast application and a music playback application are installed in the main control module; wherein,
    所述主控模块,用于在满足预设显示条件时,在显示屏上显示背景音操作控件;The main control module is used to display background sound operation controls on the display screen when preset display conditions are met;
    所述主控模块,还用于响应于用户对所述背景音操作控件的操作指令,将所述音乐播放应用中对应的第一音频码流发送到所述音频处理模块中;The main control module is further configured to send the corresponding first audio code stream in the music playing application to the audio processing module in response to the user's operation instruction on the background sound operation control;
    所述音频处理模块,用于将所述第一音频数据流与第二音频码流进行汇流合成,以确定所述直播应用的输入音频码流,所述第二音频码流包括通过音频采集装置接收到主播所发出的声音信号,和/或,直播环境中的声音信号。The audio processing module is configured to concatenate and synthesize the first audio data stream and the second audio code stream to determine the input audio code stream of the live application, and the second audio code stream includes Receive the sound signal from the anchor, and/or the sound signal in the live broadcast environment.
  11. 一种电子设备,其特征在于,包括:An electronic device, characterized in that it comprises:
    至少一个处理器和存储器;at least one processor and memory;
    所述存储器用于存储计算机程序;The memory is used to store computer programs;
    所述至少一个处理器执行所述存储器存储的所述计算机程序,使得所述至少一个处理器执行如权利要求1-9任一项所述的直播背景音处理方法。The at least one processor executes the computer program stored in the memory, so that the at least one processor executes the live broadcast background sound processing method according to any one of claims 1-9.
  12. 一种直播一体机,其特征在于,包括:权利要求11所述的电子设备。An all-in-one live broadcasting machine, comprising: the electronic device according to claim 11.
  13. 一种计算机可读存储介质,其特征在于,所述计算机可读存储介质中存储有计算机程序,当处理器执行所述计算机程序时,实现如权利要求1-9任一项所述的直播背景音处理方法。A computer-readable storage medium, characterized in that a computer program is stored in the computer-readable storage medium, and when the processor executes the computer program, the live broadcast background according to any one of claims 1-9 is realized sound processing method.
  14. 一种计算机程序产品,其特征在于,包括计算机程序,其特征在于,所述计算机程序被处理器执行时实现权利要求1-9任一项所述的直播背景音处理方法。A computer program product, characterized in that it includes a computer program, and is characterized in that, when the computer program is executed by a processor, the method for processing live background sound according to any one of claims 1-9 is realized.
  15. 一种计算机程序,其特征在于,所述计算机程序被处理器执行时实现权利要求1-9任一项所述的直播背景音处理方法。A computer program, characterized in that, when the computer program is executed by a processor, the method for processing live background sound according to any one of claims 1-9 is implemented.
PCT/CN2022/087482 2021-05-13 2022-04-18 Livestreaming background sound processing method and apparatus, device, medium, and program product WO2022237463A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202110522604.2A CN113132794A (en) 2021-05-13 2021-05-13 Live background sound processing method, device, equipment, medium and program product
CN202110522604.2 2021-05-13

Publications (1)

Publication Number Publication Date
WO2022237463A1 true WO2022237463A1 (en) 2022-11-17

Family

ID=76781767

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2022/087482 WO2022237463A1 (en) 2021-05-13 2022-04-18 Livestreaming background sound processing method and apparatus, device, medium, and program product

Country Status (2)

Country Link
CN (1) CN113132794A (en)
WO (1) WO2022237463A1 (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113132794A (en) * 2021-05-13 2021-07-16 北京字节跳动网络技术有限公司 Live background sound processing method, device, equipment, medium and program product

Citations (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20040125965A1 (en) * 2002-12-27 2004-07-01 William Alberth Method and apparatus for providing background audio during a communication session
CN105872253A (en) * 2016-05-31 2016-08-17 腾讯科技(深圳)有限公司 Live broadcast sound processing method and mobile terminal
CN106302087A (en) * 2015-05-19 2017-01-04 深圳市腾讯计算机系统有限公司 Instant communication method, Apparatus and system
CN106331736A (en) * 2016-08-24 2017-01-11 武汉斗鱼网络科技有限公司 Live client speech processing system and processing method thereof
CN106531177A (en) * 2016-12-07 2017-03-22 腾讯科技(深圳)有限公司 Audio treatment method, a mobile terminal and system
CN107027050A (en) * 2017-04-13 2017-08-08 广州华多网络科技有限公司 Auxiliary live audio/video processing method and device
CN107948704A (en) * 2017-12-29 2018-04-20 北京安云世纪科技有限公司 For to voice data into Mobile state synthetic method, system and mobile terminal
CN108024135A (en) * 2017-12-13 2018-05-11 广州虎牙信息科技有限公司 Direct broadcasting room live picture exhibition method, storage device and computer equipment
CN108235047A (en) * 2018-01-30 2018-06-29 广州华多网络科技有限公司 A kind of audio frequency playing method of direct broadcasting room and main broadcaster's terminal device
CN109767777A (en) * 2019-01-31 2019-05-17 迅雷计算机(深圳)有限公司 A kind of sound mixing method that software is broadcast live
CN112015505A (en) * 2020-08-13 2020-12-01 北京字节跳动网络技术有限公司 Mode switching method and device and electronic equipment
CN112423009A (en) * 2020-11-09 2021-02-26 珠海格力电器股份有限公司 Method and equipment for controlling live broadcast audio
CN113132794A (en) * 2021-05-13 2021-07-16 北京字节跳动网络技术有限公司 Live background sound processing method, device, equipment, medium and program product

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112269898A (en) * 2020-10-30 2021-01-26 维沃移动通信有限公司 Background music obtaining method and device, electronic equipment and readable storage medium

Patent Citations (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20040125965A1 (en) * 2002-12-27 2004-07-01 William Alberth Method and apparatus for providing background audio during a communication session
CN106302087A (en) * 2015-05-19 2017-01-04 深圳市腾讯计算机系统有限公司 Instant communication method, Apparatus and system
CN105872253A (en) * 2016-05-31 2016-08-17 腾讯科技(深圳)有限公司 Live broadcast sound processing method and mobile terminal
CN106331736A (en) * 2016-08-24 2017-01-11 武汉斗鱼网络科技有限公司 Live client speech processing system and processing method thereof
CN106531177A (en) * 2016-12-07 2017-03-22 腾讯科技(深圳)有限公司 Audio treatment method, a mobile terminal and system
CN107027050A (en) * 2017-04-13 2017-08-08 广州华多网络科技有限公司 Auxiliary live audio/video processing method and device
CN108024135A (en) * 2017-12-13 2018-05-11 广州虎牙信息科技有限公司 Direct broadcasting room live picture exhibition method, storage device and computer equipment
CN107948704A (en) * 2017-12-29 2018-04-20 北京安云世纪科技有限公司 For to voice data into Mobile state synthetic method, system and mobile terminal
CN108235047A (en) * 2018-01-30 2018-06-29 广州华多网络科技有限公司 A kind of audio frequency playing method of direct broadcasting room and main broadcaster's terminal device
CN109767777A (en) * 2019-01-31 2019-05-17 迅雷计算机(深圳)有限公司 A kind of sound mixing method that software is broadcast live
CN112015505A (en) * 2020-08-13 2020-12-01 北京字节跳动网络技术有限公司 Mode switching method and device and electronic equipment
CN112423009A (en) * 2020-11-09 2021-02-26 珠海格力电器股份有限公司 Method and equipment for controlling live broadcast audio
CN113132794A (en) * 2021-05-13 2021-07-16 北京字节跳动网络技术有限公司 Live background sound processing method, device, equipment, medium and program product

Also Published As

Publication number Publication date
CN113132794A (en) 2021-07-16

Similar Documents

Publication Publication Date Title
WO2021073315A1 (en) Video file generation method and device, terminal and storage medium
CN102064857B (en) Method and apparatus for remote controlling bluetooth device
JP6702451B2 (en) In-vehicle device, control method for in-vehicle device, and control program
WO2022237464A1 (en) Audio synthesis method and apparatus, and device, medium and program product
WO2022007722A1 (en) Display method and apparatus, and device and storage medium
WO2022007724A1 (en) Video processing method and apparatus, and device and storage medium
JP2013198085A (en) Information processing device, information processing method, information processing program and terminal device
EP4124052A1 (en) Video production method and apparatus, and device and storage medium
WO2022001655A1 (en) Video playback control method and apparatus, electronic device, and storage medium
WO2023051293A1 (en) Audio processing method and apparatus, and electronic device and storage medium
WO2021083145A1 (en) Video processing method and device, terminal, and storage medium
WO2021083146A1 (en) Video processing method and apparatus, and terminal and storage medium
RU2453899C1 (en) Apparatus and method for audio-visual search and browse interface, machine-readable medium
US11886484B2 (en) Music playing method and apparatus based on user interaction, and device and storage medium
WO2023185647A1 (en) Media content display method and apparatus, and device, storage medium and program product
WO2022160603A1 (en) Song recommendation method and apparatus, electronic device, and storage medium
KR20160017461A (en) Device for controlling play and method thereof
JP2021536079A (en) Control method of display device and display device by it
WO2024078516A1 (en) Media content display method and apparatus, device, and storage medium
WO2024094130A1 (en) Content sharing method and apparatus, and device, computer-readable storage medium and product
JP2023538943A (en) Audio data processing methods, devices, equipment and storage media
WO2022237463A1 (en) Livestreaming background sound processing method and apparatus, device, medium, and program product
JP2018042254A (en) Terminal device
WO2024037480A1 (en) Interaction method and apparatus, electronic device, and storage medium
WO2024032635A1 (en) Media content acquisition method and apparatus, and device, readable storage medium and product

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 22806433

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 02.04.2024)