WO2020052665A1 - Live broadcast interaction method and apparatus, and storage medium - Google Patents

Live broadcast interaction method and apparatus, and storage medium Download PDF

Info

Publication number
WO2020052665A1
WO2020052665A1 PCT/CN2019/105771 CN2019105771W WO2020052665A1 WO 2020052665 A1 WO2020052665 A1 WO 2020052665A1 CN 2019105771 W CN2019105771 W CN 2019105771W WO 2020052665 A1 WO2020052665 A1 WO 2020052665A1
Authority
WO
WIPO (PCT)
Prior art keywords
target user
live broadcast
special effect
instruction
information
Prior art date
Application number
PCT/CN2019/105771
Other languages
French (fr)
Chinese (zh)
Inventor
牛冰峰
Original Assignee
咪咕音乐有限公司
咪咕文化科技有限公司
中国移动通信集团有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 咪咕音乐有限公司, 咪咕文化科技有限公司, 中国移动通信集团有限公司 filed Critical 咪咕音乐有限公司
Publication of WO2020052665A1 publication Critical patent/WO2020052665A1/en

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/439Processing of audio elementary streams
    • H04N21/4398Processing of audio elementary streams involving reformatting operations of audio signals
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/431Generation of visual interfaces for content selection or interaction; Content or additional data rendering
    • H04N21/4312Generation of visual interfaces for content selection or interaction; Content or additional data rendering involving specific graphical features, e.g. screen layout, special fonts or colors, blinking icons, highlights or animations
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/45Management operations performed by the client for facilitating the reception of or the interaction with the content or administrating data related to the end-user or to the client device itself, e.g. learning user preferences for recommending movies, resolving scheduling conflicts
    • H04N21/4508Management of client data or end-user data
    • H04N21/4532Management of client data or end-user data involving end-user characteristics, e.g. viewer profile, preferences
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/47End-user applications
    • H04N21/478Supplemental services, e.g. displaying phone caller identification, shopping application
    • H04N21/4788Supplemental services, e.g. displaying phone caller identification, shopping application communicating with other users, e.g. chatting

Definitions

  • the present invention relates to, but is not limited to, the field of network communications, and in particular, to a live broadcast interactive method, device, and storage medium.
  • Fig. 1 is a schematic diagram showing the effect of a gift presented by a live app in related technology.
  • the gift is often displayed in the form of a chat dialog in the chat window of the live broadcast room, or is animated with special effects in the live broadcast interface Form display.
  • it may be covered by frequent chat content and gradually disappear, resulting in a shorter presentation time of the gift, and the display effect of the gift is less related to the content being broadcasted in the current live broadcast.
  • the effect of presenting gifts during live video playback is relatively simple, thereby reducing the interaction between the user and the anchor.
  • embodiments of the present invention are expected to provide a method, device and storage medium for live broadcast interaction, at least to solve the problem that it is difficult to effectively improve the interaction effect between the user and the anchor in the related art.
  • an embodiment of the present invention provides a live broadcast interactive method, where the method includes:
  • the first instruction After receiving the first instruction, determining an acoustic characteristic parameter corresponding to the voice information of the target user; wherein the first instruction is used to characterize a gift giving instruction triggered by the target user;
  • the acoustic characteristic parameter adjust the track information of the anchor of the live room that the target user is currently watching.
  • an embodiment of the present invention further provides a live broadcast interactive device, where the device includes: a parsing module and an adjustment module;
  • the analysis module is configured to determine an acoustic characteristic parameter corresponding to the voice information of the target user after receiving the first instruction; wherein the first instruction is used to characterize a gift gift instruction triggered by the target user;
  • the adjusting module is configured to adjust, according to the acoustic characteristic parameter, the track information of the anchor of the live room that the target user is currently watching.
  • an embodiment of the present invention further provides a live broadcast interactive device, including a memory, a processor, and an executable program stored on the memory and executable by the processor.
  • a live broadcast interactive device including a memory, a processor, and an executable program stored on the memory and executable by the processor.
  • the processor runs the executable program, Perform the steps of the live broadcast interactive method provided by the embodiment of the present invention.
  • an embodiment of the present invention further provides a storage medium on which an executable program is stored.
  • the executable program is executed by a processor, the steps of the live broadcast interactive method provided by the embodiment of the present invention are implemented.
  • the live broadcast interactive method, device, and storage medium After receiving the first instruction, collect voice information of the target user, parse the voice information of the target user, and determine that the voice information of the target user corresponds to An acoustic characteristic parameter, according to the acoustic characteristic parameter, adjusting track information of a live broadcaster currently watching by the target user.
  • changing the audio track information of the anchor through the specific acoustic characteristic parameters of the target user can not only enhance the presentation effect of the gift, but also increase the interaction between the user and the anchor, enhance the fun of live broadcast and the use of the network platform user Viscosity, which greatly improves the user experience.
  • FIG. 1 is a schematic diagram showing the effect of a gift presented by a live APP in related technologies
  • FIG. 2 is a schematic flowchart of an implementation method of a live broadcast interactive method according to an embodiment of the present invention
  • FIG. 3 is a schematic flowchart of implementing another live broadcast interactive method according to an embodiment of the present invention.
  • 4A to 4C are schematic diagrams of display effects of collecting voice information of a target user according to an embodiment of the present invention.
  • FIG. 5 is a functional structure diagram of a live broadcast interactive device according to an embodiment of the present invention.
  • FIG. 6 is a functional structure diagram of another live broadcast interactive device according to an embodiment of the present invention.
  • FIG. 7 is a schematic diagram of a hardware structure of a live broadcast interactive device according to an embodiment of the present invention.
  • FIG. 2 is a schematic flowchart of a live broadcast interactive method provided by an embodiment of the present invention, and the live broadcast interactive method may be applied to a server or a terminal device; as shown in FIG. 2, the live broadcast interactive method is implemented in the embodiment of the present invention.
  • the process can include the following steps:
  • Step 201 After receiving a first instruction, collect voice information of a target user, where the first instruction is used to represent a gift giving instruction triggered by the target user.
  • the first instruction may refer to a gift giving instruction triggered by a target user in a designated area of the live broadcast app; wherein the gift giving instruction may be triggered by inputting a click operation or a sliding operation in the designated area.
  • the target user can flexibly choose multiple gift giving methods according to their own needs.
  • the gift giving mode includes at least one of the following: a normal gift giving mode and a special effect gift giving mode.
  • the following manners can be implemented: parsing the special effect identifier from the first instruction; and selecting to enter the corresponding special effect according to the special effect identifier Gift giving mode; in the special effect gift giving mode, voice information of the target user is collected by calling an audio collection device.
  • the normal identifier is parsed from the first instruction, and in response to the first instruction, the normal gift giving mode is entered, then in the normal gift giving mode, the gift given by the target user will be in accordance with the current existing methods such as The form of chat dialogue or special effects animation is displayed, which will not be repeated here.
  • Step 202 Parse the voice information of the target user, and determine the acoustic characteristic parameters corresponding to the voice information of the target user.
  • the acoustic characteristic parameters include acoustic spectrum information.
  • the embodiment of the present invention uses the acoustic spectrum information as the main basis for distinguishing the sound tone color. . That is, in order to implement the effect of changing the anchor sound through the special effect voice in the embodiment of the present invention, it is necessary to extract the acoustic spectrum information of the target user from the collected voice information of the target user in advance.
  • the method further includes: associating the acoustic spectrum information corresponding to the voice information of the target user with the user account of the target user. .
  • the association relationship between the acoustic spectrum information of the target user and the user account of the target user is saved to the acoustic spectrum information database of the server, so that the user of the target user
  • the account number can further determine the accuracy of the extracted acoustic spectrum information.
  • Step 203 Adjust, according to the acoustic characteristic parameters, the track information of the anchor of the live room that the target user is currently watching.
  • acoustic characteristic parameter is acoustic frequency spectrum information
  • for adjusting the track information of the anchor of the live room that the target user is currently watching according to the acoustic characteristic parameter in step 203 Use the following methods:
  • Parse the special effect duration from the first instruction and adjust, according to the special effect duration and the acoustic spectrum information, the track information of the live broadcast host currently watching by the target user.
  • the live broadcast room corresponding to the first instruction is first determined according to the live room identifier carried in the first instruction, that is, the target user The live broadcast room currently being watched, and the anchor of the live broadcast room is determined, and then the track information of the anchor of the live broadcast room currently being watched by the target user is adjusted according to the determined acoustic spectrum information and the duration of the special effect.
  • the setting time period can be set according to the actual situation, which is not specifically limited here.
  • the method further includes:
  • a special effect use range is parsed from the first instruction, wherein the special effect use range corresponds to a type of gift to be given; and adjusted audio track information is output according to the special effect use range.
  • the special effect use range includes at least one of the following: for all users and some users in the live broadcast room that the target user is currently watching, the partial users may be, for example, the target user who triggers the gift giving instruction.
  • the acoustic characteristic parameters corresponding to the target user's voice information are determined, and according to the acoustic characteristic parameters, the track information of the anchor of the live room that the target user is currently watching Make corresponding adjustments to achieve the effect of changing the audio track information of the anchor through the voice effects of the target user, enhance the presenting effect of the gift and the interaction between the user and the anchor, and enhance the fun of the live broadcast and the use of the network platform user Viscosity.
  • FIG. 3 is a schematic flowchart of another live broadcast interactive method provided by an embodiment of the present invention.
  • the live broadcast interactive method may be applied to a server (such as a live broadcast server) or a terminal device.
  • the implementation process of the live broadcast interactive method may include the following steps:
  • Step 301 After receiving the first instruction, collect voice information of the target user.
  • the first instruction is used to represent a gift giving instruction triggered by the target user.
  • the first instruction may refer to the gift giving instruction triggered by the target user through a designated area of the live broadcast app. ; Among them, a click operation or a sliding operation can be input in a designated area to trigger a gift giving instruction.
  • the target user can flexibly choose a variety of gift giving methods according to their own needs.
  • the gift giving mode includes at least one of the following: a normal gift giving mode and a special effect gift giving mode.
  • the first instruction when the gift presenting method selected by the target user is the normal gift presenting mode, the first instruction carries a normal identification. At this time, the normal identification is parsed from the first instruction, and the normal instruction is entered in response to the first instruction.
  • the gift-giving mode in the normal gift-giving mode, the gift given by the target user will be displayed according to the current existing method shown in FIG. 1 such as a chat dialog form or a special effect animation form, which is not repeated here.
  • the gift giving method selected by the target user is a special effect gift giving mode
  • the first instruction carries a special effect identifier. At this time, the special effect identifier, that is, a voice special effect identifier is parsed from the first instruction, and the special effect is entered in response to the first instruction.
  • the terminal device will call the audio collection device and pop up a prompt message to guide the target user to input audio information.
  • FIGS. 4A to 4C are schematic diagrams of display effects of collecting voice information of a target user according to an embodiment of the present invention.
  • the target user clicks a virtual button that can trigger a “gift gift”, and the live broadcast app will pop up.
  • the gift gift mode selection interface shown in FIG. 4B according to the gift gift mode selected by the target user on the selection interface, such as a normal gift gift mode or a special effect gift gift mode, a corresponding operation interface pops up.
  • the live broadcast app will pop up a voice input guidance interface as shown in FIG. 4C, such as "Please record a voice through the microphone" to guide the target user to input a voice.
  • Step 302 Parse the voice information of the target user, and determine the acoustic spectrum information corresponding to the voice information of the target user.
  • sound is an electromagnetic wave (acoustic wave) with a certain oscillation frequency
  • electromagnetic waves have physical parameters such as oscillation frequency, amplitude, and waveform. It is precisely because of these different physical parameters that the sound has various auditory effects.
  • the sound has four different forms of expression such as tone, volume, tone color and tone type.
  • the tone is a form of expression related to the oscillation frequency of the electromagnetic wave, which is proportional to the oscillation frequency of the electromagnetic wave, that is, the higher the oscillation frequency of the electromagnetic wave, the higher the tone; the lower the oscillation frequency of the electromagnetic wave, the more the tone Low; volume is a manifestation related to the amplitude of the electromagnetic wave.
  • the size of the volume is proportional to the amplitude of the electromagnetic wave, that is, the larger the amplitude, the larger the volume, and conversely, the smaller the amplitude, the smaller the volume.
  • the tone color refers to the sensory characteristics of the sound, and the voices of different users are distinguished by the tone color. The same is the soprano, even if the same song is sung, the tone of the user is not the same.
  • the timbre is determined by the waveform of the electromagnetic wave.
  • the waveform of a standard electromagnetic wave is a sine wave, such as alternating current, and its waveform is a standard sine wave.
  • the user's voice, the sounds of various musical instruments, and the various sounds in nature often have a more complex shape. It is precisely these different shapes of waveforms that determine the timbre of different sounds.
  • the sound color of a sound can be represented by a waveform (the waveform is a time-domain representation of a sound), or by a sound spectrum representation (a spectrum is a frequency-domain representation of a sound).
  • the sound spectrum information corresponding to the segment waveform can be represented by a waveform (the waveform is a time-domain representation of a sound), or by a sound spectrum representation (a spectrum is a frequency-domain representation of a sound).
  • the embodiment of the present invention uses the acoustic spectrum information as the main basis for distinguishing the sound tone color. . That is, in order to implement the effect of changing the anchor sound through the special effect voice in the embodiment of the present invention, it is necessary to extract the acoustic spectrum information of the target user from the collected voice information of the target user in advance.
  • the 20 basic acoustic spectrums can be combined into more than 400 acoustic spectrum combinations, so that the target can be simulated by these more than 400 acoustic spectrum combinations.
  • the voice of the user is only 20 basic acoustic spectrums need to be collected.
  • the method further includes: associating the acoustic spectrum information corresponding to the voice information of the target user with the user account of the target user. .
  • target user may choose different durations of special effects or special effect use scopes when giving gifts of voice effects.
  • Step 303 Parse the duration of the special effect from the first instruction.
  • Step 304 Adjust, according to the duration of the special effect and the acoustic spectrum information, the track information of the anchor of the live room that the target user is currently watching.
  • the live broadcast room corresponding to the first instruction is first determined according to the live broadcast identifier carried in the first instruction, that is, the target The live room that the user is currently watching and the anchor of the live room are determined, and then the track information of the anchor of the live room that the target user is currently watching is adjusted according to the determined acoustic spectrum information and the duration of the special effect.
  • the track information of the live broadcaster currently watching the target user within a set period of time.
  • the setting time period can be set according to the actual situation, which is not specifically limited here.
  • Step 305 Parse the special effect use range from the first instruction, and output adjusted track information according to the special effect use range.
  • the special effect use range corresponds to the type of gift being given.
  • the special effect use range includes at least one of the following: for all users and some users in the live broadcast room that the target user is currently watching, the partial users may be, for example, the target user who triggers the gift giving instruction.
  • the voice effect gift is divided into two types: "A” and "B".
  • the effect range of the "A" type voice effect gift is for all users in the live room, that is, when the target user selects When sending "A" type voice effects gift, all users in the live room can listen to the voice effect, and when the target user chooses to send "B" type voice effects gift, only the gift giver is the target user The voice effect can be heard, and other users cannot hear the voice effect.
  • the target user may choose to give the anchor a special voice effect
  • the target user can input a piece of voice information through a voice input device (such as a microphone), and analyze the voice information to determine the acoustic spectrum information corresponding to the voice information, and according to the acoustic spectrum information, the anchor within the next set time period
  • the audio track information is adjusted to achieve the effect of changing the anchor's sound through the voice special effect gift, which enhances the presenting effect of the gift and the interaction between the user and the anchor, and enhances the fun of live broadcast and the viscosity of the users of the network platform.
  • FIG. 5 is a function of a live broadcast interactive apparatus provided by an embodiment of the present invention.
  • the live broadcast interactive device includes: a collection module 51, an analysis module 52, and an adjustment module 53. The functions of the above program modules are described in detail below.
  • the collection module 51 is configured to collect voice information of a target user after receiving a first instruction, wherein the first instruction is used to represent a gift gift instruction triggered by the target user;
  • the analysis module 52 is configured to parse the voice information of the target user, and determine an acoustic characteristic parameter corresponding to the voice information of the target user;
  • the adjustment module 53 is configured to adjust, according to the acoustic characteristic parameter, the track information of the live broadcast host currently watching by the target user.
  • the following methods can be used to implement: parsing the special effect identifier from the first instruction; and selecting to enter the corresponding one according to the special effect identifier.
  • a special effect gift giving mode in the special effect gift giving mode, voice information of the target user is collected by calling an audio collection device.
  • the acoustic characteristic parameter may include acoustic spectrum information.
  • FIG. 6 is a functional structure diagram of another live broadcast interactive device according to an embodiment of the present invention. As shown in FIG. 6, the live broadcast interactive device may further include:
  • the association module 54 is configured to, after the analysis module 52 determines the acoustic characteristic parameters corresponding to the voice information of the target user, perform the acoustic spectrum information corresponding to the voice information of the target user with the user account of the target user. Associated.
  • the following manner may be used to implement: parsing the special effect persistence from the first instruction Duration; adjusting track information of a live broadcast host currently watching the target user according to the duration of the special effect and the acoustic spectrum information.
  • the parsing module 52 is further configured to parse the special effect usage from the first instruction after the adjustment module 53 adjusts the track information of the live broadcast host currently watching the target user.
  • a range wherein the range of use of the special effect corresponds to the type of gift given;
  • the live broadcast interactive device further includes:
  • the output module 55 is configured to output adjusted audio track information according to the special effect use range.
  • the live broadcast interactive device provided in the foregoing embodiment implements the live broadcast interaction between the user and the anchor
  • only the above-mentioned division of the program modules is used as an example.
  • the above-mentioned processing may be allocated by different processes.
  • the program modules are completed, that is, the internal structure of the live broadcast interactive device is divided into different program modules to complete all or part of the processing described above.
  • the live broadcast interactive device and the live broadcast interactive method embodiments provided by the foregoing embodiments belong to the same concept, and the implementation process thereof is detailed in the method embodiment, and details are not described herein again.
  • the acquisition module 51, the analysis module 52, the adjustment module 53 and the association module 54 in the live broadcast interactive device can all be a central processing unit (CPU, Central Processing Unit), a microprocessor located on a server or a terminal device. (MPU, Micro Processor Unit), Digital Signal Processor (DSP, Digital Signal Processor) or Field Programmable Gate Array (FPGA, Field Programmable Gate Array), etc .; the output module 55 can be used in the actual application through the communication module (Including: basic communication suite, operating system, communication module, standardized interfaces and protocols, etc.) and transceiver antenna implementation.
  • CPU Central Processing Unit
  • DSP Digital Signal Processor
  • FPGA Field Programmable Gate Array
  • an embodiment of the present invention further provides a hardware structure of a live broadcast interactive device.
  • a live broadcast interactive device that implements an embodiment of the present invention will now be described with reference to the drawings.
  • the live broadcast interactive device may be implemented in various forms, such as a server (such as a live server), a terminal device (such as a desktop computer, a notebook computer, a smart phone). ) And other types of computer equipment.
  • the hardware structure of the live broadcast interactive device according to the embodiment of the present invention is further described below. It can be understood that FIG. 7 only shows an exemplary structure of the live broadcast interactive device instead of the entire structure, and some or all of the structures shown in FIG. 7 may be implemented as required. structure.
  • FIG. 7 is a schematic diagram of a hardware structure of a live broadcast interactive device according to an embodiment of the present invention. In actual applications, it can be applied to the foregoing various servers or terminal devices running application programs.
  • the live broadcast interactive device 700 shown in FIG. It includes: at least one processor 701, memory 702, user interface 703, and at least one network interface 704.
  • the various components in the live broadcast interactive device 700 are coupled together through a bus system 705. It can be understood that the bus system 705 is configured to implement connection and communication between these components.
  • the bus system 705 includes a power bus, a control bus, and a status signal bus in addition to the data bus. However, for the sake of clarity, various buses are marked as the bus system 705 in FIG. 7.
  • the user interface 703 may include a display, a keyboard, a mouse, a trackball, a click wheel, keys, buttons, a touch panel, or a touch screen.
  • the memory 702 may be a volatile memory or a non-volatile memory, and may also include both volatile and non-volatile memory.
  • the memory 702 in the embodiment of the present invention is configured to store various types of data to support the operation of the live broadcast interactive device 700. Examples of these data include: any computer program for operating on the live broadcast interactive device 700, such as an executable program 7021 and an operating system 7022. A program that implements the live broadcast interactive method of the embodiment of the present invention may be included in the executable program 7021.
  • the live broadcast interaction method disclosed in the embodiment of the present invention may be applied to the processor 701, or implemented by the processor 701.
  • the processor 701 may be an integrated circuit chip and has a signal processing capability. In the implementation process, each step of the foregoing live broadcast interactive method may be completed by an integrated logic circuit of hardware in the processor 701 or an instruction in the form of software.
  • the foregoing processor 701 may be a general-purpose processor, a DSP, or other programmable logic devices, discrete gate or transistor logic devices, discrete hardware components, and the like.
  • the processor 701 may implement or execute various live broadcast interactive methods, steps, and logic block diagrams provided in the embodiments of the present invention.
  • a general-purpose processor may be a microprocessor or any conventional processor.
  • the steps can be directly performed by a hardware decoding processor, or can be performed by a combination of hardware and software modules in the decoding processor.
  • the software module may be located in a storage medium.
  • the storage medium is located in the memory 702.
  • the processor 701 reads the information in the memory 702 and completes the steps of the live broadcast interactive method provided by the embodiment of the present invention in combination with its hardware.
  • the live broadcast interactive device 700 includes a memory 702, a processor 701, and an executable program 7021 stored on the memory 702 and executable by the processor 701.
  • the processor 701 runs the executable
  • the program 7021 realizes that after receiving the first instruction, the target user's voice information is collected, wherein the first instruction is used to represent a gift gift instruction triggered by the target user; the voice information of the target user is parsed to determine An acoustic characteristic parameter corresponding to the voice information of the target user; and adjusting, according to the acoustic characteristic parameter, the track information of the anchor of the live room that the target user is currently watching.
  • the processor 701 when the processor 701 runs the executable program 7021, the processor 701 implements: parsing a special effect identifier from the first instruction; selecting to enter a corresponding special effect gift giving mode according to the special effect identifier; In the special effect gift giving mode, the voice information of the target user is collected by calling an audio collection device.
  • the processor 701 is implemented when the executable program 7021 is run: the acoustic characteristic parameters include acoustic spectrum information; after determining the acoustic characteristic parameters corresponding to the voice information of the target user, The acoustic spectrum information corresponding to the voice information of the target user is associated with the user account of the target user.
  • the processor 701 when the processor 701 runs the executable program 7021, it is implemented: parsing a special effect duration from the first instruction; adjusting the special effect duration according to the special effect duration and the acoustic spectrum information The track information of the anchor of the live room that the target user is currently watching.
  • the processor 701 when the processor 701 runs the executable program 7021, it is implemented: after adjusting the track information of the live broadcast host currently watching by the target user, parsing from the first instruction A special effect use range is provided, wherein the special effect use range corresponds to the type of gift to be given; according to the special effect use range, adjusted audio track information is output.
  • an embodiment of the present invention further provides a storage medium.
  • the storage medium may be a storage medium such as an optical disk, a flash memory, or a magnetic disk, and may be a non-transitory storage medium.
  • an executable program 7021 is stored on the storage medium, and the executable program 7021 is implemented when the processor 701 executes: after receiving the first instruction, collecting voice information of the target user, wherein, the The first instruction is used for characterizing a gift giving instruction triggered by the target user; analyzing the target user's voice information to determine an acoustic characteristic parameter corresponding to the target user's voice information; and adjusting the acoustic characteristic parameter according to the acoustic characteristic parameter The track information of the anchor of the live room that the target user is currently watching.
  • the executable program 7021 when executed by the processor 701, it is realized: parsing a special effect identifier from the first instruction; selecting to enter a corresponding special effect gift giving mode according to the special effect identifier; In the special-effect gift-giving mode, voice information of the target user is collected by calling an audio collection device.
  • the executable program 7021 is implemented when executed by the processor 701: the acoustic characteristic parameters include acoustic spectrum information; after determining the acoustic characteristic parameters corresponding to the voice information of the target user, The acoustic spectrum information corresponding to the voice information of the target user is associated with the user account of the target user.
  • the executable program 7021 is implemented when executed by the processor 701: parsing a special effect duration from the first instruction; adjusting the target according to the special effect duration and the acoustic spectrum information Track information of the anchor of the live room that the user is currently watching.
  • the executable program 7021 is implemented when the processor 701 executes: after adjusting the track information of the live broadcast host currently watching by the target user, parsing out from the first instruction
  • the use range of special effects wherein the use range of special effects corresponds to the type of gift given; according to the use range of special effects, adjusted audio track information is output.
  • the live broadcast interactive method changes the track information of the anchor through the specific acoustic characteristic parameters of the target user, which can not only enhance the presentation effect of the gift given, but also increase the Interactive effects to enhance the fun of live broadcast and the viscosity of users of online platforms.
  • the embodiments of the present invention may be provided as a method, a system, or an executable program product. Therefore, the present invention may take the form of a hardware embodiment, a software embodiment, or an embodiment combining software and hardware aspects. Moreover, the present invention may take the form of an executable program product implemented on one or more computer-usable storage media (including, but not limited to, magnetic disk storage, optical storage, and the like) containing computer-usable program code.
  • computer-usable storage media including, but not limited to, magnetic disk storage, optical storage, and the like
  • These executable program instructions may also be stored in a computer-readable memory capable of directing a computer or a reference programmable data processing device to work in a particular manner such that the instructions stored in the computer-readable memory produce a manufactured article including the instruction means, the The instruction device implements the functions specified in one or more flowcharts and / or one or more blocks of the block diagram.
  • executable program instructions can also be loaded onto a computer or reference programmable data processing device, such that a series of steps are performed on the computer or reference programmable data processing device to produce a computer-implemented process for execution on the computer or reference programmable data processing device.
  • the instructions provide steps for implementing the functions specified in one or more flowcharts and / or one or more blocks of the block diagrams.

Abstract

Disclosed is a live broadcast interaction method. The method comprises: after a first instruction is received, determining an acoustic feature parameter corresponding to voice information of a target user, wherein the first instruction is used for characterizing a gift-giving instruction triggered by the target user; and adjusting, according to the acoustic feature parameter, audio track information of an anchor in a live broadcast room that is currently viewed by the target user. Further disclosed are a live broadcast interaction apparatus and a storage medium.

Description

一种直播互动方法、装置及存储介质Live broadcast interactive method, device and storage medium
相关申请的交叉引用Cross-reference to related applications
本申请基于申请号为201811063369.1、申请日为2018年09月12日的中国专利申请提出,并要求该中国专利申请的优先权,该中国专利申请的全部内容在此引入本申请作为参考。This application is based on a Chinese patent application with an application number of 201811063369.1 and an application date of September 12, 2018, and claims the priority of the Chinese patent application. The entire contents of the Chinese patent application are incorporated herein by reference.
技术领域Technical field
本发明涉及但不限于网络通信领域,尤其涉及一种直播互动方法、装置及存储介质。The present invention relates to, but is not limited to, the field of network communications, and in particular, to a live broadcast interactive method, device, and storage medium.
背景技术Background technique
随着互联网技术的飞速发展,网络直播变得越来越热门,用户可以通过直播应用程序(APP,Application)随时随地地观看自己喜爱主播的直播节目。其中,用户在观看直播视频的过程中,可以通过网络平台为自己喜爱的主播赠送礼物,以实现与主播之间的互动。With the rapid development of Internet technology, webcasting has become more and more popular, and users can watch their favorite anchor live broadcast programs anytime, anywhere through a live broadcast application (APP, Application). Among them, during the process of watching a live video, a user can give a gift to his favorite anchor through a network platform to achieve interaction with the anchor.
图1为相关技术中直播APP展示赠送的礼物的效果示意图,如图1所示,赠送的礼物往往是在直播间聊天窗口中以聊天对话的形式显示,或者是在直播界面中以特效动画的形式显示。然而,礼物在展示的过程中,可能会被频繁的聊天内容覆盖而逐渐消失,导致赠送的礼物呈现时间较短,且赠送的礼物的展示效果与当前直播间正在直播的内容关联程度较低,导致直播视频播放过程中赠送礼物的展示效果较为单一,从而降低用户与主播之间的互动效果。Fig. 1 is a schematic diagram showing the effect of a gift presented by a live app in related technology. As shown in Fig. 1, the gift is often displayed in the form of a chat dialog in the chat window of the live broadcast room, or is animated with special effects in the live broadcast interface Form display. However, during the display of the gift, it may be covered by frequent chat content and gradually disappear, resulting in a shorter presentation time of the gift, and the display effect of the gift is less related to the content being broadcasted in the current live broadcast. As a result, the effect of presenting gifts during live video playback is relatively simple, thereby reducing the interaction between the user and the anchor.
发明内容Summary of the Invention
有鉴于此,本发明实施例期望提供一种直播互动方法、装置及存储介 质,至少用以解决相关技术中难以有效提高用户与主播之间的互动效果的问题。In view of this, embodiments of the present invention are expected to provide a method, device and storage medium for live broadcast interaction, at least to solve the problem that it is difficult to effectively improve the interaction effect between the user and the anchor in the related art.
本发明实施例的技术方案是这样实现的:The technical solution of the embodiment of the present invention is implemented as follows:
第一方面,本发明实施例提供一种直播互动方法,所述方法包括:In a first aspect, an embodiment of the present invention provides a live broadcast interactive method, where the method includes:
接收到第一指令后,确定目标用户的语音信息对应的声学特征参数;其中,所述第一指令用于表征由所述目标用户触发的礼物赠送指令;After receiving the first instruction, determining an acoustic characteristic parameter corresponding to the voice information of the target user; wherein the first instruction is used to characterize a gift giving instruction triggered by the target user;
根据所述声学特征参数,调整所述目标用户当前正在观看的直播间主播的音轨信息。According to the acoustic characteristic parameter, adjust the track information of the anchor of the live room that the target user is currently watching.
第二方面,本发明实施例还提供一种直播互动装置,所述装置包括:解析模块和调整模块;其中,In a second aspect, an embodiment of the present invention further provides a live broadcast interactive device, where the device includes: a parsing module and an adjustment module;
所述解析模块,配置为在接收到第一指令后,确定所述目标用户的语音信息对应的声学特征参数;其中,所述第一指令用于表征由所述目标用户触发的礼物赠送指令;The analysis module is configured to determine an acoustic characteristic parameter corresponding to the voice information of the target user after receiving the first instruction; wherein the first instruction is used to characterize a gift gift instruction triggered by the target user;
所述调整模块,配置为根据所述声学特征参数,调整所述目标用户当前正在观看的直播间主播的音轨信息。The adjusting module is configured to adjust, according to the acoustic characteristic parameter, the track information of the anchor of the live room that the target user is currently watching.
第三方面,本发明实施例还提供一种直播互动装置,包括存储器、处理器及存储在存储器上并能够由所述处理器运行的可执行程序,所述处理器运行所述可执行程序时执行本发明实施例提供的直播互动方法的步骤。According to a third aspect, an embodiment of the present invention further provides a live broadcast interactive device, including a memory, a processor, and an executable program stored on the memory and executable by the processor. When the processor runs the executable program, Perform the steps of the live broadcast interactive method provided by the embodiment of the present invention.
第四方面,本发明实施例还提供一种存储介质,其上存储有可执行程序,所述可执行程序被处理器执行时实现本发明实施例提供的直播互动方法的步骤。According to a fourth aspect, an embodiment of the present invention further provides a storage medium on which an executable program is stored. When the executable program is executed by a processor, the steps of the live broadcast interactive method provided by the embodiment of the present invention are implemented.
本发明实施例所提供的直播互动方法、装置及存储介质,在接收到第一指令后,采集目标用户的语音信息,解析所述目标用户的语音信息,确定所述目标用户的语音信息对应的声学特征参数,根据所述声学特征参数,调整所述目标用户当前正在观看的直播间主播的音轨信息。如此,通过目标用户特定的声学特征参数改变主播的音轨信息,不仅可以增强赠送的礼 物的呈现效果,还可以增加用户与主播之间的互动效果,提升直播的趣味性以及网络平台用户的使用粘度,从而大大提升用户的使用体验。After receiving the first instruction, the live broadcast interactive method, device, and storage medium provided by the embodiments of the present invention collect voice information of the target user, parse the voice information of the target user, and determine that the voice information of the target user corresponds to An acoustic characteristic parameter, according to the acoustic characteristic parameter, adjusting track information of a live broadcaster currently watching by the target user. In this way, changing the audio track information of the anchor through the specific acoustic characteristic parameters of the target user can not only enhance the presentation effect of the gift, but also increase the interaction between the user and the anchor, enhance the fun of live broadcast and the use of the network platform user Viscosity, which greatly improves the user experience.
附图说明BRIEF DESCRIPTION OF THE DRAWINGS
图1为相关技术中直播APP展示赠送的礼物的效果示意图;FIG. 1 is a schematic diagram showing the effect of a gift presented by a live APP in related technologies;
图2为本发明实施例提供的一种直播互动方法的实现流程示意图;2 is a schematic flowchart of an implementation method of a live broadcast interactive method according to an embodiment of the present invention;
图3为本发明实施例提供的另一种直播互动方法的实现流程示意图;FIG. 3 is a schematic flowchart of implementing another live broadcast interactive method according to an embodiment of the present invention; FIG.
图4A至图4C为本发明实施例提供的采集目标用户的语音信息的显示效果示意图;4A to 4C are schematic diagrams of display effects of collecting voice information of a target user according to an embodiment of the present invention;
图5为本发明实施例提供的一种直播互动装置的功能结构示意图;FIG. 5 is a functional structure diagram of a live broadcast interactive device according to an embodiment of the present invention; FIG.
图6为本发明实施例提供的另一种直播互动装置的功能结构示意图;FIG. 6 is a functional structure diagram of another live broadcast interactive device according to an embodiment of the present invention; FIG.
图7为本发明实施例提供的一种直播互动装置的硬件结构示意图。FIG. 7 is a schematic diagram of a hardware structure of a live broadcast interactive device according to an embodiment of the present invention.
具体实施方式detailed description
为了能够更加详尽地了解本发明实施例的特点与技术内容,下面结合附图对本发明实施例的实现进行详细阐述,所附附图仅供参考说明之用,并非用来限定本发明。本领域技术人员应当理解,本发明实施例所记载的各技术方案之间,在不冲突的情况下,可以任意组合。In order to understand the features and technical contents of the embodiments of the present invention in more detail, the implementation of the embodiments of the present invention will be described in detail with reference to the accompanying drawings. The accompanying drawings are for reference only and are not intended to limit the present invention. Those skilled in the art should understand that the technical solutions described in the embodiments of the present invention can be arbitrarily combined without conflict.
图2为本发明实施例提供的一种直播互动方法的实现流程示意图,所述直播互动方法可应用于服务器或终端设备中;如图2所示,本发明实施例中的直播互动方法的实现流程,可以包括以下步骤:FIG. 2 is a schematic flowchart of a live broadcast interactive method provided by an embodiment of the present invention, and the live broadcast interactive method may be applied to a server or a terminal device; as shown in FIG. 2, the live broadcast interactive method is implemented in the embodiment of the present invention. The process can include the following steps:
步骤201:接收到第一指令后,采集目标用户的语音信息,其中,所述第一指令用于表征由所述目标用户触发的礼物赠送指令。Step 201: After receiving a first instruction, collect voice information of a target user, where the first instruction is used to represent a gift giving instruction triggered by the target user.
在一些实施例中,所述第一指令可以是指目标用户通过在直播APP的指定区域触发的礼物赠送指令;其中,可通过在指定区域输入点击操作或者滑动操作等方式以触发礼物赠送指令。这里,目标用户可根据自身的需 求,灵活地选择多种礼物赠送方式。In some embodiments, the first instruction may refer to a gift giving instruction triggered by a target user in a designated area of the live broadcast app; wherein the gift giving instruction may be triggered by inputting a click operation or a sliding operation in the designated area. Here, the target user can flexibly choose multiple gift giving methods according to their own needs.
在一些实施例中,所述礼物赠送方式包括以下至少之一:正常礼物赠送模式和特效礼物赠送模式。In some embodiments, the gift giving mode includes at least one of the following: a normal gift giving mode and a special effect gift giving mode.
在一些实施例中,对于本步骤201中的采集目标用户的语音信息来说,可以采用以下方式实现:从所述第一指令中解析出特效标识;根据所述特效标识,选择进入相应的特效礼物赠送模式;在所述特效礼物赠送模式下,通过调用音频采集设备采集所述目标用户的语音信息。In some embodiments, for collecting the voice information of the target user in step 201, the following manners can be implemented: parsing the special effect identifier from the first instruction; and selecting to enter the corresponding special effect according to the special effect identifier Gift giving mode; in the special effect gift giving mode, voice information of the target user is collected by calling an audio collection device.
需要说明的是,当从第一指令中解析出正常标识时,响应于该第一指令进入正常礼物赠送模式,则在正常礼物赠送模式下,目标用户赠送的礼物将按照目前现有的方式如聊天对话的形式,或者特效动画的形式进行显示,这里不再赘述。It should be noted that when the normal identifier is parsed from the first instruction, and in response to the first instruction, the normal gift giving mode is entered, then in the normal gift giving mode, the gift given by the target user will be in accordance with the current existing methods such as The form of chat dialogue or special effects animation is displayed, which will not be repeated here.
步骤202:解析所述目标用户的语音信息,确定所述目标用户的语音信息对应的声学特征参数。Step 202: Parse the voice information of the target user, and determine the acoustic characteristic parameters corresponding to the voice information of the target user.
在一些实施例中,所述声学特征参数包括声学频谱信息。In some embodiments, the acoustic characteristic parameters include acoustic spectrum information.
需要指出的是,由于同一种音色的声音可能有多种不同的波形,但是同一种音色的声学频谱信息往往是相同的,因而,本发明实施例采用声学频谱信息作为区分声音音色不同的主要依据。也就是说,本发明实施例为了实现通过特效语音改变主播声音的效果,需要预先从采集到的目标用户的语音信息中提取该目标用户的声学频谱信息。It should be pointed out that since the sound of the same tone color may have multiple different waveforms, the acoustic spectrum information of the same tone color is often the same. Therefore, the embodiment of the present invention uses the acoustic spectrum information as the main basis for distinguishing the sound tone color. . That is, in order to implement the effect of changing the anchor sound through the special effect voice in the embodiment of the present invention, it is necessary to extract the acoustic spectrum information of the target user from the collected voice information of the target user in advance.
这里,在所述确定所述目标用户的语音信息对应的声学特征参数之后,所述方法还包括:将所述目标用户的语音信息对应的声学频谱信息,与所述目标用户的用户账号进行关联。Here, after determining the acoustic characteristic parameters corresponding to the voice information of the target user, the method further includes: associating the acoustic spectrum information corresponding to the voice information of the target user with the user account of the target user. .
在实际实施时,在确定出目标用户的声学频谱信息之后,将目标用户的声学频谱信息与目标用户的用户账号之间的关联关系保存至服务器的声学频谱信息库中,这样通过目标用户的用户账号就可以进一步确定提取出的声学频谱信息的准确性。In actual implementation, after the acoustic spectrum information of the target user is determined, the association relationship between the acoustic spectrum information of the target user and the user account of the target user is saved to the acoustic spectrum information database of the server, so that the user of the target user The account number can further determine the accuracy of the extracted acoustic spectrum information.
步骤203:根据所述声学特征参数,调整所述目标用户当前正在观看的直播间主播的音轨信息。Step 203: Adjust, according to the acoustic characteristic parameters, the track information of the anchor of the live room that the target user is currently watching.
在一些实施例中,所述声学特征参数为声学频谱信息时,对于本步骤203中的根据所述声学特征参数,调整所述目标用户当前正在观看的直播间主播的音轨信息来说,可以采用以下方式实现:In some embodiments, when the acoustic characteristic parameter is acoustic frequency spectrum information, for adjusting the track information of the anchor of the live room that the target user is currently watching according to the acoustic characteristic parameter in step 203, Use the following methods:
从所述第一指令中解析出特效持续时长;根据所述特效持续时长和所述声学频谱信息,调整所述目标用户当前正在观看的直播间主播的音轨信息。Parse the special effect duration from the first instruction; and adjust, according to the special effect duration and the acoustic spectrum information, the track information of the live broadcast host currently watching by the target user.
需要指出的是,在接收到第一指令以及确定目标用户的语音信息对应的声学频谱信息之后,首先会根据第一指令中携带的直播间标识,确定第一指令对应的直播间,即目标用户当前正在观看的直播间,以及确定该直播间的主播,进而根据确定的声学频谱信息和特效持续时长,对目标用户当前正在观看的直播间主播的音轨信息进行调整。It should be noted that after receiving the first instruction and determining the acoustic spectrum information corresponding to the voice information of the target user, the live broadcast room corresponding to the first instruction is first determined according to the live room identifier carried in the first instruction, that is, the target user The live broadcast room currently being watched, and the anchor of the live broadcast room is determined, and then the track information of the anchor of the live broadcast room currently being watched by the target user is adjusted according to the determined acoustic spectrum information and the duration of the special effect.
这里,可以调整目标用户当前正在观看的直播间主播在设定时间段内的音轨信息。其中,设定时间段可根据实际情况进行设定,这里不作具体限定。Here, it is possible to adjust the track information of the live broadcaster currently watching the target user within a set period of time. The setting time period can be set according to the actual situation, which is not specifically limited here.
在一些实施例中,在所述调整所述目标用户当前正在观看的直播间主播的音轨信息之后,所述方法还包括:In some embodiments, after the adjusting the track information of the live broadcaster currently watching by the target user, the method further includes:
从所述第一指令中解析出特效使用范围,其中,所述特效使用范围与赠送的礼物类型相对应;根据所述特效使用范围,输出调整后的音轨信息。A special effect use range is parsed from the first instruction, wherein the special effect use range corresponds to a type of gift to be given; and adjusted audio track information is output according to the special effect use range.
这里,所述特效使用范围包括以下至少之一:针对目标用户当前正在观看的直播间中的所有用户、部分用户,所述部分用户例如可为触发礼物赠送指令的目标用户。Here, the special effect use range includes at least one of the following: for all users and some users in the live broadcast room that the target user is currently watching, the partial users may be, for example, the target user who triggers the gift giving instruction.
采用本发明实施例的技术方案,通过对目标用户的语音信息进行解析,确定目标用户的语音信息对应的声学特征参数,并按照声学特征参数对目标用户当前正在观看的直播间主播的音轨信息进行相应调整,以达到通过 目标用户的语音特效礼物改变主播的音轨信息的效果,增强了礼物的呈现效果以及用户与主播之间的互动效果,提升了直播的趣味性以及网络平台用户的使用粘度。By adopting the technical solution of the embodiment of the present invention, by analyzing the target user's voice information, the acoustic characteristic parameters corresponding to the target user's voice information are determined, and according to the acoustic characteristic parameters, the track information of the anchor of the live room that the target user is currently watching Make corresponding adjustments to achieve the effect of changing the audio track information of the anchor through the voice effects of the target user, enhance the presenting effect of the gift and the interaction between the user and the anchor, and enhance the fun of the live broadcast and the use of the network platform user Viscosity.
在一些实施例中,参见图3,图3为本发明实施例提供的另一种直播互动方法的实现流程示意图,所述直播互动方法可应用于服务器(如直播服务器)或终端设备中;如图3所示,所述直播互动方法的实现流程,可以包括以下步骤:In some embodiments, referring to FIG. 3, FIG. 3 is a schematic flowchart of another live broadcast interactive method provided by an embodiment of the present invention. The live broadcast interactive method may be applied to a server (such as a live broadcast server) or a terminal device. As shown in FIG. 3, the implementation process of the live broadcast interactive method may include the following steps:
步骤301:接收到第一指令后,采集目标用户的语音信息。Step 301: After receiving the first instruction, collect voice information of the target user.
在一些实施例中,所述第一指令用于表征由所述目标用户触发的礼物赠送指令,在实际实施时,第一指令可以是指目标用户通过在直播APP的指定区域触发的礼物赠送指令;其中,可通过在指定区域输入点击操作或者滑动操作等方式以触发礼物赠送指令。这里,目标用户可根据自身的需求,灵活地选择多种礼物赠送方式。In some embodiments, the first instruction is used to represent a gift giving instruction triggered by the target user. In actual implementation, the first instruction may refer to the gift giving instruction triggered by the target user through a designated area of the live broadcast app. ; Among them, a click operation or a sliding operation can be input in a designated area to trigger a gift giving instruction. Here, the target user can flexibly choose a variety of gift giving methods according to their own needs.
在一些实施例中,所述礼物赠送方式包括以下至少之一:正常礼物赠送模式和特效礼物赠送模式。In some embodiments, the gift giving mode includes at least one of the following: a normal gift giving mode and a special effect gift giving mode.
举例来说,当目标用户选择的礼物赠送方式为正常礼物赠送模式时,第一指令中携带的是正常标识,此时,从第一指令中解析出正常标识,响应于该第一指令进入正常礼物赠送模式,则在正常礼物赠送模式下,目标用户赠送的礼物将按照如图1所示的目前现有的方式如聊天对话的形式,或者特效动画的形式进行显示,这里不再赘述。当目标用户选择的礼物赠送方式为特效礼物赠送模式时,第一指令中携带的是特效标识,此时,从第一指令中解析出特效标识即语音特效标识,响应于该第一指令进入特效礼物赠送模式,则在特效礼物赠送模式下,终端设备将调用音频采集设备,并弹出提示信息以引导目标用户进行音频信息的输入。For example, when the gift presenting method selected by the target user is the normal gift presenting mode, the first instruction carries a normal identification. At this time, the normal identification is parsed from the first instruction, and the normal instruction is entered in response to the first instruction. In the gift-giving mode, in the normal gift-giving mode, the gift given by the target user will be displayed according to the current existing method shown in FIG. 1 such as a chat dialog form or a special effect animation form, which is not repeated here. When the gift giving method selected by the target user is a special effect gift giving mode, the first instruction carries a special effect identifier. At this time, the special effect identifier, that is, a voice special effect identifier is parsed from the first instruction, and the special effect is entered in response to the first instruction. In the gift-giving mode, in the special-effect gift-giving mode, the terminal device will call the audio collection device and pop up a prompt message to guide the target user to input audio information.
图4A至图4C为本发明实施例提供的采集目标用户的语音信息的显示 效果示意图,如图4A所示,首先,目标用户点击可触发“赠送礼物”的虚拟按键,这时直播APP将弹出如图4B所示的礼物赠送模式的选择界面,根据目标用户在该选择界面上所选择的礼物赠送模式如正常礼物赠送模式或特效礼物赠送模式,弹出对应的操作界面。在图4B中,例如当目标用户选择了特效礼物赠送模式后,直播APP将会弹出如图4C所示的语音输入引导界面,如“请通过麦克风录制一段语音”,以引导目标用户输入语音。FIGS. 4A to 4C are schematic diagrams of display effects of collecting voice information of a target user according to an embodiment of the present invention. As shown in FIG. 4A, first, the target user clicks a virtual button that can trigger a “gift gift”, and the live broadcast app will pop up. As shown in the gift gift mode selection interface shown in FIG. 4B, according to the gift gift mode selected by the target user on the selection interface, such as a normal gift gift mode or a special effect gift gift mode, a corresponding operation interface pops up. In FIG. 4B, for example, when the target user selects the special effect gift gift mode, the live broadcast app will pop up a voice input guidance interface as shown in FIG. 4C, such as "Please record a voice through the microphone" to guide the target user to input a voice.
步骤302:解析所述目标用户的语音信息,确定所述目标用户的语音信息对应的声学频谱信息。Step 302: Parse the voice information of the target user, and determine the acoustic spectrum information corresponding to the voice information of the target user.
这里,声音是一种具有一定振荡频率的电磁波(声波),而电磁波具有振荡频率、振幅、波形等物理参数,正是由于这些不同的物理参数,才使得声音有了各种不同的听觉效果。如果按照各种乐器的声音特点划分,声音有音调、音量、音色和音型等四种不同的表现形式。其中,音调是与电磁波的振荡频率相关联的一种表现形式,与电磁波的振荡频率成正比,即电磁波的振荡频率越高,则音调就越高;电磁波的振荡频率越低,则音调就越低;音量是与电磁波的振荡幅度相关联的一种表现形式,音量的大小与电磁波的振幅成正比,即振幅越大,音量就越大,反之,振幅越小,音量就越小。Here, sound is an electromagnetic wave (acoustic wave) with a certain oscillation frequency, and electromagnetic waves have physical parameters such as oscillation frequency, amplitude, and waveform. It is precisely because of these different physical parameters that the sound has various auditory effects. If divided according to the sound characteristics of various musical instruments, the sound has four different forms of expression such as tone, volume, tone color and tone type. Among them, the tone is a form of expression related to the oscillation frequency of the electromagnetic wave, which is proportional to the oscillation frequency of the electromagnetic wave, that is, the higher the oscillation frequency of the electromagnetic wave, the higher the tone; the lower the oscillation frequency of the electromagnetic wave, the more the tone Low; volume is a manifestation related to the amplitude of the electromagnetic wave. The size of the volume is proportional to the amplitude of the electromagnetic wave, that is, the larger the amplitude, the larger the volume, and conversely, the smaller the amplitude, the smaller the volume.
音色则是指声音的感觉特性,不同用户的声音正是由音色进行区分的。同样是女高音,即使唱同一首歌,不用用户的音色是不相同的。而音色是由电磁波的波形所决定的。标准电磁波的波形是正弦波,比如交流电,其波形就是一种标准的正弦波。但是,用户的声音,各种乐器的声音,以及自然界中各种不同的声音,其波形往往是一种较为复杂的形状,恰恰是这些不同形状的波形,才决定了不同声音的音色。声音的音色除了可以通过波形表示(波形是声音的时域表示)外,还可以通过声音频谱表示(频谱是声音的频域表示),通过对声音的波形进行傅里叶变换,即可得到该段波形所对应的声音频谱信息。The tone color refers to the sensory characteristics of the sound, and the voices of different users are distinguished by the tone color. The same is the soprano, even if the same song is sung, the tone of the user is not the same. The timbre is determined by the waveform of the electromagnetic wave. The waveform of a standard electromagnetic wave is a sine wave, such as alternating current, and its waveform is a standard sine wave. However, the user's voice, the sounds of various musical instruments, and the various sounds in nature often have a more complex shape. It is precisely these different shapes of waveforms that determine the timbre of different sounds. The sound color of a sound can be represented by a waveform (the waveform is a time-domain representation of a sound), or by a sound spectrum representation (a spectrum is a frequency-domain representation of a sound). The sound spectrum information corresponding to the segment waveform.
需要指出的是,由于同一种音色的声音可能有多种不同的波形,但是同一种音色的声学频谱信息往往是相同的,因而,本发明实施例采用声学频谱信息作为区分声音音色不同的主要依据。也就是说,本发明实施例为了实现通过特效语音改变主播声音的效果,需要预先从采集到的目标用户的语音信息中提取该目标用户的声学频谱信息。It should be pointed out that since the sound of the same tone color may have multiple different waveforms, the acoustic spectrum information of the same tone color is often the same. Therefore, the embodiment of the present invention uses the acoustic spectrum information as the main basis for distinguishing the sound tone color. . That is, in order to implement the effect of changing the anchor sound through the special effect voice in the embodiment of the present invention, it is necessary to extract the acoustic spectrum information of the target user from the collected voice information of the target user in advance.
这里,在采集声学频谱信息时,只需要采集20个基础声学频谱即可,通过这20个基础声学频谱可以组合成400多种声学频谱组合,从而可以通过这400多种声学频谱组合来模拟目标用户的声音。Here, when collecting acoustic spectrum information, only 20 basic acoustic spectrums need to be collected. The 20 basic acoustic spectrums can be combined into more than 400 acoustic spectrum combinations, so that the target can be simulated by these more than 400 acoustic spectrum combinations. The voice of the user.
这里,在所述确定所述目标用户的语音信息对应的声学特征参数之后,所述方法还包括:将所述目标用户的语音信息对应的声学频谱信息,与所述目标用户的用户账号进行关联。Here, after determining the acoustic characteristic parameters corresponding to the voice information of the target user, the method further includes: associating the acoustic spectrum information corresponding to the voice information of the target user with the user account of the target user. .
需要说明的是,目标用户在进行语音特效礼物赠送时,可以根据需要选择不同的特效持续时长或者特效使用范围。It should be noted that the target user may choose different durations of special effects or special effect use scopes when giving gifts of voice effects.
步骤303:从所述第一指令中解析出特效持续时长。Step 303: Parse the duration of the special effect from the first instruction.
步骤304:根据所述特效持续时长和所述声学频谱信息,调整所述目标用户当前正在观看的直播间主播的音轨信息。Step 304: Adjust, according to the duration of the special effect and the acoustic spectrum information, the track information of the anchor of the live room that the target user is currently watching.
在一些实施例中,在接收到第一指令以及确定目标用户的语音信息对应的声学频谱信息之后,首先会根据第一指令中携带的直播间标识,确定第一指令对应的直播间,即目标用户当前正在观看的直播间,以及确定该直播间的主播,进而根据确定的声学频谱信息和特效持续时长,对目标用户当前正在观看的直播间主播的音轨信息进行调整。这里,可以调整目标用户当前正在观看的直播间主播在设定时间段内的音轨信息。其中,设定时间段可根据实际情况进行设定,这里不作具体限定。In some embodiments, after receiving the first instruction and determining the acoustic spectrum information corresponding to the voice information of the target user, the live broadcast room corresponding to the first instruction is first determined according to the live broadcast identifier carried in the first instruction, that is, the target The live room that the user is currently watching and the anchor of the live room are determined, and then the track information of the anchor of the live room that the target user is currently watching is adjusted according to the determined acoustic spectrum information and the duration of the special effect. Here, it is possible to adjust the track information of the live broadcaster currently watching the target user within a set period of time. The setting time period can be set according to the actual situation, which is not specifically limited here.
步骤305:从所述第一指令中解析出特效使用范围,根据所述特效使用范围,输出调整后的音轨信息。Step 305: Parse the special effect use range from the first instruction, and output adjusted track information according to the special effect use range.
这里,所述特效使用范围与赠送的礼物类型相对应。所述特效使用范 围包括以下至少之一:针对目标用户当前正在观看的直播间中的所有用户、部分用户,所述部分用户例如可为触发礼物赠送指令的目标用户。Here, the special effect use range corresponds to the type of gift being given. The special effect use range includes at least one of the following: for all users and some users in the live broadcast room that the target user is currently watching, the partial users may be, for example, the target user who triggers the gift giving instruction.
举例来说,将语音特效礼物分为“A”、“B”两种类型,其中,“A”类型的语音特效礼物的特效使用范围,是针对该直播间所有用户的,即当目标用户选择发送“A”类型的语音特效礼物时,该直播间中的所有用户均可以收听到该语音特效,而当目标用户选择发送“B”类型的语音特效礼物时,仅有礼物赠送者即目标用户可以收听到该语音特效,其他用户不能收听到该语音特效。For example, the voice effect gift is divided into two types: "A" and "B". Among them, the effect range of the "A" type voice effect gift is for all users in the live room, that is, when the target user selects When sending "A" type voice effects gift, all users in the live room can listen to the voice effect, and when the target user chooses to send "B" type voice effects gift, only the gift giver is the target user The voice effect can be heard, and other users cannot hear the voice effect.
采用本发明实施例的技术方案,在目标用户使用直播APP观看直播视频的过程中,当目标用户对主播的直播内容比较满意即满意程度大于设定阈值时,目标用户可以选择向主播赠送语音特效礼物,目标用户可通过语音输入设备(如麦克风)输入一段语音信息,并对语音信息进行分析,以确定出语音信息对应的声学频谱信息,按照声学频谱信息对主播接下来设定时间段内的音轨信息进行调整,以达到通过语音特效礼物改变主播声音的效果,增强了礼物的呈现效果以及用户与主播之间的互动效果,提升了直播的趣味性以及网络平台用户的使用粘度。With the technical solution of the embodiment of the present invention, during the process of watching the live video by the target user using the live APP, when the target user is satisfied with the live content of the anchor, that is, the satisfaction degree is greater than a set threshold, the target user may choose to give the anchor a special voice effect As a gift, the target user can input a piece of voice information through a voice input device (such as a microphone), and analyze the voice information to determine the acoustic spectrum information corresponding to the voice information, and according to the acoustic spectrum information, the anchor within the next set time period The audio track information is adjusted to achieve the effect of changing the anchor's sound through the voice special effect gift, which enhances the presenting effect of the gift and the interaction between the user and the anchor, and enhances the fun of live broadcast and the viscosity of the users of the network platform.
为了实现上述直播互动方法,本发明实施例还提供了一种直播互动装置,所述直播互动装置可应用于服务器或终端设备中,图5为本发明实施例提供的一种直播互动装置的功能结构示意图;如图5所示,所述直播互动装置包括:采集模块51、解析模块52和调整模块53。下面对上述各程序模块的功能进行详细说明。In order to implement the above live broadcast interactive method, an embodiment of the present invention further provides a live broadcast interactive apparatus, which can be applied to a server or a terminal device. FIG. 5 is a function of a live broadcast interactive apparatus provided by an embodiment of the present invention. A schematic structural diagram; as shown in FIG. 5, the live broadcast interactive device includes: a collection module 51, an analysis module 52, and an adjustment module 53. The functions of the above program modules are described in detail below.
所述采集模块51,配置为接收到第一指令后,采集目标用户的语音信息,其中,所述第一指令用于表征由所述目标用户触发的礼物赠送指令;The collection module 51 is configured to collect voice information of a target user after receiving a first instruction, wherein the first instruction is used to represent a gift gift instruction triggered by the target user;
所述解析模块52,配置为解析所述目标用户的语音信息,确定所述目标用户的语音信息对应的声学特征参数;The analysis module 52 is configured to parse the voice information of the target user, and determine an acoustic characteristic parameter corresponding to the voice information of the target user;
所述调整模块53,配置为根据所述声学特征参数,调整所述目标用户 当前正在观看的直播间主播的音轨信息。The adjustment module 53 is configured to adjust, according to the acoustic characteristic parameter, the track information of the live broadcast host currently watching by the target user.
在一些实施例中,对于所述采集模块51采集目标用户的语音信息来说,可以采用以下方式来实现:从所述第一指令中解析出特效标识;根据所述特效标识,选择进入相应的特效礼物赠送模式;在所述特效礼物赠送模式下,通过调用音频采集设备采集所述目标用户的语音信息。In some embodiments, for the acquisition of the target user's voice information by the acquisition module 51, the following methods can be used to implement: parsing the special effect identifier from the first instruction; and selecting to enter the corresponding one according to the special effect identifier. A special effect gift giving mode; in the special effect gift giving mode, voice information of the target user is collected by calling an audio collection device.
这里,所述声学特征参数可包括声学频谱信息。Here, the acoustic characteristic parameter may include acoustic spectrum information.
在一些实施例中,图6为本发明实施例提供的另一种直播互动装置的功能结构示意图;如图6所示,所述直播互动装置还可以包括:In some embodiments, FIG. 6 is a functional structure diagram of another live broadcast interactive device according to an embodiment of the present invention. As shown in FIG. 6, the live broadcast interactive device may further include:
关联模块54,配置为在所述解析模块52确定所述目标用户的语音信息对应的声学特征参数之后,将所述目标用户的语音信息对应的声学频谱信息,与所述目标用户的用户账号进行关联。The association module 54 is configured to, after the analysis module 52 determines the acoustic characteristic parameters corresponding to the voice information of the target user, perform the acoustic spectrum information corresponding to the voice information of the target user with the user account of the target user. Associated.
对于所述调整模块53根据所述声学特征参数,调整所述目标用户当前正在观看的直播间主播的音轨信息来说,可以采用以下方式来实现:从所述第一指令中解析出特效持续时长;根据所述特效持续时长和所述声学频谱信息,调整所述目标用户当前正在观看的直播间主播的音轨信息。For the adjusting module 53 to adjust the track information of the live broadcast host currently watching by the target user according to the acoustic characteristic parameter, the following manner may be used to implement: parsing the special effect persistence from the first instruction Duration; adjusting track information of a live broadcast host currently watching the target user according to the duration of the special effect and the acoustic spectrum information.
在一些实施例中,所述解析模块52,还配置为在所述调整模块53调整所述目标用户当前正在观看的直播间主播的音轨信息之后,从所述第一指令中解析出特效使用范围,其中,所述特效使用范围与赠送的礼物类型相对应;In some embodiments, the parsing module 52 is further configured to parse the special effect usage from the first instruction after the adjustment module 53 adjusts the track information of the live broadcast host currently watching the target user. A range, wherein the range of use of the special effect corresponds to the type of gift given;
所述直播互动装置还包括:The live broadcast interactive device further includes:
输出模块55,配置为根据所述特效使用范围,输出调整后的音轨信息。The output module 55 is configured to output adjusted audio track information according to the special effect use range.
需要说明的是:上述实施例提供的直播互动装置在实现用户与主播间的直播互动时,仅以上述各程序模块的划分进行举例说明,实际应用中,可以根据需要而将上述处理分配由不同的程序模块完成,即将直播互动装置的内部结构划分成不同的程序模块,以完成以上描述的全部或者部分处理。另外,上述实施例提供的直播互动装置与直播互动方法实施例属于同 一构思,其实现过程详见方法实施例,这里不再详细赘述。It should be noted that when the live broadcast interactive device provided in the foregoing embodiment implements the live broadcast interaction between the user and the anchor, only the above-mentioned division of the program modules is used as an example. In actual applications, the above-mentioned processing may be allocated by different processes. The program modules are completed, that is, the internal structure of the live broadcast interactive device is divided into different program modules to complete all or part of the processing described above. In addition, the live broadcast interactive device and the live broadcast interactive method embodiments provided by the foregoing embodiments belong to the same concept, and the implementation process thereof is detailed in the method embodiment, and details are not described herein again.
在实际应用中,所述直播互动装置中的采集模块51、解析模块52、调整模块53和关联模块54均可由位于服务器或终端设备上的中央处理器(CPU,Central Processing Unit)、微处理器(MPU,Micro Processor Unit)、数字信号处理器(DSP,Digital Signal Processor)或现场可编程门阵列(FPGA,Field Programmable Gate Array)等实现;所述输出模块55在实际应用中可通过通信模组(包含:基础通信套件、操作系统、通信模块、标准化接口和协议等)及收发天线实现。In practical applications, the acquisition module 51, the analysis module 52, the adjustment module 53 and the association module 54 in the live broadcast interactive device can all be a central processing unit (CPU, Central Processing Unit), a microprocessor located on a server or a terminal device. (MPU, Micro Processor Unit), Digital Signal Processor (DSP, Digital Signal Processor) or Field Programmable Gate Array (FPGA, Field Programmable Gate Array), etc .; the output module 55 can be used in the actual application through the communication module (Including: basic communication suite, operating system, communication module, standardized interfaces and protocols, etc.) and transceiver antenna implementation.
为了实现上述直播互动方法,本发明实施例还提供了一种直播互动装置的硬件结构。现在将参考附图描述实现本发明实施例的直播互动装置,所述直播互动装置可以以各种形式来实施,例如服务器(如直播服务器)、终端设备(如台式机电脑、笔记本电脑、智能手机)等各种类型的计算机设备。下面对本发明实施例的直播互动装置的硬件结构做进一步说明,可以理解,图7仅仅示出了直播互动装置的示例性结构而非全部结构,根据需要可以实施图7示出的部分结构或全部结构。In order to implement the above live broadcast interactive method, an embodiment of the present invention further provides a hardware structure of a live broadcast interactive device. A live broadcast interactive device that implements an embodiment of the present invention will now be described with reference to the drawings. The live broadcast interactive device may be implemented in various forms, such as a server (such as a live server), a terminal device (such as a desktop computer, a notebook computer, a smart phone). ) And other types of computer equipment. The hardware structure of the live broadcast interactive device according to the embodiment of the present invention is further described below. It can be understood that FIG. 7 only shows an exemplary structure of the live broadcast interactive device instead of the entire structure, and some or all of the structures shown in FIG. 7 may be implemented as required. structure.
参见图7,图7为本发明实施例提供的一种直播互动装置的硬件结构示意图,实际应用中可以应用于前述运行应用程序的各种服务器或终端设备,图7所示的直播互动装置700包括:至少一个处理器701、存储器702、用户接口703和至少一个网络接口704。所述直播互动装置700中的各个组件通过总线系统705耦合在一起。可以理解,总线系统705用于实现这些组件之间的连接通信。总线系统705除包括数据总线之外,还包括电源总线、控制总线和状态信号总线。但是为了清楚说明起见,在图7中将各种总线都标为总线系统705。Referring to FIG. 7, FIG. 7 is a schematic diagram of a hardware structure of a live broadcast interactive device according to an embodiment of the present invention. In actual applications, it can be applied to the foregoing various servers or terminal devices running application programs. The live broadcast interactive device 700 shown in FIG. It includes: at least one processor 701, memory 702, user interface 703, and at least one network interface 704. The various components in the live broadcast interactive device 700 are coupled together through a bus system 705. It can be understood that the bus system 705 is configured to implement connection and communication between these components. The bus system 705 includes a power bus, a control bus, and a status signal bus in addition to the data bus. However, for the sake of clarity, various buses are marked as the bus system 705 in FIG. 7.
其中,用户接口703可以包括显示器、键盘、鼠标、轨迹球、点击轮、按键、按钮、触感板或者触摸屏等。The user interface 703 may include a display, a keyboard, a mouse, a trackball, a click wheel, keys, buttons, a touch panel, or a touch screen.
可以理解,存储器702可以是易失性存储器或非易失性存储器,也可 包括易失性和非易失性存储器两者。It is understood that the memory 702 may be a volatile memory or a non-volatile memory, and may also include both volatile and non-volatile memory.
本发明实施例中的存储器702,配置为存储各种类型的数据以支持直播互动装置700的操作。这些数据的示例包括:用于在直播互动装置700上操作的任何计算机程序,如可执行程序7021和操作系统7022,实现本发明实施例的直播互动方法的程序可以包含在可执行程序7021中。The memory 702 in the embodiment of the present invention is configured to store various types of data to support the operation of the live broadcast interactive device 700. Examples of these data include: any computer program for operating on the live broadcast interactive device 700, such as an executable program 7021 and an operating system 7022. A program that implements the live broadcast interactive method of the embodiment of the present invention may be included in the executable program 7021.
本发明实施例揭示的直播互动方法可以应用于处理器701中,或者由处理器701实现。处理器701可能是一种集成电路芯片,具有信号的处理能力。在实现过程中,上述直播互动方法的各步骤可以通过处理器701中的硬件的集成逻辑电路或者软件形式的指令完成。上述的处理器701可以是通用处理器、DSP,或者其他可编程逻辑器件、分立门或者晶体管逻辑器件、分立硬件组件等。处理器701可以实现或者执行本发明实施例中提供的各直播互动方法、步骤及逻辑框图。通用处理器可以是微处理器或者任何常规的处理器等。结合本发明实施例所提供的直播互动方法的步骤,可以直接体现为硬件译码处理器执行完成,或者用译码处理器中的硬件及软件模块组合执行完成。软件模块可以位于存储介质中,该存储介质位于存储器702,处理器701读取存储器702中的信息,结合其硬件完成本发明实施例提供的直播互动方法的步骤。The live broadcast interaction method disclosed in the embodiment of the present invention may be applied to the processor 701, or implemented by the processor 701. The processor 701 may be an integrated circuit chip and has a signal processing capability. In the implementation process, each step of the foregoing live broadcast interactive method may be completed by an integrated logic circuit of hardware in the processor 701 or an instruction in the form of software. The foregoing processor 701 may be a general-purpose processor, a DSP, or other programmable logic devices, discrete gate or transistor logic devices, discrete hardware components, and the like. The processor 701 may implement or execute various live broadcast interactive methods, steps, and logic block diagrams provided in the embodiments of the present invention. A general-purpose processor may be a microprocessor or any conventional processor. In combination with the steps of the live broadcast interactive method provided by the embodiment of the present invention, the steps can be directly performed by a hardware decoding processor, or can be performed by a combination of hardware and software modules in the decoding processor. The software module may be located in a storage medium. The storage medium is located in the memory 702. The processor 701 reads the information in the memory 702 and completes the steps of the live broadcast interactive method provided by the embodiment of the present invention in combination with its hardware.
在一些实施例中,所述直播互动装置700包括存储器702、处理器701及存储在存储器702上并能够由所述处理器701运行的可执行程序7021,所述处理器701运行所述可执行程序7021时实现:接收到第一指令后,采集目标用户的语音信息,其中,所述第一指令用于表征由所述目标用户触发的礼物赠送指令;解析所述目标用户的语音信息,确定所述目标用户的语音信息对应的声学特征参数;根据所述声学特征参数,调整所述目标用户当前正在观看的直播间主播的音轨信息。In some embodiments, the live broadcast interactive device 700 includes a memory 702, a processor 701, and an executable program 7021 stored on the memory 702 and executable by the processor 701. The processor 701 runs the executable The program 7021 realizes that after receiving the first instruction, the target user's voice information is collected, wherein the first instruction is used to represent a gift gift instruction triggered by the target user; the voice information of the target user is parsed to determine An acoustic characteristic parameter corresponding to the voice information of the target user; and adjusting, according to the acoustic characteristic parameter, the track information of the anchor of the live room that the target user is currently watching.
在一些实施例中,所述处理器701运行所述可执行程序7021时实现:从所述第一指令中解析出特效标识;根据所述特效标识,选择进入相应的 特效礼物赠送模式;在所述特效礼物赠送模式下,通过调用音频采集设备采集所述目标用户的语音信息。In some embodiments, when the processor 701 runs the executable program 7021, the processor 701 implements: parsing a special effect identifier from the first instruction; selecting to enter a corresponding special effect gift giving mode according to the special effect identifier; In the special effect gift giving mode, the voice information of the target user is collected by calling an audio collection device.
在一些实施例中,所述处理器701运行所述可执行程序7021时实现:所述声学特征参数包括声学频谱信息;在所述确定所述目标用户的语音信息对应的声学特征参数之后,将所述目标用户的语音信息对应的声学频谱信息,与所述目标用户的用户账号进行关联。In some embodiments, the processor 701 is implemented when the executable program 7021 is run: the acoustic characteristic parameters include acoustic spectrum information; after determining the acoustic characteristic parameters corresponding to the voice information of the target user, The acoustic spectrum information corresponding to the voice information of the target user is associated with the user account of the target user.
在一些实施例中,所述处理器701运行所述可执行程序7021时实现:从所述第一指令中解析出特效持续时长;根据所述特效持续时长和所述声学频谱信息,调整所述目标用户当前正在观看的直播间主播的音轨信息。In some embodiments, when the processor 701 runs the executable program 7021, it is implemented: parsing a special effect duration from the first instruction; adjusting the special effect duration according to the special effect duration and the acoustic spectrum information The track information of the anchor of the live room that the target user is currently watching.
在一些实施例中,所述处理器701运行所述可执行程序7021时实现:在所述调整所述目标用户当前正在观看的直播间主播的音轨信息之后,从所述第一指令中解析出特效使用范围,其中,所述特效使用范围与赠送的礼物类型相对应;根据所述特效使用范围,输出调整后的音轨信息。In some embodiments, when the processor 701 runs the executable program 7021, it is implemented: after adjusting the track information of the live broadcast host currently watching by the target user, parsing from the first instruction A special effect use range is provided, wherein the special effect use range corresponds to the type of gift to be given; according to the special effect use range, adjusted audio track information is output.
在示例性实施例中,本发明实施例还提供了一种存储介质,所述存储介质可为光盘、闪存或磁盘等存储介质,可选为非瞬间存储介质。在一些实施例中,所述存储介质上存储有可执行程序7021,所述可执行程序7021被处理器701执行时实现:接收到第一指令后,采集目标用户的语音信息,其中,所述第一指令用于表征由所述目标用户触发的礼物赠送指令;解析所述目标用户的语音信息,确定所述目标用户的语音信息对应的声学特征参数;根据所述声学特征参数,调整所述目标用户当前正在观看的直播间主播的音轨信息。In an exemplary embodiment, an embodiment of the present invention further provides a storage medium. The storage medium may be a storage medium such as an optical disk, a flash memory, or a magnetic disk, and may be a non-transitory storage medium. In some embodiments, an executable program 7021 is stored on the storage medium, and the executable program 7021 is implemented when the processor 701 executes: after receiving the first instruction, collecting voice information of the target user, wherein, the The first instruction is used for characterizing a gift giving instruction triggered by the target user; analyzing the target user's voice information to determine an acoustic characteristic parameter corresponding to the target user's voice information; and adjusting the acoustic characteristic parameter according to the acoustic characteristic parameter The track information of the anchor of the live room that the target user is currently watching.
在一些实施例中,所述可执行程序7021被处理器701执行时实现:从所述第一指令中解析出特效标识;根据所述特效标识,选择进入相应的特效礼物赠送模式;在所述特效礼物赠送模式下,通过调用音频采集设备采集所述目标用户的语音信息。In some embodiments, when the executable program 7021 is executed by the processor 701, it is realized: parsing a special effect identifier from the first instruction; selecting to enter a corresponding special effect gift giving mode according to the special effect identifier; In the special-effect gift-giving mode, voice information of the target user is collected by calling an audio collection device.
在一些实施例中,所述可执行程序7021被处理器701执行时实现:所 述声学特征参数包括声学频谱信息;在所述确定所述目标用户的语音信息对应的声学特征参数之后,将所述目标用户的语音信息对应的声学频谱信息,与所述目标用户的用户账号进行关联。In some embodiments, the executable program 7021 is implemented when executed by the processor 701: the acoustic characteristic parameters include acoustic spectrum information; after determining the acoustic characteristic parameters corresponding to the voice information of the target user, The acoustic spectrum information corresponding to the voice information of the target user is associated with the user account of the target user.
在一些实施例中,所述可执行程序7021被处理器701执行时实现:从所述第一指令中解析出特效持续时长;根据所述特效持续时长和所述声学频谱信息,调整所述目标用户当前正在观看的直播间主播的音轨信息。In some embodiments, the executable program 7021 is implemented when executed by the processor 701: parsing a special effect duration from the first instruction; adjusting the target according to the special effect duration and the acoustic spectrum information Track information of the anchor of the live room that the user is currently watching.
在一些实施例中,所述可执行程序7021被处理器701执行时实现:在所述调整所述目标用户当前正在观看的直播间主播的音轨信息之后,从所述第一指令中解析出特效使用范围,其中,所述特效使用范围与赠送的礼物类型相对应;根据所述特效使用范围,输出调整后的音轨信息。In some embodiments, the executable program 7021 is implemented when the processor 701 executes: after adjusting the track information of the live broadcast host currently watching by the target user, parsing out from the first instruction The use range of special effects, wherein the use range of special effects corresponds to the type of gift given; according to the use range of special effects, adjusted audio track information is output.
综上所述,本发明实施例所提供的直播互动方法,通过目标用户特定的声学特征参数改变主播的音轨信息,不仅可以增强赠送的礼物的呈现效果,还可以增加用户与主播之间的互动效果,提升直播的趣味性以及网络平台用户的使用粘度。In summary, the live broadcast interactive method provided by the embodiment of the present invention changes the track information of the anchor through the specific acoustic characteristic parameters of the target user, which can not only enhance the presentation effect of the gift given, but also increase the Interactive effects to enhance the fun of live broadcast and the viscosity of users of online platforms.
本领域内的技术人员应明白,本发明的实施例可提供为方法、系统、或可执行程序产品。因此,本发明可采用硬件实施例、软件实施例、或结合软件和硬件方面的实施例的形式。而且,本发明可采用在一个或多个其中包含有计算机可用程序代码的计算机可用存储介质(包括但不限于磁盘存储器和光学存储器等)上实施的可执行程序产品的形式。Those skilled in the art should understand that the embodiments of the present invention may be provided as a method, a system, or an executable program product. Therefore, the present invention may take the form of a hardware embodiment, a software embodiment, or an embodiment combining software and hardware aspects. Moreover, the present invention may take the form of an executable program product implemented on one or more computer-usable storage media (including, but not limited to, magnetic disk storage, optical storage, and the like) containing computer-usable program code.
本发明是参照根据本发明实施例的方法、设备(系统)、和可执行程序产品的流程图和/或方框图来描述的。应理解可由可执行程序指令实现流程图和/或方框图中的每一流程和/或方框以及流程图和/或方框图中的流程和/或方框的结合。可提供这些可执行程序指令到通用计算机、专用计算机、嵌入式处理机或参考可编程数据处理设备的处理器以产生一个机器,使得通过计算机或参考可编程数据处理设备的处理器执行的指令产生用于实现在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功 能的装置。The present invention is described with reference to flowcharts and / or block diagrams of methods, devices (systems), and executable program products according to embodiments of the present invention. It should be understood that each process and / or block in the flowcharts and / or block diagrams and combinations of processes and / or blocks in the flowcharts and / or block diagrams can be implemented by executable program instructions. These executable program instructions may be provided to a processor of a general purpose computer, special purpose computer, embedded processor, or reference programmable data processing device to produce a machine such that instructions executed by the computer or the processor of the reference programmable data processing device are generated Means for implementing the functions specified in one or more flowcharts and / or one or more blocks of the block diagrams.
这些可执行程序指令也可存储在能引导计算机或参考可编程数据处理设备以特定方式工作的计算机可读存储器中,使得存储在该计算机可读存储器中的指令产生包括指令装置的制造品,该指令装置实现在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功能。These executable program instructions may also be stored in a computer-readable memory capable of directing a computer or a reference programmable data processing device to work in a particular manner such that the instructions stored in the computer-readable memory produce a manufactured article including the instruction means, the The instruction device implements the functions specified in one or more flowcharts and / or one or more blocks of the block diagram.
这些可执行程序指令也可装载到计算机或参考可编程数据处理设备上,使得在计算机或参考可编程设备上执行一系列操作步骤以产生计算机实现的处理,从而在计算机或参考可编程设备上执行的指令提供用于实现在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功能的步骤。These executable program instructions can also be loaded onto a computer or reference programmable data processing device, such that a series of steps are performed on the computer or reference programmable data processing device to produce a computer-implemented process for execution on the computer or reference programmable data processing device. The instructions provide steps for implementing the functions specified in one or more flowcharts and / or one or more blocks of the block diagrams.
以上所述,仅为本发明的较佳实施例而已,并非用于限定本发明的保护范围,凡在本发明的精神和原则之内所作的任何修改、等同替换和改进等,均应包含在本发明的保护范围之内。The above description is only the preferred embodiments of the present invention, and is not intended to limit the protection scope of the present invention. Any modification, equivalent replacement, and improvement made within the spirit and principle of the present invention shall be included in Within the scope of the present invention.

Claims (14)

  1. 一种直播互动方法,所述方法包括:A live broadcast interactive method, the method includes:
    接收到第一指令后,确定目标用户的语音信息对应的声学特征参数;其中,所述第一指令用于表征由所述目标用户触发的礼物赠送指令;After receiving the first instruction, determining an acoustic characteristic parameter corresponding to the voice information of the target user; wherein the first instruction is used to characterize a gift giving instruction triggered by the target user;
    根据所述声学特征参数,调整所述目标用户当前正在观看的直播间主播的音轨信息。According to the acoustic characteristic parameter, adjust the track information of the anchor of the live room that the target user is currently watching.
  2. 根据权利要求1所述的直播互动方法,所述方法还包括:The live broadcast interactive method according to claim 1, further comprising:
    采集所述目标用户的语音信息;Collecting voice information of the target user;
    相应的,所述确定目标用户的语音信息对应的声学特征参数,包括:Accordingly, the determining acoustic characteristic parameters corresponding to the voice information of the target user includes:
    解析所述目标用户的语音信息,得到所述目标用户的语音信息对应的声学特征参数。Parse the voice information of the target user to obtain the acoustic characteristic parameters corresponding to the voice information of the target user.
  3. 根据权利要求1所述的直播互动方法,其中,所述方法还包括:The live broadcast interactive method according to claim 1, wherein the method further comprises:
    从所述第一指令中解析出特效标识;Parsing a special effect identifier from the first instruction;
    根据所述特效标识,选择进入相应的特效礼物赠送模式;Selecting to enter a corresponding special effect gift giving mode according to the special effect identification;
    在所述特效礼物赠送模式下,通过调用音频采集设备采集所述目标用户的语音信息。In the special-effect gift-giving mode, voice information of the target user is collected by calling an audio collection device.
  4. 根据权利要求1所述的直播互动方法,其中,所述声学特征参数包括声学频谱信息;The live broadcast interactive method according to claim 1, wherein the acoustic characteristic parameters include acoustic spectrum information;
    在所述确定所述目标用户的语音信息对应的声学特征参数之后,所述方法还包括:After determining the acoustic characteristic parameters corresponding to the voice information of the target user, the method further includes:
    将所述目标用户的语音信息对应的声学频谱信息,与所述目标用户的用户账号进行关联。Associate the acoustic spectrum information corresponding to the voice information of the target user with the user account of the target user.
  5. 根据权利要求4所述的直播互动方法,其中,所述根据所述声学特征参数,调整所述目标用户当前正在观看的直播间主播的音轨信息,包括:The live broadcast interactive method according to claim 4, wherein adjusting the track information of a live broadcaster currently watching the target user according to the acoustic characteristic parameter comprises:
    从所述第一指令中解析出特效持续时长;Parse the special effect duration from the first instruction;
    根据所述特效持续时长和所述声学频谱信息,调整所述目标用户当前正在观看的直播间主播的音轨信息。And adjusting, according to the duration of the special effect and the acoustic spectrum information, audio track information of a live broadcast host currently watching by the target user.
  6. 根据权利要求1所述的直播互动方法,其中,在所述调整所述目标用户当前正在观看的直播间主播的音轨信息之后,所述方法还包括:The live broadcast interactive method according to claim 1, wherein after the adjusting the track information of a live broadcast host currently watching the target user, the method further comprises:
    从所述第一指令中解析出特效使用范围,其中,所述特效使用范围与赠送的礼物类型相对应;Parse the special effect use range from the first instruction, wherein the special effect use range corresponds to the type of gift given;
    根据所述特效使用范围,输出调整后的音轨信息。Output adjusted audio track information according to the special effect use range.
  7. 一种直播互动装置,所述装置包括:解析模块和调整模块;其中,A live broadcast interactive device, the device includes: an analysis module and an adjustment module;
    所述解析模块,配置为在接收到第一指令后,确定所述目标用户的语音信息对应的声学特征参数;其中,所述第一指令用于表征由所述目标用户触发的礼物赠送指令;The analysis module is configured to determine an acoustic characteristic parameter corresponding to the voice information of the target user after receiving the first instruction; wherein the first instruction is used to characterize a gift gift instruction triggered by the target user;
    所述调整模块,配置为根据所述声学特征参数,调整所述目标用户当前正在观看的直播间主播的音轨信息。The adjusting module is configured to adjust, according to the acoustic characteristic parameter, the track information of the anchor of the live room that the target user is currently watching.
  8. 根据权利要求7所述的直播互动装置,其中,所述装置还包括采集模块;The live broadcast interactive device according to claim 7, wherein the device further comprises an acquisition module;
    所述采集模块,配置为采集所述目标用户的语音信息;The acquisition module is configured to collect voice information of the target user;
    所述解析模块,还配置为解析所述目标用户的语音信息,得到所述目标用户的语音信息对应的声学特征参数。The analysis module is further configured to parse the voice information of the target user to obtain acoustic characteristic parameters corresponding to the voice information of the target user.
  9. 根据权利要求7所述的直播互动装置,其中,所述装置还包括:The live broadcast interactive device according to claim 7, wherein the device further comprises:
    采集模块,配置为:从所述第一指令中解析出特效标识;An acquisition module configured to parse a special effect identifier from the first instruction;
    根据所述特效标识,选择进入相应的特效礼物赠送模式;Selecting to enter a corresponding special effect gift giving mode according to the special effect identification;
    在所述特效礼物赠送模式下,通过调用音频采集设备采集所述目标用户的语音信息。In the special-effect gift-giving mode, voice information of the target user is collected by calling an audio collection device.
  10. 根据权利要求7所述的直播互动装置,其中,所述声学特征参数包括声学频谱信息;所述装置还包括:The live interactive device according to claim 7, wherein the acoustic characteristic parameters include acoustic spectrum information; the device further comprises:
    关联模块,配置为在所述解析模块确定所述目标用户的语音信息对应 的声学特征参数之后,将所述目标用户的语音信息对应的声学频谱信息,与所述目标用户的用户账号进行关联。The association module is configured to associate the acoustic spectrum information corresponding to the voice information of the target user with the user account of the target user after the analysis module determines the acoustic characteristic parameters corresponding to the voice information of the target user.
  11. 根据权利要求10所述的直播互动装置,其中,所述调整模块,还配置为:The live broadcast interactive device according to claim 10, wherein the adjustment module is further configured to:
    从所述第一指令中解析出特效持续时长;Parse the special effect duration from the first instruction;
    根据所述特效持续时长和所述声学频谱信息,调整所述目标用户当前正在观看的直播间主播的音轨信息。And adjusting, according to the duration of the special effect and the acoustic spectrum information, audio track information of a live broadcast host currently watching by the target user.
  12. 根据权利要求7所述的直播互动装置,其中,The live broadcast interactive device according to claim 7, wherein:
    所述解析模块,还配置为在所述调整模块调整所述目标用户当前正在观看的直播间主播的音轨信息之后,从所述第一指令中解析出特效使用范围,其中,所述特效使用范围与赠送的礼物类型相对应;The parsing module is further configured to parse the special effect use range from the first instruction after the adjustment module adjusts the track information of the live broadcast host currently watching the target user, wherein the special effect use The scope corresponds to the type of gift given;
    所述装置还包括:The device further includes:
    输出模块,配置为根据所述特效使用范围,输出调整后的音轨信息。The output module is configured to output adjusted audio track information according to the special effect use range.
  13. 一种直播互动装置,包括存储器、处理器及存储在存储器上并能够由所述处理器运行的可执行程序,所述处理器运行所述可执行程序时执行如权利要求1至6任一项所述的直播互动方法的步骤。A live broadcast interactive device includes a memory, a processor, and an executable program stored on the memory and executable by the processor. When the processor runs the executable program, the processor executes any one of claims 1 to 6 The steps of the live broadcast interactive method.
  14. 一种存储介质,其上存储有可执行程序,所述可执行程序被处理器执行时实现如权利要求1至6任一项所述的直播互动方法的步骤。A storage medium stores an executable program, and when the executable program is executed by a processor, the steps of the live broadcast interactive method according to any one of claims 1 to 6 are implemented.
PCT/CN2019/105771 2018-09-12 2019-09-12 Live broadcast interaction method and apparatus, and storage medium WO2020052665A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201811063369.1 2018-09-12
CN201811063369.1A CN109348274B (en) 2018-09-12 2018-09-12 Live broadcast interaction method and device and storage medium

Publications (1)

Publication Number Publication Date
WO2020052665A1 true WO2020052665A1 (en) 2020-03-19

Family

ID=65305258

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2019/105771 WO2020052665A1 (en) 2018-09-12 2019-09-12 Live broadcast interaction method and apparatus, and storage medium

Country Status (2)

Country Link
CN (1) CN109348274B (en)
WO (1) WO2020052665A1 (en)

Families Citing this family (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109348274B (en) * 2018-09-12 2021-03-23 咪咕音乐有限公司 Live broadcast interaction method and device and storage medium
CN110119264B (en) * 2019-05-21 2023-03-31 北京达佳互联信息技术有限公司 Sound effect adjusting method, device and storage medium
CN111988655A (en) * 2019-05-22 2020-11-24 西安诺瓦星云科技股份有限公司 Program playing method and device and program playing system
CN110989910A (en) * 2019-11-28 2020-04-10 广州虎牙科技有限公司 Interaction method, system, device, electronic equipment and storage medium
CN111314788A (en) * 2020-03-13 2020-06-19 广州华多网络科技有限公司 Voice password returning method and presenting method, device and equipment for voice gift
CN112533053B (en) * 2020-11-30 2022-08-23 北京达佳互联信息技术有限公司 Live broadcast interaction method and device, electronic equipment and storage medium
CN113014477A (en) * 2021-03-18 2021-06-22 广州市百果园信息技术有限公司 Gift processing method, device and equipment of voice platform and storage medium
CN113596596A (en) * 2021-07-27 2021-11-02 百果园技术(新加坡)有限公司 Gift rewarding system, method, device and medium for live broadcast application
CN113613033A (en) * 2021-08-03 2021-11-05 广州繁星互娱信息科技有限公司 Method, device, electronic equipment and medium for live broadcast interaction between audience and anchor

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20130173253A1 (en) * 2012-01-02 2013-07-04 International Business Machines Corporation Speech effects
CN106331736A (en) * 2016-08-24 2017-01-11 武汉斗鱼网络科技有限公司 Live client speech processing system and processing method thereof
CN106375864A (en) * 2016-08-25 2017-02-01 广州华多网络科技有限公司 Virtual good distribution control method and apparatus, and mobile terminal
CN106507207A (en) * 2016-10-31 2017-03-15 北京小米移动软件有限公司 Interactive method and device in live application
CN107093421A (en) * 2017-04-20 2017-08-25 深圳易方数码科技股份有限公司 A kind of speech simulation method and apparatus
CN109348274A (en) * 2018-09-12 2019-02-15 咪咕音乐有限公司 A kind of living broadcast interactive method, apparatus and storage medium

Family Cites Families (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP5479823B2 (en) * 2009-08-31 2014-04-23 ローランド株式会社 Effect device
CN102881283B (en) * 2011-07-13 2014-05-28 三星电子(中国)研发中心 Method and system for processing voice
CN105488135B (en) * 2015-11-25 2019-11-15 广州酷狗计算机科技有限公司 Live content classification method and device
CN105872838A (en) * 2016-04-28 2016-08-17 徐文波 Sending method and device of special media effects of real-time videos
CN107483986A (en) * 2017-06-30 2017-12-15 武汉斗鱼网络科技有限公司 A kind of method and system of gifts
CN107277637A (en) * 2017-08-18 2017-10-20 上海东方明珠新媒体股份有限公司 Medium living broadcast shopping interactive device, medium living broadcast shopping interactive system and method
CN107396177B (en) * 2017-08-28 2020-06-02 北京小米移动软件有限公司 Video playing method, device and storage medium
CN107481735A (en) * 2017-08-28 2017-12-15 中国移动通信集团公司 A kind of method, server and the computer-readable recording medium of transducing audio sounding
CN107682729A (en) * 2017-09-08 2018-02-09 广州华多网络科技有限公司 It is a kind of based on live interactive approach and live broadcast system, electronic equipment
CN107818792A (en) * 2017-10-25 2018-03-20 北京奇虎科技有限公司 Audio conversion method and device
CN107767879A (en) * 2017-10-25 2018-03-06 北京奇虎科技有限公司 Audio conversion method and device based on tone color
CN108040285B (en) * 2017-11-15 2019-12-06 上海掌门科技有限公司 Video live broadcast picture adjusting method, computer equipment and storage medium
CN107959882B (en) * 2017-12-12 2019-12-13 广东小天才科技有限公司 Voice conversion method, device, terminal and medium based on video watching record
CN108198566B (en) * 2018-01-24 2021-07-20 咪咕文化科技有限公司 Information processing method and device, electronic device and storage medium

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20130173253A1 (en) * 2012-01-02 2013-07-04 International Business Machines Corporation Speech effects
CN106331736A (en) * 2016-08-24 2017-01-11 武汉斗鱼网络科技有限公司 Live client speech processing system and processing method thereof
CN106375864A (en) * 2016-08-25 2017-02-01 广州华多网络科技有限公司 Virtual good distribution control method and apparatus, and mobile terminal
CN106507207A (en) * 2016-10-31 2017-03-15 北京小米移动软件有限公司 Interactive method and device in live application
CN107093421A (en) * 2017-04-20 2017-08-25 深圳易方数码科技股份有限公司 A kind of speech simulation method and apparatus
CN109348274A (en) * 2018-09-12 2019-02-15 咪咕音乐有限公司 A kind of living broadcast interactive method, apparatus and storage medium

Also Published As

Publication number Publication date
CN109348274B (en) 2021-03-23
CN109348274A (en) 2019-02-15

Similar Documents

Publication Publication Date Title
WO2020052665A1 (en) Live broadcast interaction method and apparatus, and storage medium
WO2019112858A1 (en) Streaming radio with personalized content integration
US11887619B2 (en) Method and apparatus for detecting similarity between multimedia information, electronic device, and storage medium
CN104123938A (en) Voice control system, electronic device and voice control method
US20170060520A1 (en) Systems and methods for dynamically editable social media
US11511200B2 (en) Game playing method and system based on a multimedia file
US20150317699A1 (en) Method, apparatus, device and system for inserting audio advertisement
CN104866275B (en) Method and device for acquiring image information
CN104980773A (en) Streaming media processing method and device, terminal and server
CN108322791B (en) Voice evaluation method and device
CN107450874B (en) Multimedia data double-screen playing method and system
CN110718239A (en) Audio processing method and device, electronic equipment and storage medium
CN110602553B (en) Audio processing method, device, equipment and storage medium in media file playing
US8660845B1 (en) Automatic separation of audio data
US20190377540A1 (en) Calibrating audio output device with playback of adjusted audio
CN105635418A (en) Method and device for cutting bell
CN109410972B (en) Method, device and storage medium for generating sound effect parameters
CN110503979B (en) Audio output effect monitoring method, device, medium and electronic equipment
WO2023005193A1 (en) Subtitle display method and device
CN109147819A (en) Audio-frequency information processing method, device and storage medium
EP3909046B1 (en) Determining a light effect based on a degree of speech in media content
WO2016197471A1 (en) Multimedia content sending, generating, transmitting and playing method, and corresponding device
Lorho Perceptual evaluation of mobile multimedia loudspeakers
CN111739510A (en) Information processing method, information processing apparatus, vehicle, and computer storage medium
CN111506765A (en) Method and device for controlling music playing rhythm, electronic equipment and storage medium

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 19859510

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 19859510

Country of ref document: EP

Kind code of ref document: A1

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM FORM 1205A DATED 13/10/2021)