CN109120947A - A kind of the voice private chat method and client of direct broadcasting room - Google Patents

A kind of the voice private chat method and client of direct broadcasting room Download PDF

Info

Publication number
CN109120947A
CN109120947A CN201811031975.5A CN201811031975A CN109120947A CN 109120947 A CN109120947 A CN 109120947A CN 201811031975 A CN201811031975 A CN 201811031975A CN 109120947 A CN109120947 A CN 109120947A
Authority
CN
China
Prior art keywords
voice
private chat
voice messaging
chat
private
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201811031975.5A
Other languages
Chinese (zh)
Inventor
潘璠
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Alibaba China Co Ltd
Original Assignee
Beijing Youku Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Youku Technology Co Ltd filed Critical Beijing Youku Technology Co Ltd
Priority to CN201811031975.5A priority Critical patent/CN109120947A/en
Publication of CN109120947A publication Critical patent/CN109120947A/en
Pending legal-status Critical Current

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/21Server components or server architectures
    • H04N21/218Source of audio or video content, e.g. local disk arrays
    • H04N21/2187Live feed
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/233Processing of audio elementary streams
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/239Interfacing the upstream path of the transmission network, e.g. prioritizing client content requests
    • H04N21/2393Interfacing the upstream path of the transmission network, e.g. prioritizing client content requests involving handling client requests
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/25Management operations performed by the server for facilitating the content distribution or administrating data related to end-users or client devices, e.g. end-user or client device authentication, learning user preferences for recommending movies
    • H04N21/262Content or additional data distribution scheduling, e.g. sending additional data at off-peak times, updating software modules, calculating the carousel transmission frequency, delaying a video stream transmission, generating play-lists
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/439Processing of audio elementary streams
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/47End-user applications
    • H04N21/478Supplemental services, e.g. displaying phone caller identification, shopping application
    • H04N21/4788Supplemental services, e.g. displaying phone caller identification, shopping application communicating with other users, e.g. chatting

Abstract

The application embodiment discloses the voice private chat method and client of a kind of direct broadcasting room, wherein the described method includes: initiating the private chat request for being directed toward target user to voice server;The private chat voice messaging of the initiator is acquired, and the private chat voice messaging of acquisition is uploaded to the voice server, so that the voice server provides the private chat voice messaging to the target user by private chat channel;The private chat voice messaging provided by the target user that the voice server is sent by the private chat channel is provided, and receives the group chat voice messaging for the other users for being in same live streaming group with the initiator that the voice server is sent;After private chat voice messaging that the target user provides and the group chat voice messaging are synthesized one voice flow, the voice flow of synthesis is played.Technical solution provided by the present application can be improved the convenience linked up with other users.

Description

A kind of the voice private chat method and client of direct broadcasting room
Technical field
This application involves Internet technical field, in particular to the voice private chat method and client of a kind of direct broadcasting room.
Background technique
With the rise of net cast, a large amount of net cast platform is emerged.In net cast platform, it can divide Between multiple net casts, usually presided over by main broadcaster between these net casts.Live content can be pushed to direct broadcast service by main broadcaster Device, then the user in net cast can download from direct broadcast server and watch the live content between the net cast.
Currently, user is in watching video live broadcast, if it is desired to individually linked up with other users, usually can with think The target user to be linked up jumps to an idle direct broadcasting room together, can be linked up by voice in the direct broadcasting room. Further, it is also possible to by way of text chat, by sending private chat text to target user, to realize with single user's It links up.
However, on the one hand will affect the net cast that user is currently viewed by way of jumping to idle direct broadcasting room On the other hand content will increase the operation complexity of user, in addition, equally will increase the behaviour of user in the way of communication text Make complexity.Therefore, when currently individually being linked up in direct broadcasting room with other users, operation can be faced and not enough easily asked Topic.
Summary of the invention
The purpose of the application embodiment is to provide the voice private chat method and client of a kind of direct broadcasting room, can be improved with The convenience that other users are linked up.
To achieve the above object, the application embodiment provides a kind of voice private chat method of direct broadcasting room, the method packet It includes: the private chat request for being directed toward target user is initiated to voice server, so that the voice server is requested in the private chat Initiator and the target user between establish private chat channel;The private chat voice messaging of the initiator is acquired, and will acquisition The private chat voice messaging be uploaded to the voice server so that the voice server by the private chat channel to The target user provides the private chat voice messaging;Receive the voice server by the private chat channel send by institute The private chat voice messaging of target user's offer is stated, and receive that the voice server sends is in the initiator with always Broadcast the group chat voice messaging of the other users of group;The private chat voice messaging that the target user is provided and group chat voice letter After breath synthesizes one voice flow, the voice flow of synthesis is played.
To achieve the above object, the application embodiment also provides a kind of client, and the client includes: private chat request Unit is initiated, the private chat request for initiating to be directed toward target user to voice server, so that the voice server is in institute It states and establishes private chat channel between the initiator and the target user of private chat request;Private chat voice collecting unit, for acquiring The private chat voice messaging of initiator is stated, and the private chat voice messaging of acquisition is uploaded to the voice server, so that The voice server provides the private chat voice messaging to the target user by the private chat channel;Voice messaging receives Unit, the private chat voice provided by the target user sent for receiving the voice server by the private chat channel Information, and receive the group chat voice for the other users that same live streaming group is in the initiator that the voice server is sent Information;Voice flow synthesis unit, private chat voice messaging and the group chat voice messaging for providing the target user close After one voice flow, the voice flow of synthesis is played.
To achieve the above object, the application embodiment also provides a kind of client, the client include processor and Memory, the memory is for storing computer program, when the computer program is executed by the processor, realizes above-mentioned Method.
Therefore technical solution provided by the present application, when user direct broadcasting room watch be broadcast live when, if necessary to and target User is individually linked up, then can initiate to request for the private chat of the target user to voice server.Voice server After receiving private chat request, private chat channel can be established between the user and target user.The private chat channel can be used for Transmit the private chat voice messaging between the two users.Meanwhile voice server can also receive each use in same live streaming group The group chat voice messaging at family, the group chat voice messaging and private chat voice messaging be received and dispatched by different channels, therefore that This will not influence each other, to guarantee the privacy of private chat voice messaging.The initiator of private chat request can pass through local record Sound equipment acquires private chat voice messaging, and uploads the private chat voice messaging to voice server, so that voice server is logical The private chat channel established before crossing sends the private chat voice messaging to target user.Similarly, target user can also be to the hair It plays side and sends private chat voice messaging.Initiator can receive voice after receiving the private chat voice messaging of target user simultaneously Server mentions the group chat voice messaging of other users in the same live streaming group sent.The client of initiator can will receive Private chat voice messaging and group chat voice messaging are synthesized, to export both voice messagings by the same loudspeaker.This Sample, user is when carrying out the communication of independent voice with other users, additionally it is possible to hear other voice messagings in direct broadcasting room, not only not Will affect viewing live content, the complexity of user's operation can also be simplified by way of voice-enabled chat, thus improve with The convenience that other users are linked up.
Detailed description of the invention
It, below will be to embodiment in order to illustrate more clearly of the application embodiment or technical solution in the prior art Or attached drawing needed to be used in the description of the prior art is briefly described, it should be apparent that, the accompanying drawings in the following description is only It is some embodiments as described in this application, for those of ordinary skill in the art, in not making the creative labor property Under the premise of, it is also possible to obtain other drawings based on these drawings.
Fig. 1 is the live broadcast system schematic diagram that voice connects wheat in the application embodiment;
Fig. 2 is the voice private chat method and step figure of direct broadcasting room in the application embodiment;
Fig. 3 is the schematic diagram of chat interface in the application embodiment;
Fig. 4 is the functional block diagram of client in the application embodiment;
Fig. 5 is the structural schematic diagram of client in the application embodiment.
Specific embodiment
In order to make those skilled in the art better understand the technical solutions in the application, below in conjunction with the application reality The attached drawing in mode is applied, the technical solution in the application embodiment is clearly and completely described, it is clear that described Embodiment is only a part of embodiment of the application, rather than whole embodiments.Based on the embodiment party in the application Formula, every other embodiment obtained by those of ordinary skill in the art without making creative efforts, is all answered When the range for belonging to the application protection.
The application provides a kind of voice private chat method of direct broadcasting room, and this method can be applied in system as shown in Figure 1. Referring to Fig. 1, video living transmission system may include voice server, direct broadcast server and client.Wherein, the client It can be the terminal device that user uses, in the terminal device, can have net cast software, and the terminal is set The standby microphone that can above have the voice messaging for including user.In addition, the client can also refer to that the terminal is set The net cast software of standby middle operation.The net cast software can call the microphone on the terminal device, to include The voice messaging of user.The voice server can be used for receiving the voice messaging for the user that each client uploads, and can These voice messagings according to preset stream media protocol, are converted to voice flow.The direct broadcast server, then can receive master The live content that the terminal device broadcast is sent, and the live content can be converted to live streaming audio/video flow.
Referring to Fig. 2, the voice private chat method of direct broadcasting room provided by the present application may comprise steps of.
S1: the private chat request for being directed toward target user is initiated to voice server, so that the voice server is described Private chat channel is established between the initiator and the target user of private chat request.
In the present embodiment, it when user watches live content in direct broadcasting room, can be used with the part in the direct broadcasting room Same live streaming group is added in family.Different user in same live streaming group can carry out mutual communication by way of voice.Tool Body, user can open the function that voice in group connects wheat.In the case that voice connects the unlatching of wheat function in organizing, the Mike of user Wind can acquire the voice messaging of user in real time.The voice messaging of acquisition can be uploaded to voice service by the client of user Device.In voice server, voice messaging can be converted to the voice flow of user according to preset stream media protocol.This is pre- If stream media protocol for example can be HLS (HTTP Live Streaming, HTTP live stream) agreement.Certainly, this is preset Stream media protocol can also be in the light of actual conditions modified.For example, the preset stream media protocol can also be WebRTC (Web Real-Time Communication, page real time communication) agreement.Subsequent, the user that unlatching voice connects wheat function can To listen to the voice messaging of other users in same live streaming group.At this point, the client of user can initiate number to voice server According to acquisition request.The user identifier of the user can be carried in the data acquisition request.In this way, voice server is receiving this After data acquisition request, user identifier wherein included can be identified.By the user identifier, voice server can be determined Live streaming group locating for the user identifier, then can by the live streaming group except the user identifier characterization voice flow in addition to other The voice flow of user is supplied to the client of the user.On the one hand it can enable the user to hear other in same live streaming group On the other hand the Instant audio messages of user also avoid the user from the voice messaging of uppick itself.In this way, user is from voice The voice messaging of other users can be used as group chat voice messaging in the same live streaming group obtained at server.The group chat voice Information can be obtained by the user in same live streaming group.
In the present embodiment, user carries out group in the live content of viewing direct broadcasting room and with the user of same live streaming group When merely, it can choose with some user in live streaming group or individually linked up with the other users outside live streaming group.At this point, with The client at family can initiate the private chat request for being directed toward target user to voice server, may include request in private chat request The user identifier of initiator and the user identifier of the target user.In this way, voice server is receiving private chat request Afterwards, the user identifier of both sides can be therefrom extracted, and based on the user identifier extracted, between initiator and target user Establish private chat channel.The private chat channel can be the channel for being used for transmission private chat voice messaging, the letter transmitted in the private chat channel Breath is independent of each other with the information in group chat channel.
S3: acquiring the private chat voice messaging of the initiator, and the private chat voice messaging of acquisition is uploaded to described Voice server, so that the voice server provides the private chat voice to the target user by the private chat channel Information.
In the present embodiment, after establishing private chat channel, in the client of two parties, private chat can occur The prompt information of state.As shown in figure 3, may include the control of group chat and private chat in the operation interface of user, when private chat frequency After road is established, private chat control can be activated.User can choose the private chat control of the activation in early operation interface.So when When private chat control is selected, the voice messaging of user's typing can be used as private chat voice messaging, rather than group chat voice messaging. Certainly, in practical applications, private chat control can mutually be bound with target user.In this way, as shown in figure 3, when user need with it is more When a user carries out private chat simultaneously, can occur multiple private chat controls, and each private chat control in the interface of the user In can show the mark (Zhang San, Li Si) of other side, consequently facilitating user distinguishes the object of private chat.Correspondingly, when it In one or more private chat control it is selected when, the private chat voice messaging of user will be sent to these selected private chat controls At the corresponding target user of part.
In the present embodiment, when the initiator of private chat request is under private chat state, the microphone of initiator can be with The private chat voice messaging of initiator is acquired in the way of in step S1, and the private chat voice messaging of acquisition is uploaded to institute State voice server.In the private chat voice messaging, the user identifier of the target user can be carried, so as to inform language Which user sound server, current private chat voice messaging should be sent to.
In the present embodiment, after voice server receives the private chat voice messaging, the mesh wherein carried can be identified The user identifier of user is marked, so as to the private chat channel for establishing the private chat voice messaging before, is sent to target use Family.Similarly, target user can reply private chat voice messaging to it after the private chat voice messaging of uppick initiator.This Sample, voice server can also receive the private chat voice messaging of target user's offer.
In one embodiment, the client of user, can be to private after acquiring the private chat voice messaging of user Merely voice messaging carries out some optimization processings, so that the private chat voice messaging for being uploaded to voice server has higher sound quality. Firstly, client can all remove the sound in private chat voice messaging in addition to voice, so as to reduce environmental noise pair The influence of voice.Specifically, client can identify the audio frequency characteristics in the private chat voice messaging.The audio frequency characteristics can wrap The audio frequency characteristics for characterizing voice are included, can also include the audio frequency characteristics for characterizing environmental noise.Typically, voice is past It is past to have fixed frequency separation.For example, male sound may be typically located between 64~523Hz, female's sound is usually located at 160~ Between 1200Hz.So, this corresponding relationship of voice and fixed frequency separation, can be used as standard voice feature.
In the present embodiment, it when the audio frequency characteristics for including in the private chat voice messaging of identification acquisition, can will be in The private chat voice messaging of time-domain is converted to frequency domain, and the voice messaging in frequency domain can be and carry out according to frequency Distribution, and each Frequency point can correspond to certain signal strength.At this point it is possible to be identified from the voice messaging of frequency domain Signal strength reaches the corresponding target frequency of information of specified intensity threshold value.The specified intensity threshold value can be set to human ear can The intensity of sound obviously heard.In this way, the voice messaging of frequency domain can according to the specified intensity threshold value, be divided into it is multiple from Scattered voice segments, the intensity of voice messaging reaches the specified intensity threshold value in these voice segments.Voice in these voice segments Information can have respective target frequency.These target frequencies can be as the audio for including in the private chat voice messaging Feature.It is then possible to calculate the frequency difference between target frequency frequency corresponding with standard voice feature.Specifically, The center frequency value of the frequency separation of male voice and female voice can be determined respectively.Then, it when calculating frequency difference, can first determine Current target frequency and which center frequency value are closer, it is then possible to calculate current target frequency and immediate frequency Frequency difference between rate central value.The frequency difference can be as between current audio frequency characteristics and standard voice feature Difference value.
In the present embodiment, if the difference value is more than or equal to specified threshold, then it represents that current audio frequency characteristics Differ larger with standard voice feature, current audio frequency characteristics are likely to be environmental noise.Therefore, in this case, may be used To remove the corresponding information of the audio frequency characteristics from the private chat voice messaging, to filter in the private chat voice messaging Component environment noise.Wherein, above-mentioned difference value can refer to the absolute value being calculated.The specified threshold can be according to reality Border situation flexible setting.
In one embodiment, it is contemplated that after handling in a manner mentioned above private chat voice messaging, due to Environmental noise is eliminated, then there may be the mute of big section between voice adjacent in private chat voice messaging.From For the auditory effect of human ear, that people can be allowed to generate is uncomfortable for big section mute, while people can also be allowed to generate the illusion of communication disruption.Mirror In this, can big section it is mute in be properly added the lower noise signal of some intensity, to eliminate above-mentioned problem.Specifically Ground can identify target language segment in the private chat voice messaging, and the intensity value of any information is equal in the target language segment Lower than specified intensity threshold value.Wherein, it is lower than the specified intensity threshold value, shows for the angle of human ear, in the target language segment Voice messaging can not be gone out by ear recognition, therefore, the target language segment be mute section.At this point it is possible to identify this mute section Lasting duration show the target voice if the duration of the target language segment is more than or equal to specified duration threshold value The duration of Duan Chixu is too long, at this point it is possible to add specified noise signal in the target language segment.The specified noise signal Can be sound of the wind, sound of sea wave etc. will not allow human ear to generate uncomfortable white noise (White Noise).
In one embodiment, private chat voice messaging is carried out to handle it according to above-mentioned removal environmental noise the step of Afterwards, it is more likely that the part signal in the initial position of normal voice and/or final position can be removed, so as to cause normal language Imperfect or normal voice the starting and/or termination of sound are excessively lofty.In consideration of it, can by the way of signal fitting, It is suitably the starting of voice and final position addition a part fitting information, to solve the problem above-mentioned.It specifically, can be with Initial position and the final position of voice are identified in the private chat voice messaging.Typically, there is language in voice messaging The place of sound, the waveform that raising and lowering can all occur in the intensity of information can by the identification to information strength in voice messaging To identify initial position and the final position of voice.At this point it is possible to according to the information waveform for the initial position identified and end The information waveform that stop bit is set generates corresponding voice fitting information.Voice fitting information and the information of corresponding position splice it Afterwards, continuous waveform can be formed.In this way, adding the voice to match respectively at the initial position and the final position It is fitted information, the starting of voice and termination can be enabled more smooth, lofty feeling will not be generated.
In one embodiment, in the private chat voice messaging of the microphone acquisition of user, there may be echo signal, In order to enhance the audio experience of user, the echo signal in the private chat voice messaging can be identified, and from the private chat voice The echo signal is removed in information.Specifically, convergence algorithm can be carried out to input signal by sef-adapting filter, made It obtains and matches by the shock response that sef-adapting filter obtains with true echo path, so that it is corresponding to obtain echo path The estimated value of echo signal.It is then possible to the private chat voice messaging be subtracted to the estimated value of the echo signal, thus from the private Merely echo signal is removed in voice messaging.
In one embodiment, user might have other people at one's side and speaking in typing private chat voice messaging, from And lead to the sound in the voice messaging of typing there are others.In order to avoid other people sound causes to do to the sound of user It disturbs, client is after collecting the private chat voice messaging of user, other people language that can will include in the private chat voice messaging Message breath removal.Specifically, present embodiment can remove other people voice messaging by the method for Application on Voiceprint Recognition.The use It family can be in advance in the client by a certain number of voice messagings of typing, so that client saves the vocal print of the user Feature.In this way, after client collects the private chat voice messaging of user, can identify the private chat language between net cast The vocal print feature for including in message breath, and the vocal print feature that will identify that is compared with the vocal print feature of the user. If the vocal print feature identified and the vocal print feature of the user are inconsistent, the vocal print feature that can be will identify that Corresponding information is removed from the private chat voice messaging.Above-mentioned vocal print feature can be and utilize special Application on Voiceprint Recognition group The sound wave spectrum that part obtains after analyzing voice messaging.The generation of human language be Body Languages maincenter and vocal organs it Between a complicated physiology physical process, tongue that people uses in speech, tooth, larynx, lung, nasal cavity is in terms of size and form Everyone is widely different, so the sound wave spectrum of different people is all variant, so that the vocal print feature between different user It can also be different.Therefore, it is possible to remove the voice messaging of other users by vocal print feature.
S5: the private chat language provided by the target user that the voice server is sent by the private chat channel is received Message breath, and receive the group chat language for the other users that same live streaming group is in the initiator that the voice server is sent Message breath.
In the present embodiment, voice server again may be by the private chat channel, the private that target user is provided Merely voice messaging is sent to the initiator of private chat request.In addition, voice server can also be by group chat channel, it will be with the hair The group chat voice messaging for playing other users of the side in same live streaming group is sent to the initiator together.In this way, in initiator Private chat voice messaging and group chat voice messaging can be locally provided simultaneously with.
S7: private chat voice messaging that the target user provides and the group chat voice messaging are synthesized into one voice flow Afterwards, the voice flow of synthesis is played.
In the present embodiment, due to initiator's local reception to two kinds of voice messagings, in order to listen to simultaneously this two Kind of voice messaging needs private chat voice messaging that the target user provides and the group chat voice messaging synthesizing one language Sound stream, and the voice flow after synthesis is played by loudspeaker.In this way, both believing comprising group chat voice in voice flow in post synthesis Breath, and include private chat voice messaging, in addition, in practical applications, the live content of direct broadcasting room can also be added in voice flow Voice messaging so that user when carrying out private chat, will not miss other useful informations in direct broadcasting room.
In one embodiment, due to participating in the voice flow that the user of private chat plays including a large amount of voice messaging, In order to guarantee that user can not hear private chat voice messaging, the client of user can automatically believe the group chat voice in voice flow The volume of breath is adjusted.Specifically, client can identify the volume of the private chat voice messaging, and according to the institute identified Volume is stated, the volume of the group chat voice messaging is adjusted.Wherein, private chat voice messaging and group chat voice messaging initially all may be used To be played out according to preset volume, at this point, referring to if the volume of the private chat voice messaging identified is more than or equal to Determine volume threshold, shows that the user for participating in private chat at this time is illustrating an important content.At this point, in order to not hear the user Private chat voice messaging, client can be automatically by the volume adjustment of the group chat voice messaging to lower first volume.So Afterwards, when the volume of the group chat voice messaging is in first volume, if the sound of the private chat voice messaging identified Amount is less than the specified volume threshold, then shows that the user for participating in private chat has completed the elaboration of thing, at this point it is possible to will be described The volume adjustment of group chat voice messaging extremely second volume higher than the first above-mentioned volume.For example, second volume can be it Volume when preceding group chat voice messaging normal play.Above-mentioned specified volume threshold, can be the sound than people when normally speaking The more slightly lower volume value of magnitude.In this way, can suitably turn down group chat voice messaging when thering is user to speak in private chat channel Volume, to guarantee that the private chat voice messaging of user in private chat channel can not heard.In the sound according to private chat voice messaging Amount, after automatically adjusting to the volume of group chat voice messaging, can will the private chat voice messaging and adjust volume after Group chat voice messaging merge into a track, and the information after track is merged passes through loudspeaker as the voice flow after synthesis It plays.
Referring to Fig. 4, the application also provides a kind of client, the client includes:
Private chat request initiating cell, the private chat request for initiating to be directed toward target user to voice server, so that institute It states voice server and establishes private chat channel between the initiator and the target user that the private chat is requested;
Private chat voice collecting unit, for acquiring the private chat voice messaging of the initiator, and by the private chat of acquisition Voice messaging is uploaded to the voice server, so that the voice server is used by the private chat channel to the target Family provides the private chat voice messaging;
Voice messaging receiving unit, for receive the voice server by the private chat channel send by the mesh The private chat voice messaging that user provides is marked, and receive that the voice server sends is in same live streaming group with the initiator Other users group chat voice messaging;
Voice flow synthesis unit, private chat voice messaging and the group chat voice messaging for providing the target user After synthesizing one voice flow, the voice flow of synthesis is played.
In one embodiment, the client further include:
Difference value determination unit, for identification audio frequency characteristics in the private chat voice messaging, and the determining audio spy Difference value between sign and standard voice feature;
Voice messaging removal unit, if being more than or equal to specified threshold for the difference value, by the audio frequency characteristics Corresponding information is removed from the private chat voice messaging.
In one embodiment, the client further include:
Vocal print feature recognition unit, the vocal print feature for including in the private chat voice messaging for identification, and will identify that The vocal print feature be compared with the vocal print feature of the initiator;
Voiceprint removal unit, if the vocal print feature for identifying and the vocal print feature of the initiator are different It causes, the corresponding information of the vocal print feature that will identify that is removed from the private chat voice messaging.
In one embodiment, the voice flow synthesis unit includes:
Group chat speech volume adjustment module, the volume of the private chat voice messaging for identification, and according to the institute identified Volume is stated, the volume of the group chat voice messaging is adjusted;
Track merging module, for merging into the group chat voice messaging after the private chat voice messaging and adjusting volume One track, and the information after track is merged is as the voice flow after synthesis.
Referring to Fig. 5, the application also provides a kind of client, the client includes memory and processor, described to deposit Reservoir when the computer program is executed by the processor, realizes the language of above-mentioned direct broadcasting room for storing computer program Sound private chat method.
In the present embodiment, the memory may include the physical unit for storing information, usually by information It is stored again with the media using the methods of electricity, magnetic or optics after digitlization.Memory described in present embodiment again may be used To include: to store the device of information, such as RAM, ROM in the way of electric energy;The device of information is stored in the way of magnetic energy, it is such as hard Disk, floppy disk, tape, core memory, magnetic bubble memory, USB flash disk;Using the device of optical mode storage information, such as CD or DVD. Certainly, there are also memories of other modes, such as quantum memory, graphene memory etc..
In the present embodiment, the processor can be implemented in any suitable manner.For example, the processor can be with Take such as microprocessor or processor and storage can by (micro-) processor execute computer readable program code (such as Software or firmware) computer-readable medium, logic gate, switch, specific integrated circuit (Application Specific Integrated Circuit, ASIC), programmable logic controller (PLC) and the form etc. for being embedded in microcontroller.
The concrete function that the device that this specification embodiment provides, memory and processor are realized, can be with this theory Aforementioned embodiments in bright book contrast explanation, and can reach the technical effect of aforementioned embodiments, just no longer superfluous here It states.
Therefore technical solution provided by the present application, when user direct broadcasting room watch be broadcast live when, if necessary to and target User is individually linked up, then can initiate to request for the private chat of the target user to voice server.Voice server After receiving private chat request, private chat channel can be established between the user and target user.The private chat channel can be used for Transmit the private chat voice messaging between the two users.Meanwhile voice server can also receive each use in same live streaming group The group chat voice messaging at family, the group chat voice messaging and private chat voice messaging be received and dispatched by different channels, therefore that This will not influence each other, to guarantee the privacy of private chat voice messaging.The initiator of private chat request can pass through local record Sound equipment acquires private chat voice messaging, and uploads the private chat voice messaging to voice server, so that voice server is logical The private chat channel established before crossing sends the private chat voice messaging to target user.Similarly, target user can also be to the hair It plays side and sends private chat voice messaging.Initiator can receive voice after receiving the private chat voice messaging of target user simultaneously Server mentions the group chat voice messaging of other users in the same live streaming group sent.The client of initiator can will receive Private chat voice messaging and group chat voice messaging are synthesized, to export both voice messagings by the same loudspeaker.This Sample, user is when carrying out the communication of independent voice with other users, additionally it is possible to hear other voice messagings in direct broadcasting room, not only not Will affect viewing live content, the complexity of user's operation can also be simplified by way of voice-enabled chat, thus improve with The convenience that other users are linked up.
In the 1990s, the improvement of a technology can be distinguished clearly be on hardware improvement (for example, Improvement to circuit structures such as diode, transistor, switches) or software on improvement (improvement for method flow).So And with the development of technology, the improvement of current many method flows can be considered as directly improving for hardware circuit. Designer nearly all obtains corresponding hardware circuit by the way that improved method flow to be programmed into hardware circuit.Cause This, it cannot be said that the improvement of a method flow cannot be realized with hardware entities module.For example, programmable logic device (Programmable Logic Device, PLD) (such as field programmable gate array (Field Programmable Gate Array, FPGA)) it is exactly such a integrated circuit, logic function determines device programming by user.By designer Voluntarily programming comes a digital display circuit " integrated " on a piece of PLD, designs and makes without asking chip maker Dedicated IC chip.Moreover, nowadays, substitution manually makes IC chip, this programming is also used instead mostly " is patrolled Volume compiler (logic compiler) " software realizes that software compiler used is similar when it writes with program development, And the source code before compiling also write by handy specific programming language, this is referred to as hardware description language (Hardware Description Language, HDL), and HDL is also not only a kind of, but there are many kind, such as ABEL (Advanced Boolean Expression Language)、AHDL(Altera Hardware Description Language)、Confluence、CUPL(Cornell University Programming Language)、HDCal、JHDL (Java Hardware Description Language)、Lava、Lola、MyHDL、PALASM、RHDL(Ruby Hardware Description Language) etc., VHDL (Very-High-Speed is most generally used at present Integrated Circuit Hardware Description Language) and Verilog.Those skilled in the art also answer This understands, it is only necessary to method flow slightly programming in logic and is programmed into integrated circuit with above-mentioned several hardware description languages, The hardware circuit for realizing the logical method process can be readily available.
It is also known in the art that other than realizing server in a manner of pure computer readable program code, it is complete Entirely can by by method and step carry out programming in logic come so that server with logic gate, switch, specific integrated circuit, programmable Logic controller realizes identical function with the form for being embedded in microcontroller etc..Therefore this server is considered one kind Hardware component, and the structure that the unit for realizing various functions for including in it can also be considered as in hardware component.Or Even, can will be considered as realizing the unit of various functions either the software module of implementation method can be Hardware Subdivision again Structure in part.
As seen through the above description of the embodiments, those skilled in the art can be understood that the application can It realizes by means of software and necessary general hardware platform.Based on this understanding, the technical solution essence of the application On in other words the part that contributes to existing technology can be embodied in the form of software products, the computer software product It can store in storage medium, such as ROM/RAM, magnetic disk, CD, including some instructions are used so that a computer equipment (can be personal computer, server or the network equipment etc.) executes each embodiment of the application or embodiment Method described in certain parts.
Each embodiment in this specification is described in a progressive manner, same and similar between each embodiment Part may refer to each other, what each embodiment stressed is the difference with other embodiments.In particular, needle For the embodiment of client, the introduction control for being referred to the embodiment of preceding method is explained.
The application can describe in the general context of computer-executable instructions executed by a computer, such as program Module.Generally, program module includes routines performing specific tasks or implementing specific abstract data types, programs, objects, group Part, data structure etc..The application can also be practiced in a distributed computing environment, in these distributed computing environments, by Task is executed by the connected remote processing devices of communication network.In a distributed computing environment, program module can be with In the local and remote computer storage media including storage equipment.
Although depicting the application by embodiment, it will be appreciated by the skilled addressee that there are many deformations by the application With variation without departing from spirit herein, it is desirable to which the attached claims include these deformations and change without departing from the application Spirit.

Claims (14)

1. a kind of voice private chat method of direct broadcasting room, which is characterized in that the described method includes:
The private chat request for being directed toward target user is initiated to voice server, so that the voice server is requested in the private chat Initiator and the target user between establish private chat channel;
The private chat voice messaging of the initiator is acquired, and the private chat voice messaging of acquisition is uploaded to the voice service Device, so that the voice server provides the private chat voice messaging to the target user by the private chat channel;
The private chat voice messaging provided by the target user that the voice server is sent by the private chat channel is provided, And receive the group chat voice messaging for the other users that same live streaming group is in the initiator that the voice server is sent;
After private chat voice messaging that the target user provides and the group chat voice messaging are synthesized one voice flow, play The voice flow of synthesis.
2. the method according to claim 1, wherein after the private chat voice messaging for acquiring the initiator, The method also includes:
It identifies the audio frequency characteristics in the private chat voice messaging, and determines the difference between the audio frequency characteristics and standard voice feature Different value;
If the difference value is more than or equal to specified threshold, the corresponding information of the audio frequency characteristics is believed from the private chat voice It is removed in breath.
3. according to the method described in claim 2, it is characterized in that, identify the audio frequency characteristics in the private chat voice messaging, and Determine that the difference value between the audio frequency characteristics and standard voice feature includes:
The private chat voice messaging in time-domain is converted to frequency domain, and identifies letter from the voice messaging of frequency domain Number intensity reaches the corresponding target frequency of information of specified intensity threshold value, and the target frequency that will identify that is as the private The audio frequency characteristics for merely including in voice messaging;
The frequency difference between the target frequency and standard voice frequency is calculated, and using the frequency difference as the audio Difference value between feature and standard voice feature.
4. according to the method described in claim 2, it is characterized in that, by the corresponding information of the audio frequency characteristics from the private chat After being removed in voice messaging, the method also includes:
Target language segment is identified in the private chat voice messaging, the intensity value of any information is below in the target language segment Specified intensity threshold value;
If the duration of the target language segment is more than or equal to specified duration threshold value, added in the target language segment specified Noise signal.
5. according to the method described in claim 2, it is characterized in that, by the corresponding information of the audio frequency characteristics from the private chat After being removed in voice messaging, the method also includes:
Identify initial position and the final position of voice in the private chat voice messaging, and in the initial position and described The voice fitting information to match is added at final position respectively.
6. the method according to claim 1, wherein after the private chat voice messaging for acquiring the initiator, The method also includes:
It identifies the echo signal in the private chat voice messaging, and removes the echo signal from the private chat voice messaging It removes.
7. the method according to claim 1, wherein after the private chat voice messaging for acquiring the initiator, The method also includes:
The vocal print feature that identifies the vocal print feature for including in the private chat voice messaging, and will identify that and the initiator Vocal print feature be compared;
If the vocal print feature identified and the vocal print feature of the initiator are inconsistent, the vocal print feature that will identify that Corresponding information is removed from the private chat voice messaging.
8. the method according to claim 1, wherein the private chat voice messaging that the target user is provided and institute It states group chat voice messaging and synthesizes one voice flow and include:
It identifies the volume of the private chat voice messaging, and according to the volume identified, adjusts the group chat voice messaging Volume;
Group chat voice messaging after the private chat voice messaging and adjusting volume is merged into a track, and track is merged Information afterwards is as the voice flow after synthesis.
9. according to the method described in claim 8, it is characterized in that, adjusting the group chat language according to the volume identified Message breath volume include:
If the volume of the private chat voice messaging identified is more than or equal to specified volume threshold, the group chat voice is believed The volume adjustment of breath is to the first volume;
When the volume of the group chat voice messaging is in first volume, if the sound of the private chat voice messaging identified Amount is less than the specified volume threshold, by the volume adjustment of the group chat voice messaging to the second volume;Wherein, first sound Amount is less than second volume.
10. a kind of client, which is characterized in that the client includes:
Private chat request initiating cell, the private chat request for initiating to be directed toward target user to voice server, so that institute's predicate Sound server establishes private chat channel between the initiator that the private chat is requested and the target user;
Private chat voice collecting unit, for acquiring the private chat voice messaging of the initiator, and by the private chat voice of acquisition Information is uploaded to the voice server, so that the voice server is mentioned by the private chat channel to the target user For the private chat voice messaging;
Voice messaging receiving unit is used by what the private chat channel was sent by the target for receiving the voice server The private chat voice messaging that family provides, and receive that the voice server sends with the initiator be in same live streaming group its The group chat voice messaging of his user;
Voice flow synthesis unit, private chat voice messaging and group chat voice messaging synthesis for providing the target user After one voice flow, the voice flow of synthesis is played.
11. client according to claim 10, which is characterized in that the client further include:
Difference value determination unit, audio frequency characteristics in the private chat voice messaging for identification, and determine the audio frequency characteristics with Difference value between standard voice feature;
Voice messaging removal unit, it is if being more than or equal to specified threshold for the difference value, the audio frequency characteristics are corresponding Information removed from the private chat voice messaging.
12. client according to claim 10, which is characterized in that the client further include:
Vocal print feature recognition unit, the vocal print feature for including in the private chat voice messaging for identification, and the institute that will identify that Vocal print feature is stated to be compared with the vocal print feature of the initiator;
Voiceprint removal unit, if the vocal print feature for identifying and the vocal print feature of the initiator are inconsistent, The corresponding information of the vocal print feature that will identify that is removed from the private chat voice messaging.
13. client according to claim 10, which is characterized in that the voice flow synthesis unit includes:
Group chat speech volume adjustment module, the volume of the private chat voice messaging for identification, and according to the sound identified Amount, adjusts the volume of the group chat voice messaging;
Track merging module, for the group chat voice messaging after the private chat voice messaging and adjusting volume to be merged into one Track, and the information after track is merged is as the voice flow after synthesis.
14. a kind of client, which is characterized in that the client includes processor and memory, and the memory is for storing Computer program when the computer program is executed by the processor, is realized such as any claim in claim 1 to 9 The method.
CN201811031975.5A 2018-09-05 2018-09-05 A kind of the voice private chat method and client of direct broadcasting room Pending CN109120947A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201811031975.5A CN109120947A (en) 2018-09-05 2018-09-05 A kind of the voice private chat method and client of direct broadcasting room

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201811031975.5A CN109120947A (en) 2018-09-05 2018-09-05 A kind of the voice private chat method and client of direct broadcasting room

Publications (1)

Publication Number Publication Date
CN109120947A true CN109120947A (en) 2019-01-01

Family

ID=64858526

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201811031975.5A Pending CN109120947A (en) 2018-09-05 2018-09-05 A kind of the voice private chat method and client of direct broadcasting room

Country Status (1)

Country Link
CN (1) CN109120947A (en)

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110996180A (en) * 2019-12-17 2020-04-10 李昱颉 Network live broadcast chatting method, system and server
US10791224B1 (en) 2019-08-20 2020-09-29 Motorola Solutions, Inc. Chat call within group call
CN112968826A (en) * 2020-02-05 2021-06-15 北京字节跳动网络技术有限公司 Voice interaction method and device and electronic equipment
CN113542783A (en) * 2021-07-13 2021-10-22 北京字节跳动网络技术有限公司 Audio processing method, live broadcast equipment and live broadcast system
CN114071177A (en) * 2021-11-16 2022-02-18 网易(杭州)网络有限公司 Virtual gift sending method and device and terminal equipment
CN115002553A (en) * 2022-04-29 2022-09-02 当趣网络科技(杭州)有限公司 Method and system for chatting while watching based on same movie and television video

Citations (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101841422A (en) * 2010-05-07 2010-09-22 无锡中星微电子有限公司 Method and system for realizing private chat in voice conference
US20110154044A1 (en) * 2009-12-18 2011-06-23 Compugroup Holding Ag Computer implemented method for sending a message to a recipient user, receiving a message by a recipient user, a computer readable storage medium and a computer system
CN103236263A (en) * 2013-03-27 2013-08-07 东莞宇龙通信科技有限公司 Method, system and mobile terminal for improving communicating quality
CN103365538A (en) * 2013-04-08 2013-10-23 广州华多网络科技有限公司 Instant communication control method and instant communication control device
CN103489448A (en) * 2013-09-03 2014-01-01 广州日滨科技发展有限公司 Processing method and system of voice data
CN104580763A (en) * 2013-10-23 2015-04-29 深圳市潮流网络技术有限公司 Method and device for realizing private chat in telephone conference
CN105323536A (en) * 2014-07-30 2016-02-10 三亚中兴软件有限责任公司 Attendee private chat method and device in television conference
CN106533924A (en) * 2016-12-19 2017-03-22 广州华多网络科技有限公司 Instant messaging method and device
CN106790043A (en) * 2016-12-17 2017-05-31 北京小米移动软件有限公司 The method and device of message is sent in live application
CN107066199A (en) * 2017-04-13 2017-08-18 网易(杭州)网络有限公司 The exchange method and device sent for message
CN107945815A (en) * 2017-11-27 2018-04-20 歌尔科技有限公司 Voice signal noise-reduction method and equipment
CN108462882A (en) * 2017-02-22 2018-08-28 杨绍辉 A kind of interactive approach of network direct broadcasting system

Patent Citations (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20110154044A1 (en) * 2009-12-18 2011-06-23 Compugroup Holding Ag Computer implemented method for sending a message to a recipient user, receiving a message by a recipient user, a computer readable storage medium and a computer system
CN101841422A (en) * 2010-05-07 2010-09-22 无锡中星微电子有限公司 Method and system for realizing private chat in voice conference
CN103236263A (en) * 2013-03-27 2013-08-07 东莞宇龙通信科技有限公司 Method, system and mobile terminal for improving communicating quality
CN103365538A (en) * 2013-04-08 2013-10-23 广州华多网络科技有限公司 Instant communication control method and instant communication control device
CN103489448A (en) * 2013-09-03 2014-01-01 广州日滨科技发展有限公司 Processing method and system of voice data
CN104580763A (en) * 2013-10-23 2015-04-29 深圳市潮流网络技术有限公司 Method and device for realizing private chat in telephone conference
CN105323536A (en) * 2014-07-30 2016-02-10 三亚中兴软件有限责任公司 Attendee private chat method and device in television conference
CN106790043A (en) * 2016-12-17 2017-05-31 北京小米移动软件有限公司 The method and device of message is sent in live application
CN106533924A (en) * 2016-12-19 2017-03-22 广州华多网络科技有限公司 Instant messaging method and device
CN108462882A (en) * 2017-02-22 2018-08-28 杨绍辉 A kind of interactive approach of network direct broadcasting system
CN107066199A (en) * 2017-04-13 2017-08-18 网易(杭州)网络有限公司 The exchange method and device sent for message
CN107945815A (en) * 2017-11-27 2018-04-20 歌尔科技有限公司 Voice signal noise-reduction method and equipment

Cited By (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10791224B1 (en) 2019-08-20 2020-09-29 Motorola Solutions, Inc. Chat call within group call
CN110996180A (en) * 2019-12-17 2020-04-10 李昱颉 Network live broadcast chatting method, system and server
CN110996180B (en) * 2019-12-17 2021-05-28 李昱颉 Network live broadcast chatting method, system and server
CN112968826A (en) * 2020-02-05 2021-06-15 北京字节跳动网络技术有限公司 Voice interaction method and device and electronic equipment
CN112968826B (en) * 2020-02-05 2023-08-08 北京字节跳动网络技术有限公司 Voice interaction method and device and electronic equipment
CN113542783A (en) * 2021-07-13 2021-10-22 北京字节跳动网络技术有限公司 Audio processing method, live broadcast equipment and live broadcast system
WO2023284436A1 (en) * 2021-07-13 2023-01-19 北京字节跳动网络技术有限公司 Audio processing method, live broadcast device, and live broadcast system
CN114071177A (en) * 2021-11-16 2022-02-18 网易(杭州)网络有限公司 Virtual gift sending method and device and terminal equipment
CN114071177B (en) * 2021-11-16 2023-09-26 网易(杭州)网络有限公司 Virtual gift sending method and device and terminal equipment
CN115002553A (en) * 2022-04-29 2022-09-02 当趣网络科技(杭州)有限公司 Method and system for chatting while watching based on same movie and television video

Similar Documents

Publication Publication Date Title
CN109120947A (en) A kind of the voice private chat method and client of direct broadcasting room
CN106162413B (en) The Headphone device of specific environment sound prompting mode
CN109005419A (en) A kind of processing method and client of voice messaging
EP3282669A2 (en) Private communications in virtual meetings
US9942673B2 (en) Method and arrangement for fitting a hearing system
CN109104616A (en) A kind of voice of direct broadcasting room connects wheat method and client
US8249233B2 (en) Apparatus and system for representation of voices of participants to a conference call
US10586131B2 (en) Multimedia conferencing system for determining participant engagement
CN109951743A (en) Barrage information processing method, system and computer equipment
TW201820315A (en) Improved audio headset device
US20210280191A1 (en) Lip language recognition method and mobile terminal
CN108965904A (en) A kind of volume adjusting method and client of direct broadcasting room
US11115444B2 (en) Private communications in virtual meetings
WO2019071808A1 (en) Video image display method, apparatus and system, terminal device, and storage medium
WO2023098332A1 (en) Audio processing method, apparatus and device, medium, and program product
US9967668B2 (en) Binaural recording system and earpiece set
US20200211540A1 (en) Context-based speech synthesis
WO2023029829A1 (en) Audio processing method and apparatus, user terminal, and computer readable medium
CN109451329A (en) Mixed audio processing method and device
TWI811692B (en) Method and apparatus and telephony system for acoustic scene conversion
US20160057527A1 (en) Binaural recording system and earpiece set
EP2216975A1 (en) Telecommunication device
US20220122630A1 (en) Real-time augmented hearing platform
CN112788489B (en) Control method and device and electronic equipment
CN111696566B (en) Voice processing method, device and medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
TA01 Transfer of patent application right

Effective date of registration: 20200512

Address after: 310052 room 508, floor 5, building 4, No. 699, Wangshang Road, Changhe street, Binjiang District, Hangzhou City, Zhejiang Province

Applicant after: Alibaba (China) Co.,Ltd.

Address before: 100102 No. 4 Building, Wangjing Dongyuan District, Chaoyang District, Beijing

Applicant before: BEIJING YOUKU TECHNOLOGY Co.,Ltd.

TA01 Transfer of patent application right
RJ01 Rejection of invention patent application after publication

Application publication date: 20190101

RJ01 Rejection of invention patent application after publication