CN108763288A - A kind of barrage hold-up interception method and its relevant device - Google Patents

A kind of barrage hold-up interception method and its relevant device Download PDF

Info

Publication number
CN108763288A
CN108763288A CN201810333585.7A CN201810333585A CN108763288A CN 108763288 A CN108763288 A CN 108763288A CN 201810333585 A CN201810333585 A CN 201810333585A CN 108763288 A CN108763288 A CN 108763288A
Authority
CN
China
Prior art keywords
barrage information
barrage
gram
similarity
information
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201810333585.7A
Other languages
Chinese (zh)
Inventor
刘兵
陈少杰
张文明
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Wuhan Douyu Network Technology Co Ltd
Original Assignee
Wuhan Douyu Network Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Wuhan Douyu Network Technology Co Ltd filed Critical Wuhan Douyu Network Technology Co Ltd
Priority to CN201810333585.7A priority Critical patent/CN108763288A/en
Publication of CN108763288A publication Critical patent/CN108763288A/en
Pending legal-status Critical Current

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/45Management operations performed by the client for facilitating the reception of or the interaction with the content or administrating data related to the end-user or to the client device itself, e.g. learning user preferences for recommending movies, resolving scheduling conflicts
    • H04N21/454Content or additional data filtering, e.g. blocking advertisements
    • H04N21/4545Input to filtering algorithms, e.g. filtering a region of the image
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/47End-user applications
    • H04N21/488Data services, e.g. news ticker
    • H04N21/4884Data services, e.g. news ticker for displaying subtitles

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Databases & Information Systems (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The embodiment of the present application discloses a kind of barrage hold-up interception method and its relevant device, for timely and effectively being intercepted to the higher class barrage of similarity in the short time, safeguards the barrage environment of live streaming platform.The embodiment of the present application method includes:Receive the first barrage information;The N-Gram set of the first barrage information is calculated, the N is the integer more than 1;The similarity is calculated according to the N-Gram of the first barrage information set and the N-Gram set of the second barrage information, the second barrage information is the barrage information got, and the issuing time interval of the second barrage information and the first barrage information is less than preset duration;If the similarity is more than the first preset value, the first barrage information is intercepted.

Description

A kind of barrage hold-up interception method and its relevant device
Technical field
This application involves barrages to intercept field more particularly to a kind of barrage hold-up interception method and its relevant device.
Background technology
With the development of internet, entertain more and more abundant on the line of people, net cast mode has been obtained for extensively Application.People can be big absolutely at present by internet whenever and wherever possible in a variety of client watching video live broadcasts such as computer, mobile phone Most webcast websites all use barrage, and barrage can greatly reinforce the interaction between spectators and main broadcaster, spectators and spectators.
With the development of live streaming industry, barrage interception means are also more and more abundant, but user sends the form of rubbish barrage Also constantly changing, sending the crowd of barrage for certain profit purpose, need to expand its influence power, can send in a short time A large amount of core contents are identical, but text table is as slightly discrepant barrage is (in order to avoid certain content intercept models, in barrage content Have nuance).These rubbish barrages can seriously affect the experience that other users watch live streaming.
Invention content
The embodiment of the present application provides a kind of barrage hold-up interception method and its relevant device, for timely and effectively to the short time The interior higher class barrage of similarity is intercepted, and safeguards the barrage environment of live streaming platform.
The first aspect of the embodiment of the present application provides a kind of barrage hold-up interception method, specifically includes:Receive the first barrage letter Breath;The N-Gram set of the first barrage information is calculated, the N is the integer more than 1;According to the first barrage information N-Gram gathers calculates the similarity with the N-Gram of the second barrage information set, and the second barrage information is to have obtained The barrage information got, and when the issuing time interval of the second barrage information and the first barrage information is less than default It is long;If the similarity is more than the first preset value, the first barrage information is intercepted.
In a kind of possible design, in the first realization method of the embodiment of the present application first aspect, the basis The N-Gram set of the first barrage information calculates the similarity with the N-Gram set of the second barrage information:
It is calculated according to following formula:
Wherein, the SIM (S, T) is the similarity;The GN(S) it is the N-Gram collection of the first barrage information It closes;The GN(T) gather for the N-Gram of the second barrage information;NUM (the GN(S)) it is the first barrage information The element number of N-Gram set;NUM (the GN(T)) element number gathered for the N-Gram of the second barrage information; NUM (the GN(S)∩GN(T)) it is that the N-Gram of the first barrage information gathers the N-Gram with the second barrage information The element number overlapped in set.
In a kind of possible design, in second of realization method of the embodiment of the present application first aspect, the interception Before the first barrage information, the method further includes:It obtains the similarity in the preset duration and is more than described first The barrage information number of preset value.
In a kind of possible design, in the third realization method of the embodiment of the present application first aspect, if the institute It states similarity and is more than the first preset value, then intercepting the first barrage information includes:If the similarity is more than the first preset value, And the barrage information number is more than the second preset value, then intercepts the first barrage information.
The second aspect of the embodiment of the present application provides a kind of barrage interception equipment, specifically includes:Receiving unit, for connecing Receive the first barrage information;First computing unit, the N-Gram for calculating the first barrage information gather, and the N is more than 1 Integer;Second computing unit, for according to the N-Gram of the first barrage information set and the second barrage information N-Gram set calculates the similarity;The second barrage information is the barrage information got, and second barrage The issuing time interval of information and the first barrage information is less than preset duration;Interception unit is big for working as the similarity When the first preset value, the first barrage information is intercepted.
In a kind of possible design, in the first realization method of the embodiment of the present application second aspect, the calculating Unit is specifically additionally operable to:
It is calculated according to following formula:
Wherein, the SIM (S, T) is the similarity;The GN(S) it is the N-Gram collection of the first barrage information It closes;The GN(T) gather for the N-Gram of the second barrage information;NUM (the GN(S)) it is the first barrage information The element number of N-Gram set;NUM (the GN(T)) element number gathered for the N-Gram of the second barrage information; NUM (the GN(S)∩GN(T)) it is that the N-Gram of the first barrage information gathers the N-Gram with the second barrage information The element number overlapped in set.
In a kind of possible design, in second of realization method of the embodiment of the present application second aspect, the equipment Further include:Acquiring unit is more than the barrage information of first preset value for obtaining the similarity in the preset duration Number.
In a kind of possible design, in the third realization method of the embodiment of the present application second aspect, the interception Unit is specifically used for:If the similarity is more than the first preset value, and the barrage information number is more than the second preset value, then blocks Cut the first barrage information.
The another aspect of the application provides a kind of computer readable storage medium, in the computer readable storage medium It is stored with instruction, when run on a computer so that computer executes the method described in above-mentioned various aspects.
The another aspect of the application provides a kind of computer program product including instruction, when it runs on computers When so that computer executes the method described in above-mentioned various aspects.
As can be seen from the above technical solutions, the embodiment of the present application has the following advantages:Barrage intercepts equipment and receives first Barrage information;Then the N-Gram set of the first barrage information is calculated, the N is the integer more than 1;According to described first The N-Gram set of barrage information calculates the similarity, second barrage with the N-Gram set of the second barrage information Information is the barrage information got, and the issuing time interval of the second barrage information and the first barrage information is small In preset duration;If the similarity is more than the first preset value, the first barrage information is intercepted.The embodiment of the present application, can Timely and effectively to be intercepted to the higher class barrage of similarity in the short time, the barrage environment of live streaming platform is safeguarded.
Description of the drawings
Fig. 1 is an a kind of flow diagram of barrage hold-up interception method provided in an embodiment of the present invention;
Fig. 2 is a kind of another flow diagram of barrage hold-up interception method provided in an embodiment of the present invention;
Fig. 3 is the structural schematic diagram that a kind of barrage provided in an embodiment of the present invention intercepts equipment;
Fig. 4 is another structural schematic diagram that a kind of barrage provided in an embodiment of the present invention intercepts equipment;
Fig. 5 is a kind of hardware architecture diagram that barrage intercepts equipment in the embodiment of the present invention;
Fig. 6 is another hardware architecture diagram that straight barrage intercepts equipment in the embodiment of the present invention.
Specific implementation mode
The embodiment of the present application provides a kind of barrage hold-up interception method and its relevant device, for timely and effectively to the short time The interior higher class barrage of similarity is intercepted, and safeguards the barrage environment of live streaming platform.
Term " first ", " second ", " third " in the description and claims of this application and above-mentioned attached drawing, " The (if present)s such as four " are for distinguishing similar object, without being used to describe specific sequence or precedence.It should manage The data that solution uses in this way can be interchanged in the appropriate case, so that the embodiments described herein can be in addition to illustrating herein Or the sequence other than the content of description is implemented.In addition, term " comprising " and " having " and their any deformation, it is intended that Cover it is non-exclusive include, for example, containing the process of series of steps or unit, method, system, product or equipment need not limit In those of clearly listing step or unit, but may include not listing clearly or for these processes, method, production The intrinsic other steps of product or equipment or unit.
Barrage hold-up interception method in the embodiment of the present application calculates the similarity between barrage based on N-gram, can have in time Effect intercepts the higher class barrage of similarity in the short time, safeguards the barrage environment of live streaming platform.
It should be noted that barrage hold-up interception method in the embodiment of the present application can apply with net cast field, can also Applied to other field, specific field does not limit herein, and the embodiment of the present application is illustrated by example of net cast.
Some terms in the embodiment of the present application are explained below:
N-gram:N-gram is common a kind of language model in large vocabulary continuous speech recognition, for Chinese, we Referred to as Chinese language model (chinese language model, CLM).Chinese language model utilizes adjacent word in context Between collocation information, needing the phonetic continuously without space, stroke, or represents alphabetical or stroke number, be converted into Chinese character When string (i.e. sentence), the sentence with maximum probability can be calculated, to realize the automatic conversion to Chinese character, is not necessarily to user hand Dynamic selection avoids the coincident code problem that many Chinese characters correspond to an identical phonetic (or stroke string or numeric string).The model base It is and all uncorrelated to other any words in such a it is assumed that the appearance of n-th word is only related to the word of front N-1, whole sentence Probability is exactly the product of each word probability of occurrence.These probability can by directly counted from language material N number of word and meanwhile occur Number obtains.The most commonly used is the Tri-Gram of the Bi-Gram of binary and ternary.
Similarity:The similarity degree of content of text between barrage.
Barrage:In viewing live streaming or video, that sails between shielding in video punctuates and annotates user.
Referring to Fig. 1, Fig. 1 is an a kind of flow diagram of barrage hold-up interception method provided by the embodiments of the present application, tool Body includes:
101, barrage intercepts equipment and receives the first barrage information.
In the present embodiment, when user is in viewing time-frequency live streaming, when needing to send barrage, barrage can't be directly presented at this time Go out in screen, but first pass through barrage and intercept equipment processing, barrage, which intercepts equipment, can receive the first barrage of user transmission Information, such as the first barrage information text content S=" plating cool one frightens (history breast ice story) someone driving "
102, barrage intercepts the similarity that equipment calculates the first barrage information and the second barrage information.
In the present embodiment, after barrage, which intercepts equipment, receives the first barrage information, the first barrage information and the will be calculated The similarity of two barrage information, wherein the second barrage information is the barrage information got, and the second barrage information and first The issuing time interval of barrage information is less than preset duration.
Wherein, the similarity of calculating the first barrage information and the second barrage information includes:
The N-Gram set of the first barrage information is calculated, N is the integer more than 1;
Similarity is calculated according to the N-Gram of the first barrage information set and the N-Gram set of the second barrage information.
Calculating similarity according to the N-Gram set of the N-Gram of the first barrage information set and the second barrage information includes:
It is calculated according to following formula:
Wherein, SIM (S, T) is similarity;
GN(S) gather for the N-Gram of the first barrage information;
GN(T) gather for the N-Gram of the second barrage information;
NUM(GN(S)) element number gathered for the N-Gram of the first barrage information;
NUM(GN(T)) element number gathered for the N-Gram of the second barrage information;
NUM(GN(S)∩GN(T)) it is that the N-Gram set of the first barrage information and the N-Gram of the second barrage information gather The element number of middle coincidence.
If the first barrage information text content S=" plating cool one frightens (history breast ice story) someone driving ", the second barrage letter Informative text content T=" plating cool one frightens (history breast ice story) someone driving 6 ", at this point, similarity is calculated according to above-mentioned formula, this When, N=2 is taken, is 1 imponderable situation to avoid barrage content-length, increases " beginning " " end " in front and back distribution, then:
GN(S)=" plating of beginning ", " plating is cool ", " cool one ", " one frightens ", " frightening (", " (history ", " history breast ", " newborn ice ", " ice event ", " story ", " thing) ", ") have ", " someone ", " people opens ", " driving ", " vehicle is whole " };
GN(T)=" plating of beginning ", " plating is cool ", " cool one ", " one frightens ", " frightening (", " (history ", " history breast ", " newborn ice ", " ice event ", " story ", " thing) ", ") have ", " someone ", " people opens ", " driving ", " vehicle 6 ", " 6 eventually " };
GN(S)∩GN(T)=" plating of beginning ", " plating is cool ", " cool one ", " one frightens ", " frightening (", " (history ", " history breast ", " newborn ice ", " ice event ", " story ", " thing) ", ") have ", " someone ", " people opens ", " driving " };
It can be obtained from above:NUM(GN(S))=16;NUM(GN(T))=17;NUM(GN(S)∩GN(T))=15;
Therefore similarity
If 103, similarity is more than the first preset value, barrage intercepts equipment and intercepts the first barrage information.
In the present embodiment, when the above-mentioned similarity calculated is more than the first preset value, then barrage intercepts equipment and intercepts First barrage information, when the first preset value is 0.8,0.909 is more than 0.8 at this time, then at this point, barrage, which intercepts equipment, intercepts first Barrage information." plating cool one frightens (history breast ice story) someone driving " that user sends will not be displayed on the screen.
Wherein, if the similarity calculated is less than the first preset value, barrage intercepts equipment and does not intercept the first bullet at this time Curtain information, the first barrage information may be displayed in the barrage on screen.
In the embodiment of the present application, barrage intercepts equipment and receives the first barrage information;Then the first barrage information and the is calculated The similarity of two barrage information, the second barrage information are the barrage information got, and the second barrage information and the first barrage The issuing time interval of information is less than preset duration;If similarity is more than the first preset value, the first barrage information is intercepted.This Shen Please embodiment, timely and effectively the higher class barrage of similarity in the short time can be intercepted, safeguard live streaming platform bullet Curtain environment.
Referring to Fig. 2, Fig. 2 is a kind of another flow diagram of barrage hold-up interception method provided by the embodiments of the present application, It specifically includes:
201, barrage intercepts equipment and receives the first barrage information.
In the present embodiment, when user is in viewing time-frequency live streaming, when needing to send barrage, barrage can't be directly presented at this time Go out in screen, but first pass through barrage and intercept equipment processing, barrage, which intercepts equipment, can receive the first barrage of user transmission Information, such as the first barrage information text content S=" plating cool one frightens (history breast ice story) someone driving "
202, barrage intercepts the similarity that equipment calculates the first barrage information and the second barrage information.
In the present embodiment, after barrage, which intercepts equipment, receives the first barrage information, the first barrage information and the will be calculated The similarity of two barrage information, wherein the second barrage information is the barrage information got, and the second barrage information and first The issuing time interval of barrage information is less than preset duration
Wherein, the similarity of calculating the first barrage information and the second barrage information includes:
The N-Gram set of the first barrage information is calculated, N is the integer more than 1;
Similarity is calculated according to the N-Gram of the first barrage information set and the N-Gram set of the second barrage information.
Calculating similarity according to the N-Gram set of the N-Gram of the first barrage information set and the second barrage information includes:
It is calculated according to following formula:
Wherein, SIM (S, T) is similarity;
GN(S) gather for the N-Gram of the first barrage information;
GN(T) gather for the N-Gram of the second barrage information;
NUM(GN(S)) element number gathered for the N-Gram of the first barrage information;
NUM(GN(T)) element number gathered for the N-Gram of the second barrage information;
NUM(GN(S)∩GN(T)) it is that the N-Gram set of the first barrage information and the N-Gram of the second barrage information gather The element number of middle coincidence.
If the first barrage information text content S=" plating cool one frightens (history breast ice story) someone driving ", the second barrage letter Informative text content T=" plating cool one frightens (history breast ice story) someone driving 6 ", at this point, similarity is calculated according to above-mentioned formula, this When, N=2 is taken, is 1 imponderable situation to avoid barrage content-length, increases " beginning " " end " in front and back distribution, then:
GN(S)=" plating of beginning ", " plating is cool ", " cool one ", " one frightens ", " frightening (", " (history ", " history breast ", " newborn ice ", " ice event ", " story ", " thing) ", ") have ", " someone ", " people opens ", " driving ", " vehicle is whole " };
GN(T)=" plating of beginning ", " plating is cool ", " cool one ", " one frightens ", " frightening (", " (history ", " history breast ", " newborn ice ", " ice event ", " story ", " thing) ", ") have ", " someone ", " people opens ", " driving ", " vehicle 6 ", " 6 eventually " };
GN(S)∩GN(T)=" plating of beginning ", " plating is cool ", " cool one ", " one frightens ", " frightening (", " (history ", " history breast ", " newborn ice ", " ice event ", " story ", " thing) ", ") have ", " someone ", " people opens ", " driving " };
It can be obtained from above:NUM(GN(S))=16;NUM(GN(T))=17;NUM(GN(S)∩GN(T))=15;
Therefore similarity
203, barrage intercepts equipment and obtains the barrage information number that similarity in preset duration is more than the first preset value.
In the present embodiment, the similarity between the first barrage information and the second barrage information is calculated when barrage intercepts equipment Later, it by using the similarity between barrage, is clustered, that is, needs to obtain similarity in preset duration and be more than the first preset value Barrage information number, for example, obtain 30 seconds in similarity be more than 0.8 barrage information number.
If 204, similarity is more than the first preset value, and barrage information number is more than the second preset value, then barrage interception is set It is standby to intercept the first barrage information.
In the present embodiment, when using the similarity between barrage, after being clustered to barrage text, barrage intercepts equipment It for the barrage class that traffic volume in short-term is larger, will intercept, for example, of the barrage for similarity in 30 seconds more than 0.8 Number is more than that 15 barrage is intercepted.
It can be seen that the content of two barrages is more similar from the example, the difference expression as same barrage can be regarded as;In addition Due to using the 2-Gram formats after duplicate removal, the case where for having repetition in barrage, such as:" teacher's wechat NG 8018 teacher's wechat NG, 8018 teacher's wechat NG 8018 ", recognition effect is preferable.
If calculating the similarity between full platform barrage, calculating cost is larger, and is unsatisfactory for the real-time of barrage interception It is required that in the present solution, the similarity between all barrages in short time piece is only calculated, based on similarity matrix, to barrage It is clustered, identifies rubbish barrage therein.And this programme need not be thought to mark barrage, avoid standard of the different people to barrage Differ, the difficulty brought to model training.
Wherein, if the barrage information number that the similarity calculated is less than in the first preset value and/or preset duration is more than When the second preset value, then barrage intercepts equipment and does not intercept the first barrage information at this time, and the first barrage information may be displayed on screen On barrage in.
In the embodiment of the present application, barrage intercepts equipment and receives the first barrage information;Then the first barrage information and the is calculated The similarity of two barrage information, the second barrage information are the barrage information got, and the second barrage information and the first barrage The issuing time interval of information is less than preset duration;Barrage intercepts similarity in equipment acquisition preset duration and is more than the first preset value Barrage information number, if similarity be more than the first preset value, and barrage information number be more than the second preset value, then barrage intercept Equipment intercepts the first barrage information.The embodiment of the present application, can be timely and effectively to the higher class barrage of similarity in the short time It is intercepted, safeguards the barrage environment of live streaming platform.
The embodiment of the present application is described from the angle of barrage hold-up interception method above, below barrage intercept equipment angle The embodiment of the present application is described in degree.
Referring to Fig. 3, Fig. 3 is one embodiment schematic diagram that barrage provided in an embodiment of the present invention intercepts equipment, specifically Including:
Receiving unit 301, for receiving the first barrage information;
First computing unit 302, the N-Gram for calculating the first barrage information gather, and the N is whole more than 1 Number;
Second computing unit 303, for being believed with second barrage according to the N-Gram of the first barrage information set The N-Gram set of breath calculates the similarity, and the second barrage information is the barrage information got, and the second barrage information It is less than preset duration with the issuing time interval of the first barrage information;
Interception unit 304, for when similarity is more than the first preset value, intercepting the first barrage information.
Referring to Fig. 4, Fig. 4 is another embodiment schematic diagram that barrage provided in an embodiment of the present invention intercepts equipment, tool Body includes:
Receiving unit 401, for receiving the first barrage information;
First computing unit 402, the N-Gram for calculating the first barrage information gather, and the N is whole more than 1 Number;
Second computing unit 403, for being believed with second barrage according to the N-Gram of the first barrage information set The N-Gram set of breath calculates the similarity, and the second barrage information is the barrage information got, and the second barrage information It is less than preset duration with the issuing time interval of the first barrage information;
Second computing unit 403 is further specifically used for:
It is calculated according to following formula:
Wherein, SIM (S, T) is similarity;
GN(S) gather for the N-Gram of the first barrage information;
GN(T) gather for the N-Gram of the second barrage information;
NUM(GN(S)) element number gathered for the N-Gram of the first barrage information;
NUM(GN(T)) element number gathered for the N-Gram of the second barrage information;
NUM(GN(S)∩GN(T)) it is that the N-Gram set of the first barrage information and the N-Gram of the second barrage information gather The element number of middle coincidence.
Acquiring unit 404 is more than the barrage information number of the first preset value for obtaining similarity in preset duration;
Interception unit 405, for when similarity is more than the first preset value, intercepting the first barrage information.
Interception unit 405 is specifically used for:
If similarity is more than the first preset value, and barrage information number is more than the second preset value, then intercepts the first barrage letter Breath.
In the embodiment of the present application, receiving unit 401 receives the first barrage information;First computing unit 402 calculates described The N-Gram of one barrage information gathers, and the N is the integer more than 1;Second computing unit 403 is according to the first barrage information N-Gram set calculate the similarity with the N-Gram of the second barrage information set, the second barrage information is to have obtained The barrage information arrived, and the issuing time interval of the second barrage information and the first barrage information is less than preset duration;Acquiring unit 404 obtain the barrage information number that similarity in preset duration is more than the first preset value, if similarity is more than the first preset value, and Barrage information number is more than the second preset value, then interception unit 405 intercepts the first barrage information.The embodiment of the present application, can and When effectively the higher class barrage of similarity in the short time is intercepted, safeguard live streaming platform barrage environment.
Referring to Fig. 5, Fig. 5 is the embodiment schematic diagram that barrage provided in an embodiment of the present invention intercepts equipment.
As shown in figure 5, an embodiment of the present invention provides a kind of barrages to intercept equipment, including memory 510, processor 520 And it is stored in the computer program 511 that can be run on memory 520 and on processor 520, processor 520 executes computer journey Following steps are realized when sequence 511:Receive the first barrage information;The N-Gram set of the first barrage information is calculated, the N is Integer more than 1;It is total according to the N-Gram of the first barrage information set and the N-Gram collection of the second barrage information Calculate the similarity, the second barrage information is the barrage information got, and the second barrage information and the first barrage information Issuing time interval is less than preset duration;If similarity is more than the first preset value, the first barrage information is intercepted.
Optionally, gathered according to the N-Gram of the first barrage information similar to the N-Gram of the second barrage information set calculating Degree includes:
It is calculated according to following formula:
Wherein, SIM (S, T) is similarity;
GN(S) gather for the N-Gram of the first barrage information;
GN(T) gather for the N-Gram of the second barrage information;
NUM(GN(S)) element number gathered for the N-Gram of the first barrage information;
NUM(GN(T)) element number gathered for the N-Gram of the second barrage information;
NUM(GN(S)∩GN(T)) it is that the N-Gram set of the first barrage information and the N-Gram of the second barrage information gather The element number of middle coincidence.
Optionally, before intercepting the first barrage information, method further includes:
Obtain the barrage information number that similarity in preset duration is more than the first preset value;
If similarity is more than the first preset value, intercepting the first barrage information includes:
If similarity is more than the first preset value, and barrage information number is more than the second preset value, then intercepts the first barrage letter Breath.
In specific implementation process, when processor 520 executes computer program 511, Fig. 1 may be implemented or Fig. 2 is corresponding Any embodiment in embodiment.
Based on the method described in the embodiment of the present invention, those skilled in the art can understand the bullet of the present embodiment Curtain intercepts the specific implementation mode and its various change form of equipment, so intercepting how equipment is realized for the barrage herein Method in the embodiment of the present invention is no longer discussed in detail, as long as those skilled in the art implement the side in the embodiment of the present invention Equipment used by method belongs to the range of the invention to be protected.
Referring to Fig. 6, Fig. 6 is a kind of embodiment signal of computer readable storage medium provided in an embodiment of the present invention Figure.
As shown in fig. 6, present embodiments providing a kind of computer readable storage medium 600, it is stored thereon with computer journey Sequence 611, the computer program 611 realize following steps when being executed by processor:Receive the first barrage information;Calculate described first The N-Gram of barrage information gathers, and the N is the integer more than 1;According to the N-Gram of the first barrage information set and institute The N-Gram set for stating the second barrage information calculates the similarity, and the second barrage information is the barrage information got, and The issuing time interval of second barrage information and the first barrage information is less than preset duration;If similarity is more than the first preset value, Then intercept the first barrage information.
Optionally, gathered according to the N-Gram of the first barrage information similar to the N-Gram of the second barrage information set calculating Degree includes:
It is calculated according to following formula:
Wherein, SIM (S, T) is similarity;
GN(S) gather for the N-Gram of the first barrage information;
GN(T) gather for the N-Gram of the second barrage information;
NUM(GN(S)) element number gathered for the N-Gram of the first barrage information;
NUM(GN(T)) element number gathered for the N-Gram of the second barrage information;
NUM(GN(S)∩GN(T)) it is that the N-Gram set of the first barrage information and the N-Gram of the second barrage information gather The element number of middle coincidence.
Optionally, before intercepting the first barrage information, method further includes:
Obtain the barrage information number that similarity in preset duration is more than the first preset value;
If similarity is more than the first preset value, intercepting the first barrage information includes:
If similarity is more than the first preset value, and barrage information number is more than the second preset value, then intercepts the first barrage letter Breath.
In the above-described embodiments, can come wholly or partly by software, hardware, firmware or its arbitrary combination real It is existing.When implemented in software, it can entirely or partly realize in the form of a computer program product.
Computer program product includes one or more computer instructions.Load and execute on computers computer program When instruction, the flow or function according to the embodiment of the present invention are entirely or partly generated.Computer can be all-purpose computer, specially With computer, computer network or other programmable devices.Computer instruction can be stored in computer readable storage medium In, or transmit from a computer readable storage medium to another computer readable storage medium, for example, computer instruction can To pass through wired (such as coaxial cable, optical fiber, Digital Subscriber Line from a web-site, computer, server or data center (DSL)) or wireless (such as infrared, wireless, microwave etc.) mode is into another web-site, computer, server or data The heart is transmitted.Computer readable storage medium can be that any usable medium that computer can store either includes one Or the data storage devices such as integrated server, data center of multiple usable mediums.Usable medium can be magnetic medium, (example Such as, floppy disk, hard disk, tape), optical medium (for example, DVD) or semiconductor medium (such as solid state disk Solid State Disk (SSD)) etc..
It is apparent to those skilled in the art that for convenience and simplicity of description, the system of foregoing description, The specific work process of device and unit, can refer to corresponding processes in the foregoing method embodiment, and details are not described herein.
In several embodiments provided herein, it should be understood that disclosed system, device and method can be with It realizes by another way.For example, the apparatus embodiments described above are merely exemplary, for example, the division of unit, Only a kind of division of logic function, formula that in actual implementation, there may be another division manner, such as multiple units or component can be with In conjunction with or be desirably integrated into another system, or some features can be ignored or not executed.Another point, it is shown or discussed Mutual coupling, direct-coupling or communication connection can be by some interfaces, the INDIRECT COUPLING of device or unit or Communication connection can be electrical, machinery or other forms.
The unit illustrated as separating component may or may not be physically separated, and be shown as unit Component may or may not be physical unit, you can be located at a place, or may be distributed over multiple networks On unit.Some or all of unit therein can be selected according to the actual needs to achieve the purpose of the solution of this embodiment.
In addition, each functional unit in each embodiment of the application can be integrated in a processing unit, it can also It is that each unit physically exists alone, it can also be during two or more units be integrated in one unit.Above-mentioned integrated list The form that hardware had both may be used in member is realized, can also be realized in the form of SFU software functional unit.
It, can if integrated unit is realized in the form of SFU software functional unit and when sold or used as an independent product To be stored in a computer read/write memory medium.Based on this understanding, the technical solution of the application substantially or Say that all or part of the part that contributes to existing technology or the technical solution can embody in the form of software products Out, which is stored in a storage medium, including some instructions are used so that a computer equipment (can be personal computer, server or the network equipment etc.) executes all or part of each embodiment method of the application Step.And storage medium above-mentioned includes:It is USB flash disk, mobile hard disk, read-only memory (read-only memory, ROM), random Access various Jie that can store program code such as memory (random access memory, RAM), magnetic disc or CD Matter.
More than, above example is only to illustrate the technical solution of the application, rather than its limitations;Although with reference to aforementioned reality Example is applied the application is described in detail, it will be understood by those of ordinary skill in the art that:It still can be to aforementioned each Technical solution recorded in embodiment is modified or equivalent replacement of some of the technical features;And these are changed Or it replaces, the spirit and scope of each embodiment technical solution of the application that it does not separate the essence of the corresponding technical solution.

Claims (10)

1. a kind of barrage hold-up interception method, which is characterized in that including:
Receive the first barrage information;
The N-Gram set of the first barrage information is calculated, the N is the integer more than 1;
The phase is calculated according to the N-Gram of the first barrage information set and the N-Gram set of the second barrage information Like degree, the second barrage information is the barrage information got, and the second barrage information is believed with first barrage The issuing time interval of breath is less than preset duration;
If the similarity is more than the first preset value, the first barrage information is intercepted.
2. according to the method described in claim 1, it is characterized in that, described gather according to the N-Gram of the first barrage information Calculating the similarity with the N-Gram set of the second barrage information includes:
It is calculated according to following formula:
Wherein, the SIM (S, T) is the similarity;
The GN(S) gather for the N-Gram of the first barrage information;
The GN(T) gather for the N-Gram of the second barrage information;
NUM (the GN(S)) element number gathered for the N-Gram of the first barrage information;
NUM (the GN(T)) element number gathered for the N-Gram of the second barrage information;
NUM (the GN(S)∩GN(T)) it is that the N-Gram of the first barrage information gathers the N- with the second barrage information The element number overlapped in Gram set.
3. according to the method described in claim 1, it is characterized in that, it is described intercept the first barrage information before, the side Method further includes:
Obtain the barrage information number that the similarity in the preset duration is more than first preset value.
4. according to the method in any one of claims 1 to 3, which is characterized in that if the similarity is more than first Preset value, then intercepting the first barrage information includes:
If the similarity is more than the first preset value, and the barrage information number is more than the second preset value, then intercepts described the One barrage information.
5. a kind of barrage intercepts equipment, which is characterized in that including:
Receiving unit, for receiving the first barrage information;
First computing unit, the N-Gram for calculating the first barrage information gather, and the N is the integer more than 1;
Second computing unit, for the N- according to the N-Gram of the first barrage information set and the second barrage information Gram set calculates the similarity, and the second barrage information is the barrage information got, and second barrage is believed Breath and the issuing time interval of the first barrage information are less than preset duration;
Interception unit, for when the similarity is more than the first preset value, intercepting the first barrage information.
6. equipment according to claim 5, which is characterized in that second computing unit is specifically additionally operable to:
It is calculated according to following formula:
Wherein, the SIM (S, T) is the similarity;
The GN(S) gather for the N-Gram of the first barrage information;
The GN(T) gather for the N-Gram of the second barrage information;
NUM (the GN(S)) element number gathered for the N-Gram of the first barrage information;
NUM (the GN(T)) element number gathered for the N-Gram of the second barrage information;
NUM (the GN(S)∩GN(T)) it is that the N-Gram of the first barrage information gathers the N- with the second barrage information The element number overlapped in Gram set.
7. equipment according to claim 5, which is characterized in that the equipment further includes:
Acquiring unit is more than the barrage information of first preset value for obtaining the similarity in the preset duration Number.
8. equipment according to any one of claims 5 to 7, which is characterized in that the interception unit is specifically used for:
If the similarity is more than the first preset value, and the barrage information number is more than the second preset value, then intercepts described the One barrage information.
9. a kind of computer readable storage medium, including instruction, when run on a computer so that computer is executed as weighed Profit requires the method described in 1-4 any one.
10. a kind of computer program product including instruction, when run on a computer so that computer executes such as right It is required that the method described in 1-4 any one.
CN201810333585.7A 2018-04-13 2018-04-13 A kind of barrage hold-up interception method and its relevant device Pending CN108763288A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201810333585.7A CN108763288A (en) 2018-04-13 2018-04-13 A kind of barrage hold-up interception method and its relevant device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201810333585.7A CN108763288A (en) 2018-04-13 2018-04-13 A kind of barrage hold-up interception method and its relevant device

Publications (1)

Publication Number Publication Date
CN108763288A true CN108763288A (en) 2018-11-06

Family

ID=64010699

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201810333585.7A Pending CN108763288A (en) 2018-04-13 2018-04-13 A kind of barrage hold-up interception method and its relevant device

Country Status (1)

Country Link
CN (1) CN108763288A (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110139134A (en) * 2019-05-10 2019-08-16 韶关市启之信息技术有限公司 A kind of personalization barrage intelligently pushing method and system
CN115170372A (en) * 2022-09-06 2022-10-11 江西兴智教育科技有限公司 Interactive education platform system and method based on Internet

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103218423A (en) * 2013-04-02 2013-07-24 中国科学院信息工程研究所 Data inquiry method and device
CN105357586A (en) * 2015-09-28 2016-02-24 北京奇艺世纪科技有限公司 Video bullet screen filtering method and device
CN105592331A (en) * 2015-12-16 2016-05-18 广州华多网络科技有限公司 Method for processing barrage messages, related equipment, and system
CN106341703A (en) * 2016-08-30 2017-01-18 乐视控股(北京)有限公司 Bullet screen processing method and device
CN106878823A (en) * 2016-12-29 2017-06-20 武汉斗鱼网络科技有限公司 It is a kind of to filter word barrage and be converted to the method and system of voice barrage

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103218423A (en) * 2013-04-02 2013-07-24 中国科学院信息工程研究所 Data inquiry method and device
CN105357586A (en) * 2015-09-28 2016-02-24 北京奇艺世纪科技有限公司 Video bullet screen filtering method and device
CN105592331A (en) * 2015-12-16 2016-05-18 广州华多网络科技有限公司 Method for processing barrage messages, related equipment, and system
CN106341703A (en) * 2016-08-30 2017-01-18 乐视控股(北京)有限公司 Bullet screen processing method and device
CN106878823A (en) * 2016-12-29 2017-06-20 武汉斗鱼网络科技有限公司 It is a kind of to filter word barrage and be converted to the method and system of voice barrage

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110139134A (en) * 2019-05-10 2019-08-16 韶关市启之信息技术有限公司 A kind of personalization barrage intelligently pushing method and system
CN110139134B (en) * 2019-05-10 2021-12-10 青岛民航凯亚系统集成有限公司 Intelligent personalized bullet screen pushing method and system
CN115170372A (en) * 2022-09-06 2022-10-11 江西兴智教育科技有限公司 Interactive education platform system and method based on Internet

Similar Documents

Publication Publication Date Title
US11334635B2 (en) Domain specific natural language understanding of customer intent in self-help
US20200301954A1 (en) Reply information obtaining method and apparatus
CN103956169B (en) A kind of pronunciation inputting method, device and system
US10162812B2 (en) Natural language processing system to analyze mobile application feedback
JP2019003604A (en) Methods, systems and programs for content curation in video-based communications
CA3014781A1 (en) Method and apparatus for building prediction models from customer web logs
CN112040263A (en) Video processing method, video playing method, video processing device, video playing device, storage medium and equipment
US10083004B2 (en) Using voice-based web navigation to conserve cellular data
US10909174B1 (en) State detection of live feed
CN107071554B (en) Method for recognizing semantics and device
US10331685B2 (en) Method and apparatus for sorting related searches
CN110532354A (en) The search method and device of content
US11275994B2 (en) Unstructured key definitions for optimal performance
US11397852B2 (en) News interaction method, apparatus, device and computer storage medium
US10037321B1 (en) Calculating a maturity level of a text string
CN110489747A (en) A kind of image processing method, device, storage medium and electronic equipment
CN111767393A (en) Text core content extraction method and device
EP3491552A1 (en) Triggering application information
US10430522B2 (en) Dynamic suggestions for content translation
CN108763288A (en) A kind of barrage hold-up interception method and its relevant device
WO2021118746A1 (en) Systems and methods for generating labeled short text sequences
US20200320253A1 (en) Method and apparatus for generating commentary
CN111813993A (en) Video content expanding method and device, terminal equipment and storage medium
EP2869546B1 (en) Method and system for providing access to auxiliary information
CN115312034A (en) Method, device and equipment for processing voice signal based on automaton and dictionary tree

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20181106

RJ01 Rejection of invention patent application after publication