CN102982041A - Method used for detecting burst information of interactive platform and device - Google Patents

Method used for detecting burst information of interactive platform and device Download PDF

Info

Publication number
CN102982041A
CN102982041A CN2011102627023A CN201110262702A CN102982041A CN 102982041 A CN102982041 A CN 102982041A CN 2011102627023 A CN2011102627023 A CN 2011102627023A CN 201110262702 A CN201110262702 A CN 201110262702A CN 102982041 A CN102982041 A CN 102982041A
Authority
CN
China
Prior art keywords
information
burst mode
news
outburst
publisher
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN2011102627023A
Other languages
Chinese (zh)
Other versions
CN102982041B (en
Inventor
李彦宏
舒迅
帅帅
尹佳
陈楚洁
周天
方勇
王波
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Baidu Netcom Science and Technology Co Ltd
Original Assignee
Beijing Baidu Netcom Science and Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Baidu Netcom Science and Technology Co Ltd filed Critical Beijing Baidu Netcom Science and Technology Co Ltd
Priority to CN201110262702.3A priority Critical patent/CN102982041B/en
Publication of CN102982041A publication Critical patent/CN102982041A/en
Application granted granted Critical
Publication of CN102982041B publication Critical patent/CN102982041B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Landscapes

  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)
  • Information Transfer Between Computers (AREA)

Abstract

The invention aims at providing a method used for detecting burst information of an interactive platform and a device. A detection device determines a burst mode by obtaining pieces of released information of the interactive platform, and further determines the bust information from the pieces of released information according to the burst mode. Compared with the prior art, the method used for detecting the burst information of the interactive platform and the device can detect whether a burst phenomena exists in the interactive platform timely and correctly so as to enable effective processing of the burst information to be possible, and therefore a user can effectively obtain information of the interactive platform, and the objects of interaction and communication are achieved.

Description

A kind of method and apparatus for detection of outburst information in the interaction platform
Technical field
The present invention relates to networking technology area, relate in particular to a kind of technology for detection of outburst information in the interaction platform.
Background technology
Along with the development of network technology, increasing user carries out the interchange of information by the network interdynamic platform, and then has reached the purpose of message fast propagation, but a kind of phenomenon that breaks out information in the network interdynamic platform also occurs thereupon.The phenomenon of this information outburst can't normally make a speech other normal users by in an organized way, constantly repeatedly send same or similar meaningless content within the short time, and normal the speech meeting is very fast is flooded by a large amount of meaningless outburst information.This information outburst phenomenon has had a strong impact on the normal order in the interaction platform, the normal interchange between the normal issue that has hindered information and reception and the network user.
Therefore, how effectively to detect outburst information in the interaction platform, become one of present problem demanding prompt solution.
Summary of the invention
The purpose of this invention is to provide a kind of method and apparatus that detects outburst information in the interaction platform.
According to an aspect of the present invention, provide a kind of computer implemented method for detection of outburst information in the interaction platform, wherein, the method may further comprise the steps:
A obtains a plurality of releasing news in the interaction platform;
B obtains and described a plurality of corresponding one or more mass-sending patterns that release news according to described a plurality of releasing news;
C determines burst mode by carry out matching inquiry in the burst mode storehouse from described one or more mass-sending patterns;
D determines the outburst information corresponding with described burst mode according to described burst mode from described a plurality of releasing news.
According to a further aspect in the invention, also provide a kind of equipment for detection of outburst information in the interaction platform, wherein, this equipment comprises:
Information acquisition device, interaction platform is a plurality of to release news for obtaining;
Mass-sending pattern deriving means is used for according to described a plurality of releasing news, and obtains and described a plurality of corresponding one or more mass-sending patterns that release news;
The burst mode deriving means is used for determining burst mode by carrying out matching inquiry in the burst mode storehouse from described one or more mass-sending patterns;
Determine device, be used for according to described burst mode, from described a plurality of releasing news, determine the outburst information corresponding with described burst mode.
Compared with prior art, the present invention is by judging a plurality of burst mode that release news in the interaction platform, and then definite outburst information corresponding with this burst mode, can detect timely and accurately thus and whether produce the outburst phenomenon in the interaction platform, so that outburst information effectively is treated as possibility, can effectively obtain the information of interaction platform and the purpose of carrying out interactive communication thereby reach the user.
Description of drawings
By reading the detailed description that non-limiting example is done of doing with reference to the following drawings, it is more obvious that other features, objects and advantages of the present invention will become:
Fig. 1 illustrates according to the equipment synoptic diagram of one aspect of the invention for detection of outburst information in the interaction platform;
Fig. 2 illustrates according to the method flow diagram of one aspect of the invention for detection of outburst information in the interaction platform.
Same or analogous Reference numeral represents same or analogous parts in the accompanying drawing.
Embodiment
Below in conjunction with accompanying drawing the present invention is described in further detail.
Fig. 1 illustrates according to the equipment synoptic diagram of one aspect of the invention for detection of outburst information in the interaction platform.Checkout equipment 1 comprises information acquisition device 11, mass-sending pattern deriving means 12, burst mode deriving means 13 and definite device 14.At this, checkout equipment 1 includes but not limited to the cloud that computing machine, network host, single network server, a plurality of webserver collection or a plurality of server consist of.At this, cloud is by consisting of based on a large amount of computing machines of cloud computing (Cloud Computing) or the webserver, and wherein, cloud computing is a kind of of Distributed Calculation, a super virtual machine that is comprised of the loosely-coupled computing machine collection of a group.
Particularly, information acquisition device 11 obtains a plurality of releasing news in the interaction platform.More specifically, information acquisition device 11 is in the predetermined time interval or obtain continuously a plurality of releasing news in the specific column of interaction platform or interaction platform, the submission request that releases news of for example submitting to by subscriber equipment by the real-time listening user, to obtain releasing news of user's input, the perhaps communication mode by agreement periodically in the predetermined time interval, such as communication protocols such as http, https, from interaction platform, extract up-to-date a plurality of releasing news.For example, checkout equipment 1 is the webserver of forum, the user releases news by subscriber equipment one section text message conduct of webpage inputting interface input by this forum, then, subscriber equipment releases news this and is packaged into http request and is submitted to the information acquisition device 11 of checkout equipment 1 by http communication protocol as posting of this forum, and then, information acquisition device 11 is by the real-time listening user message, receive and also to resolve this http request, obtain releasing news wherein.For another example, information acquisition device 11 extracted a plurality of releasing news up-to-date in the interaction platform periodically every five minutes.At this, described interaction platform includes but not limited to community, forum, blog, microblogging, in the shopping website to the comment of commodity, news analysis, message interactive etc.Those skilled in the art will be understood that and above-mentionedly obtain a plurality of modes that release news only for for example; other existing or may occur from now on obtain a plurality of modes that release news as applicable to the present invention; also should be included in the protection domain of the present invention, and be contained in this at this with way of reference.
Subsequently, mass-sending pattern deriving means 12 obtains and described a plurality of corresponding one or more mass-sending patterns that release news according to described a plurality of releasing news.Particularly, a plurality of the releasing news that mass-sending pattern deriving means 12 obtains according to information acquisition device 11, analyse and compare each other by for example those being released news, judging whether those a plurality of releasing news have same or analogous issue feature, and then obtain and those a plurality of corresponding one or more mass-sending patterns that release news.Wherein, described mass-sending pattern means a plurality of information release models that release news with same or similar issue feature by information publisher's issue, for example in a certain forum with regard to a certain much-talked-about topic, a plurality of information publisher's issues have the information release model of the model of a plurality of same keyword, perhaps in a certain forum, by the information release model of the identical model of content of a plurality of information publishers' issues.For example, information acquisition device 11 obtains 100 and releases news in the tennis column of forum, mass-sending pattern deriving means 12 releases news these 100 and analyses and compares each other, all have keyword " Li Na ", " winning the championship " to obtain these 100 90 titles that release news in releasing news, then can obtain accordingly and these 90 the corresponding mass-sending patterns that release news.Again for example, 100 of obtaining in Li Yuchun's column of forum of information acquisition device 11 release news, mass-sending pattern deriving means 12 releases news these 100 and analyses and compares each other, judge to obtain wherein have 80 Chinese characters in the title that releases news all identical, and then can obtain and these 80 the corresponding mass-sending patterns that release news.Those skilled in the art will be understood that the above-mentioned mode of mass-sending pattern of obtaining is only for giving an example; other existing or modes of obtaining the mass-sending pattern that may occur from now on are as applicable to the present invention; also should be included in the protection domain of the present invention, and be contained in this at this with way of reference.
Then, burst mode deriving means 13 is determined burst mode by carry out matching inquiry in the burst mode storehouse from described one or more mass-sending patterns.Particularly, burst mode deriving means 13 will be mass-sended one or more mass-sending patterns that pattern deriving means 12 obtains and carry out matching inquiry in the burst mode storehouse, for example will these one or more mass-sending patterns and this burst mode storehouse in burst mode mate, perhaps compare each other analysis by a plurality of the releasing news that will have the mass-sending pattern feature, to extract its issue feature, and then those issue features are mated with a plurality of burst mode in the burst mode storehouse, and coupling obtains the one or more burst mode corresponding with this (a bit) mass-sending pattern accordingly.Wherein, described burst mode includes but not limited to: the character numerical value of a plurality of title contents that release news is identical, the a plurality of Chinese character numbers of content when only keeping Chinese character that release news are identical, information publisher's account content is identical when only keeping Chinese character, a plurality of title contents that release news are verse, and a plurality of contents that release news are the lyrics etc.At this, described burst mode storehouse is used for the storage burst mode.For example, mass-sending pattern deriving means 12 obtain with the tennis column in keyword be 90 corresponding mass-sending patterns that release news that " Li Na " " wins the championship "; Then, burst mode deriving means 13 releases news those and compares each other analysis, with extract its all issue be characterized as and all contain keyword " Li Na " in those titles that release news and " win the championship ", and these 90 in releasing news 80 release news as containing the return information of " RE ", and then those are issued features in the burst mode storehouse, carry out matching inquiry, coupling obtains the burst mode corresponding with this issue feature.Again for example, mass-sending pattern deriving means 12 obtains 80 corresponding mass-sending patterns that release news, and this mass-sending pattern is that the release news Chinese word number average of title is identical; Then, burst mode deriving means 13 releases news those and compares each other analysis, obtain the title Chinese character that releases news and be " I descry bright moonlight before bed; be suspected to be frost on the ground ", and then the coupling acquisition is that title content is the burst mode of identical verse with these 80 the corresponding burst mode that release news in the burst mode storehouse.Those skilled in the art will be understood that the above-mentioned mode of burst mode of obtaining is only for giving an example; other existing or modes of obtaining burst mode that may occur from now on are as applicable to the present invention; also should be included in the protection domain of the present invention, and be contained in this at this with way of reference.
Determine device 14 according to described burst mode, from described a plurality of releasing news, determine the outburst information corresponding with described burst mode.Particularly, determine the burst mode that device 14 obtains according to burst mode deriving means 13, determine the release news information corresponding with this (a bit) burst mode with as outburst information, wherein, described outburst information means to have and meets releasing news of burst mode feature.For example, the burst mode that burst mode deriving means 13 obtains is that title content is the burst mode of identical verse, determines device 14 according to this burst mode, extracts its corresponding a plurality of releasing news, and those are released news as outburst information.Those skilled in the art will be understood that the mode of above-mentioned definite outburst information is only for giving an example; the mode of other existing or definite outburst information that may occur from now on is as applicable to the present invention; also should be included in the protection domain of the present invention, and be contained in this at this with way of reference.
Preferably, be to work continuously between information acquisition device 11, mass-sending pattern deriving means 12, burst mode deriving means 13 and the definite device 14.Particularly, information acquisition device 11 obtains a plurality of releasing news in the interaction platform; Subsequently, mass-sending pattern deriving means 12 obtains and described a plurality of corresponding one or more mass-sending patterns that release news according to described a plurality of releasing news; Then, burst mode deriving means 13 is determined burst mode by carry out matching inquiry in the burst mode storehouse from described one or more mass-sending patterns; Then, determine device 14 according to described burst mode, from described a plurality of releasing news, determine the outburst information corresponding with described burst mode.At this, it will be understood by those skilled in the art that " continuing " refers to that each device obtains, mass-sends obtaining and breaking out determining of information of the obtaining of pattern, burst mode according to what the mode of operation of setting or adjust in real time required to release news respectively, until information acquisition device 11 stops in a long time to a plurality of obtaining of releasing news in the interaction platform.
Preferably, described burst mode include but not limited to following at least each:
-title lining up mode;
-Subscriber Queue pattern;
-content lining up mode.
Particularly, the title lining up mode includes but not limited to: 1) number of characters of a plurality of title contents that release news is identical; 2) in a plurality of title contents that release news with identical special character prefix; 3) ratio of identical characters number and total character surpasses default proportion threshold value in a plurality of title contents that release news; 4) a plurality of title contents that release news all do not comprise Chinese character.For example, four title contents that release news are:
I descry bright moonlight before a bed
B is suspected to be frost on the ground
The c prestige bright moon of raising the head
D bows and thinks the native place
These four the title content numbers of words that release news are identical, and then these four release news and belong to title lining up mode in the burst mode, i.e. " number of characters of a plurality of title contents that release news is identical ".Again for example, five title contents that release news are:
a?Fighting!
b?My?friends!
c?Fighting!
d?My?brothers!
e?Never?give?up!
These five title contents that release news all do not comprise Chinese character, and then these five release news and belong to title lining up mode in the burst mode, i.e. " a plurality of title contents that release news all do not comprise Chinese character ".Those skilled in the art will be understood that above-mentioned title lining up mode only for giving an example, and other title lining up modes existing or that may occur from now on also should be included in the protection domain of the present invention as applicable to the present invention, and are contained in this at this with way of reference.
The Subscriber Queue pattern includes but not limited to: 1) information publisher's account content same or similar 2) rear its account contents such as meaningless character in removing information publisher's account content, numeral are same or similar; 3) Chinese character in information publisher's account content is same or similar; 4) Chinese character in rear its account contents such as the meaningless character in removing information publisher's account content, numeral is same or similar.For example, five information publisher's account contents that release news are:
1)@of legion of waterborne troops 1
2)@of legion of waterborne troops 2
3)@of legion of waterborne troops 3
4) waterborne troops's Jun Tuan ﹠amp; 5
5) waterborne troops's Jun Tuan ﹠amp; 6
With the meaningless character " " in these five information publisher's account contents, "; " and numeral " 1 ", " 2 ", " 3 ", " 5 ", " 6 " remove, the Chinese character homogeneous phase that keeps is all " legion of waterborne troops ", then these five release news and belong to Subscriber Queue pattern in the burst mode.Those skilled in the art will be understood that above-mentioned Subscriber Queue pattern only for giving an example, and other Subscriber Queue patterns existing or that may occur from now on also should be included in the protection domain of the present invention as applicable to the present invention, and are contained in this at this with way of reference.
The content lining up mode includes but not limited to: 1) the character numerical value of a plurality of contents that release news is identical; 2) the Chinese character number of a plurality of contents that release news is identical; 3) a plurality of contents that release news all do not comprise Chinese character.For example, four contents that release news are:
1) the invincible ## of my army of #
2) the invincible % of my army of %
3) the invincible@of my army of@
4) ﹠amp; The invincible ﹠amp of my army;
When these four contents that release news only keep Chinese character, its Chinese character homogeneous phase with, then these four release news and belong to content lining up mode in the burst mode.Wherein, described meaningless character means the symbol with Chinese meaning, such as space character, " ", " # " etc.Those skilled in the art will be understood that the foregoing lining up mode only for giving an example, and other content lining up modes existing or that may occur from now on also should be included in the protection domain of the present invention as applicable to the present invention, and are contained in this at this with way of reference.
Those skilled in the art will be understood that above-mentioned every burst mode not only can be used for separately obtaining of outburst information, can also be in conjunction with being used for obtaining of outburst information.Those skilled in the art will be understood that above-mentioned burst mode only for giving an example, and other burst mode existing or that may occur from now on also should be included in the protection domain of the present invention as applicable to the present invention, and are contained in this at this with way of reference.
In a preferred embodiment (with reference to Fig. 1), described burst mode deriving means 13 in conjunction with the auxiliary regular that presets, determine described burst mode also by carry out matching inquiry in described burst mode storehouse.Referring to Fig. 1 the preferred embodiment is described in detail, wherein, information acquisition device 11 obtains a plurality of releasing news in the interaction platform; Mass-sending pattern deriving means 12 obtains and described a plurality of corresponding one or more mass-sending patterns that release news according to described a plurality of releasing news; Determine device 14 according to described burst mode, from described a plurality of releasing news, determine the outburst information corresponding with described burst mode.Its detailed process for simplicity's sake, is contained in this with way of reference with aforementioned identical with reference to the performed process of information acquisition device 11 among the described embodiment of Fig. 1, mass-sending pattern deriving means 12 and definite device 14, does not give unnecessary details and do not do.
Particularly, burst mode deriving means 13 will be mass-sended one or more mass-sending patterns that pattern deriving means 12 obtains and carry out matching inquiry in the burst mode storehouse, and according to the auxiliary regular that presets, for example whether a plurality of information issue frequencys that release news issue the frequency greater than the information that presets, and then definite burst mode.For example, mass-sending pattern deriving means 12 is mass-sending pattern of extraction from 20 of Man U's column release news, and this mass-sending pattern is identical for the Chinese character number of words of those title contents that release news; Then, burst mode deriving means 13 should the mass-sending pattern carry out matching inquiry in the burst mode storehouse, obtaining the burst mode corresponding with this mass-sending pattern is title formation burst mode, and obtain the information issue frequency of this Man U's column according to this 20 information issuing times that release news, and this information issue frequency is issued frequency threshold value less than the information that presets, and then judges that this mass-sending pattern is not real title formation burst mode.Those skilled in the art will be understood that the above-mentioned mode of burst mode of obtaining is only for giving an example; other existing or modes of obtaining burst mode that may occur from now on are as applicable to the present invention; also should be included in the protection domain of the present invention, and be contained in this at this with way of reference.
At this, burst mode deriving means 13 can also judge exactly whether the mass-sending pattern is carried out the burst mode that matching inquiry obtains in the burst mode storehouse correct according to those auxiliary regulars, greatly improve the accuracy of determining burst mode, realize effectively outburst information being determined, and then reduce the False Rate that the mistake that will normally release news is decided to be outburst information.
Preferably, burst mode deriving means 13 in conjunction with based on but be not limited to following at least each the described auxiliary regular that presets, determine described burst mode:
-described a plurality of issuing time that release news;
-with described a plurality of corresponding information publishers' that release news relevant information.
Particularly, based on described a plurality of issuing time that release news, can determine to include but not limited to: the information issue frequency of an information publisher's the information issue frequency, whole interaction platform, the information issue frequency of a certain plate in the interaction platform.For example, burst mode deriving means 13 carries out matching inquiry by mass-sending pattern in the burst mode storehouse, with determine with this (etc.) the corresponding burst mode of mass-sending pattern, but this burst mode deriving means 13 is according to the corresponding a plurality of information issuing times that release news that are positioned at same column of this mass-sending pattern, the average information issue frequency of determined these a plurality of place columns that release news is less than default information issue frequency threshold value, and then burst mode deriving means 13 judges that this mass-sending pattern is not burst mode.。
Described a plurality of relevant information that releases news corresponding information publisher includes but not limited to: information publisher's hour of log-on, information publisher whether in blacklist, information publisher's user credit degree etc.For example, burst mode deriving means 13 is by carrying out matching inquiry in the burst mode storehouse, determining the burst mode corresponding with the mass-sending pattern, but should the corresponding a plurality of publisher's user profile degree height that release news of mass-sending pattern, judge that then this mass-sending pattern is not burst mode.。
Those skilled in the art will be understood that based on above-mentioned two auxiliary regulars that preset not only can be used for separately the auxiliary burst mode of determining, can also be in conjunction with being used for the auxiliary burst mode of determining.Those skilled in the art will be understood that the mode of above-mentioned definite burst mode is only for giving an example; the mode of other definite burst mode existing or that may occur from now on is as applicable to the present invention; also should be included in the protection domain of the present invention, and be contained in this at this with way of reference.
More preferably, burst mode deriving means 13 in conjunction with based on the described auxiliary regular that presets of described a plurality of corresponding information publishers' that release news relevant information, determine described burst mode, wherein, described information publisher's relevant information include but not limited to following at least each:
-information publisher's historical behavior record;
-information publisher's hour of log-on;
-information publisher's IP address;
The quantity that-information publisher released news within the unit interval.
Particularly, information publisher's relevant information comprises information publisher's historical behavior record, wherein, information publisher's historical behavior record includes but not limited to: release news content, information publisher's history interocclusal record, information publisher's when releasing news historical online hours etc. of information publisher's history.For example, burst mode deriving means 13 carries out matching inquiry with the information publisher's account that releases news in the historical behavior database, releasing news to be normally with the history that obtains this information publisher releases news, and then judges that this information publisher's user credit degree is higher.Wherein, described historical behavior database is used for storage information publisher's historical behavior record, includes but not limited to relational database, memory storage, harddisk memory etc.
Information publisher's relevant information comprises information publisher's hour of log-on.Burst mode deriving means 13 is according to information publisher's hour of log-on, and for example information publisher's hour of log-on is before 2 years of current time, to judge that then this information publisher's user credit degree is higher.
Information publisher's relevant information comprises information publisher's IP address, based on information publisher's IP address, can determine to include but not limited to: whether this IP address has historical outburst delivering, this IP address to comprise the quantity etc. of information publisher's account.For example, burst mode deriving means 13 is according to information publisher's IP address, in address database, carry out matching inquiry, do not had and have in a large number the historical record that releasing news of similar features sent to obtain this IP address, and then judged that this information publisher's user credit degree is higher.Wherein, address database is used for storage and once issued the IP address and the corresponding history thereof that release news and release news.
Information publisher's relevant information comprises the quantity that the information publisher releases news within the unit interval.For example, the quantity that burst mode deriving means 13 releases news within the unit interval according to the information publisher, compare with predetermined information issue frequency threshold value, issue frequency threshold value when the quantity that this information publisher releases news less than this information within the unit interval, judge that then this information publisher's user credit degree is higher.
Those skilled in the art will be understood that the relevant information based on above-mentioned four information publishers not only can be used for separately the auxiliary burst mode of determining, can also be in conjunction with being used for the auxiliary burst mode of determining.Those skilled in the art will be understood that above-mentioned information publisher's relevant information is only for giving an example; other information publishers' existing or that may occur from now on relevant information is as applicable to the present invention; also should be included in the protection domain of the present invention, and be contained in this at this with way of reference.
More preferably, described information publisher's relevant information comprises information publisher's historical behavior record, and wherein, this checkout equipment 1 also comprises record updating device (not shown), the record updating device upgrades described information publisher's historical behavior record according to described outburst information.Particularly, information publisher's relevant information comprises information publisher's historical behavior record, the record updating device is according to the outburst information corresponding with burst mode of determining that device 14 is determined, break out the time of information, information publisher's the information such as online hours with the outburst information content of extracting these accounts that break out the information publisher of information, these information publishers' issues, these information publishers' issue, in such as the historical behavior database, add these information publishers' historical behavior record.For example, the record updating device is according to 80 outburst information in Li Yuchun's column of determining that device 14 obtains, these 80 outburst information are analyzed, to extract information publisher's account of these outburst information, and the outburst information content of the corresponding issue of these information publisher's accounts, these persons of releasing news issue each corresponding time of outburst information, the online hours of these persons of releasing news when this interaction platform generation information outburst, then, the record updating device adds the corresponding historical behavior record of this information publisher's account in the historical behavior database according to information publisher's account.Those skilled in the art will be understood that the mode of above-mentioned renewal historical behavior record is only for giving an example; the mode of other renewal historical behaviors existing or that may occur from now on records is as applicable to the present invention; also should be included in the protection domain of the present invention, and be contained in this at this with way of reference.
In another preferred embodiment (with reference to Fig. 1), checkout equipment 1 also comprises the pretreatment unit (not shown), and pretreatment unit carries out pre-service to described a plurality of releasing news, and obtains the pre-service result; Wherein, mass-sending pattern deriving means 12 obtains described one or more mass-sending pattern also according to described pre-service result.Referring to Fig. 1 the preferred embodiment is described in detail, wherein, information acquisition device 11 obtains a plurality of releasing news in the interaction platform; Burst mode deriving means 13 is determined burst mode by carry out matching inquiry in the burst mode storehouse from described one or more mass-sending patterns; Determine device 14 according to described burst mode, from described a plurality of releasing news, determine the outburst information corresponding with described burst mode.Its detailed process for simplicity's sake, is contained in this with way of reference with aforementioned identical with reference to the performed process of information acquisition device 11 among the described embodiment of Fig. 1, burst mode deriving means 13 and definite device 14, does not give unnecessary details and do not do.
Particularly, pretreatment unit carries out pre-service to a plurality of the releasing news that information acquisition device 11 obtains, this pretreated mode includes but not limited to: remove the meaningless character in a plurality of the releasing news, the numeral in a plurality of the releasing news of removal etc., to obtain the pre-service result; Then, mass-sending pattern deriving means 12 obtains one or more mass-sending patterns also according to those pre-service result.For example, a plurality of contents that release news are:
1) the invincible ## of my army of #
2) the invincible % of my army of %
3) the invincible@of my army of@
4) ﹠amp; The invincible ﹠amp of my army;
Pretreatment unit with in these four contents that release news without meaning character " # ", " % ", " ", "; " remove, and keep Chinese character with as with result:
My army of a is invincible
My army of b is invincible
My army of c is invincible
My army of d is invincible
Then, mass-sending pattern deriving means 12 is also according to this pre-service result, with pre-service as a result a, b, c, d compare each other analysis, to obtain the as a result equal identical mass-sending pattern of a, b, c, d content of pre-service.Those skilled in the art will be understood that above-mentioned pretreated mode only for giving an example, and other pretreated modes existing or that may occur from now on also should be included in the protection domain of the present invention as applicable to the present invention, and are contained in this at this with way of reference.
In another preferred embodiment (with reference to Fig. 1), checkout equipment 1 also comprises the after-treatment device (not shown), and after-treatment device carries out corresponding aftertreatment according to described outburst information to described interaction platform.Referring to Fig. 1 the preferred embodiment is described in detail, wherein, information acquisition device 11 obtains a plurality of releasing news in the interaction platform; Mass-sending pattern deriving means 12 obtains and described a plurality of corresponding one or more mass-sending patterns that release news according to described a plurality of releasing news; Burst mode deriving means 13 is determined burst mode by carry out matching inquiry in the burst mode storehouse from described one or more mass-sending patterns; Determine device 14 according to described burst mode, from described a plurality of releasing news, determine the outburst information corresponding with described burst mode.Its detailed process is with aforementioned identical with reference to the performed process of information acquisition device 11 among the described embodiment of Fig. 1, mass-sending pattern deriving means 12, burst mode deriving means 13 and definite device 14, for simplicity's sake, be contained in this with way of reference, do not give unnecessary details and do not do.
Particularly, after-treatment device is according to the outburst information of determining that device 14 obtains, a column to interaction platform or interaction platform carries out corresponding aftertreatment, the information that for example will break out is all deleted, or by stop to a column of this interaction platform or this interaction platform domain name mapping, stop the modes such as this interaction platform server operation, a column of this interaction platform or interaction platform is closed.For example, determine the outburst information of a certain forum that device 14 obtains, namely determine this forum's generation information outburst phenomenon, then after-treatment device is all deleted those outburst information, and perhaps after-treatment device stops the domain name mapping of a column of this interaction platform.Those skilled in the art will be understood that the mode of above-mentioned aftertreatment only for giving an example, and the mode of other aftertreatments existing or that may occur from now on also should be included in the protection domain of the present invention as applicable to the present invention, and is contained in this at this with way of reference.
Preferably, described aftertreatment include but not limited to following at least each:
The described a plurality of outburst information of-deletion;
-forbid that the information publisher of described a plurality of outburst information releases news.
Particularly, the mode of aftertreatment includes but not limited to: 1) after-treatment device will determine that the outburst information that device 14 obtains all deletes; 2) after-treatment device is according to the outburst information of determining that device 14 obtains, extracting the corresponding information publisher's account of those outburst information, and by closing those information publisher's accounts, thereby forbid that those information publishers release news.Those skilled in the art will be understood that above-mentioned two post processing modes not only can be used for separately the aftertreatment of outburst information, can also be in conjunction with the aftertreatment that is used for outburst information.Those skilled in the art will be understood that the mode of above-mentioned aftertreatment only for giving an example, and the mode of other aftertreatments existing or that may occur from now on also should be included in the protection domain of the present invention as applicable to the present invention, and is contained in this at this with way of reference.
At this, described aftertreatment can realize in time outburst information and information publisher thereof being processed, farthest reduce outburst information to the negative effect of normal users, so that normal users can effectively be obtained the information of interaction platform and carry out interactive communication, and safeguard the normal operation order of interaction platform, further, promote user's experience.
In another preferred embodiment (with reference to Fig. 1), checkout equipment 1 also comprises pattern base updating device (not shown), and the pattern base updating device upgrades described burst mode storehouse according to described outburst information.Referring to Fig. 1 the preferred embodiment is described in detail, wherein, information acquisition device 11 obtains a plurality of releasing news in the interaction platform; Mass-sending pattern deriving means 12 obtains and described a plurality of corresponding one or more mass-sending patterns that release news according to described a plurality of releasing news; Burst mode deriving means 13 is determined burst mode by carry out matching inquiry in the burst mode storehouse from described one or more mass-sending patterns; Determine device 14 according to described burst mode, from described a plurality of releasing news, determine the outburst information corresponding with described burst mode.Its detailed process is with aforementioned identical with reference to the performed process of information acquisition device 11 among the described embodiment of Fig. 1, mass-sending pattern deriving means 12, burst mode deriving means 13 and definite device 14, for simplicity's sake, be contained in this with way of reference, do not give unnecessary details and do not do.
The pattern base updating device is according to the outburst information of determining that device 14 obtains, those outburst information are analysed and compared each other, to extract the whole same or analogous issue feature that has between those outburst information, and in the burst mode storehouse, carry out matching inquiry, when arbitrary same or analogous issue feature when the match is successful in the burst mode storehouse, then will issue feature and be added to this burst mode storehouse as new burst mode.For example, determine that device 14 obtains 80 outburst information, the title of these 80 outburst information is " my army is invincible, and with whom would fight for mastery "; Then, this title that releases news is carried out matching inquiry for the burst mode of " I army invincible with whom would fight for mastery " to the pattern base updating device in the burst mode storehouse and the match is successful, and then the pattern base updating device title that will release news is added to this burst mode storehouse for this burst mode of " my army is invincible, and with whom would fight for mastery ".Those skilled in the art will be understood that the mode in above-mentioned renewal burst mode storehouse is only for giving an example; the mode in other renewal burst mode existing or that may occur from now on storehouses is as applicable to the present invention; also should be included in the protection domain of the present invention, and be contained in this at this with way of reference.
Fig. 2 illustrates according to the method flow diagram of one aspect of the invention for detection of outburst information in the interaction platform.At this, checkout equipment 1 includes but not limited to the cloud that computing machine, network host, single network server, a plurality of webserver collection or a plurality of server consist of.At this, cloud is by consisting of based on a large amount of computing machines of cloud computing (Cloud Computing) or the webserver, and wherein, cloud computing is a kind of of Distributed Calculation, a super virtual machine that is comprised of the loosely-coupled computing machine collection of a group.
Particularly, in step S1, checkout equipment 1 obtains a plurality of releasing news in the interaction platform.More specifically, in step S1, checkout equipment 1 is in the predetermined time interval or obtain continuously a plurality of releasing news in the specific column of interaction platform or interaction platform, the submission request that releases news of for example submitting to by subscriber equipment by the real-time listening user, to obtain releasing news of user's input, perhaps in the predetermined time interval periodically by the communication mode of agreement, such as communication protocols such as http, https, from interaction platform, extract up-to-date a plurality of releasing news.For example, checkout equipment 1 is the webserver of forum, the user releases news by subscriber equipment one section text message conduct of webpage inputting interface input by this forum, then, subscriber equipment releases news this and is packaged into the http request and is submitted to checkout equipment 1 by http communication protocol as posting of this forum, then, in step S1, checkout equipment 1 receives and resolves this http request by the real-time listening user message, obtains releasing news wherein.For another example, in step S1, checkout equipment 1 extracted a plurality of releasing news up-to-date in the interaction platform periodically every five minutes.At this, described interaction platform includes but not limited to community, forum, blog, microblogging, in the shopping website to the comment of commodity, news analysis, message interactive etc.Those skilled in the art will be understood that and above-mentionedly obtain a plurality of modes that release news only for for example; other existing or may occur from now on obtain a plurality of modes that release news as applicable to the present invention; also should be included in the protection domain of the present invention, and be contained in this at this with way of reference.
Subsequently, in step S2, checkout equipment 1 obtains and described a plurality of corresponding one or more mass-sending patterns that release news according to described a plurality of releasing news.Particularly, in step S2, checkout equipment 1 is according to its a plurality of releasing news of obtaining in step S1, analyse and compare each other by for example those being released news, judging whether those a plurality of releasing news have same or analogous issue feature, and then obtain and those a plurality of corresponding one or more mass-sending patterns that release news.Wherein, described mass-sending pattern means a plurality of information release models that release news with same or similar issue feature by information publisher's issue, for example in a certain forum with regard to a certain much-talked-about topic, a plurality of information publisher's issues have the information release model of the model of a plurality of same keyword, perhaps in a certain forum, by the information release model of the identical model of content of a plurality of information publishers' issues.For example, in step S1, checkout equipment 1 obtains 100 and releases news in the tennis column of forum, in step S2, checkout equipment 1 releases news these 100 and analyses and compares each other, all have keyword " Li Na ", " winning the championship " to obtain these 100 90 titles that release news in releasing news, then can obtain accordingly and these 90 the corresponding mass-sending patterns that release news.Again for example, in step S1,100 of obtaining in Li Yuchun's column of forum of checkout equipment 1 release news, in step S2, checkout equipment 1 releases news these 100 and analyses and compares each other, judge to obtain wherein have 80 Chinese characters in the title that releases news all identical, and then can obtain and these 80 the corresponding mass-sending patterns that release news.Those skilled in the art will be understood that the above-mentioned mode of mass-sending pattern of obtaining is only for giving an example; other existing or modes of obtaining the mass-sending pattern that may occur from now on are as applicable to the present invention; also should be included in the protection domain of the present invention, and be contained in this at this with way of reference.
Then, in step S3, checkout equipment 1 is determined burst mode by carry out matching inquiry in the burst mode storehouse from described one or more mass-sending patterns.Particularly, in step S3, checkout equipment 1 carries out matching inquiry with one or more mass-sending patterns that it obtains in the burst mode storehouse in step S2, for example will these one or more mass-sending patterns and this burst mode storehouse in burst mode mate, perhaps compare each other analysis by a plurality of the releasing news that will have the mass-sending pattern feature, to extract its issue feature, and then those issue features are mated with a plurality of burst mode in the burst mode storehouse, and coupling obtains the one or more burst mode corresponding with this (a bit) mass-sending pattern accordingly.Wherein, described burst mode includes but not limited to: the character numerical value of a plurality of title contents that release news is identical, the a plurality of Chinese character numbers of content when only keeping Chinese character that release news are identical, information publisher's account content is identical when only keeping Chinese character, a plurality of title contents that release news are verse, and a plurality of contents that release news are the lyrics etc.At this, described burst mode storehouse is used for the storage burst mode.For example, in step S2, checkout equipment 1 obtain with the tennis column in keyword be 90 corresponding mass-sending patterns that release news that " Li Na " " wins the championship "; Then, in step S3, checkout equipment 1 releases news those and compares each other analysis, with extract its all issue be characterized as and all contain keyword " Li Na " in those titles that release news and " win the championship ", and these 90 in releasing news 80 release news as containing the return information of " RE ", and then those are issued features in the burst mode storehouse, carry out matching inquiry, coupling obtains the burst mode corresponding with this issue feature.Again for example, in step S2, checkout equipment 1 obtains 80 corresponding mass-sending patterns that release news, and this mass-sending pattern is that the release news Chinese word number average of title is identical; Then, in step S3, checkout equipment 1 releases news those and compares each other analysis, obtain the title Chinese character that releases news and be " I descry bright moonlight before bed; be suspected to be frost on the ground ", and then the coupling acquisition is that title content is the burst mode of identical verse with these 80 the corresponding burst mode that release news in the burst mode storehouse.Those skilled in the art will be understood that the above-mentioned mode of burst mode of obtaining is only for giving an example; other existing or modes of obtaining burst mode that may occur from now on are as applicable to the present invention; also should be included in the protection domain of the present invention, and be contained in this at this with way of reference.
In step S4, checkout equipment 1 is determined the outburst information corresponding with described burst mode according to described burst mode from described a plurality of releasing news.Particularly, in step S4, checkout equipment 1 is according to its burst mode of obtaining in step S3, determines that the release news information corresponding with this (a bit) burst mode is with as outburst information, wherein, described outburst information means to have and meets releasing news of burst mode feature.For example, in step S3, the burst mode that checkout equipment 1 obtains is that title content is the burst mode of identical verse, in step S4, checkout equipment 1 extracts its corresponding a plurality of releasing news according to this burst mode, and those are released news as outburst information.Those skilled in the art will be understood that the mode of above-mentioned definite outburst information is only for giving an example; the mode of other existing or definite outburst information that may occur from now on is as applicable to the present invention; also should be included in the protection domain of the present invention, and be contained in this at this with way of reference.
Preferably, checkout equipment 1 is to work continuously between step S1, step S2, step S3 and step S4.Particularly, in step S1, checkout equipment 1 obtains a plurality of releasing news in the interaction platform; Subsequently, in step S2, checkout equipment 1 obtains and described a plurality of corresponding one or more mass-sending patterns that release news according to described a plurality of releasing news; Then, in step S3, checkout equipment 1 is determined burst mode by carry out matching inquiry in the burst mode storehouse from described one or more mass-sending patterns; Then, in step S4, checkout equipment 1 is determined the outburst information corresponding with described burst mode according to described burst mode from described a plurality of releasing news.At this, it will be understood by those skilled in the art that " continuing " refers to that checkout equipment 1 obtains, mass-sends obtaining and breaking out determining of information of the obtaining of pattern, burst mode according to what the mode of operation of setting or adjust in real time required to release news respectively in each step, until checkout equipment 1 stops in a long time to a plurality of obtaining of releasing news in the interaction platform.
Preferably, described burst mode include but not limited to following at least each:
-title lining up mode;
-Subscriber Queue pattern;
-content lining up mode.
Particularly, the title lining up mode includes but not limited to: 1) number of characters of a plurality of title contents that release news is identical; 2) in a plurality of title contents that release news with identical special character prefix; 3) ratio of identical characters number and total character surpasses default proportion threshold value in a plurality of title contents that release news; 4) a plurality of title contents that release news all do not comprise Chinese character.For example, four title contents that release news are:
I descry bright moonlight before a bed
B is suspected to be frost on the ground
The c prestige bright moon of raising the head
D bows and thinks the native place
These four the title content numbers of words that release news are identical, and then these four release news and belong to title lining up mode in the burst mode, i.e. " number of characters of a plurality of title contents that release news is identical ".Again for example, five title contents that release news are:
a?Fighting!
b?My?friends!
c?Fighting!
d?My?brothers!
e?Never?give?up!
These five title contents that release news all do not comprise Chinese character, and then these five release news and belong to title lining up mode in the burst mode, i.e. " a plurality of title contents that release news all do not comprise Chinese character ".Those skilled in the art will be understood that above-mentioned title lining up mode only for giving an example, and other title lining up modes existing or that may occur from now on also should be included in the protection domain of the present invention as applicable to the present invention, and are contained in this at this with way of reference.
The Subscriber Queue pattern includes but not limited to: 1) information publisher's account content same or similar 2) rear its account contents such as meaningless character in removing information publisher's account content, numeral are same or similar; 3) Chinese character in information publisher's account content is same or similar; 4) Chinese character in rear its account contents such as the meaningless character in removing information publisher's account content, numeral is same or similar.For example, five information publisher's account contents that release news are:
1)@of legion of waterborne troops 1
2)@of legion of waterborne troops 2
3)@of legion of waterborne troops 3
4) waterborne troops's Jun Tuan ﹠amp; 5
5) waterborne troops's Jun Tuan ﹠amp; 6
With the meaningless character " " in these five information publisher's account contents, "; " and numeral " 1 ", " 2 ", " 3 ", " 5 ", " 6 " remove, the Chinese character homogeneous phase that keeps is all " legion of waterborne troops ", then these five release news and belong to Subscriber Queue pattern in the burst mode.Those skilled in the art will be understood that above-mentioned Subscriber Queue pattern only for giving an example, and other Subscriber Queue patterns existing or that may occur from now on also should be included in the protection domain of the present invention as applicable to the present invention, and are contained in this at this with way of reference.
The content lining up mode includes but not limited to: 1) the character numerical value of a plurality of contents that release news is identical; 2) the Chinese character number of a plurality of contents that release news is identical; 3) a plurality of contents that release news all do not comprise Chinese character.For example, four contents that release news are:
1) the invincible ## of my army of #
2) the invincible % of my army of %
3) the invincible@of my army of@
4) ﹠amp; The invincible ﹠amp of my army;
When these four contents that release news only keep Chinese character, its Chinese character homogeneous phase with, then these four release news and belong to content lining up mode in the burst mode.Wherein, described meaningless character means the symbol with Chinese meaning, such as space character, " ", " # " etc.Those skilled in the art will be understood that the foregoing lining up mode only for giving an example, and other content lining up modes existing or that may occur from now on also should be included in the protection domain of the present invention as applicable to the present invention, and are contained in this at this with way of reference.
Those skilled in the art will be understood that above-mentioned every burst mode not only can be used for separately obtaining of outburst information, can also be in conjunction with being used for obtaining of outburst information.Those skilled in the art will be understood that above-mentioned burst mode only for giving an example, and other burst mode existing or that may occur from now on also should be included in the protection domain of the present invention as applicable to the present invention, and are contained in this at this with way of reference.
In a preferred embodiment (with reference to Fig. 2), in step S3, checkout equipment 1 in conjunction with the auxiliary regular that presets, is determined described burst mode also by carry out matching inquiry in described burst mode storehouse.Referring to Fig. 2 the preferred embodiment is described in detail, wherein, in step S1, checkout equipment 1 obtains a plurality of releasing news in the interaction platform; In step S2, checkout equipment 1 obtains and described a plurality of corresponding one or more mass-sending patterns that release news according to described a plurality of releasing news; In step S4, checkout equipment 1 is determined the outburst information corresponding with described burst mode according to described burst mode from described a plurality of releasing news.Its detailed process for simplicity's sake, is contained in this with way of reference with aforementioned identical with reference to the performed process in step S1, step S2 and step S4 of checkout equipment 1 among the described embodiment of Fig. 2, does not give unnecessary details and do not do.
Particularly, in step S3, checkout equipment 1 carries out matching inquiry with one or more mass-sending patterns that it obtains in the burst mode storehouse in step S2, and according to the auxiliary regular that presets, for example whether a plurality of information issue frequencys that release news issue the frequency greater than the information that presets, and then definite burst mode.For example, in step S2, checkout equipment 1 is mass-sending pattern of extraction from 20 of Man U's column release news, and this mass-sending pattern is identical for the Chinese character number of words of those title contents that release news; Then, in step S3, checkout equipment 1 should the mass-sending pattern carry out matching inquiry in the burst mode storehouse, obtaining the burst mode corresponding with this mass-sending pattern is title formation burst mode, and obtain the information issue frequency of this Man U's column according to this 20 information issuing times that release news, and this information issue frequency is issued frequency threshold value less than the information that presets, and then judges that this mass-sending pattern is not real title formation burst mode.Those skilled in the art will be understood that the above-mentioned mode of burst mode of obtaining is only for giving an example; other existing or modes of obtaining burst mode that may occur from now on are as applicable to the present invention; also should be included in the protection domain of the present invention, and be contained in this at this with way of reference.
At this, in step S3, checkout equipment 1 can also judge exactly whether the mass-sending pattern is carried out the burst mode that matching inquiry obtains in the burst mode storehouse correct according to those auxiliary regulars, greatly improve the accuracy of determining burst mode, realize effectively outburst information being determined, and then reduce the False Rate that the mistake that will normally release news is decided to be outburst information.
Preferably, in step S3, checkout equipment 1 in conjunction with based on but be not limited to following at least each the described auxiliary regular that presets, determine described burst mode:
-described a plurality of issuing time that release news;
-with described a plurality of corresponding information publishers' that release news relevant information.
Particularly, based on described a plurality of issuing time that release news, can determine to include but not limited to: the information issue frequency of an information publisher's the information issue frequency, whole interaction platform, the information issue frequency of a certain plate in the interaction platform.For example, in step S3, checkout equipment 1 carries out matching inquiry by mass-sending pattern in the burst mode storehouse, with determine with this (etc.) the corresponding burst mode of mass-sending pattern, but this checkout equipment 1 is according to the corresponding a plurality of information issuing times that release news that are positioned at same column of this mass-sending pattern, the average information issue frequency of determined these a plurality of place columns that release news is less than default information issue frequency threshold value, and then checkout equipment 1 judges that this mass-sending pattern is not burst mode.。
Described a plurality of relevant information that releases news corresponding information publisher includes but not limited to: information publisher's hour of log-on, information publisher whether in blacklist, information publisher's user credit degree etc.For example, in step S3, checkout equipment 1 is by carrying out matching inquiry, to determine the burst mode corresponding with the mass-sending pattern in the burst mode storehouse, but should the mass-sending pattern corresponding a plurality of publisher's user profile degree that release news are high, judge that then this mass-sending pattern is not burst mode.。
Those skilled in the art will be understood that based on above-mentioned two auxiliary regulars that preset not only can be used for separately the auxiliary burst mode of determining, can also be in conjunction with being used for the auxiliary burst mode of determining.Those skilled in the art will be understood that the mode of above-mentioned definite burst mode is only for giving an example; the mode of other definite burst mode existing or that may occur from now on is as applicable to the present invention; also should be included in the protection domain of the present invention, and be contained in this at this with way of reference.
More preferably, in step S3, checkout equipment 1 in conjunction with based on the described auxiliary regular that presets of described a plurality of corresponding information publishers' that release news relevant information, determine described burst mode, wherein, described information publisher's relevant information include but not limited to following at least each:
-information publisher's historical behavior record;
-information publisher's hour of log-on;
-information publisher's IP address;
The quantity that-information publisher released news within the unit interval.
Particularly, information publisher's relevant information comprises information publisher's historical behavior record, wherein, information publisher's historical behavior record includes but not limited to: release news content, information publisher's history interocclusal record, information publisher's when releasing news historical online hours etc. of information publisher's history.For example, in step S3, checkout equipment 1 carries out matching inquiry with the information publisher's account that releases news in the historical behavior database, releasing news to be normally with the history that obtains this information publisher releases news, and then judges that this information publisher's user credit degree is higher.Wherein, described historical behavior database is used for storage information publisher's historical behavior record, includes but not limited to relational database, memory storage, harddisk memory etc.
Information publisher's relevant information comprises information publisher's hour of log-on.In step S3, checkout equipment 1 is according to information publisher's hour of log-on, and for example information publisher's hour of log-on is before 2 years of current time, to judge that then this information publisher's user credit degree is higher.
Information publisher's relevant information comprises information publisher's IP address, based on information publisher's IP address, can determine to include but not limited to: whether this IP address has historical outburst delivering, this IP address to comprise the quantity etc. of information publisher's account.For example, in step S3, checkout equipment 1 carries out matching inquiry according to information publisher's IP address in address database, do not had and had in a large number the historical record that releasing news of similar features sent to obtain this IP address, and then judged that this information publisher's user credit degree was higher.Wherein, address database is used for storage and once issued the IP address and the corresponding history thereof that release news and release news.
Information publisher's relevant information comprises the quantity that the information publisher releases news within the unit interval.For example, in step S3, the quantity that checkout equipment 1 releases news within the unit interval according to the information publisher, compare with predetermined information issue frequency threshold value, issue frequency threshold value when the quantity that this information publisher releases news less than this information within the unit interval, judge that then this information publisher's user credit degree is higher.
Those skilled in the art will be understood that the relevant information based on above-mentioned four information publishers not only can be used for separately the auxiliary burst mode of determining, can also be in conjunction with being used for the auxiliary burst mode of determining.Those skilled in the art will be understood that above-mentioned information publisher's relevant information is only for giving an example; other information publishers' existing or that may occur from now on relevant information is as applicable to the present invention; also should be included in the protection domain of the present invention, and be contained in this at this with way of reference.
More preferably, described information publisher's relevant information comprises information publisher's historical behavior record, wherein, this process also comprises step S5 (not shown), in step S5, checkout equipment 1 upgrades described information publisher's historical behavior record according to described outburst information.Particularly, information publisher's relevant information comprises information publisher's historical behavior record, in step S5, checkout equipment 1 is according to its outburst information corresponding with burst mode of determining in step S4, break out the time of information, information publisher's the information such as online hours with the outburst information content of extracting these accounts that break out the information publisher of information, these information publishers' issues, these information publishers' issue, in such as the historical behavior database, add these information publishers' historical behavior record.For example, in step S5, checkout equipment 1 is according to its 80 outburst information in Li Yuchun's column of obtaining in step S4, these 80 outburst information are analyzed, to extract information publisher's account of these outburst information, and the outburst information content of the corresponding issue of these information publisher's accounts, these persons of releasing news issue each corresponding time of outburst information, the online hours of these persons of releasing news when this interaction platform generation information outburst, then, checkout equipment 1 adds the corresponding historical behavior record of this information publisher's account in the historical behavior database according to information publisher's account.Those skilled in the art will be understood that the mode of above-mentioned renewal historical behavior record is only for giving an example; the mode of other renewal historical behaviors existing or that may occur from now on records is as applicable to the present invention; also should be included in the protection domain of the present invention, and be contained in this at this with way of reference.
In another preferred embodiment (with reference to Fig. 2), this process also comprises step S6 (not shown), and in step S6,1 pair of described a plurality of releasing news of checkout equipment is carried out pre-service, obtains the pre-service result; Wherein, in step S2, checkout equipment 1 obtains described one or more mass-sending pattern also according to described pre-service result.Referring to Fig. 2 the preferred embodiment is described in detail, wherein, in step S1, checkout equipment 1 obtains a plurality of releasing news in the interaction platform; In step S3, checkout equipment 1 is determined burst mode by carry out matching inquiry in the burst mode storehouse from described one or more mass-sending patterns; In step S4, checkout equipment 1 is determined the outburst information corresponding with described burst mode according to described burst mode from described a plurality of releasing news.Its detailed process for simplicity's sake, is contained in this with way of reference with aforementioned identical with reference to the performed process in step S1, step S3 and step S4 of checkout equipment 1 among the described embodiment of Fig. 2, does not give unnecessary details and do not do.
Particularly, in step S6, checkout equipment 1 carries out pre-service to a plurality of the releasing news that it obtains in step S1, this pretreated mode includes but not limited to: remove the meaningless character in a plurality of the releasing news, the numeral in a plurality of the releasing news of removal etc., to obtain the pre-service result; Then, in step S2, checkout equipment 1 obtains one or more mass-sending patterns also according to those pre-service result.For example, a plurality of contents that release news are:
1) the invincible ## of my army of #
2) the invincible % of my army of %
3) the invincible@of my army of@
4) ﹠amp; The invincible ﹠amp of my army;
In step S6, checkout equipment 1 with in these four contents that release news without meaning character " # ", " % ", " ", "; " remove, and keep Chinese character with as with result:
My army of a is invincible
My army of b is invincible
My army of c is invincible
My army of d is invincible
Then, in step S2, checkout equipment 1 is also according to this pre-service result, with pre-service as a result a, b, c, d compare each other analysis, to obtain the as a result equal identical mass-sending pattern of a, b, c, d content of pre-service.Those skilled in the art will be understood that above-mentioned pretreated mode only for giving an example, and other pretreated modes existing or that may occur from now on also should be included in the protection domain of the present invention as applicable to the present invention, and are contained in this at this with way of reference.
In another preferred embodiment (with reference to Fig. 2), this process also comprises step S7 (not shown), and in step S7, checkout equipment 1 carries out corresponding aftertreatment according to described outburst information to described interaction platform.Referring to Fig. 2 the preferred embodiment is described in detail, wherein, in step S1, checkout equipment 1 obtains a plurality of releasing news in the interaction platform; In step S2, checkout equipment 1 obtains and described a plurality of corresponding one or more mass-sending patterns that release news according to described a plurality of releasing news; In step S3, checkout equipment 1 is determined burst mode by carry out matching inquiry in the burst mode storehouse from described one or more mass-sending patterns; In step S4, checkout equipment 1 is determined the outburst information corresponding with described burst mode according to described burst mode from described a plurality of releasing news.Its detailed process for simplicity's sake, is contained in this with way of reference with aforementioned identical with reference to the performed process in step S1, step S2, step S3 and step S4 of checkout equipment 1 among the described embodiment of Fig. 2, does not give unnecessary details and do not do.
Particularly, in step S7, checkout equipment 1 is according to its outburst information of obtaining in step S4, a column to interaction platform or interaction platform carries out corresponding aftertreatment, the information that for example will break out is all deleted, or by stop to a column of this interaction platform or this interaction platform domain name mapping, stop the modes such as this interaction platform server operation, a column of this interaction platform or interaction platform is closed.For example, in step S4, the outburst information of a certain forum that checkout equipment 1 obtains, namely determine this forum's generation information outburst phenomenon, then in step S7, checkout equipment 1 is all deleted those outburst information, and perhaps in step S7, checkout equipment 1 stops the domain name mapping of a column of this interaction platform.Those skilled in the art will be understood that the mode of above-mentioned aftertreatment only for giving an example, and the mode of other aftertreatments existing or that may occur from now on also should be included in the protection domain of the present invention as applicable to the present invention, and is contained in this at this with way of reference.
Preferably, described aftertreatment include but not limited to following at least each:
The described a plurality of outburst information of-deletion;
-forbid that the information publisher of described a plurality of outburst information releases news.
Particularly, the mode of aftertreatment includes but not limited to: 1) in step S7, checkout equipment 1 is all deleted the outburst information that it obtains in step S4; 2) in step S7, checkout equipment 1 is according to its outburst information of obtaining in step S4, extracting the corresponding information publisher's account of those outburst information, and by closing those information publisher's accounts, thereby forbid that those information publishers release news.Those skilled in the art will be understood that above-mentioned two post processing modes not only can be used for separately the aftertreatment of outburst information, can also be in conjunction with the aftertreatment that is used for outburst information.Those skilled in the art will be understood that the mode of above-mentioned aftertreatment only for giving an example, and the mode of other aftertreatments existing or that may occur from now on also should be included in the protection domain of the present invention as applicable to the present invention, and is contained in this at this with way of reference.
At this, described aftertreatment can realize in time outburst information and information publisher thereof being processed, farthest reduce outburst information to the negative effect of normal users, so that normal users can effectively be obtained the information of interaction platform and carry out interactive communication, and safeguard the normal operation order of interaction platform, further, promote user's experience.
In another preferred embodiment (with reference to Fig. 2), this process also comprises step S8 (not shown), and in step S8, checkout equipment 1 upgrades described burst mode storehouse according to described outburst information.Referring to Fig. 2 the preferred embodiment is described in detail, wherein, in step S1, checkout equipment 1 obtains a plurality of releasing news in the interaction platform; In step S2, checkout equipment 1 obtains and described a plurality of corresponding one or more mass-sending patterns that release news according to described a plurality of releasing news; In step S3, checkout equipment 1 is determined burst mode by carry out matching inquiry in the burst mode storehouse from described one or more mass-sending patterns; In step S4, checkout equipment 1 is determined the outburst information corresponding with described burst mode according to described burst mode from described a plurality of releasing news.Its detailed process for simplicity's sake, is contained in this with way of reference with aforementioned identical with reference to the performed process in step S1, step S2, step S3 and step S4 of checkout equipment 1 among the described embodiment of Fig. 2, does not give unnecessary details and do not do.
In step S8, checkout equipment 1 is according to its outburst information of obtaining in step S4, those outburst information are analysed and compared each other, to extract the whole same or analogous issue feature that has between those outburst information, and in the burst mode storehouse, carry out matching inquiry, when arbitrary same or analogous issue feature when the match is successful in the burst mode storehouse, then will issue feature and be added to this burst mode storehouse as new burst mode.For example, in step S4, checkout equipment 1 obtains 80 outburst information, and the title of these 80 outburst information is " my army is invincible, and with whom would fight for mastery "; Then, in step S8, this title that releases news is carried out matching inquiry for the burst mode of " I army invincible with whom would fight for mastery " in the burst mode storehouse with checkout equipment 1 and the match is successful, and then checkout equipment 1 title that will release news is added to this burst mode storehouse for this burst mode of " my army is invincible, and with whom would fight for mastery ".Those skilled in the art will be understood that the mode in above-mentioned renewal burst mode storehouse is only for giving an example; the mode in other renewal burst mode existing or that may occur from now on storehouses is as applicable to the present invention; also should be included in the protection domain of the present invention, and be contained in this at this with way of reference.
To those skilled in the art, obviously the invention is not restricted to the details of above-mentioned example embodiment, and in the situation that does not deviate from spirit of the present invention or essential characteristic, can realize the present invention with other concrete form.Therefore, no matter from which point, all should regard embodiment as exemplary, and be nonrestrictive, scope of the present invention is limited by claims rather than above-mentioned explanation, therefore is intended to be included in the present invention dropping on the implication that is equal to important document of claim and all changes in the scope.Any Reference numeral in the claim should be considered as limit related claim.In addition, obviously other unit or step do not got rid of in " comprising " word, and odd number is not got rid of plural number.A plurality of unit of stating in the device claim or device also can be realized by software or hardware by a unit or device.The first, the second word such as grade is used for representing title, and does not represent any specific order.

Claims (20)

  1. One kind computer implemented for detection of in the interaction platform outburst information method, the method may further comprise the steps:
    A obtains a plurality of releasing news in the interaction platform;
    B obtains and described a plurality of corresponding one or more mass-sending patterns that release news according to described a plurality of releasing news;
    C determines burst mode by carry out matching inquiry in the burst mode storehouse from described one or more mass-sending patterns;
    D determines the outburst information corresponding with described burst mode according to described burst mode from described a plurality of releasing news.
  2. 2. method according to claim 1, wherein, described burst mode comprise following at least each:
    -title lining up mode;
    -Subscriber Queue pattern;
    -content lining up mode.
  3. 3. method according to claim 1 and 2, wherein, described step c also comprises:
    -by in described burst mode storehouse, carrying out matching inquiry, in conjunction with the auxiliary regular that presets, determine described burst mode.
  4. 4. method according to claim 3, wherein, in conjunction with based on following at least each the described auxiliary regular that presets, determine described burst mode:
    -described a plurality of issuing time that release news;
    -with described a plurality of corresponding information publishers' that release news relevant information.
  5. 5. method according to claim 4, in conjunction with based on the described auxiliary regular that presets of described a plurality of corresponding information publishers' that release news relevant information, determine described burst mode, wherein, described information publisher's relevant information comprise following at least each:
    -information publisher's historical behavior record;
    -information publisher's hour of log-on;
    -information publisher's IP address;
    The quantity that-information publisher released news within the unit interval.
  6. 6. method according to claim 5, described information publisher's relevant information comprises information publisher's historical behavior record, wherein, the method also comprises:
    -according to described outburst information, upgrade described information publisher's historical behavior record.
  7. 7. each described method in 6 according to claim 1, wherein, the method also comprises:
    -described a plurality of releasing news carried out pre-service, obtain the pre-service result;
    Wherein, described step b also comprises:
    -according to described pre-service result, obtain described one or more mass-sending pattern.
  8. 8. each described method in 7 according to claim 1, wherein, the method also comprises:
    -according to described outburst information, described interaction platform is carried out corresponding aftertreatment.
  9. 9. method according to claim 8, wherein, described aftertreatment comprise following at least each:
    The described a plurality of outburst information of-deletion;
    -forbid that the information publisher of described a plurality of outburst information releases news.
  10. 10. each described method in 9 according to claim 1, wherein, the method also comprises:
    -according to described outburst information, upgrade described burst mode storehouse.
  11. 11. the equipment for detection of outburst information in the interaction platform, this equipment comprises:
    Information acquisition device, interaction platform is a plurality of to release news for obtaining;
    Mass-sending pattern deriving means is used for according to described a plurality of releasing news, and obtains and described a plurality of corresponding one or more mass-sending patterns that release news;
    The burst mode deriving means is used for determining burst mode by carrying out matching inquiry in the burst mode storehouse from described one or more mass-sending patterns;
    Determine device, be used for according to described burst mode, from described a plurality of releasing news, determine the outburst information corresponding with described burst mode.
  12. 12. equipment according to claim 11, wherein, described burst mode comprise following at least each:
    -title lining up mode;
    -Subscriber Queue pattern;
    -content lining up mode.
  13. 13. according to claim 11 or 12 described equipment, wherein, described burst mode deriving means also is used for by carrying out matching inquiry in described burst mode storehouse, in conjunction with the auxiliary regular that presets, determines described burst mode.
  14. 14. equipment according to claim 13 wherein, in conjunction with based on following at least each the described auxiliary regular that presets, is determined described burst mode:
    -described a plurality of issuing time that release news;
    -with described a plurality of corresponding information publishers' that release news relevant information.
  15. 15. equipment according to claim 14, in conjunction with based on the described auxiliary regular that presets of described a plurality of corresponding information publishers' that release news relevant information, determine described burst mode, wherein, described information publisher's relevant information comprise following at least each:
    -information publisher's historical behavior record;
    -information publisher's hour of log-on;
    -information publisher's IP address;
    The quantity that-information publisher released news within the unit interval.
  16. 16. equipment according to claim 15, described information publisher's relevant information comprise information publisher's historical behavior record, wherein, this equipment also comprises:
    The record updating device is used for according to described outburst information, upgrades described information publisher's historical behavior record.
  17. 17. each described equipment in 16 according to claim 11, wherein, this equipment also comprises:
    Pretreatment unit is used for described a plurality of releasing news carried out pre-service, obtains the pre-service result;
    Wherein, described mass-sending pattern deriving means also is used for according to described pre-service result, obtains described one or more mass-sending pattern.
  18. 18. each described equipment in 17 according to claim 11, wherein, this equipment also comprises:
    After-treatment device is used for according to described outburst information, and described interaction platform is carried out corresponding aftertreatment.
  19. 19. equipment according to claim 18, wherein, described aftertreatment comprise following at least each:
    The described a plurality of outburst information of-deletion;
    -forbid that the information publisher of described a plurality of outburst information releases news.
  20. 20. each described equipment in 19 according to claim 11, wherein, this equipment also comprises:
    The pattern base updating device is used for according to described outburst information, upgrades described burst mode storehouse.
CN201110262702.3A 2011-09-06 2011-09-06 It is a kind of to be used to detect the method and apparatus that information is broken out in interaction platform Active CN102982041B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201110262702.3A CN102982041B (en) 2011-09-06 2011-09-06 It is a kind of to be used to detect the method and apparatus that information is broken out in interaction platform

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201110262702.3A CN102982041B (en) 2011-09-06 2011-09-06 It is a kind of to be used to detect the method and apparatus that information is broken out in interaction platform

Publications (2)

Publication Number Publication Date
CN102982041A true CN102982041A (en) 2013-03-20
CN102982041B CN102982041B (en) 2018-05-08

Family

ID=47856078

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201110262702.3A Active CN102982041B (en) 2011-09-06 2011-09-06 It is a kind of to be used to detect the method and apparatus that information is broken out in interaction platform

Country Status (1)

Country Link
CN (1) CN102982041B (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104731816A (en) * 2013-12-23 2015-06-24 阿里巴巴集团控股有限公司 Method and device for processing abnormal business data

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20020133619A1 (en) * 2001-03-07 2002-09-19 Broadcom Corporation Pointer based binary search engine and method for use in network devices
CN101350957A (en) * 2008-07-28 2009-01-21 杨沁沁 Method and equipment for shielding rubbish short message
CN101510879A (en) * 2009-03-26 2009-08-19 腾讯科技(深圳)有限公司 Method and apparatus for filtering rubbish contents
CN101697620A (en) * 2009-10-30 2010-04-21 中兴通讯股份有限公司 Method and system for determining spam messages

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20020133619A1 (en) * 2001-03-07 2002-09-19 Broadcom Corporation Pointer based binary search engine and method for use in network devices
CN101350957A (en) * 2008-07-28 2009-01-21 杨沁沁 Method and equipment for shielding rubbish short message
CN101510879A (en) * 2009-03-26 2009-08-19 腾讯科技(深圳)有限公司 Method and apparatus for filtering rubbish contents
CN101697620A (en) * 2009-10-30 2010-04-21 中兴通讯股份有限公司 Method and system for determining spam messages

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104731816A (en) * 2013-12-23 2015-06-24 阿里巴巴集团控股有限公司 Method and device for processing abnormal business data

Also Published As

Publication number Publication date
CN102982041B (en) 2018-05-08

Similar Documents

Publication Publication Date Title
CN102279875B (en) Method and device for identifying fishing website
JP6114403B2 (en) Method and apparatus for providing input candidate item corresponding to input character string
CN105095211B (en) The acquisition methods and device of multi-medium data
CN103336766A (en) Short text garbage identification and modeling method and device
CN103064838A (en) Data searching method and device
CN106294314A (en) Topics Crawling method and device
US9692771B2 (en) System and method for estimating typicality of names and textual data
CN103679012A (en) Clustering method and device of portable execute (PE) files
CN103177204A (en) Password information tip method and device
CN104967587A (en) Method for identifying malicious account numbers, and apparatus thereof
CN112532624B (en) Black chain detection method and device, electronic equipment and readable storage medium
CN110717801A (en) Commodity information pushing method and device
CN102682011B (en) Method, device and system for establishing domain description name information sheet and searching
CN103778122A (en) Searching method and system
CN108182234B (en) Regular expression screening method and device
CN102982048A (en) Method and device for assessing junk information mining rule
CN108683649A (en) A kind of malice domain name detection method based on text feature
CN109672586A (en) A kind of DPI service traffics recognition methods, device and computer readable storage medium
CN106257449A (en) A kind of information determines method and apparatus
CN105653941A (en) Heuristic detection method and system for phishing website
CN102737017B (en) Method and apparatus for extracting page theme
CN111027065B (en) Leucavirus identification method and device, electronic equipment and storage medium
CN102982041A (en) Method used for detecting burst information of interactive platform and device
CN107360197A (en) A kind of phishing analysis method and device based on DNS daily records
CN104331396A (en) Intelligent advertisement identifying method

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant