CN107169092A - Intelligent Recognition and the method and system of sensitive content are handled in interaction - Google Patents

Intelligent Recognition and the method and system of sensitive content are handled in interaction Download PDF

Info

Publication number
CN107169092A
CN107169092A CN201710334441.9A CN201710334441A CN107169092A CN 107169092 A CN107169092 A CN 107169092A CN 201710334441 A CN201710334441 A CN 201710334441A CN 107169092 A CN107169092 A CN 107169092A
Authority
CN
China
Prior art keywords
sensitive content
automatic machine
search tree
interaction
sensitive
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201710334441.9A
Other languages
Chinese (zh)
Inventor
杜洪博
樊磊
王军
方骏达
汪铁丰
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Storm Sports (beijing) Co Ltd
Original Assignee
Storm Sports (beijing) Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Storm Sports (beijing) Co Ltd filed Critical Storm Sports (beijing) Co Ltd
Priority to CN201710334441.9A priority Critical patent/CN107169092A/en
Publication of CN107169092A publication Critical patent/CN107169092A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/36Creation of semantic tools, e.g. ontology or thesauri
    • G06F16/374Thesaurus
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/332Query formulation
    • G06F16/3329Natural language query formulation or dialogue systems

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Mathematical Physics (AREA)
  • Computational Linguistics (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Artificial Intelligence (AREA)
  • Human Computer Interaction (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The present invention discloses Intelligent Recognition and the method for handling sensitive content in interaction, including:Default sensitive content is compiled into automatic machine, automatic machine search tree is built, automatic machine search tree is stored to telecommunication network and asked on service platform;Parsing interaction content is obtained interacting character, and interaction character is word for word read in automatic machine search tree by remote service agreement, and the sensitive content in interaction content is obtained by interaction character traversal automatic machine search tree;The number of times of sensitive content appearance is obtained, strategy is handled with reference to the sensitive content pre-set, processing operation corresponding to client executing updates automatic machine search tree;By in the corresponding processing policy store of the automatic machine search tree and sensitive content after renewal to telecommunication network request service platform;When the number of times occurred in interaction character in interaction is more than or equal to sensitive number of times, interaction character is fed back into management port.The present invention is managed collectively to yellow version vocabulary, improves the treatment effeciency of yellow version vocabulary.

Description

Intelligent Recognition and the method and system of sensitive content are handled in interaction
Technical field
The present invention relates to the technical field of network interaction management, more particularly, to Intelligent Recognition in a kind of interaction And handle the method and system of sensitive content.
Background technology
With the development of network technology, various social networking application programs (APP), the release of social platform, social activity chat is gradually Become the information interaction approach that people commonly use, but personnel are intricate during social activity, chat content be also it is various, Chat content is more, and to ensure chat quality, social intercourse system needs to shield some uncivil or illegal sensitive vocabulary or hair is wide Accuse (i.e. social sensitive content, what is also had is called yellow version vocabulary).
At present, in the prior art by the way that chat vocabulary is identified whether with comparison search in default yellow version database For yellow version vocabulary, if it find that there is yellow version vocabulary, the ID for the person of being used for is shielded.But, if user is using other shapes Formula exical substitution is present in the yellow version vocabulary in dictionary, " 8 " is such as replaced with to the form of " eight ", with regard to that can bypass shielding, it is impossible to reach To the purpose of expected yellow version vocabulary shielding.And network words update is so fast, existing system is to emerging sensitivity The automatic identification ability of content is strong not enough, can not include these emerging sensitive words intelligently, in time in database Remittance content.
Furthermore, the combination for multiple vocabulary that Chinese and English is combined is with regard to that can obtain a variety of vocabulary implications, for such As many as combining form, the larger and accuracy by the way of the sensitive vocabulary of existing sensitive lexicon matching contrast inquiry It is not high, in addition it is also necessary to which that the longer matching comparison time of consumption, execution efficiency is not high, it is impossible to change existing quick comprehensively, exactly Feel using and propagating for vocabulary.And the complaint process after maloperation shielding user is very complicated, very bad is caused to user Usage experience.
Therefore it provides a kind of intelligent, degree of accuracy is high, easy to operate and interaction of efficiency high in sensitive content handle Scheme is this area urgent problem to be solved.
The content of the invention
In view of this, the invention provides Intelligent Recognition in a kind of interaction and handle the method for sensitive content and be System, solve intelligent can not focus on the technical problem of sensitive content in interaction in the prior art.
In order to solve the above-mentioned technical problem, the present invention proposes in a kind of interaction Intelligent Recognition and handles sensitive content Method, including:
Default sensitive content is received, the default sensitive content is compiled into automatic machine according to the mechanism of AC automatic machines;Will The endianness order that the automatic machine is encoded according to default keyword, and will be described automatic according to the endianness order Mechanism builds up automatic machine search tree, and the automatic machine search tree is stored to telecommunication network request service platform;
The interaction content of client is received, the interaction content is parsed and obtains interacting character, existed by remote service agreement The interactive character is word for word read in the automatic machine search tree, traveling through the automatic machine search tree by the interactive character obtains To the sensitive content in the interaction content;
The history intersection record of the client is searched for, the number of times that the sensitive content occurs is obtained, with reference to pre-setting Sensitive content processing strategy, operation is handled accordingly to the client executing;
Based on the current endianness order, the automatic machine search tree is updated according to the sensitive content;
By the corresponding processing policy store of the automatic machine search tree and the sensitive content after renewal to described long-range On network request service platform;
The number of times that the interactive character occurs in interaction is counted, is occurred in the interactive character in interaction Number of times when being more than or equal to sensitive number of times, the interactive character is fed back into management port;
The sensitive content instruction of the management port is received, and is instructed according to the sensitive content by the interactive character more In the new extremely automatic machine search tree.
Further, wherein, sensitive content processing strategy is:
When the sensitive content occurrence number is less than or equal to setting number of times, the sensitive content is shielded in interactive interface, And send alert messages to the client;
When the sensitive content occurrence number is more than setting number of times and is less than or equal to setting frequency threshold value, in interactive interface Middle shielding sensitive content, and by client's end shield preset time;
When the sensitive content occurrence number is more than the setting frequency threshold value, shielded in interactive interface in the sensitivity Hold, and shield the client.
Further, wherein, the setting number of times, is one to three times;The setting frequency threshold value, is three to five times.
Further, wherein, the endianness order that the automatic machine is encoded according to default keyword, and according to institute State endianness order and the automechanism is built up into automatic machine search tree, be:
The corresponding extension automatic machine of the automatic machine is obtained according to default automatic machine expanding policy;
The endianness order that the automatic machine and its extension automatic machine are encoded according to default keyword, and according to institute State endianness order and the automechanism is built up into automatic machine search tree.
On the other hand, the present invention also provides Intelligent Recognition and the system for handling sensitive content in a kind of interaction, including: Automatic machine search tree creation module, sensitive content acquisition module, sensitive content processing module, automatic machine search tree update module and Sensitive content handles policy store module;Wherein,
The automatic machine search tree creation module, for being connected with the sensitive content acquisition module, receives default quick Feel content, the default sensitive content is compiled into automatic machine according to the mechanism of AC automatic machines;By the automatic machine according to default Keyword coding endianness order, and according to the endianness order by the automechanism build up automatic machine search Tree, and the automatic machine search tree is stored to telecommunication network request service platform;
The sensitive content acquisition module, for the automatic machine search tree creation module and sensitive content processing module It is connected, receives the interaction content of client, parses the interaction content and obtain interacting character, by remote service agreement in institute State and the interactive character is word for word read in automatic machine search tree, traveling through the automatic machine search tree by the interactive character obtains Sensitive content in the interaction content;
The sensitive content processing module, for the sensitive content acquisition module and automatic machine search tree update module It is connected, searches for the history intersection record of the client, the number of times that the sensitive content occurs is obtained, with reference to what is pre-set Sensitive content processing strategy, operation is handled to the client executing accordingly;
The automatic machine search tree update module, for tactful with the sensitive content processing module and sensitive content processing Memory module is connected, based on the current endianness order, updates the automatic machine according to the sensitive content and searches for Tree;
The sensitive content handles policy store module, for being connected with the automatic machine search tree update module, will The corresponding processing policy store of the automatic machine search tree and the sensitive content after renewal asks to take to the telecommunication network It is engaged on platform;
Further comprise:Default sensitive content update module, for being connected with the automatic machine search tree creation module, The number of times that the interactive character occurs in interaction is counted, the number of times occurred in the interactive character in interaction is big When sensitive number of times, the interactive character is fed back into management port;
The sensitive content instruction of the management port is received, and is instructed according to the sensitive content by the interactive character more In the new extremely automatic machine search tree.
Further, wherein, the sensitive content processing module is:The number of times statistic unit of sensitive content appearance, first Sensitive content processing unit, the second sensitive content processing unit and the 3rd sensitive content processing unit;Wherein,
The number of times statistic unit that the sensitive content occurs, for being searched for the sensitive content acquisition module, automatic machine Tree update module and the first sensitive content processing unit are connected, and search for the history intersection record of the client, obtain described The number of times that sensitive content occurs;
The first sensitive content processing unit, the number of times statistic unit and second for occurring with the sensitive content is quick Sense content processing unit is connected, when the sensitive content occurrence number is less than or equal to setting number of times, in interactive interface The sensitive content is shielded, and alert messages are sent to the client;
The second sensitive content processing unit, for the first sensitive content processing unit and the 3rd sensitive content Processing unit is connected, when the sensitive content occurrence number is more than setting number of times and is less than or equal to setting frequency threshold value, Shield the sensitive content in interactive interface, and by client's end shield preset time;
The 3rd sensitive content processing unit, it is described quick for being connected with the second sensitive content processing unit When feeling content occurrence number more than the setting frequency threshold value, the sensitive content is shielded in interactive interface, and shield the visitor Family end.
Further, wherein, the setting number of times, is one to three times;The setting frequency threshold value, is three to five times.
Further, wherein, the automatic machine search tree creation module is:Automatic machine acquiring unit and automatic machine search Set creating unit;Wherein,
The automatic machine acquiring unit, for the sensitive content acquisition module and automatic machine search tree creating unit phase Connection, automatic machine is compiled into according to the mechanism of AC automatic machines by the default sensitive content, and plan is extended according to default automatic machine Slightly obtain the corresponding extension automatic machine of the automatic machine;
The automatic machine search tree creating unit, for being connected with the automatic machine acquiring unit, by the automatic machine And its endianness order that extension automatic machine is encoded according to default keyword, and will be described according to the endianness order Automechanism builds up automatic machine search tree, and the automatic machine search tree is stored to telecommunication network request service platform.
Compared with prior art, Intelligent Recognition and the method and system of sensitive content are handled in interaction of the invention, Realize following beneficial effect:
(1) Intelligent Recognition and the method and system of sensitive content are handled in interaction of the present invention, using AC from Motivation sets up search tree and carries out yellow version lexical search, processing, can comprehensively search for, handle various yellow version vocabulary and its deformation; The yellow version lexical search service systems of the RPC for calling service can externally be provided by setting up, it is not necessary to build one to each interactive system Yellow version vocabulary processing module, can also be managed collectively to yellow version vocabulary, improve the treatment effeciency of yellow version vocabulary.
(2) Intelligent Recognition and the method and system of sensitive content are handled in interaction of the present invention, using AC from Motivation sets up search tree and carries out yellow version lexical search, processing, and sets the sensitive content processing scheme and automatic machine of stagewise to search Suo Shu and emerging yellow version lexical data more new strategy, it is ensured that the promptness and accuracy of sensitive content processing.
Certainly, implement the present invention any product must not specific needs simultaneously reach all the above technique effect.
By referring to the drawings to the detailed description of the exemplary embodiment of the present invention, further feature of the invention and its Advantage will be made apparent from.
Brief description of the drawings
The accompanying drawing for being combined in the description and constituting a part for specification shows embodiments of the invention, and even It is used for the principle for explaining the present invention together with its explanation.
Fig. 1 be described in the embodiment of the present invention 1 in interaction Intelligent Recognition and handle sensitive content method flow Schematic diagram;
Fig. 2 be described in the embodiment of the present invention 2 in interaction Intelligent Recognition and handle sensitive content method flow Schematic diagram;
Fig. 3 be described in the embodiment of the present invention 3 in interaction Intelligent Recognition and handle sensitive content system structure Schematic diagram;
Fig. 4 be described in the embodiment of the present invention 4 in interaction Intelligent Recognition and handle sensitive content system structure Schematic diagram.
Embodiment
The various exemplary embodiments of the present invention are described in detail now with reference to accompanying drawing.It should be noted that:Unless had in addition Body illustrates that the part and the positioned opposite of step, numerical expression and numerical value otherwise illustrated in these embodiments does not limit this The scope of invention.
The description only actually at least one exemplary embodiment is illustrative below, never as to the present invention And its any limitation applied or used.
It may be not discussed in detail for technology, method and apparatus known to person of ordinary skill in the relevant, but suitable In the case of, the technology, method and apparatus should be considered as a part for specification.
In shown here and discussion all examples, any occurrence should be construed as merely exemplary, without It is as limitation.Therefore, other examples of exemplary embodiment can have different values.
It should be noted that:Similar label and letter represents similar terms in following accompanying drawing, therefore, once a certain Xiang Yi It is defined, then it need not be further discussed in subsequent accompanying drawing in individual accompanying drawing.
Embodiment 1
As shown in figure 1, in the interaction described in the present embodiment Intelligent Recognition and handle sensitive content method flow Schematic diagram, the method for present embodiments providing yellow version sensitive content in a kind of centralized intelligenceization processing interaction, this method bag Include the steps:
Default sensitive content, is compiled into automatically by step 101, the default sensitive content of reception according to the mechanism of AC automatic machines Machine;The endianness order that automatic machine is encoded according to default keyword, and built automatic machine according to endianness order Stored into automatic machine search tree, and by automatic machine search tree to telecommunication network request service platform.
AC automatic machines (Aho-Corasick), are a kind of string matching algorithms based on automatic machine principle, such as Fig. 1 institutes Show, its basic functional principle is:Feature string (such as virus characteristic storehouse, filtering keys) is compiled into automatic machine first, Since state 0, content to be matched is word for word read in, when reading in a character every time, checks whether current state has corresponding character Redirect arrow, if so, then jumping to this redirects corresponding NextState, if not having, jump back to state 0.There are some state quilts Labeled as matching status, if the match is successful into this status representative.The word that the interaction content of user is encoded with keyword Symbol form can embody the word combination of the interaction content after carrying out sequential combination.
It will be stored on telecommunication network request service platform and be managed in sensitive word finder, ask to take by telecommunication network Business platform connects each interactive service platform, advantageously ensures that the uniformity of sensitive vocabulary standard and the uniformity of the network information.
Step 102, the interaction content for receiving client, parsing interaction content obtain interacting character, are assisted by remote service View word for word reads in interaction character in automatic machine search tree, and traveling through automatic machine search tree by interaction character obtains in interaction content Sensitive content.
If not obtaining any sensitive content after traversal automatic machine search tree, any interference is not done to the interaction, only obtained The interaction content of client is recorded, subsequently to use.
The history intersection record of step 103, search client, obtains the number of times of sensitive content appearance, with reference to pre-setting Sensitive content processing strategy, it is corresponding to client executing processing operation.
Alternatively, intersection record of the search client within nearest a period of time, this can be 1 month to 1 for a period of time In year., may if be not any limitation as to the period that recording interactive is recorded because netspeak update speed is quickly Larger error is caused to sensitive content processing.
Step 104, based on current endianness order, automatic machine search tree is updated according to sensitive content.
The basic automatic machine character that sensitive content is searched for after the sensitive content that this search is obtained is likely to become, utilizes friendship The sensitive content searched out during mutually updates in itself can preferably improve automatic machine search tree.
Step 105, by the corresponding processing policy store of the automatic machine search tree and sensitive content after renewal to telecommunication network Ask on service platform.
Can be corresponding sensitive vocabulary or phase on other interaction platforms by storing the corresponding processing strategy of sensitive content Foundation is provided like the processing of sensitive vocabulary, it might even be possible to which the sensitive content processing strategy of each interaction platform of intelligent comprehensive is obtained The processing standard of the sensitive content.
The number of times that step 106, statistics interaction character occur in interaction, occurs in interaction character in interaction Number of times when being more than or equal to sensitive number of times, interaction character is fed back into management port;The sensitive content for receiving management port refers to Order, and interaction character is updated in automatic machine search tree according to sensitive content instruction.
Embodiment 2
As shown in Fig. 2 in the interaction described in the present embodiment Intelligent Recognition and handle sensitive content method flow Schematic diagram, the present embodiment is a kind of preferred embodiment provided on the basis of above-described embodiment 1, and this method comprises the following steps:
Default sensitive content, is compiled into automatically by step 201, the default sensitive content of reception according to the mechanism of AC automatic machines Machine, the corresponding extension automatic machine of automatic machine is obtained according to default automatic machine expanding policy.
Alternatively, automatic machine expanding policy can be:Chinese, phonetic, initials in chinese pinyin, English, the English of vocabulary Initial, alternative word etc..
Step 202, the endianness order for encoding automatic machine and its extension automatic machine according to default keyword, and root Automechanism is built up into automatic machine search tree according to endianness order, and automatic machine search tree is stored to telecommunication network request clothes It is engaged on platform.
Step 203, the interaction content for receiving client, parsing interaction content obtain interacting character, are assisted by remote service View word for word reads in interaction character in automatic machine search tree, and traveling through automatic machine search tree by interaction character obtains in interaction content Sensitive content.
The history intersection record of step 204, search client, obtains the number of times of sensitive content appearance, with reference to pre-setting Sensitive content processing strategy, it is corresponding to client executing processing operation.
When step 205, sensitive content occurrence number are less than or equal to setting number of times, shielded in interactive interface in the sensitivity Hold, and alert messages are sent to client;Sensitive content occurrence number is more than setting number of times and less than or equal to setting number of times threshold During value, the sensitive content is shielded in interactive interface, and by client's end shield preset time;Sensitive content occurrence number, which is more than, to be set When determining frequency threshold value, the sensitive content is shielded in interactive interface, and shield client.
Alternatively, number of times is set, is further one to three times;Frequency threshold value is set, is further three to five times.
Step 206, based on current endianness order, automatic machine search tree is updated according to sensitive content.
Step 207, by the corresponding processing policy store of the automatic machine search tree and sensitive content after renewal to telecommunication network Ask on service platform.
The number of times that step 208, statistics interaction character occur in interaction, occurs in interaction character in interaction Number of times when being more than or equal to sensitive number of times, interaction character is fed back into management port;The sensitive content for receiving management port refers to Order, and interaction character is updated in automatic machine search tree according to sensitive content instruction.
Embodiment 3
As shown in figure 3, in the interaction described in the present embodiment Intelligent Recognition and handle sensitive content system structure Schematic diagram, system described in the present embodiment is used to implement described in above-described embodiment in interaction Intelligent Recognition and handled in sensitivity The method of appearance, the system includes:Automatic machine search tree creation module 301, sensitive content acquisition module 302, sensitive content processing Module 303, automatic machine search tree update module 304 and sensitive content processing policy store module 305.
Wherein, automatic machine search tree creation module 301 is connected with sensitive content acquisition module 302, default for receiving Sensitive content, automatic machine is compiled into according to the mechanism of AC automatic machines by default sensitive content;By automatic machine according to default key The endianness order of word coding, and automechanism is built up by automatic machine search tree according to endianness order, and by automatic machine Search tree is stored to telecommunication network request service platform.
Sensitive content acquisition module 302 is connected with automatic machine search tree creation module 301 and sensitive content processing module 303 Connect, the interaction content for receiving client, parsing interaction content obtains interacting character, by remote service agreement in automatic machine Interaction character is word for word read in search tree, is traveled through by interaction character in the sensitivity that automatic machine search tree is obtained in interaction content Hold.
Sensitive content processing module 303 is connected with sensitive content acquisition module 302 and automatic machine search tree update module 304 Connect, for the history intersection record of search client, the number of times of sensitive content appearance is obtained, with reference to the sensitive content pre-set Processing strategy, processing operation corresponding to client executing.
Automatic machine search tree update module 304 and sensitive content processing module 303 and sensitive content processing policy store mould Block 305 is connected, for based on current endianness order, automatic machine search tree to be updated according to sensitive content.
Sensitive content processing policy store module 305 is connected with automatic machine search tree update module 304, for that will update On the corresponding processing policy store of automatic machine search tree and sensitive content afterwards to telecommunication network request service platform.
Said system further comprises:Default sensitive content update module 306, for automatic machine search tree creation module 301 are connected, the number of times that statistics interaction character occurs in interaction, the number of times occurred in interaction character in interaction During more than or equal to sensitive number of times, interaction character is fed back into management port;Receive the sensitive content instruction of management port, and root Interaction character is updated in automatic machine search tree according to sensitive content instruction.
Embodiment 4
As shown in figure 4, in the interaction described in the present embodiment Intelligent Recognition and handle sensitive content system structure Schematic diagram, the present embodiment is a kind of preferred embodiment provided on the basis of above-described embodiment 3, and the system includes:Automatic machine is searched Suo Shu creation modules 401, sensitive content acquisition module 402, sensitive content processing module 403, automatic machine search tree update module 404 and sensitive content processing policy store module 405.
Wherein, automatic machine search tree creation module 401 is further:Automatic machine acquiring unit 411 and automatic machine search tree Creating unit 412.
Automatic machine acquiring unit 411, for sensitive content acquisition module 402 and automatic machine search tree creating unit 412 It is connected, default sensitive content is compiled into automatic machine according to the mechanism of AC automatic machines, according to default automatic machine expanding policy Obtain the corresponding extension automatic machine of automatic machine.
Automatic machine search tree creating unit 412, for being connected with automatic machine acquiring unit 411, by automatic machine and its expansion The endianness order that exhibition automatic machine is encoded according to default keyword, and built up automechanism certainly according to endianness order Motivation search tree, and automatic machine search tree is stored to telecommunication network request service platform.
Sensitive content acquisition module 402 is connected with automatic machine search tree creation module 401 and sensitive content processing module 403 Connect, the interaction content for receiving client, parsing interaction content obtains interacting character, by remote service agreement in automatic machine Interaction character is word for word read in search tree, is traveled through by interaction character in the sensitivity that automatic machine search tree is obtained in interaction content Hold.
Sensitive content processing module 403 is connected with sensitive content acquisition module 402 and automatic machine search tree update module 404 Connect, for the history intersection record of search client, the number of times of sensitive content appearance is obtained, with reference to the sensitive content pre-set Processing strategy, processing operation corresponding to client executing.
Sensitive content processing module 403, further for:The number of times statistic unit 431, first that sensitive content occurs is sensitive interior Hold processing unit 432, the second sensitive content processing unit 433 and the 3rd sensitive content processing unit 434.
Wherein, the number of times statistic unit 431 that sensitive content occurs, for being searched with sensitive content acquisition module 402, automatic machine The sensitive content processing unit 432 of Suo Shu update modules 404 and first is connected, and the history intersection record of search client is obtained The number of times that sensitive content occurs.
First sensitive content processing unit 432, the number of times statistic unit 431 and second for occurring with sensitive content is sensitive Content processing unit 433 is connected, and when sensitive content occurrence number is less than or equal to setting number of times, is shielded in interactive interface The sensitive content, and send alert messages to client.
Second sensitive content processing unit 433, for at the first sensitive content processing unit 432 and the 3rd sensitive content Reason unit 434 is connected, when sensitive content occurrence number is more than setting number of times and is less than or equal to setting frequency threshold value, in interaction Shield the sensitive content in interface, and by client's end shield preset time.
3rd sensitive content processing unit 434, for being connected with the second sensitive content processing unit 433, sensitive content When occurrence number is more than setting frequency threshold value, the sensitive content is shielded in interactive interface, and shield client.
Alternatively, number of times is set, is further one to three times;Frequency threshold value is set, is further three to five times.
Automatic machine search tree update module 404 and sensitive content processing module 403 and sensitive content processing policy store mould Block 405 is connected, for based on current endianness order, automatic machine search tree to be updated according to sensitive content.
Sensitive content processing policy store module 405 is connected with automatic machine search tree update module 404, for that will update On the corresponding processing policy store of automatic machine search tree and sensitive content afterwards to telecommunication network request service platform.
Said system further comprises:Default sensitive content update module 406, for automatic machine search tree creation module 401 are connected, the number of times that statistics interaction character occurs in interaction, the number of times occurred in interaction character in interaction During more than or equal to sensitive number of times, interaction character is fed back into management port;Receive the sensitive content instruction of management port, and root Interaction character is updated in automatic machine search tree according to sensitive content instruction.
Intelligent Recognition and handle the method for sensitive content by above-described embodiment, in interaction of the invention and be System, has reached following beneficial effect:
(1) Intelligent Recognition and the method and system of sensitive content are handled in interaction of the present invention, using AC from Motivation sets up search tree and carries out yellow version lexical search, processing, can comprehensively search for, handle various yellow version vocabulary and its deformation; The yellow version lexical search service systems of the RPC for calling service can externally be provided by setting up, it is not necessary to build one to each interactive system Yellow version vocabulary processing module, can also be managed collectively to yellow version vocabulary, improve the treatment effeciency of yellow version vocabulary.
(2) Intelligent Recognition and the method and system of sensitive content are handled in interaction of the present invention, using AC from Motivation sets up search tree and carries out yellow version lexical search, processing, and sets the sensitive content processing scheme and automatic machine of stagewise to search Suo Shu and emerging yellow version lexical data more new strategy, it is ensured that the promptness and accuracy of sensitive content processing.
It should be understood by those skilled in the art that, embodiments of the invention can be provided as method, device or computer program Product.Therefore, the present invention can be using the reality in terms of complete hardware embodiment, complete software embodiment or combination software and hardware Apply the form of example.Moreover, the present invention can be used in one or more computers for wherein including computer usable program code The computer program production that usable storage medium is implemented on (including but is not limited to magnetic disk storage, CD-ROM, optical memory etc.) The form of product.
Although some specific embodiments of the present invention are described in detail by example, the skill of this area Art personnel are it should be understood that example above is merely to illustrate, the scope being not intended to be limiting of the invention.The skill of this area Art personnel to above example it should be understood that can modify without departing from the scope and spirit of the present invention.This hair Bright scope is defined by the following claims.

Claims (8)

1. Intelligent Recognition and the method for handling sensitive content in a kind of interaction, it is characterised in that including:
Default sensitive content is received, the default sensitive content is compiled into automatic machine according to the mechanism of AC automatic machines;Will be described The endianness order that automatic machine is encoded according to default keyword, and according to the endianness order by the automechanism Automatic machine search tree is built up, and the automatic machine search tree is stored to telecommunication network request service platform;
The interaction content of client is received, the interaction content is parsed and obtains interacting character, by remote service agreement described The interactive character is word for word read in automatic machine search tree, traveling through the automatic machine search tree by the interactive character obtains institute State the sensitive content in interaction content;
The history intersection record of the client is searched for, the number of times that the sensitive content occurs is obtained, it is quick with reference to what is pre-set Feel contents processing strategy, operation is handled accordingly to the client executing;
Based on the current endianness order, the automatic machine search tree is updated according to the sensitive content;
By the corresponding processing policy store of the automatic machine search tree and the sensitive content after renewal to the telecommunication network Ask on service platform;
The number of times that the interactive character occurs in interaction is counted, time occurred in the interactive character in interaction When number is more than or equal to sensitive number of times, the interactive character is fed back into management port;
The sensitive content instruction of the management port is received, and is updated to the interactive character according to sensitive content instruction In the automatic machine search tree.
2. Intelligent Recognition and the method for handling sensitive content in interaction according to claim 1, it is characterised in that institute State sensitive content processing strategy, further for:
When the sensitive content occurrence number is less than or equal to setting number of times, the sensitive content is shielded in interactive interface, and to The client sends alert messages;
When the sensitive content occurrence number is more than setting number of times and is less than or equal to setting frequency threshold value, shield in interactive interface Cover the sensitive content, and by client's end shield preset time;
When the sensitive content occurrence number is more than the setting frequency threshold value, the sensitive content is shielded in interactive interface, and Shield the client.
3. Intelligent Recognition and the method for handling sensitive content in interaction according to claim 2, it is characterised in that institute Setting number of times is stated, is further one to three times;The setting frequency threshold value, is further three to five times.
4. Intelligent Recognition and the method for handling sensitive content in interaction according to claim 1, it is characterised in that will The endianness order that the automatic machine is encoded according to default keyword, and will be described automatic according to the endianness order Mechanism builds up automatic machine search tree, further for:
The corresponding extension automatic machine of the automatic machine is obtained according to default automatic machine expanding policy;
The endianness order that the automatic machine and its extension automatic machine are encoded according to default keyword, and according to the word The automechanism is built up automatic machine search tree by symbol order arrangement.
5. Intelligent Recognition and the system for handling sensitive content in a kind of interaction, it is characterised in that including:Automatic machine search tree Creation module, sensitive content acquisition module, sensitive content processing module, automatic machine search tree update module and sensitive content processing Policy store module;Wherein,
The automatic machine search tree creation module, for being connected with the sensitive content acquisition module, receives default sensitive interior Hold, the default sensitive content is compiled into automatic machine according to the mechanism of AC automatic machines;The automatic machine is closed according to default The endianness order of key word coding, and the automechanism is built up by automatic machine search tree according to the endianness order, And store the automatic machine search tree to telecommunication network request service platform;
The sensitive content acquisition module, for being connected with the automatic machine search tree creation module and sensitive content processing module Connect, receive the interaction content of client, parse the interaction content and obtain interacting character, by remote service agreement it is described from The interactive character is word for word read in motivation search tree, traveling through the automatic machine search tree by the interactive character obtains described Sensitive content in interaction content;
The sensitive content processing module, for being connected with the sensitive content acquisition module and automatic machine search tree update module Connect, search for the history intersection record of the client, the number of times that the sensitive content occurs is obtained, with reference to the sensitivity pre-set Contents processing strategy, operation is handled to the client executing accordingly;
The automatic machine search tree update module, for handling policy store with the sensitive content processing module and sensitive content Module is connected, and based on the current endianness order, the automatic machine search tree is updated according to the sensitive content;
The sensitive content handles policy store module, for being connected with the automatic machine search tree update module, will update The corresponding processing policy store of the automatic machine search tree and the sensitive content afterwards is flat to telecommunication network request service On platform;
Further comprise:Default sensitive content update module, for being connected with the automatic machine search tree creation module, statistics The number of times that the interactive character occurs in interaction, the number of times occurred in the interactive character in interaction be more than or During equal to sensitive number of times, the interactive character is fed back into management port;
The sensitive content instruction of the management port is received, and is updated to the interactive character according to sensitive content instruction In the automatic machine search tree.
6. Intelligent Recognition and the system for handling sensitive content in interaction according to claim 5, it is characterised in that institute State sensitive content processing module, further for:Sensitive content occur number of times statistic unit, the first sensitive content processing unit, Second sensitive content processing unit and the 3rd sensitive content processing unit;Wherein,
The number of times statistic unit that the sensitive content occurs, for the sensitive content acquisition module, automatic machine search tree more New module and the first sensitive content processing unit are connected, and search for the history intersection record of the client, obtain the sensitivity The number of times that content occurs;
The first sensitive content processing unit, it is sensitive interior for the number of times statistic unit occurred with the sensitive content and second Hold processing unit to be connected, when the sensitive content occurrence number is less than or equal to setting number of times, shielded in interactive interface The sensitive content, and send alert messages to the client;
The second sensitive content processing unit, for being handled with the first sensitive content processing unit and the 3rd sensitive content Unit is connected, when the sensitive content occurrence number is more than setting number of times and is less than or equal to setting frequency threshold value, in interaction Shield the sensitive content in interface, and by client's end shield preset time;
The 3rd sensitive content processing unit, it is described sensitive interior for being connected with the second sensitive content processing unit When holding occurrence number more than the setting frequency threshold value, the sensitive content is shielded in interactive interface, and shield the client.
7. Intelligent Recognition and the system for handling sensitive content in interaction according to claim 6, it is characterised in that institute Setting number of times is stated, is further one to three times;The setting frequency threshold value, is further three to five times.
8. Intelligent Recognition and the system for handling sensitive content in interaction according to claim 5, it is characterised in that institute State automatic machine search tree creation module, further for:Automatic machine acquiring unit and automatic machine search tree creating unit;Wherein,
The automatic machine acquiring unit, for being connected with the sensitive content acquisition module and automatic machine search tree creating unit Connect, the default sensitive content is compiled into automatic machine according to the mechanism of AC automatic machines, according to default automatic machine expanding policy Obtain the corresponding extension automatic machine of the automatic machine;
The automatic machine search tree creating unit, for being connected with the automatic machine acquiring unit, by the automatic machine and its The endianness order that extension automatic machine is encoded according to default keyword, and will be described automatic according to the endianness order Mechanism builds up automatic machine search tree, and the automatic machine search tree is stored to telecommunication network request service platform.
CN201710334441.9A 2017-05-12 2017-05-12 Intelligent Recognition and the method and system of sensitive content are handled in interaction Pending CN107169092A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201710334441.9A CN107169092A (en) 2017-05-12 2017-05-12 Intelligent Recognition and the method and system of sensitive content are handled in interaction

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201710334441.9A CN107169092A (en) 2017-05-12 2017-05-12 Intelligent Recognition and the method and system of sensitive content are handled in interaction

Publications (1)

Publication Number Publication Date
CN107169092A true CN107169092A (en) 2017-09-15

Family

ID=59815958

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710334441.9A Pending CN107169092A (en) 2017-05-12 2017-05-12 Intelligent Recognition and the method and system of sensitive content are handled in interaction

Country Status (1)

Country Link
CN (1) CN107169092A (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108970123A (en) * 2018-07-16 2018-12-11 网易(杭州)网络有限公司 The sending method of interference information and device, electronic equipment in game
CN111177518A (en) * 2019-12-18 2020-05-19 深圳市任子行科技开发有限公司 Webpage purification method, system and computer readable storage medium
CN111683049A (en) * 2020-05-08 2020-09-18 江苏涵秋网络科技有限公司 Sensitive content interception system for network security
WO2021135103A1 (en) * 2020-05-29 2021-07-08 平安科技(深圳)有限公司 Method and apparatus for semantic analysis, computer device, and storage medium
CN114048102A (en) * 2021-11-18 2022-02-15 广州银汉科技有限公司 Chat intelligent analysis monitoring system based on big data

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103150432A (en) * 2013-03-07 2013-06-12 宁波成电泰克电子信息技术发展有限公司 Method for internet public opinion analysis
CN105468684A (en) * 2015-11-17 2016-04-06 贵阳朗玛信息技术股份有限公司 Sensitive word filtering system and communication method thereof
CN105893626A (en) * 2016-05-10 2016-08-24 中广核工程有限公司 Index library creation method used for nuclear power engineering and index system adopting index library creation method
CN106445998A (en) * 2016-05-26 2017-02-22 达而观信息科技(上海)有限公司 Text content auditing method and system based on sensitive word
CN106529294A (en) * 2016-11-15 2017-03-22 广东华仝九方科技有限公司 Method for determining and filtering mobile phone viruses

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103150432A (en) * 2013-03-07 2013-06-12 宁波成电泰克电子信息技术发展有限公司 Method for internet public opinion analysis
CN105468684A (en) * 2015-11-17 2016-04-06 贵阳朗玛信息技术股份有限公司 Sensitive word filtering system and communication method thereof
CN105893626A (en) * 2016-05-10 2016-08-24 中广核工程有限公司 Index library creation method used for nuclear power engineering and index system adopting index library creation method
CN106445998A (en) * 2016-05-26 2017-02-22 达而观信息科技(上海)有限公司 Text content auditing method and system based on sensitive word
CN106529294A (en) * 2016-11-15 2017-03-22 广东华仝九方科技有限公司 Method for determining and filtering mobile phone viruses

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108970123A (en) * 2018-07-16 2018-12-11 网易(杭州)网络有限公司 The sending method of interference information and device, electronic equipment in game
CN111177518A (en) * 2019-12-18 2020-05-19 深圳市任子行科技开发有限公司 Webpage purification method, system and computer readable storage medium
CN111683049A (en) * 2020-05-08 2020-09-18 江苏涵秋网络科技有限公司 Sensitive content interception system for network security
WO2021135103A1 (en) * 2020-05-29 2021-07-08 平安科技(深圳)有限公司 Method and apparatus for semantic analysis, computer device, and storage medium
CN114048102A (en) * 2021-11-18 2022-02-15 广州银汉科技有限公司 Chat intelligent analysis monitoring system based on big data
CN114048102B (en) * 2021-11-18 2022-07-22 广州银汉科技有限公司 Chat intelligent analysis monitoring system based on big data

Similar Documents

Publication Publication Date Title
CN107169092A (en) Intelligent Recognition and the method and system of sensitive content are handled in interaction
AU2019200437B2 (en) A method to build an enterprise-specific knowledge graph
US10169471B2 (en) Generating and executing query language statements from natural language
CA3087534C (en) System and method for information extraction with character level features
CN104412265B (en) Update for promoting the search of application searches to index
US10951492B2 (en) System and a method for automatic conversion of monolithic services to micro-services
WO2020108063A1 (en) Feature word determining method, apparatus, and server
CN112262390A (en) Regular expression generation based on positive and negative pattern matching examples
CN106528613B (en) Intelligent answer method and device
CN102647414B (en) Protocol analysis method, protocol analysis device and protocol analysis system
CN102567455A (en) Method and system of managing documents using weighted prevalence data for statements
CN109697231A (en) A kind of display methods, system, storage medium and the processor of case document
CN113282762A (en) Knowledge graph construction method and device, electronic equipment and storage medium
CN109753517A (en) A kind of method, apparatus, computer storage medium and the terminal of information inquiry
CN108304363A (en) Public sentiment public relations method and system
CN109933331A (en) Data transfer device and associated component between a kind of client-server
CN108388547A (en) Character string parsing method, apparatus, equipment and computer readable storage medium
US9898467B1 (en) System for data normalization
CN107273546A (en) Counterfeit application detection method and system
CN110489740A (en) Semantic analytic method and Related product
CN109683727A (en) A kind of data processing method and device
US8090750B2 (en) Prompting of an end user with commands
CN113515630A (en) Triple generating and checking method and device, electronic equipment and storage medium
CN113742529A (en) Multi-table front-end processing method and device
CN115774797A (en) Video content retrieval method, device, equipment and computer readable storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20170915