CN102693236A

CN102693236A - Bad information filtering method based on content understanding

Info

Publication number: CN102693236A
Application number: CN2011100712318A
Authority: CN
Inventors: 宦奕奕
Original assignee: SUZHOU STYLE INFORMATION TECHNOLOGY CO LTD
Current assignee: SUZHOU STYLE INFORMATION TECHNOLOGY CO LTD
Priority date: 2011-03-24
Filing date: 2011-03-24
Publication date: 2012-09-26

Abstract

The invention relates to a bad information filtering method based on content understanding. The method comprises the following steps of: firstly performing data pretreatment on the content in a network information source, extracting the dominant and recessive features capable of reflecting the content or helpful in distinguishing the content, and effectively expressing the bad information content through the feature item; matching a bad information template with the bad information content to be processed according to the matching rule and method; performing corresponding filtration of the information source according to the matching result; and finally, returning the processed result to a user of the Web page. Therefore, the method provided by the invention can accurately and effectively filter the bad information in the network information according to the context of the text information content and various features of the image information so as to provide a clean network environment to the user; and the application prospects of the method are very broad.

Description

The flame filter method of content-based understanding

Technical field

The present invention relates to a kind of information filtering method, relate in particular to a kind of flame filter method of content-based understanding.

Background technology

Along with the development of Internet technology, various very different information contents sharply expand in recent years, and network information security problem becomes increasingly conspicuous, and serious have ruined social general mood, and therefore society is strong day by day to the filtration needs of information with the individual.Yet in conjunction with the flame filter software and the system that are using at present; Exist the phenomenon of failing to report, misrepresenting deliberately, and filter velocity is slower, and the method for the content-based analysis that the present invention proposes; Not only can accurately effectively filter flame; For the user provides clean network environment, and filter velocity is very fast, and application prospect is boundless.

Summary of the invention

The object of the invention is exactly the problems referred to above that exist in the prior art in order to solve, and a kind of flame filter method of content-based understanding is provided.

The object of the invention is realized through following technical scheme:

The flame filter method of content-based understanding, it may further comprise the steps:

1. step carries out the data pre-service to the content in the network information source, therefrom extracts to reflect or to help dominance and the recessive character of differentiating content, makes the flame content through the characteristic item effectively expressing;

2. step according to matched rule and method, matees flame template and pending flame content;

3. step is carried out corresponding filter according to matching result to information source and is handled;

4. step returns to the result after handling the user of Web page or leaf.

The flame filter method of above-mentioned content-based understanding, wherein: described network information source comprises content of text information and image content information.

Further, the flame filter method of above-mentioned content-based understanding, wherein: the filtration of described text message is context of co-text, the text elements according to content of text, through analyzing and understand the semanteme of content of text, finds flame.

Further; The flame filter method of above-mentioned content-based understanding; Wherein: the filtration of said picture material is color, texture, shape, profile and color, texture, shape, the spatial relationship characteristic between the profile and semantic as index according to image, filters through the coupling of the similarity degree between the image.

Further, the flame filter method of above-mentioned content-based understanding, wherein: the 2. described flame of step comprises, obscene pornographic, reaction violence and junk information.

Again further; The flame filter method of above-mentioned content-based understanding; Wherein: described pre-service is the irrelevant information of removing in the network information source; Keep Useful Information and it is described characteristic separate and quantize, will reflect then or help to distinguish that the dominance of content character and recessive information extract, make flame can pass through the characteristic item effective expression.

The advantage of technical scheme of the present invention is mainly reflected in: can be according to the context of co-text of content of text messages and the various characteristics of image information; Flame in the accurately effective screen information; For the user provides a clean network environment, its application prospect is boundless.

The object of the invention, advantage and characteristics will make an explanation through the non-limitative illustration of following preferred embodiment.These embodiment only are the prominent examples of using technical scheme of the present invention, and all technical schemes of taking to be equal to replacement or equivalent transformation and forming all drop within the scope of requirement protection of the present invention.

Embodiment

The flame filter method of content-based understanding; Its unusual part is may further comprise the steps: at first; Content in the network information source is carried out the data pre-service; Therefrom extract and to reflect or to help dominance and the recessive character of differentiating content, make the flame content through the characteristic item effectively expressing.Specifically, described network information source comprises content of text information and image content information.

Afterwards, according to matched rule and method, flame template and pending flame content are mated.Specifically, described flame comprises, obscene pornographic, reaction violence and junk information.

Then, according to matching result information source being carried out corresponding filter handles.At last, the result after handling is returned to the user of Web page or leaf.

In conjunction with actual implementation process of the present invention, adopting the filtration of text message is context of co-text, text elements according to content of text, through analyzing and understand the semanteme of content of text, finds flame.Simultaneously, the filtration of said picture material is color, texture, shape, profile and color, texture, shape, the spatial relationship characteristic between the profile and semantic as index according to image, filters through the coupling of the similarity degree between the image.And; In order to play preferable filter effect; The pre-service of adopting is the irrelevant information of removing in the network information source; Keep Useful Information and it is described characteristic separate and quantize, will reflect then or help to distinguish that the dominance of content character and recessive information extract, make flame can pass through the characteristic item effective expression.

Can find out through above-mentioned character express; After adopting the present invention; Can be according to the context of co-text of content of text messages and the various characteristics of image information; The accurate effectively flame in the screen information, for the user provides a clean network environment, its application prospect is boundless.

Claims

1. the flame filter method of content-based understanding is characterized in that may further comprise the steps:

4. step returns to the result after handling the user of Web page or leaf.

2. the flame filter method of content-based understanding according to claim 1, it is characterized in that: described network information source comprises content of text information and image content information.

3. the flame filter method of content-based understanding according to claim 2; It is characterized in that: the filtration of described text message is context of co-text, the text elements according to content of text; Through analyzing and understand the semanteme of content of text, find flame.

4. the flame filter method of content-based understanding according to claim 2; It is characterized in that: the filtration of said picture material is color, texture, shape, profile and color, texture, shape, the spatial relationship characteristic between the profile and semantic as index according to image, filters through the coupling of the similarity degree between the image.

5. the flame filter method of content-based understanding according to claim 1 is characterized in that: the 2. described flame of step comprises, obscene pornographic, reaction violence and junk information.

6. the flame filter method of content-based understanding according to claim 1; It is characterized in that: described pre-service is the irrelevant information of removing in the network information source; Keep Useful Information and it is described characteristic separate and quantize; To reflect then or help to distinguish that the dominance of content character and recessive information extract, and make flame can pass through the characteristic item effective expression.