CN103198066A - Word list based information search method and search system - Google Patents

Word list based information search method and search system Download PDF

Info

Publication number
CN103198066A
CN103198066A CN 201210002697 CN201210002697A CN103198066A CN 103198066 A CN103198066 A CN 103198066A CN 201210002697 CN201210002697 CN 201210002697 CN 201210002697 A CN201210002697 A CN 201210002697A CN 103198066 A CN103198066 A CN 103198066A
Authority
CN
China
Prior art keywords
descriptor
safe class
module
security strategy
vocabulary
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN 201210002697
Other languages
Chinese (zh)
Inventor
王沁泉
王佳强
杨娜
胡文翠
潘树燊
文勖
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shenzhen Shiji Guangsu Information Technology Co Ltd
Original Assignee
Tencent Technology Shenzhen Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tencent Technology Shenzhen Co Ltd filed Critical Tencent Technology Shenzhen Co Ltd
Priority to CN 201210002697 priority Critical patent/CN103198066A/en
Publication of CN103198066A publication Critical patent/CN103198066A/en
Pending legal-status Critical Current

Links

Images

Landscapes

  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention discloses a word list based information search method and search system, wherein theme words in a word list have respective security levels, and each security level correspondingly have a security policy. The method comprises the following steps: a search request containing a key word is received by the search system; an inquiry is carried out by the search system according to the key word to obtain a theme word matched with the key word; and the theme word is used by the search system to search, the corresponding security policy is determined according to the theme word matched with the key word, and search results are obtained according to the security policy. The invention ensures the controllability of the safety of the search results and the flexibility of the search results.

Description

A kind of information search method and search system based on vocabulary
Technical field
The present invention relates to communication technical field, particularly relate to a kind of information search method based on vocabulary and search system.
Background technology
Development along with real-time information network and content issue instrument and multimedia equipment, various information are more and more, search is as a kind of important means of obtaining information, make the user in abundant information resources, can find needed information fast, and become indispensable important tool of information age.
There is following several frequently seen information security issue at present on the internet: sensitive information, Pornograph, social controversial event, prohibited items information, gambling category information, swindle content, illegal advertising message etc.These unsound information make the security of information retrieval be subjected to challenge, and for the Search Results that guarantees to export meets the requirements, security strategy has been taked in search.
Present search system, adopt with a kind of security strategy for different themes word in the vocabulary is unified, be vocabulary only correspondence a kind of security strategy is set, security strategy for example comprises, do not return the sensitive word Search Results, directly return the sensitive word Search Results or return filtration after the sensitive word Search Results.Therefore, if the safe class of the security strategy of formulating is higher, can filter whole sensitive informations, not return the sensitive word Search Results, can also all filter out by the sensitive information that safe class is lower like this; If the safe class of the security strategy of formulating is lower, then can in Search Results, return the higher sensitive word Search Results of all or part of safe class.
This has not more and more satisfied existing search need, can't carry out security control to Search Results according to different search needs.
Summary of the invention
The object of the present invention is to provide a kind of information search method based on vocabulary and search system, in order to the problem that solve to adopt Search Results controllability that same security strategy causes and dirigibility to guarantee.
For this reason, the embodiment of the invention adopts following technical scheme:
The embodiment of the invention provides a kind of information search method based on vocabulary, and the descriptor in the described vocabulary has safe class separately, and each safe class correspondence is provided with corresponding security strategy; Described method comprises:
Search system receives the searching request that includes keyword;
Described search system is inquired about in described vocabulary according to described keyword, the descriptor that obtains mating;
Described search system is searched for according to the descriptor that matches, and according to the safe class of the descriptor correspondence that matches, determines this safe class corresponding security strategy, obtains the Search Results of described descriptor according to this security strategy.
The embodiment of the invention provides a kind of information search system based on vocabulary, and the descriptor in the described vocabulary has safe class separately; Described search system comprises: memory module, receiver module, enquiry module, security module and output module, wherein,
Memory module is for the corresponding relation of storage security grade and security strategy;
Receiver module is used for receiving the searching request that includes keyword;
Enquiry module is used for inquiring about the descriptor that obtains mating at described vocabulary according to the keyword that described receiver module receives;
Security module, for the safe class of the descriptor that matches according to described enquiry module, and described memory module stored relation, determine this safe class corresponding security strategy;
Search module is used for searching for according to the descriptor that matches, and obtains the Search Results of described descriptor according to the security strategy that described security module is determined.
Compared with prior art, embodiments of the invention have following advantage:
In the embodiments of the invention, descriptor in the vocabulary has safe class separately, each safe class correspondence is provided with corresponding security strategy, when the user initiates to search for, search system is inquired about in vocabulary according to keyword, the descriptor that obtains mating, and search for according to the descriptor that matches, safe class according to the descriptor correspondence that matches, determine this safe class corresponding security strategy, and obtain the Search Results of descriptor according to this security strategy, thereby make the descriptor of different safety class can carry out different security strategies, obtain different Search Results, guaranteed the controllability of Search Results aspect security requirement, and the dirigibility of Search Results.
Description of drawings
The information search method schematic flow sheet based on vocabulary that Fig. 1 provides for the embodiment of the invention;
The structured flowchart based on the information search system of vocabulary that Fig. 2 provides for the embodiment of the invention.
Embodiment
Below in conjunction with the accompanying drawing among the present invention, the technical scheme among the present invention is carried out clear, complete description, obviously, described embodiment is a part of embodiment of the present invention, rather than whole embodiment.Based on the embodiment among the present invention, the every other embodiment that those of ordinary skills obtain under the prerequisite of not making creative work belongs to the scope of protection of the invention.
In the embodiment of the invention, each descriptor in the vocabulary has safe class separately, and namely safe class is as an attribute of descriptor and exist.Each safe class correspondence is provided with corresponding security strategy.Safe class can be divided according to actual needs, for example, sensitive information, Pornograph, social controversial event, part prohibited items, gambling category information, swindle content, illegal advertising message etc. are to social harm degree difference, when setting up vocabulary, be each the descriptor setting safe class separately in the vocabulary.For example, the safe class that will be referred to swindle the descriptor of content is set to the highest, and the descriptor safe class that will be referred to illegal advertising message is set to take second place.
Security strategy typically refers to the strategy that presents of Search Results, and for example, common security strategy has at present: return whole Search Results, returning part Search Results or do not return Search Results.Function by adopting security strategy can realize filtering sensitive information does not repeat them here.
In the embodiment of the invention, can be according to the security control needs, for the different safety class of descriptor in the vocabulary arranges corresponding security strategy.For example, for the security strategy of Search Results is not returned in the highest safe class setting, for medium safe class arranges the security strategy of returning part Search Results, return the security strategy of whole Search Results for minimum safe class setting.During specific implementation, the mapping relations table of safe class and the security strategy of descriptor in the vocabulary can be set, can find the safe class corresponding security strategy by this mapping relations table.Search system can be upgraded the safe class of descriptor in the vocabulary and the mapping relations table of security strategy according to time or needs.For example, in important festivals or hold in important political activity, social activities, the competitive sports equal time section, can the descriptor of medium safe class is corresponding with the security strategy of all not returning Search Results, realize high security control.When red-letter day later or behind the activity end, can be only that the safe class that comprises contents such as staying sensitive information, social controversial event, part prohibited items is higher descriptor is corresponding with whole security strategies of not returning Search Results.Again for example, originally minimum safe grade corresponding security strategy is for all returning Search Results, according to actual needs at present, need be adjusted into part and return Search Results, at this situation, only need in the mapping relations table described minimum safe grade corresponding security strategy is adjusted into part and return Search Results and get final product, need not to revise vocabulary.
Based on above setting, the flow process based on the information search method of vocabulary that Fig. 1 shows that the embodiment of the invention provides, as shown in the figure, this flow process can comprise:
Step 11, search system receives the searching request that includes keyword.
After search system receives searching request, can carry out word segmentation processing according to the information of carrying in the searching request usually, to determine keyword.The word segmentation processing mode can adopt existing mode to realize that the embodiment of the invention does not limit the word segmentation processing mode.
Step 12, search system is inquired about in vocabulary according to keyword, the descriptor that obtains mating.
Concrete, the descriptor that obtains the coupling of keyword can adopt existing mode to realize, for example, realizes by the accurate matching logic of vocabulary.
Step 13, search system is searched for according to the descriptor that matches, and according to the safe class of the descriptor correspondence that matches, determines this safe class corresponding security strategy, obtains the Search Results of this descriptor according to this security strategy.
Concrete, after the descriptor that search system obtains mating, safe class according to this this descriptor, be stored in the safe class of local descriptor and the mapping relations table of security strategy by inquiry, obtain corresponding security strategy, and adopt this security strategy to obtain the Search Results of this descriptor.
For the document category file, Search Results comprises information such as the author, title of document; For the web page class file, Search Results comprises the URL of this webpage.
By above description as can be seen, in the embodiments of the invention, descriptor in the vocabulary has safe class separately, each safe class correspondence is provided with corresponding security strategy, when the user initiates to search for, search system is inquired about in vocabulary according to keyword, the descriptor that obtains mating, search for according to the descriptor that matches, and according to the safe class of the descriptor correspondence that matches, determine this safe class corresponding security strategy, obtain the Search Results of descriptor according to this security strategy, thereby make the descriptor of different safety class can carry out different security strategies, return different Search Results, guaranteed the controllability of Search Results aspect security requirement, and the dirigibility of Search Results.In addition, when security strategy changes, only need to adjust the corresponding relation of security strategy or renewal security strategy and safe class, need not to revise vocabulary, thereby reduced the work of manual maintenance vocabulary, reduced the complicacy that vocabulary is safeguarded.
The embodiment of the invention also provides a kind of information search system based on vocabulary, and the descriptor in the vocabulary has safe class separately, and as shown in Figure 2, this search system comprises:
Memory module 21 is for the corresponding relation of storage security grade and security strategy.
Receiver module 22 is used for receiving the searching request that includes keyword.
Enquiry module 23 is used for inquiring about the descriptor that obtains mating at vocabulary according to the keyword that receiver module 22 receives.
Security module 24, for the safe class of the descriptor that matches according to enquiry module 23, and memory module 21 stored relation, determine this safe class corresponding security strategy.
Search module 25 is used for searching for according to the descriptor that matches, and obtains the Search Results of described descriptor according to the security strategy that described security module 24 is determined.
Concrete, the memory module 21 concrete Storage Mapping relation tables that are used for, the mapping relations table comprises: the mapping relations of the safe class of descriptor and security strategy in the vocabulary.
Security module 24 specifically is used for, and determines the safe class of the descriptor that described enquiry module 23 matches, and searches the mapping relations table of storage in the memory module 21 according to this safe class, obtains corresponding security strategy.
Concrete, the embodiment of the invention can also comprise based on the information search system of vocabulary: update module 26 is used for upgrading the safe class of descriptor and the corresponding relation of security strategy according to time or needs.During specific implementation, update module 26 can updated stored module 21 in the mapping relations table of storage.
Security module is the important ring in the search system, the for example microblogging search of numerous business, the search of Qzone community waits all and provides the platform of realizing and detecting for security module, the present invention mainly is by inquire about descriptor in vocabulary, descriptor according to coupling is searched for, carry out the descriptor corresponding security strategy, obtain corresponding Search Results, thereby realize the security control of vocabulary.
Information search method of the present invention and search system, descriptor by different safety class is carried out different security strategies, obtain different Search Results, make the security of Search Results controlled, easy control, not only improve search system dirigibility and controllability greatly, also reduced the work of manual maintenance vocabulary simultaneously.
It will be appreciated by those skilled in the art that the module in the device among the embodiment can be distributed in the device of embodiment according to the embodiment description, also can carry out respective change and be arranged in the one or more devices that are different from present embodiment.The module of above-described embodiment can be merged into a module, also can further split into a plurality of submodules.
Through the above description of the embodiments, those skilled in the art can be well understood to the present invention and can realize by the mode that software adds essential general hardware platform, can certainly pass through hardware, but the former is better embodiment under a lot of situation.Based on such understanding, the part that technical scheme of the present invention contributes to prior art in essence in other words can embody with the form of software product, this computer software product is stored in the storage medium, comprise that some instructions are with so that a station terminal equipment (can be mobile phone, personal computer, server, the perhaps network equipment etc.) carry out the described method of each embodiment of the present invention.
The above only is preferred implementation of the present invention; should be pointed out that for those skilled in the art, under the prerequisite that does not break away from the principle of the invention; can also make some improvements and modifications, these improvements and modifications also should be looked protection scope of the present invention.

Claims (6)

1. A kind of information search method based on vocabulary is characterized in that the descriptor in the described vocabulary has safe class separately, and each safe class correspondence is provided with corresponding security strategy; Described method comprises:
Search system receives the searching request that includes keyword;
Described search system is inquired about in described vocabulary according to described keyword, the descriptor that obtains mating;
Described search system is searched for according to the descriptor that matches, and according to the safe class of the descriptor correspondence that matches, determines this safe class corresponding security strategy, obtains the Search Results of described descriptor according to this security strategy.
2. The method of claim 1 is characterized in that, described search system is determined this safe class corresponding security strategy according to the safe class of the descriptor correspondence that matches, and specifically comprises:
Search system is stored in local mapping relations table by searching, and determines the safe class corresponding security strategy, and described mapping relations table comprises: the mapping relations of the safe class of descriptor and security strategy in the vocabulary.
3. The method of claim 1 is characterized in that, also comprises: search system is upgraded the corresponding relation of described safe class and described security strategy.
4. A kind of information search system based on vocabulary is characterized in that the descriptor in the described vocabulary has safe class separately; Described search system comprises: memory module, receiver module, enquiry module, security module and output module, wherein,
Memory module is for the corresponding relation of storage security grade and security strategy;
Receiver module is used for receiving the searching request that includes keyword;
Enquiry module is used for inquiring about the descriptor that obtains mating at described vocabulary according to the keyword that described receiver module receives;
Security module, for the safe class of the descriptor that matches according to described enquiry module, and described memory module stored relation, determine this safe class corresponding security strategy;
Search module is used for searching for according to the descriptor that matches, and obtains the Search Results of described descriptor according to the security strategy that described security module is determined.
5. Search system as claimed in claim 4 is characterized in that, described memory module specifically is used for, the Storage Mapping relation table, and described mapping relations table comprises: the mapping relations of the safe class of descriptor and security strategy in the vocabulary;
Described security module specifically is used for, and determines the safe class of the descriptor that described enquiry module matches, and searches the mapping relations table of storing in the described memory module according to this safe class, obtains corresponding security strategy.
6. Search system as claimed in claim 4 is characterized in that, also comprises:
Update module be used for to be upgraded safe class that described memory module stores and the corresponding relation of security strategy.
CN 201210002697 2012-01-06 2012-01-06 Word list based information search method and search system Pending CN103198066A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN 201210002697 CN103198066A (en) 2012-01-06 2012-01-06 Word list based information search method and search system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN 201210002697 CN103198066A (en) 2012-01-06 2012-01-06 Word list based information search method and search system

Publications (1)

Publication Number Publication Date
CN103198066A true CN103198066A (en) 2013-07-10

Family

ID=48720635

Family Applications (1)

Application Number Title Priority Date Filing Date
CN 201210002697 Pending CN103198066A (en) 2012-01-06 2012-01-06 Word list based information search method and search system

Country Status (1)

Country Link
CN (1) CN103198066A (en)

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103631908A (en) * 2013-11-26 2014-03-12 百度在线网络技术(北京)有限公司 Pornographic information processing method, information processing method, information processing client terminal and information processing server
CN105787029A (en) * 2016-02-25 2016-07-20 浪潮软件集团有限公司 SOLR-based key word recognition method
CN109189984A (en) * 2018-08-15 2019-01-11 百度在线网络技术(北京)有限公司 For showing the method and device of information
CN110020153A (en) * 2017-11-30 2019-07-16 北京搜狗科技发展有限公司 A kind of searching method and device
CN111177518A (en) * 2019-12-18 2020-05-19 深圳市任子行科技开发有限公司 Webpage purification method, system and computer readable storage medium
CN111708938A (en) * 2020-05-27 2020-09-25 北京百度网讯科技有限公司 Method, apparatus, electronic device, and storage medium for information processing
CN113407671A (en) * 2017-06-01 2021-09-17 互动解决方案公司 Data information storage device for search

Cited By (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103631908A (en) * 2013-11-26 2014-03-12 百度在线网络技术(北京)有限公司 Pornographic information processing method, information processing method, information processing client terminal and information processing server
CN103631908B (en) * 2013-11-26 2017-07-25 百度在线网络技术(北京)有限公司 The processing method of pornography, the processing method of information, client and server
CN105787029A (en) * 2016-02-25 2016-07-20 浪潮软件集团有限公司 SOLR-based key word recognition method
CN113407671A (en) * 2017-06-01 2021-09-17 互动解决方案公司 Data information storage device for search
CN110020153A (en) * 2017-11-30 2019-07-16 北京搜狗科技发展有限公司 A kind of searching method and device
CN109189984A (en) * 2018-08-15 2019-01-11 百度在线网络技术(北京)有限公司 For showing the method and device of information
CN109189984B (en) * 2018-08-15 2022-04-19 百度在线网络技术(北京)有限公司 Method and device for displaying information
CN111177518A (en) * 2019-12-18 2020-05-19 深圳市任子行科技开发有限公司 Webpage purification method, system and computer readable storage medium
CN111708938A (en) * 2020-05-27 2020-09-25 北京百度网讯科技有限公司 Method, apparatus, electronic device, and storage medium for information processing
CN111708938B (en) * 2020-05-27 2023-04-07 北京百度网讯科技有限公司 Method, apparatus, electronic device, and storage medium for information processing

Similar Documents

Publication Publication Date Title
CN103198066A (en) Word list based information search method and search system
CN107660284B (en) Search improvement based on machine learning
US10180967B2 (en) Performing application searches
US11580168B2 (en) Method and system for providing context based query suggestions
CN102110170B (en) System with information distribution and search functions and information distribution method
US20110289015A1 (en) Mobile device recommendations
CN107145496A (en) The method for being matched image with content item based on keyword
CN101996195A (en) Searching method and device of voice information in audio files and equipment
CN104516910A (en) Method and system for recommending content in client-side server environment
CN102279894A (en) Method for searching, integrating and providing comment information based on semantics and searching system
CN102750629B (en) Schedule association method and device
CN101149758A (en) Searching system and searching method
CN104699737A (en) Method and system for managing a search
CN102426591A (en) Method and device for operating corpus used for inputting contents
US20170168695A1 (en) Graphical User Interface for Generating Structured Search Queries
CN107463592B (en) Method, device and data processing system for matching a content item with an image
US10747824B2 (en) Building a data query engine that leverages expert data preparation operations
CN101216837A (en) Method and system for displaying search result based on matching user personalized configuration
CN102682036A (en) Non-editing based method and system for searching media assets
CN103049495A (en) Method, device and equipment for providing searching advice corresponding to inquiring sequence
CN106503251A (en) Searching method and searcher
CN107145497A (en) The method of the image of metadata selected and content matching based on image and content
US11537672B2 (en) Method and system for filtering content
CN103076894A (en) Method and equipment for building input entries for object identity information according to object identity information
CN106484694A (en) Full-text search method based on distributed data base and system

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
ASS Succession or assignment of patent right

Owner name: SHENZHEN SHIJI LIGHT SPEED INFORMATION TECHNOLOGY

Free format text: FORMER OWNER: TENGXUN SCI-TECH (SHENZHEN) CO., LTD.

Effective date: 20131017

C41 Transfer of patent application or patent right or utility model
TA01 Transfer of patent application right

Effective date of registration: 20131017

Address after: A Tencent Building in Shenzhen Nanshan District City, Guangdong streets in Guangdong province science and technology 518057 16

Applicant after: Shenzhen Shiji Guangsu Information Technology Co., Ltd.

Address before: Shenzhen Futian District City, Guangdong province 518057 Zhenxing Road, SEG Science Park 2 East Room 403

Applicant before: Tencent Technology (Shenzhen) Co., Ltd.

C05 Deemed withdrawal (patent law before 1993)
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20130710