CN101968807A - Content retrieval method and device - Google Patents
Content retrieval method and device Download PDFInfo
- Publication number
- CN101968807A CN101968807A CN 201010517829 CN201010517829A CN101968807A CN 101968807 A CN101968807 A CN 101968807A CN 201010517829 CN201010517829 CN 201010517829 CN 201010517829 A CN201010517829 A CN 201010517829A CN 101968807 A CN101968807 A CN 101968807A
- Authority
- CN
- China
- Prior art keywords
- user
- issue
- web page
- content
- keyword
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Images
Landscapes
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Information Transfer Between Computers (AREA)
Abstract
The invention discloses a content retrieval method and a content retrieval device, which are used for realizing the content retrieval of information and improving information retrieval efficiency and accuracy. The method comprises the following steps of: acquiring a structure tag corresponding to a keyword input by a user; acquiring a release user identifier (ID) embedded into the current webpage; and retrieving information in the structure tag, which is matched with the input keyword, in a range of release information corresponding to the release user ID. The invention also discloses a device for implementing the method.
Description
Technical field
The present invention relates to the computing machine and the communications field, particularly relate to a kind of method and device of content retrieval.
Background technology
Computing machine and Internet technology are widely used, and resource sharing is its principal feature.How searching the information that self needs from huge information resources, is the problem that the user generally is concerned about.
Tens million of websites all need the retrieval of internal information of standing in the internet, but limit because of technology, and present most of websites all can only provide simple full text keyword matching retrieval, and the key word of importing according to the user comes search information.Yet for the webpage magnanimity information that expands day by day, the full-text search of being undertaken by key word is at the needs that all can not satisfy the user aspect search speed and the Search Results.Therefore, those skilled in the art have proposed the solution of structuring search.
Present structuring search information database structure or pre-designed, perhaps can add voluntarily but not according to versatility by the user, the result who causes is exactly: when range of search is whole network, when scope is big, may retrieve a lot of useless information, and Useful Information may need could to screen from the information that retrieves through the long period, and recall precision is lower, can't adapt to the diversified development of information and the needs of all types of user.And the user must be familiar with various taxonomic hierarchies and could comparatively fast retrieve information needed when using all kinds of different web sites.
The prior art structuring search that proposed to make a summary replaces original classified search, can improve the accuracy of search, but present content retrieval scope is bigger, is further improved.
Summary of the invention
The embodiment of the invention provides a kind of method of content retrieval, is used to realize the content retrieval to information, improves the efficient and the accuracy of information retrieval.
A kind of method of content retrieval may further comprise the steps:
Obtain the pairing structure label of keyword of user's input;
The issue user identifier ID that embeds in the acquisition current web page;
In the scope that releases news of this issue user ID correspondence under this structure label of retrieval with the information of the keyword of described input coupling.
A kind of device of content retrieval comprises:
Acquisition module is used to obtain the pairing structure label of keyword that the user imports;
Execution module is used for obtaining the issue user ID that current web page embeds;
Search module is used in the scope that releases news of this issue user ID correspondence under this structure label of retrieval the information with the keyword coupling of described input.
The embodiment of the invention obtains the pairing structure label of keyword of user's input; The issue user identifier ID that embeds in the acquisition current web page; In the scope that releases news of this issue user ID correspondence under this structure label of retrieval with the information of the keyword of described input coupling.Thereby realized in specialized range, passing through the structured content retrieving information, retrieved targetedly, made retrieving more quick, improved the efficient and the accuracy of information retrieval.
Description of drawings
Fig. 1 is the synoptic diagram of individual layer content structure in the embodiment of the invention;
Fig. 2 A and Fig. 2 B are the synoptic diagram of multilevel content structure in the embodiment of the invention;
Fig. 3 is the primary structure figure of content search apparatus in the embodiment of the invention;
Fig. 4 A is the detailed structure view of content search apparatus in the embodiment of the invention;
Fig. 4 B is the detailed structure view that has the content search apparatus of installed module in the embodiment of the invention;
Fig. 5 is the main method process flow diagram of content retrieval in the embodiment of the invention;
Fig. 6 is the detailed method process flow diagram of content retrieval in the embodiment of the invention;
Fig. 7 is the detailed method process flow diagram of content retrieval in the time of will issuing in the webpage that user ID embeds issue in the embodiment of the invention;
Fig. 8 is the detailed method process flow diagram of content retrieval will issue user ID in the embodiment of the invention and embed click in the incident of index button the time.
Embodiment
The embodiment of the invention obtains the pairing structure label of keyword of user's input; The issue user identifier ID that embeds in the acquisition current web page; In the scope that releases news of this issue user ID correspondence under this structure label of retrieval with the information of the keyword of described input coupling.Thereby realized in specialized range, passing through the structured content retrieving information, retrieved targetedly, made retrieving accurately quick more, the efficient and the accuracy that have improved information retrieval.
The structure that comprises the path (comprising link) of pointing to storage file in the embodiment of the invention all belongs to content structure.Structure in the content structure is said from the division angle and is comprised sorting item and structure item, comprises structure label and structure content (being the keyword that the user imports) from content.Content structure as shown in Figure 1, a sorting item and a structure item can navigate to a structural unit, the sign speech of sorting item and structure item is the structure label, " () " under the structure label is used for the user and imports keyword.Sorting item and structure item have constituted the content structure of one deck two dimension.Under the sorting item a plurality of structure items can be arranged.The sign speech of sorting item, i.e. structure label such as news, bulletin, knowledge, product, service, Yellow Page, human communication, forum, program request and download etc.The sign speech of the structure item under the news category item, i.e. structure label such as main body, behavior, time etc.The all right layering of content structure, each structure label is a structure label of corresponding last layer all, and a structure label of last layer can corresponding a plurality of structure labels of one deck down.A kind of tree structure of whole contents similar, each node (being sub-content structure) all is made up of a component category and structure item.The a plurality of structure labels of a structure label of last layer in can a sub-content structure of corresponding one deck down, shown in Fig. 2 A, the structure label in also can a plurality of sub-content structure of corresponding one deck down is shown in Fig. 2 B.The original state of content structure is that all sub-content structures comprise identical structure label, and the sub-content structure of following one deck is inherited the sub-content structure of last layer.
Referring to Fig. 3, the device that is used for content retrieval in the embodiment of the invention comprises acquisition module 101, execution module 102 and search module 103.Wherein, this device can be positioned at client-side.
Referring to Fig. 4 A, described device also comprises release module 104 and processing module 105.
Referring to Fig. 4 B, described device also comprises installed module 106.
Installed module 106 is used for and will be linked to the sign embedded web page of content structure.Webpage can select whether to need to install the sign that is linked to content structure as required, install if select, then installed module 106 will be linked in the sign embedded web page of content structure, after triggering clicks to the incident of sign of content structure, to user's output content structure, the user imports keyword in this content structure, wherein, trigger the incident of the sign that clicks to content structure to the sign of content structure by user clicks on links, index button under the click on content structure, by this content structure, in the scope of the pairing structure label of this keyword, reach search information in the pairing scope that releases news of issue user ID that execution module 102 obtains.Execution module 102 can obtain the issue user ID when triggering clicks to the incident of sign of content structure, perhaps when triggering the incident of clicking index button, obtain issue user ID, the issue user ID that perhaps when user one enters the Web page, obtains to embed in the webpage.
Introduce the method for content retrieval below.
Referring to Fig. 5, the main method flow process of content retrieval is as follows:
Step 501: the pairing structure label of keyword that obtains user's input.
Step 502: the distribution indicator that obtains to embed in the current web page accords with ID.
Step 503: in the scope that releases news of this issue user ID correspondence under this structure label of retrieval with the information of the keyword of described input coupling.
Referring to Fig. 6, the detailed method flow process of content retrieval is as follows:
When the user need be in this web search information, execution in step 601.
Step 601: the incident that triggers the sign that clicks to content structure.Wherein, click sign by the user and trigger the incident of clicking sign, wherein, the described sign that is linked to content structure that is designated.
Step 602: obtain the issue user ID.Execution module 102 obtains the issue user ID by receiving the sign that clicks to content structure.
Perhaps processing module 105 will be issued in the user ID embedded web page in advance, execution module 102 obtains the issue user ID that embeds in the described webpage when the user enters the Web page, perhaps processing module 105 will be issued in the incident of user ID embedding click index button, when the user clicks index button, be equivalent to trigger the incident of clicking index button, then execution module 102 obtains the issue user ID.
Step 603: the keyword that obtains input.
Step 604: the pairing structure label of keyword that obtains input.
Step 605: trigger the incident of clicking index button.Wherein, click index button by the user and trigger the incident of clicking index button.
Step 606: the information of in the pairing scope that releases news of this issue user ID, mating with keyword under this structure label of retrieval.
Referring to Fig. 7, will issue user ID in advance and embed in the webpage of issuing.The detailed method flow process of content retrieval is as follows:
When the user enters this webpage, execution in step 701.
Step 701: by the corresponding relation acquisition issue user ID of banner with the issue user ID of this webpage of issue.
Step 702: the keyword that obtains input.
Step 703: the pairing structure label of keyword that obtains input.
Step 704: trigger the incident of clicking index button.
Step 705: the information of in the pairing scope that releases news of this issue user ID, mating with keyword under this structure label of retrieval.
Referring to Fig. 8, will issue user ID in advance and embed in the incident of clicking index button.The detailed method flow process of content retrieval is as follows:
When the user need be in this web search information, execution in step 801.
Step 801: the incident that triggers the sign that clicks to content structure.Click sign by the user and trigger the incident of clicking sign, the wherein said sign that is linked to content structure that is designated is to user's output content structure.
Step 802: the keyword that obtains input.
Step 803: the pairing structure label of keyword that obtains input.
Step 804: trigger the incident of clicking index button.Click index button by the user and trigger the incident of clicking index button.
Step 805: obtain the issue user ID.
Step 806: the information of in the pairing scope that releases news of this issue user ID, mating with keyword under this structure label of retrieval.
The embodiment of the invention obtains the pairing structure label of keyword of user's input; The distribution indicator that obtains to embed in the current web page accords with ID; In the scope that releases news of this issue user ID correspondence under this structure label of retrieval with the information of the keyword of described input coupling.Thereby realized in specialized range retrieving targetedly, made retrieving more quick, improved the efficient and the accuracy of information retrieval according to the structured content retrieving information.And each website of the method can be general, and the user needn't could comparatively fast retrieve information needed after being familiar with the different taxonomic hierarchies in each website.Not only for the website provides a structured content searching database separately, also can be for different web sites provide Universal Database, thus improve the retrieval and the service efficiency of site information.The user can obtain the issue user ID that embeds in the described webpage when entering the Web page; Perhaps, when arriving the sign of content structure, user clicks on links obtains to embed the issue user ID; Perhaps, when clicking index button, the user obtains the issue user ID that embeds in the described webpage.The mode that obtains the issue user ID is versatile and flexible, is convenient to the user and selects.And, release module 104 information releasing can leave in the general database of various information, this database can be positioned on the server, can be the webserver etc., retrieve, the information in the Universal Database is brought in constant renewal in for the user, quantity of information is than horn of plenty, can be for user search to more information, and can deposit same issue user ID in this database and be distributed on information in the different web sites, different issue user ID information releasing also can be deposited.The retrieval of content structure in the embodiment of the invention is applicable to the general retrieval of most websites, when the user retrieves at home Web site's page, by the issue user ID that obtains to embed in the webpage, can retrieve this issue ID and be distributed on relevant information in the different web sites.
Those skilled in the art should understand that embodiments of the invention can be provided as method, system or computer program.Therefore, the present invention can adopt complete hardware embodiment, complete software implementation example or in conjunction with the form of the embodiment of software and hardware aspect.And the present invention can adopt the form that goes up the computer program of implementing in one or more computer-usable storage medium (including but not limited to magnetic disk memory and optical memory etc.) that wherein include computer usable program code.
The present invention is that reference is described according to the process flow diagram and/or the block scheme of method, equipment (system) and the computer program of the embodiment of the invention.Should understand can be by the flow process in each flow process in computer program instructions realization flow figure and/or the block scheme and/or square frame and process flow diagram and/or the block scheme and/or the combination of square frame.Can provide these computer program instructions to the processor of multi-purpose computer, special purpose computer, Embedded Processor or other programmable data processing device to produce a machine, make the instruction of carrying out by the processor of computing machine or other programmable data processing device produce to be used for the device of the function that is implemented in flow process of process flow diagram or a plurality of flow process and/or square frame of block scheme or a plurality of square frame appointments.
These computer program instructions also can be stored in energy vectoring computer or the computer-readable memory of other programmable data processing device with ad hoc fashion work, make the instruction that is stored in this computer-readable memory produce the manufacture that comprises command device, this command device is implemented in the function of appointment in flow process of process flow diagram or a plurality of flow process and/or square frame of block scheme or a plurality of square frame.
These computer program instructions also can be loaded on computing machine or other programmable data processing device, make on computing machine or other programmable devices and to carry out the sequence of operations step producing computer implemented processing, thereby the instruction of carrying out on computing machine or other programmable devices is provided for being implemented in the step of the function of appointment in flow process of process flow diagram or a plurality of flow process and/or square frame of block scheme or a plurality of square frame.
Obviously, those skilled in the art can carry out various changes and modification to the present invention and not break away from the spirit and scope of the present invention.Like this, if of the present invention these are revised and modification belongs within the scope of claim of the present invention and equivalent technologies thereof, then the present invention also is intended to comprise these changes and modification interior.
Claims (10)
1. the method for a content retrieval is characterized in that, may further comprise the steps:
Obtain the pairing structure label of keyword of user's input;
The issue user identifier ID that embeds in the acquisition current web page;
In the scope that releases news of this issue user ID correspondence under this structure label of retrieval with the information of the keyword of described input coupling.
2. the method for claim 1, it is characterized in that, the step that obtains the pairing structure label of keyword of user's input comprises: to user's output content structure, and obtain the pairing structure label of keyword of user's input by content structure when obtaining to click the incident of sign; Wherein, the described sign that is linked to content structure that embeds in the webpage that is designated.
3. the method for claim 1 is characterized in that, the issue user ID that embeds in the described current web page is for to be embedded in the current web page when publishing web page.
4. as claim 1 or 3 described methods, it is characterized in that the step that obtains the issue user ID that embeds in the current web page comprises: when the user enters the Web page, obtain the issue user ID that embeds in the described webpage; Perhaps, when arriving the sign of content structure, user clicks on links obtains the ID of embedding; Perhaps, when clicking index button, the user obtains the issue user ID of embedding.
5. method as claimed in claim 4 is characterized in that, the step of the issue user ID that embeds in the acquisition current web page comprises:
By the corresponding relation acquisition issue user ID of banner with the issue user ID of this webpage of issue; Perhaps
When triggering clicks to the sign of content structure or clicks the incident of index button, obtain the issue user ID by this incident.
6. the device of a content retrieval is characterized in that, comprising:
Acquisition module is used to obtain the pairing structure label of keyword that the user imports;
Execution module is used for obtaining the issue user ID that current web page embeds;
Search module is used in the scope that releases news of this issue user ID correspondence under this structure label of retrieval the information with the keyword coupling of described input.
7. device as claimed in claim 6 is characterized in that, to user's output content structure, and obtains the pairing structure label of keyword of user's input by content structure when described acquisition module also is used to obtain to click the incident of sign; Wherein, the described sign that is linked to content structure that embeds in the webpage that is designated.
8. device as claimed in claim 6 is characterized in that, the issue user ID that embeds in the described current web page is for to be embedded in the current web page when publishing web page.
9. as claim 6 or 8 described devices, it is characterized in that described execution module also is used for obtaining the issue user ID that described webpage embeds when the user enters the Web page; Perhaps, when arriving the sign of content structure, user clicks on links obtains the issue user ID of embedding; Perhaps, when clicking index button, the user obtains the issue user ID of embedding.
10. device as claimed in claim 9 is characterized in that, described execution module also is used for by the corresponding relation acquisition issue user ID of banner with the issue user ID of this webpage of issue; When perhaps triggering sign that clicks to content structure or the incident of clicking index button, obtain the issue user ID by this incident.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN 201010517829 CN101968807A (en) | 2010-10-15 | 2010-10-15 | Content retrieval method and device |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN 201010517829 CN101968807A (en) | 2010-10-15 | 2010-10-15 | Content retrieval method and device |
Publications (1)
Publication Number | Publication Date |
---|---|
CN101968807A true CN101968807A (en) | 2011-02-09 |
Family
ID=43547964
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN 201010517829 Pending CN101968807A (en) | 2010-10-15 | 2010-10-15 | Content retrieval method and device |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN101968807A (en) |
Citations (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1462003A (en) * | 2002-05-28 | 2003-12-17 | 百度在线网络技术(北京)有限公司 | Method of issuring information and queuing by bid using searching engine |
CN101079051A (en) * | 2007-06-11 | 2007-11-28 | 周广宇 | Method for structural information issue and search at network environment |
US20080082526A1 (en) * | 2006-09-28 | 2008-04-03 | Takuya Kanawa | Method, apparatus, and computer program product for searching structured document |
EP1926030A2 (en) * | 2006-11-23 | 2008-05-28 | Samsung Electronics Co., Ltd. | Apparatus and method for optimized index search |
CN101517574A (en) * | 2006-07-25 | 2009-08-26 | 韩国电子通信研究院 | Illegal contents auto-searching system and method using access/search application on internet |
WO2009133667A1 (en) * | 2008-04-30 | 2009-11-05 | パナソニック株式会社 | Device for displaying result of similar image search and method for displaying result of similar image search |
CN101655862A (en) * | 2009-08-11 | 2010-02-24 | 华天清 | Method and device for searching information object |
US20100250560A1 (en) * | 2007-12-05 | 2010-09-30 | S. Grants Co., Ltd. | Bit string merge sort device, method, and program |
CN101866347A (en) * | 2005-10-23 | 2010-10-20 | 谷歌公司 | Method, system that structural data is searched for and method, the system that makes data item structured and can search for |
-
2010
- 2010-10-15 CN CN 201010517829 patent/CN101968807A/en active Pending
Patent Citations (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1462003A (en) * | 2002-05-28 | 2003-12-17 | 百度在线网络技术(北京)有限公司 | Method of issuring information and queuing by bid using searching engine |
CN101866347A (en) * | 2005-10-23 | 2010-10-20 | 谷歌公司 | Method, system that structural data is searched for and method, the system that makes data item structured and can search for |
CN101517574A (en) * | 2006-07-25 | 2009-08-26 | 韩国电子通信研究院 | Illegal contents auto-searching system and method using access/search application on internet |
US20080082526A1 (en) * | 2006-09-28 | 2008-04-03 | Takuya Kanawa | Method, apparatus, and computer program product for searching structured document |
EP1926030A2 (en) * | 2006-11-23 | 2008-05-28 | Samsung Electronics Co., Ltd. | Apparatus and method for optimized index search |
CN101079051A (en) * | 2007-06-11 | 2007-11-28 | 周广宇 | Method for structural information issue and search at network environment |
US20100250560A1 (en) * | 2007-12-05 | 2010-09-30 | S. Grants Co., Ltd. | Bit string merge sort device, method, and program |
WO2009133667A1 (en) * | 2008-04-30 | 2009-11-05 | パナソニック株式会社 | Device for displaying result of similar image search and method for displaying result of similar image search |
CN101655862A (en) * | 2009-08-11 | 2010-02-24 | 华天清 | Method and device for searching information object |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
EP3161678B1 (en) | Deep links for native applications | |
US10713324B2 (en) | Search results for native applications | |
CN103221951B (en) | Predictive query suggestion caching | |
CN109145078B (en) | The application page of the machine application is indexed | |
US20120246558A1 (en) | Social bookmarking of resources exposed in web pages | |
CN101655862A (en) | Method and device for searching information object | |
CN101288075A (en) | Simultaneously spawning multiple searches across multiple providers | |
CN102231152B (en) | Searching method for precisely inquiring based on IP (Internet Protocol) address of mobile terminal | |
CN102117331B (en) | Video search method and system | |
CN103970800A (en) | Method and system for extracting and processing webpage related keywords | |
CN105007314A (en) | Big data processing system oriented to mass reading data of readers | |
CN102214182A (en) | Accurate query searching method according to internet protocol (IP) address | |
CN102508884A (en) | Method and device for acquiring hotpot events and real-time comments | |
CN111159590A (en) | Serial connection method and device based on front-end and back-end service call links | |
Cox et al. | SISSVoc: A Linked Data API for access to SKOS vocabularies | |
CN105095383A (en) | Information issuance method, information search method and relevant device | |
CN103324764A (en) | Web implementation of multi-condition random keyword multi-field fuzzy query method | |
Srinivas et al. | Web service architecture for a meta search engine | |
CN105574185A (en) | Method and device for providing clustering type intelligent summaries | |
CN102222067A (en) | Searching method for accurately querying information according to IP (Internet Protocol) address of keyword | |
CN101968807A (en) | Content retrieval method and device | |
Gupta et al. | Exploringhidden'parts of the web: the hidden web | |
CN103377215A (en) | Information promotion method and system | |
US20130226900A1 (en) | Method and system for non-ephemeral search | |
Ding et al. | On-tourism: semantic e-tourism portal |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C02 | Deemed withdrawal of patent application after publication (patent law 2001) | ||
WD01 | Invention patent application deemed withdrawn after publication |
Application publication date: 20110209 |