CN1858733B - Information searching system and searching method - Google Patents

Information searching system and searching method Download PDF

Info

Publication number
CN1858733B
CN1858733B CN200510117147XA CN200510117147A CN1858733B CN 1858733 B CN1858733 B CN 1858733B CN 200510117147X A CN200510117147X A CN 200510117147XA CN 200510117147 A CN200510117147 A CN 200510117147A CN 1858733 B CN1858733 B CN 1858733B
Authority
CN
China
Prior art keywords
retrieval
user
search
search engine
information
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CN200510117147XA
Other languages
Chinese (zh)
Other versions
CN1858733A (en
Inventor
王伟
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Huawei Technologies Co Ltd
Original Assignee
Huawei Technologies Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Huawei Technologies Co Ltd filed Critical Huawei Technologies Co Ltd
Priority to CN200510117147XA priority Critical patent/CN1858733B/en
Priority to PCT/CN2006/002804 priority patent/WO2007051397A1/en
Publication of CN1858733A publication Critical patent/CN1858733A/en
Application granted granted Critical
Publication of CN1858733B publication Critical patent/CN1858733B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/335Filtering based on additional data, e.g. user or group profiles

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

This invention provides an information retrieval system including: a search engine, a content indexing database provided to the search engine, a user character database and a content analyzing system. An information retrieval method is also provided including: retrieving according to retrieval key words input by a user and acquiring an original retrieval result; acquiring characteristic behavior information corresponding to the user according to a user identification and a current time, the characteristic behavior information including at least one characteristic behavior key word; retrieving for a second time to the original retrieval result according to the characteristic behavior key word, and displaying the secondary retrieval result to the user. The information retrieval system can filter the user's search according to the different characteristic behavior of the user, improve the accuracy and the performance of the user's search on the associated information.

Description

Information retrieval system and search method
Technical field
The present invention relates to technical field of information retrieval, be meant a kind of information retrieval system and search method especially.
Background technology
Search engine is meant and can obtains the website and webpage data, can set up database and the system of inquiry is provided.According to the difference of principle of work, can search engine be divided into two base class: full-text search engine (FullText Search Engine) and split catalog Directory).
The database of full-text search engine is to rely on a software that is " network robot (Spider) " or is " crawler (crawlers) "; Automatically obtain a large amount of info web contents through the various links on the network, and by forming with the arrangement of fixed rule analysis.Google, Baidu all are more typical full-text search automotive engine system.Usually will be called search " all websites " or " all websites " to the inquiry of full-text search engine, like the full-text search (http://www.google.com/intl/zh-CN/) of Google.
Split catalog then is to compile the website data through the mode of manual work to form database, such as Yahoo China and domestic Sohu, Sina, Netease's split catalog.In addition, some navigation websites on the net also can belong to and are original split catalog, like " website home " (http://www.hao123.com/).Usually will be called search " split catalog " or search " classifieds website " to the inquiry of split catalog, like " Sina search " (http://dir.sina.com.cn/) and " Yahoo China's search " (http://cn.search.yahoo.com/dirsrch/).
Full-text search engine and split catalog respectively have length in the use.The full-text search engine is because rely on software to carry out, so the capacity of database is very huge, still, its Query Result is often not accurate enough; Split catalog relies on and artificially collects and put in order the website, and Query Result more accurately can be provided, but the content of collecting is very limited.In order to learn from other's strong points to offset one's weaknesses, present a lot of search engines all provide this two types of inquiries simultaneously.These two types of search engines are integrated, also produced other search service, here, we also are called search engine to them for the time being, mainly contain following two types:
1, META Search Engine (META Search Engine).This type search engine does not generally all have own network robot and database, their Search Results be through call, control and optimize other a plurality of independent search engine Search Results and with unified format at same interface centralized displaying.Though META Search Engine does not have " network robot " or " crawler ", does not have independently index data base yet,, the characteristic unit search technique of oneself researching and developing is arranged all at aspects such as retrieval request submission, Retrieval Interface agency and result for retrieval demonstrations.Such as " metaFisher META Search Engine " (http://www.hsfz.net/fish/), it has just called and has integrated the data of how tame search engines such as Google, Yahoo, AlltheWeb, Baidu and OpenFind.
2, integration search engine (All-in-One Search Page).The integration search engine is through network technology; In a lot of independent search engine of a links on web pages, during inquiry, click or specify search engine; Once input; A plurality of search engines are inquired about simultaneously, and Search Results is shown with the different pages respectively by each search engine, like " internet Swiss Army Knife " (http://free.okey.net/%7Efree/searchl.htm).
Here introduce the principle of work of search engine again; " network robot " of full-text search engine or " crawler " are the software on a kind of network; Its traversal Web space; Can scan the website in certain IP address range, and the link on the network from a webpage to another webpage, gather Webpage material from a website to another website.It is up-to-date for guaranteeing the data of gathering, and also can pay a return visit the webpage that had grasped.The webpage that network robot or crawler are gathered also will have other program to analyze, and carries out a large amount of calculating according to certain degree of correlation algorithm and sets up web page index, just adds in the content index database.The full-text search engine that we see at ordinary times; In fact be the search interface of a search engine system; When the input keyword is inquired about; Search engine can find the index of all related web pages that meet this keyword from huge content index database, and presents to us by certain rank rule.Different search engines, content index database is different, the rank rule also is not quite similar, so, when we with same keyword during with different search engine inquiry, Search Results also just is not quite similar.
Now the routine search engine is through being visited with following the tracks of in turn that wherein hypertext is connected and being extracted in each file that wherein runs into and indicating in a big database that each file is prepared against subsequently through so-called " keyword " by the access websites automatically of software implementation.
Particularly, extract through this type, this class file has all reduced, and is all transferred all semantemes and syntactic information, has the substantial speech in ground in the include file but go back.These lexical words possibly exist in the file itself or only in the description section of the HTML(Hypertext Markup Language) of this document.Under above any situation, this engine is set up the i.e. file logging of clauses and subclauses for each this class file.For each file, but its lexical word all in a search data structure, indicate, and have a connection of back pointing to file logging.This document record comprises usually: a, a network address, i.e. a URL (uniform resource locator a, web browser can be visited corresponding file through it); Different content speech in b, this document and in some engine the relative address of each this type lexical word relevant with the other guide speech of this document; A section of c, this document is made a summary, and has only the first few lines of several row or this document usually; D, possibly have at its HTML and describe the description that provides in the section file.
The user is when using search engine; To engine an inquiry based on keyword is provided; This search engine attempts to search the file that comprises keyword as much as possible, and when request according to operational symbol or miscellaneous stipulations (for example be logical operation, as: with/or/non-) scope search.This class file that it is searched for each, its file logging of this engine. retrieves reach according to keyword matching numbers relative in this document and other these class files and sort so that this record to be provided to the user.
At present; Search engine is just made simple response to the keyword query that the user provides; And the user possibly have different behavioural habits in different time; Thereby different demands is arranged, hope that content retrieved information maybe be different, the Search Results of search engine is not classified but existing search method not will consider these situation.
Summary of the invention
In view of this; Fundamental purpose of the present invention has been to provide a kind of system and method for time-based user characteristics behavior search; Make and to filter user's search in the different characteristic behavior that different time sections showed according to the user; It is different with the result that same keyword search obtains to reach different user, and same user is also different with the result that same keyword search obtains in the different time section, thereby improves the accuracy and the search efficiency of user search relevant information.
The invention provides a kind of information retrieval system, comprising: search engine (12), offer the content index database (11) that search engine is searched for, also comprise:
User feature database (14) is preserved the characteristic behavior information that the user is had in different time sections;
Content analysis system (13); Be used to obtain the searching key word of user terminal input; Obtain ID simultaneously, according to ID that obtains and current inquiring user property data base search time (14) acquisition and the characteristic behavior information of mating said ID and said current search time; And searching key word is sent to search engine (12) and preserves the result for retrieval information that search engine (12) search is come out; According to the said characteristic behavior information that obtains the result for retrieval information of preserving is carried out retrieval ordering once more; Result for retrieval behind the retrieval ordering is once more sent to user terminal displays, comprising:
Data transmit-receive unit (131) is used to realize mutual with user terminal, and the searching key word that receives the user terminal input also sends to search engine interface (132), and ID is sent to time series analysis unit (133);
Search engine interface (132) is used for the searching key word that data transmit-receive unit (131) send over is sent to search engine (12), and the Search Results of reception search engine (12) sends to retrieve data storage unit (135);
Retrieve data storage unit (135) is used to preserve the Search Results of the search engine (12) that search engine interface (132) sends over, to offer retrieval analysis unit (134);
Time series analysis unit (133); Be used to receive the ID and definite current search time that data transmit-receive unit (131) sends over; And retrieval user property data base (14) in view of the above; Obtain said ID and current characteristic of correspondence behavioural information search time, offer retrieval analysis unit (134);
Retrieval analysis unit (134); Be used for the characteristic behavior information that time of reception analytic unit (133) sends over; And in view of the above the said Search Results of storing in the retrieve data storage unit (135) is carried out quadratic search filtration and/or ordering, and the result for retrieval after will filtering and/or sort sends to data transmit-receive unit (131) to return to user terminal.
Wherein, said user feature database (14) comprising:
The time period information table is used to store the corresponding different time segment number of different time sections;
The characteristic behavior table is used to store the key word of the corresponding different character behavior of user's different characteristic behavior numbering and/or the subordinate keyword message of characteristic behavior;
Matching list is used to store user's the pairing characteristic behavior numbering of different time segment number.
Wherein, said user feature database (14) further comprises: the personal user information table is used to store user's personal information.
The present invention also provides a kind of information retrieval method, preserves ID in advance in different time sections characteristic of correspondence behavioural information, and is further comprising the steps of:
A, data transmit-receive unit (131) obtain the search key of user's input, obtain ID simultaneously, and the searching key word that user terminal is imported sends to search engine interface (132), and ID is sent to time series analysis unit (133);
The searching key word that search engine interface (132) sends over data transmit-receive unit (131) sends to search engine (12); Search engine (12) is retrieved in content index database (11) according to search key and is obtained original result for retrieval; Send to search engine interface (132), the original result for retrieval that search engine interface (132) will receive sends to retrieve data storage unit (135) and preserves;
B, time series analysis unit (133) are according to ID that obtains and current search time; And retrieval user property data base (14) in view of the above; Retrieve and said ID and said current characteristic of correspondence behavioural information search time, offer retrieval analysis unit (134);
The characteristic behavior information that C, retrieval analysis unit (134) time of reception analytic unit (133) send over; According to said characteristic behavior information the original result for retrieval that the search engine of storing in the retrieve data storage unit (135) (12) searches out is retrieved once more; The result for retrieval that will comprise said characteristic behavior information sends to data transmit-receive unit (131), and data transmit-receive unit (131) preferentially are shown to the user with the result for retrieval that receives.
Wherein, the said step of obtaining ID comprises: receive the ID of user through the user terminal input; Or, the ID of typing when receiving the User login system.
Wherein, the said step of obtaining current search time comprises: arbitrary computer equipment obtains the current search time that provides on home server or the network.
Wherein, the different characteristic behavioural information is provided with different priority, when step C retrieves once more, further comprises: the retrieval once more of the original result for retrieval that search engine searches is gone out according to said different characteristic behavioural information respectively; Sort according to the priority of the said characteristic behavior information result for retrieval after with the retrieval once more of correspondence.
Wherein, described characteristic behavior information comprises: characteristic behavior key word and/or characteristic behavior subordinate key word.
Can find out by said method; Scheme provided by the invention can be according to the characteristic behavior of the corresponding user's of time response personalization; Search engine is carried out postsearch screening according to the original collection outcome record that keyword searched of user's input to be filtered; The real interested file logging information priority of user is shown to the user, has improved the accuracy and the search efficiency of user search relevant information.
Description of drawings
Fig. 1 is the system framework figure of information retrieval system of the present invention.
Fig. 2 is the frame diagram of user feature database.
Fig. 3 is the frame diagram of content analysis system.
Fig. 4 realizes the process flow diagram of retrieving for the present invention.
Embodiment
The present invention considers that the user has the different character behavioural information in the different time section; Therefore; After search engine obtained result for retrieval, according to the result that the pairing user's of current slot characteristic behavior information processing is retrieved, the result for retrieval that will meet said user characteristics behavioural information preferentially was shown to the user; Thereby improve the precision of search engine retrieving, the demand that the result for retrieval that offers the user more is close to the users.
With reference to accompanying drawing the present invention is elaborated below.
At first Fig. 1 shows information retrieval system of the present invention, comprises content analysis system 13, user feature database 14, search engine 12 and content index database 11, wherein:
Content analysis system 13; Be used to receive ID, the search key of input and the current time of acquisition home server that user terminal sends; And inquiring user property data base 14 matches this period user's characteristic behavior in view of the above; The page to coming out through search engine 12 search is retrieved once more and is filtered, and the page that makes retrieval is presented to the user by the order of the characteristic behavior preference priority that the user shows in this time period.
User feature database 14 is used to preserve the characteristic behavior information that user's characteristic behavior information, especially user are had in different time sections, detailed explanation has been carried out to this database in the back, repeats no more here.
Search engine 12 is based on the research tool of text and keyword, in existing content index database 11, after the search, returns required file pointer inventory, and has file title, and also have some descriptive matter in which there of from document text, taking passages usually.
Content index database 11; Through activate by the auto-programming (like " crawler ") of software implementation automatically access websites with in turn follow the tracks of wherein hypertext and be connected and be extracted in each file that wherein runs into through what is called " keyword "; And be kept in this database, offer search engine 12 and conduct interviews.
Wherein, Fig. 2 is an embodiment of said user feature database 14, can through but be not limited to the preservation of the characteristic behavior information that following several tables realize being had in user's the different time sections.Be described in detail in the face of personal user information table, time period information table, characteristic behavior table, the matching list that provides down.
The personal user information table is used to store user's personal information, can be the information that the user registers time input.Show a user message table like following table 1:
Customs Assigned Number Address name User's sex ......
U001 Zhang San The man ......
...... ...... ...... ......
Table 1
The time period information table has been used to store the corresponding different time segment number of different time sections, and will number the time period is the retrieval convenience for the ease of database, and the setting to the time period is more flexible simultaneously.Show a time period information table like following table 2:
The time period numbering Time period
T001 0:00-1:00
...... ......
Table 2
The characteristic behavior table is used to store user's the pairing different characteristic behavior numbering of different character behavior key word, and wherein, a characteristic behavior key word can also have the subordinate key word, and these all belong to characteristic behavior information.A characteristic behavior table that shows like following table 3:
Customs Assigned Number The characteristic behavior numbering The characteristic behavior key word Characteristic behavior subordinate key word ......
U001 C001 Recreation Electronic game, computer game ... ......
U001 C002 Music Classic, philharmonic ... ......
...... ...... ...... ...... ......
Table 3
Matching list is used to store user's the pairing characteristic behavior numbering of different time segment number.Through this table, set up the relation between table 1, table 2 and the table 3, promptly set up the relation of different time sections and characteristic behavior key word/characteristic behavior subordinate key word.Show a matching list like following table 4:
Customs Assigned Number The time period numbering The characteristic behavior numbering Characteristic priority ......
U001 T001 C001 9 ......
U001 T001 C002 8 ......
...... ...... ...... ...... ......
Table 4
Also comprised the characteristic prioritized item in the above-mentioned table 4, be used for being identified in the certain hour section, the priority of this user's different characteristic behavior.Example shown in table 4 is represented: user U001 is in time period T001; The characteristic priority that characteristic behavior is numbered C001 is that 9 to be higher than the characteristic priority that characteristic behavior is numbered C002 be 8, representes that this user U001 more is partial to show the characteristic behavior that characteristic behavior is numbered C001 in time period T001.
The data of being stored for user feature database 14; Can be that system by the collection of customer service behavioural characteristic provides; The realization of the system of gathering about the customer service behavioural characteristic can be referring to " system and method that the customer service behavioural characteristic is gathered " invention of the applicant's application.
Fig. 3 shows the frame diagram of said content analysis system 13, comprises data transmit-receive unit 131, search engine interface 132, time series analysis unit 133, retrieval analysis unit 134, retrieve data storage unit 135.Wherein:
Data transmit-receive unit 131 is used to realize mutual with user terminal, receive the user through the user terminal input searching key word and send to search engine interface 132, and the ID that obtains is sent to time series analysis unit 133.
Search engine interface 132 is used to realize mutual with search engine 12, and the searching key word that data transmit-receive unit 131 is sended over sends to search engine 12, and the Search Results that receives search engine 12 sends to retrieve data storage unit 135.
Retrieve data storage unit 135: the Search Results of the search engine 12 that search engine interface 132 is sended over is preserved, and analyzes to offer retrieval analysis unit 134.
Time series analysis unit 133; Be used to receive the ID search time current that data transmit-receive unit 131 sends over acquisition; And retrieval user property data base 14 in view of the above; Obtain said ID and characteristic of correspondence behavior search time key word information, and offer retrieval analysis unit 134.Said behavioural characteristic key word information can include but not limited to characteristic behavior key word and characteristic behavior subordinate key word.
Retrieval analysis unit 134; Be used for the characteristic behavior key word information that time of reception analytic unit 133 sends over; And in view of the above the said Search Results of storing in the retrieve data storage unit 135 is carried out quadratic search filtration and/or ordering, and the result for retrieval after will filtering and/or sort sends to data transmit-receive unit 131 to return to user terminal displays to the user.
Referring to Fig. 3, realize the process flow diagram of retrieving simultaneously referring to the information retrieval system of the present invention shown in Fig. 4, search method of the present invention is elaborated, comprise with the lower part:
Step 401: at first the user is according to wanting information inquiring in the search engine that user terminal provides, to import search key, when input, possibly have the operational symbol that can discern at the Boolean type (for example " and " or " or ") between the keyword or other search engines continuously.
Suppose that the user is in a user terminal input search key " recreation ", requesting query relevant information in this example.
Step 402: these information are sent in the content analysis system 13 through network, are obtained the key word information of user inquiring by the data transmit-receive unit 131 of content analysis system 13; Simultaneously data transmit-receive unit 131 also obtains this user's sign, and obtaining of ID can be that the user passes through the user terminal input, also can be user's typing when landing when using information retrieval system of the present invention.
Step 403: data transmit-receive unit 131 sends to search engine interface 132 with the keyword that obtains, and user totem information is sent to time series analysis unit 133.
Data transmit-receive unit 131 sends to search engine interface 132 with the keyword " recreation " of user's input in this example; This user's who obtains sign U001 is sent to time series analysis unit 133.
Step 404: search engine interface 132 sends to search engine 12 with the keyword of the user inquiring that obtains; Search engine 12 is retrieved relevant information according to keyword in content index database 11; The result of retrieval is returned to search engine interface 132, redispatch to preserving in the retrieve data storage unit 135.
Step 405: redispatch to retrieval analysis unit 134 according to the ID that obtains and current time information find coupling from user feature database 14 correlated characteristic behavioral data in time series analysis unit 133.Temporal information can be the home server by the load contents analytic system provide or network in arbitrary computer equipment provide, preferred here home server provides.
In this example, obtain time corresponding segment number T001 according to temporal information; From the above-mentioned table 4 of user feature database 14, retrieve this user user behavior preference and priority at the moment according to ID U001, time period numbering again and be (C001,9), (C002,8) ...; Obtaining this user characteristic behavior key word and characteristic behavior subordinate key word at the moment according to above-mentioned table 3 is: recreation, and electronic game, computer game ...; Music, allusion, philharmonic, ...; User's these characteristic behavior keywords and correlated characteristic priority are sent to retrieval analysis unit 134.
Step 406: retrieval analysis unit 134 obtains the coordinate indexing result (like page info) that this user has searched out through ID from retrieve data storage unit 135; Pass through the characteristic behavior keyword and the correlated characteristic priority of reception again; Secondary is retrieved rearrangement to result for retrieval information, makes the really relevant page info of user be shown to the user at first.
In this example, said result for retrieval is carried out quadratic search when ordering, at first use the high characteristic behavior keyword of priority (recreation, electronic game, computer game ...) and retrieve, the fileinfo that retrieval is drawn is listed in the foremost; Low to priority then characteristic behavior keyword (music, allusion, philharmonic, ...) retrieve, the fileinfo that retrieval is drawn is listed in the back; Do not comprise during then with quadratic search that the information of the former result for retrieval of said characteristic behavior keyword gets the last place.Retrieving to these keywords among the present invention is not done detailed description, and these technology have all comprised in each text retrieval system.
Step 407: the result for retrieval after retrieval analysis unit 134 sorts quadratic search sends to data transmit-receive unit 131, and the result's (like page info) who quadratic search is sorted by data transmit-receive unit 131 issues user terminal displays to the user.
Above-mentioned retrieval scheme can be used for almost any information retrieval system to increase the wherein search accuracy of search engine, no matter and whether this engine is a conventional engine.In addition, the present invention has also improved the accuracy of retrieving information from high-volume database, no matter and which kind of language of Word message employing is for example Chinese, English, French, German etc.
The above is merely preferred embodiment of the present invention, and is in order to restriction the present invention, not all within spirit of the present invention and principle, any modification of being done, is equal to replacement, improvement etc., all should be included within protection scope of the present invention.

Claims (8)

1. information retrieval system comprises: search engine (12), offer the content index database (11) that search engine is searched for, it is characterized in that, also comprise:
User feature database (14) is preserved the characteristic behavior information that the user is had in different time sections;
Content analysis system (13); Be used to obtain the searching key word of user terminal input; Obtain ID simultaneously, according to ID that obtains and current inquiring user property data base search time (14) acquisition and the characteristic behavior information of mating said ID and said current search time; And searching key word is sent to search engine (12) and preserves the result for retrieval information that search engine (12) search is come out; According to the said characteristic behavior information that obtains the result for retrieval information of preserving is carried out retrieval ordering once more; Result for retrieval behind the retrieval ordering is once more sent to user terminal displays, comprising:
Data transmit-receive unit (131) is used to realize mutual with user terminal, and the searching key word that receives the user terminal input also sends to search engine interface (132), and ID is sent to time series analysis unit (133);
Search engine interface (132) is used for the searching key word that data transmit-receive unit (131) send over is sent to search engine (12), and the Search Results of reception search engine (12) sends to retrieve data storage unit (135);
Retrieve data storage unit (135) is used to preserve the Search Results of the search engine (12) that search engine interface (132) sends over, to offer retrieval analysis unit (134);
Time series analysis unit (133); Be used to receive the ID and definite current search time that data transmit-receive unit (131) sends over; And retrieval user property data base (14) in view of the above; Obtain said ID and current characteristic of correspondence behavioural information search time, offer retrieval analysis unit (134);
Retrieval analysis unit (134); Be used for the characteristic behavior information that time of reception analytic unit (133) sends over; And in view of the above the said Search Results of storing in the retrieve data storage unit (135) is carried out quadratic search filtration and/or ordering, and the result for retrieval after will filtering and/or sort sends to data transmit-receive unit (131) to return to user terminal.
2. system according to claim 1 is characterized in that, said user feature database (14) comprising:
The time period information table is used to store the corresponding different time segment number of different time sections;
The characteristic behavior table is used to store the key word of the corresponding different character behavior of user's different characteristic behavior numbering and/or the subordinate keyword message of characteristic behavior;
Matching list is used to store user's the pairing characteristic behavior numbering of different time segment number.
3. system according to claim 2 is characterized in that, said user feature database (14) further comprises: the personal user information table is used to store user's personal information.
4. an information retrieval method is characterized in that, preserves ID in advance in different time sections characteristic of correspondence behavioural information, and is further comprising the steps of:
A, data transmit-receive unit (131) obtain the search key of user's input, obtain ID simultaneously, and the searching key word that user terminal is imported sends to search engine interface (132), and ID is sent to time series analysis unit (133);
The searching key word that search engine interface (132) sends over data transmit-receive unit (131) sends to search engine (12); Search engine (12) is retrieved in content index database (11) according to search key and is obtained original result for retrieval; Send to search engine interface (132), the original result for retrieval that search engine interface (132) will receive sends to retrieve data storage unit (135) and preserves;
B, time series analysis unit (133) are according to ID that obtains and current search time; And retrieval user property data base (14) in view of the above; Retrieve and said ID and said current characteristic of correspondence behavioural information search time, offer retrieval analysis unit (134);
The characteristic behavior information that C, retrieval analysis unit (134) time of reception analytic unit (133) send over; According to said characteristic behavior information the original result for retrieval that the search engine of storing in the retrieve data storage unit (135) (12) searches out is retrieved once more; The result for retrieval that will comprise said characteristic behavior information sends to data transmit-receive unit (131), and data transmit-receive unit (131) preferentially are shown to the user with the result for retrieval that receives.
5. method according to claim 4 is characterized in that, the said step of obtaining ID comprises: receive the ID of user through the user terminal input; Or,
The ID of typing when receiving the User login system.
6. method according to claim 4 is characterized in that, the said step of obtaining current search time comprises: arbitrary computer equipment obtains the current search time that provides on home server or the network.
7. method according to claim 4 is characterized in that the different characteristic behavioural information is provided with different priority, when step C retrieves once more, further comprises:
The retrieval once more of the original result for retrieval that search engine searches is gone out according to said different characteristic behavioural information respectively;
Sort according to the priority of the said characteristic behavior information result for retrieval after with the retrieval once more of correspondence.
8. method according to claim 4 is characterized in that, described characteristic behavior information comprises: characteristic behavior key word and/or characteristic behavior subordinate key word.
CN200510117147XA 2005-11-01 2005-11-01 Information searching system and searching method Expired - Fee Related CN1858733B (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
CN200510117147XA CN1858733B (en) 2005-11-01 2005-11-01 Information searching system and searching method
PCT/CN2006/002804 WO2007051397A1 (en) 2005-11-01 2006-10-20 An information retrieval system and information retrieval method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN200510117147XA CN1858733B (en) 2005-11-01 2005-11-01 Information searching system and searching method

Publications (2)

Publication Number Publication Date
CN1858733A CN1858733A (en) 2006-11-08
CN1858733B true CN1858733B (en) 2012-04-04

Family

ID=37297642

Family Applications (1)

Application Number Title Priority Date Filing Date
CN200510117147XA Expired - Fee Related CN1858733B (en) 2005-11-01 2005-11-01 Information searching system and searching method

Country Status (2)

Country Link
CN (1) CN1858733B (en)
WO (1) WO2007051397A1 (en)

Families Citing this family (36)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN100555283C (en) * 2006-12-12 2009-10-28 北京搜狗科技发展有限公司 A kind of directly at the dissemination method and the system of user's relevant information
CN101374044B (en) * 2007-08-21 2010-12-15 中国电信股份有限公司 Method and system for making business engine to obtain user identification
CN101996200B (en) * 2009-08-19 2014-03-12 华为技术有限公司 Method and device for searching file
US20110225139A1 (en) * 2010-03-11 2011-09-15 Microsoft Corporation User role based customizable semantic search
CN102207942A (en) * 2010-03-29 2011-10-05 上海博泰悦臻电子设备制造有限公司 Identification information matching-based search method and device
CN102207943A (en) * 2010-03-29 2011-10-05 上海博泰悦臻电子设备制造有限公司 Identification information matching-based search method and device
CN102253936B (en) * 2010-05-18 2013-07-24 阿里巴巴集团控股有限公司 Method for recording access of user to merchandise information, search method and server
TWI547888B (en) * 2010-08-27 2016-09-01 Alibaba Group Holding Ltd A method of recording user information and a search method and a server
CN101916295B (en) * 2010-08-27 2011-12-14 董方 Internet search system and method based on point-to-point network
CN101996246B (en) * 2010-11-09 2012-11-14 中国电信股份有限公司 Method and system for instant indexing
CN102117332A (en) * 2011-03-10 2011-07-06 辜进荣 Given time-based searching method
CN102184224A (en) * 2011-05-09 2011-09-14 李郁文 System and method for screening search results
CN102902695A (en) * 2011-07-29 2013-01-30 上海博泰悦臻电子设备制造有限公司 Navigation system as well as interest point searching method and device
CN102270243A (en) * 2011-08-25 2011-12-07 北京思博途信息技术有限公司 Information search method and system
CN102385636A (en) * 2011-12-22 2012-03-21 陈伟 Intelligent searching method and device
CN103368986B (en) 2012-03-27 2017-04-26 阿里巴巴集团控股有限公司 Information recommendation method and information recommendation device
CN102663048B (en) * 2012-03-29 2017-04-12 天津奇思科技有限公司 Method and device for providing search result
CN102779193B (en) * 2012-07-16 2015-05-13 哈尔滨工业大学 Self-adaptive personalized information retrieval system and method
CN103577049B (en) * 2012-07-24 2019-04-12 百度在线网络技术(北京)有限公司 A kind of method, apparatus and equipment for suggesting object for providing downloading
CN102880633A (en) * 2012-07-27 2013-01-16 四川长虹电器股份有限公司 Content pushing method based on characteristic word
CN103324675A (en) * 2013-05-24 2013-09-25 崔吉平 Internet individuation accurate information search and algorithm
CN103970848B (en) * 2014-05-01 2016-05-11 刘莎 A kind of universal internet information data digging method
CN104036003B (en) * 2014-06-16 2018-12-14 北京奇虎科技有限公司 search result integration method and device
CN104765867A (en) * 2015-04-23 2015-07-08 宁波市科技信息研究院 Collaborative manufacturing information sharing system
CN105045883B (en) * 2015-07-21 2020-12-25 惠州Tcl移动通信有限公司 Mobile terminal and searching method thereof
CN107885889A (en) * 2017-12-13 2018-04-06 聚好看科技股份有限公司 Feedback method, methods of exhibiting and the device of search result
CN108073726B (en) * 2018-01-29 2019-07-16 百度在线网络技术(北京)有限公司 Method, apparatus, storage medium and the terminal device of information retrieval push
CN109271577A (en) * 2018-09-13 2019-01-25 江苏站企动网络科技有限公司 A kind of network-based information retrieval method
CN110502692B (en) * 2019-07-10 2023-02-03 平安普惠企业管理有限公司 Information retrieval method, device, equipment and storage medium based on search engine
CN111143460A (en) * 2019-12-30 2020-05-12 智慧神州(北京)科技有限公司 Big data-based economic field data retrieval method and device and processor
CN111444377A (en) * 2020-04-15 2020-07-24 厦门快商通科技股份有限公司 Voiceprint identification authentication method, device and equipment
CN111914142B (en) * 2020-07-30 2023-07-04 重庆电子工程职业学院 Time-division memory information retrieval system
CN112104910B (en) * 2020-08-05 2023-02-03 苏宁智能终端有限公司 Video searching method, device and system
CN112445830B (en) * 2020-11-26 2024-05-14 湖南智慧政务区块链科技有限公司 Data analysis system based on block chain technology
CN116186078A (en) * 2023-03-15 2023-05-30 中国华能集团有限公司北京招标分公司 Data retrieval method and system
CN116578677B (en) * 2023-07-14 2023-09-15 高密市中医院 Retrieval system and method for medical examination information

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1319815A (en) * 1999-09-22 2001-10-31 Lg电子株式会社 Multimedia search and browse method using multimedia user simple document information structure
CN1460373A (en) * 2001-04-03 2003-12-03 皇家菲利浦电子有限公司 Method and apparatus for generating recommendations based on user preferences and environmental characteristics
WO2004090755A2 (en) * 2003-03-31 2004-10-21 Google Inc. System and method for providing preferred language ordering of search results

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1319815A (en) * 1999-09-22 2001-10-31 Lg电子株式会社 Multimedia search and browse method using multimedia user simple document information structure
CN1460373A (en) * 2001-04-03 2003-12-03 皇家菲利浦电子有限公司 Method and apparatus for generating recommendations based on user preferences and environmental characteristics
WO2004090755A2 (en) * 2003-03-31 2004-10-21 Google Inc. System and method for providing preferred language ordering of search results

Also Published As

Publication number Publication date
WO2007051397A1 (en) 2007-05-10
CN1858733A (en) 2006-11-08

Similar Documents

Publication Publication Date Title
CN1858733B (en) Information searching system and searching method
JP5632124B2 (en) Rating method, search result sorting method, rating system, and search result sorting system
CN101882149B (en) Reorder and improve the dependency of Search Results
US6718365B1 (en) Method, system, and program for ordering search results using an importance weighting
US7499965B1 (en) Software agent for locating and analyzing virtual communities on the world wide web
US8166013B2 (en) Method and system for crawling, mapping and extracting information associated with a business using heuristic and semantic analysis
US7383299B1 (en) System and method for providing service for searching web site addresses
KR101361182B1 (en) Systems for and methods of finding relevant documents by analyzing tags
US7020679B2 (en) Two-level internet search service system
US7302646B2 (en) Information rearrangement method, information processing apparatus and information processing system, and storage medium and program transmission apparatus therefor
KR100645608B1 (en) Server of providing information search service using visited uniform resource locator log, and method thereof
US8166028B1 (en) Method, system, and graphical user interface for improved searching via user-specified annotations
US20010049674A1 (en) Methods and systems for enabling efficient employment recruiting
US20070271255A1 (en) Reverse search-engine
US8990193B1 (en) Method, system, and graphical user interface for improved search result displays via user-specified annotations
US8180751B2 (en) Using an encyclopedia to build user profiles
WO2001009747A2 (en) Apparatus and methods for collaboratively searching knowledge databases
CN1703696A (en) Data store for knowledge-based data mining system
JP4875911B2 (en) Content identification method and apparatus
EP1975816A1 (en) Electronic document retrieval system
JP4430598B2 (en) Information sharing system and information sharing method
CN101661490B (en) Search engine, client thereof and method for searching page
KR100671077B1 (en) Server, Method and System for Providing Information Search Service by Using Sheaf of Pages
CA2713932A1 (en) Automated boolean expression generation for computerized search and indexing
JP2000348061A (en) Semi-structured document information integrating retrieval device, semi-structured document information extracting device, its method and recording medium for storing its program

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20120404