CN103678376B - Searching system and searching method - Google Patents

Searching system and searching method Download PDF

Info

Publication number
CN103678376B
CN103678376B CN201210344271.XA CN201210344271A CN103678376B CN 103678376 B CN103678376 B CN 103678376B CN 201210344271 A CN201210344271 A CN 201210344271A CN 103678376 B CN103678376 B CN 103678376B
Authority
CN
China
Prior art keywords
keyword
webpage
user
search
noun
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CN201210344271.XA
Other languages
Chinese (zh)
Other versions
CN103678376A (en
Inventor
张旭东
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Guangzhou Ziyou Network Technology Co ltd
Original Assignee
Guangzhou Zi Swim Network Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Guangzhou Zi Swim Network Technology Co Ltd filed Critical Guangzhou Zi Swim Network Technology Co Ltd
Priority to CN201210344271.XA priority Critical patent/CN103678376B/en
Publication of CN103678376A publication Critical patent/CN103678376A/en
Application granted granted Critical
Publication of CN103678376B publication Critical patent/CN103678376B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/951Indexing; Web crawling techniques

Abstract

The invention provides a searching method. The method includes receiving a first keyword input by a first user through one terminal equipment; selecting R related keywords from a webpage returned according to the first keyword; selecting one or multiple other users using the first keyword for webpage searching in the past as related users; presenting a searching history of each related user to the first user through the terminal equipment. The invention further provides a searching system. By the system and the method, needed webpages can be quickly found with the help of the searching histories of other users with similar searching objectives.

Description

Search system and method
Technical field
The present invention relates to Internet technical field, especially with respect to a kind of search system and method.
Background technology
The development of computer networking technology drastically increases the convenience that people obtain information.Store in computer network The information of magnanimity, finds oneself required information for the ease of people, various search engines are widely used.
Traditional search engine is largely dependent upon the keyword of user's input, the key being provided according to user Word provides related Search Results to user.However, just due in computer network data volume very huge, according to user The webpage of the keyword search providing is generally also very many, sometimes or even up to millions of, and has suitable one among these Although subnetting page includes the keyword of user's input, may with user it is to be understood that information unrelated.Therefore, make It is a thing wasting time and energy that user wants to filter out its required webpage from so many webpage.
Content of the invention
In view of the foregoing it is necessary to propose a kind of search system and method, it may help to user and is quickly found Its required webpage.
Described search system includes:Keyword receiver module, passes through a terminal device for receiving the first user First keyword of input;Related keyword analysis module, for selecting R from the webpage returning according to described first keyword Individual related keyword;Associated user analysis module, for carrying out Webpage search from once using excessively described first keyword One or more is selected as associated user in other users;And display module, for by each associated user Search course presents to described first user by described terminal device.
Described searching method includes:Receive the first keyword that the first user passes through a terminal device input;From Select R related keyword according in the webpage that described first keyword returns;Entered using excessively described first keyword from once One or more is selected as associated user in other users of row Webpage search;And searching each associated user Suo Licheng presents to described first user by described terminal device.
Can be by other users with same search purpose using search system provided by the present invention and method Search course be quickly found out oneself needs webpage.
Brief description
Fig. 1 is the applied environment figure of search system preferred embodiment of the present invention.
Fig. 2 is the functional block diagram of search system preferred embodiment of the present invention.
Fig. 3 is the method flow diagram of searching method preferred embodiment of the present invention.
Fig. 4 is the schematic diagram of the search course of an associated user.
Main element symbol description
Following specific embodiment will further illustrate the present invention in conjunction with above-mentioned accompanying drawing.
Specific embodiment
Refering to the applied environment figure shown in Fig. 1, being search system preferred embodiment of the present invention.Described search system 10 is applied In application server 1.Described application server 1 is passed through network and is connected with multiple terminal devices 2 and web page server 3 communication Connect.Described network can be Internet or intranet etc..Described terminal device 2 can be personal computer, put down Plate computer, PDA(Personal digital assistant, personal digital assistant), the electric terminal equipment such as smart mobile phone.
What described web page server 3 was used for providing network information browses service.Web page server 3 is applied server 1 After the web-page requests of transmission, required webpage is sent to corresponding terminal device 2 by application server 1.The present invention other In embodiment, described web page server 3 can also be combined into one with apps server with described application server 1 Web server.
Refering to the functional block diagram shown in Fig. 2, being search system preferred embodiment of the present invention.
Search system 10 of the present invention includes multiple functional modules being made up of programming code(As described below), tool There is the first keyword receiving first user's input, several related keywords are analyzed according to this first keyword, and root Find out, according to this some related keyword, the associated user that one or more has same search purpose, searched for course It is presented to described first user, to help described first user to be quickly found out the function of the webpage of oneself needs.There is phase Refer to that once used described first keyword carries out Webpage search with search purpose.Described search course can include using Keyword and browsed with the webpage of described keyword associated etc..
The programming code of described search system 10 is stored in the memory cell 20 of application server 1, and is taken by application To realize its function performed by the control unit 30 of business device 1.The memory cell 20 of application server 1 can be smart media card (smart mediacard), safe digital card(secure digital card), flash memory cards(flash card)Deng Storage facilities.The control unit 30 of described application server 1 can be central processing unit etc..
In the present embodiment, in described search system 10, keyword is included by the functional module that programming code is formed and receive Module 100, related keyword analysis module 101, associated user analysis module 102, display module 103 and memory module 104. Function below in conjunction with Fig. 3 specification module 100 ~ 104.
Refering to the method flow diagram shown in Fig. 3, being searching method preferred embodiment of the present invention.According to different demands, should In flow chart, the order of step can change, and some steps can be omitted.
Step S01, keyword receiver module 100 receives the first user and searches at one through the browser of terminal device 2 First keyword of middle input held up in index.
Step S02, related keyword analysis module 101 obtains from the search and webpage being returned according to described first keyword N number of webpage therein.It should be appreciated that after inputting described first keyword, search engine can return all inclusion institutes State the webpage of the first keyword, described related keyword analysis module selects N number of webpage therein.Selected N number of webpage can Be described all webpages top n or according to set in advance rule select.
Step S03, related keyword analysis module 101 will be sharp to all nouns in described N number of webpage or noun phrase Carry out weight computing with a kind of weighting algorithm, calculate each noun or the weights of noun phrase.Described noun refers to represent The word of persons or thingses such as " computer ", " user ", " user ", " network " etc., noun phrase refers to by several nouns or name Word and its phrase of modifier composition, such as " computer network ", " authorized user " etc..The weighting that present pre-ferred embodiments adopt Algorithm is TF-IDF(Termfrequency inverse document frequency, word frequency-reverse document-frequency)Weighting Algorithm.Described TF-IDF is a kind of weighting technique prospected for information retrieval and information, in order to assess a noun or name The significance level of one of webpage for described N number of webpage for the word phrase.The importance of noun or noun phrase with The number of times that it occurs in same webpage is directly proportional increase, but the frequency that simultaneously can occur in described N number of webpage with it Be inversely proportional to decline.For example, in a webpage, the number of total noun or noun phrase is 100, and noun " computer " goes out 3 times are showed, then " computer " one word word frequency in the web page(TF)It is exactly 3/100=0.03.And if " computer " one word Occurred in 1,000 webpage, and the total N of webpage was 10,000,000, its reverse document-frequency(IDF)Be exactly log (10, 000,000/1,000)=4, therefore, the weights of " computer " one word are 0.03*4=0.12.
The other embodiment of the present invention can also adopt single TF(Termfrequency, word frequency)Weighting algorithm, that is, not Consider the frequency that noun or noun phrase occur in described N number of webpage.Additionally, the other embodiment of the present invention can also be adopted Use Boolean weighting algorithm.Described Boolean weighting algorithm refers to randomly draw several nouns or noun in a webpage Phrase, calculates the frequency that it occurs in the web page.
The other embodiment of the present invention can also without calculate webpage in all nouns or noun phrase weights, and It is using social bookmarks Tag(Community label)Method obtains the label to certain webpage label for each user, meter Calculate the frequency that each label is used.For example, user b is " computer " collecting the label marking during webpage a, user c It is " data processing equipment " collecting the label marking during webpage a ..., then related keyword analysis module 101 calculates each mark Sign the frequency being used as the weights of this each label.
Step S04, related keyword analysis module 101 is by all nouns in described N number of webpage or noun piece radix It is ranked up according to its weights, and closed as related according to R higher noun of this sequencing selection wherein weights or noun phrase Key word.When described noun or noun phrase are descending sort according to its weights, related keyword analysis module 101 selects it In front R, and when described noun or noun phrase are ascending sort according to its weights, related keyword analysis module 101 select rear R therein.
Step S05, associated user analysis module 102 is searched and was once carried out Webpage search using excessively described first keyword Other users.For example, if in step S 01 as above, the first keyword of described first user's input is " meter Calculation machine ", then associated user analysis module 102 search other uses of once scanning for as keyword by the use of " computer " Person.
Step S06, associated user analysis module 102 select described in once carried out webpage using excessively described first keyword One of user in other users of search, obtains selected user according to described first keyword search Browsed webpage in the search and webpage returning.Described once carry out other of Webpage search using excessively described first keyword and make User can be considered as thering is identical search purpose with described first user.For example, selected user utilizes keyword The search and webpage that " computer " search returns includes webpage a, webpage b, webpage c, webpage d and webpage e, and this user clicks Webpage a therein and webpage c is browsed, then associated user analysis module 102 obtains this webpage a and webpage c.
Step S07, associated user analysis module 102 obtains all in the browsed webpage of selected user Noun or noun phrase, and seek common ground with R above-mentioned related keyword, calculate number S of the keyword in occuring simultaneously, and count Calculate assessed value V of selected user, wherein V=S/R.For example, described R related keyword includes Keyword 1, Keyword 2, Keyword 3, Keyword 4 and Keyword5 totally 5 keywords, and selected user is browsed Noun in webpage or noun phrase include Keyword 6, Keyword 1, Keyword 8, Keyword3, Keyword 4, Keyword 7, Keyword 9, Keyword 10, then it occurs simultaneously for Keyword 1, Keyword 3, and Keyword 4, therefore Number S=3 of keyword in common factor, then, assessed value V=3/5 of selected user.
Step S08, associated user analysis module 102 judges whether to have calculating or not in other users described that it is commented The user of valuation.If there being such user, return foregoing description step S06.Otherwise, if other users described all It has been computed its assessed value, then following step S09 of flow performing.
In step S09, associated user analysis module 102 select higher one or more of wherein assessed value other User is considered as associated user, and by display module 103, the search course of each associated user is presented to described first In the browser of terminal device 2 of user.Described search course includes each described associated user in a search procedure In other keywords in addition to described first keyword of using, and once browsed net in this retrieving Page.
As shown in figure 4, being the schematic diagram of the search course of associated user a, b and c.When the first user utilizes the first pass When key word scans for, higher used in connection with of the assessed value once scanning for can be found using excessively described first keyword Person a, b and c.From fig. 4, it can be seen that when associated user a is scanned for using the first keyword, according to this first keyword Browsed webpage A, webpage B and webpage C in the webpage returning.Additionally, associated user a in once search course also once Used second keyword scans for, and browsed webpage D in the webpage being returned according to this second keyword.Similarly, phase Close user b when scanning for using the first keyword, in the webpage being returned according to this first keyword browsed webpage E and Webpage F.Additionally, associated user b in once search course also once used 3rd keyword scan for, and in basis Browsed webpage G and webpage H in the webpage that 3rd keyword returns, and associated user c carried out using the first keyword a During search, in the webpage returning according to this first keyword, there is no browsed any webpage, and associated user c is with once Browsed webpage I, webpage J and webpage K in the webpage of the 4th keyword return is utilized in search course.Used in connection with shown in figure The browsed webpage A of person, webpage B ... can be presented on described first user in the way of snapshots of web pages or web page title The browser of terminal device 2 in.Additionally, the first webpage shown in Fig. 4, the second webpage, the 3rd webpage, the 4th webpage ... are The webpage that search engine returns according to described first keyword.
According to the search course of each associated user, described first user is just recognized which keyword also may be used To be used for into line retrieval, for example, the first keyword that described first user utilizes is " computer ", according to described used in connection with The search course of person, the first user is recognized that can also be with the second keyword " computer ", the 3rd keyword " data processing Equipment " etc. scans for.And the once browsed webpage of associated user potentially includes some important informations.Additionally, described search Suo Licheng can also include the time that each webpage is browsed.The time being browsed is longer, illustrates that this webpage is more important.Therefore, Assist in the webpage that the first user is quickly found out oneself needs.
It is also possible to include a step, that is, this search of described first user is used by memory module 104 To keyword and browsed webpage recorded the memory cell 20 of application server 1, with to other users in search When help is provided.
Finally it should be noted that above example is only in order to illustrate technical scheme and unrestricted, although reference Preferred embodiment has been described in detail to the present invention, it will be understood by those within the art that, can be to the present invention's Technical scheme is modified or equivalent, without deviating from the spirit and scope of technical solution of the present invention.

Claims (4)

1. a kind of search system is it is characterised in that this system includes:
Keyword receiver module, passes through the first keyword of a terminal device input for receiving the first user;
Related keyword analysis module, for selecting R related keyword from the webpage returning according to described first keyword;
Associated user analysis module, for from other users once carrying out Webpage search using excessively described first keyword Middle select one or more as associated user;Wherein, the selection of described associated user is adopted with the following method:
Obtain the browsed webpage in the search and webpage returning according to described first keyword of each other user described;
Obtain all nouns in browsed webpage or noun phrase, and seek common ground with R above-mentioned related keyword, Calculate number S of the keyword in occuring simultaneously;
Calculate assessed value V of each other user described, wherein V=S/R;And
Select wherein one or more higher other user of assessed value as associated user;And
Display module, for presenting to described first use by the search course of each associated user by described terminal device Person.
2. search system as claimed in claim 1 is it is characterised in that described R related keyword utilizes following method choice:
Obtain N number of webpage therein from the search and webpage returning according to described first keyword;
All nouns in described N number of webpage or noun phrase are carried out weight computing using a kind of weighting algorithm, calculates Each noun or the weights of noun phrase;And
All nouns in described N number of webpage or noun phrase are ranked up according to its weights, and according to this sequencing selection Wherein R higher noun of weights or noun phrase are as related keyword.
3. a kind of searching method is it is characterised in that the method includes:
Keyword receiving step:Receive the first keyword that the first user passes through a terminal device input;
Related keyword analytical procedure:Select R related keyword from the webpage returning according to described first keyword;
Associated user analytical procedure:Select from other users once carrying out Webpage search using excessively described first keyword Select one or more as associated user;Wherein, described associated user analytical procedure includes:
Obtain the browsed webpage in the search and webpage returning according to described first keyword of each other user described;
Obtain all nouns in browsed webpage or noun phrase, and seek common ground with R above-mentioned related keyword, Calculate number S of the keyword in occuring simultaneously;
Calculate assessed value V of each other user described, wherein V=S/R;And
Select wherein one or more higher other user of assessed value as associated user;And
Step display:The search course of each associated user is presented to described first user by described terminal device.
4. searching method as claimed in claim 3 is it is characterised in that described related keyword analytical procedure includes:
Obtain N number of webpage therein from the search and webpage returning according to described first keyword;
All nouns in described N number of webpage or noun phrase are carried out weight computing using a kind of weighting algorithm, calculates Each noun or the weights of noun phrase;And
All nouns in described N number of webpage or noun phrase are ranked up according to its weights, and according to this sequencing selection Wherein R higher noun of weights or noun phrase are as related keyword.
CN201210344271.XA 2012-09-17 2012-09-17 Searching system and searching method Expired - Fee Related CN103678376B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201210344271.XA CN103678376B (en) 2012-09-17 2012-09-17 Searching system and searching method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201210344271.XA CN103678376B (en) 2012-09-17 2012-09-17 Searching system and searching method

Publications (2)

Publication Number Publication Date
CN103678376A CN103678376A (en) 2014-03-26
CN103678376B true CN103678376B (en) 2017-02-08

Family

ID=50315961

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201210344271.XA Expired - Fee Related CN103678376B (en) 2012-09-17 2012-09-17 Searching system and searching method

Country Status (1)

Country Link
CN (1) CN103678376B (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110163719A (en) * 2019-04-15 2019-08-23 深圳壹账通智能科技有限公司 Information-pushing method, device, computer equipment and storage medium

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1808423A (en) * 2005-01-17 2006-07-26 佳能信息技术(北京)有限公司 Webpage search display method and its client device
CN1889079A (en) * 2006-07-27 2007-01-03 唐晨辉 User cooperative searching engine
CN101004749A (en) * 2006-12-26 2007-07-25 朱莉君 Method of constructing communication platform for Internet users
CN102567409A (en) * 2010-12-31 2012-07-11 珠海博睿科技有限公司 Method and device for providing retrieval associated word

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8412706B2 (en) * 2004-09-15 2013-04-02 Within3, Inc. Social network analysis
US20070100798A1 (en) * 2005-10-31 2007-05-03 Shyam Kapur Community built result sets and methods of using the same

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1808423A (en) * 2005-01-17 2006-07-26 佳能信息技术(北京)有限公司 Webpage search display method and its client device
CN1889079A (en) * 2006-07-27 2007-01-03 唐晨辉 User cooperative searching engine
CN101004749A (en) * 2006-12-26 2007-07-25 朱莉君 Method of constructing communication platform for Internet users
CN102567409A (en) * 2010-12-31 2012-07-11 珠海博睿科技有限公司 Method and device for providing retrieval associated word

Also Published As

Publication number Publication date
CN103678376A (en) 2014-03-26

Similar Documents

Publication Publication Date Title
US20200311155A1 (en) Systems for and methods of finding relevant documents by analyzing tags
Hotho et al. Information retrieval in folksonomies: Search and ranking
Bennett et al. Inferring and using location metadata to personalize web search
Tyagi et al. Weighted page rank algorithm based on number of visits of links of web page
TWI391834B (en) Systems for and methods of finding relevant documents by analyzing tags
US20110055238A1 (en) Methods and systems for generating non-overlapping facets for a query
WO2009079875A1 (en) Systems and methods for extracting phrases from text
Aktas et al. Personalizing pagerank based on domain profiles
CN102737090B (en) Webpage searching result ordering method and device
KR100671077B1 (en) Server, Method and System for Providing Information Search Service by Using Sheaf of Pages
CN103365932A (en) Webpage search method and device
CN101599069A (en) The searching method of electronic document and system
de Moura et al. Using structural information to improve search in Web collections
CN103678376B (en) Searching system and searching method
KR101180371B1 (en) Folksonomy-based personalized web search method and system for performing the method
JP2010282403A (en) Document retrieval method
KR100906618B1 (en) Method and system for user define link search
Shen et al. A content-based algorithm for blog ranking
Gautam et al. Semantic Web improved with IDF feature of the TFIDF algorithm
TW201411379A (en) Searching system and method
Fatima et al. Analysis of different page ranking algorithms
Malinský et al. Improvements of webometrics by using sentiment analysis for better accessibility of the web
Lobo et al. Acquiring the best page using query term synonym combination
Debbarma et al. Solution for queries for top-k relevant attribute
Selvan et al. An Efficient Re-Ranking Algorithm By Refining Client Side Log To Predict User-S Interest Based On Web Personalization

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C41 Transfer of patent application or patent right or utility model
TA01 Transfer of patent application right

Effective date of registration: 20160918

Address after: 518000 Guangdong Province, Shenzhen New District of Longhua City, Dalang street, Hua Sheng Lu Yong Jingxuan commercial building 1608

Applicant after: Jinyang Shenzhen sea Network Intelligent Technology Co.,Ltd.

Address before: 518109 Guangdong city of Shenzhen province Baoan District Longhua Town Industrial Zone tabulaeformis tenth East Ring Road No. 2 two

Applicant before: HONG FU JIN PRECISION INDUSTRY (SHENZHEN) Co.,Ltd.

Applicant before: HON HAI PRECISION INDUSTRY Co.,Ltd.

C41 Transfer of patent application or patent right or utility model
CB02 Change of applicant information

Inventor after: Zhang Xudong

Inventor before: Li Zhongyi

Inventor before: Ye Jianfa

Inventor before: Liu Yuecen

Inventor before: Lu Junqi

COR Change of bibliographic data
TA01 Transfer of patent application right

Effective date of registration: 20161214

Address after: 510000 Guangdong, Guangzhou province Jiang Yan Road, No. 31, No. two, accounting for the construction of the floor of the self - made 271, No. 33, No.

Applicant after: Guangzhou Ziyou Network Technology Co.,Ltd.

Address before: 518000 Guangdong Province, Shenzhen New District of Longhua City, Dalang street, Hua Sheng Lu Yong Jingxuan commercial building 1608

Applicant before: Jinyang Shenzhen sea Network Intelligent Technology Co.,Ltd.

C14 Grant of patent or utility model
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20170208

Termination date: 20210917

CF01 Termination of patent right due to non-payment of annual fee