CN101271464A - Search method of internet search engine - Google Patents

Search method of internet search engine Download PDF

Info

Publication number
CN101271464A
CN101271464A CNA2007101780759A CN200710178075A CN101271464A CN 101271464 A CN101271464 A CN 101271464A CN A2007101780759 A CNA2007101780759 A CN A2007101780759A CN 200710178075 A CN200710178075 A CN 200710178075A CN 101271464 A CN101271464 A CN 101271464A
Authority
CN
China
Prior art keywords
product
data
substring
search engine
index
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CNA2007101780759A
Other languages
Chinese (zh)
Other versions
CN100557610C (en
Inventor
王双
吴爱华
苗宇枫
谌谦
李建锋
徐光美
吴柏林
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing The9 livable Property Co.,Ltd.
Guangdong Fanzai Wireless RFID Public Technology Support Co.,Ltd.
Original Assignee
BEIJING NINETOWNS INTERNET TECHNOLOGY Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by BEIJING NINETOWNS INTERNET TECHNOLOGY Co Ltd filed Critical BEIJING NINETOWNS INTERNET TECHNOLOGY Co Ltd
Priority to CNB2007101780759A priority Critical patent/CN100557610C/en
Publication of CN101271464A publication Critical patent/CN101271464A/en
Application granted granted Critical
Publication of CN100557610C publication Critical patent/CN100557610C/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Abstract

The present invention provides a search method by an Internet search engine. The method implements the search of product data by a search system consisting of a downloader, a product database server, a product webpage data server, a word segmentation device, an indexer, an index database and a query device. The steps of the method includes obtaining the product webpage data by the downloader; processing the product phrases according to the data of the product database by the word segmentation device and the indexer; obtaining the product webpage with relevant data included and establishing the data index; inputting the user query by the query device, processing the product phrases according to the data of the product database server to obtain the relevant data and generating the query result. The search method is mainly used in the product search engine system of B2B vertical search.

Description

A kind of searching method of internet search engine
Technical field
The present invention relates to computer networking technology, particularly relate to a kind of searching method of internet search engine.
Background technology
The development of search engine technique is the information digitalization of formation and the inevitable outcome of data networkization along with the continuous progress of electronic technology.An outstanding search engine can in time provide needed information to the user, and to accomplish this point just need one fast, high-quality, searching method is supported efficiently.The Google search engine relies on its Page Rank mechanism and convergence algorithm to be in the leading position in this field always.The search engine of Google company is the doctoral candidate SergeyBrin of Stanford University and the prototype system that Lawrence Page realizes at first, has developed into one of search engine best on the internet now.The architecture of Google is similar to traditional search engine, and it is handled with different being in the ordering of webpage having been carried out based on authority's value of traditional search engine maximum, makes most important webpage appear at result's foremost.Google goes out the Page Rank value of webpage by PageRank unit algorithm computation, thereby determines the appearance position of webpage in result set, the high more webpage of Page Rank value, and the position that occurs in the result is forward more.
With respect to general search engines such as Google, Baidu, professional B2B (Business to Business) the B2B information that vertical search provided more precisely, more professional and have more the degree of depth; And with respect to traditional B2B portal website, professional B2B vertical search can provide more comprehensively, more objective, the information content of diversification more.In view of this, professional B2B vertical search be subjected to enterprise customer's favor just day by day, and this has also been established the foundation place that ecommerce B2B technology mode is imbued with vitality as the product of internet fast development, segmentation.
Similar with traditional search engines, the results page that need return based on the vertical search engine of B2B E-commerce is the information that the user is concerned about.And obviously be not suitable for the demand of B2B specialty vertical search for the name arranging technology that traditional search engines is searched in E-business applications, the rank as a result of searched page should not go to consider to be linked to this page hyperlink quantity and mostly are specialized informations that those and enterprise commerce are closely connected.Above-mentioned application demand is arranged just, be suggested based on the vertical search engine method of product quality algorithm and be used for the quality of comprehensive evaluation Search Results, and carry out rank in results page, to improve the user search quality and to help the offshore purchase merchant to screen the high-quality supplier fast, accurately.
Needing to intersperse among on the internet product and the related data on each independent website based on the product search service of internet collects, unified retrieval service is provided, therefore the search function of product data is very important for whole business, search method should satisfy the such particular requirement of retrieval of product data, has very high performance again.
Product search service based on the internet is a kind of vertical search service, the field of vertical search is had very strong limited, need collect at the relevant data in field as far as possible comprehensively, and provide the retrieval of concentrating the degree of depth, and general internet universal search method does not have such characteristics, so both are different to the requirement of searching method.Generally, the searching method in vertical search field will to have especially concern ability at the data in field.Specific to the internet product searching service, such requirement is embodied in two aspects: first aspect is will treat with a certain discrimination product data in retrieval, and is limited with the embodiment field; Second aspect is the complicated query of wanting to handle about product, to satisfy the requirement of comprehensive precise search.In addition, the data that the internet hunt service needed is faced are magnanimity, and need deal with a large amount of concurrent search, therefore whole search system performance are had very high requirement.
Vertical search engine be relative universal search engine contain much information, inquire about new search engine service pattern inaccurate, that the degree of depth is not enough etc. puts forward, by information that certain value is arranged and the related service that provides at a certain specific area, a certain specific crowd or a certain particular demands.Its characteristics are exactly " special, smart, dark ", and have the industry color, the magnanimity information disordering of the universal search engine of comparing, and vertical search engine then seems absorbed more, concrete and gos deep into.Can briefly become is the industry-specific division of labor of searching engine field.Numerous professional websites, industry website stand-alone service are in the success of internet, and the general layout that has exactly proved the internet should be many-sided.The character of universal search engine has determined its precision information requirement that can not satisfy special dimension, special population service.Market demand diversification has determined the service mode of search engine segmentation will occur, provides accurate more industry service mode at different industries.
Summary of the invention
The object of the present invention is to provide a kind of characteristics that adapt to internet vertical product search, realize high-performance, the searching method of high-precision search engine.
For achieving the above object, on the internet provided by the invention based on the searching method of knowledge base by containing by downloader, the product knowledge database server, the product web page data server, participle device and index, index data base, the search system that requestor is formed, carry out the search of product data, the step of this method comprises for the product original web page information on the internet, obtain the product web page data by downloader, according to the data in the product knowledge database server by participle device and index treatment product webpage and create data directory and write index data base, corresponding product data webpage writes the product web page data server, by requestor input user inquiring and according to generated query result after the data processing in the product knowledge database server, describedly comprise the steps: by participle device and index treatment product web data according to the data in the product knowledge database server
A. obtain the original web page text,
B. according to the product data in the product knowledge database server web page text is carried out the longest coupling of forward,
C. whether coupling is successful among the determining step b,
If d. the match is successful, then occurrence is carried out the data phrase and handle and to obtain the substring line ordering of going forward side by side,
E. each substring in the steps d is calculated correlativity, and sets up index and write index data base,
The described processing by requestor input user inquiring according to the data in the product knowledge database server comprises the steps:
F. read in the query string of user's input,
G. query string is carried out the forward maximum match, generates the occurrence set,
H. occurrence set carrying out product phrase is handled, is generated substring set and ordering, generate effective substring sequence according to each substring correlativity,
I. each substring in effective substring sequence is obtained the web data set that matches successively, and according to correlativity size ordering output.
The searching method of internet search engine of the present invention, the product data in the described product knowledge database server comprise product attribute data and product business data.
The searching method of internet search engine of the present invention, described according to the data in the product knowledge database server by increasing following steps in described step h in the requestor input user inquiring treatment step and the step I:
J. each element in effective substring sequence is determined the true centric speech,
If k. have preposition in the true centric speech, determine that then centre word is a most left preposition speech before, if there is not preposition in the true centric speech, determine that then centre word is last speech,
L. centre word and true centric speech are generated the substring sequence of expansion as the effective substring sequence in the described step I.
The searching method of internet search engine of the present invention, described phrase are treated to according to the inner structure of product speech and carry out the multiple-segmentation processing.
The searching method of internet search engine of the present invention, described ordering are order or inverted order.
The searching method of internet search engine of the present invention, described substring index order method are more important than short substring according to long substring, the position by by the substring substring important method that keeps left than the position draw.
The searching method of internet search engine of the present invention, substring correlativity size is R (t) * I (n) described in the described step e in described participle device and the index treatment product web data treatment step.
The searching method of internet search engine of the present invention, described is R (t) * I (n) according to the data in the product knowledge database server by the described substring correlativity among the described step h in the requestor input user inquiring treatment step.
The present invention is for realizing the searching method of product vertical search function in the internet product searching service, utilizes in the product knowledge database server and comprehensive data such as online product information quality, is fit to the particular requirement of internet product retrieval; Adopt two stage retrieval and search framework, had higher search and search efficiency; When index foundation and dynamic response retrieval, all adopted product phrase treatment technology, can handle long complexity retrieval string, method of the present invention is applied to internet B2B E-commerce vertical search, the data that analysis-by-synthesis user submits to and the network download device is gathered, the product quality grade point of the data that calculating is collected, and in view of the above Search Results is sorted, make most important webpage appear at result's foremost, the accuracy and the search quality of search have been improved, make search engine more help the user and use, obtain satisfied result for retrieval.
Be elaborated with reference to accompanying drawing below in conjunction with embodiment, so that purpose of the present invention, feature and advantage are had deep understanding.
Description of drawings
Fig. 1 is the related system principle synoptic diagram of searching method of internet search engine of the present invention;
Fig. 2 carries out the method flow diagram that the product phrase is handled during for the participle device of the searching method of internet search engine of the present invention and index work;
Fig. 3 is the method for work process flow diagram during for the requestor dynamic response of the searching method of internet search engine of the present invention;
Fig. 4 is the web data correlativity synoptic diagram of a specific embodiment of the searching method of internet search engine of the present invention;
Fig. 5 carries out the method flow diagram that correlativity is judged for the participle device and the index of the searching method of internet search engine of the present invention;
Fig. 6 is the requestor dynamic response work detailed method process flow diagram of the searching method of internet search engine of the present invention.
Embodiment
With embodiment technical scheme is elaborated below.
With reference to Fig. 1, the related system of the searching method of internet search engine of the present invention is by downloader, the product knowledge database server, and search engine, the product web page data server is formed.Downloader is responsible for the work of obtaining of info web; The product knowledge database server provides search engine needed product speech, product attribute speech, product classification speech and other needed relevant product information data; Search engine is further by the index creation module, and index data base is inquired about input processing and result-generation module and formed.Index creation module in the search engine comprises participle device and index, participle device and index use together, the web page contents that they are responsible for obtaining carries out the processing of product phrase and carries out index automatically, and the position and the frequency computation part weights that in webpage, occur by speech, deposit product phrase result in index data base then, whole webpage obtains work and indexing service upgrades whole index data base and product web page data server after finishing; Requestor at first carries out the product phrase to the information of user's input to be handled, and retrieve the record that all comprise term, by calculating webpage weight and rank to the query note row set computing of going forward side by side of sorting, as union, intersection operation, the summary info that extracts each webpage at last from the product web page data server feeds back to inquiring user.
Table 1:
OEM?1GB?MP3?Player Detailed?Product?Description Features:1)Supports?animated?menu,synchronized?lyric2)FM stereo?recording?and?voice?mixing3)Supports?MP3,WAV,ASF and?WMA?music?format?files4)Firmware?upgrade?function5)Wear resistant?mirror?surface6)Supports?8EQ?modes:pop,rock,class, natural,DBB,Jazz,soft,user7)OSD?Multi-language:supports Chinese?GB,Chinese?Big5and?English?display8)A-B?repeat, repeat?1,repeat?all9)Driverless?installation?for?Windows Me/2000/XP10)Memory:built-in?128/256/512MB?flash memory11)Digital?voice?recording:128M-8hrs/256M-16hrs/ 512M-32hrs12)Playtime:8hrs13)Recording?fbrmat:PCM/MS ADPCM/IM?ADPCM14)Backlight:7colors?15)Power?supply:1× AAA?battery(alkaline?battery)16)Data?retention:10years17) Power?output:5mW+5mW(32Ω)18)Frequency?response: 20Hz-20KHz19)SNR?analog?output:>90dB20)Weight:30g21) Dimensions(L×W?×H):90×29×21mmInner?packing:1pc/gift bo×Dimensions:190×140×55mmOuter?packing:Dimensions: 395×297×290mmConveyance:Qty/20′FCL:14,800pcsQty/40′ FCL:29,700pcsQty/40′HQ:33,800pcs
Table 2:
TX?International?Group?Co.,Ltd Our?company?is?specialized?production?and?trading?MP3?player/DVD play?games?manufacturer.Specialized?production?world?name?brand MP3MP4player.Produces?specially?the?MP4?player?and?PSP. Welcome?to?discuss?and?make?arrangements?the?cooperation?item! TEL:86-0755-21137563 Mobile?phone:086-13006621355 MSN:cootv2008@hotmail.com E-mail:cootv@163.com?cootv2008@gmail.com websites:http://www.cootv.com
Table 3:
Memory?Stick?Pro?Duo?1GB/2GB(High?Speed) Description: Sony?Memory?Stick?Pro?Duo?1GB/2GB(HIGH?SPEED)Product?Origin: Japan?Detailed?Product?Description?With?a?massive?total?storage?capacity of?1GB/2GB(940MB/1.85GB?available),it?is?perfect?for?storing photographs,video?and?digital?music?files.Its?high?speed?feature?allows?it?to read?and?write?data?at?speeds?up?to?80Mbps?on?enabled?devices,and?it?is backwards?compatible?with?regular?Memory?Stick?PRO?Duo?devices.With the?included?MSAC-M2adapter,the?MSX-M2GN?is?backwards?compatible with?most?products?that?support?full-sized?Memory?Stick?PRO?media. Designed?especially?for?professional-quality?compact?digital?cameras?and portable?digital?music?devices,it′...
Further, the omnibearing information data that relates in the internet product data is provided in the product knowledge database server, particularly, it can use various about the descriptive data of product and the each side information data of the enterprise of release product on the internet, to satisfy the demand about the high-quality retrieval of product data webpage.Participle device and the index web data that search gets according to downloader generates corresponding index database data, and the knowledge data about product is write the index database data with higher weights (being the product know-how correlation of data) for the importance that embodies product data, degree of correlation according to good as calculated product know-how data when said process during requestor dynamic response user search generates result for retrieval, the efficient of such method and system design raising system and search engine.The present invention has used product phrase treatment technology when static search library of generation and dynamic response user inquiring, and the system that makes can handle long, complicated query, demand comprehensive to satisfy, accurate retrieval.
With reference to Fig. 1, participle device and index (index creation module) are to web pages downloaded data organization index, except general in-line arrangement, fall the index structure such as row, this module has also been carried out the emphasis processing according to the information data of the product description in the product knowledge database server to the product description data in the webpage, with the importance of embodiment product data, and avoid when the dynamic response user inquiring, carrying out complex calculations.Requestor (inquiry input processing/result-generation module) dynamic response user's inquiry, input string to inquiry is handled it, the generated query result returns the user, and the foundation of result for retrieval is the index structure in the index database, reflection be to be the correlativity of core webpage with product data.In webpage is described the correlativity of product data, the generation of final ranking results also with reference to comprehensive information such as the newness degree that whether contains the time of product issue in product picture, the webpage in the company information of release product in the product knowledge database server, the webpage, makes the result who is obtained can really reflect product retrieval result's quality comprehensively.Wherein, company information comprises the static information data that every evaluation of indexes such as scale, history, organizational structure are formed, the active degree data of commercial activity on the internets such as the product issue of enterprise, and client, expert's data such as evaluation are formed.Said system and method are to take as the leading factor with the correlativity of product data in the webpage, take into account data evaluation correlativity to product data issue enterprise, and other every correlation of data, realize search engine system and searching method efficient, comprehensive, accurate retrieval.
The searching method of search engine of the present invention has all adopted product phrase disposal route and adopted the method for the product know-how data fusion being advanced index database in participle device and index (index creation module) in participle device and index (index creation module) and requestor (inquiry input processing/result-generation module) in order to make effective processing to the complicated query input string.
The standard method of the calculating of search engine retrieving correlativity at present is the correlativity of calculating between inquiry input and the document, and presses the height output ranking results of correlativity.Correlativity between inquiry input and the document is in fact by each vocabulary item of forming the inquiry input and the correlation calculations between the document, the significance level of vocabulary in document during promptly inquiry is imported, weight in other words.Therefore, which type of weight which type of vocabulary being provided with is very crucial problem in the searching system of search engine of the present invention.The standard meter algorithm of the weight of certain vocabulary item in document is the TF/IDF computing method at present, this be with reference to vocabulary in document frequency of occurrence and the weight of the distribution situation in entire document set method is set, but such method does not embody the importance of product data, can not fully adapt to the needs based on product search engine. retrieves method based on the internet.In the searching algorithm of searching method of the present invention, except adopting the TF/IDF method to be provided with the weight, utilize the product know-how data to general vocabulary, product data have been adopted special weight setting, to embody its singularity.
In the scope of the searching system in search engine method, the product know-how data show as every attribute speech that product speech and this product are had, and the special processing of product data is also shown as setting to product speech weight.The basic foundation that lexical data is weighed with respect to the correlativity of one piece of document (being webpage) is a frequency of occurrence, but at product speech data, then only the frequency of occurrence of measurement itself is not enough.Product has the attribute data of various aspects such as size dimension, electrical equipment index, thus product speech correlation of data should with the consideration of uniting of its attribute speech correlation of data.Because product speech data have leading role, so attribute speech correlation of data is subordinated to product speech correlation of data.A product speech t iCorrelativity R (t to one piece of webpage i) be defined as:
R(t i)=W 0·f(t i)+W 1·f(s i1)+…+W n·f(s in)
In the following formula, s I1S InBe product speech t iThe attribute speech, f is the importance function based on the vocabulary frequency of occurrence, W 0, W 1W nBe every weight, can regulate setting.
Under product speech data or a lot of situations of lexical data relevant with product is not a single speech, but compound word or even phrase, has inner structure, if these inner structures are not handled it, then be difficult to handle long complex query, can not provide comprehensive result for retrieval, so the present invention adopted product phrase disposal route, when index creation stage and dynamic response inquiry, all carried out the product phrase and handle.
The product phrase is handled product speech or the inquiry input refer to having inner structure and is carried out multiple-segmentation, and with the cutting substring that produces according to the significance level differentiating and processing.The principle of the significance level ordering of the substring that cutting produces is that the substring of length is more important than short substring, and the substring of keeping right in the position is more important than the substring that the position keeps left.Creating the index stage, the significance level of substring influences its correlativity to document, the ordering that the significance level influence retrieval of the substring that produces in the cutting of dynamic corresponding stage produces.Product phrase disposal route mainly finishes the simple cutting of text input string/file, based on phrase/subphrase identification, the subphrase importance ranking of stem reduction, and the centre word of each cutting string identification.
Participate in participle device of the present invention and index (index creation module) the method for work process flow diagram of Fig. 2, this method is at first read in web data, identification has the long products speech data of inner structure then, carry out cutting and ordering then, at last cutting substring and other lexical item data are set up index and weight is set.
With reference to Fig. 3, what the product phrase processing in the requestor dynamic response stage of the searching method of internet search engine of the present invention was different with index (index creation module) method of work with the participle device is the identification that also comprises the centre word data, the centre word data are meant the part that the inquiry input string is become branch to modify by modification property, or play the composition of difference effect, has " mp3 " functional eyeglasses as referring at " MP3Glass ", wherein centre word is " Glass ", but centre word should be " mp3 " in " mp3player ".
Referring to table 1, table 1 is the original web page that an internet relates to MP3 player product information data, contains the following message data in this webpage: " MP3Player ", " Size ", " Multi-Language ", " Memory ", " Power Supply ", " Data Retention ".
Referring to table 2, table 2 is the original web page of design MP3 player product information data on another internet, contains the following message data in this webpage: " MP3Player ".
If the description of following every attribute about " MP3Player " is arranged: " MP3Player ": " Size " in the product knowledge database server, " Multi-Language ", " Play Time ", " Memory ", " Power Supply ", " Data Retention ".All comprise " MP3Player " in the shown webpage in above-mentioned two tables, but according to the data in the knowledge base server, contain more detailed explanation in first webpage about " MP3Player " every attribute, therefore in the index creation stage, in the correlation calculations of " MP3Player " this vocabulary item and these two webpages, the correlativity height of previous webpage, then a correlativity is little, as represented among Fig. 4 (the correlativity size is represented with the thickness of arrow among the figure).
Just like the web data described in the table 3, contain in the webpage 3 " 1GB " for another example.If user's retrieval is input as " 1gbmp3player ", then handle through the product phrase, after the centre word identification, " Mp3player " is identified as centre word, afterwards with respect to the ordering of the retrieval of this retrieval and above-mentioned three webpages output webpage will become for: as the described webpage 1 of table 1, as the described webpage 2 of table 2, as the described webpage 3 of table 3.
Participle device and index with reference to the searching method of the internet search engine of the present invention of Fig. 5 carry out the method flow diagram that correlativity is judged; Its ordering to substring is more important than short substring according to long substring, the substring of keeping right in the position carries out than the substring important method that the position keeps left, to correlation calculations be: R (t) * I (n) according to computing formula, wherein R (t) is an aforementioned formula, I (n) is the ordering attenuation function, ordering is more little by its value of back more, and n is the sequence number in the ordering.
Requestor dynamic response work detailed method process flow diagram with reference to the searching method of the internet search engine of the present invention of Fig. 6.The process of wherein definite true centric speech and centre word can still be omitted the searching method based on processing of data phrase and the realization of correlativity basic fundamental means that these steps still can realize search engine of the present invention so that searching method of the present invention is more accurate.Above-mentioned steps has enlarged retrieval substring sequence by determining the judgement and the increase method of centre word, has just improved the hunting zone to data in the product web page data server.

Claims (8)

1. the searching method of an internet search engine, this method is by containing by downloader, the product knowledge database server, the product web page data server, participle device and index, index data base, the search system that requestor is formed, carry out the search of product data, it is characterized in that: the step of this method comprises for the product original web page information on the internet, obtain the product web page data by downloader, according to the data in the product knowledge database server by participle device and index treatment product webpage and create data directory and write index data base, corresponding product data webpage writes the product web page data server, by requestor input user inquiring and according to generated query result after the data processing in the product knowledge database server, describedly comprise the steps: by participle device and index treatment product web data according to the data in the product knowledge database server
A. obtain the original web page text,
B. according to the product data in the product knowledge database server web page text is carried out the longest coupling of forward,
C. whether coupling is successful among the determining step b,
If d. the match is successful, then occurrence is carried out the data phrase and handle and to obtain the substring line ordering of going forward side by side,
E. each substring in the steps d is calculated correlativity, and sets up index and write index data base,
The described processing by requestor input user inquiring according to the data in the product knowledge database server comprises the steps:
F. read in the query string of user's input,
G. query string is carried out the forward maximum match, generates the occurrence set,
H. occurrence set carrying out product phrase is handled, is generated substring set and ordering, generate effective substring sequence according to each substring correlativity,
I. each substring in effective substring sequence is obtained the web data set that matches successively, and according to correlativity size ordering output.
2. according to the searching method of the described internet search engine of claim 1, it is characterized in that the product data in the described product knowledge database server comprise product attribute data and product business data.
3. according to the searching method of the described internet search engine of claim 2, it is characterized in that, described according to the data in the product knowledge database server by increasing following steps in described step h in the requestor input user inquiring treatment step and the step I:
J. each element in effective substring sequence is determined the true centric speech,
If k. have preposition in the true centric speech, determine that then centre word is a most left preposition speech before, if there is not preposition in the true centric speech, determine that then centre word is last speech,
L. centre word and true centric speech are generated the substring sequence of expansion as the effective substring sequence in the described step I.
4. according to the searching method of the described internet search engine of claim 3, it is characterized in that described phrase is treated to according to the inner structure of product speech and carries out the multiple-segmentation processing.
5. according to the searching method of the described internet search engine of claim 4, it is characterized in that described ordering is order or inverted order.
6. according to the searching method of the described internet search engine of claim 5, it is characterized in that described substring index order method is more important than short substring according to long substring, the position by by the substring substring important method that keeps left than the position draw.
7. according to the searching method of the described internet search engine of the arbitrary claim of claim 1 to 6, it is characterized in that substring correlativity size is R (t) * I (n) described in the described step e in described participle device and the index treatment product web data treatment step.
8. according to the searching method of the described internet search engine of the arbitrary claim of claim 1 to 6, it is characterized in that described is R (t) * I (n) according to the data in the product knowledge database server by the described substring correlativity among the described step h in the requestor input user inquiring treatment step.
CNB2007101780759A 2007-11-26 2007-11-26 A kind of searching method of internet search engine Expired - Fee Related CN100557610C (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CNB2007101780759A CN100557610C (en) 2007-11-26 2007-11-26 A kind of searching method of internet search engine

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CNB2007101780759A CN100557610C (en) 2007-11-26 2007-11-26 A kind of searching method of internet search engine

Publications (2)

Publication Number Publication Date
CN101271464A true CN101271464A (en) 2008-09-24
CN100557610C CN100557610C (en) 2009-11-04

Family

ID=40005438

Family Applications (1)

Application Number Title Priority Date Filing Date
CNB2007101780759A Expired - Fee Related CN100557610C (en) 2007-11-26 2007-11-26 A kind of searching method of internet search engine

Country Status (1)

Country Link
CN (1) CN100557610C (en)

Cited By (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101814098A (en) * 2010-05-11 2010-08-25 天津大学 Method for obtaining software security defects based on vertical search and semantic annotation
CN103049577A (en) * 2013-01-09 2013-04-17 广东欧珀移动通信有限公司 Method for querying data connection type of different digital products by mobile terminals
WO2013127319A1 (en) * 2012-02-28 2013-09-06 Tencent Technology (Shenzhen) Company Limited Method and apparatusfor text searching on a touchterminal
CN103927342A (en) * 2014-03-28 2014-07-16 苏州中炎工贸有限公司 Vertical search engine system on basis of big data
CN103995845A (en) * 2014-05-06 2014-08-20 百度在线网络技术(北京)有限公司 Information search method and device
CN103995846A (en) * 2014-05-06 2014-08-20 百度在线网络技术(北京)有限公司 Application message searching method and device
CN105912584A (en) * 2016-04-01 2016-08-31 南京奥灵克物联网科技有限公司 Data index system based on webpage information data
CN108595400A (en) * 2018-04-20 2018-09-28 广东电网有限责任公司 A kind of work report generation method based on artificial intelligence
CN109241360A (en) * 2018-08-21 2019-01-18 阿里巴巴集团控股有限公司 The matching process and device and electronic equipment of combining characters string
CN111104485A (en) * 2019-12-24 2020-05-05 上海风秩科技有限公司 Method and device for determining product text, computer equipment and medium

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115470323B (en) * 2022-10-31 2023-03-10 中建电子商务有限责任公司 Method for improving searching precision of building industry based on word segmentation technology

Cited By (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101814098A (en) * 2010-05-11 2010-08-25 天津大学 Method for obtaining software security defects based on vertical search and semantic annotation
CN101814098B (en) * 2010-05-11 2012-05-02 天津大学 Method for obtaining software security defects based on vertical search and semantic annotation
WO2013127319A1 (en) * 2012-02-28 2013-09-06 Tencent Technology (Shenzhen) Company Limited Method and apparatusfor text searching on a touchterminal
CN103049577A (en) * 2013-01-09 2013-04-17 广东欧珀移动通信有限公司 Method for querying data connection type of different digital products by mobile terminals
CN103927342A (en) * 2014-03-28 2014-07-16 苏州中炎工贸有限公司 Vertical search engine system on basis of big data
CN103995846A (en) * 2014-05-06 2014-08-20 百度在线网络技术(北京)有限公司 Application message searching method and device
CN103995845A (en) * 2014-05-06 2014-08-20 百度在线网络技术(北京)有限公司 Information search method and device
CN103995846B (en) * 2014-05-06 2017-04-05 百度在线网络技术(北京)有限公司 The searching method and its device of application message
CN105912584A (en) * 2016-04-01 2016-08-31 南京奥灵克物联网科技有限公司 Data index system based on webpage information data
CN105912584B (en) * 2016-04-01 2020-07-31 南京奥灵克物联网科技有限公司 Data indexing system based on webpage information data
CN108595400A (en) * 2018-04-20 2018-09-28 广东电网有限责任公司 A kind of work report generation method based on artificial intelligence
CN109241360A (en) * 2018-08-21 2019-01-18 阿里巴巴集团控股有限公司 The matching process and device and electronic equipment of combining characters string
CN109241360B (en) * 2018-08-21 2021-08-20 创新先进技术有限公司 Matching method and device of combined character strings and electronic equipment
CN111104485A (en) * 2019-12-24 2020-05-05 上海风秩科技有限公司 Method and device for determining product text, computer equipment and medium

Also Published As

Publication number Publication date
CN100557610C (en) 2009-11-04

Similar Documents

Publication Publication Date Title
CN100557610C (en) A kind of searching method of internet search engine
US20230214905A1 (en) Recommendations based on branding
CN102419779B (en) Method and device for personalized searching of commodities sequenced based on attributes
US7685084B2 (en) Term expansion using associative matching of labeled term pairs
CN105765573B (en) Improvements in website traffic optimization
US8352331B2 (en) Relationship discovery engine
Khraim The impact of search engine optimization on online advertisement: The case of companies using E-Marketing in Jordan
US20180060921A1 (en) Augmenting visible content of ad creatives based on documents associated with linked to destinations
EP3564828A1 (en) Method of data query based on evaluation and device
KR101936362B1 (en) Generating an advertising campaign
CN102446326B (en) A kind of method of information pushing, system and equipment
CN104679771A (en) Individual data searching method and device
Thomaidou et al. Multiword keyword recommendation system for online advertising
CN104636334A (en) Keyword recommending method and device
US8977625B2 (en) Inference indexing
EP2188712A2 (en) Recommendation systems and methods
CN103309886A (en) Trading-platform-based structural information searching method and device
García-Moya et al. Storing and analysing voice of the market data in the corporate data warehouse
Khraim The impact of search engine optimization dimensions on companies using online advertisement in Jordan
CN112269816B (en) Government affair appointment correlation retrieval method
US20070143255A1 (en) Method and system for delivering internet content to mobile devices
CN113486226A (en) Method and system for search result annotation
CA3233355A1 (en) System and method for improving e-commerce
JP2006146446A (en) Retrieval optimization system and method for web site
WO2016020930A1 (en) An integrated system for a virtual bookstore

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
ASS Succession or assignment of patent right

Owner name: NINETOWNS INTERNET TECHNOLOGY GROUP COMPANY LIMITE

Free format text: FORMER OWNER: BEIJING JIUCHENG YIJU TENANCY CO., LTD.

Effective date: 20120417

C41 Transfer of patent application or patent right or utility model
C56 Change in the name or address of the patentee

Owner name: BEIJING JIUCHENG YIJU TENANCY CO., LTD.

Free format text: FORMER NAME: BEIJING NINETOWNS INTERNET TECHNOLOGY CO., LTD.

COR Change of bibliographic data

Free format text: CORRECT: ADDRESS; FROM: 100070 FENGTAI, BEIJING TO: 100020 CHAOYANG, BEIJING

CP01 Change in the name or title of a patent holder

Address after: 100070, Beijing, Fengtai District, South Fourth Ring Road, No. 7, 188 District, 14 floor

Patentee after: Beijing The9 livable Property Co.,Ltd.

Address before: 100070, Beijing, Fengtai District, South Fourth Ring Road, No. 7, 188 District, 14 floor

Patentee before: BEIJING NINETOWNS INTERNET TECHNOLOGY Co.,Ltd.

TR01 Transfer of patent right

Effective date of registration: 20120417

Address after: 100020 Beijing City, Chaoyang District Road No. 20, building 1, 22 storey International Building Report

Patentee after: Guangdong Fanzai Wireless RFID Public Technology Support Co.,Ltd.

Address before: 100070, Beijing, Fengtai District, South Fourth Ring Road, No. 7, 188 District, 14 floor

Patentee before: Beijing The9 livable Property Co.,Ltd.

CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20091104

Termination date: 20151126

CF01 Termination of patent right due to non-payment of annual fee