CN102332025A - Intelligent vertical search method and system - Google Patents

Intelligent vertical search method and system Download PDF

Info

Publication number
CN102332025A
CN102332025A CN201110300772A CN201110300772A CN102332025A CN 102332025 A CN102332025 A CN 102332025A CN 201110300772 A CN201110300772 A CN 201110300772A CN 201110300772 A CN201110300772 A CN 201110300772A CN 102332025 A CN102332025 A CN 102332025A
Authority
CN
China
Prior art keywords
clauses
subclauses
user
chained address
safety
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201110300772A
Other languages
Chinese (zh)
Other versions
CN102332025B (en
Inventor
赵轶
许小东
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
3600 Technology Group Co Ltd
Original Assignee
Qizhi Software Beijing Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Qizhi Software Beijing Co Ltd filed Critical Qizhi Software Beijing Co Ltd
Priority to CN201110300772.3A priority Critical patent/CN102332025B/en
Priority to CN201410328560.XA priority patent/CN104063514B/en
Publication of CN102332025A publication Critical patent/CN102332025A/en
Application granted granted Critical
Publication of CN102332025B publication Critical patent/CN102332025B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Landscapes

  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention provides an intelligent vertical search method and an intelligent vertical search system, and relates to the technical field of networks. The method comprises the following step of: according to matching results of query words of a user and items which have been classified in the industry in a classified database, acquiring items related with the query words in various classes, displaying the items of the various classes and related information according to classes, wherein the items which have been classified in the industry are acquired by the following steps of: acquiring all items of all webpages of the industry and related information; performing weighted analysis on morphological matching results of key words of various items and key words corresponding to various classes of the industry to determine the classes to which the various items belong; and/or determining the classes to which the various items belong according to the similarity of the appearance frequency rate of various key words of unclassified items and the appearance probability of various key words of the classified items, which is acquired through statistics in advance. By the invention, information related to focuses of users can be displayed more accurately, comprehensively and quickly.

Description

A kind of intelligent uprightness searching method and system
Technical field
The application relates to networking technology area, particularly relates to a kind of intelligent uprightness searching method and system.
Background technology
Development along with Internet technology; Information on the internet presents volatile development, if this causes the user to want in network, to search out the information of oneself needs, must go to obtain the information that oneself needs through search; Search is that numerous netizens obtain information, arrives the main channel of appointed website.
In the prior art; A kind of is universal search, initiatively sends " spider " program through search engine, and the site information of whole internet is retrieved; The spider reptile is after grasping a page; This page is analyzed warehouse-in, and read all connections of the page, these are connected grasp analysis more then.Do not stop to grasp through the periodicity of thousands of spiders; Realized index to all info webs of whole internet; And directly return to the user with relevancy ranking according to the keyword coupling information that all are relevant with this keyword, and its advantage is that the hunting zone is wide, that return through keyword search is large and complete result but shortcoming is the user; The service of carrying out category filter by industrial characteristic cannot be provided; The demonstration of can't classifying, however which content be want in this search of user to search out can't judge that the user need take long to and in the result, look for the information of own needs.
A kind of in addition is vertical search, the deep search of resource in its station, and the user adds corresponding data on its website; System checks to these data and filters that search engine carries out Chinese word segmentation with these data and sets up index, supplies other user inquirings; Be of the search service of certain website to its station internal information; Information in its website is directly various information to be added in each classification of this website by user in its station, and its advantage is that classification is clear, but that shortcoming is the hunting zone is narrower; The result who provides is comprehensive inadequately, and the user needs can recognize comprehensive information through being used just with other search engines.
Summary of the invention
The application's technical matters to be solved provides a kind of vertical intelligent uprightness searching method and system, solves the user can't obtain relevant information comprehensively, accurately and rapidly when search information shortcoming.
In order to address the above problem, the application discloses a kind of intelligent uprightness searching method, comprising:
Obtain the query word of importing in the industry of user side selection;
According to the result of classified each entries match of the sector in said query word and the taxonomy database, obtain the clauses and subclauses relevant of all categories, and category is given the user with the clauses and subclauses and the correlated information exhibition of each said classification with query word; Wherein, described clauses and subclauses are or multinomial business tine in the ecommerce webpage;
Wherein, classified each clauses and subclauses of said the sector obtain through following steps:
Obtain all clauses and subclauses and the relevant information of said all webpages of industry;
The morphology matching result of the keyword in each clauses and subclauses and the sector corresponding keyword of all categories is carried out weighted analysis confirm classification under each clauses and subclauses;
And/or, confirm classification under each clauses and subclauses according to the occurrence number ratio of each keyword of unfiled clauses and subclauses and the similarity of the probability of occurrence of each keyword of the class entry that obtains according to statistics in advance.
Preferably, described query word comprises:
With the keyword of user's input as query word;
Perhaps, one in the suggestion speech that the input speech of the user being selected according to this user returns as query word; Wherein, described suggestion speech obtains with corresponding result's click relation extraction through the input speech of user's input of statistics in advance.
Preferably, also comprise following safety detection step:
Steps A, for the chained address at said each clauses and subclauses place to be classified that obtains, check whether safety of said chained address through safety inspection engine and wooden horse killing engine, if safety is then classified to said clauses and subclauses;
And/or, step B, the chained address for the classified clauses and subclauses place in the said taxonomy database constantly travels through each chained address through the safety inspection engine, if dangerous, then the related data of this chained address is deleted from taxonomy database;
And/or, step C, the chained address at the clauses and subclauses place of clicking for the user, through this chained address of safety inspection engine inspection safety whether, if dangerous, then point out the user and in taxonomy database with the related data deletion of this chained address.
Preferably, check through safety inspection engine and wooden horse killing engine whether safe process is carried out through following steps in said chained address:
Step P1, for the chained address at said each the clauses and subclauses place to be classified that obtains, the inspection of safety inspection engine is submitted in this chained address, and whether it exists in the safe class storehouse;
Step P2 if exist and safety, then classifies to said clauses and subclauses;
Step P3 if exist but dangerous, then sends warning message, and filters out the related data of this chained address;
Step P4 if do not exist, then through this chained address of wooden horse killing engine inspection, judges that whether this link safety, if safety deposits this chained address in the safe class storehouse, and changes step P1 over to.
Preferably, also comprise following safety detection step:
Through put on record information and/or website real name Verification System of the chained address icp that comprises said clauses and subclauses said clauses and subclauses are carried out safety detection.
Preferably, when all clauses and subclauses of obtaining said all webpages of industry and relevant information, comprise:
Automatically grasp step, be used for grasping automatically all clauses and subclauses and the relevant information of said all webpages of industry;
Replenish the typing step, be used for replenishing the clauses and subclauses and the relevant information of typing one webpage.
Preferably,, category comprises when giving user side with the clauses and subclauses of each said classification and correlated information exhibition:
The user behavior inlet that clauses and subclauses are relevant directly offers user side.
Preferably, also comprise:
The intelligent correction step, the query word of importing for user error through the intelligent correction engine carries out error correction.
Preferably; The occurrence number ratio of each keyword i of said clauses and subclauses and the difference of the probability of occurrence of each keyword i of class entry; Through the logarithm weighting; Calculate the corresponding classification c deviate G (c) of clauses and subclauses, the more little then similarity of deviate is high more, gets minimum deviate place classification and confirms to classify under the clauses and subclauses.
Disclosed herein as well is a kind of intelligent uprightness searching system accordingly, comprising:
Search engine is used for obtaining the query word that a industry that user side selects is imported; According to the result of classified each entries match of the sector in said query word and the taxonomy database, obtain the clauses and subclauses relevant of all categories, and category is given the user with the clauses and subclauses and the correlated information exhibition of each said classification with query word; Wherein, described clauses and subclauses are or multinomial business tine in the ecommerce webpage;
Taxonomy database is used to store every profession and trade grouped data;
Data acquisition module is used to obtain all clauses and subclauses and the relevant information of said all webpages of industry;
Sorter is used for morphology matching result with the keyword of each clauses and subclauses and the sector corresponding keyword of all categories and carries out weighted analysis and confirm classification under each clauses and subclauses; And/or, confirm classification under each clauses and subclauses according to the occurrence number ratio of each keyword of unfiled clauses and subclauses and the similarity of the probability of occurrence of each keyword of the class entry that obtains according to statistics in advance.
Preferably, also comprise:
Safety check module is used for the chained address for said each clauses and subclauses place to be classified that obtains, and checks whether safety of said chained address through safety inspection engine and wooden horse killing engine, if safety is then classified to said clauses and subclauses;
And/or the chained address for the classified clauses and subclauses place in the said taxonomy database constantly travels through each chained address through the safety inspection engine, if dangerous, then the related data of this chained address is deleted from taxonomy database;
And/or, the chained address at the clauses and subclauses place of clicking for the user, through this chained address of safety inspection engine inspection safety whether, if dangerous, then point out the user and in taxonomy database with the related data deletion of this chained address.
Preferably, also comprise:
The keyword suggestion engine is used for returning the suggestion speech according to user's input speech; Described suggestion speech obtains with corresponding result's click relation extraction through the input speech of user's input of statistics in advance.
Preferably, also comprise:
The intelligent correction engine is used for carrying out error correction for the query word of user error input.
Preferably, described data acquisition module comprises:
The data grabber is used for grasping automatically all clauses and subclauses and the relevant information of said all webpages of industry;
Replenish the typing module, be used for replenishing the clauses and subclauses and the relevant information of typing one webpage.
Preferably, also comprise:
Interface provides module, is used for that the user behavior inlet that clauses and subclauses are relevant directly offers user side when category is given user side with the clauses and subclauses of each said classification and correlated information exhibition.
Compared with prior art, the application comprises following advantage:
The application is through being the search starting point with the trade classification; Obtain the clauses and subclauses of all websites of every profession and trade in the network, and carry out weighted analysis through morphology matching result and confirm classification under each clauses and subclauses and/or confirm classification under each clauses and subclauses according to the occurrence number ratio of each keyword of unfiled clauses and subclauses and the similarity of the probability of occurrence of each keyword of the class entry that obtains according to statistics in advance with the keyword in each clauses and subclauses and the sector corresponding keyword of all categories; After all clauses and subclauses of an industry are classified; When the user inquires about; Search engine can be according to the result of classified each entries match of the sector in said query word and the taxonomy database; Obtain the clauses and subclauses relevant of all categories, and category is given the user with the clauses and subclauses and the correlated information exhibition of each said classification with query word; The application has realized the automatic classification and the screening function of data through setting up automatic classification system, can represent the information relevant with user's focus more accurately, more comprehensively, more efficiently.
Description of drawings
Fig. 1 is the schematic flow sheet of a kind of intelligent uprightness searching method of the application;
Fig. 2 is the validity synoptic diagram of the statistics classification of a kind of intelligent uprightness searching method of the application;
Fig. 3 is the best classification process synoptic diagram of a kind of intelligent uprightness searching method of the application;
Fig. 4 is the suggestion speech validity synoptic diagram of a kind of intelligent uprightness searching method of the application;
Fig. 5 is a safety inspection schematic flow sheet before the preferred clauses and subclauses classification of a kind of intelligent uprightness searching method of the application;
Fig. 6 is the structural representation of a kind of intelligent uprightness searching of the application system;
Fig. 7 is the structural representation of a kind of intelligent uprightness searching optimum system choosing of the application.
Embodiment
For above-mentioned purpose, the feature and advantage that make the application can be more obviously understandable, the application is done further detailed explanation below in conjunction with accompanying drawing and embodiment.
With reference to Fig. 1, show the schematic flow sheet of a kind of intelligent uprightness searching method of the application, comprising:
Step 110 is obtained the query word of importing in the industry of user side selection.
Step 120 according to the result of classified each entries match of the sector in said query word and the taxonomy database, is obtained the clauses and subclauses relevant with query word of all categories, and category is given the user with the clauses and subclauses and the correlated information exhibition of each said classification; Wherein, described clauses and subclauses are or multinomial business tine in the ecommerce webpage.Wherein, business tine does not contain non-main business information such as news, advertisement, question and answer, and does not comprise new line on the webpage, advertisement or icp (Internet Content Provider, the Web content service provider) information in zones such as upper left or below such as put on record.
Wherein, classified each clauses and subclauses of said the sector obtain through following steps:
Step 210 is obtained all clauses and subclauses and the relevant information of said all webpages of industry;
Step 220 is carried out weighted analysis with the morphology matching result of the keyword in each clauses and subclauses and the sector corresponding keyword of all categories and is confirmed classification under each clauses and subclauses;
And/or, confirm classification under each clauses and subclauses according to the occurrence number ratio of each keyword of unfiled clauses and subclauses and the similarity of the probability of occurrence of each keyword of the class entry that obtains according to statistics in advance.
The application can onlinely further classify through step 210 and step 220 pair every profession and trade down in advance.Such as for education sector; Can it be divided into juvenile education; Secondary education; Higher education; Big type of vocational educations etc. can be divided into child, children again and rise little, one grade, second grade, three grades, senior class, five grades, six grades, from-primary-to-junior-middle-school etc. type in juvenile education, secondary education can be divided into the junior one, the junior two, the junior three, senior middle school's preparatory course, high by one, Senior Two, Senior Three, preparatory course etc. type again; Higher education can be divided into again that English Test (Band 4 and 6), new ideas, English are relevant, rare foreign languages, the English of preparing for the postgraduate qualifying examination, the mathematics of preparing for the postgraduate qualifying examination, the politics of preparing for the postgraduate qualifying examination, the specialized course for preparing for the postgraduate qualifying examination, the TOEFL, IELTS etc. type, and vocational education can be divided into classes such as TOEIC, professional rare foreign languages, financial accounting, self-study examination, computing machine, driving school, construction work, economy and trade/finance, medicine, working graduate student, human resources, civil servant again.Wherein, Each type all can contain corresponding a plurality of keyword, rises little, keywords such as a grade, second grade, three grades, senior class, five grades, six grades, from-primary-to-junior-middle-school, elementary English, mathematics, Chinese language, preschool class, speciality class such as contain child, children in the juvenile education class.
Again for example, for the recreation industry; Can be divided into type of play, recreation subject matter and game picture etc. type, type of play can be divided into again that role playing, turn-based, action game, FPS shooting, TPS shooting match, car are competed for speed, sports, music and dance, fistfight fight, strategy etc. type; The recreation subject matter can be divided into swordsman, fantasy, magical, science fiction, cartoon, history etc. type again; Game picture can be divided into 3D, 2D, 2.5D etc. types again.Wherein each class all can contain corresponding a plurality of keyword.
At a minute time-like; At first obtain all clauses and subclauses of all websites of the sector in the network by industry through step 210; Described clauses and subclauses are or multinomial business tine in the ecommerce webpage; Wherein business tine does not contain non-main business information such as news, advertisement, question and answer; And do not comprise new line on the webpage, advertisement or the icp information in zones such as upper left or below such as put on record, such as education sector is respectively educated the course content and the title thereof of type website, such as " English of preparing for the postgraduate qualifying examination spurt class " and content thereof.
Wherein, when all clauses and subclauses of obtaining said all webpages of industry and relevant information, comprise:
Automatically grasp step, be used for grasping automatically all clauses and subclauses and the relevant information of said all webpages of industry.
Wherein, grasping step process automatically can carry out through following steps:
Step m1 searches for the chained address of all websites of the sector automatically, generates and grasps tabulation, and write down each extracting time and seized condition.
Step m2 adopts distributed deployment, according to Regional Distribution, different web sites is carried out periodicity grasp.If website data has change, the notification interface that the application provides can be called to adopt proactive notification mechanism in the website, realizes the real-time update to this website data.Only need visit the Notify Address that the application provides when website data changes, the extracting behavior can be triggered in this address.
Step m3 carries out safety inspection and validity check to grasping the data of coming.Whether safety inspection is submitted to this address of safety check module inspection with the address has wooden horse or virus to exist, and whether includes this address thereby return prompting.Validity check will check whether this address can normally open, and not exist or other mistakes if this address is returned, and this extracting will can not included this address.
Step m4 submits to sorter with data and carries out classification processing, and capture program continues to grasp new data.After this grasps end, the data grabber will restart to grasp, thereby and judge whether file has renewal to determine whether to skip and grasp next bar record.
Replenish the typing step, be used for replenishing the clauses and subclauses and the relevant information of typing one webpage.Can be used for replenishing and grasp the content that step does not grab automatically.
Common web data are unstructured datas, if use the mode of traditional extracting web page then need carry out structured analysis to data, in this process, because the Intelligent Recognition of system is limited in one's ability, will cause losing of partial information.In order to address this problem; The application combines the characteristics of problem, has set up a kind of cooperative mechanism with the data source provider, through formulating and provide the industry data format standard by the application; Fill in data content by the data source provider according to this standard, thereby reached the effect of getting twice the result with half the effort.Can set up format standard through following method: selected needs carry out the industry of vertical search, such as education, recreation; Analyze the data characteristics of the sector, formulate format standard.With the recreation is example, when people mention recreation, can expect game name, type of play, and game profile, relevant informations such as recreation address are carried out induction-arrangement to these information, finally form standard interface.The game interface field that forms is following:
Field Explanation of field
Game name Be no more than 50 words
Game characteristic Be no more than 50 words
Mission Description Be no more than 500 words
Game state Test or issue
Type of play Client recreation/web game etc.
Recreation operator To online game
The recreation start address To online game
The game download address The download link address is provided
How to begin Be no more than 500 words
Operating guidance Be no more than 500 words
Game picture The picture chained address is provided
The registered address The recreation registered address is provided
Different according to the rank of data volume, the application has formulated full dose interface and variable interface, and according to these interface generation corresponding interface data, what the application realized being correlated with calls by the data source provider.The characteristics of full dose structure can disposablely be returned all data, and the variable interface then is each returning part data.The application advises using the data of xml form to transmit.When the data that provide when a mechanism are fewer (size of general whole xml file is in 50Mb), can directly use the full dose data-interface; When data volume is bigger, generally the variable interface need be provided, thereby obtains data in batches.Promptly address is with the situation of change of every data of formal description of id, and the another one address can obtain corresponding data through id.Form is following:
<id?do=’insert’>1000</id>
<id?do=’update’>1001</id>
<id?do=’delete’>1002</id>
<id?do=’insert’>1003</id>
When the value of do is insert, show the newly-increased record that is recorded as of this id correspondence;
When the value of do is update, show the amendment record that is recorded as of this id correspondence;
When do ground value is delete, show the deletion record that is recorded as of this id correspondence.
Because all different from the data qualification information that each data source is obtained, when data exhibiting, for convenient the screening of user's ability, the application need reclassify according to unified standard various data.Promptly get into step 220, through the rule match classification: the morphology matching result of the keyword in each clauses and subclauses and the sector corresponding keyword of all categories is carried out weighted analysis confirm classification under each clauses and subclauses;
And/or, through statistics match classifying method: confirm classification under each clauses and subclauses according to the occurrence number ratio of each keyword of unfiled clauses and subclauses and the similarity of the probability of occurrence of each keyword of the class entry that obtains according to statistics in advance.
For the rule match classification:
For example following title:
" class's 3 years junior middle schools level physical a surname Feng Shi system top-student class in autumn "
Through this title, the classified information that the application can obtain is: information such as autumn class, the junior three, physics.
At first, can find out that three classified informations that the application obtains all have corresponding word in title, obtain this classified information so the application can set up three rules.
Rule one: if contain " class in autumn " in the title then be categorized as " class in autumn ";
Rule two: if contain " junior middle school three grades " in the title then be categorized as " junior three ";
Rule three: if contain " physics " in the title then be categorized as " physics ".
In this way, similar following title classified information can be obtained.
" autumn, class was improved towards high system in class's 3 years junior middle schools level physical east "
" 3 years level physical top-student class of senior middle school of class in autumn "
" autumn, class's junior middle school one grade mathematics improved class "
" autumn, class's three grades English in junior middle school improved class "
The rule match method is based upon on the morphology matching mechanisms, and which classification it belongs to according to the common keyword decision clauses and subclauses that occur in clauses and subclauses and all kinds of keyword, through weighted analysis, finally confirms criteria for classification, and its formula is following:
P 1=x 1r 1+x 2r 2+x 3r 3+...+x nr n
P wherein iBe the classification results that draws through single rule, for certain classification P iThe highest, then these clauses and subclauses just belong to such; x iBe the morphology coefficient; r iBe the morphology matching result.The morphology matching result refers to the number of times that this speech occurs in this coupling.The morphology coefficient refers to the weight that this morphology matching result is stood in all morphology matching results.Weight is high more then should to be worth more near 1, and weight is low more then should to be worth more near 0.The value of this value is through manually being provided with.Then the morphology coefficient is higher relatively such as in title, matching the result, in description or content, matches the result, and then the morphology coefficient is lower.
Table one is the coefficient that coupling English divides time-like to use, and is categorized as example with " a high English Band training seminar " coupling " English ":
The morphology coupling The morphology matching result The morphology coefficient
Contain " English " in the title 1 0.8
Contain " level Four " in the title 1 0.7
Contain " IELTS " in the title 0 0.7
Contain " English " in the description 3 0.3
Contain " level Four " in the description 1 0.2
Table one
Can calculate batch mark of mixing the English classification is: 2.6
Calculate the mark of other classification of coupling then, check that at last the highest then these clauses and subclauses of mark of which classification belong to this classification.
Classify for statistics:
The application at first put in order the associative key of influential classification; Then classified data are added up; Check that these keywords exist or non-existent relation in each clauses and subclauses; Next with these keywords those still non-classified clauses and subclauses are analyzed, checked the relation that these keywords exist in clauses and subclauses, contrast the occurrence number ratio of each keyword of probability and the unfiled clauses and subclauses of classified each keyword at last; If two close, can think that then these unfiled clauses and subclauses belong to such.
According to the analysis of front, the application sets up following formula:
min G ( c ) = &Sigma; i = - 1 m log 2 ( 1 + | T ci - t ci | )
Wherein, c is classification, and G (c) is the classification deviate, and 1 is constant, and effective for guaranteeing the log value, i is a keyword, T CiBe the keyword of the class entry probability of occurrence that has counted, t CiFor treating class entry keyword occurrence number ratio.G (c) is more little, explains that similarity is high more, then judges these clauses and subclauses and belongs to the c classification.Wherein, class entry keyword probability of occurrence equals the geometric mean of the keyword occurrence number ratio of all clauses and subclauses; Treat the number of times that number of times/all keywords occur that class entry keyword occurrence number ratio=keyword occurs in clauses and subclauses in clauses and subclauses.
Promptly confirm classification under each clauses and subclauses according to the occurrence number ratio of each keyword of unfiled clauses and subclauses and the similarity of the probability of occurrence of each keyword of the class entry that obtains according to statistics in advance.
Eigenwert for example: English, mathematics, Chinese language, the clauses and subclauses of prior classification are carried out characteristic speech probability of occurrence statistics, obtain like following table two:
Figure BDA0000096934640000112
Table two
Following the application carries out the eigenwert statistics to four unfiled clauses and subclauses and obtains table three:
Figure BDA0000096934640000113
Figure BDA0000096934640000121
Table three
Calculate like table four classification results according to aforementioned formula at last:
Figure BDA0000096934640000122
Table four
With reference to Fig. 2, be that this statistics classification is along with the continuous increase of statistics and the corresponding diagram of classification quality.Can find out that along with the continuous increase of statistics, the accuracy rate of sample classification method constantly improves, sample size is big more, and its classification accuracy is more near 1, so this statistics sorting technique has sufficient validity.
In practical application,, be the best classification process synoptic diagram of the application with reference to Fig. 3.
For the convenience and the load that lowers system of the computing of system, the morphology matching result that the application at first adopts aforesaid matched rule classification to be about to keyword and the sector in each clauses and subclauses corresponding keyword of all categories carries out weighted analysis and confirms the affiliated classification of each clauses and subclauses; When in threshold time; Can't certain clauses and subclauses be carried out the branch time-like through the matched rule classification, then promptly confirm classification under each clauses and subclauses according to the occurrence number ratio of each keyword of unfiled clauses and subclauses and the similarity of the probability of occurrence of each keyword of the class entry that obtains according to statistics in advance through the statistics classification.
After the classification of every profession and trade is accomplished in to network; When the user uses the application's system; Query word for the user; Search engine can obtain the clauses and subclauses relevant with query word of all categories according to the result of classified each entries match of the sector in said query word and the taxonomy database, and category is given the user with the clauses and subclauses and the correlated information exhibition of each said classification.
Wherein, described query word comprises:
With the keyword of user's input as query word.
Perhaps, one in the suggestion speech that the input speech of the user being selected according to this user returns as query word; Wherein, described suggestion speech obtains with corresponding result's click relation extraction through the input speech of user's input of statistics in advance.
For the suggestion speech, because the user is in when search, the keyword of choosing is very general speech to a great extent, and a lot of meanings are arranged, can corresponding various webpages, and in fact the user possibly only want to look for specific contents.Such as user search " English "; The webpage that can mate this speech has countless webpages; And in fact the user possibly want to search for is " Expert English language training by qualified teachers " perhaps contents such as " English exams ", so for can better the match user demand, the application analyzes the keyword of user's input through intelligence suggestion dictionary; Thereby provide a suggestion speech to let the user select to search for again; Reached the refinement user's request, thereby can understand user view more accurately, for the user provides Search Results more accurately.
Further, can set up intelligence suggestion dictionary through following steps, the input speech of being imported according to the user by the keyword suggestion engine then returns respectively advises speech.
Step n1, the input speech of statistics of user's input concerns with corresponding result's click.Such as user search English, clicked the course of Expert English language training by qualified teachers, the search mobile phone has been clicked the webpage of buying mobile phone.
Step n2 sorts according to the result who adds up, and clicks title to the corresponding hot topic of each input speech and all carries out word segmentation processing.
Step n3 according to word segmentation result, extracts and the relevant click speech of input speech, produces input speech and the corresponding relation of clicking speech.Such as the corresponding Expert English language training by qualified teachers of English, English Band, English is prepared for the postgraduate qualifying examination or the like.The user is in input during keyword, clicks the prompting of speech to the user, and speech screens if the user has selected click, good refinement user search purpose then, thus for the user Search Results is provided more accurately.
With reference to Fig. 4, be to suggestion speech and the continuous six days contrast of Search Results clicking rate of not having the suggestion speech are arranged.As can be seen from the figure, the clicking rate as a result that the suggestion speech is arranged proves that apparently higher than the clicking rate of not having the suggestion speech the application's suggestion speech structural scheme has validity.
In addition, the application can carry out error correction through the intelligent correction engine for the query word of user's input, and such as user's input " test English ", the intelligent correction engine can be corrected as it " English of preparing for the postgraduate qualifying examination ".
In addition, the application comprises when category is given user side with the clauses and subclauses of each said classification and correlated information exhibition: the user behavior inlet that clauses and subclauses are relevant directly offers user side.Such as recreation download is arranged, course has registration etc., when representing, directly these interfaces is offered the user.
Preferably, the application's the process that represents is:
Step q1 gets access to the query word of user search.
Step q2 inserts the intelligent correction engine with query word and inquires about.This engine can carry out the obfuscation participle with the user input query speech, through the natural participle dictionary that has generated, and standard participle dictionary, modes such as phonetic error correction dictionary produce a discernible participle text of search engine.
Step q3, search engine insert the text in advance and retrieve in the compound full-text index through the taxonomy database generation, and Search Results is carried out intelligent sequencing, return to sort through search engine then and optimize good result.Wherein, the intelligent sequencing mode is handled according to the temperature and the degree of correlation of participle text generating.Because in advance the industry data having been carried out format analyzes; Directly obtained the address that the user possibly use in the interface through definition; When the foreground represents, directly the user behavior inlet is offered user's (such as recreation download is immediately arranged, course has registration, audiovisual etc.); Realize that the user clicks download, the webpage that need not get into the other side more just can directly be downloaded.The user clicks registration, need not get into the INTRODUCTION OF THE COURSE STRUCTURE page again and just can directly register, and finally realizes the through page of wanting of a key.
Through said process, can realize the query word to the user is carried out intelligent correction, the query aim with the user is precisely represented, and the through user behavior inlet of a key.
In addition, the application also comprises following safety detection step:
Steps A, for the chained address at said each clauses and subclauses place to be classified that obtains, check whether safety of said chained address through safety inspection engine and wooden horse killing engine, if safety is then classified to said clauses and subclauses.
With reference to Fig. 5, check through safety inspection engine and wooden horse killing engine whether safe process is carried out through following steps in said chained address:
Step P1, for the chained address at said each the clauses and subclauses place to be classified that obtains, the inspection of safety inspection engine is submitted in this chained address, and whether it exists in the safe class storehouse;
Step P2 if exist and safety, then classifies to said clauses and subclauses;
Step P3 if exist but dangerous, then sends warning message, and filters out the related data of this chained address;
Step P4 if do not exist, then through this chained address of wooden horse killing engine inspection, judges that whether this link safety, if safety deposits this chained address in the safe class storehouse, and changes step P1 over to.
And/or, step B, the chained address for the classified clauses and subclauses place in the said taxonomy database constantly travels through each chained address through the safety inspection engine, if dangerous, then the related data of this chained address is deleted from taxonomy database.Each address of traversal that the safe class storehouse of safety inspection engine can not stopped; In case finding has non-safety information in the link, immediately this address is made as dangerously, the notification data engine should link related data and delete simultaneously.
And/or, step C, the chained address at the clauses and subclauses place of clicking for the user, through this chained address of safety inspection engine inspection safety whether, if dangerous, then point out the user and in taxonomy database with the related data deletion of this chained address.Writing down all represents to the user and the safety inspection engine is inserted with these addresses in the chained address clicked; Carry out safety inspection; If it is dangerous to be checked through the address; Point out the user " it is dangerous to be checked through this network address, whether continues visit " immediately, the notification service end just should link the related data deletion simultaneously.
Preferably, the application uses steps A and step B and step C simultaneously, sets up network address safety dynamic chek mechanism.Such as using 360 wooden horse cloud killing engines,, all have been obtained connection investigated and set up the safe class storehouse immediately through uninterrupted circular test and click follow-up investigations technology.Use triple security mechanisms, data are checked.The first heavy security mechanism: data get into when treating class library for treating that class entry checks whether safety of web page address for the first time, and related with the safe class database data, in case pinpoint the problems deleting this record immediately; The second heavy security mechanism: will carry out uninterrupted circular test after data are classified; Triple security mechanisms: when the user clicked the corresponding network address of Search Results, system will submit to this address to judge whether safety of this address to safety inspection mechanism level.
The application also comprises following safety detection step:
Through put on record information and/or website real name Verification System of the chained address icp that comprises said clauses and subclauses said clauses and subclauses are carried out safety detection.
In reality, also possibly exist some normal in form, do not have wooden horse and any other virus but in fact non-honest webpage, such as some fishing webpages.The application can be through said clauses and subclauses place put on record situation such as the basic condition that information can query web, the situation of website owner and compare with actual information and to confirm whether safety of said clauses and subclauses of the ICP of website of chained address; Whether the chained address that also can verify said clauses and subclauses place through the website real name Verification System of closely carrying out the real name authentication safety; If safety then deposits said clauses and subclauses and relevant information thereof in taxonomy database.
Through above-mentioned safety inspection process, can ensure the safety of webpage in user's use.
With reference to Fig. 6, show the structural representation of a kind of intelligent uprightness searching of the application system.
Search engine 310 is used for obtaining the query word that a industry that user side selects is imported; According to the result of classified each entries match of the sector in said query word and the taxonomy database, obtain the clauses and subclauses relevant of all categories, and category is given the user with the clauses and subclauses and the correlated information exhibition of each said classification with query word; Wherein, described clauses and subclauses are or multinomial business tine in the ecommerce webpage.
Wherein, business tine does not contain the webpage of non-main business information such as news, advertisement, question and answer, and does not comprise new line on the webpage, advertisement or the icp information in zones such as upper left or below such as put on record.
Taxonomy database 320 is used to store every profession and trade grouped data.
Data acquisition module 410 is used to obtain all clauses and subclauses and the relevant information of said all webpages of industry.
Sorter 420 is used for morphology matching result with the keyword of each clauses and subclauses and the sector corresponding keyword of all categories and carries out weighted analysis and confirm classification under each clauses and subclauses; And/or, confirm classification under each clauses and subclauses according to the occurrence number ratio of each keyword of unfiled clauses and subclauses and the similarity of the probability of occurrence of each keyword of the class entry that obtains according to statistics in advance.
Further also comprise: safety check module; Be used for chained address for said each the clauses and subclauses place to be classified that obtains; Check whether safety of said chained address through safety inspection engine and wooden horse killing engine, if safety is then classified to said clauses and subclauses;
And/or the chained address for the classified clauses and subclauses place in the said taxonomy database constantly travels through each chained address through the safety inspection engine, if dangerous, then the related data of this chained address is deleted from taxonomy database;
And/or, the chained address at the clauses and subclauses place of clicking for the user, through this chained address of safety inspection engine inspection safety whether, if dangerous, then point out the user and in taxonomy database with the related data deletion of this chained address.
Further, also comprise: the keyword suggestion engine is used for returning the suggestion speech according to user's input speech; Described suggestion speech obtains with corresponding result's click relation extraction through the input speech of user's input of statistics in advance.
Further, also comprise: the intelligent correction engine is used for carrying out error correction for the query word of user error input.Wherein the intelligent correction engine can be included in the search engine.
Further, described data acquisition module comprises:
The data grabber is used for grasping automatically all clauses and subclauses and the relevant information of said all webpages of industry;
Replenish the typing module, be used for replenishing the clauses and subclauses and the relevant information of typing one webpage.
Further, also comprise: interface provides module, is used for that the user behavior inlet that clauses and subclauses are relevant directly offers user side when category is given user side with the clauses and subclauses of each said classification and correlated information exhibition.
With reference to Fig. 7, show the structural representation of a kind of intelligent uprightness searching optimum system choosing of the application.
The user is through the relevant input speech of browser input; The keyword suggestion engine can return the suggestion speech relevant with importing speech according to wherein intelligence suggestion dictionary, and the user can select suggestion speech that oneself input speech or system give as query word; After the user has confirmed query word; Search engine returns to user's result for retrieval relevant with query word according to the class entry in the query word combining classification database and relevant information to be showed; Category is given the user with the clauses and subclauses and the correlated information exhibition of each said classification when wherein showing, also can directly represent to the user by the user behavior interface that clauses and subclauses are relevant simultaneously; Wherein search engine also can carry out intelligent correction to user's query word through its intelligent correction engine.
Wherein, in network, grasp the web data through the data grabber, also can be through replenishing the data that typing module supplementary data grabber does not grab; Said data are transferred to the classification that sorter carries out various clauses and subclauses and relevant information thereof; Sorter can deposit classified data in the taxonomy database in then.
At this simultaneously; Safety check module can be carried out the first heavily safety inspection when data are obtained; Promptly in the chained address process of data grabber extracting data, carry out the first heavily safety inspection in the chained address process of the additional typing module supplementary data in family, the chained address of safety just can be put into sorter and classify; In taxonomy database, carry out the second heavily safety inspection, promptly safety check module constantly travels through the chained address of the data of inspection in the safety database, checks its whether safety, and the data of safety just can keep; When the user clicks the chained address of certain clauses and subclauses through browser; The safety inspection engine can carry out the triple safe inspection; The webpage of promptly user being clicked carries out the actual time safety inspection; For unsafe chained address, the safety inspection engine can be pointed out user's potential risk, and reporting system is with the related data deletion of this chained address.
The application has reduced the risk that the user poisons through search through setting up network address safety dynamic chek mechanism.Grasp the typing management platform through setting up, solved the single problem of Data Source, realized obtaining by all kinds of means data, make data more comprehensively with abundant.Through setting up automatic classification system, realized the automatic classification and the screening function of data.Through setting up keyword intelligence suggestion model, refinement the user search demand, realized offering user's Search Results more accurately.Through setting up the industry Data Mining Mechanism, realize that a key of general utility functions is through.Reached and let user security search, the user does not worry harmful network address among the result; Cover, station search just can be obtained whole industry information comprehensively; Precisely the result returns the real valuable results to the user, rather than lets the user in the magnanimity result, oneself select; Quick through: as to the user immediate access to be provided, to select and need not get into the page again.
For system embodiment, because it is similar basically with method embodiment, so description is fairly simple, relevant part gets final product referring to the part explanation of method embodiment.
Each embodiment in this instructions all adopts the mode of going forward one by one to describe, and what each embodiment stressed all is and the difference of other embodiment that identical similar part is mutually referring to getting final product between each embodiment.
More than to a kind of vertical intelligent uprightness searching method and system that the application provided; Carried out detailed introduction; Used concrete example among this paper the application's principle and embodiment are set forth, the explanation of above embodiment just is used to help to understand the application's method and core concept thereof; Simultaneously, for one of ordinary skill in the art, according to the application's thought, the part that on embodiment and range of application, all can change, in sum, this description should not be construed as the restriction to the application.

Claims (15)

1. an intelligent uprightness searching method is characterized in that, comprising:
Obtain the query word of importing in the industry of user side selection;
According to the result of classified each entries match of the sector in said query word and the taxonomy database, obtain the clauses and subclauses relevant of all categories, and category is given the user with the clauses and subclauses and the correlated information exhibition of each said classification with query word; Wherein, described clauses and subclauses are or multinomial business tine in the ecommerce webpage;
Wherein, classified each clauses and subclauses of said the sector obtain through following steps:
Obtain all clauses and subclauses and the relevant information of said all webpages of industry;
The morphology matching result of the keyword in each clauses and subclauses and the sector corresponding keyword of all categories is carried out weighted analysis confirm classification under each clauses and subclauses;
And/or, confirm classification under each clauses and subclauses according to the occurrence number ratio of each keyword of unfiled clauses and subclauses and the similarity of the probability of occurrence of each keyword of the class entry that obtains according to statistics in advance.
2. method according to claim 1 is characterized in that, described query word comprises:
With the keyword of user's input as query word;
Perhaps, one in the suggestion speech that the input speech of the user being selected according to this user returns as query word; Wherein, described suggestion speech obtains with corresponding result's click relation extraction through the input speech of user's input of statistics in advance.
3. method according to claim 1 is characterized in that, also comprises following safety detection step:
Steps A, for the chained address at said each clauses and subclauses place to be classified that obtains, check whether safety of said chained address through safety inspection engine and wooden horse killing engine, if safety is then classified to said clauses and subclauses;
And/or, step B, the chained address for the classified clauses and subclauses place in the said taxonomy database constantly travels through each chained address through the safety inspection engine, if dangerous, then the related data of this chained address is deleted from taxonomy database;
And/or, step C, the chained address at the clauses and subclauses place of clicking for the user, through this chained address of safety inspection engine inspection safety whether, if dangerous, then point out the user and in taxonomy database with the related data deletion of this chained address.
4. method according to claim 3 is characterized in that, checks through safety inspection engine and wooden horse killing engine whether safe process is carried out through following steps in said chained address:
Step P1, for the chained address at said each the clauses and subclauses place to be classified that obtains, the inspection of safety inspection engine is submitted in this chained address, and whether it exists in the safe class storehouse;
Step P2 if exist and safety, then classifies to said clauses and subclauses;
Step P3 if exist but dangerous, then sends warning message, and filters out the related data of this chained address;
Step P4 if do not exist, then through this chained address of wooden horse killing engine inspection, judges that whether this link safety, if safety deposits this chained address in the safe class storehouse, and changes step P1 over to.
5. method according to claim 1 is characterized in that, also comprises following safety detection step:
Through put on record information and/or website real name Verification System of the chained address icp that comprises said clauses and subclauses said clauses and subclauses are carried out safety detection.
6. method according to claim 1 is characterized in that, when all clauses and subclauses of obtaining said all webpages of industry and relevant information, comprises:
Automatically grasp step, be used for grasping automatically all clauses and subclauses and the relevant information of said all webpages of industry;
Replenish the typing step, be used for replenishing the clauses and subclauses and the relevant information of typing one webpage.
7. method according to claim 1 is characterized in that, when category is given user side with the clauses and subclauses of each said classification and correlated information exhibition, comprises:
The user behavior inlet that clauses and subclauses are relevant directly offers user side.
8. method according to claim 1 is characterized in that, also comprises:
The intelligent correction step, the query word of importing for user error through the intelligent correction engine carries out error correction.
9. method according to claim 1 is characterized in that:
The occurrence number ratio of each keyword i of said clauses and subclauses and the difference of the probability of occurrence of each keyword i of class entry; Through the logarithm weighting; Calculate the corresponding classification c deviate G (c) of clauses and subclauses, the more little then similarity of deviate is high more, gets minimum deviate place classification and confirms to classify under the clauses and subclauses.
10. an intelligent uprightness searching system is characterized in that, comprising:
Search engine is used for obtaining the query word that a industry that user side selects is imported; According to the result of classified each entries match of the sector in said query word and the taxonomy database, obtain the clauses and subclauses relevant of all categories, and category is given the user with the clauses and subclauses and the correlated information exhibition of each said classification with query word; Wherein, described clauses and subclauses are or multinomial business tine in the ecommerce webpage;
Taxonomy database is used to store every profession and trade grouped data;
Data acquisition module is used to obtain all clauses and subclauses and the relevant information of said all webpages of industry;
Sorter is used for morphology matching result with the keyword of each clauses and subclauses and the sector corresponding keyword of all categories and carries out weighted analysis and confirm classification under each clauses and subclauses; And/or, confirm classification under each clauses and subclauses according to the occurrence number ratio of each keyword of unfiled clauses and subclauses and the similarity of the probability of occurrence of each keyword of the class entry that obtains according to statistics in advance.
11. system according to claim 10 is characterized in that, also comprises:
Safety check module is used for the chained address for said each clauses and subclauses place to be classified that obtains, and checks whether safety of said chained address through safety inspection engine and wooden horse killing engine, if safety is then classified to said clauses and subclauses;
And/or the chained address for the classified clauses and subclauses place in the said taxonomy database constantly travels through each chained address through the safety inspection engine, if dangerous, then the related data of this chained address is deleted from taxonomy database;
And/or, the chained address at the clauses and subclauses place of clicking for the user, through this chained address of safety inspection engine inspection safety whether, if dangerous, then point out the user and in taxonomy database with the related data deletion of this chained address.
12. system according to claim 10 is characterized in that, also comprises:
The keyword suggestion engine is used for returning the suggestion speech according to user's input speech; Described suggestion speech obtains with corresponding result's click relation extraction through the input speech of user's input of statistics in advance.
13. system according to claim 10 is characterized in that, also comprises:
The intelligent correction engine is used for carrying out error correction for the query word of user error input.
14. system according to claim 10 is characterized in that, described data acquisition module comprises:
The data grabber is used for grasping automatically all clauses and subclauses and the relevant information of said all webpages of industry;
Replenish the typing module, be used for replenishing the clauses and subclauses and the relevant information of typing one webpage.
15. system according to claim 10 is characterized in that, also comprises:
Interface provides module, is used for that the user behavior inlet that clauses and subclauses are relevant directly offers user side when category is given user side with the clauses and subclauses of each said classification and correlated information exhibition.
CN201110300772.3A 2011-09-29 2011-09-29 Intelligent vertical search method and system Active CN102332025B (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
CN201110300772.3A CN102332025B (en) 2011-09-29 2011-09-29 Intelligent vertical search method and system
CN201410328560.XA CN104063514B (en) 2011-09-29 2011-09-29 Method for vertical search

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201110300772.3A CN102332025B (en) 2011-09-29 2011-09-29 Intelligent vertical search method and system

Related Child Applications (2)

Application Number Title Priority Date Filing Date
CN201410328404.3A Division CN104063513A (en) 2011-09-29 2011-09-29 Intelligent vertical search method and system
CN201410328560.XA Division CN104063514B (en) 2011-09-29 2011-09-29 Method for vertical search

Publications (2)

Publication Number Publication Date
CN102332025A true CN102332025A (en) 2012-01-25
CN102332025B CN102332025B (en) 2014-08-27

Family

ID=45483801

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201110300772.3A Active CN102332025B (en) 2011-09-29 2011-09-29 Intelligent vertical search method and system

Country Status (1)

Country Link
CN (1) CN102332025B (en)

Cited By (25)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102739653A (en) * 2012-06-06 2012-10-17 奇智软件(北京)有限公司 Detection method and device aiming at webpage address
CN103136339A (en) * 2013-02-01 2013-06-05 百度在线网络技术(北京)有限公司 Searching method, client-side and network server-side based on service information
CN103366002A (en) * 2013-07-17 2013-10-23 北京奇虎科技有限公司 Personalized vertical search method and device
WO2014022172A2 (en) * 2012-07-30 2014-02-06 Alibaba Group Holding Limited Information classification based on product recognition
CN103593386A (en) * 2013-08-14 2014-02-19 北京觅缘信息科技有限公司 Dating search solution
CN103793418A (en) * 2012-10-31 2014-05-14 珠海富讯网络科技有限公司 Search method of real-time vertical search engine for security industry
CN103942043A (en) * 2014-03-17 2014-07-23 可牛网络技术(北京)有限公司 Method and device for managing mobile terminal desktop icons
CN104063514A (en) * 2011-09-29 2014-09-24 北京奇虎科技有限公司 Vertical search method
CN104182467A (en) * 2014-07-21 2014-12-03 安徽华贞信息科技有限公司 Network data source detection method
CN104363206A (en) * 2014-10-28 2015-02-18 北京恩信创业科技有限公司 Cloud-computing ERP safety realizing method
CN105100053A (en) * 2015-05-29 2015-11-25 北京奇虎科技有限公司 Website security detection method, website security detection device and cloud monitoring system
CN105468782A (en) * 2015-12-21 2016-04-06 北京奇虎科技有限公司 Method and device for judging matching rate of query and resources
CN105468729A (en) * 2015-11-23 2016-04-06 深圳大粤网络视界有限公司 Internet mobile vertical search engine
CN106156116A (en) * 2015-04-07 2016-11-23 富士通株式会社 Information issuing method and system
CN106250369A (en) * 2016-07-28 2016-12-21 海信集团有限公司 voice interactive method, device and terminal
CN107066533A (en) * 2017-03-01 2017-08-18 北京奇艺世纪科技有限公司 Search inquiry error correction system and method
CN107301241A (en) * 2017-07-03 2017-10-27 北京奇艺世纪科技有限公司 Query word error correction method and device in a kind of video search
CN107423362A (en) * 2017-06-20 2017-12-01 阿里巴巴集团控股有限公司 Industry determines method, Method of Get Remote Object and device, client, server
CN108701014A (en) * 2016-03-09 2018-10-23 电子湾有限公司 Inquiry database for tail portion inquiry
CN110990598A (en) * 2019-11-18 2020-04-10 北京声智科技有限公司 Resource retrieval method and device, electronic equipment and computer-readable storage medium
CN111460119A (en) * 2020-03-27 2020-07-28 海信集团有限公司 Intelligent question and answer method and system for economic knowledge and intelligent equipment
CN112329423A (en) * 2020-11-05 2021-02-05 上海钐昆网络科技有限公司 ICP filing company classification method and device, electronic equipment and computer storage medium
CN113221004A (en) * 2021-05-21 2021-08-06 珠海金山网络游戏科技有限公司 Keyword display method and device
US20220222277A1 (en) * 2021-01-12 2022-07-14 Tealbook Inc. System and method for data profiling
US11593855B2 (en) 2015-12-30 2023-02-28 Ebay Inc. System and method for computing features that apply to infrequent queries

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1983255A (en) * 2006-05-17 2007-06-20 唐红春 Internet searching method
CN101266603A (en) * 2007-03-12 2008-09-17 北京搜狗科技发展有限公司 Webpage information sorting method, system and service system applying the classification
CN102012922A (en) * 2010-11-30 2011-04-13 无锡快度信息技术有限公司 Modeling method for industrial application model of universal vertical search engine
CN102043831A (en) * 2010-11-19 2011-05-04 四川圣望科技有限公司 Engine agency search method

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1983255A (en) * 2006-05-17 2007-06-20 唐红春 Internet searching method
CN101266603A (en) * 2007-03-12 2008-09-17 北京搜狗科技发展有限公司 Webpage information sorting method, system and service system applying the classification
CN102043831A (en) * 2010-11-19 2011-05-04 四川圣望科技有限公司 Engine agency search method
CN102012922A (en) * 2010-11-30 2011-04-13 无锡快度信息技术有限公司 Modeling method for industrial application model of universal vertical search engine

Cited By (33)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104063514A (en) * 2011-09-29 2014-09-24 北京奇虎科技有限公司 Vertical search method
CN102739653A (en) * 2012-06-06 2012-10-17 奇智软件(北京)有限公司 Detection method and device aiming at webpage address
CN102739653B (en) * 2012-06-06 2015-05-20 北京奇虎科技有限公司 Detection method and device aiming at webpage address
WO2014022172A2 (en) * 2012-07-30 2014-02-06 Alibaba Group Holding Limited Information classification based on product recognition
WO2014022172A3 (en) * 2012-07-30 2014-06-26 Alibaba Group Holding Limited Information classification based on product recognition
CN103793418A (en) * 2012-10-31 2014-05-14 珠海富讯网络科技有限公司 Search method of real-time vertical search engine for security industry
CN103136339A (en) * 2013-02-01 2013-06-05 百度在线网络技术(北京)有限公司 Searching method, client-side and network server-side based on service information
CN103366002A (en) * 2013-07-17 2013-10-23 北京奇虎科技有限公司 Personalized vertical search method and device
CN103593386A (en) * 2013-08-14 2014-02-19 北京觅缘信息科技有限公司 Dating search solution
CN103942043A (en) * 2014-03-17 2014-07-23 可牛网络技术(北京)有限公司 Method and device for managing mobile terminal desktop icons
CN103942043B (en) * 2014-03-17 2017-11-07 可牛网络技术(北京)有限公司 A kind of method and device for managing mobile terminal desktop icon
CN104182467A (en) * 2014-07-21 2014-12-03 安徽华贞信息科技有限公司 Network data source detection method
CN104363206A (en) * 2014-10-28 2015-02-18 北京恩信创业科技有限公司 Cloud-computing ERP safety realizing method
CN106156116A (en) * 2015-04-07 2016-11-23 富士通株式会社 Information issuing method and system
CN105100053A (en) * 2015-05-29 2015-11-25 北京奇虎科技有限公司 Website security detection method, website security detection device and cloud monitoring system
CN105468729A (en) * 2015-11-23 2016-04-06 深圳大粤网络视界有限公司 Internet mobile vertical search engine
CN105468782A (en) * 2015-12-21 2016-04-06 北京奇虎科技有限公司 Method and device for judging matching rate of query and resources
CN105468782B (en) * 2015-12-21 2019-05-17 北京奇虎科技有限公司 A kind of method and device of the resource matched degree judgement of inquiry-
US11593855B2 (en) 2015-12-30 2023-02-28 Ebay Inc. System and method for computing features that apply to infrequent queries
CN108701014A (en) * 2016-03-09 2018-10-23 电子湾有限公司 Inquiry database for tail portion inquiry
CN106250369A (en) * 2016-07-28 2016-12-21 海信集团有限公司 voice interactive method, device and terminal
CN106250369B (en) * 2016-07-28 2019-08-06 海信集团有限公司 Voice interactive method, device and terminal
CN107066533B (en) * 2017-03-01 2020-10-27 北京奇艺世纪科技有限公司 Search query error correction system and method
CN107066533A (en) * 2017-03-01 2017-08-18 北京奇艺世纪科技有限公司 Search inquiry error correction system and method
CN107423362A (en) * 2017-06-20 2017-12-01 阿里巴巴集团控股有限公司 Industry determines method, Method of Get Remote Object and device, client, server
CN107301241A (en) * 2017-07-03 2017-10-27 北京奇艺世纪科技有限公司 Query word error correction method and device in a kind of video search
CN107301241B (en) * 2017-07-03 2020-06-05 北京奇艺世纪科技有限公司 Method and device for correcting error of query word in video search
CN110990598A (en) * 2019-11-18 2020-04-10 北京声智科技有限公司 Resource retrieval method and device, electronic equipment and computer-readable storage medium
CN111460119A (en) * 2020-03-27 2020-07-28 海信集团有限公司 Intelligent question and answer method and system for economic knowledge and intelligent equipment
CN111460119B (en) * 2020-03-27 2024-04-12 海信集团有限公司 Intelligent question-answering method and system for economic knowledge and intelligent equipment
CN112329423A (en) * 2020-11-05 2021-02-05 上海钐昆网络科技有限公司 ICP filing company classification method and device, electronic equipment and computer storage medium
US20220222277A1 (en) * 2021-01-12 2022-07-14 Tealbook Inc. System and method for data profiling
CN113221004A (en) * 2021-05-21 2021-08-06 珠海金山网络游戏科技有限公司 Keyword display method and device

Also Published As

Publication number Publication date
CN102332025B (en) 2014-08-27

Similar Documents

Publication Publication Date Title
CN102332025B (en) Intelligent vertical search method and system
CN104063513A (en) Intelligent vertical search method and system
US11100124B2 (en) Systems and methods for similarity and context measures for trademark and service mark analysis and repository searches
CN108304526B (en) Data processing method and device and server
CN103744981B (en) System for automatic classification analysis for website based on website content
US8935197B2 (en) Systems and methods for facilitating open source intelligence gathering
Sharifi et al. Summarizing microblogs automatically
US20170293864A1 (en) System and Method for Searching and Matching Content Over Social Networks Relevant to an Individual
EP2560111A2 (en) Systems and methods for facilitating the gathering of open source intelligence
CN103106275B (en) The text classification Feature Selection method of feature based distributed intelligence
Blanco et al. Repeatable and reliable semantic search evaluation
KR20160125375A (en) Method, system and storage medium for realizing intelligent answering of questions
CN105930469A (en) Hadoop-based individualized tourism recommendation system and method
US20160253428A1 (en) Searching user-created finite keyword profiles based on one keyword and metadata filters and randomness
Vidinli et al. New query suggestion framework and algorithms: A case study for an educational search engine
CN104899335A (en) Method for performing sentiment classification on network public sentiment of information
Mulder et al. Operationalizing framing to support multiperspective recommendations of opinion pieces
CN114254615A (en) Volume assembling method and device, electronic equipment and storage medium
US20180365324A1 (en) Method of data organization and data searching for use in constructing evidence-based beliefs
CN105528419B (en) Personalized search method based on modeling considering user-author relation
CN106997340A (en) The generation of dictionary and the Document Classification Method and device using dictionary
Kardkovács et al. The ferrety algorithm for the KDD Cup 2005 problem
CN104063514B (en) Method for vertical search
Harris Searching for Diverse Perspectives in News Articles: Using an LSTM Network to Classify Sentiment.
Sharma et al. Tourview: Sentiment based analysis on tourist domain

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
ASS Succession or assignment of patent right

Owner name: TIANJIN QISI TECHNOLOGY CO., LTD.

Free format text: FORMER OWNER: QIZHI SOFTWARE (BEIJING) CO., LTD.

Effective date: 20141125

C41 Transfer of patent application or patent right or utility model
COR Change of bibliographic data

Free format text: CORRECT: ADDRESS; FROM: 100016 CHAOYANG, BEIJING TO: 300384 NANKAI, TIANJIN

TR01 Transfer of patent right

Effective date of registration: 20141125

Address after: 300384 Tianjin hi New Technology Industrial Park Huayuan Industrial District No. 18 West North 2-102 industrial incubation -5

Patentee after: Tianjin Qi Si Science and Technology Ltd.

Address before: The 4 layer 100016 unit of Beijing city Chaoyang District Jiuxianqiao Road No. 14 Building C

Patentee before: Qizhi Software (Beijing) Co., Ltd.

CP03 Change of name, title or address
CP03 Change of name, title or address

Address after: 300000 Binhai high tech Zone, Tianjin Binhai hi tech Park Science and Technology Park, No. 39, No. six, No. 9-3-401

Patentee after: 360 Polytron Technologies Inc

Address before: 300384 Tianjin hi New Technology Industrial Park Huayuan Industrial District No. 18 West North 2-102 industrial incubation -5

Patentee before: Tianjin Qi Si Science and Technology Ltd.

CP03 Change of name, title or address
CP03 Change of name, title or address

Address after: 300000 Tianjin Binhai New Area Binhai Science Park hi-tech No. six 9-3-401 No. 39

Patentee after: 360 science and Technology Co., Ltd.

Address before: 300000 Binhai high tech Zone, Tianjin Binhai hi tech Park Science and Technology Park, No. 39, No. six, No. 9-3-401

Patentee before: 360 Polytron Technologies Inc

CP01 Change in the name or title of a patent holder
CP01 Change in the name or title of a patent holder

Address after: 300000 Tianjin Binhai New Area Binhai Science Park hi-tech No. six 9-3-401 No. 39

Patentee after: 3600 Technology Group Co., Ltd

Address before: 300000 Tianjin Binhai New Area Binhai Science Park hi-tech No. six 9-3-401 No. 39

Patentee before: 360 science and Technology Co.,Ltd.