CN106919588A - A kind of application program search system and method - Google Patents

A kind of application program search system and method Download PDF

Info

Publication number
CN106919588A
CN106919588A CN201510993113.0A CN201510993113A CN106919588A CN 106919588 A CN106919588 A CN 106919588A CN 201510993113 A CN201510993113 A CN 201510993113A CN 106919588 A CN106919588 A CN 106919588A
Authority
CN
China
Prior art keywords
application program
search
search word
keyword
word
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201510993113.0A
Other languages
Chinese (zh)
Inventor
王振凯
曹国栋
唐竞胜
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Qihoo Technology Co Ltd
Qizhi Software Beijing Co Ltd
Original Assignee
Beijing Qihoo Technology Co Ltd
Qizhi Software Beijing Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Qihoo Technology Co Ltd, Qizhi Software Beijing Co Ltd filed Critical Beijing Qihoo Technology Co Ltd
Priority to CN201510993113.0A priority Critical patent/CN106919588A/en
Publication of CN106919588A publication Critical patent/CN106919588A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/3331Query processing
    • G06F16/3332Query translation
    • G06F16/3338Query expansion
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/3331Query processing
    • G06F16/334Query execution
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/951Indexing; Web crawling techniques

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

It is used for the Back ground Information according to application program the invention discloses a kind of application program search system and method, including Distributor, obtains the basic keyword of application program;The Back ground Information of historical search record and application program according to each search word, obtains the matching keywords as application program with the search word of application matches;The keywords database of application program is generated according to the basic keyword and the matching keywords;User terminal, for obtaining the search keyword of input, and is sent to the Distributor by the search keyword;The Distributor, is additionally operable to, according to the search keyword for receiving, the search keyword be matched with the keywords database of each application program;And according to matching result, obtain application program corresponding with the search keyword and feed back to the user terminal.Application program search system disclosed by the invention and method, solve the problems, such as that application developers need the indexing key words by cumbersome operation selection application program.

Description

A kind of application program search system and method
Technical field
The present invention relates to search technique field, and in particular to a kind of application program search system and method.
Background technology
With the development of intelligent mobile terminal, increasing user downloads various APP in intelligent mobile terminal (application, application program) is used.Based on this kind of situation, application program distribution platform arises at the historic moment, and user can pass through Intelligent mobile terminal access application distribution platform, such as the application program delivery applications by being installed in intelligent mobile terminal Access application distribution platform is removed, such that it is able to download various application programs from platform.Wherein, application program delivery applications Such as various mobile phone assistants.
And in application program distribution platform, in order to be the application program owner for having popularization demand, such as application journey Sequence developer, the application program of application program owner can be applied in application program searched page with forward displaying Program owner can bid word as indexing key words for the purchase of these application programs.
But, the word of bidding of application developers purchase may in itself be mismatched with application program, be made flat using distribution The search engine of platform may be returned actually with the search word degree of correlation very when being retrieved according to the search word of user input The information of low application program, causes user to search during the application program with its demand, it is necessary to more operated, than Such as page turning operation, influence obtains the efficiency of the application program of its demand.
The content of the invention
In view of the above problems, it is proposed that the present invention so as to provide one kind overcome above mentioned problem or at least in part solve on State the application program search system and method for problem.
On the one hand, the application provides a kind of application program search system, the system by an embodiment of the application Including:
Distributor, for the Back ground Information according to application program, obtains the basic keyword of application program;According to each The historical search record and the Back ground Information of application program of search word, obtain the search word with application matches as application journey The matching keywords of sequence;The keywords database of application program is generated according to the basic keyword and the matching keywords;
User terminal, the search keyword for obtaining input, and the search keyword is sent to the distribution clothes Business device;
The Distributor, be additionally operable to according to receive the search keyword, by the search keyword with respectively should Matched with the keywords database of program;And according to matching result, obtain application program corresponding with the search keyword simultaneously The user terminal is fed back to, to cause to show application program corresponding with the search keyword on the user terminal.
Optionally, the Distributor includes:
First matching keywords acquiring unit, for the search Download History in the search history of each search word record With the title and/or classification in the Back ground Information of application program, the search word with application matches is obtained as application journey The matching keywords of sequence.
Optionally, the Distributor includes:
Second matching keywords acquiring unit, searches for the description information in the Back ground Information according to application program and respectively Search word and the click relation of each application program in the search history record of rope word, obtain the search word with application matches As the matching keywords of application program.
Optionally, the Distributor includes:
3rd matching keywords acquiring unit, for the classification in the Back ground Information according to application program and each search word pair The classification answered, obtains the matching keywords as application program with the search word of application matches.
Optionally, the first matching keywords acquiring unit, specifically includes:
Text similarity acquiring unit, for each search word in search Download History, for calculating search word and application The text similarity between title in the Back ground Information of program;If the text similarity is more than first threshold, obtain The search word as application program matching keywords.
Optionally, the first matching keywords acquiring unit, specifically includes:
Independent access search word extraction unit, for each search word in search Download History, for judging the search Whether the independent access download time of word is more than Second Threshold, and classification and the application program of the search word Back ground Information In classification whether belong to same classification;If the independent access download time of the search word is more than the Second Threshold, And the classification of the search word belongs to same classification with the classification in the Back ground Information of application program, then obtain the search Word as application program matching keywords.
Optionally, the second matching keywords acquiring unit, specifically includes:
Application program theme distribution computing unit, for the description information in the Back ground Information of each application program, for leading to Cross the theme distribution that topic model calculates application program;
Search word theme distribution computing unit, to each search word, for being recorded according to search history in search word with it is each The click relation of application program, calculates the theme distribution of search word;
Theme similarity word extraction unit, the search word for volumes of searches more than the 3rd threshold value, for being searched according to The theme distribution of rope word and the theme distribution of application program, calculate the Topic Similarity between the search word and application program; If the Topic Similarity between the search word and application program is more than theme threshold value, the search word is obtained as application The matching keywords of program.
Optionally, the 3rd matching keywords acquiring unit, specifically includes:
Application program classification subdivision unit, for each one-level class application program now, for using one-level class now The description information of each application program, corresponding one-level class two grades of classifications now are divided into using grader by each application program;
Search word taxon, to each search word, for being recorded according to search history in search word and each application program Click relation, and two grades of classifications belonging to each application program calculate two grades of classifications corresponding to the search word;
Class heading search word extracts form unit, for two grades of classifications according to where application program, obtains to should two grades of classes Each search word of purpose is then as the matching keywords of application program.
Optionally, the Distributor includes:
Participle keyword extracting unit, participle operation is carried out for the title in the Back ground Information by application program, will be divided Word result as application program basic keyword.
Optionally, the Distributor includes:
Phonetic keyword extracting unit, for the name translation in the Back ground Information by application program be pinyin string and/or The word segmentation result that participle obtains is carried out by the title and is converted to pinyin string, closed the pinyin string as the basis of application program Keyword.
Optionally, the Distributor also includes:
Label keyword extracting unit, for using the label word of application program as application program basic keyword.
Optionally, the Distributor also includes:
Application program acquiring unit, for each application program, journey is applied specifically for being characterized in the matching result When there is the keyword matched with the search keyword in the keywords database of sequence, determine that the application program is closed with the search Keyword is corresponding, to obtain application program corresponding with the search keyword.
Optionally, the user terminal includes:
Search keyword acquiring unit, specifically for the input information according to user, obtains the search keyword.
On the other hand, the application provides a kind of application program searching method, the side by an embodiment of the application Method includes:
By Distributor according to the Back ground Information of application program, the basic keyword of application program is obtained;According to each The historical search record and the Back ground Information of application program of search word, obtain the search word with application matches as application journey The matching keywords of sequence;The keywords database of application program is generated according to the basic keyword and the matching keywords;
The search keyword of input is obtained by user terminal, and the search keyword is sent to the distribution service Device;
The search keyword received by the Distributor, by the search keyword and each application program Keywords database is matched;And according to matching result, obtain application program corresponding with the search keyword and feed back to institute User terminal is stated, to cause to show application program corresponding with the search keyword on the user terminal.
Optionally, the historical search record and the Back ground Information of application program according to each search word, obtains and application The search word of procedure match is specifically included as the matching keywords of application program:
The name in the Back ground Information for searching for Download History and application program in search history record according to each search word Claim and/or classification, obtain the matching keywords as application program with the search word of application matches.
Optionally, the historical search record and the Back ground Information of application program according to each search word, obtains and application The search word of procedure match is specifically included as the matching keywords of application program:
Search word in the search history record of description information and each search word in the Back ground Information of application program With the click relation of each application program, the matching keywords as application program with the search word of application matches are obtained.
Optionally, the historical search record and the Back ground Information of application program according to each search word, obtains and application The search word of procedure match is specifically included as the matching keywords of application program:
Classification and the corresponding classification of each search word in the Back ground Information of application program, obtain and application matches Search word as application program matching keywords.
Optionally, the acquisition is specifically wrapped with the search word of application matches as the matching keywords of application program Include:
For each search word in search Download History, for the name in the Back ground Information for calculating search word and application program Text similarity between referred to as;If the text similarity is more than first threshold, the search word is obtained as application journey The matching keywords of sequence.
Optionally, the acquisition is specifically wrapped with the search word of application matches as the matching keywords of application program Include:
For each search word in search Download History, judge whether the independent access download time of the search word is more than Whether Second Threshold, and the classification of the search word belongs to same classification with the classification in the Back ground Information of application program; If the independent access download time of the search word is more than the Second Threshold, and the search word classification and application journey Classification in the Back ground Information of sequence belongs to same classification, then obtain matching keywords of the search word as application program.
Optionally, the acquisition is specifically wrapped with the search word of application matches as the matching keywords of application program Include:
For the description information in the Back ground Information of each application program, the theme of application program is calculated by topic model Distribution;
To each search word, according to search word in search history record and the click relation of each application program, search is calculated The theme distribution of word;
Search word for volumes of searches more than the 3rd threshold value, the master of theme distribution and application program according to the search word Topic distribution, calculates the Topic Similarity between the search word and application program;If between the search word and application program Topic Similarity be more than theme threshold value, then obtain matching keywords of the search word as application program.
Optionally, the acquisition is specifically wrapped with the search word of application matches as the matching keywords of application program Include:
For each one-level class application program now, using the description information of one-level class each application program now, use Each application program is divided into corresponding one-level class two grades of classifications now by grader;
To each search word, according to the click relation of search word in search history record and each application program, and respectively should With two grades of classifications belonging to program, two grades of classifications corresponding to the search word are calculated;
Two grades of classifications according to where application program, obtain to should two grades of each search words of classification then as application program Matching keywords.
Optionally, the Back ground Information according to application program, obtains the basic keyword of application program, specifically includes:
Title in the Back ground Information of application program is carried out into participle operation, using word segmentation result as the basis of application program Keyword.
Optionally, the Back ground Information according to application program, obtains the basic keyword of application program, specifically includes:
Name translation in the Back ground Information of application program is carried out what participle was obtained for pinyin string and/or by the title Word segmentation result is converted to pinyin string, using the pinyin string as application program basic keyword.
Optionally, the Back ground Information according to application program, obtains the basic keyword of application program, specifically includes:
Using the label word of application program as application program basic keyword.
Optionally, it is described according to matching result, application program corresponding with the search keyword is obtained, specifically include:
For each application program, exist in the matching result characterizes the keywords database of application program and searched with described During the keyword that rope keyword matches, determine that the application program is corresponding with the search keyword, searched with described with obtaining The corresponding application program of rope keyword.
Optionally, the search keyword for obtaining input, specifically includes:
Input information according to user, obtains the search keyword.
One or more technical schemes provided in the embodiment of the present application, at least have the following technical effect that or advantage:
Application according to the present invention program search system and method, Distributor, according to the Back ground Information of application program, Obtain the basic keyword of application program;The Back ground Information of historical search record and application program according to each search word, obtains With the search word of application matches as application program matching keywords;Closed according to the basic keyword and the matching Keyword generates the keywords database of application program;User terminal, the search keyword for obtaining input, and the search is crucial Word is sent to the Distributor;The Distributor, it is according to the search keyword for receiving, the search is crucial Word is matched with the keywords database of each application program;And according to matching result, obtain answer corresponding with the search keyword With program and feed back to the user terminal, with cause to be shown on the user terminal it is corresponding with the search keyword should Use program;Generated because the keywords database of application program is basic keyword and matching keywords by application program, So that the keyword in the keywords database of application program is improved with the correlation of application program, application program is thus solved Developer needs the problem of the indexing key words by cumbersome operation selection application program, and because the index for selecting is crucial Word is incorrect, and the probability for causing application program to appear in the Search Results very low with the search word degree of correlation of user input is higher Problem, achieve can by the keywords database of application program automatically for application program automatically selects indexing key words, reduce Application developers effectively improve application program and appear in and user input to the selection course of application index keyword Search word degree of correlation Search Results higher in probability.
Brief description of the drawings
Technical scheme in order to illustrate more clearly the embodiments of the present invention, below will be to that will make needed for embodiment description Accompanying drawing is briefly described, it should be apparent that, drawings in the following description are some embodiments of the present invention, for this For the those of ordinary skill of field, on the premise of not paying creative work, can also obtain other according to these accompanying drawings Accompanying drawing.
Fig. 1 is the Organization Chart of the application program search system in the embodiment of the present invention;
Fig. 2 is the flow chart of application program searching method in the embodiment of the present invention.
Specific embodiment
In view of the above problems, it is proposed that the present invention so as to provide one kind overcome above mentioned problem or at least in part solve on State the application program search system and method for problem.
In order to be better understood from above-mentioned technical proposal, below in conjunction with Figure of description and specific embodiment to upper Technical scheme is stated to be described in detail.
Illustrate first, herein presented term "and/or", only a kind of incidence relation for describing affiliated partner, table Show there may be three kinds of relations, for example, A and/or B, can represent:Individualism A, while there is A and B, individualism B this three The situation of kind.In addition, character "/" herein, typicallys represent forward-backward correlation pair as if a kind of relation of "or".
Referring to Fig. 1, the embodiment of the application one provides a kind of application program search system, and the system includes:
Distributor 10, for the Back ground Information according to application program, obtains the basic keyword of application program;According to The historical search record and the Back ground Information of application program of each search word, obtain the search word with application matches as application The matching keywords of program;The keywords database of application program is generated according to the basic keyword and the matching keywords;
User terminal 20, the search keyword for obtaining input, and the search keyword is sent to distribution service Device 10;
Distributor 10, is additionally operable to according to the search keyword for receiving, by the search keyword and each application The keywords database of program is matched;And according to matching result, obtain application program corresponding with the search keyword simultaneously anti- Feed user terminal 20, to cause to show application program corresponding with the search keyword on user terminal 20.
In embodiments of the present invention, owner of application program etc. can upload application program in Distributor 10, so The request for promoting the application program is sent to Distributor 10 afterwards.Distributor 10 upon receipt of the request, is generated The keywords database of the application program, wherein, the request of the above-mentioned popularization application program can be, application program owner can be to Certain application program that Distributor 10 is uploaded to it sends payment data.
Wherein, the Back ground Information of above-mentioned application program includes:The title of application program, the label of application program, using journey Classification belonging to the description information of sequence, application program etc..
Wherein, the label word of above-mentioned application program is the label word stamped for the application program in advance, such as " take journey Travelling " application program with artificial operation label:" tourism ", " train ticket ", " tourism strategy ", " air ticket ", " trip ", " wine Shop " etc..The description information of application program is the detailed description information of application program.Also, Distributor 10 can pre-set The classification such as classification, such as game class, sport category, for all application programs for uploading, in all being assigned to corresponding classification.
So in the embodiment of the present invention, correspondence application program can be directly extracted from the Back ground Information of application program Keyword.Keyword is extracted such as from title, keyword etc. is extracted from label word.
Further, when the keywords database of application program is generated, Distributor 10 should in basis for Distributor 10 With the Back ground Information of program, after obtaining the basic keyword of application program;Further according to each search word historical search record and The Back ground Information of application program, obtains the matching keywords as application program with the search word of application matches;Certainly The matching keywords of the basic keyword and application program that obtain application program can be simultaneously performed, page can first be obtained and apply journey The matching keywords of sequence, then the basic keyword of application program is obtained, the application is not specifically limited.
In specific implementation process, enable application program delivery applications in user terminal 20 and access Distributor 10.Than As user starts 360 mobile phone assistant in its mobile phone, 360 mobile phone assistant is then connected to Distributor 10.User can answer Search word is input into search box with program distribution application, the search word uploads to Distributor 10, Distributor 10 According to the search word and search application program Search Results and return to application program delivery applications, application program delivery applications then show Sequentially show the application program Search Results, user can click in Search Results and check or click on download application program. So in the search procedure of a large number of users, Distributor 10 can be recorded to the search history of each search word, be obtained To each search word search history record, such as Distributor 10 can be by the above-mentioned search history record of log recording.
And because some search words actually may carry out phase with application program Back ground Information in itself according to certain rule Close, therefore, Distributor 10 can be gone through according to the search of the Back ground Information of application program and each search word in the embodiment of the present invention The Records of the Historian is recorded, and obtains the matching keywords as application program with the search word of application matches.
Specifically, Distributor 10 is after the basic keyword and the matching keywords are obtained, according to described Basic keyword and the matching keywords, generate the keywords database of application program so that wrapped in the keywords database of application program The matching keywords of the basic keyword containing the application program and the application program;Then Distributor 10 can then be based on The keywords database of the application program builds the index for the application program, so as to user in its terminal with the application program When related search keyword is retrieved, can be sorted forward display.
Distributor 10 can perform aforesaid operations to each application program in advance so that each application program is present and it Corresponding keywords database.
User terminal 20, obtains the search keyword of input, and the search keyword is sent into Distributor 10, Wherein, user terminal 20 specifically includes search keyword acquiring unit, and the search keyword acquiring unit is used for according to user Input information, obtain the search keyword, then the search keyword is sent to by application program delivery applications Distributor 10.
In actual application, after application program delivery applications are opened in user terminal 20, get user and lead to Cross after the input information of the input blocks such as dummy keyboard, physical keyboard input, institute is directly obtained according to the input information Search keyword is stated, the input presentation of information of such as user is axxx, it is determined that the search keyword is axxx.
Distributor 10 receive user terminal 20 transmission the search keyword after, according to receive described in Search keyword, the search keyword is matched with the keywords database of each application program;And according to matching result, obtain Application program corresponding with the search keyword simultaneously feeds back to user terminal 20, to cause display and institute on user terminal 20 State the corresponding application program of search keyword.
In specific implementation process, application program acquiring unit can be set in Distributor 10, should for each With program, specifically for existing and the search keyword phase in the keywords database that application program is characterized in the matching result During the keyword matched somebody with somebody, determine that the application program is corresponding with the search keyword, it is corresponding with the search keyword to obtain Application program, in this way, matched with the search keyword to the keywords database of each application program, according to described Application program corresponding with the search keyword can be obtained with result, application journey corresponding with the search keyword is being got The quantity of sequence for it is multiple when, the degree of correlation according to the search keyword and application program is answered come pair corresponding with the search keyword It is ranked up with program.
In embodiments of the present invention, for aforementioned index, can be marked by advertisement and identifier in Distributor 10 It is popularization and application program, then when retrieving application program again, if the application program has advertisement and identifier, can be shifted to an earlier date Displaying.The advertisement and identifier such as " popularization ", " recommending ".Additionally, can set various advertisement and identifiers in the embodiment of the present invention, different is wide Accuse mark and possess different displaying weights.The displaying weight such as " promoted " is high, displaying of " recommending " the displaying weight less than " popularization " Weight.
Wherein, mark " popularization " and " recommending " printed words is popularization and application program, then love is advanced and managed money matters and favourable net financing It is popularization and application program.Search " financing " keyword represents above-mentioned application program.
In sum, the embodiment of the present invention, can be by distributing for the application program that application developers need to promote Server 10 extracts the corresponding basic keyword of application program automatically according to the Back ground Information of application program, and according to application journey The search history record of the Back ground Information of sequence and each search word, obtains the search word with application matches as application program Matching keywords, the keywords database of application program is then generated according to the basic keyword and the matching keywords;Again The search keyword of input is matched with the keywords database of each application program;According to matching result, obtain and the search The corresponding application program of keyword.First, said process can automatically for the application program of application developers is automatically selected Indexing key words, reduces selection course of the application developers to indexing key words.Secondly as the keyword of application program Storehouse is basic keyword and matching keywords by application program to be generated so that the pass in the keywords database of application program Keyword is improved with the correlation of application program such that it is able to which effectively reduction application program appears in the search with user input Probability in the very low Search Results of the word degree of correlation, effectively improves application program and appears in the search word degree of correlation with user input Probability in Search Results higher, improves the accuracy of search.
With continued reference to Fig. 1, another embodiment of the application provides a kind of application program search system and method, it is preferred that Distributor 10 can include:
Participle keyword extracting unit, participle operation is carried out for the title in the Back ground Information by application program, will be divided Word result as application program basic keyword.
In embodiments of the present invention, the Back ground Information of application program includes title, such as " takes journey travelling ", then the present invention can Directly to carry out participle operation to the title, after " taking journey travelling " participle, word segmentation result is " taking journey " and " travelling ", then can Using " journey will be taken " and " travelling " as the application program " taking journey travelling " basic keyword.
And/or, Distributor 10 can include:
Phonetic keyword extracting unit, for the name translation in the Back ground Information by application program be pinyin string and/or The word segmentation result that participle obtains is carried out by the title and is converted to pinyin string, closed the pinyin string as the basis of application program Keyword.
For the title of application program, can convert it directly to phonetic such as " xiechenglvxing ", or by its Word segmentation result is converted to phonetic, and such as the phonetic of " taking journey " is " xiecheng ", then these phonetics can be as the application program Basic keyword.
And/or, Distributor 10 can also include:
Label keyword extracting unit, for using the label word of application program as application program basic keyword.
For a default label word for application program, such as " journey is taken to travel " mark with artificial operation of application program Sign word:" tourism ", " train ticket ", " tourism strategy ", " air ticket ", " trip ", " hotel ", then can using these label words as The basic keyword of the application program.
Preferably, Distributor 10 can also include:
First matching keywords acquiring unit, for the search Download History in the search history of each search word record With the title and/or classification in the Back ground Information of application program, the search word with application matches is obtained as application journey The matching keywords of sequence.
In actual applications, user have input search word and scans in the terminal, and it may click on download application program It is likely to not download application program, then situation is downloaded in the search that Distributor 10 can then record each search word, such as User A searches for " financing ", and application program 1 has been downloaded in search results pages, and user B searches for " financing ", then may be in search Application program 2 is downloaded in result page, by the record of the search download behavior to a large number of users, then can have been obtained to each search word Search Download History.
In implementing, the search Download History is with storage in the form of searching for download log in Distributor 10.
So in the embodiment of the present invention, can according to search download log in extract search word, according to the search word with should With the relation between the title and/or classification of program, using related search word as the application program matching keywords.
Preferably, the first matching keywords acquiring unit, specifically includes:
Text similarity acquiring unit, for each search word in search Download History, for calculating search word and application The text similarity between title in the Back ground Information of program;If the text similarity is more than first threshold, obtain The search word as application program matching keywords.
The embodiment of the present invention can extract each search word for having used from search download log, calculate the search word The text similarity and title of application program between.Such as calculate the cosine between search word text and application name text Distance.
The embodiment of the present invention can set a first threshold for text similarity, if the text similarity is more than First threshold, then obtain matching keywords of the search word as the application program.If the text similarity is less than the One threshold value, then ignore the word.
Preferably, the first matching keywords acquiring unit, specifically includes:
Independent access search word extraction unit, for each search word in search Download History, for judging the search Whether the independent access download time of word is more than Second Threshold, and classification and the application program of the search word Back ground Information In classification whether belong to same classification;If the independent access download time of the search word is more than the Second Threshold, And the classification of the search word belongs to same classification with the classification in the Back ground Information of application program, then obtain the search Word as application program matching keywords.
For a search word in search download log, there may be search of multiple users in the search word of terminal display Download application program in result, and its terminal downloads that there is same IP multiple application programs or same application program Download repeatedly.And in order to reduce the influence that the terminal-pair search word of same IP downloads weight, the embodiment of the present invention is then counted The independent access download time of each search word, i.e. UV (Unique Visitor) is downloaded, even if the terminal of that is, same IP Download repeatedly, its UV download time is also only calculated once.Then for a search word, the terminal for counting how many IP is used The Search Results of the search word have downloaded application program.
Then, the embodiment of the present invention is provided with the Second Threshold for UV download times, if it is determined that under the UV of search word Carry number of times and be more than the Second Threshold, then can determine whether whether is classification in the classification of the search word and the Back ground Information of application program Belong to same classification, if now the classification of search word belongs to same class with the classification in the Back ground Information of application program Mesh, then using the search word as the application program matching keyword.And for a search word, its independent access download time Less than or equal to Second Threshold, and classification in the Back ground Information of its classification and application program is not belonging to same classification, can be with Ignore the search word.
Certainly, application program is classified in the embodiment of the present invention.For search word, it is also possible to which it is classified. The specific assorting process present invention is not any limitation as to it.Certain Distributor 10 can be using following steps to application program Classify with search word:
Sub-step A11, for each one-level class application program now, using the description of one-level class each application program now Information, corresponding one-level class two grades of classifications now are divided into using grader by each application program;
The default various classification in Distributor 10, the classification has been played class, physical culture since first-level class, such as Class.And in fact, application program for an one-level class now, can be carried out thinner according to the description information of its application program Classification.In actual applications, it is possible to use Bayes classifier is classified to description information, by one-level class now respectively should Each two grades of classes are assigned to program now.
Sub-step A12, to each search word, closes according to search word in search history record and the click of each application program System, and two grades of classifications belonging to each application program, calculate two grades of classifications corresponding to the search word.
In the search procedure of user, possible its details of checking application program are clicked in result of page searching and Do not download, it is also possible to click on lower application program.The embodiment of the present invention can according to the click relation of search word and each application program, With reference to two grades of application programs of classification of sub-step A12, each search word is also assigned into corresponding two grades of classes now.Certainly application Program also assists in assorting process.
The accounting that such as search word 1 clicks on the number of times of the application program in two grades of classifications 1 is more than accounting threshold value, then search this Rope word is grouped under two grades of classifications 1.
Above-mentioned search word and the click relation of each application program, can check it for search word with the click of each application program Between relation, or the click of search word and each application program download between relation, naturally it is also possible to for search word with The total relation between downloading is checked and clicked in the click of each application program.
And/or, Distributor 10 can also include:
Second matching keywords acquiring unit, searches for the description information in the Back ground Information according to application program and respectively Search word and the click relation of each application program in the search history record of rope word, obtain the search word with application matches As the matching keywords of application program.
The embodiment of the present invention can be according to the search in the search history of the description information of application program, each search word record Word and the click relation of each application program, go to calculate the topic relativity between application program and search word.Work as topic relativity During more than theme threshold value, then can using the search word as the application program matching keywords.Otherwise can then ignore this to search Rope word.
Preferably, the second matching keywords acquiring unit, specifically includes:
Application program theme distribution computing unit, for the description information in the Back ground Information of each application program, for leading to Cross the theme distribution that topic model calculates application program;
In the embodiment of the present invention, theme can be inputted using the description information of all of application program as input Model, calculates the theme distribution of each application program.
In implementing, because the description information of application program is actually an article, above-mentioned topic model can be with It is LDA (Latent Dirichlet Allocation, latent Dirichletal location theme) model.Can be right by LDA models Each article is analyzed, and obtains the theme distribution of each description information of correspondence, the i.e. probability distribution of each theme, such as theme 1 Probability be 0.6, the probability of theme 2 is 0.3, obtains a vector (0.6,0.4).
Search word theme distribution computing unit, to each search word, for being recorded according to search history in search word with it is each The click relation of application program, calculates the theme distribution of search word;
As it was previously stated, each search word has click relation with each application program, such as in the Search Results of a search word Which application program is clicked is checked, and/or which application program is clicked download.In this way, each search word point can be counted Which application program, number of clicks of each application program etc. are hit.
So because the application program in the application program theme distribution computing unit calculates theme distribution, then one The application program that individual search word can be clicked on according to it, indirectly determines the theme distribution of the search word.Such as search 1 is clicked on should With the accounting 0.8 of program 1, the accounting for clicking on application program 2 is 0.2, and the theme distribution of application program 1 is (0.6,0.4), (0.7,0.3), then the theme distribution of search word can be ((0.6+.07) * 0.8, (0.4+0.3) * 0.2).
Theme similarity word extraction unit, the search word for volumes of searches more than the 3rd threshold value, for being searched according to The theme distribution of rope word and the theme distribution of application program, calculate the Topic Similarity between the search word and application program; If the Topic Similarity between the search word and application program is more than theme threshold value, the search word is obtained as application The matching keywords of program.
In actual applications, small some the search word volumes of searches of some search word volumes of searches are big, for the application journey to be promoted For sequence, the big search word of volumes of searches is easier to make for promoting.Thus the present invention then counts each and searches in search history record The volumes of searches of rope word, and default 3rd threshold value, if the search word for volumes of searches more than the 3rd threshold value, just according to search word The theme distribution of theme distribution and application program, calculates the Topic Similarity between the search word and application program.
In embodiments of the present invention, for search word it is similar between theme distribution and the theme distribution of application program Degree, can be calculated using KL distances and/or JS distances.Wherein, KL distances are Kullback-Leibler divergence, and Claim relative entropy, for the two of discrete random variable probability distribution a P and Q, their KL divergences are defined as he:D(P|| Q)=Σ P (i) log (P (i)/Q (i)) ... formula(1).
It is bottom with 2 when wherein seeking log.
It is Jensen-Shannon divergence for JS distances, it is the prioritization scheme of KL distances, and its formula is:
... formula (2),
Wherein... (formula 3).Wherein, D is calculated using formula (1).
JSD values are between 0 to 1.Bigger to represent that two theme distributions are more consistent, similitude is higher.
The theme distribution of search word of the invention and the theme distribution of application program correspond to P and Q respectively, if the search Topic Similarity between word and application program is more than theme threshold value, then obtain the search word and closed as the matching of application program Keyword.
The embodiment of the present invention presets a theme threshold value, and the Topic Similarity between search word and application program is more than the master Topic threshold value, then obtain matching keywords of the corresponding search word as the application program.Conversely, then ignoring.
And/or, Distributor 10 can also include:
3rd matching keywords acquiring unit, for the classification in the Back ground Information according to application program and each search word pair The classification answered, obtains the matching keywords as application program with the search word of application matches.
Preferably, the 3rd matching keywords acquiring unit is specifically included:
Application program classification subdivision unit, for each one-level class application program now, for using one-level class now The description information of each application program, corresponding one-level class two grades of classifications now are divided into using grader by each application program;
Search word taxon, to each search word, for being recorded according to search history in search word and each application program Click relation, and two grades of classifications belonging to each application program calculate two grades of classifications corresponding to the search word;
Application program classification subdivision unit is similar with foregoing sub-step A11 and A12 with search word taxon.Due to similar Search word 1 click on two grades of classifications 1 in application program number of times accounting be more than accounting threshold value, then by the search word be grouped into this two , there are certain two grades of class now in the situation under level classification 1, the click accounting very little of search word, namely the search word is this two grades The probability of classification is small, then it can be removed from two grades of classifications.
After by search word two grades of classifications of correspondence, by should the small search word of probability of two grades of classifications delete, will be surplus The search word of two grades of remaining classifications is generated as a word bag, is then applied in class heading search word extracts form unit.
Class heading search word extracts form unit, for two grades of classifications according to where application program, obtains to should two grades of classes Each search word of purpose is then as the matching keywords of application program.
For application program, two grades of classes where calculating each application program due to application program classification subdivision unit Mesh, then two grades of classifications of application program also determine, determines two grades of word bags of the keyword of classification in search word taxon, So can using the word in the word bag as the application program matching keywords.
In embodiments of the present invention, the first matching keywords acquiring unit, second matching keywords obtain single First, described 3rd matching keywords acquiring unit each for all of search word calculate and obtains term, described the One matching keywords acquiring unit, the second matching keywords acquiring unit and the 3rd matching keywords acquiring unit can To be used alone, wherein several use can be selected, it is also possible to which selection is all used.The present invention is not limited to it.
Specifically, Distributor 10 by said units obtain the basic keyword and the matching keywords it Afterwards, the keywords database of application program is generated according to the basic keyword and the matching keywords;
In embodiments of the present invention, the basic keyword and matching keywords that obtain are combined for various, can be entered first Row normalization, identical keyword is merged, and is obtained after most simple keyword, and application program is generated according to most simple keyword Keywords database.
Distributor 10 can perform aforesaid operations to each application program in advance so that each application program is present and it Corresponding keywords database.
User terminal 20, obtains the search keyword of input, and the search keyword is sent into Distributor 10, Wherein, user terminal 20 specifically includes search keyword acquiring unit, and the search keyword acquiring unit is used for according to user Input information, obtain the search keyword, then the search keyword is sent to by application program delivery applications Distributor 10.
In actual application, after application program delivery applications are opened in user terminal 20, get user and lead to Cross after the input information of the input blocks such as dummy keyboard, physical keyboard input, institute is directly obtained according to the input information Search keyword is stated, the input presentation of information of such as user is axxx, it is determined that the search keyword is axxx.
Distributor 10 receive user terminal 20 transmission the search keyword after, according to receive described in Search keyword, the search keyword is matched with the keywords database of each application program;And according to matching result, obtain Application program corresponding with the search keyword simultaneously feeds back to user terminal 20, to cause display and institute on user terminal 20 State the corresponding application program of search keyword.
In specific implementation process, application program acquiring unit can be set in Distributor 10, should for each With program, specifically for existing and the search keyword phase in the keywords database that application program is characterized in the matching result During the keyword matched somebody with somebody, determine that the application program is corresponding with the search keyword, it is corresponding with the search keyword to obtain Application program, in this way, matched with the search keyword to the keywords database of each application program, according to described Application program corresponding with the search keyword can be obtained with result, application journey corresponding with the search keyword is being got The quantity of sequence for it is multiple when, the degree of correlation according to the search keyword and application program is answered come pair corresponding with the search keyword It is ranked up with program.
Distributor 10 can perform aforesaid operations to each application program in advance so that each application program is present and it Corresponding keywords database.
User terminal 20, obtains the search keyword of input, and the search keyword is sent into Distributor 10, Wherein, user terminal 20 specifically includes search keyword acquiring unit, and the search keyword acquiring unit is used for according to user Input information, obtain the search keyword, then the search keyword is sent to by application program delivery applications Distributor 10.
In actual application, after application program delivery applications are opened in user terminal 20, get user and lead to Cross after the input information of the input blocks such as dummy keyboard, physical keyboard input, institute is directly obtained according to the input information Search keyword is stated, the input presentation of information of such as user is axxx, it is determined that the search keyword is axxx.
Distributor 10 receive user terminal 20 transmission the search keyword after, according to receive described in Search keyword, the search keyword is matched with the keywords database of each application program;And according to matching result, obtain Application program corresponding with the search keyword simultaneously feeds back to user terminal 20, to cause display and institute on user terminal 20 State the corresponding application program of search keyword.
In specific implementation process, application program acquiring unit can be set in Distributor 10, should for each With program, specifically for existing and the search keyword phase in the keywords database that application program is characterized in the matching result During the keyword matched somebody with somebody, determine that the application program is corresponding with the search keyword, it is corresponding with the search keyword to obtain Application program, in this way, matched with the search keyword to the keywords database of each application program, according to described Application program corresponding with the search keyword can be obtained with result, application journey corresponding with the search keyword is being got The quantity of sequence for it is multiple when, the degree of correlation according to the search keyword and application program is answered come pair corresponding with the search keyword It is ranked up with program.
Based on said system identical technology design, the embodiment of the application one additionally provides a kind of application program searcher Method, referring to Fig. 2, methods described includes:
S201:By Distributor according to the Back ground Information of application program, the basic keyword of application program is obtained;Root According to the historical search record and the Back ground Information of application program of each search word, obtain with the search word of application matches as should With the matching keywords of program;The keywords database of application program is generated according to the basic keyword and the matching keywords;
S202:The search keyword of input is obtained by user terminal, and the search keyword is sent to distribution clothes Business device;
S203:The search keyword received by Distributor, by the search keyword and each application program Keywords database matched;And according to matching result, obtain application program corresponding with the search keyword and feed back to User terminal, to cause to show application program corresponding with the search keyword on the subscriber terminal.
Specifically, the historical search record and the Back ground Information of application program according to each search word, obtains and application The search word of procedure match is specifically included as the matching keywords of application program:
The name in the Back ground Information for searching for Download History and application program in search history record according to each search word Claim and/or classification, obtain the matching keywords as application program with the search word of application matches.
Specifically, the historical search record and the Back ground Information of application program according to each search word, obtains and application The search word of procedure match is specifically included as the matching keywords of application program:
Search word in the search history record of description information and each search word in the Back ground Information of application program With the click relation of each application program, the matching keywords as application program with the search word of application matches are obtained.
Specifically, the historical search record and the Back ground Information of application program according to each search word, obtains and application The search word of procedure match is specifically included as the matching keywords of application program:
Classification and the corresponding classification of each search word in the Back ground Information of application program, obtain and application matches Search word as application program matching keywords.
Specifically, matching keywords of the search word of the acquisition and application matches as application program, specific bag Include:
For each search word in search Download History, for the name in the Back ground Information for calculating search word and application program Text similarity between referred to as;If the text similarity is more than first threshold, the search word is obtained as application journey The matching keywords of sequence.
Specifically, matching keywords of the search word of the acquisition and application matches as application program, specific bag Include:
For each search word in search Download History, judge whether the independent access download time of the search word is more than Whether Second Threshold, and the classification of the search word belongs to same classification with the classification in the Back ground Information of application program; If the independent access download time of the search word is more than the Second Threshold, and the search word classification and application journey Classification in the Back ground Information of sequence belongs to same classification, then obtain matching keywords of the search word as application program.
Specifically, matching keywords of the search word of the acquisition and application matches as application program, specific bag Include:
For the description information in the Back ground Information of each application program, the theme of application program is calculated by topic model Distribution;
To each search word, according to search word in search history record and the click relation of each application program, search is calculated The theme distribution of word;
Search word for volumes of searches more than the 3rd threshold value, the master of theme distribution and application program according to the search word Topic distribution, calculates the Topic Similarity between the search word and application program;If between the search word and application program Topic Similarity be more than theme threshold value, then obtain matching keywords of the search word as application program.
Specifically, matching keywords of the search word of the acquisition and application matches as application program, specific bag Include:
For each one-level class application program now, using the description information of one-level class each application program now, use Each application program is divided into corresponding one-level class two grades of classifications now by grader;
To each search word, according to the click relation of search word in search history record and each application program, and respectively should With two grades of classifications belonging to program, two grades of classifications corresponding to the search word are calculated;
Two grades of classifications according to where application program, obtain to should two grades of each search words of classification then as application program Matching keywords.
Specifically, the Back ground Information according to application program, obtains the basic keyword of application program, specifically include:
Title in the Back ground Information of application program is carried out into participle operation, using word segmentation result as the basis of application program Keyword.
Specifically, the Back ground Information according to application program, obtains the basic keyword of application program, specifically include:
Name translation in the Back ground Information of application program is carried out what participle was obtained for pinyin string and/or by the title Word segmentation result is converted to pinyin string, using the pinyin string as application program basic keyword.
Specifically, the Back ground Information according to application program, obtains the basic keyword of application program, specifically include:
Using the label word of application program as application program basic keyword.
Specifically, it is described according to matching result, application program corresponding with the search keyword is obtained, specifically include:
For each application program, exist in the matching result characterizes the keywords database of application program and searched with described During the keyword that rope keyword matches, determine that the application program is corresponding with the search keyword, searched with described with obtaining The corresponding application program of rope keyword.
Specifically, the search keyword for obtaining input, specifically includes:
Input information according to user, obtains the search keyword.
Technical scheme in above-mentioned the embodiment of the present application, at least has the following technical effect that or advantage:
Application according to the present invention program search system and method, Distributor, according to the Back ground Information of application program, Obtain the basic keyword of application program;The Back ground Information of historical search record and application program according to each search word, obtains With the search word of application matches as application program matching keywords;Closed according to the basic keyword and the matching Keyword generates the keywords database of application program;User terminal, the search keyword for obtaining input, and the search is crucial Word is sent to the Distributor;The Distributor, it is according to the search keyword for receiving, the search is crucial Word is matched with the keywords database of each application program;And according to matching result, obtain answer corresponding with the search keyword With program and feed back to the user terminal, with cause to be shown on the user terminal it is corresponding with the search keyword should Use program;Generated because the keywords database of application program is basic keyword and matching keywords by application program, So that the keyword in the keywords database of application program is improved with the correlation of application program, application program is thus solved Developer needs the problem of the indexing key words by cumbersome operation selection application program, and because the index for selecting is crucial Word is incorrect, and the probability for causing application program to appear in the Search Results very low with the search word degree of correlation of user input is higher Problem, achieve can by the keywords database of application program automatically for application program automatically selects indexing key words, reduce Application developers effectively improve application program and appear in and user input to the selection course of application index keyword Search word degree of correlation Search Results higher in probability.
It should be understood by those skilled in the art that, embodiments of the invention can be provided as method, system or computer program Product.Therefore, the present invention can be using the reality in terms of complete hardware embodiment, complete software embodiment or combination software and hardware Apply the form of example.And, the present invention can be used and wherein include the computer of computer usable program code at one or more The computer program implemented in usable storage medium (including but not limited to magnetic disk storage, CD-ROM, optical memory etc.) is produced The form of product.
The present invention is the flow with reference to method according to embodiments of the present invention, equipment (system) and computer program product Figure and/or block diagram are described.It should be understood that every first-class during flow chart and/or block diagram can be realized by computer program instructions The combination of flow and/or square frame in journey and/or square frame and flow chart and/or block diagram.These computer programs can be provided The processor of all-purpose computer, special-purpose computer, Embedded Processor or other programmable data processing devices is instructed to produce A raw machine so that produced for reality by the instruction of computer or the computing device of other programmable data processing devices The device of the function of being specified in present one flow of flow chart or multiple one square frame of flow and/or block diagram or multiple square frames.
These computer program instructions may be alternatively stored in can guide computer or other programmable data processing devices with spy In determining the computer-readable memory that mode works so that instruction of the storage in the computer-readable memory is produced and include finger Make the manufacture of device, the command device realize in one flow of flow chart or multiple one square frame of flow and/or block diagram or The function of being specified in multiple square frames.
These computer program instructions can be also loaded into computer or other programmable data processing devices so that in meter Series of operation steps is performed on calculation machine or other programmable devices to produce computer implemented treatment, so as in computer or The instruction performed on other programmable devices is provided for realizing in one flow of flow chart or multiple flows and/or block diagram one The step of function of being specified in individual square frame or multiple square frames.
, but those skilled in the art once know basic creation although preferred embodiments of the present invention have been described Property concept, then can make other change and modification to these embodiments.So, appended claims are intended to be construed to include excellent Select embodiment and fall into having altered and changing for the scope of the invention.
Obviously, those skilled in the art can carry out various changes and modification without deviating from essence of the invention to the present invention God and scope.So, if these modifications of the invention and modification belong to the scope of the claims in the present invention and its equivalent technologies Within, then the present invention is also intended to comprising these changes and modification.
The present invention discloses A1, a kind of application program search system, it is characterised in that the system includes:
Distributor, for the Back ground Information according to application program, obtains the basic keyword of application program;According to each The historical search record and the Back ground Information of application program of search word, obtain the search word with application matches as application journey The matching keywords of sequence;The keywords database of application program is generated according to the basic keyword and the matching keywords;
User terminal, the search keyword for obtaining input, and the search keyword is sent to the distribution clothes Business device;
The Distributor, be additionally operable to according to receive the search keyword, by the search keyword with respectively should Matched with the keywords database of program;And according to matching result, obtain application program corresponding with the search keyword simultaneously The user terminal is fed back to, to cause to show application program corresponding with the search keyword on the user terminal.
A2, the system as described in A1, it is characterised in that the Distributor includes:
First matching keywords acquiring unit, for the search Download History in the search history of each search word record With the title and/or classification in the Back ground Information of application program, the search word with application matches is obtained as application journey The matching keywords of sequence.
A3, the system as described in A1, it is characterised in that the Distributor includes:
Second matching keywords acquiring unit, searches for the description information in the Back ground Information according to application program and respectively Search word and the click relation of each application program in the search history record of rope word, obtain the search word with application matches As the matching keywords of application program.
A4, the system as described in A1, it is characterised in that the Distributor includes:
3rd matching keywords acquiring unit, for the classification in the Back ground Information according to application program and each search word pair The classification answered, obtains the matching keywords as application program with the search word of application matches.
A5, the system as described in A2, it is characterised in that the first matching keywords acquiring unit, specifically include:
Text similarity acquiring unit, for each search word in search Download History, for calculating search word and application The text similarity between title in the Back ground Information of program;If the text similarity is more than first threshold, obtain The search word as application program matching keywords.
A6, the system as described in A2, it is characterised in that the first matching keywords acquiring unit, specifically include:
Independent access search word extraction unit, for each search word in search Download History, for judging the search Whether the independent access download time of word is more than Second Threshold, and classification and the application program of the search word Back ground Information In classification whether belong to same classification;If the independent access download time of the search word is more than the Second Threshold, And the classification of the search word belongs to same classification with the classification in the Back ground Information of application program, then obtain the search Word as application program matching keywords.
A7, the system as described in A3, it is characterised in that the second matching keywords acquiring unit, specifically include:
Application program theme distribution computing unit, for the description information in the Back ground Information of each application program, for leading to Cross the theme distribution that topic model calculates application program;
Search word theme distribution computing unit, to each search word, for being recorded according to search history in search word with it is each The click relation of application program, calculates the theme distribution of search word;
Theme similarity word extraction unit, the search word for volumes of searches more than the 3rd threshold value, for being searched according to The theme distribution of rope word and the theme distribution of application program, calculate the Topic Similarity between the search word and application program; If the Topic Similarity between the search word and application program is more than theme threshold value, the search word is obtained as application The matching keywords of program.
A8, the system as described in A4, it is characterised in that the 3rd matching keywords acquiring unit, specifically include:
Application program classification subdivision unit, for each one-level class application program now, for using one-level class now The description information of each application program, corresponding one-level class two grades of classifications now are divided into using grader by each application program;
Search word taxon, to each search word, for being recorded according to search history in search word and each application program Click relation, and two grades of classifications belonging to each application program calculate two grades of classifications corresponding to the search word;
Class heading search word extracts form unit, for two grades of classifications according to where application program, obtains to should two grades of classes Each search word of purpose is then as the matching keywords of application program.
A9, the system as described in A1, it is characterised in that the Distributor includes:
Participle keyword extracting unit, participle operation is carried out for the title in the Back ground Information by application program, will be divided Word result as application program basic keyword.
A10, the system as described in A1, it is characterised in that the Distributor includes:
Phonetic keyword extracting unit, for the name translation in the Back ground Information by application program be pinyin string and/or The word segmentation result that participle obtains is carried out by the title and is converted to pinyin string, closed the pinyin string as the basis of application program Keyword.
A11, the system as described in A1, it is characterised in that the Distributor also includes:
Label keyword extracting unit, for using the label word of application program as application program basic keyword.
A12, the system as described in A1, it is characterised in that the Distributor also includes:
Application program acquiring unit, for each application program, journey is applied specifically for being characterized in the matching result When there is the keyword matched with the search keyword in the keywords database of sequence, determine that the application program is closed with the search Keyword is corresponding, to obtain application program corresponding with the search keyword.
A13, the system as described in A1, it is characterised in that the user terminal includes:
Search keyword acquiring unit, specifically for the input information according to user, obtains the search keyword.
B14, a kind of application program searching method, it is characterised in that methods described includes:
By Distributor according to the Back ground Information of application program, the basic keyword of application program is obtained;According to each The historical search record and the Back ground Information of application program of search word, obtain the search word with application matches as application journey The matching keywords of sequence;The keywords database of application program is generated according to the basic keyword and the matching keywords;
The search keyword of input is obtained by user terminal, and the search keyword is sent to the distribution service Device;
The search keyword received by the Distributor, by the search keyword and each application program Keywords database is matched;And according to matching result, obtain application program corresponding with the search keyword and feed back to institute User terminal is stated, to cause to show application program corresponding with the search keyword on the user terminal.
B15, the method as described in B14, it is characterised in that described to be recorded according to the historical search of each search word and apply journey The Back ground Information of sequence, obtains the search word with application matches as the matching keywords of application program, specifically includes:
The name in the Back ground Information for searching for Download History and application program in search history record according to each search word Claim and/or classification, obtain the matching keywords as application program with the search word of application matches.
B16, the method as described in B14, it is characterised in that described to be recorded according to the historical search of each search word and apply journey The Back ground Information of sequence, obtains the search word with application matches as the matching keywords of application program, specifically includes:
Search word in the search history record of description information and each search word in the Back ground Information of application program With the click relation of each application program, the matching keywords as application program with the search word of application matches are obtained.
B17, the method as described in B14, it is characterised in that described to be recorded according to the historical search of each search word and apply journey The Back ground Information of sequence, obtains the search word with application matches as the matching keywords of application program, specifically includes:
Classification and the corresponding classification of each search word in the Back ground Information of application program, obtain and application matches Search word as application program matching keywords.
B18, the method as described in B15, it is characterised in that the search word of the acquisition and application matches is used as application The matching keywords of program, specifically include:
For each search word in search Download History, for the name in the Back ground Information for calculating search word and application program Text similarity between referred to as;If the text similarity is more than first threshold, the search word is obtained as application journey The matching keywords of sequence.
B19, the method as described in B15, it is characterised in that the search word of the acquisition and application matches is used as application The matching keywords of program, specifically include:
For each search word in search Download History, judge whether the independent access download time of the search word is more than Whether Second Threshold, and the classification of the search word belongs to same classification with the classification in the Back ground Information of application program; If the independent access download time of the search word is more than the Second Threshold, and the search word classification and application journey Classification in the Back ground Information of sequence belongs to same classification, then obtain matching keywords of the search word as application program.
B20, the method as described in B16, it is characterised in that the search word of the acquisition and application matches is used as application The matching keywords of program, specifically include:
For the description information in the Back ground Information of each application program, the theme of application program is calculated by topic model Distribution;
To each search word, according to search word in search history record and the click relation of each application program, search is calculated The theme distribution of word;
Search word for volumes of searches more than the 3rd threshold value, the master of theme distribution and application program according to the search word Topic distribution, calculates the Topic Similarity between the search word and application program;If between the search word and application program Topic Similarity be more than theme threshold value, then obtain matching keywords of the search word as application program.
B21, the method as described in B17, it is characterised in that the search word of the acquisition and application matches is used as application The matching keywords of program, specifically include:
For each one-level class application program now, using the description information of one-level class each application program now, use Each application program is divided into corresponding one-level class two grades of classifications now by grader;
To each search word, according to the click relation of search word in search history record and each application program, and respectively should With two grades of classifications belonging to program, two grades of classifications corresponding to the search word are calculated;
Two grades of classifications according to where application program, obtain to should two grades of each search words of classification then as application program Matching keywords.
Journey is applied in B22, the method as described in B14, it is characterised in that the Back ground Information according to application program, acquisition The basic keyword of sequence, specifically includes:
Title in the Back ground Information of application program is carried out into participle operation, using word segmentation result as the basis of application program Keyword.
Journey is applied in B23, the method as described in B14, it is characterised in that the Back ground Information according to application program, acquisition The basic keyword of sequence, specifically includes:
Name translation in the Back ground Information of application program is carried out what participle was obtained for pinyin string and/or by the title Word segmentation result is converted to pinyin string, using the pinyin string as application program basic keyword.
Journey is applied in B24, the method as described in B14, it is characterised in that the Back ground Information according to application program, acquisition The basic keyword of sequence, specifically includes:
Using the label word of application program as application program basic keyword.
B25, the method as described in B14, it is characterised in that described according to matching result, obtain and the search keyword Corresponding application program, specifically includes:
For each application program, exist in the matching result characterizes the keywords database of application program and searched with described During the keyword that rope keyword matches, determine that the application program is corresponding with the search keyword, searched with described with obtaining The corresponding application program of rope keyword.
B26, the method as described in B14, it is characterised in that the search keyword of the acquisition input, specifically include:
Input information according to user, obtains the search keyword.

Claims (10)

1. a kind of application program search system, it is characterised in that the system includes:
Distributor, for the Back ground Information according to application program, obtains the basic keyword of application program;According to each search The historical search record and the Back ground Information of application program of word, obtain the search word with application matches as application program Matching keywords;The keywords database of application program is generated according to the basic keyword and the matching keywords;
User terminal, for obtaining the search keyword of input, and is sent to the Distributor by the search keyword;
The Distributor, is additionally operable to according to the search keyword for receiving, by the search keyword and each application journey The keywords database of sequence is matched;And according to matching result, obtain application program corresponding with the search keyword and feed back To the user terminal, to cause to show application program corresponding with the search keyword on the user terminal.
2. the system as claimed in claim 1, it is characterised in that the Distributor includes:
First matching keywords acquiring unit, for according to the search history of each search word record in search Download History and should With title and/or classification in the Back ground Information of program, the search word with application matches is obtained as application program Matching keywords.
3. the system as claimed in claim 1, it is characterised in that the Distributor includes:
Second matching keywords acquiring unit, for the description information in the Back ground Information according to application program and each search word Search history record in search word and each application program click relation, obtain and the search word conduct of application matches The matching keywords of application program.
4. the system as claimed in claim 1, it is characterised in that the Distributor includes:
3rd matching keywords acquiring unit, it is corresponding for the classification in the Back ground Information according to application program and each search word Classification, obtains the matching keywords as application program with the search word of application matches.
5. system as claimed in claim 2, it is characterised in that the first matching keywords acquiring unit, specifically includes:
Text similarity acquiring unit, for each search word in search Download History, for calculating search word and application program Back ground Information in title between text similarity;If the text similarity is more than first threshold, obtain described Search word as application program matching keywords.
6. system as claimed in claim 2, it is characterised in that the first matching keywords acquiring unit, specifically includes:
Independent access search word extraction unit, for each search word in search Download History, for judging the search word Whether independent access download time is more than in Second Threshold, and the classification of the search word and the Back ground Information of application program Whether classification belongs to same classification;If the independent access download time of the search word is more than the Second Threshold, and The classification of the search word belongs to same classification with the classification in the Back ground Information of application program, then obtain the search word and make It is the matching keywords of application program.
7. system as claimed in claim 3, it is characterised in that the second matching keywords acquiring unit, specifically includes:
Application program theme distribution computing unit, for the description information in the Back ground Information of each application program, for by master Topic model calculates the theme distribution of application program;
Search word theme distribution computing unit, to each search word, for being recorded according to search history in search word and each application The click relation of program, calculates the theme distribution of search word;
Theme similarity word extraction unit, the search word for volumes of searches more than the 3rd threshold value, for according to the search word Theme distribution and application program theme distribution, calculate the Topic Similarity between the search word and application program;If Topic Similarity between the search word and application program is more than theme threshold value, then obtain the search word as application program Matching keywords.
8. system as claimed in claim 4, it is characterised in that the 3rd matching keywords acquiring unit, specifically includes:
Application program classification subdivision unit, for each one-level class application program now, for utilization one-level class respectively should now With the description information of program, each application program is divided into by corresponding one-level class two grades of classifications now using grader;
Search word taxon, to each search word, for being recorded according to search history in search word and each application program point Relation, and two grades of classifications belonging to each application program are hit, two grades of classifications corresponding to the search word are calculated;
Class heading search word extracts form unit, for two grades of classifications according to where application program, obtains to should two grades of classifications Each search word is then as the matching keywords of application program.
9. the system as claimed in claim 1, it is characterised in that the Distributor includes:
Participle keyword extracting unit, participle operation is carried out for the title in the Back ground Information by application program, by participle knot Really as the basic keyword of application program.
10. a kind of application program searching method, it is characterised in that methods described includes:
By Distributor according to the Back ground Information of application program, the basic keyword of application program is obtained;According to each search The historical search record and the Back ground Information of application program of word, obtain the search word with application matches as application program Matching keywords;The keywords database of application program is generated according to the basic keyword and the matching keywords;
The search keyword of input is obtained by user terminal, and the search keyword is sent to the Distributor;
The search keyword received by the Distributor, by the key of the search keyword and each application program Dictionary is matched;And according to matching result, obtain application program corresponding with the search keyword and feed back to the use Family terminal, to cause to show application program corresponding with the search keyword on the user terminal.
CN201510993113.0A 2015-12-24 2015-12-24 A kind of application program search system and method Pending CN106919588A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201510993113.0A CN106919588A (en) 2015-12-24 2015-12-24 A kind of application program search system and method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201510993113.0A CN106919588A (en) 2015-12-24 2015-12-24 A kind of application program search system and method

Publications (1)

Publication Number Publication Date
CN106919588A true CN106919588A (en) 2017-07-04

Family

ID=59460223

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201510993113.0A Pending CN106919588A (en) 2015-12-24 2015-12-24 A kind of application program search system and method

Country Status (1)

Country Link
CN (1) CN106919588A (en)

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107622090A (en) * 2017-08-22 2018-01-23 上海艾融软件股份有限公司 Acquisition methods, the apparatus and system of object
CN107767172A (en) * 2017-10-12 2018-03-06 百度在线网络技术(北京)有限公司 Information-pushing method, device, server and medium
CN108920652A (en) * 2018-07-03 2018-11-30 佛山市影腾科技有限公司 A kind of searching method, device and terminal
CN110196833A (en) * 2018-03-22 2019-09-03 腾讯科技(深圳)有限公司 Searching method, device, terminal and the storage medium of application program
CN110704729A (en) * 2019-09-09 2020-01-17 上海博泰悦臻网络技术服务有限公司 Application search method and cloud server
CN111488510A (en) * 2020-04-17 2020-08-04 支付宝(杭州)信息技术有限公司 Method and device for determining related words of small program, processing equipment and search system

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1389811A (en) * 2002-02-06 2003-01-08 北京造极人工智能技术有限公司 Intelligent search method of search engine
CN101179472A (en) * 2007-05-31 2008-05-14 腾讯科技(深圳)有限公司 Network resource searching method and searching system
US20110219015A1 (en) * 2008-08-28 2011-09-08 Nhn Business Platform Corporation Searching method using extended keyword pool and system thereof
CN102236711A (en) * 2011-06-30 2011-11-09 百度在线网络技术(北京)有限公司 Method and equipment for determining displayed information corresponding to promotion keyword
CN102737045A (en) * 2011-04-08 2012-10-17 北京百度网讯科技有限公司 Method and device for relevancy computation
CN103914552A (en) * 2014-04-14 2014-07-09 百度在线网络技术(北京)有限公司 Method and device for retrieving applications
CN105095187A (en) * 2015-08-07 2015-11-25 广州神马移动信息科技有限公司 Search intention identification method and device
CN105117479A (en) * 2015-09-11 2015-12-02 北京金山安全软件有限公司 Acquisition method and processing method of user search behavior information and electronic equipment

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1389811A (en) * 2002-02-06 2003-01-08 北京造极人工智能技术有限公司 Intelligent search method of search engine
CN101179472A (en) * 2007-05-31 2008-05-14 腾讯科技(深圳)有限公司 Network resource searching method and searching system
US20110219015A1 (en) * 2008-08-28 2011-09-08 Nhn Business Platform Corporation Searching method using extended keyword pool and system thereof
CN102737045A (en) * 2011-04-08 2012-10-17 北京百度网讯科技有限公司 Method and device for relevancy computation
CN102236711A (en) * 2011-06-30 2011-11-09 百度在线网络技术(北京)有限公司 Method and equipment for determining displayed information corresponding to promotion keyword
CN103914552A (en) * 2014-04-14 2014-07-09 百度在线网络技术(北京)有限公司 Method and device for retrieving applications
CN105095187A (en) * 2015-08-07 2015-11-25 广州神马移动信息科技有限公司 Search intention identification method and device
CN105117479A (en) * 2015-09-11 2015-12-02 北京金山安全软件有限公司 Acquisition method and processing method of user search behavior information and electronic equipment

Cited By (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107622090A (en) * 2017-08-22 2018-01-23 上海艾融软件股份有限公司 Acquisition methods, the apparatus and system of object
CN107622090B (en) * 2017-08-22 2020-10-16 上海艾融软件股份有限公司 Object acquisition method, device and system
CN107767172A (en) * 2017-10-12 2018-03-06 百度在线网络技术(北京)有限公司 Information-pushing method, device, server and medium
CN110196833A (en) * 2018-03-22 2019-09-03 腾讯科技(深圳)有限公司 Searching method, device, terminal and the storage medium of application program
CN110196833B (en) * 2018-03-22 2023-06-09 腾讯科技(深圳)有限公司 Application searching method, device, terminal and storage medium
CN108920652A (en) * 2018-07-03 2018-11-30 佛山市影腾科技有限公司 A kind of searching method, device and terminal
CN110704729A (en) * 2019-09-09 2020-01-17 上海博泰悦臻网络技术服务有限公司 Application search method and cloud server
CN111488510A (en) * 2020-04-17 2020-08-04 支付宝(杭州)信息技术有限公司 Method and device for determining related words of small program, processing equipment and search system
CN111488510B (en) * 2020-04-17 2023-09-29 支付宝(杭州)信息技术有限公司 Method and device for determining related words of applet, processing equipment and search system

Similar Documents

Publication Publication Date Title
CN106919575B (en) Application program searching method and device
CN106919588A (en) A kind of application program search system and method
CN106709040B (en) Application search method and server
US20190114668A1 (en) Application recommendation method and server
CN104111933B (en) Obtain business object label, set up the method and device of training pattern
CN102982153B (en) A kind of information retrieval method and device thereof
CN106445963B (en) Advertisement index keyword automatic generation method and device of APP platform
CN110532451A (en) Search method and device for policy text, storage medium, electronic device
CN105653562B (en) The calculation method and device of correlation between a kind of content of text and inquiry request
CN109299344A (en) The generation method of order models, the sort method of search result, device and equipment
CN105095187A (en) Search intention identification method and device
CN106982256A (en) Information-pushing method, device, equipment and storage medium
CN104951468A (en) Data searching and processing method and system
CN105023165A (en) Method, device and system for controlling release tasks in social networking platform
CN106294783A (en) A kind of video recommendation method and device
CN106415537A (en) Inserting native application search results into web search results
CN108319376B (en) Input association recommendation method and device for optimizing commercial word promotion
CN109409928A (en) A kind of material recommended method, device, storage medium, terminal
US11144594B2 (en) Search method, search apparatus and non-temporary computer-readable storage medium for text search
CN104778283B (en) A kind of user's occupational classification method and system based on microblogging
CN107818491A (en) Electronic installation, Products Show method and storage medium based on user's Internet data
CN108304490A (en) Text based similarity determines method, apparatus and computer equipment
CN107273391A (en) Document recommends method and apparatus
CN106445954A (en) Business object display method and apparatus
CN113570413A (en) Method and device for generating advertisement keywords, storage medium and electronic equipment

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20170704