CN106919588A - A kind of application program search system and method - Google Patents
A kind of application program search system and method Download PDFInfo
- Publication number
- CN106919588A CN106919588A CN201510993113.0A CN201510993113A CN106919588A CN 106919588 A CN106919588 A CN 106919588A CN 201510993113 A CN201510993113 A CN 201510993113A CN 106919588 A CN106919588 A CN 106919588A
- Authority
- CN
- China
- Prior art keywords
- application program
- search
- search word
- keyword
- word
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/33—Querying
- G06F16/3331—Query processing
- G06F16/3332—Query translation
- G06F16/3338—Query expansion
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/33—Querying
- G06F16/3331—Query processing
- G06F16/334—Query execution
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/951—Indexing; Web crawling techniques
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Databases & Information Systems (AREA)
- Data Mining & Analysis (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
It is used for the Back ground Information according to application program the invention discloses a kind of application program search system and method, including Distributor, obtains the basic keyword of application program;The Back ground Information of historical search record and application program according to each search word, obtains the matching keywords as application program with the search word of application matches;The keywords database of application program is generated according to the basic keyword and the matching keywords;User terminal, for obtaining the search keyword of input, and is sent to the Distributor by the search keyword;The Distributor, is additionally operable to, according to the search keyword for receiving, the search keyword be matched with the keywords database of each application program;And according to matching result, obtain application program corresponding with the search keyword and feed back to the user terminal.Application program search system disclosed by the invention and method, solve the problems, such as that application developers need the indexing key words by cumbersome operation selection application program.
Description
Technical field
The present invention relates to search technique field, and in particular to a kind of application program search system and method.
Background technology
With the development of intelligent mobile terminal, increasing user downloads various APP in intelligent mobile terminal
(application, application program) is used.Based on this kind of situation, application program distribution platform arises at the historic moment, and user can pass through
Intelligent mobile terminal access application distribution platform, such as the application program delivery applications by being installed in intelligent mobile terminal
Access application distribution platform is removed, such that it is able to download various application programs from platform.Wherein, application program delivery applications
Such as various mobile phone assistants.
And in application program distribution platform, in order to be the application program owner for having popularization demand, such as application journey
Sequence developer, the application program of application program owner can be applied in application program searched page with forward displaying
Program owner can bid word as indexing key words for the purchase of these application programs.
But, the word of bidding of application developers purchase may in itself be mismatched with application program, be made flat using distribution
The search engine of platform may be returned actually with the search word degree of correlation very when being retrieved according to the search word of user input
The information of low application program, causes user to search during the application program with its demand, it is necessary to more operated, than
Such as page turning operation, influence obtains the efficiency of the application program of its demand.
The content of the invention
In view of the above problems, it is proposed that the present invention so as to provide one kind overcome above mentioned problem or at least in part solve on
State the application program search system and method for problem.
On the one hand, the application provides a kind of application program search system, the system by an embodiment of the application
Including:
Distributor, for the Back ground Information according to application program, obtains the basic keyword of application program;According to each
The historical search record and the Back ground Information of application program of search word, obtain the search word with application matches as application journey
The matching keywords of sequence;The keywords database of application program is generated according to the basic keyword and the matching keywords;
User terminal, the search keyword for obtaining input, and the search keyword is sent to the distribution clothes
Business device;
The Distributor, be additionally operable to according to receive the search keyword, by the search keyword with respectively should
Matched with the keywords database of program;And according to matching result, obtain application program corresponding with the search keyword simultaneously
The user terminal is fed back to, to cause to show application program corresponding with the search keyword on the user terminal.
Optionally, the Distributor includes:
First matching keywords acquiring unit, for the search Download History in the search history of each search word record
With the title and/or classification in the Back ground Information of application program, the search word with application matches is obtained as application journey
The matching keywords of sequence.
Optionally, the Distributor includes:
Second matching keywords acquiring unit, searches for the description information in the Back ground Information according to application program and respectively
Search word and the click relation of each application program in the search history record of rope word, obtain the search word with application matches
As the matching keywords of application program.
Optionally, the Distributor includes:
3rd matching keywords acquiring unit, for the classification in the Back ground Information according to application program and each search word pair
The classification answered, obtains the matching keywords as application program with the search word of application matches.
Optionally, the first matching keywords acquiring unit, specifically includes:
Text similarity acquiring unit, for each search word in search Download History, for calculating search word and application
The text similarity between title in the Back ground Information of program;If the text similarity is more than first threshold, obtain
The search word as application program matching keywords.
Optionally, the first matching keywords acquiring unit, specifically includes:
Independent access search word extraction unit, for each search word in search Download History, for judging the search
Whether the independent access download time of word is more than Second Threshold, and classification and the application program of the search word Back ground Information
In classification whether belong to same classification;If the independent access download time of the search word is more than the Second Threshold,
And the classification of the search word belongs to same classification with the classification in the Back ground Information of application program, then obtain the search
Word as application program matching keywords.
Optionally, the second matching keywords acquiring unit, specifically includes:
Application program theme distribution computing unit, for the description information in the Back ground Information of each application program, for leading to
Cross the theme distribution that topic model calculates application program;
Search word theme distribution computing unit, to each search word, for being recorded according to search history in search word with it is each
The click relation of application program, calculates the theme distribution of search word;
Theme similarity word extraction unit, the search word for volumes of searches more than the 3rd threshold value, for being searched according to
The theme distribution of rope word and the theme distribution of application program, calculate the Topic Similarity between the search word and application program;
If the Topic Similarity between the search word and application program is more than theme threshold value, the search word is obtained as application
The matching keywords of program.
Optionally, the 3rd matching keywords acquiring unit, specifically includes:
Application program classification subdivision unit, for each one-level class application program now, for using one-level class now
The description information of each application program, corresponding one-level class two grades of classifications now are divided into using grader by each application program;
Search word taxon, to each search word, for being recorded according to search history in search word and each application program
Click relation, and two grades of classifications belonging to each application program calculate two grades of classifications corresponding to the search word;
Class heading search word extracts form unit, for two grades of classifications according to where application program, obtains to should two grades of classes
Each search word of purpose is then as the matching keywords of application program.
Optionally, the Distributor includes:
Participle keyword extracting unit, participle operation is carried out for the title in the Back ground Information by application program, will be divided
Word result as application program basic keyword.
Optionally, the Distributor includes:
Phonetic keyword extracting unit, for the name translation in the Back ground Information by application program be pinyin string and/or
The word segmentation result that participle obtains is carried out by the title and is converted to pinyin string, closed the pinyin string as the basis of application program
Keyword.
Optionally, the Distributor also includes:
Label keyword extracting unit, for using the label word of application program as application program basic keyword.
Optionally, the Distributor also includes:
Application program acquiring unit, for each application program, journey is applied specifically for being characterized in the matching result
When there is the keyword matched with the search keyword in the keywords database of sequence, determine that the application program is closed with the search
Keyword is corresponding, to obtain application program corresponding with the search keyword.
Optionally, the user terminal includes:
Search keyword acquiring unit, specifically for the input information according to user, obtains the search keyword.
On the other hand, the application provides a kind of application program searching method, the side by an embodiment of the application
Method includes:
By Distributor according to the Back ground Information of application program, the basic keyword of application program is obtained;According to each
The historical search record and the Back ground Information of application program of search word, obtain the search word with application matches as application journey
The matching keywords of sequence;The keywords database of application program is generated according to the basic keyword and the matching keywords;
The search keyword of input is obtained by user terminal, and the search keyword is sent to the distribution service
Device;
The search keyword received by the Distributor, by the search keyword and each application program
Keywords database is matched;And according to matching result, obtain application program corresponding with the search keyword and feed back to institute
User terminal is stated, to cause to show application program corresponding with the search keyword on the user terminal.
Optionally, the historical search record and the Back ground Information of application program according to each search word, obtains and application
The search word of procedure match is specifically included as the matching keywords of application program:
The name in the Back ground Information for searching for Download History and application program in search history record according to each search word
Claim and/or classification, obtain the matching keywords as application program with the search word of application matches.
Optionally, the historical search record and the Back ground Information of application program according to each search word, obtains and application
The search word of procedure match is specifically included as the matching keywords of application program:
Search word in the search history record of description information and each search word in the Back ground Information of application program
With the click relation of each application program, the matching keywords as application program with the search word of application matches are obtained.
Optionally, the historical search record and the Back ground Information of application program according to each search word, obtains and application
The search word of procedure match is specifically included as the matching keywords of application program:
Classification and the corresponding classification of each search word in the Back ground Information of application program, obtain and application matches
Search word as application program matching keywords.
Optionally, the acquisition is specifically wrapped with the search word of application matches as the matching keywords of application program
Include:
For each search word in search Download History, for the name in the Back ground Information for calculating search word and application program
Text similarity between referred to as;If the text similarity is more than first threshold, the search word is obtained as application journey
The matching keywords of sequence.
Optionally, the acquisition is specifically wrapped with the search word of application matches as the matching keywords of application program
Include:
For each search word in search Download History, judge whether the independent access download time of the search word is more than
Whether Second Threshold, and the classification of the search word belongs to same classification with the classification in the Back ground Information of application program;
If the independent access download time of the search word is more than the Second Threshold, and the search word classification and application journey
Classification in the Back ground Information of sequence belongs to same classification, then obtain matching keywords of the search word as application program.
Optionally, the acquisition is specifically wrapped with the search word of application matches as the matching keywords of application program
Include:
For the description information in the Back ground Information of each application program, the theme of application program is calculated by topic model
Distribution;
To each search word, according to search word in search history record and the click relation of each application program, search is calculated
The theme distribution of word;
Search word for volumes of searches more than the 3rd threshold value, the master of theme distribution and application program according to the search word
Topic distribution, calculates the Topic Similarity between the search word and application program;If between the search word and application program
Topic Similarity be more than theme threshold value, then obtain matching keywords of the search word as application program.
Optionally, the acquisition is specifically wrapped with the search word of application matches as the matching keywords of application program
Include:
For each one-level class application program now, using the description information of one-level class each application program now, use
Each application program is divided into corresponding one-level class two grades of classifications now by grader;
To each search word, according to the click relation of search word in search history record and each application program, and respectively should
With two grades of classifications belonging to program, two grades of classifications corresponding to the search word are calculated;
Two grades of classifications according to where application program, obtain to should two grades of each search words of classification then as application program
Matching keywords.
Optionally, the Back ground Information according to application program, obtains the basic keyword of application program, specifically includes:
Title in the Back ground Information of application program is carried out into participle operation, using word segmentation result as the basis of application program
Keyword.
Optionally, the Back ground Information according to application program, obtains the basic keyword of application program, specifically includes:
Name translation in the Back ground Information of application program is carried out what participle was obtained for pinyin string and/or by the title
Word segmentation result is converted to pinyin string, using the pinyin string as application program basic keyword.
Optionally, the Back ground Information according to application program, obtains the basic keyword of application program, specifically includes:
Using the label word of application program as application program basic keyword.
Optionally, it is described according to matching result, application program corresponding with the search keyword is obtained, specifically include:
For each application program, exist in the matching result characterizes the keywords database of application program and searched with described
During the keyword that rope keyword matches, determine that the application program is corresponding with the search keyword, searched with described with obtaining
The corresponding application program of rope keyword.
Optionally, the search keyword for obtaining input, specifically includes:
Input information according to user, obtains the search keyword.
One or more technical schemes provided in the embodiment of the present application, at least have the following technical effect that or advantage:
Application according to the present invention program search system and method, Distributor, according to the Back ground Information of application program,
Obtain the basic keyword of application program;The Back ground Information of historical search record and application program according to each search word, obtains
With the search word of application matches as application program matching keywords;Closed according to the basic keyword and the matching
Keyword generates the keywords database of application program;User terminal, the search keyword for obtaining input, and the search is crucial
Word is sent to the Distributor;The Distributor, it is according to the search keyword for receiving, the search is crucial
Word is matched with the keywords database of each application program;And according to matching result, obtain answer corresponding with the search keyword
With program and feed back to the user terminal, with cause to be shown on the user terminal it is corresponding with the search keyword should
Use program;Generated because the keywords database of application program is basic keyword and matching keywords by application program,
So that the keyword in the keywords database of application program is improved with the correlation of application program, application program is thus solved
Developer needs the problem of the indexing key words by cumbersome operation selection application program, and because the index for selecting is crucial
Word is incorrect, and the probability for causing application program to appear in the Search Results very low with the search word degree of correlation of user input is higher
Problem, achieve can by the keywords database of application program automatically for application program automatically selects indexing key words, reduce
Application developers effectively improve application program and appear in and user input to the selection course of application index keyword
Search word degree of correlation Search Results higher in probability.
Brief description of the drawings
Technical scheme in order to illustrate more clearly the embodiments of the present invention, below will be to that will make needed for embodiment description
Accompanying drawing is briefly described, it should be apparent that, drawings in the following description are some embodiments of the present invention, for this
For the those of ordinary skill of field, on the premise of not paying creative work, can also obtain other according to these accompanying drawings
Accompanying drawing.
Fig. 1 is the Organization Chart of the application program search system in the embodiment of the present invention;
Fig. 2 is the flow chart of application program searching method in the embodiment of the present invention.
Specific embodiment
In view of the above problems, it is proposed that the present invention so as to provide one kind overcome above mentioned problem or at least in part solve on
State the application program search system and method for problem.
In order to be better understood from above-mentioned technical proposal, below in conjunction with Figure of description and specific embodiment to upper
Technical scheme is stated to be described in detail.
Illustrate first, herein presented term "and/or", only a kind of incidence relation for describing affiliated partner, table
Show there may be three kinds of relations, for example, A and/or B, can represent:Individualism A, while there is A and B, individualism B this three
The situation of kind.In addition, character "/" herein, typicallys represent forward-backward correlation pair as if a kind of relation of "or".
Referring to Fig. 1, the embodiment of the application one provides a kind of application program search system, and the system includes:
Distributor 10, for the Back ground Information according to application program, obtains the basic keyword of application program;According to
The historical search record and the Back ground Information of application program of each search word, obtain the search word with application matches as application
The matching keywords of program;The keywords database of application program is generated according to the basic keyword and the matching keywords;
User terminal 20, the search keyword for obtaining input, and the search keyword is sent to distribution service
Device 10;
Distributor 10, is additionally operable to according to the search keyword for receiving, by the search keyword and each application
The keywords database of program is matched;And according to matching result, obtain application program corresponding with the search keyword simultaneously anti-
Feed user terminal 20, to cause to show application program corresponding with the search keyword on user terminal 20.
In embodiments of the present invention, owner of application program etc. can upload application program in Distributor 10, so
The request for promoting the application program is sent to Distributor 10 afterwards.Distributor 10 upon receipt of the request, is generated
The keywords database of the application program, wherein, the request of the above-mentioned popularization application program can be, application program owner can be to
Certain application program that Distributor 10 is uploaded to it sends payment data.
Wherein, the Back ground Information of above-mentioned application program includes:The title of application program, the label of application program, using journey
Classification belonging to the description information of sequence, application program etc..
Wherein, the label word of above-mentioned application program is the label word stamped for the application program in advance, such as " take journey
Travelling " application program with artificial operation label:" tourism ", " train ticket ", " tourism strategy ", " air ticket ", " trip ", " wine
Shop " etc..The description information of application program is the detailed description information of application program.Also, Distributor 10 can pre-set
The classification such as classification, such as game class, sport category, for all application programs for uploading, in all being assigned to corresponding classification.
So in the embodiment of the present invention, correspondence application program can be directly extracted from the Back ground Information of application program
Keyword.Keyword is extracted such as from title, keyword etc. is extracted from label word.
Further, when the keywords database of application program is generated, Distributor 10 should in basis for Distributor 10
With the Back ground Information of program, after obtaining the basic keyword of application program;Further according to each search word historical search record and
The Back ground Information of application program, obtains the matching keywords as application program with the search word of application matches;Certainly
The matching keywords of the basic keyword and application program that obtain application program can be simultaneously performed, page can first be obtained and apply journey
The matching keywords of sequence, then the basic keyword of application program is obtained, the application is not specifically limited.
In specific implementation process, enable application program delivery applications in user terminal 20 and access Distributor 10.Than
As user starts 360 mobile phone assistant in its mobile phone, 360 mobile phone assistant is then connected to Distributor 10.User can answer
Search word is input into search box with program distribution application, the search word uploads to Distributor 10, Distributor 10
According to the search word and search application program Search Results and return to application program delivery applications, application program delivery applications then show
Sequentially show the application program Search Results, user can click in Search Results and check or click on download application program.
So in the search procedure of a large number of users, Distributor 10 can be recorded to the search history of each search word, be obtained
To each search word search history record, such as Distributor 10 can be by the above-mentioned search history record of log recording.
And because some search words actually may carry out phase with application program Back ground Information in itself according to certain rule
Close, therefore, Distributor 10 can be gone through according to the search of the Back ground Information of application program and each search word in the embodiment of the present invention
The Records of the Historian is recorded, and obtains the matching keywords as application program with the search word of application matches.
Specifically, Distributor 10 is after the basic keyword and the matching keywords are obtained, according to described
Basic keyword and the matching keywords, generate the keywords database of application program so that wrapped in the keywords database of application program
The matching keywords of the basic keyword containing the application program and the application program;Then Distributor 10 can then be based on
The keywords database of the application program builds the index for the application program, so as to user in its terminal with the application program
When related search keyword is retrieved, can be sorted forward display.
Distributor 10 can perform aforesaid operations to each application program in advance so that each application program is present and it
Corresponding keywords database.
User terminal 20, obtains the search keyword of input, and the search keyword is sent into Distributor 10,
Wherein, user terminal 20 specifically includes search keyword acquiring unit, and the search keyword acquiring unit is used for according to user
Input information, obtain the search keyword, then the search keyword is sent to by application program delivery applications
Distributor 10.
In actual application, after application program delivery applications are opened in user terminal 20, get user and lead to
Cross after the input information of the input blocks such as dummy keyboard, physical keyboard input, institute is directly obtained according to the input information
Search keyword is stated, the input presentation of information of such as user is axxx, it is determined that the search keyword is axxx.
Distributor 10 receive user terminal 20 transmission the search keyword after, according to receive described in
Search keyword, the search keyword is matched with the keywords database of each application program;And according to matching result, obtain
Application program corresponding with the search keyword simultaneously feeds back to user terminal 20, to cause display and institute on user terminal 20
State the corresponding application program of search keyword.
In specific implementation process, application program acquiring unit can be set in Distributor 10, should for each
With program, specifically for existing and the search keyword phase in the keywords database that application program is characterized in the matching result
During the keyword matched somebody with somebody, determine that the application program is corresponding with the search keyword, it is corresponding with the search keyword to obtain
Application program, in this way, matched with the search keyword to the keywords database of each application program, according to described
Application program corresponding with the search keyword can be obtained with result, application journey corresponding with the search keyword is being got
The quantity of sequence for it is multiple when, the degree of correlation according to the search keyword and application program is answered come pair corresponding with the search keyword
It is ranked up with program.
In embodiments of the present invention, for aforementioned index, can be marked by advertisement and identifier in Distributor 10
It is popularization and application program, then when retrieving application program again, if the application program has advertisement and identifier, can be shifted to an earlier date
Displaying.The advertisement and identifier such as " popularization ", " recommending ".Additionally, can set various advertisement and identifiers in the embodiment of the present invention, different is wide
Accuse mark and possess different displaying weights.The displaying weight such as " promoted " is high, displaying of " recommending " the displaying weight less than " popularization "
Weight.
Wherein, mark " popularization " and " recommending " printed words is popularization and application program, then love is advanced and managed money matters and favourable net financing
It is popularization and application program.Search " financing " keyword represents above-mentioned application program.
In sum, the embodiment of the present invention, can be by distributing for the application program that application developers need to promote
Server 10 extracts the corresponding basic keyword of application program automatically according to the Back ground Information of application program, and according to application journey
The search history record of the Back ground Information of sequence and each search word, obtains the search word with application matches as application program
Matching keywords, the keywords database of application program is then generated according to the basic keyword and the matching keywords;Again
The search keyword of input is matched with the keywords database of each application program;According to matching result, obtain and the search
The corresponding application program of keyword.First, said process can automatically for the application program of application developers is automatically selected
Indexing key words, reduces selection course of the application developers to indexing key words.Secondly as the keyword of application program
Storehouse is basic keyword and matching keywords by application program to be generated so that the pass in the keywords database of application program
Keyword is improved with the correlation of application program such that it is able to which effectively reduction application program appears in the search with user input
Probability in the very low Search Results of the word degree of correlation, effectively improves application program and appears in the search word degree of correlation with user input
Probability in Search Results higher, improves the accuracy of search.
With continued reference to Fig. 1, another embodiment of the application provides a kind of application program search system and method, it is preferred that
Distributor 10 can include:
Participle keyword extracting unit, participle operation is carried out for the title in the Back ground Information by application program, will be divided
Word result as application program basic keyword.
In embodiments of the present invention, the Back ground Information of application program includes title, such as " takes journey travelling ", then the present invention can
Directly to carry out participle operation to the title, after " taking journey travelling " participle, word segmentation result is " taking journey " and " travelling ", then can
Using " journey will be taken " and " travelling " as the application program " taking journey travelling " basic keyword.
And/or, Distributor 10 can include:
Phonetic keyword extracting unit, for the name translation in the Back ground Information by application program be pinyin string and/or
The word segmentation result that participle obtains is carried out by the title and is converted to pinyin string, closed the pinyin string as the basis of application program
Keyword.
For the title of application program, can convert it directly to phonetic such as " xiechenglvxing ", or by its
Word segmentation result is converted to phonetic, and such as the phonetic of " taking journey " is " xiecheng ", then these phonetics can be as the application program
Basic keyword.
And/or, Distributor 10 can also include:
Label keyword extracting unit, for using the label word of application program as application program basic keyword.
For a default label word for application program, such as " journey is taken to travel " mark with artificial operation of application program
Sign word:" tourism ", " train ticket ", " tourism strategy ", " air ticket ", " trip ", " hotel ", then can using these label words as
The basic keyword of the application program.
Preferably, Distributor 10 can also include:
First matching keywords acquiring unit, for the search Download History in the search history of each search word record
With the title and/or classification in the Back ground Information of application program, the search word with application matches is obtained as application journey
The matching keywords of sequence.
In actual applications, user have input search word and scans in the terminal, and it may click on download application program
It is likely to not download application program, then situation is downloaded in the search that Distributor 10 can then record each search word, such as
User A searches for " financing ", and application program 1 has been downloaded in search results pages, and user B searches for " financing ", then may be in search
Application program 2 is downloaded in result page, by the record of the search download behavior to a large number of users, then can have been obtained to each search word
Search Download History.
In implementing, the search Download History is with storage in the form of searching for download log in Distributor 10.
So in the embodiment of the present invention, can according to search download log in extract search word, according to the search word with should
With the relation between the title and/or classification of program, using related search word as the application program matching keywords.
Preferably, the first matching keywords acquiring unit, specifically includes:
Text similarity acquiring unit, for each search word in search Download History, for calculating search word and application
The text similarity between title in the Back ground Information of program;If the text similarity is more than first threshold, obtain
The search word as application program matching keywords.
The embodiment of the present invention can extract each search word for having used from search download log, calculate the search word
The text similarity and title of application program between.Such as calculate the cosine between search word text and application name text
Distance.
The embodiment of the present invention can set a first threshold for text similarity, if the text similarity is more than
First threshold, then obtain matching keywords of the search word as the application program.If the text similarity is less than the
One threshold value, then ignore the word.
Preferably, the first matching keywords acquiring unit, specifically includes:
Independent access search word extraction unit, for each search word in search Download History, for judging the search
Whether the independent access download time of word is more than Second Threshold, and classification and the application program of the search word Back ground Information
In classification whether belong to same classification;If the independent access download time of the search word is more than the Second Threshold,
And the classification of the search word belongs to same classification with the classification in the Back ground Information of application program, then obtain the search
Word as application program matching keywords.
For a search word in search download log, there may be search of multiple users in the search word of terminal display
Download application program in result, and its terminal downloads that there is same IP multiple application programs or same application program
Download repeatedly.And in order to reduce the influence that the terminal-pair search word of same IP downloads weight, the embodiment of the present invention is then counted
The independent access download time of each search word, i.e. UV (Unique Visitor) is downloaded, even if the terminal of that is, same IP
Download repeatedly, its UV download time is also only calculated once.Then for a search word, the terminal for counting how many IP is used
The Search Results of the search word have downloaded application program.
Then, the embodiment of the present invention is provided with the Second Threshold for UV download times, if it is determined that under the UV of search word
Carry number of times and be more than the Second Threshold, then can determine whether whether is classification in the classification of the search word and the Back ground Information of application program
Belong to same classification, if now the classification of search word belongs to same class with the classification in the Back ground Information of application program
Mesh, then using the search word as the application program matching keyword.And for a search word, its independent access download time
Less than or equal to Second Threshold, and classification in the Back ground Information of its classification and application program is not belonging to same classification, can be with
Ignore the search word.
Certainly, application program is classified in the embodiment of the present invention.For search word, it is also possible to which it is classified.
The specific assorting process present invention is not any limitation as to it.Certain Distributor 10 can be using following steps to application program
Classify with search word:
Sub-step A11, for each one-level class application program now, using the description of one-level class each application program now
Information, corresponding one-level class two grades of classifications now are divided into using grader by each application program;
The default various classification in Distributor 10, the classification has been played class, physical culture since first-level class, such as
Class.And in fact, application program for an one-level class now, can be carried out thinner according to the description information of its application program
Classification.In actual applications, it is possible to use Bayes classifier is classified to description information, by one-level class now respectively should
Each two grades of classes are assigned to program now.
Sub-step A12, to each search word, closes according to search word in search history record and the click of each application program
System, and two grades of classifications belonging to each application program, calculate two grades of classifications corresponding to the search word.
In the search procedure of user, possible its details of checking application program are clicked in result of page searching and
Do not download, it is also possible to click on lower application program.The embodiment of the present invention can according to the click relation of search word and each application program,
With reference to two grades of application programs of classification of sub-step A12, each search word is also assigned into corresponding two grades of classes now.Certainly application
Program also assists in assorting process.
The accounting that such as search word 1 clicks on the number of times of the application program in two grades of classifications 1 is more than accounting threshold value, then search this
Rope word is grouped under two grades of classifications 1.
Above-mentioned search word and the click relation of each application program, can check it for search word with the click of each application program
Between relation, or the click of search word and each application program download between relation, naturally it is also possible to for search word with
The total relation between downloading is checked and clicked in the click of each application program.
And/or, Distributor 10 can also include:
Second matching keywords acquiring unit, searches for the description information in the Back ground Information according to application program and respectively
Search word and the click relation of each application program in the search history record of rope word, obtain the search word with application matches
As the matching keywords of application program.
The embodiment of the present invention can be according to the search in the search history of the description information of application program, each search word record
Word and the click relation of each application program, go to calculate the topic relativity between application program and search word.Work as topic relativity
During more than theme threshold value, then can using the search word as the application program matching keywords.Otherwise can then ignore this to search
Rope word.
Preferably, the second matching keywords acquiring unit, specifically includes:
Application program theme distribution computing unit, for the description information in the Back ground Information of each application program, for leading to
Cross the theme distribution that topic model calculates application program;
In the embodiment of the present invention, theme can be inputted using the description information of all of application program as input
Model, calculates the theme distribution of each application program.
In implementing, because the description information of application program is actually an article, above-mentioned topic model can be with
It is LDA (Latent Dirichlet Allocation, latent Dirichletal location theme) model.Can be right by LDA models
Each article is analyzed, and obtains the theme distribution of each description information of correspondence, the i.e. probability distribution of each theme, such as theme 1
Probability be 0.6, the probability of theme 2 is 0.3, obtains a vector (0.6,0.4).
Search word theme distribution computing unit, to each search word, for being recorded according to search history in search word with it is each
The click relation of application program, calculates the theme distribution of search word;
As it was previously stated, each search word has click relation with each application program, such as in the Search Results of a search word
Which application program is clicked is checked, and/or which application program is clicked download.In this way, each search word point can be counted
Which application program, number of clicks of each application program etc. are hit.
So because the application program in the application program theme distribution computing unit calculates theme distribution, then one
The application program that individual search word can be clicked on according to it, indirectly determines the theme distribution of the search word.Such as search 1 is clicked on should
With the accounting 0.8 of program 1, the accounting for clicking on application program 2 is 0.2, and the theme distribution of application program 1 is (0.6,0.4),
(0.7,0.3), then the theme distribution of search word can be ((0.6+.07) * 0.8, (0.4+0.3) * 0.2).
Theme similarity word extraction unit, the search word for volumes of searches more than the 3rd threshold value, for being searched according to
The theme distribution of rope word and the theme distribution of application program, calculate the Topic Similarity between the search word and application program;
If the Topic Similarity between the search word and application program is more than theme threshold value, the search word is obtained as application
The matching keywords of program.
In actual applications, small some the search word volumes of searches of some search word volumes of searches are big, for the application journey to be promoted
For sequence, the big search word of volumes of searches is easier to make for promoting.Thus the present invention then counts each and searches in search history record
The volumes of searches of rope word, and default 3rd threshold value, if the search word for volumes of searches more than the 3rd threshold value, just according to search word
The theme distribution of theme distribution and application program, calculates the Topic Similarity between the search word and application program.
In embodiments of the present invention, for search word it is similar between theme distribution and the theme distribution of application program
Degree, can be calculated using KL distances and/or JS distances.Wherein, KL distances are Kullback-Leibler divergence, and
Claim relative entropy, for the two of discrete random variable probability distribution a P and Q, their KL divergences are defined as he:D(P||
Q)=Σ P (i) log (P (i)/Q (i)) ... formula(1).
It is bottom with 2 when wherein seeking log.
It is Jensen-Shannon divergence for JS distances, it is the prioritization scheme of KL distances, and its formula is:
... formula (2),
Wherein... (formula 3).Wherein, D is calculated using formula (1).
JSD values are between 0 to 1.Bigger to represent that two theme distributions are more consistent, similitude is higher.
The theme distribution of search word of the invention and the theme distribution of application program correspond to P and Q respectively, if the search
Topic Similarity between word and application program is more than theme threshold value, then obtain the search word and closed as the matching of application program
Keyword.
The embodiment of the present invention presets a theme threshold value, and the Topic Similarity between search word and application program is more than the master
Topic threshold value, then obtain matching keywords of the corresponding search word as the application program.Conversely, then ignoring.
And/or, Distributor 10 can also include:
3rd matching keywords acquiring unit, for the classification in the Back ground Information according to application program and each search word pair
The classification answered, obtains the matching keywords as application program with the search word of application matches.
Preferably, the 3rd matching keywords acquiring unit is specifically included:
Application program classification subdivision unit, for each one-level class application program now, for using one-level class now
The description information of each application program, corresponding one-level class two grades of classifications now are divided into using grader by each application program;
Search word taxon, to each search word, for being recorded according to search history in search word and each application program
Click relation, and two grades of classifications belonging to each application program calculate two grades of classifications corresponding to the search word;
Application program classification subdivision unit is similar with foregoing sub-step A11 and A12 with search word taxon.Due to similar
Search word 1 click on two grades of classifications 1 in application program number of times accounting be more than accounting threshold value, then by the search word be grouped into this two
, there are certain two grades of class now in the situation under level classification 1, the click accounting very little of search word, namely the search word is this two grades
The probability of classification is small, then it can be removed from two grades of classifications.
After by search word two grades of classifications of correspondence, by should the small search word of probability of two grades of classifications delete, will be surplus
The search word of two grades of remaining classifications is generated as a word bag, is then applied in class heading search word extracts form unit.
Class heading search word extracts form unit, for two grades of classifications according to where application program, obtains to should two grades of classes
Each search word of purpose is then as the matching keywords of application program.
For application program, two grades of classes where calculating each application program due to application program classification subdivision unit
Mesh, then two grades of classifications of application program also determine, determines two grades of word bags of the keyword of classification in search word taxon,
So can using the word in the word bag as the application program matching keywords.
In embodiments of the present invention, the first matching keywords acquiring unit, second matching keywords obtain single
First, described 3rd matching keywords acquiring unit each for all of search word calculate and obtains term, described the
One matching keywords acquiring unit, the second matching keywords acquiring unit and the 3rd matching keywords acquiring unit can
To be used alone, wherein several use can be selected, it is also possible to which selection is all used.The present invention is not limited to it.
Specifically, Distributor 10 by said units obtain the basic keyword and the matching keywords it
Afterwards, the keywords database of application program is generated according to the basic keyword and the matching keywords;
In embodiments of the present invention, the basic keyword and matching keywords that obtain are combined for various, can be entered first
Row normalization, identical keyword is merged, and is obtained after most simple keyword, and application program is generated according to most simple keyword
Keywords database.
Distributor 10 can perform aforesaid operations to each application program in advance so that each application program is present and it
Corresponding keywords database.
User terminal 20, obtains the search keyword of input, and the search keyword is sent into Distributor 10,
Wherein, user terminal 20 specifically includes search keyword acquiring unit, and the search keyword acquiring unit is used for according to user
Input information, obtain the search keyword, then the search keyword is sent to by application program delivery applications
Distributor 10.
In actual application, after application program delivery applications are opened in user terminal 20, get user and lead to
Cross after the input information of the input blocks such as dummy keyboard, physical keyboard input, institute is directly obtained according to the input information
Search keyword is stated, the input presentation of information of such as user is axxx, it is determined that the search keyword is axxx.
Distributor 10 receive user terminal 20 transmission the search keyword after, according to receive described in
Search keyword, the search keyword is matched with the keywords database of each application program;And according to matching result, obtain
Application program corresponding with the search keyword simultaneously feeds back to user terminal 20, to cause display and institute on user terminal 20
State the corresponding application program of search keyword.
In specific implementation process, application program acquiring unit can be set in Distributor 10, should for each
With program, specifically for existing and the search keyword phase in the keywords database that application program is characterized in the matching result
During the keyword matched somebody with somebody, determine that the application program is corresponding with the search keyword, it is corresponding with the search keyword to obtain
Application program, in this way, matched with the search keyword to the keywords database of each application program, according to described
Application program corresponding with the search keyword can be obtained with result, application journey corresponding with the search keyword is being got
The quantity of sequence for it is multiple when, the degree of correlation according to the search keyword and application program is answered come pair corresponding with the search keyword
It is ranked up with program.
Distributor 10 can perform aforesaid operations to each application program in advance so that each application program is present and it
Corresponding keywords database.
User terminal 20, obtains the search keyword of input, and the search keyword is sent into Distributor 10,
Wherein, user terminal 20 specifically includes search keyword acquiring unit, and the search keyword acquiring unit is used for according to user
Input information, obtain the search keyword, then the search keyword is sent to by application program delivery applications
Distributor 10.
In actual application, after application program delivery applications are opened in user terminal 20, get user and lead to
Cross after the input information of the input blocks such as dummy keyboard, physical keyboard input, institute is directly obtained according to the input information
Search keyword is stated, the input presentation of information of such as user is axxx, it is determined that the search keyword is axxx.
Distributor 10 receive user terminal 20 transmission the search keyword after, according to receive described in
Search keyword, the search keyword is matched with the keywords database of each application program;And according to matching result, obtain
Application program corresponding with the search keyword simultaneously feeds back to user terminal 20, to cause display and institute on user terminal 20
State the corresponding application program of search keyword.
In specific implementation process, application program acquiring unit can be set in Distributor 10, should for each
With program, specifically for existing and the search keyword phase in the keywords database that application program is characterized in the matching result
During the keyword matched somebody with somebody, determine that the application program is corresponding with the search keyword, it is corresponding with the search keyword to obtain
Application program, in this way, matched with the search keyword to the keywords database of each application program, according to described
Application program corresponding with the search keyword can be obtained with result, application journey corresponding with the search keyword is being got
The quantity of sequence for it is multiple when, the degree of correlation according to the search keyword and application program is answered come pair corresponding with the search keyword
It is ranked up with program.
Based on said system identical technology design, the embodiment of the application one additionally provides a kind of application program searcher
Method, referring to Fig. 2, methods described includes:
S201:By Distributor according to the Back ground Information of application program, the basic keyword of application program is obtained;Root
According to the historical search record and the Back ground Information of application program of each search word, obtain with the search word of application matches as should
With the matching keywords of program;The keywords database of application program is generated according to the basic keyword and the matching keywords;
S202:The search keyword of input is obtained by user terminal, and the search keyword is sent to distribution clothes
Business device;
S203:The search keyword received by Distributor, by the search keyword and each application program
Keywords database matched;And according to matching result, obtain application program corresponding with the search keyword and feed back to
User terminal, to cause to show application program corresponding with the search keyword on the subscriber terminal.
Specifically, the historical search record and the Back ground Information of application program according to each search word, obtains and application
The search word of procedure match is specifically included as the matching keywords of application program:
The name in the Back ground Information for searching for Download History and application program in search history record according to each search word
Claim and/or classification, obtain the matching keywords as application program with the search word of application matches.
Specifically, the historical search record and the Back ground Information of application program according to each search word, obtains and application
The search word of procedure match is specifically included as the matching keywords of application program:
Search word in the search history record of description information and each search word in the Back ground Information of application program
With the click relation of each application program, the matching keywords as application program with the search word of application matches are obtained.
Specifically, the historical search record and the Back ground Information of application program according to each search word, obtains and application
The search word of procedure match is specifically included as the matching keywords of application program:
Classification and the corresponding classification of each search word in the Back ground Information of application program, obtain and application matches
Search word as application program matching keywords.
Specifically, matching keywords of the search word of the acquisition and application matches as application program, specific bag
Include:
For each search word in search Download History, for the name in the Back ground Information for calculating search word and application program
Text similarity between referred to as;If the text similarity is more than first threshold, the search word is obtained as application journey
The matching keywords of sequence.
Specifically, matching keywords of the search word of the acquisition and application matches as application program, specific bag
Include:
For each search word in search Download History, judge whether the independent access download time of the search word is more than
Whether Second Threshold, and the classification of the search word belongs to same classification with the classification in the Back ground Information of application program;
If the independent access download time of the search word is more than the Second Threshold, and the search word classification and application journey
Classification in the Back ground Information of sequence belongs to same classification, then obtain matching keywords of the search word as application program.
Specifically, matching keywords of the search word of the acquisition and application matches as application program, specific bag
Include:
For the description information in the Back ground Information of each application program, the theme of application program is calculated by topic model
Distribution;
To each search word, according to search word in search history record and the click relation of each application program, search is calculated
The theme distribution of word;
Search word for volumes of searches more than the 3rd threshold value, the master of theme distribution and application program according to the search word
Topic distribution, calculates the Topic Similarity between the search word and application program;If between the search word and application program
Topic Similarity be more than theme threshold value, then obtain matching keywords of the search word as application program.
Specifically, matching keywords of the search word of the acquisition and application matches as application program, specific bag
Include:
For each one-level class application program now, using the description information of one-level class each application program now, use
Each application program is divided into corresponding one-level class two grades of classifications now by grader;
To each search word, according to the click relation of search word in search history record and each application program, and respectively should
With two grades of classifications belonging to program, two grades of classifications corresponding to the search word are calculated;
Two grades of classifications according to where application program, obtain to should two grades of each search words of classification then as application program
Matching keywords.
Specifically, the Back ground Information according to application program, obtains the basic keyword of application program, specifically include:
Title in the Back ground Information of application program is carried out into participle operation, using word segmentation result as the basis of application program
Keyword.
Specifically, the Back ground Information according to application program, obtains the basic keyword of application program, specifically include:
Name translation in the Back ground Information of application program is carried out what participle was obtained for pinyin string and/or by the title
Word segmentation result is converted to pinyin string, using the pinyin string as application program basic keyword.
Specifically, the Back ground Information according to application program, obtains the basic keyword of application program, specifically include:
Using the label word of application program as application program basic keyword.
Specifically, it is described according to matching result, application program corresponding with the search keyword is obtained, specifically include:
For each application program, exist in the matching result characterizes the keywords database of application program and searched with described
During the keyword that rope keyword matches, determine that the application program is corresponding with the search keyword, searched with described with obtaining
The corresponding application program of rope keyword.
Specifically, the search keyword for obtaining input, specifically includes:
Input information according to user, obtains the search keyword.
Technical scheme in above-mentioned the embodiment of the present application, at least has the following technical effect that or advantage:
Application according to the present invention program search system and method, Distributor, according to the Back ground Information of application program,
Obtain the basic keyword of application program;The Back ground Information of historical search record and application program according to each search word, obtains
With the search word of application matches as application program matching keywords;Closed according to the basic keyword and the matching
Keyword generates the keywords database of application program;User terminal, the search keyword for obtaining input, and the search is crucial
Word is sent to the Distributor;The Distributor, it is according to the search keyword for receiving, the search is crucial
Word is matched with the keywords database of each application program;And according to matching result, obtain answer corresponding with the search keyword
With program and feed back to the user terminal, with cause to be shown on the user terminal it is corresponding with the search keyword should
Use program;Generated because the keywords database of application program is basic keyword and matching keywords by application program,
So that the keyword in the keywords database of application program is improved with the correlation of application program, application program is thus solved
Developer needs the problem of the indexing key words by cumbersome operation selection application program, and because the index for selecting is crucial
Word is incorrect, and the probability for causing application program to appear in the Search Results very low with the search word degree of correlation of user input is higher
Problem, achieve can by the keywords database of application program automatically for application program automatically selects indexing key words, reduce
Application developers effectively improve application program and appear in and user input to the selection course of application index keyword
Search word degree of correlation Search Results higher in probability.
It should be understood by those skilled in the art that, embodiments of the invention can be provided as method, system or computer program
Product.Therefore, the present invention can be using the reality in terms of complete hardware embodiment, complete software embodiment or combination software and hardware
Apply the form of example.And, the present invention can be used and wherein include the computer of computer usable program code at one or more
The computer program implemented in usable storage medium (including but not limited to magnetic disk storage, CD-ROM, optical memory etc.) is produced
The form of product.
The present invention is the flow with reference to method according to embodiments of the present invention, equipment (system) and computer program product
Figure and/or block diagram are described.It should be understood that every first-class during flow chart and/or block diagram can be realized by computer program instructions
The combination of flow and/or square frame in journey and/or square frame and flow chart and/or block diagram.These computer programs can be provided
The processor of all-purpose computer, special-purpose computer, Embedded Processor or other programmable data processing devices is instructed to produce
A raw machine so that produced for reality by the instruction of computer or the computing device of other programmable data processing devices
The device of the function of being specified in present one flow of flow chart or multiple one square frame of flow and/or block diagram or multiple square frames.
These computer program instructions may be alternatively stored in can guide computer or other programmable data processing devices with spy
In determining the computer-readable memory that mode works so that instruction of the storage in the computer-readable memory is produced and include finger
Make the manufacture of device, the command device realize in one flow of flow chart or multiple one square frame of flow and/or block diagram or
The function of being specified in multiple square frames.
These computer program instructions can be also loaded into computer or other programmable data processing devices so that in meter
Series of operation steps is performed on calculation machine or other programmable devices to produce computer implemented treatment, so as in computer or
The instruction performed on other programmable devices is provided for realizing in one flow of flow chart or multiple flows and/or block diagram one
The step of function of being specified in individual square frame or multiple square frames.
, but those skilled in the art once know basic creation although preferred embodiments of the present invention have been described
Property concept, then can make other change and modification to these embodiments.So, appended claims are intended to be construed to include excellent
Select embodiment and fall into having altered and changing for the scope of the invention.
Obviously, those skilled in the art can carry out various changes and modification without deviating from essence of the invention to the present invention
God and scope.So, if these modifications of the invention and modification belong to the scope of the claims in the present invention and its equivalent technologies
Within, then the present invention is also intended to comprising these changes and modification.
The present invention discloses A1, a kind of application program search system, it is characterised in that the system includes:
Distributor, for the Back ground Information according to application program, obtains the basic keyword of application program;According to each
The historical search record and the Back ground Information of application program of search word, obtain the search word with application matches as application journey
The matching keywords of sequence;The keywords database of application program is generated according to the basic keyword and the matching keywords;
User terminal, the search keyword for obtaining input, and the search keyword is sent to the distribution clothes
Business device;
The Distributor, be additionally operable to according to receive the search keyword, by the search keyword with respectively should
Matched with the keywords database of program;And according to matching result, obtain application program corresponding with the search keyword simultaneously
The user terminal is fed back to, to cause to show application program corresponding with the search keyword on the user terminal.
A2, the system as described in A1, it is characterised in that the Distributor includes:
First matching keywords acquiring unit, for the search Download History in the search history of each search word record
With the title and/or classification in the Back ground Information of application program, the search word with application matches is obtained as application journey
The matching keywords of sequence.
A3, the system as described in A1, it is characterised in that the Distributor includes:
Second matching keywords acquiring unit, searches for the description information in the Back ground Information according to application program and respectively
Search word and the click relation of each application program in the search history record of rope word, obtain the search word with application matches
As the matching keywords of application program.
A4, the system as described in A1, it is characterised in that the Distributor includes:
3rd matching keywords acquiring unit, for the classification in the Back ground Information according to application program and each search word pair
The classification answered, obtains the matching keywords as application program with the search word of application matches.
A5, the system as described in A2, it is characterised in that the first matching keywords acquiring unit, specifically include:
Text similarity acquiring unit, for each search word in search Download History, for calculating search word and application
The text similarity between title in the Back ground Information of program;If the text similarity is more than first threshold, obtain
The search word as application program matching keywords.
A6, the system as described in A2, it is characterised in that the first matching keywords acquiring unit, specifically include:
Independent access search word extraction unit, for each search word in search Download History, for judging the search
Whether the independent access download time of word is more than Second Threshold, and classification and the application program of the search word Back ground Information
In classification whether belong to same classification;If the independent access download time of the search word is more than the Second Threshold,
And the classification of the search word belongs to same classification with the classification in the Back ground Information of application program, then obtain the search
Word as application program matching keywords.
A7, the system as described in A3, it is characterised in that the second matching keywords acquiring unit, specifically include:
Application program theme distribution computing unit, for the description information in the Back ground Information of each application program, for leading to
Cross the theme distribution that topic model calculates application program;
Search word theme distribution computing unit, to each search word, for being recorded according to search history in search word with it is each
The click relation of application program, calculates the theme distribution of search word;
Theme similarity word extraction unit, the search word for volumes of searches more than the 3rd threshold value, for being searched according to
The theme distribution of rope word and the theme distribution of application program, calculate the Topic Similarity between the search word and application program;
If the Topic Similarity between the search word and application program is more than theme threshold value, the search word is obtained as application
The matching keywords of program.
A8, the system as described in A4, it is characterised in that the 3rd matching keywords acquiring unit, specifically include:
Application program classification subdivision unit, for each one-level class application program now, for using one-level class now
The description information of each application program, corresponding one-level class two grades of classifications now are divided into using grader by each application program;
Search word taxon, to each search word, for being recorded according to search history in search word and each application program
Click relation, and two grades of classifications belonging to each application program calculate two grades of classifications corresponding to the search word;
Class heading search word extracts form unit, for two grades of classifications according to where application program, obtains to should two grades of classes
Each search word of purpose is then as the matching keywords of application program.
A9, the system as described in A1, it is characterised in that the Distributor includes:
Participle keyword extracting unit, participle operation is carried out for the title in the Back ground Information by application program, will be divided
Word result as application program basic keyword.
A10, the system as described in A1, it is characterised in that the Distributor includes:
Phonetic keyword extracting unit, for the name translation in the Back ground Information by application program be pinyin string and/or
The word segmentation result that participle obtains is carried out by the title and is converted to pinyin string, closed the pinyin string as the basis of application program
Keyword.
A11, the system as described in A1, it is characterised in that the Distributor also includes:
Label keyword extracting unit, for using the label word of application program as application program basic keyword.
A12, the system as described in A1, it is characterised in that the Distributor also includes:
Application program acquiring unit, for each application program, journey is applied specifically for being characterized in the matching result
When there is the keyword matched with the search keyword in the keywords database of sequence, determine that the application program is closed with the search
Keyword is corresponding, to obtain application program corresponding with the search keyword.
A13, the system as described in A1, it is characterised in that the user terminal includes:
Search keyword acquiring unit, specifically for the input information according to user, obtains the search keyword.
B14, a kind of application program searching method, it is characterised in that methods described includes:
By Distributor according to the Back ground Information of application program, the basic keyword of application program is obtained;According to each
The historical search record and the Back ground Information of application program of search word, obtain the search word with application matches as application journey
The matching keywords of sequence;The keywords database of application program is generated according to the basic keyword and the matching keywords;
The search keyword of input is obtained by user terminal, and the search keyword is sent to the distribution service
Device;
The search keyword received by the Distributor, by the search keyword and each application program
Keywords database is matched;And according to matching result, obtain application program corresponding with the search keyword and feed back to institute
User terminal is stated, to cause to show application program corresponding with the search keyword on the user terminal.
B15, the method as described in B14, it is characterised in that described to be recorded according to the historical search of each search word and apply journey
The Back ground Information of sequence, obtains the search word with application matches as the matching keywords of application program, specifically includes:
The name in the Back ground Information for searching for Download History and application program in search history record according to each search word
Claim and/or classification, obtain the matching keywords as application program with the search word of application matches.
B16, the method as described in B14, it is characterised in that described to be recorded according to the historical search of each search word and apply journey
The Back ground Information of sequence, obtains the search word with application matches as the matching keywords of application program, specifically includes:
Search word in the search history record of description information and each search word in the Back ground Information of application program
With the click relation of each application program, the matching keywords as application program with the search word of application matches are obtained.
B17, the method as described in B14, it is characterised in that described to be recorded according to the historical search of each search word and apply journey
The Back ground Information of sequence, obtains the search word with application matches as the matching keywords of application program, specifically includes:
Classification and the corresponding classification of each search word in the Back ground Information of application program, obtain and application matches
Search word as application program matching keywords.
B18, the method as described in B15, it is characterised in that the search word of the acquisition and application matches is used as application
The matching keywords of program, specifically include:
For each search word in search Download History, for the name in the Back ground Information for calculating search word and application program
Text similarity between referred to as;If the text similarity is more than first threshold, the search word is obtained as application journey
The matching keywords of sequence.
B19, the method as described in B15, it is characterised in that the search word of the acquisition and application matches is used as application
The matching keywords of program, specifically include:
For each search word in search Download History, judge whether the independent access download time of the search word is more than
Whether Second Threshold, and the classification of the search word belongs to same classification with the classification in the Back ground Information of application program;
If the independent access download time of the search word is more than the Second Threshold, and the search word classification and application journey
Classification in the Back ground Information of sequence belongs to same classification, then obtain matching keywords of the search word as application program.
B20, the method as described in B16, it is characterised in that the search word of the acquisition and application matches is used as application
The matching keywords of program, specifically include:
For the description information in the Back ground Information of each application program, the theme of application program is calculated by topic model
Distribution;
To each search word, according to search word in search history record and the click relation of each application program, search is calculated
The theme distribution of word;
Search word for volumes of searches more than the 3rd threshold value, the master of theme distribution and application program according to the search word
Topic distribution, calculates the Topic Similarity between the search word and application program;If between the search word and application program
Topic Similarity be more than theme threshold value, then obtain matching keywords of the search word as application program.
B21, the method as described in B17, it is characterised in that the search word of the acquisition and application matches is used as application
The matching keywords of program, specifically include:
For each one-level class application program now, using the description information of one-level class each application program now, use
Each application program is divided into corresponding one-level class two grades of classifications now by grader;
To each search word, according to the click relation of search word in search history record and each application program, and respectively should
With two grades of classifications belonging to program, two grades of classifications corresponding to the search word are calculated;
Two grades of classifications according to where application program, obtain to should two grades of each search words of classification then as application program
Matching keywords.
Journey is applied in B22, the method as described in B14, it is characterised in that the Back ground Information according to application program, acquisition
The basic keyword of sequence, specifically includes:
Title in the Back ground Information of application program is carried out into participle operation, using word segmentation result as the basis of application program
Keyword.
Journey is applied in B23, the method as described in B14, it is characterised in that the Back ground Information according to application program, acquisition
The basic keyword of sequence, specifically includes:
Name translation in the Back ground Information of application program is carried out what participle was obtained for pinyin string and/or by the title
Word segmentation result is converted to pinyin string, using the pinyin string as application program basic keyword.
Journey is applied in B24, the method as described in B14, it is characterised in that the Back ground Information according to application program, acquisition
The basic keyword of sequence, specifically includes:
Using the label word of application program as application program basic keyword.
B25, the method as described in B14, it is characterised in that described according to matching result, obtain and the search keyword
Corresponding application program, specifically includes:
For each application program, exist in the matching result characterizes the keywords database of application program and searched with described
During the keyword that rope keyword matches, determine that the application program is corresponding with the search keyword, searched with described with obtaining
The corresponding application program of rope keyword.
B26, the method as described in B14, it is characterised in that the search keyword of the acquisition input, specifically include:
Input information according to user, obtains the search keyword.
Claims (10)
1. a kind of application program search system, it is characterised in that the system includes:
Distributor, for the Back ground Information according to application program, obtains the basic keyword of application program;According to each search
The historical search record and the Back ground Information of application program of word, obtain the search word with application matches as application program
Matching keywords;The keywords database of application program is generated according to the basic keyword and the matching keywords;
User terminal, for obtaining the search keyword of input, and is sent to the Distributor by the search keyword;
The Distributor, is additionally operable to according to the search keyword for receiving, by the search keyword and each application journey
The keywords database of sequence is matched;And according to matching result, obtain application program corresponding with the search keyword and feed back
To the user terminal, to cause to show application program corresponding with the search keyword on the user terminal.
2. the system as claimed in claim 1, it is characterised in that the Distributor includes:
First matching keywords acquiring unit, for according to the search history of each search word record in search Download History and should
With title and/or classification in the Back ground Information of program, the search word with application matches is obtained as application program
Matching keywords.
3. the system as claimed in claim 1, it is characterised in that the Distributor includes:
Second matching keywords acquiring unit, for the description information in the Back ground Information according to application program and each search word
Search history record in search word and each application program click relation, obtain and the search word conduct of application matches
The matching keywords of application program.
4. the system as claimed in claim 1, it is characterised in that the Distributor includes:
3rd matching keywords acquiring unit, it is corresponding for the classification in the Back ground Information according to application program and each search word
Classification, obtains the matching keywords as application program with the search word of application matches.
5. system as claimed in claim 2, it is characterised in that the first matching keywords acquiring unit, specifically includes:
Text similarity acquiring unit, for each search word in search Download History, for calculating search word and application program
Back ground Information in title between text similarity;If the text similarity is more than first threshold, obtain described
Search word as application program matching keywords.
6. system as claimed in claim 2, it is characterised in that the first matching keywords acquiring unit, specifically includes:
Independent access search word extraction unit, for each search word in search Download History, for judging the search word
Whether independent access download time is more than in Second Threshold, and the classification of the search word and the Back ground Information of application program
Whether classification belongs to same classification;If the independent access download time of the search word is more than the Second Threshold, and
The classification of the search word belongs to same classification with the classification in the Back ground Information of application program, then obtain the search word and make
It is the matching keywords of application program.
7. system as claimed in claim 3, it is characterised in that the second matching keywords acquiring unit, specifically includes:
Application program theme distribution computing unit, for the description information in the Back ground Information of each application program, for by master
Topic model calculates the theme distribution of application program;
Search word theme distribution computing unit, to each search word, for being recorded according to search history in search word and each application
The click relation of program, calculates the theme distribution of search word;
Theme similarity word extraction unit, the search word for volumes of searches more than the 3rd threshold value, for according to the search word
Theme distribution and application program theme distribution, calculate the Topic Similarity between the search word and application program;If
Topic Similarity between the search word and application program is more than theme threshold value, then obtain the search word as application program
Matching keywords.
8. system as claimed in claim 4, it is characterised in that the 3rd matching keywords acquiring unit, specifically includes:
Application program classification subdivision unit, for each one-level class application program now, for utilization one-level class respectively should now
With the description information of program, each application program is divided into by corresponding one-level class two grades of classifications now using grader;
Search word taxon, to each search word, for being recorded according to search history in search word and each application program point
Relation, and two grades of classifications belonging to each application program are hit, two grades of classifications corresponding to the search word are calculated;
Class heading search word extracts form unit, for two grades of classifications according to where application program, obtains to should two grades of classifications
Each search word is then as the matching keywords of application program.
9. the system as claimed in claim 1, it is characterised in that the Distributor includes:
Participle keyword extracting unit, participle operation is carried out for the title in the Back ground Information by application program, by participle knot
Really as the basic keyword of application program.
10. a kind of application program searching method, it is characterised in that methods described includes:
By Distributor according to the Back ground Information of application program, the basic keyword of application program is obtained;According to each search
The historical search record and the Back ground Information of application program of word, obtain the search word with application matches as application program
Matching keywords;The keywords database of application program is generated according to the basic keyword and the matching keywords;
The search keyword of input is obtained by user terminal, and the search keyword is sent to the Distributor;
The search keyword received by the Distributor, by the key of the search keyword and each application program
Dictionary is matched;And according to matching result, obtain application program corresponding with the search keyword and feed back to the use
Family terminal, to cause to show application program corresponding with the search keyword on the user terminal.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201510993113.0A CN106919588A (en) | 2015-12-24 | 2015-12-24 | A kind of application program search system and method |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201510993113.0A CN106919588A (en) | 2015-12-24 | 2015-12-24 | A kind of application program search system and method |
Publications (1)
Publication Number | Publication Date |
---|---|
CN106919588A true CN106919588A (en) | 2017-07-04 |
Family
ID=59460223
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201510993113.0A Pending CN106919588A (en) | 2015-12-24 | 2015-12-24 | A kind of application program search system and method |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN106919588A (en) |
Cited By (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107622090A (en) * | 2017-08-22 | 2018-01-23 | 上海艾融软件股份有限公司 | Acquisition methods, the apparatus and system of object |
CN107767172A (en) * | 2017-10-12 | 2018-03-06 | 百度在线网络技术(北京)有限公司 | Information-pushing method, device, server and medium |
CN108920652A (en) * | 2018-07-03 | 2018-11-30 | 佛山市影腾科技有限公司 | A kind of searching method, device and terminal |
CN110196833A (en) * | 2018-03-22 | 2019-09-03 | 腾讯科技(深圳)有限公司 | Searching method, device, terminal and the storage medium of application program |
CN110704729A (en) * | 2019-09-09 | 2020-01-17 | 上海博泰悦臻网络技术服务有限公司 | Application search method and cloud server |
CN111488510A (en) * | 2020-04-17 | 2020-08-04 | 支付宝(杭州)信息技术有限公司 | Method and device for determining related words of small program, processing equipment and search system |
Citations (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1389811A (en) * | 2002-02-06 | 2003-01-08 | 北京造极人工智能技术有限公司 | Intelligent search method of search engine |
CN101179472A (en) * | 2007-05-31 | 2008-05-14 | 腾讯科技(深圳)有限公司 | Network resource searching method and searching system |
US20110219015A1 (en) * | 2008-08-28 | 2011-09-08 | Nhn Business Platform Corporation | Searching method using extended keyword pool and system thereof |
CN102236711A (en) * | 2011-06-30 | 2011-11-09 | 百度在线网络技术(北京)有限公司 | Method and equipment for determining displayed information corresponding to promotion keyword |
CN102737045A (en) * | 2011-04-08 | 2012-10-17 | 北京百度网讯科技有限公司 | Method and device for relevancy computation |
CN103914552A (en) * | 2014-04-14 | 2014-07-09 | 百度在线网络技术(北京)有限公司 | Method and device for retrieving applications |
CN105095187A (en) * | 2015-08-07 | 2015-11-25 | 广州神马移动信息科技有限公司 | Search intention identification method and device |
CN105117479A (en) * | 2015-09-11 | 2015-12-02 | 北京金山安全软件有限公司 | Acquisition method and processing method of user search behavior information and electronic equipment |
-
2015
- 2015-12-24 CN CN201510993113.0A patent/CN106919588A/en active Pending
Patent Citations (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1389811A (en) * | 2002-02-06 | 2003-01-08 | 北京造极人工智能技术有限公司 | Intelligent search method of search engine |
CN101179472A (en) * | 2007-05-31 | 2008-05-14 | 腾讯科技(深圳)有限公司 | Network resource searching method and searching system |
US20110219015A1 (en) * | 2008-08-28 | 2011-09-08 | Nhn Business Platform Corporation | Searching method using extended keyword pool and system thereof |
CN102737045A (en) * | 2011-04-08 | 2012-10-17 | 北京百度网讯科技有限公司 | Method and device for relevancy computation |
CN102236711A (en) * | 2011-06-30 | 2011-11-09 | 百度在线网络技术(北京)有限公司 | Method and equipment for determining displayed information corresponding to promotion keyword |
CN103914552A (en) * | 2014-04-14 | 2014-07-09 | 百度在线网络技术(北京)有限公司 | Method and device for retrieving applications |
CN105095187A (en) * | 2015-08-07 | 2015-11-25 | 广州神马移动信息科技有限公司 | Search intention identification method and device |
CN105117479A (en) * | 2015-09-11 | 2015-12-02 | 北京金山安全软件有限公司 | Acquisition method and processing method of user search behavior information and electronic equipment |
Cited By (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107622090A (en) * | 2017-08-22 | 2018-01-23 | 上海艾融软件股份有限公司 | Acquisition methods, the apparatus and system of object |
CN107622090B (en) * | 2017-08-22 | 2020-10-16 | 上海艾融软件股份有限公司 | Object acquisition method, device and system |
CN107767172A (en) * | 2017-10-12 | 2018-03-06 | 百度在线网络技术(北京)有限公司 | Information-pushing method, device, server and medium |
CN110196833A (en) * | 2018-03-22 | 2019-09-03 | 腾讯科技(深圳)有限公司 | Searching method, device, terminal and the storage medium of application program |
CN110196833B (en) * | 2018-03-22 | 2023-06-09 | 腾讯科技(深圳)有限公司 | Application searching method, device, terminal and storage medium |
CN108920652A (en) * | 2018-07-03 | 2018-11-30 | 佛山市影腾科技有限公司 | A kind of searching method, device and terminal |
CN110704729A (en) * | 2019-09-09 | 2020-01-17 | 上海博泰悦臻网络技术服务有限公司 | Application search method and cloud server |
CN111488510A (en) * | 2020-04-17 | 2020-08-04 | 支付宝(杭州)信息技术有限公司 | Method and device for determining related words of small program, processing equipment and search system |
CN111488510B (en) * | 2020-04-17 | 2023-09-29 | 支付宝(杭州)信息技术有限公司 | Method and device for determining related words of applet, processing equipment and search system |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN106919575B (en) | Application program searching method and device | |
CN106919588A (en) | A kind of application program search system and method | |
CN106709040B (en) | Application search method and server | |
US20190114668A1 (en) | Application recommendation method and server | |
CN104111933B (en) | Obtain business object label, set up the method and device of training pattern | |
CN102982153B (en) | A kind of information retrieval method and device thereof | |
CN106445963B (en) | Advertisement index keyword automatic generation method and device of APP platform | |
CN110532451A (en) | Search method and device for policy text, storage medium, electronic device | |
CN105653562B (en) | The calculation method and device of correlation between a kind of content of text and inquiry request | |
CN109299344A (en) | The generation method of order models, the sort method of search result, device and equipment | |
CN105095187A (en) | Search intention identification method and device | |
CN106982256A (en) | Information-pushing method, device, equipment and storage medium | |
CN104951468A (en) | Data searching and processing method and system | |
CN105023165A (en) | Method, device and system for controlling release tasks in social networking platform | |
CN106294783A (en) | A kind of video recommendation method and device | |
CN106415537A (en) | Inserting native application search results into web search results | |
CN108319376B (en) | Input association recommendation method and device for optimizing commercial word promotion | |
CN109409928A (en) | A kind of material recommended method, device, storage medium, terminal | |
US11144594B2 (en) | Search method, search apparatus and non-temporary computer-readable storage medium for text search | |
CN104778283B (en) | A kind of user's occupational classification method and system based on microblogging | |
CN107818491A (en) | Electronic installation, Products Show method and storage medium based on user's Internet data | |
CN108304490A (en) | Text based similarity determines method, apparatus and computer equipment | |
CN107273391A (en) | Document recommends method and apparatus | |
CN106445954A (en) | Business object display method and apparatus | |
CN113570413A (en) | Method and device for generating advertisement keywords, storage medium and electronic equipment |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20170704 |