Invention content
The purpose of the embodiment of the present application is to propose a kind of improved information-pushing method and device, to solve background above
The technical issues of technology segment is mentioned.
In a first aspect, the embodiment of the present application provides a kind of information-pushing method, this method includes:From the day of search engine
Target acquisition data is extracted in will file;Target acquisition data is parsed, target search set of words is generated;Target is searched
Each target search word in rope set of words, the search behavior letter that extraction matches with the target search word from journal file
Breath, and search behavior information is parsed, generate the hot value of the target search word;Temperature based on each target search word
Value and/or search behavior information generate information to be pushed, and information to be pushed are pushed to client.
In some embodiments, target acquisition data is extracted from the journal file of search engine, including:By search engine
Journal file in, meet the first preset condition search data be determined as interference search data, delete interference search data;
Preset keyword set is obtained, for each preset keyword in preset keyword set, search data are interfered from deleting
Extraction search data associated with the preset keyword in journal file afterwards, and the search data extracted are determined as mesh
Mark search data.
In some embodiments, target acquisition data is parsed, generates target search set of words, including:To target
It searches for data and carries out semantic analysis, extract at least one target search word;Clustering processing is carried out at least one target search word,
Generate target search set of words.
In some embodiments, it is parsed to target acquisition data, after generating target search set of words, this method
Further include:Obtain preset history target search set of words;For each target search word in target search set of words, really
Determine with the presence or absence of the history target search word to match with the target search word in history target search set of words, if it is not, should
Target search word is determined as newly-increased hot spot notional word;Prompt message comprising identified newly-increased hot spot notional word is pushed into visitor
Family end.
In some embodiments, the search behavior information to match with each target search word includes following at least one
:Search behavior time of origin, execute search behavior user user information and user geographical location information.
In some embodiments, it for each target search word in target search set of words, is carried from journal file
The search behavior information to match with the target search word is taken, and search behavior information is parsed, generates the target search
The hot value of word, including:For each target search word in target search set of words, extraction and the mesh from journal file
The search behavior information that mark search term matches;Search behavior information is parsed, determine in preset number of days and meets the
Daily searching times two preset conditions and that the target search word matches;Based on identified daily searching times and in advance
If number of days, the hot value of the target search word is generated.
In some embodiments, the hot value based on each target search word and/or search behavior information, generate and wait pushing
Information, and information to be pushed is pushed into client, including:The target search word that hot value is more than to default value is determined as heat
Degree rises hot spot notional word, and the information to be pushed for rising hot spot notional word comprising identified temperature is pushed to client.
In some embodiments, the hot value based on each target search word and/or search behavior information, generate and wait pushing
Information, and information to be pushed is pushed into client, including:According to the sequence of hot value from big to small to each target search word
It is ranked up;Information to be pushed comprising each target search word after sorting from big to small according to hot value is pushed into client
End.
In some embodiments, the hot value based on each target search word and/or search behavior information, generate and wait pushing
Information, and information to be pushed is pushed into client, including:Each search behavior information is parsed, determines preset time
Total searching times of each target search word in section;According to the sequence of total searching times from big to small to each target search word
It is ranked up;Information to be pushed comprising each target search word after sorting from big to small according to searching times is pushed into visitor
Family end.
Second aspect, the embodiment of the present application provide a kind of information push-delivery apparatus, which includes:Extraction unit, configuration
For extracting target acquisition data from the journal file of search engine;Resolution unit, be configured to target acquisition data into
Row parsing, generates target search set of words;Generation unit is configured to search each target in target search set of words
Rope word, the search behavior information that extraction matches with the target search word from journal file, and search behavior information is carried out
Parsing, generates the hot value of the target search word;First push unit is configured to the hot value based on each target search word
And/or search behavior information, information to be pushed is generated, and information to be pushed is pushed into client.
In some embodiments, extraction unit includes:Removing module is configured in the journal file by search engine
, meet the first preset condition search data be determined as interference search data, delete interference search data;Determining module is matched
It sets for obtaining preset keyword set, for each preset keyword in preset keyword set, is searched from interference is deleted
Extraction search data associated with the preset keyword in journal file after rope data, and the search data extracted are true
It is set to target acquisition data.
In some embodiments, resolution unit includes:Analysis module is configured to carry out semantic point to target acquisition data
Analysis, extracts at least one target search word;Cluster module is configured to carry out clustering processing at least one target search word,
Generate target search set of words.
In some embodiments, which further includes:Acquiring unit is configured to obtain preset history target search word
Set;Determination unit is configured to, for each target search word in target search set of words, determine history target search
With the presence or absence of the history target search word to match with the target search word in set of words, if it is not, the target search word is determined
To increase hot spot notional word newly;Second push unit is configured to that the prompt message of identified newly-increased hot spot notional word will be included
Push to client.
In some embodiments, the search behavior information to match with each target search word includes following at least one
:Search behavior time of origin, execute search behavior user user information and user geographical location information.
In some embodiments, generation unit is further configured to:For each mesh in target search set of words
Search term is marked, the search behavior information that extraction matches with the target search word from journal file;To search behavior information into
Row parsing determines daily search time in preset number of days and meeting the second preset condition, matching with the target search word
Number;Based on identified daily searching times and preset number of days, the hot value of the target search word is generated.
In some embodiments, the first push unit is further configured to:Hot value is more than to the target of default value
Search term is determined as temperature and rises hot spot notional word, and the information to be pushed that will rise hot spot notional word comprising identified temperature
Push to client.
In some embodiments, the first push unit includes:First sorting module, be configured to according to hot value from greatly to
Small sequence is ranked up each target search word;First pushing module, be configured to will include according to hot value from greatly to
The information to be pushed of each target search word after small sequence pushes to client.
In some embodiments, the first push unit includes:Parsing module, be configured to each search behavior information into
Row parsing, determines total searching times of each target search word in preset time period;Second sorting module, be configured to according to
The total sequence of searching times from big to small is ranked up each target search word;Second pushing module, be configured to will include
The information to be pushed of each target search word after sorting from big to small according to searching times pushes to client.
Information-pushing method and device provided by the embodiments of the present application, by being carried from the journal file of search engine
The target acquisition data taken is parsed, and to generate target search set of words, is then based on being extracted from the journal file
Search behavior information target search word is parsed, generate the hot value of each target search word, be finally based on each mesh
The hot value and/or search behavior information for marking search term, generate and push information to be pushed, and search engine is based on to realize
Daily record data information push.Since the daily record data of search engine can intuitively embody the current search intention of user, thus
This information push mode improves the accuracy and timeliness of information push.
Specific implementation mode
The application is described in further detail with reference to the accompanying drawings and examples.It is understood that this place is retouched
The specific embodiment stated is used only for explaining related invention, rather than the restriction to the invention.It also should be noted that in order to
Convenient for description, is illustrated only in attached drawing and invent relevant part with related.
It should be noted that in the absence of conflict, the features in the embodiments and the embodiments of the present application can phase
Mutually combination.The application is described in detail below with reference to the accompanying drawings and in conjunction with the embodiments.
Fig. 1 shows the exemplary system architecture of the information-pushing method or information push-delivery apparatus that can apply the application
100。
As shown in Figure 1, system architecture 100 may include terminal device 101,102,103, network 104 and server 105.
Network 104 between terminal device 101,102,103 and server 105 provide communication link medium.Network 104 can be with
Including various connection types, such as wired, wireless communication link or fiber optic cables etc..
User can be interacted by network 104 with server 105 with using terminal equipment 101,102,103, to receive or send out
Send message etc..Various telecommunication customer end applications can be installed, such as web browser is answered on terminal device 101,102,103
With, securities trading class application etc..
Terminal device 101,102,103 can be the various electronic equipments with display screen and supported web page browsing, packet
Include but be not limited to smart mobile phone, tablet computer, pocket computer on knee and desktop computer etc..
Server 105 can be to provide the server of various services, such as to being shown on terminal device 101,102,103
Search and webpage provides the background server supported.Background server can extract target search from the journal file of search engine
Data, and the target acquisition data to being extracted carries out the processing such as analyzing, and handling result (such as information to be pushed) is pushed
To terminal device.
It should be noted that the information-pushing method that the embodiment of the present application is provided generally is executed by server 105, accordingly
Ground, information push-delivery apparatus are generally positioned in server 105.
It should be understood that the number of the terminal device, network and server in Fig. 1 is only schematical.According to realization need
It wants, can have any number of terminal device, network and server.
With continued reference to Fig. 2, the flow 200 of one embodiment of the information-pushing method according to the application is shown.It is described
Information-pushing method, include the following steps:
Step 201, target acquisition data is extracted from the journal file of search engine;
In the present embodiment, information-pushing method runs electronic equipment (such as server shown in FIG. 1) thereon
The journal file of search engine can be stored in the memory of itself, at this moment, above-mentioned electronic equipment directly can be obtained locally
Above-mentioned journal file, and extract target acquisition data from above-mentioned journal file.In addition, above-mentioned journal file can also be stored in
In another server being connected with above-mentioned electronic equipment, at this point, above-mentioned electronic equipment can pass through wired connection mode or nothing
Line connection type obtains above-mentioned journal file from above-mentioned another server, and target search number is extracted from above-mentioned journal file
According to.It should be pointed out that above-mentioned radio connection can include but is not limited to 3G/4G connections, WiFi connections, bluetooth connection,
WiMAX connections, Zigbee connections, UWB (ultra wideband) connections and other it is currently known or in the future exploitation it is wireless
Connection type.
It should be noted that can record user in above-mentioned journal file carries out information search using above-mentioned search engine
Data are searched for, above-mentioned search data click access after can include but is not limited to search statement, search behavior time of origin, search
The network address of website, IP (Internet Protocol, the agreement interconnected between network) address of client, client pacified
Browser information, the user information etc. that operation system information, the client of dress are installed.Searching recorded in above-mentioned journal file
Rope sentence can be and various fields (such as tour field, financial field, computer realm etc.) relevant search statement.It needs
, it is noted that above-mentioned target acquisition data can be and default field (such as field of securities) relevant multiple search statements, example
Such as " how is tendency ", " newest hot spot plate ", " share tendency ", " newest hot spot plate ".Above-mentioned electronic equipment can profit
In various manners target acquisition data is extracted from above-mentioned journal file.
In some optional realization methods of the present embodiment, preset key can be previously stored in above-mentioned electronic equipment
Set of words, wherein the preset keyword included by above-mentioned preset keyword set can be it is preset, with above-mentioned default field phase
The keyword of pass, such as " tendency ", " plate ", " concept stock " etc..Above-mentioned electronic equipment can first will be in upper journal file
, search data that meet the first preset condition are determined as interference search data, and delete above-mentioned interference search data.Herein,
Above-mentioned first preset condition may include at least one of following:Including search statement length be more than preset length, wrapped
Day searching times containing search statement are more than preset times, or the IP address for being included matches with preset IP address.It needs
Illustrate, above-mentioned first preset condition is not limited to listed above.In practice, above-mentioned interference search data are typically that network is climbed
The search data of the generations such as worm, malicious attack, false search, above-mentioned first preset condition can be that technical staff is based on above-mentioned feelings
It largely searches for the statistics and analysis of data caused by condition and predefines and be arranged.Data are searched for deleting above-mentioned interference
Later, above-mentioned electronic equipment can obtain above-mentioned preset keyword set.Then, for every in above-mentioned preset keyword set
One preset keyword, above-mentioned electronic equipment can from extraction in the above-mentioned journal file after deleting above-mentioned interference search data with
The preset associated search data of keyword, wherein above-mentioned search data associated with the preset keyword can be packet
Search statement containing the preset keyword.As an example, the preset keyword can be " plate ", including the preset keyword
Search statement can be " hot spot plate ", " newest hot spot plate ranking ", " Shanghai and Shenzhen concept plate " etc..Finally, above-mentioned electronics
The search data extracted can be determined as target acquisition data by equipment.
In some optional realization methods of the present embodiment, preset network address can be previously stored in above-mentioned electronic equipment
Set, wherein the preset network address included by above-mentioned preset network address set can be to provide preset kind information (such as security class)
Website network address.In practice, network address is generally come by uniform resource locator (Uniform Resource Locator, URL)
It indicates.Above-mentioned electronic equipment can click the net of the website accessed after extracting user's search in above-mentioned journal file first
Location.Later, from the network address that retrieval matches with each preset network address in above-mentioned preset network address set in the network address extracted.It is right
In each network address retrieved, above-mentioned electronic equipment, which can extract before user clicks the webpage accessed indicated by the network address, to be remembered
The search statement of record.Finally, the search statement extracted can be determined as target acquisition data by above-mentioned electronic equipment.
In some optional realization methods of the present embodiment, it can be previously stored in above-mentioned electronic equipment above-mentioned preset
Keyword set and above-mentioned preset network address set.It is above-mentioned for each preset keyword in above-mentioned preset keyword set
Electronic equipment can determine search statement in above-mentioned journal file, including the preset keyword first;Later, for really
Each fixed search statement can determine that user searches for after the search statement website for clicking access based on above-mentioned journal file
Network address, and determine in above-mentioned preset network address set with the presence or absence of the preset network address to match with identified network address;If in the presence of,
Extract the search statement;Finally, the search statement extracted can be determined as target acquisition data by above-mentioned electronic equipment.
Step 202, target acquisition data is parsed, generates target search set of words.
In the present embodiment, above-mentioned electronic equipment can divide the search statement for constituting above-mentioned target acquisition data
Word;And then the word obtained after participle is analyzed, generate target search set of words.
In the present embodiment, search statement can be divided into word by above-mentioned electronic equipment using various segmenting methods, wherein
Above-mentioned segmenting method can be the segmenting method based on statistics.Specifically, each for the above-mentioned target acquisition data of composition
Search statement, above-mentioned electronic equipment can count the frequency of each adjacent combinatorics on words in the search statement, calculate
The frequency that each combination occurs.The probability of each combination is then judged when the probability of the combination is higher than predetermined probabilities threshold value
The combination constitutes word, to realize the participle to search statement.In addition, above-mentioned segmenting method can also be based on character string
Segmenting method with principle, using string matching principle by above-mentioned field to be resolved and the machine that is preset in above-mentioned electronic equipment
Character string in device dictionary is matched, wherein above-mentioned string matching principle can be Forward Maximum Method method, reverse maximum
Matching method sets up cutting mark method, by word traversal matching method, positive Best Match Method, reverse Best Match Method etc..
Can be statistics to the analysis mode of the word obtained after participle in some optional realization methods of the present embodiment
Analysis mode.As an example, can be counted to the frequency of occurrences of the obtained each word in above-mentioned target acquisition data
And sequence.Later, it chooses the frequency of occurrences and sorts forward one or more words as target search word, generate target search word set
It closes.
In some optional realization methods of the present embodiment, the word that is obtained after in the way of statistical analysis to participle into
Before row analysis, above-mentioned electronic equipment can also delete the invalid word in obtained word, above-mentioned invalid word after participle and can wrap
Include spoken word, modal particle, verb etc..In practice, above-mentioned electronic equipment can be stored with pre-set prepending non-significant set of words,
Each word that participle obtains can be matched with the prepending non-significant word in above-mentioned prepending non-significant set of words, after determining participle
Whether obtained word is invalid word.
Can be semantic to the analysis mode of the word obtained after participle in some optional realization methods of the present embodiment
Analysis mode.As an example, importance calculating can be carried out to obtained word (for example, by using the reverse document-frequency side of word frequency-
Method (Term Frequency-Inverse Document Frequency, TF-IDF)), based on importance calculate result come
It determines target search word, and generates target search set of words.
In some optional realization methods of the present embodiment, after participle, above-mentioned electronic equipment can be first with language
Adopted analysis mode carries out semantic participle to each word that participle obtains, and determines at least one target search word;Later, to it is above-mentioned extremely
A few target search word carries out clustering processing, generates target search set of words.As an example, it is poor to there will be only capital and small letter
Different same English word is determined as the same word, can also determine the Chinese word with same meaning, English word, abbreviation
For synonym, it is (such as " state-owned that the word of semantic similarity in above-mentioned at least one target search word can also be determined as to the same word
Enterprise reform " and " state-owned enterprise reform ") etc..
It should be noted that above-mentioned segmenting method, semantic analysis mode, clustering processing mode are to study and answer extensively at present
Known technology, details are not described herein.
Step 203, for each target search word in target search set of words, extraction and the mesh from journal file
The search behavior information that mark search term matches, and search behavior information is parsed, generate the temperature of the target search word
Value.
In the present embodiment, for each target search word in above-mentioned target search set of words, above-mentioned electronic equipment
Synonym in above-mentioned journal file, comprising the target search word and the target search word (such as mesh can be determined first
Mark the abridging of search term, English paraphrase/Chinese paraphrase etc.) search statement;Later, in above-mentioned journal file and institute is determined
Identified search behavior information is determined as and the target search word by the search behavior information that determining search statement matches
The search behavior information to match;Finally, search behavior information is parsed, generates the hot value of the target search word.It needs
It is noted that above-mentioned search behavior information can include but is not limited to various information associated with search behavior, for example, searching
The operation system that the network address of the website of access, the IP address of client, client are installed is clicked after Suo Hangwei time of origins, search
Unite information, the browser information that client is installed, the location information of client position, user information (such as user year
Age, user occupation) etc..Above-mentioned hot value can be the numerical value of the concerned degree for characterizing target search word.Above-mentioned electronics
Equipment can generate the hot value of the target search word using various analysis modes.
In some optional realization methods of the present embodiment, for each target in above-mentioned target search set of words
Search term, above-mentioned electronic equipment can generate the hot value of the target keyword as follows:First, above-mentioned electronic equipment
Can pair search behavior information to match with the target search word parse, determination searched with what the target search word matched
Suo Hangwei time of origins;Later, identified search behavior time of origin can be counted, to determine preset number of days (example
Such as 10,20) in the daily searching times to match with the target search word;Finally, it can daily be searched based on identified
Rope number and above-mentioned preset number of days, generate the hot value of the target search word.As an example, above-mentioned electronic equipment can utilize such as
Lower formula calculates the hot value of the target search word
Wherein, n is the positive integer for indicating preset number of days;I is the positive integer not less than 1 and no more than n;xiTo be used for
Indicate i-th day day numerical value, and xi=i, for example, the 1st day day numerical value is 1, i.e. x1=1;For average time, andyiFor identified i-th day day searching times;It, can be according to such as n days average day searching times
Under type calculates:
In some optional realization methods of the present embodiment, for each target in above-mentioned target search set of words
Search term, above-mentioned electronic equipment can generate the hot value of the target keyword as follows:First, above-mentioned electronic equipment
Can pair search behavior information to match with the target search word parse, extraction searched with what the target search word matched
Suo Hangwei time of origins, client geographical location information and user information (for example, age of user, user's occupation etc.), and be based on
The information extracted determine it is in preset number of days (such as 10,20 etc.), meet predeterminated position condition and/or meet pre-set user item
Daily searching times part and that the target search word matches.Wherein, above-mentioned preset geographical position condition can be client
Geographic location is located at default province/city (such as Beijing), client geographic location is located at predeterminable area (such as North China
Area) etc.;Above-mentioned pre-set user condition can be age of user in default the range of age (such as 30-40 Sui), user's occupation
For default professional (such as teacher) etc..It finally, can be based on identified daily searching times and above-mentioned preset number of days, according to upper
Formula is stated, the hot value of the target search word is generated.
In some optional realization methods of the present embodiment, for each target in above-mentioned target search set of words
Search term, above-mentioned electronic equipment can also pair search behavior information to match with the target search word parse, determine with
The search behavior time of origin that the target search word matches;Later, identified search behavior time of origin can be carried out
Statistics, to determine matching with the target search word in preset duration (such as nearly 24 hours, 48 hours, nearly 5 days etc. nearly)
Total searching times, and above-mentioned total searching times are determined as to the hot value of the target search word.
Step 204, the hot value based on each target search word and/or search behavior information generate information to be pushed, and
Information to be pushed is pushed into client.
In the present embodiment, above-mentioned electronic equipment can be based on the hot value and/or search behavior of each target search word
Information generates information to be pushed, and above-mentioned information to be pushed is pushed to the client being connected with above-mentioned electronic equipment and (such as is schemed
Client 101 shown in 1,102,103).
In some optional realization methods of the present embodiment, above-mentioned electronic equipment can be based on each target search word
Hot value generates information to be pushed.Specifically, the target search word that hot value can be more than to default value is determined as in temperature
Hot spot notional word is risen, and the information to be pushed for rising hot spot notional word comprising identified temperature is pushed into client.It needs
Illustrating, above-mentioned information to be pushed can also include the character string of the content for prompting pushed information, such as "【In advance
It is alert】Temperature persistently rises concept " etc..
In some optional realization methods of the present embodiment, above-mentioned electronic equipment can be according to hot value from big to small
Sequence is ranked up each target search word;Later, it will be searched comprising each target after sorting from big to small according to hot value
The information to be pushed of rope word pushes to client.It should be noted that above-mentioned information to be pushed can also include for prompting
The character string of the content of the information of push, such as " sequence of hot spot concept ".
In some optional realization methods of the present embodiment, above-mentioned electronic equipment can to each search behavior information into
Row parsing, determines in preset time period total searching times of each target search word of (such as nearly 10 days, nearly 24 hours etc.);It
Afterwards, each target search word is ranked up according to the sequence of total searching times from big to small;Finally, will include according to search time
The information to be pushed of each target search word after number sequence from big to small pushes to client.It should be noted that above-mentioned wait for
Pushed information can also include the character string of the content for prompting pushed information, such as "【Early warning】Recent volumes of searches row
Sequence " etc..
In some optional realization methods of the present embodiment, above-mentioned electronic equipment can be based on each target search word
Hot value and search behavior information generate information to be pushed, wherein above-mentioned information to be pushed can include simultaneously with the next item down or
It is multinomial:Identified temperature rising hot spot notional word, each target search word after sorting from big to small according to hot value, according to
Searching times sort from big to small after each target search word.
It is a schematic diagram 300 according to the application scenarios of the information-pushing method of the present embodiment with continued reference to Fig. 3, Fig. 3.
In the application scenarios of Fig. 3, server 301 extracts target acquisition data 303 first from the daily record data of search engine 302.
Then, above-mentioned server 301 parses above-mentioned target acquisition data 303, generates target search set of words 304.Later, on
The search behavior information 305 that the extraction of server 301 matches with target search word is stated, search behavior information 305 is parsed
Generate the hot value 306 to match with each target search word.Finally, above-mentioned server 301 is based on hot value 306 and/or searches
Rope behavioural information 305 generates information to be pushed 307, and above-mentioned information to be pushed 307 is pushed to server-side 308.
The method that above-described embodiment of the application provides passes through the target to being extracted from the journal file of search engine
Search data are parsed, to generate target search set of words, then based on the search row extracted from the journal file
Target search word is parsed for information, generates the hot value of each target search word, is finally based on each target search word
Hot value and/or search behavior information, information to be pushed is generated and pushes, to realize the daily record number based on search engine
According to information push.Since the daily record data of search engine can intuitively embody the current search intention of user, thus this information
Push mode improves the accuracy and timeliness of information push.
With further reference to Fig. 4, it illustrates the flows 400 of another embodiment of information-pushing method.The information pushes
The flow 400 of method, includes the following steps:
Step 401, search data in the journal file of search engine, meeting the first preset condition are determined as interfering
Data are searched for, interference search data are deleted.
In the present embodiment, information-pushing method runs electronic equipment (such as server shown in FIG. 1) thereon
Can be stored in the memory of itself search engine the above-mentioned electronic equipment of journal file can first can by upper daily record text
Search data in part, meeting the first preset condition are determined as interference search data, and delete above-mentioned interference search data.This
Place, above-mentioned first preset condition may include at least one of following:Including search statement length be more than preset length, institute
Including the day searching times of search statement are more than preset times, or the IP address for being included matches with preset IP address.It needs
It is noted that above-mentioned first preset condition be not limited to it is listed above.
Step 402, preset keyword set is obtained, for each preset keyword in preset keyword set, from
It deletes extraction search data associated with the preset keyword in the journal file after interference search data, and will extract
Search data are determined as target acquisition data.
In the present embodiment, above-mentioned electronic equipment can be stored with preset keyword set, wherein above-mentioned preset keyword
The included preset keyword of set can be preset and default field (such as field of securities) relevant keyword.It is above-mentioned
Electronic equipment can obtain above-mentioned preset keyword set;Then, preset for each in above-mentioned preset keyword set
Keyword, can be associated with the preset keyword from extraction in the above-mentioned journal file after deleting above-mentioned interference search data
Search for data, wherein above-mentioned search data associated with the preset keyword can include the search of the preset keyword
Sentence;Finally, the search data extracted can be determined as target acquisition data by above-mentioned electronic equipment.
Step 403, semantic analysis is carried out to target acquisition data, extracts at least one target search word.
In the present embodiment, above-mentioned electronic equipment can divide above-mentioned target acquisition data in the way of semantic analysis
Analysis.As an example, can be segmented first to above-mentioned target acquisition data;Later, importance meter is carried out to obtained word
It calculates, at least one target search word is extracted based on the result of importance calculating.
Step 404, clustering processing is carried out at least one target search word, generates target search set of words.
In the present embodiment, above-mentioned electronic equipment can carry out clustering processing to above-mentioned at least one target search word, raw
At target search set of words.As an example, the same English word that can there will be only capital and small letter difference is determined as the same word,
Chinese word with same meaning, English word, abbreviation can also be determined as synonym, it can also be by above-mentioned at least one mesh
The word of semantic similarity is determined as same word (such as " SOE reform " and " state-owned enterprise reform ") etc. in mark search term.It is generating
After target search set of words, above-mentioned electronic equipment can execute step 405-407;At the same time it can also execute step 408-409.
Step 405, preset history target search set of words is obtained.
In the present embodiment, above-mentioned electronic equipment can be stored with history target search set of words, wherein above-mentioned history mesh
Mark search set of words can be stored in advance in above-mentioned electronic equipment before this information-pushing method executes.Above-mentioned electronics is set
It is standby directly to obtain above-mentioned history target search set of words from local.It should be noted that being executed in this information-pushing method
After the completion, above-mentioned electronic equipment can also be using the target search set of words that step 404 generates as history target search set of words
It is stored.
Step 406, for each target search word in target search set of words, history target search set of words is determined
In with the presence or absence of the history target search word that matches with the target search word, if it is not, the target search word is determined as increasing newly
Hot spot notional word.
In the present embodiment, for each target search word in above-mentioned target search set of words, above-mentioned electronic equipment
It can determine the history target search word that whether there is in above-mentioned history target search set of words and match with the target search word,
If it is not, the target search word can be determined as to newly-increased hot spot notional word.
Step 407, the prompt message comprising identified newly-increased hot spot notional word is pushed into client.
In the present embodiment, each newly-increased hot spot notional word life that above-mentioned electronic equipment can be determined based on step 406
At prompt message.It can include each newly-increased hot spot notional word that above-mentioned steps 406 are determined in above-mentioned prompt message.Also include
Character string for that can indicate the content indicated by the prompt message, such as "【It reminds】Newly-increased hot spot concept " etc..
Step 408, for each target search word in target search set of words, extraction and the mesh from journal file
The search behavior information that mark search term matches;Search behavior information is parsed, determine in preset number of days and meets the
Daily searching times two preset conditions and that the target search word matches;Based on identified daily searching times and upper
Preset number of days is stated, the hot value of the target search word is generated.
In the present embodiment, each target search word in the target search set of words generated for step 406, it is above-mentioned
Electronic equipment can determine synonym in above-mentioned journal file, comprising the target search word and the target search word first
Search statement.Later, search behavior information in above-mentioned journal file, matching with identified search statement is determined, it will
Identified search behavior information is determined as the search behavior information to match with the target search word.On it should be noted that
It states search behavior information and can include but is not limited to various information associated with search behavior, for example, when search behavior occurs
Between, search after click access website network address, the IP address of client, client installed operation system information, client
The installed browser information in end, client position location information, user information etc..Then, can pair with the target
The search behavior information that search term matches is parsed, when extraction occurs with the search behavior that the target search word matches
Between, client geographical location information and user information, and determine that in preset number of days, to meet second pre- based on the information extracted
If daily searching times condition and that the target search word matches.Wherein, above-mentioned second preset condition may include following
At least one of:Preset geographical position condition, pre-set user condition.Above-mentioned preset geographical position condition can be client location
Reason position is located at default province/city, client geographic location is located at predeterminable area etc.;Above-mentioned pre-set user condition can be used
The family age in default the range of age, user's occupation be default occupation etc..Finally, above-mentioned electronic equipment can be based on determined by
Daily searching times and above-mentioned preset number of days, generate the hot value of the target search word.
Step 409, the hot value based on each target search word and/or search behavior information generate information to be pushed, and
Information to be pushed is pushed into client.
In the present embodiment, above-mentioned electronic equipment can be based on the hot value and/or search behavior of each target search word
Information generates information to be pushed, and above-mentioned information to be pushed is pushed to the client being connected with above-mentioned electronic equipment and (such as is schemed
Client 101 shown in 1,102,103).Specifically, the target that hot value can be more than default value by above-mentioned electronic equipment is searched
Rope word is determined as temperature and rises hot spot notional word;Alternatively, it is also possible to be searched to each target according to the sequence of hot value from big to small
Rope word is ranked up;Further, it is also possible to be parsed to each search behavior information, each target in preset time period is determined
Total searching times of search term, and each target search word is ranked up according to the sequence of total searching times from big to small.It
Afterwards, above-mentioned electronic equipment can will include after identified temperature rises hot spot notional word, sorts from big to small according to hot value
Each target search word, sort from big to small according to searching times after the information to be pushed of each target search word push to
Client.
Figure 4, it is seen that compared with the corresponding embodiments of Fig. 1, the flow of the information-pushing method in the present embodiment
400 highlight the step of determining newly-increased hot spot notional word.The scheme of the present embodiment description can be based on the day of search engine as a result,
Will data excavate newly-increased hot spot notional word, thus while improving the accuracy and timeliness of information push,
Improve the rich of pushed information.
With further reference to Fig. 5, as the realization to method shown in above-mentioned each figure, this application provides a kind of push of information to fill
The one embodiment set, the device embodiment is corresponding with embodiment of the method shown in Fig. 2, which specifically can be applied to respectively
In kind electronic equipment.
As shown in figure 5, the information push-delivery apparatus 500 described in the present embodiment includes:Extraction unit 501 is configured to from searching
It indexes in the journal file held up and extracts target acquisition data;Resolution unit 502 is configured to carry out above-mentioned target acquisition data
Parsing generates target search set of words;Generation unit 503 is configured to for each in above-mentioned target search set of words
Target search word, the search behavior information that extraction matches with the target search word from above-mentioned journal file, and searched to above-mentioned
Rope behavioural information is parsed, and the hot value of the target search word is generated;First push unit 504 is configured to based on each
The hot value and/or search behavior information of target search word generate information to be pushed, and above-mentioned information to be pushed are pushed to visitor
Family end.
In the present embodiment, information push-delivery apparatus 500 can be stored with the journal file of search engine, said extracted unit
501 directly can locally obtain above-mentioned journal file, and extract target acquisition data from above-mentioned journal file.Above-mentioned target is searched
Rope data can be and default field (such as field of securities) relevant multiple search statements.
In some optional realization methods of the present embodiment, said extracted unit 501 may include removing module and really
Cover half block (not shown).Wherein, above-mentioned removing module may be configured in the journal file by search engine, satisfaction
The search data of first preset condition are determined as interference search data, delete above-mentioned interference search data.Above-mentioned determining module can
To be configured to obtain preset keyword set, for each preset keyword in above-mentioned preset keyword set, from deleting
Extraction search data associated with the preset keyword in the journal file after data are searched for except above-mentioned interference, and will be extracted
Search data be determined as target acquisition data.
In the present embodiment, above-mentioned resolution unit 502 can carry out the search statement for constituting above-mentioned target acquisition data
Participle;And then the word obtained after participle is analyzed, generate target search set of words.
In some optional realization methods of the present embodiment, above-mentioned resolution unit can include analysis module with 502 and gather
Generic module (not shown).Wherein, above-mentioned analysis module is configured to carry out semantic analysis to above-mentioned target acquisition data,
Extract at least one target search word.Above-mentioned cluster module may be configured to gather above-mentioned at least one target search word
Class processing, generates target search set of words.
In some optional realization methods of the present embodiment, above- mentioned information pusher 500 further include acquiring unit, really
Order member and the second push unit (not shown).Wherein, above-mentioned acquiring unit may be configured to obtain preset history
Target search set of words.Above-mentioned determination unit may be configured to search each target in above-mentioned target search set of words
Rope word determines in above-mentioned history target search set of words with the presence or absence of the history target search to match with the target search word
Word, if it is not, the target search word is determined as newly-increased hot spot notional word.Above-mentioned second push unit may be configured to
The prompt message of identified newly-increased hot spot notional word pushes to client.
In the present embodiment, for each target search word in above-mentioned target search set of words, above-mentioned generation unit
503 can determine the search of synonym in above-mentioned journal file, comprising the target search word and the target search word first
Sentence;Later, determine search behavior information in above-mentioned journal file, matching with identified search statement, by really
Fixed search behavior information is determined as the search behavior information to match with the target search word;Finally, to search behavior information
It is parsed, generates the hot value of the target search word.
In some optional realization methods of the present embodiment, believe with the search behavior that each target search word matches
Breath includes at least one of following:Search behavior time of origin, execute above-mentioned search behavior user user information and above-mentioned use
The geographical location information at family.
In some optional realization methods of the present embodiment, above-mentioned generation unit 503 can be further configured to pair
Each target search word in above-mentioned target search set of words, extraction and the target search word phase from above-mentioned journal file
Matched search behavior information;Above-mentioned search behavior information is parsed, determines that in preset number of days and satisfaction second is default
Daily searching times condition and that the target search word matches;Based on identified daily searching times and above-mentioned default
Number of days generates the hot value of the target search word.
In the present embodiment, the hot value and/or search that above-mentioned first push unit 504 can be based on each target search word
Rope behavioural information generates information to be pushed, and above-mentioned information to be pushed pushed to and is connected with above- mentioned information pusher 500
Client (such as client shown in FIG. 1 101,102,103).
In some optional realization methods of the present embodiment, above-mentioned first push unit 504 can further configure use
It is determined as temperature in the target search word that hot value is more than to default value and rises hot spot notional word, and will will include identified heat
The information to be pushed that degree rises hot spot notional word pushes to client.
In some optional realization methods of the present embodiment, above-mentioned first push unit 504 may include the first sequence
Module and the first pushing module (not shown).Wherein, above-mentioned first sorting module may be configured to according to hot value from
Small sequence is arrived greatly to be ranked up each target search word.Above-mentioned first pushing module may be configured to include according to heat
The information to be pushed of each target search word after angle value sorts from big to small pushes to client.
In some optional realization methods of the present embodiment, above-mentioned first push unit 504 may include parsing module,
Second sorting module and the second pushing module (not shown).Wherein, above-mentioned parsing module may be configured to search to each
Rope behavioural information is parsed, and determines total searching times of each target search word in preset time period.Above-mentioned second sequence
Module may be configured to be ranked up each target search word according to the sequence of total searching times from big to small.Above-mentioned second
Pushing module may be configured to wait pushing comprising each target search word after sorting from big to small according to searching times
Information pushes to client.
The device that above-described embodiment of the application provides, by resolution unit 502 to extraction unit 501 from search engine
The target acquisition data extracted in journal file is parsed, to generate target search set of words, then generation unit 503
Target search word is parsed based on the search behavior information extracted from the journal file, generates each target search word
Hot value, hot value and/or search behavior information of last first push unit 504 based on each target search word, generate
And information to be pushed is pushed, to realize the information push of the daily record data based on search engine.Due to the day of search engine
Will data can intuitively embody the current search intention of user, thus this information push mode improves the accuracy of information push
And timeliness.
Below with reference to Fig. 6, it illustrates the computer systems 600 suitable for the server for realizing the embodiment of the present application
Structural schematic diagram.Server shown in Fig. 6 is only an example, should not be to the function and use scope band of the embodiment of the present application
Carry out any restrictions.
As shown in fig. 6, computer system 600 includes central processing unit (CPU) 601, it can be read-only according to being stored in
Program in memory (ROM) 602 or be loaded into the program in random access storage device (RAM) 603 from storage section 608 and
Execute various actions appropriate and processing.In RAM 603, also it is stored with system 600 and operates required various programs and data.
CPU 601, ROM 602 and RAM 603 are connected with each other by bus 604.Input/output (I/O) interface 605 is also connected to always
Line 604.
It is connected to I/O interfaces 605 with lower component:Importation 606 including keyboard, mouse etc.;It is penetrated including such as cathode
The output par, c 607 of spool (CRT), liquid crystal display (LCD) etc. and loud speaker etc.;Storage section 608 including hard disk etc.;
And the communications portion 609 of the network interface card including LAN card, modem etc..Communications portion 609 via such as because
The network of spy's net executes communication process.Driver 610 is also according to needing to be connected to I/O interfaces 605.Detachable media 611, such as
Disk, CD, magneto-optic disk, semiconductor memory etc. are mounted on driver 610, as needed in order to be read from thereon
Computer program be mounted into storage section 608 as needed.
Particularly, in accordance with an embodiment of the present disclosure, it may be implemented as computer above with reference to the process of flow chart description
Software program.For example, embodiment of the disclosure includes a kind of computer program product comprising be carried on computer-readable medium
On computer program, which includes the program code for method shown in execution flow chart.In such reality
It applies in example, which can be downloaded and installed by communications portion 609 from network, and/or from detachable media
611 are mounted.When the computer program is executed by central processing unit (CPU) 601, limited in execution the present processes
Above-mentioned function.It should be noted that computer-readable medium described herein can be computer-readable signal media or
Computer readable storage medium either the two arbitrarily combines.Computer readable storage medium for example can be --- but
Be not limited to --- electricity, magnetic, optical, electromagnetic, infrared ray or semiconductor system, device or device, or arbitrary above combination.
The more specific example of computer readable storage medium can include but is not limited to:Electrical connection with one or more conducting wires,
Portable computer diskette, hard disk, random access storage device (RAM), read-only memory (ROM), erasable type may be programmed read-only deposit
Reservoir (EPROM or flash memory), optical fiber, portable compact disc read-only memory (CD-ROM), light storage device, magnetic memory
Part or above-mentioned any appropriate combination.In this application, computer readable storage medium can any be included or store
The tangible medium of program, the program can be commanded the either device use or in connection of execution system, device.And
In the application, computer-readable signal media may include the data letter propagated in a base band or as a carrier wave part
Number, wherein carrying computer-readable program code.Diversified forms may be used in the data-signal of this propagation, including but not
It is limited to electromagnetic signal, optical signal or above-mentioned any appropriate combination.Computer-readable signal media can also be computer
Any computer-readable medium other than readable storage medium storing program for executing, the computer-readable medium can send, propagate or transmit use
In by instruction execution system, device either device use or program in connection.Include on computer-readable medium
Program code can transmit with any suitable medium, including but not limited to:Wirelessly, electric wire, optical cable, RF etc., Huo Zheshang
Any appropriate combination stated.
Flow chart in attached drawing and block diagram, it is illustrated that according to the system of the various embodiments of the application, method and computer journey
The architecture, function and operation in the cards of sequence product.In this regard, each box in flowchart or block diagram can generation
A part for a part for one module, program segment, or code of table, the module, program segment, or code includes one or more uses
The executable instruction of the logic function as defined in realization.It should also be noted that in some implementations as replacements, being marked in box
The function of note can also occur in a different order than that indicated in the drawings.For example, two boxes succeedingly indicated are actually
It can be basically executed in parallel, they can also be executed in the opposite order sometimes, this is depended on the functions involved.Also it to note
Meaning, the combination of each box in block diagram and or flow chart and the box in block diagram and or flow chart can be with holding
The dedicated hardware based system of functions or operations as defined in row is realized, or can use specialized hardware and computer instruction
Combination realize.
Being described in unit involved in the embodiment of the present application can be realized by way of software, can also be by hard
The mode of part is realized.Described unit can also be arranged in the processor, for example, can be described as:A kind of processor packet
Include acquiring unit, transmission unit and authentication unit.Wherein, the title of these units is not constituted under certain conditions to the unit
The restriction of itself, for example, extraction unit is also described as " unit of extraction target acquisition data ".
As on the other hand, present invention also provides a kind of computer-readable medium, which can be
Included in device described in above-described embodiment;Can also be individualism, and without be incorporated the device in.Above-mentioned calculating
Machine readable medium carries one or more program, when said one or multiple programs are executed by the device so that should
Device:Target acquisition data is extracted from the journal file of search engine;The target acquisition data is parsed, mesh is generated
Mark search set of words;For each target search word in the target search set of words, extracted from the journal file
The search behavior information to match with the target search word, and described search behavioural information is parsed, it generates the target and searches
The hot value of rope word;Hot value based on each target search word and/or search behavior information generate information to be pushed, and will
The information to be pushed pushes to client.
Above description is only the preferred embodiment of the application and the explanation to institute's application technology principle.People in the art
Member should be appreciated that invention scope involved in the application, however it is not limited to technology made of the specific combination of above-mentioned technical characteristic
Scheme, while should also cover in the case where not departing from foregoing invention design, it is carried out by above-mentioned technical characteristic or its equivalent feature
Other technical solutions of arbitrary combination and formation.Such as features described above has similar work(with (but not limited to) disclosed herein
Can technical characteristic replaced mutually and the technical solution that is formed.