CN110889050A - Method and device for mining generic brand words - Google Patents

Method and device for mining generic brand words Download PDF

Info

Publication number
CN110889050A
CN110889050A CN201811043835.XA CN201811043835A CN110889050A CN 110889050 A CN110889050 A CN 110889050A CN 201811043835 A CN201811043835 A CN 201811043835A CN 110889050 A CN110889050 A CN 110889050A
Authority
CN
China
Prior art keywords
brand
word
search result
words
click log
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201811043835.XA
Other languages
Chinese (zh)
Inventor
蔡舵
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Sogou Technology Development Co Ltd
Original Assignee
Beijing Sogou Technology Development Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Sogou Technology Development Co Ltd filed Critical Beijing Sogou Technology Development Co Ltd
Priority to CN201811043835.XA priority Critical patent/CN110889050A/en
Publication of CN110889050A publication Critical patent/CN110889050A/en
Pending legal-status Critical Current

Links

Images

Landscapes

  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention discloses a method for mining generic brand words, which comprises the following steps: acquiring at least one brand word; obtaining a specific search result click log and a natural search result click log in a search engine, wherein the specific search result is a search result which is preset by a third-party provider and is related to a brand word; and performing broad-brand word mining on the brand words based on the specific search result click log and the natural search result click log to obtain broad-brand words corresponding to the brand words. The invention solves the technical problem of lower related query flow for the brand in the existing search, and realizes automatic mining of the generic brand words, thereby improving the related query flow of the brand and improving the brand search effect. Meanwhile, the invention also discloses a device for excavating the generic brand words.

Description

Method and device for mining generic brand words
Technical Field
The invention relates to the technical field of internet, in particular to a method and a device for mining generic brand words.
Background
Search advertisement marketing is mainly realized by depending on an advertisement search engine, and when a search word (namely, a query word) input by a user is an advertisement keyword of a certain commodity, a search result corresponding to the commodity is output so as to realize the purpose of advertisement promotion.
The advertising keywords are manually picked and added by the advertiser and require bid purchases by the advertiser. The advertisement keywords may be brand words or other words related to the brand words (i.e., over-brand words).
However, since the advertisement keywords are usually selected manually by the advertiser, the advertiser cannot enumerate all the brand-related queries of the search engine user (i.e., cannot effectively obtain the broad-brand words), which results in low brand-related query traffic and poor brand search effect.
Disclosure of Invention
The embodiment of the invention provides the method and the device for mining the broad-band brand words, solves the technical problem of low related query flow for brands in the existing search, and realizes automatic mining of the broad-band brand words, thereby improving the related query flow of the brands and improving the brand search effect.
In a first aspect, the present invention provides the following technical solutions through an embodiment of the present invention:
a method for mining generic brand words comprises the following steps:
acquiring at least one brand word;
obtaining a specific search result click log and a natural search result click log in a search engine, wherein the specific search result is a search result which is preset by a third-party provider and is related to a brand word;
and performing broad-brand word mining on the brand words based on the specific search result click log and the natural search result click log to obtain broad-brand words corresponding to the brand words.
Preferably, the obtaining at least one brand word includes:
crawling one or more brand words from a registered brand display website page through a web crawler;
creating a list of brand words based on the one or more brand words;
extracting the brand word from the list of brand words.
Preferably, the performing, based on the specific search result click log and the natural search result click log, a broad-brand word mining on the brand word to obtain a broad-brand word corresponding to the brand word includes:
extracting a first specific search result click log which takes the brand word as a query word from the specific search result click log;
extracting a first-level domain name corresponding to the brand word from the first specific search result click log;
searching a click Uniform Resource Locator (URL) related to a first-level domain name corresponding to the brand word in the natural search result click log, and acquiring a query word corresponding to the click URL;
and when the query word corresponding to the clicked URL meets a preset condition, determining the query word as a universal brand word corresponding to the brand word.
Preferably, the searching for a click uniform resource locator URL related to a primary domain name corresponding to the brand word in the natural search result click log includes:
searching a first click URL in the natural search result click log, wherein a first-level domain name of the first click URL is the same as a first-level domain name corresponding to the brand word; and/or
And searching a second click URL in the natural search result click log, wherein the primary domain name of the second click URL is the same as the URL of the website homepage to which the primary domain name corresponding to the brand word belongs.
Preferably, when the query term corresponding to the click URL satisfies a preset condition, determining the query term as a generic brand term corresponding to the brand term includes:
acquiring the click frequency of the query word, wherein the click frequency is used for representing the total number of times that all search results corresponding to the query word are clicked within a preset time period;
judging whether the click frequency is greater than a preset frequency or not;
and if so, determining the query word as a universal brand word corresponding to the brand word.
Preferably, after obtaining the pan-brand word corresponding to the brand word, the method further includes:
setting the generic brand word as a brand search keyword of the brand word, thereby establishing association between the generic brand word and a specific search result corresponding to the brand word.
In a second aspect, based on the same inventive concept, the invention provides the following technical solutions through an embodiment of the invention:
an extensive brand word mining device, comprising:
the system comprises a first acquisition unit, a second acquisition unit and a third acquisition unit, wherein the first acquisition unit is used for acquiring at least one brand word;
the second acquisition unit is used for acquiring a specific search result click log and a natural search result click log in a search engine, wherein the specific search result is a search result which is preset by a third-party provider and is related to a brand word;
and the mining unit is used for mining the brand words according to the specific search result click log and the natural search result click log and obtaining the generic brand words corresponding to the brand words.
Preferably, the first obtaining unit is specifically configured to:
crawling one or more brand words from a registered brand display website page through a web crawler; creating a list of brand words based on the one or more brand words; extracting the brand word from the list of brand words.
Preferably, the excavation unit is specifically configured to:
extracting a first specific search result click log which takes the brand word as a query word from the specific search result click log; extracting a first-level domain name corresponding to the brand word from the first specific search result click log; searching a click Uniform Resource Locator (URL) related to a first-level domain name corresponding to the brand word in the natural search result click log, and acquiring a query word corresponding to the click URL; and when the query word corresponding to the clicked URL meets a preset condition, determining the query word as a universal brand word corresponding to the brand word.
Preferably, the excavation unit is specifically configured to:
searching a first click URL in the natural search result click log, wherein a first-level domain name of the first click URL is the same as a first-level domain name corresponding to the brand word; and/or
And searching a second click URL in the natural search result click log, wherein the primary domain name of the second click URL is the same as the URL of the website homepage to which the primary domain name corresponding to the brand word belongs.
Preferably, the excavation unit is specifically configured to:
acquiring the click frequency of the query word, wherein the click frequency is used for representing the total number of times that all search results corresponding to the query word are clicked within a preset time period; judging whether the click frequency is greater than a preset frequency or not; and if so, determining the query word as a universal brand word corresponding to the brand word.
Preferably, the mining device for the generic terms further comprises:
and the setting unit is used for setting the universal brand word as a brand search keyword of the brand word so as to establish the association between the universal brand word and a specific search result corresponding to the brand word.
In a third aspect, based on the same inventive concept, the invention provides the following technical solutions through an embodiment of the invention:
a generic term mining device comprising a memory, a processor, and a computer program stored on the memory and executable on the processor, wherein the processor executes the program to perform the steps of:
acquiring at least one brand word; obtaining a specific search result click log and a natural search result click log in a search engine, wherein the specific search result is a search result which is preset by a third-party provider and is related to a brand word; and performing broad-brand word mining on the brand words based on the specific search result click log and the natural search result click log to obtain broad-brand words corresponding to the brand words.
In a fourth aspect, based on the same inventive concept, the present invention provides the following technical solutions through an embodiment of the present invention:
a computer-readable storage medium, on which a computer program is stored which, when executed by a processor, carries out the steps of:
acquiring at least one brand word; obtaining a specific search result click log and a natural search result click log in a search engine, wherein the specific search result is a search result which is preset by a third-party provider and is related to a brand word; and performing broad-brand word mining on the brand words based on the specific search result click log and the natural search result click log to obtain broad-brand words corresponding to the brand words.
One or more technical solutions provided in the embodiments of the present invention have at least the following technical effects or advantages:
in the embodiment of the invention, a method for mining generic brand words is disclosed, which comprises the following steps: acquiring at least one brand word; obtaining a specific search result click log and a natural search result click log in a search engine, wherein the specific search result is a search result which is preset by a third-party provider and is related to a brand word; and performing broad-brand word mining on the brand words based on the specific search result click log and the natural search result click log to obtain broad-brand words corresponding to the brand words. Because a large amount of broad brand words corresponding to the brand words can be mined in advance, the rear end of the search engine can automatically set the broad brand words as the brand search keywords of the brand words, and therefore enough brand-related query flow can be effectively covered. Therefore, the technical problem that the related query flow for the brand is low in the existing search is solved, and the automatic mining of the universal brand words is realized, so that the related query flow of the brand is improved, and the brand search effect is improved.
Drawings
In order to more clearly illustrate the technical solutions in the embodiments of the present invention, the drawings needed to be used in the description of the embodiments are briefly introduced below, and it is obvious that the drawings in the following description are some embodiments of the present invention, and it is obvious for those skilled in the art to obtain other drawings based on the drawings without creative efforts.
FIG. 1 is a flowchart of a method for mining generic brand words according to an embodiment of the present invention;
FIG. 2 is a flowchart illustrating the step S103 of the mining method for generic brand words according to an embodiment of the present invention;
FIG. 3 is a block diagram of an apparatus for mining generic terms in accordance with an embodiment of the present invention;
FIG. 4 is a block diagram of a mining device for generic terms in accordance with an embodiment of the present invention;
fig. 5 is a structural diagram of a mining device for generic terms as a server according to an embodiment of the present invention.
Detailed Description
The embodiment of the invention provides the method and the device for mining the broad-band brand words, solves the technical problem of low related query flow for brands in the existing search, and realizes automatic mining of the broad-band brand words, thereby improving the related query flow of the brands and improving the brand search effect.
In order to solve the technical problems, the embodiment of the invention has the following general idea:
a method for mining generic brand words comprises the following steps: acquiring at least one brand word; obtaining a specific search result click log and a natural search result click log in a search engine, wherein the specific search result is a search result which is preset by a third-party provider and is related to a brand word; and performing broad-brand word mining on the brand words based on the specific search result click log and the natural search result click log to obtain broad-brand words corresponding to the brand words.
In order to better understand the technical solution, the technical solution will be described in detail with reference to the drawings and the specific embodiments.
First, it is stated that the term "and/or" appearing herein is merely one type of associative relationship that describes an associated object, meaning that three types of relationships may exist, e.g., a and/or B may mean: a exists alone, A and B exist simultaneously, and B exists alone. In addition, the character "/" herein generally indicates that the former and latter related objects are in an "or" relationship.
To be explained, the brand search keyword may refer to: search keywords corresponding to brand terms; it can be understood that: when a user inputs a search keyword in a search engine, a search result corresponding to a certain brand word can be obtained, and the search keyword can be regarded as a brand search keyword corresponding to the brand word. For example, in search advertising marketing, brand search keywords may also be referred to as "advertising keywords," i.e., words, phrases, or sentences used by the provider (also typically the advertiser) to which the brand words correspond for advertising placement.
Again, the generic brand word may refer to: words, phrases, or sentences that are closely related to brand words; the general brand word is mostly used in the search advertising marketing, is a brand search keyword that can be used by advertisers for brand promotion, except for the brand word (the same: brand word) owned by the advertisers, and generally includes: a brand search keyword consisting of an advertising commodity name of a business, a commodity logo word, a business name, or a broader range of words associated therewith.
Example one
The embodiment provides a method for mining generic brand words, which is applied to a search engine, and as shown in fig. 1, the method includes:
step S101: at least one brand word is obtained.
As an alternative embodiment, when acquiring the brand word, the brand word list may be acquired first, and then any brand word may be extracted from the brand word list.
In a specific implementation process, a brand word list may be obtained, where the brand word list includes one or more brand words (also referred to as brand words), and the brand words are brand words that need to be subjected to broad-band brand word mining in the present invention, and the brand words are used to indicate brand information of a provider (e.g., a merchant) corresponding to a brand. For example: "Tianmao", "Jingdong", "Suning", etc. are common brands. In search advertisement marketing, the above-mentioned providers may also be referred to as advertisers.
As an alternative embodiment, when the brand word list is obtained, one or more brand words can be crawled from a page of a registered brand display website through a web crawler, and the brand word list is created based on the one or more brand words.
In particular implementations, on some registered brand display websites, many brand words are displayed, for example,http://www.10brandchina.com/in such a website, the brand words generally have a fixed pattern, and therefore, the brand words can be extracted from the fixed pattern of the registered brand display website pages by a web crawler, and a brand word list is created.
Wherein the fixed mode includes but is not limited to:
(1) and text in the < title > </title > tag in html source code of a website home page of a provider corresponding to the brand. For example: the "Tianmao TMALL" is available on the "Tianmao" website homepage.
(2) Brand presentation websites (e.g.:http://www.10brandchina.com/) Of html source code<divclass="box1"></div>The alt attribute value of the inner img label. For example, brand words such as "suning", "china mobile", "hel", etc. are included therein.
In a specific implementation process, background workers can manually collect the brand words and create a brand word list based on the collected brand words.
After the brand word list is obtained, each brand word in the brand word list can be extracted one by one, and the subsequent step 102 and step S103 are executed for each brand word to perform the general brand word mining.
Step S102: and acquiring a specific search result click log and a natural search result click log in a search engine.
The natural search result click log refers to: when a search is performed by a certain search word (namely, a query word), the obtained natural search result is clicked to view, and click log information is generated. Wherein the natural search result click log comprises: query (query term), is _ click (click or not), pos (query result presentation location), cli-URL (click URL).
The specific search result click log refers to: when searching with a certain search word (namely, a query word), if a specific search result related to certain brand words is included in the obtained search result, and the specific search result is clicked to view, click log information is generated. Wherein, the specific search result may be: search results preset by a third-party provider and related to brand words; examples may include: advertisement search results corresponding to certain brand words, search results corresponding to specified third-party providers, search results corresponding to specified websites and the like.
The particular search result click log may be an advertisement click log in a search engine, and the particular search result may include, but is not limited to, an advertisement search result that includes a brand word correspondence. Wherein the advertisement click log may include: query, account-ID, ad-click-url (ad promotion link, i.e., landing page for ads). The specific search result click log is stored in the background independently and belongs to two different logs from the natural search result click log.
In a specific implementation process, a specific search result click log and a natural search result click log generated by a search engine within a preset time period (for example, a last month or a last two months) can be obtained.
Step S103: and performing broad-brand word mining on the brand words based on the specific search result click log and the natural search result click log to obtain broad-brand words corresponding to the brand words.
As an alternative embodiment, as shown in fig. 2, step S103 includes:
step S201: extracting click logs (namely, the first specific search result click logs) taking brand words as query words from the specific search result click logs;
step S202: extracting a first-level domain name corresponding to the brand word from the first specific search result click log;
step S203: searching a click URL (namely cli-URL) related to a first-level domain name corresponding to a brand word in a natural search result click log, and acquiring a query word (namely query) corresponding to the click URL;
step S204: and when the query word corresponding to the clicked URL meets a preset condition, determining the query word as a universal brand word corresponding to the brand word.
For example, taking a brand word as "tianmao" as an example, firstly, an advertisement click log with "tianmao" as a query word is extracted from an advertisement click log, secondly, a first-level domain name "tmal.com" corresponding to the brand word "tianmao" is extracted from the advertisement click log, thirdly, a click URL related to the first-level domain name "tmal.com" is searched in a natural search result click log, and a query word (for example, "tianmao shopping mall", "naobao", "tmal", "naughty", or "naughty", etc.) corresponding to the click URL is obtained, and finally, screening is performed, and a query word satisfying a preset condition in the query words is a generic brand word with the brand word "tianmao".
In a specific implementation process, in step S202, an advertisement promotion link (i.e., ad-click-url) corresponding to a brand word may be extracted from a first specific search result click log, and a primary domain name corresponding to the brand word may be extracted from the advertisement promotion link by using a regular expression; and/or; and extracting a first-level domain name corresponding to the brand word from the first specific search result click log by using a domain name resolution model.
Specifically, because the first-level domain name has a special grammar rule, usually a character string is added with a ". com" (for example, tmall. com, jd. com, suning. com, etc.), a regular expression meeting the grammar rule can be constructed, after the advertisement promotion link corresponding to the brand word is extracted from the first specific search result click log, the advertisement promotion link is filtered by using the regular expression, and the first-level domain name in the advertisement promotion link is obtained, and is the first-level domain name corresponding to the brand word. Or, a domain name resolution model may be trained in advance, and the domain name resolution model may analyze click log (e.g., advertisement click log) information of a specific search result and extract a first-level domain name corresponding to the brand word.
In a specific implementation process, in step S203, a first click URL may be searched in the natural search result click log, where a first-level domain name of the first click URL is the same as a first-level domain name corresponding to the brand word; and/or; and searching a second click URL in the click log of the natural search result, wherein the primary domain name of the second click URL is the same as the URL of the website homepage to which the primary domain name corresponding to the brand word belongs.
For example, taking the brand word "Tianmao" as an example, the click URL (i.e., the first click URL) with the primary domain name "Tianmao" (i.e., tmall. com) may be searched from the natural search result click log, or the click URL (i.e., the second click URL) with the primary domain name "Tianmao" (i.e., tmall. com) corresponding to the home page URL (i.e., www.tmall.com) of the site may be searched from the natural search result click log.
In a specific implementation process, in step S204, a click frequency of the query term may be obtained. The click frequency of the query term is used to indicate the total number of times that all search results corresponding to the query term are clicked within a preset time period (for example, the total number of times that all search results are clicked within 1 day, the total number of times that all search results are clicked within 1 week, and the like); judging whether the click frequency is greater than a preset frequency (for example, 50 times/day, or 100 times/day, etc.); and if so, determining the query word as a universal brand word corresponding to the brand word.
For example, taking the brand word "tianmao" as an example, after obtaining the query words "tianmao mall web", "naobao mall", "tmall", "naughty sale" and "naughty", if only the click frequency of "tianmao mall web", "naobao mall" and "tmall" is greater than the preset frequency, the "tianmao mall web", "naobao mall" and "tmall" are taken as the brand word "tianmao" of the brand word "tianmao".
As an alternative embodiment, after step S103, the method further includes: in a search engine, the generic brand word is set as a brand search keyword corresponding to the brand word, so that the association between the generic brand word and a specific search result (such as an advertisement link) corresponding to the brand word is established, and the purpose of advertising based on the generic brand word corresponding to the brand word is further achieved. In this way, the user may obtain search results (e.g., advertising links) when searching for a broad brand word.
For example, taking a brand word as "tianmao" as an example, after obtaining the pan brand words, such as "tianmao mall web", "naobao mall" and "tmall", the pan brand words can be all set as the brand search keywords of the brand word "tianmao" in the search engine, so that when the user searches the pan brand words, the search engine can display the search results corresponding to the brand word "tianmao" to the user, thereby improving the search traffic, bringing more click conversions to the user, and improving the effect of brand search.
The invention effectively utilizes the existing specific search result click logs (such as advertisement click logs) and natural search result click logs to realize the off-line digging out a large amount of universal brand words corresponding to the brand words in advance, so that the rear end of a search engine can automatically set the universal brand words as the brand search keywords of the brand words, thereby realizing the purpose of automatically putting the brand advertisements, improving the related query flow of the brands, improving the brand search effect and improving the advertisement effect.
The technical scheme in the embodiment of the invention at least has the following technical effects or advantages:
in the embodiment of the invention, a method for mining generic brand words is disclosed, which comprises the following steps: acquiring at least one brand word; obtaining a specific search result click log and a natural search result click log in a search engine, wherein the specific search result is a search result which is preset by a third-party provider and is related to a brand word; and performing broad-brand word mining on the brand words based on the specific search result click log and the natural search result click log to obtain broad-brand words corresponding to the brand words. Because a large amount of broad brand words corresponding to the brand words can be mined in advance, the rear end of the search engine can automatically set the broad brand words as the brand search keywords of the brand words, and therefore enough brand-related query flow can be effectively covered. Therefore, the technical problem that the related query flow for the brand is low in the existing search is solved, and the automatic mining of the universal brand words is realized, so that the related query flow of the brand is improved, and the brand search effect is improved.
Example two
Based on the same inventive concept, the present embodiment provides an excavating device for generic brand words, as shown in fig. 3, including:
a first obtaining unit 301, configured to obtain at least one brand word;
a second obtaining unit 302, configured to obtain a specific search result click log and a natural search result click log in a search engine, where the specific search result is a search result related to a brand word preset by a third party provider;
and the mining unit 303 is configured to perform broad-brand word mining on the brand word based on the specific search result click log and the natural search result click log to obtain a broad-brand word corresponding to the brand word.
As an optional embodiment, the first obtaining unit 301 is specifically configured to:
crawling one or more brand words from a registered brand display website page through a web crawler; creating a list of brand words based on the one or more brand words; extracting the brand word from the list of brand words.
As an alternative embodiment, the digging unit 303 is specifically configured to:
extracting a first specific search result click log which takes the brand word as a query word from the specific search result click log; extracting a first-level domain name corresponding to the brand word from the first specific search result click log; searching a click Uniform Resource Locator (URL) related to a first-level domain name corresponding to the brand word in the natural search result click log, and acquiring a query word corresponding to the click URL; and when the query word corresponding to the clicked URL meets a preset condition, determining the query word as a universal brand word corresponding to the brand word.
As an alternative embodiment, the digging unit 303 is specifically configured to:
searching a first click URL in the natural search result click log, wherein a first-level domain name of the first click URL is the same as a first-level domain name corresponding to the brand word; and/or
And searching a second click URL in the natural search result click log, wherein the primary domain name of the second click URL is the same as the URL of the website homepage to which the primary domain name corresponding to the brand word belongs.
As an alternative embodiment, the digging unit 303 is specifically configured to:
acquiring the click frequency of the query word, wherein the click frequency is used for representing the total number of times that all search results corresponding to the query word are clicked within a preset time period; judging whether the click frequency is greater than a preset frequency or not; and if so, determining the query word as a universal brand word corresponding to the brand word.
As an optional embodiment, the mining device for generic terms further includes:
and the setting unit is used for setting the universal brand word as a brand search keyword of the brand word so as to establish the association between the universal brand word and a specific search result corresponding to the brand word.
Since the mining device for the generic terms introduced in this embodiment is a device used for implementing the mining method for the generic terms in the embodiment of the present invention, based on the mining method for the generic terms introduced in the embodiment of the present invention, those skilled in the art can understand the specific implementation manner and various variations of the mining device for the generic terms in this embodiment, and therefore, how to implement the method in the embodiment of the present invention by the mining device for the generic terms is not described in detail herein. The scope of the present invention is intended to be covered by the claims so long as those skilled in the art can implement the apparatus for the method for mining the generic terms in the embodiment of the present invention.
The technical scheme in the embodiment of the invention at least has the following technical effects or advantages:
in the embodiment of the invention, the invention discloses a universal brand word mining device, which comprises: the system comprises a first acquisition unit, a second acquisition unit and a third acquisition unit, wherein the first acquisition unit is used for acquiring at least one brand word; the second acquisition unit is used for acquiring a specific search result click log and a natural search result click log in a search engine, wherein the specific search result is a search result which is preset by a third-party provider and is related to a brand word; and the mining unit is used for mining the brand words according to the specific search result click log and the natural search result click log and obtaining the generic brand words corresponding to the brand words. Because a large amount of broad brand words corresponding to the brand words can be mined in advance, the rear end of the advertisement search engine can automatically set the broad brand words as the brand search keywords of the brand words, and therefore enough brand-related query flow can be effectively covered. Therefore, the technical problem that the related query flow for the brand is low in the existing search is solved, and the automatic mining of the universal brand words is realized, so that the related query flow of the brand is improved, and the brand search effect is improved.
With regard to the apparatus in the above-described embodiment, the specific manner in which each module performs the operation has been described in detail in the embodiment related to the method, and will not be elaborated here.
FIG. 4 is a block diagram illustrating a generalized brand word mining apparatus according to one exemplary embodiment. For example, the apparatus 800 may be a mobile phone, a computer, a digital broadcast terminal, a messaging device, a game console, a tablet device, a medical device, an exercise device, a personal digital assistant, and the like.
Referring to fig. 4, the apparatus 800 may include one or more of the following components: processing component 802, memory 804, power component 806, multimedia component 808, audio component 810, input/output (I/O) interface 812, sensor component 814, and communication component 816.
The processing component 802 generally controls overall operation of the device 800, such as operations associated with display, telephone calls, data communications, camera operations, and recording operations. The processing elements 802 may include one or more processors 820 to execute instructions to perform all or a portion of the steps of the methods described above. Further, the processing component 802 can include one or more modules that facilitate interaction between the processing component 802 and other components. For example, the processing component 802 can include a multimedia module to facilitate interaction between the multimedia component 808 and the processing component 802.
The memory 804 is configured to store various types of data to support operation at the device 800. Examples of such data include instructions for any application or method operating on device 800, contact data, phonebook data, messages, pictures, videos, and so forth. The memory 804 may be implemented by any type or combination of volatile or non-volatile memory devices such as Static Random Access Memory (SRAM), electrically erasable programmable read-only memory (EEPROM), erasable programmable read-only memory (EPROM), programmable read-only memory (PROM), read-only memory (ROM), magnetic memory, flash memory, magnetic or optical disks.
Power component 806 provides power to the various components of device 800. The power components 806 may include a power management system, one or more power sources, and other components associated with generating, managing, and distributing power for the device 800.
The multimedia component 808 includes a screen that provides an output interface between the device 800 and a user. In some embodiments, the screen may include a Liquid Crystal Display (LCD) and a Touch Panel (TP). If the screen includes a touch panel, the screen may be implemented as a touch screen to receive an input signal from a user. The touch panel includes one or more touch sensors to sense touch, slide, and gestures on the touch panel. The touch sensor may not only sense the boundary of a touch or slide action, but also detect the duration and pressure associated with the touch or slide operation. In some embodiments, the multimedia component 808 includes a front facing camera and/or a rear facing camera. The front-facing camera and/or the rear-facing camera may receive external multimedia data when the device 800 is in an operating mode, such as a shooting mode or a video mode. Each front camera and rear camera may be a fixed optical lens system or have a focal length and optical zoom capability.
The audio component 810 is configured to output and/or input audio signals. For example, the audio component 810 includes a Microphone (MIC) configured to receive external audio signals when the apparatus 800 is in an operational mode, such as a call mode, a recording mode, and a voice recognition mode. The received audio signals may further be stored in the memory 804 or transmitted via the communication component 816. In some embodiments, audio component 810 also includes a speaker for outputting audio signals.
The I/O interface 812 provides an interface between the processing component 802 and peripheral interface modules, which may be keyboards, click wheels, buttons, etc. These buttons may include, but are not limited to: a home button, a volume button, a start button, and a lock button.
The sensor assembly 814 includes one or more sensors for providing various aspects of state assessment for the device 800. For example, the sensor assembly 814 may detect the open/closed state of the device 800, the relative positioning of the components, such as a display and keypad of the apparatus 800, the sensor assembly 814 may also detect a change in position of the apparatus 800 or a component of the apparatus 800, the presence or absence of user contact with the apparatus 800, orientation or acceleration/deceleration of the apparatus 800, and a change in temperature of the apparatus 800. Sensor assembly 814 may include a proximity sensor configured to detect the presence of a nearby object without any physical contact. The sensor assembly 814 may also include a light sensor, such as a CMOS or CCD image sensor, for use in imaging applications. In some embodiments, the sensor assembly 814 may also include an acceleration sensor, a gyroscope sensor, a magnetic sensor, a pressure sensor, or a temperature sensor.
The communication component 816 is configured to facilitate communications between the apparatus 800 and other devices in a wired or wireless manner. The device 800 may access a wireless network based on a communication standard, such as WiFi, 2G or 3G, or a combination thereof. In an exemplary embodiment, the communication component 816 receives a broadcast signal or broadcast associated information from an external broadcast management system via a broadcast channel. In an exemplary embodiment, the communications component 816 further includes a Near Field Communication (NFC) module to facilitate short-range communications. For example, the NFC module may be implemented based on Radio Frequency Identification (RFID) technology, infrared data association (IrDA) technology, Ultra Wideband (UWB) technology, Bluetooth (BT) technology, and other technologies.
In an exemplary embodiment, the apparatus 800 may be implemented by one or more Application Specific Integrated Circuits (ASICs), Digital Signal Processors (DSPs), Digital Signal Processing Devices (DSPDs), Programmable Logic Devices (PLDs), Field Programmable Gate Arrays (FPGAs), controllers, micro-controllers, microprocessors or other electronic components for performing the above-described methods.
In an exemplary embodiment, a non-transitory computer-readable storage medium comprising instructions, such as the memory 804 comprising instructions, executable by the processor 820 of the device 800 to perform the above-described method is also provided. For example, the non-transitory computer readable storage medium may be a ROM, a Random Access Memory (RAM), a CD-ROM, a magnetic tape, a floppy disk, an optical data storage device, and the like.
A non-transitory computer readable storage medium having instructions therein which, when executed by a processor of an apparatus 800, enable the apparatus 800 to perform a method of mining of overt brand words, the method comprising: acquiring at least one brand word; obtaining a specific search result click log and a natural search result click log in a search engine, wherein the specific search result is a search result which is preset by a third-party provider and is related to a brand word; and performing broad-brand word mining on the brand words based on the specific search result click log and the natural search result click log to obtain broad-brand words corresponding to the brand words.
Fig. 5 is a structural diagram of a mining device for generic terms as a server according to an embodiment of the present invention. The server 1900 may vary widely by configuration or performance and may include one or more Central Processing Units (CPUs) 1922 (e.g., one or more processors) and memory 1932, one or more storage media 1930 (e.g., one or more mass storage devices) storing applications 1942 or data 1944. Memory 1932 and storage medium 1930 can be, among other things, transient or persistent storage. The program stored in the storage medium 1930 may include one or more modules (not shown), each of which may include a series of instructions operating on a server. Still further, a central processor 1922 may be provided in communication with the storage medium 1930 to execute a series of instruction operations in the storage medium 1930 on the server 1900.
The server 1900 may also include one or more power supplies 1926, one or more wired or wireless network interfaces 1950, one or more input-output interfaces 1958, one or more keyboards 1956, and/or one or more operating systems 1941, such as Windows Server, Mac OS XTM, UnixTM, LinuxTM, FreeBSDTM, etc.
Other embodiments of the invention will be apparent to those skilled in the art from consideration of the specification and practice of the invention disclosed herein. This invention is intended to cover any variations, uses, or adaptations of the invention following, in general, the principles of the invention and including such departures from the present disclosure as come within known or customary practice within the art to which the invention pertains. It is intended that the specification and examples be considered as exemplary only, with a true scope and spirit of the invention being indicated by the following claims.
It will be understood that the invention is not limited to the precise arrangements described above and shown in the drawings and that various modifications and changes may be made without departing from the scope thereof. The scope of the present invention is defined only by the appended claims, which are not intended to limit the present invention, and any modifications, equivalents, improvements, etc. made within the spirit and principle of the present invention should be included in the scope of the present invention.

Claims (10)

1. A method for mining generic brand words, comprising:
acquiring at least one brand word;
obtaining a specific search result click log and a natural search result click log in a search engine, wherein the specific search result is a search result which is preset by a third-party provider and is related to a brand word;
and performing broad-brand word mining on the brand words based on the specific search result click log and the natural search result click log to obtain broad-brand words corresponding to the brand words.
2. The method of mining generic brand words as defined in claim 1, wherein said obtaining at least one brand word comprises:
crawling one or more brand words from a registered brand display website page through a web crawler;
creating a list of brand words based on the one or more brand words;
extracting the brand word from the list of brand words.
3. The method for mining generic brand words according to claim 1, wherein the generic brand word mining is performed on the brand words based on the specific search result click log and the natural search result click log to obtain generic brand words corresponding to the brand words, and the method comprises:
extracting a first specific search result click log which takes the brand word as a query word from the specific search result click log;
extracting a first-level domain name corresponding to the brand word from the first specific search result click log;
searching a click Uniform Resource Locator (URL) related to a first-level domain name corresponding to the brand word in the natural search result click log, and acquiring a query word corresponding to the click URL;
and when the query word corresponding to the clicked URL meets a preset condition, determining the query word as a universal brand word corresponding to the brand word.
4. The method of mining generic brand words as claimed in claim 3, wherein said searching said natural search result click log for click Uniform Resource Locators (URLs) associated with a primary domain name corresponding to said brand word comprises:
searching a first click URL in the natural search result click log, wherein a first-level domain name of the first click URL is the same as a first-level domain name corresponding to the brand word; and/or
And searching a second click URL in the natural search result click log, wherein the primary domain name of the second click URL is the same as the URL of the website homepage to which the primary domain name corresponding to the brand word belongs.
5. The method for mining the pan-brand word according to claim 3, wherein when the query word corresponding to the clicked URL meets a preset condition, determining the query word as the pan-brand word corresponding to the brand word comprises:
acquiring the click frequency of the query word, wherein the click frequency is used for representing the total number of times that all search results corresponding to the query word are clicked within a preset time period;
judging whether the click frequency is greater than a preset frequency or not;
and if so, determining the query word as a universal brand word corresponding to the brand word.
6. The method for mining the pan-brand word according to any one of claims 1 to 5, wherein after obtaining the pan-brand word corresponding to the brand word, the method further comprises:
setting the generic brand word as a brand search keyword of the brand word so as to establish an association between the generic brand word and a specific search result corresponding to the brand word.
7. An apparatus for mining generic terms, comprising:
the system comprises a first acquisition unit, a second acquisition unit and a third acquisition unit, wherein the first acquisition unit is used for acquiring at least one brand word;
the second acquisition unit is used for acquiring a specific search result click log and a natural search result click log in a search engine, wherein the specific search result is a search result which is preset by a third-party provider and is related to a brand word;
and the mining unit is used for mining the brand words according to the specific search result click log and the natural search result click log and obtaining the generic brand words corresponding to the brand words.
8. The mining device of generic terms as defined in claim 7, wherein the first obtaining unit is specifically configured to:
crawling one or more brand words from a registered brand display website page through a web crawler; creating a list of brand words based on the one or more brand words; extracting the brand word from the list of brand words.
9. A generic term mining device comprising a memory, a processor, and a computer program stored on the memory and executable on the processor, wherein the processor executes the program to perform the steps of:
acquiring at least one brand word; obtaining a specific search result click log and a natural search result click log in a search engine, wherein the specific search result is a search result which is preset by a third-party provider and is related to a brand word; and performing broad-brand word mining on the brand words based on the specific search result click log and the natural search result click log to obtain broad-brand words corresponding to the brand words.
10. A computer-readable storage medium, on which a computer program is stored, which program, when executed by a processor, carries out the steps of:
acquiring at least one brand word; obtaining a specific search result click log and a natural search result click log in a search engine, wherein the specific search result is a search result which is preset by a third-party provider and is related to a brand word; and performing broad-brand word mining on the brand words based on the specific search result click log and the natural search result click log to obtain broad-brand words corresponding to the brand words.
CN201811043835.XA 2018-09-07 2018-09-07 Method and device for mining generic brand words Pending CN110889050A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201811043835.XA CN110889050A (en) 2018-09-07 2018-09-07 Method and device for mining generic brand words

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201811043835.XA CN110889050A (en) 2018-09-07 2018-09-07 Method and device for mining generic brand words

Publications (1)

Publication Number Publication Date
CN110889050A true CN110889050A (en) 2020-03-17

Family

ID=69744600

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201811043835.XA Pending CN110889050A (en) 2018-09-07 2018-09-07 Method and device for mining generic brand words

Country Status (1)

Country Link
CN (1) CN110889050A (en)

Citations (34)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1985255A (en) * 2004-07-16 2007-06-20 郑月信 Target advertising method and system using secondary keywords having relation to first internet searching keywords, and method and system for providing a list of the secondary keywords
CN101038596A (en) * 2007-04-29 2007-09-19 北京搜狗科技发展有限公司 Method and system for classifying website
CN101241512A (en) * 2008-03-10 2008-08-13 北京搜狗科技发展有限公司 Search method for redefining enquiry word and device therefor
US20090192983A1 (en) * 2008-01-28 2009-07-30 Yahoo! Inc. Method and system for mining, ranking and visualizing lexically similar search queries for advertisers
CN101551806A (en) * 2008-04-03 2009-10-07 北京搜狗科技发展有限公司 Personalized website navigation method and system
CN101576916A (en) * 2009-06-18 2009-11-11 清华大学 Method and device for obtaining synonyms
KR20090119532A (en) * 2008-05-16 2009-11-19 엔에이치엔비즈니스플랫폼 주식회사 Method and system for recommending advertisement keyword by analyzing log
KR20100068964A (en) * 2008-12-15 2010-06-24 한국전자통신연구원 Apparatus for recommending related query and method thereof
CN102254039A (en) * 2011-08-11 2011-11-23 武汉安问科技发展有限责任公司 Searching engine-based network searching method
CN102609539A (en) * 2012-02-16 2012-07-25 北京搜狗信息服务有限公司 Search method and search system
CN103020067A (en) * 2011-09-21 2013-04-03 北京百度网讯科技有限公司 Method and device for determining webpage type
CN103106189A (en) * 2011-11-11 2013-05-15 北京百度网讯科技有限公司 Method and device for excavating synonymous attribute words
CN103136212A (en) * 2011-11-23 2013-06-05 北京百度网讯科技有限公司 Mining method of class new words and device
CN103744856A (en) * 2013-12-03 2014-04-23 北京奇虎科技有限公司 Method, device and system for linkage extended search
CN103873601A (en) * 2012-12-11 2014-06-18 百度在线网络技术(北京)有限公司 Addressing class query word mining method and system
CN103870573A (en) * 2014-03-18 2014-06-18 北京奇虎科技有限公司 Method and device for website analysis
CN103885947A (en) * 2012-12-19 2014-06-25 北京百度网讯科技有限公司 Mining method for searching demands, intelligent searching method and device thereof
CN104199833A (en) * 2014-08-01 2014-12-10 北京奇虎科技有限公司 Network search term clustering method and device
CN104715064A (en) * 2015-03-31 2015-06-17 北京奇虎科技有限公司 Method and server for marking keywords on webpage
CN105243083A (en) * 2015-09-08 2016-01-13 百度在线网络技术(北京)有限公司 Document topic mining method and apparatus
CN105243149A (en) * 2015-10-26 2016-01-13 深圳市智搜信息技术有限公司 Semantic-based query recommendation method and system
CN105446984A (en) * 2014-06-30 2016-03-30 阿里巴巴集团控股有限公司 Expansion word pair screening method and device
CN106021418A (en) * 2016-05-13 2016-10-12 北京奇虎科技有限公司 News event clustering method and device
CN106528569A (en) * 2015-09-11 2017-03-22 北京国双科技有限公司 Method and device for calculating validity of site search
CN106599297A (en) * 2016-12-28 2017-04-26 北京百度网讯科技有限公司 Method and device for searching question-type search terms on basis of deep questions and answers
CN106611022A (en) * 2015-10-27 2017-05-03 北京国双科技有限公司 Method and device for increasing website search efficiency
CN106610972A (en) * 2015-10-21 2017-05-03 阿里巴巴集团控股有限公司 Query rewriting method and apparatus
CN106611029A (en) * 2015-10-27 2017-05-03 北京国双科技有限公司 Method and device for improving site search efficiency in website
CN107193987A (en) * 2017-05-27 2017-09-22 广东神马搜索科技有限公司 Obtain the methods, devices and systems of the search term related to the page
CN107463600A (en) * 2017-06-12 2017-12-12 百度在线网络技术(北京)有限公司 Advertisement putting keyword recommendation method and device, advertisement placement method and device
CN107679119A (en) * 2017-09-19 2018-02-09 北京京东尚科信息技术有限公司 The method and apparatus for generating brand derivative words
CN107885875A (en) * 2017-11-28 2018-04-06 北京百度网讯科技有限公司 Synonymous transform method, device and the server of term
CN107958078A (en) * 2017-12-13 2018-04-24 北京百度网讯科技有限公司 Information generating method and device
CN108319376A (en) * 2017-12-29 2018-07-24 北京奇虎科技有限公司 A kind of input association recommendation method and device that optimization business word is promoted

Patent Citations (34)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1985255A (en) * 2004-07-16 2007-06-20 郑月信 Target advertising method and system using secondary keywords having relation to first internet searching keywords, and method and system for providing a list of the secondary keywords
CN101038596A (en) * 2007-04-29 2007-09-19 北京搜狗科技发展有限公司 Method and system for classifying website
US20090192983A1 (en) * 2008-01-28 2009-07-30 Yahoo! Inc. Method and system for mining, ranking and visualizing lexically similar search queries for advertisers
CN101241512A (en) * 2008-03-10 2008-08-13 北京搜狗科技发展有限公司 Search method for redefining enquiry word and device therefor
CN101551806A (en) * 2008-04-03 2009-10-07 北京搜狗科技发展有限公司 Personalized website navigation method and system
KR20090119532A (en) * 2008-05-16 2009-11-19 엔에이치엔비즈니스플랫폼 주식회사 Method and system for recommending advertisement keyword by analyzing log
KR20100068964A (en) * 2008-12-15 2010-06-24 한국전자통신연구원 Apparatus for recommending related query and method thereof
CN101576916A (en) * 2009-06-18 2009-11-11 清华大学 Method and device for obtaining synonyms
CN102254039A (en) * 2011-08-11 2011-11-23 武汉安问科技发展有限责任公司 Searching engine-based network searching method
CN103020067A (en) * 2011-09-21 2013-04-03 北京百度网讯科技有限公司 Method and device for determining webpage type
CN103106189A (en) * 2011-11-11 2013-05-15 北京百度网讯科技有限公司 Method and device for excavating synonymous attribute words
CN103136212A (en) * 2011-11-23 2013-06-05 北京百度网讯科技有限公司 Mining method of class new words and device
CN102609539A (en) * 2012-02-16 2012-07-25 北京搜狗信息服务有限公司 Search method and search system
CN103873601A (en) * 2012-12-11 2014-06-18 百度在线网络技术(北京)有限公司 Addressing class query word mining method and system
CN103885947A (en) * 2012-12-19 2014-06-25 北京百度网讯科技有限公司 Mining method for searching demands, intelligent searching method and device thereof
CN103744856A (en) * 2013-12-03 2014-04-23 北京奇虎科技有限公司 Method, device and system for linkage extended search
CN103870573A (en) * 2014-03-18 2014-06-18 北京奇虎科技有限公司 Method and device for website analysis
CN105446984A (en) * 2014-06-30 2016-03-30 阿里巴巴集团控股有限公司 Expansion word pair screening method and device
CN104199833A (en) * 2014-08-01 2014-12-10 北京奇虎科技有限公司 Network search term clustering method and device
CN104715064A (en) * 2015-03-31 2015-06-17 北京奇虎科技有限公司 Method and server for marking keywords on webpage
CN105243083A (en) * 2015-09-08 2016-01-13 百度在线网络技术(北京)有限公司 Document topic mining method and apparatus
CN106528569A (en) * 2015-09-11 2017-03-22 北京国双科技有限公司 Method and device for calculating validity of site search
CN106610972A (en) * 2015-10-21 2017-05-03 阿里巴巴集团控股有限公司 Query rewriting method and apparatus
CN105243149A (en) * 2015-10-26 2016-01-13 深圳市智搜信息技术有限公司 Semantic-based query recommendation method and system
CN106611022A (en) * 2015-10-27 2017-05-03 北京国双科技有限公司 Method and device for increasing website search efficiency
CN106611029A (en) * 2015-10-27 2017-05-03 北京国双科技有限公司 Method and device for improving site search efficiency in website
CN106021418A (en) * 2016-05-13 2016-10-12 北京奇虎科技有限公司 News event clustering method and device
CN106599297A (en) * 2016-12-28 2017-04-26 北京百度网讯科技有限公司 Method and device for searching question-type search terms on basis of deep questions and answers
CN107193987A (en) * 2017-05-27 2017-09-22 广东神马搜索科技有限公司 Obtain the methods, devices and systems of the search term related to the page
CN107463600A (en) * 2017-06-12 2017-12-12 百度在线网络技术(北京)有限公司 Advertisement putting keyword recommendation method and device, advertisement placement method and device
CN107679119A (en) * 2017-09-19 2018-02-09 北京京东尚科信息技术有限公司 The method and apparatus for generating brand derivative words
CN107885875A (en) * 2017-11-28 2018-04-06 北京百度网讯科技有限公司 Synonymous transform method, device and the server of term
CN107958078A (en) * 2017-12-13 2018-04-24 北京百度网讯科技有限公司 Information generating method and device
CN108319376A (en) * 2017-12-29 2018-07-24 北京奇虎科技有限公司 A kind of input association recommendation method and device that optimization business word is promoted

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
欧阳柳波,等: "一种基于本体和用户日志的查询扩展方法", 《计算机工程及应用》, 3 May 2013 (2013-05-03), pages 151 - 155 *

Similar Documents

Publication Publication Date Title
KR101769058B1 (en) Hashtags and content presentation
US20220321521A1 (en) Computerized system and method for determining and displaying message content in a user&#39;s inbox
CN109614482B (en) Label processing method and device, electronic equipment and storage medium
CN110020148B (en) Information recommendation method and device and information recommendation device
US20160034977A1 (en) System and method for embedded search within messaging applications
US20160103758A1 (en) Online product testing using bucket tests
CN109308334B (en) Information recommendation method and device and search engine system
WO2017181663A1 (en) Method and device for matching image to search information
CN110598098A (en) Information recommendation method and device and information recommendation device
US20210173875A1 (en) Computerized system and method for extracting entity information from text communications and displaying content based therefrom
US20160027044A1 (en) Presenting information cards for events associated with entities
CN107515869B (en) Searching method and device and searching device
US20230328028A1 (en) Method and apparatus for providing information
US20160117727A1 (en) Adaptive retargeting
CN107515870B (en) Searching method and device and searching device
US10560408B2 (en) Computerized system and method for selectively communicating HTML content to a user&#39;s inbox as a native message
CN112784142A (en) Information recommendation method and device
CN110633391B (en) Information searching method and device
CN111127053B (en) Page content recommendation method and device and electronic equipment
CN110110046B (en) Method and device for recommending entities with same name
CN110020082B (en) Searching method and device
CN110110078B (en) Data processing method and device for data processing
WO2016085585A1 (en) Presenting information cards for events associated with entities
CN110889050A (en) Method and device for mining generic brand words
CN110020206B (en) Search result ordering method and device

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination