CN109214864A - A kind of advertisement recognition method and device, electronic equipment - Google Patents
A kind of advertisement recognition method and device, electronic equipment Download PDFInfo
- Publication number
- CN109214864A CN109214864A CN201810980581.8A CN201810980581A CN109214864A CN 109214864 A CN109214864 A CN 109214864A CN 201810980581 A CN201810980581 A CN 201810980581A CN 109214864 A CN109214864 A CN 109214864A
- Authority
- CN
- China
- Prior art keywords
- attribute value
- advertisement
- visibility
- web page
- identification device
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q30/00—Commerce
- G06Q30/02—Marketing; Price estimation or determination; Fundraising
- G06Q30/0241—Advertisements
Abstract
The invention discloses a kind of advertisement recognition method and devices, electronic equipment, comprising the following steps: obtains and the newly-increased advertisement formwork Information Embedding that sends network server is to advertisement formwork feature database;Grab the web page element in target webpage in browser window predetermined position;The attribute value in the web page element is extracted, the attribute value includes the first visibility attribute value and first position attribute value;It is found out respectively in the advertisement formwork feature database prestored and the first visibility attribute value, the matched second visibility attribute value of first position attribute value and second position attribute value;The first visibility attribute value, first position attribute value and the second visibility attribute value, second position attribute value are compared respectively, when comparing out the first visibility attribute value and identical, the described first position attribute value of second visibility attribute value identical with the second position attribute value simultaneously, judge the web page element for advertisement, and then improve Experience Degree.
Description
Technical field
The invention belongs to internet detection technique field more particularly to a kind of advertisement recognition methods and device, electronic equipment.
Background technique
With the continuous development of modern science and technology, miscellaneous waste advertisements carry out in webpage extensively by Internet technology
General propagation.Currently, most users, which mainly pass through search related web page, obtains effective information, due to the advertisement being inserted into webpage,
It makes troubles to user's browsing, the highly desirable advertisement search that can skip is to required web page contents.In addition, for screen size
It is unnecessary that lesser electronic equipment, the online experience of meeting greatly influence user, and a large amount of advertisement can also be caused to user
Flow waste.
What is proposed in the prior art carries out knowing method for distinguishing that often data amount of analysis is big, time-consuming more and a large amount of to web advertisement
The memory source for consuming processor, causes Experience Degree of the user when browsing webpage bad.
Summary of the invention
In view of this, the embodiment of the present invention is designed to provide a kind of advertisement recognition method and device, electronic equipment, purport
It is occupied in the memory source for reducing data amount of analysis and processor in web advertisement identification process.
The technical solution adopted by the invention is as follows:
In a first aspect, a kind of advertisement recognition method provided in an embodiment of the present invention, is applied to advertisement identification device, it is described wide
It accuses identification device and one and is stored with the network server communication connection of newest advertisement formwork information, the advertisement recognition method includes
Following steps:
The advertisement identification device sends the request of an advertisement formwork newly-added information to the network server, so that described
Network server sends according to the advertisement formwork information prestored in the advertisement identification device to the advertisement identification device newly-increased
Advertisement formwork information;
By the newly-increased advertisement formwork Information Embedding into the advertisement formwork feature database, with the real-time update advertisement mould
Plate features library;
Grab the web page element in target webpage in browser window predetermined position;
The web page element is analyzed, and extracts the attribute value in the web page element, the attribute value includes the first visibility
Attribute value and first position attribute value;
It is found out in the advertisement formwork feature database prestored and matched second visibility of the first visibility attribute value
Attribute value and with the matched second position attribute value of the first position attribute value;
The first visibility attribute value and the second visibility attribute value are compared, the first position attribute
Value and the second position attribute value are compared;
When simultaneously compare out the first visibility attribute value and second visibility attribute value it is identical, described first
Set attribute value it is identical with the second position attribute value when, judge the web page element for advertisement.
Further, described that the first visibility attribute value and the second visibility attribute value are compared, institute
State first position attribute value and the step of the second position attribute value is compared after, which comprises
When comparing out the first visibility attribute value and the second visibility attribute value, the first position attribute value
With the second position attribute value there are it is different when, judge the non-advertisement of the web page element.
Further, the network server is according to the advertisement formwork information prestored in the advertisement identification device to described
Advertisement identification device sends the step of newly-increased advertisement formwork information and includes:
The network server parses the request of the advertisement formwork newly-added information received, obtains in the request
Device identification number;
The advertisement formwork information stored in the advertisement identification device is searched and analyzed according to the device identification number, is obtained
Newly-increased advertisement formwork information simultaneously sends it to the advertisement identification device.
Further, the predeterminated position includes the phase between the web page element and each frame of the browser window
It adjusts the distance.
Second aspect, a kind of advertisement identification device provided in an embodiment of the present invention are stored with newest advertisement formwork with one and believe
The network server of breath communicates to connect, and the advertisement identification device includes:
Sending module, for sending the request of an advertisement formwork newly-added information to the network server, so that the net
Network server increases newly according to the advertisement formwork information prestored in the advertisement identification device to advertisement identification device transmission
Advertisement formwork information;
Implant module, for by the newly-increased advertisement formwork Information Embedding into the advertisement formwork feature database, with reality
The Shi Gengxin advertisement formwork feature database;
Handling module, for grabbing in target webpage in the web page element of browser window predetermined position;
Extraction module is analyzed, for analyzing the web page element, and extracts the attribute value in the web page element, the attribute
Value includes the first visibility attribute value and first position attribute value;
Searching module is matched for finding out in the advertisement formwork feature database prestored with the first visibility attribute value
The second visibility attribute value and with the matched second position attribute value of the first position attribute value;
Comparison module, for the first visibility attribute value and the second visibility attribute value to be compared, institute
It states first position attribute value and the second position attribute value is compared;
Judgment module, for the first visibility attribute value and the second visibility attribute value phase ought to be compared out simultaneously
When with, the first position attribute value and the identical second position attribute value, judge the web page element for advertisement.
Further, the judgment module, being also used to that the first visibility attribute value and described second ought be compared out can
Opinion property attribute value, the first position attribute value and the second position attribute value there are it is different when, judge the web page element
Non- advertisement.
Further, the network server includes:
Parsing module is parsed for the request to the advertisement formwork newly-added information received, is obtained in the request
Device identification number;
Sending module is searched, is stored in the advertisement identification device for being searched and analyzing according to the device identification number
Advertisement formwork information obtains newly-increased advertisement formwork information and sends it to the advertisement identification device.
Further, the predeterminated position includes the phase between the web page element and each frame of the browser window
It adjusts the distance.
The third aspect, a kind of electronic equipment provided in an embodiment of the present invention are non-volatile including can be performed with processor
Program code computer-readable medium, said program code makes the processor execute above method.
In conclusion the present invention passes through the newly-increased advertisement formwork Information Embedding that sends network server to advertisement first
In template characteristic library, the predeterminated position in target webpage in browser window is then determined, extract the webpage member of predetermined position
The attribute value of element, and the visibility attribute value and position attribution value that are related to are carried out respectively in the advertisement formwork feature database prestored
It compares, judges whether web page element is advertisement according to comparison result, to effectively be identified to web advertisement, while it is counted
It is small according to amount of analysis, EMS memory occupation resource that is time-consuming short and reducing processor.
Detailed description of the invention
It to describe the technical solutions in the embodiments of the present invention more clearly, below will be to needed in the embodiment
Attached drawing is briefly described, it should be understood that the following drawings illustrates only certain embodiments of the present invention, therefore is not construed as
Restriction to range for those of ordinary skill in the art without creative efforts, can also basis
These attached drawings obtain other relevant attached drawings.
Fig. 1 shows the block diagram of a kind of electronic equipment provided in an embodiment of the present invention.
Fig. 2 shows a kind of flow diagrams of advertisement recognition method provided in an embodiment of the present invention.
Fig. 3 shows a kind of the functional block diagram of advertisement identification device provided in an embodiment of the present invention.
Fig. 4 shows a kind of the functional block diagram of network server provided in an embodiment of the present invention.
Main element symbol description:
Electronic equipment 000;Network server 010;Advertisement identification device 100;Memory 200;
Storage control 300;Processor 400;Peripheral Interface 500;Input-output unit 600;
Audio unit 700;Display unit 800;Sending module 101;Implant module 102;
Handling module 103;Analyze extraction module 104;Searching module 105;Comparison module 106;
Judgment module 107;Parsing module 011;Search sending module 012.
Specific embodiment
Below in conjunction with attached drawing in the embodiment of the present invention, technical solution in the embodiment of the present invention carries out clear, complete
Ground description, it is clear that described embodiments are only a part of the embodiments of the present invention, instead of all the embodiments.Usually exist
The component of the embodiment of the present invention described and illustrated in attached drawing can be arranged and be designed with a variety of different configurations herein.Cause
This, is not intended to limit claimed invention to the detailed description of the embodiment of the present invention provided in the accompanying drawings below
Range, but it is merely representative of selected embodiment of the invention.Based on the embodiment of the present invention, those skilled in the art are not doing
Every other embodiment obtained under the premise of creative work out, shall fall within the protection scope of the present invention.
As shown in Figure 1, being the block diagram of a kind of electronic equipment 000 provided in an embodiment of the present invention.The electronics is set
Standby 000 can be PC (personal computer, PC), tablet computer, smart phone, personal digital assistant
(personal digital assistant, PDA) etc..The electronic equipment 000 may include advertisement identification device 100, deposit
Reservoir 200, storage control 300, processor 400, Peripheral Interface 500, input-output unit 600, audio unit 700 and display
Unit 800.
Wherein, the memory 200, storage control 300, processor 400, Peripheral Interface 500, input-output unit
600, audio unit 700, each element of display unit 800 are directly or indirectly electrically connected between each other, to realize the biography of data
Defeated or interaction.It is electrically connected for example, these elements can be realized between each other by one or more communication bus or signal wire.Institute
Stating electronic equipment 000 includes that at least one can be stored in the memory 200 in the form of software or firmware (firmware)
Operating system (operating system, OS) in software function module.The processor 400 is for executing memory
The executable module stored in 200, such as software function module or computer program that the advertisement identification device 100 includes.
As shown in Fig. 2, being a kind of flow diagram of advertisement recognition method provided in an embodiment of the present invention.The present embodiment
In, the advertisement recognition method is applied to advertisement identification device 100, the advertisement identification device 100 with one be stored with it is newest extensively
The network server 010 of slide former information communicates to connect, i.e., the described advertisement identification device 100 can be with the network server 010
Data access is carried out each other, and the advertisement recognition method may comprise steps of:
S101: the advertisement identification device 100 sends asking for an advertisement formwork newly-added information to the network server 010
It asks.
Before advertisement identification device 100 is loaded into web data, the request of transmission advertisement formwork newly-added information is to net first
Network server 010, in this way, making the network server 010 according to the advertisement formwork prestored in the advertisement identification device 100
Information sends newly-increased advertisement formwork information to the advertisement identification device 100.
Specifically, the network server 010 parses the request of the advertisement formwork newly-added information received, and obtains
Take the device identification number in the request.Corresponding advertisement identification device 100 is found according to the device identification number, and to wide
It accuses the advertisement formwork information stored in identification device 100 to be analyzed, by being compared with newest advertisement formwork information, obtain
The advertisement identification device 100 is sent to newly-increased advertisement formwork information and by the newly-increased advertisement formwork information.
S102:, should with real-time update by the newly-increased advertisement formwork Information Embedding into the advertisement formwork feature database
Advertisement formwork feature database.
The data of advertisement formwork feature database in the present embodiment constantly carry out real-time update, can constantly adapt to advertisement form
Variability, and then effectively judge the corresponding advertisement of newest web page element.
S103: in the web page element of browser window predetermined position in crawl target webpage.
Preferably, after the target webpage accessed needed for user is inputted using the electronic equipment 000, browser takes to network
Business device 010 initiates access request, and receives the html file of the network server 010 return.The browser should by being loaded into
Html file realizes the load to webpage.Wherein, the load of webpage includes the assembling to web page element each in webpage, such as
Text, picture, Flash animation etc..
In addition, the predeterminated position may include the phase between the web page element and each frame of the browser window
It adjusts the distance.When user by browser window come browsing objective webpage when, the predeterminated position may is that the net in target webpage
The distance between left part and the left part frame of browser window of page element, the top of web page element and the top of browser window
The distance between frame.
It is worth noting that, the advertisement identification device 100 is in determining target webpage first in the pre- of browser window
If position, then the web page element of predetermined position is grabbed.Rather than to all web page elements in entire web data
Attribute value obtained, therefore reduce crawl web data range, that is, be limited to the predetermined position of browser window, should
Predeterminated position occupies a part of browser window, to largely reduce the range of data analysis, committed memory is few and divides
It is fast to analyse speed.In addition, being that first determining predeterminated position determines pre- in the web page element of crawl predetermined position in the present embodiment
If the attribute value of web page element is not taken when position.
S104: analyzing the web page element, and extracts the attribute value in the web page element, and the attribute value can including first
Opinion property attribute value and first position attribute value.
It should be noted that the code in the html file that the browser is loaded into can form corresponding DOM
(Document Object Model, document object model) structure, i.e. DOM tree, each node table in DOM tree
It is now the text items in a HTML markup or HTML markup.Therefore, target can be extracted by the analysis to DOM tree
The attribute value of any web page element on webpage.In the present embodiment, the attribute value includes the first visibility attribute value and first
Position attribution value.
S105: finding out in the advertisement formwork feature database prestored can with the first visibility attribute value matched second
Opinion property attribute value and with the matched second position attribute value of the first position attribute value.
Wherein, in CSS (Cascading Style Sheets, Cascading Style Sheet) language, " visibility " i.e.
" display ", the attribute value of display may include " none ", " block " and " inline " type.Wherein, work as display
Attribute value when being none, show corresponding web page element on webpage by " hiding ";When the attribute value of display is block
When, show that corresponding web page element is shown as " block grade " on webpage, and account for a line on webpage;When the attribute of display
When value is inline, show that corresponding web page element is shown as " row grade " on webpage, but be not take up a line.
Similarly, " position " i.e. in CSS (Cascading Style Sheets, Cascading Style Sheet) language
" position ", the attribute value of position may include " static ", " relative ", " absolute " and " fixed " class
Type.Wherein, when the attribute value of position is static, show corresponding web page element by " static immobilization " on webpage
It is positioned, i.e., carries out the localization process of web page element using default value;When the attribute value of position is relative, table
Bright corresponding web page element is positioned on webpage by " relative positioning ", i.e., when web page element is by relative to static immobilization
Position is adjusted;When the attribute value of position is absolute, show that corresponding web page element passes through " absolute fix "
It is positioned on webpage, so that web page element will be adjusted according to the position of the element comprising it;When the category of position
Property value when being fixed, show that corresponding web page element is positioned on webpage by " stationary positioned ", so that web page element quilt
The fixed position of one be arranged on browser window.
Fig. 2 is further regarded to, compares out the first visibility attribute value and the second visibility attribute value when simultaneously
When identical, described first position attribute value is identical with the second position attribute value, step S106 is executed, that is, judges the webpage
Element is advertisement.When compare out the first visibility attribute value and the second visibility attribute value, the first position belongs to
Property value and the second position attribute value there are it is different when, such as the first visibility attribute value and the second visibility category
Property value it is identical, when the first position attribute value and the second position attribute value difference, execute step S107, i.e., described in judgement
Web page element is non-advertisement.By above step, the advertisement in webpage can effectively be identified, accurately judge the webpage
Whether element is advertisement.
As shown in figure 3, being a kind of the functional block diagram of advertisement identification device 100 provided in an embodiment of the present invention, institute
The network server 010 that newest advertisement formwork information can be stored with one by stating advertisement identification device 100 communicates to connect.Wherein, institute
State advertisement identification device 100 mainly include sending module 101, implant module 102, handling module 103, analyze extraction module 104,
Searching module 105, comparison module 106 and judgment module 107.It will elaborate below to above functions module.
The sending module 101, for sending the request of an advertisement formwork newly-added information to the network server 010,
So that the network server 010 is known according to the advertisement formwork information prestored in the advertisement identification device 100 to the advertisement
Other device 100 sends newly-increased advertisement formwork information.
In the present embodiment, before advertisement identification device 100 is loaded into web data, the sending module 101 is sent first
The request of advertisement formwork newly-added information is to network server 010.In this way, making the network server 010 according to the advertisement
The advertisement formwork information prestored in identification device 100 sends newly-increased advertisement formwork information.
As shown in figure 4, the network server 010 may include parsing module 011 and lookup sending module in the present embodiment
012.Wherein:
The parsing module 011, parses for the request to the advertisement formwork newly-added information received, described in acquisition
Device identification number in request.
The lookup sending module 012, for the advertisement identification device to be searched and analyzed according to the device identification number
The advertisement formwork information stored in 100 obtains newly-increased advertisement formwork information and sends it to the advertisement identification device
100。
Preferably, the parsing module 011 parses the request of the advertisement formwork newly-added information received, and obtains
Device identification number in the request.
The lookup sending module 012 finds corresponding advertisement identification device 100 according to the device identification number, and right
The advertisement formwork information stored in advertisement identification device 100 is analyzed, by being compared with newest advertisement formwork information,
Obtain newly-increased advertisement formwork information.And the newly-increased advertisement formwork information is sent to the advertisement identification device 100.
Further regard to Fig. 3, the implant module 102, for by the newly-increased advertisement formwork Information Embedding to described
In advertisement formwork feature database, with the real-time update advertisement formwork feature database.
The handling module 103, for grabbing in target webpage in the web page element of browser window predetermined position.
Preferably, after the target webpage accessed needed for user is inputted using the electronic equipment 000, browser takes to network
Business device 010 initiates access request, and receives the html file of the network server 010 return.The browser should by being loaded into
Html file realizes the load to webpage.Wherein, the load of webpage includes the assembling to web page element each in webpage, such as
Text, picture, Flash animation etc..
In addition, the predeterminated position may include the phase between the web page element and each frame of the browser window
It adjusts the distance.When user by browser window come browsing objective webpage when, the predeterminated position may is that the net in target webpage
The distance between left part and the left part frame of browser window of page element, the top of web page element and the top of browser window
The distance between frame.
It is worth noting that, the advertisement identification device 100 is in determining target webpage first in the pre- of browser window
If position, then the web page element of predetermined position is grabbed.Rather than to all web page elements in entire web data
Attribute value obtained, therefore reduce crawl web data range, that is, be limited to the predetermined position of browser window, should
Predeterminated position occupies a part of browser window, to largely reduce the range of data analysis, committed memory is few and divides
It is fast to analyse speed.In addition, being that first determining predeterminated position determines pre- in the web page element of crawl predetermined position in the present embodiment
If not taking the attribute value of web page element when position.
The analysis extraction module 104, for analyzing the web page element, and extracts the attribute value in the web page element,
The attribute value includes the first visibility attribute value and first position attribute value.
It should be noted that the code in the html file that the browser is loaded into can form corresponding DOM
(Document Object Model, document object model) structure, i.e. DOM tree, each node table in DOM tree
It is now the text items in a HTML markup or HTML markup.Therefore, target can be extracted by the analysis to DOM tree
The attribute value of any web page element on webpage.In the present embodiment, the attribute value includes the first visibility attribute value and first
Position attribution value.
The searching module 105, for being found out in the advertisement formwork feature database prestored and the first visibility category
Property the matched second visibility attribute value of value and with the matched second position attribute value of the first position attribute value.
The comparison module 106, for carrying out the first visibility attribute value and the second visibility attribute value
It compares, the first position attribute value and the second position attribute value are compared.
The judgment module 107, for the first visibility attribute value and second visibility ought to be compared out simultaneously
When attribute value is identical, the first position attribute value is identical with the second position attribute value, judge that the web page element is wide
It accuses;And the first visibility attribute value and the second visibility attribute value, first position category are compared out for working as
Property value and the second position attribute value there are it is different when, judge the non-advertisement of the web page element, pass through above functions module
Work cooperation, can effectively be identified accurately judge whether the web page element is advertisement to the advertisement in webpage.
In conclusion the present invention first by the newly-increased advertisement formwork Information Embedding that sends network server 010 to
In advertisement formwork feature database, then determines the predeterminated position in target webpage in browser window, extract the net of predetermined position
Page attribute of an element value, and the visibility attribute value and position attribution value being related to are distinguished in the advertisement formwork feature database prestored
It is compared, judges whether web page element is advertisement according to comparison result, to effectively identify to web advertisement, simultaneously
Its data amount of analysis is small, EMS memory occupation resource that is time-consuming short and reducing processor, and then improves when user browses webpage
Experience Degree.
In embodiment provided herein, it should be understood that disclosed device and method, it can also be by other
Mode realize.The apparatus embodiments described above are merely exemplary, for example, the flow chart and block diagram in attached drawing are shown
Device, the architectural framework in the cards of method and computer program product, function of multiple embodiments according to the present invention
And operation.In this regard, each box in flowchart or block diagram can represent one of a module, section or code
Point, a part of the module, section or code includes one or more for implementing the specified logical function executable
Instruction.It should also be noted that function marked in the box can also be attached to be different from some implementations as replacement
The sequence marked in figure occurs.For example, two continuous boxes can actually be basically executed in parallel, they sometimes may be used
To execute in the opposite order, this depends on the function involved.It is also noted that each of block diagram and or flow chart
The combination of box in box and block diagram and or flow chart can be based on the defined function of execution or the dedicated of movement
The system of hardware is realized, or can be realized using a combination of dedicated hardware and computer instructions.In addition, each in the present invention
Each functional module in embodiment can integrate one independent part of formation together, is also possible to modules and individually deposits
An independent part can also be integrated to form with two or more modules.
It, can be with if the function is realized and when sold or used as an independent product in the form of software function module
It is stored in a computer readable storage medium.Based on this understanding, technical solution of the present invention is substantially in other words
The part of the part that contributes to existing technology or the technical solution can be embodied in the form of software products, the meter
Calculation machine software product is stored in a storage medium, including some instructions are used so that a computer equipment (can be a
People's computer, server or network equipment etc.) it performs all or part of the steps of the method described in the various embodiments of the present invention.
And storage medium above-mentioned includes: that USB flash disk, mobile hard disk, read-only memory (ROM, Read-Only Memory), arbitrary access are deposited
The various media that can store program code such as reservoir (RAM, Random Access Memory), magnetic or disk.It needs
Illustrate, herein, relational terms such as first and second and the like be used merely to by an entity or operation with
Another entity or operation distinguish, and without necessarily requiring or implying between these entities or operation, there are any this realities
The relationship or sequence on border.Moreover, the terms "include", "comprise" or its any other variant are intended to the packet of nonexcludability
Contain, so that the process, method, article or equipment for including a series of elements not only includes those elements, but also including
Other elements that are not explicitly listed, or further include for elements inherent to such a process, method, article, or device.
In the absence of more restrictions, the element limited by sentence "including a ...", it is not excluded that including the element
Process, method, article or equipment in there is also other identical elements.
The foregoing is only a preferred embodiment of the present invention, is not intended to restrict the invention, for the skill of this field
For art personnel, the invention may be variously modified and varied.All within the spirits and principles of the present invention, made any to repair
Change, equivalent replacement, improvement etc., should all be included in the protection scope of the present invention.It should also be noted that similar label and letter exist
Similar terms are indicated in following attached drawing, therefore, once being defined in a certain Xiang Yi attached drawing, are then not required in subsequent attached drawing
It is further defined and explained.
Claims (9)
1. a kind of advertisement recognition method, is applied to advertisement identification device, the advertisement identification device and one is stored with newest advertisement
The network server of Template Information communicates to connect, which is characterized in that the advertisement recognition method the following steps are included:
The advertisement identification device sends the request of an advertisement formwork newly-added information to the network server, so that the network
Server increases newly according to the advertisement formwork information prestored in the advertisement identification device to advertisement identification device transmission wide
Slide former information;
By the newly-increased advertisement formwork Information Embedding into the advertisement formwork feature database, with real-time update, the advertisement formwork is special
Levy library;
Grab the web page element in target webpage in browser window predetermined position;
The web page element is analyzed, and extracts the attribute value in the web page element, the attribute value includes the first visibility attribute
Value and first position attribute value;
It is found out in the advertisement formwork feature database prestored and matched second visibility attribute of the first visibility attribute value
Value and with the matched second position attribute value of the first position attribute value;
The first visibility attribute value and the second visibility attribute value are compared, the first position attribute value and
The second position attribute value is compared;
When comparing out the first visibility attribute value simultaneously and second visibility attribute value is identical, the first position belongs to
When property value is identical with the second position attribute value, judge the web page element for advertisement.
2. advertisement recognition method according to claim 1, which is characterized in that it is described by the first visibility attribute value and
The second visibility attribute value is compared, what the first position attribute value and the second position attribute value were compared
After step, which comprises
When comparing out the first visibility attribute value and the second visibility attribute value, the first position attribute value and institute
State second position attribute value there are it is different when, judge the non-advertisement of the web page element.
3. advertisement recognition method according to claim 1, which is characterized in that the network server is known according to the advertisement
The step of advertisement formwork information prestored in other device sends newly-increased advertisement formwork information to the advertisement identification device include:
The network server parses the request of the advertisement formwork newly-added information received, obtains setting in the request
Standby identifier;
The advertisement formwork information stored in the advertisement identification device is searched and analyzed according to the device identification number, is increased newly
Advertisement formwork information and send it to the advertisement identification device.
4. advertisement recognition method according to claim 1, which is characterized in that the predeterminated position includes the web page element
Relative distance between each frame of the browser window.
5. a kind of advertisement identification device, the network server for being stored with newest advertisement formwork information with one is communicated to connect, feature
It is, the advertisement identification device includes:
Sending module, for sending the request of an advertisement formwork newly-added information to the network server, so that the network takes
Business device sends newly-increased advertisement to the advertisement identification device according to the advertisement formwork information prestored in the advertisement identification device
Template Information;
Implant module, for by the newly-increased advertisement formwork Information Embedding into the advertisement formwork feature database, in real time more
The new advertisement formwork feature database;
Handling module, for grabbing in target webpage in the web page element of browser window predetermined position;
Extraction module is analyzed, for analyzing the web page element, and extracts the attribute value in the web page element, the attribute value packet
Include the first visibility attribute value and first position attribute value;
Searching module, for being found out in the advertisement formwork feature database prestored and the first visibility attribute value matched
Two visibility attribute values and with the matched second position attribute value of the first position attribute value;
Comparison module, for the first visibility attribute value and the second visibility attribute value to be compared, described
One position attribution value and the second position attribute value are compared;
Judgment module, for ought compare out simultaneously the first visibility attribute value and second visibility attribute value it is identical,
When the first position attribute value is identical with the second position attribute value, judge the web page element for advertisement.
6. advertisement identification device according to claim 5, which is characterized in that
The judgment module is also used to that the first visibility attribute value and the second visibility attribute value, institute ought be compared out
State first position attribute value and the second position attribute value there are it is different when, judge the non-advertisement of the web page element.
7. advertisement identification device according to claim 5, which is characterized in that the network server includes:
Parsing module is parsed for the request to the advertisement formwork newly-added information received, obtains setting in the request
Standby identifier;
Sending module is searched, for searching according to the device identification number and analyzing the advertisement stored in the advertisement identification device
Template Information obtains newly-increased advertisement formwork information and sends it to the advertisement identification device.
8. advertisement identification device according to claim 5, which is characterized in that the predeterminated position includes the web page element
Relative distance between each frame of the browser window.
9. a kind of electronic equipment, including the computer-readable medium for the non-volatile program code that can be performed with processor,
It is characterized in that, said program code makes the processor execute described any the method for claim 1-4.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201810980581.8A CN109214864A (en) | 2018-08-27 | 2018-08-27 | A kind of advertisement recognition method and device, electronic equipment |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201810980581.8A CN109214864A (en) | 2018-08-27 | 2018-08-27 | A kind of advertisement recognition method and device, electronic equipment |
Publications (1)
Publication Number | Publication Date |
---|---|
CN109214864A true CN109214864A (en) | 2019-01-15 |
Family
ID=64989378
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201810980581.8A Pending CN109214864A (en) | 2018-08-27 | 2018-08-27 | A kind of advertisement recognition method and device, electronic equipment |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN109214864A (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113722640A (en) * | 2021-08-26 | 2021-11-30 | 长沙博为软件技术股份有限公司 | Method, device and medium for collecting webpage configurable items based on RPA |
Citations (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101957209A (en) * | 2009-07-15 | 2011-01-26 | 江苏新科软件有限公司 | Navigation device map data increment updating method |
CN102510357A (en) * | 2011-09-26 | 2012-06-20 | 深圳中兴网信科技有限公司 | Synchronous method of enterprise organization structure address book and system thereof |
CN103235956A (en) * | 2013-03-28 | 2013-08-07 | 天脉聚源(北京)传媒科技有限公司 | Method and device for detecting advertisements |
CN103886088A (en) * | 2014-03-28 | 2014-06-25 | 北京金山网络科技有限公司 | Method and device for intercepting advertisements in webpage |
CN104021172A (en) * | 2014-05-30 | 2014-09-03 | 北京搜狗科技发展有限公司 | Advertisement filtering method and advertisement filtering device |
CN104182505A (en) * | 2014-08-19 | 2014-12-03 | 小米科技有限责任公司 | Webpage rearrangement method and device |
CN104239422A (en) * | 2014-08-21 | 2014-12-24 | 小米科技有限责任公司 | Advertisement identification method, advertisement identification device and electronic equipment |
CN105335869A (en) * | 2015-09-24 | 2016-02-17 | 精硕世纪科技(北京)有限公司 | Early warning method and system for advertisement monitoring |
CN105373728A (en) * | 2014-09-01 | 2016-03-02 | 深圳富泰宏精密工业有限公司 | Advertisement prompting system and method |
CN105934759A (en) * | 2015-10-13 | 2016-09-07 | 深圳还是威健康科技有限公司 | Data updating method and device, terminal and server |
CN106919690A (en) * | 2017-03-03 | 2017-07-04 | 北京金山安全软件有限公司 | Information shielding method and device and electronic equipment |
-
2018
- 2018-08-27 CN CN201810980581.8A patent/CN109214864A/en active Pending
Patent Citations (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101957209A (en) * | 2009-07-15 | 2011-01-26 | 江苏新科软件有限公司 | Navigation device map data increment updating method |
CN102510357A (en) * | 2011-09-26 | 2012-06-20 | 深圳中兴网信科技有限公司 | Synchronous method of enterprise organization structure address book and system thereof |
CN103235956A (en) * | 2013-03-28 | 2013-08-07 | 天脉聚源(北京)传媒科技有限公司 | Method and device for detecting advertisements |
CN103886088A (en) * | 2014-03-28 | 2014-06-25 | 北京金山网络科技有限公司 | Method and device for intercepting advertisements in webpage |
CN104021172A (en) * | 2014-05-30 | 2014-09-03 | 北京搜狗科技发展有限公司 | Advertisement filtering method and advertisement filtering device |
CN104182505A (en) * | 2014-08-19 | 2014-12-03 | 小米科技有限责任公司 | Webpage rearrangement method and device |
CN104239422A (en) * | 2014-08-21 | 2014-12-24 | 小米科技有限责任公司 | Advertisement identification method, advertisement identification device and electronic equipment |
CN105373728A (en) * | 2014-09-01 | 2016-03-02 | 深圳富泰宏精密工业有限公司 | Advertisement prompting system and method |
CN105335869A (en) * | 2015-09-24 | 2016-02-17 | 精硕世纪科技(北京)有限公司 | Early warning method and system for advertisement monitoring |
CN105934759A (en) * | 2015-10-13 | 2016-09-07 | 深圳还是威健康科技有限公司 | Data updating method and device, terminal and server |
CN106919690A (en) * | 2017-03-03 | 2017-07-04 | 北京金山安全软件有限公司 | Information shielding method and device and electronic equipment |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113722640A (en) * | 2021-08-26 | 2021-11-30 | 长沙博为软件技术股份有限公司 | Method, device and medium for collecting webpage configurable items based on RPA |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
JP5721818B2 (en) | Use of model information group in search | |
US9594730B2 (en) | Annotating HTML segments with functional labels | |
US20150067476A1 (en) | Title and body extraction from web page | |
CN102156737B (en) | Method for extracting subject content of Chinese webpage | |
US10296552B1 (en) | System and method for automated identification of internet advertising and creating rules for blocking of internet advertising | |
CN108566399B (en) | Phishing website identification method and system | |
US11907644B2 (en) | Detecting compatible layouts for content-based native ads | |
US20180285331A1 (en) | Method, server, browser, and system for recommending text information | |
CN101539918A (en) | Method and system for internet search | |
CN110245069A (en) | The methods of exhibiting and device of the test method and device of page versions, the page | |
CN112699295A (en) | Webpage content recommendation method and device and computer readable storage medium | |
CN105868290A (en) | Search result presentation method and apparatus | |
CN103136259B (en) | A kind of method and apparatus based on content block identification processing web page contents | |
CN102314494A (en) | Method and equipment for processing webpage contents | |
CN104090923A (en) | Method and device for displaying rich media information in browser | |
CN103942211A (en) | Text page recognition method and device | |
CN105117434A (en) | Webpage classification method and webpage classification system | |
CN105204806A (en) | Individual display method and device for mobile terminal webpage | |
CN102902794A (en) | Web page classification system and method | |
CN110134844A (en) | Subdivision field public sentiment monitoring method, device, computer equipment and storage medium | |
US10963690B2 (en) | Method for identifying main picture in web page | |
CN109214864A (en) | A kind of advertisement recognition method and device, electronic equipment | |
CN107862016A (en) | A kind of collocation method of the thematic page | |
CN113407678B (en) | Knowledge graph construction method, device and equipment | |
CN103870275B (en) | Information processing method and device |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20190115 |