CN109214864A - A kind of advertisement recognition method and device, electronic equipment - Google Patents

A kind of advertisement recognition method and device, electronic equipment Download PDF

Info

Publication number
CN109214864A
CN109214864A CN201810980581.8A CN201810980581A CN109214864A CN 109214864 A CN109214864 A CN 109214864A CN 201810980581 A CN201810980581 A CN 201810980581A CN 109214864 A CN109214864 A CN 109214864A
Authority
CN
China
Prior art keywords
attribute value
advertisement
visibility
web page
identification device
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201810980581.8A
Other languages
Chinese (zh)
Inventor
李旋
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Henan Fengtai Photoelectric Technology Co Ltd
Original Assignee
Henan Fengtai Photoelectric Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Henan Fengtai Photoelectric Technology Co Ltd filed Critical Henan Fengtai Photoelectric Technology Co Ltd
Priority to CN201810980581.8A priority Critical patent/CN109214864A/en
Publication of CN109214864A publication Critical patent/CN109214864A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/02Marketing; Price estimation or determination; Fundraising
    • G06Q30/0241Advertisements

Abstract

The invention discloses a kind of advertisement recognition method and devices, electronic equipment, comprising the following steps: obtains and the newly-increased advertisement formwork Information Embedding that sends network server is to advertisement formwork feature database;Grab the web page element in target webpage in browser window predetermined position;The attribute value in the web page element is extracted, the attribute value includes the first visibility attribute value and first position attribute value;It is found out respectively in the advertisement formwork feature database prestored and the first visibility attribute value, the matched second visibility attribute value of first position attribute value and second position attribute value;The first visibility attribute value, first position attribute value and the second visibility attribute value, second position attribute value are compared respectively, when comparing out the first visibility attribute value and identical, the described first position attribute value of second visibility attribute value identical with the second position attribute value simultaneously, judge the web page element for advertisement, and then improve Experience Degree.

Description

A kind of advertisement recognition method and device, electronic equipment
Technical field
The invention belongs to internet detection technique field more particularly to a kind of advertisement recognition methods and device, electronic equipment.
Background technique
With the continuous development of modern science and technology, miscellaneous waste advertisements carry out in webpage extensively by Internet technology General propagation.Currently, most users, which mainly pass through search related web page, obtains effective information, due to the advertisement being inserted into webpage, It makes troubles to user's browsing, the highly desirable advertisement search that can skip is to required web page contents.In addition, for screen size It is unnecessary that lesser electronic equipment, the online experience of meeting greatly influence user, and a large amount of advertisement can also be caused to user Flow waste.
What is proposed in the prior art carries out knowing method for distinguishing that often data amount of analysis is big, time-consuming more and a large amount of to web advertisement The memory source for consuming processor, causes Experience Degree of the user when browsing webpage bad.
Summary of the invention
In view of this, the embodiment of the present invention is designed to provide a kind of advertisement recognition method and device, electronic equipment, purport It is occupied in the memory source for reducing data amount of analysis and processor in web advertisement identification process.
The technical solution adopted by the invention is as follows:
In a first aspect, a kind of advertisement recognition method provided in an embodiment of the present invention, is applied to advertisement identification device, it is described wide It accuses identification device and one and is stored with the network server communication connection of newest advertisement formwork information, the advertisement recognition method includes Following steps:
The advertisement identification device sends the request of an advertisement formwork newly-added information to the network server, so that described Network server sends according to the advertisement formwork information prestored in the advertisement identification device to the advertisement identification device newly-increased Advertisement formwork information;
By the newly-increased advertisement formwork Information Embedding into the advertisement formwork feature database, with the real-time update advertisement mould Plate features library;
Grab the web page element in target webpage in browser window predetermined position;
The web page element is analyzed, and extracts the attribute value in the web page element, the attribute value includes the first visibility Attribute value and first position attribute value;
It is found out in the advertisement formwork feature database prestored and matched second visibility of the first visibility attribute value Attribute value and with the matched second position attribute value of the first position attribute value;
The first visibility attribute value and the second visibility attribute value are compared, the first position attribute Value and the second position attribute value are compared;
When simultaneously compare out the first visibility attribute value and second visibility attribute value it is identical, described first Set attribute value it is identical with the second position attribute value when, judge the web page element for advertisement.
Further, described that the first visibility attribute value and the second visibility attribute value are compared, institute State first position attribute value and the step of the second position attribute value is compared after, which comprises
When comparing out the first visibility attribute value and the second visibility attribute value, the first position attribute value With the second position attribute value there are it is different when, judge the non-advertisement of the web page element.
Further, the network server is according to the advertisement formwork information prestored in the advertisement identification device to described Advertisement identification device sends the step of newly-increased advertisement formwork information and includes:
The network server parses the request of the advertisement formwork newly-added information received, obtains in the request Device identification number;
The advertisement formwork information stored in the advertisement identification device is searched and analyzed according to the device identification number, is obtained Newly-increased advertisement formwork information simultaneously sends it to the advertisement identification device.
Further, the predeterminated position includes the phase between the web page element and each frame of the browser window It adjusts the distance.
Second aspect, a kind of advertisement identification device provided in an embodiment of the present invention are stored with newest advertisement formwork with one and believe The network server of breath communicates to connect, and the advertisement identification device includes:
Sending module, for sending the request of an advertisement formwork newly-added information to the network server, so that the net Network server increases newly according to the advertisement formwork information prestored in the advertisement identification device to advertisement identification device transmission Advertisement formwork information;
Implant module, for by the newly-increased advertisement formwork Information Embedding into the advertisement formwork feature database, with reality The Shi Gengxin advertisement formwork feature database;
Handling module, for grabbing in target webpage in the web page element of browser window predetermined position;
Extraction module is analyzed, for analyzing the web page element, and extracts the attribute value in the web page element, the attribute Value includes the first visibility attribute value and first position attribute value;
Searching module is matched for finding out in the advertisement formwork feature database prestored with the first visibility attribute value The second visibility attribute value and with the matched second position attribute value of the first position attribute value;
Comparison module, for the first visibility attribute value and the second visibility attribute value to be compared, institute It states first position attribute value and the second position attribute value is compared;
Judgment module, for the first visibility attribute value and the second visibility attribute value phase ought to be compared out simultaneously When with, the first position attribute value and the identical second position attribute value, judge the web page element for advertisement.
Further, the judgment module, being also used to that the first visibility attribute value and described second ought be compared out can Opinion property attribute value, the first position attribute value and the second position attribute value there are it is different when, judge the web page element Non- advertisement.
Further, the network server includes:
Parsing module is parsed for the request to the advertisement formwork newly-added information received, is obtained in the request Device identification number;
Sending module is searched, is stored in the advertisement identification device for being searched and analyzing according to the device identification number Advertisement formwork information obtains newly-increased advertisement formwork information and sends it to the advertisement identification device.
Further, the predeterminated position includes the phase between the web page element and each frame of the browser window It adjusts the distance.
The third aspect, a kind of electronic equipment provided in an embodiment of the present invention are non-volatile including can be performed with processor Program code computer-readable medium, said program code makes the processor execute above method.
In conclusion the present invention passes through the newly-increased advertisement formwork Information Embedding that sends network server to advertisement first In template characteristic library, the predeterminated position in target webpage in browser window is then determined, extract the webpage member of predetermined position The attribute value of element, and the visibility attribute value and position attribution value that are related to are carried out respectively in the advertisement formwork feature database prestored It compares, judges whether web page element is advertisement according to comparison result, to effectively be identified to web advertisement, while it is counted It is small according to amount of analysis, EMS memory occupation resource that is time-consuming short and reducing processor.
Detailed description of the invention
It to describe the technical solutions in the embodiments of the present invention more clearly, below will be to needed in the embodiment Attached drawing is briefly described, it should be understood that the following drawings illustrates only certain embodiments of the present invention, therefore is not construed as Restriction to range for those of ordinary skill in the art without creative efforts, can also basis These attached drawings obtain other relevant attached drawings.
Fig. 1 shows the block diagram of a kind of electronic equipment provided in an embodiment of the present invention.
Fig. 2 shows a kind of flow diagrams of advertisement recognition method provided in an embodiment of the present invention.
Fig. 3 shows a kind of the functional block diagram of advertisement identification device provided in an embodiment of the present invention.
Fig. 4 shows a kind of the functional block diagram of network server provided in an embodiment of the present invention.
Main element symbol description:
Electronic equipment 000;Network server 010;Advertisement identification device 100;Memory 200;
Storage control 300;Processor 400;Peripheral Interface 500;Input-output unit 600;
Audio unit 700;Display unit 800;Sending module 101;Implant module 102;
Handling module 103;Analyze extraction module 104;Searching module 105;Comparison module 106;
Judgment module 107;Parsing module 011;Search sending module 012.
Specific embodiment
Below in conjunction with attached drawing in the embodiment of the present invention, technical solution in the embodiment of the present invention carries out clear, complete Ground description, it is clear that described embodiments are only a part of the embodiments of the present invention, instead of all the embodiments.Usually exist The component of the embodiment of the present invention described and illustrated in attached drawing can be arranged and be designed with a variety of different configurations herein.Cause This, is not intended to limit claimed invention to the detailed description of the embodiment of the present invention provided in the accompanying drawings below Range, but it is merely representative of selected embodiment of the invention.Based on the embodiment of the present invention, those skilled in the art are not doing Every other embodiment obtained under the premise of creative work out, shall fall within the protection scope of the present invention.
As shown in Figure 1, being the block diagram of a kind of electronic equipment 000 provided in an embodiment of the present invention.The electronics is set Standby 000 can be PC (personal computer, PC), tablet computer, smart phone, personal digital assistant (personal digital assistant, PDA) etc..The electronic equipment 000 may include advertisement identification device 100, deposit Reservoir 200, storage control 300, processor 400, Peripheral Interface 500, input-output unit 600, audio unit 700 and display Unit 800.
Wherein, the memory 200, storage control 300, processor 400, Peripheral Interface 500, input-output unit 600, audio unit 700, each element of display unit 800 are directly or indirectly electrically connected between each other, to realize the biography of data Defeated or interaction.It is electrically connected for example, these elements can be realized between each other by one or more communication bus or signal wire.Institute Stating electronic equipment 000 includes that at least one can be stored in the memory 200 in the form of software or firmware (firmware) Operating system (operating system, OS) in software function module.The processor 400 is for executing memory The executable module stored in 200, such as software function module or computer program that the advertisement identification device 100 includes.
As shown in Fig. 2, being a kind of flow diagram of advertisement recognition method provided in an embodiment of the present invention.The present embodiment In, the advertisement recognition method is applied to advertisement identification device 100, the advertisement identification device 100 with one be stored with it is newest extensively The network server 010 of slide former information communicates to connect, i.e., the described advertisement identification device 100 can be with the network server 010 Data access is carried out each other, and the advertisement recognition method may comprise steps of:
S101: the advertisement identification device 100 sends asking for an advertisement formwork newly-added information to the network server 010 It asks.
Before advertisement identification device 100 is loaded into web data, the request of transmission advertisement formwork newly-added information is to net first Network server 010, in this way, making the network server 010 according to the advertisement formwork prestored in the advertisement identification device 100 Information sends newly-increased advertisement formwork information to the advertisement identification device 100.
Specifically, the network server 010 parses the request of the advertisement formwork newly-added information received, and obtains Take the device identification number in the request.Corresponding advertisement identification device 100 is found according to the device identification number, and to wide It accuses the advertisement formwork information stored in identification device 100 to be analyzed, by being compared with newest advertisement formwork information, obtain The advertisement identification device 100 is sent to newly-increased advertisement formwork information and by the newly-increased advertisement formwork information.
S102:, should with real-time update by the newly-increased advertisement formwork Information Embedding into the advertisement formwork feature database Advertisement formwork feature database.
The data of advertisement formwork feature database in the present embodiment constantly carry out real-time update, can constantly adapt to advertisement form Variability, and then effectively judge the corresponding advertisement of newest web page element.
S103: in the web page element of browser window predetermined position in crawl target webpage.
Preferably, after the target webpage accessed needed for user is inputted using the electronic equipment 000, browser takes to network Business device 010 initiates access request, and receives the html file of the network server 010 return.The browser should by being loaded into Html file realizes the load to webpage.Wherein, the load of webpage includes the assembling to web page element each in webpage, such as Text, picture, Flash animation etc..
In addition, the predeterminated position may include the phase between the web page element and each frame of the browser window It adjusts the distance.When user by browser window come browsing objective webpage when, the predeterminated position may is that the net in target webpage The distance between left part and the left part frame of browser window of page element, the top of web page element and the top of browser window The distance between frame.
It is worth noting that, the advertisement identification device 100 is in determining target webpage first in the pre- of browser window If position, then the web page element of predetermined position is grabbed.Rather than to all web page elements in entire web data Attribute value obtained, therefore reduce crawl web data range, that is, be limited to the predetermined position of browser window, should Predeterminated position occupies a part of browser window, to largely reduce the range of data analysis, committed memory is few and divides It is fast to analyse speed.In addition, being that first determining predeterminated position determines pre- in the web page element of crawl predetermined position in the present embodiment If the attribute value of web page element is not taken when position.
S104: analyzing the web page element, and extracts the attribute value in the web page element, and the attribute value can including first Opinion property attribute value and first position attribute value.
It should be noted that the code in the html file that the browser is loaded into can form corresponding DOM (Document Object Model, document object model) structure, i.e. DOM tree, each node table in DOM tree It is now the text items in a HTML markup or HTML markup.Therefore, target can be extracted by the analysis to DOM tree The attribute value of any web page element on webpage.In the present embodiment, the attribute value includes the first visibility attribute value and first Position attribution value.
S105: finding out in the advertisement formwork feature database prestored can with the first visibility attribute value matched second Opinion property attribute value and with the matched second position attribute value of the first position attribute value.
Wherein, in CSS (Cascading Style Sheets, Cascading Style Sheet) language, " visibility " i.e. " display ", the attribute value of display may include " none ", " block " and " inline " type.Wherein, work as display Attribute value when being none, show corresponding web page element on webpage by " hiding ";When the attribute value of display is block When, show that corresponding web page element is shown as " block grade " on webpage, and account for a line on webpage;When the attribute of display When value is inline, show that corresponding web page element is shown as " row grade " on webpage, but be not take up a line.
Similarly, " position " i.e. in CSS (Cascading Style Sheets, Cascading Style Sheet) language " position ", the attribute value of position may include " static ", " relative ", " absolute " and " fixed " class Type.Wherein, when the attribute value of position is static, show corresponding web page element by " static immobilization " on webpage It is positioned, i.e., carries out the localization process of web page element using default value;When the attribute value of position is relative, table Bright corresponding web page element is positioned on webpage by " relative positioning ", i.e., when web page element is by relative to static immobilization Position is adjusted;When the attribute value of position is absolute, show that corresponding web page element passes through " absolute fix " It is positioned on webpage, so that web page element will be adjusted according to the position of the element comprising it;When the category of position Property value when being fixed, show that corresponding web page element is positioned on webpage by " stationary positioned ", so that web page element quilt The fixed position of one be arranged on browser window.
Fig. 2 is further regarded to, compares out the first visibility attribute value and the second visibility attribute value when simultaneously When identical, described first position attribute value is identical with the second position attribute value, step S106 is executed, that is, judges the webpage Element is advertisement.When compare out the first visibility attribute value and the second visibility attribute value, the first position belongs to Property value and the second position attribute value there are it is different when, such as the first visibility attribute value and the second visibility category Property value it is identical, when the first position attribute value and the second position attribute value difference, execute step S107, i.e., described in judgement Web page element is non-advertisement.By above step, the advertisement in webpage can effectively be identified, accurately judge the webpage Whether element is advertisement.
As shown in figure 3, being a kind of the functional block diagram of advertisement identification device 100 provided in an embodiment of the present invention, institute The network server 010 that newest advertisement formwork information can be stored with one by stating advertisement identification device 100 communicates to connect.Wherein, institute State advertisement identification device 100 mainly include sending module 101, implant module 102, handling module 103, analyze extraction module 104, Searching module 105, comparison module 106 and judgment module 107.It will elaborate below to above functions module.
The sending module 101, for sending the request of an advertisement formwork newly-added information to the network server 010, So that the network server 010 is known according to the advertisement formwork information prestored in the advertisement identification device 100 to the advertisement Other device 100 sends newly-increased advertisement formwork information.
In the present embodiment, before advertisement identification device 100 is loaded into web data, the sending module 101 is sent first The request of advertisement formwork newly-added information is to network server 010.In this way, making the network server 010 according to the advertisement The advertisement formwork information prestored in identification device 100 sends newly-increased advertisement formwork information.
As shown in figure 4, the network server 010 may include parsing module 011 and lookup sending module in the present embodiment 012.Wherein:
The parsing module 011, parses for the request to the advertisement formwork newly-added information received, described in acquisition Device identification number in request.
The lookup sending module 012, for the advertisement identification device to be searched and analyzed according to the device identification number The advertisement formwork information stored in 100 obtains newly-increased advertisement formwork information and sends it to the advertisement identification device 100。
Preferably, the parsing module 011 parses the request of the advertisement formwork newly-added information received, and obtains Device identification number in the request.
The lookup sending module 012 finds corresponding advertisement identification device 100 according to the device identification number, and right The advertisement formwork information stored in advertisement identification device 100 is analyzed, by being compared with newest advertisement formwork information, Obtain newly-increased advertisement formwork information.And the newly-increased advertisement formwork information is sent to the advertisement identification device 100.
Further regard to Fig. 3, the implant module 102, for by the newly-increased advertisement formwork Information Embedding to described In advertisement formwork feature database, with the real-time update advertisement formwork feature database.
The handling module 103, for grabbing in target webpage in the web page element of browser window predetermined position.
Preferably, after the target webpage accessed needed for user is inputted using the electronic equipment 000, browser takes to network Business device 010 initiates access request, and receives the html file of the network server 010 return.The browser should by being loaded into Html file realizes the load to webpage.Wherein, the load of webpage includes the assembling to web page element each in webpage, such as Text, picture, Flash animation etc..
In addition, the predeterminated position may include the phase between the web page element and each frame of the browser window It adjusts the distance.When user by browser window come browsing objective webpage when, the predeterminated position may is that the net in target webpage The distance between left part and the left part frame of browser window of page element, the top of web page element and the top of browser window The distance between frame.
It is worth noting that, the advertisement identification device 100 is in determining target webpage first in the pre- of browser window If position, then the web page element of predetermined position is grabbed.Rather than to all web page elements in entire web data Attribute value obtained, therefore reduce crawl web data range, that is, be limited to the predetermined position of browser window, should Predeterminated position occupies a part of browser window, to largely reduce the range of data analysis, committed memory is few and divides It is fast to analyse speed.In addition, being that first determining predeterminated position determines pre- in the web page element of crawl predetermined position in the present embodiment If not taking the attribute value of web page element when position.
The analysis extraction module 104, for analyzing the web page element, and extracts the attribute value in the web page element, The attribute value includes the first visibility attribute value and first position attribute value.
It should be noted that the code in the html file that the browser is loaded into can form corresponding DOM (Document Object Model, document object model) structure, i.e. DOM tree, each node table in DOM tree It is now the text items in a HTML markup or HTML markup.Therefore, target can be extracted by the analysis to DOM tree The attribute value of any web page element on webpage.In the present embodiment, the attribute value includes the first visibility attribute value and first Position attribution value.
The searching module 105, for being found out in the advertisement formwork feature database prestored and the first visibility category Property the matched second visibility attribute value of value and with the matched second position attribute value of the first position attribute value.
The comparison module 106, for carrying out the first visibility attribute value and the second visibility attribute value It compares, the first position attribute value and the second position attribute value are compared.
The judgment module 107, for the first visibility attribute value and second visibility ought to be compared out simultaneously When attribute value is identical, the first position attribute value is identical with the second position attribute value, judge that the web page element is wide It accuses;And the first visibility attribute value and the second visibility attribute value, first position category are compared out for working as Property value and the second position attribute value there are it is different when, judge the non-advertisement of the web page element, pass through above functions module Work cooperation, can effectively be identified accurately judge whether the web page element is advertisement to the advertisement in webpage.
In conclusion the present invention first by the newly-increased advertisement formwork Information Embedding that sends network server 010 to In advertisement formwork feature database, then determines the predeterminated position in target webpage in browser window, extract the net of predetermined position Page attribute of an element value, and the visibility attribute value and position attribution value being related to are distinguished in the advertisement formwork feature database prestored It is compared, judges whether web page element is advertisement according to comparison result, to effectively identify to web advertisement, simultaneously Its data amount of analysis is small, EMS memory occupation resource that is time-consuming short and reducing processor, and then improves when user browses webpage Experience Degree.
In embodiment provided herein, it should be understood that disclosed device and method, it can also be by other Mode realize.The apparatus embodiments described above are merely exemplary, for example, the flow chart and block diagram in attached drawing are shown Device, the architectural framework in the cards of method and computer program product, function of multiple embodiments according to the present invention And operation.In this regard, each box in flowchart or block diagram can represent one of a module, section or code Point, a part of the module, section or code includes one or more for implementing the specified logical function executable Instruction.It should also be noted that function marked in the box can also be attached to be different from some implementations as replacement The sequence marked in figure occurs.For example, two continuous boxes can actually be basically executed in parallel, they sometimes may be used To execute in the opposite order, this depends on the function involved.It is also noted that each of block diagram and or flow chart The combination of box in box and block diagram and or flow chart can be based on the defined function of execution or the dedicated of movement The system of hardware is realized, or can be realized using a combination of dedicated hardware and computer instructions.In addition, each in the present invention Each functional module in embodiment can integrate one independent part of formation together, is also possible to modules and individually deposits An independent part can also be integrated to form with two or more modules.
It, can be with if the function is realized and when sold or used as an independent product in the form of software function module It is stored in a computer readable storage medium.Based on this understanding, technical solution of the present invention is substantially in other words The part of the part that contributes to existing technology or the technical solution can be embodied in the form of software products, the meter Calculation machine software product is stored in a storage medium, including some instructions are used so that a computer equipment (can be a People's computer, server or network equipment etc.) it performs all or part of the steps of the method described in the various embodiments of the present invention. And storage medium above-mentioned includes: that USB flash disk, mobile hard disk, read-only memory (ROM, Read-Only Memory), arbitrary access are deposited The various media that can store program code such as reservoir (RAM, Random Access Memory), magnetic or disk.It needs Illustrate, herein, relational terms such as first and second and the like be used merely to by an entity or operation with Another entity or operation distinguish, and without necessarily requiring or implying between these entities or operation, there are any this realities The relationship or sequence on border.Moreover, the terms "include", "comprise" or its any other variant are intended to the packet of nonexcludability Contain, so that the process, method, article or equipment for including a series of elements not only includes those elements, but also including Other elements that are not explicitly listed, or further include for elements inherent to such a process, method, article, or device. In the absence of more restrictions, the element limited by sentence "including a ...", it is not excluded that including the element Process, method, article or equipment in there is also other identical elements.
The foregoing is only a preferred embodiment of the present invention, is not intended to restrict the invention, for the skill of this field For art personnel, the invention may be variously modified and varied.All within the spirits and principles of the present invention, made any to repair Change, equivalent replacement, improvement etc., should all be included in the protection scope of the present invention.It should also be noted that similar label and letter exist Similar terms are indicated in following attached drawing, therefore, once being defined in a certain Xiang Yi attached drawing, are then not required in subsequent attached drawing It is further defined and explained.

Claims (9)

1. a kind of advertisement recognition method, is applied to advertisement identification device, the advertisement identification device and one is stored with newest advertisement The network server of Template Information communicates to connect, which is characterized in that the advertisement recognition method the following steps are included:
The advertisement identification device sends the request of an advertisement formwork newly-added information to the network server, so that the network Server increases newly according to the advertisement formwork information prestored in the advertisement identification device to advertisement identification device transmission wide Slide former information;
By the newly-increased advertisement formwork Information Embedding into the advertisement formwork feature database, with real-time update, the advertisement formwork is special Levy library;
Grab the web page element in target webpage in browser window predetermined position;
The web page element is analyzed, and extracts the attribute value in the web page element, the attribute value includes the first visibility attribute Value and first position attribute value;
It is found out in the advertisement formwork feature database prestored and matched second visibility attribute of the first visibility attribute value Value and with the matched second position attribute value of the first position attribute value;
The first visibility attribute value and the second visibility attribute value are compared, the first position attribute value and The second position attribute value is compared;
When comparing out the first visibility attribute value simultaneously and second visibility attribute value is identical, the first position belongs to When property value is identical with the second position attribute value, judge the web page element for advertisement.
2. advertisement recognition method according to claim 1, which is characterized in that it is described by the first visibility attribute value and The second visibility attribute value is compared, what the first position attribute value and the second position attribute value were compared After step, which comprises
When comparing out the first visibility attribute value and the second visibility attribute value, the first position attribute value and institute State second position attribute value there are it is different when, judge the non-advertisement of the web page element.
3. advertisement recognition method according to claim 1, which is characterized in that the network server is known according to the advertisement The step of advertisement formwork information prestored in other device sends newly-increased advertisement formwork information to the advertisement identification device include:
The network server parses the request of the advertisement formwork newly-added information received, obtains setting in the request Standby identifier;
The advertisement formwork information stored in the advertisement identification device is searched and analyzed according to the device identification number, is increased newly Advertisement formwork information and send it to the advertisement identification device.
4. advertisement recognition method according to claim 1, which is characterized in that the predeterminated position includes the web page element Relative distance between each frame of the browser window.
5. a kind of advertisement identification device, the network server for being stored with newest advertisement formwork information with one is communicated to connect, feature It is, the advertisement identification device includes:
Sending module, for sending the request of an advertisement formwork newly-added information to the network server, so that the network takes Business device sends newly-increased advertisement to the advertisement identification device according to the advertisement formwork information prestored in the advertisement identification device Template Information;
Implant module, for by the newly-increased advertisement formwork Information Embedding into the advertisement formwork feature database, in real time more The new advertisement formwork feature database;
Handling module, for grabbing in target webpage in the web page element of browser window predetermined position;
Extraction module is analyzed, for analyzing the web page element, and extracts the attribute value in the web page element, the attribute value packet Include the first visibility attribute value and first position attribute value;
Searching module, for being found out in the advertisement formwork feature database prestored and the first visibility attribute value matched Two visibility attribute values and with the matched second position attribute value of the first position attribute value;
Comparison module, for the first visibility attribute value and the second visibility attribute value to be compared, described One position attribution value and the second position attribute value are compared;
Judgment module, for ought compare out simultaneously the first visibility attribute value and second visibility attribute value it is identical, When the first position attribute value is identical with the second position attribute value, judge the web page element for advertisement.
6. advertisement identification device according to claim 5, which is characterized in that
The judgment module is also used to that the first visibility attribute value and the second visibility attribute value, institute ought be compared out State first position attribute value and the second position attribute value there are it is different when, judge the non-advertisement of the web page element.
7. advertisement identification device according to claim 5, which is characterized in that the network server includes:
Parsing module is parsed for the request to the advertisement formwork newly-added information received, obtains setting in the request Standby identifier;
Sending module is searched, for searching according to the device identification number and analyzing the advertisement stored in the advertisement identification device Template Information obtains newly-increased advertisement formwork information and sends it to the advertisement identification device.
8. advertisement identification device according to claim 5, which is characterized in that the predeterminated position includes the web page element Relative distance between each frame of the browser window.
9. a kind of electronic equipment, including the computer-readable medium for the non-volatile program code that can be performed with processor, It is characterized in that, said program code makes the processor execute described any the method for claim 1-4.
CN201810980581.8A 2018-08-27 2018-08-27 A kind of advertisement recognition method and device, electronic equipment Pending CN109214864A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201810980581.8A CN109214864A (en) 2018-08-27 2018-08-27 A kind of advertisement recognition method and device, electronic equipment

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201810980581.8A CN109214864A (en) 2018-08-27 2018-08-27 A kind of advertisement recognition method and device, electronic equipment

Publications (1)

Publication Number Publication Date
CN109214864A true CN109214864A (en) 2019-01-15

Family

ID=64989378

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201810980581.8A Pending CN109214864A (en) 2018-08-27 2018-08-27 A kind of advertisement recognition method and device, electronic equipment

Country Status (1)

Country Link
CN (1) CN109214864A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113722640A (en) * 2021-08-26 2021-11-30 长沙博为软件技术股份有限公司 Method, device and medium for collecting webpage configurable items based on RPA

Citations (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101957209A (en) * 2009-07-15 2011-01-26 江苏新科软件有限公司 Navigation device map data increment updating method
CN102510357A (en) * 2011-09-26 2012-06-20 深圳中兴网信科技有限公司 Synchronous method of enterprise organization structure address book and system thereof
CN103235956A (en) * 2013-03-28 2013-08-07 天脉聚源(北京)传媒科技有限公司 Method and device for detecting advertisements
CN103886088A (en) * 2014-03-28 2014-06-25 北京金山网络科技有限公司 Method and device for intercepting advertisements in webpage
CN104021172A (en) * 2014-05-30 2014-09-03 北京搜狗科技发展有限公司 Advertisement filtering method and advertisement filtering device
CN104182505A (en) * 2014-08-19 2014-12-03 小米科技有限责任公司 Webpage rearrangement method and device
CN104239422A (en) * 2014-08-21 2014-12-24 小米科技有限责任公司 Advertisement identification method, advertisement identification device and electronic equipment
CN105335869A (en) * 2015-09-24 2016-02-17 精硕世纪科技(北京)有限公司 Early warning method and system for advertisement monitoring
CN105373728A (en) * 2014-09-01 2016-03-02 深圳富泰宏精密工业有限公司 Advertisement prompting system and method
CN105934759A (en) * 2015-10-13 2016-09-07 深圳还是威健康科技有限公司 Data updating method and device, terminal and server
CN106919690A (en) * 2017-03-03 2017-07-04 北京金山安全软件有限公司 Information shielding method and device and electronic equipment

Patent Citations (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101957209A (en) * 2009-07-15 2011-01-26 江苏新科软件有限公司 Navigation device map data increment updating method
CN102510357A (en) * 2011-09-26 2012-06-20 深圳中兴网信科技有限公司 Synchronous method of enterprise organization structure address book and system thereof
CN103235956A (en) * 2013-03-28 2013-08-07 天脉聚源(北京)传媒科技有限公司 Method and device for detecting advertisements
CN103886088A (en) * 2014-03-28 2014-06-25 北京金山网络科技有限公司 Method and device for intercepting advertisements in webpage
CN104021172A (en) * 2014-05-30 2014-09-03 北京搜狗科技发展有限公司 Advertisement filtering method and advertisement filtering device
CN104182505A (en) * 2014-08-19 2014-12-03 小米科技有限责任公司 Webpage rearrangement method and device
CN104239422A (en) * 2014-08-21 2014-12-24 小米科技有限责任公司 Advertisement identification method, advertisement identification device and electronic equipment
CN105373728A (en) * 2014-09-01 2016-03-02 深圳富泰宏精密工业有限公司 Advertisement prompting system and method
CN105335869A (en) * 2015-09-24 2016-02-17 精硕世纪科技(北京)有限公司 Early warning method and system for advertisement monitoring
CN105934759A (en) * 2015-10-13 2016-09-07 深圳还是威健康科技有限公司 Data updating method and device, terminal and server
CN106919690A (en) * 2017-03-03 2017-07-04 北京金山安全软件有限公司 Information shielding method and device and electronic equipment

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113722640A (en) * 2021-08-26 2021-11-30 长沙博为软件技术股份有限公司 Method, device and medium for collecting webpage configurable items based on RPA

Similar Documents

Publication Publication Date Title
JP5721818B2 (en) Use of model information group in search
US9594730B2 (en) Annotating HTML segments with functional labels
US20150067476A1 (en) Title and body extraction from web page
CN102156737B (en) Method for extracting subject content of Chinese webpage
US10296552B1 (en) System and method for automated identification of internet advertising and creating rules for blocking of internet advertising
CN108566399B (en) Phishing website identification method and system
US11907644B2 (en) Detecting compatible layouts for content-based native ads
US20180285331A1 (en) Method, server, browser, and system for recommending text information
CN101539918A (en) Method and system for internet search
CN110245069A (en) The methods of exhibiting and device of the test method and device of page versions, the page
CN112699295A (en) Webpage content recommendation method and device and computer readable storage medium
CN105868290A (en) Search result presentation method and apparatus
CN103136259B (en) A kind of method and apparatus based on content block identification processing web page contents
CN102314494A (en) Method and equipment for processing webpage contents
CN104090923A (en) Method and device for displaying rich media information in browser
CN103942211A (en) Text page recognition method and device
CN105117434A (en) Webpage classification method and webpage classification system
CN105204806A (en) Individual display method and device for mobile terminal webpage
CN102902794A (en) Web page classification system and method
CN110134844A (en) Subdivision field public sentiment monitoring method, device, computer equipment and storage medium
US10963690B2 (en) Method for identifying main picture in web page
CN109214864A (en) A kind of advertisement recognition method and device, electronic equipment
CN107862016A (en) A kind of collocation method of the thematic page
CN113407678B (en) Knowledge graph construction method, device and equipment
CN103870275B (en) Information processing method and device

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20190115