CN111767454A - Business object filtering method and device, electronic equipment and computer storage medium - Google Patents

Business object filtering method and device, electronic equipment and computer storage medium Download PDF

Info

Publication number
CN111767454A
CN111767454A CN201910261039.1A CN201910261039A CN111767454A CN 111767454 A CN111767454 A CN 111767454A CN 201910261039 A CN201910261039 A CN 201910261039A CN 111767454 A CN111767454 A CN 111767454A
Authority
CN
China
Prior art keywords
page
business object
filtered
information
filtering
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201910261039.1A
Other languages
Chinese (zh)
Inventor
刘凌凌
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Alibaba Group Holding Ltd
Original Assignee
Alibaba Group Holding Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Alibaba Group Holding Ltd filed Critical Alibaba Group Holding Ltd
Priority to CN201910261039.1A priority Critical patent/CN111767454A/en
Publication of CN111767454A publication Critical patent/CN111767454A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9535Search customisation based on user profiles and personalisation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/958Organisation or management of web site content, e.g. publishing, maintaining pages or automatic linking

Abstract

The embodiment of the invention provides a business object filtering method, a business object filtering device, electronic equipment and a computer storage medium, wherein the business object filtering method comprises the following steps: acquiring the information of the identified business object according to the identification result of the business object identification of the page screenshot of the page to be filtered; determining page elements matched with the information of the business object in the page to be filtered; and carrying out filtering processing on the page elements of the page to be filtered, and displaying the filtered page. The embodiment of the invention greatly reduces the implementation cost of the business object filtering.

Description

Business object filtering method and device, electronic equipment and computer storage medium
Technical Field
The embodiment of the invention relates to the technical field of computers, in particular to a business object filtering method and device, electronic equipment and a computer storage medium.
Background
With the growing maturity of distribution and promotion technologies, more and more business objects such as advertisements manufacturers distribute and promote the business objects by using various applications. However, these business objects are not usually needed by the user, but rather severely affect the use of the application or program by the user. Therefore, filtering business objects becomes an important function of many applications and programs.
For example, in online user problem feedback, the advertisement-related percentage is very high, so advertisement filtering is always a very important function of a browser, and many browsers in China currently have the function and are one of the most concerned functions of users. The traditional advertisement filtering mechanism mainly extracts the advertisement characteristics of the page manually; making the characteristics into rules capable of operating on line and issuing the rules to the browser; the browser kernel analyzes the filtering method and the target elements according to the rules, deletes or hides the target elements, or intercepts advertisement requests. After the above advertisement filtering rules are on-line, in order to adapt to various newly-added advertisement types, the advertisement filtering rules which need continuous manual operation and iterative on-line are needed, and the basic flow comprises: new advertisements appear, manual page analysis, extraction rules and new filtering rules are added to update the original advertisement filtering rules.
However, this filtration method includes: firstly, with the occurrence of business objects such as advertisements with different writing methods, the rules need to be updated continuously, and the operation cost is high; secondly, adding a business object filtering rule such as an advertisement filtering rule is probably not comprehensive on one hand, and is easy to cause 'false killing' on the other hand, so that normal page content is filtered; thirdly, the extraction of the rules requires manual analysis, which is costly.
Disclosure of Invention
In view of the above, embodiments of the present invention provide a business object filtering scheme to solve at least some of the above problems.
According to a first aspect of the embodiments of the present invention, a method for filtering a business object is provided, including: acquiring the information of the identified business object according to the identification result of the business object identification of the page screenshot of the page to be filtered; determining page elements matched with the information of the business object in the page to be filtered; and carrying out filtering processing on the page elements of the page to be filtered, and displaying the filtered page.
According to a second aspect of the embodiments of the present invention, there is provided a business object filtering apparatus, including: the acquisition module is used for acquiring the information of the identified business object according to the identification result of the business object identification of the page screenshot of the page to be filtered; the determining module is used for determining page elements matched with the information of the business objects in the page to be filtered; and the filtering module is used for filtering the page elements of the page to be filtered and displaying the filtered page.
According to a third aspect of embodiments of the present invention, there is provided an electronic apparatus, including: the system comprises a processor, a memory, a communication interface and a communication bus, wherein the processor, the memory and the communication interface complete mutual communication through the communication bus; the memory is used for storing at least one executable instruction, and the executable instruction enables the processor to execute the operation corresponding to the business object filtering method according to the first aspect.
According to a fourth aspect of embodiments of the present invention, there is provided a computer storage medium having stored thereon a computer program which, when executed by a processor, implements the business object filtering method according to the first aspect.
According to the business object filtering scheme provided by the embodiment of the invention, firstly, the business object identification is carried out on the page screenshot corresponding to the page through the image identification technology so as to identify the business object to be filtered from the page screenshot, and the information of the identified business object is obtained; then, according to the identified information of the business object, determining a page element corresponding to the information of the business object from the page; and further, filtering the page element from the page, so that the displayed page has no business object any more.
According to the business object filtering scheme provided by the embodiment of the invention, firstly, the image recognition technology is used as a business object recognition means, and the image recognition algorithm has the characteristic of continuous optimization along with the increase of recognition data, so that the business object can be effectively recognized no matter what writing method is adopted by the business object; secondly, the business object can be accurately identified through image identification, so that the comprehensive processing of the business object can be ensured, and the 'mistaken killing' of normal page elements can be avoided; thirdly, by searching and determining the page elements matched with the identified business objects in the page, the business object filtering rules do not need to be updated even if new written business objects are generated, and the implementation cost of business object filtering is greatly reduced.
Drawings
In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly described below, it is obvious that the drawings in the following description are only some embodiments described in the embodiments of the present invention, and it is also possible for a person skilled in the art to obtain other drawings based on the drawings.
Fig. 1 is a flowchart illustrating steps of a business object filtering method according to a first embodiment of the present invention;
FIG. 2 is a flowchart illustrating steps of a business object filtering method according to a second embodiment of the present invention;
FIG. 3 is a flowchart illustrating steps of a method for filtering business objects according to a third embodiment of the present invention;
FIG. 4 is a schematic diagram of a training scenario of a business object recognition engine in the embodiment shown in FIG. 3;
fig. 5 is a block diagram illustrating a business object filtering apparatus according to a fourth embodiment of the present invention;
fig. 6 is a block diagram of a business object filtering apparatus according to a fifth embodiment of the present invention;
fig. 7 is a schematic structural diagram of an electronic device according to a sixth embodiment of the present invention.
Detailed Description
In order to make those skilled in the art better understand the technical solutions in the embodiments of the present invention, the technical solutions in the embodiments of the present invention will be described clearly and completely with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all embodiments. All other embodiments obtained by a person skilled in the art based on the embodiments of the present invention shall fall within the scope of the protection of the embodiments of the present invention.
The following further describes specific implementation of the embodiments of the present invention with reference to the drawings.
Example one
Referring to fig. 1, a flowchart illustrating steps of a business object filtering method according to a first embodiment of the present invention is shown.
The business object filtering method of the embodiment comprises the following steps:
step S102: and acquiring the information of the identified business object according to the identification result of the business object identification of the page screenshot of the page to be filtered.
The page to be filtered may be any appropriate page of an application APP that can be used by a business object vendor to distribute and promote a business object, such as a browser webpage, any page of a certain application APP, and the like. The generation or acquisition of the page screenshot of the page to be filtered can also be realized in any appropriate manner, for example, the page screenshot is generated according to the information of the page element of the page to be filtered, or the page screenshot is generated in a screen snapshot manner, and the like.
In the embodiment of the present invention, the business object may be set as any suitable object to be filtered by those skilled in the art according to actual needs, including but not limited to an advertisement, or a specific object, etc. The business objects in the page shot may be identified using a suitable image recognition algorithm, such as: the image recognition algorithm based on the convolutional neural network, the SURF algorithm, the SIFT algorithm, the K-MEANS algorithm, the DFS algorithm, the BFS algorithm, and the like, which are not limited in this embodiment of the present invention.
The business object recognition of the page screenshot of the page to be filtered can be carried out locally at the client, the page screenshot can also be sent to the server, the server carries out the business object recognition and returns the recognition result. The service object identification is carried out locally at the client, so that the identification efficiency and speed are high; and the service object identification is carried out at the server side, so that the local data processing load and the system consumption of the client side can be reduced.
If the business object to be filtered exists in the page screenshot of the page to be filtered, the information of the corresponding business object, such as the position information, the size information, the identification information, the name information, and the like of the business object, can be obtained, so as to subsequently determine the page element corresponding to the business object.
Step S104: and determining page elements matched with the information of the business object in the page to be filtered.
The page elements of a page may include some or all of text, pictures, audio, animation, and video, and the business object may also be displayed in the form of some of the above page elements. Each page element has corresponding information in the page, such as identification information, attribute information, layout information, position information, and the like, so as to determine the display position and the display mode of the page element in the page. Based on this, after the information of the business object is obtained, the page element matched with the information of the business object can be searched and determined from the page elements according to the information of the business object and the information of the page elements.
Step S106: and carrying out filtering processing on page elements on the page to be filtered, and displaying the filtered page.
The page elements matched with the information of the business object are the page elements which need to be filtered, so that the page elements in the page to be filtered need to be filtered, such as deleted or hidden, so as to achieve the effect of limiting the display of the page elements. After the page elements are filtered, the displayed page does not have the page elements, that is, the business objects do not exist any more, so that the filtering of the business objects in the page is realized.
According to the embodiment, firstly, the business object recognition is carried out on the page screenshot corresponding to the page through the image recognition technology, so that the business object needing to be filtered is recognized; then, according to the identified information of the business object, determining a page element corresponding to the information of the business object from the page; and further, filtering the page element from the page, so that the displayed page has no business object any more.
According to the embodiment, firstly, the image recognition technology is used as a means for recognizing the business object, and the image recognition algorithm has the characteristic of continuous optimization along with the increase of recognition data, so that the business object can be effectively recognized no matter what writing method is adopted by the business object; secondly, the business object can be accurately identified through image identification, so that the comprehensive processing of the business object can be ensured, and the 'mistaken killing' of normal page elements can be avoided; thirdly, by searching and determining the page elements matched with the identified business objects in the page, the business object filtering rules do not need to be updated even if new written business objects are generated, and the implementation cost of business object filtering is greatly reduced.
The business object filtering method of the present embodiment may be executed by any suitable electronic device with data processing capability, including but not limited to: mobile terminals (such as mobile phones, PADs, etc.), servers, PCs, etc.
Example two
Referring to fig. 2, a flowchart illustrating steps of a business object filtering method according to a second embodiment of the present invention is shown.
The business object filtering method of the embodiment comprises the following steps:
step S202: and acquiring a page screenshot of the page to be filtered.
In this embodiment, obtaining the page screenshot of the page to be filtered in a generating manner includes: obtaining rendering data of the page to be filtered from a cache of an application where the page to be filtered is located; and generating a page screenshot of the page to be filtered according to the rendering data.
Many applications are provided with an application cache, such as a browser cache, for storing a copy of a requested resource in the application, so that the next time the resource is requested, the copy is used to respond to the request according to a caching mechanism. For example, a browser cache may store a copy of a requested Web resource (e.g., html page, picture, js, data, etc.) in the browser, and the cache may store a copy of the output content based on the received access request; when the next access request comes, if the access request is the same URL, the cache decides whether to directly respond to the access request by using the copy or to send the access request to the origin server again according to the caching mechanism. It is common that the browser caches the web pages that have visited the website, and when this URL address is visited again, if the web page is not updated, the web page is not downloaded again, but the locally cached web page is directly used. Based on this, in this embodiment, the rendering data of the page to be filtered is selected to be obtained from the application cache, and a corresponding page screenshot is generated according to the rendering data. And for the page to be filtered which is accessed for the first time, caching data of the page to be filtered by adopting a preloading mode, further generating a corresponding page screenshot according to rendering data in the page to be filtered, filtering business objects in the page to be filtered before the page to be filtered is displayed, and displaying the corresponding page according to the preloaded data after the page to be filtered is displayed.
By using the mode of generating the page screenshot by using the rendering data in the cache, on one hand, the page screenshot is generated more quickly and efficiently because the rendering data is obtained from the cache; on the other hand, the method can acquire the information of all the page elements of the page to be filtered, including the information of the page elements displayed to the user in the display range of the current page and the information of the page elements not displayed to the user outside the display range of the current page, so that the page elements in the generated page screenshot are more complete and comprehensive, and the identification efficiency of the business object is improved.
But not limited thereto, other ways of generating the page screenshot, such as a screen printing way or a VIEW control screenshot way, are also applicable to the scheme of the embodiment. In addition, a page screenshot of the page to be filtered can also be generated in advance and stored, and a mode of directly acquiring the screenshot from the storage address when needed is also applicable to the scheme of the embodiment.
Step S204: and carrying out business object identification on the page screenshot of the page to be filtered.
In this embodiment, the service object recognition is performed on the page screenshot of the page to be filtered locally at the client, so as to implement the rapid filtering processing of the service object.
The business object can be set by a person skilled in the art according to actual needs, and the specific implementation of business object recognition on the screenshot can also be implemented by a person skilled in the art by adopting any appropriate image recognition mode according to actual needs.
Step S206: and acquiring the information of the identified business object according to the identification result of the business object identification of the page screenshot of the page to be filtered.
In this embodiment, the information of the identified service object at least includes: coordinate information and/or location information of the business object. Optionally, the width and height information of the area where the business object is located may also be included. Of course, other information of the business object may also be included, such as identification information or name information. The coordinate information of the business object is used for indicating the coordinate of the business object in the page screenshot; the location information of the business object is used to indicate the relative location of the business object in the page shot, e.g., the distance from the boundary of the page shot or the location relative to a certain target in the page shot, etc. Because the business object in the page screenshot corresponds to the page element in the page to be filtered, the corresponding page element in the page to be filtered can be determined more quickly and efficiently by using the coordinate information and/or the position information of the business object.
Step S208: and determining page elements matched with the information of the business object in the page to be filtered.
In a feasible manner, the page element matched with the information of the service object can be searched and determined according to the node information of the page element in the page to be filtered. The node information of the page element may be node information of a node corresponding to the page element in a DOM (Document Object Model) tree of the page to be filtered, or may also be node information of a node corresponding to the page element in a Render tree of the page to be filtered, and the like. By the method, the existing information of the page to be filtered is effectively utilized, and the determining cost of the business object in the page to be filtered is reduced; the speed of determining the business objects in the page to be filtered can be faster.
Taking a browser as an example, after the browser parses an HTML web page, the browser generates a corresponding DOM tree. Each node on the DOM tree corresponds to each page element in the web page. Based on the information, the information matched with the information of the business object can be found from the node information of the node in the DOM tree, and then the page element corresponding to the node is determined as the page element matched with the information of the business object.
For another example, because the DOM tree itself cannot be used directly for composition and rendering, the browser may also generate a Render tree for the web page, with each node RenderObject on the Render tree corresponding one-to-one to a node on the DOM tree. In order to facilitate processing of page elements, the browser also generates corresponding renderLayers for some renderObjects, the renderObjects and renderLayers have dependencies, and a plurality of renderLayers generate a Layer tree. Based on this, when determining the page elements matched with the information of the business object, the Layer tree can also be traversed, each RenderLayer is accessed, then the RenderObject belonging to the RenderLayer is traversed, and the RenderObject matched with the information of the business object is determined from the traversed RenderObject, and the page elements corresponding to the RenderObject are the page elements matched with the information of the business object.
But not limited thereto, other ways of determining the page element according to the information of the business object, such as ways of determining the corresponding page element through Hittest algorithm according to the coordinate information of the business object, are also applicable to the solution of the embodiment of the present invention.
Step S210: and carrying out filtering processing on page elements on the page to be filtered, and displaying the filtered page.
Wherein, the filtering process of the page elements includes but is not limited to: a delete process or a hide process. In a feasible manner, the deleting process or the hiding process may be determined according to the layout information of the page element in the page to be filtered. For example, according to the layout information of the page element in the page to be filtered, if it is determined that the layout information of other page elements in the page to be filtered changes after the page element is deleted, the page element is hidden; otherwise, the deletion process is performed.
The layout information of the page elements in the page to be filtered indicates the display positions of the page elements in the page to be filtered, the relative position relationship between the page elements and other page elements, and the like.
By determining the mode of deleting or hiding the page elements according to the typesetting information, the influence of the typesetting of the page to be filtered due to the filtering of the business object can be reduced to the minimum, so that the filtering of the business object is better.
After the filtering processing is carried out, the displayed page has no business object, so that the filtering requirement on the business object is met, and the use experience of a user is improved.
Step S212: and updating the business object identification engine by using the page screenshot of the page to be filtered and the identified information of the business object.
The step is an optional step, and the business object identification engine can be arranged at a client and/or a server. The business object recognition engine arranged at the client can recognize the business object of the page screenshot of the page to be filtered; the business object recognition engine arranged at the server side can continuously optimize according to the page screenshots reported by each client side and the information of the recognized business objects, and then can issue the optimized business object engine to the client side so as to update and optimize the business object recognition engine of the client side; or, if the client does not set the service object recognition engine, the client may send the page screenshot of the page to be filtered to the server, and the service object recognition engine of the server performs service object recognition, and then returns the information of the recognized service object to the client, so that the client performs subsequent filtering processing.
Through the steps, the business object recognition engine for business object recognition can be continuously optimized, so that support is provided for better subsequent business object recognition.
According to the embodiment, firstly, the business object recognition is carried out on the page screenshot corresponding to the page through the image recognition technology, so that the business object needing to be filtered is recognized from the page screenshot, and the information of the recognized business object is obtained; then, according to the identified information of the business object, determining a page element corresponding to the information of the business object from the page; and further, filtering the page element from the page, so that the displayed page has no business object any more.
According to the embodiment, firstly, the image recognition technology is used as a means for recognizing the business object, and the image recognition algorithm has the characteristic of continuous optimization along with the increase of recognition data, so that the business object can be effectively recognized no matter what writing method is adopted by the business object; secondly, the business object can be accurately identified through image identification, so that the comprehensive processing of the business object can be ensured, and the 'mistaken killing' of normal page elements can be avoided; thirdly, by searching and determining the page elements matched with the identified business objects in the page, the business object filtering rules do not need to be updated even if new written business objects are generated, and the implementation cost of business object filtering is greatly reduced.
The business object filtering method of the present embodiment may be executed by any suitable electronic device with data processing capability, including but not limited to: mobile terminals (such as mobile phones, PADs, etc.), servers, PCs, etc.
EXAMPLE III
Referring to fig. 3, a flowchart illustrating steps of a business object filtering method according to a third embodiment of the present invention is shown.
In this embodiment, the service object filtering method according to the embodiment of the present invention is described by taking advertisement filtering on a browser webpage as an example. However, it should be understood by those skilled in the art that advertisement filtering for other applications, filtering for other business objects in a browser webpage, and other similar scenarios can be implemented with reference to the present embodiment.
The business object filtering method of the embodiment comprises the following steps:
step S302: and loading and rendering the webpage.
For example, when a user needs to access a web page, the user can open a web page through a browser, and then the browser kernel loads and renders the web page.
Step S304: and generating a webpage content screenshot.
For example, the browser kernel may generate the screenshot of the web page content of the web page according to rendering data stored in a browser cache when the web page is rendered by the browser.
Step S306: and identifying the advertisement area through a business object identification engine.
For example, the generated screenshot of web page content may be input to a business object recognition engine, identifying an advertisement area therein.
Step S308: judging whether the advertisement is identified or not according to the identification result; if the advertisement is identified, go to step S310; otherwise, the process is ended.
Step S310: and calculating the coordinates and the width and the height of the advertisement area.
The specific implementation of this step can be realized by those skilled in the art in any appropriate manner according to actual situations, for example, the width of the advertisement area is calculated by calculating the distance between corresponding coordinate points, and the like.
Step S312: and searching advertisement elements in the webpage by the browser kernel according to the coordinates and the width and the height of the advertisement area by using a Hittest algorithm.
Wherein the Hittest algorithm can be used to detect whether two objects or targets overlap and intersect. Further, if overlapping or intersecting, some operations of setting may be performed. Therefore, when searching for an advertisement element in a webpage by using the Hittest algorithm, both the coordinate and the width and height of the advertisement area need to be transmitted to the Hittest algorithm, so that whether the area indicated by the coordinate and the width and height is overlapped or intersected with the advertisement element area in the webpage can be accurately judged. By means of the Hittest algorithm, the advertisement elements in the webpage can be accurately searched, and the subsequent setting of filtering processing of the advertisement elements can be realized.
Step S314: the browser kernel deletes or hides the advertisement elements.
After the width, the height and the coordinates of the advertisement area are transmitted to the browser kernel, the browser kernel finds corresponding advertisement element nodes in the webpage through the data, and then deletes or hides the advertisement element nodes, so that the purpose of filtering advertisements is achieved.
Step S316: and uploading the information of the webpage content screenshot and the advertisement area to a cloud server so as to train a business object recognition engine of the server.
For example, as shown in fig. 4, after the web page content screenshot is sent to the business object recognition engine in the browser kernel to perform business object recognition, information of the advertisement area is obtained, and advertisement filtering is completed according to the information, the web page content screenshot and information (such as coordinates and width and height) of the relevant advertisement area can be transmitted to the cloud server; the cloud server stores the uploaded data into a training database to serve as training data of a business object recognition engine of an optimized training server side; a service object recognition engine with deep learning capability at a server side acquires training data from a training database and optimizes an algorithm model of the service object recognition engine; after the optimization training of the business object recognition engine is completed, the server may push (or notify the client to actively pull) the business object recognition engine to the client according to certain conditions (such as the improvement range of the accuracy rate, the update interval from the last time, and the like), so as to complete the update of the business object recognition engine of the client (for example, the update of the business object recognition engine of the client is completed by using a hot update replacement method).
According to the embodiment, on one hand, the image recognition technology is applied to filtering of the webpage advertisements, and effective recognition and processing of the advertisements in the webpage are realized; on the other hand, the webpage does not need to be analyzed manually in the later period, the advertisement filtering rules are extracted, and the later maintenance cost is low; on the other hand, the deep learning technology is used, the business object recognition engine is continuously optimized, the advertisement recognition accuracy is improved, and the external advertisement change can be timely adapted.
The business object filtering method of the present embodiment may be executed by any suitable electronic device with data processing capability, including but not limited to: mobile terminals (such as mobile phones, PADs, etc.), servers, PCs, etc.
Example four
Referring to fig. 5, a block diagram of a business object filtering apparatus according to a fourth embodiment of the present invention is shown.
The business object filtering device of the embodiment comprises: an obtaining module 402, configured to obtain information of an identified business object according to an identification result of a business object identification performed on a page screenshot of a page to be filtered; a determining module 404, configured to determine a page element, in the page to be filtered, that is matched with the information of the business object; the filtering module 406 is configured to perform filtering processing on the page element on the page to be filtered, and display the filtered page.
The service object filtering apparatus of this embodiment is used to implement the corresponding service object filtering method in the foregoing multiple method embodiments, and has the beneficial effects of the corresponding method embodiments, which are not described herein again. In addition, the functional implementation of each module in the service object filtering apparatus of this embodiment can refer to the description of the corresponding part in the foregoing method embodiment, and is not described herein again.
EXAMPLE five
Referring to fig. 6, a block diagram of a business object filtering apparatus according to a fifth embodiment of the present invention is shown.
The business object filtering device of the embodiment comprises: an obtaining module 502, configured to obtain information of an identified business object according to an identification result of a business object identification performed on a page screenshot of a page to be filtered; a determining module 504, configured to determine a page element, in the page to be filtered, that is matched with the information of the business object; the filtering module 506 is configured to perform filtering processing on the page element on the page to be filtered, and display the filtered page.
Optionally, the determining module 504 is configured to search and determine a page element matched with the information of the service object according to the node information of the page element in the page to be filtered.
Optionally, the information of the business object includes coordinate information and/or position information of the business object.
Optionally, the service object filtering apparatus of this embodiment further includes: a generating module 508, configured to obtain rendering data of the page to be filtered from a cache of an application in which the page to be filtered is located before the obtaining module 502 obtains information of the identified business object according to an identification result of business object identification performed on the page screenshot of the page to be filtered; and generating a page screenshot of the page to be filtered according to the rendering data.
Optionally, the filtering module 506 is configured to determine to delete or hide the page element according to the layout information of the page element in the page to be filtered, and display the filtered page.
Optionally, the filtering module 506 is configured to, according to the layout information of the page element in the page to be filtered, determine that, after the page element is deleted, the layout information of other page elements in the page to be filtered changes, and hide the page element; otherwise, carrying out deletion processing; and displaying the page after the hiding processing or the deleting processing.
Optionally, the service object filtering apparatus of this embodiment further includes: an updating module 510, configured to update the business object recognition engine using the page screenshot and the information of the business object.
The service object filtering apparatus of this embodiment is used to implement the corresponding service object filtering method in the foregoing multiple method embodiments, and has the beneficial effects of the corresponding method embodiments, which are not described herein again. In addition, the functional implementation of each module in the service object filtering apparatus of this embodiment can refer to the description of the corresponding part in the foregoing method embodiment, and is not described herein again.
EXAMPLE six
Referring to fig. 7, a schematic structural diagram of an electronic device according to a sixth embodiment of the present invention is shown, and the specific embodiment of the present invention does not limit the specific implementation of the electronic device.
As shown in fig. 7, the electronic device may include: a processor (processor)602, a communication Interface 604, a memory 606, and a communication bus 608.
Wherein:
the processor 602, communication interface 604, and memory 606 communicate with one another via a communication bus 608.
A communication interface 604 for communicating with other electronic devices or servers.
The processor 602 is configured to execute the program 610, and may specifically perform relevant steps in the foregoing business object filtering method embodiment.
In particular, program 610 may include program code comprising computer operating instructions.
The processor 602 may be a central processing unit CPU or an application specific Integrated circuit asic or one or more Integrated circuits configured to implement embodiments of the present invention. The intelligent device comprises one or more processors which can be the same type of processor, such as one or more CPUs; or may be different types of processors such as one or more CPUs and one or more ASICs.
And a memory 606 for storing a program 610. The memory 606 may comprise high-speed RAM memory, and may also include non-volatile memory (non-volatile memory), such as at least one disk memory.
The program 610 may specifically be configured to cause the processor 602 to perform the following operations: acquiring the information of the identified business object according to the identification result of the business object identification of the page screenshot of the page to be filtered; determining page elements matched with the information of the business objects in the page to be filtered; and performing filtering processing on the page elements on the page to be filtered, and displaying the filtered page.
In an alternative embodiment, the program 610 is further configured to, when determining a page element in the page to be filtered, which matches with the information of the business object, find a page element determined to match with the information of the business object according to node information of the page element in the page to be filtered.
In an alternative embodiment, the information of the business object includes coordinate information and/or position information of the business object.
In an optional implementation manner, the program 610 is further configured to enable the processor 602 to obtain rendering data of the page to be filtered from a cache of an application in which the page to be filtered is located before obtaining information of the identified business object according to an identification result of business object identification performed on a page screenshot of the page to be filtered; and generating a page screenshot of the page to be filtered according to the rendering data.
In an optional implementation manner, the program 610 is further configured to enable the processor 602, when the filtering processing of the page element is performed on the page to be filtered, determine to perform deletion processing or hiding processing on the page element according to the layout information of the page element in the page to be filtered.
In an optional implementation manner, the program 610 is further configured to enable the processor 602, when determining to delete or hide the page element according to the layout information of the page element in the page to be filtered, determine, according to the layout information of the page element in the page to be filtered, that if the layout information of other page elements in the page to be filtered changes after deleting the page element, the page element is hidden; otherwise, the deletion process is performed.
In an alternative embodiment, program 610 is further operative to cause processor 602 to update a business object recognition engine using the page shot and information for the business object.
For specific implementation of each step in the program 610, reference may be made to corresponding steps and corresponding descriptions in units in the foregoing embodiments of the service object filtering method, which are not described herein again. It can be clearly understood by those skilled in the art that, for convenience and brevity of description, the specific working processes of the above-described devices and modules may refer to the corresponding process descriptions in the foregoing method embodiments, and are not described herein again.
Through the electronic equipment of the embodiment, firstly, the business object recognition is carried out on the page screenshot corresponding to the page through the image recognition technology, so as to recognize the business object needing to be filtered; then, according to the identified information of the business object, determining a page element corresponding to the information of the business object from the page; and further, filtering the page element from the page, so that the displayed page has no business object any more.
Through the electronic equipment of the embodiment, firstly, the image recognition technology is used as a means for recognizing the business object, and the image recognition algorithm has the characteristic of continuous optimization along with the increase of recognition data, so that the business object can be effectively recognized no matter what writing method is adopted by the business object; secondly, the business object can be accurately identified through image identification, so that the comprehensive processing of the business object can be ensured, and the 'mistaken killing' of normal page elements can be avoided; thirdly, by searching and determining the page elements matched with the identified business objects in the page, the business object filtering rules do not need to be updated even if new written business objects are generated, and the implementation cost of business object filtering is greatly reduced.
It should be noted that, according to the implementation requirement, each component/step described in the embodiment of the present invention may be divided into more components/steps, and two or more components/steps or partial operations of the components/steps may also be combined into a new component/step to achieve the purpose of the embodiment of the present invention.
The above-described method according to an embodiment of the present invention may be implemented in hardware, firmware, or as software or computer code storable in a recording medium such as a CD ROM, a RAM, a floppy disk, a hard disk, or a magneto-optical disk, or as computer code originally stored in a remote recording medium or a non-transitory machine-readable medium downloaded through a network and to be stored in a local recording medium, so that the method described herein may be stored in such software processing on a recording medium using a general-purpose computer, a dedicated processor, or programmable or dedicated hardware such as an ASIC or FPGA. It will be appreciated that the computer, processor, microprocessor controller or programmable hardware includes memory components (e.g., RAM, ROM, flash memory, etc.) that can store or receive software or computer code that, when accessed and executed by the computer, processor or hardware, implements the business object filtering methods described herein. Further, when a general purpose computer accesses code for implementing the business object filtering methods illustrated herein, execution of the code transforms the general purpose computer into a special purpose computer for performing the business object filtering methods illustrated herein.
Those of ordinary skill in the art will appreciate that the various illustrative elements and method steps described in connection with the embodiments disclosed herein may be implemented as electronic hardware, or combinations of computer software and electronic hardware. Whether such functionality is implemented as hardware or software depends upon the particular application and design constraints imposed on the implementation. Skilled artisans may implement the described functionality in varying ways for each particular application, but such implementation decisions should not be interpreted as causing a departure from the scope of the present embodiments.
The above embodiments are only for illustrating the embodiments of the present invention and not for limiting the embodiments of the present invention, and those skilled in the art can make various changes and modifications without departing from the spirit and scope of the embodiments of the present invention, so that all equivalent technical solutions also belong to the scope of the embodiments of the present invention, and the scope of patent protection of the embodiments of the present invention should be defined by the claims.

Claims (16)

1. A business object filtering method comprises the following steps:
acquiring the information of the identified business object according to the identification result of the business object identification of the page screenshot of the page to be filtered;
determining page elements matched with the information of the business object in the page to be filtered;
and carrying out filtering processing on the page elements of the page to be filtered, and displaying the filtered page.
2. The method of claim 1, wherein the determining page elements in the page to be filtered that match the information of the business object comprises:
and searching and determining the page elements matched with the information of the service object according to the node information of the page elements in the page to be filtered.
3. The method according to claim 1 or 2, wherein the information of the business object comprises coordinate information and/or location information of the business object.
4. The method of claim 1, wherein before the obtaining information of the identified business object according to the identification result of the business object identification of the page screenshot of the page to be filtered, the method further comprises:
obtaining rendering data of the page to be filtered from a cache of an application where the page to be filtered is located;
and generating a page screenshot of the page to be filtered according to the rendering data.
5. The method according to claim 1, wherein the filtering the page element on the page to be filtered includes:
and determining to delete or hide the page elements according to the typesetting information of the page elements in the page to be filtered.
6. The method according to claim 5, wherein the determining to delete or hide the page element according to the layout information of the page element in the page to be filtered includes:
according to the typesetting information of the page elements in the page to be filtered, if the typesetting information of other page elements in the page to be filtered is changed after the page elements are deleted, the page elements are hidden; otherwise, the deletion process is performed.
7. The method of claim 1, wherein the method further comprises:
and updating a business object identification engine by using the page screenshot and the information of the business object.
8. A business object filtering apparatus comprising:
the acquisition module is used for acquiring the information of the identified business object according to the identification result of the business object identification of the page screenshot of the page to be filtered;
the determining module is used for determining page elements matched with the information of the business objects in the page to be filtered;
and the filtering module is used for filtering the page elements of the page to be filtered and displaying the filtered page.
9. The apparatus according to claim 8, wherein the determining module is configured to find and determine a page element that matches the information of the business object according to node information of page elements in the page to be filtered.
10. The apparatus according to claim 8 or 9, wherein the information of the business object comprises coordinate information and/or position information of the business object.
11. The apparatus of claim 8, wherein the apparatus further comprises:
the generating module is used for acquiring rendering data of the page to be filtered from a cache of an application where the page to be filtered is located before the acquiring module acquires information of the identified business object according to an identification result of business object identification carried out on a page screenshot of the page to be filtered; and generating a page screenshot of the page to be filtered according to the rendering data.
12. The apparatus according to claim 8, wherein the filtering module is configured to determine to delete or hide the page element according to the layout information of the page element in the page to be filtered, and display the filtered page.
13. The apparatus according to claim 12, wherein the filtering module is configured to, according to the layout information of the page element in the page to be filtered, determine that, after the page element is deleted, the layout information of other page elements in the page to be filtered changes, and hide the page element; otherwise, carrying out deletion processing; and displaying the page after the hiding processing or the deleting processing.
14. The apparatus of claim 8, wherein the apparatus further comprises:
and the updating module is used for updating the business object recognition engine by using the page screenshot and the information of the business object.
15. An electronic device, comprising: the system comprises a processor, a memory, a communication interface and a communication bus, wherein the processor, the memory and the communication interface complete mutual communication through the communication bus;
the memory is used for storing at least one executable instruction, and the executable instruction causes the processor to execute the operation corresponding to the business object filtering method according to any one of claims 1-7.
16. A computer storage medium having stored thereon a computer program which, when executed by a processor, implements the business object filtering method of any one of claims 1-7.
CN201910261039.1A 2019-04-02 2019-04-02 Business object filtering method and device, electronic equipment and computer storage medium Pending CN111767454A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910261039.1A CN111767454A (en) 2019-04-02 2019-04-02 Business object filtering method and device, electronic equipment and computer storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910261039.1A CN111767454A (en) 2019-04-02 2019-04-02 Business object filtering method and device, electronic equipment and computer storage medium

Publications (1)

Publication Number Publication Date
CN111767454A true CN111767454A (en) 2020-10-13

Family

ID=72718795

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910261039.1A Pending CN111767454A (en) 2019-04-02 2019-04-02 Business object filtering method and device, electronic equipment and computer storage medium

Country Status (1)

Country Link
CN (1) CN111767454A (en)

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101331473A (en) * 2005-12-07 2008-12-24 三维实验室公司 Methods for manipulating web pages
CN103810425A (en) * 2012-11-13 2014-05-21 腾讯科技(深圳)有限公司 Method and device for detecting malicious website
CN105069023A (en) * 2015-07-17 2015-11-18 孙巍 Method, device and system for playing webpage image-text in television
CN106202101A (en) * 2015-05-06 2016-12-07 腾讯科技(深圳)有限公司 Advertisement recognition method and device
CN106326316A (en) * 2015-07-08 2017-01-11 腾讯科技(深圳)有限公司 Web page advertisement filtering method and device
CN108733764A (en) * 2018-04-16 2018-11-02 优视科技有限公司 Advertisement filter rule generating method based on machine learning and advertisement filtering system

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101331473A (en) * 2005-12-07 2008-12-24 三维实验室公司 Methods for manipulating web pages
CN103810425A (en) * 2012-11-13 2014-05-21 腾讯科技(深圳)有限公司 Method and device for detecting malicious website
US20150244728A1 (en) * 2012-11-13 2015-08-27 Tencent Technology (Shenzhen) Company Limited Method and device for detecting malicious url
CN106202101A (en) * 2015-05-06 2016-12-07 腾讯科技(深圳)有限公司 Advertisement recognition method and device
CN106326316A (en) * 2015-07-08 2017-01-11 腾讯科技(深圳)有限公司 Web page advertisement filtering method and device
CN105069023A (en) * 2015-07-17 2015-11-18 孙巍 Method, device and system for playing webpage image-text in television
CN108733764A (en) * 2018-04-16 2018-11-02 优视科技有限公司 Advertisement filter rule generating method based on machine learning and advertisement filtering system

Similar Documents

Publication Publication Date Title
JP6356273B2 (en) Batch optimized rendering and fetch architecture
US8977653B1 (en) Modifying web pages to reduce retrieval latency
KR102047568B1 (en) Measuring web page rendering time
JP5786105B2 (en) Information processing apparatus, information processing method, information processing program, display control apparatus, and display control program
CN102306174B (en) Method and equipment for interacting with user based on web page elements
US20130326333A1 (en) Mobile Content Management System
KR102222087B1 (en) Image recognition method and apparatus based on augmented reality
US9607584B2 (en) Real world analytics visualization
CN110851757A (en) Page resource loading method and terminal
CN105550338A (en) HTML5 application cache based mobile Web cache optimization method
CN110309461B (en) Page display method and device
CN112965645B (en) Page dragging method and device, computer equipment and storage medium
CN112685671A (en) Page display method, device, equipment and storage medium
CN106547806B (en) Page loading method and device
CN109815243B (en) Structured storage method and device during document interface modification
CN107451163B (en) Animation display method and device
JP2018160264A (en) Batch-optimized render and fetch architecture
WO2021189766A1 (en) Data visualization method and related device
CN111767454A (en) Business object filtering method and device, electronic equipment and computer storage medium
CN108133029B (en) Map element adjusting method, device and equipment and computer-readable storage medium
CN103793516B (en) Method and device for obtaining URL icon
CN114756797A (en) Page processing method and device, electronic equipment and storage medium
CN114168875A (en) Page program generation method and device, computer equipment and storage medium
RU2634221C2 (en) Method and device for drawing presentation of electronic document on screen
CN104239432A (en) Method and device for identifying responsive pages

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination