CN110619103A - Webpage image-text detection method and device and storage medium - Google Patents

Webpage image-text detection method and device and storage medium Download PDF

Info

Publication number
CN110619103A
CN110619103A CN201910882771.0A CN201910882771A CN110619103A CN 110619103 A CN110619103 A CN 110619103A CN 201910882771 A CN201910882771 A CN 201910882771A CN 110619103 A CN110619103 A CN 110619103A
Authority
CN
China
Prior art keywords
target
web page
picture
content
webpage
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201910882771.0A
Other languages
Chinese (zh)
Inventor
王立颖
康林林
王沅召
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Gree Electric Appliances Inc of Zhuhai
Zhuhai Lianyun Technology Co Ltd
Original Assignee
Gree Electric Appliances Inc of Zhuhai
Zhuhai Lianyun Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Gree Electric Appliances Inc of Zhuhai, Zhuhai Lianyun Technology Co Ltd filed Critical Gree Electric Appliances Inc of Zhuhai
Priority to CN201910882771.0A priority Critical patent/CN110619103A/en
Publication of CN110619103A publication Critical patent/CN110619103A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/40Information retrieval; Database structures therefor; File system structures therefor of multimedia data, e.g. slideshows comprising image and additional audio data
    • G06F16/43Querying
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/958Organisation or management of web site content, e.g. publishing, maintaining pages or automatic linking
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
    • G06K9/00Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
    • G06K9/62Methods or arrangements for recognition using electronic means
    • G06K9/6267Classification techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06QDATA PROCESSING SYSTEMS OR METHODS, SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce, e.g. shopping or e-commerce
    • G06Q30/06Buying, selling or leasing transactions
    • G06Q30/0601Electronic shopping

Abstract

The present disclosure provides a method, an apparatus and a storage medium for detecting web page graphics, wherein the method comprises: according to the acquired webpage link address to be detected, a target picture in webpage content and character description information corresponding to the target picture are acquired, identification content corresponding to the target picture is acquired after the target picture is identified, whether the character description information corresponding to the target picture and the identification content are consistent or not is confirmed, and the confirmed result is used as a detection result. Through the arrangement, the error condition of manual checking whether the pictures and texts are consistent is effectively avoided, and the checking accuracy and the working efficiency are improved while the human resources are saved.

Description

Webpage image-text detection method and device and storage medium
Technical Field
The present disclosure relates to the field of computer technologies, and in particular, to a method and an apparatus for detecting web page images and texts, and a storage medium.
Background
With the popularization of the internet, online shopping has become an important channel for shopping and consumption of consumers. The online shopping mall achieves a virtual shop in the process from buying to selling by various means of electronic commerce, thereby reducing intermediate links, eliminating transportation cost and price difference between agents, and leading online shopping to be popular and accepted by wide consumers. People can buy needed things at any time and any place by opening a shopping website or application and browsing commodity pictures. In order to attract the consumers to buy the commodities of the merchants, the most important means is to rely on attractive commodity pictures. The consumer sees the commodity picture first when selecting the commodity, so when the merchant releases the commodity information, the most important step is to check whether the commodity picture is the picture of the character description of the commodity or not, and at present, the check is manually identified.
Disclosure of Invention
The disclosure provides a webpage image-text detection method, a webpage image-text detection device and a storage medium, which are used for solving the problems of error condition and low working efficiency when manually checking whether the website images and texts are consistent.
In order to achieve the above object, in a first aspect of the embodiments of the present disclosure, a method for detecting web page graphics context is provided, including:
acquiring a webpage link address to be detected;
according to the webpage link address, a target picture in webpage content and character description information corresponding to the target picture are obtained;
identifying the target picture to obtain identification content corresponding to the target picture;
and confirming whether the character description information corresponding to the target picture is consistent with the identification content or not, and taking a confirmation result as a detection result.
Optionally, the determining whether the text description information corresponding to the target picture is consistent with the identification content includes:
confirming whether the text description information corresponding to the target picture is matched with the identification content through fuzzy matching;
if the text description information is matched with the identification content, confirming that the text description information is consistent with the identification content;
and if the text description information is not matched with the identification content, confirming that the text description information is inconsistent with the identification content.
Optionally, the obtaining a target picture in web page content according to the web page link address includes:
searching a target element node corresponding to the webpage label from the webpage content corresponding to the webpage link address;
and obtaining a target picture corresponding to the picture link on the target element node according to the resource positioning attribute.
Optionally, the searching for the target element node corresponding to the web tag from the web content corresponding to the web link address includes:
and searching a target element node corresponding to the tag name from the webpage content according to the tag name of the webpage tag.
Optionally, the searching for the target element node corresponding to the web tag from the web content corresponding to the web link address further includes:
and searching a target element node corresponding to the characteristic attribute from the webpage content according to the characteristic attribute of the webpage label.
Optionally, the searching for the target element node corresponding to the web tag from the web content corresponding to the web link address includes:
and searching a target element node corresponding to the characteristic value from the webpage content according to the characteristic value of the preset attribute on the webpage label.
Optionally, the obtaining, according to the web page link address, text description information corresponding to the target picture in the web page content includes:
and obtaining the text description information on the target element node according to the text description attribute.
Optionally, the obtaining a target picture corresponding to the picture link on the target element node according to the resource positioning attribute includes:
obtaining a picture link on the target element node according to the resource positioning attribute;
and when the request resource corresponding to the picture link is confirmed to be a picture resource, taking the picture resource as a target picture.
In a second aspect of the embodiments of the present disclosure, a web page image-text detection apparatus is provided, which includes:
a memory having a computer program stored thereon; and
a processor for executing the computer program in the memory to implement the steps of the method of any of the first aspects above.
In a third aspect of the embodiments of the present disclosure, a computer-readable storage medium is provided, on which a computer program is stored, which when executed by a processor implements the steps of the method of any one of the above first aspects.
By adopting the technical scheme, the following technical effects can be at least achieved:
according to the method and the device, after the webpage link address to be detected is obtained, the target picture in the webpage content and the character description information corresponding to the target picture are obtained according to the webpage link address, the identification content of the target picture is obtained after the target picture is identified, whether the character description information is matched with the identification content is judged, and if the character description information is matched with the identification content, the character description information is determined to be consistent with the identification content, so that the problems of error condition and low working efficiency existing when the website pictures and texts are manually checked to be consistent are solved.
Drawings
The present disclosure will be described in more detail below based on embodiments and with reference to the accompanying drawings. Wherein the included drawings are:
fig. 1 is a schematic flow chart of a web page image-text detection method provided by the present disclosure;
FIG. 2 is a schematic flowchart of step S120 in FIG. 1;
fig. 3 is a block diagram of a web page image-text detection apparatus provided in the present disclosure;
in the drawings, like parts are designated with like reference numerals, and the drawings are not drawn to scale.
Detailed Description
Embodiments of the present disclosure will be described in detail with reference to the accompanying drawings and examples, so that how to apply technical means to solve technical problems and achieve the corresponding technical effects can be fully understood and implemented. Various modifications may be made and equivalents may be substituted for elements thereof without departing from the scope of the disclosure. The embodiments and various features in the embodiments of the present application can be combined with each other without conflict, and the formed technical solutions are all within the protection scope of the present disclosure.
It should be noted that: like reference numbers and letters refer to like items in the following figures, and thus, once an item is defined in one figure, it need not be further defined and explained in subsequent figures.
In the related art, the pictures and descriptive texts of the commodities of the selling websites are usually recognized by human eyes.
The inventor of the present disclosure finds that, by manually checking whether the images and texts of the website are consistent, human resources are wasted. And when manual inspection is inevitable and has errors, the accuracy cannot be ensured, and the working efficiency is reduced. Therefore, effective automatic inspection means are urgently needed to be adopted to inspect the pictures and texts of the webpage, so that the accuracy is ensured, and the working efficiency is improved.
Example one
The invention provides a webpage image-text detection method, which aims to solve the problems of errors and low working efficiency caused by manual identification of whether website images and texts are consistent or not in the prior art. Fig. 1 is a schematic flow chart of a web page image-text detection method provided by the present disclosure, and as shown in fig. 1, the web page image-text detection method mainly includes steps S110 to S140.
In step S110, a web page link address to be detected is acquired.
The web page link address to be detected may be input by a user or queried by the system.
The webpage link address can comprise webpage addresses of various network resources and jump addresses of local resources in different formats, different attributes and different positions.
In step S120, a target picture in the web content and the text description information corresponding to the target picture are obtained according to the web link address. As shown in fig. 2, the process of acquiring the target picture may include the following steps:
step S1201, searching for a target element node corresponding to the web tag from the web content corresponding to the web link address.
Step S1202, according to the resource positioning attribute, a target picture corresponding to the picture link on the target element node is obtained.
In step S1201, corresponding web page content is obtained according to the web page link address, where the web page content includes a hypertext link, a picture, audio, a video, a text, and the like. Since the web page is a plain text file containing HTML tags. The HTML tag comprises various tag names, attributes on the tag and attribute values corresponding to the attributes on the tag. Different HTML labels correspond to different element nodes of the webpage, and corresponding target element nodes can be found through APIs provided by different programming languages and CSS selectors. The target element corresponding to the picture resource can be identified by a special tag name, a characteristic attribute of the webpage tag and a characteristic value of a preset attribute on the webpage tag.
The CSS selector comprises at least one of a webpage label, an attribute on the label and an attribute value corresponding to the attribute on the label.
The different program languages may be, but are not limited to, JS or Python, and are not specifically limited herein and may be set according to actual requirements.
And when the identification of the target element is the tag name, searching a target element node corresponding to the tag name from the webpage content according to the tag name of the webpage tag.
For example, the webpage usually displays the picture by using the < img > tag, so that the < img > tag can be used as the identifier of the target element corresponding to the picture resource, and the target element node corresponding to the picture resource can be found by using the tag name of the img and the corresponding API.
When the identification of the target element is the feature attribute of the web page label, the web page may display the picture by using some common labels without picture features, and at this time, the feature attribute of the common label may be used as the identification of the target element, and the target element node corresponding to the label name may be searched from the web page content according to the feature attribute of the common label of the web page label.
For example, if some pictures in a web page do not display pictures by using < img > tags, but use general tags such as < div >, if an element node is sought by using the < div > tags, most of the sought element nodes may be text element nodes that are not needed in this embodiment, at this time, a feature attribute data-img is added to the < div > tag (this is because the web page tag is allowed to set a custom attribute, and the custom attribute is agreed to be only used for displaying the web page tag at the innermost layer of the pictures), and the target element node corresponding to the picture resource can be sought by looking up the attribute data-img.
When the identification of the target element is the characteristic value of the preset attribute on the webpage label, the webpage may display the picture by some common labels without picture characteristics, and at this time, the characteristic value of the preset attribute on the webpage label may be used as the identification of the target element, and the target element node corresponding to the label name is searched from the webpage content according to the characteristic value of the preset attribute on the webpage label.
For example, some pictures in the web page do not adopt < img > tags to display the pictures, and do not adopt characteristic attributes such as data-img as the identifiers of the picture elements, at this time, a characteristic value flag-img is added to some attributes (for example, class attributes) of the common tags as the identifiers of the picture elements, and the characteristic value is agreed to be only used for displaying the web page tags at the innermost layer of the pictures, and at this time, the target element node corresponding to the picture resource can be found only by searching the attribute value of the characteristic value flag-img.
Therefore, all picture resources needing to be searched in the webpage can be comprehensively inquired, and the condition of picture omission is effectively avoided.
And after the target element node is obtained, executing step S1202, wherein the target element node comprises a resource positioning attribute storing a target resource address, obtaining a picture link on the target element node according to the resource positioning attribute, and obtaining a corresponding target picture according to the picture link.
The resource location attribute may be, but is not limited to, an src attribute or a url attribute, and may be set according to an actual requirement, which is not specifically limited herein.
For example, if a webpage displays a picture by using an < img > tag, a picture link can be obtained through an src attribute on the < img > tag (the value of the src attribute is the picture link); if the webpage adopts a < div > tag to display the picture as a background picture, a picture link can be obtained through url attribute; and obtaining the corresponding target picture according to the picture link.
It should be noted that the link corresponding to the resource location attribute on the target element node found in the above steps may not be a picture link, and at this time, it is necessary to determine whether the filename suffix of the resource downloaded through the link is the filename suffix corresponding to the picture, and if the filename suffix of the resource corresponding to the link is the picture filename suffix, the resource is taken as the target picture; and if not, the resource obtained by the link is not subjected to the next identification processing.
The picture filename suffix can be, but is not limited to, ". jpg", ". jpeg", ". gif", and ". png", and is set according to actual requirements, and is not specifically limited herein.
In step S130, the target picture is identified to obtain an identification content corresponding to the target picture.
Optionally, in this embodiment, the identification content corresponding to the target picture is obtained after the target picture is identified by a fuzzy identification technology.
Firstly, obtaining an original data set to be identified of a picture, namely the obtained target picture, extracting and identifying the features of the target picture, calling a function of a fuzzy identification feature library to calculate the membership between the original data set and a reference set (the reference set is a preset sample set containing a plurality of feature subsets), and thus forming a membership set. If there is a number a (x) in the interval [0,1] for any element x of the study range U, a is the reference set on U, a (x) is the membership of x to a, which can be said to be equivalent to the probability that the original data set falls into a subset, and when x varies in U, a (x) is a function, called the membership function of a. The closer the degree of membership A (x) is to 1, the higher the degree to which x belongs to A. Then, a function in the fuzzy recognition dynamic library is called to transmit the membership set as a parameter to obtain a reference set with a smaller range, the process is circulated until the membership value obtained by calculation is smaller than the initially set target matching value, and a matching result is obtained at the moment. And finally, calling a function of the fuzzy recognition feature library, and transmitting the subscript of the obtained matching result as a parameter into a data set of the fuzzy recognition feature library to obtain the recognition content.
For example, if the picture is a white washing machine picture, the identification content obtained by fuzzy identification is "white washing machine" or "washing machine".
When the picture is released, the website can automatically extract the keywords of the description information of the picture and then store the extracted information on the character description attribute of the label corresponding to the picture. In the above steps, the target element node is found in various ways, and at this time, the text description information corresponding to the target picture can be obtained according to the specific text description attribute.
For example, if a certain washing machine picture is displayed by using an < img > tag, an alt text description attribute exists on the < img > tag, and the text description attribute "washing machine" corresponding to the washing machine picture is found through the alt attribute.
In step S140, it is determined whether the text description information and the identification content, which all correspond to the target picture, are consistent, and the determination result is used as a detection result.
Matching the character description information and the identification content which both correspond to the target graph through a preset algorithm, if the character description information is matched with the identification content, determining that the character description information is consistent with the identification content, and the detection result is image-text consistency, if the character description information is not matched with the identification content, determining that the character description information is inconsistent with the identification content, and the detection result is image-text inconsistency.
Optionally, in this embodiment, the preset algorithm may be a fuzzy matching algorithm.
For example, for a picture of a washing machine, the corresponding identification content is "white washing machine", the corresponding text description information is "washing machine", and if the keyword of the "white washing machine" is the same as the keyword of the "washing machine" through fuzzy matching, it is determined that the "white washing machine" is matched with the "washing machine", and accordingly, it can be determined that the text description information is consistent with the identification content, and at this time, the detection result is that the texts and the texts are inconsistent.
By applying the webpage image-text detection method, a target picture in webpage content and character description information corresponding to the target picture can be obtained according to the obtained webpage link address to be detected, the target picture is subjected to fuzzy recognition to obtain recognition content corresponding to the target picture, whether the character description information and the recognition content which are both corresponding to the target picture are matched or not is confirmed, and the confirmation result is used as the detection result of whether the webpage images and texts are consistent or not. Through the arrangement, the error condition of manual checking whether the pictures and texts are consistent can be effectively avoided, and the checking accuracy and the working efficiency are improved while the manpower resources are saved.
Example two
The embodiment provides a web page image-text detection device, which can apply the web page image-text detection method, and comprises the following steps:
a memory having a computer program stored thereon; and
a processor for executing the computer program in the memory to implement the steps of the web page teletext detection method according to any one of the above-mentioned alternative embodiments.
Fig. 3 is a block diagram of a web page graph-text detection apparatus 400 provided by the present disclosure, and as shown in fig. 3, the web page graph-text detection apparatus 400 may include: a processor 401, a memory 402, a multimedia component 403, an input/output (I/O) interface 404, and a communication component 405.
The processor 401 is configured to control the overall operation of the apparatus 400, so as to complete all or part of the steps of the web page image and text detection method. The memory 402 is used to store various types of data to support operation of the apparatus 400, and such data may include, for example, instructions for any application or method operating on the apparatus 400, as well as application-related data. The Memory 402 may be implemented by any type of volatile or non-volatile Memory device or combination thereof, such as Static Random Access Memory (SRAM), Electrically Erasable Programmable Read-Only Memory (EEPROM), Erasable Programmable Read-Only Memory (EPROM), Programmable Read-Only Memory (PROM), Read-Only Memory (ROM), magnetic Memory, flash Memory, magnetic disk or optical disk. The multimedia components 403 may include a screen and an audio component. Wherein the screen may be, for example, a touch screen and the audio component is used for outputting and/or inputting audio signals. For example, the audio component may include a microphone for receiving external audio signals. The received audio signal may further be stored in the memory 402 or transmitted through the communication component 405. The audio assembly also includes at least one speaker for outputting audio signals. The I/O interface 404 provides an interface between the processor 401 and other interface modules, such as a keyboard, mouse, buttons, etc. These buttons may be virtual buttons or physical buttons. The communication component 405 is used for wired or wireless communication between the apparatus 400 and other devices. Wireless Communication, such as Wi-Fi, bluetooth, Near Field Communication (NFC), 2G, 3G, or 4G, or a combination of one or more of them, so that the corresponding Communication component 405 may include: Wi-Fi module, bluetooth module, NFC module.
In this embodiment, the apparatus 400 may be implemented by one or more Application Specific Integrated Circuits (ASICs), Digital Signal Processors (DSPs), Digital Signal Processing Devices (DSPDs), Programmable Logic Devices (PLDs), Field Programmable Gate Arrays (FPGAs), controllers, microcontrollers, microprocessors or other electronic components, and is used to perform the web page graph detection method.
EXAMPLE III
The present embodiment provides a storage medium having a computer program stored thereon, where the computer program can be executed by one or more processors to implement the web page teletext detection method described in the first embodiment.
The method implemented when the computer program of the web page image-text detection method run on the processor is executed may refer to the specific embodiment of the web page image-text detection method of the present disclosure, and details are not described herein again.
The processor may be an integrated circuit chip having information processing capabilities. The processor may be a general-purpose processor including a Central Processing Unit (CPU), a Network Processor (NP), and the like.
It should be understood that the disclosed methods and apparatus may be implemented in other ways. The apparatus embodiments described above are merely illustrative, and for example, the flowchart and block diagrams in the figures illustrate the architecture, functionality, and operation of possible implementations of apparatus, methods and computer program products according to embodiments of the present disclosure. In this regard, each block in the flowchart or block diagrams may represent a module, segment, or portion of code, which comprises one or more executable instructions for implementing the specified logical function(s). It should also be noted that, in some alternative implementations, the functions noted in the block may occur out of the order noted in the figures. For example, two blocks shown in succession may, in fact, be executed substantially concurrently, or the blocks may sometimes be executed in the reverse order, depending upon the functionality involved. It will also be noted that each block of the block diagrams and/or flowchart illustration, and combinations of blocks in the block diagrams and/or flowchart illustration, can be implemented by special purpose hardware-based systems which perform the specified functions or acts, or combinations of special purpose hardware and computer instructions.
Although the embodiments disclosed in the present disclosure are described above, the descriptions are only for the convenience of understanding the present disclosure, and are not intended to limit the present disclosure. It will be understood by those skilled in the art that various changes in form and details may be made therein without departing from the spirit and scope of the disclosure, and that the scope of the disclosure is to be limited only by the appended claims.

Claims (10)

1. A webpage image-text detection method is characterized by comprising the following steps:
acquiring a webpage link address to be detected;
according to the webpage link address, a target picture in webpage content and character description information corresponding to the target picture are obtained;
identifying the target picture to obtain identification content corresponding to the target picture;
and confirming whether the character description information corresponding to the target picture is consistent with the identification content or not, and taking a confirmation result as a detection result.
2. The web page image-text detection method according to claim 1, wherein the confirming whether the text description information and the identification content, both of which correspond to the target picture, are consistent comprises:
confirming whether the character description information corresponding to the target picture is matched with the identification content through fuzzy matching;
if the text description information is matched with the identification content, confirming that the text description information is consistent with the identification content;
and if the text description information is not matched with the identification content, confirming that the text description information is inconsistent with the identification content.
3. The method for detecting the web page graphics context according to claim 1, wherein the obtaining the target picture in the web page content according to the web page link address comprises:
searching a target element node corresponding to the webpage label from the webpage content corresponding to the webpage link address;
and obtaining a target picture corresponding to the picture link on the target element node according to the resource positioning attribute.
4. The web page image-text detection method according to claim 3, wherein the searching for the target element node corresponding to the web page tag from the web page content corresponding to the web page link address comprises:
and searching a target element node corresponding to the tag name from the webpage content according to the tag name of the webpage tag.
5. The web page image-text detection method according to claim 3, wherein the searching for the target element node corresponding to the web page tag from the web page content corresponding to the web page link address comprises:
and searching a target element node corresponding to the characteristic attribute from the webpage content according to the characteristic attribute of the webpage label.
6. The web page image-text detection method according to claim 3, wherein the searching for the target element node corresponding to the web page tag from the web page content corresponding to the web page link address comprises:
and searching a target element node corresponding to the characteristic value from the webpage content according to the characteristic value of the preset attribute on the webpage label.
7. The method for detecting the web page image-text according to claim 3, wherein the obtaining the text description information corresponding to the target picture in the web page content according to the web page link address comprises:
and obtaining the text description information on the target element node according to the text description attribute.
8. The web page image-text detection method according to claim 3, wherein the obtaining of the target image corresponding to the image link on the target element node according to the resource positioning attribute comprises:
obtaining a picture link on the target element node according to the resource positioning attribute;
and when the request resource corresponding to the picture link is confirmed to be a picture resource, taking the picture resource as a target picture.
9. A web page image-text detection device is characterized by comprising:
a memory having a computer program stored thereon; and
a processor for executing the computer program in the memory to carry out the steps of the method of any one of claims 1 to 8.
10. A computer-readable storage medium, on which a computer program is stored which, when being executed by a processor, carries out the steps of the method according to any one of claims 1 to 8.
CN201910882771.0A 2019-09-18 2019-09-18 Webpage image-text detection method and device and storage medium Pending CN110619103A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910882771.0A CN110619103A (en) 2019-09-18 2019-09-18 Webpage image-text detection method and device and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910882771.0A CN110619103A (en) 2019-09-18 2019-09-18 Webpage image-text detection method and device and storage medium

Publications (1)

Publication Number Publication Date
CN110619103A true CN110619103A (en) 2019-12-27

Family

ID=68923413

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910882771.0A Pending CN110619103A (en) 2019-09-18 2019-09-18 Webpage image-text detection method and device and storage medium

Country Status (1)

Country Link
CN (1) CN110619103A (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112036521A (en) * 2020-11-09 2020-12-04 北京沃东天骏信息技术有限公司 Information consistency detection method, device, equipment and storage medium
CN112187949A (en) * 2020-10-09 2021-01-05 珠海格力电器股份有限公司 Picture batch downloading method and device, storage medium and electronic device

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112187949A (en) * 2020-10-09 2021-01-05 珠海格力电器股份有限公司 Picture batch downloading method and device, storage medium and electronic device
CN112036521A (en) * 2020-11-09 2020-12-04 北京沃东天骏信息技术有限公司 Information consistency detection method, device, equipment and storage medium

Similar Documents

Publication Publication Date Title
CN110619103A (en) Webpage image-text detection method and device and storage medium
US8224823B1 (en) Browsing history restoration
US10235712B1 (en) Generating product image maps
US10489475B2 (en) Presentation of information on multiple devices
CN104036011B (en) Webpage element display method and browser device
CN106687949A (en) Search results for native applications
US20090199077A1 (en) Creating first class objects from web resources
US20150227276A1 (en) Method and system for providing an interactive user guide on a webpage
US20130262463A1 (en) Method and system to provide smart tagging of search input
US20190188729A1 (en) System and method for detecting counterfeit product based on deep learning
US20140337699A1 (en) Method and apparatus for extracting web page content
CN104346464A (en) Processing method and device of webpage element information and browser client
US10452723B2 (en) Detecting malformed application screens
CN104462590A (en) Information searching method and device
CN107329981B (en) Page detection method and device
CN106919711B (en) Method and device for labeling information based on artificial intelligence
CN109451333B (en) Bullet screen display method, device, terminal and system
CN109684015B (en) Interface data loading method and device, electronic equipment and storage medium
KR102208027B1 (en) Operation method of terminal, terminal, and phone number information server
CN104965912A (en) Information acquisition method and apparatus
US11113461B2 (en) Generating edit suggestions for transforming digital documents
CN107622135B (en) Method and apparatus for displaying information
US20210120074A1 (en) Browser management system, browser management method, browser management program, and client program
US20210042384A1 (en) Generating Edit Suggestions for Transforming Digital Documents
CN110399063B (en) Method and device for viewing page element attributes

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination