CN110365776B - Picture batch downloading method and device, electronic equipment and storage medium - Google Patents

Picture batch downloading method and device, electronic equipment and storage medium Download PDF

Info

Publication number
CN110365776B
CN110365776B CN201910646745.8A CN201910646745A CN110365776B CN 110365776 B CN110365776 B CN 110365776B CN 201910646745 A CN201910646745 A CN 201910646745A CN 110365776 B CN110365776 B CN 110365776B
Authority
CN
China
Prior art keywords
picture
pictures
downloaded
picture address
addresses
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201910646745.8A
Other languages
Chinese (zh)
Other versions
CN110365776A (en
Inventor
许蕾
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
BOE Technology Group Co Ltd
Original Assignee
BOE Technology Group Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by BOE Technology Group Co Ltd filed Critical BOE Technology Group Co Ltd
Priority to CN201910646745.8A priority Critical patent/CN110365776B/en
Publication of CN110365776A publication Critical patent/CN110365776A/en
Application granted granted Critical
Publication of CN110365776B publication Critical patent/CN110365776B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/955Retrieval from the web using information identifiers, e.g. uniform resource locators [URL]
    • G06F16/9558Details of hyperlinks; Management of linked annotations
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/01Protocols
    • H04L67/06Protocols specially adapted for file transfer, e.g. file transfer protocol [FTP]

Abstract

The invention discloses a method and a device for downloading pictures in batch, electronic equipment and a storage medium; the method comprises the following steps: determining a target webpage, and acquiring a plurality of picture addresses corresponding to a plurality of pictures on the target webpage; generating a picture address set according to a plurality of picture addresses; accessing and downloading each picture address in the picture address set to obtain a plurality of pictures; storing a plurality of said pictures. According to the invention, a plurality of picture addresses corresponding to a plurality of pictures on the target webpage are obtained and then fused into a picture address set comprising a plurality of picture addresses, and then access, download and storage are carried out according to the picture address set, so that fast, simple and efficient picture batch downloading is realized.

Description

Picture batch downloading method and device, electronic equipment and storage medium
Technical Field
The present invention relates to the field of computer technologies, and in particular, to a method and an apparatus for downloading pictures in batch, an electronic device, and a storage medium.
Background
With the development of information technology and the internet, people have moved from the information-deficient era to the information-overloaded era. When a user browses a webpage on a browser by using a network, the user can find a large number of pictures to be stored, and the pictures can be used as materials and the like. At present, pictures appearing on a webpage are all public and can be downloaded by a user. However, the user needs to manually click the pictures to download the pictures by opening the web pages through the browser, which is cumbersome and time-consuming.
Disclosure of Invention
In view of this, the present invention provides a method, an apparatus, an electronic device and a storage medium for downloading pictures in batch, which can realize batch downloading of pictures quickly, easily and efficiently.
Based on the above purpose, the invention provides a method for downloading pictures in batch, which comprises the following steps:
determining a target webpage, and acquiring a plurality of picture addresses corresponding to a plurality of pictures on the target webpage;
generating a picture address set according to a plurality of picture addresses;
accessing and downloading each picture address in the picture address set to obtain a plurality of pictures;
storing a plurality of said pictures.
In addition, the invention also provides a device for downloading pictures in batches, which comprises:
the system comprises an acquisition module, a display module and a display module, wherein the acquisition module is used for determining a target webpage and acquiring a plurality of picture addresses corresponding to a plurality of pictures on the target webpage;
the fusion module is used for generating a picture address set according to the plurality of picture addresses;
the downloading module is used for accessing and downloading each picture address in the picture address set to obtain a plurality of pictures;
and the storage module is used for storing a plurality of pictures.
Furthermore, the present invention provides an electronic device comprising a memory, a processor and a computer program stored on the memory and executable on the processor, wherein the processor implements the method as described above when executing the program.
Furthermore, the present invention also provides a non-transitory computer-readable storage medium, characterized in that the non-transitory computer-readable storage medium stores computer instructions for causing the computer to execute the method as described above.
As can be seen from the above description, the method, the device, the electronic device, and the storage medium for batch downloading of pictures provided by the present invention obtain a plurality of picture addresses corresponding to a plurality of pictures on a target webpage, and then fuse the plurality of picture addresses into a picture address set including the plurality of picture addresses, so as to access, download, and store the plurality of picture addresses according to the picture address set, thereby realizing fast, simple, and efficient batch downloading of pictures.
Drawings
In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly described below, it is obvious that the drawings in the following description are only some embodiments of the present invention, and for those skilled in the art, other drawings can be obtained according to the drawings without creative efforts.
FIG. 1 is a flowchart of a method for downloading pictures in batch according to an embodiment of the present invention;
FIG. 2 is a flowchart of a method for obtaining a picture address according to an embodiment of the present invention;
FIG. 3 is a flowchart illustrating the deduplication processing steps performed during generation of a picture address set according to an embodiment of the present invention;
FIG. 4 is a flowchart of a first verification step in an embodiment of the present invention;
FIG. 5 is a flowchart illustrating a second verification step in accordance with an embodiment of the present invention;
FIG. 6 is a schematic structural diagram of a device for downloading pictures in batch according to an embodiment of the present invention;
fig. 7 is a schematic structural diagram of a device for downloading pictures in batch according to an embodiment of the present invention.
Detailed Description
In order to make the objects, technical solutions and advantages of the present invention more apparent, the present invention is described in further detail below with reference to specific embodiments and the accompanying drawings.
It is to be noted that technical terms or scientific terms used in the embodiments of the present invention should have the ordinary meanings as understood by those having ordinary skill in the art to which the present disclosure belongs, unless otherwise defined. The use of "first," "second," and similar terms in this disclosure is not intended to indicate any order, quantity, or importance, but rather is used to distinguish one element from another. The word "comprising" or "comprises", and the like, means that the element or item listed before the word covers the element or item listed after the word and its equivalents, but does not exclude other elements or items. The terms "connected" or "coupled" and the like are not restricted to physical or mechanical connections, but may include electrical connections, whether direct or indirect. "upper", "lower", "left", "right", and the like are used merely to indicate relative positional relationships, and when the absolute position of the object being described is changed, the relative positional relationships may also be changed accordingly.
The embodiment of the invention provides a method for downloading pictures in batches, which comprises the following steps with reference to fig. 1:
step 101, determining a target webpage, and acquiring a plurality of picture addresses corresponding to a plurality of pictures on the target webpage.
In this step, a target web page is first determined. The target webpage is any internet page accessible to the user on the internet. The target web pages generally include pictures for content presentation, and the pictures on the target web pages are download objects.
And after the target webpage is determined, further acquiring the picture address of the picture on the target webpage. The crawler technology can be selected according to a specific picture address acquisition mode. The crawler technology is to capture a picture website on a webpage based on a preset picture address extraction rule corresponding to a crawler program or a script. For example, a CrawSpider type spider in a script frame is adopted to set a grabbing rule corresponding to a picture website, and then after a target webpage is accessed, the picture address of a picture on the target webpage is extracted through a LinkExtractor script. Obviously, according to specific implementation requirements, the picture address of the picture on the target webpage can be acquired through other data acquisition modes.
In this embodiment, the picture address is a URL (Uniform Resource Locator). In the following embodiments, the description will be made by taking the picture address as the URL address. Obviously, the picture address may be in other forms according to different data systems, i.e. it should not be understood that the present application only defines the picture address as a URL address.
It should be noted that the number of target web pages may be one or more. For each target webpage, the pictures to be downloaded may be all pictures thereon or may be partial pictures. The number of the target web pages and the number of the pictures on the target web pages needing to be downloaded can be determined by receiving instructions of a user or reading a setting file preset by the user.
And 102, generating a picture address set according to the plurality of picture addresses.
In this step, based on the multiple picture addresses acquired in step 101, the multiple picture addresses are integrated to form a picture address set including the multiple picture addresses. The specific implementation form of the picture address set may be that a plurality of picture addresses are stored in a database, such as a mySQL database, as a data set; alternatively, the plurality of picture addresses may be generated as a file, such as a text file, a table file, and the like, and the file may be stored locally or in the cloud. In any way, the picture address set in the step integrates a plurality of picture addresses, so that the subsequent data transmission and data processing are facilitated.
And 103, accessing and downloading each picture address in the picture address set to obtain a plurality of pictures.
In this step, based on the picture address set generated in step 102, each picture address in the picture address set is accessed, and a picture corresponding to the picture address is downloaded after the access, so as to finally obtain a plurality of pictures to be downloaded.
When downloading, the operations of accessing the picture address and downloading the picture can be executed in a background execution mode, namely, in a mode that a user does not sense the picture. Some websites set up a mask for background data operations on the web page, which may cause failure of the picture download. Therefore, the picture downloading can be realized in a mode of simulating user operation. Specifically, for the simulation of the operation of accessing the picture address, a webpage testing tool can be called, and the picture address can be accessed through the webpage testing tool; for example, a web page test tool may choose webdriver. For the simulation of the picture downloading operation, an input device simulation tool can be called, and the downloading operation is simulated through the input device simulation tool so as to download and obtain the picture. For example, the mouse action simulation can be completed through PyMouse, and the specific simulated action may include selecting an address bar for input, right-clicking a picture to call out a menu, selecting a download option in the menu, and the like; the keyboard action simulation can be completed by the keyboard simulation tool PyKeyboard, and the specific simulation action can comprise selecting corresponding options or functions by keyboard shortcut keys, such as pressing "v" to select a download option, and pressing "alt" + "s" to select a save function.
And 104, storing a plurality of pictures.
In this step, the obtained plurality of pictures are stored. Specifically, the storage location of the picture is set, and the picture can be locally or further uploaded to a cloud storage. In the storage position, a plurality of pictures downloaded from the same target webpage can be stored in the same storage position, so that the pictures can be conveniently searched and classified.
Other related attributes of the picture, such as picture name, picture format, etc., may also be set during storage. For example, a picture is named according to the following format: the last segmentation data obtained after '/' segmentation in the URL of the 'website domain name' + '+' picture; the naming mode can intuitively obtain certain related information about the pictures from the picture names on one hand, and can ensure that the naming of each picture is unique on the other hand, so that the operations such as searching the pictures and the like are facilitated.
In addition, when a plurality of pictures are stored, a log file can be generated according to the execution of the above steps. The log file records the relevant information of the plurality of downloaded and stored pictures. For example, for each picture, the log file records therein: and acquiring a source website of the picture address, the picture name, the picture storage address and other available information and the like. When stored, the picture and the log file may be stored in the same storage location, such as both locally; or the images can be stored in different storage positions respectively, for example, the images are uploaded and stored in a cloud, and the log files are stored locally.
It can be seen that, in the method for downloading pictures in batches according to the embodiment, after a plurality of picture addresses corresponding to a plurality of pictures on a target webpage are obtained, the picture addresses are fused into a picture address set comprising the plurality of picture addresses, and then access, download and storage are performed according to the picture address set, so that fast, simple and efficient picture batch downloading is realized.
In an optional embodiment, referring to fig. 2, the step of obtaining a plurality of picture addresses corresponding to a plurality of pictures on the target webpage includes:
step 201, accessing a database of the target webpage.
In the step, a database of the background of the target webpage is accessed, and a picture address corresponding to a picture on the target webpage is searched and obtained from a data structure forming the target webpage.
Step 202, determining a data segment for recording the picture address in the database of the target webpage according to a preset rule.
For target web pages based on different data construction technologies, different types of data are respectively allocated with a given data position in a corresponding background data structure. Therefore, in this step, a data position setting rule preset in the data structure of the web page under the technology is determined by the data construction technology based on which the target web page is identified, and further, a preset rule corresponding to the picture is determined. Then, according to the preset rule, a data segment for recording the picture address can be determined in the database of the target webpage.
Step 203, obtaining a plurality of picture addresses corresponding to a plurality of pictures on the target webpage according to the data segment for recording the picture addresses.
In this step, according to the data segment for recording the picture address determined in step 202, the data segment in the database of the target webpage is accessed, so as to obtain the picture address corresponding to the picture on the target webpage.
In an optional embodiment, the step of generating the picture address set according to a plurality of picture addresses further includes a step of removing duplicate of the picture addresses. Among the acquired multiple picture addresses, there may be the same picture address, which may cause multiple repeated downloads of the same picture. In this embodiment, a deduplication step is added when a picture address set is generated, and only one duplicated picture address can be reserved in a plurality of acquired picture addresses through the deduplication step, that is, each picture address in the picture address set is unique.
Specifically, referring to fig. 3, the step of removing the duplicate of the picture address includes:
step 301, comparing the plurality of picture addresses pairwise, and determining whether at least two same picture addresses exist.
In this step, two-by-two comparison is performed on all the picture addresses in the picture address set, that is, any two picture addresses are compared, and whether the two picture addresses are the same or not is checked. After the same picture addresses are obtained through comparison, the same picture addresses can be associated through grouping, setting labels and the like. If the same picture addresses are obtained through comparison, the number of the same picture addresses is at least two; obviously, the same number of picture addresses may also be more than two.
Step 302, if there are at least two same picture addresses, one of the at least two same picture addresses is reserved, and the rest are deleted.
In this step, only one of the same picture addresses is reserved, and the rest of the same picture addresses are deleted from the picture address set, so as to realize the duplicate removal of the picture addresses.
After the deduplication step of this embodiment, each picture address in the picture address set is unique, so that it can be ensured with a high probability that there is no duplicate picture in a plurality of subsequently downloaded pictures.
In an optional embodiment, after storing the plurality of pictures, a first checking step for the pictures is further included, and it is determined whether the pictures corresponding to each picture address included in the picture address set have been successfully downloaded and successfully stored through the first checking step, and if there are pictures that have not been successfully downloaded and successfully stored, the pictures are re-downloaded and stored again.
Specifically, referring to fig. 4, the first verifying step includes:
step 401, according to the accessed picture address, generating a downloaded picture address set.
In this step, the picture addresses included in the downloaded picture address set are recorded in the process of downloading after accessing the picture addresses. If the access of the picture address is successfully realized and the downloading operation is successfully realized, correspondingly recording the picture address into a downloaded picture address set; for the picture address access or the downloading operation which is not successfully realized, the corresponding picture address is not recorded into the downloaded picture address set.
Step 402, calculating a difference set of the picture address set and the downloaded picture address set, and generating a first un-downloaded picture address set.
The picture address set is the picture address of all pictures to be downloaded, the downloaded picture address set is the picture address of the downloaded picture, and the difference set of the two is that the picture address corresponding to all pictures which are not successfully accessed or not successfully downloaded is recorded in the first un-downloaded picture address set.
Step 403, judging whether the first un-downloaded picture address set is empty; if yes, go to step 404, otherwise go to step 405.
And step 404, if the first un-downloaded picture address set is empty, ending the verification.
If the first un-downloaded picture address set is empty, it indicates that all pictures corresponding to all picture addresses in the picture address set have been downloaded, and at this time, the first verification step of this embodiment may be ended.
Step 405, if the first un-downloaded picture address set is not empty, accessing and downloading each picture address of the first un-downloaded picture address set.
If the first un-downloaded picture address set is not empty, it indicates that pictures corresponding to some picture addresses in the picture address set are not successfully downloaded, and the picture writing address is recorded in the generated first un-downloaded picture address set. At this time, in the method of this embodiment, in the step of generating the picture address set, the first un-downloaded picture address set is used to replace the originally generated picture address set, and the subsequent steps are executed to access the picture addresses in the first un-downloaded picture address set for downloading and storing. The above process is repeated until the first un-downloaded picture address set is empty, and the first verification step of this embodiment is ended.
In some cases, due to the fact that a picture address is wrong or a downloading operation is forbidden, a picture corresponding to a certain picture address cannot be successfully downloaded all the time, that is, after the steps are repeatedly executed for many times, the first un-downloaded picture address set is not empty all the time, at this time, the remaining picture addresses in the first un-downloaded picture address set can be deleted, prompt information is generated, and a user is notified to perform corresponding processing.
Further, in the process of downloading the picture, even if the picture address can be successfully accessed and the downloading operation is successfully performed, an error may still occur in the process of storing the picture, so that the picture is not successfully stored. Therefore, in this embodiment, after the first verification step, a second verification step may be performed. It is further verified by a second verification step whether the picture to be downloaded has been successfully stored.
Specifically, referring to fig. 5, the second checking step includes:
step 501, extracting picture names of pictures corresponding to the picture addresses in the downloaded picture address set, and generating a downloaded picture name set.
In this step, whether the picture is successfully stored is judged according to the name of the picture. Firstly, according to the picture addresses in the downloaded picture address set, pictures corresponding to the picture addresses are recorded as successfully downloaded. And for the picture addresses, extracting the picture names of the pictures corresponding to the picture addresses, and generating a downloaded picture name set.
Step 502, extracting the picture names of the stored pictures to generate a stored picture name set.
In this step, a storage location (local or cloud) of the downloaded picture is accessed, and picture names of the pictures stored in the storage location are extracted to generate a stored picture name set. Each picture name in the stored picture name set corresponds to a picture that has been actually stored in a corresponding storage location.
Step 503, calculating a difference set of the downloaded picture name set and the stored picture name set, determining a picture address corresponding to a picture name included in the difference set, and generating a second un-downloaded picture address set.
In this step, the pictures corresponding to the picture names included in the difference set of the downloaded picture name set and the stored picture name set indicate that the pictures are recorded as successfully downloaded but are not actually stored. And further integrating the picture addresses corresponding to the picture names in the difference set to generate a second un-downloaded picture address set.
Step 504, determining whether the second un-downloaded picture address set is empty, if yes, performing step 505, otherwise, performing step 506.
And 505, if the second un-downloaded picture address set is empty, ending the verification.
If the second un-downloaded picture address set is empty, it indicates that the pictures recorded as successfully downloaded are also successfully stored, and the second verification step of this embodiment may be ended.
Step 506, if the second un-downloaded picture address set is not empty, accessing and downloading each picture address of the second un-downloaded picture address set.
If the second un-downloaded picture address set is not empty, it indicates that some pictures recorded as successfully downloaded are not successfully stored in reality, and the picture addresses corresponding to the pictures are recorded in the second un-downloaded picture address set. At this time, in the method of this embodiment, in the step of generating the picture address set, the second un-downloaded picture address set is used to replace the originally generated picture address set, and the subsequent steps are executed to access the picture addresses in the second un-downloaded picture address set for downloading and storing. The above process is repeated until the second un-downloaded picture address set is empty, and the second verification step of this embodiment is ended.
Therefore, according to the method for downloading the pictures in batches, the results of the picture batch downloading are verified through the first verification step and the second verification step, and the accuracy and the integrity of the picture batch downloading can be effectively guaranteed.
Based on the same inventive concept, an embodiment of the present invention further provides a device for downloading pictures in batches, where, referring to fig. 6, the device includes:
an obtaining module 601, configured to determine a target webpage and obtain a plurality of picture addresses corresponding to a plurality of pictures on the target webpage;
a fusion module 602, configured to generate a picture address set according to a plurality of picture addresses;
a downloading module 603, configured to access and download each picture address in the picture address set to obtain multiple pictures;
a storage module 604, configured to store a plurality of pictures.
In an optional embodiment, the obtaining module 601 is specifically configured to: accessing a database of the target web page; determining a data segment for recording a picture address in a database of the target webpage according to a preset rule; and acquiring a plurality of picture addresses corresponding to a plurality of pictures on the target webpage according to the data segment for recording the picture addresses.
In an optional embodiment, the fusion module 602 is specifically configured to: comparing the plurality of picture addresses pairwise to determine whether at least two same picture addresses exist; if at least two same picture addresses exist, one of the at least two same picture addresses is reserved, and the rest are deleted.
In an optional embodiment, the downloading module 603 is specifically configured to: calling a webpage testing tool, and accessing the picture address in the picture address set through the webpage testing tool; and calling an input equipment simulation tool, and simulating a downloading operation through the input equipment simulation tool so as to download and obtain the picture.
In an optional embodiment, referring to fig. 7, the apparatus for downloading pictures in bulk further includes:
a first checking module 605, configured to generate a downloaded picture address set according to the accessed picture address; calculating a difference set of the picture address set and the downloaded picture address set to generate a first un-downloaded picture address set; and if the first un-downloaded picture address set is not empty, accessing and downloading each picture address of the first un-downloaded picture address set.
Further, the device for downloading pictures in batches according to the present embodiment further includes:
a second check module 606, configured to extract picture names of the stored multiple pictures and generate a picture name set with successful downloading; according to the accessed picture address, extracting a picture name of a picture corresponding to the picture address, and generating a downloaded picture name set; calculating a difference set of the downloaded picture name set and the downloaded picture name set, determining a picture address corresponding to a picture name included in the difference set, and generating a second un-downloaded picture address set; and if the second un-downloaded picture address set is not empty, accessing and downloading each picture address of the second un-downloaded picture address set.
The apparatus of the foregoing embodiment is used to implement the corresponding method in the foregoing embodiment, and has the beneficial effects of the corresponding method embodiment, which are not described herein again.
Based on the same inventive concept, an embodiment of the present invention further provides an electronic device, which includes a memory, a processor, and a computer program stored in the memory and executable on the processor, where the processor implements the method for downloading the pictures in batch according to any of the above embodiments when executing the program.
The electronic device of the foregoing embodiment is used to implement the corresponding method in the foregoing embodiment, and has the beneficial effects of the corresponding method embodiment, which are not described herein again.
Based on the same inventive concept, an embodiment of the present invention further provides a non-transitory computer-readable storage medium storing computer instructions for causing the computer to execute the method for downloading the batch of pictures according to any one of the above embodiments.
The storage medium of the foregoing embodiment is used to implement the corresponding method in the foregoing embodiment, and has the beneficial effects of the corresponding method embodiment, which are not described herein again.
Those of ordinary skill in the art will understand that: the discussion of any embodiment above is meant to be exemplary only, and is not intended to intimate that the scope of the disclosure, including the claims, is limited to these examples; within the idea of the invention, also features in the above embodiments or in different embodiments may be combined, steps may be implemented in any order, and there are many other variations of the different aspects of the invention as described above, which are not provided in detail for the sake of brevity.
While the present invention has been described in conjunction with specific embodiments thereof, many alternatives, modifications, and variations of these embodiments will be apparent to those of ordinary skill in the art in light of the foregoing description. For example, other memory architectures (e.g., dynamic ram (dram)) may use the discussed embodiments.
The embodiments of the invention are intended to embrace all such alternatives, modifications and variances that fall within the broad scope of the appended claims. Therefore, any omissions, modifications, substitutions, improvements and the like that may be made without departing from the spirit and principles of the invention are intended to be included within the scope of the invention.

Claims (10)

1. A method for downloading pictures in batch is characterized by comprising the following steps:
determining a target webpage, and acquiring a plurality of picture addresses corresponding to a plurality of pictures on the target webpage;
generating a picture address set according to a plurality of picture addresses;
accessing and downloading each picture address in the picture address set to obtain a plurality of pictures;
storing a plurality of said pictures;
generating a downloaded picture address set according to the accessed picture address;
calculating a difference set of the picture address set and the downloaded picture address set to generate a first un-downloaded picture address set;
if the first un-downloaded picture address set is not empty, accessing and downloading each picture address of the first un-downloaded picture address set;
extracting picture names of pictures corresponding to the picture addresses in the downloaded picture address set to generate a downloaded picture name set;
extracting picture names of a plurality of stored pictures to generate a stored picture name set;
calculating a difference set of the downloaded picture name set and the stored picture name set, determining a picture address corresponding to a picture name included in the difference set, and generating a second un-downloaded picture address set;
and if the second un-downloaded picture address set is not empty, accessing and downloading each picture address of the second un-downloaded picture address set.
2. The method for downloading pictures in batches according to claim 1, wherein the obtaining the addresses of the multiple pictures corresponding to the multiple pictures on the target web page includes:
accessing a database of the target web page;
determining a data segment for recording a picture address in a database of the target webpage according to a preset rule;
and acquiring a plurality of picture addresses corresponding to a plurality of pictures on the target webpage according to the data segment for recording the picture addresses.
3. The method for downloading pictures in batches according to claim 1, wherein the generating a picture address set according to the plurality of picture addresses comprises:
comparing the plurality of picture addresses pairwise to determine whether at least two same picture addresses exist;
if at least two same picture addresses exist, one of the at least two same picture addresses is reserved, and the rest are deleted.
4. The method for downloading pictures in batches according to claim 1, wherein said accessing and downloading each of said picture addresses in said set of picture addresses to obtain a plurality of pictures comprises:
calling a webpage testing tool, and accessing the picture address in the picture address set through the webpage testing tool;
and calling an input equipment simulation tool, and simulating a downloading operation through the input equipment simulation tool so as to download and obtain the picture.
5. A picture batch downloading device is characterized by comprising:
the system comprises an acquisition module, a display module and a display module, wherein the acquisition module is used for determining a target webpage and acquiring a plurality of picture addresses corresponding to a plurality of pictures on the target webpage;
the fusion module is used for generating a picture address set according to the plurality of picture addresses;
the downloading module is used for accessing and downloading each picture address in the picture address set to obtain a plurality of pictures;
a storage module for storing a plurality of said pictures;
the first checking module is used for generating a downloaded picture address set according to the accessed picture address; calculating a difference set of the picture address set and the downloaded picture address set to generate a first un-downloaded picture address set; if the first un-downloaded picture address set is not empty, accessing and downloading each picture address of the first un-downloaded picture address set;
the second check module is used for extracting the picture name of the picture corresponding to the picture address in the downloaded picture address set and generating a downloaded picture name set; extracting picture names of a plurality of stored pictures to generate a stored picture name set; calculating a difference set of the downloaded picture name set and the stored picture name set, determining a picture address corresponding to a picture name included in the difference set, and generating a second un-downloaded picture address set; and if the second un-downloaded picture address set is not empty, accessing and downloading each picture address of the second un-downloaded picture address set.
6. The device for batch downloading of pictures according to claim 5, wherein the obtaining module is specifically configured to: accessing a database of the target web page; determining a data segment for recording a picture address in a database of the target webpage according to a preset rule; and acquiring a plurality of picture addresses corresponding to a plurality of pictures on the target webpage according to the data segment for recording the picture addresses.
7. The device for batch downloading of pictures according to claim 5, wherein the fusion module is specifically configured to: comparing the plurality of picture addresses pairwise to determine whether at least two same picture addresses exist; if at least two same picture addresses exist, one of the at least two same picture addresses is reserved, and the rest are deleted.
8. The device for batch downloading of pictures according to claim 5, wherein the downloading module is specifically configured to: calling a webpage testing tool, and accessing the picture address in the picture address set through the webpage testing tool; and calling an input equipment simulation tool, and simulating a downloading operation through the input equipment simulation tool so as to download and obtain the picture.
9. An electronic device comprising a memory, a processor and a computer program stored on the memory and executable on the processor, characterized in that the processor implements the method according to any of claims 1 to 4 when executing the computer program.
10. A non-transitory computer readable storage medium storing computer instructions for causing a computer to perform the method of any one of claims 1 to 4.
CN201910646745.8A 2019-07-17 2019-07-17 Picture batch downloading method and device, electronic equipment and storage medium Active CN110365776B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910646745.8A CN110365776B (en) 2019-07-17 2019-07-17 Picture batch downloading method and device, electronic equipment and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910646745.8A CN110365776B (en) 2019-07-17 2019-07-17 Picture batch downloading method and device, electronic equipment and storage medium

Publications (2)

Publication Number Publication Date
CN110365776A CN110365776A (en) 2019-10-22
CN110365776B true CN110365776B (en) 2021-05-04

Family

ID=68220941

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910646745.8A Active CN110365776B (en) 2019-07-17 2019-07-17 Picture batch downloading method and device, electronic equipment and storage medium

Country Status (1)

Country Link
CN (1) CN110365776B (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111597421B (en) * 2020-04-30 2022-08-30 武汉思普崚技术有限公司 Method, device, equipment and storage medium for realizing website picture crawler
CN111651418B (en) * 2020-05-29 2022-03-08 腾讯科技(深圳)有限公司 Document content downloading method and device, computer equipment and storage medium

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101079057A (en) * 2007-03-14 2007-11-28 腾讯科技(深圳)有限公司 System and method for keeping multiple link object of web page
CN103702176A (en) * 2013-12-09 2014-04-02 乐视致新电子科技(天津)有限公司 HLS (http live streaming) protocol-based video downloading method and device
CN107395672A (en) * 2017-06-12 2017-11-24 维沃移动通信有限公司 A kind of picture method for down loading and mobile terminal
CN109614536A (en) * 2018-11-30 2019-04-12 平安科技(深圳)有限公司 Video batch crawling method, system, device based on YouTuBe and can storage medium

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102438031B (en) * 2011-03-11 2015-06-10 奇智软件(北京)有限公司 Transmission resuming downloading method and system
CN103593354B (en) * 2012-08-15 2018-09-07 腾讯科技(深圳)有限公司 A kind of method, apparatus, server and the system of screen page ad
CN105991699B (en) * 2015-02-06 2019-07-19 北京中搜云悦网络技术有限公司 A kind of distributed downloads system of internet crawler
CN109165357A (en) * 2018-09-07 2019-01-08 北京三快在线科技有限公司 Picture Generation Method, server, electronic equipment and readable storage medium storing program for executing
CN109803006A (en) * 2019-01-04 2019-05-24 福建天泉教育科技有限公司 Multifile batch packaging method, storage medium under distributed file system

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101079057A (en) * 2007-03-14 2007-11-28 腾讯科技(深圳)有限公司 System and method for keeping multiple link object of web page
CN103702176A (en) * 2013-12-09 2014-04-02 乐视致新电子科技(天津)有限公司 HLS (http live streaming) protocol-based video downloading method and device
CN107395672A (en) * 2017-06-12 2017-11-24 维沃移动通信有限公司 A kind of picture method for down loading and mobile terminal
CN109614536A (en) * 2018-11-30 2019-04-12 平安科技(深圳)有限公司 Video batch crawling method, system, device based on YouTuBe and can storage medium

Also Published As

Publication number Publication date
CN110365776A (en) 2019-10-22

Similar Documents

Publication Publication Date Title
CN110365776B (en) Picture batch downloading method and device, electronic equipment and storage medium
CN104077387A (en) Webpage content display method and browser device
CN108572823B (en) Front-end and back-end development management method and system based on interface engine
CN103678487A (en) Method and device for generating web page snapshot
CN104036011A (en) Webpage element display method and browser device.
CN107528718B (en) Method, device and system for acquiring resources
CN104765746B (en) Data processing method and device for mobile communication terminal browser
CN103678704A (en) Picture recognition method, system, equipment and device based on picture information
US20160328110A1 (en) Method, system, equipment and device for identifying image based on image
CN109491763B (en) System deployment method and device and electronic equipment
CN107294918B (en) Phishing webpage detection method and device
CN106648569B (en) Target serialization realization method and device
JP2018116496A (en) Difference detection device and program
CN110955428A (en) Page display method and device, electronic equipment and medium
CN114168875A (en) Page program generation method and device, computer equipment and storage medium
JP6505849B2 (en) Generation of element identifier
CN113590564A (en) Data storage method and device, electronic equipment and storage medium
US9589067B2 (en) Converting electronic documents having visible objects
CN111130845B (en) Method and device for testing IPv6 support degree of website page based on visual information
CN111222065A (en) Information display method and device, electronic equipment and medium
CN107832052B (en) Method and device for displaying preview page, storage medium and electronic equipment
CN111367595A (en) Data processing method, program running method, device and processing equipment
CN110968314A (en) Page generation method and device
CN115080114B (en) Application program transplanting processing method, device and medium
CN114238048B (en) Automatic testing method and system for Web front-end performance

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant