CN114138712A - File processing method, processing device, electronic equipment and storage medium - Google Patents

File processing method, processing device, electronic equipment and storage medium Download PDF

Info

Publication number
CN114138712A
CN114138712A CN202111472843.8A CN202111472843A CN114138712A CN 114138712 A CN114138712 A CN 114138712A CN 202111472843 A CN202111472843 A CN 202111472843A CN 114138712 A CN114138712 A CN 114138712A
Authority
CN
China
Prior art keywords
files
target
format type
file
source
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202111472843.8A
Other languages
Chinese (zh)
Inventor
方礼蓉
汪维
肖翔
黄梦蓓
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
China Construction Bank Corp
Original Assignee
China Construction Bank Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by China Construction Bank Corp filed Critical China Construction Bank Corp
Priority to CN202111472843.8A priority Critical patent/CN114138712A/en
Publication of CN114138712A publication Critical patent/CN114138712A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/10File systems; File servers
    • G06F16/11File system administration, e.g. details of archiving or snapshots
    • G06F16/116Details of conversion of file system types or formats
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/205Parsing

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • General Health & Medical Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The present disclosure provides a file processing method, which can be applied to the technical field of computers and the financial field. The file processing method comprises the following steps: acquiring a plurality of target source files according to the identification information of the plurality of source files, wherein the target source files are file materials which are uploaded through client pages and used for business auditing; decompressing the target source files to obtain target decompressed files under the condition that the target source files are compressed files; and calling a file analysis tool, and converting the plurality of target decompressed files from a first format type to a second format type to generate a plurality of first target analyzed files, wherein the second format type is a format type representing images, and the first format type is a first text format type representing non-images. The present disclosure also provides a file processing apparatus, a device, a storage medium, and a program product.

Description

File processing method, processing device, electronic equipment and storage medium
Technical Field
The present disclosure relates to the field of computer technology, applicable and financial, and in particular to a file processing method, processing apparatus, electronic device, storage medium, and program product.
Background
At present, when various financial institutions transact various services, a client needs to submit a large amount of document materials, for example, document materials such as client basic information, invoice information, transaction detail information and the like, and a counter staff of the financial institution needs to process a large amount of document data submitted by the client and then to be audited by an auditor of the financial institution.
Disclosure of Invention
In view of the problems, the present disclosure provides a file presentation method, a processing apparatus, an electronic device, a storage medium, and a program product.
According to a first aspect of the present disclosure, there is provided a file processing method including: acquiring a plurality of target source files according to the identification information of the plurality of source files, wherein the target source files are file materials which are uploaded through client pages and used for business auditing; decompressing the target source files to obtain target decompressed files under the condition that the target source files are compressed files; and calling a file analysis tool, and converting the plurality of target decompressed files from a first format type to a second format type to generate a plurality of first target analyzed files, wherein the second format type is a format type representing images, and the first format type is a first text format type representing non-images.
According to an embodiment of the present disclosure, the file processing method further includes: and under the condition that the target source files are not compressed files, calling a file parsing tool to convert the target source files from a fourth format type to a fifth format type so as to generate second target parsing files, wherein the fifth format type is the same as the second format type, and the fourth format type is the same as the first format type.
According to an embodiment of the present disclosure, obtaining a plurality of target source files according to a plurality of source file identification information includes: acquiring service flow information; determining a plurality of source file identification information associated with the service flow information according to the service flow information; and acquiring a plurality of target source files according to the plurality of source file identification information.
According to an embodiment of the present disclosure, determining, according to the service flow information, a plurality of source file identification information associated with the service flow information includes: receiving a plurality of source files corresponding to the service types; generating service flow information corresponding to the service type and a plurality of source file identification information of a plurality of source files corresponding to the service type according to the service type and the plurality of source files corresponding to the service type; and matching the service flow information with a plurality of source file identification information, and determining a plurality of source file identification information associated with the service flow information.
According to an embodiment of the present disclosure, in a case that the plurality of target source files are compressed files, decompressing the plurality of target source files to obtain a plurality of target decompressed files, includes: performing first loop traversal on a plurality of target source files, and determining the compression format type of each target source file; and calling a decompression interface corresponding to the compression format type, decompressing each target source file, and obtaining a plurality of target decompressed files obtained after each target source file is decompressed.
According to an embodiment of the present disclosure, wherein invoking a file parsing tool to convert a plurality of target decompressed files from a first format type to a second format type to generate a plurality of first target parsed files comprises: converting the plurality of target decompressed files from the first format type into a third format type to obtain a plurality of intermediate files, wherein the third format type is a second text format type representing a non-image format; and converting the plurality of intermediate files from the third format type to the second format type to generate a plurality of first target analysis files.
According to an embodiment of the present disclosure, the file processing method further includes: and displaying the plurality of first target analysis files in a second format type in an image area of the auditing platform.
A second aspect of the present disclosure provides a file processing apparatus including: the acquisition module is used for acquiring a plurality of target source files according to the identification information of the plurality of source files, wherein the target source files are file materials which are uploaded through a client page and are used for business audit; the decompression module is used for decompressing the target source files to obtain a plurality of target decompressed files under the condition that the target source files are compressed files; the first conversion module is used for calling a file analysis tool and converting the plurality of target decompressed files from a first format type to a second format type so as to generate a plurality of first target analyzed files, wherein the second format type is a format type representing images, and the first format type is a first text format type representing non-images.
A third aspect of the present disclosure provides an electronic device, comprising: one or more processors; a memory for storing one or more programs, wherein the one or more programs, when executed by the one or more processors, cause the one or more processors to perform the file processing method described above.
A fourth aspect of the present disclosure also provides a computer-readable storage medium having stored thereon executable instructions that, when executed by a processor, cause the processor to perform the above-described file processing method.
A fifth aspect of the present disclosure also provides a computer program product comprising a computer program which, when executed by a processor, implements the above-described file processing method.
According to the embodiment of the disclosure, a plurality of target source files are obtained according to identification information of the plurality of source files, wherein the target source files are file materials which are uploaded through a client page and used for business audit; decompressing the target source files to obtain target decompressed files under the condition that the target source files are compressed files; and calling a file analysis tool, and converting the plurality of target decompressed files from a first format type to a second format type to generate a plurality of first target analyzed files, wherein the second format type is a format type representing images, and the first format type is a first text format type representing non-images. The technical problems that workload is large, working efficiency is low, client information is easy to leak and the like caused in the existing handling mode are solved, one-time automatic file data analysis is achieved, working error rate and operation risk are reduced, and technical effects of file analysis efficiency and working quality are improved.
Drawings
The foregoing and other objects, features and advantages of the disclosure will be apparent from the following description of embodiments of the disclosure, which proceeds with reference to the accompanying drawings, in which:
fig. 1 schematically shows an application scenario diagram of a file processing method and processing apparatus according to an embodiment of the present disclosure;
FIG. 2 schematically illustrates a flow diagram of a document processing method according to an embodiment of the disclosure;
FIG. 3 schematically shows a schematic diagram of a document processing method according to an embodiment of the disclosure;
FIG. 4 is a block diagram schematically showing the configuration of a document processing apparatus according to an embodiment of the present disclosure; and
FIG. 5 schematically illustrates a block diagram of an electronic device suitable for implementing a file method according to an embodiment of the disclosure.
Detailed Description
Hereinafter, embodiments of the present disclosure will be described with reference to the accompanying drawings. It should be understood that the description is illustrative only and is not intended to limit the scope of the present disclosure. In the following detailed description, for purposes of explanation, numerous specific details are set forth in order to provide a thorough understanding of the embodiments of the disclosure. It may be evident, however, that one or more embodiments may be practiced without these specific details. Moreover, in the following description, descriptions of well-known structures and techniques are omitted so as to not unnecessarily obscure the concepts of the present disclosure.
The terminology used herein is for the purpose of describing particular embodiments only and is not intended to be limiting of the disclosure. The terms "comprises," "comprising," and the like, as used herein, specify the presence of stated features, steps, operations, and/or components, but do not preclude the presence or addition of one or more other features, steps, operations, or components.
All terms (including technical and scientific terms) used herein have the same meaning as commonly understood by one of ordinary skill in the art unless otherwise defined. It is noted that the terms used herein should be interpreted as having a meaning that is consistent with the context of this specification and should not be interpreted in an idealized or overly formal sense.
Where a convention analogous to "at least one of A, B and C, etc." is used, in general such a construction is intended in the sense one having skill in the art would understand the convention (e.g., "a system having at least one of A, B and C" would include but not be limited to systems that have a alone, B alone, C alone, a and B together, a and C together, B and C together, and/or A, B, C together, etc.).
When a financial institution handles various types of businesses, a large amount of file materials of related businesses need to be submitted to the financial institution for auditing by auditors. When the client transacts the business submission document, the business submission document can be transacted in the following way.
One mode is that a customer carries paper document data needing to handle related business to a financial institution website for handling, and a website teller scans and uploads the paper document data needing to handle related business to a website background centralized processing system page by page for an auditor to audit the document data.
The other mode is that a client initiates a transaction application of related services through a client, file data of different formats of the related services can be compressed and uploaded to a background centralized processing system of a financial institution, and auditors download the file data of different formats uploaded by the client to a local manual decompression device and then read and audit the file data.
In the inventive concept disclosed by the present invention, it is found that, aiming at a mode, a customer needs to carry a large amount of document data to be transacted by a website, which causes inconvenience to the customer, a website teller needs to scan page by page, which increases the transaction time of website services, and the phenomenon of wrong scanning and missing scanning is easy to occur in the scanning process, thereby reducing the security of information.
In addition, the auditor needs to manually decompress the uploaded file data, and the effect of business processing is reduced.
To this end, an embodiment of the present disclosure provides a file processing method, including: acquiring a plurality of target source files according to the identification information of the plurality of source files, wherein the target source files are file materials which are uploaded through client pages and used for business auditing; decompressing the target source files to obtain target decompressed files under the condition that the target source files are compressed files; and calling a file analysis tool, and converting the plurality of target decompressed files from a first format type to a second format type to generate a plurality of first target analyzed files, wherein the second format type is a format type representing images, and the first format type is a first text format type representing non-images.
Fig. 1 schematically shows an application scenario diagram of a file processing method and a processing apparatus according to an embodiment of the present disclosure.
As shown in fig. 1, the application scenario 100 according to this embodiment may include terminal devices 101, 102, 103, a network 104 and a server 105. The network 104 serves as a medium for providing communication links between the terminal devices 101, 102, 103 and the server 105. Network 104 may include various connection types, such as wired, wireless communication links, or fiber optic cables, to name a few.
The user may use the terminal devices 101, 102, 103 to interact with the server 105 via the network 104 to receive or send messages or the like. The terminal devices 101, 102, 103 may have installed thereon various communication client applications, such as shopping-like applications, web browser applications, search-like applications, instant messaging tools, mailbox clients, social platform software, etc. (by way of example only).
The terminal devices 101, 102, 103 may be various electronic devices having a display screen and supporting web browsing, including but not limited to smart phones, tablet computers, laptop portable computers, desktop computers, and the like.
The server 105 may be a server providing various services, such as a background management server (for example only) providing support for websites browsed by users using the terminal devices 101, 102, 103. The background management server may analyze and perform other processing on the received data such as the user request, and feed back a processing result (e.g., a webpage, information, or data obtained or generated according to the user request) to the terminal device.
It should be noted that the file processing method provided by the embodiment of the present disclosure may be generally executed by the server 105. Accordingly, the file processing apparatus provided by the embodiment of the present disclosure may be generally disposed in the server 105. The file processing method provided by the embodiment of the present disclosure may also be executed by a server or a server cluster different from the server 105 and capable of communicating with the terminal devices 101, 102, 103 and/or the server 105. Accordingly, the file processing apparatus provided by the embodiment of the present disclosure may also be disposed in a server or a server cluster different from the server 105 and capable of communicating with the terminal devices 101, 102, 103 and/or the server 105.
It should be understood that the number of terminal devices, networks, and servers in fig. 1 is merely illustrative. There may be any number of terminal devices, networks, and servers, as desired for implementation.
FIG. 2 schematically shows a flow chart of a file processing method according to an embodiment of the present disclosure.
As shown in fig. 2, the file processing method of this embodiment includes operations S210 to S230.
In operation S210, a plurality of target source files are obtained according to the source file identification information, where the target source files are file materials uploaded through a client page and used for service auditing.
According to the embodiment of the disclosure, the target source file is file data which is uploaded by a client through a client page and is used for business audit. The obtained target source file corresponds to the source file identification information one by one, and the source file obtained according to the source file identification information is the source file needing file processing.
In operation S220, in the case that the plurality of target source files are compressed files, the plurality of target source files are decompressed to obtain a plurality of target decompressed files.
According to the embodiment of the disclosure, the target source files may be one file, or may be a compressed file composed of a plurality of single files with different format types, and the format of the compressed file may be arbitrary, for example, the compressed file may be in a zip format or a rar format.
According to the embodiment of the disclosure, whether each target source file in a plurality of target source files is a compressed file is judged, and if the target source file is the compressed file, the target source file is decompressed.
According to an embodiment of the present disclosure, the process of decompressing the plurality of target source files may be an automatic decompression process, and the decompression process may further include: when the target source file is a compressed file, if a plurality of sub-compressed files are nested in the compressed file, each sub-compressed file can be sequentially decompressed.
According to the embodiment of the disclosure, the decompression operation is performed on the compressed file of the target source file, the file information of the target decompressed file can be obtained, and the target decompressed file is stored to the specified saving path. The file information of the target decompressed file may include: identification information of the target decompressed file, file name information of the target decompressed file, format type information of a plurality of target decompressed files, and the like.
In operation S230, a file parsing tool is called to convert the plurality of target decompressed files from a first format type to a second format type, so as to generate a plurality of first target parsed files, where the second format type is a format type representing an image, and the first format type is a first text format type representing a non-image.
According to an embodiment of the present disclosure, the first format type is a first text format type characterizing non-images, which may include, for example, but is not limited to: formats such as doc, docx, txt, pdf, excel, etc.; the second format type is a format type characterizing the image, and may be a jpg or jpeg picture format type, for example.
According to an embodiment of the present disclosure, the file parsing tool may be a server software tool, and the server software tool is used to parse the target decompressed file by calling an application programming interface of the server software tool, so as to obtain a plurality of target parsed files.
According to the embodiment of the disclosure, file analysis is performed on the target decompressed files, file information of a plurality of first target analysis files can be obtained, and the first target analysis files are stored to the specified storage path. The file information of the first target resolution file may include: the file format information comprises identification information of the first target analysis file, file name information of the first target analysis file, format type information of the plurality of first target analysis files and the like.
According to the embodiment of the disclosure, a plurality of target source files are obtained according to identification information of the plurality of source files, wherein the target source files are file materials which are uploaded through a client page and used for business audit; decompressing the target source files to obtain target decompressed files under the condition that the target source files are compressed files; and calling a file analysis tool, and converting the plurality of target decompressed files from a first format type to a second format type to generate a plurality of first target analyzed files, wherein the second format type is a format type representing images, and the first format type is a first text format type representing non-images. The technical problems that workload is large, working efficiency is low, client information is easy to leak and the like caused in the existing handling mode are solved, one-time automatic file data analysis is achieved, working error rate and operation risk are reduced, and technical effects of file analysis efficiency and working quality are improved.
According to an embodiment of the present disclosure, the file processing method may further include: and under the condition that the target source files are not compressed files, calling a file parsing tool to convert the target source files from a fourth format type to a fifth format type so as to generate second target parsing files, wherein the fifth format type is the same as the second format type, and the fourth format type is the same as the first format type.
According to the embodiment of the disclosure, if a plurality of target source files are not compressed files, a file analysis tool is called to perform file analysis on the target source files, the files of the non-image format types of the target source files are converted into the files of the image format types, file information of a second target analysis file is obtained, and meanwhile, the second target analysis file is stored to the designated storage path. The file information of the second target resolution file may include: identification information of the second target analysis file, file name information of the second target analysis file, format type information of the plurality of second target analysis files, and the like. According to an embodiment of the present disclosure, the fifth format type may be the same as the second format type, may be a format type characterizing an image, and may be a jpg or jpeg picture format type, for example.
According to an embodiment of the present disclosure, the fourth format type may be the same as the first format type, may be a format type characterizing non-images, for example, may include but is not limited to: doc, docx, txt, pdf, excel, etc.
According to an embodiment of the present disclosure, obtaining a plurality of target source files according to a plurality of source file identification information includes: acquiring service flow information; determining a plurality of source file identification information associated with the service flow information according to the service flow information; and acquiring a plurality of target source files according to the plurality of source file identification information.
According to the embodiment of the disclosure, when a client needs to transact business at a financial institution, a plurality of file materials of the related business transacted by the client need to be submitted on a client page. When a client applies for related services to be handled, unique service flow information of the applied services is generated. Because a plurality of file materials are required under the service, when the service flow information of the service is generated, the identification information of the plurality of file materials corresponding to the service is generated at the same time, that is, a plurality of source file identification information, for example, the source file identification information may be a source file ID.
According to the embodiment of the disclosure, the service flow information under the service corresponds to a plurality of source file identification information, the source file identification information is a unique identification of each source file, and a target source file corresponding to the source file identification information can be obtained through the source file identification information.
According to the embodiment of the disclosure, determining a plurality of source file identification information associated with the business flow information according to the business flow information includes: receiving a plurality of source files corresponding to the service types; generating service flow information corresponding to the service type and a plurality of source file identification information of a plurality of source files corresponding to the service type according to the service type and the plurality of source files corresponding to the service type; and matching the service flow information with a plurality of source file identification information, and determining a plurality of source file identification information associated with the service flow information.
According to the embodiment of the disclosure, under the condition that a client applies for transacting related services, transacting the services of each service type requires submitting a plurality of source files related to the service type. In processing each transaction, no more than 10 attachments, no more than 10M files per attachment, are received from the source file submitted by the client.
According to the embodiment of the disclosure, when different types of services are handled, service flow information corresponding to the service types and a plurality of source file identification information corresponding to the services are generated. And matching the corresponding relation between the service flow information and the source file identification information to ensure that the service flow information of the service and the source file identification information corresponding to the service form an incidence relation.
According to an embodiment of the present disclosure, in a case that a plurality of target source files are compressed files, decompressing the plurality of target source files to obtain a plurality of target decompressed files includes: performing first loop traversal on a plurality of target source files, and determining the compression format type of each target source file; and calling a decompression interface corresponding to the compression format type, decompressing each target source file, and obtaining a plurality of target decompressed files obtained after each target source file is decompressed.
According to the embodiment of the disclosure, in the case that the multiple target source files are compressed files, the compressed files may include multiple single files of different format types, may also include multiple nested sub-compressed files, and may also include multiple single files of different format types. Different format types may include: doc, docx, tXt, pdf, jpeg, jpg, excel, etc.
According to the embodiment of the disclosure, the compressed files of a plurality of target source files are circularly traversed, and the format type of the current compressed file is judged, for example, when the format type of the compressed file is a zip compressed file, a decompression application programming interface is called to perform corresponding decompression operation for a Java zip API; and when the format type of the compressed file is a rar compressed file, calling a decompression application programming interface to perform corresponding decompression operation for a Java rar API. Compressed files that loop through multiple target source files may be implemented with while loops.
According to the embodiment of the disclosure, after each target source file is decompressed, decompressed file information, that is, target decompressed file information is obtained, and after all target source files are decompressed, the entity file corresponding to the target decompressed file information can be stored to the specified storage path. The target decompressed file information may include target decompressed file identification information, a file name of the target decompressed file, a file format of the target decompressed file, and identification information of the target decompressed file, for example, may be a target decompressed file ID.
According to an embodiment of the present disclosure, invoking a file parsing tool to convert a plurality of target decompressed files from a first format type to a second format type to generate a plurality of first target parsed files includes: converting the plurality of target decompressed files from the first format type into a third format type to obtain a plurality of intermediate files, wherein the third format type is a second text format type representing a non-image format; and converting the plurality of intermediate files from the third format type to the second format type to generate a plurality of first target analysis files.
According to the embodiment of the present disclosure, the third format type may be an intermediate format type that converts the first format type into the second format type in the parsing operation process of the target decompressed file, and a file corresponding to the intermediate format type is an intermediate file.
According to the embodiment of the disclosure, for example, the first format type file is a word format type, the second format type file is a jpg format type, and by calling a file parsing tool, the first format type file may be converted into a third format type, for example, a pdf format type, to generate an intermediate file, and then the intermediate file is converted into the second format type file.
According to the embodiment of the disclosure, after each target decompressed file is parsed, parsed file information, that is, first target parsed file information is obtained, and after all target decompressed files are parsed, an entity file corresponding to the first target parsed file information can be stored to a specified storage path. The first target parse file information may include first target parse file identification information, a file name of the first target parse file, and a file format of the first target parse file, where the identification information of the first target parse file may be, for example, a target parse file ID.
According to an embodiment of the present disclosure, the source file information, the target decompressed file information, and the first target parsed file information correspond to each other, for example, the source file identification information in the source file information is ID1, the identification information of the target decompressed file information may be ID1+ ID2, and the identification information of the first target parsed file information may be ID1+ ID2+ ID3, so that the corresponding target file may be acquired according to the identification information.
According to an embodiment of the present disclosure, when determining that the plurality of target source files are not compressed files, the converting process of converting the plurality of target source files from the fourth format type to the fifth format type may include converting the plurality of target source files from the fourth format type to an intermediate file of the intermediate format type, and then converting the intermediate file to the fifth format type to generate a plurality of second target parse files. It should be noted that the intermediate format type may be the same as the third format type. According to the embodiment of the present disclosure, after the second target analysis file is generated, the entity file corresponding to the file information of the second target analysis file is stored in the same manner as the first target analysis file, which is not described herein again.
According to an embodiment of the present disclosure, the file processing method may further include: and displaying the plurality of first target analysis files in a second format type in an image area of the auditing platform.
According to the embodiment of the disclosure, in the case that the target source file is determined to be a compressed file, the plurality of first target parse files converted into the second format type are displayed in the auditing platform image area.
According to the embodiment of the disclosure, under the condition that the target source file is determined not to be a compressed file, file parsing is performed on the target source file, the target source file is converted into a plurality of second target parsing files of a fifth format type, and the plurality of second target parsing files converted into the fifth format type are displayed in the auditing platform influence area.
According to the embodiment of the disclosure, the second format type is an image format type, the image area of the auditing platform can be divided into a left screen and a right screen, the displayed contents are the same, and the auditing personnel can compare the contents of different files by dragging the left screen and the right screen, so that the inconvenience of browsing before and after auditing is avoided.
Fig. 3 schematically shows a schematic diagram of a document processing method according to an embodiment of the present disclosure.
As shown in fig. 3, a plurality of source files 301 corresponding to the service types are received, a plurality of source file identification information 303 of the plurality of source files corresponding to the service types and service flow information 304 corresponding to the service types are generated according to the service type 302 and the plurality of source files 301 corresponding to the service types, the service flow information 303 is matched with the plurality of source file identification information 304, and a plurality of source file identification information 305 associated with the service flow information is determined. A plurality of target source files 306 are acquired based on the plurality of source file identification information 305, and it is determined whether the plurality of target source files 306 are compressed files 307.
When the target source files 306 are compressed files, the target source files 306 are decompressed to obtain target decompressed files 308, a file parsing tool is called to convert the target decompressed files 308 from a first format type to a second format type 309, and a plurality of first target parsed files 310 are generated. In the case where the plurality of target source files 306 are not compressed files, a file parsing tool is invoked to convert the plurality of target source files 306 from the fourth format type to the fifth format type 311, generating a plurality of second target parse files 312.
It should be noted that the fifth format type is the same as the second format type, and is a format type representing an image; the fourth format type is the same as the first format type and is the first text format type for representing the non-image.
Based on the file processing method, the disclosure also provides a file processing device. The apparatus will be described in detail below with reference to fig. 4.
Fig. 4 schematically shows a block diagram of the structure of a document processing apparatus according to an embodiment of the present disclosure.
As shown in fig. 4, the document processing apparatus 400 of this embodiment may include: an acquisition module 410, a decompression module 420 and a first conversion module 430.
The obtaining module 410 is configured to obtain a plurality of target source files according to the identification information of the plurality of source files, where the target source files are file data used for service auditing and uploaded through a client page. In an embodiment, the obtaining module 410 may be configured to perform the operation S210 described above, which is not described herein again.
The decompression module 420 is configured to, when the plurality of target source files are compressed files, decompress the plurality of target source files to obtain a plurality of target decompressed files. In an embodiment, the decompression module 420 may be configured to perform the operation S220 described above, which is not described herein again.
The first conversion module 430 is configured to invoke a file parsing tool, and convert the plurality of target decompressed files from a first format type to a second format type to generate a plurality of first target parsed files, where the second format type is a format type representing an image, and the first format type is a first text format type representing a non-image. In an embodiment, the converting module 430 may be configured to perform the operation S230 described above, which is not described herein again.
According to the embodiment of the disclosure, a plurality of target source files are obtained according to identification information of the plurality of source files, wherein the target source files are file materials which are uploaded through a client page and used for business audit; decompressing the target source files to obtain target decompressed files under the condition that the target source files are compressed files; and calling a file analysis tool, and converting the plurality of target decompressed files from a first format type to a second format type to generate a plurality of first target analyzed files, wherein the second format type is a format type representing images, and the first format type is a first text format type representing non-images. The technical problems that workload is large, working efficiency is low, client information is easy to leak and the like caused in the existing handling mode are solved, one-time automatic file data analysis is achieved, working error rate and operation risk are reduced, and technical effects of file analysis efficiency and working quality are improved.
According to an embodiment of the present disclosure, the document processing apparatus 400 may include: and a second conversion module.
The second conversion module is used for calling a file parsing tool to convert the plurality of target source files from a fourth format type to a fifth format type under the condition that the plurality of target source files are not compressed files so as to generate a plurality of second target parsing files, wherein the fifth format type is the same as the second format type, and the fourth format type is the same as the first format type.
According to an embodiment of the present disclosure, the obtaining module 410 may include: the device comprises an acquisition submodule, a first determination submodule and a second acquisition submodule.
And the acquisition submodule is used for acquiring the service flow information.
And the first determining submodule is used for determining a plurality of source file identification information associated with the service flow information according to the service flow information.
And the second obtaining submodule is used for obtaining a plurality of target source files according to the identification information of the source files.
According to an embodiment of the present disclosure, determining the sub-module may include: the device comprises a receiving unit, a generating unit and a determining unit.
And the receiving unit is used for receiving a plurality of source files corresponding to the service types.
And the generating unit is used for generating service flow information corresponding to the service type and a plurality of source file identification information of the source files corresponding to the service type according to the service type and the source files corresponding to the service type.
And the determining unit is used for matching the service flow information with the plurality of source file identification information and determining the plurality of source file identification information associated with the service flow information.
According to an embodiment of the present disclosure, the decompression module 420 may include: a second determining submodule and a calling submodule.
And the second determining submodule is used for performing first loop traversal on the plurality of target source files and determining the compression format type of each target source file.
And the calling submodule is used for calling the decompression interfaces corresponding to the compression format types, decompressing each target source file and obtaining a plurality of target decompressed files obtained after each target source file is decompressed.
According to an embodiment of the present disclosure, the first conversion module 430 may include: the first conversion submodule and the second conversion submodule.
And the first conversion submodule is used for firstly converting the plurality of target decompressed files from the first format type into a third format type to obtain a plurality of intermediate files, wherein the third format type is a second text format type representing a non-image format.
And the second conversion submodule is used for converting the plurality of intermediate files from the third format type to the second format type so as to generate a plurality of first target analysis files.
According to an embodiment of the present disclosure, the document processing apparatus 400 may further include: and a display module.
And the display module is used for displaying the plurality of target analysis files in the second format type in the image area of the auditing platform.
According to the embodiment of the present disclosure, any plurality of the obtaining module 410, the decompressing module 420, and the first converting module 430 may be combined into one module to be implemented, or any one of them may be split into a plurality of modules. Alternatively, at least part of the functionality of one or more of these modules may be combined with at least part of the functionality of the other modules and implemented in one module. According to an embodiment of the present disclosure, at least one of the obtaining module 410, the decompressing module 420, and the first converting module 430 may be implemented at least partially as a hardware circuit, such as a Field Programmable Gate Array (FPGA), a Programmable Logic Array (PLA), a system on a chip, a system on a substrate, a system on a package, an Application Specific Integrated Circuit (ASIC), or may be implemented by hardware or firmware in any other reasonable manner of integrating or packaging a circuit, or may be implemented in any one of three implementations of software, hardware, and firmware, or in a suitable combination of any of them. Alternatively, at least one of the obtaining module 410, the decompressing module 420 and the first converting module 430 may be at least partially implemented as a computer program module, which when executed may perform the corresponding functions.
FIG. 5 schematically illustrates a block diagram of an electronic device suitable for implementing a file method according to an embodiment of the disclosure.
As shown in fig. 5, an electronic device 500 according to an embodiment of the present disclosure includes a processor 501 that can perform various appropriate actions and processes according to a program stored in a Read Only Memory (ROM)502 or a program loaded from a storage section 508 into a Random Access Memory (RAM) 503. The processor 501 may comprise, for example, a general purpose microprocessor (e.g., a CPU), an instruction set processor and/or associated chipset, and/or a special purpose microprocessor (e.g., an Application Specific Integrated Circuit (ASIC)), among others. The processor 501 may also include onboard memory for caching purposes. Processor 501 may include a single processing unit or multiple processing units for performing different actions of a method flow according to embodiments of the disclosure.
In the RAM 503, various programs and data necessary for the operation of the electronic apparatus 500 are stored. The processor 501, the ROM 502, and the RAM 503 are connected to each other by a bus 504. The processor 501 performs various operations of the method flows according to the embodiments of the present disclosure by executing programs in the ROM 502 and/or the RAM 503. Note that the programs may also be stored in one or more memories other than the ROM 502 and the RAM 503. The processor 501 may also perform various operations of method flows according to embodiments of the present disclosure by executing programs stored in the one or more memories.
According to an embodiment of the present disclosure, electronic device 500 may also include an input/output (I/O) interface 505, input/output (I/O) interface 505 also being connected to bus 504. The electronic device 500 may also include one or more of the following components connected to the I/O interface 505: an input portion 506 including a keyboard, a mouse, and the like; an output portion 507 including a display such as a Cathode Ray Tube (CRT), a Liquid Crystal Display (LCD), and the like, and a speaker; a storage portion 508 including a hard disk and the like; and a communication section 509 including a network interface card such as a LAN card, a modem, or the like. The communication section 509 performs communication processing via a network such as the internet. The driver 510 is also connected to the I/O interface 505 as necessary. A removable medium 511 such as a magnetic disk, an optical disk, a magneto-optical disk, a semiconductor memory, or the like is mounted on the drive 510 as necessary, so that a computer program read out therefrom is mounted into the storage section 508 as necessary.
The present disclosure also provides a computer-readable storage medium, which may be contained in the apparatus/device/system described in the above embodiments; or may exist separately and not be assembled into the device/apparatus/system. The computer-readable storage medium carries one or more programs which, when executed, implement the method according to an embodiment of the disclosure.
According to embodiments of the present disclosure, the computer-readable storage medium may be a non-volatile computer-readable storage medium, which may include, for example but is not limited to: a portable computer diskette, a hard disk, a Random Access Memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or flash memory), a portable compact disc read-only memory (CD-ROM), an optical storage device, a magnetic storage device, or any suitable combination of the foregoing. In the present disclosure, a computer readable storage medium may be any tangible medium that can contain, or store a program for use by or in connection with an instruction execution system, apparatus, or device. For example, according to embodiments of the present disclosure, a computer-readable storage medium may include ROM 502 and/or RAM 503 and/or one or more memories other than ROM 502 and RAM 503 described above.
Embodiments of the present disclosure also include a computer program product comprising a computer program containing program code for performing the method illustrated in the flow chart. When the computer program product runs in a computer system, the program code is used for causing the computer system to realize the file processing method provided by the embodiment of the disclosure.
The computer program performs the above-described functions defined in the system/apparatus of the embodiments of the present disclosure when executed by the processor 501. The systems, apparatuses, modules, units, etc. described above may be implemented by computer program modules according to embodiments of the present disclosure.
In one embodiment, the computer program may be hosted on a tangible storage medium such as an optical storage device, a magnetic storage device, or the like. In another embodiment, the computer program may also be transmitted, distributed in the form of a signal on a network medium, downloaded and installed through the communication section 509, and/or installed from the removable medium 511. The computer program containing program code may be transmitted using any suitable network medium, including but not limited to: wireless, wired, etc., or any suitable combination of the foregoing.
In such an embodiment, the computer program may be downloaded and installed from a network through the communication section 509, and/or installed from the removable medium 511. The computer program, when executed by the processor 501, performs the above-described functions defined in the system of the embodiments of the present disclosure. The systems, devices, apparatuses, modules, units, etc. described above may be implemented by computer program modules according to embodiments of the present disclosure.
In accordance with embodiments of the present disclosure, program code for executing computer programs provided by embodiments of the present disclosure may be written in any combination of one or more programming languages, and in particular, these computer programs may be implemented using high level procedural and/or object oriented programming languages, and/or assembly/machine languages. The programming language includes, but is not limited to, programming languages such as Java, C + +, python, the "C" language, or the like. The program code may execute entirely on the user computing device, partly on the user device, partly on a remote computing device, or entirely on the remote computing device or server. In the case of a remote computing device, the remote computing device may be connected to the user computing device through any kind of network, including a Local Area Network (LAN) or a Wide Area Network (WAN), or may be connected to an external computing device (e.g., through the internet using an internet service provider).
The flowchart and block diagrams in the figures illustrate the architecture, functionality, and operation of possible implementations of systems, methods and computer program products according to various embodiments of the present disclosure. In this regard, each block in the flowchart or block diagrams may represent a module, segment, or portion of code, which comprises one or more executable instructions for implementing the specified logical function(s). It should also be noted that, in some alternative implementations, the functions noted in the block may occur out of the order noted in the figures. For example, two blocks shown in succession may, in fact, be executed substantially concurrently, or the blocks may sometimes be executed in the reverse order, depending upon the functionality involved. It will also be noted that each block of the block diagrams or flowchart illustration, and combinations of blocks in the block diagrams or flowchart illustration, can be implemented by special purpose hardware-based systems which perform the specified functions or acts, or combinations of special purpose hardware and computer instructions.
Those skilled in the art will appreciate that various combinations and/or combinations of features recited in the various embodiments and/or claims of the present disclosure can be made, even if such combinations or combinations are not expressly recited in the present disclosure. In particular, various combinations and/or combinations of the features recited in the various embodiments and/or claims of the present disclosure may be made without departing from the spirit or teaching of the present disclosure. All such combinations and/or associations are within the scope of the present disclosure.
The embodiments of the present disclosure have been described above. However, these examples are for illustrative purposes only and are not intended to limit the scope of the present disclosure. Although the embodiments are described separately above, this does not mean that the measures in the embodiments cannot be used in advantageous combination. The scope of the disclosure is defined by the appended claims and equivalents thereof. Various alternatives and modifications can be devised by those skilled in the art without departing from the scope of the present disclosure, and such alternatives and modifications are intended to be within the scope of the present disclosure.

Claims (11)

1. A method of file processing, comprising:
acquiring a plurality of target source files according to the identification information of the source files, wherein the target source files are file materials which are uploaded through client pages and used for business auditing;
decompressing the target source files to obtain target decompressed files under the condition that the target source files are compressed files;
and calling a file analysis tool, and converting the plurality of target decompressed files from a first format type to a second format type to generate a plurality of first target analyzed files, wherein the second format type is a format type representing images, and the first format type is a first text format type representing non-images.
2. The method of claim 1, further comprising:
and under the condition that the target source files are not compressed files, calling the file parsing tool to convert the target source files from a fourth format type to a fifth format type so as to generate second target parsing files, wherein the fifth format type is the same as the second format type, and the fourth format type is the same as the first format type.
3. The method of claim 1, wherein said obtaining a plurality of target source files from a plurality of source file identification information comprises:
acquiring service flow information;
determining a plurality of source file identification information associated with the service flow information according to the service flow information;
and acquiring a plurality of target source files according to the identification information of the source files.
4. The method of claim 3, wherein said determining a plurality of said source file identification information associated with said traffic flow information from said traffic flow information comprises:
receiving a plurality of source files corresponding to the service types;
generating the service flow information corresponding to the service type and a plurality of source file identification information of a plurality of source files corresponding to the service type according to the service type and the plurality of source files corresponding to the service type;
and matching the service flow information with a plurality of source file identification information, and determining a plurality of source file identification information associated with the service flow information.
5. The method of claim 1, wherein, in the case that the target source files are compressed files, decompressing the target source files to obtain target decompressed files comprises:
performing first loop traversal on a plurality of target source files, and determining the compression format type of each target source file;
and calling a decompression interface corresponding to the compression format type, decompressing each target source file, and obtaining a plurality of target decompressed files obtained after each target source file is decompressed.
6. The method of claim 1, wherein said invoking a file parsing tool to convert the plurality of target decompressed files from a first format type to a second format type to generate a plurality of first target parsed files comprises:
converting the target decompressed files from the first format type into a third format type to obtain a plurality of intermediate files, wherein the third format type is a second text format type representing a non-image format;
converting the plurality of intermediate files from the third format type to the second format type to generate a plurality of first target parse files.
7. The method of claim 1, further comprising:
and displaying the plurality of first target analysis files in the second format type in an image area of an auditing platform.
8. A document processing apparatus comprising:
the acquisition module is used for acquiring a plurality of target source files according to the identification information of the plurality of source files, wherein the target source files are file materials which are uploaded through a client page and are used for business audit;
the decompression module is used for decompressing the target source files to obtain a plurality of target decompressed files under the condition that the target source files are compressed files;
the first conversion module is used for calling a file analysis tool and converting the plurality of target decompressed files from a first format type to a second format type to generate a plurality of first target analyzed files, wherein the second format type is a format type representing images, and the first format type is a first text format type representing non-images.
9. An electronic device, comprising:
one or more processors;
a storage device for storing one or more programs,
wherein the one or more programs, when executed by the one or more processors, cause the one or more processors to perform the method of any of claims 1-7.
10. A computer readable storage medium having stored thereon executable instructions which, when executed by a processor, cause the processor to perform the method of any one of claims 1 to 7.
11. A computer program product comprising a computer program which, when executed by a processor, implements a method according to any one of claims 1 to 7.
CN202111472843.8A 2021-12-03 2021-12-03 File processing method, processing device, electronic equipment and storage medium Pending CN114138712A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202111472843.8A CN114138712A (en) 2021-12-03 2021-12-03 File processing method, processing device, electronic equipment and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202111472843.8A CN114138712A (en) 2021-12-03 2021-12-03 File processing method, processing device, electronic equipment and storage medium

Publications (1)

Publication Number Publication Date
CN114138712A true CN114138712A (en) 2022-03-04

Family

ID=80388011

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202111472843.8A Pending CN114138712A (en) 2021-12-03 2021-12-03 File processing method, processing device, electronic equipment and storage medium

Country Status (1)

Country Link
CN (1) CN114138712A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116127922A (en) * 2023-04-19 2023-05-16 北京路浩知识产权集团有限公司 Intellectual property file processing method and device

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116127922A (en) * 2023-04-19 2023-05-16 北京路浩知识产权集团有限公司 Intellectual property file processing method and device

Similar Documents

Publication Publication Date Title
US20100223543A1 (en) Automating Interrogative Population of Electronic Forms Using a Real-Time Communication Platform
CN113159970A (en) Data format conversion method, device, system, electronic equipment and storage medium
CN111950249A (en) Message generation method and device, electronic equipment and storage medium
CN114022031A (en) Data processing method, data processing apparatus, electronic device, medium, and computer program product
CN113507419B (en) Training method of traffic distribution model, traffic distribution method and device
CN114237651A (en) Installation method and device of cloud native application, electronic equipment and medium
CN114138712A (en) File processing method, processing device, electronic equipment and storage medium
US20170083600A1 (en) Creating data objects to separately store common data included in documents
CN112486482A (en) Page display method and device
CN113515271B (en) Service code generation method and device, electronic equipment and readable storage medium
CN113448578A (en) Page data processing method, processing system, electronic device and readable storage medium
CN113395292A (en) Message processing method and device, electronic equipment and storage medium
CN111199432A (en) Data processing method, device, system and computer readable storage medium
CN113760582A (en) Document declaration method and device
CN113176907A (en) Interface data calling method and device, computer system and readable storage medium
CN112784195A (en) Page data publishing method and system
CN113779133B (en) Data processing method, device, electronic equipment, storage medium and program product
CN113553123B (en) Data processing method, device, electronic equipment and storage medium
CN110858240A (en) Front-end module loading method and device
CN114201214A (en) File generation method, file generation device, electronic equipment, medium and program product
CN114066425A (en) Electronic approval method, device, equipment and medium
CN113989046A (en) Transaction processing method, apparatus, electronic device, storage medium, and program product
CN116310423A (en) Image recognition method, device, electronic equipment and readable storage medium
CN114218254A (en) Report generation method, device, equipment and storage medium
CN116629230A (en) Document processing method, device, equipment and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination