CN106339362B - A kind of big Document encapsulation method of archive information packet and client - Google Patents

A kind of big Document encapsulation method of archive information packet and client Download PDF

Info

Publication number
CN106339362B
CN106339362B CN201610797042.1A CN201610797042A CN106339362B CN 106339362 B CN106339362 B CN 106339362B CN 201610797042 A CN201610797042 A CN 201610797042A CN 106339362 B CN106339362 B CN 106339362B
Authority
CN
China
Prior art keywords
original text
file
electronics
encapsulation
electronics original
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201610797042.1A
Other languages
Chinese (zh)
Other versions
CN106339362A (en
Inventor
李铮
江威
李翔宇
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Ding Xin Tongfang Polytron Technologies Inc
Original Assignee
Ding Xin Tongfang Polytron Technologies Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Ding Xin Tongfang Polytron Technologies Inc filed Critical Ding Xin Tongfang Polytron Technologies Inc
Priority to CN201610797042.1A priority Critical patent/CN106339362B/en
Publication of CN106339362A publication Critical patent/CN106339362A/en
Application granted granted Critical
Publication of CN106339362B publication Critical patent/CN106339362B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/12Use of codes for handling textual entities
    • G06F40/14Tree-structured documents
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/12Use of codes for handling textual entities
    • G06F40/126Character encoding

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • General Health & Medical Sciences (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The present invention relates to a kind of big Document encapsulation of archive information packet, de-encapsulation method and systems.The packaging method includes obtaining electronics original text and metadata;It generates and contains the original xml document of the location information of electronics original text;The original xml document is parsed, the byte for being successively read regular length flows and is spliced to form continuous byte stream;Electronics original text after base64 coded treatment is inserted at the position of the electronics original text marked in the byte stream, merges write-in with byte stream and meet the new xml document that encapsulation pack arrangement defines;The signed data of encryption is generated for new xml document, and is packaged into the complete package packet with private key.The present invention solves the problems, such as of the existing technology, is packaged processing to archive information packet using base64 coding techniques and xml technology, can satisfy the package requirements of big file, and is with good expansibility and compatibility.

Description

A kind of big Document encapsulation method of archive information packet and client
Technical field
The present invention relates to archives and information management fields, and in particular to a kind of big Document encapsulation method of archive information packet and client End.
Background technique
OAIS (Reference Model for an Open Archival Information System), i.e., it is open Formula Documentary Information System model, it is nineteen ninety-five, under the request of International Organization for standardization (ISO), American National aviation and boat The consultative committee for space data system (CCSDS) of its office starts access standard and the length being intended to digital resource of exploitation The regulation concept and reference frame that phase saves.Later several modified expansions, in January, 2002, reference model eventually by audit, Formally become a new international standard (ISO:14721), becomes the research and development of many digital archives in the world The standard criterion that project is preferentially abided by.
OAIS information model proposes the concept of packet, the structure that information is inputted, operated and exported in systems by it Generalities are divided into and submit packet (Submission Information Package, SIP), archive information packet (Archival Information Package, AIP) and distribution packet (Dissemination Information Package, DIP). Submitting packet (SIP) is the packet that information producer is supplied to OAIS;It is to be converted that one or more submits packet to need It is saved as one or more archive information packets.Archive information is surrounded by a series of complete preservation description informations and relevant Content information.Finally, OAIS needs provide an archive letter in a manner of distributing packet (DIP) according to the request of consumer All or part of content of packet is ceased to consumer.
The english abbreviation of XML, that is, Extensible Markup Language is extensible markup language.It is a kind of function Energy is strong, purposes is wide, open-ended common index language.XML is that occur blank nineteen ninety-five, is issued as within 2 months within 1998 the mark of W3C It is quasi-.XML term marks electronic document, makes it have structural markup language, can be used to flag data, define data class Type is a kind of original language that permission user is defined the markup language of oneself.
It is very convenient to the exchange of text since XML file is the data exchange structure of text-type, but for non-textual Data are helpless.And there are a large amount of binary data, such as the data such as formatted document and image in archives economy.It needs It above-mentioned non-document file is mapped as text file is put into XML to transmit, and Base64 coding just meets the demand.
The principle of Base64 coding is as follows: the data flow of input taken 6bit (the preceding benefit 0 less than 6bit) every time, it is every in this way 38 bytes will be encoded to 46 bit bytes;Discontented 4 bytes with "=" be filled.46 bit bytes after coding It is transmitted still according to 8, only front two is all configured to 0.When a byte only has 6 significance bits, it Valued space is limited in 0-63.The information of 3 bytes can be converted into the information of 4 bytes by we in this way, but this 4 Information after a conversion can hint obliquely in character.It, can be by letter, number, word due to this characteristic of base64 coding Symbol, Chinese character and text generation restoring files turn to the coded format that can be split, parse, therefore in this way I Text can be read into memory in batches by way of stream, change into form corresponding base64 coded data, and will Data are written in packet, form AIP and DIP file packet, and catalogue data and the archives mounting data for realizing archives are unified Storage.When parsing, since base64 coding is fixed 8bit coding mode, we can be segmented file pair The base64 coding memory parsing answered, and restore document.
Archive information packet (AIP), distribution packet (DIP) are to seal the structural metadata of archives and archives plaintext data It is attached in packet, packet infrastructure is XML file;Traditional packet encapsulation technology is using all the elements as the section of XML Point content is saved, and wherein structural metadata saves original data content, and archives original text is to convert electronic document itself to Base64 coding, and base64 coding is saved as node content.Traditional archives are usually papery, are scanned processing Electronics original text size afterwards is not too large, probably also can control within 10,000,000 after the original text conversion more than file, therefore, traditional Packing manner is disposably to be cached to original text in memory, and complete the conversion of base64 coding in memory, and be saved in XML Term preservation is carried out in file, for original text size within 30,000,000, such operation is all what there is no problem;But with archives electricity The type of sub- original text increases the multimedia class of more and more photos, audio-video on the basis of traditional document type Type, therefore the size of original text was originally bigger, video file some have arrived the size rank of G, thus traditional packing manner without Method meets the encapsulation requirement of the packet of big files electronic original text.
Archival digitalization is the inevitable trend of Archival Profession, with the development and social progress of society, with difference Kind medium is that the archives of carrier are more and more: secretarial document, photo file, audio file and video archive.How these to be situated between Matter is unified to be saved in archive information packet, while being convenient for the archives text of compatible format in future with preferable scalability again Part is current urgent problem to be solved.
Summary of the invention
The technical problem to be solved by the present invention is in view of the deficiencies of the prior art, provide a kind of big file of archive information packet Encapsulation, parsing inspection method and system.
The technical scheme to solve the above technical problems is that
A kind of big Document encapsulation method of archive information packet is applied to server-side, includes the following steps:
Receive the large file and package request that client is sent;
The large file is encapsulated as encapsulation package using base64 coding techniques and xml technology;
After checking request receive that client sends, the encapsulation package is returned to client.
The beneficial effects of the present invention are: being packaged place to archive information packet using base64 coding techniques and xml technology Reason, can satisfy the package requirements of big file, and be with good expansibility and compatibility, solve of the existing technology to ask Topic.
Based on the above technical solution, the present invention can also be improved as follows.
Further, the specific implementation that the large file is encapsulated as to encapsulation package by scheduled packing rule are as follows:
Received large file is stored under scheduled file path by S100, forms the electronics original text of mounting;
S101 obtains the electronics original text for the mounting stored under the scheduled file path, and member is obtained from data source Data, the not limited to of the electronics original text;
S102, generation do not include electronics original text and data signature but contain the original xml of the location information of electronics original text File, the original xml document include metadata;
S103, parses the original xml document in a manner of SAX, and the byte for being successively read regular length is flowed and is spliced to form Continuous byte stream, until reading whole original xml documents;
Electronics original text after base64 coded treatment is inserted into the position of the electronics original text marked in the byte stream by S104 Place merges write-in with byte stream and meets the new xml document that encapsulation pack arrangement defines;
S105, the signed data of encryption is generated for new xml document, and is packaged into the complete package packet with private key.
Beneficial effect using above-mentioned further scheme is: continuously being handled the electronics original text of mounting, is generated full The files encapsulation package of foot application and archiving requirement;SAX mode higher using performance, less to Installed System Memory demand is handled Processing for the XML file of structuring in encapsulation package avoids longer system latency time.
Further, the electronics original text includes archive files, archive files papery scanned copy, image file, image file Papery scanned copy, audio file and video archive.
A kind of de-encapsulation method of the big file of archive information packet is applied to client, includes the following steps:
Large file and package request are sent to server-side;
Big Fileview, which is sent, to server-side requests and receive the big Document encapsulation packet that server-side returns;
Big Document encapsulation packet is decapsulated by reversed packing rule and provides corresponding reader for checking.
The beneficial effects of the present invention are: sending package request and checking request, while providing the deblocking of big the file information packet Fill function.
Based on the above technical solution, the present invention can also be improved as follows.
Further, described that big Document encapsulation packet is decapsulated by reversed packing rule and corresponding reader is provided The specific implementation for for checking are as follows:
S200, generates xsd file according to encapsulation package structure definition file, encapsulates pack arrangement using the xsd file checking Legitimacy, if it is illegal, prompting structure is invalid and ends processing process;If legal, S201 is executed;
S201, loading private key, splits signature object part, signs and relatively newly signs and original again to signature object part It whether consistent signs, if inconsistent, prompt signature invalid and end processing process;If consistent, S202 is executed;
S202 separates the electronics original text and metadata in encapsulation package according to reversed packing rule;
S203 provides corresponding reader electronics for checking according to metadata and the type of electronics original text respectively.
Beneficial effect using above-mentioned further scheme is: the legitimacy of verifying encapsulation pack arrangement and the legitimacy of signature, Encapsulation package is decapsulated simultaneously, separates metadata and electronics original text, and by metadata and the type of electronics original text, respectively Corresponding reader electronics is provided for checking.
A kind of server-side, the server-side include:
Receiving module, for receiving the large file and package request of client transmission;
Package module, for the large file to be encapsulated as encapsulation package using base64 coding techniques and xml technology;
Sending module returns to the encapsulation package to client after checking request for what is sent in reception client.
The beneficial effects of the present invention are: being packaged by client request to big file and being returned when client request is checked Return encapsulation package.
Based on the above technical solution, the present invention can also be improved as follows.
Further, the package module includes:
Storage unit forms the electronics of mounting for received large file to be stored under scheduled file path Original text;
First acquisition unit, for obtaining the electronics original text for the mounting stored under the scheduled file path, and from number According to obtaining metadata, the not limited to of the electronics original text in source.
Original document generation unit, for generating not comprising electronics original text and data signature but containing the position of electronics original text The original xml document of confidence breath, the original xml document include metadata;
Resolution unit is flowed, for parsing the original xml document in a manner of SAX, is successively read the byte stream of regular length And it is spliced to form continuous byte stream, until reading whole original xml documents;
Electronics original text after base64 coded treatment is inserted into the electronics marked in the byte stream by big File write unit At the position of original text, merges write-in with byte stream and meet the new xml document that encapsulation pack arrangement defines;
Packaged unit for generating the signed data of encryption for new xml document, and is packaged into the complete package with private key Packet.
A kind of client, the client include:
First sending module, for sending large file and package request to server-side;
The big file envelope that server-side returns is requested and received to second sending module for sending big Fileview to server-side Dress packet;
Decapsulation module is looked into accordingly for big Document encapsulation packet to be decapsulated and provided by reversed packing rule See device for checking.
Based on the above technical solution, the present invention can also be improved as follows.
Further, the decapsulation module includes:
Structure verification unit is examined for generating xsd file according to encapsulation package structure definition file using the xsd file Close down the legitimacy of dress pack arrangement;
Signature check unit splits signature object part, signs and compare again to signature object part for being loaded into private key Whether relatively new signature and original signature are consistent;
Separative unit, for separating the electronics original text and metadata in encapsulation package according to reversed packing rule;
It checks unit, according to metadata and the type of electronics original text, provides corresponding reader electronics respectively for checking.
A kind of big Document encapsulation of archive information packet, decapsulating system, including a kind of server-side of claim 6 or 7 and A kind of client of claim 8 or 9.
The beneficial effects of the present invention are: realize the quick of the big file of archive information packet, super large Document encapsulation and decapsulation, solution Problem certainly of the existing technology.
Detailed description of the invention
Fig. 1 is first embodiment of the invention schematic diagram;
Fig. 2 is a kind of big Document encapsulation method flow diagram of archive information packet of the present invention
Fig. 3 is second embodiment of the invention schematic diagram;
Fig. 4 is second embodiment of the invention module class figure;
Fig. 5 is a kind of big file de-encapsulation method flow chart of archive information packet of the present invention;
Fig. 6 is third embodiment of the invention module map;
Fig. 7 is that third embodiment of the invention encapsulation package verifies flow chart;
Fig. 8 is that third embodiment of the invention EEP packet browser is loaded into flow chart;
Fig. 9 is a kind of server module figure of the present invention;
Figure 10 is a kind of client modules figure of the present invention;
Figure 11 is a kind of big file envelope of archive information packet of the present invention, decapsulation dress system embodiment schematic diagram.
Specific embodiment
The principle and features of the present invention will be described below with reference to the accompanying drawings, and the given examples are served only to explain the present invention, and It is non-to be used to limit the scope of the invention.
It is as shown in Figure 1 first embodiment of the invention schematic diagram, the big Document encapsulation of archive information packet that the present invention realizes, solution Package system realizes the big Document encapsulation method of archive information packet in server-side, as follows:
Receive the large file and package request that client is sent;
The large file is encapsulated as encapsulation package using base64 coding techniques and xml technology;
After checking request receive that client sends, the encapsulation package is returned to client.
The big file de-encapsulation method of archive information packet is realized in client, as follows:
Large file and package request are sent to server-side;
Big Fileview, which is sent, to server-side requests and receive the big Document encapsulation packet that server-side returns;
Big Document encapsulation packet is decapsulated by reversed packing rule and provides corresponding reader for checking.
Wherein the large file is encapsulated as encapsulation package using base64 coding techniques and xml technology by the server-side Specific implementation flow as shown in Fig. 2, including the following steps:
Received large file is stored under scheduled file path by S100, forms the electronics original text of mounting;
S101 obtains the electronics original text for the mounting stored under the scheduled file path, and member is obtained from data source Data, the not limited to of the electronics original text;
The electronics original text includes archive files, archive files papery scanned copy, image file, the scanning of image file papery Part, audio file and video archive.
The electronics original text of the mounting refers to the multiple electronics original texts being sequentially stored under same file path, and system obtains The file path is simultaneously loaded into memory array from electronics original text is read under path.It is also possible to that electronics original is not present under path Text.
S102, generation do not include electronics original text and data signature but contain the original xml of the location information of electronics original text File, the original xml document include metadata;
S103, parses the original xml document in a manner of SAX, and the byte for being successively read regular length is flowed and is spliced to form Continuous byte stream, until reading whole original xml documents;
The regular length of the byte stream of the regular length can be 1024 bytes.
Electronics original text after base64 coded treatment is inserted into the position of the electronics original text marked in the byte stream by S104 Place merges write-in with byte stream and meets the new xml document that encapsulation pack arrangement defines;
Specific implementation procedure is as follows:
The location information that electronics original text is searched in the byte stream once read, if not finding, by the primary reading Byte stream new xml document is written;If finding, new xml document is written into the byte before position, obtains and believes with the position Corresponding electronics original text is ceased, new xml is written in the electronics original text after carrying out base64 coding to electronics original text and encoding base64 New xml document is written in byte stream remaining data after position by file;
Above-mentioned process successively is executed to the byte stream of the regular length read every time, until whole byte stream and electronics original text New xml document is all written.
S105, the signed data of encryption is generated for new xml document, and is packaged into the complete package packet with private key;
With the development and progress of society, archival digitalization is following trend, using not same media as the archives of carrier It is more and more: secretarial document, photo file, audio file and video archive.These are situated between by big Document encapsulation technology of the invention Matter is unified to be saved in archive information packet, is supporting also there is preferable scalability outside existing format, in the future convenient for compatibility The files of format.
Generally in files, there are super large files, such as the video file as unit of G, packet encapsulation technology The processing for supporting this super large file can be properly completed package and the unpacking processing of super large file.
Fig. 3 is second embodiment of the invention schematic diagram, realizes big Document encapsulation method, and wherein a.xml is original xml text Part, the document coding back end of the inside and the signature node below signature object node are sky;B.xml file is new xml text Part, electronics plaintext data and other non-three types nodes after having base64 coding, but not including that signed data;b.xml File is packaged by encryption, is EEP packet after modification file extension.EEP packet is electronic document packaging packet (Electronic Record Encapsulation Package) abbreviation.
Detailed process is as follows:
S1 loads a.xml file;
S2 reads 1024 bytes from a.xml file;
S3 judges whether it is to be read out for the first time, if it is, being put into interim byte stream, otherwise reads the last time The data taken are intercepted, the data that this reads in spelling and become a new interim stream;
S4, the lookup<document coding data>node in interim byte stream if it exists and are to find for the first time, then general It is put into interim chained list list with the offset after character;It is to find for the first time if it exists but not, then it will be inclined after matching character Shifting amount is put into interim chained list list after subtracting regular length;
What the document coding data were stored is the location information of electronics original text;The regular length refers to < document coding Data > length -1;
S5, obtains the quantity count of interim chained list list, and judges whether quantity is greater than 0, executes S6 if more than 0, if S7 is executed equal to 0;
S6 is inserted into first part's data of the interim byte stream, i.e., first<document coding data>section into b.xml Data before point;Following processing is done by circular treatment parameter of quantity count, and every circulation primary count subtracts 1 certainly, directly To count be 0 when circulation terminate:
Obtain current<document coding data>information;
The electronics original text of mounting is obtained from interim array array, and b.xml is written after carrying out base64 coding;
By the data write-in of current<document coding data>to next<document coding data>direct interim byte stream In b.xml;
In S7, the data write-in b.xml that interim byte is flowed;
S8 judges whether a.xml reads and finishes, if so, ending processing;Otherwise S2-S7 is continued to execute.
Fig. 4 is second embodiment of the invention embodiment module class figure, and the function of each module and interface is as described in Table 1.
Table 1
The client decapsulates big Document encapsulation packet by reversed packing rule and provides corresponding reader The specific implementation for for checking is as shown in figure 5, include the following steps:
S200, generates xsd file according to encapsulation package structure definition file, encapsulates pack arrangement using the xsd file checking Legitimacy, if it is illegal, prompting structure is invalid and ends processing process;If legal, S201 is executed;
S201, loading private key, splits signature object part, signs and relatively newly signs and original again to signature object part It whether consistent signs, if inconsistent, prompt signature invalid and end processing process;If consistent, S202 is executed;
S202 separates the electronics original text and metadata in encapsulation package according to reversed packing rule;
S203 provides corresponding reader electronics for checking according to metadata and the type of electronics original text respectively.
Specifically, being directed to metadata, metadata reader is provided, for electronics original text, document files is provided respectively and is checked Device, image file reader and video file reader.
The usual capacity of audio-video document archives is larger, less using depositing to demand in system during package and parsing SAX mode handles the processing for the XML file of structuring in EEP packet, avoids longer system latency time.
Fig. 6 is third embodiment of the invention module map, and each functions of modules is as shown in table 2:
Table 2
Fig. 7 is that third embodiment of the invention encapsulation package verifies flow chart, and the process is as follows:
It is loaded into encapsulation package structure definition file;
According to definition file generated xsd file;
It is loaded into encapsulation package;
Pack arrangement is encapsulated with xsd file checking, then return structure is invalid if it is illegal, then continues to execute if legal;
It is loaded into signature key;
Split signature object part;
It is signed again to signature object part;
New signature is compared with original signature, and it is invalid to sign if inconsistent, if being unanimously verified.
Fig. 8 is that third embodiment of the invention EEP packet browser is loaded into flow chart, and the process is as follows:
It is loaded into EEP packet;
The legitimacy for verifying EEP packet, then shows error message if it is illegal, then continues to execute if legal;
According to reversed packing rule separation electronics original text and metadata;
It shows that electronics original text and metadata provide metadata reader for metadata, for electronics original text, mentions respectively For document files reader, image file reader and video file reader.
Fig. 9 is a kind of server module figure of the present invention, realizes the encapsulation of the big file of archive information packet, comprising:
Receiving module, for receiving the large file and package request of client transmission;
Package module, for the large file to be encapsulated as encapsulation package using base64 coding techniques and xml technology;
Sending module returns to the encapsulation package to client after checking request for what is sent in reception client.
The package module includes:
Storage unit forms the electronics of mounting for received large file to be stored under scheduled file path Original text;
First acquisition unit, for obtaining the electronics original text for the mounting stored under the scheduled file path, and from number According to obtaining metadata, the not limited to of the electronics original text in source.
Original document generation unit, for generating not comprising electronics original text and data signature but containing the position of electronics original text The original xml document of confidence breath, the original xml document include metadata;
Resolution unit is flowed, for parsing the original xml document in a manner of SAX, is successively read the byte stream of regular length And it is spliced to form continuous byte stream, until reading whole original xml documents;
Electronics original text after base64 coded treatment is inserted into the electronics marked in the byte stream by big File write unit At the position of original text, merges write-in with byte stream and meet the new xml document that encapsulation pack arrangement defines;
Packaged unit for generating the signed data of encryption for new xml document, and is packaged into the complete package with private key Packet.
The big file writing module includes:
Single writing unit, for searching the location information of electronics original text in the byte stream once read, if not finding, New xml document then is written into the byte stream once read;If finding, new xml document is written into the byte before position, Electronics original text corresponding with the location information is obtained, the electricity after carrying out base64 coding to electronics original text and encoding base64 New xml document is written in sub- original text, and new xml document is written in the byte stream remaining data after position;
Cycling element executes above-mentioned process for the byte stream successively to the regular length read every time, until whole New xml document is all written in byte stream and electronics original text.
Figure 10 is a kind of client modules figure of the present invention, realizes the parsing of encapsulation package and checks, comprising:
The client includes:
First sending module, for sending large file and package request to server-side;
The big file envelope that server-side returns is requested and received to second sending module for sending big Fileview to server-side Dress packet;
Decapsulation module is looked into accordingly for big Document encapsulation packet to be decapsulated and provided by reversed packing rule See device for checking.
The decapsulation module includes:
Structure verification unit is examined for generating xsd file according to encapsulation package structure definition file using the xsd file Close down the legitimacy of dress pack arrangement;
Signature check unit splits signature object part, signs and compare again to signature object part for being loaded into private key Whether relatively new signature and original signature are consistent;
Separative unit, for separating the electronics original text and metadata in encapsulation package according to reversed packing rule;
It checks unit, according to metadata and the type of electronics original text, provides corresponding reader electronics respectively for checking.
A kind of big Document encapsulation of archive information packet, decapsulating system, server-side and client.
Figure 11 is a kind of big Document encapsulation decapsulating system embodiment schematic diagram of archive information packet of the present invention, the number of client Amount no less than one, server-side includes application server, assembling server and Message Queuing server, wherein application server Quantity is no less than one.
The client is used to send large file and package request to server-side, monitors in real time, when the generation of EEP packet at Function or failure will close real-time monitoring;
The assembling server, the big Document encapsulation for receiving client are requested and are sent out after being assembled request data Give Message Queuing server;The archives for receiving client check request, and obtain archives encapsulation by Message Queuing server Packet is sent to client;
The Message Queuing server, for receiving the message sent of assembling server and all message being put into message Queue, and detect which platform application server is in idle condition, queue scheduling is carried out, the message of message queue is sent to application Server;Receiving archives, which check request and obtain big Document encapsulation packet from application server, is sent to assembling server;
The application server receives large file and uses base64 coding techniques and xml technology by large file It is packaged into the encapsulation package for meeting application demand;When receive client returns to big Document encapsulation packet to client when checking request, After receiving the message of Message Queuing server and completing processing, more new database.
The foregoing is merely presently preferred embodiments of the present invention, is not intended to limit the invention, it is all in spirit of the invention and Within principle, any modification, equivalent replacement, improvement and so on be should all be included in the protection scope of the present invention.

Claims (3)

1. a kind of big Document encapsulation method of archive information packet is applied to server-side, which comprises the steps of:
S000 receives large file and package request that client is sent;
Received large file is stored under scheduled file path by S100, forms the electronics original text of mounting;
S101, obtains the electronics original text for the mounting stored under the scheduled file path, and metadata is obtained from data source, The not limited to of the electronics original text;
S102, generation do not include electronics original text and data signature but contain the original xml document of the location information of electronics original text, The original xml document includes metadata;
S103 parses the original xml document in a manner of SAX, and the byte for being successively read regular length flows and encodes base64 Treated, and electronics original text is inserted at the position of the electronics original text marked in the byte stream, and the byte stream is spliced to form company Continuous byte stream, until reading whole original xml documents;
The continuous byte stream is merged write-in and meets the new xml document that encapsulation pack arrangement defines by S104;
S105, the signed data of encryption is generated for new xml document, and is packaged into the complete package packet with private key;
S200 returns to the encapsulation package to client after checking request receive that client sends.
2. a kind of big Document encapsulation method of archive information packet according to claim 1, which is characterized in that the electronics original text packet Include archive files, archive files papery scanned copy, image file, image file papery scanned copy, audio file and video archive.
3. a kind of server-side, which is characterized in that the server-side includes:
Receiving module, for receiving the large file and package request of client transmission;
Package module, the package module include:
Storage unit forms the electronics original text of mounting for received large file to be stored under scheduled file path;
First acquisition unit, for obtaining the electronics original text for the mounting stored under the scheduled file path, and from data source Middle acquisition metadata, the not limited to of the electronics original text;
Original document generation unit, for generating not comprising electronics original text and data signature but containing the position letter of electronics original text The original xml document of breath, the original xml document include metadata;
Big File write unit is successively read the byte stream of regular length and is inserted into the electronics original text after base64 coded treatment At the position of the electronics original text marked in the byte stream, the byte stream is spliced to form continuous byte and is flowed, until reading The continuous byte stream is merged write-in and meets the new xml document that encapsulation pack arrangement defines by whole original xml documents;
Packaged unit for generating the signed data of encryption for new xml document, and is packaged into the complete package packet with private key;
Sending module returns to the encapsulation package to client after checking request for what is sent in reception client.
CN201610797042.1A 2016-08-31 2016-08-31 A kind of big Document encapsulation method of archive information packet and client Active CN106339362B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201610797042.1A CN106339362B (en) 2016-08-31 2016-08-31 A kind of big Document encapsulation method of archive information packet and client

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201610797042.1A CN106339362B (en) 2016-08-31 2016-08-31 A kind of big Document encapsulation method of archive information packet and client

Publications (2)

Publication Number Publication Date
CN106339362A CN106339362A (en) 2017-01-18
CN106339362B true CN106339362B (en) 2019-09-24

Family

ID=57822421

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201610797042.1A Active CN106339362B (en) 2016-08-31 2016-08-31 A kind of big Document encapsulation method of archive information packet and client

Country Status (1)

Country Link
CN (1) CN106339362B (en)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110737629A (en) * 2019-08-30 2020-01-31 华迪计算机集团有限公司 method and system for archiving electronic files
CN112632009A (en) * 2020-12-29 2021-04-09 航天信息股份有限公司 Electronic file processing method and device, storage medium and electronic equipment
CN116150105B (en) * 2023-04-20 2023-07-11 北京云唤维科技有限公司 Reading and analyzing method and system for electronic file long-term storage package

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101043512A (en) * 2006-03-21 2007-09-26 环达电脑(上海)有限公司 Electronic mail system
CN101158996A (en) * 2007-06-01 2008-04-09 华中科技大学 Digital resource copyright controller
CN101436141A (en) * 2008-11-21 2009-05-20 深圳创维数字技术股份有限公司 Firmware upgrading and encapsulating method and device based on digital signing
CN101478730A (en) * 2007-11-12 2009-07-08 华为技术有限公司 Data exchanging method, system and device
CN102075568A (en) * 2010-12-28 2011-05-25 广东万维博通信息技术有限公司 Key item project file management method based on software as a service (SaaS) mode
CN105282124A (en) * 2014-07-24 2016-01-27 上海未来宽带技术股份有限公司 Transmission method and presentation method of progressive picture based on XMPP

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101043512A (en) * 2006-03-21 2007-09-26 环达电脑(上海)有限公司 Electronic mail system
CN101158996A (en) * 2007-06-01 2008-04-09 华中科技大学 Digital resource copyright controller
CN101478730A (en) * 2007-11-12 2009-07-08 华为技术有限公司 Data exchanging method, system and device
CN101436141A (en) * 2008-11-21 2009-05-20 深圳创维数字技术股份有限公司 Firmware upgrading and encapsulating method and device based on digital signing
CN102075568A (en) * 2010-12-28 2011-05-25 广东万维博通信息技术有限公司 Key item project file management method based on software as a service (SaaS) mode
CN105282124A (en) * 2014-07-24 2016-01-27 上海未来宽带技术股份有限公司 Transmission method and presentation method of progressive picture based on XMPP

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
基于OAIS电子文件管理系统的设计与实现;周佳;《中国优秀硕士学位论文全文数据库 信息科技辑》;20120315(第03期);第I138-1257页 *
电子档案元数据方案设计与应用初探;毛海帆;《档案学研究》;20100125(第1期);第77页第3节第1段 *

Also Published As

Publication number Publication date
CN106339362A (en) 2017-01-18

Similar Documents

Publication Publication Date Title
KR102125162B1 (en) Media encapsulation and decapsulation techniques
US9530012B2 (en) Processing extensible markup language security messages using delta parsing technology
CN108345685A (en) More granularity data processing methods, system, equipment and storage medium under block chain
CN106339362B (en) A kind of big Document encapsulation method of archive information packet and client
JP2008537259A (en) Efficient description of relationships between resources
US7716290B2 (en) Send by reference in a customizable, tag-based protocol
CN103098484A (en) Method and apparatus for encapsulating coded multi-component video
JP7391963B2 (en) Apparatus and method for signaling information in container file format
CN101997643B (en) Method and system for packing electronic files
US20090030924A1 (en) Device and Method for Generating a Media Package
CN103618781A (en) File transmission method of service system and electronic file management system
CN111930708B (en) Ceph object storage-based object tag expansion system and method
CN103596011B (en) The storage processing method and device of view data
CN110532233A (en) A kind of epub document generating method and system
CN101488139B (en) Document management method and device thereof
CN107122433A (en) A kind of merging method of compound document and the system for realizing this method
WO2004029837A1 (en) Multimedia file format
US9081755B2 (en) Method for processing a data tree structure
Płoszajski Metadata in long-term digital preservation
KR102170738B1 (en) Method, apparatus and system for transmitting and receiving message
EP4068781A1 (en) File format with identified media data box mapping with track fragment box
Harada et al. Archive and preservation of media content using MPEG-A
WO2007006090A1 (en) Systems and methods for use in transforming electronic information into a format
CN101997864A (en) System architecture for realizing electronic document packaging and constructing method thereof
Baldwin et al. Content Packaging Approach for a Large OAIS Repository

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant