WO2010026223A1 - Method and device for encoding elements - Google Patents

Method and device for encoding elements Download PDF

Info

Publication number
WO2010026223A1
WO2010026223A1 PCT/EP2009/061479 EP2009061479W WO2010026223A1 WO 2010026223 A1 WO2010026223 A1 WO 2010026223A1 EP 2009061479 W EP2009061479 W EP 2009061479W WO 2010026223 A1 WO2010026223 A1 WO 2010026223A1
Authority
WO
WIPO (PCT)
Prior art keywords
data structure
current element
encoded
encoding
attribute value
Prior art date
Application number
PCT/EP2009/061479
Other languages
French (fr)
Inventor
Ren Lei Chen
Guang Hua Zhou
Wen Juan Song
Xiao Jun Ma
Original Assignee
Thomson Licensing
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Thomson Licensing filed Critical Thomson Licensing
Priority to EP09811130.5A priority Critical patent/EP2327028B1/en
Priority to JP2011525563A priority patent/JP5536066B2/en
Priority to CN200980131070.8A priority patent/CN102119384B/en
Priority to US12/737,936 priority patent/US8193952B2/en
Publication of WO2010026223A1 publication Critical patent/WO2010026223A1/en

Links

Classifications

    • HELECTRICITY
    • H03ELECTRONIC CIRCUITRY
    • H03MCODING; DECODING; CODE CONVERSION IN GENERAL
    • H03M7/00Conversion of a code where information is represented by a given sequence or number of digits to a code where the same, similar or subset of information is represented by a different sequence or number of digits
    • H03M7/30Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction
    • H03M7/3084Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction using adaptive string matching, e.g. the Lempel-Ziv method
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/12Use of codes for handling textual entities
    • G06F40/14Tree-structured documents
    • G06F40/143Markup, e.g. Standard Generalized Markup Language [SGML] or Document Type Definition [DTD]
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/12Use of codes for handling textual entities
    • G06F40/14Tree-structured documents
    • G06F40/146Coding or compression of tree-structured data

Definitions

  • the present invention relates to data process, and more particularly, relates to a method and a device for encoding elements. Background
  • a structured document is a set of elements each associated with a type and at least one attribute, and interconnected by relationships that are mainly hierarchical.
  • a typical example of the structured document is the extensible markup language (XML) document.
  • the structured document includes markers (also called "tags") for separating different elements.
  • An element may itself comprise a plurality of attributes and lower-level elements, which are also called sub-elements.
  • the structured document presents a tree or hierarchical structure, each node represents an element and is connected to a node at a higher hierarchical level representing an element that contains the elements at lower level.
  • the nodes located at the ends of branches in such a tree structure represent elements containing data that can not be divided into information sub-elements.
  • the data of the node located at the ends of branches is considered as the attribute value of a certain type.
  • the schema-based compression method There are several compression methods for encoding structured documents, of which one is the schema-based compression method.
  • the schema for defining a structured document itself is also a structured document.
  • a typical example of the schema is the XML schema.
  • an XML schema is a set of schema components that define the structure of an XML instance.
  • the schema component, which itself is also an element is a generic term for the building blocks that comprise the data model template of the schema.
  • a Finite State Automaton is derived from the definition of a schema, and then an instance of the schema or portion of such instance can be converted to a bit stream with the aid of the corresponding FSA.
  • Some schema components may have an occurrence constraint, which is defined by the attributes of minOccurs and maxOccurs . This kind of schema components is usually called occurrence node.
  • each element comprises a data structure of a type and at least one attribute value.
  • the method comprises the steps of: selecting a current element for encoding; determining whether the current element has the same structure type as a previously encoded element; in the negative, encoding the data structure of the current element and the at least one attribute value of the current element; and in the affirmative, encoding the at least one attribute value of the current element and providing an indication value indicating the current element has the same data structure type as the previously encoded element.
  • each element comprises a data structure of a type and at least one attribute value.
  • the method comprises the steps of: selecting the encoded data of a current element for decoding; and if determining said current element has same structure type as the previously decoded element based on a portion of the encoded data indicating the current element has the same data structure type as the previous decoded element, deriving the at least one attribute value by decoding said encoded data and deriving the data structure of said current element by using the data structure of said previous decoded element.
  • the present invention provides a data structure for carrying the encoded data of a current element, wherein the current element has a data structure of a type and at least one attribute value.
  • the data structure comprises an attribute value field used to carry the encoded data of the at least one attribute value; and an indication field used to indicate whether the current element has the same data structure type as a previously encoded element.
  • an encoder for encoding a set of elements wherein each element comprises a data structure of a type and at least one attribute value.
  • the encoder comprises: an input module (402) configured to receive data; and a process module (403) configured to determine whether the current element has the same structure type as a previously encoded element, encode the data structure of the current element and the at least one attribute value in response to the negation of said determination , and encode the at least one attribute value of the current element and provide an indication value indicating the current element has the same data structure type as the previously encoded element in response to the affirmation of said determination.
  • the present invention provides a decoder for decoding encoded data of a set of elements wherein each element comprises a data structure of a type and at least one attribute value.
  • the decoder comprises: an input module (502) configured to receive the encode data of a current element for decoding; and a process module (503) configured to determine whether said current element has same structure type as the previously decoded element based on a portion of the encoded data, wherein the portion of the encoded d indicats the current element has the same data structure type as the previous decoded element, and responsive to the affirmation of the determination derive the at least one attribute value by decoding said encoded data and derive the data structure of said current element by using the data structure of said previous decoded element.
  • it reduces the encoding redundancy of the structure information .
  • Fig. 1 is a diagram illustrating the state transition for the occurrence node according to an embodiment of the present invention.
  • Fig. 2 is a flow chart illustrating the encoding method carried out by the encoder device according to the embodiment of the present invention.
  • Fig. 3 is a flow chart illustrating the decoding method carried out by the decoder device according to the embodiment of the present invention.
  • Fig. 4 is a block diagram illustrating the encoder device according to the embodiment of the present invention.
  • Fig. 5 is a block diagram illustrating the decoder device according to the embodiment of the present invention .
  • the embodiment is elaborated in a data processing environment employing a schema-based compression method.
  • a schema-based compression method As an example, the document ISO/IEC 15938-1 : 2002/Amd 2: 2006 Information Technology -Multimedia Content Description Interface-Parti, Systems, available in ISO website, defines certain aspects of a schema-based compression environment.
  • the embodiment described below is placed in a framework of such environment along with the changes indicated in the description. However, the invention should not be limited to the described embodiment .
  • an FSA is used to encode the structure information of elements.
  • the structure information of an instance includes information about components of the element except the data value contained in the element in an instance of a structured document, for example, sequence, choice, properties and other structures which compose the element.
  • the FSA uses the Shunt transition and Loop transition (Loop start transition, Loop end transition and Loop continue transition) to encode one or more elements or groups of elements.
  • states of "Repeat state” and "Unrepeat state” are added in order to reduce the redundant structure information.
  • Element transition when crossed, it specifies to the decoder which element is present.
  • Loop transition it is used to model the decoding of one or more elements or groups of elements.
  • "Loop transition” comprises the "loop start transition", the “loop end transition”, the “loop continue transition”, the “Repeat transition” and the "Unrepeat transition”.
  • Loop start transition it is crossed when there are many occurrences of some elements or groups of elements to be decoded.
  • Loop continue transition it is crossed when there is at least one more element or group of elements to be decoded.
  • Loop end transition it is crossed when there are no more elements or groups of elements to be decoded.
  • Code transition it is associated with a binary code and a signature. Code transition is crossed when its associated binary code is read from the binary description stream. The binary code is deduced from its signature .
  • Shunt transition it is a special kind of code transition. Its binary code value is always equal to 0.
  • Simple state it has no specific behavior and is used to structure the automaton.
  • Repeat state it is crossed when the element has the same structure information as the previous element.
  • Unrepeat state it is crossed when the element has different structure information compared to the previous element.
  • each element is parsed one by one, and recursively for the nested elements.
  • it loops in the FSA as shown in Fig. 1, and the codes of passed transitions form the encoded result.
  • the example of the XML instance in the background shows that element el occurs 5 times with different data values. Firstly, it crosses code transition, loop transition and element transition to type state. Since it is the first time that element el has occurred, it crosses loop continue transition to simple state directly.
  • the type state second time in the course of encoding the second element el determines whether the structure information of the second element el is the same as that of the previous encoded element. If they are the same and the next element is also el, then it will cross the repeat state to the simple state. Or otherwise, if they are not same, it will cross the unrepeat state. The procedure is iterated until all elements are parsed. And at last, it will cross the loop end transition to the end state .
  • the encoder device compresses the XML instance by encoding it with the aid of the corresponding XML schema.
  • Fig. 2 is a flow chart illustrating the encoding method with the aid of the XML schema carried out by the encoder device according to the embodiment: [0060] -In step 201, the encoder device generates all FSAs, which are used to encode elements in an XML file, based on the XML schema.
  • step 202 the encoder device receives an XML file associated with the XML schema to be encoded.
  • step 203 the encoder device gets an element from the XML file as current element.
  • step 204 the encoder device determines whether the current element is EOF (end of file) . If it is EOF, then the encoder device will end the encoding process in step 205. Or otherwise, if not, it will go to step 206.
  • EOF end of file
  • the encoder device encodes the structure information of the current element by using the corresponding FSA to generate encoded structure information.
  • the encoded structure information is typically in binary format.
  • the data value contained in the current element can be encoded at this step or at a later step after outputting the final encoded structure information to generate an encoded data value.
  • the combination of the encoded structure information and the encoded data value forms the resulting encoded element.
  • the following steps will mainly be focused on the aspect of encoding the structure information.
  • step 207 the encoder device determines whether the current element corresponds to a sub-element of the occurrence node and the previous element corresponds to a sub-element of the same occurrence node. If not, it will go to the step 209, or if yes, it will go to the step 208. Due to the method is intended to reduce the redundancy during the encoding of the occurrence node, this step is intended to determine whether the current element belongs to the same occurrence node. Therefore, it can save the following determination steps in case the current element and the previous element do not belong to the same occurrence node.
  • the encoder device outputs the encoded structure information of the current element.
  • the encoder device determines whether the element definition information of the current element is the same as that of the previous element. If not, it will go to the step 212, or if yes, it will go to the step 210. This step is used to distinguish the elements being different sub-element of the same occurrence node. Sometimes, this step is necessary because different structure information of different elements may have the same encoded structure information.
  • the element definition information is the information used to define the detail structure of the element in the schema.
  • element definitions of element el and element e2 in the XML schema are not the same.
  • the information relating to the previous element such as element definition information, FSA and the encoded structure information, is stored in a temporary or volatile storage device, such as RAM, when the encoder device performs the step 206 to the previous element. And the storage device is updated after the current element is encoded.
  • the encoder outputs an indication value indicating the difference, such as bit "0", and the encoded structure information of the current element.
  • the encoder device determines whether the encoded structure information of the current element is the same as that of the previous element in the XML file by comparing the encoded structure information of the current element generated in the step
  • the encoder outputs an indication value indicating the sameness, such as bit "1".
  • the indication value can also be recognized as a flag to indicate the presence of the encoded structure information .
  • the steps 208 and 210 are jointly used to determine whether the structure information of the current element in the XML file is the same as the previous one because different elements with different structure information may have the same encoded structure information in binary format. Furthermore, it is apparent to one skilled in the art that other means are possible to determine the sameness of the structure information between the current element and the previous element when applying the method to other environment where the instance of a structured document is encoded with the aid the schema. Through using the flag indicative of the presence of the encoded structure information, it reduces the size of encoded elements having the same structure information so as to save the storage size and the bandwidth when transmitting the XML file containing such elements .
  • the encoder device does not generate all FSAs in step 201. Instead, the encoder device merely generates the necessary corresponding FSA for an element before encoding the element, or the FSAs are pre-stored in the device instead of being generated.
  • the step 206 is not necessarily performed before the step 207. If other methods or means are used to perform the determination of step 210 without the use of the encoded structure information. But the structure information of the element should be encoded before it is outputted.
  • the method can be applied on a fragment of an XML file instead of the whole XML file.
  • FIG. 3 is a flow char illustrating the decoding method carried out by the decoder device according to the present embodiment of the invention.
  • the decoder device generates all the FSAs for decoding the encoded elements based on the corresponding XML schema. [0077] -In step 302, the decoder device gets the current encoded element belonging to the occurrence node.
  • the decoder device determines whether the structure information of the current element is the same as that of the previous element based on the indication value contained in the encoded element. If the same, it will go to step 304, or if not it will go to step 305.
  • the indication value of bit "1" indicates the structure information of the current element is the same as that of the previous one, and the bit "0" indicates they are different.
  • the indication value can be considered as a flag used to indicate whether the encoded structure information is present or not.
  • information relating to the previous element is temporarily stored in a buffer or a storage device when the decoder device decodes the previous element, and the content of the buffer or storage device is updated after the decoder device decodes the current element.
  • step 304 the decoder device outputs the stored structure information of the previous element.
  • step 305 the decoder device decodes the encoded structure information based on the corresponding FSA to generate the structure information of the current element and outputs the structure information of the current element.
  • the decoding process of the encoded data value can be done in the course of decoding the encoded structure information or after the encoded structure information is decoded.
  • a data structure for carrying encoded element of occurrence node type in a schema-based compression environment.
  • the data structure comprises an indication field, and further may comprise a structure information field and a content field for conveying the encoded structure information and the encoded data value of the element separately.
  • the indication field is used to indicate whether the structure information of the element is the same as that of the previous element. If the structure information of the element is the same as that of the previous one, the indication field is set a value indicating the sameness and the structure information field is not present, or otherwise, the indication field is set a value indicating the difference and the structure information field is present. Therefore, the indication field can also be used to indicate whether the structure information field is present or not.
  • Fig. 4 is a block diagram illustrating the encoder device according to the present embodiment of the invention.
  • the encoder device 400 comprises an FSA module 401, an input module 402, a process module 403, an output module 404 and a buffer module 405.
  • the FSA module 401 is configured to provide FSA based on an XML schema for the process module 403.
  • the FSA can be provided in a way that the FSA module 401 generates the FSA upon the process module's request for the FSA, or the FSA module 401 firstly generates all FSAs based on the XML schema, stores all the FSAs in a storage device, and then returns the FSA to the process module 403 responsive to the request for the FSA.
  • the input module 402 is configured to receive data.
  • the output module 404 is configured to output data.
  • the buffer module 405 is configured to buffer data.
  • the process module 403 is configured to receive an element to be encoded from the input module 402 as the current element, and determines whether the structure information of the current element is the same as that of the previous element based on the structure information of the previous element provided by the buffer module 405. If the same, the process module 403 will use the output module 404 to output an indication value indicating the absence of the encoded structure information. If not, the process module 403 will use the output module 404 to output an indication value indicating the presence of the encoded structure information, and the process module is further configured to encode the structure information of the current element to generate the encoded structure information of the current element based on the corresponding FSA received from the FSA module 401.
  • Fig. 5 is a block diagram illustrating the decoder device according to the present embodiment of the invention.
  • the decoder device 500 comprises an FSA module 501, an input module 502, a process module 503, an output module 504 and a buffer module 505.
  • the FSA module 501 is configured to provide FSA for the process module 503 based on an XML schema.
  • the input module 502 is configured to receive data.
  • the output module 404 is configured to output data.
  • the buffer module 505 is configured to buffer data.
  • the process module 503 of the decoder device 500 is configured to generate the structure information of the element based on the data received from the input module 502 by using the corresponding FSA derived from the FSA module 501.
  • the decoder device 500 receives the encoded element from the input module 502 as the current encoded element, and determines whether the structure information of the current element is the same as that of the previous element based on the indication value contained in the encoded element. If they are the same, the process module 503 will use the output module 504 to output the structure information of the previous element, which is stored in the buffer module 505 when decoding the encoded structure information of the previous element.
  • the process module 503 will decode encoded structure information of the current element based on the corresponding FSA received from the FSA module 501, and uses the output module 504 to output the structure information of the current element.
  • information relating to the previous element such as the structure information of the previous element, is stored in the buffer module 505 when the process module 503 decodes the previous element, and the information stored in the buffer module 505 is updated after the current encoded element is decoded.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • General Health & Medical Sciences (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Document Processing Apparatus (AREA)

Abstract

A method for encoding a set of elements wherein each element comprises a data structure of a type and at least one attribute value is provided. The method comprises the steps of: selecting a current element for encoding; determining whether the current element has the same data structure type as a previously encoded element; in the negative, encoding the data structure of the current element and the at least one attribute value of the current element; and in the affirmative, encoding the at least one attribute value of the current element and providing an indication value indicating the current element has the same data structure type as the previously encoded element. The method is used to reduce the encoding redundancy of the structure information.

Description

METHOD AND DEVICE FOR ENCODING ELEMENTS
Technical Field
[0001] The present invention relates to data process, and more particularly, relates to a method and a device for encoding elements. Background
[0002] Presently, data is often stored and transmitted in a structured document that contains a plurality of different types of data. A structured document is a set of elements each associated with a type and at least one attribute, and interconnected by relationships that are mainly hierarchical. A typical example of the structured document is the extensible markup language (XML) document. [0003] The structured document includes markers (also called "tags") for separating different elements. An element may itself comprise a plurality of attributes and lower-level elements, which are also called sub-elements. Thus, the structured document presents a tree or hierarchical structure, each node represents an element and is connected to a node at a higher hierarchical level representing an element that contains the elements at lower level. The nodes located at the ends of branches in such a tree structure represent elements containing data that can not be divided into information sub-elements. Herein, the data of the node located at the ends of branches is considered as the attribute value of a certain type. [0004] There are several compression methods for encoding structured documents, of which one is the schema-based compression method. The schema for defining a structured document itself is also a structured document. A typical example of the schema is the XML schema. Generally, an XML schema is a set of schema components that define the structure of an XML instance. The schema component, which itself is also an element, is a generic term for the building blocks that comprise the data model template of the schema. In the process of compressing an instance of a structured document using a schema-based compression method, a Finite State Automaton (FSA) is derived from the definition of a schema, and then an instance of the schema or portion of such instance can be converted to a bit stream with the aid of the corresponding FSA. Some schema components may have an occurrence constraint, which is defined by the attributes of minOccurs and maxOccurs . This kind of schema components is usually called occurrence node.
[0005] Below is an example of an XML schema containing an occurrence node with maxOccurs attribute set to 100. [0006] <?xml version="1.0" encoding="ISO-8859-l" ?> <schema targetNamespace="urn : thomson : SchemaExample" xmlns="http: //www. w3. org/2001/XMLSchema" xmlns : s="urn : thomson : SchemaExample" xmlns:xs="http: //www. w3. org/2001/XMLSchema" xmlns : xsi="http : //www. w3. org/2001/XMLSchema-instance" > [0007] <element name="testSchema"> [0008] <complexType>
[0009] <choice maxOccurs="100">
[0010] <element name="el" type="xs : string" />
[0011] <element name="e2" type="xs : string" />
[0012] <element name="e3" type="xs : string"/> [0013] <element name="e4" type="xs : string" />
[0014] <element name="e5" type="xs : string"/>
[0015] </choice> [0016] </complexType>
[0017] </element> [0018] </schema>
[0019] Below is an example of an instance according to the above XML schema.
[0020] <?xml version="1.0" encoding="ISO-8859-l" ?> [0021] <s : testSchema xmlns : s="urn : thomson : SchemaExample" xmlns :b="urn : thomson : SchemaB" xmlns : a="urn : thomson : SchemaA" xmlns : c="urn : thomson : SchemaC" xmlns : xsi="http : //www. w3. org/2001/XMLSchema-instance" xsi : schemaLocation="urn : thomson : SchemaExample . /SchemaExa mple . xsd">
[0022] <el>AAAA</el> [0023] <el>BBBB</el> [0024] <el>CCCC</el> [0025] <el>DDDD</el> [0026] <el>EEEE</el> [0027] </s:testSchema> [0028] It can be seen that element el repeats 5 times with different data values in this XML instance. The conventional schema-based compression method generates 5 times the same structure information of element el in the resulting encoded bit stream, which is deemed redundant. SUMMARY
[0029] According to an aspect of the present invention, it provides a method for encoding a set of elements wherein each element comprises a data structure of a type and at least one attribute value. The method comprises the steps of: selecting a current element for encoding; determining whether the current element has the same structure type as a previously encoded element; in the negative, encoding the data structure of the current element and the at least one attribute value of the current element; and in the affirmative, encoding the at least one attribute value of the current element and providing an indication value indicating the current element has the same data structure type as the previously encoded element.
[0030] According to an aspect of the present invention, it provides a method for decoding encoded data of a set of elements wherein each element comprises a data structure of a type and at least one attribute value. The method comprises the steps of: selecting the encoded data of a current element for decoding; and if determining said current element has same structure type as the previously decoded element based on a portion of the encoded data indicating the current element has the same data structure type as the previous decoded element, deriving the at least one attribute value by decoding said encoded data and deriving the data structure of said current element by using the data structure of said previous decoded element.
[0031] According to an aspect of the present invention, it provides a data structure for carrying the encoded data of a current element, wherein the current element has a data structure of a type and at least one attribute value. The data structure comprises an attribute value field used to carry the encoded data of the at least one attribute value; and an indication field used to indicate whether the current element has the same data structure type as a previously encoded element.
[0032] According to an aspect of the present invention, it provides an encoder for encoding a set of elements wherein each element comprises a data structure of a type and at least one attribute value. The encoder comprises: an input module (402) configured to receive data; and a process module (403) configured to determine whether the current element has the same structure type as a previously encoded element, encode the data structure of the current element and the at least one attribute value in response to the negation of said determination , and encode the at least one attribute value of the current element and provide an indication value indicating the current element has the same data structure type as the previously encoded element in response to the affirmation of said determination. [0033] According to an aspect of the present invention, it provides a decoder for decoding encoded data of a set of elements wherein each element comprises a data structure of a type and at least one attribute value. The decoder comprises: an input module (502) configured to receive the encode data of a current element for decoding; and a process module (503) configured to determine whether said current element has same structure type as the previously decoded element based on a portion of the encoded data, wherein the portion of the encoded d indicats the current element has the same data structure type as the previous decoded element, and responsive to the affirmation of the determination derive the at least one attribute value by decoding said encoded data and derive the data structure of said current element by using the data structure of said previous decoded element. [0034] According to an aspect of the present invention, it reduces the encoding redundancy of the structure information . [0035] It is to be understood that other aspects and advantages of the present invention will be found after reading the following detailed description of the present invention .
BRIEF DESCRIPTION OF THE DRAWINGS
[0036] The below description explains an embodiment of the invention with the help of the accompanying drawings, which are included to provide a further understanding of the invention and are incorporated in and constitute a part of this application. The invention is not limited to the embodiment .
[0037] In the drawings :
[0038] Fig. 1 is a diagram illustrating the state transition for the occurrence node according to an embodiment of the present invention.
[0039] Fig. 2 is a flow chart illustrating the encoding method carried out by the encoder device according to the embodiment of the present invention. [0040] Fig. 3 is a flow chart illustrating the decoding method carried out by the decoder device according to the embodiment of the present invention.
[0041] Fig. 4 is a block diagram illustrating the encoder device according to the embodiment of the present invention.
[0042] Fig. 5 is a block diagram illustrating the decoder device according to the embodiment of the present invention .
DETAILED DESCRIPTION [0043] The embodiment of the present invention will now be described in detail in conjunction with the drawings. In the following description, some detailed descriptions of known functions and configurations may be omitted for clarity and conciseness.
[0044] The embodiment is elaborated in a data processing environment employing a schema-based compression method. As an example, the document ISO/IEC 15938-1 : 2002/Amd 2: 2006 Information Technology -Multimedia Content Description Interface-Parti, Systems, available in ISO website, defines certain aspects of a schema-based compression environment. The embodiment described below is placed in a framework of such environment along with the changes indicated in the description. However, the invention should not be limited to the described embodiment .
[0045] In the schema-based compression method, an FSA is used to encode the structure information of elements. Herein, the structure information of an instance includes information about components of the element except the data value contained in the element in an instance of a structured document, for example, sequence, choice, properties and other structures which compose the element. As can be seen from Fig. 1 the state transition diagram for the occurrence node according to the embodiment of the present invention, the FSA uses the Shunt transition and Loop transition (Loop start transition, Loop end transition and Loop continue transition) to encode one or more elements or groups of elements. Furthermore, states of "Repeat state" and "Unrepeat state" are added in order to reduce the redundant structure information. [0046] Since the embodiment is placed in the framework stipulated by ISO/IEC 15938-1 : 2002/Amd 2: 2006 Information Technology -Multimedia Content Description Interface-Parti along with some changes, the below gives a brief introduction about states and transitions:
[0047] Element transition: when crossed, it specifies to the decoder which element is present.
[0048] Type state: when activated, it triggers type decoders . [0049] Loop transition: it is used to model the decoding of one or more elements or groups of elements. In the embodiment, "Loop transition" comprises the "loop start transition", the "loop end transition", the "loop continue transition", the "Repeat transition" and the "Unrepeat transition".
[0050] Loop start transition: it is crossed when there are many occurrences of some elements or groups of elements to be decoded. [0051] Loop continue transition: it is crossed when there is at least one more element or group of elements to be decoded.
[0052] Loop end transition: it is crossed when there are no more elements or groups of elements to be decoded. [0053] Code transition: it is associated with a binary code and a signature. Code transition is crossed when its associated binary code is read from the binary description stream. The binary code is deduced from its signature . [0054] Shunt transition: it is a special kind of code transition. Its binary code value is always equal to 0.
[0055] Simple state: it has no specific behavior and is used to structure the automaton. [0056] Repeat state: it is crossed when the element has the same structure information as the previous element. [0057] Unrepeat state: it is crossed when the element has different structure information compared to the previous element.
[0058] When an XML file or a fragment thereof is compressed, each element is parsed one by one, and recursively for the nested elements. As to the process for an occurrence element, it loops in the FSA as shown in Fig. 1, and the codes of passed transitions form the encoded result. The example of the XML instance in the background shows that element el occurs 5 times with different data values. Firstly, it crosses code transition, loop transition and element transition to type state. Since it is the first time that element el has occurred, it crosses loop continue transition to simple state directly. Secondly, when it reaches the type state second time in the course of encoding the second element el, it determines whether the structure information of the second element el is the same as that of the previous encoded element. If they are the same and the next element is also el, then it will cross the repeat state to the simple state. Or otherwise, if they are not same, it will cross the unrepeat state. The procedure is iterated until all elements are parsed. And at last, it will cross the loop end transition to the end state .
[0059] The encoder device compresses the XML instance by encoding it with the aid of the corresponding XML schema. Fig. 2 is a flow chart illustrating the encoding method with the aid of the XML schema carried out by the encoder device according to the embodiment: [0060] -In step 201, the encoder device generates all FSAs, which are used to encode elements in an XML file, based on the XML schema.
[0061] -In step 202, the encoder device receives an XML file associated with the XML schema to be encoded.
[0062] -In step 203, the encoder device gets an element from the XML file as current element.
[0063] -In step 204, the encoder device determines whether the current element is EOF (end of file) . If it is EOF, then the encoder device will end the encoding process in step 205. Or otherwise, if not, it will go to step 206.
[0064] -In step 206, the encoder device encodes the structure information of the current element by using the corresponding FSA to generate encoded structure information. Herein, the encoded structure information is typically in binary format. Furthermore, the data value contained in the current element can be encoded at this step or at a later step after outputting the final encoded structure information to generate an encoded data value. And the combination of the encoded structure information and the encoded data value forms the resulting encoded element. In order to reduce the redundancy of structure information encoding, the following steps will mainly be focused on the aspect of encoding the structure information.
[0065] -In step 207, the encoder device determines whether the current element corresponds to a sub-element of the occurrence node and the previous element corresponds to a sub-element of the same occurrence node. If not, it will go to the step 209, or if yes, it will go to the step 208. Due to the method is intended to reduce the redundancy during the encoding of the occurrence node, this step is intended to determine whether the current element belongs to the same occurrence node. Therefore, it can save the following determination steps in case the current element and the previous element do not belong to the same occurrence node.
[0066] -In step 209, the encoder device outputs the encoded structure information of the current element. [0067] -In step 208, the encoder device determines whether the element definition information of the current element is the same as that of the previous element. If not, it will go to the step 212, or if yes, it will go to the step 210. This step is used to distinguish the elements being different sub-element of the same occurrence node. Sometimes, this step is necessary because different structure information of different elements may have the same encoded structure information. Herein, the element definition information is the information used to define the detail structure of the element in the schema. It can be seen from the previous example of the XML schema that element definitions of element el and element e2 in the XML schema are not the same. The information relating to the previous element, such as element definition information, FSA and the encoded structure information, is stored in a temporary or volatile storage device, such as RAM, when the encoder device performs the step 206 to the previous element. And the storage device is updated after the current element is encoded. [0068] -In step 212, the encoder outputs an indication value indicating the difference, such as bit "0", and the encoded structure information of the current element. [0069] -In step 210, the encoder device determines whether the encoded structure information of the current element is the same as that of the previous element in the XML file by comparing the encoded structure information of the current element generated in the step
206 to the encoded structure information of the previous element. If yes, it will go to the step 211, or if not, it will go to the step 212.
[0070] -In step 211, the encoder outputs an indication value indicating the sameness, such as bit "1". To some extent, the indication value can also be recognized as a flag to indicate the presence of the encoded structure information .
[0071] According to an aspect of the present invention, a man skilled in the art will understand that the step
207 is intended to determine whether the current element and the previous element belong to the same occurrence node, and the steps 208 and 210 are jointly used to determine whether the structure information of the current element in the XML file is the same as the previous one because different elements with different structure information may have the same encoded structure information in binary format. Furthermore, it is apparent to one skilled in the art that other means are possible to determine the sameness of the structure information between the current element and the previous element when applying the method to other environment where the instance of a structured document is encoded with the aid the schema. Through using the flag indicative of the presence of the encoded structure information, it reduces the size of encoded elements having the same structure information so as to save the storage size and the bandwidth when transmitting the XML file containing such elements .
[0072] According to a variant of the present embodiment, the encoder device does not generate all FSAs in step 201. Instead, the encoder device merely generates the necessary corresponding FSA for an element before encoding the element, or the FSAs are pre-stored in the device instead of being generated. [0073] According to a variant of the present embodiment, the step 206 is not necessarily performed before the step 207. If other methods or means are used to perform the determination of step 210 without the use of the encoded structure information. But the structure information of the element should be encoded before it is outputted. [0074] According to a variant of the present embodiment, the method can be applied on a fragment of an XML file instead of the whole XML file.
[0075] Fig. 3 is a flow char illustrating the decoding method carried out by the decoder device according to the present embodiment of the invention.
[0076] -In step 301, the decoder device generates all the FSAs for decoding the encoded elements based on the corresponding XML schema. [0077] -In step 302, the decoder device gets the current encoded element belonging to the occurrence node.
[0078] -In step 303, the decoder device determines whether the structure information of the current element is the same as that of the previous element based on the indication value contained in the encoded element. If the same, it will go to step 304, or if not it will go to step 305. As an example, the indication value of bit "1" indicates the structure information of the current element is the same as that of the previous one, and the bit "0" indicates they are different. In other words, the indication value can be considered as a flag used to indicate whether the encoded structure information is present or not. Herein, information relating to the previous element is temporarily stored in a buffer or a storage device when the decoder device decodes the previous element, and the content of the buffer or storage device is updated after the decoder device decodes the current element.
[0079] -In step 304, the decoder device outputs the stored structure information of the previous element. [0080] -In step 305, the decoder device decodes the encoded structure information based on the corresponding FSA to generate the structure information of the current element and outputs the structure information of the current element.
[0081] Furthermore, the decoding process of the encoded data value can be done in the course of decoding the encoded structure information or after the encoded structure information is decoded.
[0082] According to the embodiment of the present invention, there is provided a data structure for carrying encoded element of occurrence node type in a schema-based compression environment. The data structure comprises an indication field, and further may comprise a structure information field and a content field for conveying the encoded structure information and the encoded data value of the element separately. The indication field is used to indicate whether the structure information of the element is the same as that of the previous element. If the structure information of the element is the same as that of the previous one, the indication field is set a value indicating the sameness and the structure information field is not present, or otherwise, the indication field is set a value indicating the difference and the structure information field is present. Therefore, the indication field can also be used to indicate whether the structure information field is present or not. [0083] Fig. 4 is a block diagram illustrating the encoder device according to the present embodiment of the invention. The encoder device 400 comprises an FSA module 401, an input module 402, a process module 403, an output module 404 and a buffer module 405. The FSA module 401 is configured to provide FSA based on an XML schema for the process module 403. The FSA can be provided in a way that the FSA module 401 generates the FSA upon the process module's request for the FSA, or the FSA module 401 firstly generates all FSAs based on the XML schema, stores all the FSAs in a storage device, and then returns the FSA to the process module 403 responsive to the request for the FSA. The input module 402 is configured to receive data. The output module 404 is configured to output data. The buffer module 405 is configured to buffer data. The process module 403 is configured to receive an element to be encoded from the input module 402 as the current element, and determines whether the structure information of the current element is the same as that of the previous element based on the structure information of the previous element provided by the buffer module 405. If the same, the process module 403 will use the output module 404 to output an indication value indicating the absence of the encoded structure information. If not, the process module 403 will use the output module 404 to output an indication value indicating the presence of the encoded structure information, and the process module is further configured to encode the structure information of the current element to generate the encoded structure information of the current element based on the corresponding FSA received from the FSA module 401. Herein, information relating to the previous element, such as encoded structure information of the previous element, is stored in the buffer module 405 when the process module 403 encodes the previous element, and the information stored in the buffer module 405 is updated after the current element is encoded. [0084] Fig. 5 is a block diagram illustrating the decoder device according to the present embodiment of the invention. The decoder device 500 comprises an FSA module 501, an input module 502, a process module 503, an output module 504 and a buffer module 505. The FSA module 501 is configured to provide FSA for the process module 503 based on an XML schema. The input module 502 is configured to receive data. The output module 404 is configured to output data. The buffer module 505 is configured to buffer data. The process module 503 of the decoder device 500 is configured to generate the structure information of the element based on the data received from the input module 502 by using the corresponding FSA derived from the FSA module 501. To be specific, the decoder device 500 receives the encoded element from the input module 502 as the current encoded element, and determines whether the structure information of the current element is the same as that of the previous element based on the indication value contained in the encoded element. If they are the same, the process module 503 will use the output module 504 to output the structure information of the previous element, which is stored in the buffer module 505 when decoding the encoded structure information of the previous element. Or otherwise, if not the same, the process module 503 will decode encoded structure information of the current element based on the corresponding FSA received from the FSA module 501, and uses the output module 504 to output the structure information of the current element. Herein, information relating to the previous element, such as the structure information of the previous element, is stored in the buffer module 505 when the process module 503 decodes the previous element, and the information stored in the buffer module 505 is updated after the current encoded element is decoded.
[0085] Below is experimental data along with annotations. Regarding the example of schema and XML instance thereof, the output under the framework stipulated by ISO/IEC 15938-1 :2002/Amd 2: 2006 Information Technology Multimedia Content Description Interface-Parti is shown below. 0000 0100 #number of element 000 #position code 0 #typecast flag 0 0100 #size of the string
0100 0001 0100 0001 0100 0001 0100 0001 #value of the string 000 #position code 0 #typecast flag 0 0100 0100 0010 0100 0010 0100 0010 0100 0010 #value of the string
000 #position code
0 #typecast flag 0 0100 #size of the string
0100 0011 0100 0011 0100 0011 0100 0011 #value of the string
000 #position code
0 #typecast flag 0 0100 #size of the string
0100 0100 0100 0100 0100 0100 0100 0100 #value of the string
000 #position code
0 #typecast flag 0 0100 #size of the string
0100 0101 0100 0101 0100 0101 0100 0101 #value of the string
000
[0086] The output in according to the embodiment of the invention is shown below:
0000 0100 #number of element
000 #position code
0 #typecast flag
0 0100 #size of the string 0100 0001 0100 0001 0100 0001 0100 0001 #value of the string
1 #repeat flag
0 0100 #size of the string
0100 0010 0100 0010 0100 0010 0100 0010 #value of the string
1 #repeat flag
0 0100 #size of the string 0100 0011 0100 0011 0100 0011 0100 0011 #value of the string
1 #repeat flag
0 0100 #size of the string 0100 0100 0100 0100 0100 0100 0100 0100 #value of the string
1 #repeat flag
0 0100 #size of the string
0100 0101 0100 0101 0100 0101 0100 0101 #value of the string 000
[0087] It can be seen from the above experimental data that the redundant structure information is reduced. [0088] A number of embodiments have been described. Nevertheless, it will be understood that various modifications may be made. For example, elements of different embodiments may be combined, supplemented, modified, or removed to produce other embodiments. Additionally, one of ordinary skill will understand that other structures and processes may be substituted for those disclosed and the resulting embodiments will perform at least substantially the same function (s), in at least substantially the same way(s), to achieve at least substantially the same result (s) as the embodiments disclosed.

Claims

1. A method for encoding a set of elements wherein each element comprises a data structure of a type and at least one attribute value, characterized by the steps of: selecting a current element for encoding; determining whether the current element has the same data structure type as a previously encoded element; in the negative, encoding the data structure of the current element and the at least one attribute value of the current element; and in the affirmative, encoding the at least one attribute value of the current element and providing an indication value indicating the current element has the same data structure type as the previously encoded element.
2. The method according to claim 1, characterized in that the current element immediately follows the previously encoded element in the set of elements.
3. The method according to claims 1 or 2, characterized in that the current element and the previously encoded element are of occurrence node type and the encoding process employs a schema.
4. The method according to any one of the claims 1 to 3, characterized in that the method further comprises: in the negative, providing another indication value indicating the current element has a different data structure type as the previously encoded element.
5. The method according to the claim 3, characterized in that the step of encoding by using said schema further comprises: deriving an encoding tool corresponding to said current element from said schema for encoding the data structure of said current element (201); and encoding the data structure of said current element based on said encoding tool (206) .
6. The method according to any one of claims 1 to 5, characterized in that said set of elements are received from a file or a fragment associated with said schema.
7. A method for decoding encoded data of a set of elements wherein each element comprises a data structure of a type and at least one attribute value, characterized by the steps of: selecting the encoded data of a current element for decoding; and if determining said current element has the same data structure type as a previously decoded element based on a portion of the encoded data indicating the current element has the same data structure type as the previously decoded element, deriving the at least one attribute value by decoding said encoded data and deriving the data structure of said current element by using the data structure of said previous decoded element.
8. The method according to the claim 7, characterized by further comprising: if determining said current element has a different data structure type compared to the previously decoded element, deriving the at least one attribute value and the data structure by decoding said encoded data of said current element.
9. A data structure for carrying the encoded data of a current element having a data structure of a type and at least one attribute value, characterized by comprising: an attribute value field used to carry the encoded data of the at least one attribute value of the current element; and an indication field used to indicate whether the current element has the same data structure type as a previously encoded element.
10. An encoder for encoding a set of elements wherein each element comprises a data structure of a type and at least one attribute value, characterized by comprising: an input module (402) configured to receive data; and a process module (403) configured to determine whether a current element to be encoded has the same data structure type as a previously encoded element, encode the data structure of the current element and the at least one attribute value in response to the negation of said determination , and in response to the affirmation of said determination, encode the at least one attribute value of the current element and provide an indication value indicating the current element has the same data structure type as the previously encoded element.
11. The encoder according to the claim 10, characterized in that the current element and the previously encoded element are of occurrence node type and the process module (403) employs a schema to encode the data structure.
12. The encoder according to the claim 11, characterized in that the encoder further comprises: an encoding tool module (501) configured to provide a tool for encoding the data structure of an element by deriving from the schema; and the encoding of the data structure further comprises: said process module (403) is further configured to encode the data structure of said current element based on the corresponding tool of said current element provided by said encoding tool module (501) .
13. A decoder for decoding encoded data of a set of elements wherein each element comprises a data structure of a type and at least one attribute value, characterized by comprising: an input module (502) configured to receive the encode data of a current element for decoding; and a process module (503) configured to determine whether said current element has the same structure type as a previously decoded element based on a portion of the encode data indicating the current element has the same data structure type as the previously decoded element, and responsive to the affirmation of the determination derive the at least one attribute value by decoding said encoded data and derive the data structure of said current element by using the data structure of said previous decoded element .
14. The decoder according to the claim 13, characterized in that in response to the negation of said determination, the said process module (503) is further configured to derive the at least one attribute value and the data structure of said current element by decoding said encoded data of said current element.
15. A storage medium for encoding a set of elements wherein each element comprises a data structure of a type and at least one attribute value, characterized by comprising instructions for: selecting a current element for encoding; determining whether the current element has the same data structure type as a previously encoded element; in the negative, encoding the data structure of the current element and the at least one attribute value of the current element; and in the affirmative, encoding the at least one attribute value of the current element and providing an indication value indicating the current element has the same data structure type as the previously encoded element.
16. A storage medium for decoding encoded data of a set of elements wherein each element comprises a data structure of a type and at least one attribute value, characterized by comprising instructions for: selecting the encoded data of a current element for decoding; and if determining said current element has the same data structure type as a previously decoded element based on a portion of the encoded data indicating the current element has the same data structure type as the previously decoded element, deriving the at least one attribute value by decoding said encoded data and deriving the data structure of said current element by using the data structure of said previous decoded element.
PCT/EP2009/061479 2008-09-08 2009-09-04 Method and device for encoding elements WO2010026223A1 (en)

Priority Applications (4)

Application Number Priority Date Filing Date Title
EP09811130.5A EP2327028B1 (en) 2008-09-08 2009-09-04 Method and device for encoding elements
JP2011525563A JP5536066B2 (en) 2008-09-08 2009-09-04 Element encoding method and apparatus
CN200980131070.8A CN102119384B (en) 2008-09-08 2009-09-04 Method and device for encoding elements
US12/737,936 US8193952B2 (en) 2008-09-08 2009-09-04 Method and device for encoding elements

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
EP08305534.3 2008-09-08
EP08305534A EP2161667A1 (en) 2008-09-08 2008-09-08 Method and device for encoding elements

Publications (1)

Publication Number Publication Date
WO2010026223A1 true WO2010026223A1 (en) 2010-03-11

Family

ID=41090333

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/EP2009/061479 WO2010026223A1 (en) 2008-09-08 2009-09-04 Method and device for encoding elements

Country Status (5)

Country Link
US (1) US8193952B2 (en)
EP (2) EP2161667A1 (en)
JP (1) JP5536066B2 (en)
CN (1) CN102119384B (en)
WO (1) WO2010026223A1 (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10311137B2 (en) * 2015-03-05 2019-06-04 Fujitsu Limited Grammar generation for augmented datatypes for efficient extensible markup language interchange

Family Cites Families (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
FR2813743B1 (en) * 2000-09-06 2003-01-03 Claude Seyrat COMPRESSION / DECOMPRESSION PROCESS FOR STRUCTURED DOCUMENTS
JP3894280B2 (en) * 2001-02-02 2007-03-14 インターナショナル・ビジネス・マシーンズ・コーポレーション Encoding method of XML data, decoding method of encoded XML data, encoding system of XML data, decoding system of encoded XML data, program, and recording medium
US7158990B1 (en) * 2002-05-31 2007-01-02 Oracle International Corporation Methods and apparatus for data conversion
JP2005148970A (en) * 2003-11-13 2005-06-09 Meidensha Corp Method for converting data
JP2005284903A (en) * 2004-03-30 2005-10-13 Matsushita Electric Ind Co Ltd Document encoding system, document decoding system, method for encoding document, and method for decoding document
JP2005332274A (en) * 2004-05-20 2005-12-02 Toshiba Corp Data structure of metadata stream for object in dynamic image, retrieval method and reproduction method
US8346737B2 (en) * 2005-03-21 2013-01-01 Oracle International Corporation Encoding of hierarchically organized data for efficient storage and processing
TWI295446B (en) * 2005-12-30 2008-04-01 Ind Tech Res Inst Executing system and executing method of intelligent rule base service
CN100458793C (en) * 2007-05-10 2009-02-04 浪潮集团山东通用软件有限公司 Mapping conversion method between data access level Xml format data and relational data
KR20090017030A (en) * 2007-08-13 2009-02-18 삼성전자주식회사 A method for encoding/decoding metadata and an apparatus thereof

Non-Patent Citations (5)

* Cited by examiner, † Cited by third party
Title
BUSATTO G ET AL: "Efficient memory representation of XML document trees", INFORMATION SYSTEMS, PERGAMON PRESS, OXFORD, GB, vol. 33, no. 4-5, 1 June 2008 (2008-06-01), pages 456 - 474, XP022668371, ISSN: 0306-4379, [retrieved on 20080115] *
BYRON CHOI ED - MONG LI LEE ET AL: "Document Decomposition for XML Compression: A Heuristic Approach", DATABASE SYSTEMS FOR ADVANCED APPLICATIONS LECTURE NOTES IN COMPUTER SCIENCE;;LNCS, SPRINGER, BERLIN, DE, vol. 3882, 1 January 2006 (2006-01-01), pages 202 - 217, XP019029997, ISBN: 978-3-540-33337-1 *
LIEFKE H ET AL: "XMILL: AN EFFICIENT COMPRESSOR FOR XML DATA", SIGMOD RECORD, ACM, NEW YORK, NY, US, vol. 29, no. 2, 1 June 2000 (2000-06-01), pages 153 - 164, XP001002286, ISSN: 0163-5808 *
RAMEZ ALKHATIB ET AL: "Efficient Compression and Querying of XML Repositories", DATABASE AND EXPERT SYSTEMS APPLICATION, 2008. DEXA '08. 19TH INTERNATIONAL CONFERENCE ON, IEEE, PISCATAWAY, NJ, USA, 1 September 2008 (2008-09-01), pages 365 - 369, XP031320675, ISBN: 978-0-7695-3299-8 *
SEBASTIAN MANETH ET AL: "XML Tree Structure Compression", DATABASE AND EXPERT SYSTEMS APPLICATION, 2008. DEXA '08. 19TH INTERNATIONAL CONFERENCE ON, IEEE, PISCATAWAY, NJ, USA, 1 September 2008 (2008-09-01), pages 243 - 247, XP031320655, ISBN: 978-0-7695-3299-8 *

Also Published As

Publication number Publication date
JP5536066B2 (en) 2014-07-02
EP2327028A1 (en) 2011-06-01
CN102119384A (en) 2011-07-06
EP2161667A1 (en) 2010-03-10
US20110148673A1 (en) 2011-06-23
JP2012502337A (en) 2012-01-26
CN102119384B (en) 2014-06-11
US8193952B2 (en) 2012-06-05
EP2327028B1 (en) 2023-06-28

Similar Documents

Publication Publication Date Title
AU2002253002B2 (en) Method and system for compressing structured descriptions of documents
JP3368883B2 (en) Data compression device, database system, data communication system, data compression method, storage medium, and program transmission device
US20110283183A1 (en) Method for compressing/decompressing structured documents
JP4884438B2 (en) Method for compressing hierarchical tree and method for decoding compressed multimedia signal
US20080077606A1 (en) Method and apparatus for facilitating efficient processing of extensible markup language documents
US20070044012A1 (en) Encoding of markup language data
AU2002253002A1 (en) Method and system for compressing structured descriptions of documents
CN1251135C (en) Self-descriptive data tag
JP2004536481A (en) Encoding and decoding method of path in tree structure of structured document
US7676742B2 (en) System and method for processing of markup language information
US7627586B2 (en) Method for encoding a structured document
JP2006221656A (en) High-speed encoding method and system of data document
US7797346B2 (en) Method for improving the functionality of the binary representation of MPEG-7 and other XML based content descriptions
EP2327028B1 (en) Method and device for encoding elements
US7571152B2 (en) Method for compressing and decompressing structured documents
US20060013322A1 (en) Method for encoding an xml-based document
US8922798B2 (en) Storage medium and method for producing a printed product
JP2008512886A (en) Method for encoding XML-based documents

Legal Events

Date Code Title Description
WWE Wipo information: entry into national phase

Ref document number: 200980131070.8

Country of ref document: CN

121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 09811130

Country of ref document: EP

Kind code of ref document: A1

WWE Wipo information: entry into national phase

Ref document number: 12737936

Country of ref document: US

ENP Entry into the national phase

Ref document number: 2011525563

Country of ref document: JP

Kind code of ref document: A

NENP Non-entry into the national phase

Ref country code: DE

WWE Wipo information: entry into national phase

Ref document number: 2009811130

Country of ref document: EP