CN100449530C - Schema, syntactic analysis method and method of generating a bit stream based on schema - Google Patents
Schema, syntactic analysis method and method of generating a bit stream based on schema Download PDFInfo
- Publication number
- CN100449530C CN100449530C CNB028014421A CN02801442A CN100449530C CN 100449530 C CN100449530 C CN 100449530C CN B028014421 A CNB028014421 A CN B028014421A CN 02801442 A CN02801442 A CN 02801442A CN 100449530 C CN100449530 C CN 100449530C
- Authority
- CN
- China
- Prior art keywords
- data
- bit stream
- xsd
- file
- relevant
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/958—Organisation or management of web site content, e.g. publishing, maintaining pages or automatic linking
- G06F16/986—Document structures and storage, e.g. HTML extensions
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/25—Integrating or interfacing systems involving database management systems
- G06F16/258—Data format conversion from or to a database
Landscapes
- Engineering & Computer Science (AREA)
- Databases & Information Systems (AREA)
- Theoretical Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Document Processing Apparatus (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Abstract
The invention proposes a new type of schema derived from the (B2)XML schema which permits to describe a coding format in a generic fashion. Such a schema is used to carry out the syntactic analysis of a bit stream (B1) to produce a document (B4) that represents the bit stream which is an instance of said schema, or to generate a bit stream from a document representing the bit stream.
Description
The present invention relates to a kind ofly to comprising a method that bit stream carries out grammatical analysis of the data with the structure of observing a certain form and content, this method was intended to tree-shaped an expression that generates this stream.The present invention relates to the method that generates a bit stream according to a certain form from a document equally, and the document is the tree-shaped expression of this bit stream and comprises data, particularly by using the data of a certain input pattern input.The invention still further relates to an outline, its the definition may have one or more minutes face (facet) one or more data types and comprise a plurality of elements, it is for these elements describe that a title, data type, one are staggered, an order and a plurality of predefine or incidence at random, and the incidence of an element is compulsory or selectable.
The invention still further relates to the computer program that is used to realize this method, a processing unit that comprises the memory storage that is used to store a kind of like this outline and such computer program, and a transmission system that comprises at least one source entity and a destination entity, this source entity comprises the device that is used to store such outline and one or more such programs.
The present invention is applied on editor, modification and the merging field of content.
An example of application of the present invention is as the adaptive content (memory capacity is transmitted used link for screen, calculated capacity ...) that will be transferred to the destination of a function of user profiles.Like this adaptive allows to avoid the useless transmission with out of use data to the recipient especially, and therefore saved passband.
About this point, No. 0101530 a generation method is described by the french patent application that Koninklijke Philips E1ectronics N.V. proposed in May 2 calendar year 2001, be used for generating file with some characteristic element from a basic file of writing and describe the progressive file on a basis with SGML, this method comprises a switch process, generate a document after the conversion by being applied to this basic file as an intended conversion of the function of this characteristic element, the file with this characteristic element generates from the document after this conversion.This method is on a basic file to carry out necessary conversion, and in this basic file, the structure of progressive file is conspicuous, and it provides possibility for the progressive document in this basis of avoiding having to decode differently writes down it.
But, this document generating method has hinted a generation of describing the basic file of this progressive file in basis, and generates another file from the document after this conversion.
An object of the present invention is to propose to carry out an automatic and method in common of such operation.
Therefore, the present invention comprises new outline-type of proposition especially, and the type allows to describe a kind of coded format with a kind of universal mode.
One or more data types that may have one or more minutes faces have been defined according to a kind of outline of the present invention.This outline comprises a plurality of elements, and it is for these elements describe that a title, data type, one are staggered, an order and a plurality of predefine or incidence at random, and the incidence of an element is compulsory or selectable.And it is characterized in that it has at least one in the following characteristic element:
-it has defined a kind of data type, the type is corresponding to the scale-of-two section of uncertain length, and has at least one branch face relevant with this data entry mode, and have alternatively with this scale-of-two section stop to indicate a relevant branch face,
-it has defined one or more data types, and these types are corresponding to the binary word of predetermined length, and have at least one branch face relevant with this length, and are the relevant branch face of position of filling up that comprises with this binary word alternatively,
-it has defined one or more variablees that are made of data access path and has comprised that one or more conditional branchings describe as the various structures of the function of this value or this variable or possible content.
According to method of the present invention, carry out grammatical analysis to comprising a bit stream that has according to the data of the structure of a certain form and content, generate a tree-shaped expression of this stream, the method is characterized in that it comprises:
A) read an outline, it is used for always describing this form;
A) definition may comprise one or more data types of one or more minutes faces, particularly:
-when this form uses the binary word of predetermined length; The binary word of the corresponding predetermined length of one or more data types, this data type has at least one branch face relevant with this length, and when the binary word of this predetermined length may comprise one or more fill up, this data type had one and fills up the relevant branch face in position with this
-use the scale-of-two section of uncertain length when this form, when it has the content of planning to import with this expression by using a certain input pattern, data type is to should the scale-of-two section, this data type has at least one branch face relevant with described input pattern, and when this scale-of-two Duan Youyi stops sign and defines, this data type has one and stops to indicate relevant branch face with this
B) comprise a plurality of elements, it is for these elements describe that a title, data type, one are staggered, predefine or order at random and a plurality of incidence, and the incidence of an element is compulsory or selectable,
When c) data that are positioned at this bit stream front end when this format specification provide the information of the structure of relevant this bit stream remainder or content,
-one or more variablees of forming to the described access path that is positioned at the data of front end of definition in this tree-shaped expression,
-and comprise one or more conditional branchings with the various possible structure or the content of description as a function of this variate-value,
B) in this bit stream search to the data of the element that should outline comprises,
C) generate an example of this outline, this outline is included in the data that find in this bit stream and its forms this tree-shaped expression.
Therefore the present invention includes the instrument of making, it allows always to describe a form in the outline.The present invention includes then and use such outline to be used to analyze grammer, so that generate this stream of expression and as the document of an example of this stream according to the bit stream of this form.This outline stipulated according in the bit stream of this form the various data types that may comprise, and the mode that in last document, comprises these data.The data type of defined and the rule that in the end comprises in the document depend on the form of being considered in this outline.
By way of example, when the form of being considered is the JPEG2000 form when (ISO/IEC FCD standard 15444-1), this outline:
-definition is corresponding to the various data types of the binary word of predetermined length, they may comprise fill up the position and have a branch face relevant with this length and one or more and this fill up a branch face of being correlated with,
-defined a data type, it is corresponding to the scale-of-two section that stops to indicate the uncertain length of definition by, and has plan by using the content of a certain input pattern with this expression input, and have the branch face relevant and one and stop to indicate relevant branch face with this with input pattern
-definition is by one or more variablees of forming to an access path of the data that are positioned at this bit stream front end in this tree-shaped expression, and it provides about the structure of this bit stream remainder or the information of content,
-and comprise that one or more conditional branchings are described may be as the various structures or the content of the function of this variate-value.
According to a kind of method of the present invention, from document, generates a bit stream according to a certain form, the document is the tree-shaped expression of this bit stream and comprises data, particularly the data of importing by a certain input pattern.The method is characterized in that and comprise:
A) read the document,
B) read an outline that is used for always describing this form with parallel mode:
A) the one or more data types that may have one or more minutes faces of definition, particularly:
-one data type wherein has at least one branch face relevant with this input pattern corresponding to the scale-of-two section of uncertain length
-and, when this form uses the binary word of predetermined length, one or more data types are corresponding to the binary word of predetermined length, wherein has at least one branch face relevant with this length, and when this binary word comprises one or more fill up, have one and fill up the relevant branch face in position with this
B) comprise a plurality of elements, it is for these elements describe that a title, data type, one are staggered, an order and a plurality of predefine or incidence at random, and the incidence of an element is compulsory or selectable.
C) when this format specification: when the data that are positioned at this bit stream front end provide the information of the structure of relevant this bit stream remainder or content:
-in this tree-shaped expression, be defined into one or more variablees that an access path of the described data that are positioned at front end is formed,
-and comprise that one or more conditional branchings are described may be as the various structures or the content of a function of this variate-value, be included in data type in the document with definition,
C) encode these data as a function of this define styles,
D) data behind this coding are formed a bit stream.
Therefore the present invention comprises the outline of using the front to describe type equally, is used for from representing this bit stream and generating bit stream as the document of an example of this outline.Parallel read data type that the document and outline determine that the document comprised in case according to form coding that form provided they and with bit stream of the composition of the data behind this coding.
By limiting examples, with reference to the embodiment of describing later, these and other aspects of the present invention will become obviously and be carried out explanation.
In the accompanying drawings:
What Fig. 1 represented is a block diagram, has summarized the basic model according to the operation of parsing method of the present invention,
What Fig. 2 represented is a block diagram, has summarized the basic model according to the operation of bit stream generation method of the present invention, and
Fig. 3 is the example according to a transmission system of the present invention.
In Fig. 1 represented block diagram illustrations according to the basic model of the operation of parsing method of the present invention.Frame B1 represents a bit stream according to predetermined format.What frame B2 represented is the outline of always describing this predetermined format.What frame B3 represented is a syntax analyzer, and it allows bit stream B1 is carried out grammatical analysis so that generates document B4, and the document B4 is the tree-shaped expression of bit stream B1 and is the example of outline B2.
Outline B2 has described the grammer of this bit stream.It has stipulated that especially the data type that may occur in this bit stream and these types are included in the mode in the document.Tree-shaped expression B4 relies on the analysis of this bit stream B1 is generated.
In Fig. 2 represented block diagram illustrations according to the operating principle of bit stream generation method of the present invention.What frame B ' 2 represented is the outline of in general manner describing coded format.What frame B ' 4 represented is a document of outline B ' 2 examples.Frame B ' 3 expression be a bit stream maker, its allows parallel document B ' 4 and the outline B ' 2 of reading to generate bit stream B ' 1.Be easily, be used for the outline B2 and B ' the 2nd of same coded format, identical.
Document B ' 4 is read the data type that is comprised among the document B ' 4 to judge with outline B ' 2.For the value that from document B ' 4, reads, the corresponding data type that in outline B ' 2, the finds coding mode that this value will be used of having determined to encode.
Usually certain unnecessary full details that provides coded format structure in the outline.The degree of details depends on the application of being considered.Because mention in the foreword in the application in the use described in the French patent application number 0101530 (generate JPEG2000 file of the method for a file-for example-have the characteristic element that is suitable for recipient's profile), some section of JPEG2000 mark will be provided to parameter one by one in detail to allow these parameters of covering and to revise them.Other sections of mark are handled as a monoblock, and this is because there is no need to visit their included parameters.
Document B4 and B ' 4 be XML document and outline B2 and B ' the 2nd preferably, writes with the newspeak that is derived from XML outline language.XML is (extend markup language of issue on October 6th, 2,000 1.0 editions) that the W3C of association is recommended, and the XML outline is the recommended project of same W3C association issue on October 24th, 2000.This XML recommends and the XML outline recommended project can obtain on interconnected network address http://www.w3.org/.But, this is not restricted and may uses by using dissimilar language in the inventive principle that the residue branch face of instructions is described in detail, for example, and by AT﹠amp; The BRICS of the Aarhus university of T and Denmark advises and available DSD type language (file structure description) on interconnected network address http://www.brics.dk/DSD/.
From physical viewpoint, an XML document comprises entity especially, and their possibilities are analyzed and comprise text, that is, and and the characteristic sequence that belongs to the predetermined characteristic collection and represent a mark or text data.
From the viewpoint of logic, an XML document comprises one or more elements, stops its boundary of identifier mark by beginning flag symbol and one.Element may interlock.Each element is accorded with and is stopped title pointed in the identifier by its beginning flag and identifies.An element may have a value.Element value is placed in its beginning flag symbol and stops between the identifier.
In the example that will describe, select data below so that it is directly placed the element of XML document and has simplified this realization (that is to say that the data that XML document comprised have been formed the value of XML element).
The XML outline is the outline language that allows specified content and XML document structure: more particularly, XML outline allows descriptive element and title of each element, a data type, staggered, as a to occur frequency and an a plurality of incidence.Order that occurs and incidence number may be predefine or at random.The incidence of an element may be compulsory or selectable.
An outline has defined a class of XML document.The example of an XML outline is with reference to the effective XML document of this outline.
Realize a concrete example of the present invention in the following description and for providing, will consider the bit stream of JPEG2000 form.This no longer is that restricted and obvious the present invention can be applied to extended formatting.For some form, it may be necessary that the other types data are added in these types described herein.
In annex A, provided an example of describing the outline of JPEG coded format.This outline has been used the simple data type that defines in general and the accessories B, and the simple data type that derives from these universal classes of annex C definition.Annex D has provided an example of XML document, and it is the example and the expression JPEG bit stream of the outline that provides of annex A.Among annex A, the B and C that is described below, letter " xsd " is identified at the instrument that defines in the XML outline.And letter " bsd " has identified the instrument that is added by the present invention.
According to the present invention, outline B2 (thinking identical with outline B ' 2 in being described below) has defined the type of all data that may comprise according to the JPEG2000 form in a stream.The a certain type of data is Already in the XML outline language and can directly use.Other will make an amendment.Other must be added.
Bit stream according to the JPEG2000 form more particularly comprises following data type:
1) the scale-of-two section of uncertain length, first or second input pattern that its content may describe below by use and in an XML document, importing,
2) binary word of different length may comprise the unessential position of filling up,
3) definition value and in the hexadecimal code of JPEG2000 standard with the mark of the hexadecimal code of XML document input.The 3rd input pattern is formed in this input with hexadecimal code.
This first input pattern comprises binary data is converted to the character that belongs to the used book character collection of XML.Be this purpose, used one easily with known to the name " base 64 " and the coding method of in 6.8 sections of the document RFC2045 of IETF issue, describing.This method comprises a character dividing binary data and convert thereof into 6 hytes and the related book character of each 6 hyte is concentrated.This first input pattern has caused data extending 33%.
In second input pattern, replacement converts this binary data to character and it directly is inserted in the XML document, has introduced the pointer that points to the binary file zone that comprises this binary data in XML document.This XML document just becomes and depends on this binary file then.
In this XML outline language, a data type is defined as three-bit byte, comprising:
-be called as the value collection in value space,
-be called as the lexical representation collection of lexical space,
-Fen Mianji, a branch face correspondence is forced at a constraint in this value space.
Be to realize that the present invention, the digital coding of being write in the bit stream will be clear and definite and undoubtedly.Preexist in the XML outline and a certain type clear and definite and data undoubtedly directly used: promptly this situation, for example, data type " xsd:unsignedShort ", expression-32768 and+integer between 32767 and therefore may impliedly encoding with two bytes." xsd:unsignedInt " and " xsd:unsignedByte " equally also is this situation.
Used data type " xsd:binary " equally according to the present invention, but revised:
1) added a branch face in addition, with a called after " bsd:stopFlagExclusive " stop the sign relevant.Face will be used to point out the end of the scale-of-two section of uncertain length this minute: for example, the JPEG2000 of the scale-of-two section of uncertain length grouping is defined by a mark SOP (grouping beginning) or a mark SOT (beginning of tiling) or a mark EOC (end of code flow); Therefore for a JPEG2000 grouping, this minute face will have FF51 (SOP), in FF90 (SOT) or three values of FFD9 (EOC) one.
2) add a new value to " encoding " branch face, this minute face Already in the XML outline language, value " hex " is used for hexadecimal code and " base 64 " are used for the input after base 64 conversions.Should new value of adding be called as " externalData ".It is used to point out that data import (introduce pointer in XML document, this pointed comprises the bit stream zone of related data) by using above-mentioned second input mode in XML document.
This amended data type called after " binaryNoLength " and in annex C, defining in the following manner:
<xsd:simpleType?name=″binaryNoLength″>
<xsd:annotation>
<xsd:appinfo>
<!--Read?data?until?a?flag?is?found-->
<xsd:hasFacet?name=″stopFlagExclusive″/>
<!--How?binary?data?should?be?instantiated:-->
<!--base64,hex(same?as?for?xsd:binary)-->
<|--externalData(URI?pointing?to?an?external?entity?data?segment)-->
<xsd:hasFacet?name=″encoding″/>
</xsd:appinfo>
</xsd:annotation>
<xsd:restriction?base=″xsd:anySimpleType″/>
</xsd:simpleType>
This amended data type is a general type, and it is used to derive other specific data types that may find in an XML bit stream then.For example, as pointed in the accessories B, the corresponding JPEG2000 grouping of type " PacketDataTYpe ", it derives from universal class " binaryNoLength ".In this example, these data that comprise in dividing into groups are imported in XML document by using second input mode (to the pointer in the bit stream zone that comprises these data) as can be seen.A JPEG2000 grouping has following three value FF51 by one equally as can be seen, and the mark of one of FFD9 or FF90 defines.
<xsd:simpleType?name=″packetDataType″>
<xsd:restriction?base=″bsd:binaryNoLength″>
<xsd:encoding?value=″extemalData″/>
<bsd:stopFlagExclusive?value=″FF51|FFD9|FF90″/>
</xsd:restriction>
</xsd:simpleType>
When reference annex A, find that other elements have the type that derives from " binaryNoLength ", for example, the element of called after " Data " in the complicated COC types of elements.This element " Data " has " encoding " branch face of a value for " base 64 ", this means by using a base 64 conversions to import corresponding data from the bit stream of XML document.
<xsd:complexType?name=″COCType″>
<xsd:sequence>
<xsd:element?name=″Marker>>type=″jp2:markerType>>fixed=″ff53″/>
<xsd:element?name=″Lcoc>>type=″xsd:unsignedShort″/>
<xsd:element?name=″data″>
<xsd:simpleType>
<xsd:restriction?base=″xsd:binary″>
<xsd:length?value=″$Lcoc″/>
<xsd:encoding?value=″base64″/>
</xsd:restriction>
</xsd:simpleType>
</xsd:element>
</xsd:sequence>
</xsd:complexType>
The present invention also comprises increases a new data type, and this data type will be used for the binary word of predetermined length.This new data type called after " bsd:bitArray " and it comprise three branch faces: called after " bitLength " relates to a branch face of the length of binary word, called after " prePadding " relates to a branch face filling up bits number that places before the significance bit, and called after " postPadding " relates to a branch face filling up bits number that places after this significance bit.This new data type defines in annex C in the following manner:
<xsd:simpleType?name=″bitsArray″>
<xsd:annotation>
<xsd:appinfo>
<xsd:hasFacet?name=″bitsLength″/>
<xsd:hasFacet?name=″prePadding″/>
<xsd:hasFacet?name=″postPadding″/>
</xsd:appinfo>
</xsd:annotation>
<xsd:restriction?base=″xsd:anySimpleType″/>
</xsd:simpleType>
Pointed as accessories B, the new data type of this conventional data type is used for deriving the specific data type that JPEG2000 stream can find then.For example, the data type of called after " 11b ", it derives from conventional data type " bitsArray ", does not comprise 11 binary words filling up the position corresponding to one:
<xsd:simpleType?name=″11b″>
<xsd:restriction?base=″bsd:bitsArray″>
<bsd:bitsLength?value=″11″/>
</xsd:restriction>
</xsd:simpleType>
And the data type of called after " 5b3p ", derive from conventional data type " bitsArray " equally, comprise the back with 35 binary words filling up 2 significant bits of position corresponding to one.
<xsd:simpleType?name=″5b3p″>
<xsd:restriction?base=″bsd:bitsArray″>
<bsd:bitsLength?value=″5″/>
<bsd:postPadding?value=″3″/>
</xsd:restriction>
</xsd:simpleType>
With reference to annex A, find to have used data type " 11b ", for example, be used to be named as the element of " mantissa ":
<xsd:element?name=″mantissa”type=″jp2:11b″/>
And use data type " 5b3p ", for example, be used to be named as the element of " exponent ".
<xsd:element?name=″exponent”type=″jp2:5b3p″/>
From the structure viewpoint, the present invention has used the following instrument in the XML outline Already in:
-simple and complicated data type " xsd:simple Type " and " xsd:complexType ",
-element " xsd:element ",
-group model " xsd:group ",
-connector " xsd:sequence ", " xsd:all " and, possibly, " xsd:choice ".
Syntax analyzer B3 reads the tree-shaped expression B4 that bit stream B1 and outline B2 generate bit stream B1.This tree-shaped expression B4 is the example of outline B2.Recurrence generates by explaining the connector that finds in the outline.
More particularly, explain connector " xsd:sequence " in the following manner by syntax analyzer: when this syntax analyzer finds connector " xsd:sequence ", with outline in identical order read element in the bit stream.For example, used connector " xsd:sequence " in the element of the type " CodestreamType " of annex A definition.
<xsd:complexType?name=″CodestreamType″>
<xsd:sequence>
<xsd:element?name=″MainHeader”type=jp2:MainHeaderType″/>
<xsd:element?name=″Tile”type=″jp2:TileType″/>
<xsd:element?name=″EOC”type=″jp2:EOCType″/>
</xsd:sequence>
</xsd:complexType>
According to this definition, element " Codestream " must comprise the back with element " Tile ", again the back with element " Eoc ", an element " MainHeader ".
Connector " xsd:all " is explained in the following manner by syntax analyzer: when this syntax analyzer finds connector " xsd:all ", it is by attempting to read the statement element of first in the connector " xsd:all " (declared element) beginning, if and the failure, then attempt to read second element or the like ...When finding the element that will search for, it forwards the next element of connector " xsd:all " to.When all elements of connector " xsd:all " has traveled through, just forward next connector to.
One of daughter element that comprises for this element or for it when outline defines a fixed value, and when the value that finds in the not corresponding bit stream of this value, the not corresponding element of searching for of the element that finds in this bit stream.For example, when this syntax analyzer managed to realize element " SIZ ", if not to distribute to value " FF51 " beginning of mark " SIZ " in the JPEG2000 standard, the element that then reads from bit stream was not the element of being searched for.
For example, in the element of the type " MainHeader Type " that annex A defines, use connector " xsd:all ".
<xsd:complexType?name=″MainHeaderType″>
<xsd:sequence>
<xsd:element?name=″SOC″type=″jp2:SOCType″/>
<xsd:element?name=″SIZ″type=″jp2:SIZType″/>
<xsd:group>
<xsd:all>
<xsd:element?name=″COD″type=″jp2:CODType″minOccurs=″0″/>
<xsd:element?name=″QCD″type=″jp2:QCDType″minOccurs=″0″/>
<xsd:element?name=″COC″type=″jp2:COCType″minOccurs=″0″/>
<xsd:element?name=″QCC″type=″jp2:QCCType″minOccurs=″0″/>
<xsd:element?name=″RGN″type=″jp2:RGNType″minOccurs=″0″/>
<xsd:element?name=″POC″type=″jp2:POCType″minOccurs=″0″/>
<xsd:element?name=″PPM″type=″jp2:PPMType″minOccurs=″0″/>
<xsd:element?name=″TLM″type=″jp2:TLMType″minOccurs=″0″/>
<xsd:element?name=″PLM″type=″jp2:PLMType″minOccurs=″0″/>
<xsd:element?name=″CRG″type=″jp2:CRGType″minOccurs=″0″/>
<xsd:element?name=″COM″type=″jp2:COMType″minOccurs=″0″/>
</xsd:all>
</xsd:group>
</xsd:sequence>
</xsd:complexType>
According to this definition, an element " MainHeader " must comprise " SOC " element, follows " SIZ " element thereafter, follow a group that can comprise the following element that one or more or zero occurs with at random order: " COD " thereafter, " QCD ", " COC ", " QCC ", " RGN ", " POC ", " PPM ", " TLM ", " PLM ", " CRG ".
Although do not provide any example in annex A, connector " xsd:choice " also can be used.When this syntax analyzer was found a connector " xsd:choice ", it began to manage to read first element of asserting in the connector " xsd:choice " in this bit stream.When syntax analyzer finds the element of being searched, it will forward the next connector in the outline to.When the element that reads was not the element of being searched, syntax analyzer forwarded the following element in this same connector " xsd:choice " to.
And the present invention has introduced the use of variable in outline.The notion of variable is used in XSLT language (conversion of XML Extensible Stylesheet Language (XSL)).XSLT is a kind of language by W3C association regulation, allows definition to can be applicable to the conversion of XML document.Specifically, a variable is a character string that is linked to a value.It can appear at ad-hoc location of document and use Anywhere at other.According to the grammer that defines among the XSLT, conduct interviews by before this character string, placing $ symbol (different word , $ZZZ is the value by the variable of character string ZZZ sign) by the value of the variable of a character string identification.The value of a variable is pointed out in the definition of this variable.The grammer that the present invention allows to use definition in the Xpa th language (XML path language) in the XML tree with the value of the variable of formal definition in path.Xpath equally also is the language of W3C association regulation.XSLT and Xpath standard can obtain at internet sites http://www.w3.org.
The use of variable allows, and for example defines many elements by a parameter, rather than by a normal value defined it.Given at the front end of bit stream when the value of a parameter, and when informing the structure of this bit stream remainder or content, the value of this variable defines by using the Xpath grammer.
For example, in the JPEG2000 form, be included in the number " comp-siz " of the assembly among the marker field SIZ by a parameter " Csiz " definition that is positioned over marker field SIZ front end.In appendix A, element " Csiz " is defined as a variable.Its value is defined by the path in the tree-shaped expression in the structure: to this, expression formula " SIZ/Csiz " indication " Csiz " is a daughter element of element " SIZ ".Then, in the definition of complicated type " SIZ ", this variable " Csiz " is called to the generation number of decision element " comp-siz ".
<xsl:variable?name=″Csiz″>
<xsl:value-of?select=″SIZ/Csiz″/>
</xsl:variable>
<xsd:element?name=″Comp_siz″minOccurs=″$Csiz″maxOccurs=″$Csiz″>
The present invention has also comprised an additional new group model " bsd:conditionalChoice " and two new connectors " xsl:if ", " xsl:choose ".This allows to introduce conditional branching and thereby expresses the condition that may be present in the form that someone attempts to describe and select in outline.Should be noted that connector " xsl:if " and " xsl:choose " use XSLT language definition.According to the standard of XSLT language, connector " xsl:if " and " xsl:choose " have used an attribute " test ", and this attribute allows a choice function of an experimental result of definition.This connector " xsl:if " allows the function of selection of definition as the value of a Boolean variable.This connector " xsl:choose " allows a selection in the many optional values of definition.
By example, the JPEG2000 form provides the type of the appearance of element-specific or data to depend on the value of a given parameter of bit stream front end.
Specifically be contained in the situation of the element " SPcod " in the element " COD ".When the value of variable " PredinctsUsed " was 1, element " SPcod " had comprised " PredinctSize " element separately.This constrains in the outline of having used connector " xsl:if " and expresses.
<xsd:element?name=″SPcod″>
<xsd:eomplexType>
<xsd:sequence>
<xsd:element?name=″nDecompLevels″type=″xsd:unsignedByte″/>
<xsd:element?name=″codeBlockWidth″type=″jp2:4p4b″/>
<xsd:element?name=″codeBlockHeight″type=″jp2:4p4b″/>
<xsd:element?name=″codeBlockStyle″>
<xsd:complexType>
<xsd:sequence>
<xsd:element?name=″optSegMarkers″type=″jp2:2p1b″/>
<xsd:element?name=″optErTerm″type=″jp2:1b″/>
<xsd:element?name=″optVertStrCausal″type=″jp2:1b″/>
<xsd:element?name=″optRegTerm″type=″jp2:1b″/>
<xsd:element?name=″optResetMQ″type=″jp2:1b″/>
<xsd:element?name=″optByPass″type=″jp2:1b″/>
</xsd:sequence>
</xsd:complexType>
</xsd:element>
<xsd:element?name=″Transformation″type=″xsd:unsignedByte″/>
<xsl:iftest=″$PrecinctsUsed=1″>
<xsd:element?name=″PrecinctSize″minOccurs=″0″>
<xsd:complexType>
<xsd:sequence>
<xsd:element?name=″PPy″type=″jp2:4b″/>
<xsd:element?name=″PPx″type=″jp2:4b″/>
</xsd:sequence>
</xsd:complexType>
</xsd:element>
</xsd:sequence>
</xsl:if>
</xsd:complexType>
</xsd:element>
In element " QCD ", find the example of one " bsd:conditionalChoice " group, this group use connector " xsl:choose " to point out one will to follow (Spqcd_1 ", Spqcd_2 or Spqcd_3) the type of element depend on the value of a parameter " quantStyle ", its be arranged in when the position of the tree-shaped expression of pre-treatment given by path " QCD/Spqcd/QuantStyle ".
<bsd:conditionalChoice>
<xsl:choose>
<xsl:when?test=″QCD/Sqcd/quantStyle?=′0′″>
<xsd:element?name=″Spqcd_1″minOccurs=″3*$nDecompLevels+1″
maxOccurs=″3*$nDecompLevels+1″>
<xsd:complexType>
<xsd:sequence>
<xsd:element?name=″exponent″type=″jp2:5b3p″/>
</xsd:sequence>
</xsd:complexType>
</xsd:element>
</xsl:when>
<xsl:when?test=″QCD/Sqcd/quantStyle?=′1′″>
<xsd:element?name=″Spqcd_2″minOccurs=″1″maxOccurs=″1″>
<xsd:complexType>
<xsd:sequence>
<xsd:element?name=″exponent″type=″jP2:5b″/>
<xsd:element?name=″mantissa″type=″jp2:11b″/>
</xsd:sequence>
</xsd:complexType>
</xsd:element>
</xsl:when>
<xsl:when?test=″QCD/Sqcd/quantStyle=′2′″>
<xsd:element?name=″Spqcd_3″minOccurs=″3*$nDecompLevels+1″
maxOccurs=″3*$nDecompLevels+1″>
<xsd:complexType>
<xsd:sequence>
<xsd:element?name=″exponent″type=″jp2:5b″/>
<xsd:element?name=″mantissa″type=″jp2:11b″/>
</xsd:sequence>
</xsd:complexType>
</xsd:element>
</xsl:when>
</xsl:choose>
</bsd:conditionalChoice
When syntax analyzer read this bit stream, it generated the XML tree progressively.When it runs into variable in the outline, for example, an attribute " test " of a connector " if ", this variable is assessed in its path of pointing out by application.
Should be noted that the XML language allows to define the expansion of himself.First embodiment of the present invention thereby comprise the expansion of new tool that additional the present invention proposes as existing XML outline language.Another embodiment of the present invention comprises a complete newspeak that redefines, and this language is got the instrument of XML outline and added the new instrument that is proposed by the present invention.
Fig. 3 has illustrated according to a transmission system of the present invention.Transmission system shown in Figure 3 has comprised a server S V and many client computer CT.Server S V and client computer CT are linked to internet WWW.Server S V comprises storage arrangement MEM and treating apparatus PROC.This storage arrangement comprises an outline B2 especially, and the first bit stream B1 and a computer program PG1 are used to realize one according to parsing method of the present invention, to obtain this first bit stream B1 of expression and to be the first document B4 of the example of this outline B2.This storage arrangement MEM preferably also comprises a computer program PG2, is used to realize represent that from one same document B ' 4 and this outline B2 generate the method for one second bit stream B ' 1.
By way of example, the document B ' 4 obtains by the document B4 is used a conversion, and this conversion depends on the client's of a bit stream of previous request conversion profile.
Claims (6)
1, a kind of adaptive method that will be sent to the content of a destination entity (CT) with profile, this method comprises the step that produces an electronic data file (B4) from a bit stream (B1) that comprises data, these data have structure and the content according to a certain form, this electronic data file (B4) is a tree-shaped expression of this bit stream (B1), it is characterized in that this method may further comprise the steps:
A) read an electronics outline file (B2), be used for always describing this form:
A) definition may comprise one or more data types of one or more minutes faces:
-when this form uses the binary word of predetermined length: the binary word of the corresponding described predetermined length of one or more data types, this one or more data type has at least one branch face relevant with this length, and when the binary word of this predetermined length may comprise one or more fill up, this one or more data type has one and fills up the relevant branch face in position with this
-use the scale-of-two section of uncertain length when this form, this scale-of-two section has plan when using a certain input pattern with this content of representing to import: data type is to should the scale-of-two section, it is relevant with described input pattern that this data type has at least one branch face, and when this scale-of-two Duan Youyi stops sign and defines, this data type has a branch face and stops to indicate relevant with this
B) comprise a plurality of elements, for these elements describe that a title, data type, one are staggered, predefine or order at random and a plurality of incidence, the incidence of an element is compulsory or selectable,
C) when this format specification: when the data that are positioned at this bit stream (B1) front end provide the information of the structure of relevant this bit stream (B1) remainder or content,
-define by one or more variablees of forming to a described access path that is positioned at the data of front end in this tree-shaped expression,
-and comprise that one or more conditional branchings are used to describe various possible structure or the content as a function of this variate-value,
B) in this bit stream (B1) search to the data of the element that comprises in should electronics outline file (B2),
C) example of generation (B3) this electronics outline file (B2), this electronics outline file is included in the data that find in this bit stream (B1) and forms this electronic data file (B4), and this electronic data file is the tree-shaped expression of this bit stream (B1),
D) described electronic data file (B4) is converted to the function of the profile of this destination entity (CT), so that obtain an adaptive electronic data file (B ' 4).
2. a foundation a certain form that is used to have the destination entity (CT) of profile generates the method for a bit stream (B ' 1), this bit stream (B ' 1) produces from a basic electronic data file (B4), it is characterized in that, this method comprises switch process, wherein this basic electronic data file (B4) is converted into the function of the profile of this destination entity (CT), so that obtain an adaptive electronic data file (B ' 4), this electronic data file is the tree-shaped expression of this bit stream (B ' 1) and comprises that this method is further comprising the steps of by using the data of a certain input pattern input:
A) read this adaptive electronic data file (B ' 4);
B) read an electronics outline file (B ' 2) with parallel mode, be used for always describing this form:
A) definition may have one or more data types of one or more minutes faces:
-one data type is corresponding to the scale-of-two section of uncertain length, and it is relevant with this input pattern wherein to have at least one branch face,
-and when this form used the binary word of predetermined length, one or more data types were corresponding to the binary word of predetermined length, it is relevant with this length wherein to have at least one branch face, and when this binary word comprised one or more fill up, to fill up the position relevant with this for branch face
B) comprise a plurality of elements, for these elements describe that a title, data type, one are staggered, an order and a plurality of predefine or incidence at random, the incidence of an element is compulsory or selectable,
C) encode these data as a function of institute's define styles,
D) data behind this coding are formed (B ' 3) described bit stream (B ' 1).
3. server (SV) that can be coupled to the internet, this server (SV) comprising:
-being used for producing the device of an electronic data file (B4) from a bit stream (B1) that comprises data, these data have structure and the content according to a certain form,
-being used for searching for the device of the data of the element that comprises in the corresponding electronics outline file (B2) at this bit stream (B1), this electronics outline file is used for always describing this form:
A) definition may comprise one or more data types of one or more minutes faces:
-when this form uses the binary word of predetermined length, the binary word of the corresponding described predetermined length of one or more data types, these one or more data types have at least one branch face relevant with this length, and when the binary word of this predetermined length may comprise one or more fill up, these one or more data types have one and fill up the relevant branch face in position with this
-use the scale-of-two section of uncertain length when this form, this scale-of-two section has plan when using a certain input pattern with this content of representing to import: data type is to should the scale-of-two section, it is relevant with described input pattern that this data type has at least one branch face, and when this scale-of-two Duan Youyi stops sign and defines, this data type has a branch face and stops to indicate relevant with this
B) comprise a plurality of elements, for these elements describe that a title, data type, one are staggered, predefine or order at random and a plurality of incidence, the incidence of an element is compulsory or selectable,
C) when this format specification: when the data that are positioned at this bit stream (B1) front end provide the information of the structure of relevant this bit stream (B1) remainder or content,
-define by one or more variablees of forming to a described access path that is positioned at the data of front end in this tree-shaped expression,
-and comprise that one or more conditional branchings are used to describe various possible structure or the content as a function of this variate-value;
Be used for generating the device of the example of the electronics outline file (B2) be included in the data that described bit stream (B1) finds, this example constitutes electronic data file (B4), and it is the tree-shaped expression of described bit stream (B1), and
Be used for described electronic data file (B4) is converted to the function of the profile of this destination entity (CT), so that obtain the device of an adaptive electronic data file (B ' 4).
4. server (SV) that can be coupled to the internet, this server (SV) comprising:
-generating the device of a bit stream (B ' 1) according to a certain form of the destination entity (CT) that is used to have profile, this bit stream produces from a basic electronic data file (B4),
Be used for will this basic electronic data file (B4) being converted to the function of the profile of this destination entity (CT), so that obtain the device of an electronic data file (B ' 4), this electronic data file is the tree-shaped expression of this bit stream (B ' 1) and comprises by using the data of a certain input pattern input
Be used to read the device of this electronic data file (B ' 4);
Be used for reading with parallel mode the device of an electronics outline file (B ' 2), this electronics outline file is used for always describing described form;
A) definition may have one or more data types of one or more minutes faces, particularly:
-one data type is corresponding to the scale-of-two section of uncertain length, and it is relevant with this input pattern wherein to have at least one branch face,
-and when this form used the binary word of predetermined length, one or more data types were corresponding to the binary word of predetermined length, it is relevant with this length wherein to have at least one branch face, and when this binary word comprised one or more fill up, to fill up the position relevant with this for branch face
B) comprise a plurality of elements, for these elements describe that a title, data type, one are staggered, an order and a plurality of predefine or incidence at random, the incidence of an element is compulsory or selectable;
Be used for encoding these data as the device of a function of described electronics outline file (B ' 2) institute's define styles, and
Be used for data behind this coding and form the device of described bit stream (B ' 1).
5. a transmission system comprises the server (SV) according to claim 3.
6. a transmission system comprises the server (SV) according to claim 4.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
FR0102764A FR2821458A1 (en) | 2001-02-28 | 2001-02-28 | SCHEME, SYNTAX ANALYSIS METHOD, AND METHOD FOR GENERATING A BINARY STREAM FROM A SCHEME |
FR01/02764 | 2001-02-28 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN1462400A CN1462400A (en) | 2003-12-17 |
CN100449530C true CN100449530C (en) | 2009-01-07 |
Family
ID=8860580
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CNB028014421A Expired - Fee Related CN100449530C (en) | 2001-02-28 | 2002-02-08 | Schema, syntactic analysis method and method of generating a bit stream based on schema |
Country Status (11)
Country | Link |
---|---|
EP (1) | EP1366439A1 (en) |
JP (1) | JP4260481B2 (en) |
KR (1) | KR100898614B1 (en) |
CN (1) | CN100449530C (en) |
BR (1) | BR0204315A (en) |
FR (1) | FR2821458A1 (en) |
MX (1) | MXPA02010534A (en) |
PL (1) | PL363513A1 (en) |
RU (1) | RU2294012C2 (en) |
TW (1) | TW563036B (en) |
WO (1) | WO2002069187A1 (en) |
Families Citing this family (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP4040577B2 (en) | 2001-11-26 | 2008-01-30 | コーニンクリク・フィリップス・エレクトロニクス・ナムローゼ・フエンノートシャップ | Schema, parsing, and how to generate a bitstream based on a schema |
ATE513415T1 (en) * | 2001-12-28 | 2011-07-15 | Koninkl Philips Electronics Nv | METHOD FOR PROCESSING MULTIMEDIA CONTENT |
CN1751308A (en) * | 2003-02-19 | 2006-03-22 | 皇家飞利浦电子股份有限公司 | Method of producing a document based on a scheme that generically describes the format of a bit stream |
EP2242273A1 (en) * | 2009-04-14 | 2010-10-20 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Transmission scheme for text-based information |
CN104598635B (en) * | 2015-02-06 | 2018-01-19 | 无锡江南计算技术研究所 | A kind of complex documents automatic generation method based on XML descriptions |
CN107092656B (en) * | 2017-03-23 | 2019-12-03 | 中国科学院计算技术研究所 | A kind of tree data processing method and system |
RU2762398C2 (en) * | 2019-12-03 | 2021-12-21 | Владимир Дмитриевич Мазур | Method for transmitting binary data in a standard audio media stream |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO1999067784A1 (en) * | 1998-06-24 | 1999-12-29 | Seagate Technology Llc | Efficient run length limited code with short interleaved constraint |
WO2000038329A1 (en) * | 1998-12-21 | 2000-06-29 | Koninklijke Philips Electronics N.V. | Device for encoding n-bit source words into corresponding m-bit channel words and decoding m-bit channel words into corresponding n-bit source words |
-
2001
- 2001-02-28 FR FR0102764A patent/FR2821458A1/en not_active Withdrawn
-
2002
- 2002-02-08 WO PCT/IB2002/000393 patent/WO2002069187A1/en active Application Filing
- 2002-02-08 EP EP02710247A patent/EP1366439A1/en not_active Ceased
- 2002-02-08 PL PL02363513A patent/PL363513A1/en not_active Application Discontinuation
- 2002-02-08 CN CNB028014421A patent/CN100449530C/en not_active Expired - Fee Related
- 2002-02-08 KR KR1020027014532A patent/KR100898614B1/en not_active IP Right Cessation
- 2002-02-08 RU RU2003128962/09A patent/RU2294012C2/en not_active IP Right Cessation
- 2002-02-08 JP JP2002568241A patent/JP4260481B2/en not_active Expired - Fee Related
- 2002-02-08 BR BR0204315-7A patent/BR0204315A/en not_active Application Discontinuation
- 2002-02-08 MX MXPA02010534A patent/MXPA02010534A/en unknown
- 2002-02-25 TW TW091103306A patent/TW563036B/en not_active IP Right Cessation
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO1999067784A1 (en) * | 1998-06-24 | 1999-12-29 | Seagate Technology Llc | Efficient run length limited code with short interleaved constraint |
WO2000038329A1 (en) * | 1998-12-21 | 2000-06-29 | Koninklijke Philips Electronics N.V. | Device for encoding n-bit source words into corresponding m-bit channel words and decoding m-bit channel words into corresponding n-bit source words |
Non-Patent Citations (1)
Title |
---|
Millau: an encoding format for efficient representation and exchange of XML over the Web. GIRARDOT M ET AL.Computer Networks,Vol.33 . 2000 * |
Also Published As
Publication number | Publication date |
---|---|
WO2002069187A1 (en) | 2002-09-06 |
RU2003128962A (en) | 2005-03-10 |
RU2294012C2 (en) | 2007-02-20 |
BR0204315A (en) | 2003-02-18 |
KR20020092459A (en) | 2002-12-11 |
PL363513A1 (en) | 2004-11-29 |
MXPA02010534A (en) | 2003-09-22 |
JP2004519771A (en) | 2004-07-02 |
JP4260481B2 (en) | 2009-04-30 |
CN1462400A (en) | 2003-12-17 |
KR100898614B1 (en) | 2009-05-21 |
FR2821458A1 (en) | 2002-08-30 |
EP1366439A1 (en) | 2003-12-03 |
TW563036B (en) | 2003-11-21 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US7080318B2 (en) | Schema, syntactic analysis method and method of generating a bit stream based on a schema | |
ES2262000T3 (en) | PROCEDURE AND DEVICES FOR CODING / DECODING STRUCTURED DOCUMENTS, IN PARTICULAR XML DOCUMENTS. | |
KR101011663B1 (en) | Method and apparatus for structured streaming of an xml document | |
US7043686B1 (en) | Data compression apparatus, database system, data communication system, data compression method, storage medium and program transmission apparatus | |
CN1104696C (en) | System and method for automatically adding informational hypertext links to received documents | |
US7886223B2 (en) | Generating a statistical tree for encoding/decoding an XML document | |
CN1998241B (en) | Method and device for coding and encoding an XML document | |
US20050144556A1 (en) | XML schema token extension for XML document compression | |
US20070234192A1 (en) | Encoding and distribution of schema for multimedia content descriptions | |
JP3865694B2 (en) | Path encoding and decoding method in tree structure of structured document | |
CN100449530C (en) | Schema, syntactic analysis method and method of generating a bit stream based on schema | |
US7607080B2 (en) | Method for coding structured documents | |
KR101032240B1 (en) | Method for the creation of a bit stream from an indexing tree | |
US7797346B2 (en) | Method for improving the functionality of the binary representation of MPEG-7 and other XML based content descriptions | |
US8898559B2 (en) | Method for encoding an XML-based document | |
CN102119384B (en) | Method and device for encoding elements | |
KR100919231B1 (en) | Apparatus and Method for Linking Real-Time Data for Rich Media Service | |
KR100910061B1 (en) | Metadata encoding apparatus and method for digital broadcasting and metadata decoding apparatus and method | |
Balalaieva | DATA MODEL FOR EMBEDDING METADATA IN WEB DOCUMENTS BY RNEWS STANDARD. | |
Hong et al. | XFlavor: providing XML features in media representation | |
Timmerer et al. | Efficient processing of MPEG-21 metadata in the binary domain | |
Sanmartino et al. | Secure representation of multimedia content licenses |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
C17 | Cessation of patent right | ||
CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20090107 Termination date: 20140208 |