WO2006042460A1 - Hidden data communication method and the application thereof in text digital watermark technology - Google Patents

Hidden data communication method and the application thereof in text digital watermark technology Download PDF

Info

Publication number
WO2006042460A1
WO2006042460A1 PCT/CN2005/001703 CN2005001703W WO2006042460A1 WO 2006042460 A1 WO2006042460 A1 WO 2006042460A1 CN 2005001703 W CN2005001703 W CN 2005001703W WO 2006042460 A1 WO2006042460 A1 WO 2006042460A1
Authority
WO
WIPO (PCT)
Prior art keywords
character
string
glyphs
encoding
glyph
Prior art date
Application number
PCT/CN2005/001703
Other languages
French (fr)
Chinese (zh)
Inventor
Dong Liu
Original Assignee
Dong Liu
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from CN 200410040853 external-priority patent/CN1601956A/en
Application filed by Dong Liu filed Critical Dong Liu
Publication of WO2006042460A1 publication Critical patent/WO2006042460A1/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T1/00General purpose image data processing
    • G06T1/0021Image watermarking
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2201/00General purpose image data processing
    • G06T2201/005Image watermarking
    • G06T2201/0062Embedding of the watermark in text images, e.g. watermarking text documents using letter skew, letter distance or row distance

Definitions

  • the invention belongs to the field of communication and information engineering, and specifically relates to data hiding, data encoding and decoding, and digital water printing technology.
  • Digital watermarking technology is an important part of information hiding technology. It uses specific information (digital watermark information) to be hidden in various digital image, sound, video and text digital products by digital embedding. On the one hand, these electronic products with digital watermark information can be used not easily and perceptibly, and on the other hand, digital watermark information embedded in these digital products can be detected by specific technical means. Digital watermarking technology is widely used in many watersheds such as copyright protection of digital products, content verification and anti-counterfeiting, prevention of illegal copying, operation tracking, and secret data communication. According to different digital watermark carriers, digital watermarks can be divided into image digital watermarks, sound digital watermarks, video digital watermarks, and text digital watermarks.
  • the invention relates mainly to the field of text digital watermarking, characterized in that watermark information is hidden in a file composed of characters as elements, and the main existing text digital watermarking techniques are as follows:
  • the text file is converted into an image file, and the watermark information is loaded according to the method provided by the image digital watermarking technology.
  • a disadvantage of this method is that the display and processing of image electronic files with watermark information cannot be performed with most word processing software.
  • the object of the present invention is to provide a text digital watermarking technology for carrying hidden watermark information by using a topology of characters (strings), which is used for solving the visual influence of watermark information, such as watermark information, existing in existing text digital watermarking technology.
  • the carrier file carries the capacity of the watermark information is small, and the digital watermark information of the printed matter is difficult to detect.
  • the technique of the present invention is suitable for the case where the carrier file of the digital watermark information is an electronic file or a printed document.
  • the basic principle of the present invention is to design a plurality of glyphs of semantically identical characters (strings) by appropriately changing the topological structure of characters (strings), and to properly encode glyph features based on the character (string) glyph topology.
  • the digital watermark information is embedded by the encoding of the character (string) glyph, thereby forming a new text digital watermarking technology.
  • the present invention includes the following closely related contents: Confirmation, (1) A method for designing the same character (string) into a plurality of glyphs for carrying digital watermark information;
  • a method for designing the same character (string) into a plurality of glyphs for carrying digital watermark information :
  • the main design idea is to design a variety of character (string) shapes of the same character (string) that are semantically identical by appropriately changing the topology of the characters (strings).
  • the more natural glyph design method is: change the topology of the character (string) by changing the concatenation relationship between the strokes that make up the character (string).
  • any method of changing the character (string) topology is feasible, as long as the change does not cause the semantic confusion of the character (string) itself for the human visual recognition ability.
  • the design method of character glyphs can be similarly extended to the design of string glyphs.
  • the entire string is treated as a whole to change the topology of the entire string.
  • the topology of the individual characters that make up the string be changed, but also the transition between the characters that make up the string can be changed.
  • the relationship changes the topology of the string, thereby designing a variety of topological shapes for the string, and using the glyphs of different topological structures of the string to carry the digital watermark information.
  • the minimum carrier unit carrying the watermark information is a single character
  • the minimum carrier unit carrying the watermark information is a character composed of multiple characters. string.
  • the glyph design ideas and encoding rules in these two methods are not fundamentally different. You can think of a string as a character with a complex topology.
  • the present invention recommends that a complete word (group) composed of a plurality of characters be used as a basic unit for carrying digital watermark information, and the design of the string glyph is also Design for complete words (groups).
  • Latin-based languages such as English
  • English are more suitable for carrying digital watermark information using strings, usually in handwritten and cursive glyphs.
  • art can be designed. Font style (or handwritten, cursive) glyphs to carry digital watermark information.
  • the essence of the present invention is to represent hidden information by glyphs of different topologies of semantically identical characters (strings), which requires encoding of glyphs of different topologies.
  • the basic encoding rules of the present invention are: characters of the same topology ( The string) glyphs have the same encoding, and the encoding of the character (string) glyphs of different topologies cannot be identical (ie, there are at least two different encodings).
  • glyphs of different topologies should be coded as different code values as much as possible.
  • the method includes the following steps:
  • the glyph of a character (string) and the "picture” are a many-to-one mapping relationship, that is, a glyph of one character (string) is mapped onto a "picture", and a "picture” may be mapped to a plurality of character (string) glyphs.
  • the character (string) glyphs corresponding to different "pictures” should be encoded into different codes to maximize the capacity of the character (string) to carry the watermark information.
  • the capacity of a plurality of different characters (strings) to carry information is preferably the same, and the number of codes of the desired character (string) glyph is rounded (for example, usually required A factor such as a multiple of 2 or a power exponent of 2) allows a plurality of differently constructed character (string) glyphs to have the same encoding.
  • the character (string) font carrying the digital watermark information must have at least Two different encoding states.
  • the method encodes the number of independent connected regions (ie, those connected regions that are not connected to each other) contained in the character (string) glyph.
  • the number of independent connected regions of the character (string) glyph is equal to the number of components of the "graph" corresponding to the character (string) glyph.
  • the coding rule takes into account the same reason as the "image structure-based coding method" step (2), and the character (string) glyphs of the same independent connected area number have the same coding, and the characters of different independent connected areas ( The encoding of the string) glyphs cannot be identical (ie there are at least two different encodings).
  • the encoding method encodes a combined set of the number of independent connected regions included in the character (string) glyph and the number of independent closed regions included in the character (string) glyph.
  • the coding rule takes into account the same reason as the "image structure-based coding method" step (2), and the character (string) glyph corresponding to the same combination set has the same coding, and the corresponding combination of characters (string)
  • the encoding of the glyphs cannot be identical (ie there are at least two different encodings).
  • this method provides greater flexibility and more coding space.
  • the method encodes the sum of the number of independent connected regions included in the character (string) glyph and the number of independent closed regions included in the character (string) glyph.
  • the coding rule takes into account the same reason as the "pattern-based coding method" step (2), and the character (string) glyphs of the same number of independent connected areas and the number of independent closed areas have the same coding, two The encoding of different character (string) glyphs cannot be identical (ie, there are at least two different encodings).
  • the method encodes the number of independent connected regions included in the character (string) glyph divided by the integer, and the character (string) glyphs with the same remainder have the same encoding, and the character (string) glyphs with different remainders have different encodings.
  • an integer in this method is flexible.
  • the method is equivalent to the number of independent connected regions. Parity is encoded.
  • the characters (strings) with independent odd-numbered connected areas have the same encoding, and the characters with independent number of connected areas (strings) have the same encoding, but the number of independent connected areas is different.
  • the encoding between character (string) glyphs is different.
  • the integer value ranges from 2 to 4, and from the perspective of ease of use, the integer value of the present invention is 2 or 4.
  • the method encodes the sum of the number of independent connected regions included in the character (string) glyph and the number of independent closed regions included in the character (string) glyph divided by the remainder of the integer, and the characters (strings) having the same remainder have the same Encoding, the remainder of the different character (string) glyphs have different encodings.
  • the integer value range is preferably between 2 and 8. From the perspective of ease of use, the integer value of the present invention is 2, 4 or 8. It is to be noted that there are many variations of the above six typical coding methods of the present invention.
  • a variant is as follows: The mathematical operation result of the number of independent connected regions of the character (string) and the number of independent closed regions is used as a parameter. For example, similar to the above-mentioned "encoding method based on the number of independent connected regions", it is possible to encode the results of mathematical operations such as square, cubic, parity, and whether or not the number of independent connected regions is a prime number. It is also possible to encode the product of the number of independent connected regions and the number of independent closed regions, or the number of independent connected regions, similar to the above-mentioned "encoding method based on the sum of the number of independent connected regions and the number of independent closed regions”.
  • the number of independent closed regions is used as a parameter to perform mathematical operations such as exponential operations and logarithmic operations, and the results are encoded. Further, it is also possible to encode the array of arrangement combinations formed by the above various mathematical operation results. And so on, you can transform a lot of coding methods.
  • the character (string) glyphs are encoded by a plurality of encoding methods.
  • the character (string) glyphs having the same coded value based on the coding method are further coded by other methods, and may be multi-timed by multiple methods.
  • coding For example, firstly, the method based on the method of the number of independent connected regions is used, and then the plurality of character (string) glyphs having the same number of independent connected regions are secondarily encoded, and the method of secondary encoding can be based on "independent communication.
  • the encoding method of the combined number of regions and the number of independent closed regions may be further encoded three times, and the third encoding method may adopt a "coding method based on the graph structure".
  • many encoding methods can be transformed, which can be selected according to the self-topology of the character (string) glyph. Combining multiple encoding methods to encode character (string) glyphs can expand the capacity of characters (strings) to carry digital watermark information.
  • the method is a method of uniformly encoding a plurality of characters of a plurality of characters (strings).
  • the method adopts one of the "methods for encoding a plurality of glyphs of the same character (string)" (such as the six typical coding methods and the deformation method thereof described in the foregoing section), for a plurality of characters (strings)
  • the various glyphs are encoded using the same method, and the specific correspondence between the glyph features of the characters (strings) and the encoding is uniform between different characters (strings).
  • the encoding method of the structure has the same encoding not only for the various glyphs of the same character (string) corresponding to the isomorphic "graph", but also for the various glyphs of different characters (strings) corresponding to the isomorphic "graph”. If the "encoding method based on the number of independent connected regions" is used, all the character (string) glyphs having the same number of independent connected regions correspond to the same encoded value, regardless of whether the glyphs of these characters (strings) are the same character ( The glyph of the string), or the glyph of multiple different characters (strings).
  • the digital watermark information is embedded in a plurality of glyphs of the character (string) of the carrier file, and the encoding of the character (string) glyph is used to represent the digital watermark information.
  • the design method of the glyph adopts the method of changing the topology of the character (string) glyph of the present invention, and designs a plurality of character (string) glyphs for the same character (string).
  • the glyph coding method employs six typical methods of encoding a plurality of glyphs of the same character (string) of the present invention and a plurality of variations thereof.
  • the feature of the watermark information detecting method of the text digital watermarking technology is: It is necessary to clarify the unique glyph encoding method of each character (string) in the carrier file.
  • the detection process should generally first determine the semantic information of each character (string) in the carrier file, query each character (string) specific glyph encoding method according to the semantic information of the character (string), and then detect the corresponding glyph feature according to the specific glyph encoding method.
  • the encoding of each character (string) glyph in the carrier file is determined to detect the digital watermark information.
  • the encoding method corresponding to a character (string) is "coding method based on graph structure”
  • the "graph” structure feature corresponding to the character (string) glyph should be detected to determine the corresponding glyph encoding.
  • the encoding method corresponding to another character (string) is "the encoding method based on the number of connected regions”, and the glyph feature of the number of connected regions included in the character (string) glyph should be detected to further determine the encoding of the glyph.
  • the basis of the digital watermark information detection process is to first specify the specific glyph encoding method of each character (string) in the carrier file, and the detection process of the digital watermark information carried by one character (string) and the character ( The semantic information of a string is usually associated.
  • digital watermark information is embedded in a plurality of glyphs of carrier file characters (strings).
  • the encoding of the character (string) glyph is used to represent the digital watermark information.
  • a plurality of character (string) shapes are designed for the same character (string).
  • the main difference of this technique is that: the encoding method of the glyphs uses the "several pairs of characters" (string) of the present invention. ) A method of unified coding of multiple glyphs and its deformation method.
  • the encoding of multiple glyphs should not only consider the encoding factors of the different topological glyph features of the character (string) itself, but also the encoding of other character (string) glyphs, which should be combined with other characters (strings).
  • the coding is coordinated.
  • the feature of the watermark information detection method of the text digital watermarking technology is: Since each character (string) in the carrier file has only one common glyph coding method, the detection process does not need to know the semantic information of each character (string), and can directly The encoding characteristics of the character (string) font are detected for a common encoding method to determine the encoding of each character (string) font in the carrier file, thereby detecting the digital watermark information. For example, if the carrier file uses the "coding method based on the graph structure", the same method is used for multiple characters (strings) in the carrier file, and the "graph" structure feature corresponding to each character (string) font can be directly detected. Determine its encoding.
  • the carrier file uses "method of encoding based on the remainder of dividing the number of independent connected regions by an integer", and the integer is taken as 2, the detection method is extremely simple. It is not necessary to know the semantic information of each character (string), and directly calculate the number of independent connected regions of each character (string), the odd number is one type of code (for example, code is 1), and the even number is another code (for example) The code is 0), thereby directly determining the encoding of each character (string) glyph.
  • the digital watermark information carried by the entire carrier file is obtained.
  • the detection process of the digital watermark information carried by one character (string) is independent of the semantic information of the character (string).
  • the method for detecting digital watermark information of the present invention is only related to the topology of the character (string) font, and is independent of the size of the character (string) and the tilt angle, and is convenient for detection.
  • the scaling and rotation of the character (string) glyph does not affect the detection of the watermark information, and the anti-noise ability is strong, and the Lu Gang is good.
  • the change of the character (string) glyph of the present invention is to appropriately change the topological structure of the character (string), and the shape of the character (string) glyph may not be changed, the overall style, the visual influence caused by the watermark information is small, and the embedded Digital watermark information is not easily noticeable.
  • the character (string) glyph design method of the present invention is flexible, and after determining a specific encoding rule, a plurality of glyphs having the same encoding but different font styles can be designed for the same character (string) as needed, without changing
  • the detection method and related programs have good scalability.
  • the "text digital watermarking technique based on a plurality of glyphs for a plurality of characters (strings) respectively) of the present invention "specially designs the glyphs of characters (strings) for the specific case of each character (string), and selects A specific encoding method, such that the character (string) carries a large amount of watermark information.
  • the text digital watermarking technique based on the uniform encoding of a plurality of characters of a plurality of characters (strings) of the present invention can directly determine the text carried by each character (string) without detecting the semantic information of each character (string).
  • the digital watermark information simplifies the detection method of the watermark information and reduces the error.
  • Fig. 1 shows, by way of example, a method of designing a plurality of glyphs of the same character by appropriately changing the topological structure of characters, and shows a method in which character glyphs correspond to "graphs" in the "Graphics" of the mathematical discipline.
  • Fig. 2 shows, by way of example, a glyph design and encoding method based on the number of connected regions and the number of closed regions included in the character glyph.
  • Figure 3 shows, by way of example, a method of designing multiple glyphs for a string.
  • Figure 4 shows, by way of example, a glyph design and coding method for each set of characters.
  • Figure 5 shows, by way of example, various forms of character (string) glyph design methods.
  • Fig. 6 shows, by way of example, the principle of loading and detecting the watermark information based on the digital watermarking technique of separately encoding a plurality of glyphs of a plurality of characters (strings).
  • Fig. 7 shows, by way of example, the principle of loading and detecting watermark information "based on a digital watermarking technique of uniformly encoding a plurality of glyphs for a plurality of characters (strings)". 5) A specific implementation method of encoding a remainder after dividing the number of independent connected regions by an integer. If this method is used, four different glyphs (400) to (403) of the character "you" in Fig. 4 are used.
  • Row coding is equivalent to encoding the parity of the number of independent connected regions included in the glyph, and assuming that the code corresponding to the number of independent connected regions is "1", and the independent connected regions are If the code corresponding to an even number of character glyphs is "0", the following results are obtained:
  • the number of independent connected regions included in the glyphs (400) and (402) is 4, and the remainder after dividing by 2 is 0, that is, the number of independent connected regions included in the glyphs (400) and (402) is even. , their encoding is "0"; glyph (401),
  • the 15 glyphs of the character "initial" shown in Fig. 2 are integrated and encoded using a plurality of encoding methods.
  • the encoding is performed using "the encoding method based on the combined set of the number of independent connected regions and the number of independent closed regions".
  • the glyphs (200) include the same glyphs (2001) and (2002). (2003); the glyphs (230) include the same glyphs (2301), (2302), and (2303).
  • the fonts of the three glyphs (2601), (2602), and (2603) included in the glyph group (260) are the same.
  • the glyphs in the glyph groups (200), (230), and (260) are secondarily encoded by the "coding method based on the graph structure". If the encoding method based on the undirected graph is adopted, the glyph group (200) contains the glyphs (2001), (2002),
  • (2003) corresponds to three different structures of "graphs”; then the glyphs (230) contain glyphs (2301), (2302), (2303) corresponding to two different structures of "graphs", where glyphs (2302), (2303) The corresponding "graph” is isomorphic; glyph group
  • the included glyphs (2601), (2602), and (2603) also correspond to "graphs" of three different structures. After two encodings, the 15 character glyphs in Figure 2 have 14 different states and can be programmed into 14 different codes.
  • This encoding method is characterized by a plurality of characters (strings) of a plurality of glyphs (note: a plurality of glyphs including the same character (string)), using the same method for multiple characters (strings)
  • a plurality of glyphs are uniformly coded, and a rule for determining a character (string) glyph code value is uniform among a plurality of characters (strings).
  • the specific encoding method can be determined based on the "several methods for encoding a plurality of glyphs of the same character (string)" of the present invention.
  • the glyphs of the character set ⁇ "you", “good", "! ⁇ are encoded, and the integer is assumed to be 2
  • the mapping rules of the glyphs and the encoded values shown in FIG. 4 satisfy the requirements of the encoding method of this item. Since the integer is 2, it is equivalent to encoding the parity of the number of independent connected regions, and it is assumed that the code corresponding to the character glyph with an odd number of independent connected regions is "1", and the number of independent connected regions is even.
  • the corresponding code is "0", as shown in Figure 4:
  • the number of independent connected areas of the glyphs (400) (402) is 4, and the number of independent connected areas of the glyphs (410) (412) is 2, glyphs (440)
  • the number of independent connected regions of (442) is also 2.
  • the number of independent connected regions included in these glyphs is divided by 2 and the remainder is 0. That is, the number of independent connected regions included in these glyphs is even, so their encoding is The same, all are "0".
  • the number of independent connected areas of the glyphs (401) (403) is 5, the number of independent connected areas of the glyphs (411) (413) is 1, and the number of independent connected areas of the glyphs (441) and (443) is also 1.
  • the number of independent connected regions included in these glyphs is divided by 2 and the remainder is 1. That is, the number of independent connected regions included in these glyphs is odd, and their codes are the same, all being "1".
  • the mapping rules between glyphs and coded values between different characters are consistent, and the mapping relationship between the remainder and the glyph code values cannot be changed between different characters. From this, it can be seen that the method of encoding based on the remainder after dividing the number of independent connected regions by an integer, the character set ⁇ "you", “good”, "! ⁇ is determined by the method of determining the glyph code value of FIG. Meet the requirements of this coding method.
  • Figure 6 shows, by way of example, the principle of watermark information loading and detection of the digital watermarking technique.
  • the flow from block diagrams (600), (601) to (610), to block diagrams (620), (630), and finally to block diagrams (621), (631) represents the flow of watermark information loading.
  • the flow represents loading the digital watermark information "0101100” (600) into the text "Hello, Mom! (601).
  • the number of digits carrying the watermark information is one digit; the character “Mom” carries the watermark information with two digits; the character ",” has no ability to carry watermark information.
  • the watermark information is segmented by the method shown in (610).
  • the watermark information corresponding to the character "you” is "0"; the watermark information corresponding to the character “good” is "1";",,,, does not correspond to any watermark information (because, as shown in Figure 4, "," does not have the ability to carry watermark information), the watermark information corresponding to the previous character “mother” is "01”; the latter character “mother” corresponds The watermark information is "10”; the character "! “The corresponding watermark information is "0".
  • the table in Fig. 4 is queried to find the glyph of each character glyph code equal to the watermark information corresponding to the character. As shown in Fig.
  • each character has a two-character code equal to the watermark information corresponding to the character, and the two glyphs belong to different font styles: Song and Lishu.
  • the glyphs of multiple characters of the same font style are combined to obtain a text string (621), (631) carrying the watermark information "0101100” (600), wherein the font style of the string (621) is a librarian, a character The font style of the string (631) is Song. Due to the uniform font style between the characters in the strings (621) and (631), the visual impact on the digital watermark information "0101100" (600) is small.
  • Figure 8 shows the process of digital watermark information detection by this technique.
  • the carrier file (800) with digital watermark information identifies the original carrier electronic file (820) by the character semantic recognition system (810).
  • the character glyph recognition system (830) performs character (string) glyph coding recognition on the carrier file (800) with digital watermark information, and recognizes the coding of each character (string) glyph, and then combines the carrier files.
  • the encoding of the character (string) yields digital watermark information (840).
  • the character glyph recognition system (830) needs to use the recognition result of the character semantic recognition system (810) to perform character (string) glyph coding recognition because in the present technology, the encoding method of each character (string) in the carrier file can be Differently, the watermark information detection process first needs to clarify the specific glyph coding method of each character (string) in the carrier file. Therefore, it is necessary to perform semantic recognition on each character (string) in the carrier file, find the coding table shown in FIG. 4 by the semantic information of the character (string), and determine a specific glyph coding method for each character (string), thereby detecting each The glyph feature corresponding to the encoding method further determines the encoding of each character (string) glyph.
  • the original carrier electronic file can be directly used as a template for the detection of watermark information.
  • the original carrier electronic file template (850) provides the semantic information of the characters (strings) in the carrier file, and the character font recognition system (830) can perform character (string) feature and code recognition.
  • the watermark information detection process at this time is a Non-blind watermark detection process.
  • the detection system needs to know the specific glyph coding mode of each character constituting the character string (621) or (631). Therefore, the detection system should first identify the semantic information of each character (including manual recognition), or directly obtain the original carrier electronic file, use the original carrier electronic file as a template to provide the semantic information of the character, and then obtain each character through the semantic information of the character. Specific coding method.
  • the characters "you”, “good”, “! are based on “encoding method based on the number of independent connected areas", and the character “mother” is based on “number of independent connected areas and number of independent closed areas”
  • the encoding method of the combined set ", character”, “does not carry watermark information.
  • the detecting system detects the glyph features corresponding to the encoding method according to the specific glyph encoding method of each character. For example, the glyph features of the number of independent connected regions corresponding to the glyphs of the characters "you", “good”, and “! should be detected.
  • the glyph feature of the combined set of the number of independent connected regions corresponding to the glyph of the character "mother” and the number of independent closed regions is detected.
  • the result of detecting the character glyph feature is to obtain the corresponding character glyph encoding, and the character string (621).
  • the correspondence between each character glyph and the encoding in (631) is as shown in the block diagrams (620) and (630).
  • the encoding of each character glyph is combined to obtain the digital watermark information carried by the character strings (621) and (631) as "0101100" (600).
  • the method for detecting glyph features corresponding to the coding method of the present invention is a prior art mature technology.
  • the isomorphism judgment of the "graph" of the character (string) glyph mapping, the calculation of the number of independent connected regions included in the character (string) glyph, and the number of independent closed regions can be completed by using existing mature techniques.
  • Techniques are not included in the scope of the invention.
  • a specific implementation of a textual digital watermarking technique based on a plurality of glyphs for a plurality of characters (strings) - Figure 7 shows, by way of example, the principle of loading and detecting watermark information of the present digital watermarking technique.
  • FIG. 7 the flow from block diagrams (700), (701) to (710), to block diagrams (720), (730), and finally to block diagrams (721), (731) represents the flow of watermark information loading.
  • This flow means loading the digital watermark information "010" (700) into the text "Hello I" (701).
  • Fig. 7 the flow from block diagrams (700), (701) to (710), to block diagrams (720), (730), and finally to block diagrams (721), (731) represents the flow of watermark information loading.
  • This flow means loading the digital watermark information "010" (700) into the text "Hello I" (701).
  • each character order corresponds to one bit of information in the watermark information "010" (700). Then query the table in Figure 4 to find the glyph of each character glyph code equal to the watermark information corresponding to the character.
  • the encoding of (400) and (402) is 0, corresponding to the watermark letter.
  • glyphs of multiple characters of the same font style are combined to obtain a text string (721), (731) carrying the watermark information "010" (700), wherein the font style of the string (721) is a librarian, a character The font style of string (731) is Song. Due to the uniform font style between the characters in the strings (721) and (731), the visual effect of loading the digital watermark information "010" (700) is small.
  • the sentence (350) carries the digital watermark information as follows: The sum of the number of independent connected areas (3501) and the number of independent closed areas is 4, (3502) is 13, (3503) is 11, (3504) is 2, (3505) is 4, and (3506) is 2. (3507) is 4, (3508) is 7.
  • Figure 9 shows the process of digital watermark information detection by this technique.
  • the character glyph recognition system (910) directly performs character (string) glyph code recognition on the carrier file (900) with digital watermark information. Since each character (string) in the carrier file (900) with digital watermark information has a common glyph encoding method, the glyph features of each character (string) corresponding to the encoding method can be directly detected, and each character (string) is further determined.
  • the encoding of the glyphs, the encoding of each character (string) in the carrier file results in digital watermark information (920).
  • the whole watermark detection process is a blind watermark detection process, which does not require obtaining a template of the original carrier electronic file, or performing character (string) semantic recognition.
  • the character font recognition system (910) performs character glyph recognition in accordance with a unified glyph encoding method.
  • the characters "you", “good”, and “! in the example use the "method based on the number of independent connected regions divided by 2" (equivalent to "based on the number of independent connected regions”
  • the method of encoding the parity is "), so the character glyph recognition system (910) can directly judge the string (721) or

Abstract

A hidden data communication method and the application thereof in text digital watermark technology are disclosed. The invention belongs to the field of communication and information engineering, particularly, to technologies of the data hiding, data encoding and decoding, and digital watermark. In the invention, the hidden information is carried by using the shapes of characters or character strings which have different topological structures. The invention is characterized in easy detecting, strong anti-noise capability and good robustness. According to the invention, the visual influence caused by the watermark information is imperceptible, and the embedded digital watermark information has the advantages of being imperceptible, extendable, and having a capability of carrying large quantity of information.

Description

一种隐藏数据通信方法及其在文本数字水印技术中的应用 所属技术领域  Hidden data communication method and application thereof in text digital watermarking technology
本发明属于通信与信息工程领域, 具体涉及到数据的隐藏、 数据的编码与解码、 数字水 印技术。  The invention belongs to the field of communication and information engineering, and specifically relates to data hiding, data encoding and decoding, and digital water printing technology.
背景技术 Background technique
数字水印技术是信息隐藏技术领域的一个重要组成部分, 它将具有特定意义的信息 (数 字水印信息), 利用数字嵌入方法隐藏在各种数字图像、 声音、 视频、 文本数字产品中。 这 些带有数字水印信息的电子产品一方面可以不易被感知地正常使用, 另一方面, 通过特定的 技术手段可以检测出嵌入在这些数字产品中的数字水印信息。数字水印技术广泛应用在数字 产品的版权保护、 内容验证与防伪、 防止非法拷贝、 操作跟踪、 秘密数据通信等众多流域。 按照数字水印载体的不同, 数字水印可分为图像数字水印、 声音数字水印、 视频数字水印和 文本数字水印等主要的几个种类。本发明涉及的主要是文本数字水印领域, 其特点是水印信 息隐藏在由字符为组成元素的文件中, 主要的现有文本数字水印技术如下:  Digital watermarking technology is an important part of information hiding technology. It uses specific information (digital watermark information) to be hidden in various digital image, sound, video and text digital products by digital embedding. On the one hand, these electronic products with digital watermark information can be used not easily and perceptibly, and on the other hand, digital watermark information embedded in these digital products can be detected by specific technical means. Digital watermarking technology is widely used in many watersheds such as copyright protection of digital products, content verification and anti-counterfeiting, prevention of illegal copying, operation tracking, and secret data communication. According to different digital watermark carriers, digital watermarks can be divided into image digital watermarks, sound digital watermarks, video digital watermarks, and text digital watermarks. The invention relates mainly to the field of text digital watermarking, characterized in that watermark information is hidden in a file composed of characters as elements, and the main existing text digital watermarking techniques are as follows:
( 1 )利用文本文件的格式信息来保存数字水印信息。通常利用文本的字间距、行间距进 行编码来嵌入水印信息。 这种思路的缺陷在于: 对于利用字间距、 行间距的编码方法, 以拉 丁字母为基础的语言体系(如英语)有一定的优势, 但对于类似中文这样以方块文字为基础 的语言, 由于不存在英文意义下的字间距, 水印的嵌入比较困难。 同时, 对利用字间距编码 的水印信息进行检测的误差较大, 而利用行间距编码的水印技术携带的水印信息较少。  (1) Using the format information of the text file to save the digital watermark information. Watermark information is usually embedded by encoding the word spacing and line spacing of the text. The shortcoming of this kind of thinking is: For the coding method using word spacing and line spacing, the Latin alphabet-based language system (such as English) has certain advantages, but for languages like Chinese, which are based on block text, because There is a word spacing in the English sense, and the embedding of the watermark is difficult. At the same time, the error of detecting the watermark information encoded by the word spacing is large, and the watermarking technique using the line spacing coding carries less watermark information.
(2)以标点信息编码、 字符的字体编码来携带水印信息。其缺点在于: 由于文本文件中 标点符号相对较少, 所以利用标点信息编码携带信息较少。利用字符的字体编码的文本数字 水印技术的主要问题在于检测以印刷品文件为载体文件的水印信息很困难,文中没有提及在 这种情况下的检测方法。  (2) Carrying watermark information by punctuation information encoding and character font encoding. The disadvantages are: Since the punctuation marks in the text file are relatively small, the information carried by the punctuation information is less. Text Digital Coding Using Character Fonts The main problem with watermarking techniques is that it is difficult to detect watermark information using printed documents as a carrier file. The detection method in this case is not mentioned herein.
( 3)利用字符的特征编码来保存数字水印信息。主要涉及了改变部分字符笔划的长度或 整个字符的高度来嵌入水印信息。该项技术的主要问题同样是对以印刷品文件为载体文件的 水印信息的检测很困难, 同时会带来较大的视觉影响。  (3) Using the feature encoding of the character to save the digital watermark information. It mainly involves changing the length of a part of the character stroke or the height of the entire character to embed the watermark information. The main problem of this technology is also that it is difficult to detect the watermark information using the printed documents as the carrier file, and it will bring a large visual impact.
(4) 文本文件转换为图像文件, 按照图像数字水印技术提供的方法进行水印信息加载。 该方法的缺点是不能用大多数的文字处理软件进行带有水印信息的图像电子文件的显示和 处理。  (4) The text file is converted into an image file, and the watermark information is loaded according to the method provided by the image digital watermarking technology. A disadvantage of this method is that the display and processing of image electronic files with watermark information cannot be performed with most word processing software.
( 5)通过对文本中的特定词组进行同义词替代, 对同义的不同词汇进行编码, 用于加载 水印信息。这种技术的缺点是难以为所有的词汇找到恰当的同义词, 造成文本可嵌入水印信 息的容量相当有限, 毕竟不是每一个词汇都有与之对应的同义词。  (5) Encoding different words of the same meaning by synonymous substitution of specific phrases in the text for loading watermark information. The disadvantage of this technique is that it is difficult to find the proper synonym for all vocabulary, and the capacity of the text to embed the watermark information is rather limited. After all, not every vocabulary has a synonym corresponding to it.
发明内容 Summary of the invention
本发明的目的是提供一种利用字符 (串) 的拓扑结构携带隐藏水印信息的文本数字水 印技术,用于解决现有文本数字水印技术中出现的诸如水印信息给用户带来的视觉上的影响 较大、 载体文件携带水印信息的容量小、 印刷品数字水印信息检测困难等问题。 本发明的技 术对于数字水印信息的载体文件为电子文件、 印刷品文件的情况均适合。  The object of the present invention is to provide a text digital watermarking technology for carrying hidden watermark information by using a topology of characters (strings), which is used for solving the visual influence of watermark information, such as watermark information, existing in existing text digital watermarking technology. Larger, the carrier file carries the capacity of the watermark information is small, and the digital watermark information of the printed matter is difficult to detect. The technique of the present invention is suitable for the case where the carrier file of the digital watermark information is an electronic file or a printed document.
本文所述的 "字符 (串) "是 "字符或字符串" 的简写形式。  The "character (string)" described in this article is shorthand for "character or string".
本发明的基本原理在于通过适当改变字符 (串) 的拓扑结构, 设计出语义上相同的字 符 (串) 的多种字形, 并对基于字符 (串)字形拓扑结构的字形特征进行恰当的编码, 利用 字符 (串) 字形的编码来嵌入数字水印信息, 从而构成一种新的文本数字水印技术。  The basic principle of the present invention is to design a plurality of glyphs of semantically identical characters (strings) by appropriately changing the topological structure of characters (strings), and to properly encode glyph features based on the character (string) glyph topology. The digital watermark information is embedded by the encoding of the character (string) glyph, thereby forming a new text digital watermarking technology.
本发明包括如下紧密相关的内容: 确 认 本 , ( 1 ) 用于携带数字水印信息的将同一字符 (串) 设计成多种字形的方法;The present invention includes the following closely related contents: Confirmation, (1) A method for designing the same character (string) into a plurality of glyphs for carrying digital watermark information;
(2 ) 若干种对同一字符 (串) 的多种字形进行编码的方法; (2) Several methods of encoding multiple glyphs of the same character (string);
( 3 ) 若干种对多个字符 (串) 的多种字形进行统一编码的方法;  (3) a number of methods for uniformly encoding multiple glyphs of multiple characters (strings);
(4) 基于对多个字符 (串) 的多种字形分别编码的文本数字水印技术;  (4) A textual digital watermarking technique based on a plurality of glyphs for a plurality of characters (strings);
( 5) 基于对多个字符 (串) 的多种字形统一编码的文本数字水印技术;  (5) A textual digital watermarking technique based on uniform coding of multiple glyphs for multiple characters (strings);
用于携带数字水印信息的将同一字符 (串) 设计成多种字形的方法: A method for designing the same character (string) into a plurality of glyphs for carrying digital watermark information:
主要的设计思想是通过适当改变字符 (串) 的拓扑结构, 从而设计出语义上相同的同 一字符(串) 的多种字符(串)外形。 其中, 较为自然的字形设计方法是: 通过改变组成字 符(串) 的各笔划之间的连断关系来改变字符(串) 的拓扑结构。 但不仅限于此, 任何改变 字符(串)拓扑结构的方法都可行, 只要这种改变对于人的视觉识别能力来说, 不至于引起 字符 (串)本身语义上的混淆。  The main design idea is to design a variety of character (string) shapes of the same character (string) that are semantically identical by appropriately changing the topology of the characters (strings). Among them, the more natural glyph design method is: change the topology of the character (string) by changing the concatenation relationship between the strokes that make up the character (string). However, it is not limited to this, any method of changing the character (string) topology is feasible, as long as the change does not cause the semantic confusion of the character (string) itself for the human visual recognition ability.
需要特别说明的是, 对字符字形的设计方法可以类似地推广到字符串字形的设计中。 这种情况下, 将整个字符串看作一个整体来改变整个字符串的拓扑结构, 不但可以改变组成 字符串的单个字符的拓扑结构,还可以通过改变组成字符串的各字符之间的连断关系来改变 字符串的拓扑结构, 从而为字符串设计出多种拓扑形状, 并使用字符串不同拓扑结构的字形 来携带数字水印信息。在利用字符携带数字水印信息的方法中, 携带水印信息的最小载体单 位为单个的字符, 而在利用字符串携带数字水印信息的方法中, 携带水印信息的最小载体单 位为多个字符组成的字符串。这两种方法中的字形设计思想及编码规则没有本质的不同, 可 以将字符串看成一个有复杂拓扑结构的字符。  It should be specially noted that the design method of character glyphs can be similarly extended to the design of string glyphs. In this case, the entire string is treated as a whole to change the topology of the entire string. Not only can the topology of the individual characters that make up the string be changed, but also the transition between the characters that make up the string can be changed. The relationship changes the topology of the string, thereby designing a variety of topological shapes for the string, and using the glyphs of different topological structures of the string to carry the digital watermark information. In the method of carrying digital watermark information by using characters, the minimum carrier unit carrying the watermark information is a single character, and in the method of carrying the digital watermark information by using the character string, the minimum carrier unit carrying the watermark information is a character composed of multiple characters. string. The glyph design ideas and encoding rules in these two methods are not fundamentally different. You can think of a string as a character with a complex topology.
在实际的文本数字水印系统中, 如果希望以字符串为单位携带数字水印信息, 本发明 推荐将多个字符组成的完整单词(组)作为携带数字水印信息的基本单位, 字符串字形的设 计也针对完整的单词 (组)进行设计。 以拉丁字母为基础的语言(如英语)较为适合利用字 符串携带数字水印信息, 通常以手写体、 草书体的字形来实现; 同时, 对于汉语、 韩语等以 方块字为基础的语言, 可以设计有美术字体风格(或手写体、 草书体) 的字形来携带数字水 印信息。  In an actual text digital watermarking system, if it is desired to carry digital watermark information in units of character strings, the present invention recommends that a complete word (group) composed of a plurality of characters be used as a basic unit for carrying digital watermark information, and the design of the string glyph is also Design for complete words (groups). Latin-based languages (such as English) are more suitable for carrying digital watermark information using strings, usually in handwritten and cursive glyphs. At the same time, for Chinese, Korean and other block-based languages, art can be designed. Font style (or handwritten, cursive) glyphs to carry digital watermark information.
若干种对同一字符 (串) 的多种字形进行编码的方法: Several ways to encode multiple glyphs of the same character (string):
本发明的本质在于通过语义上相同的字符 (串) 的不同拓扑结构的字形来表示隐藏信 息, 这需要对不同拓扑结构的字形进行编码, 本发明的基本编码规则是: 相同拓扑结构的字 符(串)字形有相同的编码, 不同拓扑结构的字符(串)字形的编码不能完全相同 (即至少 有两个不同的编码)。 通常, 应尽可能的将不同拓扑结构的字形编为不同的码值。  The essence of the present invention is to represent hidden information by glyphs of different topologies of semantically identical characters (strings), which requires encoding of glyphs of different topologies. The basic encoding rules of the present invention are: characters of the same topology ( The string) glyphs have the same encoding, and the encoding of the character (string) glyphs of different topologies cannot be identical (ie, there are at least two different encodings). In general, glyphs of different topologies should be coded as different code values as much as possible.
以下是遵守上述规则的 6种具体的编码方法及其若干种的变形编码方法。  The following are six specific encoding methods that comply with the above rules and several variant encoding methods.
1 ) 基于 "图"结构的编码方法  1) Encoding method based on "graph" structure
本方法包括以下步骤:  The method includes the following steps:
( 1 ) '按照一定的规则将字符 (串) 字形映射为数学学科 "图论" 中定义的 "图 "。 一种具体的规则是将字符笔划的顶点、 交叉点、 拐点等特征点映射为数学学科"图论" 中定义的 "图" 的节点 (端点), 而连接这些特征点 (顶点、 交叉点、 拐点等) 之间的笔划 映射为 "图" 的边。 这样, 可以将字符 (串) 的字形映射为 "图论" 中定义的无向 "图"。 字符(串)的字形与 "图"是多对一的映射关系, 即一个字符(串)的字形映射到一个 "图" 上, 而一个 "图"可能映射为多个字符(串)字形。 还可在无向图的基础上, 加入特定的空 间顺序规则 (例如从左到右, 从上到下等), 将字符 (串) 字形映射为有向图。  (1) 'The character (string) glyphs are mapped to the "graphs" defined in the "Graphics" of the mathematics according to certain rules. A specific rule is to map feature points such as vertices, intersections, and inflection points of character strokes to nodes (endpoints) of "graphs" defined in the mathematical theory "graph theory", and connect these feature points (vertices, intersections, The stroke between the inflection point, etc. is mapped to the side of the "graph". In this way, the glyphs of characters (strings) can be mapped to the undirected "graphs" defined in "Graphics". The glyph of a character (string) and the "picture" are a many-to-one mapping relationship, that is, a glyph of one character (string) is mapped onto a "picture", and a "picture" may be mapped to a plurality of character (string) glyphs. You can also map character (string) glyphs to directed graphs by adding specific spatial order rules (for example, from left to right, top to bottom, etc.) on the basis of undirected graphs.
(2 ) 同构的 "图"对应的字符(串)字形有相同的编码, 不同构的 "图"对应的字符 (串) 字形的编码不能完全相同 (即至少要有两个不同的编码)。 在步骤(1 )所得的同一个字符(串) 的不同字形对应的多个 "图"中, 有可能出现其 中一些 "图"是同构的 (按照 "图论"中对同构的定义), 编码时应将同构的 "图"对应的字 符(串)字形编码为相同的码。 通常, 应将不同构的 "图"对应的字符(串)字形编码为不 同的码, 以尽量提高字符(串)携带水印信息的容量。但是, 考虑到在实际的文本数字水印 系统中, 多个不同的字符(串)携带信息的容量最好相同, 以及希望字符(串)字形的编码 个数是圆整的(例如, 通常要求是 2的倍数或以 2为底的幂指数)等因素, 允许多个不同构 的 "图"对应的字符(串)字形有相同的编码。 为了保证字符(串)至少有一位二进制的水 印信息携带容量, 则至少要有两个不同构的 "图"对应的字形有不同的编码, 即携带数字水 印信息的字符 (串)字形至少要有两个不同的编码状态。 (2) The character (string) glyphs corresponding to the isomorphic "graph" have the same encoding, and the encoding of the character (string) glyphs corresponding to different "pictures" cannot be identical (that is, at least two different encodings are required) . In the plurality of "graphs" corresponding to different glyphs of the same character (string) obtained in step (1), it is possible that some of the "graphs" are isomorphic (according to the definition of isomorphism in "graph theory") When encoding, the character (string) glyph corresponding to the isomorphic "graph" should be encoded into the same code. In general, the character (string) glyphs corresponding to different "pictures" should be encoded into different codes to maximize the capacity of the character (string) to carry the watermark information. However, considering that in an actual text digital watermarking system, the capacity of a plurality of different characters (strings) to carry information is preferably the same, and the number of codes of the desired character (string) glyph is rounded (for example, usually required A factor such as a multiple of 2 or a power exponent of 2) allows a plurality of differently constructed character (string) glyphs to have the same encoding. In order to ensure that the character (string) has at least one binary watermark information carrying capacity, at least two different fonts corresponding to the "picture" have different encodings, that is, the character (string) font carrying the digital watermark information must have at least Two different encoding states.
2)基于独立连通区域个数的编码方法  2) Encoding method based on the number of independent connected regions
本方法针对字符(串)字形包含的独立连通区域(即相互之间不连通的那些连通区域) 的个数进行编码。 字符(串)字形的独立连通区域个数等于该字符(串)字形对应的 "图" 的分量个数。  The method encodes the number of independent connected regions (ie, those connected regions that are not connected to each other) contained in the character (string) glyph. The number of independent connected regions of the character (string) glyph is equal to the number of components of the "graph" corresponding to the character (string) glyph.
编码的规则考虑到与 "基于图结构的编码方法"步骤(2)相同的原因, 仍然是相同独 立连通区域个数的字符(串)字形有相同的编码, 不同独立连通区域个数的字符(串)字形 的编码不能完全相同 (即至少有两个不同的编码)。  The coding rule takes into account the same reason as the "image structure-based coding method" step (2), and the character (string) glyphs of the same independent connected area number have the same coding, and the characters of different independent connected areas ( The encoding of the string) glyphs cannot be identical (ie there are at least two different encodings).
"基于图结构的编码方法"需要对不同 "图"之间的同构性进行判断, 而 "图"的同 构判断算法在数学理论上计算复杂性较高(为 NP问题)。 虽然字符(串)字形对应的 "图" 通常不会太复杂, 直接利用现有 "图"的同构判断算法进行处理是可行的, 但处理过程仍然 相对复杂。 本编码方法是对 "基于图结构编码方法"的简化方法。  "Coding method based on graph structure" needs to judge the isomorphism between different "graphs", and the isomorphism judgment algorithm of "graph" has higher computational complexity in mathematics theory (for NP problem). Although the "graph" corresponding to the character (string) glyph is usually not too complicated, it is feasible to directly use the isomorphic judgment algorithm of the existing "graph", but the process is still relatively complicated. This coding method is a simplified method of "pattern-based coding method".
3)基于独立连通区域个数与独立封闭区域个数的组合集合的编码方法  3) Coding method based on a combined set of the number of independent connected regions and the number of independent closed regions
一些特定的字符(串), 特别是汉语、 韩语等语言的字符(串), 存在一个或多个由笔 划围成的封闭区域。 此外, 采用本发明的字符(串)字形设计方法, 也可为一些特定的字符 (串)设计出由字符(串)字形笔划围成的封闭区域。 本编码方法针对字符(串)字形包含 的独立连通区域个数与字符(串)字形包含的独立封闭区域个数两者的组合集合进行编码。  Some specific characters (strings), especially characters (strings) in languages such as Chinese and Korean, have one or more enclosed areas enclosed by strokes. Further, with the character (string) font design method of the present invention, it is also possible to design a closed area surrounded by character (string) font strokes for some specific characters (strings). The encoding method encodes a combined set of the number of independent connected regions included in the character (string) glyph and the number of independent closed regions included in the character (string) glyph.
编码的规则考虑到与 "基于图结构的编码方法"步骤(2)相同的原因, 仍然是相同的 组合集合对应的字符(串)字形有相同的编码, 不同的组合集合对应的字符(串)字形的编 码不能完全相同 (即至少有两个不同的编码)。  The coding rule takes into account the same reason as the "image structure-based coding method" step (2), and the character (string) glyph corresponding to the same combination set has the same coding, and the corresponding combination of characters (string) The encoding of the glyphs cannot be identical (ie there are at least two different encodings).
与单独 "基于独立连通区域个数的编码方法"相比, 本方法提供了更大的灵活性和更 多的编码空间。  Compared with the separate "encoding method based on the number of independent connected regions", this method provides greater flexibility and more coding space.
4)基于独立连通区域个数与独立封闭区域个数的和的编码方法  4) Encoding method based on the sum of the number of independent connected regions and the number of independent closed regions
本方法针对字符 (串)字形包含的独立连通区域个数与字符(串)字形包含的独立封 闭区域个数之和进行编码。  The method encodes the sum of the number of independent connected regions included in the character (string) glyph and the number of independent closed regions included in the character (string) glyph.
编码的规则考虑到与 "基于图结构的编码方法"步骤(2)相同的原因, 仍然是独立连 通区域个数与独立封闭区域个数之和相同的字符(串)字形有相同的编码, 两者之和不同的 字符(串)字形的编码不能完全相同 (即至少有两个不同的编码)。  The coding rule takes into account the same reason as the "pattern-based coding method" step (2), and the character (string) glyphs of the same number of independent connected areas and the number of independent closed areas have the same coding, two The encoding of different character (string) glyphs cannot be identical (ie, there are at least two different encodings).
与 "基于独立连通区域个数与独立封闭区域个数的组合集合的编码方法"相比, 本方 法更简单。  Compared with the "encoding method based on the combined set of the number of independent connected areas and the number of independent closed areas", the method is simpler.
5) 基于对独立连通区域个数除以整数后的余数进行编码的方法  5) Method for encoding based on the remainder of dividing the number of independent connected regions by an integer
本方法针对字符(串)字形包含的独立连通区域个数除以整数后的余数进行编码, 余 数相同的字符 (串)字形有相同的编码, 余数不同的字符(串)字形有不同的编码。  The method encodes the number of independent connected regions included in the character (string) glyph divided by the integer, and the character (string) glyphs with the same remainder have the same encoding, and the character (string) glyphs with different remainders have different encodings.
本方法中整数的取值是灵活的, 当整数取 2时, 本方法等价于对独立连通区域个数的 奇偶性进行编码。 独立连通区域个数为奇数的字符(串)字形之间有相同的编码, 独立连通 区域个数为偶数的字符(串)字形之间也有相同的编码, 但独立连通区域个数奇偶性不同的 字符(串)字形之间的编码是不同的。本方法中, 整数的取值范围在 2~4之间较为合适, 从 简单易用的角度, 本发明推荐整数的取值为 2或 4。 The value of an integer in this method is flexible. When an integer is 2, the method is equivalent to the number of independent connected regions. Parity is encoded. The characters (strings) with independent odd-numbered connected areas have the same encoding, and the characters with independent number of connected areas (strings) have the same encoding, but the number of independent connected areas is different. The encoding between character (string) glyphs is different. In this method, the integer value ranges from 2 to 4, and from the perspective of ease of use, the integer value of the present invention is 2 or 4.
6) 基于对独立连通区域个数与独立封闭区域个数之和除以整数后的余数进行编码的 方法  6) A method of encoding based on the remainder of the number of independent connected regions and the number of independent closed regions divided by an integer
本方法针对字符 (串)字形包含的独立连通区域个数与字符 (串)字形包含的独立封 闭区域个数之和除以整数后的余数进行编码, 余数相同的字符(串)字形有相同的编码, 余 数不同的字符(串)字形有不同的编码。  The method encodes the sum of the number of independent connected regions included in the character (string) glyph and the number of independent closed regions included in the character (string) glyph divided by the remainder of the integer, and the characters (strings) having the same remainder have the same Encoding, the remainder of the different character (string) glyphs have different encodings.
与前述仅针对独立连通区域个数除以整数后的余数进行编码的方法相比, 本方法提供 了更大的灵活性, 但本质上是相似的。 本方法中, 整数的取值范围在 2~8之间较为合适, 从 简单易用的角度, 本发明推荐整数的取值为 2、 4或 8。 需要特别指出的是, 本发明的上述 6种典型编码方法有很多种变形。  This method provides greater flexibility, but is essentially similar, compared to the foregoing method of encoding only the remainder after dividing the number of independent connected regions by an integer. In this method, the integer value range is preferably between 2 and 8. From the perspective of ease of use, the integer value of the present invention is 2, 4 or 8. It is to be noted that there are many variations of the above six typical coding methods of the present invention.
一种变形的方式为: 针对将字符(串)字形独立连通区域个数、 独立封闭区域个数作 为参数的数学运算结果进行编码。例如, 类似前述的 "基于独立连通区域个数的编码方法", 可以对独立连通区域的个数做平方、三次方、求奇偶性、判断是否为素数等数学运算的结果 进行编码。也可类似前述的 "基于独立连通区域个数与独立封闭区域个数的和的编码方法", 对独立连通区域个数与独立封闭区域个数的乘积进行编码,或者将独立连通区域个数与独立 封闭区域个数作为参数进行指数运算、对数运算等数学运算, 对其结果进行编码。此外, 还 可以对上述各种数学运算结果形成的排列组合集合进行编码。依此类推, 可以变形出很多编 码方法。  A variant is as follows: The mathematical operation result of the number of independent connected regions of the character (string) and the number of independent closed regions is used as a parameter. For example, similar to the above-mentioned "encoding method based on the number of independent connected regions", it is possible to encode the results of mathematical operations such as square, cubic, parity, and whether or not the number of independent connected regions is a prime number. It is also possible to encode the product of the number of independent connected regions and the number of independent closed regions, or the number of independent connected regions, similar to the above-mentioned "encoding method based on the sum of the number of independent connected regions and the number of independent closed regions". The number of independent closed regions is used as a parameter to perform mathematical operations such as exponential operations and logarithmic operations, and the results are encoded. Further, it is also possible to encode the array of arrangement combinations formed by the above various mathematical operation results. And so on, you can transform a lot of coding methods.
另一种变形的方式为: 综合应用多个编码方法对字符 (串)字形进行编码。 在利用一 种方法进行编码的基础上, 对基于该编码方法的有相同编码值的字符(串)字形再利用其他 方法进行二次编码, 并可依此类推, 综合利用多个方法进行多次编码。 例如, 先采用 "基于 独立连通区域个数的方法"进行编码, 再对有相同独立连通区域个数的多个字符(串)字形 进行二次编码, 二次编码的方式可采用 "基于独立连通区域个数与独立封闭区域个数的组合 集合的编码方法"。 还可进一步针对独立连通区域个数与独立封闭区域个数的组合集合相同 的字符(串)字形进行三次编码, 三次编码的方式可采用 "基于图结构的编码方法"。 依此 类推, 可以变形出很多编码方法, 可根据字符(串)字形的自身拓扑结构进行选择。 综合多 个编码方法对字符(串)字形进行编码可以扩大字符(串)携带数字水印信息的容量。  Another variant is that the character (string) glyphs are encoded by a plurality of encoding methods. On the basis of coding by a method, the character (string) glyphs having the same coded value based on the coding method are further coded by other methods, and may be multi-timed by multiple methods. coding. For example, firstly, the method based on the method of the number of independent connected regions is used, and then the plurality of character (string) glyphs having the same number of independent connected regions are secondarily encoded, and the method of secondary encoding can be based on "independent communication. The encoding method of the combined number of regions and the number of independent closed regions." Further, the character (string) glyph of the same number of independent connected regions and the combined set of independent closed regions may be further encoded three times, and the third encoding method may adopt a "coding method based on the graph structure". By analogy, many encoding methods can be transformed, which can be selected according to the self-topology of the character (string) glyph. Combining multiple encoding methods to encode character (string) glyphs can expand the capacity of characters (strings) to carry digital watermark information.
虽然上述变形方法从形式上看起来与本发明的 6种典型编码方法有所不同, 实质上是 这些典型编码方法的简单延伸。  Although the above described deformation method is similar in form to the six typical coding methods of the present invention, it is essentially a simple extension of these typical coding methods.
若干种对多个字符(串) 的多种字形进行统一编码的方法: Several methods for uniformly encoding multiple glyphs of multiple characters (strings):
在多个字符(串) 的多种字形(注: 包括同一字符(串) 的多种字形)组成的集合范 围内,利用本发明的 "若干种对同一字符(串)的多种字形进行编码的方法 "中的一种方法, 对多个字符(串)的多种字形进行统一编码, 并且字符(串)字形编码值的确定规则在多个 字符(串)之间是统一的。  Encoding a plurality of glyphs of the same character (string) of the present invention within a set of a plurality of glyphs of a plurality of characters (strings) (note: a plurality of glyphs including the same character (string)) One of the methods "uniformly encodes a plurality of glyphs of a plurality of characters (strings), and the rule for determining the character (string) glyph code value is uniform among a plurality of characters (strings).
在实际的数字水印系统中, 通常需要由载体文件的多个字符(串)共同携带数字水印, 而本方法是对多个字符(串)的多种字形进行统一编码的方法。 本方法采用本发明 "若干种 对同一字符(串) 的多种字形进行编码的方法"(如前节所述 6种典型编码方法及其变形方 法) 中的一种, 对多个字符(串) 的多种字形使用相同的方法进行编码, 而且字符(串) 的 字形特征与编码的具体对应关系在不同字符(串)之间是统一的。 例如, 如果采用 "基于图 结构的编码方法", 不仅对应同构 "图" 的同一字符 (串) 的多种字形有相同编码, 而且对 应同构 "图"的不同字符(串) 的多种字形也有相同编码。 再例如, 如果采用 "基于独立连 通区域个数的编码方法", 则有相同独立连通区域个数的所有字符 (串) 字形都对应相同的 编码值, 不论这些字符 (串) 的字形是同一个字符 (串) 的字形, 还是多个不同字符 (串) 的字形。 In an actual digital watermarking system, it is usually required to carry a digital watermark by a plurality of characters (strings) of a carrier file, and the method is a method of uniformly encoding a plurality of characters of a plurality of characters (strings). The method adopts one of the "methods for encoding a plurality of glyphs of the same character (string)" (such as the six typical coding methods and the deformation method thereof described in the foregoing section), for a plurality of characters (strings) The various glyphs are encoded using the same method, and the specific correspondence between the glyph features of the characters (strings) and the encoding is uniform between different characters (strings). For example, if using "based on graphs The encoding method of the structure has the same encoding not only for the various glyphs of the same character (string) corresponding to the isomorphic "graph", but also for the various glyphs of different characters (strings) corresponding to the isomorphic "graph". If the "encoding method based on the number of independent connected regions" is used, all the character (string) glyphs having the same number of independent connected regions correspond to the same encoded value, regardless of whether the glyphs of these characters (strings) are the same character ( The glyph of the string), or the glyph of multiple different characters (strings).
基于对多个字符 (串) 的多种字形分别编码的文本数字水印技术: A textual digital watermarking technique based on a plurality of glyphs for multiple characters (strings):
在本项文本数字水印技术中, 数字水印信息嵌入到载体文件字符(串) 的多种字形中, 字符(串)字形的编码用来表示数字水印信息。字形的设计方法采用本发明的改变字符(串) 字形的拓扑结构的方法, 为同一字符(串)设计出多种字符(串)字形。 字形的编码方法采 用本发明的 6种典型的对同一字符 (串) 的多种字形进行编码的方法及其若干种变形方法。  In the text digital watermarking technique of this item, the digital watermark information is embedded in a plurality of glyphs of the character (string) of the carrier file, and the encoding of the character (string) glyph is used to represent the digital watermark information. The design method of the glyph adopts the method of changing the topology of the character (string) glyph of the present invention, and designs a plurality of character (string) glyphs for the same character (string). The glyph coding method employs six typical methods of encoding a plurality of glyphs of the same character (string) of the present invention and a plurality of variations thereof.
本项文本数字水印技术的一个重要特点是针对载体文件包含的多个不同字符 (串) 的 字形编码方法可以不同, 可以根据字符(串) 自身的特点为某个字符(串)选择专门的字形 编码方法。 通常来说, 字符(串)笔划的繁简程度和拓扑结构有其自身的特点, 在维持一定 的视觉感官质量的前提下, 不同字符(串)可以设计出的不同拓扑结构字形的数量是有差异 的。 实质上各字符(串)通过字形的变化携带数字水印信息的能力是有差异的, 对不同字符 (串)使用不同的字形编码方法可以充分反映出这种差异, 从而增大整个载体文件携带数字 水印信息的能力。 在具体编码值的确定上, 本项文本数字水印技术中不同字符(串) 的字形 与编码值的对应规则是相互独立的。对同一字符(串)的多种字形来说,仅考虑对该字符(串) 的不同拓扑结构进行编码, 不考虑其他字符(串)字形的拓扑结构可能带来的对该字符(串) 的影响, 从而编码较为简单。  An important feature of this text digital watermarking technology is that the glyph encoding methods for a plurality of different characters (strings) contained in the carrier file can be different, and a special glyph can be selected for a certain character (string) according to the characteristics of the character (string) itself. Coding method. Generally speaking, the complexity and topology of character (string) strokes have their own characteristics. Under the premise of maintaining a certain visual sensory quality, the number of different topological structure glyphs that can be designed by different characters (strings) is Difference. In essence, the ability of each character (string) to carry digital watermark information through the change of glyph is different. Different font coding methods for different characters (strings) can fully reflect this difference, thereby increasing the entire carrier file carrying numbers. The ability to watermark information. In the determination of the specific code value, the corresponding rules of the glyphs and code values of different characters (strings) in the text digital watermarking technology are independent of each other. For multiple glyphs of the same character (string), only the different topologies of the character (string) are considered to be encoded, regardless of the other character (string) glyph topology may bring the character (string) The effect is thus simpler to code.
本项文本数字水印技术的水印信息检测方法的特点是: 需要明确载体文件中各个字符 (串) 的特有的字形编码方法。 检测过程通常应首先确定载体文件中各字符(串) 的语义信 息, 根据字符(串) 的语义信息查询各字符(串)特定的字形编码方法, 然后根据特定的字 形编码方法检测对应的字形特征以确定载体文件中各字符(串)字形的编码, 从而检测出数 字水印信息。 例如, 一个字符 (串) 对应的编码方法是 "基于图结构的编码方法", 则应检 测该字符(串)字形对应的 "图"结构特征, 以确定相应的字形编码。 另一个字符 (串)对 应的编码方法是 "基于连通区域个数的编码方法", 则应检测该字符 (串) 字形包含的连通 区域个数这个字形特征, 从而进一步确定字形的编码。 将载体文件中各字符(串)对应的字 形编码组合起来, 就得到了整个载体文件携带的数字水印信息。 本项文本数字水印技术中, 数字水印信息检测过程的基础是首先应明确载体文件中各字符 (串)特定的字形编码方法, 一个字符 (串) 携带的数字水印信息的检测过程与该字符 (串) 的语义信息通常是关联的。 基于对多个字符 (串) 的多种字形统一编码的文本数字水印技术:  The feature of the watermark information detecting method of the text digital watermarking technology is: It is necessary to clarify the unique glyph encoding method of each character (string) in the carrier file. The detection process should generally first determine the semantic information of each character (string) in the carrier file, query each character (string) specific glyph encoding method according to the semantic information of the character (string), and then detect the corresponding glyph feature according to the specific glyph encoding method. The encoding of each character (string) glyph in the carrier file is determined to detect the digital watermark information. For example, if the encoding method corresponding to a character (string) is "coding method based on graph structure", the "graph" structure feature corresponding to the character (string) glyph should be detected to determine the corresponding glyph encoding. The encoding method corresponding to another character (string) is "the encoding method based on the number of connected regions", and the glyph feature of the number of connected regions included in the character (string) glyph should be detected to further determine the encoding of the glyph. By combining the font codes corresponding to the characters (strings) in the carrier file, the digital watermark information carried by the entire carrier file is obtained. In the text digital watermarking technology of this text, the basis of the digital watermark information detection process is to first specify the specific glyph encoding method of each character (string) in the carrier file, and the detection process of the digital watermark information carried by one character (string) and the character ( The semantic information of a string is usually associated. A textual digital watermarking technique based on a uniform encoding of multiple glyphs for multiple characters (strings):
与上述的 "基于对多个字符 (串) 的多种字形分别编码的文本数字水印技术"相似, 在本项文本数字水印技术中, 数字水印信息嵌入到载体文件字符(串) 的多种字形中, 字符 (串)字形的编码用来表示数字水印信息。 仍然采用本发明的改变字符(串) 的拓扑结构的 方法, 为同一字符 (串) 设计出多种字符 (串) 外形。  Similar to the above-mentioned "text digital watermarking technique based on encoding a plurality of glyphs of a plurality of characters (strings)", in the text digital watermarking technique, digital watermark information is embedded in a plurality of glyphs of carrier file characters (strings). The encoding of the character (string) glyph is used to represent the digital watermark information. Still using the method of changing the topology of a character (string) of the present invention, a plurality of character (string) shapes are designed for the same character (string).
与 "基于对多个字符 (串) 的多种字形分别编码的文本数字水印技术"相比, 本项技 术的主要区别在于: 字形的编码方法采用本发明的 "若干种对多个字符(串) 的多种字形进 行统一编码的方法"及其变形方法。  Compared with "text digital watermarking technology based on encoding multiple fonts of multiple characters (strings) respectively, the main difference of this technique is that: the encoding method of the glyphs uses the "several pairs of characters" (string) of the present invention. ) A method of unified coding of multiple glyphs and its deformation method.
本项文本数字水印技术的一个重要特点是针对载体文件的多个字符 (串) 的字形编码 方法相同, 只能使用一种共同的方法对多个字符(串) 的多种字形进行编码。 在具体编码值 的确定上, 不同字符 (串) 的字形编码值的确定规则是统一的, 即对于多个不同字符 (串) 的多种字形,只要它们对应的拓扑结构特征相同, 则它们的编码值应相同。对同一字符(串) 的多种字形进行编码, 不仅应考虑该字符(串) 自身的不同拓扑结构字形特征的编码因素, 而且还应考虑到其他字符(串)字形的编码情况,应与其他字符(串)字形的编码协调一致。 An important feature of this text digital watermarking technique is that the glyph encoding methods for multiple characters (strings) of a carrier file are the same, and only a common method can be used to encode multiple glyphs of multiple characters (strings). In determining the specific code value, the rule for determining the glyph code value of different characters (strings) is uniform, that is, for a plurality of glyphs of a plurality of different characters (strings), as long as their corresponding topological features are the same, their The encoded values should be the same. For the same character (string) The encoding of multiple glyphs should not only consider the encoding factors of the different topological glyph features of the character (string) itself, but also the encoding of other character (string) glyphs, which should be combined with other characters (strings). The coding is coordinated.
本项文本数字水印技术的水印信息检测方法的特点是: 由于载体文件中各字符(串) 只有一种共同的字形编码方法, 检测过程不需要知道每个字符(串)的语义信息, 可直接针 对共同的编码方法检测字符(串)字形的编码特征, 以确定载体文件中各字符(串)字形的 编码, 从而检测出数字水印信息。 例如, 如果载体文件使用 "基于图结构的编码方法", 则 载体文件中的多个字符(串)都使用这个相同的方法, 可直接检测各字符(串)字形对应的 "图"结构特征以确定其编码。再例如, 如果载体文件使用 "基于对独立连通区域个数除以 整数后的余数进行编码的方法",并且整数取 2,则检测方法极为简单。不需要知道各字符(串) 的语义信息, 直接计算每个字符(串)独立连通区域个数, 奇数个数为一种编码(例如编码 为 1), 偶数个数为另一种编码 (例如编码为 0), 从而直接确定各字符 (串)字形的编码。 将载体文件中各字符(串)对应的字形编码组合起来, 就得到了整个载体文件携带的数字水 印信息。本项文本数字水印技术中, 一个字符(串)携带的数字水印信息的检测过程与该字 符(串) 的语义信息无关。  The feature of the watermark information detection method of the text digital watermarking technology is: Since each character (string) in the carrier file has only one common glyph coding method, the detection process does not need to know the semantic information of each character (string), and can directly The encoding characteristics of the character (string) font are detected for a common encoding method to determine the encoding of each character (string) font in the carrier file, thereby detecting the digital watermark information. For example, if the carrier file uses the "coding method based on the graph structure", the same method is used for multiple characters (strings) in the carrier file, and the "graph" structure feature corresponding to each character (string) font can be directly detected. Determine its encoding. For another example, if the carrier file uses "method of encoding based on the remainder of dividing the number of independent connected regions by an integer", and the integer is taken as 2, the detection method is extremely simple. It is not necessary to know the semantic information of each character (string), and directly calculate the number of independent connected regions of each character (string), the odd number is one type of code (for example, code is 1), and the even number is another code (for example) The code is 0), thereby directly determining the encoding of each character (string) glyph. By combining the glyph codes corresponding to the characters (strings) in the carrier file, the digital watermark information carried by the entire carrier file is obtained. In the text digital watermarking technique, the detection process of the digital watermark information carried by one character (string) is independent of the semantic information of the character (string).
对照现有文本数字水印技术, 本发明的主要特点是: Compared with the existing text digital watermarking technology, the main features of the present invention are:
( 1 ) 本发明的数字水印信息的检测方法仅与字符(串)字形的拓扑结构有关,与字符(串) 的大小, 倾斜角度无关, 便于检测。字符(串)字形的缩放、 旋转不影响对水印信息 的检测, 抗噪声能力强, 鲁帮性好。  (1) The method for detecting digital watermark information of the present invention is only related to the topology of the character (string) font, and is independent of the size of the character (string) and the tilt angle, and is convenient for detection. The scaling and rotation of the character (string) glyph does not affect the detection of the watermark information, and the anti-noise ability is strong, and the Lu Gang is good.
(2) 本发明对字符(串)字形的改变方式是适当改变字符(串) 的拓扑结构, 可以不改变 字符(串)字形的外形大小, 整体风格, 水印信息造成的视觉影响小, 嵌入的数字水 印信息不易被觉察。  (2) The change of the character (string) glyph of the present invention is to appropriately change the topological structure of the character (string), and the shape of the character (string) glyph may not be changed, the overall style, the visual influence caused by the watermark information is small, and the embedded Digital watermark information is not easily noticeable.
(3) 本发明的字符(串)字形设计方法灵活, 确定了特定的编码规则后, 可以根据需要为 同一字符(串)设计出有相同编码但有不同字体风格的多种字形, 不需要改变检测方 法及相关的程序, 可扩展性好。  (3) The character (string) glyph design method of the present invention is flexible, and after determining a specific encoding rule, a plurality of glyphs having the same encoding but different font styles can be designed for the same character (string) as needed, without changing The detection method and related programs have good scalability.
(4) 本发明的 "基于对多个字符(串)的多种字形分别编码的文本数字水印技术"可以针 对每一字符(串)的具体情况特别地设计字符(串)的字形,并选择特定的编码方法, 从而使得字符 (串)携带水印信息的容量较大。  (4) The "text digital watermarking technique based on a plurality of glyphs for a plurality of characters (strings) respectively) of the present invention "specially designs the glyphs of characters (strings) for the specific case of each character (string), and selects A specific encoding method, such that the character (string) carries a large amount of watermark information.
(5) 本发明的 "基于对多个字符(串)的多种字形统一编码的文本数字水印技术"不需要 检测各字符(串)的语义信息便可以直接确定各字符(串)携带的文本数字水印信息, 简化了水印信息的检测方法, 并减少了出错的环节。  (5) The text digital watermarking technique based on the uniform encoding of a plurality of characters of a plurality of characters (strings) of the present invention can directly determine the text carried by each character (string) without detecting the semantic information of each character (string). The digital watermark information simplifies the detection method of the watermark information and reduces the error.
附图说明 DRAWINGS
图 1以示例的方式显示了通过适当改变字符的拓扑结构来设计同一字符多种字形的方 法, 并显示了字符字形与数学学科 "图论"中的 "图"对应的方法。  Fig. 1 shows, by way of example, a method of designing a plurality of glyphs of the same character by appropriately changing the topological structure of characters, and shows a method in which character glyphs correspond to "graphs" in the "Graphics" of the mathematical discipline.
图 2以示例的方式显示了基于字符字形包含的连通区域个数和封闭区域个数的字形设 计和编码方法。  Fig. 2 shows, by way of example, a glyph design and encoding method based on the number of connected regions and the number of closed regions included in the character glyph.
图 3以示例的方式显示了字符串的多种字形设计的方法。  Figure 3 shows, by way of example, a method of designing multiple glyphs for a string.
图 4以示例的方式显示了一组字符各自不同的字形设计与编码方法。  Figure 4 shows, by way of example, a glyph design and coding method for each set of characters.
图 5以示例的方式显示了多种形式的字符 (串)字形设计方法。  Figure 5 shows, by way of example, various forms of character (string) glyph design methods.
图 6以示例的方式显示了 "基于对多个字符(串) 的多种字形分别编码的数字水印技 术"的水印信息的加载与检测原理。  Fig. 6 shows, by way of example, the principle of loading and detecting the watermark information based on the digital watermarking technique of separately encoding a plurality of glyphs of a plurality of characters (strings).
图 7以示例的方式显示了 "基于对多个字符(串) 的多种字形统一编码的数字水印技 术"的水印信息的加载与检测原理。 5) 基于对独立连通区域个数除以整数后的余数进行编码的方法的具体实施方式 如果利用本项方法对图 4中的字符 "你"的 4种不同字形(400)〜(403)迸行编码, 假设整数取 2, 等价于对字形包含的独立连通区域个数的奇偶性进行编码, 又假设独立连通 区域个数为奇数的字符字形对应的编码为 " 1 ", 独立连通区域个数为偶数的字符字形对应的 编码为 "0", 则有以下结果: Fig. 7 shows, by way of example, the principle of loading and detecting watermark information "based on a digital watermarking technique of uniformly encoding a plurality of glyphs for a plurality of characters (strings)". 5) A specific implementation method of encoding a remainder after dividing the number of independent connected regions by an integer. If this method is used, four different glyphs (400) to (403) of the character "you" in Fig. 4 are used. Row coding, assuming an integer of 2, is equivalent to encoding the parity of the number of independent connected regions included in the glyph, and assuming that the code corresponding to the number of independent connected regions is "1", and the independent connected regions are If the code corresponding to an even number of character glyphs is "0", the following results are obtained:
如图 4所示, 字形 (400)、 (402)包含的独立连通区域个数为 4, 除以 2后余数为 0, 即字形 (400)、 (402)包含的独立连通区域个数为偶数, 它们的编码为 "0"; 字形(401)、 As shown in FIG. 4, the number of independent connected regions included in the glyphs (400) and (402) is 4, and the remainder after dividing by 2 is 0, that is, the number of independent connected regions included in the glyphs (400) and (402) is even. , their encoding is "0"; glyph (401),
(403)包含的独立连通区域个数为 5, 除以 2后余数为 1, 即字形(401 )、 (403)包含的独 立连通区域个数为奇数, 它们的编码为 " 1"。 这样, 字符 "你"的 4种不同字形(400)〜(403) The number of independent connected areas included is 5, and the remainder after dividing by 2 is 1, that is, the number of independent connected areas included in the glyphs (401) and (403) is an odd number, and their codes are "1". In this way, the characters "you" are 4 different glyphs (400) ~
(403) 按照本项方法进行编码后, 对应了 2种不同的编码状态, 被编为 2种不同的码。 (403) After encoding according to this method, two different encoding states are corresponding, and two different codes are encoded.
6 ) 基于对独立连通区域个数与独立封闭区域个数之和除以整数后的余数进行编码的 方法的具体实施方法  6) A specific implementation method for encoding a remainder based on the sum of the number of independent connected regions and the number of independent closed regions divided by an integer
如果利用本项方法对图 2中的字符 "启"的 15种不同字形(200)〜(280)进行编码, 假设整数取 4, 独立连通区域个数与独立封闭区域个数之和除以 4后余数为 0的字形对应的 编码为 "00", 余数为 1的字形对应的编码为 "01 ", 余数为 2的字形对应的编码为 "10", 余数为 3的字形对应的编码为 "11 ", 则有以下结果:  If this method is used to encode the 15 different glyphs (200)~(280) of the character "in" in Figure 2, assume that the integer is 4, the sum of the number of independent connected regions and the number of independent closed regions divided by 4 The code corresponding to the glyph with the remainder of 0 is "00", the code corresponding to the glyph with a remainder of 1 is "01", the code with the remainder of 2 is "10", and the code with the remainder of 3 is "10". 11 ", then the following results:
、 如图 2所示, 字形 (2001)、 (2002)、 (2003)包含的独立连通区域个数与独立封闭区 域个数的和为 1 ,除以 4后余数为 1,字形编码为" 01 ";字形(210)、(2301 )、(2302)、 (2303) 包含的独立连通区域个数与独立封闭区域个数的和为 2,除以 4后余数为 2,字形编码为" 10"; 字形(220)、 (240)、 (2601)、 (2602)、 (2603)包含的独立连通区域个数与独立封闭区域个 数的和为 3, 除以 4后余数为 3, 字形编码为 " 11 "; 字形(250)、 (270)包含的独立连通区 域个数与独立封闭区域个数的和为 4, 除以 4后余数为 0, 字形编码为 " 00"; 字形(280) 包含的独立连通区域个数与独立封闭区域个数的和为 5,除以 4后余数为 1,字形编码为" 01", 与字形组(200) 的编码相同。 这样, 图 2中的 15个不同的字形按照本项方法进行编码后, 对应了 4种不同的编码状态, 被编为 4种不同的码。  As shown in Figure 2, the sum of the number of independent connected areas and the number of independent closed areas included in the glyphs (2001), (2002), and (2003) is 1, and the remainder is divided by 4, and the glyph is encoded as "01". "; glyphs (210), (2301), (2302), (2303) contain the number of independent connected areas and the number of independent closed areas is 2, divided by 4, the remainder is 2, the glyph is encoded as "10" ; The sum of the number of independent connected areas and the number of independent closed areas included in the glyphs (220), (240), (2601), (2602), and (2603) is 3, and the remainder after dividing by 4 is 3, and the glyph is encoded as "11"; The sum of the number of independent connected areas and the number of independent closed areas contained in glyphs (250) and (270) is 4, and the remainder after dividing by 4 is 0, the glyph is encoded as "00"; the glyph (280) contains The sum of the number of independent connected areas and the number of independent closed areas is 5, and the remainder after division by 4 is 1 and the glyph is coded as "01", which is the same as the code of the glyph group (200). Thus, the 15 different glyphs in Figure 2 are encoded according to this method, corresponding to 4 different encoding states, and are encoded into 4 different codes.
综合应用多种编码方法进行编码的具体实施方法: The specific implementation method of comprehensively applying multiple coding methods for coding:
例如, 对如图 2所示的字符 "启"的 15种字形采用多种编码方法进行综合编码。 首先, 利用 "基于独立连通区域个数与独立封闭区域个数的组合集合的编码方法"进 行编码, 如前所述, 15种字形有 9种编码状态, 可以编为 9种不同的码。其中字形组(200) 包括的三种字形(2001 )、 (2002). (2003)的编码相同;字形组(230)包括的三种字形(2301 )、 (2302)、 (2303) 的编码相同; 字形组 (260)包括的三种字形 (2601 )、 (2602)、 (2603) 的编码相同。  For example, the 15 glyphs of the character "initial" shown in Fig. 2 are integrated and encoded using a plurality of encoding methods. First, the encoding is performed using "the encoding method based on the combined set of the number of independent connected regions and the number of independent closed regions". As described above, there are nine kinds of encoding states for the nine types of glyphs, which can be coded into nine different codes. The glyphs (200) include the same glyphs (2001) and (2002). (2003); the glyphs (230) include the same glyphs (2301), (2302), and (2303). The fonts of the three glyphs (2601), (2602), and (2603) included in the glyph group (260) are the same.
然后, 再用 "基于图结构的编码方法"对字形组(200)、 (230)、 (260) 中的字形进行 二次编码。假如采用基于无向图的编码方式, 则字形组(200)包含的字形(2001)、(2002)、 Then, the glyphs in the glyph groups (200), (230), and (260) are secondarily encoded by the "coding method based on the graph structure". If the encoding method based on the undirected graph is adopted, the glyph group (200) contains the glyphs (2001), (2002),
(2003)对应三种不同结构的 "图 "; 则字形组 (230)包含的字形(2301)、 (2302)、 (2303) 对应两种不同结构的 "图", 其中, 字形 (2302)、 (2303)对应的 "图"是同构的; 字形组(2003) corresponds to three different structures of "graphs"; then the glyphs (230) contain glyphs (2301), (2302), (2303) corresponding to two different structures of "graphs", where glyphs (2302), (2303) The corresponding "graph" is isomorphic; glyph group
(260)包含的字形(2601 )、 (2602)、 (2603)也对应了三种不同结构的 "图"。 经过两次编 码后, 图 2中的 15种字符字形有 14中不同的状态, 可编为 14种不同的码。 (260) The included glyphs (2601), (2602), and (2603) also correspond to "graphs" of three different structures. After two encodings, the 15 character glyphs in Figure 2 have 14 different states and can be programmed into 14 different codes.
对多个字符(串) 的多种字形进行统一编码的方法的具体实施方法: A specific implementation method for uniformly encoding multiple glyphs of multiple characters (strings):
本项编码方法的特点是在多个字符(串) 的多种字形 (注: 包括同一字符(串) 的多 种字形) 组成的集合范围内, 利用同一种方法对多个字符(串) 的多种字形进行统一编码, 字符(串)字形编码值的确定规则在多个字符(串)之间是统一的。 具体编码方法的确定可 在本发明的 "若干种对同一字符(串)的多种字形进行编码的方法"基础上, 将这些方法按  This encoding method is characterized by a plurality of characters (strings) of a plurality of glyphs (note: a plurality of glyphs including the same character (string)), using the same method for multiple characters (strings) A plurality of glyphs are uniformly coded, and a rule for determining a character (string) glyph code value is uniform among a plurality of characters (strings). The specific encoding method can be determined based on the "several methods for encoding a plurality of glyphs of the same character (string)" of the present invention.
10 照统一编码的规则扩展到多个字符的多种字形上。 10 The rules of unified coding are extended to multiple glyphs of multiple characters.
例如, 如果采用 "基于对独立连通区域个数除以整数后的余数进行编码的方法", 对字 符集合 { "你", "好", "!" }的字形进行编码, 假设整数取 2, 则图 4中所示的字形与编码值 的映射规则满足本项编码方法的要求。 由于整数取 2, 等价于对独立连通区域个数的奇偶性 进行编码, 又假设独立连通区域个数为奇数的字符字形对应的编码为 " 1 ", 独立连通区域个 数为偶数的字符字形对应的编码为 "0", 如图 4所示: 字形 (400) (402) 的独立连通区域 个数为 4, 字形(410) (412) 的独立连通区域个数为 2, 字形(440)、 (442) 的独立连通区 域个数也为 2, 这些字形包含的独立连通区域个数除以 2后为余数 0, 即这些字形包含的独 立连通区域个数均为偶数, 所以它们的编码是相同的, 均为 "0"。 字形 (401 ) (403) 的独 立连通区域个数为 5, 字形(411 ) (413) 的独立连通区域个数为 1, 字形(441)、 (443) 的 独立连通区域个数也为 1, 这些字形包含的独立连通区域个数除以 2后为余数 1, 即这些字 形包含的独立连通区域个数均为奇数, 它们的编码是相同的, 均为 "1 "。本例中, 特别应该 注意的是不同字符之间字形与编码值的映射规则是一致的,余数与字形编码值的映射关系在 不同字符之间不能变化。 由此可知, 采用 "基于对独立连通区域个数除以整数后的余数进行 编码的方法", 字符集合{ "你", "好", "!" }如图 4的字形编码值的确定方法满足本项编码 方法的要求。  For example, if the method of encoding based on the remainder of dividing the number of independent connected regions by an integer is used, the glyphs of the character set {"you", "good", "!"} are encoded, and the integer is assumed to be 2, Then, the mapping rules of the glyphs and the encoded values shown in FIG. 4 satisfy the requirements of the encoding method of this item. Since the integer is 2, it is equivalent to encoding the parity of the number of independent connected regions, and it is assumed that the code corresponding to the character glyph with an odd number of independent connected regions is "1", and the number of independent connected regions is even. The corresponding code is "0", as shown in Figure 4: The number of independent connected areas of the glyphs (400) (402) is 4, and the number of independent connected areas of the glyphs (410) (412) is 2, glyphs (440) The number of independent connected regions of (442) is also 2. The number of independent connected regions included in these glyphs is divided by 2 and the remainder is 0. That is, the number of independent connected regions included in these glyphs is even, so their encoding is The same, all are "0". The number of independent connected areas of the glyphs (401) (403) is 5, the number of independent connected areas of the glyphs (411) (413) is 1, and the number of independent connected areas of the glyphs (441) and (443) is also 1. The number of independent connected regions included in these glyphs is divided by 2 and the remainder is 1. That is, the number of independent connected regions included in these glyphs is odd, and their codes are the same, all being "1". In this example, it should be noted that the mapping rules between glyphs and coded values between different characters are consistent, and the mapping relationship between the remainder and the glyph code values cannot be changed between different characters. From this, it can be seen that the method of encoding based on the remainder after dividing the number of independent connected regions by an integer, the character set {"you", "good", "!" } is determined by the method of determining the glyph code value of FIG. Meet the requirements of this coding method.
相反,如果仍然采用 "基于独立连通区域个数的编码方法",对图 4中的字符集合 {"你", "好 ", "!" }的字形进行编码, 图 4中所示的字形与编码值的映射规则不满足本项编码方法 的要求。这是因为编码值为 0的字符 "你"的字形(400)、 (402)的独立连通区域个数为 4, 而编码值为 0的字符 "好"的字形 (410)、 (412)与字符 "!"的字形 (440)、 (442) 的独 立连通区域个数均为 2, 字符 "你"的字形与编码值的映射规则同字符 "好"、 "!"的映射规 则不一致。同样,对于编码值为 "1"的字符字形(401 )、 (403)、 (411) (413)、 (441)、 (443) 来说, 字符 "你"同字符 "好" (或 "!") 的字形与编码值的映射规则也不一致。 由此可知, 采用 "基于独立连通区域个数的编码方法", 字符集合{ "你", "好 ", "!" }如图 4的字形编 码值的确定方法不满足本项编码方法的要求。  On the contrary, if the "encoding method based on the number of independent connected regions" is still adopted, the glyphs of the character set {"you", "good", "!"} in Fig. 4 are encoded, and the glyphs shown in Fig. 4 are The mapping rules of the encoded values do not meet the requirements of this encoding method. This is because the number of independent connected regions of the glyphs (400) and (402) of the character "0" with an encoded value of 0 is 4, and the glyphs (410), (412) with the character "good" with a value of 0 are The number of independent connected areas of the glyphs (440) and (442) of the character "!" is 2, and the mapping rules of the characters "you" and the encoded values are inconsistent with the mapping rules of the characters "good" and "!". Similarly, for character glyphs (401), (403), (411) (413), (441), (443) with an encoding value of "1", the character "you" is the same as the character "good" (or "! The glyph of ") is also inconsistent with the mapping rules of the encoded values. It can be seen that the "encoding method based on the number of independent connected regions", the character set {"you", "good", "!" }, the method of determining the glyph code value of Fig. 4 does not satisfy the requirements of the encoding method of this item. .
由上可知, 对于同样字符集合 { "你", "好", "!" }的如图 4所示的字形来说, 不能采 用 "基于独立连通区域个数的编码方法"对该集合进行统一编码, 而可以采用 "基于对独立 连通区域个数除以 2后的余数进行编码的方法"对该集合进行统一编码。  As can be seen from the above, for the glyphs shown in Figure 4 of the same character set {"you", "good", "!" }, the collection cannot be unified by the "encoding method based on the number of independent connected regions" Encoding, and the set can be uniformly coded by "method of encoding based on the remainder of dividing the number of independent connected regions by two."
基于对多个字符 (串) 的多种字形分别编码的文本数字水印技术的具体实施方式: A specific implementation of a text digital watermarking technique based on a plurality of glyphs for a plurality of characters (strings):
图 6以示例的方式显示了本项数字水印技术的水印信息加载与检测的原理。  Figure 6 shows, by way of example, the principle of watermark information loading and detection of the digital watermarking technique.
如图 6所示, 从框图 (600)、 (601 )到(610), 再到框图 (620)、 (630), 最后到框图 (621 )、 (631 ) 的流程表示水印信息加载的流程。 该流程表示将数字水印信息 "0101100" (600)加载到文本 "你好, 妈妈!" (601 ) 中。 首先, 根据文本 "你好, 妈妈!" (601 ) 中 各字符的语义信息査询如图 4所示的表, 确定每一个字符携带水印信息的长度 (位数), 得 到字符 "你"、 "好"、 "!"携带水印信息的位数为一位; 字符 "妈"携带水印信息的位数为 两位; 字符 ","没有携带水印信息的能力。 然后利用 (610)所示的方法将水印信息进行分 割, 如框图 (610)所示, 字符 "你"对应的水印信息为 "0"; 字符 "好"对应的水印信息 为 " 1 "; 字符 ",,,不对应任何水印信息(因为如图 4所示, ","没有携带水印信息的能力), 前一个字符 "妈"对应的水印信息为 "01 "; 后一个字符 "妈"对应的水印信息为 " 10"; 字 符 "!"对应的水印信息为 "0"。 接下来再查询图 4中的表, 査找各字符字形编码等于该字 符对应的水印信息的字形。 如图 4所示, 在字符 "你"的多种字形中, (400)、 (402) 的编 码为 0, 对应水印信息 "0"; 在字符 "好"的多种字形中, (411 )、 (413) 的编码为 1, 对应 水印信息" 1 ";在字符 "妈"的多种字形中,(431)、 (435)的编码为 01,对应水印信息 "01 ", (432)、 (436)的编码为 10,对应水印信息 " 10";在字符 "!"的多种字形中,(440)、 (442) 的编码为 0, 对应水印信息 " 0"。 字符字形与水印信息的对应结果如框图 (620)、 (630)所 示。 图 6中, 每个字符都有两个字形的编码等于该字符对应的水印信息, 且这两个字形分别 属于不同的字体风格: 宋体和隶书。最后将相同字体风格的多个字符的字形组合起来, 得到 携带了水印信息 " 0101100" ( 600) 的文本字符串 (621 )、 (631 ), 其中字符串 (621 ) 的字 体风格为隶书, 字符串 (631 ) 的字体风格为宋体。 由于字符串 (621 )、 (631 ) 内部各字符 之间的字体风格统一, 加载数字水印信息 " 0101100" ( 600)后给人视觉上带来的影响很小。 As shown in FIG. 6, the flow from block diagrams (600), (601) to (610), to block diagrams (620), (630), and finally to block diagrams (621), (631) represents the flow of watermark information loading. The flow represents loading the digital watermark information "0101100" (600) into the text "Hello, Mom!" (601). First, query the table shown in Figure 4 according to the semantic information of the characters in the text "Hello, Mom!" (601), determine the length (number of bits) of each character carrying the watermark information, and get the character "you", "Good", "!" The number of digits carrying the watermark information is one digit; the character "Mom" carries the watermark information with two digits; the character "," has no ability to carry watermark information. Then, the watermark information is segmented by the method shown in (610). As shown in the block diagram (610), the watermark information corresponding to the character "you" is "0"; the watermark information corresponding to the character "good" is "1";",,, does not correspond to any watermark information (because, as shown in Figure 4, "," does not have the ability to carry watermark information), the watermark information corresponding to the previous character "mother" is "01"; the latter character "mother" corresponds The watermark information is "10"; the character "! "The corresponding watermark information is "0". Next, the table in Fig. 4 is queried to find the glyph of each character glyph code equal to the watermark information corresponding to the character. As shown in Fig. 4, various glyphs in the character "you" Among them, the codes of (400) and (402) are 0, corresponding to the watermark information "0"; among the various glyphs of the characters "good", the codes of (411) and (413) are 1, corresponding to the watermark information "1" In the various glyphs of the character "mother", the codes of (431) and (435) are 01, corresponding to the watermark information "01", the codes of (432) and (436) are 10, corresponding to the watermark information "10"; In the character "! " Among the various glyphs, (440), (442) The code is 0, corresponding to the watermark information "0". Corresponding results of character glyphs and watermark information are shown in block diagrams (620) and (630). In Figure 6, each character has a two-character code equal to the watermark information corresponding to the character, and the two glyphs belong to different font styles: Song and Lishu. Finally, the glyphs of multiple characters of the same font style are combined to obtain a text string (621), (631) carrying the watermark information "0101100" (600), wherein the font style of the string (621) is a librarian, a character The font style of the string (631) is Song. Due to the uniform font style between the characters in the strings (621) and (631), the visual impact on the digital watermark information "0101100" (600) is small.
图 8显示了本项技术的数字水印信息检测的过程。带有数字水印信息的载体文件(800) 通过字符语义识别系统(810)识别出原载体电子文件(820)。 在此基础上, 字符字形识别 系统(830)对带有数字水印信息的载体文件(800)进行字符(串)字形编码识别, 识别出 各字符(串)字形的编码后,组合载体文件中各字符(串)的编码就得到数字水印信息(840)。 字符字形识别系统(830)需要利用字符语义识别系统(810) 的识别结果进行字符(串)字 形编码识别的原因在于:在本项技术中,载体文件中的各个字符(串)的编码方法可以不同, 水印信息检测过程首先需要明确载体文件中各字符(串)的特定的字形编码方法。 因此, 需 要对载体文件中的各字符(串)进行语义识别, 通过字符(串)的语义信息査找如图 4所示 的编码表, 确定各字符(串)特定的字形编码方法, 从而检测各编码方法对应的字形特征, 进一步确定各字符(串)字形的编码。此外, 也可直接利用原载体电子文件作为模板进行水 印信息的检测。 原载体电子文件模板(850 )提供载体文件中字符(串)各自的语义信息, 结合字符字形识别系统(830)可进行字符(串)特征及编码的识别, 此时的水印信息检测 过程是一个非盲水印检测过程。  Figure 8 shows the process of digital watermark information detection by this technique. The carrier file (800) with digital watermark information identifies the original carrier electronic file (820) by the character semantic recognition system (810). On the basis of this, the character glyph recognition system (830) performs character (string) glyph coding recognition on the carrier file (800) with digital watermark information, and recognizes the coding of each character (string) glyph, and then combines the carrier files. The encoding of the character (string) yields digital watermark information (840). The character glyph recognition system (830) needs to use the recognition result of the character semantic recognition system (810) to perform character (string) glyph coding recognition because in the present technology, the encoding method of each character (string) in the carrier file can be Differently, the watermark information detection process first needs to clarify the specific glyph coding method of each character (string) in the carrier file. Therefore, it is necessary to perform semantic recognition on each character (string) in the carrier file, find the coding table shown in FIG. 4 by the semantic information of the character (string), and determine a specific glyph coding method for each character (string), thereby detecting each The glyph feature corresponding to the encoding method further determines the encoding of each character (string) glyph. In addition, the original carrier electronic file can be directly used as a template for the detection of watermark information. The original carrier electronic file template (850) provides the semantic information of the characters (strings) in the carrier file, and the character font recognition system (830) can perform character (string) feature and code recognition. The watermark information detection process at this time is a Non-blind watermark detection process.
例如, 如果需要从图 6所示的字符串 (621 )或(631 ) 中检测出数字水印信息, 检测 系统需要知道组成字符串 (621 )或(631 )的各字符的特定的字形编码方式。 所以, 检测系 统首先应该识别出各字符的语义信息 (包括人工识别), 或者直接得到原载体电子文件, 利 用原载体电子文件作为模板提供字符的语义信息, 然后, 通过字符的语义信息获取各字符特 定的编码方法。本例中字符 "你"、 "好"、 "!"采用的是 "基于独立连通区个数的编码方法", 字符 "妈"采用的是 "基于独立连通区个数与独立封闭区域个数的组合集合的编码方法", 字符 ", "不携带水印信息。 然后, 检测系统根据各字符的特定字形编码方法, 检测编码方 法对应的字形特征, 例如, 应检测字符 "你"、 "好"、 "!"的字形对应的独立连通区个数的 字形特征, 检测字符 "妈"的字形对应的独立连通区域个数与独立封闭区域个数形成的组合 集合的字形特征。检测字符字形特征的结果是得到相应的字符字形编码,字符串(621 ). (631 ) 中各字符字形与编码的对应关系如框图(620)、 (630)所示。 组合各字符字形的编码, 得到 字符串 (621 )、 ( 631 )携带的数字水印信息为 "0101100" ( 600)。  For example, if digital watermark information needs to be detected from the character string (621) or (631) shown in Fig. 6, the detection system needs to know the specific glyph coding mode of each character constituting the character string (621) or (631). Therefore, the detection system should first identify the semantic information of each character (including manual recognition), or directly obtain the original carrier electronic file, use the original carrier electronic file as a template to provide the semantic information of the character, and then obtain each character through the semantic information of the character. Specific coding method. In this example, the characters "you", "good", "!" are based on "encoding method based on the number of independent connected areas", and the character "mother" is based on "number of independent connected areas and number of independent closed areas" The encoding method of the combined set ", character", "does not carry watermark information. Then, the detecting system detects the glyph features corresponding to the encoding method according to the specific glyph encoding method of each character. For example, the glyph features of the number of independent connected regions corresponding to the glyphs of the characters "you", "good", and "!" should be detected. The glyph feature of the combined set of the number of independent connected regions corresponding to the glyph of the character "mother" and the number of independent closed regions is detected. The result of detecting the character glyph feature is to obtain the corresponding character glyph encoding, and the character string (621). The correspondence between each character glyph and the encoding in (631) is as shown in the block diagrams (620) and (630). The encoding of each character glyph is combined to obtain the digital watermark information carried by the character strings (621) and (631) as "0101100" (600).
需要说明的是本发明的编码方法对应的字形特征的检测方法, 为现有成熟技术。例如, 对字符(串)字形映射的 "图"的同构性判断, 对字符(串)字形包含的独立连通区域个数、 独立封闭区域个数的计算, 利用现有成熟技术可完成, 这些技术不包含在本发明的范围内。 基于对多个字符(串) 的多种字形统一编码的文本数字水印技术的具体实施方式- 图 7以示例的方式显示了本项数字水印技术的水印信息的加载与检测的原理。  It should be noted that the method for detecting glyph features corresponding to the coding method of the present invention is a prior art mature technology. For example, the isomorphism judgment of the "graph" of the character (string) glyph mapping, the calculation of the number of independent connected regions included in the character (string) glyph, and the number of independent closed regions can be completed by using existing mature techniques. Techniques are not included in the scope of the invention. A specific implementation of a textual digital watermarking technique based on a plurality of glyphs for a plurality of characters (strings) - Figure 7 shows, by way of example, the principle of loading and detecting watermark information of the present digital watermarking technique.
如图 7所示, 从框图 (700)、 ( 701 )到 ( 710), 再到框图 ( 720)、 ( 730), 最后到框图 ( 721 )、 ( 731 )的流程表示水印信息加载的流程。 该流程表示将数字水印信息 "010" ( 700) 加载到文本 "你好 I " ( 701 ) 中。 如图 4所示, 文本字符串 "你好!" ( 701 ) 中的各字符的编 码方法相同(都釆用 "基于对独立连通区域个数除以 2后的余数进行编码的方法"), 且各字 符携带的水印信息长度(位数)相同, 不需要特别的水印信息分割处理, 只需按顺序、 等长 度(位数)进行水印信息对应。如框图(710)所示,每个字符顺序对应水印信息 "010" ( 700) 中的一位信息。然后再査询图 4中的表, 査找各字符字形编码等于该字符对应的水印信息的 字形。 如图 4所示, 在字符 "你"的多种字形中, (400)、 (402) 的编码为 0, 对应水印信  As shown in FIG. 7, the flow from block diagrams (700), (701) to (710), to block diagrams (720), (730), and finally to block diagrams (721), (731) represents the flow of watermark information loading. This flow means loading the digital watermark information "010" (700) into the text "Hello I" (701). As shown in Fig. 4, the encoding method of each character in the text string "Hello!" ( 701 ) is the same (using the method of encoding based on the remainder after dividing the number of independent connected regions by 2), Moreover, the length (number of bits) of the watermark information carried by each character is the same, and no special watermark information segmentation processing is required, and only the watermark information is required to be sequentially and equal in length (number of bits). As shown in block (710), each character order corresponds to one bit of information in the watermark information "010" (700). Then query the table in Figure 4 to find the glyph of each character glyph code equal to the watermark information corresponding to the character. As shown in Figure 4, in the various glyphs of the character "you", the encoding of (400) and (402) is 0, corresponding to the watermark letter.
12 息 "0"; 在字符 "好"的多种字形中, (411 )、 (413) 的编码为 1 , 对应水印信息 " 1 "; 在 字符 "!"的多种字形中, (440)、 (442) 的编码为 0, 对应水印信息 "0"。 字符字形与水印 信息的对应结果如框图 (720)、 (730)所示。 图 7中, 每个字符都有两个字形的编码等于该 字符対应的水印信息, 且这两个字形分别属于不同的字体风格: 宋体和隶书。最后将相同字 体风格的多个字符的字形组合起来,得到携带了水印信息 "010" (700)的文本字符串(721 )、 (731 ), 其中字符串 (721 ) 的字体风格为隶书, 字符串 (731 ) 的字体风格为宋体。 由于字 符串 (721 )、 (731 ) 内部各字符之间的字体风格统一, 加载数字水印信息 "010" (700)后 给人视觉上带来的影响很小。 12 In the various glyphs of the character "good", the codes of (411) and (413) are 1, corresponding to the watermark information "1"; among the various glyphs of the character "!", (440), The code of (442) is 0, corresponding to the watermark information "0". Corresponding results of character glyphs and watermark information are shown in block diagrams (720) and (730). In Figure 7, each character has a two-character code equal to the watermark information of the character, and the two glyphs belong to different font styles: Song and Lishu. Finally, the glyphs of multiple characters of the same font style are combined to obtain a text string (721), (731) carrying the watermark information "010" (700), wherein the font style of the string (721) is a librarian, a character The font style of string (731) is Song. Due to the uniform font style between the characters in the strings (721) and (731), the visual effect of loading the digital watermark information "010" (700) is small.
假设采用 "基于对独立连通区域个数与独立封闭区域个数之和除以整数后的余数进行 编码的方法", 对图 3中的句子 (350) 中的单词 (字符串)进行统一编码, 整数取 2, 等价 于对独立连通区域个数与独立封闭区域个数之和的奇偶性进行统一编码。又假设和数为奇数 的字符串字形对应的编码为 " 1",和数为偶数的字符串字形对应的编码为" 0",则句子(350) 携带数字水印信息的情况是这样的: 单词(3501 )的独立连通区域个数与独立封闭区域个数 之和为 4, (3502) 为 13, (3503)为 11, (3504)为 2, (3505)为 4, (3506)为 2, (3507) 为 4, (3508)为 7。 可知单词 (3501)、 (3504)、 (3505)、 (3506)、 (3507)独立连通区域个 数与独立封闭区域个数之和为偶数, 编码为 "0", 单词 (3502)、 (3503)、 (3508)独立连通 区域个数与独立封闭区域个数之和为奇数, 编码为 " 1 "。 这样, 句子 (350) 中的单词 (字 符串)按从左到右的顺序对应的编码为 "01100001 ",而二进制数字 "01100001 "对应的 ASC 码为 "a",这相当于句子(350)携带了数字水印信息 "a" (或二进制水印信息 "01100001")。  Assuming that "the method of encoding based on the sum of the number of independent connected regions and the number of independent closed regions divided by an integer" is used, the words (strings) in the sentence (350) in FIG. 3 are uniformly encoded, The integer is taken as 2, which is equivalent to uniformly coding the parity of the number of independent connected regions and the number of independent closed regions. It is also assumed that the code corresponding to the odd-numbered string glyph is "1", and the code corresponding to the even-numbered string glyph is "0", and the sentence (350) carries the digital watermark information as follows: The sum of the number of independent connected areas (3501) and the number of independent closed areas is 4, (3502) is 13, (3503) is 11, (3504) is 2, (3505) is 4, and (3506) is 2. (3507) is 4, (3508) is 7. It can be seen that the sum of the number of independent connected areas of words (3501), (3504), (3505), (3506), (3507) and the number of independent closed areas is even, coded as "0", words (3502), (3503 (3508) The sum of the number of independent connected areas and the number of independent closed areas is an odd number, and the code is "1". Thus, the words (strings) in the sentence (350) correspond to the code "01100001" in the order from left to right, and the ASC code corresponding to the binary number "01100001" is "a", which is equivalent to the sentence (350). The digital watermark information "a" (or binary watermark information "01100001") is carried.
图 9显示了本项技术的数字水印信息检测的过程。字符字形识别系统(910)对带有数 字水印信息的载体文件(900)直接进行字符 (串)字形编码识别。 由于带有数字水印信息 的载体文件(900) 中的各字符 (串)有共同的字形编码方法, 可以直接检测与该编码方法 对应的各字符(串) 的字形特征, 进一步确定各字符(串)字形的编码, 组合载体文件中各 字符(串)的编码就得到数字水印信息(920)。整个水印的检测过程是一个盲水印检测过程, 不需要获得原载体电子文件的模板, 或者进行字符(串)语义识别。 例如, 如果需要从图 7 所示的字符串 (721 )或(731 ) 中检测出数字水印信息, 字符字形识别系统(910)根据统 一的字形编码方法进行字符字形识别。 例子中的字符 "你"、 "好"、 "!"都釆用了 "基于对 独立连通区域个数除以 2后的余数进行编码的方法"(等价于 "基于对独立连通区域个数的 奇偶性进行编码的方法"), 所以, 字符字形识别系统(910)可以直接判断字符串 (721 )或 Figure 9 shows the process of digital watermark information detection by this technique. The character glyph recognition system (910) directly performs character (string) glyph code recognition on the carrier file (900) with digital watermark information. Since each character (string) in the carrier file (900) with digital watermark information has a common glyph encoding method, the glyph features of each character (string) corresponding to the encoding method can be directly detected, and each character (string) is further determined. The encoding of the glyphs, the encoding of each character (string) in the carrier file, results in digital watermark information (920). The whole watermark detection process is a blind watermark detection process, which does not require obtaining a template of the original carrier electronic file, or performing character (string) semantic recognition. For example, if it is necessary to detect digital watermark information from the character string (721) or (731) shown in Fig. 7, the character font recognition system (910) performs character glyph recognition in accordance with a unified glyph encoding method. The characters "you", "good", and "!" in the example use the "method based on the number of independent connected regions divided by 2" (equivalent to "based on the number of independent connected regions" The method of encoding the parity is "), so the character glyph recognition system (910) can directly judge the string (721) or
(731 )中各字符字形包含的独立连通区域个数的奇偶性,奇数编码'为 " 1 ",偶数编码为" 0"。 这个规则对各个字符都是相同的, 从而可直接检测出字符字形对应的编码, 不需要知道各个 字符的语义信息,字符串(721 )、 (731 )中各字符字形与编码的对应关系如框图(720)、 (730) 所示。 组合各字符字形的编码, 得到字符串 (721 )、 (731 )携带的数字水印信息为 "010"(731) The parity of the number of independent connected regions included in each character glyph, the odd code 'is '1', and the even code is '0'. This rule is the same for each character, so that the code corresponding to the character glyph can be directly detected. It is not necessary to know the semantic information of each character, and the correspondence between each character glyph and the code in the characters (721) and (731) is as shown in the block diagram. (720), (730) are shown. Combine the encoding of each character glyph to obtain the digital watermark information carried by the characters (721) and (731) as "010"
(700)。 (700).
本项数字水印技术与 "基于对多个字符(串) 的多种字形分别编码的文本数字水印技 术"相比,本质的区别在于: 在本项技术中,携带数字水印信息的载体文件中的各字符(串) 需要采用共同的字形编码方法, 而对于 "基于对多个字符(串) 的多种字形分别编码的文本 数字水印技术", 载体文件中的各字符(串)可以采用不同的字形编码方法。  The essential difference between this digital watermarking technology and the "text digital watermarking technique based on the encoding of multiple fonts of multiple characters (strings)" is: In this technology, in the carrier file carrying the digital watermark information Each character (string) needs to adopt a common glyph encoding method, and for "text digital watermarking technology based on a plurality of glyphs for multiple characters (strings) respectively", each character (string) in the carrier file can be different. Glyph encoding method.
13 13

Claims

权利 要 求 书 Claim
1、 一种隐藏数据通信方法, 其特征是: 利用字符或字符串的不同拓扑结构的字形来携 带隐藏信息。 A hidden data communication method, characterized in that: a glyph of a different topological structure of a character or a character string is used to carry hidden information.
2、 将同一字符或字符串设计成多种字形的方法, 其特征是: 通过改变字符或字符串的 拓扑结构, 从而得到同一字符或字符串的多种外形, 用于携带隐藏信息。  2. A method of designing the same character or string into a plurality of glyphs, and the feature is: by changing the topology of the character or the string, thereby obtaining various shapes of the same character or string for carrying hidden information.
3、 如权利要求 2所述的将同一字符或字符串设计成多种字形的方法, 其特征是: 通过 改变组成字符或字符串的各笔划之间的连断关系来改变字符或字符串的拓扑结构,从而得到 同一字符或字符串的多种外形。  3. A method of designing the same character or character string into a plurality of glyphs as claimed in claim 2, wherein: changing the character or character string by changing a concatenation relationship between the strokes constituting the character or the character string. Topology to get multiple shapes of the same character or string.
4、 对同一字符或字符串的多种字形进行编码的方法, 其特征是: 相同拓扑结构的字符 或字符串字形有相同的编码, 不同拓扑结构的字符或字符串字形的编码不能完全相同, 即至 少有两个不同的编码。  4. A method for encoding multiple glyphs of the same character or a string, the feature is: characters or string glyphs of the same topology have the same encoding, and characters of different topological structures or string glyphs cannot be identical. That is, there are at least two different encodings.
5、 对同一字符或字符串的多种字形进行编码的方法, 其步骤是:  5. A method of encoding multiple glyphs of the same character or string, the steps of which are:
( 1 )将字符或字符串字形映射为数学学科 "图论"中定义的 "图 ";  (1) Mapping character or string glyphs to "graphs" defined in the "Graphics" of mathematics;
(2) 同构的 "图"对应的字符或字符串字形有相同的编码, 不同构的 "图"对应的字符或 字符串字形的编码不能完全相同, 即至少有两个不同的编码。  (2) The characters or string glyphs corresponding to the isomorphic "graph" have the same encoding. The encoding of the characters or string glyphs corresponding to different "maps" cannot be identical, that is, there are at least two different encodings.
6、 对同一字符或字符串的多种字形进行编码的方法, 其特征是: 针对字符或字符串字 形包含的独立连通区域的个数进行编码,独立连通区域个数相同的字符或字符串字形有相同 的编码, 独立连通区域个数不同的字符或字符串字形的编码不能完全相同, 即至少有两个不 同的编码。  6. A method for encoding a plurality of glyphs of the same character or a character string, wherein: the number of independent connected regions included in the character or string glyph is encoded, and the characters or string glyphs having the same number of independent connected regions are respectively. With the same encoding, the encoding of characters or string glyphs with different numbers of independent connected regions cannot be identical, that is, there are at least two different encodings.
7、 对同一字符或字符串的多种字形进行编码的方法, 其特征是: 针对字符或字符串字 形包含的独立连通区域个数与字符或字符串字形包含的独立封闭区域个数两者的组合集合 进行编码, 相同的组合集合对应的字符或字符串字形有相同的编码, 不同的组合集合对应的 字符或字符串字形的编码不能完全相同, 即至少有两个不同的编码。  7. A method for encoding a plurality of glyphs of the same character or a character string, wherein: the number of independent connected regions included in the character or string glyph and the number of independent closed regions included in the character or string glyph. The combined set is encoded, and the characters or string glyphs corresponding to the same combination set have the same encoding, and the encoding of the corresponding character or string glyph of different combination sets cannot be identical, that is, there are at least two different encodings.
8、 对同一字符或字符串的多种字形进行编码的方法, 其特征是: 针对字符或字符串字 形包含的独立连通区域个数与字符或字符串字形包含的独立封闭区域个数之和进行编码,两 者之和相同的字符或字符串字形有相同的编码,两者之和不同的字符或字符串字形的编码不 能完全相同, 即至少有两个不同的编码。  8. A method for encoding a plurality of glyphs of the same character or a character string, wherein: the sum of the number of independent connected regions included in the character or string glyph and the number of independent closed regions included in the character or string glyph. Encoding, the same character or string glyph has the same encoding, and the encoding of the different characters or string glyphs cannot be identical, that is, there are at least two different encodings.
9、 对同一字符或字符串的多种字形进行编码的方法, 其特征是: 针对字符或字符串字 形包含的独立连通区域个数除以整数后的余数进行编码,余数相同的字符或字符串字形有相 同的编码, 余数不同的字符或字符串字形有不同的编码。  9. A method for encoding a plurality of glyphs of the same character or a character string, wherein: the character or string having the same remainder is encoded by dividing the number of independent connected regions included in the character or string glyph by the remainder of the integer. Glyphs have the same encoding, and different numbers of characters or string glyphs have different encodings.
10、对同一字符或字符串的多种字形进行编码的方法, 其特征是: 针对字符或字符串字 形包含的独立连通区域的个数与字符或字符串字形包含的独立封闭区域个数之和除以整数 后的余数进行编码, 余数相同的字符或字符串字形有相同的编码, 余数不同的字符或字符串 字形有不同的编码。  10. A method of encoding a plurality of glyphs of the same character or a character string, wherein: the sum of the number of independent connected regions included in the character or string glyph and the number of independent closed regions included in the character or string glyph Divided by the remainder after the integer, the same number of characters or string glyphs have the same encoding, and the remainder of the different characters or string glyphs have different encodings.
11、 组合利用如权利要求 4、 5、 6、 7、 8、 9或 10所述的对同一字符或字符串的多种字 形进行编码的方法对同一字符或字符串的多种字形进行编码的方法。  11. Combining the use of a plurality of glyphs of the same character or character string as encoded in claim 4, 5, 6, 7, 8, 9, or 10 to encode a plurality of glyphs of the same character or character string method.
12、 采用如权利要求 4、 5、 6、 7、 8、 9、 10或 11所述的对同一字符或字符串的多种字 形进行编码的方法对多个字符或字符串的多种字形进行编码的方法, 其特征是: 在多个字符 或字符串的多种字形组成的集合范围内, 对多个字符或字符串的多种字形进行统一编码,字 符或字符串字形编码值的确定规则在多个字符或字符串之间是一致的。  12. A method of encoding a plurality of glyphs of the same character or a character string as claimed in claim 4, 5, 6, 7, 8, 9, 10 or 11 for a plurality of glyphs of a plurality of characters or character strings The encoding method is characterized in that: in a collection of a plurality of characters or a plurality of glyphs of a character string, a plurality of glyphs of a plurality of characters or a character string are uniformly encoded, and a character or string glyph code value is determined. Consistent across multiple characters or strings.
13、 对字符或字符串的多种字形进行编码的方法, 其特征是: 针对将字符或字符串字形 包含的独立连通区域的个数、 独立封闭区域个数作为参数的数学运算结果进行编码。  13. A method of encoding a plurality of glyphs of a character or a character string, wherein: the mathematical operation result of the number of independent connected regions included in the character or string glyph and the number of independent closed regions is used as a parameter.
14、 文本数字水印嵌入与检测方法, 其特征是:  14. Text digital watermark embedding and detecting method, which is characterized by:
14 (1 )采用如权利要求 2或 3所述的方法设计合适的字符或字符串字形,并用如权利要 求 4〜11所述的方法分别对载体文件多个字符或字符串的多种字形进行编码。数字水印信息 嵌入到载体文件各字符或字符串的多种字形中,字符或字符串字形的编码用来表示数字水印 信息。 14 (1) designing a suitable character or character string glyph according to the method of claim 2 or 3, and encoding a plurality of glyphs of a plurality of characters or character strings of the carrier file by the method according to claims 4 to 11, respectively. . The digital watermark information is embedded in a plurality of glyphs of characters or strings of the carrier file, and the encoding of the character or string glyph is used to represent the digital watermark information.
(2)针对载体文件中的各字符或字符串在 (1 ) 中确定的各自的编码方法, 分别检测 载体文件中各字符或字符串的字形特征, 以确定载体文件各字符或字符串字形的编码, 从而 检测出数字水印信息。  (2) for each character or string in the carrier file in the respective encoding method determined in (1), respectively detecting the glyph features of each character or string in the carrier file to determine the character or string glyph of the carrier file. Encoding to detect digital watermark information.
15、 文本数字水印嵌入与检测方法, 其特征是:  15. A text digital watermark embedding and detecting method, which is characterized by:
( 1 )采用如权利要求 2或 3所述的方法设计合适的字符或字符串字形, 并用如权利要 求 12所述的方法对载体文件中多个字符或字符串的多种字形进行统一编码。 数字水印信息 嵌入到载体文件各字符或字符串的多种字形中,字符或字符串字形的编码用来表示数字水印 信息。  (1) A suitable character or string glyph is designed by the method of claim 2 or 3, and a plurality of glyphs of a plurality of characters or character strings in the carrier file are uniformly encoded by the method according to claim 12. The digital watermark information is embedded in a plurality of glyphs of characters or strings of the carrier file, and the code of the character or string glyph is used to represent the digital watermark information.
(2)针对载体文件中的各字符或字符串在(1 ) 中确定的共同的编码方法, 统一检测载 体文件中各字符或字符串的字形特征, 以确定载体文件各字符或字符串字形的编码, 从而检 测出数字水印信息。  (2) For the common encoding method determined in (1) for each character or character string in the carrier file, uniformly detect the glyph features of each character or string in the carrier file to determine the character or string glyph of the carrier file. Encoding to detect digital watermark information.
15 15
PCT/CN2005/001703 2004-10-18 2005-10-17 Hidden data communication method and the application thereof in text digital watermark technology WO2006042460A1 (en)

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
CN200410040853.4 2004-10-18
CN 200410040853 CN1601956A (en) 2004-10-18 2004-10-18 Text digital Watermark tech using character's features for carrying watermark information
CN 200510065893 CN1684115B (en) 2004-10-18 2005-04-20 Text digital water printing technology based on character topoloical structure
CN200510065893.9 2005-04-20

Publications (1)

Publication Number Publication Date
WO2006042460A1 true WO2006042460A1 (en) 2006-04-27

Family

ID=35263434

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2005/001703 WO2006042460A1 (en) 2004-10-18 2005-10-17 Hidden data communication method and the application thereof in text digital watermark technology

Country Status (2)

Country Link
CN (1) CN1684115B (en)
WO (1) WO2006042460A1 (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2011021114A1 (en) 2009-08-20 2011-02-24 Nds Limited Electronic book security features
US8762828B2 (en) 2011-09-23 2014-06-24 Guy Le Henaff Tracing an electronic document in an electronic publication by modifying the electronic page description of the electronic document

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104239753B (en) * 2014-07-03 2017-05-03 东华大学 Tamper detection method for text documents in cloud storage environment
CN107423629B (en) * 2017-04-12 2020-10-27 北京溯斐科技有限公司 Method and system for file information output anti-disclosure and tracing
CN108830772A (en) * 2018-05-25 2018-11-16 珠海奔图电子有限公司 Watermark encoder conversion method and device
CN111294340B (en) * 2020-01-17 2022-05-17 河南芯盾网安科技发展有限公司 Encryption information steganography method based on zero-width characters
CN116824598B (en) * 2023-08-24 2023-10-31 强企宝典(山东)信息科技有限公司 Method and device for protecting copyright of digital written works

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2001053954A (en) * 1999-08-17 2001-02-23 Ntt Data Corp Device and method for embedding information and reading information, digital watermark system and recording medium
CN1344462A (en) * 2000-01-19 2002-04-10 皇家菲利浦电子有限公司 Invisable coding of element information
JP2002232679A (en) * 2001-01-30 2002-08-16 Canon Inc Method and device for image processing, computer program, and storage medium
CN1404298A (en) * 2001-09-03 2003-03-19 佳能株式会社 Image processing apparatus and image processing method and program and storage media

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2001053954A (en) * 1999-08-17 2001-02-23 Ntt Data Corp Device and method for embedding information and reading information, digital watermark system and recording medium
CN1344462A (en) * 2000-01-19 2002-04-10 皇家菲利浦电子有限公司 Invisable coding of element information
JP2002232679A (en) * 2001-01-30 2002-08-16 Canon Inc Method and device for image processing, computer program, and storage medium
CN1404298A (en) * 2001-09-03 2003-03-19 佳能株式会社 Image processing apparatus and image processing method and program and storage media

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2011021114A1 (en) 2009-08-20 2011-02-24 Nds Limited Electronic book security features
WO2011021113A1 (en) 2009-08-20 2011-02-24 Nds Limited Electronic book security features
WO2011021111A1 (en) 2009-08-20 2011-02-24 Nds Limited Hindering optical character recognition of a displayed text
WO2011021112A1 (en) 2009-08-20 2011-02-24 Nds Limited Electronic book security features
WO2011021110A1 (en) 2009-08-20 2011-02-24 Nds Limited Electronic book security features
US8791788B2 (en) 2009-08-20 2014-07-29 Cisco Technology Inc. Electronic book security features
US8762828B2 (en) 2011-09-23 2014-06-24 Guy Le Henaff Tracing an electronic document in an electronic publication by modifying the electronic page description of the electronic document
US9606967B2 (en) 2011-09-23 2017-03-28 Guy Le Henaff Tracing a document in an electronic publication

Also Published As

Publication number Publication date
CN1684115A (en) 2005-10-19
CN1684115B (en) 2011-03-23

Similar Documents

Publication Publication Date Title
WO2006042460A1 (en) Hidden data communication method and the application thereof in text digital watermark technology
US5761686A (en) Embedding encoded information in an iconic version of a text image
CN109977861B (en) Off-line handwriting mathematical formula recognition method
Nagy Twenty years of document image analysis in PAMI
CN1607540B (en) System and method for detecting hand-drawn objects inputted by ink
US7942341B2 (en) Two dimensional dot code, and decoding apparatus and method for a two dimensional dot code
CN108182349B (en) Word document watermark copyright information protection device and method
Shirali-Shahreza et al. Arabic/Persian text steganography utilizing similar letters with different codes
JPH10334272A (en) Information embedding method and system for three-dimensional shape model
JP2003528388A (en) Document processing
CN104143200B (en) The frame type coding and intelligent identification Method of a kind of additional information of images
CN106462981A (en) Contour encryption and decryption
CN103593677A (en) Near-duplicate image detection method
CN106845593A (en) A kind of rectangle fixes dot matrix information encoding-decoding method
CN100498834C (en) Digital water mark embedding and extracting method and device
CN101692288A (en) Digital watermark embedding and detecting method of CAD model indicated on basis of NURBS
CN111488732A (en) Deformed keyword detection method, system and related equipment
CN103310134B (en) Vector data watermark anti-counterfeiting method based on geographical semantics support
CN105975643B (en) A kind of realtime graphic search method based on text index
CN115455955A (en) Chinese named entity recognition method based on local and global character representation enhancement
CN101777172A (en) Cellular automata-based blind watermark implementing method
CN1812321A (en) Hidden communication method for mutual independence of character graphic and code
CN114817873A (en) Watermark generating and reading method and device based on deformation
Kise et al. Backgrounds as information carriers for printed documents
CN102456215B (en) Embedding and extraction method of visible watermark carrying hidden information and system thereof

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A1

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BW BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE EG ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KM KP KR KZ LC LK LR LS LT LU LV LY MA MD MG MK MN MW MX MZ NA NG NI NO NZ OM PG PH PL PT RO RU SC SD SE SG SK SL SM SY TJ TM TN TR TT TZ UA UG US UZ VC VN YU ZA ZM ZW

AL Designated countries for regional patents

Kind code of ref document: A1

Designated state(s): BW GH GM KE LS MW MZ NA SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IS IT LT LU LV MC NL PL PT RO SE SI SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 05802022

Country of ref document: EP

Kind code of ref document: A1

WWW Wipo information: withdrawn in national office

Ref document number: 5802022

Country of ref document: EP