CN104361268A - Watermark embedding and reading method, device and system - Google Patents

Watermark embedding and reading method, device and system Download PDF

Info

Publication number
CN104361268A
CN104361268A CN201410705665.2A CN201410705665A CN104361268A CN 104361268 A CN104361268 A CN 104361268A CN 201410705665 A CN201410705665 A CN 201410705665A CN 104361268 A CN104361268 A CN 104361268A
Authority
CN
China
Prior art keywords
font
watermark
text document
pixel
feature
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201410705665.2A
Other languages
Chinese (zh)
Inventor
刘创招
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Guangzhou Shiyuan Electronics Thecnology Co Ltd
Original Assignee
Guangzhou Shiyuan Electronics Thecnology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Guangzhou Shiyuan Electronics Thecnology Co Ltd filed Critical Guangzhou Shiyuan Electronics Thecnology Co Ltd
Priority to CN201410705665.2A priority Critical patent/CN104361268A/en
Publication of CN104361268A publication Critical patent/CN104361268A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F21/00Security arrangements for protecting computers, components thereof, programs or data against unauthorised activity
    • G06F21/10Protecting distributed programs or content, e.g. vending or licensing of copyrighted material ; Digital rights management [DRM]
    • G06F21/16Program or content traceability, e.g. by watermarking
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/103Formatting, i.e. changing of presentation of documents
    • G06F40/109Font handling; Temporal or kinetic typography

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Software Systems (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Computer Hardware Design (AREA)
  • Computer Security & Cryptography (AREA)
  • Technology Law (AREA)
  • Multimedia (AREA)
  • Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • General Health & Medical Sciences (AREA)
  • Editing Of Facsimile Originals (AREA)
  • Document Processing Apparatus (AREA)

Abstract

The invention discloses a watermark embedding method. The watermark embedding method comprises the steps of: modifying any numbers of pixel points of each font in an original font library, and forming a feature font library by each font of which any number of pixel points is modified; setting information contained in a watermark, and converting the information into a binary sequence; replacing fonts in the feature font library with corresponding fonts in a text document into which the watermark needs to be embedded according to the binary sequence, so as to obtain the text document containing the watermark. Correspondingly, the invention further provides a watermark embedding device, a reading method, a reading device and a processing system. By adopting the embodiment of the invention, the watermark can be embedded into the text document by modifying the pixel points of the fonts, the mode is simple and convenient, the reading experience of a user is not influenced, and the safety is high.

Description

A kind of embedding of watermark and read method, Apparatus and system
Technical field
The present invention relates to information security field, particularly relate to a kind of embedding and read method, Apparatus and system of watermark.
Background technology
Digital watermarking, a kind of important technology safeguarded as intellectual property protection and information security developed rapidly in recent years.The ultimate principle of digital watermarking is in multi-medium data (as image, text, Voice & Video etc.) carrier, embed the hidden digital watermark information with definite meaning, the watermark information embedded does not have an impact to the use of original multimedia bearer data, and watermark information transmits together along with initial carrier data and uses.After embed watermark information, specific watermark detecting apparatus can also be used to be read out by the watermark information of embedding, be applied to copyright protection, tampering location, data integrity detection, broadcast supervision, content authentication, use control, covert communications etc.Text Watermarking, as an aspect of digital watermark technology, more and more receives the concern of people, and becomes a focus of digital watermark.
To the text embed watermark in webpage, existing technical scheme mainly contains two kinds.One is first picture by text-converted, then on picture, adds picture watermark, and finally in webpage, the picture of watermark was added in display.The shortcoming of this scheme is owing to there is the discernible watermark of naked eyes in webpage, affecting user's reading experience, and program is wanted to process for watermark specially, and method is loaded down with trivial details.
Another kind is in web page source code, add sightless random character, and webpage can not demonstrate these random characters, can not affect user's reading experience, when copying the text in webpage as user, can together copy by these random characters related.The shortcoming of this scheme is because web displaying is normal, when carrying out sectional drawing to webpage and obtaining text message, does not comprise watermark information, cannot obtain the source of sectional drawing, poor stability in sectional drawing.
Summary of the invention
The object of the invention is the embedding and read method, the Apparatus and system that propose a kind of watermark, under the prerequisite not affecting user's reading experience, can also obtain the watermark information in sectional drawing text document, thus obtain the source of sectional drawing, security is high.
The embodiment of the present invention provides a kind of embedding grammar of watermark, comprising:
By each font amendment pixel arbitrarily in raw font storehouse, the common composition characteristic fontlib of described each font after any pixel of amendment;
The information that setting watermark comprises, is converted to binary sequence by described information;
According to described binary sequence, the font in described feature fontlib is replaced the corresponding font needed in the text document of embed watermark, thus obtain the text document comprising watermark.
In one embodiment, described by each font amendment pixel arbitrarily in raw font storehouse, described each font common composition characteristic fontlib after any individual pixel of amendment specifically comprises: each font in raw font storehouse is removed a pixel arbitrarily, revises the common composition characteristic fontlib of described each font after any pixel.
In another embodiment, described by each font amendment pixel arbitrarily in raw font storehouse, described each font common composition characteristic fontlib after any individual pixel of amendment specifically comprises: each font in raw font storehouse is increased a pixel arbitrarily, revises the common composition characteristic fontlib of described each font after any pixel.
Further, the information that described setting watermark comprises, also comprises after described information is converted to binary sequence: be encrypted described binary sequence;
Described according to described binary sequence, font in described feature fontlib is replaced the corresponding font needed in the text document of embed watermark, thus the text document obtaining comprising watermark specifically comprises: according to the described binary sequence after encryption, font in described feature fontlib is replaced the corresponding font needed in the text document of embed watermark, thus obtain the text document comprising watermark.
Further again, the information that described watermark comprises is the user name of the user browsing text document.
Correspondingly, the embodiment of the present invention also provides a kind of read method of watermark, comprising:
The text document of scanning containing watermark, the embedding grammar that described text document applies above-mentioned watermark obtains;
Contrast draws the font type of each font in described text document, and described font type is raw font or feature font, and wherein, described feature font is the font will obtained after raw font amendment arbitrarily a pixel;
The Binary Text stream of described text document is obtained according to the font type of the described each font after contrast;
Resolve described Binary Text stream, draw the watermark information in described text document.
In one embodiment, described contrast draws the font type of each font in described text document, described font type is raw font or feature font, and wherein, described feature font is the font obtained after raw font amendment arbitrarily a pixel specifically comprised:
Contrast draws the font type of each font in described text document, and described font type is raw font or feature font, and wherein, described feature font is the font obtained after raw font is removed any pixel.
In another embodiment, described contrast draws the font type of each font in described text document, described font type is raw font or feature font, and wherein, described feature font is the font obtained after raw font amendment arbitrarily a pixel specifically comprised:
Contrast draws the font type of each font in described text document, and described font type is raw font or feature font, and wherein, described feature font is the font obtained after raw font is increased any pixel.
Further, also comprise before described parsing described Binary Text stream: described Binary Text stream is decrypted;
The described Binary Text stream of described parsing, show that the watermark information in described text document specifically comprises: resolve the described Binary Text stream after deciphering, draw the watermark information in described text document.
The embodiment of the invention also discloses a kind of flush mounting of watermark, comprising:
Feature fontlib construction unit, for by each font amendment pixel arbitrarily in raw font storehouse, revises the common composition characteristic fontlib of described each font after any pixel;
Watermark generation and converting unit, for setting the information that watermark comprises, be converted to binary sequence by described information;
Watermark embedder unit, for according to described binary sequence, replaces the corresponding font needed in the text document of embed watermark, thus obtains the text document comprising watermark by the font in described feature fontlib.
Further, described feature fontlib construction unit is realized each font amendment pixel arbitrarily in raw font storehouse by the mode removing each font in raw font storehouse/increase a pixel arbitrarily.
In another embodiment, the flush mounting of described watermark also comprises:
Ciphering unit, for being encrypted described binary sequence;
Font in described feature fontlib, according to the described binary sequence after encryption, is replaced the corresponding font needed in the text document of embed watermark, thus is obtained the text document comprising watermark by described watermark embedder unit.
The embodiment of the invention also discloses a kind of reading device of watermark, it is characterized in that, comprising:
Scanning element, for scanning the text document containing watermark;
Font comparing unit, for contrasting the font type drawing each font in described text document, described font type is raw font or feature font, and wherein, described feature font is the font will obtained after raw font amendment arbitrarily a pixel;
Binary Conversion unit, obtains the Binary Text stream of described text document according to the font type of the described each font after contrast;
Resolution unit, for resolving described Binary Text stream, draws the watermark information in described text document.
In another embodiment, the reading device of described watermark also comprises:
Decryption unit, for being decrypted described Binary Text stream;
Described resolution unit resolves the described Binary Text stream after deciphering, draws the watermark information in described text document.
The embodiment of the present invention additionally provides a kind of disposal system of watermark, comprising: the flush mounting of watermark and the reading device of watermark.
The embedding of the watermark that the embodiment of the present invention provides and read method, Apparatus and system, by the font needed in the text document of embed watermark being replaced with the font revising pixel according to watermark information, watermark information is embedded in text document, overcome the problem that text-converted will be become picture by classic method, easy to use; The watermark naked eyes not identifiable design using the method to embed in the text document of webpage, does not affect user's reading experience, overcomes the problem of reading experience difference; Can be obtained the watermark information in text document by the read method of watermark, described text document is the text document of the above-mentioned watermark embedding method of application embed watermark; When the embedding of described watermark and read method are applied to the text document in webpage; embedded watermark information can for browsing the user name of the user of text document; when text document is leaked; obtain the sectional drawing of text document; corresponding leakage personnel can be learnt by reading watermark information; overcome the problem cannot reviewing leakage personnel, and then protection information safety can be reached, prevent the effect of information leakage.Watermark information is embedded in the process of text document, first watermark information is encrypted, then the watermark information after encryption is embedded in text document, when reading watermark, if there is no decipherment algorithm, the plaintext of watermark information cannot be obtained, improve the confidentiality of watermark.
Accompanying drawing explanation
Fig. 1 is the schematic flow sheet of an embodiment of the embedding grammar of watermark provided by the invention;
Fig. 2 is the schematic flow sheet of another embodiment of the embedding grammar of watermark provided by the invention;
Fig. 3 is the schematic flow sheet of an embodiment of the read method of watermark provided by the invention;
Fig. 4 is the schematic flow sheet of another embodiment of the read method of watermark provided by the invention;
Fig. 5 is the schematic diagram of the embodiment of original fontlib in the embedding grammar of watermark provided by the invention;
Fig. 6 is the schematic diagram of the embodiment of the feature fontlib removing pixel in the embedding grammar of watermark provided by the invention;
Fig. 7 is the schematic diagram of the embodiment of the feature fontlib increasing pixel in the embedding grammar of watermark provided by the invention;
Fig. 8 is the schematic diagram of the embodiment of the text document needing embed watermark in the embedding grammar of watermark provided by the invention;
Fig. 9 is the schematic diagram of an embodiment of the text document of embed watermark in the embedding grammar of watermark provided by the invention;
Figure 10 is the schematic diagram of another embodiment of the text document of embed watermark in the embedding grammar of watermark provided by the invention;
Figure 11 is the structured flowchart of an embodiment of the flush mounting of watermark provided by the invention;
Figure 12 is the structured flowchart of another embodiment of the flush mounting of watermark provided by the invention;
Figure 13 is the structured flowchart of an embodiment of the reading device of watermark provided by the invention;
Figure 14 is the structured flowchart of another embodiment of the reading device of watermark provided by the invention.
Figure 15 is the structured flowchart of an embodiment of the disposal system of watermark provided by the invention.
Embodiment
Below in conjunction with the accompanying drawing in the embodiment of the present invention, be clearly and completely described the technical scheme in the embodiment of the present invention, obviously, described embodiment is only the present invention's part embodiment, instead of whole embodiments.Based on the embodiment in the present invention, those of ordinary skill in the art, not making the every other embodiment obtained under creative work prerequisite, belong to the scope of protection of the invention.
See Fig. 1, it is the schematic flow sheet of an embodiment of the embedding grammar of watermark provided by the invention.
The embodiment of the present invention provides a kind of embedding grammar of watermark, comprises step S101 to S103, specific as follows:
S101, by each font amendment pixel arbitrarily in raw font storehouse, the described each font common composition characteristic fontlib of amendment arbitrarily after a pixel;
In step S101, the method for any pixel of amendment comprises at least two embodiments.In one embodiment, above-mentioned steps S101 specifically comprises: each font in raw font storehouse is removed a pixel arbitrarily, the common composition characteristic fontlib of described each font after any pixel of amendment.
In another embodiment, above-mentioned steps S101 specifically comprises: each font in raw font storehouse is increased a pixel arbitrarily, the common composition characteristic fontlib of described each font after any pixel of amendment.
The information that S102, setting watermark comprise, is converted to binary sequence by described information;
Optionally, the information that described watermark comprises is the user name of the user browsing text document.
Should be understood that; for those skilled in the art; under the premise without departing from the principles of the invention, the information setting that watermark can be comprised is the physical address of computing machine, Business Name or other information, and these improvement are also considered as protection scope of the present invention.
S103, according to described binary sequence, the font in described feature fontlib is replaced the corresponding font needing in the text document of embed watermark, thus obtains the text document comprising watermark.
The embedding grammar of the watermark that the embodiment of the present invention provides, by the font needed in the text document of embed watermark being replaced with the font revising pixel according to watermark information, watermark information is embedded in text document, overcome the problem that text-converted will be become picture by classic method, easy to use; The watermark naked eyes not identifiable design using the method to embed in the text document of webpage, do not affect user's reading experience, and security is high.
See Fig. 2, it is the schematic flow sheet of another embodiment of the embedding grammar of watermark provided by the invention.
The embodiment of the present invention provides a kind of embedding grammar of watermark, comprises step S201 to S204, specific as follows:
S201, by each font amendment pixel arbitrarily in raw font storehouse, the described each font common composition characteristic fontlib of amendment arbitrarily after a pixel;
In step s 201, the method for any pixel of amendment comprises at least two embodiments.In one embodiment, above-mentioned steps S201 specifically comprises: each font in raw font storehouse is removed a pixel arbitrarily, the common composition characteristic fontlib of described each font after any pixel of amendment.
In another embodiment, above-mentioned steps S201 specifically comprises: each font in raw font storehouse is increased a pixel arbitrarily, the common composition characteristic fontlib of described each font after any pixel of amendment.
The information that S202, setting watermark comprise, is converted to binary sequence by described information;
Optionally, the information that described watermark comprises is the user name of the user browsing text document.
Should be understood that; for those skilled in the art; under the premise without departing from the principles of the invention, the information setting that watermark can be comprised is the physical address of computing machine, Business Name or other information, and these improvement are also considered as protection scope of the present invention.
S203, described binary sequence to be encrypted;
S204, according to the described binary sequence after encryption, the font in described feature fontlib is replaced the corresponding font needing in the text document of embed watermark, thus obtains the text document comprising watermark.
The embedding grammar of the watermark that the embodiment of the present invention provides, the beneficial effect that first embodiment that can reach the embedding grammar of watermark reaches, and can also be embedded in the process of text document at watermark information, first watermark information is encrypted, again the watermark information after encryption is embedded in text document, when reading watermark, if there is no decipherment algorithm, the plaintext of watermark information cannot be obtained, improve the confidentiality of watermark.
See Fig. 3, it is the schematic flow sheet of an embodiment of the read method of watermark provided by the invention.
Correspondingly, the embodiment of the present invention also provides a kind of read method of watermark, comprises step S301 to step S304, specific as follows:
S301, the text document of scanning containing watermark, the embedding grammar that described text document applies above-mentioned watermark obtains;
S302, contrast draw the font type of each font in described text document, and described font type is raw font or feature font, and wherein, described feature font is will the font that obtains after a pixel arbitrarily of raw font amendment;
In step s 302, the method for any pixel of amendment comprises at least two embodiments.In one embodiment, above-mentioned steps S302 specifically comprises: contrast draws the font type of each font in described text document, described font type is raw font or feature font, and wherein, described feature font is the font obtained after raw font is removed any pixel.
In another embodiment, above-mentioned steps S302 specifically comprises: contrast draws the font type of each font in described text document, described font type is raw font or feature font, and wherein, described feature font is the font obtained after raw font is increased any pixel.
S303, obtain the Binary Text stream of described text document according to the font type of described each font after contrast;
S304, resolve described Binary Text stream, draw the watermark information in described text document.
The read method of the watermark that the embodiment of the present invention provides, by the font type of each font in contrast text document, thus can read out the watermark information in described text document fast.
See Fig. 4, it is the schematic flow sheet of another embodiment of the read method of watermark provided by the invention.
Correspondingly, the embodiment of the present invention also provides a kind of read method of watermark, comprises step S401 to step S405, specific as follows:
S401, the text document of scanning containing watermark, the embedding grammar that described text document applies above-mentioned watermark obtains;
S402, contrast draw the font type of each font in described text document, and described font type is raw font or feature font, and wherein, described feature font is will the font that obtains after a pixel arbitrarily of raw font amendment;
In step S402, the method for any pixel of amendment comprises at least two embodiments.In one embodiment, above-mentioned steps S402 specifically comprises: contrast draws the font type of each font in described text document, described font type is raw font or feature font, and wherein, described feature font is the font obtained after raw font is removed any pixel.
In another embodiment, above-mentioned steps S402 specifically comprises: contrast draws the font type of each font in described text document, described font type is raw font or feature font, and wherein, described feature font is the font obtained after raw font is increased any pixel.
S403, obtain the Binary Text stream of described text document according to the font type of described each font after contrast;
S404, described Binary Text stream to be decrypted;
Described Binary Text stream after S405, parsing deciphering, draws the watermark information in described text document.
It should be noted that, the present embodiment is the embodiment of the watermark read method that there is decryption step, corresponding with the embodiment of the watermark embedding method that there is encrypting step, and decipherment algorithm in the present embodiment is corresponding with the cryptographic algorithm in the embodiment of described watermark embedding method.
The read method of the watermark that the embodiment of the present invention provides, the beneficial effect that first embodiment that can reach the read method of watermark reaches, and can also embed in text document be encryption watermark information time, also can read out the watermark information in described text document fast.
Below in conjunction with Fig. 5 ~ Figure 10, the embedding of the watermark that the embodiment of the present invention provides and read method are described in detail.
See Fig. 5, it is the schematic diagram of the embodiment of original fontlib in the embedding grammar of watermark provided by the invention.
The embodiment of the present invention provides a kind of raw font storehouse of embedding grammar of watermark, comprises two words " water " and " print ";
Font in described raw font storehouse can be regular script, the Song typeface, black matrix, and also can be other font, the word in described raw font storehouse can be any Chinese, English, arabic numeral, punctuation mark etc.
It should be noted that, the present embodiment only comprises " water " for raw font storehouse and " print " is described, and described raw font storehouse also comprises other all words, numbers and symbols etc.
See Fig. 6, it is the schematic diagram of the embodiment of the feature fontlib removing pixel in the embedding grammar of watermark provided by the invention.
The embodiment of the present invention provides a kind of feature fontlib of removal pixel of embedding grammar of watermark, comprises two words " water " and " print "; The raw font of above-mentioned two words is removed respectively a pixel arbitrarily, form the feature font of the removal pixel of " water " and " print ", be made up of the feature fontlib removing pixel the feature font of the removal pixel of " water " and " print ".
It should be noted that, the present embodiment only comprises " water " for feature fontlib and " print " is described, and described feature fontlib also comprises other all words, numbers and symbols etc.
See Fig. 7, it is the schematic diagram of the embodiment of the feature fontlib increasing pixel in the embedding grammar of watermark provided by the invention.
The embodiment of the present invention provides a kind of feature fontlib of increase pixel of embedding grammar of watermark, comprises two words " water " and " print "; The raw font of above-mentioned two words is increased respectively a pixel arbitrarily, form the feature font of the increase pixel of " water " and " print ", be made up of the feature fontlib increasing pixel the feature font of the increase pixel of " water " and " print ".
It should be noted that, the present embodiment only comprises " water " for feature fontlib and " print " is described, and described feature fontlib also comprises other all words, numbers and symbols etc.
See Fig. 8, it is the schematic diagram of the embodiment of the text document needing embed watermark in the embedding grammar of watermark provided by the invention.
The embodiment of the present invention provides a kind of text document needing embed watermark information, and text document comprises four words, is " watermark watermark ", and wherein, the font in text document is the font in raw font storehouse.
It should be noted that, the present embodiment is only described for " watermark watermark " for text document, and described text document can also comprise other any word, numeral or symbols.
See Fig. 9, it is the schematic diagram of an embodiment of the text document of embed watermark in the embedding grammar of watermark provided by the invention.
In order to obtain the text document of the embed watermark shown in Fig. 9, as follows to the idiographic flow of the embedding grammar of the watermark that the embodiment of the present invention provides:
S501, raw font storehouse comprises two words " water " and " print " (see Fig. 5), the raw font of above-mentioned two words is removed respectively a pixel arbitrarily, form the feature font of the removal pixel of " water " and " print ", be made up of the feature fontlib (see Fig. 6) removing pixel the feature font of the removal pixel of " water " and " print ";
S502, the information that setting watermark comprises is the user name " Zhang San " of the user browsing the text document needing embed watermark, and user name " Zhang San " is converted to binary sequence " 0110 " according to certain rule;
It should be noted that, need the number of words of the text document of embed watermark to be no less than digital number that the watermark that need embed text document converts binary sequence to.
It should be noted that, when needs are encrypted watermark information, also comprise after step S502: described binary sequence " 0110 " is encrypted;
S503, the text document of embed watermark is needed to comprise four words, for " watermark watermark " (see Fig. 8), by each word in text document and binary sequence " 0110 " one_to_one corresponding, in text document " watermark watermark ", the word corresponding with the numeral of two in binary sequence 1 is respectively first " print " word and second " water " word, word corresponding with the numeral of two in binary sequence 0 in text document is respectively first " water " word and second " print " word, the font removed in the feature fontlib of pixel is replaced with by with 1 corresponding word digital in binary sequence, do not replace with 0 corresponding word digital in binary sequence, the font in the feature fontlib removing pixel is replaced with by first " print " word in text document and second " water " word, obtain the text document " watermark watermark " (see Fig. 9) of embed watermark.
It should be noted that, when needs are encrypted watermark information, described each word in text document and binary sequence " 0110 " one_to_one corresponding specifically to be comprised: by the described binary sequence one_to_one corresponding after each word in text document and encryption.
It should be noted that, the mode of replacing font in the present embodiment only with " replacing with the font in feature fontlib with 1 corresponding word digital in binary sequence; do not replace with 0 corresponding word digital in binary sequence " for example is described, also can adopt the mode of following replacement font: " replacing with the font in feature fontlib with 0 corresponding word digital in binary sequence; do not replace with 1 corresponding word digital in binary sequence ", or adopt other to replace the mode of fonts.
In order to read the watermark information in the text document of the embed watermark shown in Fig. 9, as follows to the idiographic flow of the embedding grammar of the watermark that the embodiment of the present invention provides:
S601, scanning adopts the text document " watermark watermark " (see Fig. 9) of the mode embed watermark removing font pixel;
S602, contrast draws the font type of each font in described text document, and first " water " word and second " print " word are raw font, and second " water " word and first " print " word are the feature font removing pixel.
S603, by each word in text document respectively with binary digit " 0 " or " 1 " one_to_one corresponding, it is the corresponding digital " 0 " of word of raw font in text document, the corresponding numeral " 1 " of word for removing the feature font of pixel in text document, therefore first " water " word and the corresponding digital " 0 " of second " print " word in text document, in text document, second " water " word and first corresponding numeral " 1 " of " print " word, obtain the Binary Text stream " 0110 " of text document.
It should be noted that, when needs are decrypted watermark information, also comprise after step S603: described Binary Text stream " 0110 " is decrypted;
It should be noted that, the mode obtaining Binary Text stream according to font type in the present embodiment only with " for the corresponding numeral 0 of the word of raw font in text document, the corresponding numeral 1 of the word for removing the feature font of pixel in text document " for example is described.According to the difference of font substitute mode in watermark embedding method, the mode that font type can also be obtained Binary Text stream is set to " being the corresponding numeral 1 of the word of raw font in text document; the corresponding numeral 0 of the word for removing the feature font of pixel in text document ", or adopts other to obtain the mode of Binary Text stream.
S604, resolves described Binary Text stream " 0110 ", draws the watermark information " Zhang San " in text document.
It should be noted that, when needs are decrypted watermark information, the described Binary Text stream " 0110 " of described parsing, show that the watermark information " Zhang San " in text document specifically comprises: resolve the described Binary Text stream after deciphering, draw the watermark information " Zhang San " in text document.
See Figure 10, it is the schematic diagram of another embodiment of the text document of embed watermark in the embedding grammar of watermark provided by the invention.
In order to obtain the text document of the embed watermark shown in Figure 10, as follows to the idiographic flow of the embedding grammar of the watermark that the embodiment of the present invention provides:
S701, raw font storehouse comprises two words " water " and " print " (see Fig. 5), the raw font of above-mentioned two words is increased respectively a pixel arbitrarily, form the feature font of the increase pixel of " water " and " print ", be made up of the feature fontlib (see Fig. 7) increasing pixel the feature font of the increase pixel of " water " and " print ";
S702, the information that setting watermark comprises is the user name " Zhang San " of the user browsing the text document needing embed watermark, and user name " Zhang San " is converted to binary sequence " 0110 " according to certain rule;
It should be noted that, need the number of words of the text document of embed watermark to be no less than digital number that the watermark that need embed text document converts binary sequence to.
It should be noted that, when needs are encrypted watermark information, also comprise after step S702: described binary sequence " 0110 " is encrypted;
S703, the text document of embed watermark is needed to comprise four words, for " watermark watermark " (see Fig. 8), by each word in text document and binary sequence " 0110 " one_to_one corresponding, in text document " watermark watermark ", the word corresponding with the numeral of two in binary sequence 1 is respectively first " print " word and second " water " word, word corresponding with the numeral of two in binary sequence 0 in text document is respectively first " water " word and second " print " word, the font increased in the feature fontlib of pixel is replaced with by with 1 corresponding word digital in binary sequence, do not replace with 0 corresponding word digital in binary sequence, the font in the feature fontlib increasing pixel is replaced with by first " print " word in text document and second " water " word, obtain the text document " watermark watermark " (see Figure 10) of embed watermark.
It should be noted that, when needs are encrypted watermark information, described each word in text document and binary sequence " 0110 " one_to_one corresponding specifically to be comprised: by the described binary sequence one_to_one corresponding after each word in text document and encryption.
It should be noted that, the mode of replacing font in the present embodiment only with " replacing with the font in feature fontlib with 1 corresponding word digital in binary sequence; do not replace with 0 corresponding word digital in binary sequence " for example is described, also can adopt the mode of following replacement font: " replacing with the font in feature fontlib with 0 corresponding word digital in binary sequence; do not replace with 1 corresponding word digital in binary sequence ", or adopt other to replace the mode of fonts.
In order to read the watermark information in the text document of the embed watermark shown in Figure 10, as follows to the idiographic flow of the embedding grammar of the watermark that the embodiment of the present invention provides:
S801, scanning adopts the text document " watermark watermark " (see Figure 10) of the mode embed watermark increasing font pixel;
S802, contrast draws the font type of each font in described text document, and first " water " word and second " print " word are raw font, and second " water " word and first " print " word are the feature font increasing pixel.
S803, by each word in text document respectively with binary digit " 0 " or " 1 " one_to_one corresponding, it is the corresponding digital " 0 " of word of raw font in text document, the corresponding numeral " 1 " of word for increasing the feature font of pixel in text document, therefore first " water " word and the corresponding digital " 0 " of second " print " word in text document, in text document, second " water " word and first corresponding numeral " 1 " of " print " word, obtain the Binary Text stream " 0110 " of text document.
It should be noted that, when needs are decrypted watermark information, also comprise after step S803: described Binary Text stream " 0110 " is decrypted;
It should be noted that, the mode obtaining Binary Text stream according to font type in the present embodiment only with " for the corresponding numeral 0 of the word of raw font in text document, the corresponding numeral 1 of the word for removing the feature font of pixel in text document " for example is described.According to the difference of font substitute mode in watermark embedding method, the mode that font type can also be obtained Binary Text stream is set to " being the corresponding numeral 1 of the word of raw font in text document; the corresponding numeral 0 of the word for removing the feature font of pixel in text document ", or adopts other to obtain the mode of Binary Text stream.
S804, resolves described Binary Text stream " 0110 ", draws the watermark information " Zhang San " in text document.
It should be noted that, when needs are decrypted watermark information, the described Binary Text stream " 0110 " of described parsing, show that the watermark information " Zhang San " in text document specifically comprises: resolve the described Binary Text stream after deciphering, draw the watermark information " Zhang San " in text document.
See Figure 11, it is the structured flowchart of an embodiment of the flush mounting of watermark provided by the invention.The flush mounting of this watermark described comprises:
Feature fontlib construction unit 101, for by each font amendment pixel arbitrarily in raw font storehouse, revises the common composition characteristic fontlib of described each font after any pixel;
Wherein, described feature fontlib construction unit is realized each font amendment pixel arbitrarily in raw font storehouse by the mode removing each font in raw font storehouse/increase a pixel arbitrarily.
Watermark generation and converting unit 102, for setting the information that watermark comprises, be converted to binary sequence by described information;
Preferably, the information that described watermark comprises is the user name of the user browsing text document.
Watermark embedder unit 103, for according to described binary sequence, replaces the corresponding font needed in the text document of embed watermark, thus obtains the text document comprising watermark by the font in described feature fontlib.
See such as Figure 12, it is the structured flowchart of another embodiment of the flush mounting of watermark provided by the invention.The flush mounting of this watermark described comprises:
Feature fontlib construction unit 201, for by each font amendment pixel arbitrarily in raw font storehouse, revises the common composition characteristic fontlib of described each font after any pixel;
Wherein, described feature fontlib construction unit is realized each font amendment pixel arbitrarily in raw font storehouse by the mode removing each font in raw font storehouse/increase a pixel arbitrarily.
Watermark generation and converting unit 202, for setting the information that watermark comprises, be converted to binary sequence by described information;
Preferably, the information that described watermark comprises is the user name of the user browsing text document.
Described ciphering unit 203, for being encrypted described binary sequence; And
Watermark embedder unit 204, for according to the described binary sequence after encryption, replaces the corresponding font needed in the text document of embed watermark, thus obtains the text document comprising watermark by the font in described feature fontlib
See Figure 13, it is the structured flowchart of an embodiment of the reading device of watermark provided by the invention.The reading device of described watermark comprises:
Scanning element 301, for scanning the text document containing watermark;
Font comparing unit 302, for contrasting the font type drawing each font in described text document, described font type is raw font or feature font, and wherein, described feature font is the font will obtained after raw font amendment arbitrarily a pixel;
Preferably, described feature fontlib construction unit is realized each font amendment pixel arbitrarily in raw font storehouse by the mode removing each font in raw font storehouse/increase a pixel arbitrarily.
Binary Conversion unit 303, for obtaining the Binary Text stream of described text document according to the font type of the described each font after contrast;
Resolution unit 304, for resolving described Binary Text stream, draws the watermark information in described text document.Preferably, the watermark information drawn is user name.
See Figure 14, it is the structured flowchart of another embodiment of the reading device of watermark provided by the invention.The reading device of described watermark comprises:
Scanning element 401, for scanning the text document containing watermark;
Font comparing unit 402, for contrasting the font type drawing each font in described text document, described font type is raw font or feature font, and wherein, described feature font is the font will obtained after raw font amendment arbitrarily a pixel;
Preferably, described feature fontlib construction unit is realized each font amendment pixel arbitrarily in raw font storehouse by the mode removing each font in raw font storehouse/increase a pixel arbitrarily.
Binary Conversion unit 403, for obtaining the Binary Text stream of described text document according to the font type of the described each font after contrast;
Decryption unit 404, for being decrypted described Binary Text stream;
Resolution unit 405, for resolving the described Binary Text stream after deciphering, draws the watermark information in described text document.Preferably, the watermark information drawn is user name.
See Figure 15, it is the structured flowchart of an embodiment of the disposal system of watermark provided by the invention.The disposal system of watermark comprises the flush mounting 151 of watermark and the reading device 152 of watermark.Wherein, the flush mounting 151 of watermark is the flush mounting of the watermark in above-mentioned Figure 11 or Figure 12 embodiment, and the reading device 152 of described watermark is the reading device of the watermark in above-mentioned Figure 13 or Figure 14 embodiment.
The embedding of the watermark that the embodiment of the present invention provides and read method, Apparatus and system, by the font needed in the text document of embed watermark being replaced with the font revising pixel according to watermark information, watermark information is embedded in text document, overcome the problem that text-converted will be become picture by classic method, easy to use; The watermark naked eyes not identifiable design using the method to embed in the text document of webpage, does not affect user's reading experience, overcomes the problem of reading experience difference; Can be obtained the watermark information in text document by the read method of watermark, described text document is the text document of the above-mentioned watermark embedding method of application embed watermark; When the embedding of described watermark and read method are applied to the text document in webpage; embedded watermark information can for browsing the user name of the user of text document; when text document is leaked; obtain the sectional drawing of text document; corresponding leakage personnel can be learnt by reading watermark information; overcome the problem cannot reviewing leakage personnel, and then protection information safety can be reached, prevent the effect of information leakage.Watermark information is embedded in the process of text document, first watermark information is encrypted, then the watermark information after encryption is embedded in text document, when reading watermark, if there is no decipherment algorithm, the plaintext of watermark information cannot be obtained, improve the confidentiality of watermark.
One of ordinary skill in the art will appreciate that all or part of flow process realized in above-described embodiment method, that the hardware that can carry out instruction relevant by computer program has come, described program can be stored in a computer read/write memory medium, this program, when performing, can comprise the flow process of the embodiment as above-mentioned each side method.Wherein, described storage medium can be magnetic disc, CD, read-only store-memory body (Read-Only Memory, ROM) or random store-memory body (Random Access Memory, RAM) etc.
The above is the preferred embodiment of the present invention; it should be pointed out that for those skilled in the art, under the premise without departing from the principles of the invention; can also make some improvements and modifications, these improvements and modifications are also considered as protection scope of the present invention.

Claims (13)

1. an embedding grammar for watermark, is characterized in that, comprising:
By each font amendment pixel arbitrarily in raw font storehouse, the common composition characteristic fontlib of described each font after any pixel of amendment;
The information that setting watermark comprises, is converted to binary sequence by described information;
According to described binary sequence, the font in described feature fontlib is replaced the corresponding font needed in the text document of embed watermark, thus obtain the text document comprising watermark.
2. the embedding grammar of watermark as claimed in claim 1, is characterized in that, described by each font amendment pixel arbitrarily in raw font storehouse, the common composition characteristic fontlib of described each font after any pixel of amendment specifically comprises:
Each font in raw font storehouse is removed a pixel arbitrarily, the common composition characteristic fontlib of described each font after any pixel of amendment; Or
Each font in raw font storehouse is increased a pixel arbitrarily, the common composition characteristic fontlib of described each font after any pixel of amendment.
3. the embedding grammar of watermark as claimed in claim 1, is characterized in that, the information that described setting watermark comprises, also comprises: be encrypted described binary sequence after described information is converted to binary sequence;
Described according to described binary sequence, font in described feature fontlib is replaced the corresponding font needed in the text document of embed watermark, thus the text document obtaining comprising watermark specifically comprises: according to the described binary sequence after encryption, font in described feature fontlib is replaced the corresponding font needed in the text document of embed watermark, thus obtain the text document comprising watermark.
4. a read method for watermark, is characterized in that, comprising:
The text document of scanning containing watermark;
Contrast draws the font type of each font in described text document, and described font type is raw font or feature font, and wherein, described feature font is the font will obtained after raw font amendment arbitrarily a pixel;
The Binary Text stream of described text document is obtained according to the font type of the described each font after contrast;
Resolve described Binary Text stream, draw the watermark information in described text document.
5. the read method of watermark as claimed in claim 4, it is characterized in that, described contrast draws the font type of each font in described text document, described font type is raw font or feature font, wherein, described feature font is the font obtained after raw font amendment arbitrarily a pixel specifically comprised:
Contrast draws the font type of each font in described text document, and described font type is raw font or feature font, and wherein, described feature font is the font obtained after any pixel is removed/increased to raw font.
6. the read method of watermark as claimed in claim 4, is characterized in that, also comprise: be decrypted described Binary Text stream before described parsing described Binary Text stream;
The described Binary Text stream of described parsing, show that the watermark information in described text document specifically comprises: resolve the described Binary Text stream after deciphering, draw the watermark information in described text document.
7. a flush mounting for watermark, is characterized in that, comprising:
Feature fontlib construction unit, for by each font amendment pixel arbitrarily in raw font storehouse, revises the common composition characteristic fontlib of described each font after any pixel;
Watermark generation and converting unit, for setting the information that watermark comprises, be converted to binary sequence by described information;
Watermark embedder unit, for according to described binary sequence, replaces the corresponding font needed in the text document of embed watermark, thus obtains the text document comprising watermark by the font in described feature fontlib.
8. the flush mounting of watermark as claimed in claim 7, it is characterized in that, described feature fontlib construction unit is realized each font amendment pixel arbitrarily in raw font storehouse by the mode removing each font in raw font storehouse/increase a pixel arbitrarily.
9. the flush mounting of watermark as claimed in claim 7, is characterized in that, also comprise:
Ciphering unit, for being encrypted described binary sequence;
Font in described feature fontlib, according to the described binary sequence after encryption, is replaced the corresponding font needed in the text document of embed watermark, thus is obtained the text document comprising watermark by described watermark embedder unit.
10. a reading device for watermark, is characterized in that, comprising:
Scanning element, for scanning the text document containing watermark;
Font comparing unit, for contrasting the font type drawing each font in described text document, described font type is raw font or feature font, and wherein, described feature font is the font will obtained after raw font amendment arbitrarily a pixel;
Binary Conversion unit, for obtaining the Binary Text stream of described text document according to the font type of the described each font after contrast;
Resolution unit, for resolving described Binary Text stream, draws the watermark information in described text document.
The reading device of 11. watermarks as claimed in claim 10, is characterized in that, also comprise: described feature font is the font obtained after any pixel is removed/increased to raw font.
The reading device of 12. watermarks as claimed in claim 10, is characterized in that, also comprise:
Decryption unit, for being decrypted described Binary Text stream;
Described resolution unit resolves the described Binary Text stream after deciphering, draws the watermark information in described text document.
The disposal system of 13. 1 kinds of watermarks, is characterized in that, comprising:
The flush mounting of the watermark according to any one of claim 7 ~ 9; With
The reading device of the watermark according to any one of claim 10 ~ 12.
CN201410705665.2A 2014-11-28 2014-11-28 Watermark embedding and reading method, device and system Pending CN104361268A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201410705665.2A CN104361268A (en) 2014-11-28 2014-11-28 Watermark embedding and reading method, device and system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201410705665.2A CN104361268A (en) 2014-11-28 2014-11-28 Watermark embedding and reading method, device and system

Publications (1)

Publication Number Publication Date
CN104361268A true CN104361268A (en) 2015-02-18

Family

ID=52528527

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201410705665.2A Pending CN104361268A (en) 2014-11-28 2014-11-28 Watermark embedding and reading method, device and system

Country Status (1)

Country Link
CN (1) CN104361268A (en)

Cited By (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108345774A (en) * 2018-01-26 2018-07-31 四川中环法智互联网科技有限公司 A kind of transfer approach and transfer management system of secure content
CN108491697A (en) * 2018-01-26 2018-09-04 四川中环法智互联网科技有限公司 File content is divulged a secret management system and retroactive method of divulging a secret
CN109558378A (en) * 2018-11-28 2019-04-02 泰康保险集团股份有限公司 File management method, device, equipment and storage medium
CN110619590A (en) * 2019-08-22 2019-12-27 杭州名淘教育科技有限公司 Online education resource recommendation system based on social media
CN110955889A (en) * 2019-12-18 2020-04-03 合肥灵蓄信息技术有限公司 Electronic document tracing method based on digital fingerprints
CN111191414A (en) * 2019-11-11 2020-05-22 苏州亿歌网络科技有限公司 Page watermark generation method, identification method, device, equipment and storage medium
CN111279338A (en) * 2019-05-20 2020-06-12 阿里巴巴集团控股有限公司 Identifying copyrighted material using embedded copyright information
CN112417390A (en) * 2020-12-03 2021-02-26 北京指掌易科技有限公司 File processing method, device, equipment and storage medium
US11037469B2 (en) 2019-05-20 2021-06-15 Advanced New Technologies Co., Ltd. Copyright protection based on hidden copyright information
US11042612B2 (en) 2019-05-20 2021-06-22 Advanced New Technologies Co., Ltd. Identifying copyrighted material using embedded copyright information
US11227351B2 (en) 2019-05-20 2022-01-18 Advanced New Technologies Co., Ltd. Identifying copyrighted material using embedded copyright information
WO2023155712A1 (en) * 2021-11-10 2023-08-24 北京字节跳动网络技术有限公司 Page generation method and apparatus, page display method and apparatus, and electronic device and storage medium
CN116828127A (en) * 2023-08-30 2023-09-29 北京点聚信息技术有限公司 Fingerprint encryption storage method combined with document data

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2010068421A (en) * 2008-09-12 2010-03-25 Mitsubishi Electric Corp Digital watermark apparatus and digital watermark method
CN103500296A (en) * 2013-09-29 2014-01-08 北京溯源鸿业科技有限公司 Inlaying method and device of digital watermarks in text documents
CN103559280A (en) * 2013-11-08 2014-02-05 厦门亿联网络技术股份有限公司 Method for flexibly storing and displaying icons

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2010068421A (en) * 2008-09-12 2010-03-25 Mitsubishi Electric Corp Digital watermark apparatus and digital watermark method
CN103500296A (en) * 2013-09-29 2014-01-08 北京溯源鸿业科技有限公司 Inlaying method and device of digital watermarks in text documents
CN103559280A (en) * 2013-11-08 2014-02-05 厦门亿联网络技术股份有限公司 Method for flexibly storing and displaying icons

Cited By (18)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108345774A (en) * 2018-01-26 2018-07-31 四川中环法智互联网科技有限公司 A kind of transfer approach and transfer management system of secure content
CN108491697A (en) * 2018-01-26 2018-09-04 四川中环法智互联网科技有限公司 File content is divulged a secret management system and retroactive method of divulging a secret
CN109558378A (en) * 2018-11-28 2019-04-02 泰康保险集团股份有限公司 File management method, device, equipment and storage medium
US11080671B2 (en) 2019-05-20 2021-08-03 Advanced New Technologies Co., Ltd. Identifying copyrighted material using embedded copyright information
US11227351B2 (en) 2019-05-20 2022-01-18 Advanced New Technologies Co., Ltd. Identifying copyrighted material using embedded copyright information
US11409850B2 (en) 2019-05-20 2022-08-09 Advanced New Technologies Co., Ltd. Identifying copyrighted material using embedded copyright information
CN111279338A (en) * 2019-05-20 2020-06-12 阿里巴巴集团控股有限公司 Identifying copyrighted material using embedded copyright information
US11062000B2 (en) 2019-05-20 2021-07-13 Advanced New Technologies Co., Ltd. Identifying copyrighted material using embedded copyright information
US11037469B2 (en) 2019-05-20 2021-06-15 Advanced New Technologies Co., Ltd. Copyright protection based on hidden copyright information
US11042612B2 (en) 2019-05-20 2021-06-22 Advanced New Technologies Co., Ltd. Identifying copyrighted material using embedded copyright information
US11056023B2 (en) 2019-05-20 2021-07-06 Advanced New Technologies Co., Ltd. Copyright protection based on hidden copyright information
CN110619590A (en) * 2019-08-22 2019-12-27 杭州名淘教育科技有限公司 Online education resource recommendation system based on social media
CN111191414A (en) * 2019-11-11 2020-05-22 苏州亿歌网络科技有限公司 Page watermark generation method, identification method, device, equipment and storage medium
CN110955889A (en) * 2019-12-18 2020-04-03 合肥灵蓄信息技术有限公司 Electronic document tracing method based on digital fingerprints
CN112417390A (en) * 2020-12-03 2021-02-26 北京指掌易科技有限公司 File processing method, device, equipment and storage medium
WO2023155712A1 (en) * 2021-11-10 2023-08-24 北京字节跳动网络技术有限公司 Page generation method and apparatus, page display method and apparatus, and electronic device and storage medium
CN116828127A (en) * 2023-08-30 2023-09-29 北京点聚信息技术有限公司 Fingerprint encryption storage method combined with document data
CN116828127B (en) * 2023-08-30 2023-10-27 北京点聚信息技术有限公司 Fingerprint encryption storage method combined with document data

Similar Documents

Publication Publication Date Title
CN104361268A (en) Watermark embedding and reading method, device and system
CN109767375B (en) Image watermark embedding and tracing method and system
Por et al. UniSpaCh: A text-based data hiding method using Unicode space characters
Shirali-Shahreza et al. Arabic/Persian text steganography utilizing similar letters with different codes
CN102542212B (en) Text information hiding method and device
US10936791B2 (en) Dynamically changing text wherein if text is altered unusual shapes appear
CN102567938B (en) Watermark image blocking method and device for western language watermark processing
Kaur et al. An existential review on text watermarking techniques
Singh et al. A survey on text based steganography
CN104517045A (en) Method for creating protected digital file
Stojanov et al. A new property coding in text steganography of Microsoft Word documents
CN103559251B (en) Data security protection method based on Information hiding
CN110874456B (en) Watermark embedding method, watermark extracting method, watermark embedding device, watermark extracting device and data processing method
Chaudhary et al. Text steganography based on feature coding method
CN109800547B (en) Method for quickly embedding and extracting information for WORD document protection and distribution tracking
Bhambri et al. A novel approach of zero watermarking for text documents
Shirali-Shahreza et al. Persian/arabic unicode text steganography
Jaiswal et al. Implementation of a new technique for web document protection using unicode
CN114091080A (en) Subtitle file encryption and decryption method, system, storage medium and electronic equipment
Jusoh et al. A review of arabic text steganography: past and present
Shazzad-Ur-Rahman et al. An efficient bengali text steganography method using bengali letters and whitespace characters
Samanta et al. A Novel approach of text steganography using nonlinear character positions (NCP)
Rizzo et al. Text authorship verification through watermarking
Chen et al. Word text watermarking for IP protection and tamper localization
AL-Mozani et al. A new text steganography method by using non-printing unicode characters and unicode system characteristics in English/Arabic documents

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20150218

RJ01 Rejection of invention patent application after publication