CN101593247A - Utilize the literal body characteristics to carry the text digital water mark technology of watermark information - Google Patents

Utilize the literal body characteristics to carry the text digital water mark technology of watermark information Download PDF

Info

Publication number
CN101593247A
CN101593247A CNA2008100284741A CN200810028474A CN101593247A CN 101593247 A CN101593247 A CN 101593247A CN A2008100284741 A CNA2008100284741 A CN A2008100284741A CN 200810028474 A CN200810028474 A CN 200810028474A CN 101593247 A CN101593247 A CN 101593247A
Authority
CN
China
Prior art keywords
literal
chinese character
piracy
watermark information
image
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CNA2008100284741A
Other languages
Chinese (zh)
Inventor
朱烽
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Individual
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Priority to CNA2008100284741A priority Critical patent/CN101593247A/en
Publication of CN101593247A publication Critical patent/CN101593247A/en
Pending legal-status Critical Current

Links

Images

Landscapes

  • Image Processing (AREA)
  • Editing Of Facsimile Originals (AREA)

Abstract

A kind of text digital water mark technology that utilizes the literal body characteristics to carry watermark information.The technical problem to be solved in the present invention is: under the prerequisite that not too influences reading habit; protected content is closely connected with anti-piracy content is in the same place; the bootlegger is difficult on the image of electronic publication cuts apart both; preserve the coded message in the electronic publication; thereby the user of electronic publication is revealed in identification, reaches to stop piracy to be spread unchecked.Specifically be to represent 1 and 0, the sequence number that can discern the user is encoded to the article random site with the variation of body by several bodies that utilize Chinese character.

Description

Utilize the literal body characteristics to carry the text digital water mark technology of watermark information
Technical field
The invention belongs to command, control, communications, and information engineering field, the coding that is specifically related to data with raise the price, digital watermark technology, focus on and solve the anti-pirate technical matters of domestic network novel VIP chapters and sections
Background technology
Along with popularizing of the network life, the business model of electronic publishing obtains businessman and user's favor, it particularly is the rise of the domestic novel website of representative with the starting point, the reader can subscribe to fictitious single piece (being the VIP chapters and sections) with cheap price by network, need not put in order local the purchase, and the author can obtain payment for an article or book written by the network writing, and the novel website can obtain to be divided into, and this is one three business model of winning.
But some little websites utilize the VIP account number of having registered, and see that the VIP chapters and sections that are over get off sectional drawing afterwards, are put into own website and go, and obtain flow and sell advertisement acquisition income.It is the regular large-scale novel website of representative that this way equals to colonize in the starting point, has encouraged pirate general mood, has a strong impact on the operation of regular large-scale novel website.
Traditional digital watermark technology and theoretical copy right piracy in the face of these a kind of reality, protection seems and is pale and weak.
The guard method of two big classes is arranged substantially: the one, the novel article that protect is made into picture, utilize the image digital watermark technology, encrypt and in image, add as shading and hide Info, the 2nd, before being made into picture, protected article utilizes the text digital watermark.Represent article " research of two-value text digital watermark technology and emulation " (the system emulation journal, author: Wang Huiqin, Li Renhou) and " based on the design and the realization of the digital watermarking algorithm of content of text " (computer engineering and design, author: after relaxing, Yang Chao, He Wei, Du Juan).Network novel website is that comprehensive this two big class protection way realizes anti-pirate technology basically now.
By the anti-piracy technique of network novel VIP chapters and sections before analyzing and relevant digital watermark technology, find between protected content and the anti-piracy content it is " separating ", promptly removed anti-piracy content, remaining is exactly protected content.As long as on the image of network novel VIP chapters and sections, find the distinguishing condition of two kinds of contents, get rid of anti-piracy content, the bootlegger can equally with legal user read works.
Be embodied as two aspects " the separating " between protected content and the anti-piracy content:
First aspect is more directly perceived, be exactly protected content images and anti-piracy encoded content image be separately.Such as in VIP chapters and sections image, adding visual coding shading, perhaps add color-code combined spot that naked eyes can not discern etc.Because protection content---the document literal is fairly simple, the color of image of document literal and background can be thought bianry image, anti-piracy encoded content color of image essence is the 3rd value, it can near or equal background color, but definitely can not, perhaps can not large tracts of land near and equal the document text color, not so document can become not readable, so because have distance between document text color and the encoded content color of image, so just there are separately both technological means;
Second more abstract point in aspect, be exactly the reading of protected content images and the reading of anti-piracy encoded content image be separately.The readers ' reading novel is abstracted into a naive model: " seeing the shape of literal-identification literal meaning ", the readers ' reading novel simply says to be exactly to repeat the before model, and the look like formation that links up of the literal of identification is imagined.The reading of the protected content images of saying previously is exactly " shape-identification literal meaning of seeing literal " this model.Some are arranged in the conventional digital digital watermark is to utilize some and preceding surface model irrelevant document unit such as the topological structure of paragraph spacing, word space, literal or space usually to encode, and the reading model of the encoded content image that simple abstract is anti-piracy is " display image → recognition coding information of seeing non-legible shape ".Because there be " distance " in two models, so also just there are separately both technological means.
Except top two big class protection ways; also have some little means, adopted recently at inessential paragraph, unshowy paragraph as starting point and added significant words and expressions, can be included into second class protection way on this is general; but this way is easy to be discovered, and is not permanent way.Be based on the text digital water mark technology of content in addition in addition, the solution that does not but also have ripe content-based embed digital watermark, have only some reduction procedures, as utilize the synonym or the phonetically similar word of Chinese character, carry information shortcoming little and that penetrated easily but exist.
Summary of the invention
The technical problem to be solved in the present invention is: under the prerequisite that not too influences reading habit; preserve the coded message in the network novel VIP chapters and sections; when after protected network novel VIP chapters and sections are by sectional drawing, illegally sharing; can from pirate picture, read coded message; thereby the user of electronic publication is revealed in identification, reaches to stop piracy to be spread unchecked.
Deficiency at former technology; the present invention proposes a solution; under the prerequisite that not too influences reading habit, protected content is closely connected with anti-piracy content is in the same place, the bootlegger is difficult on the image of network novel VIP chapters and sections both are cut apart.This solution is utilizes the literal body characteristics to carry the text digital water mark technology of watermark information.
Because the present invention focuses on that to solve network novel VIP chapters and sections anti-pirate, particularly domestic technical matters adds to the present invention relates to the literal body, so following characteristic according to Chinese character designs and illustrate solution.
Ultimate principle is by suitably selecting several bodies of literal, and the body characteristics of literal is encoded, and utilizes the body of literal to change and carries digital watermark information.
The body of Chinese character comprises font and two aspects of font.
The body of Chinese character changes very abundant.Different fonts such as person in servitude, pattern, row, grass can be write in same Chinese character, has the block letter and the branch block letter of handwritten form that different font sizes are arranged with a kind of font, and the handwritten form style varies with each individual again.In addition, though Chinese character is a Chinese characters, the printing and write in different-styles such as long body, flat body, italic are arranged again.As seen, the body of Chinese character has level of freedom.
The body of Chinese character is an aspect of Chinese character, be the demonstration aspect of Chinese character specifically, and fictitious literal (Chinese character) will convey to the meaning aspect that the reader is a Chinese character, the readers ' reading process simply says to be exactly by seeing the demonstration of Chinese character, receive the meaning of Chinese character, get up continuously, thereby form the whole meaning.
Give the specific meaning different modes that Chinese character shows, the display mode combination of several Chinese characters can be represented a respective user account's sequence number, so just encoded sequence number in the article with the body variation of Chinese character, again article is become the picture form, so protected content images is the same with anti-piracy encoded content image, all is the display mode of Chinese character; And the reading of the reading of protected content images and anti-piracy encoded content image also is the same; the object that both read all is the demonstration aspect of Chinese character; different is the former obtains this Chinese character from the demonstration aspect of Chinese character the meaning, and the latter is the display mode combination acquisition coded message from Chinese character.
The bootlegger is not before having image recognition to go out the article literal, and the body that cannot eliminate Chinese character changes, and also equals to remove coding, so just means the bootlegger when scattering out article image, also will expose the user account of oneself.
The article image self that literal body coding forms has certain interference to OCR (character image recognition technology), Fig. 1 of accompanying drawing is the fontlib document received font body coding that carries with WORD, and the instrument Document Imaging program OCR that carries with Office discerns the literal accuracy less than 70%.
Degree problem as for the influence reading, can see very intuitively from Fig. 1 (demonstration document) of accompanying drawing, do not having much affect aspect the reading property with the article behind the literal body coding, this depends on Chinese character is the plane literal, Chinese character by one or above radical with two-dimensional approach (the Europe family of languages is the one dimension literal) in specific space, be configured in the square block and form.This video that Chinese character produces focuses on and allows the reader do the image impression, has cultivated like this and has used the crowd of Chinese character to possess stronger vivid feeling ability.So in article, the variation of adopting Chinese character form is for the crowd who uses Chinese character, very influence is not read.
Compare with the similar techniques scheme.
" based on the text digital water mark technology of character topology ", (small-sized microcomputer system, author and inventor: Liu Dong) patent applied for, the patent No.: 200410040853.4.(utilizing the font style characteristic of character to carry the text digital water mark technology of watermark information)
Though above-mentioned patent, the author has used " font " this noun, but the meaning of his essence is meant the topological structure of character, and (i.e. " based on the text digital water mark technology of character topology ") just replaces " font " with character topology in the paper in his later stage.
And the claim 1 in the claim literary composition of patent is mentioned: " a kind of will with carry digital watermark information will be with the method that is designed to multiple font with character (string); it is characterized in that: the disconnected relation change the topological structure of character (string) by the company that changes between each stroke of forming character (string), thereby obtain multiple character (string) profile of semantically identical same character (string).”;
And in the expository writing of patent, mention: " ultimate principle of the present invention is become to form disconnected relation of company between each stroke of character (string), design semantically identical same character (string) multiple character (string) profile,,, ".
Two places can find out that patentee's invention is based on the text digital water mark technology of character topology.
This technical solution is different with the present invention to be: the former utilizes the topological structure coding of literal, the body coding of The latter literal; Illustrate as the front; after the former technical solution is implemented; the reading of the reading of protected content images and anti-piracy encoded content image separates; after the latter's technical solution was implemented, the reading of the reading of protected content images and anti-piracy encoded content image was consistent
Description of drawings
Fig. 1 is for realizing the demonstration document sectional drawing of literal body coding;
Fig. 2 is literal body coding process flow diagram
Embodiment
2 to 3 kinds of bodies with Chinese character represent 1 and 0, the realization result that shows of accompanying drawing 1 for example, be exactly to represent 0 with the Song typeface, roman, No. four words of Chinese character, the Song typeface, italic, little No. three words with Chinese character represent 1, " one " word is more special, represent 0 literal body as before, but represent 1 literal body, with Chinese row pattern, roman, young waiter in a wineshop or an inn's word of Chinese character.
All Chinese characters of the article that will protect are used the body form of 1 and 0 correspondence in the body coding at random, form at random and upset, advance article for serial number codes and do protection.
Several 1 and 0 form a sequence number, are sequence number that unit is divided into some groups with some figure places, readjust continuous some Chinese characters on the article random site according to the body form of 1 and 0 correspondence of one group of some figure place, realize serial number codes in article.The realization result that accompanying drawing 1 is showed is exactly to be unit with the four figures, is divided into 5 groups, and 20 figure places altogether can be discerned the user of 2 20 powers.Readjust continuous four Chinese characters on the article random site according to each body form of organizing 1 and 0 correspondence of four figures, every group of code is respectively at different local the codings three times of article, formation redundancy protecting.
When electronic publication the time, can read sequence number by difference, thereby determine which buyer illegally reveals this electronic publication from article ad-hoc location body by pirate (refering in particular to the image screenshotss).
The realization result that accompanying drawing 1 is showed is one piece sequence number " D439A " embedded the part of article with the mode of literal body coding, below form be the corresponding relation of encoding.
Cycle tests number D439A The sequence number grouping Corresponding 2 carry system codes The corresponding literal body of encoding
D First group 1101 Tiltedly positively biased
4 Second group 0100 Positively biased just
3 The 3rd group 0011 Positively biased oblique
9 The 4th group 1001 Tiltedly positively biased
A The 5th group 1010 Tiltedly just positively biased

Claims (1)

1, a kind of body by literal is encoded and is carried the method for digital watermark information, it is characterized in that: by suitably selecting several bodies of literal, and the body characteristics of literal is encoded, utilize the body of literal to change and carry digital watermark information.
CNA2008100284741A 2008-06-01 2008-06-01 Utilize the literal body characteristics to carry the text digital water mark technology of watermark information Pending CN101593247A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CNA2008100284741A CN101593247A (en) 2008-06-01 2008-06-01 Utilize the literal body characteristics to carry the text digital water mark technology of watermark information

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CNA2008100284741A CN101593247A (en) 2008-06-01 2008-06-01 Utilize the literal body characteristics to carry the text digital water mark technology of watermark information

Publications (1)

Publication Number Publication Date
CN101593247A true CN101593247A (en) 2009-12-02

Family

ID=41407900

Family Applications (1)

Application Number Title Priority Date Filing Date
CNA2008100284741A Pending CN101593247A (en) 2008-06-01 2008-06-01 Utilize the literal body characteristics to carry the text digital water mark technology of watermark information

Country Status (1)

Country Link
CN (1) CN101593247A (en)

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102142073A (en) * 2010-12-27 2011-08-03 成都网安科技发展有限公司 System for preventing and identifying disclosure of paper documents based on hidden watermarks
CN103678957A (en) * 2012-09-12 2014-03-26 上海聚力传媒技术有限公司 Method, device and equipment for generating picture information and obtaining identity coded information
CN104980617A (en) * 2014-04-14 2015-10-14 华为技术有限公司 Anti-photographing auditing method and device
CN105139334A (en) * 2015-10-10 2015-12-09 上海中信信息发展股份有限公司 Multiline text watermark production device
CN108090329A (en) * 2018-01-17 2018-05-29 上海海笛数字出版科技有限公司 A kind of method and device that digital watermarking encipherment protection is carried out to content of text
CN108711131A (en) * 2018-04-28 2018-10-26 北京溯斐科技有限公司 Water mark method based on Image Feature Matching and device
CN110968847A (en) * 2019-11-27 2020-04-07 北京北信源软件股份有限公司 File watermark hiding and analyzing method, device, equipment and storage medium

Cited By (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102142073A (en) * 2010-12-27 2011-08-03 成都网安科技发展有限公司 System for preventing and identifying disclosure of paper documents based on hidden watermarks
CN103678957A (en) * 2012-09-12 2014-03-26 上海聚力传媒技术有限公司 Method, device and equipment for generating picture information and obtaining identity coded information
CN104980617A (en) * 2014-04-14 2015-10-14 华为技术有限公司 Anti-photographing auditing method and device
CN104980617B (en) * 2014-04-14 2018-05-04 华为技术有限公司 Anti- take pictures auditing method and device
CN105139334A (en) * 2015-10-10 2015-12-09 上海中信信息发展股份有限公司 Multiline text watermark production device
CN105139334B (en) * 2015-10-10 2018-02-06 上海中信信息发展股份有限公司 The preparation method of multline text watermark
CN108090329A (en) * 2018-01-17 2018-05-29 上海海笛数字出版科技有限公司 A kind of method and device that digital watermarking encipherment protection is carried out to content of text
CN108090329B (en) * 2018-01-17 2022-02-22 上海海笛数字出版科技有限公司 Method and device for carrying out digital watermark encryption protection on text content
CN108711131A (en) * 2018-04-28 2018-10-26 北京溯斐科技有限公司 Water mark method based on Image Feature Matching and device
CN108711131B (en) * 2018-04-28 2022-08-16 北京数科网维技术有限责任公司 Watermark method and device based on image feature matching
CN110968847A (en) * 2019-11-27 2020-04-07 北京北信源软件股份有限公司 File watermark hiding and analyzing method, device, equipment and storage medium

Similar Documents

Publication Publication Date Title
CN101593247A (en) Utilize the literal body characteristics to carry the text digital water mark technology of watermark information
CN106529633B (en) Generation method, coding/decoding method and the device of two dimensional code
Lee et al. A new approach to covert communication via PDF files
AU2013204619B2 (en) Methods, apparatus and articles of manufacture to encode auxilary data into text data and methods, apparatus, and articles of manufacture to obtain encoded data from text data
CN102096787B (en) Method and device for hiding information based on word2007 text segmentation
CN102968654A (en) Method and system for producing information recognizable by naked eyes in plane of two-dimensional (2D) code and 2D code
CN105706107A (en) Two dimensional barcode and method of authentication of such barcode
HUP0304080A2 (en) Method of invisibly embedding and hiding data into soft-copy text documents
CN104428778B (en) Method for being labelled to digital book
US9934457B2 (en) Method of securing a two-dimensional barcode
CN105205674A (en) Product anti-counterfeiting method based on two-dimensional code
CN102490512B (en) Paper book with functions of electronic book
CN102027526A (en) Method and system for embedding covert data in a text document using space encoding
Alginahi et al. An enhanced Kashida-based watermarking approach for Arabic text-documents
CN110322386A (en) A kind of insertion of digital text watermarking and detection method and device
Singh et al. A survey on text based steganography
Taleby Ahvanooey et al. An innovative technique for web text watermarking (AITW)
CN102855423A (en) Tracking method and device of literary works
CN109754037A (en) A kind of one yard of commodity counterfeit prevention of an object and traceability system based on digital watermarking two dimensional code
Thabit et al. A comparative analysis of Arabic text steganography
CN104050400A (en) Webpage link protection method based on control character coding and steganography
Mandal et al. A new approach of text Steganography based on mathematical model of number system
WO2005001675A3 (en) Algorithmic generation of afu calligraphy
CN110097488A (en) The generation of stealthy digital watermarking and extracting method and device
CN110689360A (en) Agricultural product two-dimensional code anti-counterfeiting inspection method based on watermark library

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C02 Deemed withdrawal of patent application after publication (patent law 2001)
WD01 Invention patent application deemed withdrawn after publication

Open date: 20091202