CN1404298A - Image processing apparatus and image processing method and program and storage media - Google Patents

Image processing apparatus and image processing method and program and storage media Download PDF

Info

Publication number
CN1404298A
CN1404298A CN02141996A CN02141996A CN1404298A CN 1404298 A CN1404298 A CN 1404298A CN 02141996 A CN02141996 A CN 02141996A CN 02141996 A CN02141996 A CN 02141996A CN 1404298 A CN1404298 A CN 1404298A
Authority
CN
China
Prior art keywords
character
mentioned
image processing
characters
radicals
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN02141996A
Other languages
Chinese (zh)
Other versions
CN1226860C (en
Inventor
金田北洋
田中哲臣
池田裕章
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Canon Inc
Original Assignee
Canon Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from JP2001386137A external-priority patent/JP3848150B2/en
Priority claimed from JP2002226588A external-priority patent/JP3833154B2/en
Application filed by Canon Inc filed Critical Canon Inc
Publication of CN1404298A publication Critical patent/CN1404298A/en
Application granted granted Critical
Publication of CN1226860C publication Critical patent/CN1226860C/en
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/40Extraction of image or video features

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Theoretical Computer Science (AREA)
  • Editing Of Facsimile Originals (AREA)
  • Image Processing (AREA)

Abstract

To ensure the accuracy and amount of information embedding above a certain level while suppressing the degradation of the font at a minimum. SOLUTION: The character which appears most frequently is decomposed into radicals and their reference values are obtained. In this case, a reference value is represented by a relative distance among coordinates of four edges of a character image. The relative position of each radical of the second or later character which is selected in the step S412 and corresponds to each specified bit of document access control information to be embedded is changed according to the embedding information in consideration of the reference value.

Description

Image processing apparatus and image processing method and program and medium
Technical field
The present invention relates to the document image that comprises the character that a character is made of a plurality of character radicals by which characters are arranged in traditional Chinese dictionaries carry out the embedding of eletric watermark image processing apparatus, the document image that comprises the character that a character is made of a plurality of character radicals by which characters are arranged in traditional Chinese dictionaries is extracted the image processing apparatus of the eletric watermark that embeds and image processing method, program, medium.
Background technology
Form in the device in digital pictures such as printer, photocopiers in recent years, its picture quality significantly improves, and can easily obtain high-quality printed article.Be anyone image processing, can both obtain desired printed article by being undertaken by high performance scanner, printer, photocopier and computer.Therefore, file takes place in regular meeting wrongfully problem such as duplicates, distorts, and in order to prevent or suppress the generation of this type of phenomenon, in recent years access control information is embedded the work very active (eletric watermark) of printed article itself.
As the method that realizes such requirement, generally be to make naked eyes invisibly access control information be embedded in the printed article now, perhaps will embed the blank space of file corresponding to the bitmap graphics of access control information, perhaps the scramble password is added in the document image.Wherein, making naked eyes embed the method for access control information invisibly, generally is to adopt following forms to realize: by the space amount between the control English character string, embed the form of information; By the rotation amount of control character, embed the form of information; By the amplification reduction volume of control character, embed the form of information etc.
Fig. 9 is the space amount between the explanation control English character string, carries out the figure of method of the embedding of information.Here, be called the space with 801~804.In addition, the interval in space 801 is set at p, the interval in space 802 is set at s.Under this state, if the position of the information that embeds is 0, then interval p, the s in space 801,802 changed to (p+s)/2 of (p+s)/2 of p ← (1+p), s ← (1-p),, then change to (p+s)/2 of (p+s)/2 of p ← (1-p), s ← (1+p) if the position of the information that embeds is 1.This can be applicable to space 803,804 equally.
Figure 10 is the rotation amount of explanation control character, carries out the figure of method of the embedding of information.Here, the state before this figure left side expression rotation, postrotational state is represented on this figure right side.The anglec of rotation of 901 expression characters.Identical with method shown in Figure 9, make of the position variation of the angle of its rotation corresponding to the information that embeds.
Figure 11 is the amount that explanation is dwindled by the amplification of control character, embeds the figure of the method for information.The original size of 1001 expressions.Size after 1002 expressions are amplified.Identical with method shown in Figure 9, make of the position variation of the amount of its amplification corresponding to the information that embeds.The situation of dwindling too.
; though embedding the method that access control information loses with seeing, above-mentioned naked eyes help maintaining secrecy; but in the document image that the redundancy of information embedded images is few (being generally the diadic image); and inharmonious sense to character, space can take place, it is very showy that the deterioration of original copy grade becomes.In addition, a little less than the anti-printing of so in general image (to the confining force of paper output back information) also.
The present invention finishes in view of above problem, and purpose is that degradation inhibiting with font in Min., guarantees that simultaneously to a certain degree above information embeds precision and embedded quantity.
Summary of the invention
In order to reach purpose of the present invention, image processing apparatus of the present invention comprises following structure.
That is, a kind of image processing apparatus that the document image that comprises the character that a character is made of a plurality of character radicals by which characters are arranged in traditional Chinese dictionaries is carried out the embedding of eletric watermark is characterized in that comprising: the draw-out device that extracts character from above-mentioned document image; In the character that selection is extracted by above-mentioned draw-out device, the character radicals by which characters are arranged in traditional Chinese dictionaries get the choice device of the character of predetermined structure; And by according to the file access control information, make the change in location of the character radicals by which characters are arranged in traditional Chinese dictionaries of the character of selecting by above-mentioned choice device, above-mentioned file access control information is embedded flush mounting in the above-mentioned character as eletric watermark.
In addition, in order to reach purpose of the present invention, image processing apparatus of the present invention comprises following structure.
That is, a kind of the document image that comprises the character that a character is made of a plurality of character radicals by which characters are arranged in traditional Chinese dictionaries is extracted the image processing apparatus of the eletric watermark that is embedded into, it is characterized in that comprising: the character extraction device that from above-mentioned document image, extracts character; In the character that selection is extracted by above-mentioned character extraction device, the character radicals by which characters are arranged in traditional Chinese dictionaries get the choice device of the character of predetermined structure; And the position of the character radicals by which characters are arranged in traditional Chinese dictionaries of the character of selecting according to above-mentioned choice device, extract the bit string that is embedded in this character, according to the bit string that extracts, the eletric watermark draw-out device that above-mentioned eletric watermark is restored as the file access control information.
In addition, in order to reach purpose of the present invention, image processing method of the present invention may further comprise the steps.
That is, a kind of image processing method that the document image that comprises the character that a character is made of a plurality of character radicals by which characters are arranged in traditional Chinese dictionaries is carried out the embedding of eletric watermark is characterized in that comprising: the extraction step that extracts character from above-mentioned document image; The selection step of the character of be chosen in the character that extracts in the above-mentioned extraction step, predetermined structure being got in the character radicals by which characters are arranged in traditional Chinese dictionaries; And, make the change in location of the character radicals by which characters are arranged in traditional Chinese dictionaries of the character of in above-mentioned selection step, selecting according to the file access control information, above-mentioned file access control information is embedded embedding step in the above-mentioned character as eletric watermark.
In addition, in order to reach purpose of the present invention, image processing method of the present invention may further comprise the steps.
That is, a kind of the document image that comprises the character that a character is made of a plurality of character radicals by which characters are arranged in traditional Chinese dictionaries is extracted the image processing method of the eletric watermark that is embedded into, it is characterized in that comprising: the character extraction step that from above-mentioned document image, extracts character; The selection step of be chosen in the character that extracts in the above-mentioned character extraction step, the character of predetermined structure being got in the character radicals by which characters are arranged in traditional Chinese dictionaries; And, extract the bit string that is embedded in this character according to the position of the character radicals by which characters are arranged in traditional Chinese dictionaries of the character of in above-mentioned selection step, selecting, and according to the bit string that extracts, the eletric watermark extraction step that above-mentioned eletric watermark is restored as the file access control information.
Description of drawings
Fig. 1 is the block diagram of basic structure of electronic watermark embedded device that document image is carried out the embedding of eletric watermark of expression first form of implementation of the present invention.
Fig. 2 is that expression has embedded the block diagram of the basic structure of the eletric watermark draw-out device that extracts eletric watermark the document image of eletric watermark from utilizing electronic watermark embedded device shown in Figure 1.
Fig. 3 is the flow chart that processor 4 embeds eletric watermark the processing in the document image.
Fig. 4 is processor 24 extracts eletric watermark from 1 document image that embeds eletric watermark a process chart.
Fig. 5 is the detail flowchart of the processing among step S206, the S306.
Fig. 6 is the schematic diagram that the embedding processing of the file access control information of carrying out in step S412 is described.
Fig. 7 is the figure of explanation with the processing in 9 the actual embedding of the information file.
Fig. 8 is the figure that expression has a plurality of Chinese modes of the above radicals by which characters are arranged in traditional Chinese dictionaries of some.
Fig. 9 be explanation by the space amount between the control English character string, carry out the figure of method of the embedding of information.
Figure 10 is the rotation amount of explanation by control character, carries out the figure of method of the embedding of information.
Figure 11 is the amplification reduction volume of explanation by control character, carries out the figure of method of the embedding of information.
Figure 12 is the figure of expression embedding information example.
Figure 13 is that the schematic diagram of handling according to the embedding of the file access control information of the fiducial value of trying to achieve with the first form of implementation diverse ways is used in explanation.
Embodiment
Below, with reference to accompanying drawing, describe the present invention in detail according to preferred implementing form.
[first form of implementation]
Fig. 1 is the block diagram of basic structure of electronic watermark embedded device that document image is carried out the embedding of eletric watermark of this form of implementation of expression.
Among this figure, the 2nd, input embed eletric watermark obj ect file by scanner, camera, or the input part that constitutes such as file reader unit, the 4th, the processor that carries out various processing, the 6th, to the keyboard of processor 4 input commands, the 8th, preserve embedding information or the dish of the document image that reads in, the 10th, temporarily store data etc. in order in processor 4, to carry out various processing, or the memory of the document image that reads in by input part 2 of storage, the 12nd, show the order input that processor 4 is carried out and the display of treatment state, the 14th, output embedded access control information document image by printer, or internet, the efferent that network interfaces such as LAN constitute.
On the other hand, Fig. 2 is that expression has embedded the basic block diagram that extracts the eletric watermark draw-out device of eletric watermark the document image of eletric watermark from utilizing electronic watermark embedded device shown in Figure 1.
In the figure, the 22nd, the input embedded eletric watermark file by scanner, camera, or file reader unit, the input part that network interface etc. constitute, the 24th, the processor that carries out various processing, the 26th, to the keyboard of processor 24 input commands, the 28th, preserve the document image that reads in, or the dish of the original document of the file that reads in retrieval usefulness, the 30th, temporarily store data etc. in order in processor 24, to carry out various processing, or the memory of the document image that reads in by input part 22 of storage, the 32nd, show the order input that processor 24 is carried out and the display of treatment state, 34,36 is respectively the network interface that the utilization file access control information of reading in is used, printer.
In addition, in this form of implementation, though electronic watermark embedded device and eletric watermark draw-out device are used as independent device separately, but be not limited thereto, also these can be installed (electronic watermark embedded device, eletric watermark draw-out device) and use as the eletric watermark Embedded Division in the device, eletric watermark extracting part.
Below, illustrate that eletric watermark embeds the rough flow process of handling.At first,, obtain the electronic document image that is embedded into, in memory 10, launch from input part 2 according to order from keyboard 6 inputs.From keyboard 6 or coil 8 input embedding information (file access control information), this information is embedded in the memory 10 in the document image that launches again by processor 4.The document image that has embedded predetermined file access control information is exported as the file that embeds eletric watermark from efferent 14.
Below, the rough flow process that extracts the processing of eletric watermark from the file that embeds eletric watermark of efferent 14 outputs is described.At first,, import the file that has embedded eletric watermark, in memory 30, launch by input part 22 according to order from keyboard 26 inputs.Secondly by the document image of processor 24, read the file access control information of embedding, the processing of being scheduled to according to its indication from memory 30, launching.So-called predetermined processing, be for example to find under the wrongful situation about reading, to outside circular, to inner disk 28 or the outside is carried out the retrieval of original document, perhaps printout attribute information etc. uses network I/F34, printer 36 in order to carry out these processing.
Below, describe processor 4 in detail with the processing method in document image of eletric watermark embedding.The flow process of this processing has been shown among Fig. 3.
In step S200, read in file from input part 2, give memory 10 as the electronic image transfer of data.In addition in this step, pre-treatments such as the direction of the file that reads in, tilt correction.In step S202, the document image that launches in memory 10 in step S200 is carried out zone identification, characters in images piece (text) is all extracted.This work can application examples such as the Japanese Patent Application Publication spy open the piece selection technology of putting down in writing in the flat 6-068301 communique and wait and realize.In step S204, the character to comprising in the alphabet piece that extracts in step S202 carries out character recognition, generates the character code as character identification result.
In step S206, in the character that from the character block that among step S202, extracts, comprises, extract the object character that embeds the file access control information.The object character of supposing extraction is the character of the word size that is predetermined.The back will explain the processing method in this step.In step S208, the file access control information that embeds in the character that input is extracted in step S206.Here, the called file access control information for example is a copy limit information, distort the information of preventing, original document management information etc.
In step S210, the file access control information that will import in step S208 is embedded in the character that extracts among the step S206.With the processing that describes in detail in the back in this step.In step S212, output has embedded the document image of file access control information in step S210.
Below, describe processor 24 extracts eletric watermark from a document image that has embedded eletric watermark processing method in detail.This handling process is shown among Fig. 4.
In step S300, be taken into the file that embeds eletric watermark from input part 22, give memory 30 as the electronic image data delivery.Processing in this step is identical with step S200, also comprises the pre-treatment such as direction, tilt correction of the file that reads in.
In step S302, the document image that embeds eletric watermark that launches in memory 30 in step S300 is carried out zone identification, the character block in the document image is all extracted.Processing in similarly carry out this step with the processing among the step S202.In step S304, the alphabet piece to extracting in step S302 carries out character recognition.Processing in similarly carry out this step with the processing among the step S204.
In step S306, in the character that from the character block that among step S302, extracts, comprises, only extract the character that embeds the file access control information.The back will explain the processing method in this step.In step S308, from the character that among step S306, extracts, read the file access control information.The back will explain the processing method in this step.
In step S310, according to the file access control information of in step S308, reading, carry out expectant control, for example duplicate and forbid processing, document retrieval processing etc.
Fig. 5 is the detail flowchart of the processing among step S206~S210 and the S306.In step S400, will flow to the character extraction working storage in the memory 10 based on the character code of character identification result.In step S402, judge whether that the alphabet sign indicating number that will comprise in the file has flowed to the character extraction working storage.Carrying under the situation about being all over, transfer among the step S404 and handle, under situation about not being all over, transfer among the step S400 and handle.
In step S404, use to flow to the character code of character extraction, to the predefined character of each character count with working storage.Here, what is called preestablishes, and is to preestablish the employing Chinese character of complicated radicals by which characters are arranged in traditional Chinese dictionaries structure in a way, for example has the word size " the formation radicals by which characters are arranged in traditional Chinese dictionaries are the Chinese character more than 3 " of 10 points such.In other words, in step S404, counting is fed to the number of character extraction with character code identical with the character code of predefined character in the character code of working storage.By carrying out such setting, can not embed the above information of some boldly and reliably.The back will describe this point in detail.
In step S406,, classify by the character that counting is counted in step S404.In step S408, judge whether counts reaches to a certain degree more than, promptly whether the character that occurrence frequency is high in the file reaches more than certain number of times.This is the embedding precision in order to ensure eletric watermark, gets the above character as object of certain number of times, and same information is embedded in the same character and the measure of taking repeatedly.This also is the extraction precision in order to ensure eletric watermark in addition.Here more than said certain number of times, though the many more precision of number of times are high more, even for example also can for twice.The back will describe this point in detail.
Here, be judged as under the situation that does not have certain above object number of characters, be judged as and embed/extract predetermined amount of information, handling and transfer to step S414, otherwise step S410 is transferred in processing.
In step S410, the character of occurrence frequency maximum in the select File in embedding/extraction object character is calculated and is used to embed/fiducial value of extraction operation.The back will describe this fiducial value in detail.
In step S412, except the character of having obtained fiducial value, from sorting result among step S406, obtaining above-mentioned occurrence frequency is more than second later characters, the embedding/extraction operation of carrying out the file access control information.The back will illustrate concrete method.
Step S414, be judged as that in step S408 embedding/extraction object character is few, under the situation that can not embed/extract, the step of the processing of being scheduled to.So-called predetermined processing is the processing that shows the warning that for example can not embed/extract etc. on display 12 or 32.
Fig. 6 is the schematic diagram that the embedding processing of the file access control information of carrying out in step S412 is described, Fig. 7 illustrates the figure that in fact 9 information is embedded the processing in the file.Fig. 6 is the figure of method that explanation is asked the method for fiducial value and embedded 3 information (8 kinds of information) respectively.In the following description, though the file access control information is decided to be 9 information, be not limited to this.In addition, in Fig. 6,7, illustrate that the number of radicals by which characters are arranged in traditional Chinese dictionaries (character radicals by which characters are arranged in traditional Chinese dictionaries) is for example 3, and for example " type " is such, little radicals by which characters are arranged in traditional Chinese dictionaries have two on top, and big radicals by which characters are arranged in traditional Chinese dictionaries have the pattern of one character in the bottom.In addition, describe figure to such an extent that more or less exaggerate for explanation.
At first, the character picture (being the image of Chinese character " type " among Fig. 6) that will extract from character block in step S206 resolves into each radicals by which characters are arranged in traditional Chinese dictionaries, asks its fiducial value.Method as character being resolved into each radicals by which characters are arranged in traditional Chinese dictionaries is not particularly limited, so adopt general disclosed method to get final product.So-called fiducial value is a most important value when embedding in the file with the invisible form of naked eyes the file access control information.Here said fiducial value as defining among Fig. 6, is sat up straight target relative distance K, P, M, N with four of character picture and is represented.
Concrete information embedding method considers to use four fiducial value K, P, M, the N that defined just now here, embeds 3 information at each character.In step S410, obtain fiducial value K, P the reliability maximum, that be the highest character of occurrence frequency, M, N (being equivalent to the 3rd step among Fig. 7).Corresponding, the file access control information that preparation should embed (9), by per 3,, carry out shown in Figure 6 certain and handle (processing of the relative position of each radicals by which characters are arranged in traditional Chinese dictionaries of change character) (being equivalent to the 4th step among Fig. 7) the more than second later characters of in step S412, selecting.Specifically, when embedding per 3 information in more than the second later characters, for example in the character with initial 3 embeddings (classification results) more than second.In the character with next 3 embeddings more than the 3rd, in the character with 3 last embeddings more than the 4th., be not limited to this order, also can carry out on the contrary, that is, for example in the character with 3 initial embeddings more than the 4th.In the character with next 3 embeddings more than the 3rd, in the character with 3 last embeddings more than second.
Generally speaking, the information (embedding information) that per 3 information has been embedded in which many character deposits in the memory 10.The example of the information of this embedding has been shown among Figure 12.In the figure, embedding information 1201 is deposited in the memory 10, and embedding information 1201 is made of the classified order that has embedded 3 initial character, the classified order that has embedded second 3 character, the information of classified order that has embedded the 3rd 3 character.
In the extraction of eletric watermark is handled,, during from per 3 information of each character extraction, can specificly which kind of rearrange these per 3 information in proper order according to original 9 file access control information is restored by information with reference to this embedding.The back will explain the extraction of eletric watermark and handle.
When embedding per 3 information in each character, as mentioned above, making the change in location of the radicals by which characters are arranged in traditional Chinese dictionaries of character, the pattern of this variation according to the information that embeds is a certain corresponding in the changing pattern of each information shown in Figure 6 (000,001,010,011,100,101,110,111).
In addition, from above explanation as can be known,, can keep the embedding/extraction precision of eletric watermark and the balance that embeds figure place by adjusting the number (minimum occurrence frequency) of object character.
K ' among Fig. 6, P ', M ', N ' are the relative distance of four ends after the change in location.
In order to prevent the deterioration of character, make maximum radicals by which characters are arranged in traditional Chinese dictionaries, be that the radicals by which characters are arranged in traditional Chinese dictionaries of lower end do not change in the case.Order according to above explanation can embed information arbitrarily.
On the other hand, same as the above-mentioned method when extracting eletric watermark, obtain fiducial value, be that the relative position of each radicals by which characters are arranged in traditional Chinese dictionaries of the character more than second below is compared with fiducial value with occurrence frequency, extract an arrangement that embeds in each character.In addition at this moment as mentioned above owing to embed information stores in memory 10, so with reference to this embedding information, the embedding of specific extraction the character arranged of position be which character, recover original file access control information.
By above explanation as can be known, adopt the image processing apparatus and the image processing method of this form of implementation, on the basis of carrying out zone identification, character recognition, use the relative position of each complicated character radicals by which characters are arranged in traditional Chinese dictionaries in the radicals by which characters are arranged in traditional Chinese dictionaries structure to change dexterously, can guarantee that simultaneously to a certain degree above information embeds precision, quantity (can use according to the classification of occurrence frequency and be controlled) with the degradation inhibiting of font in Min..In addition, when extracting eletric watermark, also can realize the eletric watermark that the noise resistance performance is high.In addition, on principle, there is not the dependence of word size fully, so, obviously be a kind of effective method even for the few original copy of character quantity yet.
[second form of implementation]
In first form of implementation, the radicals by which characters are arranged in traditional Chinese dictionaries structure that adopts as embedded object is a single pattern shown in Figure 6, but is not limited thereto, and as shown in Figure 8, also can set a plurality of Chinese modes with the above radicals by which characters are arranged in traditional Chinese dictionaries of some simultaneously.In the case, can adopt the method for using in first form of implementation, further increase the embedding amount of information character with various radicals by which characters are arranged in traditional Chinese dictionaries structures.
[the 3rd form of implementation]
In first form of implementation, each character has embedded 3, but is not limited thereto, if the figure place in the possible combination of the Move Mode of radicals by which characters are arranged in traditional Chinese dictionaries then can freely be set.But, embed figure place if increase, then the deformation extent of character increases.
[the 4th form of implementation]
In first form of implementation, watermark information has been embedded in the Chinese character, but be not to be defined in this, if the character that constitutes by a plurality of structural elements (character radicals by which characters are arranged in traditional Chinese dictionaries), for example then can fully similarly embed in Korea S's literal, the Thailand's literal etc.
[the 5th form of implementation]
Use fiducial value for example shown in Figure 6, under the situation in " 000 " bit string embedding Chinese character " type ", make N '=N, but under the medium situation that has an interference effect of the superimposed document image that has embedded bit string of noise, the processing of from the Chinese character " type " that has embedded this bit string, extracting bit string " 000 " difficulty that will become.This be because, even in extract handling, ask N ', be not N '=N strictly sometimes, its result often can not extract bit string " 000 ".
Therefore, in extracting processing, can have certain width ground and change by comparing N ' and N.If promptly satisfy | N '-N |<ε then is judged as N '=N.This processing also can be applicable to other fiducial values, if other is for example satisfied | M '-M |<ε then is judged as M '=M.
[the 6th form of implementation]
In first form of implementation, the object character as embedding the file access control information has adopted the character that identical word size is arranged.This is because by the file access control information is embedded in each object character, makes the amount of movement of mobile radicals by which characters are arranged in traditional Chinese dictionaries certain, and radicals by which characters are arranged in traditional Chinese dictionaries move the back makes character between each object character the roughly certain cause of balance.
, even the word size of each object character is inequality, if but each word size is pre-determined the amount of movement of mobile radicals by which characters are arranged in traditional Chinese dictionaries, also can move the back and between each object character, make the balance of character roughly certain at radicals by which characters are arranged in traditional Chinese dictionaries.In addition, also can when embedding, ask the amount of movement of the radicals by which characters are arranged in traditional Chinese dictionaries of each word size.In the case, for example suppose in the character of 10 points that the amount of movement of radicals by which characters are arranged in traditional Chinese dictionaries is c, then under the situation of 12 points, can obtain amount of movement by calculating (c * (12 character sizes)/(10 character sizes)).
[the 7th form of implementation]
In first form of implementation, as the character of asking fiducial value to use, adopted the character of occurrence frequency maximum, but be not limited to this, also can be in each radicals by which characters are arranged in traditional Chinese dictionaries pattern for example, in advance according to making of stroke number, radicals by which characters are arranged in traditional Chinese dictionaries etc., even be divided into visually also inconspicuous group of mobile radicals by which characters are arranged in traditional Chinese dictionaries and showy group, set as embedded object character, benchmark character respectively, with the character in a plurality of benchmark character group that in fact occur as the benchmark character.The fiducial value of this situation can be passed through, if for example by mobile radicals by which characters are arranged in traditional Chinese dictionaries then each character that comprises in the visually showy group asks the mean value of fiducial value to obtain.
[the 8th form of implementation]
In first form of implementation, the character as asking fiducial value to use has adopted the character of occurrence frequency maximum, but has been not to be defined in this, even for example adopt the initial character of file or character block also passable.In this case, when initial character and predetermined radicals by which characters are arranged in traditional Chinese dictionaries pattern are inconsistent, control the feasible for example character late that adopts.
[the 9th form of implementation]
In first form of implementation, the character as asking fiducial value to use has adopted the character of occurrence frequency maximum, but has been not to be defined in this, for example also can be with occurrence frequency more than second with down to the inferior character of pre-determined bit as the benchmark character.The fiducial value of this situation, the mean value of fiducial value that can be by asking each benchmark character obtains.
[the tenth form of implementation]
In first form of implementation, shown in the definition among Fig. 6, sit up straight target relative distance K, P, M, N with four of character picture and represent fiducial value K, P, M, N.But be not limited to this, figure 13 illustrates other examples of fiducial value.
Figure 13 is that the figure according to the embedding handling principle of the file access control information of the fiducial value of trying to achieve with the first form of implementation diverse ways is used in explanation.Fiducial value in this form of implementation with the decomposition of character picture each radicals by which characters are arranged in traditional Chinese dictionaries width, highly the ratio of absolute altitude, width is represented.If further specify, that is exactly said here each radicals by which characters are arranged in traditional Chinese dictionaries, and top has only two radicals by which characters are arranged in traditional Chinese dictionaries, does not define its magnitude proportion about the radicals by which characters are arranged in traditional Chinese dictionaries of bottom.This is that then relative fiducial value can change because if define with whole radicals by which characters are arranged in traditional Chinese dictionaries, and in addition, the distortion of word is also big, and it is very showy that deterioration becomes, so surplus so next topmost radicals by which characters are arranged in traditional Chinese dictionaries, (in the case) only uses other two radicals by which characters are arranged in traditional Chinese dictionaries.
Then, similarly use fiducial value K, P, M, the N that tries to achieve like this, carry out the embedding of file access control information and handle with first form of implementation.
[other forms of implementation]
In addition, the present invention is not only limited to the device and method of realizing that above-mentioned form of implementation is used, the procedure code of realizing the software that above-mentioned form of implementation is used is supplied with said system or the interior computer (CPU or MPU) of device, the computer of said system or device is according to this procedure code, make above-mentioned each device work, realize that the situation of above-mentioned form of implementation is included in the category of the present invention.
In addition in this case, itself will realize the function of above-mentioned form of implementation the procedure code of above-mentioned software, this procedure code itself and with this procedure code supply with device that computer uses, the medium of specifically storing the said procedure sign indicating number is also contained in the category of the present invention.
As the medium of such store program codes, for example can use floppy (registered trade mark) dish, hard disk, CD, photomagneto disk, CD-ROM, tape, Nonvolatile memory card, ROM etc.
In addition, be not only the aforementioned calculation machine according to the procedure code of supplying with, control various devices, realize the situation of the function of above-mentioned form of implementation, and the OS (operating system) that in computer, works with the said procedure sign indicating number or with co-operation such as other application software, realize the relevant procedure code of situation of above-mentioned form of implementation, be also contained in the category of the present invention.
In addition, after the procedure code of this supply is stored in the function expansion card of computer or connects in the memory that has on computers the functional expansion unit, indication according to this procedure code, the CPU that has in this function expansion card or the function memory cell etc. carries out part or all of actual treatment, handles the situation that realizes above-mentioned form of implementation by this and is also contained in the category of the present invention.
As mentioned above, adopt the present invention, can guarantee that to a certain degree above information embeds precision, embeds quantity with the degradation inhibiting of font in Min..

Claims (28)

1. image processing apparatus is the image processing apparatus that the document image that comprises the character that a character is made of a plurality of character radicals by which characters are arranged in traditional Chinese dictionaries is carried out the embedding of eletric watermark, it is characterized in that comprising:
From above-mentioned document image, extract the draw-out device of character;
In the character that selection is extracted by above-mentioned draw-out device, the character radicals by which characters are arranged in traditional Chinese dictionaries get the choice device of the character of predetermined structure; And
By according to the file access control information, make the change in location of the character radicals by which characters are arranged in traditional Chinese dictionaries of the character of selecting by above-mentioned choice device, above-mentioned file access control information is embedded flush mounting in the above-mentioned character as eletric watermark.
2. image processing apparatus according to claim 1 is characterized in that:
Above-mentioned draw-out device also comprises
From above-mentioned document image, extract the character block draw-out device of character block; And
The character that contains in the character block that is extracted by above-mentioned character block draw-out device is carried out character recognition, generate character code, from above-mentioned character block, extract the character recognition device of the image of above-mentioned character as recognition result.
3. image processing apparatus according to claim 1 and 2 is characterized in that:
The counting device of the number of the character that above-mentioned choice device comprises in the character that each character count is extracted by above-mentioned draw-out device, predetermined structure got in the character radicals by which characters are arranged in traditional Chinese dictionaries,
Under the number that the counts of being undertaken by above-mentioned counting device reaches the character more than the predetermined counts was situation more than the some, above-mentioned flush mounting embedded above-mentioned file access control information in the character of being selected by above-mentioned choice device.
4. image processing apparatus according to claim 3 is characterized in that:
Above-mentioned counting device uses this character code to count the number that the character of predetermined structure got in character radicals by which characters are arranged in traditional Chinese dictionaries to each character.
5. according to claim 3 or 4 described image processing apparatus, it is characterized in that:
Under the little situation of the predetermined counts of the counts ratio that is undertaken by above-mentioned counting device, on predetermined display unit, show the warning of the impossible embedding of eletric watermark.
6. according to any described image processing apparatus in the claim 1 to 5, it is characterized in that:
Above-mentioned flush mounting also comprises
Use is calculated the calculation element of fiducial value by the character that determines according to occurrence frequency in the character of above-mentioned choice device selection; And
According to the said reference value, make in the character of selecting by above-mentioned choice device, changeable device beyond the selecteed character, that change corresponding to the position of the character radicals by which characters are arranged in traditional Chinese dictionaries of the character of the information of the pre-determined bit of each above-mentioned file access control information in order to calculate fiducial value.
7. according to any described image processing apparatus in the claim 1 to 5, it is characterized in that:
Above-mentioned flush mounting also comprises
Utilization is calculated the calculation element of fiducial value by predetermined character in the character of above-mentioned choice device selection; And
According to the said reference value, make in the character of selecting by above-mentioned choice device, changeable device beyond the selecteed character, that change corresponding to the position of the character radicals by which characters are arranged in traditional Chinese dictionaries of the character of the information of the pre-determined bit of each above-mentioned file access control information in order to calculate fiducial value.
8. according to claim 6 or 7 described image processing apparatus, it is characterized in that:
Target relative distance sat up straight in four of aforementioned calculation device calculating selecteed character in order to calculate fiducial value, as the said reference value.
9. according to claim 6 or 7 described image processing apparatus, it is characterized in that:
The aforementioned calculation device calculates the width of each radicals by which characters are arranged in traditional Chinese dictionaries of selecteed character in order to calculate fiducial value, highly to the width of this character, the ratio of height, as the said reference value.
10. according to any described image processing apparatus in the claim 6 to 9, it is characterized in that:
Above-mentioned changeable device also generates expression the information of each pre-determined bit of above-mentioned file access control information is embedded information in which character.
11. an image processing apparatus is the image processing apparatus that the document image that comprises the character that a character is made of a plurality of character radicals by which characters are arranged in traditional Chinese dictionaries is extracted the eletric watermark that is embedded into, and it is characterized in that comprising:
From above-mentioned document image, extract the character extraction device of character;
In the character that selection is extracted by above-mentioned character extraction device, the character radicals by which characters are arranged in traditional Chinese dictionaries get the choice device of the character of predetermined structure; And
The bit string that is embedded in this character is extracted in the position of the character radicals by which characters are arranged in traditional Chinese dictionaries of the character of selecting according to above-mentioned choice device, according to the bit string that extracts, and the eletric watermark draw-out device that above-mentioned eletric watermark is restored as the file access control information.
12. image processing apparatus according to claim 11 is characterized in that:
Above-mentioned character extraction device also comprises
From above-mentioned document image, extract the character block draw-out device of character block; And
The character that contains in the character block that is extracted by above-mentioned character block draw-out device is carried out character recognition, generate character code, from above-mentioned character block, extract the character recognition device of the image of above-mentioned character as recognition result.
13., it is characterized in that according to claim 11 or 12 described image processing apparatus:
The counting device that above-mentioned choice device comprises in the character that each character count is extracted by above-mentioned character extraction device, the character number of predetermined structure got in the character radicals by which characters are arranged in traditional Chinese dictionaries,
Under the number that the counts of being undertaken by above-mentioned counting device reaches the character more than the predetermined counts was situation more than the some, above-mentioned character extraction device extracted above-mentioned file access control information from the character of being selected by above-mentioned choice device.
14. image processing apparatus according to claim 13 is characterized in that:
Above-mentioned counting device uses these character code counting character radicals by which characters are arranged in traditional Chinese dictionaries to get the character number of predetermined structure to each character.
15., it is characterized in that according to claim 13 or 14 described image processing apparatus:
Under the little situation of the predetermined counts of the counts ratio that is undertaken by above-mentioned counting device, on predetermined display unit, show the warning of the impossible extraction of eletric watermark.
16. image processing apparatus according to claim 11 is characterized in that:
Above-mentioned eletric watermark draw-out device restores above-mentioned file access control information by with reference to expression the information of each pre-determined bit of above-mentioned file access control information having been embedded information in which character.
17. any described image processing apparatus according in the claim 11 to 16 is characterized in that:
Above-mentioned eletric watermark draw-out device also comprises
Utilization is calculated the calculation element of fiducial value by the character that determines according to occurrence frequency in the character of above-mentioned choice device selection; And
According to the position and the said reference value of character radicals by which characters are arranged in traditional Chinese dictionaries beyond the selecteed character in the character of selecting by above-mentioned choice device, in order to calculate fiducial value, character, the specific device of the bit string that embeds in specific this character.
18. any described image processing apparatus according in the claim 11 to 16 is characterized in that:
Above-mentioned eletric watermark draw-out device also comprises
Utilization is calculated the calculation element of fiducial value by predetermined character in the character of above-mentioned choice device selection; And
According to the position and the said reference value of character radicals by which characters are arranged in traditional Chinese dictionaries beyond the selecteed character in the character of selecting by above-mentioned choice device, in order to calculate fiducial value, character, the specific device of the bit string that embeds in specific this character,
According to by the specific bit string of above-mentioned specific device, above-mentioned file access control information is restored.
19., it is characterized in that according to claim 17 or 18 described image processing apparatus:
Target relative distance sat up straight in four of aforementioned calculation device calculating selecteed character in order to calculate fiducial value, as the said reference value.
20., it is characterized in that according to claim 17 or 18 described image processing apparatus:
The aforementioned calculation device calculates the width of each radicals by which characters are arranged in traditional Chinese dictionaries of selecteed character in order to calculate fiducial value, highly to the width of this character, the ratio of height, as the said reference value.
21. any described image processing apparatus according in the claim 1 to 20 is characterized in that:
Above-mentioned file access control information comprises copy limit information, distorts the information of preventing, original document management information.
22. any described image processing apparatus according in the claim 1 to 21 is characterized in that:
The character that comprises in the above-mentioned document image comprises Chinese character, Korea S's literal, Thailand's literal.
23. any described image processing apparatus according in the claim 1 to 22 is characterized in that:
Above-mentioned character radicals by which characters are arranged in traditional Chinese dictionaries comprise the radicals by which characters are arranged in traditional Chinese dictionaries of Chinese character.
24. an image processing method is the image processing method that the document image that comprises the character that a character is made of a plurality of character radicals by which characters are arranged in traditional Chinese dictionaries is carried out the embedding of eletric watermark, it is characterized in that comprising:
From above-mentioned document image, extract the extraction step of character;
The selection step of the character of be chosen in the character that extracts in the above-mentioned extraction step, predetermined structure being got in the character radicals by which characters are arranged in traditional Chinese dictionaries; And
According to the file access control information, make the change in location of the character radicals by which characters are arranged in traditional Chinese dictionaries of the character of in above-mentioned selection step, selecting, above-mentioned file access control information is embedded embedding step in the above-mentioned character as eletric watermark.
25. an image processing method is the image processing method that the document image that comprises the character that a character is made of a plurality of character radicals by which characters are arranged in traditional Chinese dictionaries is extracted the eletric watermark that embeds, and it is characterized in that comprising:
From above-mentioned document image, extract the character extraction step of character;
The selection step of be chosen in the character that extracts in the above-mentioned character extraction step, the character of predetermined structure being got in the character radicals by which characters are arranged in traditional Chinese dictionaries; And
According to the position of the character radicals by which characters are arranged in traditional Chinese dictionaries of the character of in above-mentioned selection step, selecting, extract the bit string that is embedded in this character, according to the bit string that extracts, the eletric watermark extraction step that above-mentioned eletric watermark is restored as the file access control information.
26. a program is characterized in that: make computer have function as any described image processing apparatus in the claim 1 to 23.
27. a program is characterized in that: this program is to be used to make the computer enforcement of rights to require the program of the image processing method described in 24 or 25.
28. a recording medium is characterized in that: the program described in the storage claim 26 or 27.
CNB021419965A 2001-09-03 2002-09-02 Image processing apparatus and image processing method and program and storage media Expired - Fee Related CN1226860C (en)

Applications Claiming Priority (6)

Application Number Priority Date Filing Date Title
JP2001266436 2001-09-03
JP266436/2001 2001-09-03
JP2001386137A JP3848150B2 (en) 2001-12-19 2001-12-19 Image processing apparatus and method
JP386137/2001 2001-12-19
JP2002226588A JP3833154B2 (en) 2001-09-03 2002-08-02 Image processing apparatus, image processing method, program, and storage medium
JP226588/2002 2002-08-02

Publications (2)

Publication Number Publication Date
CN1404298A true CN1404298A (en) 2003-03-19
CN1226860C CN1226860C (en) 2005-11-09

Family

ID=27347433

Family Applications (1)

Application Number Title Priority Date Filing Date
CNB021419965A Expired - Fee Related CN1226860C (en) 2001-09-03 2002-09-02 Image processing apparatus and image processing method and program and storage media

Country Status (2)

Country Link
KR (1) KR100485554B1 (en)
CN (1) CN1226860C (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2006042460A1 (en) * 2004-10-18 2006-04-27 Dong Liu Hidden data communication method and the application thereof in text digital watermark technology
CN1326383C (en) * 2004-06-30 2007-07-11 佳能株式会社 Image processing apparatus, image processing method, computer program and computer readable storage medium
US7613317B2 (en) 2004-06-30 2009-11-03 Canon Kabushinki Kaisha Image processing apparatus, image processing method, computer program and computer readable storage medium

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP3573009B2 (en) * 1999-08-11 2004-10-06 日本電気株式会社 Digital watermark insertion system, digital watermark characteristic table generation system, and digital watermark characteristic parameter table generation system
JP3643509B2 (en) * 1999-09-30 2005-04-27 株式会社東芝 Digital watermark embedding method and apparatus, and digital watermark detection method and apparatus
KR20010070865A (en) * 2001-06-14 2001-07-27 최종욱 Apparatus for preventing duplication and forgery/alternation of document and authenticating it

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1326383C (en) * 2004-06-30 2007-07-11 佳能株式会社 Image processing apparatus, image processing method, computer program and computer readable storage medium
US7613317B2 (en) 2004-06-30 2009-11-03 Canon Kabushinki Kaisha Image processing apparatus, image processing method, computer program and computer readable storage medium
WO2006042460A1 (en) * 2004-10-18 2006-04-27 Dong Liu Hidden data communication method and the application thereof in text digital watermark technology
CN1684115B (en) * 2004-10-18 2011-03-23 刘�东 Text digital water printing technology based on character topoloical structure

Also Published As

Publication number Publication date
KR20030020250A (en) 2003-03-08
CN1226860C (en) 2005-11-09
KR100485554B1 (en) 2005-04-27

Similar Documents

Publication Publication Date Title
CN1195280C (en) Method and system for inserting information into piles
Amano et al. A feature calibration method for watermarking of document images
US7532738B2 (en) Print medium quality adjustment system, inspection watermark medium output device for outputting watermark medium to undergo inspection, watermark quality inspection device, adjusted watermark medium output device, print medium quality adjustment method and inspection watermark medium to undergo inspection
US7411702B2 (en) Method, apparatus, and computer program product for embedding digital watermark, and method, apparatus, and computer program product for extracting digital watermark
JP4758461B2 (en) Text direction determination method and system in digital image, control program, and recording medium
US20090021793A1 (en) Image processing device, image processing method, program for executing image processing method, and storage medium for storing program
CN1542656A (en) Information processing apparatus, method, storage medium and program
JP2003230001A (en) Apparatus for embedding electronic watermark to document, apparatus for extracting electronic watermark from document, and control method therefor
US8391607B2 (en) Image processor and computer readable medium
CN1719865A (en) Image processing system and image processing method
JP2009003937A (en) Method and system for identifying text orientation in digital image, control program and recording medium
JP2008035491A (en) Image processing apparatus, image processing method, and image processing program
CN1704990A (en) Information embedding device, information detecting device, information embedding and detecting system, information embedding method, information detecting method, information embedding program, infor
CN1226860C (en) Image processing apparatus and image processing method and program and storage media
CN1945622A (en) Digital water mark embedding and extracting method and device
JP3980983B2 (en) Watermark information embedding method, watermark information detecting method, watermark information embedding device, and watermark information detecting device
JP5598120B2 (en) Image processing device
EP2529331B1 (en) Parallel test payload
CN1771513A (en) Method of detecting watermarks
Langley et al. Google Books: Making the public domain universally accessible
CN1497525A (en) Technique for setting printing width of outline character
CN1084010C (en) Word generating device
JP4179977B2 (en) Stamp processing apparatus, electronic approval system, program, and recording medium
Davarzani et al. Farsi text watermarking based on character coding
Simske Low-resolution photo/drawing classification: metrics, method and archiving optimization

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20051109

Termination date: 20140902

EXPY Termination of patent right or utility model