AU650876B2 - Document marking system - Google Patents
Document marking system Download PDFInfo
- Publication number
- AU650876B2 AU650876B2 AU77282/91A AU7728291A AU650876B2 AU 650876 B2 AU650876 B2 AU 650876B2 AU 77282/91 A AU77282/91 A AU 77282/91A AU 7728291 A AU7728291 A AU 7728291A AU 650876 B2 AU650876 B2 AU 650876B2
- Authority
- AU
- Australia
- Prior art keywords
- document
- marking
- text
- spacing
- code
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
Landscapes
- Record Information Processing For Printing (AREA)
Description
0191-1 1 650876--tF) PATENTS ACT 1952 COMPLETE SPECIFICATION
(ORIGINAL)
FOR OFFICE USE 0* 0 a 0 0* .0 0t *0 0 0 0 00 0 *0* 0* 0 .0#
S
t 0 t
C
Ct t C L C CC C C Short Title: Int. CI: Application Number: Lodged: Complete Specif icati on- lodged: Accepted: Lapsed: Published: Prio r ity: Related Art: Name of Applicant: Address of Applicant: Peter Samuel Vogel 28 Adeline Street, Faulconbridge, NSW 2776 Actual Inventor: Peter Samuel Vogel Add ress for Service: -2-8-Ade-l-i-ne-t-r-eetFau--conb-ri; az -NSW-2-7--6
RA
4 Complete Specification for the invention entitled: DOCUMENT MARKING SYSTEM The following statement is a full description of this invention, including the best method of performing it known to me.
wppoa300191 -2- This application is an application for a Patent of Addition to Australian Patent Application number 31811/89.
TECHNICAL FIELD The present invention relates to methods of, and apparatus for, generating documents which, as well as presenting the usual alphanumeric text information, contain a distinctive marking. More particularly, the marking scheme of the invention can convey identifying information which, in the event of a document coming into the possession of an unauthorised person. allows a particular copy of a S S document to be traced to its source.
BACKGROUND ART A common problem faced by authors of confidential documents is the difficulty of preventing unauthorised persons from gaining access to them. A related problem is breach of copyright of printed texts. These problems have become particularly acute with the commonplace availability of photocopiers as it is a simple matter for a person legitimately possessing a certain document to copy it and pass it on to other, perhaps unauthorised, persons. To date, preventicn of such transgressions has been mainly by physical restriction of availability, fo. example by denying access to all but a trustworthy few, and relying on punitive measures to prevent breach of confidence. As a further deterrent confidential documents are sometimes stamped with the words "SECRET" or "CONFIDENTIAL". Such techniques have little effect and efforts have been made to mark documents in such a way that each copy bears a distinctive feature so that in the event of "leaking" the source of the leak can be identified. One such technique involves subtly changing the text of each copy distributed, for example by introducing spelling mistakes, and recording the details of changes made in the case of each recipient. This technique is cumbersome, requires undesirable alteration of the text, and relies on recovery of the whole or a large part of the text to be effective as a means of identifying the source. Other marking schemes utilise a pattern of special markings which must cover the whole page area if the possibility of erasure is to be avoided.
IIIIIICL
wppoa300191 Some document marking schemes rely on subtly variations of character formation, for example by changing the typeface of selected characters.
One such system, described in Patent Cooperation treaty International Publication WO 85/05713 of Millet, used small variations in the formation of characters produced by a dot-matrix printer to uniquely mark documents. For example, Millet's system can selectively omit dots of the usual matrix to create deliberately mis-formed characters. This scheme is only effective in a limited range of applications, since the subtle variations are easily lost if the document is blurred by photocopying, particularly if multiple generations of copy are made.
An improved document marking scheme has been published under the Patent Cooperation treaty International Publication number WO 1 89/07819 of Vogel. This publication discloses a scheme which marks documents by varying the spacing between words. According to this scheme the pattern of space variations conveys marking information which is unique to each copy of a document printed and can be used to trace the source of a particular copy in the case of a leak.
Although the scheme of WO 89/07819 is useful, the improved marking scheme of the present invention is responsive to the content of a document so that the system of marking is optimised according to the format of the text. For example, where text is in tabular format, spacing between lines is varied for marking rather than spacing between words, thereby maintaining columnar alignment.
One shortcoming of prior-art document marking schemes is that they do not solve the problem of marking unmarked documents copied using a photocopier. For example, there is a strong need for a system which will deter people from copying books in libraries, an act which could infringe copyright.
The present invention is directed to providing an improved document marking scheme which uniquely identifies each of a potentially large number of copies of a document. A further novel and useful feature of the invention is the ability to convey within the marking textual information, such as the name of the intended recipient of each copy.
The invention can also be used to encode long passages of text within a document in a way that is not readily visible but can be decoded to l .r wppoa300191 -4reveal the original encoded text. Furthermore the marking scheme of the invention does not significantly detract from the visual quality of the document, and the marking is maintained in spite of photocopying, enlarging or reducing. Another feature of the present invention is that it can be effectively used to mark text of a wide variety of formats, including tabular text.
The invention can also be practised in the form of a photocopier which uniquely marks copies produced, so that, for example, if a person copies a book in a library, the name of the person making the copy is marked into the copy made. The invention can be realised by a suitable arrangement of conventional digital data processing components. Other 1advantages and objects of this invention will be more fully appreciated from the description of the invention which follows.
DISCLOSURE OF INVENTION According to one aspect of the invention there is provided an improved document marking method comprising the steps of inputting a document, inputting a marking code to be applied to the document, and altering the spacing between groups of text elements according to a transform of the marking code. The document and the marking code can be input by scanning a printed document, or by entering from a keyboard, or by receiving data representative of a document in electronic form. The invention is further adapted so that the classes of text elements between which the spacing is varied is selected according to which type of variation is most applicable to the format of text appearing at the particular point in the document to which the marking is being applied. The type of variation being used for marking can be controlled by directives within the document or it can be selected automatically according to the formatting of text in the document. For example, the invention can be arranged so that spacing between words is used to convey the marking except where columns of text appear in the document, in which case spacing between characters or spacing between lines is used, so that the columnar alignment of text is not adversely affected.
wppoa30019 According to another aspect of the invention the marking code is generated automatically and the step of entering a marking code is not required. In this case, the marking code can be generated as an ascending number sequence, a pseudo-random number sequence, or other function. Alternatively, the marking code can be automatically derived from information stored within the data processor, including data representing the user's name, user's password, date, time or document name.
According to yet another aspect of the invention there is provided a method of generating multiple uniquely-marked documents, employing the steps of the previous paragraphs repeatedly, using different o 0 identifying data each time.
S 15 The invention also consists in a method of decoding the marking Soencoded into a document using the above methods, comprising the steps of examining the spacing between printed elements of a document, classifying said spaces as large or small, and matching the resulting sequence of large and small spaces with predetermined sequences. Such predetermined sequences can include sequences representing alpha- *numeric characters, a sequence number, or a pseudo-random sequence.
to According to a further aspect of the invention there is provided improved document marking means comprising means for inputting a document and a marking code to be applied to the document, and spacealtering means for altering the spacing between groups of text elements according to a transform of the marking code. The space-altering means e is adapted so that the classes of text elements between which the spacing is varied is selected according to which type of variation is most applicable to the format of text appearing at the particular point in the document to which the marking is being applied.
According to another aspect of the invention, further means are provided for automatically generating the marking code and means for entering a marking code are not required. In this case, the marking code can be generated as an ascending number sequence, a pseudorandom number sequence, or other function. Alternatively, means are provided for deriving the marking code artomatically from information
_I
1.~ wppoa300191 -6stored within the data processor, including data representing the user's name, user's password, date, time or document name.
According to yet another aspect of the invention there is provided means for generating multiple uniquely-marked documents, employing the document marking methods of the invention repeatedly, using different marking data each time.
Another useful extension of the invention provides a photocopier which performs a marking function, comprising document scanner means, marking code input means, processing means for grouping text elements of the scanned document into classes and marking the document by varying the spacing between text elements according to the t marking code, and means for printing the marked document. The t 15 photocopier can further comprise locking means adapted to enable i copying only if a marking code has been entered. The locking means i can be further adapted to accept only marking codes which pass a i predetermined test for validity.
The invention also consists in means for decoding the marking encoded Ii into a document using the above methods, comprising means for examining the spacing between printed elements of a document, classifying said spaces as large or small, and matching the resulting sequence of large and small spaces with predetermincd sequences.
BRIEF DESCRIPTION OF THE DRAWINGS The foregoing and other objects, features, extensions, and advantages of the invention will be apparent from the following description of some exemplary embodiments of the invention, as illustrated in the accompanying drawings in which: Fig. 1 is a schematic representation of a multiple document generating scheme according to the invention; Fig. 2 is an illustration of an original piece of text, including a paragraph of text and a table, before marking; wppoa300191 7- Fig. 3 is an illustration of the text of Fig. 2 after marking using interword spacing and inter-line spacing; Fig. 4 is an illustration of the text of Fig. 2 after marking using interword spacing and inter-character spacing; Fig. 5 is a flow diagram of a generalised system used for creating multiple uniquely-marked copies of a document; Fig. 6 is a flow diagram of a routine suitable for the input file marking function of the flow diagram of Fig. 5 in the case of inter-word space marking; Fig. 7 is a program listing of a routine for modifying inter-word spacings without changing line length; Fig. 8 is a flow diagram of a routine suitable for the input file marking function of the flow diagram of Fig. 5 in the case of inter-line space marking; Fig. 9 is a flow diagram of a routine suitable for the input file marking function of the flow diagram of Fig. '5 in the case of inter-character space marking; S 25 Fig. 10 is a flow diagram of document marking system utilising encryption of the marking string; Fig. 11 is a schematic illustration of an embodiment of the invention adapted to encode a sub-text into the marking of a document; and Fig. 12 is a schematic illustratioi- of an anebodiment of the invention adapted to mark documents as part of a photocopying process.
I
wppoa300191 -8" SMODES FOR CARRYING OUT THE INVENTION
;:II
,f t f ta t r 5 A text processing system for carrying out the invention will now be described, by way of example only, in which a piogrammed digital computer is used to perform word-processing functions as well as the document marking function of the invention. This embodiment is useful for producing multiple copies of a document, each carrying a distinctive marking within the formatting of the text. For convenience, in this embodiment the marking codes correspond to alpha-numeric strings, such as persons' names, although this is not an essential characteristic of the invention. Referring to Fig. 1, a schematic representation of this multiple-document producing scheme is shown. Text file 5 is a digital 15 representation of the document to be marked. Marking string file 6 is a set of data representing a table of alpha-numeric strings, each designating a recipient of the document. These files can reside in the memory of the computer or on a storage medium or can be received serially from another source and are created using well-known textentry techniques, typically by typing on a keyboard. Marking function 7 is a set of programme instructions which when executed by the computer causes the data of the text file to be altered so that when printed, the spacing between characters, words, lines or other class of similar text elements is increased or decreased according to the marking algorithms.
Within marking function 7 are routines which perform a transformation upon the marking strings, that is, the strings are input character by character and for each character a unique series of "longs" and "shorts" is generated. In this embodiment the transformation is a simple lookup of a 10-bit code, although in other embodiments more complex transformations are performed. For example, an encryption key can be included as a term in applications requiring security of marking.
This embodiment of the invention produces a number of copies of a document, each uniquely marked according to the contents of a marking string file. By way of example, the marking string file can contain a number of records, each of which is the name of an intended recipient of a document. In this example, if there are ten recipients, the i I wppoa300191 9arrangement of Fig. 1 will produce a first marked file 1 from text file using the first record of marking string file 6, then a second rmarked file 2 from text file 5 using tile second record of marking string file 6, and so on for each of the ten records. Assuming each recipient's name is different, the result is ten uniquely-marked documents.
It is of course possible to direct the output of marking function 7 directly to a printer, so that the marked text files are not required, or the files can be stored temporarily prior to printing, or permanently on disk or other medium.
Marking function 7 in this embodiment of the invention uses a 10 bit x| ,marking code, where each bit defines the length of a corresponding space in the marked document. The codes corresponding to each alpha- I 15 numeric character have been selected to minimise the occurrence of S long series of long or short spaces, so that the undesirable degradation I of the appearance of the marked document is minimised.
The marking code corresponding to each character is shown in the following table (table in which S represents a Short space, L represents a Long space.
h t t wppoa3 00191-1- TABLE 1. TABLE OF CHARACTER CODES 9 S 0* Se S
S*
**S
S
.55 I
SLSLSLSLSL
SLSLSLSLLS
SLSLSLLSSL
SLSLSLLSLS
SLSLLSSLSL
SLSLLSSLLS
SLSLLSLSSL
SLSLLSLSLS
SLLSSLSLSL
SLLSSLSLLS
SLLSSLLSSL
SLLSSLLSLS
SLLSLSSLSL
SLLSLSSLLS
SLLSLSLSSL
SLLSLSLSLS
LSSLSLSLSL
LSSLSLSLLS
LSSLSLLSSL
LSSLSLLSLS
LSSLLSSLSL
LSSLLSSLLS
LSSLLSLSSL
LSSLLSLSL-S
LSLSSLSLSL
LSLSSLSLLS
LSLSSLLSSL
LSLSSLLSLS
LSLSLSSLSL
LSLSLSSLLS
LSLSLSLSSL
LSLSLSLSLS
SSSLSLSLSL
SSSLSLSLLS
SSSLSLLSSL
SSSLSVLLSLS
SSSLLSSLSL
SSSLLSSLLS
SSSLLSLSSL
SSSLLSLSLS
LLLSSLSLSL
LLLSSLSLLS
LLLSSLLSSL
LLLSLSLSLS
SPACE
wppoa300191 11- As can be seen from table 1, the marking codes for most characters have been chosen to ensure a maximum of two consecutive long or short spaces, except that the characters 0-9, and space begin with three long spaces or three short spaces. This scheme allows the beginning of the characters starting with three long spaces or three short spaces to be located unambiguously even if the starting position of the coding is not known, for example when only a fragment of the document has been recovered. For this reason it is desirable that marking strings include at least one space or numeric character.
The marking can be applied by means of varying the spaces between words, characters or lines of text, according to tLie nature of the S. document. For example, for plain text in the usual paragraph form, varying spacing between words is generally the most visually acceptable S means of marking. However where text takes the form of tabulated columns, such variations can spoil the appearance of the text, and in such circumstances it may be desired to use variation of another class of text elements, such as inter-character spacing or inter-line spacing.
The selection of which type of space variations are used can be made responsive to the content of the surrounding document. For example, inter-line space variation can be selected automatically if the line of text being marked contains tabulation characters or if the positioning of words on the page is determined to be characteristic of tabular text.
25 Alternatively, selection of class of text elements to which spacing variations are applied can be controlled by directives embedded in the document.
9 9 In the case of varying inter-word spacing, this embodiment of the invention is arranged so that spaces following punctuation are ignored by the marking function, this being desirable to avoid ambiguity arising from multiple spaces commonly used after punctuation.
By way of illustration, Figs. 3 and 4 show the appearance of the different types of marking provided by this invention when applied to the sample document shown in Fig. 2. As seen in Fig. 2, the sample document comprises a paragraph of text followed by a table. This document is typical of that produced by a conventional word-processor, wherein the spacing between characters, words and lines are constant, wppoa300191 12- As seen in Fig. 3, the document has been maiked using the invention.
The paragraph of text has been marked by varying the inter-word spacing, and the table has been marked by varying the inter-line spacing. As seen in Fig. 4, the paragraph of text has been marked by varying the inter-word spacing, and the table has been marked by varying the inter-character spacing.
Operation of this embodiment of the invention will now be further explained with reference to a number of flow diagrams, in which the following abbreviations are used: I.F. Input file M.S.F. Marking striIg file S, t E.O.R. End of record a II CHAR Current character PREV Previous character St E.O.T. End of text PATTERN 10 bit marking code pattern BIT COUNT Number of bits of PATTERN remaining to be used DEC Decrement M.S.B. Most significant bit L.F. Line feed character Referring to Fig. 5, the process begins with the creation of the first output file. The first record of the marking string file is then S" 25 read. Each record of the M.S.F. corresponds to one string that is to be marked into each copy of the document output, that is, the number of records in the M.S.F. determines the number of marked copies that will S'be generated. The input file is ten read, marked with the S, current marking string, and output as a marked file. This process is repeated until the M.S.F. reahed end of file Within the process shown as "Mark are a number of sub-processes arranged to vary the type of marking applied to the document according to user instructions or according to the format of the text being marked.
For example, in one preferred embodiment, variation of spacing between words, characters or lines of text can be used to convey the marking, the most appropriate method being controlled by directives placed within the text. Examples of such directives are: turn marking on wppoa300191 13turn marking off <<char>> use inter-character spacing for marking <<word>> use inter-word spacing for marking <<line>> use inter-line spacing for marking These directives are not written to the output file, but instead are used as directives to the marking routines to switch to the requested mode from the point at which the directive occurs in the text.
Details of the various marking processes of this embodiment will now be given with reference to Figs. 6-9.
The flow diagram of Fig. 6 corresponds to the marking process which varies the inter-word spacing of the text. The M.S.F contains a number of records each of which corresponds to a text string to be marked into a copy of the document. Each ASCII character of the marking strings j c corresponds to a 10 bit marking pattern, as described above.
The marking process is performed on a charar.ter-by-character basis.
As each character is read from the input file, a test performed to determine whether the character read (CHAR) is the space character. If not, CHAR is stored in a temporary location PREV, CHAR is written to the output file, and a test is performed to see if the input file is at end of text. If not, the next character is read from the input file, and S' 25 this process is repeated until a space character is encountered. When a space is encountered, PREV is tested to see whether the previous character was a punctuation character. If it was, the space is ignored.
J j If not, a test is performed to see if BIT COUNT has reached zero, indicating that the end of the 10 bit marking pattern has been reached.
If not, the most significant bit of PATTERN is then tested. If it is set a large space is written to the output, if it is clear a short space is written to the output.
The BIT COUNT is then decremented and the PATTERN shifted one bit to the left, in preparation for the next iteration. If, on encountering a space in the input file, the test BIT COUNT=0 is true, the next byte of t; the marking string file is read. If the marking string file is at the end of a record, the file is reset to the beginning of the same record, so that the I marking string will be repeated throughout the pass of the input file. If wppoa300191 -14it is not at the end of the record, the byte read from the marking string file is used to obtain a 10 bit marking pattern using a lookup table containing the data of Table 1, and BITCOUNT is reset to When the test AT returns true, the M.S.F. is set to the beginning of the next record, so that the next marking string will be used to mark the next file generated. A test is then performed to see if bie M.S.F. is at E.O.T. A true result indicates that all required marked copies have been generated, in which case the process ends. If M.S.F. is not at another output is created, and the process repeats, generating an output file with the markings dictated by the next marking string of the M.S.F.
In this embodiment, the inter-word space modifying routine is arranged so that the overall line length is not changed by the marking function.
This is particularly important in the case of right-justified text, where it is desirable to maintain a straight right margin. The system by which this is achieved will now be described with reference to Fig. 7, which is a program listing in the BASIC language.
This routine is called once a line of text has been processed by the Smarking scheme described above, which has nominated long or short spaces for each of the inter-word spaces in the line and built an array (modarray) of space-size indicators, in this example for long or "S" for short. The routine of Fig. 7 first builds an array of space sizes (spacearray) which contains the size of each space in the line of the input file (current_space) and calculates the total size of space in the line (total_space) by summing each element of spacearray. Next the modulation of spaces is performed by increasing the size of spaces where the corresponding element of modarray is and decreasing the size of spaces where the corresponding element of modarray is The amount of change introduced is determined by the constant "factor", which in this embodiment can be selected by the user depending on the required degree of modulation of spaces. Values for' "factor" can range from 0, which results in no modulation, to 1, which results in words touching each other where the space between them is "short". Using this embodiment, a factor of 0.3 has been found to yield good results.
JLvy.UlVlE.i 1 IVIi.ISJUIN U aII I Va) fIAV The following statement is a full description of this invention, including the best method of performing it known to me.
wppoa30191 The process described so far will in most cases changed the total line length, and it is necessary to further adjust the spaces to return the total amount of space, and hence the line length, to the original value. This is achieved by the last two steps of Fig. 7. First, the change of line length is calculated by subtracting the original total space size from the new total space size. This error is then divided by the number of spaces in the line and the result added to each space, so that the correction is distributed evenly throughout the spaces of the line.
This space-modulating process is applicable to text processing systems which allow spaces between words to be finely controlled, for example using a page description language. In other cases, such as where only fixed-size spaces are available, the long space can be generated by using two consecutive space characters.
When it is desired to mark the document by varying inter-line spacing, the scheme shown diagrammatically in Fig. 8 is used. The process described in Fig. 8 is similar to that described above in relation to Fig.
6, except that detection of line feed characters are used to invoke the marking routines, instead of space characters. Also, it is not necessary in this case to test whether the character prior to the line feed was a punctuation character. Modulation that does not effect the overall page length can be achieved in a method similar to that described above in relation to inter-word space modulation.
I When it is desired to mark the document by varying inter-character spacing, the scheme shown diagrammatically in Fig. 9 is used. The process described in Fig. 9 is similar to tha: described above in relation to Fig. 6, except that detection space modulation is performed after every alpha-numeric character Modulation that does not effect the overall page length can be achieved in a method similar to that described above in relation to inter-word space modulation.
In some embodiments of the invention, certain additional features are provided for the purpose of streamlining the production of multiple marked documents. One such feature is additional software which provides the ability to command the computer to automatically use a recipient list (marking string file of the above-described embodiment) as a source of names to be inserted into a prescribed merge field of a wppoa300191 16document. This is useful for automatically annotating each marked copy with an identifying message. For example, it may be desired to print the message "THIS DOCUMENT IS MARKED AND UNIQUE TO J. SMITH" at the head of each version. In this case, the name (J.
SMITH) would take the form of a merge field in the original document, the name being automatically inserted in the output text as part of the marking process of the invention.
Another useful extension of the invention is its application to wordprocessing systems in which a plurality of users may have access to a document. One example of such an application is a multi-terminal word-processing system incorporating electronic mail facilities whereby a document can be circulated to a number of people in electronic form, that is, without printing on paper. Another example is a computer to which a number of users have access. In these and other cases, the invention can be used to deter those with access to the document from printing a copy and disclosing it to unauthorised persons. To achieve this, the invention can be adapted so that text files used by the wordprocessing system carry within. them an indicator that the document wvhich they represent is to be marked when printed, this indicator being accessible only to the author of the document. The printing software of the data processing system is arranged so that if any person instructs that the document be printed, the document will be marked, for example with the name of the person requesting the print. The name can for convenience be automatically retrieved from a file containing a correspondence between the password of a user of the system and that user's name. Alternatively, the marking can be determined by a marking string designated by the original author of the document at the time it is electronically mailed to each person, the string and the indicator that the document is to be marked when printed being linked to the file when mailed.
Although the effectiveness of the present invention is generally not diminished by the ease with which anyone can detect and decode the marking of a document it has generated, it is desirable in some cases to prevent unauthorised persons decoding a marking. In some cases it may also be desirable to prevent unauthorised persons making a marked document with a marking identical to that of another. For example, it is possible that a person possessing the present invention could generate a I i: wppoa300191 17document marked with someone's name and then use that copy improperly, with the result that the person whose name was used would be blamed. To overcome these potential abuses, the invention can be further extended to encrypt the marking using an encryption key known only to the authorised user. This modification is shown diagrammatically in Fig. 10. This flow diagram shows the basic document marking scheme, as described in relation to Fig. 5 above, and includes the further step of encrypting each record of the marking string file prior to applying the marking to the document. The encryption system can be any of the schemes well known in the art. The encryption key is known only by the authorised user, and must be applied to the decoded spacing variation data to recover the correct marking information from a marked document. It will be understood that this scheme of encrypting is only one of many that will achieve the desired result. For example the encryption could also be applied with good results to the "look up pattern" step of Fig. 6.
In another embodiment of the invention, the basic marking scheme of the invention is used to encode a sub-text of arbitrary length within the spaces between words of a document. Such an embodiment is shown schematically in Fig. 11. Referring to Fig. 11, a marking function 23, as described in the context of the embodiment above, is applied to a text file 21 to produce a marked document 24, except that in this case the marking corresponds to the characters of sub-text file 22. Sub-text file 22 contains a message of arbitrary length which is encoded into the marked document, instead of individual short strings used in the marking string file 6 of Fig. 1. Using this or similar embodiments, the invention can be used to convey information within the spaces between Swords, at a density of approximately one character per ten words.
Although the invention is very useful when realised as a wordprocessing system, it is also envisaged that the inventive concept can be adapted for use with other document processing or document reproducing devices. For example, in many cases it is desirable to provide a system for marking documents which have already been printed as hard copy. Another useful embodiment of the invention which achieves this object will now be described with reference to Fig.
12, which shows in schematic form a photocopier adapted to perform document marking according to the invention. Referring now to Fig.
wppoa300191 8- 12, copier 31 is a document copying machine of the type now commonly referred to as a "digital copier". Digital copiers commonly comprises electro-optical scanning means coupled to an electronically controlled print engine, using a scanning laser, light-emitting diode array or liquid crystal array to form an image on a drum which is then transferred to paper. In this embodiment of the present invention, as well as comprising scanner 32 and print engine 36, copier 31. is equipped with image processor 33 which in turn comprises text element detector 34 and marking processor 35. When a marked copy of a document is to be made, the image data output by scanner 32, which usually feeds print engine 35, is routed instead through image processor 33. Within image processor 33, the image data is first acted on by text element detector 34, which is arranged to group objects on the scanned V 1 page into elements such as characters, words and lines. There are many techniques for achieving this, well known to the optical character Cm recognition art. For best results, a scheme which works irrespective of L iorientation of the document relative to the scanner is used. Once the elements of interest have been identified, a marking process is performed by marking processor 35, according to the methods of the present invention. For example, if inter-word spacing modulation has been selected as the marking method for a particular document, text V telement detector 34 is arranged to group elements into words and marking processor 35 then moves the words slightly relative to each other in the plane of the lines of text to achieve the lengthened and shortened spaces required. The choice of class of text elements to which space variation will be applied can be effected manually by an operator or software which determines the most suitable form of marking do, according to the format of the document being copied can be provided within image processor 33. When marking is complete, the image data is transmitted from image processor 33 to print engine 36, and the image is printed on paper. To aid in identifying a particular copy, the marking text can also be printed in a convenient position on the document, for example, at the foot of each page.
The marking string used by this embodiment of the invention can be input in a number of ways. For example, the operator can key in a text string or serial number using a keyboard, or load in list of recipients, to be used as marking strings, from a floppy disk. Another useful variant i is to provide further processing means within the copier of this ,l A i provided for deriving the marking code ar tomatically from information wppoa300191 19embodiment so that a list of recipients can be entered for use by marking processor 35 by scanning a printed list on the copier. In this case optical character recognition software within image processor 33 reads the recipient list off the document in bitmapped image form, converts the image to text, and uses this text as the marking strings.
One adaptation of this embodiment of the invention is directed at the problem of preventing breach of copyright in places such as libraries, where people have access to books and although it is common to provide photocopier facilities for uses which constitute fair use, it is desirable to curtail use of the photocopier in ways which would breach copyright.
For such applications, the present invention is provided with an electronic locking system that allows the copier to operate only after a code number or word is entered by the user. The information entered is then used as the marking string, according to the scheme described above. One way of entering the code is to use a coded card, such as a magnetic-stripe credit card, which carries identification unique to the cardholder. Alternatively, the locking system can be adapted so that codes must conform with certain requirements to be accepted as valid, such valid codes being issued by an authorised person at the library and keyed in by the user or remotely entered by the authorised person. For example, the generation of valid codes can be accomplished by an algorithm which combines the user's name with an encryption key and produces a large but finite set of valid outputs, so that the copier's controller can detect entered codes which have not been legitimately generated.
In another useful variant of the embodiment of Fig. 12, image processor 33 is further adapted to apply pseudo-random variations in spacing between elements of text so that each copy produced is distinctively marked. The pseudo-random algorithm is arranged so that aesthetic constraints are not violated, for example, total line lengths are maintained and tables or columns are not unduly disturbed. This embodiment has an advantage in circumstances it is not desirable or practical to enter a recipient list into the device. In this case, to enable documents to be traced to their source, the system can be arranged to generate two identical copies of each pseudo-randomly marked copy.
When the documents are distributed, the name of each recipient is L ii tl- I nrm I wppoa300191 written on the duplicate and the duplicates are filed in case it is later necessary to match them up to a particular marked version.
Alternatively, the invention can be adapted to mark copies with a unique serial number, the number being generated automatically and output so that a record of to whom a copy with a given serial number was given can be kept.
As well as application to photocopiers, the present invention is of great value when applied to other document processing or reproducing systems, including facsimile machines.
In many cases, the benefits of the markhig scheme extend beyond the ability to monitor distribution of confidential or copyright documents.
For example, when used with facsimile transmissions, the marking can be used to authenticate documents. In this case, encrypted marking can be used for extra security.
Irrespective of the embodiment of the present invention used to mark the document, there are a number of schemes which can be used to decode the marking of a particular document.
In the simplest case, the marking can be decoded manually, by observing the pattern of long and short spaces in the document and looking up the corresponding character codes as per Table 1. The process can be partially automated by providing a means for decoding the marking information from spacing information determined visually and input into a processing device programmed to perform the inverse of the marking process.
Alternatively, the source of a marked document can be identified by optically comparing the document with a set of copies made prior to circulating the document and identifying the one which matches. For identification, the copies should be labelled with the name of the recipient. For better security, it may be desired not to keep copies of the circulated documents, but to generate a new marked set for comparison purposes should the need for identification arise. Visual comparison can be aided by producing a transparency of the document which can be used to overlay the copy to be compared.
wppoa300191 -21- If desired, the decoding process can be automated, using a document scanner to input the document to be decoded to a computer which can then decode the marking, by ascertaining the spacing between words, characters or lines as appropriate. The embodiment of Fig. 12 can readily be adapted for decoding markings, thereby providing a photocopier which outputs a page printed with the text of the decoded marking when presented with a page of a marked document.
While the invention has been described with reference to particular embodiments thereof, it will be understood by those skilled in the art that changes in the form and detail may be made without departing from ll' the scope and spirit of the invention.
The marking process of the invention is independent of the nature of the document originating means and document printing means and it is anticipated that the invention can be realised in many ways other than those specifically mentioned herein. In particular, the invention can be realised as an integral part of a word-processing system, by adding suitable software to the word-processing software, or it can be ralised as a stand-alone device interposed between a source of text data, such as a word processor, and a printer, or it can be realised as a printer adapted to carry out marking according to the invention.
It will also be understood that the scheme for relating a given inarking code to a given sequence of space variations utilised by the embodiments described above are exemplary only and many other schemes, obvious to those skilled in the art, can be used without departing from the scope of the invention.
It is also envisaged that in cases where it is desired to make it readily apparent that a document has been marked, one or more printed characters can be used instead of or as well as variations of spacing between words. For example, in the case of the embodiment described above in which two consecutive spaces are used to generate a long space, a space followed by an asterisk can be used, yielding a marked document with asterisks distributed in a distinctive pattern throughout.
Although this technique significantly affects the appearance of the text, .t is nevertheless useful in cases where it is desirable that the marking be wppoa300191 22highly apparent, easy to decode, and unlikely to be obscured through tampering, blurring or otherwise distorting the image.
It is further envisaged that whereas the embodiments described above utilise marking information provided by an operator, other adaptations of the invention can be provided whereby the marking codes can be automatically generated by the invention, for example by forming an ascending number sequence, or a sequence of random numbers, thereby assuring that each copy of a document is distinctively marked, without requiring the operator to provide specific marking information. In such cases it is desirable to maintain a set of duplicates of the documents before circulation for identification purposes. Other embodiments are possible in which the marking information is taken from a data field already serving another purpose within the memory of the data processing apparatus. For example the invention can be made to use the time and date information commonly resident in memory as the marking information, with the result that each copy of a marked document generated will be marked with the time and date at which the document was generated. Other fields suci as the name of the author of the document, operator's password, or addressee's name can also be used in like manner.
It will also be understood that whereas the exemplary embodiments described herein refer to the marking process taking place ir mnediately prior to printing a document, the invention can also be beneficially applied for marking documents in electronic form, that is, documents in the form of files of data which may or may not be printed to form hard-copy at a later time.
Whereas the invention is described herein in relation to marking documents comprising characters, words and other text objects, the invention can equally well be used to mark documents comprising any indicia, including icons, special symbols, pictures, glyphs and the like.
Claims (19)
1. A method of document marking comprising the steps of: inputting a document; receiving a marking code to be applied to said document; grouping text symbols of the input document into classes of text elements selected from the group consisting of characters, words, and lines; selecting, according to the content of the document, at least one of said classes of text elements as the class to which space adjustments are to be applied; and adjusting the spacing between text elements of the selected class according to a function of said marking code.
2. The document marking method of claim 1 wherein said received marking code is input by an operator.
3. The document marking method of ciaiin 1 wherein: said marking code comprises binary data representing text symbols; and the step of adjusting the spacing between text elements 1. comprises the sub-steps of: for each text element of the document which is of a class to 25 which space adjustments are to be applied, testing a corresponding bit of said binary data and, if said bit is of a first logic state, increasing the spacing between the current text element and an adjacent one, and if said bit is of a second logic state, decreasing the spacing between the current text element and an adjacent one. The document marking method of claim 3 wherein the step of inputting a document comprises the sub-step of receiving a file comprising digital data words, each data word corresponding to a text symbol. s <s< u .s1 77282/91 Substitute page 24 The document marking method of claim 3 wherein the step of inputting a document comprises the sub-steps of: optically scanning a printed documenrt: -e,nd forming, in memory means, digital data representative of the scanned image.
6. The document marking method of claim 1 wherein said received marking code is a calculated function of data stored within data processing means.
7. The document marking method of claim 6 wherein the step of inputting a document comprises the sub-step of receiving 0 file comprising data words, each data word corresponding to a text symbol.
8. The document marking method of claim 6 wherein the step of inputting a document comprises the sub-steps of: optically scanning a printed document; and S 20 forming, in memory means, digital data representative of the scanned imag.. t 9. The document marking method of any of claims 1 to 8 wherein the a0 step of adjusting spacing between text elements is arranged so that 25 the alignment of columns of text and overall lengths of lines of text of the marked document are the same as those of the input document.
10. A method of producing multiple documents each of which is uniquely I marked, according to the method of any of claims 1 to 8 and including the further steps of receiving a second and subsequent marking code and for each marking code received repeating the method of any of tclaims 1 to 8.
11. A document marking method of any of claims 1 to 8 which minimises visual detraction of the marking method, wherein the class of text elements to which adjustments of spacing are applied is responsive to directives within the document. -7 77282/91 Substitute page
12. The document marking method of any of claims 2 to 5 wherein the marking code is received by electro-optically scanning a printed document bearing indicia representative of a marking code.
13. A method of decoding a marked document comprising the steps of: electro-optically scanning said document to generate a signal representative of the image thereon; classifying spaces between text elements as long or short by processing said signal to measure the displacement of text elements which have been displaced from their normal position by the marking process; forming a series of binary bits according to said classification; forming data representative of the marking borne by said document according to said series of binary bits.
14. Document marking means comprising: means for inputting a document; means for receiving a marking code to be applied to said document; means for grouping text symbols of the input document into classes of text elements selected from the group consisting of characters, words, and lines; means responsive to the content of the document for selecting at least one of said classes of text elements as the class to which 25 space adjustments are to be applied; and means for adjusting the spacing between text elements of the selected class according to a function of said marking code. rtr The document marking means of claim 14 wherein sa-, received marking code is input by an operator. 3noi I_ ~i i II_~ 77282/91 Substitute page 26
16. The document marking means of claim 14 wherein: said marking code comprises binary data representing text symbols; and means for adjusting the spacing between text elements comprises spacing processor means adapted to: for each text element of the document which is of a class to which space adjustments are to be applied, testing a corresponding bit of said binary data and, if said bit is of a first logic state, increasing the spacing between the current text element and an adjacent one, and if said bit is of a second logic strte, decreasing the spacing between the current text element and an adjacent one.
17. The document marking means of claim 16 wherein means for inputting a document comprises means for receiving a file comprising digital data words, each data word corresponding to a text symbol.
18. The document marking means of claim 16 wherein means for :inputting a document comprises an optical document scanner. 1 The document marking means of claim 14 further comprising marking code originating means adapted to send marking codes to 25 said marking code receiving means. 0
20. The document marking means of claim 19 wherein means for inputting a document comprises means for receiving a file comprising data words, each data word corresponding to a text symbol. "7 21. The document marking means of claim 19 or claim 20 wherein said CC: marking code originating means comprises processor means adapted C to form a marking code representative of identifying data resident within memory means, said identifying data being selected from the group consisting of a user password, user name, addressee's name, password, date, time or document name. A 22. The docunent marking means of claim 19 wherein means for inputting a document comprises an optical document scanner. 77282/91 Substitute page 27
23. The document marking means of any of claims 14 to 22 and further comprising means for printing a marked document.
24. The document marking means of any of claims 14 to 18 and further comprising: means for scanning a document bearing text to be used as a marking string; and optical character recognition means adapted to receive signals from said scanning means and send data representative of characters recognised to said marking code receiving means. The document marking means of any of claims 14 to 22, further comprising means responsive to directives within the document for selecting the class of text elements to which adjustments of spacing are applied.
26. The document marking means of any of claims 14 to 22, further comprising: document printing means; and locking means adapted to pr,9vent operation of said document marking means unless a marking code has been received. .lt tI 27. The document marking means of any of claims 14 to 22, further 25 comprising: document printing means; marking code entry means; discriminator means adapted to determine if an entered marking code meets pre-determined validity rules; and 30 locking means responsive to the output of said discriminator c means and adapted to prevent operation of said document ~printing means unless a valid marking code has been entered. i ii 77282/91 Substitute page 28
28. Means for decoding a marking from a document comprising: an electro-optical scanner for generating a first signal representative of the image thereon; discriminator means for receiving said first signal and forming a second signal of a first type if the size of spaces between text elements is larger t, ,ian a certain value and of a second type if the size of spaces between text elements is smaller than a certain value; and means for processing said second signal to form a third signal representative of the marking borne by said document. ter. S. t I t t C cttc t K
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
AU77282/91A AU650876B2 (en) | 1991-05-22 | 1991-05-22 | Document marking system |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
AU77282/91A AU650876B2 (en) | 1991-05-22 | 1991-05-22 | Document marking system |
Related Parent Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
AU31811/89A Addition AU623895B2 (en) | 1988-02-11 | 1989-02-08 | Document marking system |
Publications (2)
Publication Number | Publication Date |
---|---|
AU7728291A AU7728291A (en) | 1992-12-03 |
AU650876B2 true AU650876B2 (en) | 1994-07-07 |
Family
ID=3757838
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
AU77282/91A Ceased AU650876B2 (en) | 1991-05-22 | 1991-05-22 | Document marking system |
Country Status (1)
Country | Link |
---|---|
AU (1) | AU650876B2 (en) |
-
1991
- 1991-05-22 AU AU77282/91A patent/AU650876B2/en not_active Ceased
Also Published As
Publication number | Publication date |
---|---|
AU7728291A (en) | 1992-12-03 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US5467447A (en) | Document marking system employing context-sensitive embedded marking codes | |
US5388194A (en) | System for altering elements of a text file to mark documents | |
Maxemchuk | Electronic document distribution | |
US5568550A (en) | Method and system for identifying documents generated by an unauthorized software copy | |
EP0629972A2 (en) | Method and apparatus for embedding identification codes in printed documents | |
US5699427A (en) | Method to deter document and intellectual property piracy through individualization | |
Brassil et al. | Hiding information in document images | |
US6351815B1 (en) | Media-independent document security method and apparatus | |
US6072871A (en) | Method and system for identifying the source of documents generated by software and documents generated thereby | |
EP1956823B1 (en) | A method and device for embedding digital watermark into a text document and detecting it | |
EP1432234B1 (en) | Systems and methods for providing hardcopy secure documents and for validation of such documents | |
JP3219251B2 (en) | Electronic document processing device | |
US20040001606A1 (en) | Watermark fonts | |
CA2377685C (en) | Automatic barcode creation for data transfer and retrieval | |
US20030145206A1 (en) | Document authentication and verification | |
US20070003341A1 (en) | Image processing device, image processing method, program, and recording medium | |
JP2004240969A (en) | Storage system for document digitally created and signed | |
US20100208938A1 (en) | Method and system for producing certified documents and the like | |
US20060147082A1 (en) | Method for robust asymmetric modulation spatial marking with spatial sub-sampling | |
KR100771133B1 (en) | Forming apparatus for image and restricting method of copy thereof | |
EP0895691B1 (en) | Method and system for identifying documents generated by an unauthorized software copy | |
WO2015140562A1 (en) | Steganographic document alteration | |
AU650876B2 (en) | Document marking system | |
Brassil et al. | Marking text features of document images to deter illicit dissemination | |
JP3545782B2 (en) | How to keep confidential documents confidential |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
MK14 | Patent ceased section 143(a) (annual fees not paid) or expired |