AU757593B2 - Image communications - Google Patents
Image communications Download PDFInfo
- Publication number
- AU757593B2 AU757593B2 AU72225/00A AU7222500A AU757593B2 AU 757593 B2 AU757593 B2 AU 757593B2 AU 72225/00 A AU72225/00 A AU 72225/00A AU 7222500 A AU7222500 A AU 7222500A AU 757593 B2 AU757593 B2 AU 757593B2
- Authority
- AU
- Australia
- Prior art keywords
- facsimile
- image
- text
- circuit
- graphics
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
Landscapes
- Facsimiles In General (AREA)
Description
P/00/011 28/5/91 Regulation 3.2
AUSTRALIA
Patents Act 1990 COMPLETE SPECIFICATION STANDARD PATENT
(ORIGINAL)
Name of Applicant: Actual Inventor: Address for Service: Inmarsat Ltd, of 99 City Road, London EC1Y 1AX, United Kingdom Eyal TRACHTMAN DAVIES COLLISON CAVE, Patent Attorneys, of 1 Little Collins Street, Melbourne, Victoria 3000, Australia.
Invention Title: "Image communications" The following statement is a full description of this invention, including the best method of performing it known to us: P:OPER\DBW\73238-96 div.doc-13 Dmbr. 2000 2 IMAGE COMMUNICATIONS The present invention relates to apparatus and methods for image communications, particularly but not exclusively for facsimile images.
Facsimile image communication has developed to a high level of reliability. Facsimile image communications are highly standardised. The signalling stage of communication is specified in, amongst other things, Recommendation (CCITT recommendation T30 "Procedure for document transmission in the general Switched Telephone Network", Volume VII, fascicle VII.3, Melbourne 1988, pages 77-167), and the image encoding is dealt within Recommendation T4 (CCITT recommendation T4 "Standardisation of group 3 facsimile apparatus for document transmission", Red Book, Volume VII, fascicle VII.3, Malaga-Torremolinos 1984, pages 20 16-31).
These standards were originally developed for operation between two facsimile machines over a circuit connection through the terrestrial wired public switch telephone network. Considerable difficulties arise when the same standards are employed over satellite communication links, e because the relatively long transmission time can prevent effective functioning of the T30 signalling protocols.
S
is These difficulties have been alleviated by the provision of facsimile interface units, as described in GB-A-2286739 (for example) A known facsimile interface unit has a port for connection to a facsimile unit (if necessary via a PSTN or other terrestrial network), and a port for connection to a satellite ground station. Each facsimile interface unit (FIU) emulates a facsimile machine, so that the exchange of protocol signals takes place locally between each facsimile unit and its facsimile interface unit rather than being carried over the satellite transmission link.
Facsimile interface units also find application in other radio transmission systems where delays may occur; for example, in terrestrial cellular radio systems such as GSM systems.
IT has been proposed to provide compression of the facsimile image at the FIU prior to transmission over the satellite channel. In "Compression of facsimile graphics for transmission over digital mobile satellite circuits", Dimolitsas and Corcoran, Milcom '91, pages 30.1.1-30.1.4 (0644-0647), 1991 IEEE, a compression scheme based on selective removal of pixeis to increase run lengths is proposed.
However, some satellite links (for example the relatively low data rate Inmarsat-C link) cannot be 2 25 r= P:\OPER\RAB\722254X) -pl-nc I.d.-22/I 1112 -4used with this type of compression because the data rate after compression is still too high.
Many proposals for compressing image signals in general are known. For example, US-A-4410916 proposes a code book compression system in which, once a symbol (for example a printed character) is encountered on a document, an entry in a code book is created and subsequent occurrences of that symbol in the document are merely replaced by a reference to the entry in the code book. EP-A0112991 teaches 10 substantially the same idea, but extended further to provide code book entries for whole words as well as characters.
However, these schemes are intended to be provided within a facsimile transmitter (and corresponding receiver) to replace CCITT recommendation T4, rather than being provided at an interface which can communicate with standard facsimile terminals.
In accordance with the present invention there is provided a method of interfacing a facsimile signal to a radio transmission system, comprising the steps of: S 20 receiving a call from an originating facsimile station; signalling back to said originating facsimile station to select the highest available resolution thereof; decoding the facsimile image signal to provide an image in said highest available resolution; compressing said image to provide a compressed image signal; and transmitting said compressed image signal via said radio transmission system.
P:\OPER\RAB\722254X)rcp n, I.doc-22/1 I)2 The present invention also provides a facsimile interface unit having a first port for connection to a terrestrial network and a second port for connection to a satellite earth station, the facsimile interface unit comprising a compressor for compressing a facsimile signal received at said first port for transmission at said second port, and a selector circuit for selecting the compression applied by said compressor circuit in dependence upon the identity of at least one of the called and calling parties.
Si.: *0 *o *go *ooo *oo *oooo P:OPERUDBW73238-96 div.doc- 13 Dcmber. 2000 6 For instance, the present invention can provide a facsimile interface unit which receives a conventional facsimile image signal (encoded, for example, according to T4) and compresses it by performing optical character recognition using standard, stored templates. Thus, the image signal is replaced by a stream of character symbols, which occupy very much less bandwidth. The bandwidth required may be reduced still further by subsequently performing text compression on the stream of symbols. A corresponding receiver FIU can receive the character symbols and reconstructs a facsimile image signal for transmission on to the receiving facsimile.
The FIU may send not only the characters but data defining the text area in which the characters are located.
Thus, somewhat of the general layout of the document can be preserved even though only the textual matter is transmitted.
The FIU may also be arranged to perform compression of a signal, in which the amount of compression is selected by 20 the sending or the receiving party, and preferably the latter. Thus, the receiving party (who may be paying for the satellite link) can decide, for example, to receive only the text part of a document compressed to a high degree using optical character recognition according to the above S- 25 aspects; or may receive text and simple graphics, or may a. 0 0receive a document compressed to a much lower degree (which consequently requires longer transmission time) 0. Preferred embodiments of the present invention are hereinafter described, by way of example only, with reference to the accompanying drawings, wherein: Figure 1 is a block diagram showing the elements of a facsimile store and forward system incorporating a first P:\OPER\DBW\73238-96 div.doc-13 Decembr, 2000 7 embodiment of the present invention; Figure 2 is a schematic block diagram showing in greater detail components of a facsimile apparatus, a facsimile interface unit and an earth station according to the embodiment of Figure 1; Figure 3 is a schematic block diagram showing in greater detail the components of a portion of the facsimile interface unit of Figure 2; Figure 4 (comprising Figures 4a-4c) is a flow diagram showing the process of operation of the apparatus of Figure 3; Figure 5 is a block diagram showing a component of a facsimile interface unit according to the embodiment of Figure 2; Figure 6 (comprising Figures 6a and 6b) is a flow diagram showing schematically the process of operation of the apparatus of Figure Figure 7 is a block diagram showing in greater detail the elements of an optical character recognition circuit 20 forming part of Figure 3; o Figure 8 is a flow diagram illustrating the operation of the apparatus of Figure 7; Figure 9 is a block diagram illustrating schematically the components of a graphics coder forming part of the embodiment of Figure 2; Figure 10a and 10b are illustrative diagrams demonstrating the operation of the graphics coder of Figure 9; Figure 11 is a diagram illustrating the format of a signal output via a formatter forming part of the embodiment of Figure 3; Figure 12 is a flow diagram showing schematically the operation of a reception control circuit forming part of the embodiment of Figure 2; and Figure 13 illustrates a facsimile image comprising text, graphics and image data regions.
Referring to Figure 1, a mobile facsimile terminal 10 is connected to a mobile earth station 14 by means of a first facsimile interface unit (FIU) 12.
20 A fixed facsimile terminal 24 is connected, via a PSTN oo 22, to a second facsimile interface unit 20, which in turn is connected to a fixed earth station 18. The fixed earth station 18 is arranged to communicate with the mobile earth station 14 via a satellite 16, which 25 may be placed in geostationary orbit or in a nongeostationary orbit (for example a low earth orbit, an intermediate orbit a 6 hour circular orbit), or a high elliptical orbit).
For the purposes of the present invention, the mobile facsimile terminal 10 will be described as the called terminal and the fixed facsimile terminal 24 as the calling terminal, although it will be appreciated that in general each may perform either function.
As shown in Figure 2, the calling facsimile terminal 24 comprises an input device 30, such as a scanner for scanning a document or an input port from a personal computer for sending facsimile data, and a facsimile transmission microprocessor 32 which encodes signals from the input device 30 according to a predetermined algorithm. The facsimile transmission microprocessor 32 also controls the operation of facsimile transmission, including call set-up, premessage procedure, message transmission, post-message procedure and call release. The output of the facsimile transmission microprocessor 32, in the form 20 of digital data, is modulated by a facsimile transmission modulator 34 to produce an analogue output suitable for transmission through a public service telephone network.
The analog output of the facsimile transmission modulator 34 is connected, either or thoug modulator 34 is connected, either directly or through a a telephone circuit, to the calling FIU 20, which demodulates the analog output to recover the digital facsimile data. The calling FIU 20 comprises a demodulator 36, which converts the modulated signal to digital data, a transmission control unit 38 which encodes the data and an output buffer 40 from which encoded data is transferred to the fixed earth station 18. In the fixed earth station 18, the data is modulated by a radio frequency (RF) modulator 42 connected to an RF transmitter 44, which transmits the signal to the satellite 16 by means cf an antenna directed at the satellite 16. The calling FIU 20 may be integrated with the fixed earth station 18.
Although not germane to the present invention, it is mentioned that the fixed earth station 18 may further comprise an RF receiver 46 for receiving RF signals from the satellite 16, in this case RF signals transmitted by the called facsimile terminal 10. The received signal is demodulated by an RF demodulator 48 to produce a digital signal which is stored in an input buffer 50 in the calling FIU 20. The digital signal is decoded by an FIU receiving control unit 52 and transferred to an FIU receiving modulator 54 which modulates the decoded data to produce an analog output 25 signal suitable for reception by the calling facsimile terminal 24.
ii The received signal is then demodulated by a facsimile receiving demodulator 56 in the calling facsimile terminal 24 to produce digital data, which is decoded by a facsimile receiving microprocessor 58.
The facsimile receiving microprocessor 58 controls an output device 60 such as a printer to print a hard copy of the received facsimile, or an output port to a personal computer for receiving facsimile data.
The facsimile terminal 24 and earth station 18 are known per se, and the above description thereof is merely illustrative.
The structures of the mobile facsimile terminal mobile facsimile interface unit 12 and mobile earth station are equivalent to those described above.
Compression Referring to Figure 3, the structure of the transmission control unit 38 will now be discussed in greater detail.
The transmission control unit 38 in this embodiment comprises a reception signalling control unit 100 which receives T30 control signal from the demodulator 36 and transmits back signals via the modulator 54 to set up the call from the calling facsimile terminal 24. It further comprises a 25 facsimile image decoder 102 arranged to receive facsimile image signals from the demodulator 36, encoded according to the T4 standard, and to write a corresponding black/white image into a frame store 104 dimensioned to contain an entire page of an image encoded at high resolution.
Connected to read the frame store 104, in parallel, are an optical character recognition (OCR) circuit 106 followed by a text compression circuit 107; a graphics coder circuit 108; and an image coder circuit 110. Each may be provided by a suitably programmed microprocessor, microcontroller or digital signal processing (DSP) chip. Alternatively, one or more may be provided by a dedicated chip or chip set, or one or more may be provided by a single suitably programmed microprocessor or microcontroller.
Also connected to the frame store 104; the OCR circuit 106; the graphics coder circu:i 108; and the image coder circuit 110 is a segmenter circuit 112 operable to designate segments of the image in the frame store address ranges therein) to be operated on by the OCR circuit, the graphics coder and the image coder circuit.
The outputs of the text compression circuit 107, S".the graphics coder circuit 108 and the image coder circuit 110 are supplied to a formatter device 114 25 which combines all three into a frame or packet format for supply to the output buffer 40 or a store a 13 large hard disk) 116, the destination being selected by a transmission control circuit 118, which is also selectively operable to couple the store 116 to the output buffer Finally, a customer data signalling circuit 120 is provided, which is operable to access a remote database (for example a central database) storing customer data to be described in greater detail below, and to receive therefrom predetermined customer data for controlling the operation of the receive control circuit 100 or transmit control circuit 118.
The operation of the apparatus of this embodiment will now be disclosed in greater detail with reference to Figure 4. In step 202, if the receive control circuit 100 detects an incoming call, during the initial call set-up signalling the receive control circuit 100 signals back to the calling facsimile 24 n a step 204 to request the highest available resolution on the calling facsimile 24. In this embodiment, preferably, details of common facsimile apparatus are stored in the receive control circuit 100 so that if the calling facsimile 24 supports non standard features, any such features which improve the resolution of the incoming facsimile image are 25 accepted.
o"e Inn sters 206 and 208, the receive ccntrol circuit 100 reads the calling and called telephone numbers (or, in general, ID data).
In step 210, the fax image decoder circuit 102 receives the incoming run length encoded facsimile bit stream and constructs a corresponding frame image (1 bit per pixel) in the frame store 104.
In step 212, the customer data signalling circuit 120 transmits the called party ID to a remote database 300 which comprises a record for each mobile terminal 10. In reply the customer data signalling circuit 120 receives a message from the database 300 indicating, firstly, any restriction of the type of compression required- by the terminal 10 (specifically, a restriction to text only compression or text and line compression) and any specific instructions for facsimile forwarding which may have been stored for the terminal 10 (specifically, one of the following: S1. forward at time X 2. attempt to forward at intervals of X hours 3. attempt to forward only Y times 4. forward on registration store facsimile permanently).
If (step 214) the compression mode is specified as text only, in step 216 the segmenter 112, in S25 conjunction with the optical character recogniser
*O.
circuit 106, segments out any text areas of the image and the optical character recogniser 106 extracts a corresponding stream of characters in step 218, which (after compression by the compressor 107) are supplied as corresponding digital data
ASCII
representation for Roman characters) to the formatter 114.
If the mode is set, in step 214, to text and line images only, then in a step 220 the segmenter 112, in conjunction with the OCR circuit 106 and graphics coder 108, segments out any text area and any line areas of the image in the frame store 104. Then, in step 222 (as in step 218) the OCR circuit 106 ex:racts digital data corresponding to text characters from the text areas, which are then text-compressed. In a step 224, the graphics coder 108 encodes the line graphics areas identified by the segmenter 112 and generates corresponding line output data to the formatter 114.
S If the compression mode is unspecified, or is specified as text, line and image compression, in step o* 20 226 the segmenter 112 (in conjunction with the OCR circuit 106, graphics coder 108 and image coder 110) segments the image into text areas, line areas and image areas. In step 228 (as in steps 218 or 222) the o OCR circuit 106 and text compression circuit 107 generate a stream of compressed digital data presenting text characters encoding the text areas; representing text characters encoding the text areas; 16 in step 230 (as in step 224) the graphic coder 108 generates a stream of graphics codes to represent the graphic areas; and in a step 232, the image coder generates a stream of digital image data representing the image areas of the facsimile. All three streams of data are supplied to the formatter 114.
Figure 13 illustrates a facsimile image, comprising text regions Tl, T2 and T3, comprising English text of different sizes; simple graphics regions G1 (a graph) and G2 (handwritten text); and image regions 11 and 12.
Referring to Figure 4b, the one, two or three streams of data are formatted in step 234 by the formatter circuit 114. The transmit control circuit 118 signals, in a step 236, to the called terminal via the satellite 1C, mobile earth staion 14, and mobile FIU 12. In the event that the connection is :unsuccessful (step 238), the formatted compressed image signal stream from the formatter 114 is supplied to the store 116 in a step 240, together with the called terminal forwarding information determined in step 212.
Then, a signal is transmitted back to the calling oo'* terminal in step 246 indicating that the message has 25 been stored, as disclosed in greater detail below.
If a connection is possible (step 238), the compressed image data from the formatter 118 is forwarded to the output buffer 40, and then transmitted via the fixed earth station 18, satellite 16, and mobile earth station 14, to the mobile FIU 12 (step 242). When confirmation of receipt is received from the mobile FIU 12, an acknowledgment is transmitted back to the calling terminal 24 (step 244).
Referring to Figure 4a, when no incoming call is detected in step 202, in step 245 the transmit controller circuit 118 reads the store 116 to determine whether the forwarding information stored for any srored message indicates that an attempt should now be made to transmit that message. For example, it may be noted that four hours have elapsed since the last recorded attempt to transmit the o" message; or the time may now be approximately equal to the time for transmission which has been requested by the called terminal In the event that one of these conditions is met, transmission event occurs and control passes to the subroutine of Figure 4c. In step 247 (as in step 236), the transmit control circuit 118 signals to the 9 called terminal 10, and in step 248, the transmission 25 control circuit 118 determines (as in step 238) whether the called terminal 10 is available and, if 18 so, in step 250 (as in step 242) the message is transmitted and, upon acknowledgment of successful transmission from the mobile FIU 12, the stored message is deleted from the store 116 in a step 252.
If it is not possible to connect to the called terminal 10 in step 248, the transmission control circuit 118 determines whether or not the stored message should be deleted (step 254) in accordance with the forwarding information determined in step 212, either because a predetermined time has elapsed since the stored message was initially received or because a predetermined number of unsuccessful attempts rn forward the message have been made. If the file is determined to be deleted, step 252 is performed; if not, or after deletion in step 252, control return returns to step 202 of Figure 4a.
Decomression :Referring to Figures 5 and 6, the corresponding decompression process will now be described.
Figure 5 indicates the components of the receive control circuit 52 of the mobile FIU 12 (the same components may also be present in the fixed FIU The receive controller comprises a deformatting circuit 400, operable to reverse the formatting of the 25 formatter circuit 114 a reception control circuit 402 for negotiating the call set up via the satellite 16 with the fixed FIU 20; a transmit control circuit 404 for negotiating the call set up with the mobile facsimile terminal 10; a frame store 406 for storing a binary (thresholded) black/white image of a received page of facsimile; and a facsimile coder circuit 408 for encoding the frame image held in the frame store 406 according to the T4 transmission protocol, connected to supply the fax image to the output buffer 54.
Also provided are a text decompression circuit 409; a font library store 410 storing, for each of a set of text characters (for example the ASCII character set) a bit map font corresponding to an image of the character; a vector to raster converter circuit 412 for receiving the data end points, length and angle or spline control points) of a line represented as a vector and constructing an equivalent raster image; and an image decoder circuit 414 operable to perform the inverse decoding process to that 20 applied by the image encoder 410. The font library store, vector/raster converter 412 and image decoder 414 are each arranged to write image data to the frame store 406 under control of an address circuit 416.
Referring to Figure 6, the operation of the 25 apparatus of Figures 2 and 5 will now be described in greater detail greater detail.
In step 500 when an incoming facsimile call is detected via the satellite 16 and earth station 14, the deformatter circuit 400 supplies the compressed text to the decompressor 409, which outputs the text character codes making up the or each text area to the input ports of the font library store, which functions essentially as a look up table, to output the corresponding character image to the area of the frame store 406 determined by the address circuit 416, in steps 504 and 506. When all character data has thus been written to the frame store 406, in step 508, Iine data present is passed to the vector to raster converter circuit 412, which calculates, for each line segment, a corresponding raster image and writes the raster image to the frame store 406 in steps 510 and 512.
After all such line segments have been reconstructed in the frame store, each block of image data is passed from the deformatter 400 in step 514, to the image decoder circuit 414, where it is decoded in step 516 and written to the frame store 406 in step 518.
After all character, line and image data have obeen reconstructed in the frame store 406, referring 25 to Figure 6b, the fax image coder 403 encodes the image in the frame score 406 in step 520, and in steD 522 it is transmitted to the called facsimile device under control of the transmission control circuit 404.
The transmission control circuit 404 determines whether the transmission has been successful and, in the event of success, transmits back (via the receive control circuit 402) a success acknowledgment message in step 526 via the satellite 16 and earth stations 14, 18 to the fixed FIU In the event that the transmission is unsuccessful (for example because the called facsimile device 10 is not functioning or has run out of paper) a failure message is sent in step 528, via the mobile earth station 14, satellite 16 and fixed earth station 18 to the fixed FIU 20. Thereafter, the step 500 is reDeated.
DETAILS OF COMPRESSION OCR Circuit 106 (Fic. 3) Referring to Figures 7 and 8, the OCR circuit 106 comprises a central processor 600; a number of character memories 602a-602c; and a number of word memories 604a-604c. Each of the character memories 602 stores bitmap template representations of the characters of a particular language in one or more 25 fonts; for example, 602a may contain the Roman character set used in English; 602b may contain the 22 Japanese Kanji, Katakana and/or Hiragana character sets and 602c may contain image representations of the Chinese character set.
Each of the word stores 604 contains a dictionary of commonly occurring words in a language; each word store is therefore associated with one of the character stores 602 by use of the same language.
In general terms, the central processor unit 600 is operable to read (step 700) the entire image held in the frame store 104, and to compare portions of the image with the characters held in the character store 602a (step 704). The CPU 600 may be a conventional microprocessor operating in accordance with a stored program. For example, for recognition of the English language, the stored program may be the Wordscan plus (TM) program supplied by Calera Recognition Systems Inc. 475 Potrero Avenue, Sunnvvale, CA 94086, USA, which includes files of data providina the contents of the character store 602a and word store 604a (which i' 20 are provided by areas of a single RAM or disc memory device) For positions of the image where a character is recognised, the CPU 600 generates the following data: 1. a code an ASCII code for Latin characters) indicating the identity of the character; 2. a confidence factor indicating the degree of 23 similarity between the area of the image and the stored representation or template of the character in the character store 602; 3. an X, Y position within the image of the character; 4. (preferably) an indication of the size (i.e.
pitch) of the character; (optionally) an indication of the font of the character.
Where the average level of confidence in character recognition is low, this may indicate that the language is not one to which the character templates stored in the template store 602a correspond. In this case (step 706) the CPU 600 selects a different character store 602b (step 708) which utilises a different set of characters (for example, the Japanese alphabet).
Once a relatively high level of confidence has *e* been found, from the above data, the CPU 600 is arranged, by utilising the character pitch and position data, to determine the association of characters in lines defining words, lines and paragraphs (step 710). The CPU 600 is then operable to compare each group of characters recognised as a 0 25 word with the words stored in the word store 604a, and to count the number of misrecognised words as a percentage of the total number of words.
If (step 712) the proportion of misrecognised words is high, it is likely that the wrong language is being used, although the character set is largely correct. For instance, the word store may be applying English words to a text which is in Swedish. If this is the case, in step 714 a new character set store 602b is selected, which overlaps substantially with the first character set Swedish or French is selected rather than English), and step 704 is repeated.
On the other hand, if there is a high percentage of recognised words, the language is assumed to be correct. The CPU 600 then determines the boundaries of the rectangular boxes surrounding each text area; specifically, the boundaries surrounding each paragraph of text and preferably additionally each line of text. These are then supplied to the segmenter 112 as an initial estimate of the boundaries of the text areas.
However, it is possible that the optical character recognition circuit 106 will recognise graphic characters as text; for example, a horizontal straight line may be recognised as a series of underlined spaces, or a vertical straight line may be recognised as a vertical sequence of "I"s.
Accordingly, the text areas boundaries determined in step 716 are reviewed by the segmenter, as described in greater detail below, which supplies revised text boundaries to the CPU 600 in step 720.
The CPU 600 then determines whether any previously recognised characters now lie wholly or partially outside the new text boundaries, and if so discards those characters in step 722. Then, in step 724, the CPU 600 supplies character data to the formatter 114, comprising the sequence of character codes; the coordinates of paragraph rectangles encompassing the characters; and, preferably, pitch and spacing information for each character.
In this embodiment, the CPU 600 also signals the recognised language to the customer data signalling circuit 120, from which the recognised language is signalled to the database station 300.
Text Compression Circuit 107 (FiQ. 3) The text compressor 107 comprises, in this embodiment, a central processing unit 610) applying a text compression algorithm such as the well known Lempel-Ziv or LZW algorithm using the PKZIP program available freely) to reduce the redundancy in areas of text, and output a series of 25 symbols representing the text in compressed form.
2G Graphics Coder 108 (Fia. 3) Referring to Figure 9, the graphics encoder 108 comprises a central processing unit 610 such as a suitably programmed microprocessor or microcontroller.
Referring in Figure 10a, in one embodiment the CPU 610 is arranged to receive a raster image of a portion of the contents of the frame store 104, and to approximate lines in the image by a sequence of straight vectors defined by a vector length and a vector angle (or, alternatively, by vertical and horizonal offset distances). Thus, the sigma shape shown in Figure 10a is replaced by three vectors.
Further details of a suitable processing algorithm are to be found in "A fast parallel algorithm for thinning digital patterns" T.Y. Zhang and C.Y. Suen, Communications of the ACMr, Volume 27, No. 3, March 1984, pages 236-239.
In a further embodiment, the graphics coder 108 is arranged instead to fit a spline curve, such as a 20 Bezier curve, defined by knots consisting of pairs of ~control points X 2
Y
2 as shown in figure Details are to be found in, for example, "An introduction to splines for use in computer graphics and geometric modelling", Bartels, Beatty and Barsky, 25 published by Morgan Kaufman, ISBN 0-934613-27-3.
Imace Coder 110 (Fig. 3) The image coder 110, comprises, in this embodiment, a transform encoder applying a two dimensional spacial transform such as the discrete cosine transform (DCT) or Hadamard or Walsh transforms.
Conveniently, in this embodiment, the image coder comprises a digital signal processor device arranged to execute the joint picture expert group (JPEG) compression algorithm, in which blocks of the image contained in the frame store 104 are subjected to a discrete cosine transform, and the transform coefficients are then quantised and run length encoding is performed to encode the runs between non zero transform coefficients; circuits for performing the JPEG algorithm are widely commercially available.
Seomenter 112 (Fig. 3) The segmenter 112 performs the task of allocating S. areas of the image in the frame store 104 to be coded by the optical character recognition circuit 106, the graphics coder 108 and the image coder 110. The segmenter therefore comprises a suitably programmed microprocessor device, operable to segment the image into text, simple (line) graphics and complex graphics areas with the following criteria in mind, in order of 25 importance: 1. the text areas should include all recognisable text; 2 the text areas should not include simple graphics, such as the lines making up boxes and tables with which text is associated; 3. the text, simple graphics and complex graphics ares should be partitioned so as, to the greatest extent possible, the areas are separated by boundaries of white space.
The optical character recognition circuit 106 will correctly recognise text, but may (as discussed above) also misrecognise graphics or images as text.
The graphics encoder 108 will encode text as simple graphics (although at a lower compression efficiency than the OCR circuit 106), but will not efficiently encode complex images.
The image encoder 110 will encode text and simple :i graphics, but with lower compression efficiencies than the optical character recognition circuit 106 or the .graphics coder 108.
Thus, in general, the segmenter 112 permits the 20 coding circuits to attempt to encode areas of the image in order of their coding efficiency the OCR coder first, the graphics coder second and the image coder third in this embodiment).
To segment between simple graphics areas to be 25 encoded by the graphics coder 108 graphs) and complex graphics areas to be encoded by the image coder 110 photographs), a simple test based on the image density may be employed; areas of the image which consist largely of white background with relatively few black pixels are relatively likely to be efficiently encoded by the graphics coder 108, whereas areas with black pixel content of, say, 30% or higher are likely to be more efficiently encoded by the image coder 110.
The segmenter 112 may perform this test simply by examining the run length codes -received prior to decoding by the fax image decoder 102, or may do so on the basis of an examination of the image stored in the frame store 104 (for example by creating vertical and horizontal density histograms), or may do so by permitting the image coder 110 to encode the entire image in the frame store 104 and examining the Stransform coefficients for each block of the image to determine whether the lower order transform "coefficients (corresponding broadly to the overall 20 image density) have relatively high values.
In determining the boundaries between text areas and simple line) graphics areas, the segmenter 112 adopts, in the first instance, the text area boundaries supplied by the OCR circuit 106. The 25 secmenter 112 then performs the following additional tests: 1. The confidence factors for character and word recognition at positions just inside and just outside the text boundaries are reviewed, relative to the average confidential factor across the text boundary.
If low character confidence factors are found just within one boundary, the segmenter 112 determines whether, by moving the boundary in by one character pitch, the word confidence factors at that boundary are improved, and if so, the boundaries move in.
If a large number of words are no: recognised at one boundary, the segmenter 112 moves the boundary outward by one character width, to accept further characters which had previously been rejected because of low recognition confidence. If this leads to an increase in the number of recognised words at the boundary, the segmenter 112 retains the increased boundary.
2. In order to prevent the mis-recognition of a vertical line (for example forming the wall of a box 20 or table enclosing text) as characters, the segmenter 0* 112 reviews the character codes produced by the OCR circuit 106, and detects any occurrence of or 1 characters disposed vertically or approximately vertically above each other in the image ("vertically" 25 here refers to the orientation of the text detected by the OCR circuit 106, rather than to the dimensions of the image itself).
In the event of detection of such a sequence of characters, the segmenter 112 instructs the graphics coder 108 to encode at least the corresponding area of the image (and, conveniently, the whole image) as graphics. The segmenter 112 determines, from the output of the graphics coder 108, whether the graphics coder has encoded a continuous line or series of connected lines at the same position as the OCR circuit has recognised a vertical sequence of characters and, in the event that the graphics coder output indicates a continuous vertical line, the segmenter 112 instructs the OCR circuit 106 to break the text area into two subsidiary text areas, one at 15 either side of the recognised vertical sequence of characters, and to delete the vertical sequence of characters themselves, which are then designated as part of a graphics area lying between the two text areas, to be encoded by the graphics coder 108.
20 3. A possible confusion can arise between long underlinings in a text area, and horizontal lines close to the text area. Therefore, the segmenter 112 reviews the characters produced by the OCR circuit 106 to detect: lengthy sequences of underline characters which are underlining a blank space; and lower case characters which cross an underline v, p, q).
The former category are likely to denote a graphics line to be encoded by the graphics coder 108, whereas the latter are likely to indicate that the line is probably an underline, which should not be encoded by the graphics coder 108. Accordingly, if the segmenter notes underlined lower case, line crossing characters, then any horizontal lines at the same vertical position in the image ("vertical" again referring to the co-ordinate axes of the text determined by the OCR circuit 106) are treated as underlines and processed by the OCR circuit as part of the text area. On the other hand, where the segmenter 15 circuit 112 notes lengthy underlines in free space, without other characters crossing the line or being detected as in the same vertical position as the line, the segmenter circuit 112 assumes that the line is to be encoded as an image (for example because it is a 20 horizontal demarcation line of a table) and instructs the OCR circuit 106 to split the text area into two around the underline characters. The underline characters are then deleted from the text stream and the area between the two text areas is treated as a graphics area and encoded by the graphics encoder 108.
Thus, in sum, the segmenter 112 applies specific rules to deal with potentially or partially overlapping areas of the simple graphics and text, so as (insofar as possible) to ensure that tables and boxes are treated as graphics with text areas within.
Where the mode is determined to be text only, the segmenter 112 may omit these text/graphic discrimination operations, and may merely determine the size of the text areas by examining the confidence factors and word areas around the boundaries of the text areas determined by the OCR circuit 106 as described above.
Formatter 114 (Fic. 3) The formatter 114 is operable :o receive the text, graphics and image data from the OCR circuit 106 and text compressor 107; graphics coder 108 and image coder 110, and to generate a serial data stream as shown in Figure 11, comprising a header portion 750 (consisting of conventional facsimile signalling data, identifying the message as being a compressed 20 facsimile message, and identifying the type of compression used text only; text and graphics; or text, graphics and image).
Also provided is a text portion 760 (or, more normally, one text portion 760 for each of several areas of text identified in the document) comprising a text header portion identifying the co-ordinates of 34 the text area, and a character portion 764 comprising the sequence of characters occupying the text area.
In text and graphics, or text, graphics and image modes, a graphics portion 770 is also present, comprising co-ordinate data portion 772 defining the co-ordinates of the graphics area, and vector graphics data 774 providing cartesian or polar co-ordinates for recreating line vectors as described above. Finally, one or more image data fields 780 are present in text, graphics and image compression mode, comprising image area co-ordinates 782 and image data 784 consisting, in this embodiment, of run length coded sets of transform co-ordinates.
Preferably, the header portion 750 in this Si: 15 embodiment specifies the number of text, graphics and image areas in the remainder of the message. The message is produced by the formatter 114 as a single message in this embodiment, but it could occupy a frame, cell or packet format in other applications.
S 20 Customer Data Sinallinc Circuit 120 ?Fic. 3) The customer data signalling circuit comprises a signalling link circuit (for example using signalling system 7 (SS7)) connected via a signalling link to the database 300, which sends data request messages, and receives messages specifying one or more the following: 1. preferred language(s) 2. preferred compression mode 3. forwarding time(s) 4. forwarding interval(s) 5. message erasure time.
Transmission control circuit 118 (FiQ. 3) The transmission controller 118 comprises a suitably programmed microprocessor, which is operable to perform the process of steps 236-252 (Figs 4b 4c).
Receiver controller 100 (Fic. 3) The receiver controller 100 may be a suitably programmed microcontroller. In addition to performing the usual facsimile signalling functions necessary to 15 set up a call (available on a variety of commercially sold chip products), the receiver control circuit 100 is arranged also to transmit back the message to the calling facsimile machine in step 246 when the compressed facsimile image has been stored. The message comprises a stored facsimile message, into which are inserted facsimile image data corresponding to the following: 1. the date and time of reception of the facsimile message at the FIU 2. the called party number 3. the compression mode, and 36 4. Any status in-ormation that is available on the called Party.
For example, when received, such a message might read as follows: YOUR FACSIMILE M~ESSAGE TO 00 44 171 222 4660 ON 12 OCTOBER 1995 AT 15.45 PM HAS ERED A FACSIMILE STORE AND FORWARD SERVICE, RATHER THAN BEING DIRECTLY TRANSMITTED TO THE RE-CIPI-ET.
AT pRESENT, THE RECIPIENT1 IS UNAVAILABLE.
HAS REQUESTED THAT~ THIS FACSIMILE BE COMPRESSED So THAT ONLY TEXT WILL BE
TRANSMITTED.
T.HrE NEXT_ ATTEMPT TO 7RANSM-T THIS3 FACSIMILE WILL BE AT 15.46 ON 12 OCTOBER 1995.
Fi-gure 12 shows the process of operation of the receiver controller 100.
Insten 8OC, -his message 'Js generated4 and premared for transmission.
in sten 802, rather than disconnecting the line 20 after the end of the fax message has been received from tne calling terminal 24, in the post message procedures specif:ied in CCITT.! standard T30, the rece-ve Control ci.rcuit: 100 sends back a -orocedural 4nzer-_uz mositive (PIP) sianal to the facsimile machi-ne 24, which- causes thefacsimi4le machine 24 to re-enter the pre-message procedure of CCITT standard (step 802).
In step 804, the receive control circuit 100 determines whether the calling facsimile apparatus 24 has re-entered the pre-message procedure; this is necessarily the case since some few facsimile machines do not fully comply with specification In the event that the facsimile apparatus 24 does not re-enter the pre-message procedure, in step 806 the receive control circuit 100 terminates the call, and determines whether, in step 206, a caller ID signal was received from the network. A caller ID signal will be available in most of North America and Western Europe, and comprises a 300 bit/s frequency shift keyed (FSK) signal between the first and second rings during the initial ringing cycle (step 808) Where no caller ID was received on call set up, the receive controller 100 determines whether during 9** the call set up procedure the calling facsimile apparatus 24 supplied a transmitting subscriber identification (TSI) field. This field is optionally according to recommendation T30, but where it is filled it contains the telephone number of the calling party (although this telephone number is not 25 necessarily up to date or correct).
On the first positive outcome of step 808 or 810, in step 812 the receive controller 100 sets up a new call (step 812) to the calling facsimile apparatus 24, and in step 814 transmits the reply facsimile.
Likewise, if in step 804 the calling facsimile apparatus 24 successfully re-initiates the call set up procedure, without the previous call being disconnected, the receive controller 100 sends the reply facsimile in step 814.
Thus, in this embodiment, to sum up, the receive control first attempts to transmits a reply message to the calling facsimile without terminating the call from the calling facsimile; and if this is unsuccessful, it firstly attempts to make use of caller ID information on the called number supplied by 15 the telecommunications network 22 (which, where 0.: available, is reliable) and, failing this, makes use of the calling telephone number recorded in the TSI field (which is not always present and may not always be reliable) 20 Thus, the transmitting facsimile 24 is alerted to the fact that the fax message has not been forwarded directly to the destination facsimile DETAILS OF DECOMPRESSION *6 The operation of the decompressor circuit in
S
detail will be obvious from the foregoing. In particular, the co-ordinate data received in the field 39 762, 772, 782 is used by the address circuit 416 to allocate the correct addresses in the frame store 406.
The character data received in the text portion 764 is decompressed and supplied to the font library store 410, the character representations of which are then written to the correct addresses in the frame store 406 addressed by the address circuit 416.
The text decompressor 409 operates the inverse of the compression algorithm (for example, it may comprise a programmed CPU operating the PKUNZIP decompression program, which is widely available).
The vector/raster converter circuit 412 comprises a programmed microprocessor which receives the coordinate data in field 774 and creates a raster image by setting pixels on the line defined by the coordinate data, and resetting all other pixels.
The image decoder 414 comprises, firstly, a transform decoder operating the reverse algorithm to the transform coder circuit the JEG decoder algorithm) to reconstitute transform co-efficients, and then perform the inverse transform to recreate an array of pixel values.
Secondly, since these pixel values are in general not binary valued but multi bit, the image decoder 414 is then arranged to perform a thresholding process on each pixel value to generate a one or zero value, which is used to set the corresponding pixel in the frame store 406 selected by the address circuit 416.
The thresholding process may simply se: the value of each pixel depending on whether it is above or below a predetermined threshold, but preferably a more sophisticated thresholding process is used such as dither processing or error diffusion processing (details of which are well known in the crinter art and need not be discussed here further), in which different pixels have different thresholds, and the thresholds vary either randomly, or in dependence upon the values of surrounding pixels.
Other erbcdiments or modifications In the above embodiment, the OCR circuit 106 applies languages in a predetermined order; generally, English first, and then if certain characers are recognised, other languages sharing a sub set of the English alphabet (in a predetermined order); and if not, other languages with dissimilar charac er sets in a predetermined order (for example, Japanese and then Chinese) However, in a further embodiment, the customer database 300 may contain a field indicating an order of languages specified by the customer, in particular 25 the customer associated with the called party facsimile so that if a particular customer communicaces principally or exclusively in Japanese, then Jacanese is the first 4i language to be attempted. This order of preference for languages is then communicated to the FIU 20 via the customer data signalling circuit 120.
The entry in the database 300 may be created either directly (on a request from the user of the facsimile apparatus 10 or an electric signal therefrom) or may be kept updated on the basis of use.
In the later case, the customer data signalling circuit 120 signals, for each received facsimile call, the language which the OCR circuit 106 has determined to be the main language of the received facsimile, and the customer database station 300 maintains a running ranging of the languages thus received from facsimile interface units 20 in order of their frequency.
15 In a modification of this embodiment, the customer database station 300 may even contain information supplied by the customer in the form of a particular supplemental dictionary of words commonly used by the customer, all characters or symbols 20 commonly used by the customer or person attempting to communicate with him. These may then be downloaded :i via the customer signalling circuit 120 to the FIU on reception of a facsimile destined for the customer in question. The same naturally applies to any customers associated with callinc facsimile terminals 24 who register the order of preference of languages 4 2 they themselves transmit in.
In the above described embodiments, the data compression is performed at a facsimile interface unit which is separate of an earth station. However, it will be apparent that the facsimile interface unit could be integrated into the earth station 18.
Alternatively, any data compression could be performed centrally, separately of the facsimile interface units 12 which would in this case merely forward a received facsimile call to the central store and forwarding station which would encode the call and attempt to call the mobile facsimile terminal Although separate components have been described in the above embodiment, it will be apparent that many 15 of the processing functions could be combined into a smaller number of processors or even a single suitably programmed digital processor performing all the functions of the apparatus of Figures 2, 3 and/or Although vector compression has been described 20 for graphics processing in the above described embodiment, it will be apparent that other methods will be used; for example, the graphics coder 108 could perform the process of modification of the width .of the probably density function of the facsimile signal which is described in "Facsimile compression for transmission over 800 bit/s to 2400 bit/s digital 43 mobile channels", Dimolitsas and Corcoran, 1990 MILCOM IEEE, pages 0502-0505, (23.2.1.-23.2.4) or in "Compression of facsimile graphics for transmission over digital mobile satellite circuits" by the same authors, in MILCOM '91, 1991 IEEE, pages 0644-0647 (30.1.1-30.1.4).
Although in the above described embodiments an attempt is made to transmit to the called terminal at periodic points in time, in a further embodiment, a network control centre (not shown) monitors any attempt by the remote terminal 10 to initiate a communication, and on detection of such an attempt, sets up a call to the remote terminal 10 to transmit a message indicating that stored facsimile messages are awaiting transmission. The network control centre may then, either on receipt of a signal from the remote terminal or automatically, set up a connection to the remote terminal to forward the stored facsimile message(s) thereto.
20 Many other modifications or embodiments will be apparent to the skilled reader. The present invention encompasses any and all such modifications substitutions and alternatives. Furthermore, it will be understood that protection is sought hereby for any and all novel subject matter contained herein, and sub combinations thereof.
P:\OPER\DBW\73238-96 div.doc-13 Decmber. 2000 44 Reference is made to co-pending International application (Agent's Reference J23600 WO, claiming priority from GB 9522487.9 and GB 9604864.0), the contents of which are incorporated herein by reference.
Throughout this specification and the claims which follow, unless the context requires otherwise, the word "comprise", and variations such as "comprises" and "comprising", will be understood to imply the inclusion of a stated integer or step or group of integers or steps but not the exclusion of any other integer or step or group of integers or steps.
The reference to any prior art in this specification is not, and should not be taken as, an acknowledgment or any form of suggestion that that prior art forms part of the common general knowledge in Australia.
o
Claims (4)
- 2. A facsimile interface unit having a first port for connection to a terrestrial network and a second port for connection to a satellite earth station, the facsimile interface unit comprising a compressor for compressing a facsimile signal received at said first port for an a eetrcici o transmission at said second port, and a selector circuit for 20 selecting the compression applied by said compressor circuit in dependence upon the identity of at least one of the called and calling parties.
- 3. A unit according to claim 2, in which the selector circuit comprises a signalling circuit for communication with a database.
- 4. A unit according to claim 2, in which the selector circuit comprises a database containing records of called and/or calling parties. 4 P:\OPER\RAB\72225-OOrespons l.doc06/I2M2 -46- A method of interfacing a facsimile signal to a radio transmission system substantially as hereinbefore described with reference to the accompanying drawings.
- 6. A facsimile interface unit substantially as hereinbefore described with reference to the accompanying drawings. DATED this 6 th day of December 2002 Inmarsat Ltd By its Patent Attorneys DAVIES COLLISON CAVE
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
AU72225/00A AU757593B2 (en) | 1995-11-02 | 2000-12-13 | Image communications |
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
GB9522487 | 1995-11-02 | ||
AU73238/96A AU724136B2 (en) | 1995-11-02 | 1996-11-04 | Image communications |
AU72225/00A AU757593B2 (en) | 1995-11-02 | 2000-12-13 | Image communications |
Related Parent Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
AU73238/96A Division AU724136B2 (en) | 1995-11-02 | 1996-11-04 | Image communications |
Publications (2)
Publication Number | Publication Date |
---|---|
AU7222500A AU7222500A (en) | 2001-03-15 |
AU757593B2 true AU757593B2 (en) | 2003-02-27 |
Family
ID=3755645
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
AU72225/00A Ceased AU757593B2 (en) | 1995-11-02 | 2000-12-13 | Image communications |
Country Status (1)
Country | Link |
---|---|
AU (1) | AU757593B2 (en) |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
GB256558A (en) * | 1925-01-30 | 1926-07-30 | Adrian Francis Sykes | Improvements in methods for producing and duplicating sound records |
EP0483969A2 (en) * | 1990-09-28 | 1992-05-06 | Canon Kabushiki Kaisha | Image communication system |
WO1995027363A1 (en) * | 1994-04-04 | 1995-10-12 | Motorola Inc. | Facsimile communication with selective call receivers |
-
2000
- 2000-12-13 AU AU72225/00A patent/AU757593B2/en not_active Ceased
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
GB256558A (en) * | 1925-01-30 | 1926-07-30 | Adrian Francis Sykes | Improvements in methods for producing and duplicating sound records |
EP0483969A2 (en) * | 1990-09-28 | 1992-05-06 | Canon Kabushiki Kaisha | Image communication system |
WO1995027363A1 (en) * | 1994-04-04 | 1995-10-12 | Motorola Inc. | Facsimile communication with selective call receivers |
Also Published As
Publication number | Publication date |
---|---|
AU7222500A (en) | 2001-03-15 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
EP0682839B1 (en) | Apparatus for transmitting and receiving encoded data | |
JP2844923B2 (en) | Facsimile communication with selective call receiver | |
EP0858707B1 (en) | Communication of images via a radio transmission channel | |
AU728954B2 (en) | Image communications | |
WO1996003769A2 (en) | Method and apparatus for fax messaging in a selective call receiver system using multiple code-book data compression | |
US5710639A (en) | Scan line compressed facsimile communication system | |
JP2611012B2 (en) | Character separation encoding method | |
US6125207A (en) | Encoded facsimile communication with a selective system and method therefor | |
JPH0537700A (en) | Facsimile equipment | |
US5253936A (en) | Data communication apparatus and data communication method | |
AU757593B2 (en) | Image communications | |
AU724136B2 (en) | Image communications | |
US6819455B2 (en) | Image transmission apparatus | |
JP3376095B2 (en) | Image communication device | |
WO1992011615A1 (en) | Facsimile paging system | |
JPH08125869A (en) | Method and device for image communication | |
KR0128667B1 (en) | Mh code decoding device | |
WO1997007626A1 (en) | Facsimile communication with a selective call system and method therefor | |
JP2578414B2 (en) | Data communication device | |
JPH06205197A (en) | Facsimile equipment | |
JPH03259663A (en) | Facsimile equipment | |
JPH03210823A (en) | Coding/decoding method |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
FGA | Letters patent sealed or granted (standard patent) |