EP1106014A1 - Method and apparatus in transmission of images - Google Patents

Method and apparatus in transmission of images

Info

Publication number
EP1106014A1
EP1106014A1 EP99931683A EP99931683A EP1106014A1 EP 1106014 A1 EP1106014 A1 EP 1106014A1 EP 99931683 A EP99931683 A EP 99931683A EP 99931683 A EP99931683 A EP 99931683A EP 1106014 A1 EP1106014 A1 EP 1106014A1
Authority
EP
European Patent Office
Prior art keywords
image
receiver
bit stream
regions
psl
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Withdrawn
Application number
EP99931683A
Other languages
German (de)
French (fr)
Inventor
Magnus JÄNDEL
Mathias Larsson
Charilaos Christopoulos
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Telefonaktiebolaget LM Ericsson AB
Original Assignee
Telefonaktiebolaget LM Ericsson AB
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Telefonaktiebolaget LM Ericsson AB filed Critical Telefonaktiebolaget LM Ericsson AB
Publication of EP1106014A1 publication Critical patent/EP1106014A1/en
Withdrawn legal-status Critical Current

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/134Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
    • H04N19/167Position within a video image, e.g. region of interest [ROI]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/17Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/30Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using hierarchical techniques, e.g. scalability
    • H04N19/37Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using hierarchical techniques, e.g. scalability with arrangements for assigning different transmission priorities to video input data or to video coded data
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/60Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/60Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding
    • H04N19/63Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding using sub-band based transform, e.g. wavelets

Definitions

  • the present invention relates to a method and to arrangement for coding and extracting regions of interest (ROI) in the transmission of still images and video images.
  • the method and the arrangement are particularly well suited for transform- based coders, such as wavelets and DCT.
  • the image In transmission of digitized still images from a transmitter to a receiver, the image is usually coded in order to reduce the amount of bits required for transmitting the image.
  • bit quantity is usually reduced, because the capacity of the channel used is limited.
  • a digitized image however, consists of a very large number of bits.
  • transmission times will be unacceptably long for the majority of applications if it is necessary to transmit every bit of the image.
  • Lossless methods i.e. methods exploiting the redundancy in the image in such manner as to enable the image to be reconstructed by the receiver without loss of information.
  • Lossy methods i.e. methods that exploit the fact that not all bits are equally as important to the receiver.
  • the image received is not identical to the original but looks sufficiently like the original image to the human eye, for instance.
  • certain parts of the transmitted image are of more interest than the remainder of the image, and better visual quality of these parts of the image is therefore desired.
  • a part is usually called the region of interest (ROI).
  • ROI region of interest
  • Applications in which this can be useful include, for example, medical databases or the transmission of satellite images.
  • the present invention addresses the aforesaid problem of defining and transmitting regions of interest and background regions of mutually different qualities in the transmission of images.
  • the basic concept of the invention in solving the problem is to transform the image and to define in said transform a mask that corresponds to the regions of interest and to the background regions.
  • the region definition and the image transform are transmitted to a receiver capable of recreating the image with the quality desired in the predetermined regions.
  • the solution involves dividing the image into the desired regions.
  • the image is then transformed to some type of transform coefficients.
  • a mask corresponding to the separate regions in the image is defined in the transform domain and the coefficients classified and assigned to different segments in accordance with the mask definition.
  • the segments thus belong to the corresponding regions in the image.
  • the segments and the coefficients are transmitted in a compressed state to a receiver that is capable of reproducing regions in the image on the one hand and of reproducing the actual image on the other hand with the desired image quality in the various regions.
  • One advantage afforded by the invention is that several different regions of interest can be defined.
  • Another advantage is that different regions can have several different degrees of image quality. Still another advantage is that only those parts of the image that are of vital interest to the user need be decoded, while avoiding decoding of the whole of the image.
  • Figure 1 is a block schematic illustrating an inventive arrangement.
  • Figure 2 is a flow chart illustrating part of an inventive method.
  • Figure 3 is a flow chart illustrating a further part of an inventive method.
  • Figure 4 is a diagram illustrating classification of transform coefficients.
  • Figure 5 is a diagram for interlinking image segments in a bit stream.
  • Figure 6 is a view of an image with object.
  • Figure 7 is a graphic representation of the topology in Figure 6. DESCRIPTION OF PREFERRED EMBODIMENTS
  • Figure 1 is an overview of an arrangement for coding and transmitting images.
  • An image 3 of an object is stored in digital form in a digital camera 1, and the image presented on a screen 4.
  • the screen is connected to a computer 2 which is programmed to divide the image 3 into objects or regions, of which a background region RI and regions of interest RI and Rn are shown.
  • An image coder 5 in the computer 2 wavelet- transforms the image, while simultaneously compressing the image, and generates a compressed bit stream PS1.
  • An operator at the image screen 4 defines the regions of interest R2 and Rn.
  • the image coder includes means for creating a mask PS2 in accordance with the regions and defines separate parts, segments, of the bit streams with respect to the corresponding regions RI, R2 and Rn, with the aid of said mask.
  • the definition also enables the regions RI, R2 , Rn in the form of said separate segments in the bit stream PS1 to be coded to different degrees of accuracy.
  • a transmitter 6 sends the bit stream, including the definition of the positions and shapes of the regions R2 and Rn to a receiver 7 which is connected to a computer that includes an image decoder 8.
  • the decoder decodes the bit stream PS1 and reproduces the mask definition PS2 and presents the image on an image display screen 9.
  • the accuracy of the background RI is relatively poor, whereas each of the regions R2 and Rn has respectively a higher degree of accuracy.
  • a segment is defined here as all of the coefficients in the transform domain that belong to a given object or the background in the image. The segment can then be divided further into subsets.
  • a subset is defined here as a number of coefficients in a part of the transform domain (e.g. a subband in the case of the wavelet transform) which is required for the reconstruction and which belongs to a segment in the digitized image, see Figure 4.
  • the coefficients are classified and can be assigned to individual segments.
  • the segments are coded independently of one another to different levels of accuracy, which yields a bit stream for each segment. These segments are then joined together.
  • the inventive encoding method will be described with reference to Figure 2.
  • the digitized image 3 to be transmitted presents the background RI and the regions of interest R2 and Rn.
  • the following procedural steps are carried out:
  • step 21 Perform a transformation of the image 3 according to step 21.
  • this transformation is performed with a wavelet transform or with a discrete cosine transform (DCT) .
  • DCT discrete cosine transform
  • step 2 Create a mask according to step 2 with the aid of information as to how the digitized image 3 shall be divided into the background RI and the objects R2 and Rn.
  • the techniques described in Swedish Patent Applications SE 9703690-9 and SE 9800088-8 can be used to this end.
  • the mask is created in the transform domain and describes which coefficients are required to reconstruct the different objects or the background. Different segments SGI, SG2 and SGn correspond to the background RI and the objects R2 and Rn.
  • step 24 Code the segments independently of one another, according to step 24. This gives the number of bits needed for each subset.
  • step 26 Concatenate the subset streams together with necessary substream information and header information, according to step 26. This requires a bit stream description, given below.
  • the method enables the receiver to have immediate access to any parts of the image when so desired, as shown in Figure 3. This is possible because the information as to where different parts are found in the bit stream is known.
  • step 32 Find and decode the required segment information
  • a pointer is a set of symbols that defines the position of a bit or a byte in a bit stream or a file. Many ways of defining a pointer have been defined in computer science. Any one of these methods can be used here. A pointer can be defined implicitly by a specific bit stream composition rule. A pointer can be defined relative to an explicitly or implicitly determined position. A simple way of defining a pointer is to determine the number of bits between the requested position and a known reference point, such as the first bit in the bit stream, for instance.
  • the topology descriptor, TOP is a set of symbols that defines the topological relationship between numbered objects and shapes. This is illustrated in Figure 6, in which four objects 01, 02, 03, 04 and four shapes SI, S2, S3 and S4 are shown.
  • the topology of the image can be represented, e.g., as a tree graph as shown in Figure 7.
  • the nodes and the edges of the tree graph can be coded in a data structure using well known methods.
  • P_T0P is a pointer to a topology descriptor.
  • a shape descriptor, Si defines the appearance of a closed boundary line of an object.
  • the shape number, i is given by a topology descriptor.
  • Many different shape coding techniques can be used. Examples of such methods are chain coding and shape coding methods in MPEG-4.
  • Shape descriptors can be decoded independently of one another once their respective positions in the bit stream is known.
  • P_Si is a pointer to a shape descriptor.
  • a segment descriptor, Ti is a compressed set of symbols that encode a segment as described above.
  • the segment includes an ordered set of subsets.
  • the object number, i is given by a topology descriptor.
  • P_Ti is a pointer to a segment descriptor.
  • a subset descriptor, B ;j is an independently decodable subset, j , of a segment descriptor, T A , which describes, e.g., the coefficients that belong to a given subband, j , as described above.
  • p_B ij is a pointer to a subset descriptor.
  • segment descriptors ⁇ i r T , T k ... ⁇
  • MT data structure
  • p_MT is a pointer to a multiplexed segment descriptor.
  • MT(i, j ,k) ⁇ B i0 , B j0 , B k0 , B il r B jX , B kl , B i2 , B j2 , B k2 ... ⁇
  • the order of the symbols corresponds to the order in the bit stream 51, with symbols on the left being sent first. Subsets in a multiplexed stream may be excluded if they are known by the decoder.
  • the stored bit stream or file structure should preferably include at least the following components:
  • a group of segment descriptors with index ⁇ k,l,m... ⁇ can optionally be replaced with a multiplexed segment descriptor MT(k, l,m... ) N is the number of stored objects.
  • the background is the object with index 0.
  • a server receives a request for sending image data to a client.
  • the image is stored with the server in the format described in the preceding passage.
  • Part of the stored data structures may have already been sent to the receiving terminal.
  • This section of the description describes a procedure for composing a bit stream with the server that handles the request.
  • a simple request contains the following information:
  • TOP is sent in response to a first request for image information.
  • Subset descriptors that describe the objects requested to the defined accuracy.
  • Subset descriptors that are already known to the decoder need not be sent. For instance, the user is aware of the subsets ⁇ B k0 , B kl , B k2 , B k3 ⁇ belonging .to segment k .
  • Subset descriptors ⁇ B k5 , B k6 , B k7 ⁇ must be sent when object k is requested to accuracy 7.
  • the original image is transformed with a wavelet transform.
  • a mask is then created in the transform domain.
  • This mask describes the coefficients that are required in the transform domain in order to reconstruct the region R52 and the background R53.
  • the created mask is then used to classify the coefficients in the transform domain in two segments, one segment for the region and one segment for the background.
  • the two segments are built up by a number of subsets. In the illustrated case, the number of subsets is the same as the number of subbands in the transform domain. The situation on hand is thus:
  • a shape descriptor T r ⁇ B r#0 ,B r/1 , ... ,B r/no _ subbands ⁇ and a set of subset pointers ⁇ p_B r/0 ,p_B r/1 , ... ,p_B rrno _ subbands ⁇ .
  • a segment descriptor T b ⁇ B b/0 ,B b/1 , ... ,B b , no _ subbands ⁇ and a set of subset pointers ⁇ p_B b#0 ,B b
  • bit stream 51 4.
  • the subsets are combined in the manner shown in the upper part of Figure 5, with the sub-bit streams 52 of the region being taken alternately with the sub-bit streams of the background.
  • the TOP field is not required when the receiver is aware of the order in which the various parts of the image are set.
  • the combined bit stream is then sent to the receiver.
  • the decoder is now able to create the same mask as that described above.
  • the decoder creates the segments with the underlying subsets .
  • the decoder commences with decoding the combined bit stream and filling in the transmitted transform coefficients in the corresponding subsets.
  • the image is transmitted and reconstructed.
  • the aforedescribed is one way of using the proposed method.
  • Other methods may be to combine (mix) the bit streams in another way. For instance, as shown in the bottom part of Figure 5, the region R52 may be transmitted first, followed by the background R53.
  • Another example is one in which more than one region is found, as described with reference to Figure 6, wherewith these regions are combined in a number of different ways.
  • the proposed method has the added advantage of enabling shape information to be sent only when needed.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Compression Or Coding Systems Of Tv Signals (AREA)
  • Compression Of Band Width Or Redundancy In Fax (AREA)

Abstract

An image (3) in digitized form shall be transmitted over a channel between a transmitter and a receiver. The channel has a limited bandwidth and the image has a less important background (R1) and also regions of particular importance, i.e. regions of interest (R2, Rn). The image is transformed into transform coefficients and compressed (21), and a mask corresponding to the regions (R1, R2, Rn) is defined in the transform domain (22). The transform coefficients are classified (23) and assigned to different segments (SG1, SG2, SGn) in accordance with the mask definition. These segments (24) are coded independently of one another to different degrees of accuracy, depending on the importance of corresponding regions (R1, R2, Rn) in the image (3). Coding results in sub-bit streams (25) which are linked together (26) with the image header (271, 272) to form a bit stream (27), which is sent to the receiver. The receiver decodes the image header and the segment information and reconstructs the mask in the transform domain, including shapes and positions of the regions (R1, R2, Rn). The image is then recreated with the aid of the mask to desired degrees of accuracy in respective regions. It is possible to define several regions (R2, Rn) with different degrees of image quality, and only those parts of the image that are of interest need be decoded.

Description

Method and Apparatus in Transmission of Images.
FIELD OF INVENTION
The present invention relates to a method and to arrangement for coding and extracting regions of interest (ROI) in the transmission of still images and video images. The method and the arrangement are particularly well suited for transform- based coders, such as wavelets and DCT.
DESCRIPTION OF THE BACKGROUND ART
In transmission of digitized still images from a transmitter to a receiver, the image is usually coded in order to reduce the amount of bits required for transmitting the image.
The bit quantity is usually reduced, because the capacity of the channel used is limited. A digitized image, however, consists of a very large number of bits. When transmitting an image that consists of a very large number of bits over a channel which has limited bandwidth, transmission times will be unacceptably long for the majority of applications if it is necessary to transmit every bit of the image.
Consequently, in recent years research has been directed to coding methods and techniques for digitized images with the object of reducing the number of bits necessary to transmit the images .
These methods can be divided into two groups: Lossless methods, i.e. methods exploiting the redundancy in the image in such manner as to enable the image to be reconstructed by the receiver without loss of information.
Lossy methods, i.e. methods that exploit the fact that not all bits are equally as important to the receiver. Hence, the image received is not identical to the original but looks sufficiently like the original image to the human eye, for instance.
In some applications, certain parts of the transmitted image are of more interest than the remainder of the image, and better visual quality of these parts of the image is therefore desired. Such a part is usually called the region of interest (ROI). Applications in which this can be useful include, for example, medical databases or the transmission of satellite images. In some cases, it is also desired, or necessary, to transmit the region of interest loss-free, while the quality of the remainder of the image is of less importance. There are also occasions when it is required to extract the regions of interest from the bit stream and decode these regions of interest without needing to decode the image as a whole.
Swedish Patent Applications SE 9703690-9 and SE 9800088-8 both describe how a mask can be calculated for delimiting such a region of interest (ROI).
SUMMARY OF THE INVENTION
The present invention addresses the aforesaid problem of defining and transmitting regions of interest and background regions of mutually different qualities in the transmission of images.
The basic concept of the invention in solving the problem is to transform the image and to define in said transform a mask that corresponds to the regions of interest and to the background regions. The region definition and the image transform are transmitted to a receiver capable of recreating the image with the quality desired in the predetermined regions.
More specifically, the solution involves dividing the image into the desired regions. The image is then transformed to some type of transform coefficients. A mask corresponding to the separate regions in the image is defined in the transform domain and the coefficients classified and assigned to different segments in accordance with the mask definition. The segments thus belong to the corresponding regions in the image. The segments and the coefficients are transmitted in a compressed state to a receiver that is capable of reproducing regions in the image on the one hand and of reproducing the actual image on the other hand with the desired image quality in the various regions.
One advantage afforded by the invention is that several different regions of interest can be defined.
Another advantage is that different regions can have several different degrees of image quality. Still another advantage is that only those parts of the image that are of vital interest to the user need be decoded, while avoiding decoding of the whole of the image.
Yet another advantage is that the segments can be coded independently of each other.
The invention will now be described in more detail with reference to preferred embodiments thereof and also with reference to the accompanying drawings.
BRIEF DESCRIPTION OF THE DRAWINGS
Figure 1 is a block schematic illustrating an inventive arrangement.
Figure 2 is a flow chart illustrating part of an inventive method.
Figure 3 is a flow chart illustrating a further part of an inventive method.
Figure 4 is a diagram illustrating classification of transform coefficients.
Figure 5 is a diagram for interlinking image segments in a bit stream.
Figure 6 is a view of an image with object.
Figure 7 is a graphic representation of the topology in Figure 6. DESCRIPTION OF PREFERRED EMBODIMENTS
Figure 1 is an overview of an arrangement for coding and transmitting images. An image 3 of an object is stored in digital form in a digital camera 1, and the image presented on a screen 4. The screen is connected to a computer 2 which is programmed to divide the image 3 into objects or regions, of which a background region RI and regions of interest RI and Rn are shown. An image coder 5 in the computer 2 wavelet- transforms the image, while simultaneously compressing the image, and generates a compressed bit stream PS1. An operator at the image screen 4 defines the regions of interest R2 and Rn. The image coder includes means for creating a mask PS2 in accordance with the regions and defines separate parts, segments, of the bit streams with respect to the corresponding regions RI, R2 and Rn, with the aid of said mask. The definition also enables the regions RI, R2 , Rn in the form of said separate segments in the bit stream PS1 to be coded to different degrees of accuracy. A transmitter 6 sends the bit stream, including the definition of the positions and shapes of the regions R2 and Rn to a receiver 7 which is connected to a computer that includes an image decoder 8. The decoder decodes the bit stream PS1 and reproduces the mask definition PS2 and presents the image on an image display screen 9. The accuracy of the background RI is relatively poor, whereas each of the regions R2 and Rn has respectively a higher degree of accuracy.
The following definitions are given in order to assist in describing the inventive method: - A segment is defined here as all of the coefficients in the transform domain that belong to a given object or the background in the image. The segment can then be divided further into subsets.
- A subset is defined here as a number of coefficients in a part of the transform domain (e.g. a subband in the case of the wavelet transform) which is required for the reconstruction and which belongs to a segment in the digitized image, see Figure 4.
As before mentioned, the coefficients are classified and can be assigned to individual segments. When this classification is made, the segments are coded independently of one another to different levels of accuracy, which yields a bit stream for each segment. These segments are then joined together.
The inventive encoding method will be described with reference to Figure 2. The digitized image 3 to be transmitted presents the background RI and the regions of interest R2 and Rn. The following procedural steps are carried out:
1. Perform a transformation of the image 3 according to step 21. In the illustrated case, this transformation is performed with a wavelet transform or with a discrete cosine transform (DCT) .
2. Create a mask according to step 2 with the aid of information as to how the digitized image 3 shall be divided into the background RI and the objects R2 and Rn. The techniques described in Swedish Patent Applications SE 9703690-9 and SE 9800088-8 can be used to this end. The mask is created in the transform domain and describes which coefficients are required to reconstruct the different objects or the background. Different segments SGI, SG2 and SGn correspond to the background RI and the objects R2 and Rn.
3. Use the mask to classify the transform coefficients as belonging to the different segments SGI, SG2, SGn, according to step 3.
4. Code the segments independently of one another, according to step 24. This gives the number of bits needed for each subset.
5. Concatenate the subset streams together with necessary substream information and header information, according to step 26. This requires a bit stream description, given below.
6. Send the concatenated bit streams 27. This includes shape data 271, bit stream information 272, subband 0 referenced 273 and subband 1 referenced 274.
The method enables the receiver to have immediate access to any parts of the image when so desired, as shown in Figure 3. This is possible because the information as to where different parts are found in the bit stream is known.
One method of how the decoder may work is described below with reference to Figure 3. 1. Receive the bit stream 27 and decode the header information required, according to step 31.
2. Find and decode the required segment information, step 32.
3. Create a mask in the transform domain, for instance with the aid of the technique described in said Patent Applications SE 9703690-9 and SE 9800088-8; step 33. The mask describes those coefficients that are required to reconstruct the desired objects or background.
4. Decode requisite segment data from the bit stream; step 34.
5. Reconstruct the requisite segments; step 35.
6. Decode and show the image; step 36.
BIT STREAM DESCRIPTION
A description will now be given of those components in the bit stream 27 that are required when applying the described technique.
Data structures and pointers
Pointer
A pointer is a set of symbols that defines the position of a bit or a byte in a bit stream or a file. Many ways of defining a pointer have been defined in computer science. Any one of these methods can be used here. A pointer can be defined implicitly by a specific bit stream composition rule. A pointer can be defined relative to an explicitly or implicitly determined position. A simple way of defining a pointer is to determine the number of bits between the requested position and a known reference point, such as the first bit in the bit stream, for instance.
Topology descriptor
The topology descriptor, TOP, is a set of symbols that defines the topological relationship between numbered objects and shapes. This is illustrated in Figure 6, in which four objects 01, 02, 03, 04 and four shapes SI, S2, S3 and S4 are shown. The topology of the image can be represented, e.g., as a tree graph as shown in Figure 7. The nodes and the edges of the tree graph can be coded in a data structure using well known methods. P_T0P is a pointer to a topology descriptor.
Shape descriptor
A shape descriptor, Si, defines the appearance of a closed boundary line of an object. The shape number, i, is given by a topology descriptor. Many different shape coding techniques can be used. Examples of such methods are chain coding and shape coding methods in MPEG-4. Shape descriptors can be decoded independently of one another once their respective positions in the bit stream is known. P_Si is a pointer to a shape descriptor. Segment descriptor
A segment descriptor, Ti, is a compressed set of symbols that encode a segment as described above. The segment includes an ordered set of subsets. The object number, i, is given by a topology descriptor. P_Ti is a pointer to a segment descriptor.
Subset descriptor
A subset descriptor, B;j , is an independently decodable subset, j , of a segment descriptor, TA, which describes, e.g., the coefficients that belong to a given subband, j , as described above. p_Bij is a pointer to a subset descriptor.
Multiplexed segment descriptor
Several segment descriptors, {τi r T , Tk ...}, can be multiplexed into a common data structure MT(i,j,k). This is done normally for the purpose of simultaneous progressive transmission of a set of objects. The data structure, MT, is called a multiplexed segment descriptor. Several multiplexing methods can be used. p_MT is a pointer to a multiplexed segment descriptor.
Segment multiplexing methods
Examples of multiplexing methods are shown in Figure 5. A simple method is to interleave subsets 52 belonging to the component segments so that:
MT(i, j ,k)={Bi0, Bj0, Bk0, Bil r BjX, Bkl, Bi2, Bj2, Bk2... } In this case, the order of the symbols corresponds to the order in the bit stream 51, with symbols on the left being sent first. Subsets in a multiplexed stream may be excluded if they are known by the decoder.
Bit stream storage format
In order to obtain immediate access to any object whatsoever in the image, the stored bit stream or file structure should preferably include at least the following components:
In the image header, if required:
Topology descriptor TOP
Pointers to shape descriptors {p_S1, p_S2...p_SN}
Pointers to segment descriptors {p_T0, p_T1; ...p_TN}
Optional pointers to subset descriptors: for each k=[0 ,N], {p_Bk0 , p_Bkl, ...p_BkN}
In the actual stored bit stream if needed:
Shape descriptors {sl r S2,...SN}
Segment descriptors {T0, TX,...TN}
A group of segment descriptors with index {k,l,m...} can optionally be replaced with a multiplexed segment descriptor MT(k, l,m... ) N is the number of stored objects. The background is the object with index 0.
PROGRESSIVE TRANSMISSION WITH IMMEDIATE ACCESS TO OPTIONAL OBJECTS
A server receives a request for sending image data to a client. The image is stored with the server in the format described in the preceding passage. Part of the stored data structures (topological data, shapes, segments and subsets) may have already been sent to the receiving terminal. This section of the description describes a procedure for composing a bit stream with the server that handles the request.
Example
Request from user
A simple request contains the following information:
Send objects with numbers k, 1, ... with a respective accuracy of nk, nlf nm where the accuracy is the index for the highest subset that is sent for each index.
Several primitive requests may be sent. They will be served in the order in which they are received or in an otherwise specified order. Procedure for serving a request (details)
Send topological information if needed. TOP is sent in response to a first request for image information.
Send all shape descriptors that are necessary to describe the boundaries of the objects requested. It is not necessary to send shape descriptors that are already known to the decoder. When using the topological tree structure in Figure 7, it is found that not all shape descriptors on the same branch as the object or on the same or lower hierarchical level need be sent. The server knows the state of the decoder and will send solely those shape descriptors that are unknown to the decoder.
Send (multiplexed) subset descriptors that describe the objects requested to the defined accuracy. Subset descriptors that are already known to the decoder need not be sent. For instance, the user is aware of the subsets {Bk0, Bkl, Bk2, Bk3} belonging .to segment k . Subset descriptors {Bk5, Bk6, Bk7} must be sent when object k is requested to accuracy 7.
EXAMPLES
In this section of the description, examples are given with respect to situations in which the proposed method can be applied.
Assume, according to Figure 5, that in the centre of the image R51 there is an encircled region R52 whose quality must be better than the quality of the region R53 outside the circle, this latter region being referred to hereinafter as the background. However, both the background R53 and the region R52 shall be transmitted simultaneously. The following then takes place:
1. The original image is transformed with a wavelet transform.
2. A mask is then created in the transform domain. This mask describes the coefficients that are required in the transform domain in order to reconstruct the region R52 and the background R53. The created mask is then used to classify the coefficients in the transform domain in two segments, one segment for the region and one segment for the background. The two segments are built up by a number of subsets. In the illustrated case, the number of subsets is the same as the number of subbands in the transform domain. The situation on hand is thus:
2.1 In respect of the region segment belonging to the region R52:
llr0,l'r0,2r • • • /r0,lj r • • • lrno_subbands, 1 ' rno_subbands,2 • • • rno_subbands, j J } where i,j are the number of coefficients in the different subsets .
2.2 In respect of the background segment belonging to the background R53:
XX^O,!' ,2' • • •t)o, / • • • / |bno_subbandS ι ,bno_subbandS(2, ...bno^subbandS q}| where p,q are the number of coefficients in the different subsets. !. The two subsets are then coded as follows:
3.1 In respect of the region segment:
A shape descriptor Tr={Br#0,Br/1, ... ,Br/no_subbands} and a set of subset pointers {p_Br/0,p_Br/1, ... ,p_Brrno_subbands}.
3.2 In respect of the background segment:
A segment descriptor Tb={Bb/0,Bb/1, ... ,Bb,no_subbands} and a set of subset pointers {p_Bb#0,Bb|1, ... ,p_Bb#no_βubbandβ}.
4. The two segments are then combined into a single bit stream, bit stream 51, in the following manner:
<image header><TOP><Sr><{p_Bbf 0 ,p_Br#„ ,p_Bb/-_ ,P_Bb no_ΞUbbands ,
P_^r,no_subbands J ><:MT ( b , r ) ={Bb/ 0 , Br / 0 , Bb / 1 , Br 1 , . . . , Bb no^subbands , **r , no_subbandsj
In this case, the subsets are combined in the manner shown in the upper part of Figure 5, with the sub-bit streams 52 of the region being taken alternately with the sub-bit streams of the background. It will be noted that the TOP field is not required when the receiver is aware of the order in which the various parts of the image are set. The first part of the array, from <image header> to ...p_B...}> is, in other words, a definition of where the different image regions are placed in the remainder of the compressed bit stream <MT(b,r)={...B...}>.
The combined bit stream is then sent to the receiver. The following takes place on the decoder side:
6. The image header together with the topology, shape information and pointers are read.
7. The decoder is now able to create the same mask as that described above.
8. The decoder creates the segments with the underlying subsets .
9. The decoder commences with decoding the combined bit stream and filling in the transmitted transform coefficients in the corresponding subsets.
10. An inverse transform is used.
11. The image is transmitted and reconstructed.
The aforedescribed is one way of using the proposed method. Other methods may be to combine (mix) the bit streams in another way. For instance, as shown in the bottom part of Figure 5, the region R52 may be transmitted first, followed by the background R53. Another example is one in which more than one region is found, as described with reference to Figure 6, wherewith these regions are combined in a number of different ways.
In addition to the earlier mentioned advantages, the proposed method has the added advantage of enabling shape information to be sent only when needed.

Claims

1. A method of transmitting an image (3) between a transmitter (2, 5, 6) and a receiver (7, 8), comprising the steps of: dividing the image (3) into at least two image regions (RI, R2, Rn); coding the image regions (RI, R2 , Rn) into a coded symbol stream (21), said coding utilising a symbolic representation and having predetermined accuracy levels in said image regions; and
- compressing the coded symbol stream into a compressed bit stream (PSl, 27), characterised in that the method includes the further steps of: generating (22) a definition (PS2) of the different image regions in the compressed bit stream; transmitting said definition (PS2) to the receiver (7); transmitting the compressed bit stream (PSl, 27) to the receiver (7, 8; and decoding (33, 34) in the receiver predetermined parts of the compressed bit stream (PSl, 27) with the aid of said definition.
2. An arrangement for transmitting an image (3), comprising: a transmitter (2, 5, 6) and a receiver (7, 8); means (4, 5) for dividing the image (3) into at least two image regions (RI, R2 , Rn) ; - a coding device (5) for coding the image regions (RI, R2 , Rn) into a coded symbol stream, said coding device utilising a symbolic representation and having predetermined accuracy levels in said regions; a compressing device for compressing the coded symbol stream into a compressed bit stream (PSl, 27); and - means in the transmitter (2, 5, 6) for transmitting said compressed bit stream (PSl, 27) to the receiver (7, 8), characterised in that the arrangement also includes: means (5) for generating (22) a definition (PS2) of the different image regions (RI, R2 , Rn) in the compressed bit stream (PSl, 27) ; means in the transmitter (2, 5, 6) for transmitting said definition (PS2) to the receiver (7, 8); and a decoder (8) in the receiver for decoding (34, 35) predetermined parts of the compressed bit stream (PSl, 27) with the aid of said definition (PS2).
AMENDEDCLAIMS
[ received by the International Bureau on 9 November 1999 (09.11.99) ; original claims 1-2 replaced by amended claims 1-18 (5pages)]
1. A method of transmitting an image (3) between a transmitter (2, 5, 6) and a receiver (7, 8), comprising the steps of: dividing the image (3) into at least two image regions (RI, R2, Rn); coding the image regions (RI, R2, Rn) into a coded symbol stream (21) , said coding utilising a symbolic representation and having predetermined accuracy levels in said image regions; and
- compressing the coded symbol stream into a compressed bit stream (PSl, 27), characterised in that the method includes the further steps of: generating (22) a definition (PS2) of an outer boundary line (Si)of at least one of the image regions (R2,Rn); transmitting said definition (PS2) to the receiver (7); transmitting the compressed bit stream (PSl, 27) to the receiver (7, 8; and decoding (33, 34) in the receiver with the aid of said definition.
2. The method of claim 1, characterised in that two different of the image regions (R2,Rn) are coded to have said predetermined accuracy levels independently of each other.
3. A method of transmitting an image (3) between a transmitter (2, 5, 6) and a receiver (7, 8), comprising the steps of: dividing the image (3) into at least two image regions (RI, R2, Rn) ; coding the image regions (RI, R2, Rn; into a coded symbol stream (21) , said coding utilising a symbolic representation and having predetermined accuracy levels in said imaαe regions; and - compressing the coded symbol stream into a compressed bit stream (PSl, 27), characterised in that the method includes the further steps of: - generating (22) a definition (PS2) of a mask (PS2) for at least one of the image regions (R2,Rn), two different of the image regions (R2,Rn) being encoded to have said predetermined accuracy levels independently of each other; transmitting said definition (PS2) to the receiver (7); - transmitting the compressed bit stream (PSl, 27) to the receiver (7, 8; and decoding (33, 34) in the receiver with the aid of said definition.
4. The method of claim 1,2 or 3, characterised in that only predetermined parts of the compressed bit stream (PSl, 27) are decoded.
5. The method of any of the claims 1, 2, 3 or 4, characterised by generating a topology description, defining the topological relationship between objects (01, 02, 03, 04) and shapes (SI, S2, S3, S4) in the image.
6. The method of any of the claims 1, 2, 3 or 4, characterised by generating a shape description, defining the appearance of the closed boundary line (Si) of an object (01, 02, 03, 04) in the image.
7. The method of any of the claims 1, 2, 3 or 4, characterised by generating a segment description, defining which transform coefficients that belong to respective segment .
8. The method of claim 7, characterised by generating a subset description, defining which transform coefficients that belong to an independently decodable part of a segment.
9. The method of any of the claims 5, 6, 7 or 8, characterised by generating of a pointer, defining a position in the bit stream (27) for the respective one of the above mentioned descriptions.
10. An arrangement for transmitting an image (3), comprising: a transmitter (2, 5, 6) and a receiver (7, 8); - means (4, 5) for dividing the image (3) into at least two image regions (RI, R2, Rn) ; a coding device (5) for coding the image regions (RI, R2, Rn) into a coded symbol stream, said coding device utilising a symbolic representation and having predetermined accuracy levels in said regions; a compressing device for compressing the coded symbol stream into a compressed bit stream (PSl, 27); and means in the transmitter (2, 5, β) for transmitting said compressed bit stream (PSl, 27) to the receiver (7, 8), characterised in that the arrangement also includes: means (5) for generating (22) a definition (PS2) of an outer boundary line (Si) of at least one of the image regions (R2,Rn) ; means in the transmitter (2, 5, 6) for transmitting said definition (PS2) to the receiver (7, 8); and a decoder (8) in the receiver for decoding (34, 35) of the compressed bit stream (PSl, 27) with the aid of said definition (PS2) .
11. The arrangement of claim 10, characterised in that the coding device is arranged to encode (24) two different of the image regions (R2, Rn) to have the predetermined accurracy levels independently of each other.
12. An arrangement for transmitting an image (3), comprising: a transmitter (2, 5, 6) and a receiver (7, 8); means (4, 5) for dividing the image (3) into at least two image regions (RI, R2, Rn) ; a coding device (5) for coding the image regions (RI, R2, Rn) into a coded symbol stream, said coding device utilising a symbolic representation and having predetermined accuracy levels in said regions; - a compressing device for compressing the coded symbol stream into a compressed bit stream (PSl, 27); and means in the transmitter (2, 5, 6) for transmitting said compressed bit stream (PSl, 27) to the receiver (7, 8), characterised in that the arrangement also includes: - means (5) for generating (22) a definition (PS2) of a mask (PS2) for at least one of the image regions (R2,Rn), the coding device (5) being arranged to encode (24) two different of the image regions (R2,Rn) to have said predetermined accuracy levels independently of each other; - means in the transmitter (2, 5, 6) for transmitting said definition (PS2) to the receiver (7, 8); and a decoder (8) in the receiver for decoding (34, 35) of the compressed bit stream (PSl, 27) with the aid of said definition (PS2) .
13. The arrangement of claim 10, 11 or 12, characterised in that the decoder (8) is arranged to decode only predetermined parts of the compressed bit stream (PSl, 27).
14. The arrangement of claim 10, 11, 12 or 13, characterised in that the transmitter (2, 5, 6) has means for generating a topology description, defining the topological relationship between objects (01, 02, 03, 04) and shapes (SI, S2, S3, S4) in the image.
15. The arrangement of claim 10, 11, 12 or 13, characterised in that the transmitter (2, 5, 6) has means for generating a shape description, defining the appearance of the closed boundary line (Sx) of an object (01, 02, 03, 04) in the image .
16. The arrangement of claim 10, 11, 12 or 13, characterised in that the transmitter (2, 5, 6) has means for generating a segment description, defining which transform coefficients that belong to respective segment.
17. The arrangement of claim 16, characterised in that the transmitter (2, 5, 6) has means for generating a subset description, defining which transform coefficients that belong to an independently decodable part of a segment.
18. The arrangement of claim 14, 15, 16 or 17, characterised in that the transmitter (2, 5, 6) has means for generating a pointer, defining a position in the bit stream (27) for the respective one of the above mentioned descriptions.
»««» < « PC 00/01153
STATEMENT UNDER ARTICLE 19
In the amended independent method claim 1 , which in the main corresponds to earlier claim 1 , a feature defining an outer boundary line is inserted. This is supported by the description page 9, lines 22,23. Dependent claim 2 defines that different regions are coded independently, supported by the description page 7, lines 12-14. The new independent method claim 3, also in the main corresponding to earlier claim 1 , defines a mask for the image regions and that the image regions are coded independently of each other. Support in the description is to be found at page 3, lines 14-19; page 6, line 29 to page7, line 10; page 7, lines 12-14.
The scope of earlier claim 1 has been broadend in one respect. The feature of decoding predetermined parts of the bit stream has been removed from claim 1 and is instead defined in the new dependent claim 4.
The new independent method claims 5-9 define a number of descriptions and an associated pointer for the transmitted image. Support is in the description at page 8, line 20 to page 10, line 14.
The earlier independent device claim 2 is amended in a corresponding way as claim 1 and is now claim 10. The new device claim 12 corresponds to the new method claim 3. The new independent device claims 11 and 13-18 correspond to the respective claims 2 and 4-9.
EP99931683A 1998-06-18 1999-06-10 Method and apparatus in transmission of images Withdrawn EP1106014A1 (en)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
SE9802193 1998-06-18
SE9802193A SE521021C2 (en) 1998-06-18 1998-06-18 Method and apparatus for transmitting images
PCT/SE1999/001024 WO2000001153A1 (en) 1998-06-18 1999-06-10 Method and apparatus in transmission of images

Publications (1)

Publication Number Publication Date
EP1106014A1 true EP1106014A1 (en) 2001-06-13

Family

ID=20411773

Family Applications (1)

Application Number Title Priority Date Filing Date
EP99931683A Withdrawn EP1106014A1 (en) 1998-06-18 1999-06-10 Method and apparatus in transmission of images

Country Status (8)

Country Link
EP (1) EP1106014A1 (en)
JP (1) JP2002519953A (en)
KR (1) KR20010052710A (en)
CN (1) CN1135848C (en)
AU (1) AU753304B2 (en)
CA (1) CA2335022A1 (en)
SE (1) SE521021C2 (en)
WO (1) WO2000001153A1 (en)

Families Citing this family (21)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6553150B1 (en) * 2000-04-25 2003-04-22 Hewlett-Packard Development Co., Lp Image sequence compression featuring independently coded regions
EP1287493A4 (en) * 2000-05-29 2006-08-16 Canon Kk A method for enabling file format compatibility
JP4701448B2 (en) * 2000-09-19 2011-06-15 株式会社メガチップス Region of interest encoding method
JP5388393B2 (en) * 2001-04-27 2014-01-15 キヤノン株式会社 Image processing apparatus, image processing method, and control program
FR2827409B1 (en) * 2001-07-10 2004-10-15 France Telecom WAVELLET-BASED IMAGE CODING PROCESS ALLOWING ADAPTIVE TRANSMISSION OF WAVELLET COEFFICIENTS, SYSTEM SIGNAL AND CORRESPONDING DEVICES
US6882755B2 (en) * 2001-10-19 2005-04-19 Hewlett-Packard Development Company, L.P. Image transmission for low bandwidth with region of interest
US7428338B2 (en) 2002-01-10 2008-09-23 Ricoh Co., Ltd. Header-based processing of images compressed using multi-scale transforms
CN1426221A (en) * 2002-12-27 2003-06-25 王锡宁 Three-dimensional communication unit
JP2005223852A (en) * 2004-02-09 2005-08-18 Sanyo Electric Co Ltd Image-encoding device and method, and image-decoding device and method
CN1655620B (en) 2004-02-09 2010-09-22 三洋电机株式会社 Image display apparatus
US7720295B2 (en) 2004-06-29 2010-05-18 Sanyo Electric Co., Ltd. Method and apparatus for coding images with different image qualities for each region thereof, and method and apparatus capable of decoding the images by adjusting the image quality
JP4656912B2 (en) 2004-10-29 2011-03-23 三洋電機株式会社 Image encoding device
EP1830573A1 (en) * 2006-03-02 2007-09-05 Thomson Licensing Method and apparatus for determining in picture signal encoding the bit allocation for groups of pixel blocks in a picture
CN101309259B (en) * 2008-06-24 2012-07-11 广东威创视讯科技股份有限公司 Distributed image display method
US9143783B2 (en) 2011-01-19 2015-09-22 Telefonaktiebolaget L M Ericsson (Publ) Indicating bit stream subsets
CN102438144B (en) * 2011-11-22 2013-09-25 苏州科雷芯电子科技有限公司 Video transmission method
US20140112589A1 (en) * 2012-10-22 2014-04-24 Gurulogic Microsystems Oy Encoder, decoder and method
US10514541B2 (en) 2012-12-27 2019-12-24 Microsoft Technology Licensing, Llc Display update time reduction for a near-eye display
JP6308976B2 (en) * 2015-05-19 2018-04-11 キヤノン株式会社 Diagnosis support apparatus, operation method of diagnosis support apparatus, and diagnosis support system
US20170180758A1 (en) * 2015-12-22 2017-06-22 Vallabhajosyula S. Somayazulu Tiled Wireless Display
CN106060544B (en) * 2016-06-29 2020-04-28 华为技术有限公司 Image coding method, related equipment and system

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5757974A (en) * 1996-04-15 1998-05-26 The United States Of America As Represented By The Secretary Of The Navy System and method for data compression

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
See references of WO0001153A1 *

Also Published As

Publication number Publication date
AU753304B2 (en) 2002-10-17
CN1305684A (en) 2001-07-25
AU4812099A (en) 2000-01-17
SE9802193L (en) 1999-12-19
WO2000001153A1 (en) 2000-01-06
SE521021C2 (en) 2003-09-23
SE9802193D0 (en) 1998-06-18
CN1135848C (en) 2004-01-21
CA2335022A1 (en) 2000-01-06
KR20010052710A (en) 2001-06-25
JP2002519953A (en) 2002-07-02

Similar Documents

Publication Publication Date Title
EP1106014A1 (en) Method and apparatus in transmission of images
Muthukumaran et al. The performances analysis of fast efficient lossless satellite image compression and decompression for wavelet based algorithm
AU2002215709B2 (en) Method and apparatus for scalable compression of video
US5966465A (en) Compression/decompression using reversible embedded wavelets
CN100511282C (en) Data compression
CN1254977C (en) Video coder with fine granular scalability
KR100550105B1 (en) Method and apparatus for compressing ad decompressing image
US7634145B2 (en) Compression and decompression with wavelet style and binary style including quantization by device-dependent parser
US6345126B1 (en) Method for transmitting data using an embedded bit stream produced in a hierarchical table-lookup vector quantizer
JP2527874B2 (en) Device for imparting robustness to local image loss to compressed image data.
JPH09139940A (en) Decoder for video delivery system capable of expansion and contraction between terminals
CN103841424B (en) The system and method for compressed data in random access memory
KR100959837B1 (en) Adaptive method and system for mapping parameter values to codeword indexes
KR20060106930A (en) Method and apparatus for compressing and decompressing images
ES2351880T3 (en) CODING AND DECODING SENSITIVE TO THE CONTEXT OF A VIDEO DATA FLOW.
EP1175785A1 (en) Networked delivery of profiled media files to clients
JP3466080B2 (en) Digital data encoding / decoding method and apparatus
CN1554190A (en) System and method for compressing data
RU2327301C2 (en) Method of transmission of additional data in combined use of vector quantisation and fractal conding of images allowing for classifications of domains and units from code book
Al-Janabi Highly scalable single list set partitioning in hierarchical trees image compression
EP1465350A2 (en) Embedded multiple description scalar quantizers for progressive image transmission
Pawadshetty et al. JPEG 2000 region of interest coding methods
Minas et al. Compression of an AVI video file using fractal system
CN111264062A (en) Encoder, encoding system and encoding method
Zhu et al. Adaptive context based coding for lossless color image compression

Legal Events

Date Code Title Description
PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

17P Request for examination filed

Effective date: 20001213

AK Designated contracting states

Kind code of ref document: A1

Designated state(s): DE GB IT

RAP1 Party data changed (applicant data changed or rights of an application transferred)

Owner name: TELEFONAKTIEBOLAGET LM ERICSSON (PUBL)

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: THE APPLICATION IS DEEMED TO BE WITHDRAWN

18D Application deemed to be withdrawn

Effective date: 20061024