USRE45306E1 - Image processing method and device thereof - Google Patents
Image processing method and device thereof Download PDFInfo
- Publication number
- USRE45306E1 USRE45306E1 US13/436,710 US201213436710A USRE45306E US RE45306 E1 USRE45306 E1 US RE45306E1 US 201213436710 A US201213436710 A US 201213436710A US RE45306 E USRE45306 E US RE45306E
- Authority
- US
- United States
- Prior art keywords
- frames
- image processing
- areas
- mode
- pull down
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active, expires
Links
- 238000003672 processing method Methods 0.000 title claims abstract description 27
- 238000000034 method Methods 0.000 claims description 62
- 230000003044 adaptive effect Effects 0.000 claims description 7
- 230000000750 progressive effect Effects 0.000 claims 1
- 230000008901 benefit Effects 0.000 description 3
- 230000005540 biological transmission Effects 0.000 description 2
- 230000000694 effects Effects 0.000 description 2
- 230000000007 visual effect Effects 0.000 description 2
- 238000010276 construction Methods 0.000 description 1
- 230000007812 deficiency Effects 0.000 description 1
- 238000006073 displacement reaction Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N7/00—Television systems
- H04N7/01—Conversion of standards, e.g. involving analogue television standards or digital television standards processed at pixel level
- H04N7/0112—Conversion of standards, e.g. involving analogue television standards or digital television standards processed at pixel level one of the standards corresponding to a cinematograph film standard
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/43—Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/43—Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
- H04N21/44—Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs
- H04N21/44008—Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs involving operations for analysing video streams, e.g. detecting features or characteristics in the video stream
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/47—End-user applications
- H04N21/488—Data services, e.g. news ticker
- H04N21/4884—Data services, e.g. news ticker for displaying subtitles
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N5/00—Details of television systems
- H04N5/44—Receiver circuitry for the reception of television signals according to analogue transmission standards
- H04N5/445—Receiver circuitry for the reception of television signals according to analogue transmission standards for displaying additional information
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N7/00—Television systems
- H04N7/01—Conversion of standards, e.g. involving analogue television standards or digital television standards processed at pixel level
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N5/00—Details of television systems
- H04N5/44—Receiver circuitry for the reception of television signals according to analogue transmission standards
Definitions
- Taiwan application serial no. 94113398 filed on Apr. 27, 2005. All disclosure of the Taiwan application is incorporated herein by reference.
- This invention generally relates to an image processing method and device thereof, and especially to an image processing method and device thereof for processing images with running captions.
- a conventional film mode for example, a common record mode of a movie film, has 24 complete frames per second; therefore, the frame rate is 24 frame/s (or the frequency of the play is 24 Hz). Some other film mode records 30 complete frames per second, or the frequency of the play is 30 Hz.
- the conventional broadcasting methods of visual signals such as cable television and wireless television etc., generally comprise the broadcasting modes of NTSC (National Television System Committee) and PAL (Phase Alternative Line) etc.
- the broadcasting frequency of the NTSC is 60 Hz which means 60 interlaced frames per second are received at the end-user terminal from the television station.
- the broadcasting frequency of the PAL is 50 Hz.
- the interlaced frames are, for example, in the odd-number frames, only the scan lines 1,3,5 . . . etc. (so called as the odd-number scan lines) display images.
- the even-number scan lines do not display any images.
- only the scan lines 2,4,6 . . . etc. (so called the even-number scan lines) display images; the odd-number scan lines do not have images; vice versa.
- FIG. 1 is a drawing schematically showing a view of a conventional frame converting format of a 30 Hz film mode with 2:2 pull down.
- each frame 101 , 102 to 130 represents a complete image frame with 30 frames per second.
- Frame 101 is divided into an image frame 101 e, which only even-number scan lines appear on the image, and a frame 101 o, which only odd-number scan lines appears on the image. Other frames are processed with the same way.
- the frames with suffix “e” represent that only even-number scan lines appear on the images
- the frames with suffix “o” represent that only odd-number scan lines appear on the images (the following uses the same presentations).
- the NTSC standard at broadcasting frequency 60 Hz can be accorded.
- FIG. 2 is a drawing schematically showing a view of conventional frame converting format of 24 Hz film mode with 3:2 pull down.
- the frames 201 , 202 , 203 to 224 represent the complete image frames with 24 frames per second.
- the first frame 201 is divided into the interlaced frames 201 e and 201 o.
- the second frame 202 is divided into frames 202 e, 202 o and 202 e etc.
- the third frame 202 is divided into frames 203 e and 203 o etc.
- the other frames are processed with the same way.
- 60 frames per second of 201 e, 201 o, 202 e, 202 o to 224 e and 224 o etc. are achieved, and the NTSC standard at broadcasting frequency 60 Hz can be accorded.
- FIG. 3 is a drawing schematically showing a view of the conventional frame converting format of a 24 Hz film mode with 2:2 pull down.
- the frames 301 , 302 to 324 represent the complete image frames with 24 frames per second.
- the first frame 301 is divided into the interlaced frames 301 e and 301 o.
- the other frames are processed with the same way. Therefore, frames 301 e, 301 o, 302 e, 302 o to 324 e and 324 o etc. of 48 frames per second are achieved; and with other processes, the PAL standard at broadcasting frequency 50 Hz can be accorded.
- a higher quality playback mode of an image playback device of an end-user terminal for example, a High Definition TV (HDTV)
- a High Definition TV will first detect whether the received frames are the interlaced frames processed with 2:2 pull down or 3:2 pull down; further, by the Inverse Telecine (IVTC) process, the interlaced frames will be converted into complete frames before they are played back.
- IVTC Inverse Telecine
- the frames 201 e and 201 o are combined to get the complete frame 201 ; the frames 202 e and 202 o are combined to get the complete frame 202 ; and other frames are processed with the same way.
- FIG. 4 is a drawing schematically showing a view of a conventional playback mode of a film after the IVTC process performed with 3:2 pull down. Further, according to the mode of FIG. 4 , the complete frame 201 as described is played two times to obtain the frames 201 p and 201 p; the frame 202 is played three times to obtain the frames 202 p, 202 p and 202 p; and other frames are processed with the same way. Finally, 60 frames per second can be played with the playback mode as shown in FIG. 4 . Comparing with the playback mode of a conventional TV which 60 interlaced frames per second are played, the frames played with the playback mode as shown in FIG. 4 are smooth without the effects of drop frame, image displacement or image delay.
- running captions are frequently added.
- the running captions are added directly to the interlaced broadcasting frames, for example, added directly to the interlaced frames 101 e, 101 o, 102 e, 102 o to 130 e and 130 o as shown in FIG. 1 , or added directly to the interlaced frames as shown in FIG. 2 or FIG. 3 .
- the interlaced frames with the added running captions are transmitted to the end-user terminal.
- FIG. 5 is a drawing schematically showing a view of a conventional frame process mode with an interlace mode after adding running captions.
- the conventional image playback device will combine the interlaced frames 201 e and 201 o to a complete frame 551 , and combine the interlaced frames 201 o and 202 e to a complete frame 552 , and the other frames are processed with the same way.
- the display quality of the running captions achieved by the mode is excellent.
- the definition of the whole frame is deteriorated.
- the complete frame 552 is composed of the interlaced frames 201 o and 202 e which are belonged to two different complete frames 201 and 202 as shown in FIG. 2 . Consequently, many images with ragged saw tooth are occurred at the intersections of the images in the frame 552 . Therefore, a more suitable process method and device for processing the frames with added running captions is imminent and necessary.
- An object of the present invention is to provide an image processing method, for separating the received frame to at least two different areas, further respectively performing different image processes to the different areas, to obtain a better resulted frame.
- Another object of the present invention is to provide an image processing device, for separating the received frame to at least two different areas, further separately performing different image processes to the different areas, to obtain a better resulted frame.
- the present invention provides an image processing method, which comprises the following steps. First, a plurality of frames is received. Further, at least two adjacent frames are compared to obtain an area selection signal. Further, each of the frames is separated to at least two areas according the area selection signal. Next, an image process is performed respectively to each of the areas. Finally, the corresponding processed areas are combined to obtain a resulted frame.
- the present invention provides an image processing device comprising, for example, a capture device and a processor.
- the capture device is adopted for receiving a plurality of frames, and for comparing at least two adjacent frames to obtain an area selection signal according to a differential value there-between.
- the processor is connected to the capture device for receiving the area selection signal and to separate each of the adjacent frames into at least two areas according to the area selection signal. Each of the areas is performed by an image processing step respectively. Moreover, the corresponding processed areas are combined to obtain a resulted frame.
- the image processing step comprises that, for example, according to the area selection signal, an inverse telecine (IVTC) process mode is performed to the area, when one of the areas is created by a pull down mode.
- the inverse telecine process mode comprises, for example, performing an inverse process mode of the pull down mode.
- one of the areas obtained by the pull down mode comprises a frame which is created from a frame of film mode by the pull down mode.
- the pull down mode comprises 2:2 pull down mode, 3:2 pull down mode or other pull down modes with any proportion.
- the method of respectively performing the image processing steps to each of the areas comprises that, for example, according to the area selection signal, a motion adaptive process mode is performed to the area when one of the areas is created by an interlace mode.
- the motion adaptive process mode comprises combining each two of the adjacent frames to obtain a complete frame.
- one of the areas, which is created by the interlace mode comprises a running caption.
- a method of comparing at least two adjacent frames comprises: each corresponding vertical line of the two adjacent frames is scanned; further, the changes of the image positions of the corresponding vertical lines in the two adjacent frames are compared to obtain an area selection signal.
- a method of comparing at least two frames of the adjacent frames comprises: each corresponding specific area block of the two frames of the adjacent frames is scanned; further, the changes of the corresponding image positions of the corresponding specific area blocks are compared to obtain an area selection signal.
- a method of obtaining the area selection signal comprises: after comparing at least the two frames of the adjacent frames, the area selection signal is decided according to a differential value of the frames.
- FIG. 1 is a drawing schematically showing a view of a conventional frame converting format of 30 Hz film mode with 2:2 pull down.
- FIG. 2 is a drawing schematically showing a view of a conventional frame converting format of 24 Hz film mode with 3:2 pull down.
- FIG. 3 is a drawing schematically showing a view of a conventional frame converting format of 24 Hz film mode with 2:2 pull down.
- FIG. 4 is a drawing schematically showing a view of a conventional playback mode an IVTC process performed with 3:2 pull down.
- FIG. 5 is a drawing schematically showing a view of a conventional frame process mode with interlace mode after adding running captions.
- FIG. 6 is a drawing schematically showing a flow chart of an image processing method according to an embodiment of the present invention.
- FIG. 7 is a drawing schematically showing a view of a detecting frame mode according to an embodiment of the present invention.
- FIG. 8 is a drawing schematically showing a view of a frame processing mode according to an embodiment of the present invention.
- FIG. 9 is a drawing schematically showing a view of an image processing device according to an embodiment of the present invention.
- FIG. 6 is a drawing schematically showing a flow chart of an image processing method according to an embodiment of the present invention.
- FIG. 7 is a drawing schematically showing a view of a detecting frame mode according to an embodiment of the present invention.
- first a plurality of (?) frames is received (step 602 ).
- two or more adjacent frames are compared (step 604 ).
- an area selection signal 732 can be obtained from the comparing result (step 606 ) to indicate the position of the running caption area 714 .
- the received frame 702 is divided into two areas, for example, an image area 722 and a running caption area 724 .
- the method of comparing two or more frames of the adjacent frames comprises, for example, a scanning method.
- the scanning method for example, includes horizontal scanning, vertical scanning or area block scanning etc.
- the vertical scanning method comprises: first, each corresponding vertical line of at least two frames of the adjacent frames is scanned; a subtraction is performed between each corresponding pixel of the corresponding vertical lines; the differential values obtained from the subtraction of each pixel in each vertical line are accumulated; the differential value of the accumulated value of each vertical line is compared.
- the horizontal scanning method comprises: first, each corresponding horizontal line of at least two frames of the adjacent frames is scanned; a subtraction is performed between each corresponding pixel in the corresponding horizontal lines; the differential values obtained from the subtraction of the each pixel in each vertical line are accumulated; the differential value of the accumulated value of each horizontal line is compared.
- the area block scanning method comprises: first, each corresponding specific area block of at least two frames of the adjacent frames is scanned; a subtraction is performed between each corresponding pixel in the corresponding specific area blocks; the differential values obtained from the subtraction of each pixel in each specific area block are accumulated; the differential value of the accumulated value of each specific area block is compared.
- an area selection signal 732 can be obtained as shown in FIG. 7 . Therefore, the frame can be separated to two areas according to the area selection signal 732 . Further, in step 610 , different image processes can be performed to different areas. Furthermore, in step 612 , the processed areas are combined to obtain a complete resulted frame.
- FIG. 8 is a drawing schematically showing a view of a frame processing method according to an embodiment of the present invention.
- the frames received in the present invention are the interlaced frames, 201 e to 224 o, as shown in FIG. 2 with the running captions 501 e and 501 o to 530 e and 530 o.
- the area selection signal 732 is obtained as shown in FIG. 7 , for which the frame area and the running caption area are separated.
- a motion adaptive mode will be applied to the running caption area; for example, the running captions 501 e and 501 o are combined to a running caption 531 , the running captions 501 o and 502 e are combined to a running caption 532 , and so on.
- high definition running captions 531 to 580 of 60 frames per second can be obtained.
- the present invention detects that the interlaced frames 201 e to 224 o are formed by 3:2 pull down mode.
- An inverse telecine (IVTC) mode can be utilized to combine the frames 201 e and 201 o to a complete frame 801 ; the frames 202 e and 202 o are combined to a complete frame 802 ; and other frames are processed with the same way.
- complete frames 801 , 802 to 824 of 24 frames per second can be obtained.
- the obtained complete frame 801 is repeated two times, the frame 802 is repeated three times, and other frames are processed with the same way.
- the image frames 801 , 801 , 802 , 802 , 802 to 824 , 824 , 824 and the running captioned frames 531 to 580 are combined to obtain complete frames 831 to 880 of 60 frames per second.
- the image processing method of the present invention not only maintains high definition of film mode in the original frame, but also obtains clearly and easily distinguishable running captions.
- the effects of the saw-tooth phenomenon occurred in each frame or at the intersection areas between the running captions and the frames can be avoided.
- the frame is not necessarily separated to two areas.
- the received frame can be separated to at least two different areas, for example, to at least the image areas formed by the pull down mode and the running caption area formed by the interlace mode; further, the different image processes are respectively performed to the areas formed by different modes.
- the area selection signal 732 as shown in FIG. 7 will not be obtained by the image processing method of the present invention. Therefore, in FIG. 8 , the processing mode for the running area will not be performed; only the inverse telecine process will be performed to the image area. Further, the complete frame is played by the corresponding pull down mode.
- the area selection signal 732 as shown in FIG. 7 will be obtained first by the image processing method of the present invention.
- a step-by-step process is performed to the running caption area, and the inverse telecine process mode and the corresponding pull down mode are performed to the image area.
- the image area and the running caption area are combined to obtain a relatively high definition frame.
- FIG. 9 is a drawing schematically showing a view of an image processing device according to an embodiment of the present invention.
- the image processing device 900 comprises, for example, a capture device 902 and a processor 904 .
- the capture device 902 is utilized to receive a plurality of frames, to compare at least two adjacent frames and to output an area selection signal which indicates frame difference.
- the processor 904 connected to the capture device 902 receives the area selection signal outputted from the capture device 902 , separates the received frame to at least two different areas according to the area selection signal, respectively performs different image process modes to the frame of each area. Further, the frames of all the processed areas are combined to obtain a resulted frame and to output the resulted frame to the display device 906 .
- an area selection signal is obtained according to the received frame; the frame is separated to at least two different areas according to the source mode of the received frame, which the source mode is detected according to the area selection signal; further, the different image processes are respectively performed to the different areas. Therefore, the image processing method in the present invention maintains higher definition of film mode in the original frame, obtains clearly and easily distinguishable running captions. In addition, the saw-tooth phenomenon occurred in each frame or at the intersection areas between the running captions and the frames is avoided.
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Television Systems (AREA)
- Studio Devices (AREA)
- Image Processing (AREA)
- Closed-Circuit Television Systems (AREA)
- Apparatus For Radiation Diagnosis (AREA)
Abstract
An image processing method and device thereof are provided. The device includes a capture device and a processor. The capture device is adopted for receiving a plurality of frames and comparing at least two adjacent frames to obtain an area selection signal according to a differential value there-between. The processor is connected to the capture device for receiving the area selection signal and to separate each of the adjacent frames into at least two areas. Each of the areas of the adjacent frames is performed by an image processing step respectively, and then the images of the areas are combined to obtain a resulted frame.
Description
This application claims the priority benefit of Taiwan application serial no. 94113398, filed on Apr. 27, 2005. All disclosure of the Taiwan application is incorporated herein by reference.
1. Field of the Invention
This invention generally relates to an image processing method and device thereof, and especially to an image processing method and device thereof for processing images with running captions.
2. Description of Related Art
A conventional film mode, for example, a common record mode of a movie film, has 24 complete frames per second; therefore, the frame rate is 24 frame/s (or the frequency of the play is 24 Hz). Some other film mode records 30 complete frames per second, or the frequency of the play is 30 Hz. And the conventional broadcasting methods of visual signals, such as cable television and wireless television etc., generally comprise the broadcasting modes of NTSC (National Television System Committee) and PAL (Phase Alternative Line) etc. The broadcasting frequency of the NTSC is 60 Hz which means 60 interlaced frames per second are received at the end-user terminal from the television station. The broadcasting frequency of the PAL is 50 Hz. Wherein, the interlaced frames are, for example, in the odd-number frames, only the scan lines 1,3,5 . . . etc. (so called as the odd-number scan lines) display images. The even-number scan lines do not display any images. In the even-number frames, only the scan lines 2,4,6 . . . etc. (so called the even-number scan lines) display images; the odd-number scan lines do not have images; vice versa.
Therefore, for a film of 30 Hz or 24 Hz film mode, in order to transmit via the NTSC standard at 60 Hz, the processes of 2:2 pull down or 3:2 pull down should be performed before the transmission. FIG. 1 is a drawing schematically showing a view of a conventional frame converting format of a 30 Hz film mode with 2:2 pull down. As shown in FIG. 1 , each frame 101,102 to 130 represents a complete image frame with 30 frames per second. Frame 101 is divided into an image frame 101e, which only even-number scan lines appear on the image, and a frame 101o, which only odd-number scan lines appears on the image. Other frames are processed with the same way. Therefore, 60 interlaced frames per second of 101e, 101o, 102e, 102o to 130e and 130o etc. are achieved. Wherein, the frames with suffix “e” represent that only even-number scan lines appear on the images, and the frames with suffix “o” represent that only odd-number scan lines appear on the images (the following uses the same presentations). Thus, the NTSC standard at broadcasting frequency 60 Hz can be accorded.
Besides, an original film at the film mode of 24 Hz, in order to transmit with the PAL standard at 50 Hz, the process of 2:2 pull down should be performed before the transmission. FIG. 3 is a drawing schematically showing a view of the conventional frame converting format of a 24 Hz film mode with 2:2 pull down. As shown in FIG. 3 , the frames 301, 302 to 324 represent the complete image frames with 24 frames per second. The first frame 301 is divided into the interlaced frames 301e and 301o. The other frames are processed with the same way. Therefore, frames 301e, 301o, 302e, 302o to 324e and 324o etc. of 48 frames per second are achieved; and with other processes, the PAL standard at broadcasting frequency 50 Hz can be accorded.
Currently, a higher quality playback mode of an image playback device of an end-user terminal, for example, a High Definition TV (HDTV), in order to get better display definition, will first detect whether the received frames are the interlaced frames processed with 2:2 pull down or 3:2 pull down; further, by the Inverse Telecine (IVTC) process, the interlaced frames will be converted into complete frames before they are played back. For example, after receiving the interlaced frames 201e to 224o shown in FIG. 2 , the frames 201e and 201o are combined to get the complete frame 201; the frames 202e and 202o are combined to get the complete frame 202; and other frames are processed with the same way. Accordingly, complete frames 201, 201 to 204 of 24 frames per second are achieved. FIG. 4 is a drawing schematically showing a view of a conventional playback mode of a film after the IVTC process performed with 3:2 pull down. Further, according to the mode of FIG. 4 , the complete frame 201 as described is played two times to obtain the frames 201p and 201p; the frame 202 is played three times to obtain the frames 202p, 202p and 202p; and other frames are processed with the same way. Finally, 60 frames per second can be played with the playback mode as shown in FIG. 4 . Comparing with the playback mode of a conventional TV which 60 interlaced frames per second are played, the frames played with the playback mode as shown in FIG. 4 are smooth without the effects of drop frame, image displacement or image delay.
However, in the conventional broadcasting mode of the visual signal, running captions are frequently added. In general, the running captions are added directly to the interlaced broadcasting frames, for example, added directly to the interlaced frames 101e, 101o, 102e, 102o to 130e and 130o as shown in FIG. 1 , or added directly to the interlaced frames as shown in FIG. 2 or FIG. 3 . Next, the interlaced frames with the added running captions are transmitted to the end-user terminal.
An object of the present invention is to provide an image processing method, for separating the received frame to at least two different areas, further respectively performing different image processes to the different areas, to obtain a better resulted frame.
Another object of the present invention is to provide an image processing device, for separating the received frame to at least two different areas, further separately performing different image processes to the different areas, to obtain a better resulted frame.
The present invention provides an image processing method, which comprises the following steps. First, a plurality of frames is received. Further, at least two adjacent frames are compared to obtain an area selection signal. Further, each of the frames is separated to at least two areas according the area selection signal. Next, an image process is performed respectively to each of the areas. Finally, the corresponding processed areas are combined to obtain a resulted frame.
According to an embodiment, the present invention provides an image processing device comprising, for example, a capture device and a processor. The capture device is adopted for receiving a plurality of frames, and for comparing at least two adjacent frames to obtain an area selection signal according to a differential value there-between. The processor is connected to the capture device for receiving the area selection signal and to separate each of the adjacent frames into at least two areas according to the area selection signal. Each of the areas is performed by an image processing step respectively. Moreover, the corresponding processed areas are combined to obtain a resulted frame.
In accordance with an embodiment of the present invention, the image processing step comprises that, for example, according to the area selection signal, an inverse telecine (IVTC) process mode is performed to the area, when one of the areas is created by a pull down mode. Further, the inverse telecine process mode comprises, for example, performing an inverse process mode of the pull down mode. Further, one of the areas obtained by the pull down mode comprises a frame which is created from a frame of film mode by the pull down mode. Further, the pull down mode comprises 2:2 pull down mode, 3:2 pull down mode or other pull down modes with any proportion.
In accordance with an embodiment of the present invention, the method of respectively performing the image processing steps to each of the areas comprises that, for example, according to the area selection signal, a motion adaptive process mode is performed to the area when one of the areas is created by an interlace mode. Further, the motion adaptive process mode comprises combining each two of the adjacent frames to obtain a complete frame. Further, one of the areas, which is created by the interlace mode, comprises a running caption.
In accordance with an embodiment of the present invention, a method of comparing at least two adjacent frames comprises: each corresponding vertical line of the two adjacent frames is scanned; further, the changes of the image positions of the corresponding vertical lines in the two adjacent frames are compared to obtain an area selection signal.
In accordance with another embodiment of the present invention, a method of comparing at least two frames of the adjacent frames comprises: each corresponding specific area block of the two frames of the adjacent frames is scanned; further, the changes of the corresponding image positions of the corresponding specific area blocks are compared to obtain an area selection signal.
In accordance with another embodiment of the present invention, a method of obtaining the area selection signal comprises: after comparing at least the two frames of the adjacent frames, the area selection signal is decided according to a differential value of the frames.
The above is a brief description of some deficiencies in the prior art and advantages of the present invention. Other features, advantages and embodiments of the invention will be apparent to those skilled in the art from the following description, accompanying drawings and appended claims.
In accordance with an embodiment of the present invention, the method of comparing two or more frames of the adjacent frames comprises, for example, a scanning method. The scanning method, for example, includes horizontal scanning, vertical scanning or area block scanning etc. In an embodiment of the present invention, the vertical scanning method comprises: first, each corresponding vertical line of at least two frames of the adjacent frames is scanned; a subtraction is performed between each corresponding pixel of the corresponding vertical lines; the differential values obtained from the subtraction of each pixel in each vertical line are accumulated; the differential value of the accumulated value of each vertical line is compared.
In accordance with another embodiment of the present invention, the horizontal scanning method comprises: first, each corresponding horizontal line of at least two frames of the adjacent frames is scanned; a subtraction is performed between each corresponding pixel in the corresponding horizontal lines; the differential values obtained from the subtraction of the each pixel in each vertical line are accumulated; the differential value of the accumulated value of each horizontal line is compared.
In accordance with another embodiment of the present invention, the area block scanning method comprises: first, each corresponding specific area block of at least two frames of the adjacent frames is scanned; a subtraction is performed between each corresponding pixel in the corresponding specific area blocks; the differential values obtained from the subtraction of each pixel in each specific area block are accumulated; the differential value of the accumulated value of each specific area block is compared.
When the differential value obtained from any of the above mentioned scanning methods is beyond a predetermined value, for example, an area selection signal 732 can be obtained as shown in FIG. 7 . Therefore, the frame can be separated to two areas according to the area selection signal 732. Further, in step 610, different image processes can be performed to different areas. Furthermore, in step 612, the processed areas are combined to obtain a complete resulted frame.
Further, as shown in FIG. 8 , for the image area, the present invention detects that the interlaced frames 201e to 224o are formed by 3:2 pull down mode. An inverse telecine (IVTC) mode can be utilized to combine the frames 201e and 201o to a complete frame 801; the frames 202e and 202o are combined to a complete frame 802; and other frames are processed with the same way. Thus, complete frames 801, 802 to 824 of 24 frames per second can be obtained. Next, in accordance with 3:2 pull down mode, the obtained complete frame 801 is repeated two times, the frame 802 is repeated three times, and other frames are processed with the same way. Further, the image frames 801, 801, 802, 802, 802 to 824, 824, 824 and the running captioned frames 531 to 580 are combined to obtain complete frames 831 to 880 of 60 frames per second.
Therefore, the image processing method of the present invention not only maintains high definition of film mode in the original frame, but also obtains clearly and easily distinguishable running captions. In addition, the effects of the saw-tooth phenomenon occurred in each frame or at the intersection areas between the running captions and the frames can be avoided.
It is important to note that although the illustrated embodiment herein refers to the explanation of the present invention, it is to be understood that the embodiment is presented by way of example and not by way of limitation. In other embodiments of the present invention, for example, the frame is not necessarily separated to two areas. In contrast, according to the resource mode, the received frame can be separated to at least two different areas, for example, to at least the image areas formed by the pull down mode and the running caption area formed by the interlace mode; further, the different image processes are respectively performed to the areas formed by different modes.
Further, in accordance with another embodiment of the present invention, for example, when all the received images are formed with one same mode as show in FIG. 2 which are formed with 3:2 pull down mode without the running caption area, the area selection signal 732 as shown in FIG. 7 will not be obtained by the image processing method of the present invention. Therefore, in FIG. 8 , the processing mode for the running area will not be performed; only the inverse telecine process will be performed to the image area. Further, the complete frame is played by the corresponding pull down mode.
Further, in accordance with another embodiment of the present invention, for example, when the received images are formed with other modes, for example, as show in FIG. 1 or FIG. 3 or other modes, and added with the running caption, the area selection signal 732 as shown in FIG. 7 will be obtained first by the image processing method of the present invention. Further, with the similar manner as shown in FIG. 8 , a step-by-step process is performed to the running caption area, and the inverse telecine process mode and the corresponding pull down mode are performed to the image area. Furthermore, the image area and the running caption area are combined to obtain a relatively high definition frame.
Further, the present invention provides an image processing device. FIG. 9 is a drawing schematically showing a view of an image processing device according to an embodiment of the present invention. As shown in FIG. 9 , the image processing device 900 comprises, for example, a capture device 902 and a processor 904. Wherein, the capture device 902 is utilized to receive a plurality of frames, to compare at least two adjacent frames and to output an area selection signal which indicates frame difference. The processor 904 connected to the capture device 902 receives the area selection signal outputted from the capture device 902, separates the received frame to at least two different areas according to the area selection signal, respectively performs different image process modes to the frame of each area. Further, the frames of all the processed areas are combined to obtain a resulted frame and to output the resulted frame to the display device 906.
In summary, in the image processing method and device of the present invention, an area selection signal is obtained according to the received frame; the frame is separated to at least two different areas according to the source mode of the received frame, which the source mode is detected according to the area selection signal; further, the different image processes are respectively performed to the different areas. Therefore, the image processing method in the present invention maintains higher definition of film mode in the original frame, obtains clearly and easily distinguishable running captions. In addition, the saw-tooth phenomenon occurred in each frame or at the intersection areas between the running captions and the frames is avoided.
The above description provides a full and complete description of the preferred embodiments of the present invention. Various modifications, alternate construction, and equivalent may be made by those skilled in the art without changing the scope or spirit of the invention. Accordingly, the above description and illustrations should not be construed as limiting the scope of the invention which is defined by the following claims.
Claims (20)
1. An image processing method adapted to an image processing device, the image processing method comprising:
receiving a plurality of frames;
comparing at least two adjacent frames to obtain an area selection signal;
separating each of the frames to at least two areas according the area selection signal, wherein the area selection signal is used for indicating a position of one of the two areas;
respectively performing an image process step to each of the areas; and
combining the corresponding processed areas to obtain a resulted frame.
2. The image processing method of claim 1 , wherein the method of respectively performing the image process step to each of the areas comprises:
according to the area selection signal, an inverse telecine (IVTC) process mode is performed to the area when one of the areas is created by a pull down mode.
3. The image processing method of claim 2 , wherein the inverse telecine process mode comprises performing an inverse process mode of the pull down mode.
4. The image processing method of claim 2 , wherein one of the areas created by the pull down mode comprises a frame which is created from a frame of film mode by the pull down mode.
5. The image processing method of claim 2 , wherein the pull down mode comprises 2:2 pull down mode, 3:2 pull down mode or pull down modes with any proportion.
6. The image processing method of claim 1 , wherein the method of respectively performing the image process step to each of the areas comprises:
according to the area selection signal, a motion adaptive process mode is performed to the area when one of the areas is created by an interlace mode.
7. The image processing method of claim 1 6, wherein the progressive motion adaptive process mode comprises combining each of the two adjacent frames to obtain a complete frame.
8. The image processing method of claim 1 , wherein one of the areas created by the interlace mode comprises a running caption.
9. The image processing method of claim 1 , wherein a method of comparing at least two frames of the adjacent frames comprises:
scanning each corresponding vertical line of at least the two frames of the adjacent frames;
subtracting each corresponding pixels in at least two of the corresponding vertical lines; and
detecting distribution situation of saw-tooth images.
10. The image processing method of claim 1 , wherein a method of comparing at least two frames of the adjacent frames comprises:
scanning each corresponding horizontal line of at least the two frames of the adjacent frames;
comparing change of image position of the corresponding horizontal line in at least the two adjacent frames; and
detecting distribution situation of saw-tooth image.
11. The image processing method of claim 1 , wherein a method of comparing at least the two adjacent frames comprises:
scanning each corresponding specific area block of at least the two frames of the adjacent frames;
subtracting each corresponding pixel in at least two of the corresponding specific area blocks; and
recording distribution situation of saw-tooth image in each of the specific area blocks.
12. The image processing method of claim 1 , wherein a method of obtaining the area selection signal comprises:
after comparing at least the two frames of the adjacent frames, deciding the area selection signal according to the distribution situation of saw tooth image.
13. An image processing device, comprising:
a capture device, for receiving a plurality of frames, comparing at least two adjacent frames to obtain an area selection signal according to the distribution situation of saw tooth image; and
a processor, connecting to the capture device, for receiving the area selection signal, separating each of the frames of at least the two adjacent frames to at least two areas according to the area selection signal which is used for indicating a position of one of the two areas, respectively performing an image process step to each of the areas, and combining the areas correspondingly processed to obtain a resulted frame.
14. The image processing device of claim 13 , wherein the image process step comprises:
according to the area selection signal, a inverse telecine (IVTC) process mode is performed to the area when one of the areas is created by a pull down mode.
15. The image processing device of claim 14 , wherein the inverse telecine process mode comprises performing an inverse process mode of the pull down mode.
16. The image processing device of claim 14 , wherein one of the areas created by the pull down mode comprises a frame which is created from a frame of film mode by the pull down mode.
17. The image processing device of claim 14 , wherein the pull down mode comprises 2:2 pull down mode, 3:2 pull down mode or pull down modes with any proportion.
18. The image processing device of claim 13 , wherein the method of respectively performing the image process step to each of the areas comprises:
according to the area selection signal, a motion adaptive process mode is performed to the area when one of the areas is created by an interlace mode.
19. The image processing device of claim 18 , wherein the motion adaptive process mode comprises combining each two of the adjacent frames to obtain a complete frame according to the pixel value difference of each corresponding pixel.
20. The image processing device of claim 18 , wherein one of the areas created by the interlace mode comprises a running caption.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US13/436,710 USRE45306E1 (en) | 2005-04-27 | 2012-03-30 | Image processing method and device thereof |
Applications Claiming Priority (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
TW094113398A TWI280061B (en) | 2005-04-27 | 2005-04-27 | Image processing method and device thereof |
TW94113398A | 2005-04-27 | ||
US11/161,529 US7688388B2 (en) | 2005-04-27 | 2005-08-08 | Image processing method and device thereof |
US13/436,710 USRE45306E1 (en) | 2005-04-27 | 2012-03-30 | Image processing method and device thereof |
Related Parent Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US11/161,529 Reissue US7688388B2 (en) | 2005-04-27 | 2005-08-08 | Image processing method and device thereof |
Publications (1)
Publication Number | Publication Date |
---|---|
USRE45306E1 true USRE45306E1 (en) | 2014-12-30 |
Family
ID=37234047
Family Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US11/161,529 Expired - Fee Related US7688388B2 (en) | 2005-04-27 | 2005-08-08 | Image processing method and device thereof |
US13/436,710 Active 2028-08-19 USRE45306E1 (en) | 2005-04-27 | 2012-03-30 | Image processing method and device thereof |
Family Applications Before (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US11/161,529 Expired - Fee Related US7688388B2 (en) | 2005-04-27 | 2005-08-08 | Image processing method and device thereof |
Country Status (3)
Country | Link |
---|---|
US (2) | US7688388B2 (en) |
KR (1) | KR100753619B1 (en) |
TW (1) | TWI280061B (en) |
Families Citing this family (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7705913B2 (en) * | 2005-12-20 | 2010-04-27 | Lsi Corporation | Unified approach to film mode detection |
TWI338502B (en) | 2007-05-15 | 2011-03-01 | Realtek Semiconductor Corp | Interpolation method for image picture and image processing apparatus thereof |
KR20100016741A (en) * | 2008-08-05 | 2010-02-16 | 삼성전자주식회사 | Image processing apparatus and control method thereof |
KR101869735B1 (en) * | 2008-12-19 | 2018-06-21 | 가부시키가이샤 한도오따이 에네루기 켄큐쇼 | Electronic device |
JP6377768B2 (en) * | 2015-01-07 | 2018-08-22 | オリンパス株式会社 | Spectral image acquisition device |
Citations (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR940003201A (en) | 1992-07-30 | 1994-02-21 | 강진구 | Split Image Coding Method and Apparatus |
US5398071A (en) * | 1993-11-02 | 1995-03-14 | Texas Instruments Incorporated | Film-to-video format detection for digital television |
KR200190466Y1 (en) | 1997-08-30 | 2000-08-01 | 윤종용 | Packing vessel |
US6201577B1 (en) * | 1997-10-10 | 2001-03-13 | Faroudja Laboratories, Inc. | Film source video detection |
KR20010090466A (en) | 2000-03-17 | 2001-10-18 | 니시무로 타이죠 | Method of generating object area information and apparatus thereof, and method of generating approximation polygon and apparatus thereof |
US20040085480A1 (en) * | 2002-09-24 | 2004-05-06 | Sven Salzer | Method and video processing unit for processing a video signal |
US20050253964A1 (en) * | 2004-04-30 | 2005-11-17 | Frank Janssen | Ticker processing in video sequences |
US7170562B2 (en) * | 2003-05-19 | 2007-01-30 | Macro Image Technology, Inc. | Apparatus and method for deinterlace video signal |
US7239353B2 (en) * | 2002-12-20 | 2007-07-03 | Samsung Electronics Co., Ltd. | Image format conversion apparatus and method |
US7446817B2 (en) * | 2004-02-18 | 2008-11-04 | Samsung Electronics Co., Ltd. | Method and apparatus for detecting text associated with video |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR940003201B1 (en) | 1991-03-13 | 1994-04-16 | 황철호 | Apparatus for collecting the sands or the gravels in the river |
-
2005
- 2005-04-27 TW TW094113398A patent/TWI280061B/en active
- 2005-08-08 US US11/161,529 patent/US7688388B2/en not_active Expired - Fee Related
- 2005-09-08 KR KR1020050083625A patent/KR100753619B1/en not_active IP Right Cessation
-
2012
- 2012-03-30 US US13/436,710 patent/USRE45306E1/en active Active
Patent Citations (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR940003201A (en) | 1992-07-30 | 1994-02-21 | 강진구 | Split Image Coding Method and Apparatus |
US5398071A (en) * | 1993-11-02 | 1995-03-14 | Texas Instruments Incorporated | Film-to-video format detection for digital television |
KR200190466Y1 (en) | 1997-08-30 | 2000-08-01 | 윤종용 | Packing vessel |
US6201577B1 (en) * | 1997-10-10 | 2001-03-13 | Faroudja Laboratories, Inc. | Film source video detection |
KR20010090466A (en) | 2000-03-17 | 2001-10-18 | 니시무로 타이죠 | Method of generating object area information and apparatus thereof, and method of generating approximation polygon and apparatus thereof |
US20040085480A1 (en) * | 2002-09-24 | 2004-05-06 | Sven Salzer | Method and video processing unit for processing a video signal |
US7239353B2 (en) * | 2002-12-20 | 2007-07-03 | Samsung Electronics Co., Ltd. | Image format conversion apparatus and method |
US7170562B2 (en) * | 2003-05-19 | 2007-01-30 | Macro Image Technology, Inc. | Apparatus and method for deinterlace video signal |
US7446817B2 (en) * | 2004-02-18 | 2008-11-04 | Samsung Electronics Co., Ltd. | Method and apparatus for detecting text associated with video |
US20050253964A1 (en) * | 2004-04-30 | 2005-11-17 | Frank Janssen | Ticker processing in video sequences |
Also Published As
Publication number | Publication date |
---|---|
KR100753619B1 (en) | 2007-08-29 |
US7688388B2 (en) | 2010-03-30 |
TWI280061B (en) | 2007-04-21 |
US20060244823A1 (en) | 2006-11-02 |
KR20060113330A (en) | 2006-11-02 |
TW200638776A (en) | 2006-11-01 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
KR100303728B1 (en) | Deinterlacing method of interlaced scanning video | |
US7961253B2 (en) | Method of processing fields of images and related device for data lines similarity detection | |
CN101953150A (en) | Method, apparatus and machine-readable medium for describing video processing | |
JP2011015068A (en) | Still subtitle detection apparatus, visual display equipment for displaying image including still subtitle, and method for processing image including still subtitle | |
US6266093B1 (en) | Color video camera method and system | |
JP3514063B2 (en) | Receiver | |
USRE45306E1 (en) | Image processing method and device thereof | |
US8115867B2 (en) | Image processing device | |
US6947094B2 (en) | Image signal processing apparatus and method | |
EP1531625A1 (en) | Image processing device and method, video display device, and recorded information reproduction device | |
US7898598B2 (en) | Method and apparatus for video mode judgement | |
JP4510874B2 (en) | Composite image detector | |
US7502071B2 (en) | Video information processing apparatus and video information processing method | |
JP2009111442A (en) | Video transmission system and method | |
US8243814B2 (en) | Combing artifacts detection apparatus and combing artifacts detection method | |
US20110298977A1 (en) | Video processing device | |
US8237860B2 (en) | Poor video editing detection system | |
WO2004002148A1 (en) | Motion vector detection device, detection method, motion compensation device, and motion compensation method | |
JPWO2008038442A1 (en) | Color difference signal IP conversion method | |
JP2008061067A (en) | Image display system, reproducing apparatus, and display apparatus | |
JPH07288780A (en) | Television signal processing method | |
US8373798B2 (en) | Text protection device and related motion adaptive de-interlacing device | |
KR20100111467A (en) | Video apparatus for detecting type of video data and method for processing video applying thereto | |
JP2006121568A (en) | Image processor and image processing method | |
JP2010213181A (en) | Telecine video signal detector, video processor, method of detecting telecine video signal, and computer program |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
MAFP | Maintenance fee payment |
Free format text: PAYMENT OF MAINTENANCE FEE, 8TH YEAR, LARGE ENTITY (ORIGINAL EVENT CODE: M1552) Year of fee payment: 8 |