WO2002104033A1 - Remote video recognition system - Google Patents

Remote video recognition system Download PDF

Info

Publication number
WO2002104033A1
WO2002104033A1 PCT/JP2002/005849 JP0205849W WO02104033A1 WO 2002104033 A1 WO2002104033 A1 WO 2002104033A1 JP 0205849 W JP0205849 W JP 0205849W WO 02104033 A1 WO02104033 A1 WO 02104033A1
Authority
WO
WIPO (PCT)
Prior art keywords
image
recognition system
video recognition
remote video
imaging
Prior art date
Application number
PCT/JP2002/005849
Other languages
French (fr)
Inventor
Norimitu Shirato
Original Assignee
Usc Co., Limited
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Usc Co., Limited filed Critical Usc Co., Limited
Priority to IL15934802A priority Critical patent/IL159348A0/en
Priority to EP02736070A priority patent/EP1400121A1/en
Priority to US10/480,248 priority patent/US20040189876A1/en
Priority to CA002450875A priority patent/CA2450875A1/en
Publication of WO2002104033A1 publication Critical patent/WO2002104033A1/en

Links

Classifications

    • GPHYSICS
    • G08SIGNALLING
    • G08BSIGNALLING OR CALLING SYSTEMS; ORDER TELEGRAPHS; ALARM SYSTEMS
    • G08B13/00Burglar, theft or intruder alarms
    • G08B13/18Actuation by interference with heat, light, or radiation of shorter wavelength; Actuation by intruding sources of heat, light, or radiation of shorter wavelength
    • G08B13/189Actuation by interference with heat, light, or radiation of shorter wavelength; Actuation by intruding sources of heat, light, or radiation of shorter wavelength using passive radiation detection systems
    • G08B13/194Actuation by interference with heat, light, or radiation of shorter wavelength; Actuation by intruding sources of heat, light, or radiation of shorter wavelength using passive radiation detection systems using image scanning and comparing systems
    • G08B13/196Actuation by interference with heat, light, or radiation of shorter wavelength; Actuation by intruding sources of heat, light, or radiation of shorter wavelength using passive radiation detection systems using image scanning and comparing systems using television cameras
    • G08B13/19617Surveillance camera constructional details
    • G08B13/19626Surveillance camera constructional details optical details, e.g. lenses, mirrors or multiple lenses
    • G08B13/19628Surveillance camera constructional details optical details, e.g. lenses, mirrors or multiple lenses of wide angled cameras and camera groups, e.g. omni-directional cameras, fish eye, single units having multiple cameras achieving a wide angle view
    • GPHYSICS
    • G08SIGNALLING
    • G08BSIGNALLING OR CALLING SYSTEMS; ORDER TELEGRAPHS; ALARM SYSTEMS
    • G08B13/00Burglar, theft or intruder alarms
    • G08B13/18Actuation by interference with heat, light, or radiation of shorter wavelength; Actuation by intruding sources of heat, light, or radiation of shorter wavelength
    • G08B13/189Actuation by interference with heat, light, or radiation of shorter wavelength; Actuation by intruding sources of heat, light, or radiation of shorter wavelength using passive radiation detection systems
    • G08B13/194Actuation by interference with heat, light, or radiation of shorter wavelength; Actuation by intruding sources of heat, light, or radiation of shorter wavelength using passive radiation detection systems using image scanning and comparing systems
    • G08B13/196Actuation by interference with heat, light, or radiation of shorter wavelength; Actuation by intruding sources of heat, light, or radiation of shorter wavelength using passive radiation detection systems using image scanning and comparing systems using television cameras
    • G08B13/19678User interface
    • G08B13/19684Portable terminal, e.g. mobile phone, used for viewing video remotely
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N23/00Cameras or camera modules comprising electronic image sensors; Control thereof
    • H04N23/60Control of cameras or camera modules
    • H04N23/698Control of cameras or camera modules for achieving an enlarged field of view, e.g. panoramic image capture
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/18Closed-circuit television [CCTV] systems, i.e. systems in which the video signal is not broadcast
    • H04N7/183Closed-circuit television [CCTV] systems, i.e. systems in which the video signal is not broadcast for receiving images from a single remote source

Definitions

  • the present invention relates to a remote video recognition system applied, for example, to a surveillance system and capable of outputting and displaying a video produced by an imaging means, including projection lenses such as a fish-eye lens and hemispherical reflection mirrors such as a convex mirror, in any desired form at a site remote from where the imaging means is installed.
  • an imaging means including projection lenses such as a fish-eye lens and hemispherical reflection mirrors such as a convex mirror, in any desired form at a site remote from where the imaging means is installed.
  • the present invention has been accomplished after vigorous study and research and provides a remote video recognition system which allows the user at a remote place to select a desired range of a video produced by an imaging means to enlarge or reduce or track a particular person, for example.
  • the present invention provides a remote video recognition system which comprises : an imaging means installed at an arbitrary position; a transmission means for transmitting a video shot by the imaging means; and an output means installed at a position remote from where the imaging means is installed, the output means being adapted to receive video information transmitted from the transmission means and display an image according to the video information; wherein the output means is provided with an input device and an arbitrary range of the displayed image can be selected through the input means and enlarged or reduced for display.
  • the imaging means has an imaging device capable of capturing a curved image with a wide viewing angle.
  • the transmission means can transmit the image information taken in by the imaging means to the output means through a wired or wireless network or the Internet.
  • At least one of the imaging means, the transmission means and the output means has a transformation means for transforming the image information into a plane image
  • the output means has a monitor to which the plane image transformed by the transformation means is output
  • the transformation means calculates sampling points on the curved image according to a projection characteristic of the imaging device and transforms the curved image into the plane image .
  • the transformation means may build a spherical polygon model according to the projection characteristic of the imaging device, match the sampling points on the curved image to vertices of a plurality of triangles making up the polygon model , transform the sampling points into a camera viewing system by a geometry transformation, and perform various projection conversions to transform the curved image produced by the imaging device into the plane image .
  • the transformation means may be provided in a server on the wired or wireless network or the Internet, the image information taken in by the imaging means may be transmitted by the transmission means to the server, stored there and transformed into a plane image, and the output means may connect to the wired or wireless network including the Internet to receive the image and output it to a monitor or the like.
  • the projection characteristic of the imaging device may include parameters associated with a radius of curvature of the imaging device; the projection lens may use a fish-eye lens; and the fish-eye lens may be replaced with a hemispherical reflection mirror such as a convex or concave mirror.
  • the output means may use a computer, a cell phone (including a PHS) , or even a television receiver.
  • Fig. 1 is a schematic block diagram showing a first embodiment of the present invention .
  • Fig. 2 is a block diagram showing a detail of a CCD device .
  • Fig. 3 is a schematic diagram showing a first half of an. action performed by a transformation means.
  • Fig. 4 is a schematic diagram showing a second half of the action performed by the transformation means .
  • Fig. 5A illustrates a curved image produced by a fish- eye lens
  • Fig. 5B illustrates a plane image converted from the curved image by a conversion program.
  • Fig. 6 is a schematic block diagram showing a variation of the first embodiment of the present invention .
  • Fig. 7 is a schematic block diagram showing a second embodiment of the present invention.
  • FIG. 1 shows an example embodiment of the present invention as applied to a surveillance system.
  • the surveillance system of this embodiment includes a CCD device 4 installed at a site being monitored, a transmission means 5 for transmitting a video shot by the CCD device 4, a computer 7 installed in a monitoring center 6 remote from the site where the CCD device 4 is installed, and intended to receive video information transmitted from the transmission means 5 (in this invention the "computer” includes devices with computer functions ; this applies in the following descriptions) , and a monitor 8 making up the computer and displaying the video information received.
  • the CCD device 4 corresponds to the imaging means defined in claims .
  • the word "computer” is intended to include general computers and other devices that incorporate CPU and MPU or the like and perform computer functions .
  • the CCD device 4 as shown in Fig. 2, has a fish-eye lens 1 or an imaging device, an optical filter 2 , an optical lens 3 , and a CCD camera 4.
  • An image produced by the fish- eye lens 1 (curved image) is taken into the CCD device 4 through the optical filter 2 and the optical lens 3.
  • the CCD device 4 is connected to a first controller (CPU) not shown, and thus the curved image taken into the CCD device 4 is sent to the first controller.
  • the curved image refers to an image produced by the fish-eye lens 1.
  • the curved image refers to an image produced by the convex mirror, concave mirror or wide-angle lens .
  • the transmission means 5 is allowed by a second controller (CPU) not shown to connect to the Internet 9 via a telephone line 11, so that image information taken in from the CCD device 4 is transmitted to a server (including ASP) 10 on the Internet 9.
  • the server 10 refers to a service in general that provides software functions via the Internet.
  • the computer in the monitoring center 6 can connect to the server 10 through a communication device including a terminal adapter (TA) .
  • the computer 7 receives the image information from the server 10 and displays it on the monitor 8 that makes up the computer 7.
  • a conversion means (not shown) to transform the image information is incorporated in a computer not shown which is installed in the server 10.
  • a LAN and a wireless communication may be used in addition to the public switched telephone network .
  • the conversion means is a conversion program to transform the curved image taken into the CCD device 4 into a plane image.
  • the plane image refers to an image that is presented to user's eye.
  • the conversion program is intended to transform the curved image produced by the fish-eye lens 1 into a plane image by calculating sampling points on the curved image based on projection characteristics of the fish-eye lens 1. That is, the process of converting a curved image formed by the imaging device into a plane image involves building a spherical polygon model based on projection characteristics including those associated with a radius of curvature of the fish-eye lens 1, matching the sampling points on the curved image to the vertices of a plurality of triangles making up the polygon model, converting them into a camera viewing field system by a geometry conversion and then performing a variety of projection conversions.
  • the projection characteristic of the fish-eye lens 1 has a hemispherical or pyramidal shape, as shown in Fig. 3.
  • a sampling model marked with sampling points is imaged by the fish-eye lens 1.
  • the sampling model imaged by the fish-eye lens 1 is shot as by the CCD device to determine destination points on the plane image (two-dimensional image) that are projected from the sampling points .
  • a polygon model with the same or approximately the same shape as the sampling model is built on the computer.
  • the sampling points on the sampling model imaged by the fish-eye lens 1 are matched to corresponding vertices on the polygon model. In this way the projection characteristic of the fish-eye lens 1 is obtained.
  • This projection characteristic need only be determined once. If the projection characteristic of the fish-eye lens 1 is already known, the vertices on the polygon model that match the corresponding sampling points are calculated without using the sampling model .
  • the vertices of a plurality of triangles on the polygon model are geometry-transformed into a camera coordinate system where they are subjected to various projection processing such as parallel projection and perspective projection to determine destination pixels on the plane that are projected from the vertices.
  • various projection processing such as parallel projection and perspective projection to determine destination pixels on the plane that are projected from the vertices.
  • triangular sampling areas on the curved image that correspond to the triangular areas on the plane determined as described above are deformed as required. That is, for each pixel in the triangular areas on the plane, a pixel on the curved image that should be referenced is determined.
  • the sampling points on the sampling model imaged by the fish-eye lens 1 are matched to the corresponding vertices of a plurality of triangles making up the polygon model and then geometry- transformed into a camera viewing field system where they are subjected to the above-described projection processing. That is, if a virtual camera with a line of sight, a viewing angle, a clipping plane and a banking angle is assumed to be placed in a three-dimensional space and one looks through the camera positioned at an origin of the polygon model, then he or she can see a plane image that was converted from the curved image.
  • this conversion procedure consists in determining destination points on the two-dimensional image that are projected from the sampling points through the virtual camera and, as shown in Fig. 4, filling the triangular area of each sampling point without a gap by the affine trans ormation.
  • This process obviates the need for performing complex calculations for each of a large number of pixels as required by the conventional method, and enables a fast conversion.
  • the plane image produced by the method of this embodiment is an approximated image. It can be made close to an actual image as by increasing the number of polygons in the polygon model to make it more resemble the sampling model or improving the density of the polygon model, as required.
  • the fish-eye lens 1 produces an image (curved image) as shown in Fig. 5A.
  • This curved image is taken in by the CCD device 4 which transfers the image to the computer 7 in the server 10 through the Internet 9.
  • the curved image is then transformed into a plane image by the conversion program in the computer.
  • the burden on the computer can be alleviated by performing the curved image-to-plane image transformation through a predetermined angle (for example, 90 degrees) at a time. As described earlier, this transformation is processed at higher speed than in the conventional method.
  • the computer 7 serving as the output means connects to the server 10 via the Internet 9 to receive the transformed plane image .
  • the plane image transferred to the computer 7 is then displayed on the monitor 8 of the computer 7.
  • Fig. 5B shows a plane image displayed on the monitor 8.
  • the user can select any desired range of the plane image output to the monitor 8 by using an input means of the computer 7 , such as a mouse , and enlarge or reduce the selected range for display.
  • an input means of the computer 7 such as a mouse
  • This configuration can be realized by adopting a conventionally known method.
  • a desired range of the plane image, which was selected by the input means such as a mouse and displayed on the monitor 8 can be tracked. This tracking can be performed, for example, by transmitting the data of the selected range to the computer 7 on the server 10 and by having the computer select a portion of the curved image corresponding to the selected range of image data and transmit it to the computer 7.
  • the computer 7 on the server 10 is provided with a storage means (e.g., a hard disk or semiconductor memory) to store the image data supplied and with a decision program that determines and selects that portion of the curved image transmitted successively from the transmission means 5 which matches the selected range of the image data.
  • a storage means e.g., a hard disk or semiconductor memory
  • This decision program may be chosen from those available on the market in the form of software stored in a RAM (Random Access Memory) .
  • a video of almost the entire area around the CCD device 4 which was captured by a single fish-eye lens or by the fish-eye lens 1 and the single CCD device 4 can be displayed on the monitor 8 Hence, there is no need to install a large number of surveillance cameras as required in the conventional surveillance system. Further, in this embodiment, since the video is taken in via the Internet 9 , if the monitoring center 6 is located in a place or country remote from where the CCD device 4 is installed, the surveillance can be performed with ease .
  • the image taken in by the CCD device 4 can be subjected to additional processing by the computer in the server 10, for example, to make the face of a suspicious man more identifiable. This widens the range of applications.
  • the ability to enlarge or reduce a selected range of the video and to track a selected range (e.g., a particular person) as described above can improve the surveillance function.
  • This invention may also be applied to the broadcast of baseball and soccer games, whereby a viewer can choose a particular player (e.g., a forward) and enjoy a video showing that player always at the center.
  • a so-called one-source-multiscreen system may be built whereby, although the same original video is received, a receiver on the viewer side can pick up a desired image portion so that each viewer can enjoy his or her own image or multiple, different images at the same time. This can realize a personal image system with ease and at low cost without requiring a huge facility investment.
  • the transformation means has been described to be installed in the computer on the server 10.
  • the transformation means (conversion program) may be provided in the first controller making up the CCD device 4 , or the second controller making up the transmission means 5, or the computer 7 making up the output means .
  • the conversion program may be installed in the first controller forming the CCD device 4 or in the second controller forming the transmission means 5. It may of course be installed in the computer 7 forming the output means .
  • the configuration shown in Fig. 6 represents a case where the conversion program is installed in the computer 7 forming the output means.
  • the image data (curved image) transmitted via the transmission means 5 to the computer 7 is transformed by the computer 7 into a plane image which is then output to the monitor 8.
  • the conversion program may be installed in any of the devices 4, 5, 7 making up this system.
  • Fig. 7 shows a second embodiment of the present invention.
  • the monitoring center 6 of the first embodiment is not provided and the video captured by the CCD device 4 is displayed on a cell phone or mobile terminal 12.
  • the cell phone or mobile terminal 12 must be able to connect to the Internet.
  • Other configurations and operations are similar to those of the first embodiment.
  • the CCD device 4 may be installed in a kindergarten or home so that a mother working in an office can watch her child in the kindergarten or home by using her cell phone 12.
  • the CCD device 4 may be installed in a kindergarten or home so that a mother working in an office can watch her child in the kindergarten or home by using her cell phone 12.
  • a memory in the computer at home or a server on an internet service provider side it is possible not only to realize the realtime observation but also to view a recorded and stored video by fast-feeding it.
  • this embodiment has an excellent versatility.
  • the cell phone 12 may be replaced with a portable computer or PDA. It is also possible to use a PHS or even a TV receiver in the place of the cell phone.
  • a desired portion of the image is selected from the plane image output to the monitor 8.
  • This invention is not limited to this arrangement, and the desired portion of the image may be selected from the curved image captured by the imaging means such as the CCD device 4 or from the image transmitted by the transmission means. Further, the selected portion of the image does not need to be transformed into a plane image and may be recorded and stored as is . The selected portion of the image may also be tracked and then recorded and stored without converting it into the plane image .
  • the present invention is configured and operates as described above, a portion of the image that is selected using an input means can be enlarged or reduced, or tracked down .
  • This not only improves the surveillance function but also allows the application of the invention to the television broadcast whereby a viewer can select a particular person and watch the TV showing that person at the center of the screen.
  • This invention can also realize a one-source- multiscreen system, thus offering new ways to enjoy the TV broadcasting.
  • the use of a curved image shooting means can realize fast image transformation processing. This invention therefore can be applied at low cost to a variety of equipment including a surveillance system.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Human Computer Interaction (AREA)
  • Closed-Circuit Television Systems (AREA)
  • Image Processing (AREA)
  • Studio Devices (AREA)

Abstract

A remote video recognition system is provided which realizes a new, one-source-multiscreen image reception system. The system comprises an imaging means such as a CCD device (4), a transmission means (5) for transmitting a video shot by the imaging means, and a display means (8) installed in a remote place (6) to receive video information transmitted from the transmission means and display the video information. The imaging means has a fish-eye lens (1) that can capture a curved image with a wide viewing angle. The transmission means can transmit the video shot by the imaging means to a desired location via the Internet (9). The imaging means, the transmission means or the display means (8) is provided with an input device that can select and specify an arbitrary range or area of the image. The selected arbitrary range or area of the image is automatically tracked.

Description

DESCRIPTION
Remote Video Recognition System
BACKGROUND OF THE INVENTION Technical Field of the Invention
The present invention relates to a remote video recognition system applied, for example, to a surveillance system and capable of outputting and displaying a video produced by an imaging means, including projection lenses such as a fish-eye lens and hemispherical reflection mirrors such as a convex mirror, in any desired form at a site remote from where the imaging means is installed. Background of the Invention
Significant advances have been made in the computer- related technologies in recent years. Particularly, technologies (software) associated with computer graphics have progressed remarkably as the processing speed of the computer itself and the storage capacities are increasing. The use of graphics processing software allows figures or images taken into the computer to be not only enlarged or reduced but also deformed arbitrarily. Deforming or morphing an image taken into the computer currently involves breaking down the image into pixels and performing complicated computations on each of the pixels to produce a desired deformed image.
However, since the image deformation processing described above requires performing complex calculations for each pixel, its practical application has been limited to drawing. When such image deformation processing is to be applied, however, the conventional processing software, which must perform complex calculations on each pixel, inevitably increases the amount of time required. That is why it has found only a limited use in everyday life. If the image deformation processing can be done quickly, an image shot by a hemispherical reflection mirror such as a convex mirror or a fish-eye lens, which can cover a wider viewing area than ordinary lenses, can be converted into a plane image for display. It is thus considered possible to build a surveillance system that can monitor a wide observation area with a smaller number of imaging operations .
Based on this technology, it is also considered possible to build another technology for easily monitoring sites from a remote place. In a field of broadcast, too, based on the above technology it is further considered possible with a limited number of components to track and display only a particular person or enlarge or reduce an output view of a particular area being monitored.
Under these circumstances the present invention has been accomplished after vigorous study and research and provides a remote video recognition system which allows the user at a remote place to select a desired range of a video produced by an imaging means to enlarge or reduce or track a particular person, for example.
SUMMARY OF THE INVENTION
To achieve the above objective, the present invention provides a remote video recognition system which comprises : an imaging means installed at an arbitrary position; a transmission means for transmitting a video shot by the imaging means; and an output means installed at a position remote from where the imaging means is installed, the output means being adapted to receive video information transmitted from the transmission means and display an image according to the video information; wherein the output means is provided with an input device and an arbitrary range of the displayed image can be selected through the input means and enlarged or reduced for display.
In another aspect of this invention, after the arbitrary range of the image information obtained is selected through the input means, a portion of the image that is within the selected range is automatically tracked down and displayed.
In still another aspect of the invention, the imaging means has an imaging device capable of capturing a curved image with a wide viewing angle. The transmission means, as claimed in claim 4 , can transmit the image information taken in by the imaging means to the output means through a wired or wireless network or the Internet.
In a further aspect of this invention, at least one of the imaging means, the transmission means and the output means has a transformation means for transforming the image information into a plane image, the output means has a monitor to which the plane image transformed by the transformation means is output, and the transformation means calculates sampling points on the curved image according to a projection characteristic of the imaging device and transforms the curved image into the plane image .
In a further aspect of this invention, the transformation means may build a spherical polygon model according to the projection characteristic of the imaging device, match the sampling points on the curved image to vertices of a plurality of triangles making up the polygon model , transform the sampling points into a camera viewing system by a geometry transformation, and perform various projection conversions to transform the curved image produced by the imaging device into the plane image .
In a further aspect of this invention, rather than being provided in at least one of the imaging means, the transmission means and the output means, the transformation means may be provided in a server on the wired or wireless network or the Internet, the image information taken in by the imaging means may be transmitted by the transmission means to the server, stored there and transformed into a plane image, and the output means may connect to the wired or wireless network including the Internet to receive the image and output it to a monitor or the like.
In a further aspect of this invention, the projection characteristic of the imaging device may include parameters associated with a radius of curvature of the imaging device; the projection lens may use a fish-eye lens; and the fish-eye lens may be replaced with a hemispherical reflection mirror such as a convex or concave mirror.
In a further aspect of this invention, the output means may use a computer, a cell phone (including a PHS) , or even a television receiver.
BRIEF DESCRIPTION OF THE DRAWINGS
Fig. 1 is a schematic block diagram showing a first embodiment of the present invention .
Fig. 2 is a block diagram showing a detail of a CCD device .
Fig. 3 is a schematic diagram showing a first half of an. action performed by a transformation means.
Fig. 4 is a schematic diagram showing a second half of the action performed by the transformation means .
Fig. 5A illustrates a curved image produced by a fish- eye lens and Fig. 5B illustrates a plane image converted from the curved image by a conversion program.
Fig. 6 is a schematic block diagram showing a variation of the first embodiment of the present invention .
Fig. 7 is a schematic block diagram showing a second embodiment of the present invention.
BEST MODE FOR CARRYING OUT THE INVENTION Fig. 1 shows an example embodiment of the present invention as applied to a surveillance system. The surveillance system of this embodiment includes a CCD device 4 installed at a site being monitored, a transmission means 5 for transmitting a video shot by the CCD device 4, a computer 7 installed in a monitoring center 6 remote from the site where the CCD device 4 is installed, and intended to receive video information transmitted from the transmission means 5 (in this invention the "computer" includes devices with computer functions ; this applies in the following descriptions) , and a monitor 8 making up the computer and displaying the video information received. The CCD device 4 corresponds to the imaging means defined in claims . In this specification, the word "computer" is intended to include general computers and other devices that incorporate CPU and MPU or the like and perform computer functions .
The CCD device 4, as shown in Fig. 2, has a fish-eye lens 1 or an imaging device, an optical filter 2 , an optical lens 3 , and a CCD camera 4. An image produced by the fish- eye lens 1 (curved image) is taken into the CCD device 4 through the optical filter 2 and the optical lens 3. The CCD device 4 is connected to a first controller (CPU) not shown, and thus the curved image taken into the CCD device 4 is sent to the first controller. The curved image refers to an image produced by the fish-eye lens 1. It is noted, however, that where a convex mirror, concave mirror or wide-angle lens is used in place of the fish-eye lens 1 as described later, the curved image refers to an image produced by the convex mirror, concave mirror or wide-angle lens .
The transmission means 5 is allowed by a second controller (CPU) not shown to connect to the Internet 9 via a telephone line 11, so that image information taken in from the CCD device 4 is transmitted to a server (including ASP) 10 on the Internet 9. The server 10 refers to a service in general that provides software functions via the Internet. The computer in the monitoring center 6 can connect to the server 10 through a communication device including a terminal adapter (TA) . The computer 7 receives the image information from the server 10 and displays it on the monitor 8 that makes up the computer 7. In this embodiment, a conversion means (not shown) to transform the image information is incorporated in a computer not shown which is installed in the server 10. For a connection with the Internet 9, a LAN and a wireless communication may be used in addition to the public switched telephone network .
The conversion means is a conversion program to transform the curved image taken into the CCD device 4 into a plane image. The plane image refers to an image that is presented to user's eye.
The conversion program is intended to transform the curved image produced by the fish-eye lens 1 into a plane image by calculating sampling points on the curved image based on projection characteristics of the fish-eye lens 1. That is, the process of converting a curved image formed by the imaging device into a plane image involves building a spherical polygon model based on projection characteristics including those associated with a radius of curvature of the fish-eye lens 1, matching the sampling points on the curved image to the vertices of a plurality of triangles making up the polygon model, converting them into a camera viewing field system by a geometry conversion and then performing a variety of projection conversions.
The projection characteristic of the fish-eye lens 1 has a hemispherical or pyramidal shape, as shown in Fig. 3. A sampling model marked with sampling points is imaged by the fish-eye lens 1. The sampling model imaged by the fish-eye lens 1 is shot as by the CCD device to determine destination points on the plane image (two-dimensional image) that are projected from the sampling points . Based on the destination points, a polygon model with the same or approximately the same shape as the sampling model is built on the computer. Then, the sampling points on the sampling model imaged by the fish-eye lens 1 are matched to corresponding vertices on the polygon model. In this way the projection characteristic of the fish-eye lens 1 is obtained. This projection characteristic need only be determined once. If the projection characteristic of the fish-eye lens 1 is already known, the vertices on the polygon model that match the corresponding sampling points are calculated without using the sampling model .
Based on the projection characteristic of the fish-eye lens 1 thus obtained, the vertices of a plurality of triangles on the polygon model are geometry-transformed into a camera coordinate system where they are subjected to various projection processing such as parallel projection and perspective projection to determine destination pixels on the plane that are projected from the vertices. Next, triangular sampling areas on the curved image that correspond to the triangular areas on the plane determined as described above are deformed as required. That is, for each pixel in the triangular areas on the plane, a pixel on the curved image that should be referenced is determined. These processing are not precise calculations but approximations, which can improve the processing speed.
In other words , as shown in Fig . 3 , the sampling points on the sampling model imaged by the fish-eye lens 1 are matched to the corresponding vertices of a plurality of triangles making up the polygon model and then geometry- transformed into a camera viewing field system where they are subjected to the above-described projection processing. That is, if a virtual camera with a line of sight, a viewing angle, a clipping plane and a banking angle is assumed to be placed in a three-dimensional space and one looks through the camera positioned at an origin of the polygon model, then he or she can see a plane image that was converted from the curved image. In practice, this conversion procedure consists in determining destination points on the two-dimensional image that are projected from the sampling points through the virtual camera and, as shown in Fig. 4, filling the triangular area of each sampling point without a gap by the affine trans ormation. This process obviates the need for performing complex calculations for each of a large number of pixels as required by the conventional method, and enables a fast conversion. The plane image produced by the method of this embodiment is an approximated image. It can be made close to an actual image as by increasing the number of polygons in the polygon model to make it more resemble the sampling model or improving the density of the polygon model, as required.
This embodiment constructed as described above operates as follows. The fish-eye lens 1 produces an image (curved image) as shown in Fig. 5A. This curved image is taken in by the CCD device 4 which transfers the image to the computer 7 in the server 10 through the Internet 9. The curved image is then transformed into a plane image by the conversion program in the computer. The burden on the computer can be alleviated by performing the curved image-to-plane image transformation through a predetermined angle (for example, 90 degrees) at a time. As described earlier, this transformation is processed at higher speed than in the conventional method. The computer 7 serving as the output means connects to the server 10 via the Internet 9 to receive the transformed plane image . The plane image transferred to the computer 7 is then displayed on the monitor 8 of the computer 7. Fig. 5B shows a plane image displayed on the monitor 8.
In this embodiment, the user can select any desired range of the plane image output to the monitor 8 by using an input means of the computer 7 , such as a mouse , and enlarge or reduce the selected range for display. This configuration can be realized by adopting a conventionally known method. Further, a desired range of the plane image, which was selected by the input means such as a mouse and displayed on the monitor 8, can be tracked. This tracking can be performed, for example, by transmitting the data of the selected range to the computer 7 on the server 10 and by having the computer select a portion of the curved image corresponding to the selected range of image data and transmit it to the computer 7. For this purpose, the computer 7 on the server 10 is provided with a storage means (e.g., a hard disk or semiconductor memory) to store the image data supplied and with a decision program that determines and selects that portion of the curved image transmitted successively from the transmission means 5 which matches the selected range of the image data. This decision program may be chosen from those available on the market in the form of software stored in a RAM (Random Access Memory) .
As can be seen from Fig. 5, in this embodiment, a video of almost the entire area around the CCD device 4 which was captured by a single fish-eye lens or by the fish-eye lens 1 and the single CCD device 4 can be displayed on the monitor 8 Hence, there is no need to install a large number of surveillance cameras as required in the conventional surveillance system. Further, in this embodiment, since the video is taken in via the Internet 9 , if the monitoring center 6 is located in a place or country remote from where the CCD device 4 is installed, the surveillance can be performed with ease .
The image taken in by the CCD device 4 can be subjected to additional processing by the computer in the server 10, for example, to make the face of a suspicious man more identifiable. This widens the range of applications.
Further, the ability to enlarge or reduce a selected range of the video and to track a selected range (e.g., a particular person) as described above can improve the surveillance function. This invention may also be applied to the broadcast of baseball and soccer games, whereby a viewer can choose a particular player (e.g., a forward) and enjoy a video showing that player always at the center. Alternatively, a so-called one-source-multiscreen system may be built whereby, although the same original video is received, a receiver on the viewer side can pick up a desired image portion so that each viewer can enjoy his or her own image or multiple, different images at the same time. This can realize a personal image system with ease and at low cost without requiring a huge facility investment.
Although in the above embodiment, a fish-eye lens is used for the imaging device, a hemispherical reflection mirror such as a concave or convex mirror can be employed. Further, in the first embodiment described above, the transformation means has been described to be installed in the computer on the server 10. In this invention, the transformation means (conversion program) may be provided in the first controller making up the CCD device 4 , or the second controller making up the transmission means 5, or the computer 7 making up the output means . In other words , rather than being provided in the computer on the server 10 as described above, the conversion program may be installed in the first controller forming the CCD device 4 or in the second controller forming the transmission means 5. It may of course be installed in the computer 7 forming the output means . When this configuration is adopted, there is no need to provide the server 10 as shown in Fig. 6. The configuration shown in Fig. 6 represents a case where the conversion program is installed in the computer 7 forming the output means. In this configuration, the image data (curved image) transmitted via the transmission means 5 to the computer 7 is transformed by the computer 7 into a plane image which is then output to the monitor 8. As mentioned earlier, in this invention the conversion program may be installed in any of the devices 4, 5, 7 making up this system.
Fig. 7 shows a second embodiment of the present invention. In this embodiment the monitoring center 6 of the first embodiment is not provided and the video captured by the CCD device 4 is displayed on a cell phone or mobile terminal 12. Hence , the cell phone or mobile terminal 12 must be able to connect to the Internet. Other configurations and operations are similar to those of the first embodiment.
With this embodiment, a variety of applications are conceivable. For example, the CCD device 4 may be installed in a kindergarten or home so that a mother working in an office can watch her child in the kindergarten or home by using her cell phone 12. In this case, by using a memory in the computer at home or a server on an internet service provider side, it is possible not only to realize the realtime observation but also to view a recorded and stored video by fast-feeding it. Further, at kindergartens and homes, it is possible to build an observation system by simply installing the CCD device 4 and necessary software. Therefore, this embodiment has an excellent versatility.
In this invention, the cell phone 12 may be replaced with a portable computer or PDA. It is also possible to use a PHS or even a TV receiver in the place of the cell phone.
In the above embodiment, we have described an example case where a desired portion of the image is selected from the plane image output to the monitor 8. This invention is not limited to this arrangement, and the desired portion of the image may be selected from the curved image captured by the imaging means such as the CCD device 4 or from the image transmitted by the transmission means. Further, the selected portion of the image does not need to be transformed into a plane image and may be recorded and stored as is . The selected portion of the image may also be tracked and then recorded and stored without converting it into the plane image .
INDUSTRIAL APPLICABILITY Since the present invention is configured and operates as described above, a portion of the image that is selected using an input means can be enlarged or reduced, or tracked down . This not only improves the surveillance function but also allows the application of the invention to the television broadcast whereby a viewer can select a particular person and watch the TV showing that person at the center of the screen. This invention can also realize a one-source- multiscreen system, thus offering new ways to enjoy the TV broadcasting. Further, the use of a curved image shooting means can realize fast image transformation processing. This invention therefore can be applied at low cost to a variety of equipment including a surveillance system.

Claims

1. A remote video recognition system comprising: an imaging means with a wide viewing angle installed at an arbitrary position; a transmission means for transmitting a video shot by the imaging means ; and an output means installed at a position remote from where the imaging means is installed, the output means being adapted to receive video information transmitted from the transmission means and display an image according to the video information; wherein the output means is provided with an input device and an arbitrary range or area of the displayed image can be selected through the input means and enlarged or reduced for display.
2. A remote video recognition system according to claim 1, wherein, after the arbitrary range or area of the image information obtained is selected through the input means , a portion of the image that is within the selected range is manually or automatically tracked down by a means such as a mouse or cursor and displayed.
3. A remote video recognition system according to claim 1 or 2 , wherein the imaging means has an imaging device capable of capturing a curved image with a wide viewing angl ,
4. A remote video recognition system according to any one of claims 1 to 3, wherein the transmission means can transmit the image information taken in by the imaging means to the output means through a wired or wireless network.
5. A remote video recognition system according to any one of claims 1 to 4 , wherein at least one of the imaging means, the transmission means and the output means has a transformation means for transforming the image information into a plane image, and the transformation means calculates sampling points on the curved image according to a projection characteristic of the imaging device and transforms the curved image into the plane image.
6. A remote video recognition system according to claim 5 , wherein the transformation means builds a spherical polygon model according to the projection characteristic of the imaging device, matches the sampling points on the curved image to vertices of a plurality of triangles making up the polygon model, transforms the sampling points into a camera viewing system by a geometry transformation, and performs various projection conversions and rasterization to transform the curved image produced by the imaging device into the plane image .
7. A remote video recognition system according to claim 5 or 6, wherein, rather than being provided in at least one of the imaging means, the transmission means and the output means , the transformation means is provided in a
- IS server on the wired or wireless network, the image information taken in by the imaging means can be stored in the server and transformed into a plane image, and the output means connects to the wired or wireless network including the Internet to receive the image and output it to a monitor or the like .
8. A remote video recognition system according to any one of claims 1 to 7, wherein the projection characteristic of the imaging device includes parameters associated with a radius of curvature of the imaging device .
9. A remote video recognition system according to any one of claims 1 to 8 , wherein the imaging device is a hemispherical reflection mirror or a projection lens.
10. A remote video recognition system according to claim 9, wherein the projection lens is a fish-eye lens.
11. A remote video recognition system according to claim 9, wherein the hemispherical mirror is a convex mirror or a concave mirror .
12. A remote video recognition system according to any one of claims 1 to 11, wherein the output means is a computer.
13. A remote video recognition system according to any one of claims 1 to 11 , wherein the output means is a cell phone .
14. A remote video recognition system according to any one of claims 1 to 11, wherein the output means is a television receiver,
PCT/JP2002/005849 2001-06-13 2002-06-12 Remote video recognition system WO2002104033A1 (en)

Priority Applications (4)

Application Number Priority Date Filing Date Title
IL15934802A IL159348A0 (en) 2001-06-13 2002-06-12 Remote video recognition system
EP02736070A EP1400121A1 (en) 2001-06-13 2002-06-12 Remote video recognition system
US10/480,248 US20040189876A1 (en) 2001-06-13 2002-06-12 Remote video recognition system
CA002450875A CA2450875A1 (en) 2001-06-13 2002-06-12 Remote video recognition system

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
JP2001-178524 2001-06-13
JP2001178524 2001-06-13
JP2002-167364 2002-06-07
JP2002167364A JP2003069990A (en) 2001-06-13 2002-06-07 Remote video recognition system

Publications (1)

Publication Number Publication Date
WO2002104033A1 true WO2002104033A1 (en) 2002-12-27

Family

ID=26616831

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/JP2002/005849 WO2002104033A1 (en) 2001-06-13 2002-06-12 Remote video recognition system

Country Status (6)

Country Link
US (1) US20040189876A1 (en)
EP (1) EP1400121A1 (en)
JP (1) JP2003069990A (en)
CA (1) CA2450875A1 (en)
IL (1) IL159348A0 (en)
WO (1) WO2002104033A1 (en)

Families Citing this family (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP3950776B2 (en) * 2002-09-30 2007-08-01 株式会社日立国際電気 Video distribution system and video conversion device used therefor
WO2004066191A2 (en) * 2003-01-16 2004-08-05 Intelitrac, Inc. A method and or system to perform automated facial recognition and comparison using multiple 2d facial images parsed from a captured 3d facial image
US7505610B2 (en) * 2003-08-07 2009-03-17 Intelitroc Inc. Integrated portable identification and verification device
JP2005303796A (en) * 2004-04-14 2005-10-27 Kazumasa Sasaki Broadcast system and image reproducing device
CN100440971C (en) * 2005-10-31 2008-12-03 北京中星微电子有限公司 Camera device based on Internet
ES2452366T3 (en) * 2007-02-14 2014-04-01 Photint Venture Group Inc. Method and system to join images
US8649573B1 (en) * 2010-06-14 2014-02-11 Adobe Systems Incorporated Method and apparatus for summarizing video data
GB2500417B8 (en) 2012-03-21 2017-06-07 Sony Computer Entertainment Europe Ltd Camera device, system and method of imaging
KR101586249B1 (en) 2013-12-24 2016-01-18 (주)에프엑스기어 Apparatus and method for processing wide viewing angle image
JP6564939B2 (en) * 2015-05-29 2019-08-21 エフエックスギア インコーポレイティッドFxgear Inc. Wide viewing angle image processing apparatus and method

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5185667A (en) * 1991-05-13 1993-02-09 Telerobotics International, Inc. Omniview motionless camera orientation system
EP0637168A1 (en) * 1993-07-27 1995-02-01 Canon Kabushiki Kaisha Control device for image input apparatus
US5790182A (en) * 1996-08-05 1998-08-04 Interval Research Corp. System and method for panoramic imaging using concentric spherical mirrors
EP0920211A2 (en) * 1997-12-01 1999-06-02 Lsi Card Corporation A method of forming a panoramic image
WO2000008889A1 (en) * 1998-08-07 2000-02-17 Be Here, Inc. Method and apparatus for electronically distributing motion panoramic images
WO2001005154A1 (en) * 1999-06-25 2001-01-18 Matsushita Electric Industrial Co., Ltd. All-around video output method and device

Family Cites Families (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6675386B1 (en) * 1996-09-04 2004-01-06 Discovery Communications, Inc. Apparatus for video access and control over computer network, including image correction
JPH0955930A (en) * 1995-08-17 1997-02-25 Omunibiyuu Syst:Kk Image monitor device
US6157747A (en) * 1997-08-01 2000-12-05 Microsoft Corporation 3-dimensional image rotation method and apparatus for producing image mosaics
JP3126955B2 (en) * 1999-02-12 2001-01-22 株式会社アドバネット Arithmetic unit for image conversion
US6728763B1 (en) * 2000-03-09 2004-04-27 Ben W. Chen Adaptive media streaming server for playing live and streaming media content on demand through web client's browser with no additional software or plug-ins
JP2002064737A (en) * 2000-08-23 2002-02-28 Rekoode Onkyo:Kk Automated exploration/tracking camera system
US6486943B1 (en) * 2000-09-19 2002-11-26 The Schepens Eye Research Institute, Inc. Methods and apparatus for measurement and correction of optical aberration

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5185667A (en) * 1991-05-13 1993-02-09 Telerobotics International, Inc. Omniview motionless camera orientation system
EP0637168A1 (en) * 1993-07-27 1995-02-01 Canon Kabushiki Kaisha Control device for image input apparatus
US5790182A (en) * 1996-08-05 1998-08-04 Interval Research Corp. System and method for panoramic imaging using concentric spherical mirrors
EP0920211A2 (en) * 1997-12-01 1999-06-02 Lsi Card Corporation A method of forming a panoramic image
WO2000008889A1 (en) * 1998-08-07 2000-02-17 Be Here, Inc. Method and apparatus for electronically distributing motion panoramic images
WO2001005154A1 (en) * 1999-06-25 2001-01-18 Matsushita Electric Industrial Co., Ltd. All-around video output method and device

Also Published As

Publication number Publication date
US20040189876A1 (en) 2004-09-30
CA2450875A1 (en) 2002-12-27
IL159348A0 (en) 2004-06-01
EP1400121A1 (en) 2004-03-24
JP2003069990A (en) 2003-03-07

Similar Documents

Publication Publication Date Title
CN113038111B (en) Method, system and medium for image capture and processing
US8723951B2 (en) Interactive wide-angle video server
CN109194923B (en) Video image processing device, system and method based on local non-uniform resolution
US6219089B1 (en) Method and apparatus for electronically distributing images from a panoptic camera system
EP1028389A2 (en) Arithmetic unit for image transformation
WO2020063547A1 (en) Spherical image processing method and apparatus, and server
CN103562791A (en) Apparatus and method for panoramic video imaging with mobile computing devices
JP2003536160A (en) Method and apparatus for mapping images and videos to create operable and virtual reality videos and images
JP2004282162A (en) Camera, and monitoring system
WO2017118982A1 (en) Remotely controlled communicated image resolution
CN110969060A (en) Neural network training method, neural network training device, neural network tracking method, neural network training device, visual line tracking device and electronic equipment
US20040189876A1 (en) Remote video recognition system
CN107018386A (en) A kind of video flowing multiresolution observation system
WO2002019270A1 (en) Curved image conversion method and record medium where this method for converting curved image is recorded
WO2007060497A2 (en) Interactive wide-angle video server
CN114442805A (en) Monitoring scene display method and system, electronic equipment and storage medium
CN113286138A (en) Panoramic video display method and display equipment
JP2003111041A (en) Image processor, image processing system, image processing method, storage medium and program
JP3054146B1 (en) Panoramic image device
JP2002344962A (en) Image communication equipment and portable telephone
CN110738066A (en) code scanning method, device and computer storage medium
CN115278064A (en) Panoramic image generation method and device, terminal equipment and storage medium
JP2004282163A (en) Camera, monitor image generating method, program, and monitoring system
KR100343836B1 (en) Panorama video surveillance system and controlling method therefore
JP2002152719A (en) Monitor method and monitor device utilizing curved surface image

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A1

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CN CR CU CZ DM DZ EE GD GE GH GM HR HU ID IL IN IS KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ PL PT RO RU SD SG SI SK SL TJ TM TR TT TZ UA UG US UZ VN YU ZA ZW

AL Designated countries for regional patents

Kind code of ref document: A1

Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE TR

121 Ep: the epo has been informed by wipo that ep was designated in this application
WWE Wipo information: entry into national phase

Ref document number: 159348

Country of ref document: IL

WWE Wipo information: entry into national phase

Ref document number: 2450875

Country of ref document: CA

WWE Wipo information: entry into national phase

Ref document number: 10480248

Country of ref document: US

WWE Wipo information: entry into national phase

Ref document number: 2002736070

Country of ref document: EP

WWP Wipo information: published in national office

Ref document number: 2002736070

Country of ref document: EP

WWW Wipo information: withdrawn in national office

Ref document number: 2002736070

Country of ref document: EP