WO2012078027A1 - Network and process for web-based video surveillance - Google Patents

Network and process for web-based video surveillance Download PDF

Info

Publication number
WO2012078027A1
WO2012078027A1 PCT/MY2011/000126 MY2011000126W WO2012078027A1 WO 2012078027 A1 WO2012078027 A1 WO 2012078027A1 MY 2011000126 W MY2011000126 W MY 2011000126W WO 2012078027 A1 WO2012078027 A1 WO 2012078027A1
Authority
WO
WIPO (PCT)
Prior art keywords
video
web
data representation
data
images
Prior art date
Application number
PCT/MY2011/000126
Other languages
French (fr)
Inventor
Remon Redika
Hock Woon Hon
Rohaizan Roosley
Original Assignee
Mimos Berhad
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Mimos Berhad filed Critical Mimos Berhad
Publication of WO2012078027A1 publication Critical patent/WO2012078027A1/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/47End-user applications
    • H04N21/478Supplemental services, e.g. displaying phone caller identification, shopping application
    • H04N21/4782Web browsing, e.g. WebTV
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/60Network structure or processes for video distribution between server and client or between remote clients; Control signalling between clients, server and network components; Transmission of management data between server and client, e.g. sending from server to client commands for recording incoming content stream; Communication details between server and client 
    • H04N21/61Network physical structure; Signal processing
    • H04N21/6106Network physical structure; Signal processing specially adapted to the downstream path of the transmission network
    • H04N21/6125Network physical structure; Signal processing specially adapted to the downstream path of the transmission network involving transmission via Internet
    • GPHYSICS
    • G08SIGNALLING
    • G08BSIGNALLING OR CALLING SYSTEMS; ORDER TELEGRAPHS; ALARM SYSTEMS
    • G08B13/00Burglar, theft or intruder alarms
    • G08B13/18Actuation by interference with heat, light, or radiation of shorter wavelength; Actuation by intruding sources of heat, light, or radiation of shorter wavelength
    • G08B13/189Actuation by interference with heat, light, or radiation of shorter wavelength; Actuation by intruding sources of heat, light, or radiation of shorter wavelength using passive radiation detection systems
    • G08B13/194Actuation by interference with heat, light, or radiation of shorter wavelength; Actuation by intruding sources of heat, light, or radiation of shorter wavelength using passive radiation detection systems using image scanning and comparing systems
    • G08B13/196Actuation by interference with heat, light, or radiation of shorter wavelength; Actuation by intruding sources of heat, light, or radiation of shorter wavelength using passive radiation detection systems using image scanning and comparing systems using television cameras
    • G08B13/19654Details concerning communication with a camera
    • G08B13/19656Network used to communicate with a camera, e.g. WAN, LAN, Internet

Definitions

  • the present invention generally relates to surveillance technologies, and more particularly to a network and process for web-based video surveillance using a video image acquisition and processing system in a server and a video retrieval and display system in a browser.
  • the current web-based video surveillance networks have many deficiencies.
  • the majority of the networks rely on manual monitoring to detect suspicious event especially when the distance between the monitored site and monitoring centre is far apart.
  • the majority of IP cameras employed by video surveillance networks are not capable of processing video images, displaying the video images smoothly at web browsers, and providing accesses to multiple end users.
  • the network comprises a web server having web access and a designated IP address; a video image acquisition and processing system embedded in the web server; wherein the video image acquisition and processing system acquires and processes raw video image data; and a video image retrieval and display system embedded in a web access browser; wherein the video image retrieval and display system retrieves the processed video image data and displays the processed video image data on a browser of an end user; whereby the web-based video surveillance network processes the raw video image data on web, displays the processed video image data on browsers, and provides web accesses to multiple end users.
  • the process comprises an operation of acquisition and processing comprising capturing video images from IP cameras; processing the captured video images; translating the processed video images into data representation; compressing the data representation; and saving the compressed data representation in a temporary storage or memory of the * server; and an operation of retrieval and display comprising requesting video images from a web browser by sending a camera identifier and VA identifier; providing the IP address and port of data representation storage to the browser; downloading the raw data of data representation; uncompressing the downloaded raw data of data representation; re-packaging the uncompressed data representation to sequential video images; and displaying the sequential video images at the browser.
  • FIG 1 is an architecture diagram of one typical surveillance network that simply connects a camera IP via web to end users in the prior art.
  • FIG 2 is an architecture diagram of a prior art web-based video surveillance network without any capacity of video image processing.
  • FIG 3 is a functional block diagram of the video image acquisition and processing system embedded in the web server in accordance with one embodiment of the present invention.
  • FIG 4 is a functional block diagram of the video image retrieval and display system in accordance with one embodiment of the present invention.
  • FIG 5 shows one exemplary layout of the web-based video surveillance network in accordance with one embodiment of the present invention.
  • FIG 6 shows a functional flowchart showing the operation of the acquisition and processing of captured images from IP cameras in accordance with one embodiment of the present invention.
  • FIG 7 shows a functional flowchart showing the operation of retrieval and display of video images in accordance with one embodiment of the present invention.
  • One aspect of the present invention provides a web-based video surveillance network, where the web-based video surveillance network comprises a video image acquisition and processing system embedded in a server that processes video images on the web as if on a local machine, and a video image retrieval and display system embedded in the browsers of end users that provide multiple end users with web access to the processed video images stored in the web server and enable the browsers of end users to display the video images smoothly.
  • the web-based video surveillance network comprises a video image acquisition and processing system embedded in a server that processes video images on the web as if on a local machine, and a video image retrieval and display system embedded in the browsers of end users that provide multiple end users with web access to the processed video images stored in the web server and enable the browsers of end users to display the video images smoothly.
  • the prior art web-based video surveillance network comprises at least one IP camera for capturing video images, where the at least one IP camera is able to output the captured images via internet, and a web server with a web (i.e.. internet) interface, where the web server has a designated IP address.
  • the web server is connected via web with the cameras for capturing video images and is accessible via web to the browsers of end users.
  • the web server of the present invention is embedded with a video image acquisition and processing system and a video image retrieval and display system, where the video image acquisition and processing system can also be embedded in any server, and the video image retrieval and display system can be run at the end user's browser by plug-ins software.
  • the video image acquisition and processing system working together with the video image retrieval and display system enables efficiently streaming video images from IP cameras to the browsers of end users.
  • the web server is capable of receiving, storing and forwarding information such as email, SMS and the like over the internet.
  • the video image acquisition and processing system 10 comprises a video acquisition and decoder module 11, a video processing module 12, a video translation module 13, a video data representation storage 14, a video server manager module 15. an aggregator 16, and a remote parameter adjuster module 17.
  • the video image acquisition and processing system 10 acquires video images from image capture devices such as IP cameras and processes the acquired video images into data representation packages that are accessible via web by the browsers of end users.
  • the video acquisition and decoder module 11 is enabled to acquire raw video image data via web from various imaging sources that are web accessible.
  • the video acquisition and decoder module 11 acquires raw video image data from an analogous camera through a video image server that has an IP address and is accessible via web.
  • the video acquisition and decoder module 11 acquires raw video image data from an IP imaging device for example an IP camera.
  • the video acquisition and decoder module 11 acquires the raw data of video images simultaneously from IP cameras based on their IP configurations stored in the web server, and decodes the acquired raw video images into frames of images. The frames of images are sent to the video processing module 12 for being processed.
  • the video processing module 12 is electronically coupled with the video image acquisition and decoder module 11 for receiving and processing the frames of images.
  • the video processing module 12 comprises a video processing interface and a plurality of video processing components (VP) that is in nature of plugged-in, where the identifiers and configurations of the video processing components are stored in the video processing interface.
  • the video processing components include linear processing component and video content processing components including video analytics functionalities for example intrusion detection, loitering detection.
  • the video processing interface assigns one identifier to each batch of received frames of images and then sends the frames of images to one of the video processing components for being processed.
  • the video processing interface will trigger the aggregator 16 to send an alert signal such as SMS and email to a designated user.
  • the snapshot images of the event are saved in a predefined storage for example the video data representation storage 14.
  • the video translation module 13 is electronically coupled with the video processing module 12 for receiving the processed frames of images.
  • the video translation module 13 translates the image data format of the processed f ames of images into one of the data representation that is predefined, compresses and packages the formatted data representation into data representation packages, and assigns an identifier to each one of the data representation packages.
  • the video translation module 13 offers the flexibility of generating the same video content with different video processing capability and viewing the video content separately at the browser of an end user. For example, different output data representation packages can be generated from the same camera using different video processing.
  • the configuration information of the data representation packages is stored by default in the video data representation storage relating to buffer sizes and video lengths.
  • data representation can contain one or more image binary codes (i.e., 25 images).
  • data representation contains meta-data of video data (i.e., speed of frame rate, bounding box, event identification etc). This data representation is saved in temporary storage for example memory or hard drive.
  • the data representation also has an identifier to identify the camera source and video processing source so that it can be easily sent to multiple users for displaying at the browser end.
  • the video data representation storage 14 is electronically coupled with the video translation module 13 for receiving the compressed data representation packages.
  • the video data representation storage 14 stores the compressed data representation packages with their uniquely assigned identifiers temporarily in a local server storage or memory storage.
  • the video server manager module 15 is electronically coupled with the video data representation storage 14. and is accessible via web by the browsers of end users.
  • the video server manager module 15 manages the IDs of data representation packages to be fetched from the video data representation storage 14 by a browser of an end user, responds to an end user's request and transmits the requested data representation packages to the end user.
  • the present invention utilizes TCP protocol for the transmission of data representation packages to end users by a predefined and enabled listening port of TCP protocol at the server. These protocols are verified by the video server manager module 40.
  • the video server manager module 40 When an end user at the browser starts to request of data representation packages, the video server manager module 40 first searches the database and then informs the user about the availability of data representation that has been requested, where the browser of the end user must be installed the plug-ins software to capable browser to run the video retrieval and display system 20 as described in detail hereinbelow. If the requested data representation package is not available, the transmission process stops. If the requested data representation package is available, the video server manager module 40 sends to the viewer the name and size of the requested data representation package to the video retrieval and display system 20.
  • the data representation packages comprise at least image binary code.
  • the contents of the data representation packages can be extended or improved depending on information needed. For example, when an event is detected, the information of frame rate, bounding box, and alarm can be included in the data representation packages. After the viewer receives the data representation packages completely, the image binary code will be converted back to video images at the end user's side. The images will be updated as fast as the frame rate defined in the data representation packages.
  • the aggregator 16 is electronically coupled with the video processing module 12. The aggregator module 16 outputs an alert signal to the designated end user either by email or SMS when a suspicious event such as intrusion is detected.
  • the remote parameter adjuster module 17 is web accessible, and electronically coupled with the video processing components of the video processing module 12.
  • the Remote parameter adjuster module 17 allows an end user to adjust the parameters of the video processing components in the video processing module according to the requirements of the end user.
  • the video image retrieval and display system 20 comprises a data representation receiver module 21 and an image sequence refresher module 22.
  • the video image retrieval and display system 20 is capable of receiving data representation packages from the video data representation storage and efficiently "displaying smooth videos at the browser of the end user.
  • the data representation receiver module 21 is web-access enabled.
  • the video server manager module 15 will inform the end user of the availability of the requested data representation packages; if the requested information is available, the information such as size and metadata will be sent. Then, the data representation receiver module 21 prepares the necessary buffer size in the client browser. Transmissions data will be run simultaneously and always transfer the latest data representation packages. When the first data representation package is received completely (buffer size overload at the end user's side), the end user can start to play video image and so on.
  • the image sequence refresher module 22 is electronically coupled with the data representation receiver module 21 for receiving the data representation packages for displaying.
  • the image sequence refresher module 22 ensures smooth display of sequential images from the data representation packages.
  • the images will be updated by the image sequence refresher module 22 as fast as the frame rate defined in the data representation packages.
  • Another aspect of the present invention provides a process of web-based video surveillance.
  • the process comprises two separate but integral operations, one for acquiring and processing of video images in a server and the other for retrieving and displaying video images in the browser of an end user.
  • the process is able to efficiently process video images, display the video images smoothly and provide web accesses to multiple end users at the same time.
  • the operation of acquisition and processing 100 comprises capturing video images from IP cameras 110, processing the captured video images 120, translating the processed video images into data representation 130, compressing the data representation 140, and saving the compressed data representation in a temporary storage or memory of the server 150.
  • the operation of acquiring and procession in the web server comprises reading the raw image data that have been captured by an acquisition decoder module; sending the raw image data to a specific video processing component to process the raw image data; translating the processed image data to binary code and packaging them in data representation format into data representation packages, where the bitmap data are converted to data representation with any meta-data information; defining the frame rate per seconds for each of data representation packages; compressing data representation package; and storing data representation packages temporarily in a data representation storage of the server.
  • the operation of retrieval and display 200 comprises requesting video images from a web browser by sending a camera identifier and VA identifier 210, providing the IP address and port of data representation storage to the browser 220, downloading the raw data of data representation 230, uncompressing the downloaded raw data of data representation 240, re-packaging the uncompressed data representation to sequential video images 250, and displaying the sequential video images at the browser 260.
  • the operation of retrieving and displaying in the browser of an end user comprises connecting to the web server; sending a request with camera identifier and video processing identifier to the web server in HTTP protocol: receiving the requested data representation packages from the web server through TCP protocol specified by a video server manager module; buffering the received data representation packages at the client browser; uncompressing the data representation packages; re-packaging the uncompressed data representation packages into a sequence of images; and showing the image on the display until the end of image is reached at a predetermined refresh rate for one data representation.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)
  • Information Transfer Between Computers (AREA)

Abstract

The present invention provides a web-based video surveillance network with video streaming capability, where the network comprises a web server having web access and a designated IP address; a video image acquisition and processing system embedded in the web server; wherein the video image acquisition and processing system acquires and processes raw video image data; and a video image retrieval and display system embedded in a web server; wherein the video image retrieval and display system retrieves the processed video image data and displays the processed video image data on a browser of an end user; whereby the web-based video surveillance network processes the raw video image data on web, displays the processed video image data on browsers, and provides web accesses to multiple end users. The present invention also provides a process of web-based video surveillance.

Description

NETWORK AND PROCESS FOR WEB-BASED VIDEO SURVEILLANCE
Field of the Invention [0001] The present invention generally relates to surveillance technologies, and more particularly to a network and process for web-based video surveillance using a video image acquisition and processing system in a server and a video retrieval and display system in a browser. Background of the Invention
[0002] The need for intelligent surveillance at home is growing, and most of the intelligent surveillances are done locally using DVR for recording. Many in the industry have attempted to create a web-based intelligent surveillance network for offering web- based surveillance to end users or home users (hereinafter, "end users"). One typical web- based surveillance network simply directs a camera IP to a user program at the user end as shown in FIG 1, where the end users via the web are connected with an IP camera for directly retrieving video images from the IP camera.
[0003] However, the current web-based video surveillance networks have many deficiencies. For example, the majority of the networks rely on manual monitoring to detect suspicious event especially when the distance between the monitored site and monitoring centre is far apart. In addition, the majority of IP cameras employed by video surveillance networks are not capable of processing video images, displaying the video images smoothly at web browsers, and providing accesses to multiple end users.
Summary of the Invention
[0004] One aspect of the present invention provides a web-based video surveillance network with video streaming capability. In one embodiment, the network comprises a web server having web access and a designated IP address; a video image acquisition and processing system embedded in the web server; wherein the video image acquisition and processing system acquires and processes raw video image data; and a video image retrieval and display system embedded in a web access browser; wherein the video image retrieval and display system retrieves the processed video image data and displays the processed video image data on a browser of an end user; whereby the web-based video surveillance network processes the raw video image data on web, displays the processed video image data on browsers, and provides web accesses to multiple end users.
[0005] Another aspect of the present invention provides a process of web-based video surveillance. In one embodiment, the process comprises an operation of acquisition and processing comprising capturing video images from IP cameras; processing the captured video images; translating the processed video images into data representation; compressing the data representation; and saving the compressed data representation in a temporary storage or memory of the* server; and an operation of retrieval and display comprising requesting video images from a web browser by sending a camera identifier and VA identifier; providing the IP address and port of data representation storage to the browser; downloading the raw data of data representation; uncompressing the downloaded raw data of data representation; re-packaging the uncompressed data representation to sequential video images; and displaying the sequential video images at the browser.
[0006] The objectives and advantages of the invention will become apparent from the following detailed description of preferred embodiments thereof in connection with the accompanying drawings.
Brief Description of the Drawings
[0007] Preferred embodiments according to the present invention will now be described with reference to the Figures, in which like reference numerals denote like elements.
[0008] FIG 1 is an architecture diagram of one typical surveillance network that simply connects a camera IP via web to end users in the prior art.
[0009] FIG 2 is an architecture diagram of a prior art web-based video surveillance network without any capacity of video image processing.
[0010] FIG 3 is a functional block diagram of the video image acquisition and processing system embedded in the web server in accordance with one embodiment of the present invention. [0011] FIG 4 is a functional block diagram of the video image retrieval and display system in accordance with one embodiment of the present invention.
[0012] FIG 5 shows one exemplary layout of the web-based video surveillance network in accordance with one embodiment of the present invention.
[0013] FIG 6 shows a functional flowchart showing the operation of the acquisition and processing of captured images from IP cameras in accordance with one embodiment of the present invention.
[0014] FIG 7 shows a functional flowchart showing the operation of retrieval and display of video images in accordance with one embodiment of the present invention.
Detailed Description of the Invention
[0015] The present invention may be understood more readily by reference to the following detailed description of certain embodiments of the invention.
[0016] Throughout this application, where publications are referenced, the disclosures of these publications are hereby incorporated by reference, in their entireties, into this application in order to more fully describe the state of art to which this invention pertains.
[0017] One aspect of the present invention provides a web-based video surveillance network, where the web-based video surveillance network comprises a video image acquisition and processing system embedded in a server that processes video images on the web as if on a local machine, and a video image retrieval and display system embedded in the browsers of end users that provide multiple end users with web access to the processed video images stored in the web server and enable the browsers of end users to display the video images smoothly.
[0018] Now referring to FIG 2, there is provided an architecture diagram of a prior art web-based video surveillance network without any capability of video image processing, where the end users request video images from the camera without any processing. As shown in FIG 2. the prior art web-based video surveillance network comprises at least one IP camera for capturing video images, where the at least one IP camera is able to output the captured images via internet, and a web server with a web (i.e.. internet) interface, where the web server has a designated IP address. The web server is connected via web with the cameras for capturing video images and is accessible via web to the browsers of end users. In contrast, as described in detail hereinbelow, the web server of the present invention is embedded with a video image acquisition and processing system and a video image retrieval and display system, where the video image acquisition and processing system can also be embedded in any server, and the video image retrieval and display system can be run at the end user's browser by plug-ins software. The video image acquisition and processing system working together with the video image retrieval and display system enables efficiently streaming video images from IP cameras to the browsers of end users. In addition, the web server is capable of receiving, storing and forwarding information such as email, SMS and the like over the internet.
[0019] Now referring to FIG 3, there is provided a functional block diagram of the video image acquisition and processing system in accordance with one embodiment of the present invention. As shown in FIG 3, the video image acquisition and processing system 10 comprises a video acquisition and decoder module 11, a video processing module 12, a video translation module 13, a video data representation storage 14, a video server manager module 15. an aggregator 16, and a remote parameter adjuster module 17. The video image acquisition and processing system 10 acquires video images from image capture devices such as IP cameras and processes the acquired video images into data representation packages that are accessible via web by the browsers of end users.
[0020] The video acquisition and decoder module 11 is enabled to acquire raw video image data via web from various imaging sources that are web accessible. In one embodiment, the video acquisition and decoder module 11 acquires raw video image data from an analogous camera through a video image server that has an IP address and is accessible via web. In another embodiment, the video acquisition and decoder module 11 acquires raw video image data from an IP imaging device for example an IP camera. The video acquisition and decoder module 11 acquires the raw data of video images simultaneously from IP cameras based on their IP configurations stored in the web server, and decodes the acquired raw video images into frames of images. The frames of images are sent to the video processing module 12 for being processed.
[0021] The video processing module 12 is electronically coupled with the video image acquisition and decoder module 11 for receiving and processing the frames of images. The video processing module 12 comprises a video processing interface and a plurality of video processing components (VP) that is in nature of plugged-in, where the identifiers and configurations of the video processing components are stored in the video processing interface. The video processing components include linear processing component and video content processing components including video analytics functionalities for example intrusion detection, loitering detection. The video processing interface assigns one identifier to each batch of received frames of images and then sends the frames of images to one of the video processing components for being processed. If a suspicious event such as intrusion and loitering is detected, the video processing interface will trigger the aggregator 16 to send an alert signal such as SMS and email to a designated user. At the same time, the snapshot images of the event are saved in a predefined storage for example the video data representation storage 14.
[0022] The video translation module 13 is electronically coupled with the video processing module 12 for receiving the processed frames of images. The video translation module 13 translates the image data format of the processed f ames of images into one of the data representation that is predefined, compresses and packages the formatted data representation into data representation packages, and assigns an identifier to each one of the data representation packages. The video translation module 13 offers the flexibility of generating the same video content with different video processing capability and viewing the video content separately at the browser of an end user. For example, different output data representation packages can be generated from the same camera using different video processing.
[0023] To translate the processed images into data representation packages, the configuration information of the data representation packages is stored by default in the video data representation storage relating to buffer sizes and video lengths. For example. data representation can contain one or more image binary codes (i.e., 25 images). In addition, data representation contains meta-data of video data (i.e., speed of frame rate, bounding box, event identification etc). This data representation is saved in temporary storage for example memory or hard drive. The data representation also has an identifier to identify the camera source and video processing source so that it can be easily sent to multiple users for displaying at the browser end.
[0024] The video data representation storage 14 is electronically coupled with the video translation module 13 for receiving the compressed data representation packages. The video data representation storage 14 stores the compressed data representation packages with their uniquely assigned identifiers temporarily in a local server storage or memory storage.
[0025] The video server manager module 15 is electronically coupled with the video data representation storage 14. and is accessible via web by the browsers of end users. The video server manager module 15 manages the IDs of data representation packages to be fetched from the video data representation storage 14 by a browser of an end user, responds to an end user's request and transmits the requested data representation packages to the end user.
[0026] Even though the request of viewing of content of data representation packages is through web browser, but the transmission of data representation packages is not through standard application layer protocols likes (HTTP, FTP, etc) but using programmable application layer protocols which is TCP as a transport layer protocol. Thus, the present invention utilizes TCP protocol for the transmission of data representation packages to end users by a predefined and enabled listening port of TCP protocol at the server. These protocols are verified by the video server manager module 40. When an end user at the browser starts to request of data representation packages, the video server manager module 40 first searches the database and then informs the user about the availability of data representation that has been requested, where the browser of the end user must be installed the plug-ins software to capable browser to run the video retrieval and display system 20 as described in detail hereinbelow. If the requested data representation package is not available, the transmission process stops. If the requested data representation package is available, the video server manager module 40 sends to the viewer the name and size of the requested data representation package to the video retrieval and display system 20.
[0027] The data representation packages comprise at least image binary code. The contents of the data representation packages can be extended or improved depending on information needed. For example, when an event is detected, the information of frame rate, bounding box, and alarm can be included in the data representation packages. After the viewer receives the data representation packages completely, the image binary code will be converted back to video images at the end user's side. The images will be updated as fast as the frame rate defined in the data representation packages. [0028] The aggregator 16 is electronically coupled with the video processing module 12. The aggregator module 16 outputs an alert signal to the designated end user either by email or SMS when a suspicious event such as intrusion is detected.
[0029] The remote parameter adjuster module 17 is web accessible, and electronically coupled with the video processing components of the video processing module 12. The Remote parameter adjuster module 17 allows an end user to adjust the parameters of the video processing components in the video processing module according to the requirements of the end user.
[0030] Now referring to FIG 4, there is provided a functional block diagram of the video image retrieval and display system in accordance with one embodiment of the present invention. As shown in FIG 4, the video image retrieval and display system 20 comprises a data representation receiver module 21 and an image sequence refresher module 22. When an end user browses the web interface, the video image retrieval and display system 20 is capable of receiving data representation packages from the video data representation storage and efficiently "displaying smooth videos at the browser of the end user.
[0031] The data representation receiver module 21 is web-access enabled. When an end user through a browser sends a request to the video image acquisition and processing system 10 using the identifiers of IP cameras, the video server manager module 15 will inform the end user of the availability of the requested data representation packages; if the requested information is available, the information such as size and metadata will be sent. Then, the data representation receiver module 21 prepares the necessary buffer size in the client browser. Transmissions data will be run simultaneously and always transfer the latest data representation packages. When the first data representation package is received completely (buffer size overload at the end user's side), the end user can start to play video image and so on.
[0032] The image sequence refresher module 22 is electronically coupled with the data representation receiver module 21 for receiving the data representation packages for displaying. The image sequence refresher module 22 ensures smooth display of sequential images from the data representation packages. The images will be updated by the image sequence refresher module 22 as fast as the frame rate defined in the data representation packages. [0033] Now referring to FIG 5, there is provided one exemplary layout of the web- based video surveillance network in accordance with one embodiment of the present invention. The details of relevant components in the network have been provided above.
[0034] Another aspect of the present invention provides a process of web-based video surveillance. The process comprises two separate but integral operations, one for acquiring and processing of video images in a server and the other for retrieving and displaying video images in the browser of an end user. The process is able to efficiently process video images, display the video images smoothly and provide web accesses to multiple end users at the same time.
[0035] Now referring to FIG 6, there is provided a functional flowchart showing the operation of the acquisition and processing of captured images from IP cameras in accordance with one embodiment of the present invention. As shown in FIG 6. the operation of acquisition and processing 100 comprises capturing video images from IP cameras 110, processing the captured video images 120, translating the processed video images into data representation 130, compressing the data representation 140, and saving the compressed data representation in a temporary storage or memory of the server 150.
[0036] In one embodiment, the operation of acquiring and procession in the web server comprises reading the raw image data that have been captured by an acquisition decoder module; sending the raw image data to a specific video processing component to process the raw image data; translating the processed image data to binary code and packaging them in data representation format into data representation packages, where the bitmap data are converted to data representation with any meta-data information; defining the frame rate per seconds for each of data representation packages; compressing data representation package; and storing data representation packages temporarily in a data representation storage of the server.
[0037] Now referring to FIG 7, there is provided a functional flowchart showing the operation of retrieval and display of video images in accordance with one embodiment of the present invention. As shown in FIG 7, the operation of retrieval and display 200 comprises requesting video images from a web browser by sending a camera identifier and VA identifier 210, providing the IP address and port of data representation storage to the browser 220, downloading the raw data of data representation 230, uncompressing the downloaded raw data of data representation 240, re-packaging the uncompressed data representation to sequential video images 250, and displaying the sequential video images at the browser 260.
[0038] In one embodiment, the operation of retrieving and displaying in the browser of an end user comprises connecting to the web server; sending a request with camera identifier and video processing identifier to the web server in HTTP protocol: receiving the requested data representation packages from the web server through TCP protocol specified by a video server manager module; buffering the received data representation packages at the client browser; uncompressing the data representation packages; re-packaging the uncompressed data representation packages into a sequence of images; and showing the image on the display until the end of image is reached at a predetermined refresh rate for one data representation.
[0039] While the present invention has been described with reference to particular embodiments, it will be understood that the embodiments are illustrative and that the invention scope is not so limited. Alternative embodiments of the present invention will become apparent to those having ordinary skill in the art to which the present invention pertains. Such alternate embodiments are considered to be encompassed within the scope of the present invention. Accordingly, the scope of the present invention is defined by the appended claims and is supported by the foregoing description.

Claims

What is claimed is: 1. A web-based video surveillance network with video streaming capability, comprising:
a web server having web access and a designated IP address;
a video image acquisition and processing system embedded in the web server; wherein the video image acquisition and processing system acquires and processes raw video image data; and
a video image retrieval and display system embedded in a web server and being enabled to run on a browser of an end user by plug-ins software in the browser; wherein the video image retrieval and display system retrieves the processed video image data and displays the processed video image data on the browser of the end user;
whereby the web-based video surveillance network processes the raw video image data on web, displays the processed video image data on browsers, and provides web accesses to multiple end users.
2. The network of claim 1. further comprises at least one IP camera for capturing the raw video image data, where the at least one IP camera is able to output the captured image data via web to the web server.
3. The network of claim of claim 1, wherein the video image acquisition and processing system comprises:
a video acquisition and decoder module acquiring raw video image data via web from various imaging sources that are web accessible and decoding the acquired raw video image data into frames of images;
a video processing module electronically coupled with the video image acquisition and decoder module for receiving and processing the frames of images;
a video translation module electronically coupled with the video processing module for receiving the processed frames of images, wherein the frames of images are converted into data presentation that is compressed and packaged into data representation packages; a video data representation storage electronically coupled with the video translation module for receiving the compressed data representation packages and storing the compressed data representation packages with their uniquely assigned identifiers temporarily;
a video server manager module electronically coupled with the video data representation storage, and being accessible via web by the browser of the end users; wherein the video server manager module manages the IDs of data representation packages to be fetched from the video data representation storage by the browser of the end user, responds to the end user's request and transmits the requested data representation packages to the end user;
an aggregator electronically coupled with the video processing module, outputting an alert signal to the designated end user when a suspicious event is detected; and
a remote parameter adjuster module being web accessible and electronically coupled with the video processing components of the video processing module, wherein the remote parameter adjuster module allows the end user to adjust the parameters of the video processing components in the video processing module according to the requirements of the end user
4. The network of claim 3, wherein the imaging source is an analogous camera through a video image server that has an IP address and is accessible via web.
5. The network of claim 3. wherein the imaging source is an EP imaging device.
6. The network of claim 3, wherein the video processing module comprises a video processing interface and a plurality of video processing components (VP) that are in nature of plugged-in and assigned with identifiers by the video processing interface; wherein the identifiers and configurations of the video processing components are stored in the video processing interface; and wherein the video processing interface assigns one identifier to each batch of received frames of images and then sends the frames of images to one of the video processing components for being processed.
7. The network of claim 6. wherein the video processing components include linear processing component and video content processing components for intrusion detection and loitering detection.
8. The network of claim 3, wherein the data representation package contain one or more image binary codes (i.e., 25 images), and meta-data including speed of frame rate, bounding box, and event identification.
9. The network of claim 3, wherein the transmission of data representation packages is through programmable application layer protocols which is TCP as transport protocol.
10. The network of claim 3, wherein the video image retrieval and display system comprises:
a data representation receiver module being web-access enabled; whereby when the end user through a browser sends a request to the video image acquisition and processing system using the identifiers of IP cameras, the video server manager module informs the end user of the availability of the requested data representation packages; if the requested information is available, the information such as size and meta-data is sent; then, the data representation receiver module prepares the necessary buffer size in the browser; and when the first data representation package is received completely, the end user can start to play video images; and
an image sequence refresher module electronically coupled with the data representation receiver module for receiving the data representation packages for displaying.
11. A process of web-based video surveillance, said process comprising:
an operation of acquisition and processing comprising:
capturing video images from IP cameras:
processing the captured video images;
translating the processed video images into data representation; compressing the data re resentation; and saving the compressed data representation in a temporary storage or memory of the server; and
an operation of retrieval and display comprising:
requesting video images ftom a web browser by sending a camera identifier and VA identifier;
providing the IP address and port of data representation storage to the browser;
downloading the raw data of data representation;
uncompressing the downloaded raw data of data representation;
re-packaging the uncdmpressed data representation to sequential video images; and
displaying the sequential video images at the browser.
PCT/MY2011/000126 2010-12-10 2011-06-22 Network and process for web-based video surveillance WO2012078027A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
MYPI2010005906 2010-12-10
MYPI2010005906A MY159120A (en) 2010-12-10 2010-12-10 Network and process for web-based video surveillance

Publications (1)

Publication Number Publication Date
WO2012078027A1 true WO2012078027A1 (en) 2012-06-14

Family

ID=46207365

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/MY2011/000126 WO2012078027A1 (en) 2010-12-10 2011-06-22 Network and process for web-based video surveillance

Country Status (2)

Country Link
MY (1) MY159120A (en)
WO (1) WO2012078027A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2014086223A1 (en) * 2012-12-06 2014-06-12 小米科技有限责任公司 Video communication method and apparatus

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB2389937A (en) * 2002-06-13 2003-12-24 Silent Witness Entpr Ltd Internet video surveillance camera system which utilises a server to carry out user access requests
WO2006006081A2 (en) * 2004-07-09 2006-01-19 Emitall Surveillance S.A. Smart video surveillance system ensuring privacy
US20070070185A1 (en) * 2005-09-27 2007-03-29 Dy Lady C System and method for remote display of security video images
US20080106597A1 (en) * 1999-10-12 2008-05-08 Vigilos, Inc. System and method for storing and remotely retrieving surveillance video images
US7834904B2 (en) * 2003-10-22 2010-11-16 Sam Systems, Inc. Video surveillance system
GB2470822A (en) * 2009-06-05 2010-12-08 Anthony Clive Morgan Security system with CCTV internet access for the general public to report crime

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20080106597A1 (en) * 1999-10-12 2008-05-08 Vigilos, Inc. System and method for storing and remotely retrieving surveillance video images
GB2389937A (en) * 2002-06-13 2003-12-24 Silent Witness Entpr Ltd Internet video surveillance camera system which utilises a server to carry out user access requests
US7834904B2 (en) * 2003-10-22 2010-11-16 Sam Systems, Inc. Video surveillance system
WO2006006081A2 (en) * 2004-07-09 2006-01-19 Emitall Surveillance S.A. Smart video surveillance system ensuring privacy
US20070070185A1 (en) * 2005-09-27 2007-03-29 Dy Lady C System and method for remote display of security video images
GB2470822A (en) * 2009-06-05 2010-12-08 Anthony Clive Morgan Security system with CCTV internet access for the general public to report crime

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
"H264 WebCam 3.2", H264SOFT.COM, 1 March 2009 (2009-03-01), Retrieved from the Internet <URL:http://web.archive.org/web/20090130065713/http://h264soft.com/index.html> [retrieved on 20111014] *

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2014086223A1 (en) * 2012-12-06 2014-06-12 小米科技有限责任公司 Video communication method and apparatus
US9591256B2 (en) 2012-12-06 2017-03-07 Xiaomi Inc. Methods and devices for video communication

Also Published As

Publication number Publication date
MY159120A (en) 2016-12-15

Similar Documents

Publication Publication Date Title
US10021318B2 (en) Method and apparatus in a motion video capturing system
US10972519B2 (en) Real-time video streaming to client video element
CA2638165C (en) Proxy video server for video surveillance
US20110311095A1 (en) Content fingerprinting
US20130307990A1 (en) System and method for real-time video processing for alarm monitoring systems
US20110050901A1 (en) Transmission apparatus and processing apparatus
US11496779B2 (en) Gateway for networked video management system
CN108965883A (en) The system and method that video content is encoded using virtual intra frame
US10356302B2 (en) Transmission apparatus, reception apparatus, transmission and reception system, transmission apparatus control method, reception apparatus control method, transmission and reception system control method, and program
WO2014137241A1 (en) Method and system for prompt video-data message transfer to personal devices
CN102204248B (en) Video data processing method, video image displaying method and device thereof
US20080080776A1 (en) Multi-media apparatus with jpeg 2000 compression and autofocus
US20230362418A1 (en) System and method for converting adaptive stream to downloadable media
CN106791703B (en) The method and system of scene is monitored based on panoramic view
US7834904B2 (en) Video surveillance system
GB2426652A (en) Transmission of video frames having given characteristics
US8456532B1 (en) Internet protocol camera transcode avoidance
CN112822435A (en) Security method, device and system allowing user to easily access
JP3933589B2 (en) Video conversion device and monitoring system
US11272243B2 (en) Cloud recording system, cloud recording server and cloud recording method
JP2000032437A (en) Image transmission system
WO2012078027A1 (en) Network and process for web-based video surveillance
JP2020072461A (en) Transmission device, server device, transmission method, and program
CN1701624A (en) Remote control apparatus, remote control method and remotely controlled apparatus
US20110161515A1 (en) Multimedia stream recording method and program product and device for implementing the same

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 11847088

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 11847088

Country of ref document: EP

Kind code of ref document: A1