WO2021171295A1 - Identity-concealing motion detection and portraying device - Google Patents
Identity-concealing motion detection and portraying device Download PDFInfo
- Publication number
- WO2021171295A1 WO2021171295A1 PCT/IL2021/050214 IL2021050214W WO2021171295A1 WO 2021171295 A1 WO2021171295 A1 WO 2021171295A1 IL 2021050214 W IL2021050214 W IL 2021050214W WO 2021171295 A1 WO2021171295 A1 WO 2021171295A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- video
- frames
- diff
- section
- audio
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
Classifications
-
- G—PHYSICS
- G08—SIGNALLING
- G08B—SIGNALLING SYSTEMS, e.g. PERSONAL CALLING SYSTEMS; ORDER TELEGRAPHS; ALARM SYSTEMS
- G08B13/00—Burglar, theft or intruder alarms
- G08B13/18—Actuation by interference with heat, light, or radiation of shorter wavelength; Actuation by intruding sources of heat, light, or radiation of shorter wavelength
- G08B13/189—Actuation by interference with heat, light, or radiation of shorter wavelength; Actuation by intruding sources of heat, light, or radiation of shorter wavelength using passive radiation detection systems
- G08B13/194—Actuation by interference with heat, light, or radiation of shorter wavelength; Actuation by intruding sources of heat, light, or radiation of shorter wavelength using passive radiation detection systems using image scanning and comparing systems
- G08B13/196—Actuation by interference with heat, light, or radiation of shorter wavelength; Actuation by intruding sources of heat, light, or radiation of shorter wavelength using passive radiation detection systems using image scanning and comparing systems using television cameras
- G08B13/19678—User interface
- G08B13/19686—Interfaces masking personal details for privacy, e.g. blurring faces, vehicle license plates
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T5/00—Image enhancement or restoration
- G06T5/70—Denoising; Smoothing
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
- G06T7/20—Analysis of motion
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/40—Extraction of image or video features
- G06V10/44—Local feature extraction by analysis of parts of the pattern, e.g. by detecting edges, contours, loops, corners, strokes or intersections; Connectivity analysis, e.g. of connected components
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V20/00—Scenes; Scene-specific elements
- G06V20/50—Context or environment of the image
- G06V20/52—Surveillance or monitoring of activities, e.g. for recognising suspicious objects
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V40/00—Recognition of biometric, human-related or animal-related patterns in image or video data
- G06V40/20—Movements or behaviour, e.g. gesture recognition
- G06V40/23—Recognition of whole body movements, e.g. for sport training
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N5/00—Details of television systems
- H04N5/76—Television signal recording
- H04N5/765—Interface circuits between an apparatus for recording and another apparatus
- H04N5/77—Interface circuits between an apparatus for recording and another apparatus between a recording apparatus and a television camera
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N7/00—Television systems
- H04N7/18—Closed-circuit television [CCTV] systems, i.e. systems in which the video signal is not broadcast
- H04N7/183—Closed-circuit television [CCTV] systems, i.e. systems in which the video signal is not broadcast for receiving images from a single remote source
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/10—Image acquisition modality
- G06T2207/10016—Video; Image sequence
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/20—Special algorithmic details
- G06T2207/20172—Image enhancement details
- G06T2207/20192—Edge enhancement; Edge preservation
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/30—Subject of image; Context of image processing
- G06T2207/30196—Human being; Person
- G06T2207/30201—Face
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/30—Subject of image; Context of image processing
- G06T2207/30232—Surveillance
-
- G—PHYSICS
- G08—SIGNALLING
- G08B—SIGNALLING SYSTEMS, e.g. PERSONAL CALLING SYSTEMS; ORDER TELEGRAPHS; ALARM SYSTEMS
- G08B17/00—Fire alarms; Alarms responsive to explosion
- G08B17/12—Actuation by presence of radiation or particles, e.g. of infrared radiation or of ions
- G08B17/125—Actuation by presence of radiation or particles, e.g. of infrared radiation or of ions by using a video camera to detect fire or smoke
-
- G—PHYSICS
- G08—SIGNALLING
- G08B—SIGNALLING SYSTEMS, e.g. PERSONAL CALLING SYSTEMS; ORDER TELEGRAPHS; ALARM SYSTEMS
- G08B21/00—Alarms responsive to a single specified undesired or abnormal condition and not otherwise provided for
- G08B21/02—Alarms for ensuring the safety of persons
- G08B21/04—Alarms for ensuring the safety of persons responsive to non-activity, e.g. of elderly persons
- G08B21/0407—Alarms for ensuring the safety of persons responsive to non-activity, e.g. of elderly persons based on behaviour analysis
- G08B21/043—Alarms for ensuring the safety of persons responsive to non-activity, e.g. of elderly persons based on behaviour analysis detecting an emergency event, e.g. a fall
-
- G—PHYSICS
- G08—SIGNALLING
- G08B—SIGNALLING SYSTEMS, e.g. PERSONAL CALLING SYSTEMS; ORDER TELEGRAPHS; ALARM SYSTEMS
- G08B21/00—Alarms responsive to a single specified undesired or abnormal condition and not otherwise provided for
- G08B21/02—Alarms for ensuring the safety of persons
- G08B21/04—Alarms for ensuring the safety of persons responsive to non-activity, e.g. of elderly persons
- G08B21/0438—Sensor means for detecting
- G08B21/0476—Cameras to detect unsafe condition, e.g. video cameras
Definitions
- the invention is in the field of video analysis for observation and surveillance, and in particular relates to a device that detects and portrays motion captured in video image frames while concealing the identities of subjects in the images.
- Smart motion detectors employing video cameras, are used in public locations and in private residences to alert of illegal intrusion, the presence of unauthorized people, and hazards.
- the detectors include motion analysis and object classification, using morphology and other known technologies.
- US patent 5,969,755 discloses a method to provide automatic content-based video indexing from object motion.
- Moving objects in video from a surveillance camera are detected in a video sequence using motion segmentation methods.
- Objects are tracked through segmented data.
- a symbolic representation of the video is generated in the form of annotated graphics describing the objects and their movement.
- a motion analyzer analyzes results of object tracking and annotates the graph motion with indices describing several events.
- the graph is then indexed using a rule-based classification scheme to identify events of interest such as appearance/disappearance, deposit/removal, entrance/exit, and motion/rest of objects.
- Clips of the video identified by spatio-temporal, event, and object-based queries are recalled to view the desired video.
- US patent 6,049,363 discloses object detection for scene change analysis, performed by a statistical test applied to data extracted from two images taken from the same scene from identical viewpoints. It is assumed that a single change region corresponding to an object that is present in one image but absence in the other is given.
- the test In the case of TV data, the test consists of measuring the coincidence of edge pixels in each image with the boundary of the change region.
- the tests In the case of IR data, the tests consist of measuring the pixel intensity variance within the change region in each image.
- So-called “stupid” motion detectors such as those employing passive infrared (PIR) sensors, do not disclose the identity of detected subjects (persons and objects). They are therefore allowed to be used almost everywhere.
- PIR passive infrared
- existing “smart” motion detection and alerting devices are based on cameras, which present legal or regulatory conflicts in many countries, as they violate the privacy of the photographed subjects.
- Smart motion detectors are focused on the analyses of the motion in the video frames, its detection and the classifying of the detected objects. They provide automatic and fast alert (by humans or machines). However, existing smart detectors are not bothered by issues of privacy or with the limitations where the disclosing of the pictured location is prohibited and/or unwanted.
- An effective alert is an alert which has zero false alarms and zero misses of real alarms.
- An effective alert is an alert which has zero false alarms and zero misses of real alarms.
- the best possible alert is probably one that transfers in real time the picture of the alert- causing event, it seems like there is no way of having an optimal alerting device without violating the privacy of the pictured subjects and the location itself (the detailed picture of which most people would not be happy to share).
- the current invention relates to an identity-concealing motion detection and portraying device.
- the device prevents any leaking of detailed images or video — thereby preventing privacy violations — by discarding imagery data, which is acquired by the device during the processing of the data, while saving and transmitting only the portrayal of the motion.
- an identity-concealing motion detection and portraying device does not enable the pass-through of any imagery, such as video information.
- the imagery is used for the processing of the motion detection and then discarded by deletion or erasure from the device’s memory.
- the imagery cannot be accessed for viewing or transmission. Only the processed data of the motion of the moving objects, which are monitored by the field-of-view of the device, is stored and can be shared by the device.
- the invention therefore provides an identity-concealing motion detecting and portraying device, for privacy-preserving monitoring and/or surveillance by concealing the identity of detected moving subjects and their observed location and denying access to original video frames;
- the device comprises: a. a video camera; b. a volatile memory, stored thereon a video buffer, the video camera configured to store a stream of video frames in the video buffer; c. a processor configured, for each pair of successive video frames in the video buffer, to i. compute the diff frames of the pair; ii. erase the first video frame of the pair from the volatile memory; iii. output the diff frames as portrayed motion video.
- the invention further provides the above device, wherein the processor is further configured to smooth edges of the portrayed motion in the diff frames, present a symbolic graphic illustration of a moving subject, or a combination thereof.
- the invention further provides any one of the above devices, further comprising a wireless communication module (WiFi or cellular 3G/4G/5G etc.), configured to transmit any combination of a. real-time alerts of detected moving object or objects; b. the diff frames; and c. symbolic graphic illustrations of moving subjects.
- a wireless communication module WiFi or cellular 3G/4G/5G etc.
- the invention further provides any one of the above devices, configured for setup enabling aiming the device to a desired field-of-view without revealing the location's actual image during the setup.
- the invention further provides any one of the above devices, further configured to dynamically vary the frame rate of the analyzed video frames by constantly comparing motion estimation of the same video sequence applied simultaneously to pairs of frames spanning short and long time intervals, and adjusting the frame rate accordingly when comparably fast or slow motions are detected.
- the invention further provides any one of the above devices, further configured to applying pixel acceleration motion detection, wherein each pixel value is replaced by its appropriate acceleration measure as estimated by taking the second derivative of the interpolation curve obtained from previous N frames.
- the invention further provides any one of the above devices, wherein the video camera is separate from the rest of the device, and connected via any wired or wireless communication such as USB or MIPI.
- the invention further provides any one of the above devices, further comprising a video analytics module configured to detect events computed from the diff frames, the video frames, or a combination thereof.
- the invention further provides any one of the above devices, wherein the events comprise presence of an intruder, a fire alert, a facial recognition, a fall, a violent activity, or any combination thereof.
- the invention further provides any one of the previous two devices, further configured to send alerts of the events to external devices.
- the invention further provides the previous device, wherein the communication path said between said analytics module(s) and said alerting module includes a unidirectional waterfall data link.
- the invention further provides any one of the previous four devices, further comprising an audio enhancement device, comprising a. a microphone; b. an audio buffer configured to store an audio signal collected by the microphone; c. an audio stamp database, storing audio stamps of event sounds; d. an audio analytics module configured to identify an audio event stored in the audio buffer by comparison with the audio stamps.
- an audio enhancement device comprising a. a microphone; b. an audio buffer configured to store an audio signal collected by the microphone; c. an audio stamp database, storing audio stamps of event sounds; d. an audio analytics module configured to identify an audio event stored in the audio buffer by comparison with the audio stamps.
- the invention further provides the previous device, wherein results of the audio analytics module is correlated with results of the video analytics module.
- the invention further provides an identity-concealing motion detecting and portraying device, for privacy-preserving monitoring and/or surveillance by concealing the identity of detected moving subjects and their observed location and denying access to original video frames, the device comprising a. a video camera configured to collect video frame images of an area; b. a first section comprising i. a first video buffer, the video camera configured to store a stream of the video frames in the video buffer; ii. a processor configured, for each pair of successive video frames in the video buffer, to a) compute diff frames of the pair; and b) output the diff frames as portrayed motion video; c. a second section comprising i.
- a second video buffer configured to store a stream of the diff frames; ii. a second processor configured to transfer the diff frames to a video encoder, the video encoder configured to encrypt the diff stream and output to an external network; wherein a unidirectional waterfall link carries the diff frames from the first section to the second section.
- the invention further provides the previous device, wherein the waterfall link comprises one or more of a. a transmitter of the first section and a receiver of the second section; b. a unidirectional serial connection; c. a unidirectional optical fiber; and d. an analog video link.
- the invention further provides any of the abovementioned devices with a waterfall link, wherein the first processor is further configured to erase the first the video frame of the pair from the first video buffer.
- the invention further provides any of the abovementioned devices with a waterfall link, wherein a. the first section further comprises a first analytics module, configured to detect events computed from the video frames; and/or b. the second section further comprises a second analytics module, configured to detect events computed from the diff frames; and c. the second section further comprises an alerts module, configured to send alerts of the events to external devices.
- the invention further provides the previous device, wherein the communication path between the analytics module(s) and the alerting module includes one or more additional unidirectional waterfall data links.
- the invention further provides any of the abovementioned devices with a waterfall link, wherein the first section receives software updates via a non-wireless connection.
- the invention further provides an identity-concealing motion detecting and portraying method, for privacy-preserving monitoring and/or surveillance by concealing the identity of detected moving subjects and their observed location and denying access to original video frames, the method comprising steps of a. a video camera collecting video frame images in an area; b. storing a stream of the video frame images in a video buffer of a volatile memory; c. for each pair of successive video frames in the video buffer, i. computing the diff frames of the pair; ii. erasing the first video frame of the pair from the volatile memory; iii. outputting the diff frames as portrayed motion video.
- the invention provides the above method, further comprising one or more steps of smoothing edges of the portrayed motion in the diff frames, presenting a symbolic graphic illustration of a moving subject, or a combination thereof.
- the invention further provides any one of the above methods, further comprising a step of a wireless communication module transmitting any combination of a. real-time alerts of detected moving object or objects; b. the diff frames; and c. symbolic graphic illustrations of moving subjects.
- the invention further provides any one of the above methods, further comprising a setup of aiming the device to a desired field-of-view without revealing the location's actual image during the setup.
- the invention further provides any one of the above methods, further comprising a step of dynamically varying the frame rate of the analyzed video frames by constantly comparing motion estimation of the same video sequence applied simultaneously to pairs of frames spanning short and long time intervals, and adjusting the frame rate accordingly when comparably fast or slow motions are detected.
- the invention further provides any one of the above methods, further comprising a step of applying pixel acceleration motion detection, wherein each pixel value is replaced by its appropriate acceleration measure as estimated by taking the second derivative of the interpolation curve obtained from previous N frames.
- the invention further provides any one of the above methods, further comprising a step of providing the video camera is separate from the rest of the device, and connected via a wired or wireless communication.
- the invention further provides any one of the above methods, further comprising a step of a video analytics module detecting events computed from the diff frames, the video frames, or a combination thereof.
- the invention further provides the previous method, wherein the events comprise presence of an intruder, a fire alert, a facial recognition, a fall, a violent activity, or any combination thereof.
- the invention further provides any one of the previous two methods, further comprising a step of sending alerts of the events to external devices.
- the invention further provides any one of the previous three methods, further comprising a step of providing an audio enhancement device, comprising a. a microphone; b. an audio buffer configured to store an audio signal collected by the microphone; c. an audio stamp database, storing audio stamps of event sounds; d. an audio analytics module configured to identify an audio event stored in the audio buffer by comparison with the audio stamps.
- an audio enhancement device comprising a. a microphone; b. an audio buffer configured to store an audio signal collected by the microphone; c. an audio stamp database, storing audio stamps of event sounds; d. an audio analytics module configured to identify an audio event stored in the audio buffer by comparison with the audio stamps.
- the invention provides the previous method, further comprising a step of correlating results of the audio analytics module with results of the video analytics module.
- Fig. 1 is a clear portrayal of the contours of a moving subject in the device’s field-of- view, produced according to some embodiments of the invention.
- Fig. 2 is a portrayal in which segments of a recognized moving subject are replaced with a symbolic graphic illustration.
- Fig. 3 is a functional block diagram an of an identity-concealing motion detecting and portraying device, according to some embodiments of the invention.
- Fig. 4 illustrates a process in which the earlier of the two video frames producing a differential frame is erased from the RAM after the diff frame is computed.
- Figs. 5 and 6 show, respectively, an image of a location with no moving subjects and a non-disclosing view of the image produced according to some embodiments of the invention.
- Fig. 7 is a functional block diagram of an of an identity-concealing motion detecting and portraying device with AI analytics, according to some embodiments of the invention.
- Fig. 8 is a functional block diagram of an identity-concealing motion detecting and portraying device in which AI analysis of specific events and features is based exclusively on diff images, according to some embodiments of the invention.
- Fig. 9 shows a comparison between images from a scene of a falling subject and identity-concealing motion portrayals of the scene at the time of the images.
- Fig. 10 shows morphological signatures of a dog, a woman, and a man, which can be compared with identity-concealing motion portrayals which can be used to determine the type of moving subject in an identity-concealing motion portrayal.
- Fig. 11 is a functional block diagram of an identity-concealing motion detecting and portraying device, where AI analysis of specific events and features is made from the full visual data of video frames and sequences, according to some embodiments of the invention.
- Fig. 12 is a functional block diagram of an audio enhancement 500 usable with an identity-concealing motion detecting and portraying device, according to some embodiments of the invention.
- Fig. 13 is a functional block diagram of an of an identity-concealing motion detecting and portraying device 600 with a unidirectional “waterfall” data link, according to some embodiments of the invention.
- motion portrayal refers to providing images facilitating detection of motion (by human or machine).
- the provided images accentuate the edges of moving objects.
- full visual video data or simply “full visual data” refers to unprocessed video frames as initially acquired by a video camera or repetitive still camera.
- Visual data can refer as well to video frames that have been processed to extract only an outline of moving subjects, as further described herein.
- Non-visual data refers to data about motion in a video (full-visual or visual) extracted from the video frames.
- the current invention comprises an identity-concealing motion detecting and portraying device that does not store acquired visual and/or IR video data in any externally-accessible memory.
- the acquired video data is stored temporarily for processing on the device’s random-access memory (RAM), and it is cleared from the RAM immediately after being processed.
- RAM random-access memory
- N video frames are stored in the device’s RAM, while the Nth frame is compared with a successive frame (e.g., the N+lst, N+2nd, or N+nth frame) are compared using motion estimation and image comparison technologies, detecting the edges of any present moving objects.
- a successive frame e.g., the N+lst, N+2nd, or N+nth frame
- Video cameras sample their field-of-view a few times per second, typically from 24 to 60 frames per second (FPS).
- the device computes a simple difference between successive video frames.
- a differential frame is called a diff frame (or simply a diff).
- the device computes a simple difference between successive frames — whether consecutive frames N+l or by skipping to N+2 or N+n — creating a clear portrayal of the contours of a moving subject in the device’s field-of-view, as demonstrated in Fig. 1.
- the portraying of the motion in terms of thickness of the contours, in pixels, is correlated to the distance the moving subject passed between the frames. The greater the time interval between the subtracted frames, the thicker will be the contour line of the difference image of the moving subject. The thickness is also affected by the speed of the motion. The quicker the motion, the thicker will be the contour of the moving object.
- a lower frame rate of 1 to 12 FPS is typically sufficient.
- a large interval of time e.g., more than 100 ms
- Enhancing the contrast and/or reducing the brightness of the diff image could be one way of discarding such details for concealing the identity of the moving objects.
- Another way could be to reduce the color depth to 8, 4, or even 2 colors.
- smoothing the contours and thinning the lines by one of many edge detection techniques known in the art, and/or even vectorizing the diff images for reduced data size/rate for optimized transmission.
- FIG. 2 Another way of concealing identity is by replacing parts of a recognized moving object with a symbolic graphic illustration 50 of a moving subject.
- the identity-concealing motion detecting and portraying device does not enable the pass-through of any imagery.
- Video information is used for processing of the motion detection is discarded and deleted/erased from the device’s memory, and cannot be accessed for viewing or transmission. Only the processed data of the motion of the moving objects, which are monitored by the field-of-view of the device, is stored and can be shared by the device.
- the device 100 comprises a video camera 105, one or more volatile memories (RAM) 110, one or more processors 115 (collectively, “the processor”), a video encoder 130, a communication means 135, and one or more storage media 140.
- RAM volatile memories
- processors 115 collectively, “the processor”
- the video camera 105 is a camera sensor attached to the required hardware for acquiring video frames and feeding them to a video buffer 120 in the RAM 105, and typically nothing more.
- the video camera 105 is positioned to capture an image of a surveilled area or area under observation. Typically, the video camera 105 acquires image frames at frame rates of 1-12 FPS.
- the video camera 105 can be an integral component of the device 100 according to the current invention, although it is possible to implement the current invention by connecting an off-the-shelf video camera (or still camera capable of acquiring video), to an independent device designed according to the current invention.
- the processor 115 and the video encoder 130 which encodes the diff stream 125 to produce the output stream 132, are part of different devices.
- the video encoder 130 does not have any access to the original video frames. The only data the video encoder 130 can access is that of the diff images from the diff stream 125. The diff images are sufficiently obscure to maintain the privacy of the location and objects within the field-of-view of the device 100.
- the processor 115 includes the video encoder 130 (e.g., the function of encoding the diff stream 125 to produce the output stream 132 is done by the processor). Referring to Fig.
- the processor 115 implements a process in which the earlier of the two video frames producing the diff frame is erased from the RAM 110 after the diff frame is computed. This process denies the video encoder 130 any access to the original video frames. Once the diff images are produced, the original video frames are erased from the video buffer 120. Upon creation of the diff stream 125, the original video frames data can no longer be accessed, because it no longer exists.
- the diff frames can be coded (at their original or at reduced resolution) and wrapped as a video stream, which can be transmitted over wire or wirelessly for remote monitoring of the location.
- the current invention enables a simple and effective alert whenever a certain level of motion is detected.
- the processor 115 counts the number of pixels measuring different light intensity between successive video frames in the video buffer 120.
- the processor reduces diff frames to 1-bit color depth and counts the number of white or black pixels in the diff frames.
- contours of the moving objects can be classified based on simple morphology, discriminating between pets, birds, humans, etc. by any means known in the art.
- the device according to the current invention cannot output an image of the acquired video, but only the difference between successive frames (or any other portrayal of the contours of moving objects in the device’s field-of-view).
- the current invention enables simply setting the device’s field-of-view without disclosing an identifiable image of the location.
- the device is set up mode so that acquired frames are synthetically shifted a few pixels (either horizontally or vertically or both), as if the whole scene is moving.
- Such synthetic motion enables output a diff image or diff video stream of the contours of the objects in the device’s field-of-view.
- Fig. 5 presents a location with no moving subjects.
- a static device according to the current invention will present a blank image (black, gray, white, or any other color) of the device’s field-of-view and will not display any image.
- the resulting absence of reference features presents a problem for setting the device for monitoring a desired field-of-view.
- the processor implements a non-disclosing view of the scene.
- the view is achieved by shifting the successive acquired frames a few pixels horizontally and a few pixels vertically from frame N to frame N+l and the same shift is applied again shifting N+3 from N+2 and so on (one frame in its original position, with the following frame shifted).
- N+3 one frame in its original position, with the following frame shifted.
- Fig. 7 a functional block diagram of an of an identity- concealing motion detecting and portraying device 200 with a video analytics module 222, according to some embodiments of the invention.
- the analytics module 222 may employ artificial intelligence (AI), as shown.
- AI artificial intelligence
- the video analytics module 222 is introduced between the video buffer 220 and the diff stream 225, maintaining the elements and functionality of the system described in relation to Fig. 3. Frames acquired by the video camera 205 are fed to the video buffer 220. Before the frames are compared for creating the privacy-protecting diff images, the video analytics module 222 analyzes video frames and short video sequences, using the state-of-the-art methods for face recognition, fall detection, lack of motion, and/or other hazardous situations.
- the video analytics module 222 stores the results of the analysis are non-visual data.
- non-visual data may include for example, when dealing with face recognition, only the 2D and/or 3D geometric ratios and relative angles of the facial features (eyes, nose, nostrils, forehead, eyebrows, ears, chin, hair-line etc.) of the analyzed subjects.
- Fig. 8 a functional block diagram of an identity-concealing motion detecting and portraying device 300, where video analysis 322 of specific events and features is based exclusively on diff images.
- Event detection from the diff images can be implemented by any means known in the art.
- the diff images may ease the effort of analyzing the data, as subjects are separated from the static scene in which they are located. For example, if the colors of the objects and the background are similar, it may be easier to analyze motion based on edge detection of pure diff between frames, because the static background in such cases is not present in the diff images. (In other cases, however, diff image analysis could be more complex due to the lack of detailed visual data. For example, detecting falls and human postures for example may be done more accurately when the full visual data is available, as described herein in relation to Fig. 11)
- any means known in the art may be employed for filtering noise from the diff images and summing the number of groups of adjacent pixels (blobs) of above a threshold pixel number as they move into the scene.
- Frames are fed from a video camera 305 to the video buffer 320 of the device’ s volatile memory.
- the frames are compared and frames of standard diff images or enhanced diff images are created in the diff stream 325.
- the original video frames in the video buffer 320 are erased (and written over).
- sequences of such diff frames are processed, analyzing the quantity of moving pixels, which are contained in groups of adjacent pixels (blobs), and the vector of the motion on such groups of pixels, compared to the motion of such groups of pixels (if exists) in the previous frames.
- detecting a fall may be implemented by a known technique, such as calculating the acceleration of moving diff pixels vertically, while a rectangular enveloping such pixels changes the ratio between its horizontal and vertical dimensions significantly.
- a known technique such as calculating the acceleration of moving diff pixels vertically, while a rectangular enveloping such pixels changes the ratio between its horizontal and vertical dimensions significantly.
- an alert could be verified by a morphological comparison of the signatures of the moving objects in the scene (man, woman, child pet, or some pre-defined object), as illustrated in Fig. 10.
- the techniques for performing comparisons and matching such morphological stamps can be implemented by any means known in the art.
- Fig. 11 a functional block diagram of an identity- concealing motion detecting and portraying device 400, where analysis 422 of specific events and features is made from the full visual data of the video frames and sequences, while stored in the video buffer 420, according to some embodiments of the invention, before creating the diff frames and discarding the full visual data.
- the full-video analytics module 422 may be the sole analytical component of the device 400, or it can be used as an assisting decision-making component, which is used in combination with the analytics of the diff frames, as described herein (e.g., diff-frame analytics module 322 in Fig. 8). Verifying positive identification of triggering events by correlating the analytics of the full visual frames with the analytics of the diff frames, could lead to more accurate results, minimizing the percentage of false positive and false negative identifications.
- FIG. 12 a functional block diagram an audio enhancement device 500 usable with an identity-concealing motion detecting and portraying device.
- the current invention conceals the identity of the monitored location and people by preventing the streaming of audio from the device.
- the device 500 is designed to identify specific events by comparing outlying audio signals to stored audio stamps 565 stored in the device 500.
- stored audio stamps 565 may include, among other stamps, various sounds of falls, triggering sounds and/or words, which can be also added by recording the user/s.
- Fig. 13 a functional block diagram of an of an identity- concealing motion detecting and portraying device 600 with a one-way “waterfall” data link 670, according to some embodiments of the invention.
- the device 600 is divided into a first section 602 and a second section 604.
- the first section 602 performs the first (initial) phase of acquiring the video frames.
- a processor 615 of the first section computes diff frames.
- the first section may comprise an analytics module 622 that performs analytics on the full visual video frames.
- Diff frames are fed to the frame buffer 620' of the second section 604 as the source video frames of the second section 604.
- the diff frames are fed over a unidirectional “waterfall” link 670.
- the waterfall link 670 is implemented by a single transmitter 672 of the first section 602 and a single receiver 674 of the second section 604.
- the waterfall link may be implemented by a unidirectional serial connection, over a unidirectional optical fiber, as analog video (converted from digital to analog, sent over cable and then digitized, coded and broadcast when required), and/or similar unidirectional means.
- a video encoder 630 of the second section 604 encodes the diff images as a video stream 632 and broadcasts the stream 632 when required.
- the second section 604 further issues alerts according to the full-visual video analytics (by an analytics module 622 of the first section 602) and/or diff images analytics (by an analytics module 622' of the second section 604).
- the diff frames are the only visual output of the first section 602 to the second section 604. Furthermore, because the waterfall link 670 is unidirectional and the external network 635 is connected only to the second section 604, the first section 602 is unable to receive external requests from the network 635 for the full visual data. (Only portions of the device 600 downstream from the waterfall link 670 may be externally accessed.) Therefore a hacker has no way to access and steal the full visual data; cyber-privacy is thereby preserved.
- both the first section 602 and the second section 604 each possess an independent memory, including video buffers 620, 620' and independent processors 615, 615'.
- a video encoder 630 encrypts the output video of the second section 604, thereby requiring decryption at the client’s end.
- the analytics module 622 of the first section 602 is connected to the first analytics module 622 and receives software updates via a non-wireless connection, such as an SD card or USB, thereby obviating the need to be connected to a network.
- a non-wireless connection such as an SD card or USB
- the first section analytics module 622 uses encrypted files.
- a waterfall link 670 is placed at the connection carrying the diff frames.
- unidirectional waterfall links may be placed at the connections transmitting the video stream 632 and/or the alerts 645 to the external network 635.
- the video encoder 630 may have only a WiFi transmitter to the external network 635 and no receiver.
- the alert module 645 may, for example, receive, over a waterfall link, embedded video signals such as colored pixels, such as macroblocks of 8x8 pixels in 1 of e.g., 16 or 24 colors, each one of the colors associated with a specific event. Only the alert module of the device can possibly be accessed from external devices 650 (or their networks). The portions of the device upstream from the waterfall link carrying the alerts are isolated from external access.
- embedded video signals such as colored pixels, such as macroblocks of 8x8 pixels in 1 of e.g., 16 or 24 colors, each one of the colors associated with a specific event.
- the waterfall link carrying the alerts may be implemented, for example, by a serial transmission cable.
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Multimedia (AREA)
- Theoretical Computer Science (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Human Computer Interaction (AREA)
- Signal Processing (AREA)
- Psychiatry (AREA)
- Social Psychology (AREA)
- General Health & Medical Sciences (AREA)
- Health & Medical Sciences (AREA)
- Closed-Circuit Television Systems (AREA)
- Image Analysis (AREA)
- Television Signal Processing For Recording (AREA)
- Studio Devices (AREA)
Priority Applications (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US17/802,320 US20230088660A1 (en) | 2020-02-25 | 2021-02-25 | Identity-concealing motion detection and portraying device |
| EP21760879.3A EP4111430A4 (en) | 2020-02-25 | 2021-02-25 | DEVICE FOR DETECTING AND DISPLAYING AN IDENTIFICATION MASKING MOVEMENT |
| JP2022576240A JP2023515278A (ja) | 2020-02-25 | 2021-02-25 | 識別秘匿動き検出および描写装置 |
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US202062981061P | 2020-02-25 | 2020-02-25 | |
| US62/981,061 | 2020-02-25 |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| WO2021171295A1 true WO2021171295A1 (en) | 2021-09-02 |
Family
ID=77490758
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/IL2021/050214 Ceased WO2021171295A1 (en) | 2020-02-25 | 2021-02-25 | Identity-concealing motion detection and portraying device |
Country Status (4)
| Country | Link |
|---|---|
| US (1) | US20230088660A1 (https=) |
| EP (1) | EP4111430A4 (https=) |
| JP (1) | JP2023515278A (https=) |
| WO (1) | WO2021171295A1 (https=) |
Cited By (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2024014278A1 (ja) * | 2022-07-11 | 2024-01-18 | ソニーセミコンダクタソリューションズ株式会社 | 撮像装置およびデータ出力方法 |
Families Citing this family (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| DE102020209025A1 (de) * | 2020-07-20 | 2022-01-20 | Robert Bosch Gesellschaft mit beschränkter Haftung | Verfahren zur Bestimmung einer auffälligen Teil-Sequenz einer Überwachungs-Bildersequenz |
Citations (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20160171852A1 (en) * | 2014-12-12 | 2016-06-16 | Andy Lin | Real-time video analysis for security surveillance |
| US20170289504A1 (en) * | 2016-03-31 | 2017-10-05 | Ants Technology (Hk) Limited. | Privacy Supporting Computer Vision Systems, Methods, Apparatuses and Associated Computer Executable Code |
| US20180330591A1 (en) * | 2015-11-18 | 2018-11-15 | Jörg Tilkin | Protection of privacy in video monitoring systems |
| US20190289261A1 (en) * | 2016-07-21 | 2019-09-19 | Gl D&If Inc. | Network separation device and video surveillance system employing the same |
Family Cites Families (13)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP2004021495A (ja) * | 2002-06-14 | 2004-01-22 | Mitsubishi Electric Corp | 監視システムおよび監視方法 |
| JP4412323B2 (ja) * | 2006-12-28 | 2010-02-10 | 株式会社日立製作所 | 映像処理装置及び映像表示装置 |
| JP2009033564A (ja) * | 2007-07-27 | 2009-02-12 | Sanyo Electric Co Ltd | 表示装置及び表示プログラム |
| JP2012160972A (ja) * | 2011-02-01 | 2012-08-23 | Canon Inc | 画像処理装置及び画像処理方法 |
| US8848978B2 (en) * | 2011-09-16 | 2014-09-30 | Harman International (China) Holdings Co., Ltd. | Fast obstacle detection |
| IL250010B (en) * | 2016-02-14 | 2020-04-30 | Waterfall Security Solutions Ltd | Secure connection with protected facilities |
| JP6377672B2 (ja) * | 2016-02-23 | 2018-08-22 | 株式会社 ミックウェア | システム |
| US10370102B2 (en) * | 2016-05-09 | 2019-08-06 | Coban Technologies, Inc. | Systems, apparatuses and methods for unmanned aerial vehicle |
| US10417745B2 (en) * | 2016-06-28 | 2019-09-17 | Raytheon Company | Continuous motion scene based non-uniformity correction |
| WO2019206239A1 (en) * | 2018-04-27 | 2019-10-31 | Shanghai Truthvision Information Technology Co., Ltd. | Systems and methods for detecting a posture of a human object |
| IL268485B (en) * | 2018-08-13 | 2022-04-01 | Waterfall Security Solutions Ltd | Automatic security response using one-way links |
| WO2020073172A1 (en) * | 2018-10-08 | 2020-04-16 | Huawei Technologies Co., Ltd. | Methods and devices for capturing high-speed and high-definition videos |
| US10853953B2 (en) * | 2019-04-12 | 2020-12-01 | Logitech Europe S.A. | Video content activity context and regions |
-
2021
- 2021-02-25 JP JP2022576240A patent/JP2023515278A/ja active Pending
- 2021-02-25 EP EP21760879.3A patent/EP4111430A4/en active Pending
- 2021-02-25 WO PCT/IL2021/050214 patent/WO2021171295A1/en not_active Ceased
- 2021-02-25 US US17/802,320 patent/US20230088660A1/en active Pending
Patent Citations (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20160171852A1 (en) * | 2014-12-12 | 2016-06-16 | Andy Lin | Real-time video analysis for security surveillance |
| US20180330591A1 (en) * | 2015-11-18 | 2018-11-15 | Jörg Tilkin | Protection of privacy in video monitoring systems |
| US20170289504A1 (en) * | 2016-03-31 | 2017-10-05 | Ants Technology (Hk) Limited. | Privacy Supporting Computer Vision Systems, Methods, Apparatuses and Associated Computer Executable Code |
| US20190289261A1 (en) * | 2016-07-21 | 2019-09-19 | Gl D&If Inc. | Network separation device and video surveillance system employing the same |
Non-Patent Citations (1)
| Title |
|---|
| See also references of EP4111430A4 * |
Cited By (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2024014278A1 (ja) * | 2022-07-11 | 2024-01-18 | ソニーセミコンダクタソリューションズ株式会社 | 撮像装置およびデータ出力方法 |
Also Published As
| Publication number | Publication date |
|---|---|
| JP2023515278A (ja) | 2023-04-12 |
| US20230088660A1 (en) | 2023-03-23 |
| EP4111430A1 (en) | 2023-01-04 |
| EP4111430A4 (en) | 2024-07-03 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US6774905B2 (en) | Image data processing | |
| KR101215948B1 (ko) | 신체정보 및 얼굴인식에 기반한 감시 시스템의 영상정보 마스킹 방법 | |
| US9288451B2 (en) | Image processing apparatus and image processing method | |
| KR101570339B1 (ko) | 영상분석 서버를 통한 미래 발생범죄 예측 시스템 | |
| US20110128150A1 (en) | System and method for electronic surveillance | |
| TW201722136A (zh) | 保全系統及方法 | |
| KR101513215B1 (ko) | 객체 행동패턴 cctv 영상 분석서버 | |
| CN109711318B (zh) | 一种基于视频流的多人脸检测与跟踪方法 | |
| KR101084914B1 (ko) | 차량번호 및 사람 이미지의 인덱싱 관리시스템 | |
| KR102297575B1 (ko) | 지능형 영상 감시 시스템 및 방법 | |
| US12175848B2 (en) | Computer implemented method, apparatus and computer program for privacy masking video surveillance data | |
| KR101492473B1 (ko) | 사용자 기반 상황 인지형 씨씨티비 통합관제시스템 | |
| KR101921868B1 (ko) | 고해상도 카메라를 이용한 지능형 영상 감시 시스템 및 그 방법 | |
| US20230088660A1 (en) | Identity-concealing motion detection and portraying device | |
| EP1266525B1 (en) | Image data processing | |
| KR101340897B1 (ko) | 스쿨존 cctv 감시장치 성능개선 시스템 | |
| KR101547255B1 (ko) | 지능형 감시 시스템의 객체기반 검색방법 | |
| KR101926510B1 (ko) | 광각카메라를 이용한 안면인식 기반의 광역 감시 시스템 | |
| KR101473971B1 (ko) | 다기능 방범카메라 통합방범 시스템 | |
| KR101513467B1 (ko) | 스쿨존 cctv 감시장치 성능개선 시스템 | |
| KR101688910B1 (ko) | 다중 레벨 얼굴 특징을 이용한 얼굴 마스킹 방법 및 그 장치 | |
| CN110519562B (zh) | 移动侦测方法、装置和系统 | |
| KR101926435B1 (ko) | 시간압축방식을 이용한 객체 추적 시스템 | |
| KR102058723B1 (ko) | 영상에서 추출된 객체를 암호화하고 데이터베이스를 구축하는 시스템의 제어방법 | |
| JP2001094969A (ja) | 映像処理装置 |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| 121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 21760879 Country of ref document: EP Kind code of ref document: A1 |
|
| ENP | Entry into the national phase |
Ref document number: 2022576240 Country of ref document: JP Kind code of ref document: A |
|
| NENP | Non-entry into the national phase |
Ref country code: DE |
|
| ENP | Entry into the national phase |
Ref document number: 2021760879 Country of ref document: EP Effective date: 20220926 |