WO2016184314A1 - Dispositif et procédé permettant d'établir des informations d'image vidéo structurées - Google Patents

Dispositif et procédé permettant d'établir des informations d'image vidéo structurées Download PDF

Info

Publication number
WO2016184314A1
WO2016184314A1 PCT/CN2016/081149 CN2016081149W WO2016184314A1 WO 2016184314 A1 WO2016184314 A1 WO 2016184314A1 CN 2016081149 W CN2016081149 W CN 2016081149W WO 2016184314 A1 WO2016184314 A1 WO 2016184314A1
Authority
WO
WIPO (PCT)
Prior art keywords
information
file
structured
video image
video
Prior art date
Application number
PCT/CN2016/081149
Other languages
English (en)
Chinese (zh)
Inventor
杜晓通
王伟
邢大天
Original Assignee
山东大学
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 山东大学 filed Critical 山东大学
Publication of WO2016184314A1 publication Critical patent/WO2016184314A1/fr

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/50Information retrieval; Database structures therefor; File system structures therefor of still image data
    • G06F16/58Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • G06F16/5866Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using information manually generated, e.g. tags, keywords, comments, manually generated location and time information
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/10File systems; File servers
    • G06F16/13File access structures, e.g. distributed indices
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/70Information retrieval; Database structures therefor; File system structures therefor of video data
    • G06F16/78Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • G06F16/7867Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using information manually generated, e.g. tags, keywords, comments, title and artist information, manually generated time, location and usage information, user ratings

Definitions

  • the present invention relates to the field of video image information processing technologies, and in particular, to an apparatus and method for constructing structured video image information.
  • the video information intelligent analysis technology is derived from computer vision technology and pattern recognition technology, which can establish a one-to-one mapping relationship between image and image content description, so that the computer can understand the specific content in the video image through digital image analysis. . It is an important means of mining valuable things from massive video and image resources.
  • the main algorithms of intelligent video analysis technology to realize real-time detection, recognition and multi-target tracking of mobile targets are divided into the following five categories: target detection, target tracking, target recognition, behavior analysis, content-based video retrieval and data fusion. Wait.
  • the object of the present invention is to solve the above problems, and an apparatus and method for constructing structured video image information are proposed.
  • the information collected by a plurality of different sensors is integrated.
  • text description information is created for the video image, and the retrieval speed and utilization efficiency of the video image are improved.
  • An apparatus for constructing structured video image information comprising: a CCD/CMOS image sensor module, a wireless sensor network receiving module, a video processor, a CPU processor, an information fusion module, and an Ethernet/WiFi interface;
  • the CCD/CMOS image sensor module is connected to a video processor, and the wireless sensor network receiving module is connected to the CPU processor, and the outputs of the video processor and the CPU processor are respectively connected to the information fusion module, and the information fusion module and the Ethernet Network/WiFi interface connection;
  • the video processor is used to complete the encoding function of the video image
  • the wireless sensor network receiving module is used to complete the text description information or the binary data receiving or directly connect the temperature, humidity, illuminance, pressure standard sensor, digitize the analog information; and the video image information Conformed with other data to form structured video image information.
  • the wireless sensor network module uses the ISM frequency band for data transmission, and has a standard sensor input interface, and supports 0-5v, 0-10v standard sensor signal access.
  • the apparatus of the present invention provides support for accomplishing the structuring of video image information, wherein the method of constructing structured video image information includes a method of constructing a structured JPEG image file and a method of constructing structured video information by describing a file. And based on the structured video image information, the storage and retrieval methods are optimized.
  • a method of constructing a structured video image information device comprising:
  • the specific method of the step (1) is:
  • the original unstructured JPEG image file is constructed into a structured JPEG.
  • File image information + text information. The text information part needs to be decrypted and displayed during parsing, and the image is not affected at all.
  • the tag code consists of two bytes.
  • the previous byte is fixed at 0xFF to indicate the start of the tag code.
  • the different values of the latter byte represent different meanings.
  • the file is parsed from 0xFFD8, and the end of 0xFFD9 parsing ends.
  • the specific method of the step (2) is:
  • the description file and the video file are associated as two parts of the structured information, and a corresponding description file is created for the video file of each time period to supplement the text of the single video image;
  • the sensor data or manually input text information is fused, and the packet length, video stream data packet, other information length, and other information are combined into a new data packet for transmission.
  • the method for storing the structured JPEG image file is:
  • segmentation or sub-folder storage according to the content of the text information.
  • the method for storing the structured video image information is:
  • An index file is constructed for each field of the description file in the structured video image information, and the index file is divided into several levels, and the received text information is stored according to the content of the corresponding level.
  • the method for searching the structured JPEG image file is:
  • the text information length is obtained by parsing the JPEG file data, and then all the text information is extracted according to the length, and compared with the search condition, if the search condition is met, the JPEG file is the image file we are looking for, otherwise it is performed. The comparison of the next file.
  • the method for searching the structured video image information is:
  • Search from the storage directory enter the corresponding first-level search directory, view the index files of all levels to select the qualified directory, put it into the search queue, and obtain the next-level search file from the search queue after the search directory of the level is completed.
  • the path is searched until the corresponding description file location is retrieved; after the description file is retrieved, the corresponding video file is obtained.
  • the device for constructing structured video image information in the invention can first realize the functions of capturing, encoding and transmitting video and images of a conventional camera, and is fully compatible with existing cameras conforming to international standards. On this basis, by adding new device modules and structured information processing algorithms, it is possible to use the other information (text description, standard sensor data) to create explicit description information for videos and images on the video and image information collection end. .
  • the invention attaches text information (generally sensor information) to a JPEG file to make it a new file with a sensor information label.
  • the sensor information can describe the specific environment information when the JPEG image file is taken, and can make the JPEG file better. Reproduce the scene when shooting. Since the invention does not destroy the structure and content of the original JPEG file, it does not prevent the existing software from reading and displaying the JPEG file, and also protects the security of the sensor information and prevents tampering by the existing software.
  • FIG. 1 is a structural diagram of an apparatus for constructing structured video image information according to the present invention
  • FIG. 2 is a schematic diagram of structured JPEG image information according to the present invention.
  • FIG. 3 is a schematic diagram showing the macroscopic angle of structured video information according to the present invention.
  • FIG. 4 is a schematic diagram of a microscopic angle of structured video information according to the present invention.
  • FIG. 5 is a schematic diagram of storing a picture file according to the present invention.
  • FIG. 6 is a schematic diagram of storing a multi-index video file according to the present invention.
  • FIG. 7 is a video image file retrieval process of the present invention.
  • the device structure for constructing structured video image information in the present invention is as shown in FIG. 1, and includes: a CCD/CMOS image sensor module, a wireless sensor network receiving module, a video processor, a CPU processor, an information fusion module, and an Ethernet/WiFi interface. ;
  • the CCD/CMOS image sensor module is connected to the video processor, the wireless sensor network receiving module is connected to the CPU processor, the output of the video processor and the CPU processor are respectively connected to the information fusion module, and the information fusion module is connected with the Ethernet/WiFi interface.
  • the device of the invention also designs a wireless sensor network receiving module capable of accepting standard sensors such as text description, temperature, humidity, illumination, pressure and the like.
  • the general video processor (chip) is used to complete the encoding function of the video image
  • the wireless sensor network receiving module is used to complete the text description information or the binary data reception or directly connect the standard sensors such as temperature, humidity, illumination, pressure, etc., and simulate the information. Digitizing.
  • the information structure algorithm of the present invention is implemented on a general purpose CPU, and the video image information is fused with other data to form structured video image information.
  • the video processor supports full HD encoding function, adopts pixel high-definition video and image sensor of 130w or more for video acquisition, and the wireless sensor network module uses ISM frequency band for data transmission, and has a standard sensor input interface and supports 0-5v, 0-10v standard sensor signal access.
  • the video processor is used to complete the encoding function of the video image
  • the wireless sensor network receiving module is used to complete the text description information or the binary data receiving or directly connect the temperature, humidity, illuminance, pressure standard sensor, digitize the analog information; and the video image information It is combined with other data to form structured video image information, and the two types of information are combined and uploaded to the server for processing.
  • the number of sensors is set at the time of fusion to calibrate how many sensors collect new data at this time.
  • the structured information packet format is set as follows:
  • the apparatus of the present invention provides support for accomplishing the structuring of video image information, wherein the method of constructing structured video image information includes a method of constructing a structured JPEG image file and a method of constructing structured video information by describing a file. And based on the structured video image information, the storage and retrieval methods are optimized.
  • a method of constructing a structured video image information device comprising:
  • Figure 2 is a schematic diagram of a JPEG image file structuring method.
  • the information such as text information, standard sensor data, and voice recognition data is encrypted and attached to the original JPEG file, and the original unstructured JPEG image file is structured into a structure by adding information after the JPEG file mark code EOI.
  • JPEG file image information + text information.
  • the text information part needs to be decrypted and displayed during parsing, and the image is not affected at all.
  • the sensor information can describe the specific environmental information when the JPEG image file is taken, enabling better reproduction of JPEG files.
  • JPEG File Interchange Format JPEG File Interchange Format
  • EXIF Exchange Image File Format
  • Tag code Two bytes. The previous byte is fixed at 0xFF to indicate the start of the tag code. The different values of the latter byte represent different meanings. When a plurality of consecutive 0xFFs appear, it is understood that a 0xFF also indicates the start of the tag code.
  • main tag codes Two bytes. The previous byte is fixed at 0xFF to indicate the start of the tag code. The different values of the latter byte represent different meanings. When a plurality of consecutive 0xFFs appear, it is understood that a 0xFF also indicates the start of the tag code.
  • main tag codes Two bytes. The previous byte is fixed at 0xFF to indicate the start of the tag code. The different values of the latter byte represent different meanings. When a plurality of consecutive 0xFFs appear, it is understood that a 0xFF also indicates the start of the tag code.
  • main tag codes Two bytes. The previous byte is fixed at 0xFF to indicate the start of the tag code. The different values
  • Tag code format significance SOI (Start Of Image) 0xFFD8 Image start APP0 (Application0) 0xFFE0 Application retention tag 0 SOFO (Start Of Frame) 0XFFC0 Frame image begins EOI(End Of Image) 0xFFD9 End of image, end of image file
  • Compressed data The tag code is followed by compressed data, which records the details of the image file.
  • the software parses the file from 0xFFD8 and ends the parsing at 0xFFD9. If we insert the relevant text information into the position of 0xFFD9 in the file, the software will not parse the segment information. This avoids the impact of textual information on image content and image quality.
  • the file is integrated with the text information. From the information point of view, the whole unstructured image data and structured text information constitute a piece of structured information, that is, we will use JPEG image files and various sources. The text information is associated and the text information is inserted into the JPEG file. At the same time, because the current viewing software does not view the content behind the 0xFFD9 markup code, the security and concealment of our information is guaranteed.
  • Text information Only when we need to query and parse the information behind 0xFFD9 can we get the correct image correspondence. Text information. This kind of information not only helps us to accurately describe the specific details of the image, but also can be used as a retrieval condition to retrieve the corresponding image file by retrieving the structured text information.
  • the video stream structuring method in the present invention encapsulates the original video stream data packet, and the new data packet is a video stream data packet and standard sensor data received by the Internet of Things sensor or
  • the information such as manually input text information is merged, and the data packet length, video stream data packet, other information length, and other information are combined into a new data packet for transmission.
  • the format definition of the new data packet is shown in Table 2.
  • the server After receiving the data packet, the server parses it to generate the corresponding video file and its description file. The video file is structured and encapsulated in this way.
  • the structured video image information device is configured to collect sensor information and video information, and suffix the encoded video information and the encrypted sensor information into a JPEG image file or form a corresponding description file for the video file to form a standard data stream. Output according to standard communication protocols (usually TCP/IP).
  • Figure 3 is a macro perspective view of structured video information. Macroscopically, a description file for video files is added, and description files and video files are associated as two parts of structured information. A textual supplement to a single video image is created by creating its corresponding profile for the video file for each time period. Make the abstract video picture more specific and more substantial.
  • Figure 4 is a schematic diagram of the microscopic angle of structured video information.
  • the description file records the location of the video capture, the absolute time of the shot, and the relative time at which the video file began, as well as the textual information generated during the capture.
  • the information can be encrypted and stored, and needs to be viewed. It can be viewed by decrypting it with a private key.
  • the image file storage and retrieval method of the present invention obtains the length of the text information by parsing the 4-byte data behind the JPEG file tail 0xFFD9, and then extracts all the text information according to the length, and compares with the search condition, if the conditions are met Explain that this JPEG file is the image file we are looking for, otherwise we will compare the next file.
  • the principle of “B+tree” can be used to store segments according to the content of text information in the process of storage, and the folder where the conditions are located is first searched during retrieval. And then search in a subfolder to reduce the number of files viewed.
  • Figure 5 is a schematic diagram of image file storage (taking temperature as an example).
  • the image file is "B+tree" for intelligent classification and storage.
  • searching first determine which temperature segment the retrieval condition is in, and enter the corresponding directory for retrieval. .
  • the subdirectory is retrieved, the image that meets the criteria is output.
  • the video storage and retrieval method of the present invention uses the database "B + tree” idea to construct an index file for each field in the description file, and divides the index file into "engineering", "installation location", "year”, “month”, “day”
  • the received text information is stored in several levels. Search from the storage directory at the time of retrieval, enter the corresponding project directory, view the index files of all levels to select the qualified directory, put it into the retrieval queue, and obtain the next-level search file from the retrieval queue after the retrieval of the retrieval directory at this level. The path is retrieved until the corresponding profile location is retrieved. After the description file is retrieved, the corresponding video file can be obtained.
  • Figure 6 is a schematic diagram of multi-index video file storage.
  • Index files at various levels record the number of devices, the number of files, conditional flags, and so on.
  • the index file is generated by "bottom-up”. After receiving the text information, the index file is updated upwards in turn.
  • Engineing Index File records the number of devices included in the current project and the installation location of each device, project creation time and deadline, project file storage directory and number of files stored.
  • “Location index file” records the start time and stop time of the current device shooting, the number of video files captured, the storage address of the "year value” directory, and the text information identification and data content generated during the shooting of the device. Wait.
  • the “Annual Index File” records the month in which the device was used normally, the number of video files captured, the storage address of the “monthly value” directory, and the textual information identifier and its data content generated during the shooting of the device.
  • the “monthly index file” records the number of days the device is used normally in the month, the number of video files captured, the storage address of the “date” directory, and the text information identifier and its data content generated during the shooting of the device.
  • the "date index file” records the number of video files taken on the day of the device, the file name of each video file and the file name of the description file, and the text information identifier and its data content contained in each video file.
  • Figure 7 is a video image file retrieval process.
  • the detailed search process is as follows:
  • step 4 Dequeue the date in the date queue, enter the corresponding date directory, find the matching file, and output the file name. Repeat step 4 until the date queue is empty.
  • step 3 Repeat step 3 until the month value queue is empty.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Library & Information Science (AREA)
  • Television Signal Processing For Recording (AREA)
  • Multimedia (AREA)

Abstract

L'invention concerne un dispositif et un procédé permettant d'établir des informations d'image vidéo structurées. Le dispositif est caractérisé en ce que : un module de capteur d'image CCD/CMOS est connecté à un processeur vidéo, un module de réception de réseau de capteurs sans fil est connecté à un processeur d'unité centrale de traitement (CPU), des sorties du processeur vidéo et du processeur CPU sont connectées respectivement à un module de fusion d'informations, et le module de fusion d'informations est connecté à une interface Ethernet/WiFi. Le procédé consiste : à fusionner des informations d'image vidéo avec d'autres données, ce qui forme des informations d'image vidéo structurées. Des informations de description de texte sont établies pour une image vidéo par fusion d'informations collectées par divers capteurs et utilisation d'informations des différents capteurs et des avantages d'un mode de collecte des différents capteurs, ce qui permet d'augmenter la vitesse d'extraction et le taux d'utilisation de l'image vidéo.
PCT/CN2016/081149 2015-05-20 2016-05-05 Dispositif et procédé permettant d'établir des informations d'image vidéo structurées WO2016184314A1 (fr)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201510260225.5A CN104899261B (zh) 2015-05-20 2015-05-20 一种构建结构化视频图像信息的设备和方法
CN201510260225.5 2015-05-20

Publications (1)

Publication Number Publication Date
WO2016184314A1 true WO2016184314A1 (fr) 2016-11-24

Family

ID=54031924

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2016/081149 WO2016184314A1 (fr) 2015-05-20 2016-05-05 Dispositif et procédé permettant d'établir des informations d'image vidéo structurées

Country Status (2)

Country Link
CN (1) CN104899261B (fr)
WO (1) WO2016184314A1 (fr)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110751065A (zh) * 2019-09-30 2020-02-04 北京旷视科技有限公司 训练数据的采集方法及装置
CN111783404A (zh) * 2020-06-18 2020-10-16 上海华力集成电路制造有限公司 数据处理方法及系统
CN113515649A (zh) * 2020-11-19 2021-10-19 阿里巴巴集团控股有限公司 数据的结构化方法、系统、装置、设备和存储介质
CN117633297A (zh) * 2024-01-26 2024-03-01 江苏瑞宁信创科技有限公司 基于注释的视频检索方法、装置、系统及介质

Families Citing this family (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104899261B (zh) * 2015-05-20 2018-04-03 杜晓通 一种构建结构化视频图像信息的设备和方法
CN105847254B (zh) * 2016-03-23 2018-10-16 司南 数据分享方法及装置
CN106294600A (zh) * 2016-07-29 2017-01-04 易心悦 一种数码照片的多媒体编辑方法、展示方法及系统
CN106803937B (zh) * 2017-02-28 2020-03-17 兰州理工大学 一种具有文本日志的双摄像头视频监控方法、系统和监控装置
CN107025292A (zh) * 2017-04-14 2017-08-08 国网江苏省电力公司无锡供电公司 面向变电站内视频与异构传感器的结构化描述方法
CN107222583A (zh) * 2017-08-08 2017-09-29 江苏优闼数据科技有限公司 一种融合结构化数据和非结构化数据的数据传输方法
CN107749963A (zh) * 2017-10-17 2018-03-02 北京工商大学 一种多感知源信息融合视频方法
CN113115069A (zh) * 2021-02-19 2021-07-13 深圳市麦谷科技有限公司 一种行车记录仪的视频存储方法和系统
CN113114968A (zh) * 2021-04-13 2021-07-13 中国建设银行股份有限公司 一种视频处理方法、装置、设备及存储介质
CN113656364B (zh) * 2021-08-05 2024-02-20 福瑞泰克智能系统有限公司 传感器数据处理方法、装置和计算机可读存储介质

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20030031260A1 (en) * 2001-07-16 2003-02-13 Ali Tabatabai Transcoding between content data and description data
CN101630324A (zh) * 2009-08-18 2010-01-20 北京航空航天大学 一种在多媒体资源中存取地理位置信息的方法
CN101783881A (zh) * 2010-03-05 2010-07-21 公安部第三研究所 一种具有视频结构化描述功能的智能网络摄像机
CN102387346A (zh) * 2011-10-17 2012-03-21 上海交通大学 可管、可查、可巡视的监控系统的智能前端
CN103635954A (zh) * 2011-02-08 2014-03-12 隆沙有限公司 基于地理和可视信息增强可视数据流的系统
CN104899261A (zh) * 2015-05-20 2015-09-09 杜晓通 一种构建结构化视频图像信息的设备和方法
CN204795392U (zh) * 2015-06-26 2015-11-18 山东大学 一种构建特种设备异构信息结构化的设备

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP4685465B2 (ja) * 2005-02-01 2011-05-18 パナソニック株式会社 監視記録装置
CN101420595B (zh) * 2007-10-23 2012-11-21 华为技术有限公司 一种描述和捕获视频对象的方法及设备
CN101293349A (zh) * 2008-06-05 2008-10-29 广州大学 一种基于Wi-Fi的机器人
CN201830388U (zh) * 2010-10-13 2011-05-11 成都创烨科技有限责任公司 一种视频内容采集及处理装置
CN103186634A (zh) * 2011-12-31 2013-07-03 无锡物联网产业研究院 一种智能交通监控视频的检索方法及装置
CN103595968B (zh) * 2013-11-22 2017-02-22 武汉大学 一种基于地理位置的视频传感器接入方法

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20030031260A1 (en) * 2001-07-16 2003-02-13 Ali Tabatabai Transcoding between content data and description data
CN101630324A (zh) * 2009-08-18 2010-01-20 北京航空航天大学 一种在多媒体资源中存取地理位置信息的方法
CN101783881A (zh) * 2010-03-05 2010-07-21 公安部第三研究所 一种具有视频结构化描述功能的智能网络摄像机
CN103635954A (zh) * 2011-02-08 2014-03-12 隆沙有限公司 基于地理和可视信息增强可视数据流的系统
CN102387346A (zh) * 2011-10-17 2012-03-21 上海交通大学 可管、可查、可巡视的监控系统的智能前端
CN104899261A (zh) * 2015-05-20 2015-09-09 杜晓通 一种构建结构化视频图像信息的设备和方法
CN204795392U (zh) * 2015-06-26 2015-11-18 山东大学 一种构建特种设备异构信息结构化的设备

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110751065A (zh) * 2019-09-30 2020-02-04 北京旷视科技有限公司 训练数据的采集方法及装置
CN111783404A (zh) * 2020-06-18 2020-10-16 上海华力集成电路制造有限公司 数据处理方法及系统
CN111783404B (zh) * 2020-06-18 2024-01-09 上海华力集成电路制造有限公司 数据处理方法及系统
CN113515649A (zh) * 2020-11-19 2021-10-19 阿里巴巴集团控股有限公司 数据的结构化方法、系统、装置、设备和存储介质
CN113515649B (zh) * 2020-11-19 2024-03-01 阿里巴巴集团控股有限公司 数据的结构化方法、系统、装置、设备和存储介质
CN117633297A (zh) * 2024-01-26 2024-03-01 江苏瑞宁信创科技有限公司 基于注释的视频检索方法、装置、系统及介质
CN117633297B (zh) * 2024-01-26 2024-04-30 江苏瑞宁信创科技有限公司 基于注释的视频检索方法、装置、系统及介质

Also Published As

Publication number Publication date
CN104899261A (zh) 2015-09-09
CN104899261B (zh) 2018-04-03

Similar Documents

Publication Publication Date Title
WO2016184314A1 (fr) Dispositif et procédé permettant d'établir des informations d'image vidéo structurées
US20200250218A1 (en) System and method for signature-enhanced multimedia content searching
US8270684B2 (en) Automatic media sharing via shutter click
US11417074B2 (en) Methods and apparatus for identifying objects depicted in a video using extracted video frames in combination with a reverse image search engine
US20170185675A1 (en) Fingerprinting and matching of content of a multi-media file
KR102434374B1 (ko) 인공 지능에 관한 장치 및 방법
CN101374234A (zh) 一种基于内容的视频拷贝监测方法及装置
US10380267B2 (en) System and method for tagging multimedia content elements
CN103870574A (zh) 基于h.264密文云视频存储的标签制作及索引方法
CN112364201A (zh) 一种视频数据检索方法及系统
CN113114968A (zh) 一种视频处理方法、装置、设备及存储介质
US9524754B2 (en) Video playback device and video recording device
US20210014540A1 (en) Method and system for codec of visual feature data
US8896708B2 (en) Systems and methods for determining, storing, and using metadata for video media content
Kim et al. Photo cube: an automatic management and search for photos using mobile smartphones
CN103198162B (zh) 一种图像浏览交互方法
CN102270228B (zh) 视频检索方法及前端设备、后端服务器
NO20140958A1 (no) Fremgangsmåte og system for søk i digitalt innhold
Mo Design and Implementation of Video Surveillance Identification System in Smart Campus
CN202652389U (zh) 执法记录仪
US20170286434A1 (en) System and method for signature-based clustering of multimedia content elements
CN117290389A (zh) 数据标注方法、装置、电子设备及存储介质
Brut et al. Integrating heterogeneous metadata into a distributed multimedia information system
CN114564614A (zh) 一种视频片段自动搜索方法、系统、装置及可读存储介质
CN117216308A (zh) 基于大模型的搜索方法、系统、设备及介质

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 16795801

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 16795801

Country of ref document: EP

Kind code of ref document: A1