WO2016184314A1 - Device and method for establishing structured video image information - Google Patents

Device and method for establishing structured video image information Download PDF

Info

Publication number
WO2016184314A1
WO2016184314A1 PCT/CN2016/081149 CN2016081149W WO2016184314A1 WO 2016184314 A1 WO2016184314 A1 WO 2016184314A1 CN 2016081149 W CN2016081149 W CN 2016081149W WO 2016184314 A1 WO2016184314 A1 WO 2016184314A1
Authority
WO
WIPO (PCT)
Prior art keywords
information
file
structured
video image
video
Prior art date
Application number
PCT/CN2016/081149
Other languages
French (fr)
Chinese (zh)
Inventor
杜晓通
王伟
邢大天
Original Assignee
山东大学
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 山东大学 filed Critical 山东大学
Publication of WO2016184314A1 publication Critical patent/WO2016184314A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/50Information retrieval; Database structures therefor; File system structures therefor of still image data
    • G06F16/58Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • G06F16/5866Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using information manually generated, e.g. tags, keywords, comments, manually generated location and time information
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/10File systems; File servers
    • G06F16/13File access structures, e.g. distributed indices
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/70Information retrieval; Database structures therefor; File system structures therefor of video data
    • G06F16/78Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • G06F16/7867Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using information manually generated, e.g. tags, keywords, comments, title and artist information, manually generated time, location and usage information, user ratings

Definitions

  • the present invention relates to the field of video image information processing technologies, and in particular, to an apparatus and method for constructing structured video image information.
  • the video information intelligent analysis technology is derived from computer vision technology and pattern recognition technology, which can establish a one-to-one mapping relationship between image and image content description, so that the computer can understand the specific content in the video image through digital image analysis. . It is an important means of mining valuable things from massive video and image resources.
  • the main algorithms of intelligent video analysis technology to realize real-time detection, recognition and multi-target tracking of mobile targets are divided into the following five categories: target detection, target tracking, target recognition, behavior analysis, content-based video retrieval and data fusion. Wait.
  • the object of the present invention is to solve the above problems, and an apparatus and method for constructing structured video image information are proposed.
  • the information collected by a plurality of different sensors is integrated.
  • text description information is created for the video image, and the retrieval speed and utilization efficiency of the video image are improved.
  • An apparatus for constructing structured video image information comprising: a CCD/CMOS image sensor module, a wireless sensor network receiving module, a video processor, a CPU processor, an information fusion module, and an Ethernet/WiFi interface;
  • the CCD/CMOS image sensor module is connected to a video processor, and the wireless sensor network receiving module is connected to the CPU processor, and the outputs of the video processor and the CPU processor are respectively connected to the information fusion module, and the information fusion module and the Ethernet Network/WiFi interface connection;
  • the video processor is used to complete the encoding function of the video image
  • the wireless sensor network receiving module is used to complete the text description information or the binary data receiving or directly connect the temperature, humidity, illuminance, pressure standard sensor, digitize the analog information; and the video image information Conformed with other data to form structured video image information.
  • the wireless sensor network module uses the ISM frequency band for data transmission, and has a standard sensor input interface, and supports 0-5v, 0-10v standard sensor signal access.
  • the apparatus of the present invention provides support for accomplishing the structuring of video image information, wherein the method of constructing structured video image information includes a method of constructing a structured JPEG image file and a method of constructing structured video information by describing a file. And based on the structured video image information, the storage and retrieval methods are optimized.
  • a method of constructing a structured video image information device comprising:
  • the specific method of the step (1) is:
  • the original unstructured JPEG image file is constructed into a structured JPEG.
  • File image information + text information. The text information part needs to be decrypted and displayed during parsing, and the image is not affected at all.
  • the tag code consists of two bytes.
  • the previous byte is fixed at 0xFF to indicate the start of the tag code.
  • the different values of the latter byte represent different meanings.
  • the file is parsed from 0xFFD8, and the end of 0xFFD9 parsing ends.
  • the specific method of the step (2) is:
  • the description file and the video file are associated as two parts of the structured information, and a corresponding description file is created for the video file of each time period to supplement the text of the single video image;
  • the sensor data or manually input text information is fused, and the packet length, video stream data packet, other information length, and other information are combined into a new data packet for transmission.
  • the method for storing the structured JPEG image file is:
  • segmentation or sub-folder storage according to the content of the text information.
  • the method for storing the structured video image information is:
  • An index file is constructed for each field of the description file in the structured video image information, and the index file is divided into several levels, and the received text information is stored according to the content of the corresponding level.
  • the method for searching the structured JPEG image file is:
  • the text information length is obtained by parsing the JPEG file data, and then all the text information is extracted according to the length, and compared with the search condition, if the search condition is met, the JPEG file is the image file we are looking for, otherwise it is performed. The comparison of the next file.
  • the method for searching the structured video image information is:
  • Search from the storage directory enter the corresponding first-level search directory, view the index files of all levels to select the qualified directory, put it into the search queue, and obtain the next-level search file from the search queue after the search directory of the level is completed.
  • the path is searched until the corresponding description file location is retrieved; after the description file is retrieved, the corresponding video file is obtained.
  • the device for constructing structured video image information in the invention can first realize the functions of capturing, encoding and transmitting video and images of a conventional camera, and is fully compatible with existing cameras conforming to international standards. On this basis, by adding new device modules and structured information processing algorithms, it is possible to use the other information (text description, standard sensor data) to create explicit description information for videos and images on the video and image information collection end. .
  • the invention attaches text information (generally sensor information) to a JPEG file to make it a new file with a sensor information label.
  • the sensor information can describe the specific environment information when the JPEG image file is taken, and can make the JPEG file better. Reproduce the scene when shooting. Since the invention does not destroy the structure and content of the original JPEG file, it does not prevent the existing software from reading and displaying the JPEG file, and also protects the security of the sensor information and prevents tampering by the existing software.
  • FIG. 1 is a structural diagram of an apparatus for constructing structured video image information according to the present invention
  • FIG. 2 is a schematic diagram of structured JPEG image information according to the present invention.
  • FIG. 3 is a schematic diagram showing the macroscopic angle of structured video information according to the present invention.
  • FIG. 4 is a schematic diagram of a microscopic angle of structured video information according to the present invention.
  • FIG. 5 is a schematic diagram of storing a picture file according to the present invention.
  • FIG. 6 is a schematic diagram of storing a multi-index video file according to the present invention.
  • FIG. 7 is a video image file retrieval process of the present invention.
  • the device structure for constructing structured video image information in the present invention is as shown in FIG. 1, and includes: a CCD/CMOS image sensor module, a wireless sensor network receiving module, a video processor, a CPU processor, an information fusion module, and an Ethernet/WiFi interface. ;
  • the CCD/CMOS image sensor module is connected to the video processor, the wireless sensor network receiving module is connected to the CPU processor, the output of the video processor and the CPU processor are respectively connected to the information fusion module, and the information fusion module is connected with the Ethernet/WiFi interface.
  • the device of the invention also designs a wireless sensor network receiving module capable of accepting standard sensors such as text description, temperature, humidity, illumination, pressure and the like.
  • the general video processor (chip) is used to complete the encoding function of the video image
  • the wireless sensor network receiving module is used to complete the text description information or the binary data reception or directly connect the standard sensors such as temperature, humidity, illumination, pressure, etc., and simulate the information. Digitizing.
  • the information structure algorithm of the present invention is implemented on a general purpose CPU, and the video image information is fused with other data to form structured video image information.
  • the video processor supports full HD encoding function, adopts pixel high-definition video and image sensor of 130w or more for video acquisition, and the wireless sensor network module uses ISM frequency band for data transmission, and has a standard sensor input interface and supports 0-5v, 0-10v standard sensor signal access.
  • the video processor is used to complete the encoding function of the video image
  • the wireless sensor network receiving module is used to complete the text description information or the binary data receiving or directly connect the temperature, humidity, illuminance, pressure standard sensor, digitize the analog information; and the video image information It is combined with other data to form structured video image information, and the two types of information are combined and uploaded to the server for processing.
  • the number of sensors is set at the time of fusion to calibrate how many sensors collect new data at this time.
  • the structured information packet format is set as follows:
  • the apparatus of the present invention provides support for accomplishing the structuring of video image information, wherein the method of constructing structured video image information includes a method of constructing a structured JPEG image file and a method of constructing structured video information by describing a file. And based on the structured video image information, the storage and retrieval methods are optimized.
  • a method of constructing a structured video image information device comprising:
  • Figure 2 is a schematic diagram of a JPEG image file structuring method.
  • the information such as text information, standard sensor data, and voice recognition data is encrypted and attached to the original JPEG file, and the original unstructured JPEG image file is structured into a structure by adding information after the JPEG file mark code EOI.
  • JPEG file image information + text information.
  • the text information part needs to be decrypted and displayed during parsing, and the image is not affected at all.
  • the sensor information can describe the specific environmental information when the JPEG image file is taken, enabling better reproduction of JPEG files.
  • JPEG File Interchange Format JPEG File Interchange Format
  • EXIF Exchange Image File Format
  • Tag code Two bytes. The previous byte is fixed at 0xFF to indicate the start of the tag code. The different values of the latter byte represent different meanings. When a plurality of consecutive 0xFFs appear, it is understood that a 0xFF also indicates the start of the tag code.
  • main tag codes Two bytes. The previous byte is fixed at 0xFF to indicate the start of the tag code. The different values of the latter byte represent different meanings. When a plurality of consecutive 0xFFs appear, it is understood that a 0xFF also indicates the start of the tag code.
  • main tag codes Two bytes. The previous byte is fixed at 0xFF to indicate the start of the tag code. The different values of the latter byte represent different meanings. When a plurality of consecutive 0xFFs appear, it is understood that a 0xFF also indicates the start of the tag code.
  • main tag codes Two bytes. The previous byte is fixed at 0xFF to indicate the start of the tag code. The different values
  • Tag code format significance SOI (Start Of Image) 0xFFD8 Image start APP0 (Application0) 0xFFE0 Application retention tag 0 SOFO (Start Of Frame) 0XFFC0 Frame image begins EOI(End Of Image) 0xFFD9 End of image, end of image file
  • Compressed data The tag code is followed by compressed data, which records the details of the image file.
  • the software parses the file from 0xFFD8 and ends the parsing at 0xFFD9. If we insert the relevant text information into the position of 0xFFD9 in the file, the software will not parse the segment information. This avoids the impact of textual information on image content and image quality.
  • the file is integrated with the text information. From the information point of view, the whole unstructured image data and structured text information constitute a piece of structured information, that is, we will use JPEG image files and various sources. The text information is associated and the text information is inserted into the JPEG file. At the same time, because the current viewing software does not view the content behind the 0xFFD9 markup code, the security and concealment of our information is guaranteed.
  • Text information Only when we need to query and parse the information behind 0xFFD9 can we get the correct image correspondence. Text information. This kind of information not only helps us to accurately describe the specific details of the image, but also can be used as a retrieval condition to retrieve the corresponding image file by retrieving the structured text information.
  • the video stream structuring method in the present invention encapsulates the original video stream data packet, and the new data packet is a video stream data packet and standard sensor data received by the Internet of Things sensor or
  • the information such as manually input text information is merged, and the data packet length, video stream data packet, other information length, and other information are combined into a new data packet for transmission.
  • the format definition of the new data packet is shown in Table 2.
  • the server After receiving the data packet, the server parses it to generate the corresponding video file and its description file. The video file is structured and encapsulated in this way.
  • the structured video image information device is configured to collect sensor information and video information, and suffix the encoded video information and the encrypted sensor information into a JPEG image file or form a corresponding description file for the video file to form a standard data stream. Output according to standard communication protocols (usually TCP/IP).
  • Figure 3 is a macro perspective view of structured video information. Macroscopically, a description file for video files is added, and description files and video files are associated as two parts of structured information. A textual supplement to a single video image is created by creating its corresponding profile for the video file for each time period. Make the abstract video picture more specific and more substantial.
  • Figure 4 is a schematic diagram of the microscopic angle of structured video information.
  • the description file records the location of the video capture, the absolute time of the shot, and the relative time at which the video file began, as well as the textual information generated during the capture.
  • the information can be encrypted and stored, and needs to be viewed. It can be viewed by decrypting it with a private key.
  • the image file storage and retrieval method of the present invention obtains the length of the text information by parsing the 4-byte data behind the JPEG file tail 0xFFD9, and then extracts all the text information according to the length, and compares with the search condition, if the conditions are met Explain that this JPEG file is the image file we are looking for, otherwise we will compare the next file.
  • the principle of “B+tree” can be used to store segments according to the content of text information in the process of storage, and the folder where the conditions are located is first searched during retrieval. And then search in a subfolder to reduce the number of files viewed.
  • Figure 5 is a schematic diagram of image file storage (taking temperature as an example).
  • the image file is "B+tree" for intelligent classification and storage.
  • searching first determine which temperature segment the retrieval condition is in, and enter the corresponding directory for retrieval. .
  • the subdirectory is retrieved, the image that meets the criteria is output.
  • the video storage and retrieval method of the present invention uses the database "B + tree” idea to construct an index file for each field in the description file, and divides the index file into "engineering", "installation location", "year”, “month”, “day”
  • the received text information is stored in several levels. Search from the storage directory at the time of retrieval, enter the corresponding project directory, view the index files of all levels to select the qualified directory, put it into the retrieval queue, and obtain the next-level search file from the retrieval queue after the retrieval of the retrieval directory at this level. The path is retrieved until the corresponding profile location is retrieved. After the description file is retrieved, the corresponding video file can be obtained.
  • Figure 6 is a schematic diagram of multi-index video file storage.
  • Index files at various levels record the number of devices, the number of files, conditional flags, and so on.
  • the index file is generated by "bottom-up”. After receiving the text information, the index file is updated upwards in turn.
  • Engineing Index File records the number of devices included in the current project and the installation location of each device, project creation time and deadline, project file storage directory and number of files stored.
  • “Location index file” records the start time and stop time of the current device shooting, the number of video files captured, the storage address of the "year value” directory, and the text information identification and data content generated during the shooting of the device. Wait.
  • the “Annual Index File” records the month in which the device was used normally, the number of video files captured, the storage address of the “monthly value” directory, and the textual information identifier and its data content generated during the shooting of the device.
  • the “monthly index file” records the number of days the device is used normally in the month, the number of video files captured, the storage address of the “date” directory, and the text information identifier and its data content generated during the shooting of the device.
  • the "date index file” records the number of video files taken on the day of the device, the file name of each video file and the file name of the description file, and the text information identifier and its data content contained in each video file.
  • Figure 7 is a video image file retrieval process.
  • the detailed search process is as follows:
  • step 4 Dequeue the date in the date queue, enter the corresponding date directory, find the matching file, and output the file name. Repeat step 4 until the date queue is empty.
  • step 3 Repeat step 3 until the month value queue is empty.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Library & Information Science (AREA)
  • Television Signal Processing For Recording (AREA)
  • Multimedia (AREA)

Abstract

A device and method for establishing structured video image information. The device is characterized in that: a CCD/CMOS image sensor module is connected to a video processor, a wireless sensor network receiving module is connected to a CPU processor, outputs of the video processor and the CPU processor are respectively connected to an information fusion module, and the information fusion module is connected to an Ethernet/WiFi interface. The method comprises: fusing video image information with other data, thus forming structured video image information. Text description information is established for a video image by fusing information collected by various sensors and utilizing information of the different sensors and the advantages of a collecting mode of the different sensors, thereby increasing the retrieval speed and the utilization rate of the video image.

Description

一种构建结构化视频图像信息的设备和方法Apparatus and method for constructing structured video image information 技术领域Technical field
本发明涉及视频图像信息处理技术领域,尤其涉及一种构建结构化视频图像信息的设备和方法。The present invention relates to the field of video image information processing technologies, and in particular, to an apparatus and method for constructing structured video image information.
背景技术Background technique
物联网技术给当前社会的工作和生活方式带来了根本性的改变,它也因此被视为科技界的一次革命。安防技术与物联网技术的融合也是发展的趋势,两者结合后发挥的重大作用也在日益凸显。Internet of Things technology has brought about fundamental changes in the work and lifestyle of the current society, and it has therefore been regarded as a revolution in the scientific community. The integration of security technology and Internet of Things technology is also a development trend, and the important role played by the combination of the two is also becoming increasingly prominent.
随着视频监控数据量越来越大,数据重要性越来越突出,视频信息智能分析技术成为当前研究的主流。视频信息智能分析技术源自计算机视觉技术和模式识别技术,它能够在图像及图像内容的描述之间建立一对一的映射关系,从而使计算机能够通过数字图像解析来理解视频画面中的具体内容。它是实现从海量视频和图像资源中挖掘有价值东西的重要手段。目前,智能视频分析技术实现对移动目标的实时检测、识别以及多目标跟踪等功能的主要算法分为以下五类:目标检测、目标跟踪、目标识别、行为分析、基于内容的视频检索和数据融合等。As the amount of video surveillance data increases, the importance of data becomes more and more prominent, and video information intelligent analysis technology has become the mainstream of current research. The video information intelligent analysis technology is derived from computer vision technology and pattern recognition technology, which can establish a one-to-one mapping relationship between image and image content description, so that the computer can understand the specific content in the video image through digital image analysis. . It is an important means of mining valuable things from massive video and image resources. At present, the main algorithms of intelligent video analysis technology to realize real-time detection, recognition and multi-target tracking of mobile targets are divided into the following five categories: target detection, target tracking, target recognition, behavior analysis, content-based video retrieval and data fusion. Wait.
然而当前的视频分析技术的分析重点仍旧停留在视频图像本身,并没有将视频图像与其他信息进行关联。这些视频图像所蕴含的信息尽管丰富,但很难进行量化,对其内容只能进行抽象的分析,无法有效完成文本语言的描述。这些抽象的内容和视频图像信息一样都属于非结构化信息的范畴,无法通过细化分解的方法进行解析。当信息量极大时,其存储和检索都将耗费大量的系统资源和时间。同时由于智能视频分析技术依赖于模式识别技术,因此,其分析结果的准确性和消耗的时间,将会随识别算法的优劣而变化。目前,识别算法的基础还是基于几十年前的理论,由于理论上没有实现大的突破,致使基于内容的视频图像检索技术始终没有质的变化,这就造成了在视频监控领域没有真正的解决海量视频图像数据的快速定位和准确检索问题。However, the focus of current video analytics technology remains on the video image itself, and the video image is not associated with other information. Although the information contained in these video images is rich, it is difficult to quantify, and the content can only be abstractly analyzed, and the description of the text language cannot be effectively completed. These abstract content, like video image information, belong to the category of unstructured information and cannot be parsed by refinement. When the amount of information is extremely large, its storage and retrieval will consume a lot of system resources and time. At the same time, because the intelligent video analysis technology relies on the pattern recognition technology, the accuracy and time of the analysis results will vary with the quality of the recognition algorithm. At present, the basis of the recognition algorithm is based on the theory of decades ago. Because there is no big breakthrough in theory, the content-based video image retrieval technology has not changed qualitatively, which has caused no real solution in the field of video surveillance. Rapid positioning and accurate retrieval of massive video image data.
发明内容Summary of the invention
本发明的目的就是为了解决上述问题,提出了一种构建结构化视频图像信息的设备和方法,通过将物联网技术与安防监控技术的密切配合,将多种不同传感器采集到的信息进行融合,利用不同传感器信息和其采集方式的优点,为视频图像建立文本描述信息,提高视频图像的检索速度和其利用效率。 The object of the present invention is to solve the above problems, and an apparatus and method for constructing structured video image information are proposed. By closely cooperating with the Internet of Things technology and the security monitoring technology, the information collected by a plurality of different sensors is integrated. Using the advantages of different sensor information and its collection method, text description information is created for the video image, and the retrieval speed and utilization efficiency of the video image are improved.
为了实现上述目的,本发明采用如下技术方案:In order to achieve the above object, the present invention adopts the following technical solutions:
一种构建结构化视频图像信息的设备,包括:CCD/CMOS图像传感器模块、无线传感器网络接收模块、视频处理器、CPU处理器、信息融合模块和以太网/WiFi接口;An apparatus for constructing structured video image information, comprising: a CCD/CMOS image sensor module, a wireless sensor network receiving module, a video processor, a CPU processor, an information fusion module, and an Ethernet/WiFi interface;
所述CCD/CMOS图像传感器模块与视频处理器连接,无线传感器网络接收模块与CPU处理器连接,所述视频处理器与CPU处理器的输出分别连接至信息融合模块,所述信息融合模块与以太网/WiFi接口连接;The CCD/CMOS image sensor module is connected to a video processor, and the wireless sensor network receiving module is connected to the CPU processor, and the outputs of the video processor and the CPU processor are respectively connected to the information fusion module, and the information fusion module and the Ethernet Network/WiFi interface connection;
利用视频处理器完成视频图像的编码功能,利用无线传感器网络接收模块完成文本描述信息或2进制数据的接收或者直接连接温度、湿度、照度、压力标准传感器,将模拟信息数字化;将视频图像信息与其他数据融合在一起,形成结构化视频图像信息。The video processor is used to complete the encoding function of the video image, and the wireless sensor network receiving module is used to complete the text description information or the binary data receiving or directly connect the temperature, humidity, illuminance, pressure standard sensor, digitize the analog information; and the video image information Conformed with other data to form structured video image information.
所述无线传感器网络模块采用ISM频段进行数据传输,同时具有标准传感器输入接口,支持0-5v,0-10v标准传感器信号的接入。The wireless sensor network module uses the ISM frequency band for data transmission, and has a standard sensor input interface, and supports 0-5v, 0-10v standard sensor signal access.
本发明设备为完成视频图像信息结构化提供支撑,其中构建结构化视频图像信息的方法包括构建结构化的JPEG图像文件的方法和通过描述文件构建结构化视频信息的方法。并且基于结构化后的视频图像信息,优化了其存储及检索方法。The apparatus of the present invention provides support for accomplishing the structuring of video image information, wherein the method of constructing structured video image information includes a method of constructing a structured JPEG image file and a method of constructing structured video information by describing a file. And based on the structured video image information, the storage and retrieval methods are optimized.
一种构建结构化视频图像信息设备的方法,包括:A method of constructing a structured video image information device, comprising:
(1)构建结构化的JPEG图像文件;(1) constructing a structured JPEG image file;
(2)构建结构化的视频图像信息;(2) constructing structured video image information;
(3)分别对结构化的JPEG图像文件和视频图像信息进行存储;(3) storing structured JPEG image files and video image information respectively;
(4)分别对结构化的JPEG图像文件和视频图像信息进行检索。(4) Retrieving structured JPEG image files and video image information, respectively.
所述步骤(1)的具体方法为:The specific method of the step (1) is:
将文本信息、标准传感器数据、语音识别后的数据等信息加密后附着在原有的JPEG文件上,通过在JPEG文件标记码后面添加信息,将原有的非结构化JPEG图像文件构建成结构化JPEG文件(图像信息+文本信息)。在解析时需要将文本信息部分进行解密显示,图像则不受任何影响。Encrypt the text information, standard sensor data, and voice-recognition data and attach it to the original JPEG file. By adding information to the JPEG file tag code, the original unstructured JPEG image file is constructed into a structured JPEG. File (image information + text information). The text information part needs to be decrypted and displayed during parsing, and the image is not affected at all.
标记码由两字节构成,前一个字节固定为0xFF代表标记码开始,后一个字节不同值代表不同含义;在图像解析过程中,从0xFFD8开始对文件进行解析,到0xFFD9解析结束。The tag code consists of two bytes. The previous byte is fixed at 0xFF to indicate the start of the tag code. The different values of the latter byte represent different meanings. In the image parsing process, the file is parsed from 0xFFD8, and the end of 0xFFD9 parsing ends.
所述步骤(2)的具体方法为:The specific method of the step (2) is:
将描述文件和视频文件作为结构化信息的两部分进行关联,为每个时间段的视频文件创建其对应的描述文件来对单一的视频图像进行文本化的补充;The description file and the video file are associated as two parts of the structured information, and a corresponding description file is created for the video file of each time period to supplement the text of the single video image;
对原有的视频流数据包进行封装,将原有视频流数据包和物联网传感器接收到的标准传 感器数据或手动输入的文本信息进行融合,将数据包长度、视频流数据包、其他信息长度及其他信息合并成新的数据包进行传输。Encapsulate the original video stream data packet, and pass the original video stream data packet and the standard transmission received by the Internet of Things sensor. The sensor data or manually input text information is fused, and the packet length, video stream data packet, other information length, and other information are combined into a new data packet for transmission.
所述步骤(3)中,对结构化的JPEG图像文件进行存储的方法为:In the step (3), the method for storing the structured JPEG image file is:
在存储的过程中按照文本信息内容进行分段或者分文件夹存储。In the process of storage, segmentation or sub-folder storage according to the content of the text information.
所述步骤(3)中,对结构化的视频图像信息进行存储的方法为:In the step (3), the method for storing the structured video image information is:
为结构化的视频图像信息中描述文件的各字段构建索引文件,将索引文件分为若干级,将接收到的文本信息按照相应级的内容进行存储。An index file is constructed for each field of the description file in the structured video image information, and the index file is divided into several levels, and the received text information is stored according to the content of the corresponding level.
所述步骤(4)中,对结构化的JPEG图像文件进行检索的方法为:In the step (4), the method for searching the structured JPEG image file is:
在检索时首先定位条件所在的文件夹,然后在子文件夹中进行检索;When searching, first locate the folder where the condition is located, and then search in the subfolder;
通过解析JPEG文件数据获得文本信息长度,然后根据该长度将全部的文本信息提取出来,通过与检索条件进行匹配对比,如果检索条件符合则说明这个JPEG文件就是我们所要找的图片文件,否则就进行下一文件的对比。The text information length is obtained by parsing the JPEG file data, and then all the text information is extracted according to the length, and compared with the search condition, if the search condition is met, the JPEG file is the image file we are looking for, otherwise it is performed. The comparison of the next file.
所述步骤(4)中,对结构化的视频图像信息进行检索的方法为:In the step (4), the method for searching the structured video image information is:
从存储目录开始查找,进入对应第一级检索目录,查看各级索引文件选择符合条件的目录,放入检索队列,在本级检索目录查询完毕后,从检索队列中获取下一级检索文件的路径进行检索直到检索到对应的描述文件位置为止;检索到描述文件后,得到其对应的视频文件。Search from the storage directory, enter the corresponding first-level search directory, view the index files of all levels to select the qualified directory, put it into the search queue, and obtain the next-level search file from the search queue after the search directory of the level is completed. The path is searched until the corresponding description file location is retrieved; after the description file is retrieved, the corresponding video file is obtained.
本发明的有益效果是:The beneficial effects of the invention are:
本发明中构建结构化视频图像信息的设备首先能够实现传统摄像机的视频、图像的采集、编码、传输功能,与现有的符合国际标准的摄像机完全兼容。在此基础上,通过增加新的设备模块和结构化信息的处理算法,能够在视频、图像信息采集端就利用其他信息(文本描述、标准传感器数据)为视频、图像建立起属性明确的描述信息。The device for constructing structured video image information in the invention can first realize the functions of capturing, encoding and transmitting video and images of a conventional camera, and is fully compatible with existing cameras conforming to international standards. On this basis, by adding new device modules and structured information processing algorithms, it is possible to use the other information (text description, standard sensor data) to create explicit description information for videos and images on the video and image information collection end. .
本发明通过将文本信息(一般为传感器信息)附着在JPEG文件后面使其成为一个带有传感器信息标签的新文件,传感器信息可以描述JPEG图像文件拍摄时的具体环境信息,能够使JPEG文件更好的再现拍摄时的场景。由于本发明没有破坏原有的JPEG文件的结构和内容,因此不妨碍现有的软件对JPEG文件进行读取显示,同时也保护了传感器信息的安全性,防止被现有软件篡改。The invention attaches text information (generally sensor information) to a JPEG file to make it a new file with a sensor information label. The sensor information can describe the specific environment information when the JPEG image file is taken, and can make the JPEG file better. Reproduce the scene when shooting. Since the invention does not destroy the structure and content of the original JPEG file, it does not prevent the existing software from reading and displaying the JPEG file, and also protects the security of the sensor information and prevents tampering by the existing software.
附图说明DRAWINGS
图1为本发明构建结构化视频图像信息的设备结构图;1 is a structural diagram of an apparatus for constructing structured video image information according to the present invention;
图2为本发明结构化JPEG图像信息示意图;2 is a schematic diagram of structured JPEG image information according to the present invention;
图3为本发明结构化视频信息宏观角度示意图; 3 is a schematic diagram showing the macroscopic angle of structured video information according to the present invention;
图4为本发明结构化视频信息微观角度示意图;4 is a schematic diagram of a microscopic angle of structured video information according to the present invention;
图5为本发明图片文件存储示意图;5 is a schematic diagram of storing a picture file according to the present invention;
图6为本发明多索引视频文件存储示意图;6 is a schematic diagram of storing a multi-index video file according to the present invention;
图7为本发明视频图像文件检索流程。FIG. 7 is a video image file retrieval process of the present invention.
具体实施方式detailed description
下面结合附图与实施例对本发明做进一步说明:The present invention will be further described below in conjunction with the accompanying drawings and embodiments:
本发明中构建结构化视频图像信息的设备结构如图1所示,包括:CCD/CMOS图像传感器模块、无线传感器网络接收模块、视频处理器、CPU处理器、信息融合模块和以太网/WiFi接口;The device structure for constructing structured video image information in the present invention is as shown in FIG. 1, and includes: a CCD/CMOS image sensor module, a wireless sensor network receiving module, a video processor, a CPU processor, an information fusion module, and an Ethernet/WiFi interface. ;
CCD/CMOS图像传感器模块与视频处理器连接,无线传感器网络接收模块与CPU处理器连接,视频处理器与CPU处理器的输出分别连接至信息融合模块,信息融合模块与以太网/WiFi接口连接。The CCD/CMOS image sensor module is connected to the video processor, the wireless sensor network receiving module is connected to the CPU processor, the output of the video processor and the CPU processor are respectively connected to the information fusion module, and the information fusion module is connected with the Ethernet/WiFi interface.
本发明设备除了利用通用CCD/CMOS传感器完成图像信息采集外,还并行设计了能够接受文本描述、温度、湿度、照度、压力等标准传感器的无线传感器网络接收模块。利用通用的视频处理器(芯片)完成视频图像的编码功能,利用无线传感器网络接收模块完成文本描述信息或2进制数据的接收或直接连接温度、湿度、照度、压力等标准传感器,将模拟信息数字化。在通用CPU上实施本发明的信息结构算法,视频图像信息与其他数据融合在一起,形成结构化视频图像信息。In addition to the general CCD/CMOS sensor for image information acquisition, the device of the invention also designs a wireless sensor network receiving module capable of accepting standard sensors such as text description, temperature, humidity, illumination, pressure and the like. The general video processor (chip) is used to complete the encoding function of the video image, and the wireless sensor network receiving module is used to complete the text description information or the binary data reception or directly connect the standard sensors such as temperature, humidity, illumination, pressure, etc., and simulate the information. Digitizing. The information structure algorithm of the present invention is implemented on a general purpose CPU, and the video image information is fused with other data to form structured video image information.
本发明中视频处理器支持全高清编码功能,采用130w以上的像素高清视频、图像传感器用于视频采集,无线传感器网络模块采用ISM频段进行数据传输,同时具有标准传感器输入接口,支持0-5v,0-10v标准传感器信号的接入。In the invention, the video processor supports full HD encoding function, adopts pixel high-definition video and image sensor of 130w or more for video acquisition, and the wireless sensor network module uses ISM frequency band for data transmission, and has a standard sensor input interface and supports 0-5v, 0-10v standard sensor signal access.
利用视频处理器完成视频图像的编码功能,利用无线传感器网络接收模块完成文本描述信息或2进制数据的接收或者直接连接温度、湿度、照度、压力标准传感器,将模拟信息数字化;将视频图像信息与其他数据融合在一起,形成结构化视频图像信息,并将两类信息融合打包后上传至服务器端进行处理。The video processor is used to complete the encoding function of the video image, and the wireless sensor network receiving module is used to complete the text description information or the binary data receiving or directly connect the temperature, humidity, illuminance, pressure standard sensor, digitize the analog information; and the video image information It is combined with other data to form structured video image information, and the two types of information are combined and uploaded to the server for processing.
由于视频流的速度是每秒种25帧,而环境参数改变并没有这么快,因此在融合时设置传感器数量来标定此时有多少个传感器采集到新的数据。Since the speed of the video stream is 25 frames per second, and the environmental parameters are not changed so fast, the number of sensors is set at the time of fusion to calibrate how many sensors collect new data at this time.
结构化信息数据包格式设置如下:The structured information packet format is set as follows:
表1结构化信息数据包格式Table 1 Structured Information Packet Format
内容content 长度length
数据包长度Packet length 6字节6 bytes
传感器数量Number of sensors 1字节1 byte
视频信息长度Video message length 6字节6 bytes
视频信息Video information 由“视频信息长度”确定Determined by "video information length"
传感器信息长度Sensor information length 4字节4 bytes
传感器信息Sensor information 由“传感器信息长度”确定Determined by "sensor information length"
本发明设备为完成视频图像信息结构化提供支撑,其中构建结构化视频图像信息的方法包括构建结构化的JPEG图像文件的方法和通过描述文件构建结构化视频信息的方法。并且基于结构化后的视频图像信息,优化了其存储及检索方法。The apparatus of the present invention provides support for accomplishing the structuring of video image information, wherein the method of constructing structured video image information includes a method of constructing a structured JPEG image file and a method of constructing structured video information by describing a file. And based on the structured video image information, the storage and retrieval methods are optimized.
一种构建结构化视频图像信息设备的方法,包括:A method of constructing a structured video image information device, comprising:
(1)构建结构化的JPEG图像文件;(1) constructing a structured JPEG image file;
(2)构建结构化的视频图像信息;(2) constructing structured video image information;
(3)分别对结构化的JPEG图像文件和视频图像信息进行存储;(3) storing structured JPEG image files and video image information respectively;
(4)分别对结构化的JPEG图像文件和视频图像信息进行检索。(4) Retrieving structured JPEG image files and video image information, respectively.
如图2所示是JPEG图像文件结构化方法示意图。将文本信息、标准传感器数据、语音识别后的数据等信息加密后附着在原有的JPEG文件上,通过在JPEG文件标记码EOI后面添加信息,将原有的非结构化JPEG图像文件构建成结构化JPEG文件(图像信息+文本信息)。在解析时需要将文本信息部分进行解密显示,图像则不受任何影响。Figure 2 is a schematic diagram of a JPEG image file structuring method. The information such as text information, standard sensor data, and voice recognition data is encrypted and attached to the original JPEG file, and the original unstructured JPEG image file is structured into a structure by adding information after the JPEG file mark code EOI. JPEG file (image information + text information). The text information part needs to be decrypted and displayed during parsing, and the image is not affected at all.
通过将文本信息(一般为传感器信息)附着在JPEG文件后面使其成为一个带有传感器信息标签的新文件,传感器信息可以描述JPEG图像文件拍摄时的具体环境信息,能够使JPEG文件更好的再现拍摄时的场景。由于本发明没有破坏原有的JPEG文件的结构和内容,因此不妨碍现有的软件对JPEG文件进行读取显示,同时也保护了传感器信息的安全性,防止被现有软件篡改。By attaching text information (generally sensor information) to a JPEG file to make it a new file with a sensor information tag, the sensor information can describe the specific environmental information when the JPEG image file is taken, enabling better reproduction of JPEG files. The scene when shooting. Since the invention does not destroy the structure and content of the original JPEG file, it does not prevent the existing software from reading and displaying the JPEG file, and also protects the security of the sensor information and prevents tampering by the existing software.
JPEG文件格式存储格式有很多种,目前最常用的是JFIF(JPEG File Interchange Format)和EXIF(Exchange Image File Format)两种格式,它们遵守JIF(JPEG Interchange Format)。它大体分为两部分:There are many storage formats for JPEG file formats. Currently, the most commonly used formats are JFIF (JPEG File Interchange Format) and EXIF (Exchange Image File Format), which comply with JIF (JPEG Interchange Format). It is roughly divided into two parts:
标记码:两字节构成,前一个字节固定为0xFF代表标记码开始,后一个字节不同值代表不同含义。当出现连续的多个0xFF时,理解为一个0xFF也表示标记码开始。下面介绍几种主要的标记代码:Tag code: Two bytes. The previous byte is fixed at 0xFF to indicate the start of the tag code. The different values of the latter byte represent different meanings. When a plurality of consecutive 0xFFs appear, it is understood that a 0xFF also indicates the start of the tag code. Here are a few of the main tag codes:
表2 JPEG标记码 Table 2 JPEG markup code
标记代码Tag code 格式format 意义significance
SOI(Start Of Image)SOI (Start Of Image) 0xFFD80xFFD8 图像开始Image start
APP0(Application0)APP0 (Application0) 0xFFE00xFFE0 应用程序保留标记0Application retention tag 0
SOFO(Start Of Frame)SOFO (Start Of Frame) 0XFFC00XFFC0 帧图像开始Frame image begins
EOI(End Of Image)EOI(End Of Image) 0xFFD90xFFD9 图像结束,图像文件结束End of image, end of image file
压缩数据:标记码后面是压缩数据,记录了图像文件的详细信息。Compressed data: The tag code is followed by compressed data, which records the details of the image file.
在图像解析过程中,软件从0xFFD8开始对文件进行解析,到0xFFD9解析结束。如果我们把相关的文本信息插入到文件中0xFFD9的位置后面,软件不会对该段信息进行解析。这样就避免了文本信息对图像内容和图像质量造成影响。然而文件却与文本信息形成了一个整体,从信息角度看,这个整体便是非结构化的图像数据和结构化的文本信息构成了一段结构化信息,也即我们将JPEG图片文件和各种来源的文本信息进行了关联,并将文本信息插入进了JPEG文件。同时由于目前的看图软件不会查看0xFFD9标记码后面的内容,我们的信息的安全性和隐蔽性得到保证,只有在我们需要时通过对0xFFD9后面的信息进行查询和解析才能得到正确的图像对应的文本信息。这种信息不仅有助于我们准确的描述图像的具体细节,也可以作为一种检索条件,通过检索结构化的文本信息检索到对应的图像文件。In the image parsing process, the software parses the file from 0xFFD8 and ends the parsing at 0xFFD9. If we insert the relevant text information into the position of 0xFFD9 in the file, the software will not parse the segment information. This avoids the impact of textual information on image content and image quality. However, the file is integrated with the text information. From the information point of view, the whole unstructured image data and structured text information constitute a piece of structured information, that is, we will use JPEG image files and various sources. The text information is associated and the text information is inserted into the JPEG file. At the same time, because the current viewing software does not view the content behind the 0xFFD9 markup code, the security and concealment of our information is guaranteed. Only when we need to query and parse the information behind 0xFFD9 can we get the correct image correspondence. Text information. This kind of information not only helps us to accurately describe the specific details of the image, but also can be used as a retrieval condition to retrieve the corresponding image file by retrieving the structured text information.
如图3和图4所示,本发明中视频流结构化方法则是对原有的视频流数据包进行封装,新的数据包是视频流数据包和物联网传感器接收到的标准传感器数据或手动输入的文本信息等信息的融合,将数据包长度、视频流数据包、其他信息长度、其他信息合并成新的数据包进行传输,新数据包的格式定义见表2。服务端接收到数据包后将其解析生成对应的视频文件及其描述文件。通过这种方式对视频文件进行结构化封装。As shown in FIG. 3 and FIG. 4, the video stream structuring method in the present invention encapsulates the original video stream data packet, and the new data packet is a video stream data packet and standard sensor data received by the Internet of Things sensor or The information such as manually input text information is merged, and the data packet length, video stream data packet, other information length, and other information are combined into a new data packet for transmission. The format definition of the new data packet is shown in Table 2. After receiving the data packet, the server parses it to generate the corresponding video file and its description file. The video file is structured and encapsulated in this way.
构建结构化视频图像信息设备负责采集传感器信息和视频信息,并将编码后的视频信息和加密后的传感器信息后缀到JPEG图像文件或对视频文件形成对应的描述文件并形成标准的数据码流,按照标准的通信协议(通常为TCP/IP)输出。The structured video image information device is configured to collect sensor information and video information, and suffix the encoded video information and the encrypted sensor information into a JPEG image file or form a corresponding description file for the video file to form a standard data stream. Output according to standard communication protocols (usually TCP/IP).
如图3所示是结构化视频信息的宏观角度示意图。从宏观上,增加了对于视频文件的描述文件,将描述文件和视频文件作为结构化信息的两部分进行关联。通过为每个时间段的视频文件创建其对应的描述文件来对单一的视频图像进行一个文本化的补充。使得抽象的视频画面变的更为具体,内容更加充实。Figure 3 is a macro perspective view of structured video information. Macroscopically, a description file for video files is added, and description files and video files are associated as two parts of structured information. A textual supplement to a single video image is created by creating its corresponding profile for the video file for each time period. Make the abstract video picture more specific and more substantial.
如图4所示是结构化视频信息的微观角度示意图。从微观上,描述文件中记录了视频拍摄的地点、拍摄的绝对时间和相对视频文件开始的相对时间,以及拍摄过程中产生的文本信息。为了保证文本化信息中的内容的安全性,可以将信息进行加密编码后存储,在需要查看 的时候通过私有的密钥进行解密方能进行查看。Figure 4 is a schematic diagram of the microscopic angle of structured video information. Microscopically, the description file records the location of the video capture, the absolute time of the shot, and the relative time at which the video file began, as well as the textual information generated during the capture. In order to ensure the security of the content in the textual information, the information can be encrypted and stored, and needs to be viewed. It can be viewed by decrypting it with a private key.
本发明图片文件存储及检索方法,通过解析JPEG文件尾部0xFFD9后面的4字节数据获得文本信息长度,然后根据该长度将全部的文本信息提取出来,通过与检索条件进行匹配对比,如果条件符合则说明这个JPEG文件就是我们所要找的图片文件,否则就进行下一文件的对比。在对比过程中为了提高检索的效率和减少对比次数,借鉴“B+树”原理可以在存储的过程中按照文本信息内容进行分段(分文件夹)存储,在检索时首先定位条件所在的文件夹,然后在子文件夹中进行检索,可减少查看的文件数量。The image file storage and retrieval method of the present invention obtains the length of the text information by parsing the 4-byte data behind the JPEG file tail 0xFFD9, and then extracts all the text information according to the length, and compares with the search condition, if the conditions are met Explain that this JPEG file is the image file we are looking for, otherwise we will compare the next file. In the process of comparison, in order to improve the efficiency of retrieval and reduce the number of comparisons, the principle of “B+tree” can be used to store segments according to the content of text information in the process of storage, and the folder where the conditions are located is first searched during retrieval. And then search in a subfolder to reduce the number of files viewed.
如图5是图片文件存储示意图(以温度为例),对图片文件采取“B+树”的方式进行智能化分类存储,在检索时,首先判断检索条件处于哪个温度段,进入对应的目录进行检索。当子目录检索完毕后输出符合条件的图片。Figure 5 is a schematic diagram of image file storage (taking temperature as an example). The image file is "B+tree" for intelligent classification and storage. When searching, first determine which temperature segment the retrieval condition is in, and enter the corresponding directory for retrieval. . When the subdirectory is retrieved, the image that meets the criteria is output.
本发明视频存储及检索方法是采用数据库“B+树”思想为描述文件中各字段构建索引文件,将索引文件分为“工程”、“安装地点”、“年”、“月”、“日”几级,将接收到的文本信息按照几级进行存储。在检索时从存储目录开始查找,进入对应工程目录,查看各级索引文件选择符合条件的目录,放入检索队列,在本级检索目录查询完毕后,从检索队列中获取下一级检索文件的路径进行检索直到检索到对应的描述文件位置为止。在检索到描述文件后,可以得到其对应的视频文件。The video storage and retrieval method of the present invention uses the database "B + tree" idea to construct an index file for each field in the description file, and divides the index file into "engineering", "installation location", "year", "month", "day" At several levels, the received text information is stored in several levels. Search from the storage directory at the time of retrieval, enter the corresponding project directory, view the index files of all levels to select the qualified directory, put it into the retrieval queue, and obtain the next-level search file from the retrieval queue after the retrieval of the retrieval directory at this level. The path is retrieved until the corresponding profile location is retrieved. After the description file is retrieved, the corresponding video file can be obtained.
如图6是多索引视频文件存储示意图。各级索引文件记录了设备数量,文件数量,条件标示等。索引文件的生成方式是“自下而上”生成的,在接收到文本信息后,依次向上更新索引文件。Figure 6 is a schematic diagram of multi-index video file storage. Index files at various levels record the number of devices, the number of files, conditional flags, and so on. The index file is generated by "bottom-up". After receiving the text information, the index file is updated upwards in turn.
1.“工程索引文件”记录了当前工程包含的设备数量及各设备的安装地点,工程创建时间和截止时间,工程文件存储目录和存储的文件数量等。1. "Engineering Index File" records the number of devices included in the current project and the installation location of each device, project creation time and deadline, project file storage directory and number of files stored.
2.“地点索引文件”记录了当前设备拍摄的起始时间和停止使用时间,拍摄的视频文件数量,“年值”目录的存储地址,以及设备拍摄过程中产生的文本信息标识及其数据内容等。2. "Location index file" records the start time and stop time of the current device shooting, the number of video files captured, the storage address of the "year value" directory, and the text information identification and data content generated during the shooting of the device. Wait.
3.“年值索引文件”记录了设备当年正常使用的月份,拍摄的视频文件数量,“月值”目录的存储地址,以及设备拍摄过程中产生的文本信息标识及其数据内容等。3. The “Annual Index File” records the month in which the device was used normally, the number of video files captured, the storage address of the “monthly value” directory, and the textual information identifier and its data content generated during the shooting of the device.
4.“月值索引文件”记录了设备当月正常使用的天数,拍摄的视频文件数量,“日期”目录的存储地址,以及设备拍摄过程中产生的文本信息标识及其数据内容等。4. The “monthly index file” records the number of days the device is used normally in the month, the number of video files captured, the storage address of the “date” directory, and the text information identifier and its data content generated during the shooting of the device.
5.“日期索引文件”记录了设备当天拍摄的视频文件数量,各视频文件的文件名及其描述文件的文件名以及各视频文件中包含的文本信息标识及其数据内容等。5. The "date index file" records the number of video files taken on the day of the device, the file name of each video file and the file name of the description file, and the text information identifier and its data content contained in each video file.
如图7是视频图像文件检索流程。详细的检索流程如下: Figure 7 is a video image file retrieval process. The detailed search process is as follows:
(1)检索工程目录下的索引文件,查看检索条件存在的设备IP,将符合条件的IP进入设备队列,若没有符合条件的设备,则直接退出检索。(1) Retrieve the index file under the project directory, check the device IP of the search condition, and enter the qualified IP into the device queue. If there is no device that meets the conditions, the search will be directly exited.
(2)将设备队列中的IP依次出队,进入设备目录查找,查找年值索引文件,将符合的月份入的月值队列。(2) Dequeue the IPs in the device queue, enter the device directory search, find the annual value index file, and enter the monthly value queue for the matching month.
(3)将月值队列中的月份出列,进入对应月份目录检索,查找月值索引文件,将符合的日期入日期队列。(3) Dequeue the month in the monthly value queue, enter the corresponding month directory search, find the monthly value index file, and enter the matching date into the date queue.
(4)将日期队列中的日期出队,进入对应的日期目录,查找符合的文件,将文件名输出。重复第4步,直到日期队列为空。(4) Dequeue the date in the date queue, enter the corresponding date directory, find the matching file, and output the file name. Repeat step 4 until the date queue is empty.
(5)重复第3步,直到月值队列为空。(5) Repeat step 3 until the month value queue is empty.
(6)重复2步,直到设备队列为空,检索完成。(6) Repeat 2 steps until the device queue is empty and the retrieval is completed.
上述虽然结合附图对本发明的具体实施方式进行了描述,但并非对本发明保护范围的限制,所属领域技术人员应该明白,在本发明的技术方案的基础上,本领域技术人员不需要付出创造性劳动即可做出的各种修改或变形仍在本发明的保护范围以内。 The above description of the specific embodiments of the present invention has been described with reference to the accompanying drawings, but it is not intended to limit the scope of the present invention. Those skilled in the art should understand that the skilled in the art does not require the creative work on the basis of the technical solutions of the present invention. Various modifications or variations that can be made are still within the scope of the invention.

Claims (10)

  1. 一种构建结构化视频图像信息的设备,其特征是,包括:CCD/CMOS图像传感器模块、无线传感器网络接收模块、视频处理器、CPU处理器、信息融合模块和以太网/WiFi接口;An apparatus for constructing structured video image information, comprising: a CCD/CMOS image sensor module, a wireless sensor network receiving module, a video processor, a CPU processor, an information fusion module, and an Ethernet/WiFi interface;
    所述CCD/CMOS图像传感器模块与视频处理器连接,无线传感器网络接收模块与CPU处理器连接,所述视频处理器与CPU处理器的输出分别连接至信息融合模块,所述信息融合模块与以太网/WiFi接口连接;The CCD/CMOS image sensor module is connected to a video processor, and the wireless sensor network receiving module is connected to the CPU processor, and the outputs of the video processor and the CPU processor are respectively connected to the information fusion module, and the information fusion module and the Ethernet Network/WiFi interface connection;
    利用视频处理器完成视频图像的编码功能,利用无线传感器网络接收模块完成文本描述信息或2进制数据的接收或者直接连接温度、湿度、照度、压力标准传感器,将模拟信息数字化;将视频图像信息与其他数据融合在一起,形成结构化视频图像信息。The video processor is used to complete the encoding function of the video image, and the wireless sensor network receiving module is used to complete the text description information or the binary data receiving or directly connect the temperature, humidity, illuminance, pressure standard sensor, digitize the analog information; and the video image information Conformed with other data to form structured video image information.
  2. 如权利要求1所述的一种构建结构化视频图像信息的设备,其特征是,所述无线传感器网络模块采用ISM频段进行数据传输,同时具有标准传感器输入接口,支持0-5v,0-10v标准传感器信号的接入。The device for constructing structured video image information according to claim 1, wherein the wireless sensor network module uses the ISM frequency band for data transmission, and has a standard sensor input interface, and supports 0-5v, 0-10v. Access to standard sensor signals.
  3. 一种如权利要求1所述的构建结构化视频图像信息设备的方法,其特征是,包括:A method of constructing a structured video image information device according to claim 1, comprising:
    (1)构建结构化的JPEG图像文件;(1) constructing a structured JPEG image file;
    (2)构建结构化的视频图像信息;(2) constructing structured video image information;
    (3)分别对结构化的JPEG图像文件和视频图像信息进行存储;(3) storing structured JPEG image files and video image information respectively;
    (4)分别对结构化的JPEG图像文件和视频图像信息进行检索。(4) Retrieving structured JPEG image files and video image information, respectively.
  4. 如权利要求3所述的一种构建结构化视频图像信息设备的方法,其特征是,所述步骤(1)的具体方法为:The method for constructing a structured video image information device according to claim 3, wherein the specific method of the step (1) is:
    将文本信息、标准传感器数据、语音识别后的数据信息加密后附着在原有的JPEG文件上,在JPEG文件标记码后面添加信息,将原有的非结构化JPEG图像文件构建成结构化JPEG文件;在解析时将文本信息部分进行解密显示,图像信息不受任何影响。The text information, the standard sensor data, and the voice-recognition data information are encrypted and attached to the original JPEG file, and the information is added after the JPEG file mark code, and the original unstructured JPEG image file is constructed into a structured JPEG file; The text information portion is decrypted and displayed during parsing, and the image information is not affected at all.
  5. 如权利要求3所述的一种构建结构化视频图像信息设备的方法,其特征是,所述标记码两字节构成,前一个字节固定为0xFF代表标记码开始,后一个字节不同值代表不同含义;A method for constructing a structured video image information device according to claim 3, wherein said mark code is composed of two bytes, the previous byte is fixed to 0xFF for the start of the mark code, and the latter byte has a different value. Representing different meanings;
    在图像解析过程中,从0xFFD8开始对文件进行解析,到0xFFD9解析结束。In the image parsing process, the file is parsed from 0xFFD8, and the parsing ends at 0xFFD9.
  6. 如权利要求3所述的一种构建结构化视频图像信息设备的方法,其特征是,所述步骤(2)的具体方法为:The method for constructing a structured video image information device according to claim 3, wherein the specific method of the step (2) is:
    将描述文件和视频文件作为结构化信息的两部分进行关联,为每个时间段的视频文件创建其对应的描述文件来对单一的视频图像进行文本化的补充;The description file and the video file are associated as two parts of the structured information, and a corresponding description file is created for the video file of each time period to supplement the text of the single video image;
    对原有的视频流数据包进行封装,将原有视频流数据包和物联网传感器接收到的标准传感器数据或手动输入的文本信息进行融合,将数据包长度、视频流数据包、其他信息长度及 其他信息合并成新的数据包进行传输。Encapsulate the original video stream data packet, and fuse the original video stream data packet with the standard sensor data received by the Internet of Things sensor or the manually input text information, and the data packet length, video stream data packet, and other information length. And Other information is merged into new packets for transmission.
  7. 如权利要求3所述的一种构建结构化视频图像信息设备的方法,其特征是,所述步骤(3)中,对结构化的JPEG图像文件进行存储的方法为:The method for constructing a structured video image information device according to claim 3, wherein in the step (3), the method for storing the structured JPEG image file is:
    在存储的过程中按照文本信息内容进行分段或者分文件夹存储。In the process of storage, segmentation or sub-folder storage according to the content of the text information.
  8. 如权利要求3所述的一种构建结构化视频图像信息设备的方法,其特征是,所述步骤(3)中,对结构化的视频图像信息进行存储的方法为:The method for constructing a structured video image information device according to claim 3, wherein in the step (3), the method for storing the structured video image information is:
    为结构化的视频图像信息中描述文件的各字段构建索引文件,将索引文件分为若干级,将接收到的文本信息按照相应级的内容进行存储。An index file is constructed for each field of the description file in the structured video image information, and the index file is divided into several levels, and the received text information is stored according to the content of the corresponding level.
  9. 如权利要求3所述的一种构建结构化视频图像信息设备的方法,其特征是,所述步骤(4)中,对结构化的JPEG图像文件进行检索的方法为:The method for constructing a structured video image information device according to claim 3, wherein in the step (4), the method for searching the structured JPEG image file is:
    在检索时首先定位条件所在的文件夹,然后在子文件夹中进行检索;When searching, first locate the folder where the condition is located, and then search in the subfolder;
    通过解析JPEG文件数据获得文本信息长度,然后根据该长度将全部的文本信息提取出来,通过与检索条件进行匹配对比,如果检索条件符合则说明这个JPEG文件就是我们所要找的图片文件,否则就进行下一文件的对比。The text information length is obtained by parsing the JPEG file data, and then all the text information is extracted according to the length, and compared with the search condition, if the search condition is met, the JPEG file is the image file we are looking for, otherwise it is performed. The comparison of the next file.
  10. 如权利要求3所述的一种构建结构化视频图像信息设备的方法,其特征是,所述步骤(4)中,对结构化的视频图像信息进行检索的方法为:The method for constructing a structured video image information device according to claim 3, wherein in the step (4), the method for searching the structured video image information is:
    从存储目录开始查找,进入对应第一级检索目录,查看各级索引文件选择符合条件的目录,放入检索队列,在本级检索目录查询完毕后,从检索队列中获取下一级检索文件的路径进行检索直到检索到对应的描述文件位置为止;检索到描述文件后,得到其对应的视频文件。 Search from the storage directory, enter the corresponding first-level search directory, view the index files of all levels to select the qualified directory, put it into the search queue, and obtain the next-level search file from the search queue after the search directory of the level is completed. The path is searched until the corresponding description file location is retrieved; after the description file is retrieved, the corresponding video file is obtained.
PCT/CN2016/081149 2015-05-20 2016-05-05 Device and method for establishing structured video image information WO2016184314A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201510260225.5A CN104899261B (en) 2015-05-20 2015-05-20 A kind of apparatus and method for building structuring video image information
CN201510260225.5 2015-05-20

Publications (1)

Publication Number Publication Date
WO2016184314A1 true WO2016184314A1 (en) 2016-11-24

Family

ID=54031924

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2016/081149 WO2016184314A1 (en) 2015-05-20 2016-05-05 Device and method for establishing structured video image information

Country Status (2)

Country Link
CN (1) CN104899261B (en)
WO (1) WO2016184314A1 (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110751065A (en) * 2019-09-30 2020-02-04 北京旷视科技有限公司 Training data acquisition method and device
CN111783404A (en) * 2020-06-18 2020-10-16 上海华力集成电路制造有限公司 Data processing method and system
CN113515649A (en) * 2020-11-19 2021-10-19 阿里巴巴集团控股有限公司 Data structuring method, system, device, equipment and storage medium
CN117633297A (en) * 2024-01-26 2024-03-01 江苏瑞宁信创科技有限公司 Video retrieval method, device, system and medium based on annotation

Families Citing this family (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104899261B (en) * 2015-05-20 2018-04-03 杜晓通 A kind of apparatus and method for building structuring video image information
CN105847254B (en) * 2016-03-23 2018-10-16 司南 Data sharing method and device
CN106294600A (en) * 2016-07-29 2017-01-04 易心悦 The multimedia editing method of a kind of digital photograph, methods of exhibiting and system
CN106803937B (en) * 2017-02-28 2020-03-17 兰州理工大学 Double-camera video monitoring method, system and monitoring device with text log
CN107025292A (en) * 2017-04-14 2017-08-08 国网江苏省电力公司无锡供电公司 The description method of video and heterogeneous sensor in towards transformer station
CN107222583A (en) * 2017-08-08 2017-09-29 江苏优闼数据科技有限公司 A kind of data transmission method of fusion structure data and unstructured data
CN107749963A (en) * 2017-10-17 2018-03-02 北京工商大学 A kind of source information that perceives merges video method more
CN113115069A (en) * 2021-02-19 2021-07-13 深圳市麦谷科技有限公司 Video storage method and system of automobile data recorder
CN113114968A (en) * 2021-04-13 2021-07-13 中国建设银行股份有限公司 Video processing method, device, equipment and storage medium
CN113656364B (en) * 2021-08-05 2024-02-20 福瑞泰克智能系统有限公司 Sensor data processing method, device and computer readable storage medium

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20030031260A1 (en) * 2001-07-16 2003-02-13 Ali Tabatabai Transcoding between content data and description data
CN101630324A (en) * 2009-08-18 2010-01-20 北京航空航天大学 Method for accessing geographic position information in multimedia resource
CN101783881A (en) * 2010-03-05 2010-07-21 公安部第三研究所 Intelligent web camera with video structural description function
CN102387346A (en) * 2011-10-17 2012-03-21 上海交通大学 Intelligent front end of manageable, findable and inspectable monitoring system
CN103635954A (en) * 2011-02-08 2014-03-12 隆沙有限公司 A system to augment a visual data stream based on geographical and visual information
CN104899261A (en) * 2015-05-20 2015-09-09 杜晓通 Device and method for constructing structured video image information
CN204795392U (en) * 2015-06-26 2015-11-18 山东大学 Found equipment of different structure information structure ization of special equipment

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP4685465B2 (en) * 2005-02-01 2011-05-18 パナソニック株式会社 Monitoring and recording device
CN101420595B (en) * 2007-10-23 2012-11-21 华为技术有限公司 Method and equipment for describing and capturing video object
CN101293349A (en) * 2008-06-05 2008-10-29 广州大学 Robot based on Wi-Fi
CN201830388U (en) * 2010-10-13 2011-05-11 成都创烨科技有限责任公司 Video content collecting and processing device
CN103186634A (en) * 2011-12-31 2013-07-03 无锡物联网产业研究院 Method and device for retrieving intelligent traffic monitoring video
CN103595968B (en) * 2013-11-22 2017-02-22 武汉大学 Video sensor access method based on geographical position

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20030031260A1 (en) * 2001-07-16 2003-02-13 Ali Tabatabai Transcoding between content data and description data
CN101630324A (en) * 2009-08-18 2010-01-20 北京航空航天大学 Method for accessing geographic position information in multimedia resource
CN101783881A (en) * 2010-03-05 2010-07-21 公安部第三研究所 Intelligent web camera with video structural description function
CN103635954A (en) * 2011-02-08 2014-03-12 隆沙有限公司 A system to augment a visual data stream based on geographical and visual information
CN102387346A (en) * 2011-10-17 2012-03-21 上海交通大学 Intelligent front end of manageable, findable and inspectable monitoring system
CN104899261A (en) * 2015-05-20 2015-09-09 杜晓通 Device and method for constructing structured video image information
CN204795392U (en) * 2015-06-26 2015-11-18 山东大学 Found equipment of different structure information structure ization of special equipment

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110751065A (en) * 2019-09-30 2020-02-04 北京旷视科技有限公司 Training data acquisition method and device
CN111783404A (en) * 2020-06-18 2020-10-16 上海华力集成电路制造有限公司 Data processing method and system
CN111783404B (en) * 2020-06-18 2024-01-09 上海华力集成电路制造有限公司 Data processing method and system
CN113515649A (en) * 2020-11-19 2021-10-19 阿里巴巴集团控股有限公司 Data structuring method, system, device, equipment and storage medium
CN113515649B (en) * 2020-11-19 2024-03-01 阿里巴巴集团控股有限公司 Data structuring method, system, device, equipment and storage medium
CN117633297A (en) * 2024-01-26 2024-03-01 江苏瑞宁信创科技有限公司 Video retrieval method, device, system and medium based on annotation
CN117633297B (en) * 2024-01-26 2024-04-30 江苏瑞宁信创科技有限公司 Video retrieval method, device, system and medium based on annotation

Also Published As

Publication number Publication date
CN104899261A (en) 2015-09-09
CN104899261B (en) 2018-04-03

Similar Documents

Publication Publication Date Title
WO2016184314A1 (en) Device and method for establishing structured video image information
US20200250218A1 (en) System and method for signature-enhanced multimedia content searching
US8270684B2 (en) Automatic media sharing via shutter click
US11417074B2 (en) Methods and apparatus for identifying objects depicted in a video using extracted video frames in combination with a reverse image search engine
US20170185675A1 (en) Fingerprinting and matching of content of a multi-media file
KR102434374B1 (en) Apparatus and method for artificial intelligence
CN101374234A (en) Method and apparatus for monitoring video copy base on content
US10380267B2 (en) System and method for tagging multimedia content elements
CN103870574A (en) Label manufacturing and indexing method based on H. 264 ciphertext cloud video storage
CN112364201A (en) Video data retrieval method and system
CN113114968A (en) Video processing method, device, equipment and storage medium
US9524754B2 (en) Video playback device and video recording device
US20210014540A1 (en) Method and system for codec of visual feature data
US8896708B2 (en) Systems and methods for determining, storing, and using metadata for video media content
Kim et al. Photo cube: an automatic management and search for photos using mobile smartphones
CN103198162B (en) A kind of picture browsing exchange method
CN102270228B (en) Video search method, front-end equipment and rear-end server
NO20140958A1 (en) Digital content search method and system
Mo Design and Implementation of Video Surveillance Identification System in Smart Campus
CN202652389U (en) Law-enforcing recorder
US20170286434A1 (en) System and method for signature-based clustering of multimedia content elements
CN117290389A (en) Data labeling method, device, electronic equipment and storage medium
Brut et al. Integrating heterogeneous metadata into a distributed multimedia information system
CN114564614A (en) Automatic searching method, system and device for video clip and readable storage medium
CN117216308A (en) Searching method, system, equipment and medium based on large model

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 16795801

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 16795801

Country of ref document: EP

Kind code of ref document: A1