WO2018103359A1 - 一种视频处理方法及系统 - Google Patents

一种视频处理方法及系统 Download PDF

Info

Publication number
WO2018103359A1
WO2018103359A1 PCT/CN2017/096477 CN2017096477W WO2018103359A1 WO 2018103359 A1 WO2018103359 A1 WO 2018103359A1 CN 2017096477 W CN2017096477 W CN 2017096477W WO 2018103359 A1 WO2018103359 A1 WO 2018103359A1
Authority
WO
WIPO (PCT)
Prior art keywords
image
block
video
video processing
block image
Prior art date
Application number
PCT/CN2017/096477
Other languages
English (en)
French (fr)
Inventor
黄治斌
Original Assignee
威创集团股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 威创集团股份有限公司 filed Critical 威创集团股份有限公司
Publication of WO2018103359A1 publication Critical patent/WO2018103359A1/zh

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/80Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
    • H04N21/83Generation or processing of protective or descriptive data associated with content; Content structuring
    • H04N21/845Structuring of content, e.g. decomposing content into time segments
    • H04N21/8456Structuring of content, e.g. decomposing content into time segments by decomposing the content in the time domain, e.g. in time segments
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/17Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
    • H04N19/176Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/42Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by implementation details or hardware specially adapted for video compression or decompression, e.g. dedicated software implementation
    • H04N19/436Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by implementation details or hardware specially adapted for video compression or decompression, e.g. dedicated software implementation using parallelised computational arrangements

Definitions

  • the present invention relates to the field of video processing, and more particularly to a video processing method and system.
  • IP video refers to the transmission of video through the network.
  • IP video has been widely used in video surveillance. Compared with traditional analog video, it has the advantages of convenient deployment and good sharing, but it also has the disadvantage of relatively large delay. In the field of video surveillance, the slight delay has little effect, but the application of IP video technology to the field of computer control has strict delay requirements.
  • network KVM In the field of computer control, there is a technology called network KVM, which uses the network to transmit computer output images and computer control commands such as mouse and keyboard. Since the operation of the mouse and keyboard directly leads to changes in the computer output image, the user needs to rely on The image displays the content to determine the operation, so the delay in controlling the computer image is generally within 40 milliseconds.
  • the traditional video processing process generally includes signal acquisition, video encoding, video transmission, video decoding, and finally to the video display process. As the signal resolution becomes larger and larger, under the same processing method, the processing time of each module will be The corresponding increase, resulting in a corresponding increase in the total delay.
  • a solution in the prior art is that the client obtains a to-be-processed video file, splits the to-be-processed video file into a plurality of fragment files, and compresses the file to be uploaded to the server.
  • the server receives and decompresses the fragment file and deposits it.
  • Distributed file system download each fragment file from the distributed file system for compression and decoding and store it in the distributed file system, and obtain new fragment files from the distributed file system and merge them into one complete video file.
  • the video file is split into large files by large files, and the fragments are uploaded in parallel.
  • the process of video processing includes not only video transmission and video decoding, but also video capture and video display. Each processing process has processing time, and each process will generate a delay. Therefore, this process The technical solution still has room to reduce the delay. Moreover, after the technical solution splits the video file, a merge process is required to be performed. After all the fragment files of the video file to be processed are decoded and compressed, a corresponding merge task is generated and placed in the merge queue. The merge processing server retrieves the tasks from the merge queue, merges all new shard files into one complete video file, and stores them in the distributed file system. This merging process needs to wait for all the fragment file to be decoded and compressed before it is performed, which increases the video processing delay to some extent.
  • the present invention provides a video processing method with low delay in order to overcome at least one of the above-mentioned drawbacks (deficiencies) of the prior art.
  • the present invention also provides a video processing system with low latency.
  • a video processing method comprising:
  • the video data is subjected to block acquisition to obtain a block image, and then the block image is separately encoded, transmitted, and decoded, and the decoded block image is placed in the off-screen buffer according to the position of the block image in the original image.
  • the method of the invention adopts a block processing strategy for the whole process of the acquisition, encoding, transmission and decoding of the video processing, and extends the strategy of the block to the process of collecting the video data, and the decoded image data is according to the block image.
  • the positions in the original image are placed in parallel into the off-screen buffer, so that the splicing operation of the block images is completed at the same time in the decoding process, which directly saves the merging processing steps of the block images, compared to the prior art only for the video.
  • the method of encoding, transmitting, and decoding of data performs block processing and also needs to combine processing steps, and the method of the present invention further reduces the total delay of video processing, thereby achieving lower video capture, encoding, transmission, and decoding processes. Delayed processing improves the real-time video processing and user experience.
  • a video processing system includes an acquisition module, an encoding module, a transmission module, and a decoding module; and an acquisition module, configured to perform block-by-block acquisition of the video data to obtain a block image;
  • the encoding module and the transmission module are respectively used for encoding and transmitting the block image
  • a decoding module configured to decode the block image and place the decoded image data into the off-screen buffer according to the position of the block image in the original image.
  • the block processing strategy is adopted in the acquisition module, the coding module, and the decoding module, and the block strategy is extended to the video data acquisition process, and the decoding module divides the decoded image data according to the block image.
  • the position in the original image is placed in parallel into the off-screen buffer, so that the decoding module has the decoding function and simultaneously completes the splicing function of the block image, which directly saves the merged processing module after the block image decoding, compared with the current
  • only the encoding, transmission, and decoding processes of video data are processed in blocks, and the merging processing module is also needed.
  • the system of the present invention further reduces the total delay of video processing, and realizes video acquisition, encoding, and transmission.
  • the lower latency processing in the decoding process improves the real-time video processing and user experience.
  • FIG. 1 is a flowchart of a specific embodiment of a video processing method according to the present invention.
  • FIG. 3 is a block diagram of a specific embodiment of a video processing system in accordance with the present invention.
  • first and second are used for descriptive purposes only and are not to be construed as indicating or implying a relative importance or implied number of technical features indicated. .
  • the defined “first”, “second” features may include one or more of the features, either explicitly or implicitly.
  • a plurality means two or more unless otherwise stated.
  • mounting and “connecting” are to be understood broadly, and may be, for example, a fixed connection, a detachable connection, or an integral, unless otherwise explicitly stated and defined.
  • Ground connection it can be a mechanical connection or an electrical connection; it can be directly connected, or it can be indirectly connected through an intermediate medium, and it can be said that the internal connection of the two elements.
  • the specific meaning of the above terms in the present invention can be understood in the specific circumstances by those skilled in the art.
  • FIG. 1 is a flowchart of a specific embodiment of a video processing method according to the present invention.
  • a video processing method in this embodiment specifically includes the following steps:
  • S104 Receive the block image and perform decoding, and place the decoded block image into the off-screen buffer according to the position of the block image in the original image.
  • the blocking strategy of the video processing is extended to the step of collecting the video data.
  • the video data is acquired, the video data is block-acquired to obtain a block image, and then the block images are separately encoded and transmitted.
  • the block image is received and decoded at the receiving end, and the block image obtained after decoding is placed in the off-screen buffer according to the position of the block image in the original image, and the block image is directly spliced by the storage process of the off-screen buffer.
  • the block processing method is adopted in the whole process of video processing, and the decoding operation is completed in the decoding process, and the step of splicing the block images is completed, the delay of the video processing process is greatly reduced, and the real-time performance of the video data is improved.
  • the method of the specific embodiment is more applicable to an application field such as IP video that has a delay requirement.
  • step S101 include:
  • the preset unit is a row pixel or a column pixel, and the image is scanned in units of rows or columns of the image to obtain a block image of 1*N or N*1 size, and N represents the size of the tile image.
  • the video data is collected by using the line of the image as a unit, and the strategy of the block is performed by 1*N, that is, the block is not divided in the horizontal direction, and is vertical. Blocking in the direction, N is the number of blocks.
  • step S101 when the video data is segmented, the frame number and/or the timestamp information and the block image are recorded for each block image obtained. Coordinate area information in the image.
  • step S102 the encoding step directly encodes each of the captured block images, and each block image is independently performed during encoding, and the encoding of the plurality of tile images is performed in parallel, and the encoding process uses the acquisition time. Recorded frame number and/or timestamp information.
  • step S103 the encoded block image data is transmitted through the network, such as to the receiving end.
  • Each tile image is also transmitted independently during transmission and can be performed in parallel.
  • step S104 after receiving the respective block images in parallel, the decoding is performed in parallel and stored in the off-screen buffer in parallel.
  • the frame number and/or timestamp of each block image the frame number and/or timestamp information of each block image and the coordinate area information of the block image in the original image are stored in the off-screen buffer. Placed in the off-screen buffer, the position of the tiled image in the original image is obtained from the coordinate region information of each tiled image in the original image.
  • the splicing and storage of the block image is completed at the same time, which saves the merge process of the block image.
  • the method of this embodiment further includes the displaying step:
  • the video data is not subjected to block processing, and all the video data are uniformly collected, encoded, transmitted, and decoded, and Tc, Te, Tt, and Td respectively represent the time of acquiring, encoding, transmitting, and decoding an image. Then the delay of the whole process of processing an image by a conventional video processing scheme is at least Tc+Te+Tt+Td.
  • the delay is reduced to about (Tc+Te+Tt+Td)/N+(N-1)*max(Tc/N, Te/N, Tt/N, Td/N), where max(Tc/N, Te/N, Tt/N, Td/N) represents the largest value among Tc/N, Te/N, Tt/N, Td/N.
  • the steps are simplified to perform block processing for the encoding, transmission, and decoding processes (from the background art, the video processing process also includes other steps of data storage in the distributed file system twice), and the delay includes at least Tc, Te. /N, Tt/N, Td/N, should also include the time Th of the image merging process, so the total delay is at least Tc + Te / N + Tt / N + Td / N + Th, see the method of the present invention There is still a significant reduction in latency over this solution.
  • FIG. 3 is a structural diagram of a video processing system according to the present invention.
  • a video processing system specifically includes an acquisition module 201, an encoding module 202, a transmission module 203, and a decoding module 204.
  • the collecting module 201 is configured to perform block processing on the video data to obtain a block image.
  • the encoding module 202 is configured to separately encode the block images
  • a transmission module 203 configured to transmit the encoded block image
  • the decoding module 204 is configured to receive the block image, decode the block image, and place the decoded image data in the off-screen buffer according to the position of the block image in the original image.
  • the block processing strategy of the video processing is extended to the video data acquisition module 201.
  • the acquisition module 201 performs block-by-block acquisition of the video data to obtain a block image, and then the encoding module 202 and the transmission.
  • the module 203 separately encodes and blocks the block images
  • the decoding module 204 receives the block images at the receiving end and performs decoding, and the block images obtained after the decoding are placed in the off-screen buffer according to the position of the block images in the original image.
  • the off-screen buffer storage process directly completes the splicing of the block image.
  • the block processing mode is adopted in the whole process of the video processing, and the decoding module 204 completes the decoding operation in the decoding process and completes the splicing of the block image.
  • the delay of the video processing process is greatly reduced, and the real-time performance of the video data is improved.
  • the method of the specific embodiment is more applicable to the application field of the IP video and the like.
  • the collection module 201 is specifically configured to:
  • Obtain image data from the video data source and scan the image data in units of image line pixels or column pixels to obtain a block image of 1*N or N*1 size, where N represents the largest column pixel or row pixel in the image.
  • the acquisition of the video data by the acquisition module 201 is performed by using the row of the image as a unit, and the strategy of the segmentation is performed by using 1*N, that is, in the horizontal direction.
  • Block block in the vertical direction.
  • the acquisition module 201 records the frame number and/or time stamp information and the tile image in the original for each block image obtained when the video data is segmented. Coordinate area information in the image.
  • each of the collected block images is directly encoded, and each block image is independent when encoding.
  • the frame number and/or time stamp information recorded by the acquisition module 201 is used in the encoding process, and the encoding of the plurality of tile images is performed in parallel.
  • the encoded block image data is transmitted through the network, such as to the receiving end.
  • Each tile image is also transmitted independently during transmission and can be performed in parallel.
  • the decoding module 204 after receiving the respective block images in parallel, the decoding is performed in parallel and stored in the off-screen buffer in parallel. According to the frame number and/or timestamp of each block image, the frame number and/or timestamp information of each block image and the coordinate area information of the block image in the original image are stored in the off-screen buffer. Placed in the off-screen buffer, the position of the tiled image in the original image is obtained from the coordinate region information of each tiled image in the original image. In this module, when the block image is stored in the off-screen buffer, the splicing and storage of the block image is completed at the same time, and the merged module after decoding is saved.
  • system of this embodiment further includes a display module 205, configured to switch the image data buffered in the off-screen buffer to the current screen for display.
  • the video data is not subjected to block processing, and all the video data are uniformly collected, encoded, transmitted, and decoded, and the acquisition module, the encoding module, the transmission module, and the decoding module are respectively processed by Tc, Te, Tt, and Td.
  • the time of an image, then the delay of the entire process of processing an image by a conventional video processing scheme is at least Tc+Te+Tt+Td.
  • the delay is reduced to about (Tc+Te+Tt+Td)/N+(N-1)*max(Tc/N, Te/N, Tt/N, Td/N), where max(Tc/N, Te/N, Tt/N, Td/N) represents the largest value among Tc/N, Te/N, Tt/N, Td/N.

Abstract

本发明涉及一种视频处理方法及系统。所述方法包括对视频数据进行分块采集获得分块图像,然后将分块图像分别进行编码、传输、解码,将解码获得的分块图像按照分块图像在原始图像中的位置放到离屏缓存中。本发明对视频处理的采集、编码、传输、解码全过程均采用分块处理策略,将分块的策略延伸到视频数据的采集过程中,且将解码后的图像数据按照分块图像在原始图像中的位置并行放入到离屏缓存中,使得在解码的过程中同时完成了分块图像的拼接操作,直接节省了分块图像的合并处理步骤,将视频处理的总延时进一步降低了,实现了视频采集、编码、传输、解码过程中更低延时的处理,提高了视频处理实时性和用户体验。

Description

一种视频处理方法及系统 技术领域
本发明涉及视频处理领域,更具体地,涉及一种视频处理方法及系统。
背景技术
IP视频是指通过网络来传输视频,近年来IP视频在视频监控被广泛应用,它相比传统的模拟视频来说具有部署方便、利于共享等优点,但是同样也存在延时比较大的缺点,这在视频监控领域里,轻微的延时影响并不大,但是IP视频技术应用到计算机控制领域中则对其的延时要求严格很多。
在计算机控制领域中有一种技术叫做网络KVM,它采用网络来传输计算机的输出图像和鼠标键盘等计算机控制指令,由于鼠标键盘的操作直接会导致计算机输出图像的变化,而使用者又需要依赖于图像显示内容来决定操作,因此一般需要控制计算机图像的延时在40毫秒以内。传统的视频处理过程一般包括信号采集、视频编码、视频传输、视频解码,最后到视频显示的过程,随着信号分辨率的日益变大,在相同的处理方法下,每个模块的处理时间都会相应增加,从而导致总的延时也相应增加。
现有技术中有一种解决方案是客户端获取待处理视频文件,将待处理视频文件拆分为若干个分片文件并压缩后上传至服务端,服务端接收并解压缩分片文件,存入分布式文件系统;从分布式文件系统中下载各分片文件进行压缩解码后存入分布式文件系统,从分布式文件系统中获取新的分片文件合并为一个完整的视频文件。此方案在客户端将视频文件由大文件拆分为小文件,分片并行上传,虽然可以提高传输效率,减少网络流量耗费,而在服务端将多个小文件并行解码压缩处理,减少处理时间,但是视频处理的过程不单单包括视频的传输和视频的解码,还包括了视频的采集和视频的显示等过程,每个处理过程都存在处理时间,每个过程都会产生延时,因此,此技术方案仍存在减少时延的空间。而且此技术方案将视频文件拆分后,到最后还需要增加一个合并的过程,将待处理视频文件的所有分片文件完成解码压缩处理后,生成一相应的合并任务放入合并队列中,视频合并处理服务器从合并队列取得任务,将所有新的分片文件合并为一个完整的视频文件后存入分布式文件系统中。此合并过程需要等待所有分片文件解码压缩处理后才进行,在一定程度上增加了视频处理时延。
发明内容
本发明为克服上述现有技术所述的至少一种缺陷(不足),提供一种时延低的视频处理方法。
本发明还提供一种时延低的视频处理系统。
为解决上述技术问题,本发明的技术方案如下:
一种视频处理方法,包括:
对视频数据进行分块采集获得分块图像,然后将分块图像分别进行编码、传输、解码,将解码获得的分块图像按照分块图像在原始图像中的位置放到离屏缓存中。
本发明的方法对视频处理的采集、编码、传输、解码全过程均采用分块处理策略,将分块的策略延伸到视频数据的采集过程中,而且将解码后的图像数据按照分块图像在原始图像中的位置并行放入到离屏缓存中,使得在解码的过程中同时完成了分块图像的拼接操作,直接节省了分块图像的合并处理步骤,相对于现有技术中仅仅对视频数据的编码、传输、解码过程进行分块处理以及还需要合并处理步骤来说,本发明的方法将视频处理的总延时进一步降低了,实现了视频采集、编码、传输、解码过程中更低延时的处理,提高了视频处理实时性和用户体验。
一种视频处理系统,包括采集模块、编码模块、传输模块、解码模块;采集模块,用于对视频数据进行分块采集获得分块图像;
编码模块、传输模块分别用于对分块图像进行编码和传输;
解码模块,用于将分块图像进行解码并将解码获得的图像数据按照分块图像在原始图像中的位置放到离屏缓存中。
在本发明的系统中,采集模块、编码模块、解码模块中均采用分块处理策略,将分块的策略延伸到视频数据的采集过程中,而且解码模块将解码后的图像数据按照分块图像在原始图像中的位置并行放入到离屏缓存中,使得解码模块具备解码功能的同时还同时完成了分块图像的拼接功能,直接节省了分块图像解码后的合并处理模块,相对于现有技术中仅仅对视频数据的编码、传输、解码过程进行分块处理以及还需要合并处理模块来说,本发明的系统将视频处理的总延时进一步降低了,实现了视频采集、编码、传输、解码过程中更低延时的处理,提高了视频处理实时性和用户体验。
附图说明
图1为本发明一种视频处理方法具体实施例的流程图。
图2为本发明在1*2分块方式下的耗时分析图。
图3为发明一种视频处理系统具体实施例的架构图。
具体实施方式
附图仅用于示例性说明,不能理解为对本专利的限制;
为了更好说明本实施例,附图某些部件会有省略、放大或缩小,并不代表实际产品的尺寸;
对于本领域技术人员来说,附图中某些公知结构及其说明可能省略是可以理解的。
在本发明的描述中,需要理解的是,此外,术语“第一”、“第二”仅用于描述目的,而不能理解为指示或暗示相对重要性或隐含所指示的技术特征的数量。由此,限定的“第一”、“第二”的特征可以明示或隐含地包括一个或者更多个该特征。在本发明的描述中,除非另有说明,“多个”的含义是两个或两个以上。
在本发明的描述中,需要说明的是,除非另有明确的规定和限定,术语“安装”、“连接”应做广义理解,例如,可以是固定连接,也可以是可拆卸连接,或一体地连接;可以是机械连接,也可以是电连接;可以是直接相连,也可以是通过中间媒介间接连接,可以说两个元件内部的连通。对于本领域的普通技术人员而言,可以具体情况理解上述术语在本发明的具体含义。
下面结合附图和实施例对本发明的技术方案做进一步的说明。
实施例1
如图1所示,为本发明一种视频处理方法具体实施例的流程图。参见图1,本具体实施例一种视频处理方法具体包括如下步骤:
S101.对视频数据进行分块采集获得分块图像;
S102.将分块图像分别进行编码;
S103.将编码后的分块图像分别进行传输;
S104.接收分块图像并进行解码,将解码获得的分块图像按照分块图像在原始图像中的位置放到离屏缓存中。
在本具体实施例中,视频处理的分块策略延伸到视频数据的采集步骤中,在获取视频数据时,对视频数据进行分块采集获得分块图像,然后将分块图像分别进行编码后传输,在接收端接收分块图像并进行解码,解码后获得的分块图像按照分块图像在原始图像中的位置放到离屏缓存中,离屏缓存的存储过程直接完成分块图像的拼接,此方式在视频处理的全过程都采用分块处理方式,而且在解码过程中完成解码操作的同时还完成分块图像拼接的步骤,大大减少视频处理过程的时延,提高了视频数据的实时性,对于IP视频等对时延要求的应用领域,本具体实施例的方法更加适用。
在具体实施过程中,步骤S101的具体步骤包括:
从视频数据源中获取图像数据,获取时以预设单位对图像数据进行扫描获得预设单位大 小的分块图像。
通常,预设单位为行像素或列像素,扫描时以图像的行或列为单位对图像进行扫描获得1*N或N*1大小的分块图像,N表示分块图像的大小。在一种优选的实施方式中,为了适应设备,对视频数据的采集是以图像的行作为单位进行扫描,分块的策略采用1*N的方式进行,即在水平方向上不分块,垂直方向上分块,N为分块的数量。
为了对分块图像进行标记,方便后续对分块图像进行拼接,步骤S101中对视频数据进行分块时为获得的每个分块图像记录帧编号和/或时间戳信息以及分块图像在原始图像中的坐标区域信息。
在步骤S102中,编码步骤直接对采集后的各分块图像进行编码,编码时每个分块图像是独立进行的,而且多个分块图像的编码时并行进行的,编码过程中使用采集时记录的帧编号和/或时间戳信息。
在步骤S103中,编码后的分块图像数据通过网络传输出去,如传输到接收端。传输时每个分块图像也是独立进行的,而且可以是并行进行的。
在步骤S104中,并行接收到各个分块图像后,并行进行解码并并行存放到离屏缓存中。解码时根据各个分块图像的帧编号和/或时间戳进行,存入离屏缓存时根据每个分块图像的帧编号和/或时间戳信息以及分块图像在原始图像中的坐标区域信息放到离屏缓存,分块图像在原始图像中的位置是根据每个分块图像在原始图像中的坐标区域信息得到的。在此步骤中,分块图像存入离屏缓存中时同时完成了分块图像的拼接和存放,节省了分块图像的合并过程。
在具体实施过程中,本具体实施例的方法还包括显示步骤:
S105.将离屏缓存中缓存的图像数据切换到到当前屏幕来显示。
传统视频处理方式中,视频数据不进行分块处理,全部视频数据统一进行采集、编码、传输和解码,用Tc、Te、Tt、Td分别表示采集、编码、传输、解码一幅图像的时间,那么传统视频处理方案处理一幅图像整个过程的延时至少为Tc+Te+Tt+Td。
而采用本发明的方案,如图2所示,则延时降成约(Tc+Te+Tt+Td)/N+(N-1)*max(Tc/N,Te/N,Tt/N,Td/N),其中max(Tc/N,Te/N,Tt/N,Td/N)表示取Tc/N、Te/N、Tt/N、Td/N中最大的一个值。
假设对图像进行1*2的分块来处理,为了简化说明,假设Tc、Te、Tt、Td的值相同,则一幅图像从采集到显示中间耗时为(Tc+Te+Tt+Td)+Td/2,会明显优于传统视频处理方案的Tc+Te+Tt+Td。
此外,如背景技术中所提及的现有技术一种解决方案,为了方便进行耗时计算,在此将 其步骤简化为编码、传输、解码过程进行分块处理(从背景技术看,该视频处理过程还包括先后两次在分布式文件系统进行数据存储等其他步骤),其延时至少包括Tc、Te/N、Tt/N、Td/N,还应当包括图像合并处理的时间Th,因此其总的延时至少为Tc+Te/N+Tt/N+Td/N+Th,可见本发明的方法相对于该解决方案,在延时上仍有大幅度的降低。
实施例2
在实施例1的基础上,本发明还提供一种视频处理系统。如图3所示,为本发明一种视频处理系统的架构图。参见图2,本具体实施例一种视频处理系统具体包括采集模块201、编码模块202、传输模块203、解码模块204;
采集模块201,用于对视频数据进行分块采集获得分块图像;
编码模块202,用于将分块图像分别进行编码;
传输模块203,用于将编码好的分块图像进行传输;
解码模块204用于接收分块图像,将分块图像进行解码并将解码获得的图像数据按照分块图像在原始图像中的位置并放到离屏缓存中。
在本具体实施例中,视频处理的分块策略延伸到视频数据的采集模块201中,采集模块201在获取视频数据时,对视频数据进行分块采集获得分块图像,然后编码模块202和传输模块203将分块图像分别进行编码后传输,解码模块204在接收端接收分块图像并进行解码,解码后获得的分块图像按照分块图像在原始图像中的位置放到离屏缓存中,离屏缓存的存储过程直接完成分块图像的拼接,此方式在视频处理的全过程都采用分块处理方式,而且解码模块204在解码过程中完成解码操作的同时还完成分块图像的拼接,大大减少视频处理过程的时延,提高了视频数据的实时性,对于IP视频等对时延要求的应用领域,本具体实施例的方法更加适用。
在具体实施过程中,采集模块201具体用于:
从视频数据源中获取图像数据,获取时以图像行像素或列像素为单位对图像数据进行扫描获得1*N或N*1大小的分块图像,N表示图像中最大的列像素或者行像素。在一种优选的实施方式中,为了适应设备,采集模块201对视频数据的采集是以图像的行作为单位进行扫描,分块的策略采用1*N的方式进行,即在水平方向上不分块,垂直方向上分块。
为了对分块图像进行标记,方便后续对分块图像进行拼接,采集模块201对视频数据进行分块时为获得的每个分块图像记录帧编号和/或时间戳信息以及分块图像在原始图像中的坐标区域信息。
在编码模块202中,直接对采集后的各分块图像进行编码,编码时每个分块图像是独立 进行的,而且多个分块图像的编码时并行进行的,编码过程中使用采集模块201记录的帧编号和/或时间戳信息。
在传输模块203中,将编码后的分块图像数据通过网络传输出去,如传输到接收端。传输时每个分块图像也是独立进行的,而且可以是并行进行的。
在解码模块204中,并行接收到各个分块图像后,并行进行解码并并行存放到离屏缓存中。解码时根据各个分块图像的帧编号和/或时间戳进行,存入离屏缓存时根据每个分块图像的帧编号和/或时间戳信息以及分块图像在原始图像中的坐标区域信息放到离屏缓存,分块图像在原始图像中的位置是根据每个分块图像在原始图像中的坐标区域信息得到的。在此模块中,分块图像存入离屏缓存中时同时完成了分块图像的拼接和存放,节省解码后的合并模块
在具体实施过程中,本具体实施例的系统还包括显示模块205,用于将离屏缓存中缓存的图像数据切换到到当前屏幕来显示。
传统视频处理系统中,视频数据不进行分块处理,全部视频数据统一进行采集、编码、传输和解码,用Tc、Te、Tt、Td分别表示采集模块、编码模块、传输模块、解码模块来处理一幅图像的时间,那么传统视频处理方案处理一幅图像整个过程的延时至少为Tc+Te+Tt+Td。
而采用本发明的方案,如图2所示,则延时降成约(Tc+Te+Tt+Td)/N+(N-1)*max(Tc/N,Te/N,Tt/N,Td/N),其中max(Tc/N,Te/N,Tt/N,Td/N)表示取Tc/N、Te/N、Tt/N、Td/N中最大的一个值。
假设对图像进行1*2的分块来处理,为了简化说明,假设Tc、Te、Tt、Td的值相同,则一幅图像从采集到显示中间耗时为(Tc+Te+Tt+Td)+Td/2,会明显优于传统视频处理方案的Tc+Te+Tt+Td。
相同或相似的标号对应相同或相似的部件;
附图中描述位置关系的用于仅用于示例性说明,不能理解为对本专利的限制;
显然,本发明的上述实施例仅仅是为清楚地说明本发明所作的举例,而并非是对本发明的实施方式的限定。对于所属领域的普通技术人员来说,在上述说明的基础上还可以做出其它不同形式的变化或变动。这里无需也无法对所有的实施方式予以穷举。凡在本发明的精神和原则之内所作的任何修改、等同替换和改进等,均应包含在本发明权利要求的保护范围之内。

Claims (10)

  1. 一种视频处理方法,其特征在于,包括:
    对视频数据进行分块采集获得分块图像,然后将分块图像分别进行编码、传输、解码,将解码获得的分块图像按照分块图像在原始图像中的位置放到离屏缓存中。
  2. 根据权利要求1所述的视频处理方法,其特征在于,对视频数据进行分块采集获得分块图像的具体步骤包括:
    从视频数据源中获取图像数据,获取时以预设单位对图像数据进行扫描获得预设单位大小的分块图像。
  3. 根据权利要求2所述的视频处理方法,其特征在于,所述预设单位是行像素或列像素,扫描时以行或列为单位对图像进行扫描获得1*N或N*1大小的分块图像,N表示分块图像的大小。
  4. 根据权利要求1所述的视频处理方法,其特征在于,视频数据进行分块时为获得的每个分块图像记录帧编号和/或时间戳信息以及分块图像在原始图像中的坐标区域信息。
  5. 根据权利要求4所述的视频处理方法,其特征在于,分块图像按照分块图像在原始图像中的位置放到离屏缓存时是根据每个分块图像的帧编号和/或时间戳信息以及分块图像在原始图像中的坐标区域信息进行的。
  6. 根据权利要求1-5任一项所述的视频处理方法,其特征在于,所述方法还包括显示步骤:将离屏缓存中缓存的图像数据切换到到当前屏幕来显示。
  7. 一种视频处理系统,包括采集模块、编码模块、传输模块、解码模块;其特征在于,
    采集模块,用于对视频数据进行分块采集获得分块图像;
    编码模块、传输模块分别用于对分块图像进行编码和传输;
    解码模块,用于将分块图像进行解码并将解码获得的图像数据按照分块图像在原始图像中的位置放到离屏缓存中。
  8. 根据权利要求7所述的视频处理系统,其特征在于,所述采集模块具体用于:
    从视频数据源中获取图像数据,获取时以行像素或列像素为单位对图像数据进行扫描获得1*N或N*1大小的分块图像,N表示分块图像的大小。
  9. 根据权利要求8所述的视频处理系统,其特征在于,采集模块对视频数据进行分块时为获得的每个分块图像记录帧编号和/或时间戳信息以及分块图像在原始图像中的坐标区域信息。
  10. 根据权利要求9所述的视频处理系统,其特征在于,解码模块将分块图像按照分块图像在原始图像中的位置放到离屏缓存时是根据每个分块图像的帧编号和/或时间戳信息以及分块图像在原始图像中的坐标区域信息进行的。
PCT/CN2017/096477 2016-12-08 2017-08-08 一种视频处理方法及系统 WO2018103359A1 (zh)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201611124548.2A CN106792270A (zh) 2016-12-08 2016-12-08 一种视频处理方法及系统
CN201611124548.2 2016-12-08

Publications (1)

Publication Number Publication Date
WO2018103359A1 true WO2018103359A1 (zh) 2018-06-14

Family

ID=58877439

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2017/096477 WO2018103359A1 (zh) 2016-12-08 2017-08-08 一种视频处理方法及系统

Country Status (2)

Country Link
CN (1) CN106792270A (zh)
WO (1) WO2018103359A1 (zh)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111654644A (zh) * 2020-05-15 2020-09-11 西安万像电子科技有限公司 图像传输方法及系统

Families Citing this family (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106792270A (zh) * 2016-12-08 2017-05-31 广东威创视讯科技股份有限公司 一种视频处理方法及系统
CN107197369B (zh) * 2017-06-06 2020-01-14 清华大学 一种多子流协同的视频流媒体并行解码方法
CN109688314B (zh) * 2019-02-19 2021-03-09 福州视驰科技有限公司 低延时少缓存且数据输出方式可控的摄像头系统及方法
CN113160026B (zh) * 2020-01-07 2024-03-05 北京地平线机器人技术研发有限公司 图像处理方法、装置、介质以及电子设备
CN111447339A (zh) * 2020-03-26 2020-07-24 西安万像电子科技有限公司 图像传输方法和系统
CN111556076B (zh) * 2020-05-15 2020-12-29 杭州玖城网络科技有限公司 一种多路径网络实时视频传输的方法
CN112954438B (zh) * 2021-02-05 2022-02-11 摩尔线程智能科技(北京)有限责任公司 一种图像处理方法以及装置
WO2022188021A1 (zh) * 2021-03-09 2022-09-15 深圳市大疆创新科技有限公司 数据存储方法、数据读取方法、装置、系统及存储介质
CN113766235B (zh) * 2021-08-30 2023-10-17 聚好看科技股份有限公司 一种全景视频传输方法及设备
CN114302147A (zh) * 2022-01-17 2022-04-08 周毅 视频流数据的录放方法、系统、存储介质及电子装置

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101309259A (zh) * 2008-06-24 2008-11-19 广东威创视讯科技股份有限公司 一种分布式图像显示方法
CN101383951A (zh) * 2008-09-25 2009-03-11 杭州爱威芯科技有限公司 画面分割装置
CN102256127A (zh) * 2011-07-20 2011-11-23 东南大学 一种多模式无线通信网络的实时视频传输系统
US9497457B1 (en) * 2012-10-18 2016-11-15 Altia Systems Inc Repacking of panoramic video based on standard image formats
CN106792270A (zh) * 2016-12-08 2017-05-31 广东威创视讯科技股份有限公司 一种视频处理方法及系统

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101754021B (zh) * 2009-12-18 2011-09-28 江苏省烟草公司常州市公司 基于改进小波变换图像压缩方法实现手机移动门户的方法
EP2472867A1 (en) * 2010-12-30 2012-07-04 Advanced Digital Broadcast S.A. Coding and decoding of multiview videos
CN102724492B (zh) * 2012-06-28 2015-06-03 广东威创视讯科技股份有限公司 视频图像传输、播放方法和系统
CN103049238A (zh) * 2012-12-14 2013-04-17 广东威创视讯科技股份有限公司 影像数据传输方法及装置
CN203352678U (zh) * 2013-01-30 2013-12-18 深圳市振华微电子有限公司 六摄像头全景图像硬件成像装置
CN103763556B (zh) * 2014-01-29 2018-03-06 广东威创视讯科技股份有限公司 视频图像编解码装置及其方法、传输系统及其方法

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101309259A (zh) * 2008-06-24 2008-11-19 广东威创视讯科技股份有限公司 一种分布式图像显示方法
CN101383951A (zh) * 2008-09-25 2009-03-11 杭州爱威芯科技有限公司 画面分割装置
CN102256127A (zh) * 2011-07-20 2011-11-23 东南大学 一种多模式无线通信网络的实时视频传输系统
US9497457B1 (en) * 2012-10-18 2016-11-15 Altia Systems Inc Repacking of panoramic video based on standard image formats
CN106792270A (zh) * 2016-12-08 2017-05-31 广东威创视讯科技股份有限公司 一种视频处理方法及系统

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111654644A (zh) * 2020-05-15 2020-09-11 西安万像电子科技有限公司 图像传输方法及系统

Also Published As

Publication number Publication date
CN106792270A (zh) 2017-05-31

Similar Documents

Publication Publication Date Title
WO2018103359A1 (zh) 一种视频处理方法及系统
CN103905741B (zh) 超高清全景视频实时生成与多通道同步播放系统
US10911781B2 (en) Image processing device and image processing method for encoding/decoding omnidirectional images at different resolutions
US8374444B2 (en) Method and apparatus for providing higher resolution images in an embedded device
WO2017219652A1 (zh) 一种头戴显示器、视频输出设备和视频处理方法、系统
US20040042547A1 (en) Method and apparatus for digitizing and compressing remote video signals
US10051271B2 (en) Coding structure
CN101883140B (zh) 基于远程显示的编码系统及方法、服务器
TW201526617A (zh) 影像處理方法與系統、解碼方法、編碼器與解碼器
RU2010127311A (ru) Система и способ сжигания интерактивного потокового видео
US9264663B2 (en) Ultra low latency video communication
WO2013113191A1 (zh) 一种视频图像快速传输的方法及系统
WO2018032696A1 (zh) 桌面截屏控制方法及系统
CN104602095A (zh) 一种组合桌面采集与同步显示方法及系统
US10674163B2 (en) Color space compression
CN103475877A (zh) 视频传输方法和系统
CN111406404B (zh) 获得视频文件的压缩方法、解压缩方法、系统及存储介质
US8798379B2 (en) Image processing method, encoding device, decoding device, and image processing apparatus
US20100049832A1 (en) Computer program product, a system and a method for providing video content to a target system
WO2020168501A1 (zh) 图像的编码方法、解码方法及所适用的设备、系统
CN107318021B (zh) 一种远程显示的数据处理方法及系统
CN110798700B (zh) 视频处理方法、视频处理装置、存储介质与电子设备
CN110753229A (zh) 基于h.265编码的视频采集装置及方法
WO2024051328A1 (zh) 图像编解码方法、装置、编码器、解码器和系统
WO2024051329A1 (zh) 图像编解码方法、装置、编码器、解码器和系统

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 17878257

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 17.10.2019)

122 Ep: pct application non-entry in european phase

Ref document number: 17878257

Country of ref document: EP

Kind code of ref document: A1