CN112351308A - Method for realizing rapid transcoding based on local transcoding technology - Google Patents

Method for realizing rapid transcoding based on local transcoding technology Download PDF

Info

Publication number
CN112351308A
CN112351308A CN202011192092.XA CN202011192092A CN112351308A CN 112351308 A CN112351308 A CN 112351308A CN 202011192092 A CN202011192092 A CN 202011192092A CN 112351308 A CN112351308 A CN 112351308A
Authority
CN
China
Prior art keywords
transcoding
audio
video
template
local
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202011192092.XA
Other languages
Chinese (zh)
Inventor
吴奕刚
张盼
纪亭
李宏元
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Hangzhou Arcvideo Technology Co ltd
Original Assignee
Hangzhou Arcvideo Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Hangzhou Arcvideo Technology Co ltd filed Critical Hangzhou Arcvideo Technology Co ltd
Priority to CN202011192092.XA priority Critical patent/CN112351308A/en
Publication of CN112351308A publication Critical patent/CN112351308A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/234Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/40Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using video transcoding, i.e. partial or full decoding of a coded input stream followed by re-encoding of the decoded output stream
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/234Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs
    • H04N21/2343Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs involving reformatting operations of video signals for distribution or compliance with end-user requests or end-user device requirements
    • H04N21/234309Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs involving reformatting operations of video signals for distribution or compliance with end-user requests or end-user device requirements by transcoding between formats or standards, e.g. from MPEG-2 to MPEG-4 or from Quicktime to Realvideo
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/80Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
    • H04N21/83Generation or processing of protective or descriptive data associated with content; Content structuring
    • H04N21/845Structuring of content, e.g. decomposing content into time segments
    • H04N21/8456Structuring of content, e.g. decomposing content into time segments by decomposing the content in the time domain, e.g. in time segments
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N5/00Details of television systems
    • H04N5/222Studio circuitry; Studio devices; Studio equipment
    • H04N5/262Studio circuits, e.g. for mixing, switching-over, change of character of image, other special effects ; Cameras specially adapted for the electronic generation of special effects

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Compression Or Coding Systems Of Tv Signals (AREA)

Abstract

The invention discloses a method for realizing rapid transcoding based on a local transcoding technology. The method specifically comprises the following steps: (1) an editor previews an original video stream file, dots a stripped video file, and issues a local transcoding task to a background server; (2) a background server schedules a transcoding task, namely, the incomplete GOP fragments in a video block to be edited are subjected to targeted transcoding processing, and other complete GOP fragments are subjected to transcoding strategy adjustment and load balancing according to a transparent coding mode, and finally a task is issued to a local transcoding background, and a transcoding result is fed back to be edited at the front end; (3) and calling a core transcoding Transcoder engine, executing a local transcoding task, and outputting a finished product video file. The invention has the beneficial effects that: the method meets the requirement of fast frame accurate coding and transcoding, improves the rate and resource occupancy rate of a transcoding system, has high accuracy of transcoding frames and low error rate, can realize efficient video coding and transcoding and realize fast content online of programs.

Description

Method for realizing rapid transcoding based on local transcoding technology
Technical Field
The invention relates to the technical field related to audio and video coding and transcoding, in particular to a method for realizing rapid transcoding based on a local transcoding technology.
Background
In modern society, watching videos become an indispensable part of people's lives, and the videos are required to be edited before being formally pushed to end users. The news video usually has higher requirements on the video editing time due to the requirement on timeliness, and if the video clip coding time can be shortened, the news report video can be displayed on the user terminal equipment more quickly.
Generally, a video file needs to be quickly transcoded and output after being edited, and the following two methods are currently used:
the method comprises the following steps: and carrying out ordinary transcoding through a transcoding system. The accurate frame encoding and transcoding can be realized, and the time lengths of the transcoded files are consistent. However, when the video source is 4K/HDR and has a high bit rate, the performance requirement on the transcoding server is high, the transcoding efficiency is low, the time consumption is long, and the like.
The second method comprises the following steps: and carrying out transparent transmission transcoding through a coding and transcoding system. The transcoding efficiency is high, the speed is high, but the file output by transparent transmission often has a series of problems such as frame loss, popping, screen splash and the like, and the requirements of the auditing program on video quality such as video accuracy and the like cannot be met.
The two methods have defects and cannot meet the requirement of rapid frame accurate coding and transcoding.
Disclosure of Invention
The invention provides a method for realizing rapid transcoding based on a local transcoding technology, which can meet the requirements of rapid frame accurate coding and transcoding, and aims to overcome the defects in the prior art.
In order to achieve the purpose, the invention adopts the following technical scheme:
a method for realizing rapid transcoding based on a local transcoding technology specifically comprises the following steps:
(1) an editor previews an original video stream file, dots a stripped video file, and issues a local transcoding task to a background server;
(2) a background server schedules a transcoding task, namely, the incomplete GOP fragments in a video block to be edited are subjected to targeted transcoding processing, and other complete GOP fragments are subjected to transcoding strategy adjustment and load balancing according to a transparent coding mode, and finally a task is issued to a local transcoding background, and a transcoding result is fed back to be edited at the front end;
(3) and calling a core transcoding Transcoder engine, executing a local transcoding task, and outputting a finished product video file.
The invention is based on video coding format, common video is based on GOP mode, that is, pictures are organized in sequence, one sequence is a data stream after a section of picture coding, starting with an I frame and ending with the next I frame. While the entire video is composed of a series of GOP concatenations. A GOP sequence is a series of data streams generated by encoding pictures with less than significant content differences. When the motion change is less, a sequence can be long, because the less motion change represents that the content change of the image picture is small, an I frame can be coded, and then P frames and B frames are always coded. When the motion changes much, it is possible that a sequence is shorter, e.g. containing one I-frame and 3, 4P-frames. The GOP is used as a processing unit, video blocks needing to be edited and incomplete GOP segments are subjected to targeted encoding and transcoding processing, other large number of video blocks and complete GOP segments are processed in a penetrating encoding mode and recombined into a new video, so that the requirement of rapid frame accurate encoding and transcoding is met, the rate and the resource occupancy rate of a transcoding system are improved, the accuracy of transcoding frames is high, the error rate is low, multiple segment files are combined, the problems of popping, frame accuracy, audio and video synchronization and the like are solved, a common audio and video file format is widely supported, offline/recording and other types of files are supported, a 4K/HDR transcoding technology is supported, the encoding of the video can be efficiently completed, and the rapid content transcoding of programs is realized.
Preferably, the specific operation method comprises the following steps: through the ffprobe component, the first key frame time keyframe1 at the beginning of the slice and the last key frame time keyframe2 at the end of the slice are searched as time points for distinguishing the transcoding fragment and the transparent transmission fragment, and the source stream is split into video1, video2, video3 and the whole audio fragment.
Preferably, in step (2), Mediainfo information of a source video is acquired by a Transcoder component to acquire resolution, bitrate, frame rate and audio/video format, as transcoding template configuration parameters, according to key frame point information of keyframe1 and keyframe2, time lengths of segments of video1, video2 and video3 are calculated, according to the acquired transcoding template configuration parameters, built-in template default values are modified to generate four types of transcoding templates including a no-audio transcoding template, a no-audio transparent transmission template, a no-video transparent transmission template and a combined transcoding template, and the Transcoder component transcodes video1 and video3 and video2 and audio/video fragments are generated by video transparent transmission.
Preferably, in step (3), the Transcoder component performs the following tasks based on the key frame point location information of keyframe1 and keyframe2 in step (1) and the four types of transcoding templates generated in step (2):
(31) no audio transcoding template: the template is configured with no audio transcoding, and audio files with no audio at the head and the tail, namely video1[ start, keyframe1] and video3[ keyframe2, end ], are output;
(32) the non-audio transparent transmission template: the template is configured with audio-free transparent transmission, and outputs an intermediate audio-free file, namely video2[ keyframe1, keyframe2 ];
(33) no video transparent transmission template: the template is configured with non-video transparent transmission, and outputs the whole section of non-video file, namely audio [ start, end ];
(34) merging transcoding templates: the template is configured with a plurality of sections of audio and video merging and transcoding, video1, video2, video3 and audio are merged, and finished video start and end is output.
The invention has the beneficial effects that: the method meets the requirement of fast frame accurate coding and transcoding, improves the rate and resource occupancy rate of a transcoding system, has high accuracy of transcoding frames and low error rate, can realize efficient video coding and transcoding and realize fast content online of programs.
Drawings
FIG. 1 is a flow chart of the method of the present invention.
Detailed Description
The invention is further described with reference to the following figures and detailed description.
In the embodiment shown in fig. 1, a method for implementing fast transcoding based on a partial transcoding technology specifically includes the following steps:
(1) an editor previews an original video stream file, dots a stripped video file, and issues a local transcoding task to a background server; the specific operation method comprises the following steps: through the ffprobe component, the first key frame time keyframe1 at the beginning of the slice and the last key frame time keyframe2 at the end of the slice are searched as time points for distinguishing the transcoding fragment and the transparent transmission fragment, and the source stream is split into video1, video2, video3 and the whole audio fragment.
(2) A background server schedules a transcoding task, namely, the incomplete GOP fragments in a video block to be edited are subjected to targeted transcoding processing, and other complete GOP fragments are subjected to transcoding strategy adjustment and load balancing according to a transparent coding mode, and finally a task is issued to a local transcoding background, and a transcoding result is fed back to be edited at the front end; acquiring Mediainfo information of a source video by a Transcoder component to acquire resolution, code rate, frame rate and audio/video formats, using the Mediainfo information as a transcoding template configuration parameter, calculating the duration of each segment of video1, video2 and video3 according to key frame point location information of keyframe1 and keyframe2, modifying a built-in template default value according to the acquired transcoding template configuration parameter to generate four types of transcoding templates including a no-audio transcoding template, a no-audio transparent transmission template, a no-video transparent transmission template and a merging transcoding template, transcoding the Transcoder component transcodes video1 and video3, and the video2 and audio transparent transmission generate audio/video segments.
(3) And calling a core transcoding Transcoder engine, executing a local transcoding task, and outputting a finished product video file. The Transcoder component performs the following tasks based on the key frame point location information of keyframe1 and keyframe2 in step (1) and the four types of transcoding templates generated in step (2):
(31) no audio transcoding template: the template is configured with no audio transcoding, and audio files with no audio at the head and the tail, namely video1[ start, keyframe1] and video3[ keyframe2, end ], are output;
(32) the non-audio transparent transmission template: the template is configured with audio-free transparent transmission, and outputs an intermediate audio-free file, namely video2[ keyframe1, keyframe2 ];
(33) no video transparent transmission template: the template is configured with non-video transparent transmission, and outputs the whole section of non-video file, namely audio [ start, end ];
(34) merging transcoding templates: the template is configured with a plurality of sections of audio and video merging and transcoding, video1, video2, video3 and audio are merged, and finished video start and end is output.
And the fourth step is to merge and transcode the audio and video clips output in the first three steps, and finally output the fragmented video files. Note that: the consistency of transcoding parameters of the transcoding template and the transparent template video is ensured, and the problem of video quality after merging is prevented.
The invention is based on video coding format, common video is based on GOP mode, that is, pictures are organized in sequence, one sequence is a data stream after a section of picture coding, starting with an I frame and ending with the next I frame. While the entire video is composed of a series of GOP concatenations. A GOP sequence is a series of data streams generated by encoding pictures with less than significant content differences. When the motion change is less, a sequence can be long, because the less motion change represents that the content change of the image picture is small, an I frame can be coded, and then P frames and B frames are always coded. When the motion changes much, it is possible that a sequence is shorter, e.g. containing one I-frame and 3, 4P-frames. The GOP is used as a processing unit, the video blocks to be edited and incomplete GOP fragments are subjected to targeted coding and transcoding processing, and other large number of video blocks and complete GOP fragments are processed according to a through coding mode and recombined into a new video.

Claims (4)

1. A method for realizing rapid transcoding based on a local transcoding technology is characterized by comprising the following steps:
(1) an editor previews an original video stream file, dots a stripped video file, and issues a local transcoding task to a background server;
(2) a background server schedules a transcoding task, namely, the incomplete GOP fragments in a video block to be edited are subjected to targeted transcoding processing, and other complete GOP fragments are subjected to transcoding strategy adjustment and load balancing according to a transparent coding mode, and finally a task is issued to a local transcoding background, and a transcoding result is fed back to be edited at the front end;
(3) and calling a core transcoding Transcoder engine, executing a local transcoding task, and outputting a finished product video file.
2. The method for realizing fast transcoding based on local transcoding technique as claimed in claim 1, wherein in step (1), the specific operation method is as follows: through the ffprobe component, the first key frame time keyframe1 at the beginning of the slice and the last key frame time keyframe2 at the end of the slice are searched as time points for distinguishing the transcoding fragment and the transparent transmission fragment, and the source stream is split into video1, video2, video3 and the whole audio fragment.
3. The method as claimed in claim 2, wherein in step (2), the Transcoder component obtains Mediainfo information of a source video to obtain resolution, bitrate, frame rate and audio/video format, as transcoding template configuration parameters, calculates duration of each segment of video1, video2 and video3 according to key frame point information of keyframe1 and keyframe2, modifies a built-in template default value according to the obtained transcoding template configuration parameters, and generates four types of transcoding templates including no audio transcoding template, no audio transparent template, no video transparent template and merging transcoding template, and the Transcoder component generates audio/video segments by video1 and video3, video2 and audio transparent transcoding.
4. The method of claim 3, wherein in step (3), the Transcoder component performs the following tasks based on the key frame point location information of keyframe1 and keyframe2 in step (1) and the four types of transcoding templates generated in step (2):
(31) no audio transcoding template: the template is configured with no audio transcoding, and audio files with no audio at the head and the tail, namely video1[ start, keyframe1] and video3[ keyframe2, end ], are output;
(32) the non-audio transparent transmission template: the template is configured with audio-free transparent transmission, and outputs an intermediate audio-free file, namely video2[ keyframe1, keyframe2 ];
(33) no video transparent transmission template: the template is configured with non-video transparent transmission, and outputs the whole section of non-video file, namely audio [ start, end ];
(34) merging transcoding templates: the template is configured with a plurality of sections of audio and video merging and transcoding, video1, video2, video3 and audio are merged, and finished video start and end is output.
CN202011192092.XA 2020-10-30 2020-10-30 Method for realizing rapid transcoding based on local transcoding technology Pending CN112351308A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202011192092.XA CN112351308A (en) 2020-10-30 2020-10-30 Method for realizing rapid transcoding based on local transcoding technology

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202011192092.XA CN112351308A (en) 2020-10-30 2020-10-30 Method for realizing rapid transcoding based on local transcoding technology

Publications (1)

Publication Number Publication Date
CN112351308A true CN112351308A (en) 2021-02-09

Family

ID=74356222

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202011192092.XA Pending CN112351308A (en) 2020-10-30 2020-10-30 Method for realizing rapid transcoding based on local transcoding technology

Country Status (1)

Country Link
CN (1) CN112351308A (en)

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6285361B1 (en) * 1996-11-15 2001-09-04 Futuretel, Inc. Method and apparatus for clipping video segments from an audiovisual file
CN1344106A (en) * 2000-09-15 2002-04-10 北京算通数字技术研究中心有限公司 Edition method for non-linear edition system based on MPEG-2 code stream
CN1668097A (en) * 2005-04-07 2005-09-14 北京北大方正电子有限公司 A method for quick generation of video file
CN105357531A (en) * 2015-12-15 2016-02-24 北京网博视界科技股份有限公司 Rapid cutting packaging method based on video local encoding
CN105611317A (en) * 2016-01-18 2016-05-25 北京流金岁月文化传播股份有限公司 Video-audio transcoding device and method capable of supporting audio unvarnished transmission
CN106803992A (en) * 2017-02-14 2017-06-06 北京时间股份有限公司 Video clipping method and device
US20170188071A1 (en) * 2015-12-28 2017-06-29 Synchronoss Technologies, Inc. Method and apparatus for just-in-time transcoding
CN107197372A (en) * 2017-06-30 2017-09-22 北京金山安全软件有限公司 Method and device for shearing batch vertical screen videos and electronic equipment

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6285361B1 (en) * 1996-11-15 2001-09-04 Futuretel, Inc. Method and apparatus for clipping video segments from an audiovisual file
CN1344106A (en) * 2000-09-15 2002-04-10 北京算通数字技术研究中心有限公司 Edition method for non-linear edition system based on MPEG-2 code stream
CN1668097A (en) * 2005-04-07 2005-09-14 北京北大方正电子有限公司 A method for quick generation of video file
CN105357531A (en) * 2015-12-15 2016-02-24 北京网博视界科技股份有限公司 Rapid cutting packaging method based on video local encoding
US20170188071A1 (en) * 2015-12-28 2017-06-29 Synchronoss Technologies, Inc. Method and apparatus for just-in-time transcoding
CN105611317A (en) * 2016-01-18 2016-05-25 北京流金岁月文化传播股份有限公司 Video-audio transcoding device and method capable of supporting audio unvarnished transmission
CN106803992A (en) * 2017-02-14 2017-06-06 北京时间股份有限公司 Video clipping method and device
CN107197372A (en) * 2017-06-30 2017-09-22 北京金山安全软件有限公司 Method and device for shearing batch vertical screen videos and electronic equipment

Similar Documents

Publication Publication Date Title
US6989868B2 (en) Method of converting format of encoded video data and apparatus therefor
US20180109824A1 (en) Codec Techniques for Fast Switching
US9286940B1 (en) Video editing with connected high-resolution video camera and video cloud server
CN1254952C (en) Efficient transmission and playback of digital information
US6871003B1 (en) Edit decision list for identifying the pull down phase of a video signal
KR100896974B1 (en) Method, devices, system and software product for compressed-domain video editing
CN109348309B (en) Distributed video transcoding method suitable for frame rate up-conversion
US11064245B1 (en) Piecewise hybrid video and audio synchronization
CN112653904B (en) Rapid video clipping method based on PTS and DTS modification
CN104284241A (en) Video editing method and device
CN109769142B (en) Video cutting method and system for urban media wall light show
US20140380394A1 (en) Device and method for composing programmes from different sources in baseband
CN109640162B (en) Code stream conversion method and system
GB2358539A (en) Data processing method which separates parameter data from coded data
US20170302900A1 (en) Controlling modes of sub-title presentation
CN106817588A (en) Transcoding control method and device, net cast method and system
CN112087642B (en) Cloud guide playing method, cloud guide server and remote management terminal
CN110602522A (en) Multi-path real-time live webRTC stream synthesis method
US7269184B2 (en) Data transmission method and apparatus and data receiving method and apparatus
Wieckowski et al. A complete end to end open source toolchain for the versatile video coding (VVC) standard
CN113382278B (en) Video pushing method and device, electronic equipment and readable storage medium
US10299009B2 (en) Controlling speed of the display of sub-titles
CN112351308A (en) Method for realizing rapid transcoding based on local transcoding technology
CN114339316A (en) Video stream coding processing method based on live video
CN111601113B (en) Live broadcast source tracing method based on block chain

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20210209

RJ01 Rejection of invention patent application after publication