WO2023116254A1

WO2023116254A1 - Live video recording method, apparatus and system, and terminal device

Info

Publication number: WO2023116254A1
Application number: PCT/CN2022/131510
Authority: WO
Inventors: 杨柳
Original assignee: Oppo广东移动通信有限公司
Priority date: 2021-12-22
Filing date: 2022-11-11
Publication date: 2023-06-29
Also published as: CN114173150A

Abstract

A live video recording method, apparatus and system, and a terminal device. The method comprises: processing live stream data to obtain original video compressed data and decoded audio data; copying the original video compressed data and the decoded audio data, and encoding the copied decoded audio data into audio compressed data in a specified format, which can be synthesized with the video compressed data; and then synthesizing the copied original video compressed data and the audio compressed data in the specified format into a video file. Further provided in the embodiments of the present disclosure are an apparatus and system for implementing the method, and a terminal device. The embodiments of the present disclosure can realize high-definition recording and storage of a live video, and have universality.

Description

A live video recording method, device, system and terminal equipment

cross reference

This application claims the priority of the Chinese patent application with application number 202111583166.7 and titled "A Live Video Recording Method, Device, System, and Terminal Equipment" submitted to the China Patent Office on December 22, 2021. The entire content of the application Incorporated in this application by reference.

technical field

Embodiments of the present disclosure relate to but are not limited to video technology, and more specifically, relate to a live video recording method, device, system, and terminal equipment.

Background technique

Video combines rich elements such as images, texts, and sounds, and has gradually become the mainstream expression method of the Internet. The live video broadcast utilizes the Internet and streaming media technology. When the user device sends a live broadcast request, the server sends the broadcast address of the requested live channel to the user device, and the user device joins the corresponding multicast group according to the broadcast address to receive the live stream. The data can be played and recorded, but the quality of the recorded video needs to be improved.

Summary of the invention

The following is an overview of the topics described in detail in this article. This summary is not intended to limit the scope of the claims.

An embodiment of the present disclosure provides a live video recording method, including the following recording process:

Process live stream data to obtain original video compression data and decoded audio data;

Copying the original video compressed data and decoded audio data, encoding the copied decoded audio data into audio compressed data in a specified format capable of being synthesized with video compressed data;

Synthesizing the copied original video compressed data and the specified audio compressed data into a video file.

An embodiment of the present disclosure also provides a live video recording system, including a live stream player and a video recorder, wherein:

The live stream player is set to extract original video compressed data and original audio compressed data from the live stream data, decode the original video compressed data and original audio compressed data respectively to obtain decoded video data and decoded audio data, and perform Play synchronously;

said video recorder is configured to reproduce said raw video compressed data and decoded audio data, and to encode said decoded audio data into audio compressed data in a specified format capable of being synthesized with video compressed data; and, converting said raw The video compressed data and the audio compressed data in the specified format are synthesized into a video file.

An embodiment of the present disclosure also provides a live video recording device, including a memory and a processor, wherein a computer program is stored in the memory, and when the processor executes the computer program, the computer program described in any embodiment of the present disclosure can be realized. live video recording method.

An embodiment of the present disclosure also provides a terminal device, including a processor, a memory connected to the processor through a bus, a display device, an audio device, an input device, and a network interface, and the memory stores a live stream receiving program, A live stream playing program and a live video recording program, when the processor executes the live stream receiving program, the receiving of the live stream data can be realized through the network interface; when the live stream playing program is executed, the live stream data can be process and play through the display device and audio device; when executing the live video recording program, the live video recording method described in any embodiment of the present disclosure can be realized according to the instructions of the input device.

An embodiment of the present disclosure also provides a non-transitory computer-readable storage medium, the computer-readable storage medium stores a computer program, and when the computer program is executed by a processor, it can implement the method described in any embodiment of the present disclosure. The live video recording method described above.

Other aspects will be apparent to others upon reading and understanding the drawings and detailed description.

Figure overview

The accompanying drawings are used to provide an understanding of the embodiments of the present disclosure, and constitute a part of the description, together with the embodiments of the present disclosure, are used to explain the technical solutions of the present disclosure, and do not constitute limitations on the technical solutions of the present disclosure.

FIG. 1 is a flow chart of a live video recording method according to an embodiment of the present disclosure;

2 is a schematic diagram of a live video recording system according to an embodiment of the present disclosure;

3 is a block diagram of a live video recording system according to an embodiment of the present disclosure;

4 is a schematic structural diagram of a live video recording device according to an embodiment of the present disclosure;

Fig. 5 is a schematic structural diagram of a terminal device according to an embodiment of the present disclosure.

detail

The present disclosure describes various embodiments, but the description is exemplary rather than restrictive, and it is obvious to those of ordinary skill in the art that within the scope of the described embodiments of the present disclosure are included. Many more embodiments and implementations are possible.

In the description of the present disclosure, words such as "exemplary" or "for example" are used to mean an example, illustration or illustration. Any embodiment described in this disclosure as "exemplary" or "for example" should not be construed as preferred or advantageous over other embodiments. "And/or" in this article is a description of the relationship between associated objects, which means that there can be three relationships, for example, A and/or B, which can mean: A exists alone, A and B exist simultaneously, and there exists alone B these three situations. "A plurality" means two or more than two. In addition, in order to clearly describe the technical solutions of the embodiments of the present disclosure, words such as "first" and "second" are used to distinguish the same or similar items with basically the same function and effect. Those skilled in the art can understand that words such as "first" and "second" do not limit the number and execution order, and words such as "first" and "second" do not necessarily limit the difference.

In describing representative exemplary embodiments, the specification may have presented a method and/or process as a particular sequence of steps. However, to the extent the method or process is not dependent on the specific order of steps described herein, the method or process should not be limited to the specific order of steps described. Other sequences of steps are also possible, as will be appreciated by those of ordinary skill in the art. Therefore, the specific order of the steps set forth in the specification should not be construed as limitations on the claims. Furthermore, claims directed to the method and/or process should not be limited to performing their steps in the order written, as those skilled in the art will readily understand that such order can be varied and still remain within the spirit and spirit of the disclosed embodiments. within range.

When a user watches a live broadcast, he sometimes wishes to record a video of interest so that he can watch it at any time. In one embodiment, the live broadcast is realized through a built-in player in the browser, and online recording can be realized by means of the screen recording function of the browser or special screen recording software. However, the video files obtained by screen recording have a loss of definition, which cannot meet the user's demand for video quality.

To this end, an embodiment of the present disclosure provides a live video recording method, as shown in FIG. 1 , including:

Step 110, processing the live streaming data to obtain original video compression data and decoded audio data;

Step 120, copying the original video compressed data and decoded audio data, and encoding the copied decoded audio data into audio compressed data in a specified format that can be synthesized with video compressed data;

Step 120, combining the copied original compressed video data and compressed audio data in a specified format into a video file.

In the video files recorded by the live video recording method of the embodiment of the present disclosure, the video compression data is the original video compression data (called original video compression data) in the live streaming data, and the audio compression data is the original video compression data in the live streaming data. It is obtained by transcoding the audio compression data (called original audio compression data), so it can realize the high-definition recording of live video, avoid the loss of clarity, and can get the video effect far exceeding the screen recording.

The live video recording method of the embodiment of the present disclosure first decodes the original audio compressed data in the live streaming data, and then encodes the decoded audio data into audio compressed data in a specified format. The audio compressed data in the specified format can be combined with the video The compressed data is synthesized into a video file, so no matter what format the original audio compressed data in the live stream data is in, it can be synthesized with the original video compressed data into a common Video files to realize the recording of live video. Therefore, the embodiments of the present disclosure are universal, and can be applied to the recording of live video in various formats.

In actual application scenarios, the format of the original video compressed data is usually known (and can also be obtained by decoding), and when specifying the format of the audio compressed data obtained by transcoding, that is, the above-mentioned "specified format", you can specify a format that can The format combined with the original video compression data, of course, the specified format can also be combined with video compression data in other formats.

In an exemplary embodiment of the present disclosure, the compressed original video data is obtained by decapsulating the received live streaming data.

In an exemplary embodiment of the present disclosure, the decoded audio data is obtained by decapsulating the received live stream data to obtain original audio compression data, and then decoding the original audio compression data to obtain The decoded audio data.

The decapsulation can separate the input live stream data in the encapsulation format into compressed audio stream encoded data (referred to as audio compressed data for short) and video stream compressed encoded data (referred to as video compressed data for short). The types of encapsulation formats can be, for example, mp4, MKV, RMVB, TS, FLV, AVI, etc. Encapsulation is to put compressed and encoded video data and audio data together in a certain format. Among them, MP4 is a set of compression coding standards for audio and video information, formulated by the "Moving Picture Experts Group" (MPEG) under the International Organization for Standardization (ISO) and the International Electrotechnical Commission (IEC). MKV is a media file of Matroska, and Matroska is a multimedia packaging format, also known as a multimedia container (Multimedia Container). RealMedia is a variable bit rate (VBR) extended version of the RealMedia multimedia digital container format developed by RealNetworks. ts is an encapsulation format for high-definition cameras, and its full name is MPEG2-TS. ts is the abbreviation of "Transport Stream". FLV (Flash Video) is a packaging format introduced by Adobe, mainly used in streaming media systems. AVI (Audio Video Interleave, Audio Video Interleave) is a multimedia file format introduced by Microsoft.

The decoding can decode video compressed data into uncompressed video data (decoded video data), and decode audio compressed data into uncompressed audio data (decoded audio data).

Audio compression coding standards include g711, g726, aac, MP3, AC-3 and so on. G.711 is an audio coding method formulated by the International Telecommunication Union (ITU-T), also known as ITU-T G.711. G.726 is an audio coding algorithm defined by ITU-T. MP3 is an audio compression technology, and its full name is Moving Picture Experts Group Audio Layer III (Moving Picture Experts Group Audio Layer III). AAC (Advanced Audio Coding: Advanced Audio Coding) is based on MPEG-2 audio coding technology. Jointly developed by Fraunhofer IIS, Dolby Laboratories, AT&T, Sony and other companies to replace the MP3 format. AC-3 (Dolby Digital AC-3) is a new generation of home theater multi-channel digital audio system developed by Dolby, and AC (Audio Coding) refers to digital audio coding. Through decoding, the compressed and encoded audio data is output as uncompressed audio sample data, such as pcm data, and PCM (Pulse Code Modulation, Pulse Code Modulation) is an uncompressed original encoding format.

Video compression coding standards include H.264/Advanced Video Coding (Advanced Video Coding, AVC), H.265/High Efficiency Video Coding (High Efficiency Video Coding, HEVC), H.266/Versatile Video Coding (Multifunctional Video Coding , VVC), MPEG (Moving Picture Experts Group, Dynamic Picture Experts Group), VC-1 (Video Codec 1,), etc., the full name of VC-1 is VC-1 video codec, which is a video codec developed by Microsoft. decoding system. Through decoding, the compressed and encoded video data is output as uncompressed color data, such as YUV420P, RGB and so on.

In an exemplary embodiment of the present disclosure, the live streaming data is propagated through the network, and before the live streaming data is decapsulated, the live streaming data is de-protocol-decomposed. The solution protocol is to parse the data of the streaming media protocol into standard encapsulation format data. When audio and video data is transmitted on the network, various streaming media protocols are often used, such as HTTP (Hyper Text Transfer Protocol, hypertext transfer protocol), RTMP (Real Time Messaging Protocol, real-time message transmission protocol)), or MMS ( Microsoft Media Server Protocol, Microsoft Media Server Protocol) and so on. While transmitting audio and video data, these protocols also transmit some signaling data. These signaling data include control of playback (play, pause, stop), or description of network status. In the process of unraveling the protocol, the signaling data will be removed and only the audio and video data will be kept.

In an exemplary embodiment of the present disclosure, the format of the original video compression data is H.264, H.265 or H.266, the specified format is aac format, and the video file is mp4 format. Assume that in an example, the format of the original compressed audio data is g711, the compressed audio data in the g711 format cannot be directly combined with the compressed video data into the video, and cannot be directly recorded at this time. And through this embodiment, the audio compression data of g711 format can be decoded into the audio data of pcm format, then the audio data of pcm format is coded into the audio compression data of aac format, just can synthesize in the video, for example with H.264 , H.265 or H.266 format video compression data synthesized into mp4 format video files. The mp4 format is a universal format that can be played with various players. It can be seen that the method of the embodiment of the present disclosure can be applied to video recording during live streaming of live streaming data in various formats.

In an exemplary embodiment of the present disclosure, the live video recording method further includes:

After receiving the start recording instruction input by the user or when the pre-configured start recording condition is met, start the recording process;

The recording process is stopped after receiving a recording stop instruction input by a user or when a pre-configured condition for stopping recording is met.

Some live video recording methods automatically go to the server to obtain the video of this time segment by detecting the start time and end time of the object entering the projection area. Scenes present cannot be recorded effectively. However, the embodiments of the present disclosure can start and stop recording based on user instructions, and the user can actively record a certain video segment, and the start and end times are defined by the user, realizing free recording. In some scenarios where it is inconvenient for the user to input, for example, when the user is not in front of the playback device, the recording can be started and stopped through pre-configured conditions for starting and/or stopping the recording. The condition may be, for example, that the recording is automatically started when the preset first moment is reached, and the recording is stopped after the recording time reaches a predetermined time length. For another example, in the set anti-theft mode, the recording is started when a human face is detected in the video image, and the recording is stopped if no human face is detected after a set period of time, and so on.

An embodiment of the present disclosure also provides a live video recording system, as shown in FIG. 2 , including a live stream player 1 and a video recorder 2, wherein:

The live stream player 1 is set to extract the original video compressed data and the original audio compressed data from the live stream data, and decode the original video compressed data and the original audio compressed data respectively to obtain the decoded video data and the decoded audio data Play synchronously;

Said video recorder 2 is configured to reproduce said raw video compressed data and decoded audio data, and encode said decoded audio data into audio compressed data in a specified format capable of being synthesized with video compressed data; and, said The original video compressed data and the audio compressed data in the specified format are synthesized into a video file.

The video file synthesized in this embodiment can be stored in the memory for the user to play at any time.

In the video files recorded by the live video recording system of the embodiment of the present disclosure, the video compression data is the original video compression data in the live streaming data, and the audio compression data is obtained by transcoding the original audio compression data in the live streaming data, which can realize live streaming. The high-definition recording of the video avoids the loss of clarity, and can get a video effect far exceeding that of screen recording. In addition, regardless of the format of the original audio compression data in the live streaming data, after the live video recording system of the embodiment of the present disclosure transcodes the audio compression data in a specified format, it can be synthesized with the original video compression data into a common video file , with versatility, can be applied to the recording of live video in various formats.

In an exemplary embodiment of the present disclosure, as shown in FIG. 2 , the live video recording system further includes: a recording control module 3 configured to start recording after receiving an instruction input by the user or a pre-configured start recording instruction. When the conditions are met, control the video recorder to start the recording process; and, after receiving a stop recording instruction input by the user or when the pre-configured stop recording conditions are met, control the video recorder to stop the recording process. In this embodiment, free recording by the user can be realized, and automatic recording can also be realized through preset conditions, which is convenient and flexible.

In an exemplary embodiment of the present disclosure, as shown in Figure 3:

Described live stream player 1 comprises:

The media separation module (demuxer) 11 is configured to decapsulate the live stream data to obtain the original video compressed data and original audio compressed data. The decapsulated original audio compressed data is transmitted to the audio decoding module 13 through the first audio link, and the decapsulated original video compressed data is transmitted to the video decoding module 15 through the first video link. In addition, for the live streaming data encapsulated by the streaming media protocol, the media separation module first de-protocols the live streaming data, and then decapsulates the live streaming data.

The audio decoding module 13 is configured to decode the original compressed audio data to obtain decoded audio data. The decoded audio data can be transmitted to the synchronous playback module 17 via the second audio link.

The video decoding module 15 is configured to decode the original compressed video data to obtain decoded video data. The decoded video data can be transmitted to the synchronous playback module 17 via the second video link.

The synchronous playing module 17 is configured to synchronously play the decoded video data and the decoded audio data. For example, after synchronizing the decoded video data and the decoded audio data, the video data is sent to a display device for rendering, and the audio data is sent to an audio device such as a speaker for playback.

The video recorder 2 includes:

The video duplication module 21 is configured to duplicate the original video compression data. The video duplication module 21 can be integrated with the media separation module 11, or it can be set separately on the first video link. The video duplication module 21 can buffer the original video compression data, and duplicate a video track sent to the audio and video encapsulation module 27 .

The audio duplication module 23 is configured to duplicate the decoded audio data and send it to the audio coding module 25; the audio duplication module 23 can be integrated with the audio decoding module 13, or can be set separately on the second audio link. The audio duplication module 21 can buffer the decoded audio data, and send a copy to the audio coding module 25 .

The audio encoding module 25 is configured to encode the copied decoded audio data into audio encoding data in a specified format that can be combined with video compression data. The audio coded data in the specified format can be sent to the audio track of the audio and video encapsulation module 27 .

The audio and video encapsulation module (media muxer) 27 is configured to synthesize the copied original video compressed data and the audio compressed data of the specified format into a video file.

An embodiment of the present disclosure also provides a live video recording device, as shown in FIG. 4 , including a memory 50 and a processor 60. A computer program is stored in the memory 50. When the processor 60 executes the computer program, it can realize the following The live video recording method described in any embodiment is disclosed.

The processor in the embodiment of the present disclosure may be a general-purpose processor, including a central processing unit (CPU), a network processor (Network Processor, NP for short), a microprocessor, etc., or other conventional processors, etc.; The processor may also be a digital signal processor (DSP), an application specific integrated circuit (ASIC), an off-the-shelf programmable gate array (FPGA), discrete logic or other programmable logic device, discrete gate or transistor logic device, discrete hardware component, or Other equivalent integrated or discrete logic circuits may also be a combination of the above devices. That is, the processor in the above embodiments may be any processing device or device combination that implements the methods, steps and logic block diagrams disclosed in the embodiments of the present invention. If an embodiment of the present disclosure is implemented partially in software, instructions for the software may be stored in a suitable non-transitory computer-readable storage medium and executed in hardware using one or more processors. The instructions thereby implement the methods of the embodiments of the present disclosure. The term "processor," as used herein, may refer to the foregoing structure or any other structure suitable for implementation of the techniques described herein.

An embodiment of the present disclosure also provides a terminal device, as shown in FIG. 5 , including a processor 50, a memory 60 connected to the processor through a bus, a display device 20, an audio device 30, an input device 40 and a network interface. 10, wherein, the memory 60 stores a live stream player program 61 and a live video recording program 62, and the memory 60 also stores other software such as an operating system, which will not be described here. When the processor 50 executes the live stream playing program 61, it can process the live stream data received by the network interface 10 or other interfaces and play it through the display device 20 and the audio device 30; 40 to implement the live video recording method described in any embodiment of the present disclosure.

The live video recording system, device, and terminal device of the embodiments of the present disclosure can implement the live video recording method of the embodiments of the present disclosure, realize high-definition recording of live video, avoid the lack of definition, and obtain video effects far exceeding screen recording. Moreover, the live video recording system in the embodiment of the present disclosure has versatility and can be applied to recording live video in various formats.

An embodiment of the present disclosure also provides a non-transitory computer-readable storage medium, the computer-readable storage medium stores a computer program, and when the computer program is executed by a processor, the computer program described in any embodiment of the present disclosure can be implemented. The live video recording method described above.

An embodiment of the present disclosure provides a live video recording method. In a live broadcast scene built based on Google's open source media player exoplayer, the audio data is re-encoded and repackaged with the video data to generate a live video recording file for convenience. View locally. For example, a user has installed a live camera at home, and when the user uses the app to remotely view the live video captured by the camera on the terminal, and finds that someone has broken into the home, he can click to start recording as evidence; or, when he finds that a child is playing a game, he wants to record this video. Good memories, you can also remotely start high-definition recording and save the current live video.

Referring to Fig. 3, the embodiment of the present disclosure is based on the live broadcast platform of the exoplayer player which is open sourced by google. After the exoplayer player obtains the live stream (http live source) data, it performs de-protocol and decapsulation to obtain the original audio compressed data and the original video compressed data. The yuv video data and pcm audio data are decoded by the platform's audio decoder and video decoder respectively, and rendered after audio and video synchronization. The live video recording method of the embodiment of the present disclosure re-encodes the sound on the basis of processing and playing the direct streaming data based on the exoplayer player, and synthesizes the original video compression data and the re-encoded audio compression data into an mp4 file, and also Can be video files in other formats. When re-encoding the sound, this embodiment first decodes the audio compression data obtained by decapsulating the live stream (which can be data in various formats such as g711), and then re-encodes the decoded pcm data into the aac format, and compresses the video Data (such as H.264, H.265, etc.) is synthesized into mp4 format video (video). This process will consume part of the cpu load, but it has better versatility and can guarantee the clarity of the recorded video.

In order to realize direct video recording, the embodiment of the present disclosure performs video link interception and audio link interception, that is: before the original video compressed data (video packet) is sent to the video decoder (video decoder), copy an original video The data is sent to the video track (video track) of the audio and video encapsulation module (media muxer). After the original audio compressed data (audio packet) is decoded into pcm data by an audio decoder, the pcm data is encoded into aac format audio compressed data and sent to the audio and video packaging module (media muxer, also known as is the audio track of the Media Synthesizer).

Since the video must be recorded from the I frame, the audio and video encapsulation module waits for the first I frame of the video to be delivered and then starts to compress the audio data of the audio track (audio compression data in aac format) and the video data of the video track (that is, the original video compression) Data) is synthesized, and the audio data and video data sent from the audio track and the video track are respectively sent to the audio and video encapsulation module (media muxer) for synthesis, and the synthesis is stopped after waiting for the user to input the stop key.

In the embodiment of the present disclosure, the live stream player module (i.e. exoplayer module) is mainly responsible for the decapsulation, decoding and rendering of live data; the audio encoding module can use an open source algorithm to encode pcm files into aac files; the audio and video encapsulation module uses media The synthesizer (media muxer) encapsulates and synthesizes audio and video.

The embodiments of the present disclosure record and save the original live video in high-definition for users who need live recording while live broadcasting, avoiding the lack of clarity, and obtaining video effects far exceeding screen recording and the most original video. And you can choose the recording time freely, and record for some special segments. After the live video is recorded, you can also do some editing on the recorded video, such as saving some specific scenes in the video as pictures, editing and splicing different video clips into the same video, adding some special effects to the recorded video, etc. .

In one or more of the above exemplary embodiments, the functions described may be implemented in hardware, software, firmware or any combination thereof. If implemented in software, the functions may be stored on or transmitted over, as one or more instructions or code, a computer-readable medium and executed by a hardware-based processing unit. Computer-readable media may include computer-readable storage media that correspond to tangible media such as data storage media, or communication media including any medium that facilitates transfer of a computer program from one place to another, eg, according to a communication protocol. In this manner, a computer-readable medium may generally correspond to a non-transitory tangible computer-readable storage medium or a communication medium such as a signal or carrier wave. Data storage media may be any available media that can be accessed by one or more computers or one or more processors to retrieve instructions, code and/or data structures for implementation of the techniques described in this disclosure. A computer program product may comprise a computer readable medium.

By way of example and not limitation, such computer-readable storage media may include RAM, ROM, EEPROM, CD-ROM or other optical disk storage, magnetic disk or other magnetic storage, flash memory, or may be used to store instructions or data Any other medium that stores desired program code in the form of a structure and that can be accessed by a computer. Moreover, any connection could also be termed a computer-readable medium. For example, if a connection is made from a website, server or other remote source for transmitting instructions, coaxial cable, fiber optic cable, dual wire, DSL, or wireless technologies such as infrared, radio, and microwave are included in the definition of medium. It should be understood, however, that computer-readable storage media and data storage media do not encompass connections, carrier waves, signals, or other transitory (transitory) media, but are instead directed to non-transitory tangible storage media. As used herein, disk and disc include compact disc (CD), laser disc, optical disc, digital versatile disc (DVD), floppy disk, or blu-ray disc, etc. where disks usually reproduce data magnetically, while discs use lasers to Data is reproduced optically. Combinations of the above should also be included within the scope of computer-readable media.

In some aspects, the functionality described herein may be provided within dedicated hardware and/or software modules configured for encoding and decoding, or incorporated in a combined codec. Also, the techniques may be fully implemented in one or more circuits or logic elements.

The technical solutions of the embodiments of the present disclosure may be implemented in a wide variety of devices or devices, including a wireless handset, an integrated circuit (IC), or a set of ICs (eg, a chipset). Various components, modules, or units are described in the disclosed embodiments to emphasize functional aspects of devices configured to perform the described techniques, but do not necessarily require realization by different hardware units. Rather, as described above, the various units may be combined in a codec hardware unit or provided by a collection of interoperable hardware units (comprising one or more processors as described above) in combination with suitable software and/or firmware.

Claims

A live video recording method, comprising the following recording process:

Process live stream data to obtain original video compression data and decoded audio data;

Copying the original video compressed data and decoded audio data, encoding the copied decoded audio data into audio compressed data in a specified format capable of being synthesized with video compressed data;

Synthesizing the copied original video compressed data and the specified audio compressed data into a video file.
The live video recording method as claimed in claim 1, wherein:

The compressed original video data is obtained by decapsulating the received live streaming data.
The live video recording method as claimed in claim 1 or 2, wherein:

The decoded audio data is obtained by decapsulating the received live stream data to obtain original audio compression data, and then decoding the original audio compression data to obtain the decoded audio data.
The live video recording method as claimed in claim 3, wherein:

The format of the original video compression data is H.264, H.265 or H.266, the specified format is aac format, and the video file is mp4 format.
The live video recording method according to claim 1, further comprising:

Start the recording process after receiving an instruction to start recording input by the user or when a pre-configured condition for starting recording is met;

The recording process is stopped after receiving a recording stop instruction input by a user or when a pre-configured condition for stopping recording is met.
A live video recording system, including a live stream player and a video recorder:

The live stream player is set to extract original video compressed data and original audio compressed data from the live stream data, decode the original video compressed data and original audio compressed data respectively to obtain decoded video data and decoded audio data, and perform Play synchronously;

said video recorder is configured to reproduce said raw video compressed data and decoded audio data, and to encode said decoded audio data into audio compressed data in a specified format capable of being synthesized with video compressed data; and, converting said raw The video compressed data and the audio compressed data in the specified format are synthesized into a video file.
The live video recording system as claimed in claim 6, wherein:

The live streaming player includes:

The media separation module is configured to decapsulate the live streaming data to obtain the original video compression data and original audio compression data;

A video decoding module, configured to decode the original video compressed data to obtain decoded video data;

An audio decoding module, configured to decode the original audio compressed data to obtain decoded audio data;

The synchronous playing module is configured to play the decoded video data and the decoded audio data synchronously.
The live video recording system as claimed in claim 7, wherein:

The video recorder includes:

A video duplication module, integrated with the media separation module or configured separately, configured to duplicate the original video compression data;

an audio reproduction module, integrated with the video decoding module or provided separately, configured to reproduce the decoded audio data;

An audio encoding module configured to encode the copied decoded audio data into audio encoding data in a specified format that can be synthesized with video compression data;

The audio and video encapsulation module is configured to synthesize the copied original video compressed data and the audio compressed data of the specified format into a video file.
The live video recording system as claimed in claim 6, wherein:

The live video recording system also includes: a recording control module, configured to control the video recorder to start the recording process after receiving the start recording instruction input by the user or when the pre-configured start recording conditions are met; and, upon receiving The video recorder is controlled to stop the recording process after the recording stop instruction is input by the user or when the pre-configured stop recording condition is met.
A live video recording device, comprising a memory and a processor, wherein a computer program is stored in the memory, and when the processor executes the computer program, the live video as described in any one of claims 1 to 5 can be realized recording method.
A terminal device, including a processor and a memory connected to the processor through a bus, a display device, an audio device, an input device, and a network interface, wherein the memory stores a live streaming playback program and a live video recording program, When the processor executes the live stream playing program, it can process the live stream data received by the network interface and play it through the display device and the audio device; The instruction realizes the live video recording method as described in any one of claims 1 to 5.
A non-transitory computer-readable storage medium, the computer-readable storage medium stores a computer program, wherein, when the computer program is executed by a processor, the live broadcast according to any one of claims 1 to 5 can be realized Video recording method.