CN109756683A - Panorama audio-video method for recording, device, storage medium and computer equipment - Google Patents

Panorama audio-video method for recording, device, storage medium and computer equipment Download PDF

Info

Publication number
CN109756683A
CN109756683A CN201711062668.9A CN201711062668A CN109756683A CN 109756683 A CN109756683 A CN 109756683A CN 201711062668 A CN201711062668 A CN 201711062668A CN 109756683 A CN109756683 A CN 109756683A
Authority
CN
China
Prior art keywords
panorama
audio
video data
data
video
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201711062668.9A
Other languages
Chinese (zh)
Inventor
詹五洲
李英才
柳振宇
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shenzhen Split Stone Video Technology Co Ltd
Original Assignee
Shenzhen Split Stone Video Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shenzhen Split Stone Video Technology Co Ltd filed Critical Shenzhen Split Stone Video Technology Co Ltd
Priority to CN201711062668.9A priority Critical patent/CN109756683A/en
Publication of CN109756683A publication Critical patent/CN109756683A/en
Pending legal-status Critical Current

Links

Abstract

The present invention relates to a kind of panorama audio-video method for recording, device, storage medium and computer equipments.Original video data captured by multi-path camera mould group is obtained, original video data is spliced into panoramic video data by video-splicing algorithm in real time.Multipath microphone original audio data collected is obtained, original audio data is synthesized into panorama audio data by built-in panorama sound Ambisonic algorithm in real time.Panoramic video data and panorama audio data are subjected to real-time recording, generate panorama audio, video data.It is acquired using sound of the multipath microphone to photographed scene, then panorama audio data is synthesized by audio algorithm to multipath microphone original audio data collected.The panorama audio data of captured scene is thus acquired, then panorama audio data and panoramic video data are synthesized, just generates panorama audio, video data.To realize panorama immersive effects truly.

Description

Panorama audio-video method for recording, device, storage medium and computer equipment
Technical field
The present invention relates to audio-video processing technology fields, more particularly to a kind of panorama audio-video method for recording, device, deposit Storage media and computer equipment.
Background technique
With the development of virtual reality technology, video camera also experienced from single camera normal image and take multi-cam The technological change of panoramic picture shooting.The panorama camera of existing multiple cameras can be realized the shooting of panoramic video, but very It can accomplish the recording of panorama audio less, the recording technology of panorama audio obviously lags behind the technique for taking of panoramic video.Because passing Panorama audio recording is not implemented in the virtual reality scenario of system, so while realizing panoramic video recording, still cannot achieve Panorama immersive effects truly.
Summary of the invention
Based on this, it is necessary in view of the above technical problems, provide a kind of panorama sound view that can be realized panorama audio recording Frequency method for recording, device, storage medium and computer equipment.
A kind of panorama audio-video method for recording, which comprises
Original video data captured by multi-path camera mould group is obtained, the original video data is passed through into video-splicing Algorithm is spliced into panoramic video data in real time;
Multipath microphone original audio data collected is obtained, the original audio data is passed through into built-in panorama sound Ambisonic algorithm synthesizes panorama audio data in real time;
The panoramic video data and the panorama audio data are subjected to real-time recording, generate panorama audio, video data.
Original video data captured by the acquisition multi-path camera mould group in one of the embodiments, will be described Original video data is spliced into panoramic video data by video-splicing algorithm in real time, comprising:
Original video data captured by multi-path camera mould group is obtained by video FPGA, by the original video data It is spliced into panoramic video data in real time by video-splicing algorithm, the video FPGA is for handling video data.
Acquisition multipath microphone original audio data collected in one of the embodiments, will be described original Audio data synthesizes panorama audio data by built-in panorama sound Ambisonic algorithm in real time, comprising:
Multipath microphone original audio data collected is obtained by audio FPGA, the original audio data is passed through Built-in panorama sound Ambisonic algorithm synthesizes panorama audio data in real time, and the audio FPGA is for handling audio data.
It is described in one of the embodiments, that the original video data is spliced helped in real time by video-splicing algorithm Scape video data, comprising:
The distortion data in the original video data is corrected by aberration correction algorithm, the view after generating correction Frequency evidence;
Image registration is carried out to the different video data after the correction by image registration algorithm;
The different images after having carried out image registration are merged by Image Fusion, obtain fused panorama Video data.
The multipath microphone includes No. 64 microphones in one of the embodiments, by the panorama sound Ambisonic Algorithm configuration is that level is 7 ranks, is vertically 3 ranks;
Acquisition multipath microphone original audio data collected passes through the original audio data built-in complete Scape sound Ambisonic algorithm synthesizes panorama audio data in real time, comprising:
By No. 64 microphone collected original audio data in real time, by it is described it is built-in, horizontal be 7 ranks, hang down The straight panorama sound Ambisonic algorithm for 3 ranks synthesizes the panorama audio data of vertical 3 rank of 7 rank of level in real time.
The panoramic video data and the panorama audio data are carried out described in one of the embodiments, real-time It records, after generation panorama audio, video data, further includes:
By the panorama audio, video data plug-flow to server, so that the server solves panorama audio, video data After code, and the decoded panorama audio, video data is issued to terminal in real time.
A kind of panorama audio-video recording system, the system comprises: panorama audio-video recording arrangement, server and terminal, Wherein:
The panorama audio-video recording arrangement will for obtaining original video data captured by multi-path camera mould group The original video data is spliced into panoramic video data by video-splicing algorithm in real time;It is collected to obtain multipath microphone The original audio data is synthesized panorama audio by built-in panorama sound Ambisonic algorithm by original audio data in real time Data;The panoramic video data and the panorama audio data are subjected to real-time recording, panorama audio, video data are generated, by institute Panorama audio, video data plug-flow is stated to server;
The server, it is right for receiving the panorama audio, video data of the panorama audio-video recording arrangement plug-flow The panorama audio, video data real-time perfoming transcoding processing, is sent to the end for the panorama audio, video data after transcoding in real time End;
The terminal is straight for the panorama audio, video data after obtaining the transcoding in real time from the server, and in real time Panorama audio, video data after broadcasting the transcoding.
A kind of panorama audio-video record device, described device include:
Panoramic video data splicing module, for obtaining original video data captured by multi-path camera mould group, by institute It states original video data and panoramic video data is spliced by video-splicing algorithm in real time;
Panorama audio data synthesis module, for obtaining multipath microphone original audio data collected, by the original Beginning audio data synthesizes panorama audio data by built-in panorama sound Ambisonic algorithm in real time;
Panorama audio, video data generation module, it is real for carrying out the panoramic video data and the panorama audio data When record, generate panorama audio, video data.
The panoramic video data splicing module includes: in one of the embodiments,
Distortion correction module, for the distortion data in the original video data to be carried out school by aberration correction algorithm Just, the video data after correction is generated;
Image registration module is matched for carrying out image to the different video data after the correction by image registration algorithm It is quasi-;
Image co-registration module, for being melted by Image Fusion to the different images after having carried out image registration It closes, obtains fused panoramic video data.
A kind of computer readable storage medium, is stored thereon with computer program, realization when which is executed by processor Following steps:
Original video data captured by multi-path camera mould group is obtained, the original video data is passed through into video-splicing Algorithm is spliced into panoramic video data in real time;
Multipath microphone original audio data collected is obtained, the original audio data is passed through into built-in panorama sound Ambisonic algorithm synthesizes panorama audio data in real time;
The panoramic video data and the panorama audio data are subjected to real-time recording, generate panorama audio, video data.
A kind of computer equipment, the computer equipment include memory, processor and are stored on the memory simultaneously The computer program that can be run on the processor, the processor perform the steps of when executing the computer program
Original video data captured by multi-path camera mould group is obtained, the original video data is passed through into video-splicing Algorithm is spliced into panoramic video data in real time;
Multipath microphone original audio data collected is obtained, the original audio data is passed through into built-in panorama sound Ambisonic algorithm synthesizes panorama audio data in real time;
The panoramic video data and the panorama audio data are subjected to real-time recording, generate panorama audio, video data.
Above-mentioned panorama audio-video method for recording, device, storage medium and computer equipment obtain multi-path camera mould group institute Original video data is spliced into panoramic video data by video-splicing algorithm by the original video data of shooting in real time.It obtains Multipath microphone original audio data collected, original audio data is real by built-in panorama sound Ambisonic algorithm Shi Hecheng panorama audio data.Panoramic video data and panorama audio data are subjected to real-time recording, generate panorama audio-video number According to.Panorama audio-video recording arrangement is acquired the sound of photographed scene using multipath microphone, then to multipath microphone institute The original audio data of acquisition synthesizes panorama audio data by built-in panorama sound Ambisonic algorithm in real time.In panorama sound Built-in panorama sound Ambisonic algorithm, the synthesis of panorama audio can be realized in machine, do not need in video recording device It carries computer equipment and carries out the outer panorama audio synthesis of machine, therefore is convenient and easy.Thus acquire captured scene Panorama audio data, then panorama audio data is subjected to real-time recording with the panoramic video data spliced in real time, it just generates complete Scape audio, video data.From vision and acoustically all realize panorama immersive effects truly.
Detailed description of the invention
Fig. 1 is the applied environment figure of panorama audio-video method for recording in one embodiment;
Fig. 2 is the flow chart of panorama audio-video method for recording in one embodiment;
Fig. 3 is the flow chart of panorama audio-video method for recording in one embodiment;
Fig. 4 is the process that original video data is spliced into panoramic video data method in Fig. 2 by video-splicing algorithm Figure;
Fig. 5 is the flow chart of panorama audio-video method for recording in another embodiment;
Fig. 6 is the structural schematic diagram of panorama audio-video record device in one embodiment;
Fig. 7 is the structural schematic diagram of panoramic video data splicing module in Fig. 6;
Fig. 8 is the structural schematic diagram of panorama audio-video record device in another embodiment.
Specific embodiment
In order to make the foregoing objectives, features and advantages of the present invention clearer and more comprehensible, with reference to the accompanying drawing to the present invention Specific embodiment be described in detail.Many details are explained in the following description in order to fully understand this hair It is bright.But the invention can be embodied in many other ways as described herein, those skilled in the art can be not Similar improvement is done in the case where violating intension of the present invention, therefore the present invention is not limited to the specific embodiments disclosed below.
Unless otherwise defined, all technical and scientific terms used herein and belong to technical field of the invention The normally understood meaning of technical staff is identical.Term as used herein in the specification of the present invention is intended merely to description tool The purpose of the embodiment of body, it is not intended that in the limitation present invention.Each technical characteristic of above embodiments can carry out arbitrary group It closes, for simplicity of description, combination not all possible to each technical characteristic in above-described embodiment is all described, however, As long as there is no contradiction in the combination of these technical features, all should be considered as described in this specification.
Panorama audio-video method for recording provided in an embodiment of the present invention can be applied in environment as shown in Figure 1.With reference to Fig. 1 Shown, panorama audio-video recording arrangement 110 is connect by network with server 120, and terminal 130 is also by network and server 120 connections.Panorama audio-video recording arrangement 110 includes multiple for acquiring multiple cameras of different angle video, comprising using In the multiple microphones for acquiring multiple angle audios.The original video data of acquisition is spliced into aphorama frequency by FPGA in real time According to the original audio data of acquisition is synthesized panoramic video data by FPGA in real time.Again by panoramic video data and panorama audio number It is recorded according to processor is sent to, ultimately generates panorama audio, video data.Panorama audio, video data can be passed through into network (example Such as Ethernet Ethernet) it is uploaded to server 120, terminal 130 obtains panorama audio, video data in real time from server, so that it may To realize online live streaming.Panorama audio, video data can also be stored in memory 140 (such as SD card).
In one embodiment, as shown in Fig. 2, providing a kind of panorama audio-video method for recording, this method comprises:
Step 202, original video data captured by multi-path camera mould group is obtained, original video data is passed through into video Stitching algorithm is spliced into panoramic video data in real time.
Camera module refers to may be implemented the electronic equipment of shooting photo, specifically, can be camera.The present invention Panorama audio-video recording arrangement in embodiment has used multiple cameras to be used to the video data that pans.For example, using 9 Camera is shot, and is furnishing 8 cameras with captured scene horizontal direction, the angle that each camera is put is not Together, some special angle of scene is shot simultaneously respectively, enables the panorama for taking scene.With it is captured Place 1 camera in scene vertical direction, that is, scene top.It thus may be implemented to carry out pan-shot to scene.It refers to Fig. 3, FPGA (Field-Programmable Gate Array, i.e. field programmable gate array) pass through LVDS (Low Voltage Differential Signaling, low-voltage differential signal transmission) each moment multiple camera institutes of interface acquisition The original video data of the original video data of shooting, the shooting of each camera is known as original video data all the way.LVDS is A kind of new technique meeting current high-performance data transmission application.So FPGA obtains multi-path camera in synchronization institute The original video data of shooting, then original video data of the above-mentioned multi-path camera captured by synchronization is spelled by video It connects algorithm to be spliced in real time, just generates the panoramic video data of synchronization.Pass through LVDS interface again for aphorama frequency According to being sent to processor, such as arm processor.
Step 204, multipath microphone original audio data collected is obtained, original audio data is passed through built-in complete Scape sound Ambisonic algorithm synthesizes panorama audio data in real time.
Microphone refers to the energy conversion device that voice signal is converted to electric signal, also referred to as microphone or microphone. In embodiments of the present invention, panorama audio-video recording arrangement takes multiple microphones to obtain multi-path audio-frequency data.For example, using 64 Road microphone is evenly distributed on the surface of sphere.Wherein the sound chamber of each microphone is directed toward outer surface of spheroid away from the centre of sphere, is used for Acquire the audio signal of different directions.Built-in panorama sound Ambisonic algorithm in panorama audio-video recording arrangement.FPGA passes through ADC acquisition chip acquires the multichannel original audio data that multiple microphones of each moment obtain, and original audio data is passed through complete Built-in panorama sound Ambisonic algorithm synthesizes panorama audio data in real time in scape audio-video recording arrangement, then passes through SDIO Panorama audio data is sent to place by (Secure Digital Input and Output, secure digital input and output) interface Manage device, such as arm processor.VR camera on the market, usually monophonic or the microphone array being made of 4 microphones To carry out audio recording.When carrying out synthesis panorama audio data, VR camera generallys use external computer equipment to carry out Panorama audio data is synthesized, and these external computer equipments are synthesized using 1 rank Ambisonic algorithm. It needs to carry panorama audio outside computer equipment carry out machine in studio to synthesize, it is highly inconvenient.
Therefore, in the embodiment of the present invention, the built-in panorama sound Ambisonic algorithm in panorama audio-video recording arrangement, The external computer equipment with Ambisonic algorithm is not needed, the synthesis of real time panoramic audio can be realized in machine.And it is built-in Be high-order Ambisonic algorithm, for the angle in space, the more high sound field information collected of Ambisonic order more Accurately, also more can completely reduction Target Sound Field acoustic information.Therefore, the Ambisonic algorithm of high-order is compared to low order Ambisonic algorithm has higher spatial resolution.
Step 206, panoramic video data and panorama audio data are subjected to real-time recording, generate panorama audio, video data.
The panoramic video data of synchronization and panorama audio data are sent on processor and are handled in real time by FPGA, It can be arm processor.After the process of processing is specifically, arm processor obtains panoramic video data, using H.264 marking Standard is compressed, and H.264 standard is a high compression digital video coding-coding device standard.Arm processor obtains panorama sound Frequency is compressed, AAC (Advanced Audio Coding, Advanced Audio Coding) standard is after using AAC standard A kind of audio decoding techniques based on MPEG-2.Compressed audio, video data file is all put into MP4 container and generates MP4 text Part.Compressed audio-video document can also be all put into AVI container and generate avi file.It is, of course, also possible to use other lifes At the mode of audio-video document.Referring again to Fig. 3, handled in real time by processor generate panorama audio-video document it Afterwards, panorama audio-video document can be stored in memory (such as SD card), panorama audio, video data can also be passed through into net Network (such as Ethernet Ethernet) is uploaded to server and realizes online live streaming.
In the present embodiment, panorama audio-video recording arrangement is acquired the sound of photographed scene using multipath microphone, Panorama sound is synthesized in real time by built-in panorama sound Ambisonic algorithm to multipath microphone original audio data collected again Frequency evidence.Built-in panorama sound Ambisonic algorithm, can be realized panorama in machine in panorama audio-video recording arrangement Audio synthesis does not need to carry the outer panorama audio synthesis of computer equipment progress machine, therefore convenient and easy.Thus acquire The panorama audio data of captured scene, then panorama audio data recorded in real time with the panoramic video data spliced in real time System, just generates panorama audio, video data.From vision and acoustically all realize panorama immersive effects truly.
In one embodiment, original video data captured by the acquisition multi-path camera mould group, will be described original Video data is spliced into panoramic video data by video-splicing algorithm in real time, comprising:
Original video data captured by multi-path camera mould group is obtained by video FPGA, by the original video data It is spliced into panoramic video data in real time by video-splicing algorithm, the video FPGA is for handling video data.
As shown in figure 3, handling video data and audio data respectively using different FPGA.Because of the number of a FPGA Limited according to processing capacity, in order to avoid same traditional FPGA handles audio and video data simultaneously, a large amount of data are caused Joining quality the problem of being difficult to ensure.Therefore, it is configured with one or more video FPGA for panorama audio-video recording arrangement, specially Door is for handling video data.Also still high quality can be realized with efficient process when the video data volume is larger Splicing in real time.Video FPGA obtains original video data of the multi-path camera captured by synchronization, then by above-mentioned multichannel Original video data of the camera captured by synchronization is spliced in real time by video-splicing algorithm, is just generated same The panoramic video data at moment.Pass through LVDS interface again for panoramic video data transmission to processor, such as arm processor.
In one embodiment, acquisition multipath microphone original audio data collected, by the original audio Data synthesize panorama audio data by built-in panorama sound Ambisonic algorithm in real time, comprising:
Multipath microphone original audio data collected is obtained by audio FPGA, the original audio data is passed through Built-in panorama sound Ambisonic algorithm synthesizes panorama audio data in real time, and the audio FPGA is for handling audio data.
As shown in figure 3, handling video data and audio data respectively using different FPGA.Because of the number of a FPGA Limited according to processing capacity, in order to avoid same traditional FPGA handles audio and video data simultaneously, a large amount of data are caused Joining quality the problem of being difficult to ensure.Therefore, it is configured with one or more audio FPGA for panorama audio-video recording arrangement, specially Door in the case where reaching input while multi-path audio-frequency data, realizes the real-time splicing of high quality for handling audio data. Audio FPGA acquires the multichannel original audio data that multiple microphones of each moment obtain by ADC acquisition chip, by original sound Frequency synthesizes panorama audio data according to by panorama sound Ambisonic algorithm built-in in panorama audio-video recording arrangement in real time, Pass through SDIO (Secure Digital Input and Output, secure digital input and output) interface again for panorama audio number According to being sent to processor, such as arm processor.
In one embodiment, video-splicing algorithm includes that aberration correction algorithm, image registration algorithm and image co-registration are calculated Method.
The camera of video of panning generally will use fish eye lens, and fish eye lens belongs to one of bugeye lens The range that reaches or can see beyond human eye is made every effort at special lens, its visual angle.The maximum effect of fish eye lens is visual angle model It encloses greatly, visual angle generally can reach 220 ° or 230 °, this is shooting at close range a wide range of Landscape Creation condition.But fish eye lens There is very big distortion in the image of shooting, need that fault image is converted to normal picture by aberration correction algorithm.Distortion school Normal operation method is a kind of method that can be converted into normal picture to the image for having distortion captured by fish eye lens.
Image Fusion is a kind of characteristic point by finding different pictures, is the place with reference to alignment image with characteristic point Reason method.
Image Fusion is the method that can be merged different images.It can will not by Image Fusion Image with the different perspectives of synchronization captured by camera is merged, and a complete panoramic picture is fused into.Figure As blending algorithm includes the related algorithms such as color interpolation technology or multiresolution spline.
In the present embodiment, original video data captured by multi-path camera mould group is carried out by aberration correction algorithm Image rectification corrects the image of deformity.Image is aligned by image registration algorithm again, has found different figures Corresponding relationship as between, different images are aligned.Finally, image is merged by Image Fusion, thus Image captured by different cameras is merged.The full-view video image of high quality is ultimately generated.
In one embodiment, as shown in figure 4, original video data is spliced into panorama by video-splicing algorithm in real time Video data, comprising:
Step 302, the distortion data in original video data is corrected by aberration correction algorithm, after generating correction Video data.
The camera of video of panning generally will use fish eye lens, and fish eye lens belongs to one of bugeye lens The range that reaches or can see beyond human eye is made every effort at special lens, its visual angle.The maximum effect of fish eye lens is visual angle model It encloses greatly, visual angle generally can reach 220 ° or 230 °, this is shooting at close range a wide range of Landscape Creation condition.But fish eye lens There is very big distortion in the image of shooting, fault image can be converted to normal picture by aberration correction algorithm.
Step 304, image registration is carried out to the different video data after correction by image registration algorithm.
To image of the different cameras captured by synchronization, need to find the characteristic point of different images respectively, with Characteristic point is that different images are aligned by reference.Specifically, realizing alignment by image registration algorithm.
Step 306, the different images after having carried out image registration are merged by Image Fusion, is merged Panoramic video data afterwards.
Finally, the splicing seams between image and image are passed through after by the alignment of different images captured by synchronization The Image Fusions such as color interpolation technology or multiresolution spline are merged.It is complete after synchronization fusion to obtain The full-view video image of different moments is played out with certain frame number, just constitutes full-view video image by scape image.
In the present embodiment, original video data captured by multi-path camera mould group is carried out by aberration correction algorithm Image rectification corrects the image of deformity.Image is aligned by image registration algorithm again, has found different figures Corresponding relationship as between, different images are aligned.Finally, image is merged by Image Fusion, thus Image captured by different cameras is merged.The full-view video image of high quality is finally generated in real time.
In one embodiment, multipath microphone includes No. 64 microphones, is water by panorama sound Ambisonic algorithm configuration Put down for 7 ranks, vertically be 3 ranks;
Multipath microphone original audio data collected is obtained, original audio data is passed through into built-in panorama sound Ambisonic algorithm synthesizes panorama audio data in real time, comprising:
By No. 64 microphones collected original audio data in real time, by it is built-in, horizontal be 7 ranks, be vertically 3 ranks Panorama sound Ambisonic algorithm synthesizes the panorama audio data of vertical 3 rank of 7 rank of level in real time.
Ambisonic system is to propose that it had used X, tri- difference of Y, Z in 1974 by Michael A.Gerzon It is directed toward x, y, for " 8 " the font directional microphone and 1 W in z-axis direction without directional microphone, composition can pick up sound three-dimensional space Between information recording system., under spherical coordinates state, this 4 microphone directional properties have strict difinition
W=0.707
Using it is a certain number of, make the loudspeaker that is evenly arranged around listener and do sound reproduction, and this 4 microphones are picked up The number of winning the confidence carries out the mixing of different proportion, the surrounding sound effect of reproducible three-dimensional space.
Specifically, the surface of sphere is evenly distributed on using No. 64 microphones in embodiments of the present invention, microphone sound Chamber deviates from the centre of sphere and is directed toward outer surface of spheroid, and each microphone is used to acquire sound.FPGA passes through ADC (Analog-to- Digital Converter, analog to digital conversion circuit) acquisition chip acquires microphone data in real time.By Ambisonic algorithm configuration For level be 7 ranks, vertically be 3 ranks, then FPGA will collected 64 tunnel microphone data by Ambisonic algorithm just in real time life At the Fuma format panorama audio data of vertical 3 rank of 7 rank of level.Fuma format is by the collected audio data warp of microphone Cross a kind of audio format generated after Ambsonic algorithm process.In general, the collected audio data of microphone is passed through 1 rank After Ambisonic algorithm process, 4 tunnel audios are exported;9 tunnel audios are then exported after 2 rank Ambisonic algorithm process;By 3 16 tunnel audios are then exported after rank Ambisonic algorithm process.I.e. by Ambisonic algorithm output audio track number be (order+ 1) square.
It can certainly be 3 rank of level, vertical 3 rank by Ambisonic algorithm configuration, then FPGA is by collected 64 tunnel wheat Gram wind data just generates the Fuma format panorama audio data of vertical 3 rank of 3 rank of level by Ambisonic algorithm.It can also be with Ambisonic algorithm is only configured to 3 rank of level, then FPGA calculates collected 64 tunnel microphone data by Ambisonic Method just generates the Fuma format panorama audio data of 3 rank of level.
In the present embodiment, by Ambisonic algorithm configuration be level be 7 ranks, vertically be 3 ranks, then FPGA will be collected 64 tunnel microphone datas the Fuma format panorama audio of vertical 3 rank of 7 rank of level is just generated by Ambisonic algorithm in real time Data.Ambisonic algorithm may be implemented to synthesize panorama audio data in real time to different microphones sound collected.Panorama sound Frequency just generates panorama audio, video data, joined panorama audio data according to being synthesized in real time with panoramic video data again To realize panorama immersive effects truly.
In one embodiment, as shown in figure 5, panoramic video data and panorama audio data are carried out real-time recording, After generation panorama audio, video data, further includes:
Step 208, by panorama audio, video data plug-flow to server, so that server solves panorama audio, video data After code, and decoded panorama audio, video data is issued to terminal in real time.
In the present embodiment, the panorama audio, video data after processor synthesizes in real time is uploaded to service by network On device, because the panorama audio, video data uploaded is the file by coding, server needs to be decoded it, and real When to terminal issue decoded panorama audio, video data, so as to directly play out at the terminal.Terminal is from server On obtain decoded panorama audio, video data in real time, and played in real time at the terminal.Thereby realize panorama sound view The live streaming plug-flow function of frequency.Specifically, terminal can be that by the novel VR equipment of panorama audio-video.
In one embodiment, a kind of panorama audio-video recording system, referring to Figure 1, system include: the record of panorama audio-video Control equipment 110, server 120 and terminal 130, in which:
Panorama audio-video recording arrangement 110 will be former for obtaining original video data captured by multi-path camera mould group Beginning video data is spliced into panoramic video data by video-splicing algorithm in real time;Obtain multipath microphone original sound collected Original audio data is synthesized panorama audio data by built-in panorama sound Ambisonic algorithm by frequency evidence in real time;By panorama Video data and panorama audio data carry out real-time recording, generate panorama audio, video data, extremely by panorama audio, video data plug-flow Server.
Server 120, for receiving the panorama audio, video data of panorama audio-video recording arrangement plug-flow, to panorama audio-video The processing of data real-time perfoming transcoding, is sent to terminal for the panorama audio, video data after transcoding in real time.
Terminal 130, for the panorama audio, video data after obtaining transcoding in real time from server, and after real-time live broadcast transcoding Panorama audio, video data.
In one embodiment, as shown in fig. 6, providing a kind of panorama audio-video record device 600, which includes: Panoramic video data splicing module 602, panorama audio data synthesis module 604 and panorama audio, video data generation module 606.
Panoramic video data splicing module 602 will for obtaining original video data captured by multi-path camera mould group Original video data is spliced into panoramic video data by video-splicing algorithm in real time.
Panorama audio data synthesis module 604 will be original for obtaining multipath microphone original audio data collected Audio data synthesizes panorama audio data by built-in panorama sound Ambisonic algorithm in real time.
Panorama audio, video data generation module 606, for being recorded panoramic video data and panorama audio data in real time System generates panorama audio, video data.
In one embodiment, panoramic video data splicing module 602 is also used to obtain multichannel camera shooting by video FPGA The original video data is spliced into aphorama by video-splicing algorithm by original video data captured by head mould group in real time Frequency evidence, the video FPGA is for handling video data.
In one embodiment, panorama audio, video data generation module 606 is also used to obtain multichannel by audio FPGA and pass Sound device original audio data collected, the original audio data is real-time by built-in panorama sound Ambisonic algorithm Panorama audio data is synthesized, the audio FPGA is for handling audio data.
In one embodiment, as shown in fig. 7, panoramic video data splicing module 602 includes:
Distortion correction module 602a, for the distortion data in original video data to be carried out school by aberration correction algorithm Just, the video data after correction is generated;
Image registration module 602b matches for carrying out image to the different video data after correction by image registration algorithm It is quasi-;
Image co-registration module 602c, for being carried out by Image Fusion to the different images after having carried out image registration Fusion, obtains fused panoramic video data.
In one embodiment, panorama audio data synthesis module 604 is also used to No. 64 microphones are collected in real time Original audio data, by it is built-in, horizontal be 7 ranks, be vertically that the panorama sound Ambisonic algorithms of 3 ranks synthesizes water in real time The panorama audio data of flat vertical 3 rank of 7 ranks.
In one embodiment, as shown in figure 8, additionally providing a kind of panorama audio-video record device 600, which also wraps Live streaming plug-flow module 608 is included, which is used for panorama audio, video data plug-flow to server, so that server regards panorama sound After frequency evidence is decoded, and decoded panorama audio, video data is issued to terminal in real time.
In one embodiment, a kind of computer readable storage medium is additionally provided, computer program is stored thereon with, it should It is performed the steps of when program is executed by processor and obtains original video data captured by multi-path camera mould group, it will be original Video data is spliced into panoramic video data by video-splicing algorithm in real time;Obtain multipath microphone original audio collected Original audio data is synthesized panorama audio data by built-in panorama sound Ambisonic algorithm by data in real time;By aphorama Frequency evidence and panorama audio data carry out real-time recording, generate panorama audio, video data.
In one embodiment, it also performs the steps of when above procedure is executed by processor and is obtained by video FPGA Original video data captured by multi-path camera mould group splices the original video data by video-splicing algorithm in real time At panorama video data, the video FPGA is for handling video data.
In one embodiment, it also performs the steps of when above procedure is executed by processor and is obtained by audio FPGA Multipath microphone original audio data collected calculates the original audio data by built-in panorama sound Ambisonic Method synthesizes panorama audio data in real time, and the audio FPGA is for handling audio data.
In one embodiment, it is also performed the steps of when above procedure is executed by processor and passes through aberration correction algorithm Distortion data in original video data is corrected, the video data after generating correction;By image registration algorithm to school Different video data after just carry out image registration;By Image Fusion to the different images after having carried out image registration into Row fusion, obtains fused panoramic video data.
In one embodiment, it is also performed the steps of when above procedure is executed by processor No. 64 microphones are real-time Collected original audio data, by it is built-in, horizontal be 7 ranks, vertically be 3 ranks panorama sound Ambisonic algorithm it is real-time Synthesize the panorama audio data of vertical 3 rank of 7 rank of level.
In one embodiment, it is also performed the steps of when above procedure is executed by processor by panorama audio, video data Plug-flow is to server, so that after server is decoded panorama audio, video data, and issues in real time to terminal decoded complete Scape audio, video data.
In one embodiment, additionally provide a kind of computer equipment, which includes memory, processor and The computer program that can be run on a memory and on a processor is stored, processor realizes following step when executing computer program It is rapid: original video data captured by multi-path camera mould group is obtained, original video data is real-time by video-splicing algorithm It is spliced into panoramic video data;Multipath microphone original audio data collected is obtained, original audio data is passed through built-in Panorama sound Ambisonic algorithm synthesize panorama audio data in real time;Panoramic video data and panorama audio data are carried out real When record, generate panorama audio, video data.
In one embodiment, it also performs the steps of when above-mentioned processor executes computer program through video FPGA Original video data captured by multi-path camera mould group is obtained, the original video data is real-time by video-splicing algorithm Panoramic video data are spliced into, the video FPGA is for handling video data.
In one embodiment, it also performs the steps of when above-mentioned processor executes computer program through audio FPGA Multipath microphone original audio data collected is obtained, the original audio data is passed through into built-in panorama sound Ambisonic algorithm synthesizes panorama audio data in real time, and the audio FPGA is for handling audio data.
In one embodiment, it is also performed the steps of when above-mentioned processor executes computer program and passes through distortion correction Distortion data in original video data is corrected by algorithm, the video data after generating correction;Pass through image registration algorithm Image registration is carried out to the different video data after correction;The difference after having carried out image registration is schemed by Image Fusion As being merged, fused panoramic video data are obtained.
In one embodiment, it also performs the steps of when above-mentioned processor executes computer program by No. 64 microphones Real-time collected original audio data, by it is built-in, horizontal be 7 ranks, vertically be 3 ranks panorama sound Ambisonic algorithm The panorama audio data of vertical 3 rank of 7 rank of level is synthesized in real time.
In one embodiment, it also performs the steps of when above-mentioned processor executes computer program by panorama audio-video Data plug-flow is to server, so that after server is decoded panorama audio, video data, and after issuing decoding to terminal in real time Panorama audio, video data.
Those of ordinary skill in the art will appreciate that realizing all or part of the process in above-described embodiment method, being can be with Relevant hardware is instructed to complete by computer program, program can be stored in a non-volatile computer-readable storage In medium, in the embodiment of the present invention, which be can be stored in the storage medium of computer system, and by the computer system In at least one processor execute, to realize including process such as the embodiment of above-mentioned each method.Wherein, storage medium can be Magnetic disk, CD, read-only memory (Read-Only Memory, ROM) or random access memory (Random Access Memory, RAM) etc..
Each technical characteristic of above embodiments can be combined arbitrarily, for simplicity of description, not to above-described embodiment In each technical characteristic it is all possible combination be all described, as long as however, the combination of these technical characteristics be not present lance Shield all should be considered as described in this specification.
The embodiments described above only express several embodiments of the present invention, and the description thereof is more specific and detailed, but simultaneously It cannot therefore be construed as limiting the scope of the patent.It should be pointed out that coming for those of ordinary skill in the art It says, without departing from the inventive concept of the premise, various modifications and improvements can be made, these belong to protection of the invention Range.Therefore, the scope of protection of the patent of the invention shall be subject to the appended claims.

Claims (10)

1. a kind of panorama audio-video method for recording, which comprises
Original video data captured by multi-path camera mould group is obtained, the original video data is passed through into video-splicing algorithm It is spliced into panoramic video data in real time;
Multipath microphone original audio data collected is obtained, the original audio data is passed through into built-in panorama sound Ambisonic algorithm synthesizes panorama audio data in real time;
The panoramic video data and the panorama audio data are subjected to real-time recording, generate panorama audio, video data.
2. the method according to claim 1, wherein original view captured by the acquisition multi-path camera mould group The original video data is spliced into panoramic video data by video-splicing algorithm by frequency evidence in real time, comprising:
Original video data captured by multi-path camera mould group is obtained by video FPGA, the original video data is passed through Video-splicing algorithm is spliced into panoramic video data in real time, and the video FPGA is for handling video data.
3. the method according to claim 1, wherein the acquisition multipath microphone original audio number collected According to the original audio data is synthesized panorama audio data by built-in panorama sound Ambisonic algorithm in real time, comprising:
Multipath microphone original audio data collected is obtained by audio FPGA, the original audio data is passed through built-in Panorama sound Ambisonic algorithm synthesize panorama audio data in real time, the audio FPGA is for handling audio data.
4. the method according to claim 1, wherein described calculate the original video data by video-splicing Method is spliced into panoramic video data in real time, comprising:
The distortion data in the original video data is corrected by aberration correction algorithm, the video counts after generating correction According to;
Image registration is carried out to the different video data after the correction by image registration algorithm;
The different images after having carried out image registration are merged by Image Fusion, obtain fused panoramic video Data.
5., will be described the method according to claim 1, wherein the multipath microphone includes No. 64 microphones Panorama sound panorama Ambisonic algorithm configuration is that level is 7 ranks, is vertically 3 ranks;
The original audio data is passed through built-in panorama sound by acquisition multipath microphone original audio data collected Panorama Ambisonic algorithm synthesizes panorama audio data in real time, comprising:
By No. 64 microphone collected original audio data in real time, by it is described it is built-in, horizontal be 7 ranks, be vertically 3 The panorama sound panorama Ambisonic algorithm of rank synthesizes the panorama audio data of vertical 3 rank of 7 rank of level in real time.
6. the method according to claim 1, wherein described by the panoramic video data and the panorama sound Frequency according to carry out real-time recording, generate panorama audio, video data after, further includes:
By the panorama audio, video data plug-flow to server, so that the server is decoded panorama audio, video data Afterwards, and in real time the decoded panorama audio, video data is issued to terminal.
7. a kind of panorama audio-video recording system, which is characterized in that the system comprises: panorama audio-video recording arrangement, service Device and terminal, in which:
The panorama audio-video recording arrangement will be described for obtaining original video data captured by multi-path camera mould group Original video data is spliced into panoramic video data by video-splicing algorithm in real time;It is collected original to obtain multipath microphone The original audio data is synthesized panorama audio data by built-in panorama sound Ambisonic algorithm by audio data in real time; The panoramic video data and the panorama audio data are subjected to real-time recording, generate panorama audio, video data, it will be described complete Scape audio, video data plug-flow is to server;
The server, for receiving the panorama audio, video data of the panorama audio-video recording arrangement plug-flow, to described The processing of panorama audio, video data real-time perfoming transcoding, is sent to the terminal for the panorama audio, video data after transcoding in real time;
The terminal, for the panorama audio, video data after obtaining the transcoding in real time from the server, and real-time live broadcast institute Panorama audio, video data after stating transcoding.
8. a kind of panorama audio-video record device, which is characterized in that described device includes:
Panoramic video data splicing module, for obtaining original video data captured by multi-path camera mould group, by the original Beginning video data is spliced into panoramic video data by video-splicing algorithm in real time;
Panorama audio data synthesis module, for obtaining multipath microphone original audio data collected, by the original sound Frequency synthesizes panorama audio data according to by built-in panorama sound Ambisonic algorithm in real time;
Panorama audio, video data generation module, for being recorded the panoramic video data and the panorama audio data in real time System generates panorama audio, video data.
9. a kind of computer readable storage medium, is stored thereon with computer program, which is characterized in that the program is held by processor The panorama audio-video method for recording as described in any one of power 1 to 6 is realized when row.
10. a kind of computer equipment, the computer equipment includes memory, processor and is stored on the memory and can The computer program run on the processor, which is characterized in that the processor is realized when executing the computer program Panorama audio-video method for recording as described in weighing any one of 1 to 6.
CN201711062668.9A 2017-11-02 2017-11-02 Panorama audio-video method for recording, device, storage medium and computer equipment Pending CN109756683A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201711062668.9A CN109756683A (en) 2017-11-02 2017-11-02 Panorama audio-video method for recording, device, storage medium and computer equipment

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201711062668.9A CN109756683A (en) 2017-11-02 2017-11-02 Panorama audio-video method for recording, device, storage medium and computer equipment

Publications (1)

Publication Number Publication Date
CN109756683A true CN109756683A (en) 2019-05-14

Family

ID=66398396

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201711062668.9A Pending CN109756683A (en) 2017-11-02 2017-11-02 Panorama audio-video method for recording, device, storage medium and computer equipment

Country Status (1)

Country Link
CN (1) CN109756683A (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110136392A (en) * 2019-05-31 2019-08-16 深圳中物智建科技有限公司 A kind of construction site safety defense monitoring system and method
CN114513698A (en) * 2020-11-16 2022-05-17 中国联合网络通信集团有限公司 Panoramic sound playing system and method

Citations (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2006052188A1 (en) * 2004-11-12 2006-05-18 Catt (Computer Aided Theatre Technique) Surround sound processing arrangement and method
US20080266394A1 (en) * 2006-02-23 2008-10-30 Johan Groenenboom Audio Module for a Video Surveillance System, Video Surveillance System and Method for Keeping a Plurality of Locations Under Surveillance
CN102326417A (en) * 2008-12-30 2012-01-18 庞培法布拉大学巴塞隆纳媒体基金会 Method and apparatus for three-dimensional acoustic field encoding and optimal reconstruction
CN102510436A (en) * 2011-10-17 2012-06-20 河海大学常州校区 Device and method for detecting high-speed tiny target online in real time by simulating fly vision
US20120162362A1 (en) * 2010-12-22 2012-06-28 Microsoft Corporation Mapping sound spatialization fields to panoramic video
CN103250207A (en) * 2010-11-05 2013-08-14 汤姆逊许可公司 Data structure for higher order ambisonics audio data
CN203193773U (en) * 2013-04-16 2013-09-11 宁波高新区阶梯科技有限公司 Multimedia panoramic recording system
CN103634561A (en) * 2012-08-21 2014-03-12 徐丙川 Conference communication device and system
CN104244164A (en) * 2013-06-18 2014-12-24 杜比实验室特许公司 Method, device and computer program product for generating surround sound field
CN105072557A (en) * 2015-08-11 2015-11-18 北京大学 Loudspeaker environment self-adaptation calibrating method of three-dimensional surround playback system
CN106162206A (en) * 2016-08-03 2016-11-23 北京疯景科技有限公司 Panorama recording, player method and device
CN106210990A (en) * 2016-07-13 2016-12-07 北京时代拓灵科技有限公司 A kind of panorama sound audio processing method
CN106851482A (en) * 2017-03-24 2017-06-13 北京时代拓灵科技有限公司 A kind of panorama sound loudspeaker body-sensing real-time interaction system and exchange method
CN106992959A (en) * 2016-11-01 2017-07-28 深圳市圆周率软件科技有限责任公司 A kind of 3D panoramas audio frequency and video live broadcast system and audio/video acquisition method
CN106993249A (en) * 2017-04-26 2017-07-28 深圳创维-Rgb电子有限公司 A kind of processing method and processing device of the voice data of sound field
CN107026959A (en) * 2016-02-01 2017-08-08 杭州海康威视数字技术股份有限公司 A kind of image-pickup method and image capture device
WO2017181777A1 (en) * 2016-04-19 2017-10-26 北京金山安全软件有限公司 Panoramic live video streaming method, device, system, and video source control apparatus

Patent Citations (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2006052188A1 (en) * 2004-11-12 2006-05-18 Catt (Computer Aided Theatre Technique) Surround sound processing arrangement and method
US20080266394A1 (en) * 2006-02-23 2008-10-30 Johan Groenenboom Audio Module for a Video Surveillance System, Video Surveillance System and Method for Keeping a Plurality of Locations Under Surveillance
CN102326417A (en) * 2008-12-30 2012-01-18 庞培法布拉大学巴塞隆纳媒体基金会 Method and apparatus for three-dimensional acoustic field encoding and optimal reconstruction
CN103250207A (en) * 2010-11-05 2013-08-14 汤姆逊许可公司 Data structure for higher order ambisonics audio data
US20120162362A1 (en) * 2010-12-22 2012-06-28 Microsoft Corporation Mapping sound spatialization fields to panoramic video
CN102510436A (en) * 2011-10-17 2012-06-20 河海大学常州校区 Device and method for detecting high-speed tiny target online in real time by simulating fly vision
CN103634561A (en) * 2012-08-21 2014-03-12 徐丙川 Conference communication device and system
CN203193773U (en) * 2013-04-16 2013-09-11 宁波高新区阶梯科技有限公司 Multimedia panoramic recording system
CN104244164A (en) * 2013-06-18 2014-12-24 杜比实验室特许公司 Method, device and computer program product for generating surround sound field
CN105072557A (en) * 2015-08-11 2015-11-18 北京大学 Loudspeaker environment self-adaptation calibrating method of three-dimensional surround playback system
CN107026959A (en) * 2016-02-01 2017-08-08 杭州海康威视数字技术股份有限公司 A kind of image-pickup method and image capture device
WO2017181777A1 (en) * 2016-04-19 2017-10-26 北京金山安全软件有限公司 Panoramic live video streaming method, device, system, and video source control apparatus
CN106210990A (en) * 2016-07-13 2016-12-07 北京时代拓灵科技有限公司 A kind of panorama sound audio processing method
CN106162206A (en) * 2016-08-03 2016-11-23 北京疯景科技有限公司 Panorama recording, player method and device
CN106992959A (en) * 2016-11-01 2017-07-28 深圳市圆周率软件科技有限责任公司 A kind of 3D panoramas audio frequency and video live broadcast system and audio/video acquisition method
CN106851482A (en) * 2017-03-24 2017-06-13 北京时代拓灵科技有限公司 A kind of panorama sound loudspeaker body-sensing real-time interaction system and exchange method
CN106993249A (en) * 2017-04-26 2017-07-28 深圳创维-Rgb电子有限公司 A kind of processing method and processing device of the voice data of sound field

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110136392A (en) * 2019-05-31 2019-08-16 深圳中物智建科技有限公司 A kind of construction site safety defense monitoring system and method
CN114513698A (en) * 2020-11-16 2022-05-17 中国联合网络通信集团有限公司 Panoramic sound playing system and method
CN114513698B (en) * 2020-11-16 2023-08-22 中国联合网络通信集团有限公司 Panoramic sound playing system and method

Similar Documents

Publication Publication Date Title
CN108702528B (en) Method for transmitting 360 video, method for receiving 360 video, apparatus for transmitting 360 video, and apparatus for receiving 360 video
CN106210703B (en) The utilization of VR environment bust shot camera lenses and display methods and system
US10021301B2 (en) Omnidirectional camera with multiple processors and/or multiple sensors connected to each processor
WO2018082284A1 (en) 3d panoramic audio and video live broadcast system and audio and video acquisition method
KR102221301B1 (en) Method and apparatus for transmitting and receiving 360-degree video including camera lens information
CN207443024U (en) Panorama audio and video recording arrangement and system
CN105376547A (en) Micro video course recording system and method based on 3D virtual synthesis technology
US20170187955A1 (en) Omnidirectional camera with multiple processors and/or multiple sensors connected to each processor
WO2020195232A1 (en) Image processing device, image processing method, and program
CN112235585B (en) Live broadcasting method, device and system for virtual scene
CN104902263A (en) System and method for showing image information
CN106060526A (en) Live broadcast method and device based on two cameras
KR20200065087A (en) Multi-viewpoint-based 360 video processing method and apparatus
CN106210525A (en) For realizing camera and the method for net cast
CN106657719A (en) Intelligent virtual studio system
TW201714163A (en) Information processing device, information processing method, and program
CN108989739A (en) A kind of full view system for live broadcast of video conference and method
CN107172413A (en) Method and system for displaying video of real scene
CN109756683A (en) Panorama audio-video method for recording, device, storage medium and computer equipment
WO2009078909A1 (en) Virtual object rendering system and method
Zheng et al. Research on panoramic stereo live streaming based on the virtual reality
CN110602523A (en) VR panoramic live multimedia processing and synthesizing system and method
JP6091850B2 (en) Telecommunications apparatus and telecommunications method
JP2019050593A (en) Image processing system, image processor, control method, and program
US10075693B2 (en) Embedding calibration metadata into stereoscopic video files

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination