CN108810567A - A kind of matched method in audio & video visual angle, client and server - Google Patents

A kind of matched method in audio & video visual angle, client and server Download PDF

Info

Publication number
CN108810567A
CN108810567A CN201710289042.5A CN201710289042A CN108810567A CN 108810567 A CN108810567 A CN 108810567A CN 201710289042 A CN201710289042 A CN 201710289042A CN 108810567 A CN108810567 A CN 108810567A
Authority
CN
China
Prior art keywords
audio
client
visual angle
fragment
mpd file
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201710289042.5A
Other languages
Chinese (zh)
Other versions
CN108810567B (en
Inventor
高莹
顾迎节
张尧烨
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Huawei Technologies Co Ltd
Original Assignee
Huawei Technologies Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Huawei Technologies Co Ltd filed Critical Huawei Technologies Co Ltd
Priority to CN201710289042.5A priority Critical patent/CN108810567B/en
Publication of CN108810567A publication Critical patent/CN108810567A/en
Application granted granted Critical
Publication of CN108810567B publication Critical patent/CN108810567B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/233Processing of audio elementary streams
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/234Processing of video elementary streams, e.g. splicing of video streams, manipulating MPEG-4 scene graphs
    • H04N21/23418Processing of video elementary streams, e.g. splicing of video streams, manipulating MPEG-4 scene graphs involving operations for analysing video streams, e.g. detecting features or characteristics
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/234Processing of video elementary streams, e.g. splicing of video streams, manipulating MPEG-4 scene graphs
    • H04N21/2343Processing of video elementary streams, e.g. splicing of video streams, manipulating MPEG-4 scene graphs involving reformatting operations of video signals for distribution or compliance with end-user requests or end-user device requirements
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/439Processing of audio elementary streams
    • H04N21/4394Processing of audio elementary streams involving operations for analysing the audio stream, e.g. detecting features or characteristics in audio streams
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/44Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream, rendering scenes according to MPEG-4 scene graphs
    • H04N21/44008Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream, rendering scenes according to MPEG-4 scene graphs involving operations for analysing video streams, e.g. detecting features or characteristics in the video stream
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/44Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream, rendering scenes according to MPEG-4 scene graphs
    • H04N21/4402Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream, rendering scenes according to MPEG-4 scene graphs involving reformatting operations of video signals for household redistribution, storage or real-time display
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N5/00Details of television systems
    • H04N5/222Studio circuitry; Studio devices; Studio equipment ; Cameras comprising an electronic image sensor, e.g. digital cameras, video cameras, TV cameras, video cameras, camcorders, webcams, camera modules for embedding in other devices, e.g. mobile phones, computers or vehicles
    • H04N5/225Television cameras ; Cameras comprising an electronic image sensor, e.g. digital cameras, video cameras, camcorders, webcams, camera modules specially adapted for being embedded in other devices, e.g. mobile phones, computers or vehicles
    • H04N5/232Devices for controlling television cameras, e.g. remote control ; Control of cameras comprising an electronic image sensor
    • H04N5/23229Devices for controlling television cameras, e.g. remote control ; Control of cameras comprising an electronic image sensor comprising further processing of the captured image without influencing the image pickup process

Abstract

This application discloses a kind of matched method in audio & video visual angle, client and servers, to solve client present in the scheme of existing client terminal playing panoramic video when current visual angle changes, the problem of matching audio file can not be selected to play out, lead to poor user experience.This method is that user end to server sends the first request message of mark for carrying the MPD file of the MPD file for obtaining panoramic video;Receive the MPD file of the server according to the identification feedback of the MPD file, the MPD file includes the mark of at least one audio fragment and its corresponding spatial description information, and the audio space description information is used to describe the associated region of at least one of described MPD file audio fragment;According to the current visual angle range of user and at least one audio space description information, the first audio fragment with the current visual angle commensurate in scope is determined.

Description

A kind of matched method in audio & video visual angle, client and server
Technical field
This application involves a kind of matched method of multimedia technology field more particularly to audio & video visual angle, clients And server.
Background technology
Panoramic video is also referred to as 360 degree of panoramic videos, is to carry out 360 degree entirely to surrounding by the camera positioned at center Scape is shot, and by technologies such as synchronization, splicing, projections, and the image of multiple angle shots is synthesized panoramic picture, and by multiple frames Panoramic picture form panoramic video.
User can arbitrarily change the angle of viewing up and down when watching panoramic video, obtain preferably experience.Panorama Video and the one very big difference of traditional ordinary video are:What user watched at a certain moment is not complete video pictures, only It is a part of region of complete video picture.Usually the content of the currently practical viewing of user residing for panoramic video coordinate system Region is known as current visual angle, and the video pictures that user watches in current visual angle are known as video visual angle in the application.User sees By sliding screen or rotation head (helmet) when seeing, different video visual angles is watched to convert current visual angle.
In current panoramic video application, it is different with the change of user's current visual angle to only considered video visual angle, and Other media component such as audio, subtitle are not accounted for.And in application scenes, when user's current visual angle changes, If it will be that user brings better viewing experience that audio can synchronize matching with video visual angle.For example, when we watch Such as《Father go where》Etc. entertainments when, when multigroup family gathers together, if user's current visual angle be family 1, Indicate that user is interested in family 1, matching at this time can be the relevant audio of 1 member of family.And when user works as When preceding visual angle is switched to family 2, matching should be the relevant audio of 2 member of family.When the family that user does not pay special attention to When in front yard or video pictures including multiple families, matching can be default audio, and still, current panoramic video is answered In, when the current video visual angle of user changes, matching audio file can not be selected to play out, cause to use Family experience is poor.
Invention content
A kind of matched method in audio & video visual angle of the embodiment of the present application offer, client and server, it is existing to solve Client present in the scheme of some client terminal playing panoramic videos can not select therewith when current visual angle changes The problem of audio file matched plays out, leads to poor user experience.
Specific technical solution provided by the embodiments of the present application is as follows:
In a first aspect, the embodiment of the present application provides a kind of matched method in audio & video visual angle, including:
Server receives the first request that the display advertising for obtaining panoramic video that client is sent describes MPD file Message carries the mark of the MPD file in first request message;
The server returns to the MPD file, the MPD texts according to the mark of the MPD file to the client Part includes the mark of at least one audio fragment and its corresponding audio space description information, the audio space description information Associated region for describing at least one audio fragment.
Using the above method, user end to server acquisition request includes the mark and its corresponding audio sky of audio fragment Between description information MPD file so that client can current visual angle range determination after, according to audio space description information meter Calculate associated region of each audio in full-view video image.When some corresponding associated region of audio fragment and user are current When angular field of view matches, so that client is got, precisely matched audio file plays out with video image, to realize The simultaneously match of audio & video image promotes the viewing experience of user.It can be existing to solve by the embodiment of the present application Client terminal playing panoramic video scheme present in client when current visual angle changes, can not select matching Audio file the problem of playing out, leading to poor user experience.
With reference to first aspect, in a kind of possible design, further include in the MPD file in the MPD file at least The Region Matching condition of one audio fragment and/or the matching strategy of Multi-audio-frequency fragment.
In this design, when the MPD file includes Region Matching condition, the associated region when audio fragment and user When meeting Region Matching condition between current visual angle range, that is, thinks the audio fragment and to your money visual angle be mostly matched. When MPD file includes Multi-audio-frequency matching strategy, when there are the associated regions of at least two audio fragments and user to work as forward sight When meeting Region Matching condition between angular region, determined and the audio of current visual angle commensurate in scope point according to Multi-audio-frequency matching strategy Piece provides more flexible video matching effect to the user.
With reference to first aspect, in a kind of possible design, the method further includes:
The server receives the second request message for obtaining video slicing that the client is sent, and described second The mark of the video slicing is carried in request message;
The server sends the video slicing according to the mark of the video slicing to the client.
With reference to first aspect, in a kind of possible design, the method further includes:
The server receives the acquisition that is used for that the client is sent and divides with matched first audio of the video slicing The third request message of piece carries the mark of the first audio fragment in the third request message;
The server sends first audio point according to the mark of the first audio fragment to the client Piece.
Second aspect, the embodiment of the present application provide a kind of matched method in audio & video visual angle, including:
The first request that user end to server transmission describes MPD file for obtaining the display advertising of panoramic video disappears It ceases, the mark of the MPD file is carried in first request message;
The client receives the MPD file of the server according to the identification feedback of the MPD file, described MPD file includes the mark of at least one audio fragment and its corresponding spatial description information, the audio space description letter Cease the associated region for describing at least one of described MPD file audio fragment;
Current visual angle range and at least one audio space description information of the client according to user, determine with First audio fragment of the current visual angle commensurate in scope.
In the above method, user end to server acquisition request includes the mark of audio fragment and its corresponding audio space The MPD file of description information so that client can calculate after the determination of current visual angle range according to audio space description information Go out associated region of each audio in full-view video image.When some corresponding associated region of audio fragment and user work as forward sight When angular region matches, so that client is got, precisely matched audio file plays out with video image, to realize sound The simultaneously match of frequency and video image, promotes the viewing experience of user.It can be existing to solve by the embodiment of the present application Client present in the scheme of client terminal playing panoramic video can not select matching when current visual angle changes Audio file plays out, the problem of leading to poor user experience.
In conjunction with second aspect, in a kind of possible design, further include in the MPD file in the MPD file at least The Region Matching condition of one audio fragment and/or the matching strategy of Multi-audio-frequency fragment.
In this design, when the MPD file includes Region Matching condition, the associated region when audio fragment and user When meeting Region Matching condition between current visual angle range, that is, thinks the audio fragment and to your money visual angle be mostly matched. When MPD file includes Multi-audio-frequency matching strategy, when there are the associated regions of at least two audio fragments and user to work as forward sight When meeting Region Matching condition between angular region, determined and the audio of current visual angle commensurate in scope point according to Multi-audio-frequency matching strategy Piece provides more flexible video matching effect to the user.
In conjunction with second aspect, in a kind of possible design, the client is according to the current visual angle range of user and described At least one audio space description information determines the first audio fragment with the current visual angle commensurate in scope, including:
The client obtains at least one of described MPD file according at least one audio space description information Audio fragment is at least one of panoramic video associated region;
The client by least one associated region with the association area that matches within the scope of the current visual angle The corresponding audio fragment in domain is determined as alternative audio fragment;
If only exist an alternative audio fragment, the alternative audio fragment is determined as the first audio fragment;
If there are when at least two alternative audio fragments, according to the matching strategy of the Multi-audio-frequency fragment, the first sound is determined Frequency division piece;
If there is no when alternative audio fragment, the default audio fragment of pre-configuration is set to the first audio fragment.
In this design, by the way that Multi-audio-frequency matching strategy is arranged in MPD file, when multiple associated regions and user are current When angular field of view matches, according to Multi-audio-frequency matching strategy, client can select best audio to carry out matching broadcasting.
In conjunction with second aspect, in a kind of possible design, at least one associated region with the current visual angle model The associated region to match in enclosing is associated region identical with the current visual angle range;Or,
Meet the associated region of the Region Matching condition with the current visual angle range.
In this design, it is associated with what is matched within the scope of the current visual angle at least one associated region Region is arranged different conditions, user can specifically determine according to actual needs at least one associated region whether with work as Preceding angular field of view matching, mode is flexible, improves user experience.
It is described to meet the Region Matching with the current visual angle range in a kind of possible design in conjunction with second aspect The associated region of condition, including:
Fall into the associated region of the current visual angle range;Or,
With the current visual angle range associated region that match degree is greater than the preset threshold.
In this design, by the way that the Region Matching condition of audio fragment is arranged in MPD file, the pass of audio may be implemented Join condition coupling different between region and user's current visual angle, to provide between more flexible audio & video image With effect, further,
In conjunction with second aspect, in a kind of possible design, the method further includes:
At least one audio fragment that the MPD file includes is downloaded to the client local by the client, The client according to user current visual angle range and at least one audio space description information, determination work as with described After the preceding matched first audio fragment of angular field of view, described first is obtained from being downloaded in local at least one audio fragment Audio fragment is decoded broadcasting.
In this design, due to audio fragment data amount and little, multiple audios are downloaded to local by client in advance, Locally the audio fragment is being directly acquired after determining the audio fragment that the region with the current visual angle range of user matches in the middle It is decoded broadcasting, improves the acquisition efficiency of audio, further increases matching efficiency, promotes user experience.
The third aspect, the embodiment of the present application provide a kind of server, including:
Receiving unit, the display advertising for obtaining panoramic video for receiving client transmission describe MPD file First request message carries the mark of the MPD file in first request message;
Processing unit returns to the MPD file for the mark according to the MPD file to the client, described MPD file includes that the mark of at least one audio fragment and its corresponding audio space description information, the audio space are retouched State associated region of the information for describing at least one audio fragment.
In conjunction with the third aspect, in a kind of possible design, further include in the MPD file in the MPD file at least The Region Matching condition of one audio fragment and/or the matching strategy of Multi-audio-frequency fragment.
In conjunction with the third aspect, in a kind of possible design, the server further includes transmission unit,
The receiving unit, the second request for obtaining video slicing for being additionally operable to receive the client transmission disappear It ceases, the mark of the video slicing is carried in second request message;
The transmission unit sends the video slicing for the mark according to the video slicing to the client.
In conjunction with the third aspect, in a kind of possible design, the receiving unit is additionally operable to receive what the client was sent For obtain with the third request message of the matched first audio fragment of the video slicing, carry in the third request message There is the mark of the first audio fragment;
The transmission unit is additionally operable to the mark according to the first audio fragment, and described the is sent to the client One audio fragment.
Fourth aspect, the embodiment of the present application provide a kind of client, including:
Transmission unit, for sending describe MPD file for obtaining the display advertising of panoramic video first to server Request message carries the mark of the MPD file in first request message;
Receiving unit, the MPD file for receiving the server according to the identification feedback of the MPD file, institute State the mark and its corresponding spatial description information that MPD file includes at least one audio fragment, the audio space description Information is used to describe the associated region of at least one of described MPD file audio fragment;
Processing unit is used for the current visual angle range according to user and at least one audio space description information, really Fixed the first audio fragment with the current visual angle commensurate in scope.
In conjunction with fourth aspect, in a kind of possible design, further include in the MPD file in the MPD file at least The Region Matching condition of one audio fragment and/or the matching strategy of Multi-audio-frequency fragment.
In conjunction with fourth aspect, in a kind of possible design, the processing unit according to the current visual angle range of user and At least one audio space description information, when determining the first audio fragment with the current visual angle commensurate in scope, specifically For:
At least one of described MPD file audio fragment is obtained according at least one audio space description information to exist At least one of panoramic video associated region;
By sound corresponding with the associated region to match within the scope of the current visual angle at least one associated region Frequency division piece is determined as alternative audio fragment;
If only exist an alternative audio fragment, the alternative audio fragment is determined as the first audio fragment;
If there are when at least two alternative audio fragments, according to the matching strategy of the Multi-audio-frequency fragment, the first sound is determined Frequency division piece;
If there is no when alternative audio fragment, the default audio fragment of pre-configuration is set to the first audio fragment.
In conjunction with fourth aspect, in a kind of possible design, at least one associated region with the current visual angle model The associated region to match in enclosing is associated region identical with the current visual angle range;Or,
Meet the associated region of the Region Matching condition with the current visual angle range.
It is described to meet the Region Matching with the current visual angle range in a kind of possible design in conjunction with fourth aspect The associated region of condition, including:
Fall into the associated region of the current visual angle range;Or,
With the current visual angle range associated region that match degree is greater than the preset threshold.
In conjunction with fourth aspect, in a kind of possible design, the processing unit is additionally operable to:
At least one audio fragment that the MPD file includes is downloaded to the client local, the client In the current visual angle range and at least one audio space description information, determination and the current visual angle range according to user After matched first audio fragment, from be downloaded in local at least one audio fragment obtain the first audio fragment into Row decoding plays.
5th aspect, a kind of server provided by the embodiments of the present application, including memory, processor and communication interface; Wherein,
The memory is for storing computer-readable program;
The processor by running the program in the memory, with complete it is any in first aspect and first aspect can The method that the realization method of energy provides;
The communication interface under the control of the processor for sending and receiving data.
6th aspect, a kind of client provided by the embodiments of the present application, including memory, processor and communication interface; Wherein,
The memory is for storing computer-readable program;
The processor by running the program in the memory, with complete it is any in second aspect and second aspect can The method that the realization method of energy provides;
The communication interface under the control of the processor for sending and receiving data.
7th aspect, the embodiment of the present application provide a kind of computer storage media, and the storage medium is computer-readable Storage medium, it includes instruction that the computer-readable recording medium storage, which has program, program, and described instruction is when by with processor The network equipment so that the network equipment is executed what each possible realization method of above-mentioned first aspect and one side provided Method.
Eighth aspect, the embodiment of the present application provide a kind of computer storage media, and the storage medium is computer-readable Storage medium, it includes instruction that the computer-readable recording medium storage, which has program, program, and described instruction is when by with processor Electronic equipment each possible realization method for making the electronic equipment execute above-mentioned second aspect and second aspect when executing provide Method.
Description of the drawings
Fig. 1 is a kind of network architecture schematic diagram provided by the embodiments of the present application;
Fig. 2 is the content structure schematic diagram of MPD file in the prior art;
Fig. 3 A are the video schematic diagram of full width transmission mode;
Fig. 3 B are the video schematic diagram of block transmission mode;
Fig. 4 is that video pictures switch schematic diagram in the prior art;
Fig. 5 is a kind of structural schematic diagram of server provided by the embodiments of the present application;
Fig. 6 is a kind of structural schematic diagram of client provided by the embodiments of the present application;
Fig. 7 is a kind of matched method flow schematic diagram in audio & video visual angle provided by the embodiments of the present application;
Fig. 8 is the matched method flow schematic diagram in another audio & video visual angle provided by the embodiments of the present application;
Fig. 9 A, Fig. 9 B and Fig. 9 C are the matching process schematic diagram of the associated region and current visual angle of audio fragment;
Figure 10 A, Figure 10 B and the quantity that Figure 10 C are associated region are more than schematic diagram at one;
Figure 11 is the structural schematic diagram of another server provided by the embodiments of the present application;
Figure 12 is the structural schematic diagram of another client provided by the embodiments of the present application.
Specific implementation mode
Below in conjunction with the attached drawing in the embodiment of the present application, technical solutions in the embodiments of the present application carries out clear, complete Site preparation describes.
A kind of matched method in audio & video visual angle of the embodiment of the present application offer, client and server, it is existing to solve Client present in the scheme of some client terminal playing panoramic videos can not select therewith when current visual angle changes The problem of audio file matched plays out, leads to poor user experience.
Wherein, method and apparatus be based on same inventive concept, since the principle that method and device solves the problems, such as is similar, Therefore the implementation of apparatus and method can be with cross-reference, and overlaps will not be repeated.
The network architecture that technical solution provided by the embodiments of the present application is related to is as shown in Figure 1, include server 101 and client End 102.Server is corresponding with client, provides the program of local service to the user, the invention relates to client Have the function of for user's playing panoramic video, panoramic video player is run in client, which can be mounted in visitor An application on the end of family, can also be a page on browser.Client can be wireless terminal device, can also be Line termination unit.Wireless terminal device can be had the portable equipment of wireless connecting function or be connected to wireless-modulated Other processing equipments of demodulator.Wireless terminal device can through wireless access network (Radio Access Network, RAN) with One or more core nets are communicated, and wireless terminal device can be mobile terminal device, as mobile phone (or be " bee Nest " phone) and computer with mobile terminal device, for example, it may be portable, pocket, hand-held, built-in computer Or vehicle-mounted mobile device, they exchange language and/or data with wireless access network.Line termination unit can be wired Television set, hard wired computer etc..Server refers to the equipment for providing the service of calculating, server can the service at customer in response end ask Ask, server has the function of undertaking service and ensures service, the invention relates to server have be client The function of panoramic video is provided.The composition of server is similar with general computer architecture, generally include to blow afloat, hard disk, memory, System bus etc., example more demanding in processing capacity, reliability, stability, safety, scalability, manageability etc. Such as, server can be PC (Personal Computer, PC) server.Communication between client and server Support the media transmission protocol of general panoramic video, such as real-time transport protocol (Real-Time Protocol, RTP), reality When stream protocol (Real-Time Streaming Protocol, RTSP), hypertext transfer protocol (HyperText Transfer Protocol, HTTP), HTTP dynamic self-adaptings stream (Dynamic Adaptive Streaming over HTTP, DASH) matchmaker Body agreement, HTTP live TV streams stream (HTTP Live Streaming, HLS) media protocol etc..
The invention relates to server and client side can be based on DASH technologies, other technologies can also be based on. For being based on DASH technologies, DASH technologies use different HTTP Streaming Media skills primarily to solving different video distributor Lengthy and tedious problem caused by art in deployment and reception mechanism.The client that is mainly characterized by of DASH technologies can be according to Network status Such as speed of download, caching are how many, select the media slicing of suitable code check, distribution of media quotient passes through according to the selection of client Media slicing is sent to client by http protocol, to ensure the viewing experience of user.
The media exhibition description of existing DASH standards Main Specification (Media Presentation Description, MPD) the format of file and media slicing (Segment).The content structure of existing MPD file is as shown in Fig. 2, MPD file point For the period (Period), adapt to collection (Adaptation Set), description (Representation), fragment (Segment) totally 4 A level.One MPD file is made of one or more continuous period, and a Period indicates a media time section, There are initial time and end time;One period includes one or more Adaptation Set, each Adaptation Set generally corresponds to a kind of Media component, such as audio, video, subtitle.By taking the MPD file of video as an example, video Adaptation Set generally include multiple Representation, and different Representation correspond to different code checks, divide The other features such as resolution, between multiple Representation that the same Adaptation Set include can into Mobile state from Adapt to switching;Each Representation is made of one or more media slicings, and it is the basic unit of MPD to divide media piece, Client can by the uniform resource locator of the media slicing in MPD file (Uniform Resource Locator, URL media slicing) is obtained and handled to server to realize streaming media service.
The invention relates to panoramic video transmitting scenes, and in particular to asks transmission panorama in user end to server The scene of the forward direction server acquisition request MPD file of the video slicing of video.
Panoramic video is also referred to as 360 degree of panoramic videos, panoramic video be by the camera positioned at center to surrounding into Row 360o pan-shots change observation visual angle by sliding screen or the rotation head drive helmet when user watches, play complete The picture of scape video can switch therewith automatically, and user is as being in true environment.
In panoramic video transmitting scene, client obtains the MPD file of panoramic video to server first, it is one Meta data file provides the information how client accesses the media slicing of panoramic video.
Since the data volume of panoramic video is much larger compared with ordinary video, the mode for transmitting panoramic video at present mainly may be used To be divided into two classes:
1) full width is transmitted:It is consistent with ordinary video transmission method, by whole picture panoramic picture using H.264, the videos such as H.265 Coding form carries out coding transmission, and what client received is complete panoramic video content, as shown in Figure 3A.
2) block transmission:Panoramic picture is cut into multiple pieces (tile), every block diagram picture is encoded, is corresponded to per block diagram picture One video slicing is transmitted by the corresponding piecemeal content priority transmission of the current visual angle of user or with high-resolution when transmission. As shown in Figure 3B, entire panoramic picture is divided into 16 blocks, and a video slicing is corresponded to per block diagram picture.
Client can go that corresponding video slicing, the current video of user is asked to regard according to the current video visual angle of user Angle may be fallen on one or more blocks, therefore what client received is the corresponding video slicing of one or more blocks.Assuming that Client needs to ask the corresponding video slicing of four piecemeals in the left side institute diagram such as Fig. 4 respectively according to user's current visual angle. Client is decoded splicing to obtaining 4 video slicings back, renders and plays, and the video pictures of end user's viewing are as schemed Shown in 4 right side.
Motion Picture Experts Group's (Moving Picture Experts Group, MPEG) DASH standards are in MPD texts at present Visual angle (viewpoint) descriptor defined in part, the video and audio content with identical viewpoint values can be broadcast simultaneously It puts.Client can find video and audio fragment list with identical viewpoint values in MPD file, and according to working as Preceding bandwidth obtains the video and audio fragment of suitable code check respectively.Such as the video row for the MPD examples 1 of property illustrated below provided Include altogether 4 AdaptationSet in table, may determine that the first two AdaptationSet corresponds to video from mineType, after Two AdaptationSet correspond to audio, wherein the corresponding video slicings of Representation and id that id is 11 or 12 be The 31 or 32 corresponding audio fragments of Representation can play together, because their viewpoint values are equal to vp1.And the corresponding video slicings of Representation that id is 21 or 22 and Representation pairs that id is 41 or 42 The audio fragment answered can play together, because their viewpoint values are equal to vp2.
MPD examples one
It follows that the visual angle matching relationship between video slicing and audio fragment can only be indicated using the prior art, but When being panoramic video transmission, video slicing and video visual angle are not one-to-one, cannot represent audio well and regard Matching relationship between frequency visual angle.Such as when block transmission, video visual angle may be made of multiple video slicings, according to existing skill Identical viewpoint values should be arranged in the matched audio of these video slicings of art and the visual angle.But the same video point Piece may belong in different video visual angles, especially when the matched audio difference in the two video visual angles, using existing skill Art can not represent the matching relationship between the video slicing and multiple audios at composition different video visual angle.
And when panoramic video full width is transmitted, full image corresponds to a video slicing, wherein may include multiple videos If visual angle can not be represented using the prior art in the same video slicing when the corresponding audio difference in these video visual angles Matching relationship between video visual angle and different audios.
And increasing audio space description information in the embodiment of the present application in MPD file, client can utilize audio empty Between description information calculate the associated region of the audio fragment corresponding to the audio space description information, when user's current visual angle is true After fixed, client can obtain the audio fragment of associated region and user's current visual angle commensurate in scope and play, realize audio and The effect of video visual angle simultaneously match.
Based on the above problem of the existing technology, the embodiment of the present application provides a kind of matched side in audio & video visual angle Method, client and server.Technical solution provided by the embodiments of the present application is described in detail below by specific embodiment, needs Bright, the displaying sequence of embodiment only represents the sequencing of embodiment, does not represent the technical solution that embodiment is provided Quality.
Embodiment one
The embodiment of the present application provides a kind of server, as shown in fig.5, the host 500 where the server includes:Extremely A few processor 501, memory 502 and communication interface 503;At least one processor 501,502 and of the memory The communication interface 503 is connected by bus 504;
The memory 502, for storing computer executed instructions.
At least one processor 501, the computer executed instructions for executing the storage of the memory 502 so that The host 500 carries out data interaction by the host where the communication interface 503 and client to be implemented to execute the application A kind of matched method in audio & video visual angle that example provides.Wherein,
At least one processor 501 reads the program in memory 502, executes following process:
At least one processor 501 is used to obtain for what is sent by the reception client of the communication interface 503 First request message of the MPD file of panoramic video carries the mark of the MPD file in first request message;Root According to the mark of the MPD file, the MPD file is returned to the client, the MPD file includes at least one audio The mark of fragment and its corresponding audio space description information, the audio space description information are described at least one for describing The associated region of audio fragment.
In one possible implementation, in the MPD file further include at least one of MPD file audio The Region Matching condition of fragment and/or the matching strategy of Multi-audio-frequency fragment.
At least one processor 501, is additionally operable to:Receive what the client was sent by the communication interface 503 The second request message for obtaining video slicing carries the mark of the video slicing in second request message;Root According to the mark of the video slicing, the video slicing is sent to the client by the communication interface 503.
At least one processor 501, is additionally operable to:Receive what the client was sent by the communication interface 503 For obtain with the third request message of the matched first audio fragment of the video slicing, carry in the third request message There is the mark of the first audio fragment;According to the mark of the first audio fragment, by the communication interface 503 to described Client sends the first audio fragment.
In the present embodiment, at least one processor 501 may include different types of processor 501, or including The processor 501 of same type;Processor 501 can be below any:Central processing unit (Central Processing Unit, CPU), it is microprocessor, field programmable gate array (Field Programmable Gate Array, FPGA), special Processor etc. has the device of calculation processing ability.A kind of optional embodiment, at least one processor 501 can also collect As many-core processor.
The memory 502 can be below any or any combination:Random access memory (Random Access Memory, RAM), read-only memory (read only memory, ROM), nonvolatile memory (non- Volatile memory, NVM), solid state disk (Solid State Drives, SSD), mechanical hard disk, disk, disk permutation Equal storage mediums.
The communication interface 503 carries out data friendship for host 500 and other equipment (such as host where client) Mutually.Communication interface 503 can be below any or any combination:Network interface (such as Ethernet interface), wireless network The device with network access facility such as card.
The bus 504 may include address bus, data/address bus, controlling bus etc., and for ease of indicating, Fig. 5 is with one Thick line indicates the bus.The bus 504 can be below any or any combination:Industry standard architecture (Industry Standard Architecture, ISA) bus, peripheral component interconnection (Peripheral Component Interconnect, PCI) bus, expanding the industrial standard structure (Extended Industry Standard Architecture, EISA) wired data transfers such as bus device.
An embodiment of the present invention provides a kind of clients, as shown in fig.6, the host 600 where the client includes:Extremely A few processor 601, memory 602 and communication interface 603;At least one processor 601,602 and of the memory The communication interface 603 is connected by bus 604;
The memory 602, for storing computer executed instructions.
At least one processor 601, the computer executed instructions for executing the storage of the memory 602 so that The host 600 carries out data interaction by the host where the communication interface 603 and client to be implemented to execute the application A kind of matched method in audio & video visual angle that example provides.Wherein,
At least one processor 601 reads the program in memory 602, executes following process:
At least one processor 601, for being sent to server for obtaining panorama by the communication interface 603 First request message of the MPD file of video carries the mark of the MPD file in first request message;Pass through institute It states communication interface 603 and receives the server according to the MPD file of the identification feedback of the MPD file, the MPD file Include the mark of at least one audio fragment and its corresponding spatial description information, the audio space description information is for retouching State the associated region of at least one of described MPD file audio fragment;According to the current visual angle range of user and it is described at least One audio space description information determines the first audio fragment with the current visual angle commensurate in scope.
In one possible implementation, in the MPD file further include at least one of MPD file audio The Region Matching condition of fragment and/or the matching strategy of Multi-audio-frequency fragment.
At least one processor 601 is retouched according to the current visual angle range and at least one audio space of user Information is stated, when determining the first audio fragment with the current visual angle commensurate in scope, is specifically used for:
At least one of described MPD file audio fragment is obtained according at least one audio space description information to exist At least one of panoramic video associated region;By at least one associated region within the scope of the current visual angle The corresponding audio fragment of associated region to match is determined as alternative audio fragment;If only exist an alternative audio fragment, The alternative audio fragment is determined as the first audio fragment;If there are when at least two alternative audio fragments, according to described more The matching strategy of audio fragment determines the first audio fragment;If there is no when alternative audio fragment, by the default audio of pre-configuration Fragment is set to the first audio fragment.
In one possible implementation, at least one associated region with phase within the scope of the current visual angle The associated region matched is associated region identical with the current visual angle range;Or, with described in current visual angle range satisfaction The associated region of Region Matching condition.
In one possible implementation, the pass that the Region Matching condition is met with the current visual angle range Join region, including:Fall into the associated region of the current visual angle range;Or, being more than with the matching degree of the current visual angle range The associated region of predetermined threshold value.
At least one processor 601 is additionally operable at least one audio fragment for including by the MPD file and downloads Local to the client, the client is described according to the current visual angle range and at least one audio space of user Information, determine with after the first audio fragment of the current visual angle commensurate in scope, from being downloaded to local at least one audio The first audio fragment is obtained in fragment is decoded broadcasting.
In the present embodiment, at least one processor 601 may include different types of processor 601, or including The processor 601 of same type;Processor 601 can be below any:CPU, arm processor, FPGA, application specific processor Deng the device with calculation processing ability.A kind of optional embodiment, at least one processor 601 can also be integrated into crowd Core processor.
The memory 602 can be below any or any combination:RAM, ROM, NVM, SSD, mechanical hard disk, The storage mediums such as disk, disk permutation.
The communication interface 603 carries out data friendship for host 600 and other equipment (such as host where server) Mutually.Communication interface 603 can be below any or any combination:Network interface (such as Ethernet interface), wireless network The device with network access facility such as card.
The bus 604 may include address bus, data/address bus, controlling bus etc., and for ease of indicating, Fig. 6 is with one Thick line indicates the bus.The bus 604 can be below any or any combination:Isa bus, pci bus, EISA The device of the wired data transfers such as bus.
It includes sound that user end to server acquisition request, which may be implemented, in server and client side provided by the embodiments of the present application The mark of frequency division piece and its MPD file of corresponding audio space description information so that client can be in current visual angle range After determination, associated region of each audio in full-view video image is calculated according to audio space description information.When some sound When the corresponding associated region of frequency division piece and user's current visual angle range match, client is made to get and accurate of video image The audio file matched plays out, and to realize the simultaneously match of audio & video image, promotes the viewing experience of user.Pass through The embodiment of the present application can work as forward sight to solve client present in the scheme of existing client terminal playing panoramic video When angle changes, the problem of matching audio file can not be selected to play out, lead to poor user experience.Further , in the embodiment of the present application, by the way that the Region Matching condition of audio fragment is arranged in MPD file, the pass of audio may be implemented Join condition coupling different between region and user's current visual angle, to provide between more flexible audio & video image With effect, further, by the way that Multi-audio-frequency matching strategy is arranged in MPD file, when multiple associated regions and user are current When angular field of view matches, according to Multi-audio-frequency matching strategy, client can select best audio to carry out matching broadcasting.
Embodiment two
The embodiment of the present application provides a kind of matched method in audio & video visual angle, as shown in fig. 7, being serviced in this method Device and the interaction flow of client are as follows:
S701:User end to server sends the first request message of MPD file for obtaining panoramic video, and described the The mark of the MPD file is carried in one request message.
In S701, the mark of MPD file obtains the MPD file of the mark instruction of the MPD file for server.MPD texts The mark of part can be uniform resource identifier (Uniform Resource Idetifier, URI), be http with URI:// For example.com/mpd, the first request message includes following content:
GET http://example.com/mpd HTTP/1.1
Connection:keep-alive
It should be noted that above-mentioned first request message is merely illustrative, the first request message in the present embodiment In addition to the mark including MPD file, it can also include other parameters, no longer repeat one by one herein.
S702:Server obtains MPD file according to the mark of MPD file.
In S702, MPD file includes the mark of at least one audio fragment and its corresponding audio space description information, The audio space description information is used to describe the associated region of at least one audio fragment.
Illustratively, including the content of the MPD file of audio space description information is as follows:
Adhering to separately property is described as follows shown in table one in the middle part of the above-mentioned MPD file including spatial description information:
Table one
In above-mentioned table one, adaptationSet@mimeType indicate medium type, from AdaptationSet (mimeType=" video/mp4 "), should it is found that include the video file of a mp4 type in above-mentioned MPD file The video slicing of 3 kinds of different code checks is contained in AdaptationSet, they correspond to different video heights and width respectively, Such as:When code check is " 1024000 " bandwidth=, the width of video image is width=" 2560 ", is highly Height=" 720 ", because video is by the way of full width transmission in the present embodiment, the width of panoramic picture in panoramic video Degree and height are 2560 and 720.In addition, also including 3 audio fragments, AdaptationSet in the MPD file (mimeType=" audio/mp4 ") comprising the corresponding audio fragment of a main audio fragment and 2 specific regions, SchemeIdUri=" urn:mpeg:dash:asrd:2016 " audio space description information, the definition of key (value) value are indicated As shown in Table 2, wherein M is indicated essential, and O indicates optional.
Table two
@value Usage Description
object_x M Abscissa of the upper left corner of audio fragment corresponding region in full-view video image
object_y M Ordinate of the upper left corner of audio fragment corresponding region in full-view video image
object_width M The width or horizontal direction size of audio fragment corresponding region
object_height M The height or vertical direction size of audio fragment corresponding region
total_width O The width of full-view video image
total_height O The height of full-view video image
Therefore 1 corresponding audio space description information of audio fragment<SupplementalProperty schemeIdUri =" urn:mpeg:dash:asrd:2016 " value=" 480,390,810,300,3840,1080 "/>Indicate the audio fragment Associated region be in width be 3840, highly in 1080 full-view video image with (480,390) for the upper left corner, width is The region that 810 height are 300.Because providing the width of full-view video image in 1 corresponding spatial description information of audio fragment Degree and height, thus it is corresponding in audio fragment 2<SupplementalProperty schemeIdUri=" urn:mpeg: dash:asrd:2016 " value=" 3072,285,480,510 "/>In can no longer provide full-view video image width and Highly, it indicates that the associated region of audio fragment 2 be in width is 3840, highly in 1080 full-view video image with (3072,285) it is the upper left corner, width is the region that 480 height are 510.
In the present embodiment, the audio fragment for not providing audio relation description information is considered as main audio, also may be used With referred to as default audio, in addition to there is no the audio for providing audio relation description information that can be used as default audio, if audio point When piece includes precedence information, the audio fragment of highest priority is not limited in the application it is also assumed that be default audio The method for determining default audio fragment.
It should be noted that audio space description information in addition to can such as table two in other than the description method that provides, may be used also To be described by each apex coordinate position of the corresponding associated region of audio fragment, application scheme does not limit area of space Description method.It therefore, can also be by providing the relative scale with full-view video image other than above-mentioned absolute value description To describe.
S703:Server returns to the MPD file to client, and the MPD file includes at least one audio fragment Mark and its corresponding audio space description information, the audio space description information is for describing at least one audio The associated region of fragment.
In the present embodiment, server may be implemented by the above method and send MPD file to client, client is made to be based on The MPD file realizes the matching one by one of video slicing and audio fragment.The above method can also include the following steps, to realize Audio fragment of the server to client transmissions panoramic video:
S704:User end to server sends the second request message for obtaining video slicing, second request message It include the mark of the video slicing.
Client is according to current bandwidth situation to the video slicing of the suitable code check of server request selecting, it is assumed here that client It is bandwidth=" 1024000 " to hold the code check selected, and corresponding representation is as follows:
<Representation id=" v2 " bandwidth=" 1024000 " width=" 2560 " height=" 720">
<BaseURL>562465736.mp4</BaseURL>
</Representation>
Therefore, the URL of video slicing is http://cdn1.example.com/562465736.mp4, the second request disappear It is as follows to cease format:
GET http://cdn1.example.com/562465736.mp4HTTP/1.1
Connection:keep-alive
S705:Server sends the video slicing according to the mark of the video slicing to the client.
S706:Client describes to believe according at least one of the current visual angle range of user and MPD file audio space Breath,
Determine the first audio fragment with the current visual angle commensurate in scope.
It is highly 720 because the corresponding panoramic picture width of video slicing that client obtains in S705 is 2560, it is false If it in width is 2560 that the current visual angle range areas of user, which is, highly to be with (320,260) in 720 full-view video image The upper left corner, width are the region that 540 height are 200.Due to corresponding in the audio space description information of the MPD file in table one The width of full-view video image is 3840, is highly 1080, therefore client is needed the value in audio space description information Value converts:
Object_x '=object_x*width '/total_width
Object_y '=object_y*height '/total_height
Object_width '=object_width*width '/total_width
Object_height '=object_height*height '/total_height
Wherein, object_x, object_y, object_width, object_height, total_width, total_ Height is the original value values in the corresponding audio space description information of MPD file sound intermediate frequency fragment, and width, height are The width and height for the corresponding full-view video image of video slicing that client obtains, object_x ', object_y ', Object_width ', object_height ', width, height are that the video slicing that audio fragment is obtained in client corresponds to Full-view video image in spatial description information.After calculating, audio fragment 1 width be 2560, highly for 720 it is complete Associated region is for the upper left corner, width is the region that 540 height are 200, audio fragment with (320,260) in scape video image 2 width be 2560, highly for associated region in 720 full-view video image be with (2030,190) for the upper left corner, width The region for being 340 for 320 height, therefore client determines that with the matched audio fragment in current visual angle range areas of user be sound Frequency division piece 1, i.e. the first audio fragment are audio fragment 1.
S707:The user end to server is sent for obtaining and the matched first audio fragment of the video slicing Third request message carries the mark of the first audio fragment in the third request message.
1 corresponding AdaptationSet of audio fragment is as follows, includes the audio fragment of two different code checks, it is assumed that client End selects code check for the audio fragment of bandwidth=" 64000 " according to current bandwidth determination
Therefore, the URL of the audio fragment selected is http://cdn1.example.com/3463275477.mp4, third Request message format is as follows:
GET http://cdn1.example.com/3463275477.mp4 HTTP/1.1
Connection:keep-alive
S708:The server sends first sound according to the mark of the first audio fragment to the client Frequency division piece.
Server sends corresponding audio fragment to client, client is to this according to the third solicited message of client Audio fragment is decoded broadcasting.
It should be noted that due to audio fragment data amount and little, client can also in advance by multiple audios all under It is downloaded to local, is directly obtained locally after determining the audio fragment that the region with the current visual angle range of user matches in S706 The audio fragment is taken to be decoded broadcasting.
Further, after user converts current visual angle, client obtains the audio to match with newest current visual angle Fragment is decoded broadcasting.
Assuming that it is 2560 that the region watched of the transformed current visual angle of user, which is in width, highly for 720 aphorama With (2030,190) for the upper left corner in frequency image, width is the region that 320 height are 340, therefore client is according to step S706 It determines that the audio fragment that the current visual angle range areas with user matches is audio fragment 2, then, executes S707 and S708 and obtain It is decoded after taking the audio fragment that code check is bandwidth=" 64000 " in 2 corresponding AdaptationSet of audio fragment It plays.
Sequence is executed it should be noted that being not intended to limit in the application between S704-S705 and S706-S708.
Further include at least one of MPD file audio point in the MPD file in a kind of possible embodiment The Region Matching condition of piece and/or the matching strategy of Multi-audio-frequency fragment.For this embodiment, following example three is come to this It is described in detail.
Fig. 8 shows a kind of matched method in audio & video visual angle, is retouched by executive agent of client in Fig. 8 It states, at this point, the implementation procedure of server is identical with Fig. 7, details are not described herein.
As shown in figure 8, client determines that the method with the matched audio fragment in current video visual angle comprises the steps of:
800:User end to server sends the first request message of MPD file for obtaining panoramic video, and described the The mark of the MPD file is carried in one request message.Specific implementation process sees the S701 in Fig. 7, no longer superfluous herein It states.
801:Client receives the MPD file that server is sent, and the MPD file includes at least one audio fragment Mark and its corresponding audio space description information, the audio space description information is for describing at least one audio point The associated region of piece.
The mode of transmission panoramic video can be mainly divided into full width transmission and two class of block transmission at present, be passed when using full width When defeated panoramic video, the content of the MPD file can be as shown in embodiment two.The present embodiment three will focus on block transmission For illustrate, at this point, including audio space description information MPD file content it is as follows.
Include the corresponding audio fragment of a main audio fragment and 2 specific regions in above-mentioned MPD file, SchemeIdUri=" urn:mpeg:dash:asrd:2016 " audio space description information, the audio space description letter are indicated Breath may be used as other than the representation method of the definition of table one in embodiment two, described using a kind of audio space in the present embodiment three The relative value representation method of information, the definition of value values are as shown in Table 3:
Table three
802:Client selects video slicing, determines the width and height of the corresponding full-view video image of the video slicing Degree.
Client selects the video slicing of suitable code check according to current bandwidth, when using such as the full width transmission in embodiment two When panoramic video, the corresponding width of selected video slicing and height are the width and height of full-view video image.When Using in the present embodiment three when the video slicing of block transmission panorama, it is assumed that client according to current bandwidth select code check for The video slicing of bandwidth=" 128000 ", width=" 960 " height=" 270 " indicate the corresponding video of video slicing Picture traverse is 960, is highly 270, is illustrated with above-mentioned exemplary MPD file, video AdaptationSet (mimeType=" video/mp4's ")<SupplementalProperty schemeIdUri=" urn:mpeg:dash: srd:2014 " value=" 0,0,0,1,1,4,4 "/>Indicate that the full-view video image width and height are respectively divided into 4 parts, entirely Full-view video image is divided into 4*4=16 blocks (Tile), that is to say, that the width of each block of video slicing image and height are respectively The width of full-view video image and a quarter of height, therefore client selects code check for bandwidth='s " 128000 " The width of the corresponding full-view video image of video slicing is 960*4=3840, is highly 270*4=1080.
It should be noted that it is existing to determine that the width of the corresponding full-view video image of video slicing and height are referred to The prior art is merely given as one kind in the present embodiment three and illustrates, and is not especially limited.
803:Client calculates each audio fragment in the video slicing according to the audio space description information in MPD file Associated region in corresponding full-view video image.
It, can be according in embodiment two when audio space description information uses absolute value representation mode as shown in Table 2 Method described in S706 calculates associated region of each audio fragment in the corresponding full-view video image of the video slicing.This Indicate that calculate each audio fragment when audio space description information regards described for relative scale shown in table three in embodiment three The method of associated region is described in detail in the corresponding full-view video image of frequency division piece.
It is respectively 3840 and 1080 by the overall width and total height of the full-view video image determined in 802, according in table three The audio space description information value value attributes (relative scale representation) provided can determine:1 corresponding audio of audio fragment Spatial description information<SupplementalProperty schemeIdUri=" urn:mpeg:dash:asrd:2016"value =" 0.125,0.361,0.211,0.278 "/>Indicate that the associated region of audio fragment to be 3840 in width, is highly With (0.125*3840=480,0.361*1080=390) for the upper left corner in 1080 full-view video image, width 0.211* 3840=810 height is the region of 0.278*1080=300.2 corresponding audio space description information of audio fragment< SupplementalProperty schemeIdUri=" urn:mpeg:dash:asrd:2016 " value=" 0.8,0.264, 0.125,0.472"/>It in width is 3840 to indicate that the associated region of audio fragment is, highly for 1080 full-view video image In with (0.8*3840=3072,0.264*1080=285) be the upper left corner, width is that 0.125*3840=480 height is The region of 0.472*1080=510.
804:When matching there are the associated region of alternative audio fragment and current visual angle range, 805 are executed;Otherwise, Execute 807.
Wherein, client by least one associated region with the associated region pair that matches within the scope of the current visual angle The audio fragment answered is determined as alternative audio fragment.
Specifically, determining whether there is the associated region of audio fragment and current visual angle range matches.Can by with Under type determines:
Mode one, if the associated region of an audio fragment is associated region identical with current visual angle range, it is determined that with Current visual angle range matches.
After calculating the associated region of an audio fragment according to the above method, if the associated region and use of general audio fragment When the current visual angle range areas at family is identical, then it is assumed that the audio fragment matches with current visual angle range.Such as assume user Current visual angle range areas be in width be 3840, highly in 1080 full-view video image with (480,390) for upper left When width is the region that 810 height are 300, the associated region of audio fragment 1 can be determined according to the result of calculation in 803 for angle It is identical as the current visual angle range areas of user, i.e. audio fragment 1 and current visual angle commensurate in scope, as shown in Figure 9 A.
Mode two:If the associated region of an audio fragment is the pass for meeting Region Matching condition with the current visual angle range Join region, it is determined that match with current visual angle range.
Specifically, meet the associated region of the Region Matching condition with the current visual angle range, including:
Fall into the associated region of the current visual angle range;Or, being more than with the matching degree of the current visual angle range default The associated region of threshold value.
Specifically, in MPD file can with setting area matching condition, when audio fragment associated region and user it is current When meeting the Region Matching condition between angular field of view region, it is determined that the audio fragment matches with current visual angle range.
For example, 1) Region Matching condition is the condition of inclusion relation, when the current visual angle range areas of user includes audio When the associated region of fragment, it is believed that the audio fragment matches with current visual angle range, as shown in figs. 9 a and 9b;2) region Matching condition is the condition of smallest match ratio, and smallest match ratio is preset ratio value.When the current visual angle range of user The ratio that the lap of the associated region of region and audio fragment accounts for the associated region of audio fragment is more than smallest match ratio When, it is believed that the audio fragment matches with current visual angle range, as shown in Figure 9 C.
It should be noted that being not intended to limit the match party of the associated region and current visual angle range of audio fragment in the application Method.
805:When matching there are the associated region of at least two alternative audio fragments and current visual angle range, execute 806;Otherwise, 808 are executed.
The quantity of the associated region of the audio fragment to match with the current visual angle range determined according to the method described above More than one, specifically see shown in Figure 10 A, Figure 10 B and Figure 10 C.
806:When in MPD file including Multi-audio-frequency matching strategy, 809 are executed;Otherwise, 807 are executed.
807:Client selects default audio fragment to be decoded broadcasting as the first audio fragment.
Default audio fragment can be the audio fragment of no any associated region or be not provided with audio space description The audio fragment of information can also be the audio fragment provided with highest priority.
808:Selection is decoded broadcasting with the first audio fragment that current visual angle range matches.
809:The the first audio fragment to match with current visual angle range to be obtained according to Multi-audio-frequency matching strategy determination It is decoded broadcasting.
Multi-audio-frequency matching strategy is used to indicate when the associated region of multiple audio fragments can be with current visual angle range phase The strategy of selection and the audio fragment of current visual angle commensurate in scope when matching.For example, priority match strategy can be used as multitone A kind of embodiment of frequency matching strategy.At this time, it may be necessary to the priority of each audio fragment be preset in MPD file, according to pre- If each audio fragment priority, select the audio fragment of highest priority as first with current visual angle commensurate in scope Audio fragment;In another example the matching strategy of matching degree can be as a kind of embodiment of Multi-audio-frequency matching strategy.At this point, can To calculate the overlapping region of each associated region and current visual angle range areas, using the maximum associated region in overlapping region as With the maximum associated region of degree;Alternatively, the ratio value of overlapping region and associated region is calculated, by the maximum associated region of ratio value As the maximum associated region of matching degree, so that it is determined that the first audio corresponding with the associated region of current visual angle commensurate in scope point Piece.
It should be noted that Multi-audio-frequency matching strategy is not limited in the application specifically, it is any to can be used for working as multiple sounds The associated region of frequency division piece can be with selection when current visual angle commensurate in scope and the audio fragment of current visual angle commensurate in scope Method all can serve as Multi-audio-frequency matching strategy.
If Multi-audio-frequency matching strategy is priority match strategy, audio adapts to wrap in collection (AdaptationSet) Containing priority attribute, it is used to indicate the priority of the audio fragment.When multiple audio fragments associated region can with it is current When angular field of view matches, by comparing the priority of these audio fragments, determine that the satisfactory audio fragment of priority is The audio fragment to match with current visual angle range.
The MPD file of embodiment three increases Region Matching condition on the basis of embodiment two, can be with more flexible earth's surface Show the matching relationship between the associated region of audio fragment and current visual angle range areas, plan is further matched by Multi-audio-frequency Can slightly solve the problems, such as how to select optimal audio fragment when multiple audio fragments match with current visual angle range, so as to To bring the viewing experience of more accurately audio & video visual angle simultaneously match to user.
Embodiment three
Based on above example, the embodiment of the present invention additionally provides a kind of server, the server can be with shown in Fig. 5 The identical equipment of server, the method that server side in embodiment two executes may be used.Refering to fig. 1 shown in 1, the present invention is real Applying a kind of server 1100 that example provides includes:Receiving unit 1101, processing unit 1102.Wherein,
Receiving unit 1101, the display advertising for obtaining panoramic video for receiving client transmission describe MPD texts First request message of part carries the mark of the MPD file in first request message;
Processing unit 1102 returns to the MPD file, institute for the mark according to the MPD file to the client State the mark and its corresponding audio space description information that MPD file includes at least one audio fragment, the audio space Description information is used to describe the associated region of at least one audio fragment.
In one possible implementation, in the MPD file further include at least one of MPD file audio The Region Matching condition of fragment and/or the matching strategy of Multi-audio-frequency fragment.
In one possible implementation, the server further includes transmission unit 1103,
The receiving unit 1101 is additionally operable to receive the second request for obtaining video slicing that the client is sent Message carries the mark of the video slicing in second request message;
The transmission unit 1103 sends the video for the mark according to the video slicing to the client Fragment.
In one possible implementation, the receiving unit 1101 is additionally operable to receive the use that the client is sent In obtaining the third request message with the matched first audio fragment of the video slicing, carried in the third request message The mark of the first audio fragment;
The transmission unit 1103 is additionally operable to the mark according to the first audio fragment, and institute is sent to the client State the first audio fragment.
The method that the function of above-mentioned each unit can be found in the execution of two server side of embodiment, details are not described herein again.
It should be noted that being schematical, only a kind of logic function to the division of unit in the embodiment of the present invention It divides, formula that in actual implementation, there may be another division manner.In addition, each functional unit in each embodiment of the application can be with It is integrated in a processing unit, can also be that each unit physically exists alone, it can also two or more unit collection At in a unit.The form that hardware had both may be used in above-mentioned integrated unit is realized, SFU software functional unit can also be used Form realize.
Based on above example, the embodiment of the present invention additionally provides a kind of client, the client can be with shown in Fig. 6 The identical equipment of client, the method that client-side in embodiment two executes may be used.Refering to fig. 1 shown in 2, the present invention is real Applying a kind of client 1200 that example provides includes:Receiving unit 1201, processing unit 1202 and transmission unit 1203.Wherein,
Transmission unit 1203 describes MPD file for being sent to server for obtaining the display advertising of panoramic video First request message carries the mark of the MPD file in first request message;
Receiving unit 1201, for receiving MPD text of the server according to the identification feedback of the MPD file Part, the MPD file include the mark of at least one audio fragment and its corresponding spatial description information, the audio space Description information is used to describe the associated region of at least one of described MPD file audio fragment;
Processing unit 1202, for being believed according to the current visual angle range and at least one audio space description of user Breath determines the first audio fragment with the current visual angle commensurate in scope.
In one possible implementation, in the MPD file further include at least one of MPD file audio The Region Matching condition of fragment and/or the matching strategy of Multi-audio-frequency fragment.
In one possible implementation, the processing unit 1202 is according to the current visual angle range of user and described At least one audio space description information is specifically used for when determining the first audio fragment with the current visual angle commensurate in scope:
At least one of described MPD file audio fragment is obtained according at least one audio space description information to exist At least one of panoramic video associated region;
By sound corresponding with the associated region to match within the scope of the current visual angle at least one associated region Frequency division piece is determined as alternative audio fragment;
If only exist an alternative audio fragment, the alternative audio fragment is determined as the first audio fragment;
If there are when at least two alternative audio fragments, according to the matching strategy of the Multi-audio-frequency fragment, the first sound is determined Frequency division piece;
If there is no when alternative audio fragment, the default audio fragment of pre-configuration is set to the first audio fragment.
In one possible implementation, at least one associated region with phase within the scope of the current visual angle The associated region matched is associated region identical with the current visual angle range;Or,
Meet the associated region of the Region Matching condition with the current visual angle range.
In one possible implementation, the pass that the Region Matching condition is met with the current visual angle range Join region, including:
Fall into the associated region of the current visual angle range;Or,
With the current visual angle range associated region that match degree is greater than the preset threshold.
In one possible implementation, the processing unit 1202 is additionally operable to:
At least one audio fragment that the MPD file includes is downloaded to the client local, the client In the current visual angle range and at least one audio space description information, determination and the current visual angle range according to user After matched first audio fragment, from be downloaded in local at least one audio fragment obtain the first audio fragment into Row decoding plays.
The method that the function of above-mentioned each unit can be found in the execution of two client-side of embodiment, details are not described herein again.
It should be noted that being schematical, only a kind of logic function to the division of unit in the embodiment of the present invention It divides, formula that in actual implementation, there may be another division manner.In addition, each functional unit in each embodiment of the application can be with It is integrated in a processing unit, can also be that each unit physically exists alone, it can also two or more unit collection At in a unit.The form that hardware had both may be used in above-mentioned integrated unit is realized, SFU software functional unit can also be used Form realize.
It should be understood by those skilled in the art that, the embodiment of the present application can be provided as method, system or computer program production Product.Therefore, in terms of the embodiment of the present application can be used complete hardware embodiment, complete software embodiment or combine software and hardware Embodiment form.Moreover, it wherein includes computer available programs generation that the embodiment of the present application, which can be used in one or more, The meter implemented in the computer-usable storage medium (including but not limited to magnetic disk storage, CD-ROM, optical memory etc.) of code The form of calculation machine program product.
The embodiment of the present application is with reference to the method, equipment (system) and computer program product according to the embodiment of the present application Flowchart and/or the block diagram describe.It should be understood that can be realized by computer program instructions in flowchart and/or the block diagram The combination of flow and/or box in each flow and/or block and flowchart and/or the block diagram.These calculating can be provided Processing of the machine program instruction to all-purpose computer, special purpose computer, Embedded Processor or other programmable data processing devices Device is to generate a machine so that the instruction executed by computer or the processor of other programmable data processing devices generates For realizing the function of being specified in one flow of flow chart or multiple flows and/or one box of block diagram or multiple boxes Device.
These computer program instructions, which may also be stored in, can guide computer or other programmable data processing devices with spy Determine in the computer-readable memory that mode works so that instruction generation stored in the computer readable memory includes referring to Enable the manufacture of device, the command device realize in one flow of flow chart or multiple flows and/or one box of block diagram or The function of being specified in multiple boxes.
These computer program instructions also can be loaded onto a computer or other programmable data processing device so that count Series of operation steps are executed on calculation machine or other programmable devices to generate computer implemented processing, in computer or The instruction executed on other programmable devices is provided for realizing in one flow of flow chart or multiple flows and/or block diagram one The step of function of being specified in a box or multiple boxes.
Obviously, those skilled in the art can carry out the embodiment of the present application various modification and variations without departing from this Shen Spirit and scope please.In this way, if these modifications and variations of the embodiment of the present application belong to the application claim and its wait Within the scope of technology, then the application is also intended to include these modifications and variations.

Claims (22)

1. a kind of matched method in audio & video visual angle, which is characterized in that including:
The first request that the display advertising for obtaining panoramic video that server reception client is sent describes MPD file disappears It ceases, the mark of the MPD file is carried in first request message;
The server returns to the MPD file according to the mark of the MPD file, to the client, in the MPD file Mark including at least one audio fragment and its corresponding audio space description information, the audio space description information are used for The associated region of at least one audio fragment is described.
2. the method as described in claim 1, which is characterized in that further include in the MPD file in the MPD file at least The Region Matching condition of one audio fragment and/or the matching strategy of Multi-audio-frequency fragment.
3. method as claimed in claim 1 or 2, which is characterized in that the method further includes:
The server receives the second request message for obtaining video slicing that the client is sent, second request The mark of the video slicing is carried in message;
The server sends the video slicing according to the mark of the video slicing to the client.
4. method as claimed in claim 3, which is characterized in that the method further includes:
The server receive that the client sends for obtaining and the matched first audio fragment of the video slicing Third request message carries the mark of the first audio fragment in the third request message;
The server sends the first audio fragment according to the mark of the first audio fragment to the client.
5. a kind of matched method in audio & video visual angle, which is characterized in that including:
User end to server sends the first request message that MPD file is described for obtaining the display advertising of panoramic video, institute State the mark that the MPD file is carried in the first request message;
The client receives the MPD file of the server according to the identification feedback of the MPD file, the MPD texts Part includes the mark of at least one audio fragment and its corresponding spatial description information, and the audio space description information is used for The associated region of at least one of described MPD file audio fragment is described;
Current visual angle range and at least one audio space description information of the client according to user, determine with it is described First audio fragment of current visual angle commensurate in scope.
6. method as claimed in claim 5, which is characterized in that further include in the MPD file in the MPD file at least The Region Matching condition of one audio fragment and/or the matching strategy of Multi-audio-frequency fragment.
7. such as method described in claim 5 or 6, which is characterized in that the client according to the current visual angle range of user and At least one audio space description information determines the first audio fragment with the current visual angle commensurate in scope, including:
The client obtains at least one of MPD file audio according at least one audio space description information Fragment is at least one of panoramic video associated region;
The client by least one associated region with the associated region pair that matches within the scope of the current visual angle The audio fragment answered is determined as alternative audio fragment;
If only exist an alternative audio fragment, the alternative audio fragment is determined as the first audio fragment;
If there are when at least two alternative audio fragments, according to the matching strategy of the Multi-audio-frequency fragment, the first audio point is determined Piece;
If there is no when alternative audio fragment, the default audio fragment of pre-configuration is set to the first audio fragment.
8. the method for claim 7, which is characterized in that at least one associated region with the current visual angle model The associated region to match in enclosing is associated region identical with the current visual angle range;Or,
Meet the associated region of the Region Matching condition with the current visual angle range.
9. method as claimed in claim 8, which is characterized in that described to meet the Region Matching with the current visual angle range The associated region of condition, including:
Fall into the associated region of the current visual angle range;Or,
With the current visual angle range associated region that match degree is greater than the preset threshold.
10. method as claimed in claim 5, which is characterized in that the method further includes:
At least one audio fragment that the MPD file includes is downloaded to the client local by the client, described Client according to user current visual angle range and at least one audio space description information, determine and described work as forward sight After the matched first audio fragment of angular region, first audio is obtained from being downloaded in local at least one audio fragment Fragment is decoded broadcasting.
11. a kind of server, which is characterized in that including:
Receiving unit, the display advertising for obtaining panoramic video for receiving client transmission describe the first of MPD file Request message carries the mark of the MPD file in first request message;
Processing unit returns to the MPD file, the MPD texts for the mark according to the MPD file to the client Part includes the mark of at least one audio fragment and its corresponding audio space description information, the audio space description information Associated region for describing at least one audio fragment.
12. server as claimed in claim 11, which is characterized in that further include in the MPD file in the MPD file The Region Matching condition of at least one audio fragment and/or the matching strategy of Multi-audio-frequency fragment.
13. the server as described in claim 11 or 12, which is characterized in that the server further includes transmission unit,
The receiving unit is additionally operable to receive the second request message for obtaining video slicing that the client is sent, institute State the mark that the video slicing is carried in the second request message;
The transmission unit sends the video slicing for the mark according to the video slicing to the client.
14. server as claimed in claim 13, which is characterized in that the receiving unit is additionally operable to receive the client What is sent is used to obtain and the third request message of the matched first audio fragment of the video slicing, the third request message In carry the mark of the first audio fragment;
The transmission unit is additionally operable to the mark according to the first audio fragment, and first sound is sent to the client Frequency division piece.
15. a kind of client, which is characterized in that including:
Transmission unit, for sending the first request for describing MPD file for obtaining the display advertising of panoramic video to server Message carries the mark of the MPD file in first request message;
Receiving unit, the MPD file for receiving the server according to the identification feedback of the MPD file, the MPD File includes the mark of at least one audio fragment and its corresponding spatial description information, and the audio space description information is used In the associated region for describing at least one of MPD file audio fragment;
Processing unit, for according to user current visual angle range and at least one audio space description information, determine with First audio fragment of the current visual angle commensurate in scope.
16. client as claimed in claim 15, which is characterized in that further include in the MPD file in the MPD file The Region Matching condition of at least one audio fragment and/or the matching strategy of Multi-audio-frequency fragment.
17. the client as described in claim 15 or 16, which is characterized in that the processing unit is working as forward sight according to user Angular region and at least one audio space description information determine the first audio fragment with the current visual angle commensurate in scope When, it is specifically used for:
At least one of described MPD file audio fragment is obtained described according at least one audio space description information At least one of panoramic video associated region;
By audio corresponding with the associated region to match within the scope of the current visual angle at least one associated region point Piece is determined as alternative audio fragment;
If only exist an alternative audio fragment, the alternative audio fragment is determined as the first audio fragment;
If there are when at least two alternative audio fragments, according to the matching strategy of the Multi-audio-frequency fragment, the first audio point is determined Piece;
If there is no when alternative audio fragment, the default audio fragment of pre-configuration is set to the first audio fragment.
18. client as claimed in claim 17, which is characterized in that work as forward sight with described at least one associated region The associated region to match in angular region is associated region identical with the current visual angle range;Or,
Meet the associated region of the Region Matching condition with the current visual angle range.
19. client as claimed in claim 18, which is characterized in that described to meet the region with the current visual angle range The associated region of matching condition, including:
Fall into the associated region of the current visual angle range;Or,
With the current visual angle range associated region that match degree is greater than the preset threshold.
20. client as claimed in claim 15, which is characterized in that the processing unit is additionally operable to:
At least one audio fragment that the MPD file includes is downloaded to the client local, the client is in root Current visual angle range according to user and at least one audio space description information determine and the current visual angle commensurate in scope The first audio fragment after, obtain the first audio fragment from being downloaded in local at least one audio fragment and solved Code plays.
21. a kind of server, which is characterized in that including memory, processor and communication interface;Wherein,
The memory is for storing computer-readable program;
The processor is by running the program in the memory, to complete the method as described in Claims 1-4 is any;
The communication interface under the control of the processor for sending and receiving data.
22. a kind of client, which is characterized in that including memory, processor and communication interface;Wherein,
The memory is for storing computer-readable program;
The processor is by running the program in the memory, to complete the method as described in claim 5 to 10 is any;
The communication interface under the control of the processor for sending and receiving data.
CN201710289042.5A 2017-04-27 2017-04-27 Audio and video visual angle matching method, client and server Active CN108810567B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201710289042.5A CN108810567B (en) 2017-04-27 2017-04-27 Audio and video visual angle matching method, client and server

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201710289042.5A CN108810567B (en) 2017-04-27 2017-04-27 Audio and video visual angle matching method, client and server

Publications (2)

Publication Number Publication Date
CN108810567A true CN108810567A (en) 2018-11-13
CN108810567B CN108810567B (en) 2020-10-16

Family

ID=64070220

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710289042.5A Active CN108810567B (en) 2017-04-27 2017-04-27 Audio and video visual angle matching method, client and server

Country Status (1)

Country Link
CN (1) CN108810567B (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109840052A (en) * 2019-01-31 2019-06-04 成都超有爱科技有限公司 A kind of audio-frequency processing method, device, electronic equipment and storage medium
CN110139065A (en) * 2019-01-30 2019-08-16 北京车和家信息技术有限公司 Method for processing video frequency, video broadcasting method and relevant device
CN111107398A (en) * 2019-12-27 2020-05-05 深圳市小溪流科技有限公司 Streaming media data transmission method and receiving method, and electronic device
CN113411684A (en) * 2021-06-24 2021-09-17 广州酷狗计算机科技有限公司 Video playing method and device, storage medium and electronic equipment

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102148851A (en) * 2010-09-30 2011-08-10 华为技术有限公司 Method and device for applying parental controls in adaptive hyper text transport protocol (HTTP) streaming transmission
US20140365759A1 (en) * 2013-06-06 2014-12-11 Futurewei Technologies, Inc. Signaling and Carriage of Protection and Usage Information for Dynamic Adaptive Streaming
CN105979470A (en) * 2016-05-30 2016-09-28 北京奇艺世纪科技有限公司 Panoramic video audio frequency processing method, panoramic video audio frequency processing device, and playing system
WO2017022467A1 (en) * 2015-08-06 2017-02-09 ソニー株式会社 Information processing device, information processing method, and program
CN106572359A (en) * 2016-10-27 2017-04-19 乐视控股(北京)有限公司 Method and device for synchronously playing panoramic video on multiple terminals

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102148851A (en) * 2010-09-30 2011-08-10 华为技术有限公司 Method and device for applying parental controls in adaptive hyper text transport protocol (HTTP) streaming transmission
US20140365759A1 (en) * 2013-06-06 2014-12-11 Futurewei Technologies, Inc. Signaling and Carriage of Protection and Usage Information for Dynamic Adaptive Streaming
WO2017022467A1 (en) * 2015-08-06 2017-02-09 ソニー株式会社 Information processing device, information processing method, and program
CN105979470A (en) * 2016-05-30 2016-09-28 北京奇艺世纪科技有限公司 Panoramic video audio frequency processing method, panoramic video audio frequency processing device, and playing system
CN106572359A (en) * 2016-10-27 2017-04-19 乐视控股(北京)有限公司 Method and device for synchronously playing panoramic video on multiple terminals

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110139065A (en) * 2019-01-30 2019-08-16 北京车和家信息技术有限公司 Method for processing video frequency, video broadcasting method and relevant device
CN109840052A (en) * 2019-01-31 2019-06-04 成都超有爱科技有限公司 A kind of audio-frequency processing method, device, electronic equipment and storage medium
CN111107398A (en) * 2019-12-27 2020-05-05 深圳市小溪流科技有限公司 Streaming media data transmission method and receiving method, and electronic device
CN113411684A (en) * 2021-06-24 2021-09-17 广州酷狗计算机科技有限公司 Video playing method and device, storage medium and electronic equipment

Also Published As

Publication number Publication date
CN108810567B (en) 2020-10-16

Similar Documents

Publication Publication Date Title
JP6436320B2 (en) Live selective adaptive bandwidth
EP3459252B1 (en) Method and apparatus for spatial enhanced adaptive bitrate live streaming for 360 degree video playback
CN105357542B (en) Live broadcasting method, apparatus and system
CN108810567A (en) A kind of matched method in audio &amp; video visual angle, client and server
WO2017193576A1 (en) Video resolution adaptation method and apparatus, and virtual reality terminal
WO2018171487A1 (en) Panoramic video playback method and client terminal
CN103974135B (en) A kind of video sharing method and system
CN107888987B (en) Panoramic video playing method and device
CN108616557B (en) Panoramic video transmission method, device, terminal, server and system
CN109155873B (en) Method, apparatus and computer program for improving streaming of virtual reality media content
CN107040794A (en) Video broadcasting method, server, virtual reality device and panoramic virtual reality play system
CN109792544A (en) Method and apparatus for spreading defeated panoramic video
CN104012106A (en) Aligning videos representing different viewpoints
CN105635675B (en) A kind of panorama playing method and device
US11095936B2 (en) Streaming media transmission method and client applied to virtual reality technology
CN108737882A (en) Display methods, device, storage medium and the electronic device of image
CN109286855A (en) Transmission method, transmitting device and the Transmission system of panoramic video
CN110149542A (en) Transfer control method
KR20190062565A (en) Spatially uneven streaming
CN108668138A (en) A kind of method for downloading video and user terminal
CN107087214A (en) Realize method, client and system that streaming medium content speed is played
CN107438203A (en) For establishing the method and the network equipment of inventory
CN108574881A (en) A kind of projection type recommends method, server and client
CN107707830B (en) Panoramic video playing and photographing system based on one-way communication
CN108810600B (en) Video scene switching method, client and server

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant