WO2022042382A1 - 媒体资源播放方法和相关装置 - Google Patents

媒体资源播放方法和相关装置 Download PDF

Info

Publication number
WO2022042382A1
WO2022042382A1 PCT/CN2021/113116 CN2021113116W WO2022042382A1 WO 2022042382 A1 WO2022042382 A1 WO 2022042382A1 CN 2021113116 W CN2021113116 W CN 2021113116W WO 2022042382 A1 WO2022042382 A1 WO 2022042382A1
Authority
WO
WIPO (PCT)
Prior art keywords
media
resource
media resource
data
server
Prior art date
Application number
PCT/CN2021/113116
Other languages
English (en)
French (fr)
Inventor
杨国忠
吴康华
王栋
吴诗生
Original Assignee
华为技术有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 华为技术有限公司 filed Critical 华为技术有限公司
Priority to EP21860220.9A priority Critical patent/EP4192019A4/en
Publication of WO2022042382A1 publication Critical patent/WO2022042382A1/zh

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/435Processing of additional data, e.g. decrypting of additional data, reconstructing software from modules extracted from the transport stream
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L65/00Network arrangements, protocols or services for supporting real-time applications in data packet communication
    • H04L65/1066Session management
    • H04L65/1069Session establishment or de-establishment
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L65/00Network arrangements, protocols or services for supporting real-time applications in data packet communication
    • H04L65/1066Session management
    • H04L65/1083In-session procedures
    • H04L65/1089In-session procedures by adding media; by removing media
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L65/00Network arrangements, protocols or services for supporting real-time applications in data packet communication
    • H04L65/1066Session management
    • H04L65/1096Supplementary features, e.g. call forwarding or call holding
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M3/00Automatic or semi-automatic exchanges
    • H04M3/42Systems providing special services or facilities to subscribers
    • H04M3/42017Customized ring-back tones
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M3/00Automatic or semi-automatic exchanges
    • H04M3/42Systems providing special services or facilities to subscribers
    • H04M3/42025Calling or Called party identification service
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/239Interfacing the upstream path of the transmission network, e.g. prioritizing client content requests
    • H04N21/2393Interfacing the upstream path of the transmission network, e.g. prioritizing client content requests involving handling client requests
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/431Generation of visual interfaces for content selection or interaction; Content or additional data rendering
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/47End-user applications
    • H04N21/472End-user interface for requesting content, additional data or services; End-user interface for interacting with content, e.g. for content reservation or setting reminders, for requesting event notification, for manipulating displayed content
    • H04N21/47202End-user interface for requesting content, additional data or services; End-user interface for interacting with content, e.g. for content reservation or setting reminders, for requesting event notification, for manipulating displayed content for requesting content on demand, e.g. video on demand
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/47End-user applications
    • H04N21/485End-user interface for client configuration
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M2201/00Electronic components, circuits, software, systems or apparatus used in telephone systems
    • H04M2201/50Telephonic communication in combination with video communication
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M7/00Arrangements for interconnection between switching centres
    • H04M7/006Networks other than PSTN/ISDN providing telephone service, e.g. Voice over Internet Protocol (VoIP), including next generation networks with a packet-switched transport layer

Definitions

  • the present application relates to the field of communication technologies, and in particular, to a method and related apparatus for playing media resources.
  • VOLTE voice over long term evolution
  • the media resource playback method is usually media resource playback based on the CT (communication technology, communication technology) domain, which may also be called the telecommunication domain, and the corresponding process may be: the calling terminal initiates a call to the called terminal, After the calling terminal rings, the media server in the CT domain pulls the media resources corresponding to the user subscription information from the media resource server based on the user subscription information, and then instructs the media resource server to start the main Call the terminal to play the media resource.
  • CT communication technology, communication technology
  • the media server in the CT domain plays the media resources in the CT domain for the user during the call phase, and only the screen of the media resource can be displayed on the call interface, the playback form is single, and the user experience is poor.
  • the embodiments of the present application provide a media resource playing method and related apparatus, which can realize media interaction in a CRBT scenario, enrich the experience of CRBT users, and increase the interest of video CRBT services.
  • the technical solutions of the media resource playback method and related device are as follows:
  • a method for playing media resources is provided, which is applied to a calling terminal.
  • the implementation process of the method may be as follows:
  • the interaction data of the first media resource is obtained through the second media server, and further, the calling terminal can add an overlay layer on the interface to display the first media resource while playing the first media resource.
  • Interactive data such as interactive buttons, barrage, and animation effects, realizes media interaction in CRBT scenarios, enriches the experience of CRBT users, and increases the interest of video CRBT services.
  • the video CRBT service can be expanded more flexibly, such as video overlay playback, video interface interaction, and video content scrolling switching, etc., increasing the interest of the video CRBT service.
  • the first media server is a server located in the communication technology CT domain
  • the second media server is a server located in the Internet technology IT domain.
  • the interactive data in the IT domain is used to supplement the playback of the media resources in the CT domain, which improves the interest and flexibility of the video CRBT service.
  • the resource information for determining the first media resource provided by the first media server includes any one of the following: obtaining the first media negotiation message from the first media negotiation message sent by the first media server. Resource information of a media resource; obtain the resource information carried by the first media resource from the header of the media stream of the first media resource transmitted by the first media server.
  • multiple manners for acquiring resource information of the first media resource are provided, which improves the flexibility of the implementation manner.
  • the first media negotiation message is an Update message
  • the resource information is located in the session description protocol SDP information of the Update message.
  • the resource information is located in the additional enhancement information SEI in the header of the media stream.
  • the interaction data is interaction content data
  • the method before sending the interaction data acquisition request to the second media server through the target application client, the method further includes:
  • the display data of the interactive control is first obtained from the second media server, and then the interactive content data is obtained from the second media server according to the resource information.
  • the display data of the interactive control can be obtained in advance, and then the interactive control is obtained. content data, so that when the first media resource is played subsequently, it is no longer necessary to acquire the display data of the interactive control from the second media server, which reduces the amount of data acquired and the display delay.
  • the interactive data includes interactive content data and display data of interactive controls.
  • the interactive content data and the display data of the interactive controls are acquired at one time, which ensures the synchronization of data acquisition and display.
  • the method further includes: if a triggering operation for any interaction control is detected, sending The second media server sends an interaction request corresponding to the interaction control, where the interaction request is used to implement interaction based on the first media resource.
  • This implementation provides the interactive function in the IT domain based on the interactive controls, enriches the visual content of the media resources in the CT domain, and improves the experience of the CRBT service.
  • the method further includes: if an off-hook message of the called terminal is received, closing the Display of interaction data of the first media resource.
  • the method further includes: if an off-hook message of the called terminal is received, stop The first media resource is played, and the stop screen of the first media resource is displayed.
  • the display of the stop screen is reserved, which can provide an entry for subsequent operations.
  • the method further includes: if detected In the operation of continuing to play the stopped screen, a third media resource is obtained from the second media server, and the third media resource is played, and the third media resource matches the first media resource.
  • the calling user can click the stop screen during the call or at any time after the call ends, and can obtain media resources with the same media content from the IT domain, providing a continuous and complete audio-visual experience.
  • the method further includes: if detected The triggering operation of the stop screen displays the portal website interface in the opened target application client.
  • the calling user wants to access the portal website, it can be achieved by clicking on the stop screen, which is convenient and simple to operate.
  • the method further includes: if detected Close the stop screen or turn off the screen to close the target application client.
  • the method further includes: sending a session end message to the second media server, where the session end message is used to indicate session release .
  • the second media server releases the preloaded interactive content data of the second media resource to save memory space.
  • the method further includes: acquiring a second media resource from the second media server through the target application client;
  • the playing the first media resource sent by the first media server includes: playing the first media resource in a full-screen mode;
  • the method further includes: playing the second media resource with mute in a floating window on the playing screen of the first media resource.
  • the method further includes: playing the second media resource in a full-screen mode;
  • the playing of the first media resource sent by the first media server includes: playing the first media resource with mute in a floating window on the playing screen of the second media resource.
  • the playing the first media resource sent by the first media server includes: playing the first media resource in the form of a floating window;
  • the method further includes: playing the second media resource in the form of a floating window.
  • the above process provides a variety of different playing modes when playing media resources in the IT domain and CT domain at the same time, provides the user with an intuitive interface display effect, and also allows the user to select the media resources. Further, one media resource is displayed in full screen and the other is displayed in a floating window, which also forms a picture-in-picture effect, which provides a better visual experience without causing visual confusion.
  • a placement position is also provided for operation, and the advertiser can push the terminal by placing media resources on the first media server or the second media server to achieve the operation purpose.
  • the interaction data includes at least one of homepage visit data, like data, comment data, sharing data, and download data.
  • the interactive control includes at least one of a homepage access control, a like control, a comment control, a share control, and a download control.
  • a media resource playback method is provided, applied in the second media server, and the implementation process of the method can be:
  • the technical solution provided by the embodiment of the present application obtains the interaction data of the first media resource through the second media server, and further, the calling terminal can add an overlay layer on the interface to display the interaction while playing the first media resource Data, such as interactive buttons, barrage, and animation effects, realize media interaction in the CRBT scene, enrich the experience of CRBT users, and increase the interest of video CRBT services.
  • the video CRBT service can be expanded more flexibly, such as video overlay playback, video interface interaction, and video content scrolling switching, etc., making the video CRBT service more interesting.
  • determining the interaction data of the first media resource includes: determining address information corresponding to the resource information, where the address information is used to provide the interaction data;
  • the returning the interaction data to the calling terminal includes: returning the address information to the calling terminal.
  • the interaction data is interaction content data
  • the method before receiving the interaction data acquisition request sent by the calling terminal through the target application client, the method further includes:
  • the display data of the interactive controls is extracted and obtained through the second media server, which ensures the timely display of the interactive controls when the subsequent media resources are played, and avoids the problem of display delay.
  • the method further includes: preloading the display data of the interactive control.
  • the above preloading method can optimize the delay problem of subsequent interactive control display, improve the efficiency of display data acquisition, and will not cause the problem that the control display time is longer when the subsequent media resources are played.
  • the display data acquisition request also carries relevant information of the call that the calling terminal participated in.
  • the method After receiving the display data acquisition request sent by the calling terminal through the target application client, the method It also includes: based on the relevant information of the call, performing legality authentication and functional service authentication on the user.
  • at least one of the legitimacy and validity of the user is verified by authenticating the user, so as to ensure the safe display of the interactive control during subsequent media resource playback, and improve the security and reliability of the interactive control display in the IT domain. sex.
  • the method further includes: receiving a resource acquisition request sent by the calling terminal, where the resource acquisition request carries resource information of the first media resource ; based on the resource information, determine the third media resource corresponding to the resource information, the third media resource matches the first media resource; return the third media resource to the calling terminal.
  • the calling user can click the stop screen during the call or at any time after the call ends, and can obtain media resources with the same media content from the IT domain, providing a continuous and complete audio-visual experience.
  • the method further includes: establishing a session with the calling terminal based on the third media resource, The session is used to record playback information of the third media resource.
  • the method further includes: if a session end message of the calling terminal is received, releasing the session with the calling terminal , the session end message is used to indicate the end of the session. By releasing the session in time, waste of server resources can be avoided.
  • a device for playing media resources which is used for executing the above method for playing media resources.
  • the apparatus for playing media resources includes a functional module for executing the method for playing media resources provided in the first aspect or in any optional manner of the first aspect.
  • a media server for executing the above method for playing media resources.
  • the media server includes a functional module for executing the media resource playing method provided in the second aspect or any optional manner of the second aspect.
  • a method for playing media resources comprising:
  • the calling terminal sends a call request to the called terminal, and the call request passes through the first media server; the calling terminal determines the resource information of the first media resource provided by the first media server; the calling terminal sends to the first media resource through the target application client 2.
  • the media server sends an interaction data acquisition request, where the interaction data acquisition request carries resource information of the first media resource;
  • the second media server receives an interaction data acquisition request sent by the calling terminal through the target application client, and the interaction data acquisition request carries the resource information of the first media resource; the second media server determines the first media resource based on the resource information. Interaction data; the second media server returns the interaction data to the calling terminal.
  • the calling terminal receives the interaction data of the first media resource returned by the second media server based on the resource information; receives and plays the first media resource sent by the first media server; on the playback screen of the first media resource to display the interaction data of the first media resource.
  • a media resource playback system in a sixth aspect, includes a calling terminal, a first media server and a second media server, the calling terminal, the first media server and the second media server are used to execute the fifth The media resource playback method provided by the aspect.
  • a terminal in a seventh aspect, includes a processor and a memory, the memory stores at least one piece of program code, the program code is loaded and executed by the processor to implement the above-mentioned first aspect or the above-mentioned first aspect A media resource playback method provided in any optional manner.
  • a server in an eighth aspect, includes a processor and a memory, and at least one piece of program code is stored in the memory, and the program code is loaded and executed by the processor to implement the above-mentioned second aspect or the above-mentioned second aspect A media resource playback method provided in any optional manner.
  • a ninth aspect provides a computer program product that, when the computer program product runs on a computer, causes the computer to perform the first aspect or the second aspect or any optional manner of the first aspect and the second aspect. Some or all of the steps of any method.
  • a computer storage medium is provided, and at least one piece of program code is stored in the computer storage medium, and the program code is loaded and executed by a processor to implement the first aspect or the second aspect or the first aspect and the second aspect.
  • a media resource playback method in any optional manner of the aspect.
  • FIG. 1 is an architecture diagram of a media resource system provided by an embodiment of the present application.
  • FIG. 2 is a schematic diagram of a service provided by a video CRBT cloud platform in different domains according to an embodiment of the present application
  • FIG. 3 is an architecture diagram of a media resource system provided by an embodiment of the present application.
  • FIG. 4 is a system architecture diagram of a terminal provided by an embodiment of the present application.
  • FIG. 5 is a schematic structural diagram of a terminal provided by an embodiment of the present application.
  • FIG. 6 is a schematic structural diagram of a server provided by an embodiment of the present application.
  • FIG. 7 is a flowchart of a method for playing media resources provided by an embodiment of the present application.
  • FIG. 8 is a schematic diagram of playback of a media resource provided by an embodiment of the present application.
  • FIG. 9 is a schematic diagram of an Update message carrying resource information provided by an embodiment of the present application.
  • FIG. 10 is a schematic diagram of resource information carried by a media stream provided by an embodiment of the present application.
  • FIG. 11 is a flowchart of a method for playing media resources provided by an embodiment of the present application.
  • FIG. 12 is a schematic diagram of playing a media resource provided by an embodiment of the present application.
  • FIG. 13 is a schematic diagram of playing a media resource provided by an embodiment of the present application.
  • 15 is a flowchart of a method for processing playback pause based on media resource playback provided by an embodiment of the present application
  • 16 is a schematic diagram of playing a media resource provided by an embodiment of the present application.
  • 17 is a schematic diagram of playing a media resource provided by an embodiment of the present application.
  • FIG. 19 is a schematic structural diagram of an apparatus for playing media resources provided by an embodiment of the present application.
  • FIG. 20 is a schematic structural diagram of a media server provided by an embodiment of the present application.
  • the embodiments of the present application may be applicable to 4th generation (4G), 5th generation (5G) mobile communication network architectures or future networks.
  • 4G 4th generation
  • 5G 5th generation
  • a 4G-based VoLTE network is used as an example to illustrate the network architecture and method flow of the solution.
  • FIG. 1 is an architecture diagram of a media resource system provided by an embodiment of the present application.
  • the system may include a first media server in the CT domain, a second media server in the IT domain, a calling terminal, and a called terminal.
  • the CT domain may also be referred to as a telecommunication domain, and the CT domain implements communication through an evolved packet core (EPC) and a media subsystem (internet protocol multimedia subsystem, IMS) domain core network, etc.
  • the IMS domain core network includes several application servers (application servers, AS), such as a first media server.
  • the first media server is used to provide the terminal with playback of the first media resource.
  • the first media server is providing video
  • the CRBT service is also called the video CRBT platform.
  • the first media server may include a media application server and a media resource subsystem (MRS).
  • MRS media resource subsystem
  • the media resource server is also called a ringback tone platform, and the media resource server is used to provide media resources such as video color ringtones, video color ringtones, video advertisements, and video customer service.
  • the media resource server produces and manages the above-mentioned media resources.
  • the media application server and the media resource server can be co-located or physically separated.
  • the media application server processes session initiation protocol (session initiation protocol, SIP) signaling messages, and the media resource server provides audio streams and/or video streams to the calling terminal and/or the called terminal.
  • session initiation protocol session initiation protocol, SIP
  • the IMS domain core network also includes: serving-call session control function (serving-call session control function, S-CSCF) equipment, interrogating-call session control function (interrogating-call session control function, I-CSCF) equipment, proxy - a proxy-call session control function (P-CSCF) device, a home subscriber server (HSS) device, a session border controller (SBC) device, and several application servers, Such as telephone application server (telephony application server, TAS), multimedia telephony application server (multimedia telephony application server, MMTel AS), business continuity application server (server centralization and continuity application server, SCC AS) and so on.
  • S-CSCF serving-call session control function
  • I-CSCF interrogating-call session control function
  • I-CSCF interrogating-call session control function
  • P-CSCF proxy - a proxy-call session control function
  • HSS home subscriber server
  • SBC session border controller
  • application servers such as telephone application server (telephony application server, TAS), multimedia
  • the I-CSCF equipment may be co-located with the S-CSCF equipment, which may be referred to as "I/S-CSCF” equipment for short.
  • the SBC equipment and the P-CSCF equipment can be set together, which can be referred to as "SBC/P-CSCF” equipment for short.
  • the EPC may include a packet data network gateway PGW device, a serving gateway SGW device, and a mobile management entity (mobile management entity, MME) device.
  • the S/P-GW device is used to provide the functions of the serving gateway and the logical entity of the packet data network gateway.
  • the SGW is the anchor point of local mobility, mainly facing the wireless access network for data transmission on the service plane
  • the P-GW is the anchor point of the EPS, which is mainly oriented to other data networks and realizes access and interaction with multiple public data networks.
  • the SGW device may be used for the connection between the IMS core network and the wireless network
  • the PGW device may be used for the connection between the IMS core network and an Internet Protocol (IP) network.
  • IP Internet Protocol
  • the MME device is the core device of the EPC network and is used to provide the functions of the MME logical entity.
  • the first media server when the first media server interacts with the calling terminal, its message data flow is a CT domain signaling flow, that is, its control message is a SIP message.
  • the media data stream is the CT domain media stream, that is, the RTP media stream.
  • the terminal accesses the IMS domain core network through the S/P-GW in the EPC to access the first media server. For example, during the call process, the calling terminal will pass a SIP message with the network device in the IMS domain core network.
  • the first media server and the calling terminal perform media negotiation and other processes. If there are media resources in the CT domain for this call, the first media server will push the first media resources to the calling terminal in the form of an RTP media stream. called the terminal.
  • the IT domains can also be referred to as internet domains.
  • the second media server is used to provide users with interactive data services, and can also provide services such as media resource playback, as well as a visual interface entry, which is convenient for providing users with services such as media resource setting and management.
  • the first media server further provides a setting function, and the user sets through the target application client to obtain setting information for indicating personalized display such as overlay style.
  • the second media server stores user information, or is associated with a database, and the database is used to store user information, display data of interactive controls, interactive content data of media resources, and media resources.
  • the interaction database is associated with the second media server as an example for description.
  • a target application client such as a video CRBT application
  • the calling terminal can obtain the interactive data service provided by the second media server through the target application client, that is, Yes, acquire the interaction data of the media resources provided by the second media server, and display them based on the playback of the media resources during the call.
  • the second media server provides interactive data services, and the user can subscribe to the interactive data service through the target application client.
  • the target application client accesses the portal website of the second media server, and operates on the portal website to subscribe.
  • the second media server stores the user information in the interaction database, so as to provide an interaction data service based on the stored information subsequently.
  • the second media server also provides a media resource subscription service to subscribe to the media resources played during the call, so that the terminal obtains and plays the media resources through the target application client during the call.
  • the transmission protocol used is a hypertext transfer protocol (hypertext transfer protocol, HTTP) or a hypertext transfer security protocol (hypertext transfer protocol over secure socket layer, HTTPS), in this embodiment of the present application, the second media server supports multiple transmission protocols, which can more conveniently and flexibly implement signaling interaction based on media resource playback compared to a single signaling stream protocol in the CT domain.
  • HTTP hypertext transfer protocol
  • HTTPS hypertext transfer security protocol
  • the transmission protocol real time messaging protocol real time messaging protocol, RTMP
  • the streaming protocol of multiplexing HTTP hypertext transfer protocol flash video, Http Flv.
  • the second media server supports multiple transmission protocols, which can more conveniently and flexibly implement the playback of media resources compared to a single signaling stream protocol in the CT domain.
  • the signaling stream or media stream of the IT domain is used.
  • the calling terminal interacts with the first media server
  • the signaling stream or media stream in the CT domain is used.
  • the interactive control messages are HTTP messages, HTTPS messages, HTTP responses or HTTPS responses
  • the message data flow is the IT domain signaling flow.
  • HTTP Flow hypertext markup language flow
  • the media data stream is an IT domain media stream, for example, RTMP media stream, Http Flv media stream.
  • the gateway may include a packet data network gateway (packet data network gateway, PGW) device and a serving gateway (serving gateway, SGW) device.
  • PGW packet data network gateway
  • SGW serving gateway
  • the PGW device and the SGW device may be set together, which may be referred to as "S/P-GW" device for short.
  • the corresponding request is sent to the S/P-GW through the target application client, and the S/P-GW sends the corresponding request to the second media server, and the calling terminal performs the calling process in the CT domain
  • the target application client accesses the second media server through the IT network, obtains the signaling stream of the IT domain, and then displays the interactive controls and the corresponding interactive content data on the playback screen of the media resources in the CT domain during the ringing stage, thereby Realize the interactive function based on CT domain CRBT service.
  • the playback of media resources in the IT domain is carried out during the call process in the CT domain, and its acquisition is triggered by the call process in the CT domain.
  • the interaction between the calling terminal and the second media server is carried out through the IT domain, Therefore, the call flow in the CT domain is not affected, and the normal operation of the call can be guaranteed.
  • the first media server provided in the embodiment of the present application is used to provide the operator with related services such as advertisement placement and corporate promotion, for example, personal video CRBT, enterprise video CRBT, media advertisement, fixed-line video CRBT, caring video CRBT, and Scenario video CRBT, etc.
  • the terminal accesses the first media server through the CT domain, and the terminal and the first media server interact based on the CT domain signaling flow. For example, SIP messages are used for signaling interaction, and the first media server uses CT
  • the media resource is sent to the terminal in the form of a domain media stream, for example, the first media resource is sent by using an RTP media stream.
  • the second media server can provide interactive services, such as any of content yellow pages, like services, comment services, download services, sharing services, and more content services.
  • the second media server can also In the ringing stage, the video CRBT service of the IT domain is provided, and the second media resource is subsequently used to represent the media resource of the IT domain.
  • the terminal accesses the second media server through the IT domain, and the terminal and the second media server interact based on the signaling flow in the IT domain. For example, HTTP messages, HTTPS messages, HTTP responses or HTTPS responses are used for signaling interaction.
  • the media server sends the media resource to the terminal in the form of an IT domain media stream, for example, uses an RTMP media stream to send the second media resource.
  • FIG. 3 is an architecture diagram of a media resource system provided by an embodiment of the present application, and the system includes a second media server located in the access network or metropolitan area network, an IT domain, and a first media server located in the CT domain.
  • Media server, calling terminal and called terminal are examples of the access network or metropolitan area network.
  • the access network or metropolitan area network includes a broadband remote access server (BRAS) and a router, and the BRAS is used to complete the access, authentication, and billing of broadband network users for various broadband access methods. , control, management of network equipment.
  • BRAS broadband remote access server
  • a router is a device that connects each local area network and wide area network. It can automatically select and set routes according to channel conditions, and transmit messages through the best path. For other network devices, refer to the embodiment shown in FIG. 1 .
  • the resource acquisition request sent by the target application client is forwarded by the BRAS and the router to the second media server by the terminal to access the second media server, thereby implementing the playback of the media resources provided by the embodiments of the present application.
  • the terminals involved in the embodiments of this application are devices with wireless transceiver functions, which can be deployed on land, including indoor or outdoor, handheld or vehicle-mounted; can also be deployed on water (such as ships, etc.); can also be deployed in the air (eg airplanes, balloons, and satellites, etc.).
  • the above-mentioned terminal may be a terminal device that can be connected to a mobile network, a mobile phone (mobile phone), a tablet computer (pad), a computer with a wireless transceiver function, a virtual reality (VR) terminal, an enhanced Augmented reality (AR) terminal, wireless terminal in industrial control, wireless terminal in self-driving, wireless terminal in remote medical (remote medical), smart grid (smart grid) wireless terminals, wireless terminals in transportation safety, wireless terminals in smart cities, wireless terminals in smart homes, and so on.
  • the terminal may also be a terminal device that can access a fixed network, such as a wired telephone, etc.; the terminal may also be a soft terminal corresponding to an application software with a calling function.
  • the terminal may include: applications (applications), application framework (application framework), hardware abstraction layer (HAL), library ( librarles) and the linux kernel (linux kernel).
  • the applications (applications) include a telephone dialing application (dialer), a video color ringtone application (video RBT), and the like.
  • the phone dialing application has the function of calling and dialing.
  • the video CRBT application is also the above target application, and the target application client is also the video CRBT application client.
  • the application architecture includes window management module (window manager), call management module (telephone manager), resource management module (resource manager) and so on.
  • the hardware abstraction layer is an interface layer between an operating system (linux) kernel and a hardware circuit, and is used to abstract the hardware.
  • the hardware abstraction layer is an interface layer of the operating system.
  • the library is used to store the files of Android system applications and third-party applications. Through this library, it is convenient for one application to call some functions of other applications.
  • the phone dialing application calls the video CRBT application after initiating the call to obtain the interaction data from the IT domain, and further optionally, obtains the CRBT from the IT domain.
  • the interaction data can be displayed while the media resources are being played through the cooperation between the telephone dialing application and the video CRBT application.
  • FIG. 5 is a schematic structural diagram of a terminal provided by an embodiment of the present application.
  • the terminal can be used to execute the media resource playing method on the calling terminal side in the following embodiments.
  • the terminal 500 includes:
  • the terminal 500 may include a radio frequency (RF) circuit 501, a memory 502 including one or more computer-readable storage media, an input unit 503, a display unit 504, an audio circuit 505, a wireless fidelity (WiFi)
  • the module 506 includes a processor 507 having one or more processing cores, a power supply 508 and other components.
  • RF radio frequency
  • the terminal structure shown in FIG. 5 does not constitute a limitation on the terminal, and may include more or less components than the one shown, or combine some components, or arrange different components. in:
  • the RF circuit 501 can be used for receiving and sending signals during the process of sending and receiving information or talking. In particular, after receiving the downlink information of the base station, it is handed over to one or more processors 507 for processing; in addition, it sends the data related to the uplink to the base station. .
  • the RF circuit 501 includes, but is not limited to, an antenna, at least one amplifier, a tuner, one or more oscillators, a Subscriber Identity Module (SIM) card, a transceiver, a coupler, a low noise amplifier (LNA) , duplexer, etc.
  • SIM Subscriber Identity Module
  • LNA low noise amplifier
  • duplexer duplexer
  • the RF circuit 501 can also communicate with the network and other devices through wireless communication.
  • the wireless communication can use any communication standard or protocol, including but not limited to global system of mobile communication (GSM), general packet radio service (GPRS), code division multiple access (code division multiple access) multiple access, CDMA), wideband code division multiple access (WCDMA), long term evolution (long term evolution, LTE), email, short message service (short messaging service, SMS) and so on.
  • GSM global system of mobile communication
  • GPRS general packet radio service
  • code division multiple access code division multiple access
  • CDMA code division multiple access
  • WCDMA wideband code division multiple access
  • long term evolution long term evolution
  • SMS short message service
  • the memory 502 can be used to store software programs and modules, and the processor 507 executes various functional applications and data processing by running the software programs and modules stored in the memory 502 .
  • the memory 502 may mainly include a stored program area and a stored data area, wherein the stored program area may store an operating system, an application required for at least one function (such as a sound playback function, an image playback function, etc.), and the like; 500 using the created data (such as audio data, phone book, etc.) and the like.
  • memory 502 may include high-speed random access memory, and may also include non-volatile memory, such as at least one magnetic disk storage device, flash memory device, or other volatile solid state storage device.
  • the memory 502 may also include a memory controller to provide access to the memory 502 by the processor 507 and the input unit 503 .
  • the memory 502 is further configured to store at least one item of display data, interactive content data, second media resources, and first media resources of the interactive controls acquired by the terminal in the embodiment of the present application.
  • the input unit 503 may be used to receive input numerical or character information, and generate keyboard, mouse, joystick, optical or trackball signal input related to user settings and function control.
  • the input unit 503 may include a touch-sensitive surface 5031 as well as other input devices 5032.
  • a touch sensitive surface 5031 also known as a touch display or a trackpad, collects the user's touch operations on or near it (such as the user using a finger, stylus, etc., any suitable object or accessory on or on the touch sensitive surface 5031). operation near the touch-sensitive surface 5031), and drive the corresponding connection device according to the preset program.
  • the touch-sensitive surface 5031 may include two parts, a touch detection device and a touch controller.
  • the touch detection device detects the user's touch orientation, detects the signal brought by the touch operation, and transmits the signal to the touch controller; the touch controller receives the touch information from the touch detection device, converts it into contact coordinates, and then sends it to the touch controller.
  • the touch-sensitive surface 5031 may be implemented using various types of resistive, capacitive, infrared, and surface acoustic waves.
  • the input unit 503 may also include other input devices 5032.
  • other input devices 5032 may include, but are not limited to, one or more of physical keyboards, function keys (such as volume control controls, switch controls, etc.), trackballs, mice, joysticks, and the like.
  • the above-mentioned input unit 503 is configured to receive a signal triggered by a user's operation on the input unit 503 and transmit it to a corresponding controller based on the signal. For example, in this embodiment of the present application, a user can perform a touch operation on the input unit 503 to access media resources. the playback selection and the operation of the interactive controls.
  • the display unit 504 may be used to display information input by the user or information provided to the user and various graphical user interfaces of the terminal 500, which may be composed of graphics, text, icons, videos, and any combination thereof.
  • the display unit 504 may include a display panel 5041, and optionally, the display panel 5041 may be configured in the form of a liquid crystal display (liquid crystal display, LCD), an organic light-emitting diode (organic light-emitting diode, OLED) or the like. Further, the touch-sensitive surface 5031 can cover the display panel 5041.
  • the touch-sensitive surface 5031 When the touch-sensitive surface 5031 detects a touch operation on or near it, it transmits it to the processor 507 to determine the type of the touch event, and then the processor 507 determines the type of the touch event according to the touch event.
  • Type provides corresponding visual output on display panel 5041.
  • the above-mentioned display unit 504 can display at least one of the playback screen of the second media resource and the first media resource.
  • the touch-sensitive surface 5031 and the display panel 5041 are implemented as two separate components to realize the input and input functions, in some embodiments, the touch-sensitive surface 5031 and the display panel 5041 may be integrated to realize the input and output functions.
  • the audio circuit 505 , the speaker 5051 , and the microphone 5052 can provide an audio interface between the user and the terminal 500 .
  • the audio circuit 505 can transmit the received audio data converted electrical signal to the speaker 5051, and the speaker 5051 converts it into a sound signal for output; on the other hand, the microphone 5052 converts the collected sound signal into an electrical signal, which is converted by the audio circuit 505 After receiving, it is converted into audio data, and then the audio data is output to the processor 507 for processing, and then sent to, for example, another terminal through the RF circuit 501, or the audio data is output to the memory 502 for further processing.
  • the audio circuit 505 may also include an earphone jack to provide communication between peripheral headphones and the terminal 500 .
  • the audio circuit 505 and the speaker 5051 can implement the audio playback-related process on the terminal side. For example, when any media resource is played, the terminal uses the audio circuit 505 and the speaker 5051 to process and process audio data. out.
  • WiFi is a short-distance wireless transmission technology
  • the terminal 500 can help users to send and receive emails, browse web pages, access streaming media, etc. through the WiFi module 506, which provides users with wireless broadband Internet access.
  • FIG. 5 shows the WiFi module 506, it can be understood that it does not belong to the necessary structure of the terminal 500, and can be completely omitted as required within the scope of not changing the essence of the invention.
  • the processor 507 is the control center of the terminal 500, using various interfaces and lines to connect various parts of the entire mobile phone, by running or executing the software programs and/or modules stored in the memory 502, and calling the data stored in the memory 502, Execute various functions of the terminal 500 and process data, so as to monitor the mobile phone as a whole.
  • the processor 507 may include one or more processing cores; optionally, the processor 507 may integrate an application processor and a modem processor, wherein the application processor mainly processes the operating system, user interface, and applications, etc. , the modem processor mainly deals with wireless communication. It can be understood that, the above-mentioned modulation and demodulation processor may not be integrated into the processor 507 .
  • the terminal 500 also includes a power supply 508 (such as a battery) for supplying power to various components.
  • the power supply can be logically connected to the processor 507 through a power management system, so that functions such as managing charging, discharging, and power consumption management are implemented through the power management system.
  • the power source 508 may also include one or more DC or AC power sources, recharging systems, power failure detection circuits, power converters or inverters, power status indicators, and any other components.
  • the terminal 500 may also include a camera, a Bluetooth module, and the like, which will not be repeated here.
  • FIG. 6 is a schematic structural diagram of a server provided by an embodiment of the present application.
  • the server 600 may include relatively large differences due to different configurations or performance, and may include one or more processors 601 and one or more memories 602 , wherein at least one piece of program code is stored in the memory 602, and the at least one piece of program code is loaded and executed by the processor 601 to implement the media resource playback method executed by the second media server in each of the above method embodiments.
  • the server 600 may also have components such as wired or wireless network interfaces, keyboards, and input/output interfaces for input and output, and the server 600 may also include other components for implementing device functions, which will not be described here.
  • the processor may be a central processing unit (CPU), a graphics processing unit (GPU), a tensor processing unit (TPU), a neural network processing unit (NPU) ), brain processing unit (BPU), deep learning processing unit (DPU), holographic processing unit (HPU), vector processing unit (VPU), and intelligent Any processor such as a processor (intelligence processing unit, IPU).
  • CPU central processing unit
  • GPU graphics processing unit
  • TPU tensor processing unit
  • NPU neural network processing unit
  • BPU brain processing unit
  • DPU deep learning processing unit
  • HPU holographic processing unit
  • VPU vector processing unit
  • intelligent Any processor such as a processor (intelligence processing unit, IPU).
  • the processor 601 may adopt a general-purpose CPU, a microprocessor, an application specific integrated circuit (ASIC), a GPU, or one or more integrated circuits, for executing relevant programs, so as to implement the above-mentioned method for playing media resources.
  • ASIC application specific integrated circuit
  • GPU graphics processing unit
  • the processor 601 may also be an integrated circuit chip, which has signal processing capability. In the implementation process, each step of the media resource playing method of the present application can be completed by a hardware integrated logic circuit in the processor 601 or a program code in the form of software.
  • the above-mentioned processor 601 can also be a general-purpose processor, a digital signal processor (digital signal processing, DSP), an ASI, an off-the-shelf programmable gate array (field programmable gate array, FPGA) or other programmable logic devices, discrete gates or transistors Logic devices, discrete hardware components.
  • DSP digital signal processing
  • ASI an off-the-shelf programmable gate array
  • FPGA field programmable gate array
  • the methods, steps, and logic block diagrams disclosed in the embodiments of this application can be implemented or executed.
  • a general purpose processor may be a microprocessor or the processor may be any conventional processor or the like.
  • the steps of the method disclosed in conjunction with the embodiments of the present application may be directly embodied as executed by a hardware decoding processor, or executed by a combination of hardware and software modules in the decoding processor.
  • Software modules can be located in random access memory (RAM), flash memory, read-only memory (ROM), programmable read-only memory or electrically erasable programmable memory, registers and other mature storage media in the field middle.
  • the storage medium is located in the memory 602, and the processor 601 reads the information in the memory 602 and, in combination with its hardware, completes the functions required to be performed by the modules included in the second media server of the embodiment of the present application, or executes the first function of the method embodiment of the present application. 2. A media resource playback method on the media server side.
  • FIG. 7 is a flowchart of a method for playing media resources provided by an embodiment of the present application. The process is that if there are media resources in the CT domain during the current call, the media resources in the CT domain are played, and the media resources in the CT domain are played.
  • the interactive data of the IT domain is displayed on the play screen of , wherein the display data of the interactive controls and the acquisition of the interactive data are acquired in batches, and FIG. 7 includes the following steps.
  • the calling terminal sends a call request to the called terminal, and the call request passes through the first media server.
  • the call request is a video call request or an audio call request.
  • the first media server is a server located in the communication technology CT domain.
  • a call request is sent to the called terminal through a telephone dialing application of the calling terminal. It should be understood that the call request is transparently transmitted to the called terminal through the network device in the CT domain, and the call request will pass through the first media server in the CT domain.
  • the calling terminal sends a display data acquisition request to the second media server through the target application client, where the display data acquisition request carries relevant information of the call in which the calling terminal participates.
  • the target application client is an application client with the function of playing media resources in the ringing stage, such as a video CRBT application client, and the target application client has a built-in media player.
  • the second media server is a server located in the Internet technology IT domain.
  • the display data acquisition request is used to instruct to acquire the display data of the interactive control.
  • the related information of the call includes the calling number, the called number, the location information of the calling party, the type of the call, and/or the time of the call. It should be noted that this embodiment only uses the display data acquisition request to represent the request for acquiring display data in the IT domain, and the name of the display data acquisition request is not limited in this embodiment of the present application.
  • the interactive controls include at least one of a homepage access control (also called a content yellow page control), a like control, a comment control, a share control, and a download control.
  • the display data of the interactive control is also the rendering data of the interactive control.
  • FIG. 8 is a schematic diagram of playback of a media resource provided by an embodiment of the present application.
  • the playback page of FIG. 8 displays multiple interactive controls for content yellow pages, likes, comments, sharing, downloading, and more. It should be understood that in step 702, the media resource has not yet been played, and FIG. 8 is used as an example to explain the rendering of multiple interactive controls.
  • the calling terminal after the calling terminal sends a call request to the called terminal, it pulls up the target application client (also called opening the target application client), and the calling terminal's phone dialing application sends the call request to the target application.
  • the client sends the relevant information of the call, and after receiving the relevant information of the call, the target application client sends a display data acquisition request carrying the relevant information to the second media server through the IT domain.
  • the display data acquisition request is sent by using the HTTP protocol or the HTTPS protocol.
  • the second media server receives a display data acquisition request sent by the calling terminal through the target application client.
  • the second media server after receiving the display data acquisition request sent by the calling terminal through the target application client, the second media server reads the relevant information of the call from the corresponding field of the display data acquisition request .
  • the second media server performs legality authentication and functional service authentication on the user based on the relevant information of the call. If the authentication is passed, step 705 is performed.
  • the authentication includes authentication of user function information (also referred to as validity authentication), and optionally, the authentication further includes authentication of user identity information.
  • User function information authentication refers to the verification of whether the user has enabled the function of displaying interactive controls.
  • User identity information authentication refers to the authentication of the user's identity, such as the verification of the called mobile phone number, to confirm whether the mobile phone number is a legal mobile phone number.
  • authenticating the user is authenticating at least one of the calling user and the called user.
  • the corresponding authentication process includes: after the second media server acquires the information carried in the display data acquisition request, according to the calling number in the information, Determine whether the calling number is a legal number. If the calling number is a legal number, query in the subscription database. If it is found that the calling number has subscribed to the display interactive control function, the user authentication is passed and the execution is executed. Step 705 follows. If the calling number is not queried, the user authentication fails. The authentication process for the called user is the same as the above process.
  • the second media server also records the relevant information of the call, which can be used as a data reference for subsequent operations.
  • At least one of the legitimacy and validity of the user is verified by authenticating the user, so as to ensure the safe display of the interactive control during subsequent media resource playback, and improve the security and reliability of the interactive control display in the IT domain. sex.
  • a session connection is established with the calling terminal, and the session connection is based on the HTTP protocol or the HTTPS protocol, so that in the subsequent process, the session connection is used to communicate with the calling terminal. Resource-related information and other interactions between terminals are called.
  • the second media server sends the display data of the interactive control to the calling terminal.
  • the second media server or the database associated with the second media server stores the display data of the interactive controls, and the display data is rendering data to indicate the display form of the interactive controls, and the terminal can perform rendering based on the display data to display the interaction controls.
  • default display data includes default overlay styles for home page access controls, like controls, comment controls, share controls, and download controls.
  • the second media server in response to the display data acquisition request, acquires the display data of the interactive control from the server or an associated database, and sends the display data of the interactive control to the terminal.
  • the second media server can realize rapid transmission of display data in the form of data packets, which improves the efficiency of the calling terminal in acquiring the display data.
  • the target application client can also provide the user with a variety of overlay styles, and the user can select the overlay style he wants through the target application client, and then the second media server according to
  • the setting information of the calling terminal can determine the overlay style set in advance by the user, further query the server or the subscription database, obtain the display data of the interactive control corresponding to the overlay style, and send the display data of the interactive control to the terminal. , so as to realize the personalized display effect.
  • the above steps 702 to 705 are the process of acquiring the display data of the interactive control after the calling terminal initiates a call.
  • the display data of the interactive control can be acquired in advance, so that when the first media resource is played later, there is no need to do so.
  • the display data of the interactive control is acquired from the second media server, which reduces the amount of data acquisition and reduces the display delay.
  • the second media server pushes the display data of the interaction control to the calling terminal as an example for description.
  • the calling terminal and the second media server interact multiple times to obtain the display data of the interactive control.
  • the second media server sends the interactive control to the calling terminal in response to the display data obtaining request.
  • the address information of the display data refers to a uniform resource locator (uniform resource locator, URL) address of the display data, and the URL address is also called a web page address.
  • the calling terminal receives the address information of the display data of the interactive control, and obtains the display data of the interactive control from the second media server based on the received address information.
  • the second media server preloads the display data of the interactive control, and by preloading, the delay problem of subsequent display of the interactive control can be optimized, The acquisition efficiency of display data is improved, and the problem of longer display time of controls will not occur when subsequent media resources are played.
  • the method for preloading the display data of the interactive control by the second media server includes any of the following:
  • the display data of the interactive control is stored in the hard disk of the second media server, then after the second media server determines the address information of the display data of the interactive control, from the hard disk of the second media server, the Load the display data of this interactive control into memory or cache.
  • the display data of the interactive control is stored in the hard disk or memory of a server other than the second media server, then after the second media server determines the address information of the display data of the interactive control, it will be stored from the other server's data.
  • the display data of the interactive control is preloaded into the memory or the cache of the second media server.
  • the display data of the interactive control is stored in the interactive database, and after determining the address information of the display data of the interactive control, the second media server preloads the display data of the interactive control from the interactive database to In the memory or cache of the second media server.
  • the interactive database may be a database associated with the second media server, or may be a database associated with other servers, and the embodiment of the present application does not limit the database.
  • the above three implementation manners correspond to different preloading processes according to different storage locations of the display data of the interactive controls. If the second media server loads the data when the calling terminal obtains the display data of the interactive control, a certain delay will be caused. However, the above preloading method can optimize the display delay of the subsequent interactive control, and improve the performance of the interactive control. The acquisition efficiency of display data will not cause the problem that the control display time is longer when the subsequent media resources are played.
  • the calling terminal receives the display data of the interactive control.
  • the display data of the interactive control is in the form of a data packet, and after receiving the data packet, the calling terminal obtains the display data of the interactive control by parsing the data packet.
  • the calling terminal receives the first media negotiation message sent by the first media server, where the first media negotiation message is used for media negotiation.
  • the first media negotiation message is used for media negotiation between the first media server and the calling terminal, where the first media server is located in the CT domain.
  • the first media negotiation message is an Update message
  • the first media negotiation message carries media capability information of the first media server, that is, the first media negotiation message is a SIP message that carries SDP information of the first media server.
  • the first media negotiation message is used for media negotiation of early media between the first media server and the calling terminal. If the calling terminal receives the first media negotiation message, it determines that there are media resources to be played in the CT domain for the current call, and continues to perform subsequent steps.
  • the calling terminal obtains resource information of the first media resource from the first media negotiation message.
  • the first media resource is a media resource in the CT domain, such as a video CRBT, video advertisement, video customer service and other media resources.
  • the resource information is any kind of information used to indicate the first media resource, for example, a resource ID (identification, identification number) of the first media resource.
  • the resource information is further used to indicate operator information (operator identifier) of the first media resource.
  • the resource information of the first media resource is acquired from the first media negotiation message sent by the first media server.
  • the first media negotiation message is an Update message
  • the resource information is located in the session description protocol SDP information of the Update message.
  • FIG. 9 is a schematic diagram of an Update message carrying resource information according to an embodiment of the present application.
  • the SDP information in the Update message carries resource information.
  • 01 is used to represent operator A
  • 02 is used to represent operator B
  • 03 is used to represent operator C.
  • the calling terminal receives, through the target application client, the interactive content data of the first media resource from the second media server based on the resource information of the first media resource.
  • the interactive content data is used to display the interactive content based on the first media resource.
  • the interactive content data includes at least one of homepage visit data, like data, comment data, sharing data, and download data. As shown in Figure 8, Figure 8 shows the like data 174.5w, the comment data 5.9w, and the share data 2.5w.
  • the interactive content data also includes relevant data of each interactive control.
  • the corresponding relevant data is the like data, and the like data includes the like account, the like account avatar, etc.
  • the corresponding relevant data is the comment data, and the comment data includes the comment account number, comment content, etc.
  • the download control the corresponding relevant data is the download data, and the download data includes the download service link.
  • the interactive content data is bullet screen data or animation special effect data, etc., so that the bullet screen or animation special effects are displayed on the playing screen of the calling terminal for the first media resource, which supplements the interactive display effect in the CT domain and improves the performance of the interactive display.
  • the interaction data also includes links to other media resources for the user to select.
  • the calling terminal sends an interaction data acquisition request to the second media server through the target application client, where the interaction data acquisition request carries resource information of the first media resource.
  • the second media server determines the interactive content data of the first media resource based on the resource information. For example, the resource information of the first media resource is read from the corresponding field of the interaction data acquisition request, and the interactive content data of the first media resource is queried based on the resource information.
  • the second media server returns the interactive content data to the calling terminal, so that the calling terminal receives the interactive content data.
  • the process of acquiring the interactive content data through interaction between the calling terminal and the second media server is performed based on the address information of the interactive content data. For the process, refer to the above process description of the display data of the interactive control.
  • the embodiment of the present application takes the interaction data as the interaction content data as an example, and the above steps 701 to 709 are the processes of first obtaining the display data of the interaction control, and then obtaining the interaction content data.
  • the interaction can be obtained in advance.
  • the display data of the control, and then the interactive content data is obtained, so that when the first media resource is played later, it is no longer necessary to obtain the display data of the interactive control from the second media server, which reduces the amount of data obtained and the display delay. Time.
  • the interaction data includes interaction content data and display data of interaction controls.
  • step 709 is executed to obtain the first media from the second media server according to the resource information of the first media resource.
  • the interactive content data of the resource and the display data of the interactive controls can be obtained at one time, and the interactive data including the interactive content data and the display data of the interactive controls can be obtained without performing steps 702 to 706 . That is, the steps 702 to 709 are replaced with the following process: the calling terminal determines the resource information of the first media resource provided by the first media server (see steps 707 and 708 for the process).
  • the calling terminal receives the interactive content data of the first media resource and the display data of the interactive control from the second media server through the target application client based on the resource information of the first media resource.
  • the calling terminal sends a second media negotiation message to the first media server, where the second media negotiation message is used for media negotiation.
  • the second media negotiation message is a 200OK (UPDATE) message
  • the 200OK (UPDATE) message carries media capability information of the calling terminal, that is, a media negotiation result between the calling terminal and the second media server.
  • step 709 and step 710 is not limited by the current sequence number.
  • the execution order of the two steps may be synchronous execution of 709 and 710, or execution of 710 and subsequent steps first, and then execution of 709, or That is, the process of obtaining interactive content data in the IT domain and the process of media negotiation in the CT domain do not affect each other.
  • the first media server sends a message to the calling terminal.
  • the first media negotiation message where the first media negotiation message carries the media capability information of the first media server, for example, the first media negotiation message is an update or 18* message; after receiving the first media negotiation message, the calling terminal based on the Its own capability determines the media negotiation result, and sends a second media negotiation message to the first media server, where the second media negotiation message carries the media negotiation result, and may also be called the media capability information of the calling terminal, for example, the second media negotiation message for 200OK message.
  • the first media server sends a 180 ringing message to the calling terminal.
  • the ringing message is a 180 ringing message based on the SIP protocol, and the ringing message is used to indicate that the called terminal has ringed.
  • the first media server sends the media stream of the first media resource to the calling terminal.
  • the first media server sends the media stream of the first media resource (for example, an RTP stream) to the calling terminal.
  • the calling terminal does not actively acquire the first media resource, but the first media server Push the media stream.
  • the calling terminal receives the media stream of the first media resource.
  • the resource information obtained from the first media negotiation message in step 708 is used as an example for description.
  • the calling terminal obtains the resource information through the media stream, that is, the , the calling terminal receives the media stream of the first media resource, and obtains the resource information carried by the first media resource from the header of the media stream of the first media resource transmitted by the first media server.
  • the resource information is located in additional enhancement information (supplemental enhancement information, SEI) in the header of the media stream.
  • SEI Supplemental enhancement information
  • FIG. 10 is a schematic diagram of a media stream carrying resource information provided by an embodiment of the present application.
  • a ringtone (ID) is carried in a custom frame of the SEI.
  • the custom frame of the SEI also carries operator information.
  • the resource information may be carried by either the first media negotiation message or the media stream, or both the first media negotiation message and the media stream may be used to carry the resource information.
  • the resource information of the item fails to be read, the resource information of another item can be read, and the resource information can also be acquired, which can ensure the acquisition of the resource information.
  • the calling terminal receives the 180 ringing message, and based on the received media stream, plays the first media resource sent by the first media server, and on the playback screen of the first media resource, based on the received display data of the interactive control and interactive content data to display interactive controls and corresponding interactive content data.
  • the calling terminal superimposes and displays the interactive control and the corresponding interactive content data on the playback screen of the first media resource. For example, based on the display data and interactive content data of the interactive controls, the target application client renders an interactive layer on the playback screen, where the interactive layer includes interactive controls and corresponding interactive content data, such as displaying like controls and points. Likes and other information.
  • the technical solutions provided in the embodiments of the present application realize the supplementation of the media resources in the CT domain.
  • the interaction data of the first media resources is obtained through the second media server in the IT domain, and further, While playing the first media resource, the calling terminal can add an overlay layer on the interface to display the interactive control and corresponding interactive content data, such as interactive buttons, barrage and animation effects, etc. It enriches the experience of CRBT users and makes the video CRBT service more interesting.
  • the video CRBT can Businesses can be expanded more flexibly, such as video overlay playback, video interface interaction, and video content scrolling switching, etc., making the video CRBT service more interesting.
  • FIG. 7 The above-mentioned embodiment shown in FIG. 7 is described by playing the media resources of the CT domain, and adding an overlay layer on the playing screen to display the interactively related data as an example, and for the calling terminal, it can also be used for the IT domain at the same time.
  • the media resources and the media resources in the CT domain are played.
  • the following describes the media resource playback method in conjunction with the process shown in FIG. 11. Referring to FIG. 11, the process includes:
  • the calling terminal sends a call request to the called terminal, and the call request passes through the first media server.
  • This step 1101 refers to step 701 .
  • the calling terminal determines resource information of the first media resource provided by the first media server.
  • step 1102 refer to step 708 or 713.
  • the calling terminal obtains the interactive content data of the first media resource and the display data of the interactive control from the second media server through the target application client.
  • step 1103 refer to the acquisition process of the interactive content data of the first media resource and the display data of the interactive control in the embodiment shown in FIG. 7 .
  • the calling terminal receives the second media resource from the second media server through the target application client.
  • the second media server is a server in the IT domain, and the second media resource is a media resource in the IT domain.
  • the second media resource is a pushed video advertisement, video animation, etc. This embodiment can realize the playback of the IT domain video in the ringing stage, which enriches the viewing experience of the video CRBT user.
  • the calling terminal sends a resource acquisition request through the target user client, and after receiving the resource acquisition request, the second media server sends the second media resource to the calling terminal, for example, the second media server uses RTMP streaming.
  • the form sends the media stream of the second media resource to the calling terminal.
  • the second media server actively pushes the media stream, rather than the calling terminal's acquisition based on address information.
  • the second media server can provide the user with multiple second media resources, and then the user can select the second media resource that he wants to play through the target application client.
  • the second media server sends a preset second media resource to the calling terminal to achieve operational purposes such as advertisement push.
  • step 1104 the calling terminal sends the relevant information of the call to the second media server through the target application client, and the second media server receives the relevant information and authenticates the relevant information, for example, verifies the relevant information in the relevant information. At least one of the calling number and the called number is authenticated.
  • the authentication includes legality authentication and functional service authentication as in step 704.
  • the second media server determines the first address information of the second media resource (such as the second media resource that matches the relevant information), send the address information of the second media resource to the calling terminal, and after receiving the address information of the second media resource, the calling terminal can
  • the second media resource is obtained from the second media server based on the address information of the second media resource, or, after the ringing message is received in step 1105, the second media resource is obtained.
  • the second media server preloads the second media resource.
  • the second media server releases the preloaded second media resource to save memory space.
  • the calling terminal further obtains the interactive content data of the second media resource from the second media server through the target application client.
  • the acquisition process includes: after the second media server determines the second media resource, determines the interactive content data corresponding to the second media resource, and sends the address information of the interactive content data corresponding to the second media resource to the calling terminal.
  • the calling terminal After receiving the address information of the interactive content data corresponding to the second media resource, the calling terminal optionally obtains the interactive content of the second media resource from the second media server based on the address information of the interactive content data corresponding to the second media resource data, or, after the ringing message is received in step 1105, the interactive content data of the second media resource is obtained.
  • the second media server preloads the interactive content data of the second media resource.
  • the second media server releases the preloaded interactive content data of the second media resource to save memory space.
  • the calling terminal further obtains the display data of the interactive controls of the second media resource from the second media server through the target application client. For this process, refer to the above-mentioned process of acquiring the interactive content data of the second media resource. It should be noted that, the calling terminal may not acquire the display data of the interactive control, but display it based on the acquired display data of the interactive control.
  • the calling terminal After receiving the ringing message of the called terminal, the calling terminal performs playback based on the first media resource sent by the first media server, and displays interactive controls and interactive content data on the playback screen of the first media resource, and playing the second media resource.
  • the interactive controls and the interactive content data of the second media resource are also displayed, and the display mode refers to the display mode of the interactive controls and the interactive content data of the first media resource.
  • the above-mentioned playing of the first media resource and playing of the second media resource are implemented by the calling terminal according to the setting information.
  • step 1105 is executed, and if the setting information indicates to play the media resources of the CT domain, the relevant acquisition steps may not be performed in the preceding steps.
  • This method brings new experience and more personalized choices to users, and improves users' experience and participation in the video CRBT service.
  • the calling terminal when playing the first media resource and the second media resource, the calling terminal may also adopt the following play mode:
  • the calling terminal plays the CT domain media resource in full screen
  • the floating window plays the IT domain media resource, that is, the first media resource is played in full screen mode, and the first media resource is played on the playback screen of the first media resource.
  • the second media resource is played silently.
  • FIG. 12 is a schematic diagram of playback of a media resource provided by an embodiment of the present application. As shown in FIG. 12 , FIG. 12 takes the full-screen playback of the first media resource and the floating window playback of the second media resource as an example to illustrate the solution. Be explained.
  • the calling terminal plays the IT domain media resource in full screen
  • the floating window plays the CT domain media resource, that is, the second media resource is played in full screen mode, on the playback screen of the second media resource
  • the first media resource is played silently.
  • one media resource adopts the form of full-screen playback
  • the other media resource adopts the form of floating window playback.
  • the two media resources can be displayed clearly and intuitively, and the two media resources can be displayed normally through full-screen playback (with sound) and the floating window is muted.
  • the playback method avoids the poor user experience caused by the simultaneous playback of two pieces of audio.
  • the process for the calling terminal to perform corresponding processing based on a click operation on the floating window includes any of the following:
  • the calling terminal when the calling terminal detects a click operation on the floating window, it switches the media resource played by the floating window to full-screen mode playback, and closes another media resource.
  • the calling user when the calling user is watching the media resource, he can click the floating window corresponding to the media resource he is interested in, use full-screen playback to play the media resource, and close the other media resource.
  • the media resources that the calling user wants to watch can be reserved, which can bring a better viewing experience to the calling user.
  • the calling terminal detects a click operation on the floating window, the media resource played by the floating window is switched to be played in full-screen mode, and another media resource is switched to be played in the floating window mode. .
  • the calling terminal plays the media resource in full screen, and switches another media resource to the floating window, so that the user can continue Switch to another property to watch.
  • the calling terminal can also play the first media resource in the form of a floating window, and play the second media resource in the form of a floating window.
  • FIG. 13 is a schematic diagram of playing a media resource provided by an embodiment of the present application. Referring to FIG. 13 , the first media resource and the second media resource are played in different floating windows respectively. In this implementation manner, two media resources are played in the floating window respectively, and the two media resources can also be displayed clearly and intuitively. Optionally, at least one media resource among the first media resource and the second media resource is played muted. In this process, since there is at least one media resource that is played silently, auditory confusion caused by the simultaneous playing of two pieces of audio is also avoided, and user experience is improved.
  • the playback of the second media resource is implemented by a player of the target application client (video color ringtone application), and the playback of the first media resource is implemented by the player of the system application (telephone dialing application).
  • the process that the calling terminal performs corresponding processing based on the click operation on the floating window includes any of the following:
  • both media resources are played in a floating window mode
  • the calling terminal detects a click operation on any floating window
  • the media resource played by the floating window is switched to full-screen mode playback. to close the other property.
  • the calling user when the calling user is watching the media resource, he can click the floating window corresponding to the media resource he is interested in, use full-screen playback to play the media resource, and close the other media resource.
  • the media resources that the calling user wants to watch can be reserved, which can bring a better viewing experience to the calling user.
  • the calling terminal detects a click operation on the floating window, it switches the media resource played by the floating window to full-screen mode for playback, and maintains the floating window mode of another media resource.
  • the calling terminal plays the media resource in full screen, and switches another media resource to the floating window, so that the user keeps switching back to The entry for playing another media resource provides a richer operating experience.
  • the click operation on the floating window refers to the click operation on any position in the floating window except the close button.
  • each floating window is set to There is a close button, and the user can close the media resources that they do not want to continue playing by clicking the close button, so as to achieve the purpose of personalized selection and playback. That is, if the calling terminal detects a click operation on the close key of the floating window, the floating window is closed.
  • the interactive control and the interactive content data corresponding to the resource are superimposed and displayed, and in the floating window corresponding to another media resource, the other media resource is displayed.
  • a playback screen of a media resource In this process, only the corresponding interactive controls and interactive data are displayed superimposed on the media resources played in full screen, so that the cleanness and intuitiveness of the playing interface can be ensured, and the user's interactive operation or terminal control can be facilitated.
  • the technical solution provided by this application through the interaction between the calling terminal and the first media server and the second media server, can simultaneously play the media in the CT domain when there are media resources in the CT domain and media resources in the IT domain
  • Resources and media resources in the IT domain bring new experiences and more options to users, improving user experience and participation in the video CRBT service.
  • FIG. 7 and FIG. 11 above describe the playback of media resources and the display of interactive controls during playback.
  • an interactive function based on the media resources can also be provided.
  • FIG. 14 is a flowchart of an interaction method based on media resource playback provided by an embodiment of the present application, referring to FIG. 14 :
  • the calling terminal detects a triggering operation on any interactive control, send an interactive request corresponding to the interactive control to the second media server, where the interactive request is used to realize the interaction based on the first media resource.
  • the interaction request carries the interaction object and the interaction content.
  • the interactive object refers to the interactive data corresponding to the interactive control, and the interactive content refers to the updated content of the interactive object.
  • the interactive object is like data, and the interactive content is the like data plus one.
  • the interaction request also carries resource information of the first media resource, so that the second media server processes the interaction content corresponding to the first media resource based on the resource information.
  • the second media server receives the interaction request sent by the calling terminal, and performs processing based on the first media resource.
  • the first media server determines the interaction object corresponding to the first media resource according to the resource identifier carried in the interaction request, and then processes the interaction content carried in the interaction request. For example, if the interaction object carried in the interaction request is like data, and the interactive content is the number of likes plus one, the second media server adds one to the like data of the stored first media resource, and the data on the calling terminal is added by one. The displayed like data is incremented by one.
  • the type of the media transport stream used between the calling terminal and the second media server is HTML stream.
  • the calling terminal sends an HTTP message or an HTTPS message to the second media server, and the second media server returns an HTTP response or an HTTPS response to the calling terminal.
  • Figure 15 illustrates the solution by taking playing the first media resource as an example:
  • the off-hook message is used to indicate that the called terminal has been off-hook, and the calling user and the called user start talking.
  • the target application client controls the player to stop displaying.
  • the calling terminal stops playing the first media resource, and displays a stop screen of the first media resource.
  • the stop picture is a preset picture (for example, an entry picture of a target application client) or a media picture (such as a screenshot) corresponding to the moment when the playback is stopped.
  • the calling user can realize the function of retaining the screen display of the first media resource during the call by setting it on the calling terminal, that is, the setting information of the calling terminal determines that the display of the media resource is maintained during the call.
  • the screen display can be still retained during the call, which provides the user with an entry for further operations and improves the flexibility of media playback.
  • FIG. 16 is a schematic diagram of playing a media resource provided by an embodiment of the present application.
  • the first media resource is displayed in the form of a floating window on the call on the interface.
  • the calling terminal also supports the display state setting after the call ends, and the corresponding process is as follows:
  • FIG. 17 is a schematic diagram of playing a media resource provided by an embodiment of the present application. Referring to FIG. 17 , after the call ends, the first media resource is displayed on the main interface of the calling terminal in the form of a floating window.
  • the calling terminal detects the continuous playback operation of the stop screen, it interacts with the second media server to obtain a third media resource corresponding to the resource information of the first media resource, and the third media resource is the same as the first media resource. Property matches.
  • the matching of the third media resource with the first media resource means that the third media resource is a media resource corresponding to resource information (eg, resource ID) of the first media resource. It should be understood that the media content of the third media resource is the same as that of the first media resource.
  • the calling terminal sends a resource acquisition request to the second media server, where the resource acquisition request carries resource information of the first media resource.
  • the second media server receives the resource acquisition request sent by the calling terminal, and based on the resource information, determines a third media resource corresponding to the resource information, where the third media resource matches the first media resource.
  • the second media server returns the address information of the third media resource to the calling terminal.
  • the calling terminal obtains the third media resource from the second media server based on the address information.
  • the second media server preloads the third media resource, so as to improve data transmission efficiency.
  • the preloading process refer to the above-mentioned preloading process of the display data of the interactive control.
  • a session is established between the second media server and the calling terminal, so as to perform interaction based on the session.
  • the calling terminal plays the third media resource.
  • the calling terminal plays the third media resource after receiving the third media resource sent by the second media server.
  • the third media resource is played based on the playback progress when the playback is stopped.
  • steps 1503 to 1504 trigger the acquisition process of the third media resource based on the continuous playback operation of the stop screen. If the calling user wants to continue watching the first media resource, click the floating window The continuous playback of the first media resource can be realized, and the operation is convenient and simple.
  • the process of acquiring the third media resource may be performed after the calling terminal receives the off-hook message, that is, during the user's call, the second media server returns the third media The address information of the resource is obtained, and the third media resource is acquired after the continue playing operation is detected.
  • the user can query the third media resource at the same time as the phone is connected, so as to prepare for the subsequent playback of the third media resource, improve the playback efficiency of the media resource, and will not cause the occurrence of subsequent media resource playback. The problem of long playback time.
  • the target application client is closed, and a session end message is sent to the second media server, and the second media server receives the session end message of the calling terminal. Afterwards, the session with the calling terminal for the third media resource is released.
  • the stop screen is used as the portal website entrance of the target application client. If a click operation on the stop screen is detected, the portal website interface is displayed in the opened target application client, that is, pull The target application client is started, and the portal website interface corresponding to the target application client is displayed in the target application client.
  • the calling user wants to access the portal website, it can be achieved by clicking on the stop screen, which is convenient and simple to operate.
  • the calling user wants to watch other media resources, by browsing other media resources on the portal website interface, and performing a click operation on the media resource that he wants to watch, the calling terminal starts to play the corresponding click operation. media resources. If the calling user wants to set related services on the target application client, click the corresponding setting button on the portal website interface, and the corresponding function can also be realized.
  • the calling terminal plays the third media resource in different ways based on the click operation on the stop screen at different times, and the corresponding process is as follows: In a possible implementation, if the calling terminal detects the Stop the click operation on the screen, and play the third media resource in mute. In another possible implementation manner, if the calling terminal detects a click operation on the stop screen after the call ends, it cancels the mute mode of the first media resource and plays the third media resource normally.
  • the calling user can click the stop screen during the call or at any time after the call ends, and can obtain media resources with the same media content from the IT domain, providing a continuous and complete audio-visual experience.
  • the calling terminal can play in different ways according to whether the time corresponding to the calling user's click is during the call or after the call, and mute the playback during the call, so that the calling user can clearly hear the content of the call, To avoid missing important call content during the call, after the call ends, it can be played normally, that is, non-muted playback.
  • FIG. 18 is a flowchart of a method for playing media resources provided by an embodiment of the present application. Referring to FIG. 18 :
  • the calling terminal sends an INVITE message, where the INVITE message carries the SDP information (eg, SDPA1 information) of the calling terminal.
  • SDP information eg, SDPA1 information
  • the calling terminal sends a display data acquisition request to the second media server through the target application client, where the display data acquisition request is used to instruct acquisition of the display data of the interactive control.
  • the second media server receives a display data acquisition request sent by the calling terminal through the target application client.
  • the second media server performs legality authentication and functional service authentication on the user based on the relevant information of the call. If the authentication is passed, step 1805 is performed.
  • the second media server when performing authentication, performs authentication based on the calling number and the like in the relevant information, and the specific authentication method is as described in the foregoing embodiment, which is not repeated here.
  • the second media server sends the display data of the interactive control to the calling terminal.
  • This step 1805 refers to step 705 .
  • the calling terminal receives the display data of the interactive control.
  • the acquisition of the display data of the interactive control is performed at any time after obtaining the address information and between the calling terminal starting to play the first media resource, so as to acquire the basic rendering content. For example, it is performed before the interaction content data is acquired, and for example, it is performed immediately after the subsequent calling terminal receives the 180 ringing message, which is not limited in this embodiment of the present application.
  • the called terminal receives the INVITE message, and sends a 183 message for the call request, where the 183 message carries the SDP information (eg, SDPB1 information) of the called terminal.
  • SDP information eg, SDPB1 information
  • the steps such as the called terminal receiving the INVITE message and the occurrence sequence of the above-mentioned interaction of the calling terminal in the IT domain do not affect each other.
  • the process of acquiring the display data of the interactive control can be performed immediately.
  • the calling terminal can execute the process of acquiring the display data of the interactive control at any time before playing the media resource. .
  • the calling terminal receives the 183 message, and sends a PRACK message to the called terminal, where the PRACK message is used to indicate that the calling terminal has received the 183 message sent by the called terminal.
  • the called terminal receives the PRACK message, and sends a 200 OK (PRACK) to the calling terminal, where the 200 OK (PRACK) is used to indicate that the called terminal has received the PRACK message sent by the calling terminal.
  • PRACK 200 OK
  • the calling terminal receives the 200 OK (PRACK), and sends an UPDATE message to the called terminal, where the SDPA2 information carried in the UPDATE message indicates that the calling terminal has successfully reserved resources for this call.
  • PRACK 200 OK
  • UPDATE UPDATE message
  • the called terminal receives the UPDATE message, and sends a 200UPDATE message to the calling terminal, where the SDPB2 information carried in the 200UPDATE message indicates that the called terminal has successfully reserved resources for this call.
  • the called terminal starts ringing, and sends the 180-ringing message to the first media server, where the 180-ringing message carries relevant information of the call.
  • the first media server receives the 180 ringing message, and determines the first media resource corresponding to the relevant information according to the relevant information of the call.
  • the determination of the first media resource after receiving the 180 ringing message is used as an example for description, and in other possible implementation manners, the first media server receives the INVITE of the calling terminal.
  • the determination of the first media resource can be performed at any time after the message.
  • the first media server sends an UPDATE message to the calling terminal, where the UPDATE message carries SDP information of the first media resource, where the SDP information is used for media negotiation.
  • the UPDATE message is an example of the first media negotiation message involved in the above-mentioned embodiment shown in FIG. 7 , and the UPDATE message carries the resource information of the first media resource, such as the resource ID, that is, the The resource information is carried in a specific field of the SDP information of the first media resource.
  • the calling terminal receives the UPDATE message, obtains the resource information of the first media resource from the UPDATE message, and sends a 200OK (UPDATE) message to the first media server, where the 200OK (UPDATE) message carries the media capability information of the calling terminal, That is, the media negotiation result (eg SDPA3 information) between the calling terminal and the second media server.
  • the media negotiation result eg SDPA3 information
  • the 200 OK (UPDATE) message is an example of the second media negotiation message involved in the embodiment shown in FIG. 7 above. Refer to step 708 for the process of acquiring the resource information of the first media resource from the UPDATE message.
  • the first media server sends 180 a ringing message to the calling terminal.
  • the first media server sends the media stream of the first media resource to the calling terminal.
  • the calling terminal receives the media stream of the first media resource.
  • the calling terminal determines the resource information of the first media resource based on the first media negotiation message, that is, the UPDATE message. In another possible implementation manner, The calling terminal determines resource information of the first media resource based on the media stream of the first media resource received in step 1818 . It should be understood that, in an implementation process, the calling terminal may select one of the methods to determine the resource information, or may determine the resource information separately to ensure the acquisition of the resource information.
  • the calling terminal receives, through the target application client, the interactive content data of the first media resource from the second media server based on the resource information of the first media resource.
  • step 1819 may be performed at any time after the resource information of the first media resource is determined.
  • the execution order of steps 1816 to 1818 and step 1819 may be reversed, or may be performed in parallel.
  • the calling terminal receives the 180 ringing message, and based on the received media stream, plays the first media resource sent by the first media server, and on the playback screen of the first media resource, based on the received display data of the interactive controls and Interactive content data, displaying interactive controls and corresponding interactive content data.
  • the embodiment of the present application does not limit the receiving order of the 180-ring message and the media stream.
  • the calling terminal will first receive the 180 ringing message, and then receive the media stream, or receive the media stream synchronously, and start to play the received media stream. It should be noted that for the calling terminal In other words, starting to play the received media stream after receiving the 180 ringing message, that is, the purpose of displaying resources such as resource playback and interactive controls can be achieved.
  • the calling terminal receives the media stream of the second media resource pushed by the second media server, so that in the playback picture of the first media resource, the playback picture of the second media resource is displayed to form a picture-in-picture effect.
  • the interactive control of the second media resource and corresponding interactive content data are displayed on the play screen of the second media resource.
  • the calling terminal detects a triggering operation on any interactive control, send an interactive request corresponding to the interactive control to the second media server, where the interactive request is used to realize the interaction based on the first media resource.
  • the second media server receives the interaction request sent by the calling terminal, and performs processing based on the first media resource.
  • the called terminal sends a 200 OK (INVITE) message to the calling terminal, where the 200 OK (INVITE) message is used to indicate that the called terminal has been off-hook.
  • the calling terminal stops playing the first media resource, and displays a stop screen of the first media resource.
  • the called terminal sends a Bye message to the calling terminal, where the Bye message is used to indicate that the called terminal has hung up.
  • the interaction data of the first media resource is obtained through the second media server, and further, the calling terminal can add an overlay layer on the interface to display the first media resource while playing the first media resource.
  • Interactive data such as interactive buttons, bullet screens, and animation effects, realizes media interaction in CRBT scenarios, enriches CRBT user experience, and increases the fun of video CRBT services.
  • FIG. 19 is a schematic structural diagram of an apparatus for playing media resources provided by an embodiment of the present application, where the apparatus for playing media resources is configured to execute the method performed by the calling terminal in the foregoing embodiment.
  • the media resource playback device includes a call request sending module 1901, a determination module 1902, an acquisition request sending module 1903, a receiving module 1904, a playing module 1905 and a display module 1906, wherein:
  • a call request sending module 1901 configured to send a call request to the called terminal, the call request passing through the first media server;
  • a determining module 1902 configured to determine resource information of the first media resource provided by the first media server
  • an acquisition request sending module 1903 configured to send an interaction data acquisition request to the second media server through the target application client, where the interaction data acquisition request carries resource information of the first media resource;
  • a receiving module 1904 configured to receive the interaction data of the first media resource returned by the second media server based on the resource information
  • Playing module 1905 configured to receive and play the first media resource sent by the first media server
  • the display module 1906 is configured to display the interaction data of the first media resource on the playback screen of the first media resource.
  • the first media server is a server located in the communication technology CT domain
  • the second media server is a server located in the Internet technology IT domain.
  • the determining module 1902 includes any of the following:
  • a first acquisition sub-module configured to perform the process of acquiring resource information from the first media negotiation message in step 708 or step 1102;
  • the second acquiring sub-module is configured to perform the process of acquiring resource information from the header of the media stream in step 713 or step 1102 .
  • the first media negotiation message is an Update message
  • the resource information is located in the session description protocol SDP information of the Update message.
  • the resource information is located in the additional enhancement information SEI in the header of the media stream.
  • the interactive data is interactive content data
  • the apparatus further includes:
  • the acquisition request sending module 1903 is further configured to perform step 702 or step 1802;
  • the address information receiving module 1904 is configured to execute step 706 or step 1806.
  • the interactive data includes interactive content data and display data of interactive controls.
  • the device further includes:
  • the interactive request sending module is used to execute step 1401 or step 1821.
  • the apparatus further includes a shutdown module for performing step 1501 .
  • the playing module 1905 is further configured to perform step 1502 or step 1824.
  • the playing module 1905 is further configured to perform step 1504 .
  • the display module 1906 is further configured to perform the process of displaying the portal website interface in step 1826 or step 1504 .
  • the closing module is further configured to execute the process of closing the target application client in step 1504.
  • the apparatus further includes a session message sending module, configured to execute the process of sending the session end message in step 1504 .
  • the device further includes:
  • the playing module 1905 is used to execute the process of playing the first media resource in full screen mode in step 1105;
  • the playing module 1905 is further configured to execute the process of playing the second media resource in the floating window in step 1105 .
  • the playing module 1905 is further configured to perform the process of playing the second media resource in full screen and playing the first media resource in the floating window in step 1105 .
  • the playing module 1905 is further configured to perform the process of playing the first media resource on the floating window and playing the second media resource on the floating window in step 1105 .
  • the interaction data includes at least one of homepage visit data, like data, comment data, sharing data, and download data.
  • the interactive control includes at least one of a homepage access control, a like control, a comment control, a share control, and a download control.
  • the media resource playback device when the media resource playback device provided in the above embodiment performs media resource playback, only the division of the above functional modules is used as an example for illustration. In practical applications, the above functions may be allocated to different functional modules as required. To complete, that is, to divide the internal structure of the device into different functional modules to complete all or part of the functions described above.
  • the embodiments of the media resource playback method provided by the above embodiments belong to the same concept, and the specific implementation process thereof is detailed in the method embodiments, which will not be repeated here.
  • the technical solutions provided in the embodiments of the present application realize the supplementation of the media resources in the CT domain.
  • the interaction data of the first media resources is obtained through the second media server in the IT domain, and further, While playing the first media resource, the calling terminal can add an overlay layer on the interface to display the interactive control and corresponding interactive content data, such as interactive buttons, barrage and animation effects, etc. It enriches the experience of CRBT users and makes the video CRBT service more interesting.
  • the video CRBT can Businesses can be expanded more flexibly, such as video overlay playback, video interface interaction, and video content scrolling switching, etc., making the video CRBT service more interesting.
  • FIG. 20 is a schematic structural diagram of a media server provided by an embodiment of the present application.
  • the media server includes a receiving module 2001, a determining module 2002 and a returning module 2003, wherein:
  • a receiving module 2001 configured to receive an interaction data acquisition request sent by a calling terminal through a target application client, where the interaction data acquisition request carries resource information of a first media resource;
  • a determining module 2002 configured to determine interaction data of the first media resource based on the resource information
  • Returning module 2003 is used to return the interaction data to the calling terminal.
  • the determining module 2002 is configured to execute the process of determining the interactive content data in step 709 or step 1819;
  • the returning module 2003 is configured to execute the process of returning the interactive content data in step 709 or step 1819 .
  • the interactive data is interactive content data
  • the apparatus further includes:
  • the receiving module 2001 is further configured to perform step 703 or step 1803;
  • the determining module 2002 is further configured to perform the process of determining the display data in step 705;
  • step 705 or step 1805 it is also used to execute step 705 or step 1805.
  • the apparatus further includes a loading module configured to perform the process of preloading display data in step 705 .
  • the display data acquisition request also carries relevant information of the call that the calling terminal participates in, and the apparatus further includes an authentication module for performing step 704 or step 1804 .
  • the device further includes:
  • the receiving module 2001 is further configured to perform the process of receiving the resource acquisition request in step 1503;
  • the determining module 2002 is further configured to perform the process of determining the third media resource in step 1503;
  • the returning module 2003 is further configured to perform the process of returning the third media resource in step 1503 .
  • the apparatus further includes an establishment module, configured to perform the process of establishing a session in step 1503 .
  • the apparatus further includes a release module, configured to perform the process of releasing the session in step 1504 .
  • the media server provided in the above embodiment plays media resources
  • only the division of the above functional modules is used as an example for illustration. That is, the internal structure of the device is divided into different functional modules to complete all or part of the functions described above.
  • the method embodiments on the second media server side in the media resource playback method provided by the above embodiments belong to the same concept, and the specific implementation process thereof is detailed in the method embodiments, which will not be repeated here.
  • the technical solutions provided in the embodiments of the present application realize the supplementation of the media resources in the CT domain.
  • the interaction data of the first media resources is obtained through the second media server in the IT domain, and further, While playing the first media resource, the calling terminal can add an overlay layer on the interface to display the interactive control and corresponding interactive content data, such as interactive buttons, barrage and animation effects, etc. It enriches the experience of CRBT users and makes the video CRBT service more interesting.
  • the video CRBT can Businesses can be expanded more flexibly, such as video overlay playback, video interface interaction, and video content scrolling switching, etc., making the video CRBT service more interesting.
  • a computer storage medium may be a computer-readable storage medium, such as a memory including program codes that can be executed by a processor in a terminal to accomplish the above embodiments The media resource playback method on the calling terminal side in .
  • the computer-readable storage medium may be ROM, RAM, compact disc read-only memory (CD-ROM), magnetic tape, floppy disk, optical data storage device, and the like.
  • a computer storage medium may be a computer-readable storage medium, such as a memory including program codes that can be executed by a processor in a terminal to accomplish the above embodiments The method on the second media server side in .
  • the computer-readable storage medium may be ROM, RAM, compact disc read-only memory (CD-ROM), magnetic tape, floppy disk, optical data storage device, and the like.
  • the present application also provides a system for playing media resources, the system includes a calling terminal, a first media server and a second media server.
  • the calling terminal, the first media server, and the second media server are respectively configured to execute the media provided by the embodiments shown in FIG. 7 , FIG. 11 , FIG. 14 , FIG. 15 , and FIG. 18 .
  • the disclosed system, apparatus and method may be implemented in other manners.
  • the apparatus embodiments described above are only illustrative.
  • the division of modules in the apparatus or units in a module is only a logical function division. In actual implementation, there may be other division methods, such as multiple divisions. Modules or units may be combined or may be integrated into another system, or some features may be omitted, or not implemented.
  • the shown or discussed mutual coupling or direct coupling or communication connection may be indirect coupling or communication connection through some interfaces, devices or units, and may also be electrical, mechanical or other forms of connection.
  • modules or units described as separate components may or may not be physically separated, and components shown as modules or units may or may not be physical modules or physical units, that is, they may be located in one place or may be distributed. to multiple computer devices or chips. Some or all of the modules or units may be selected according to actual needs to achieve the purpose of the solutions of the embodiments of the present application.
  • each functional module or unit in each embodiment of the present application may be integrated into one target processing module, or each module or unit may exist physically alone, or two or more modules or units may be integrated into one in the target processing module.
  • the above-mentioned integrated modules or units may be implemented in the form of hardware, or may be implemented in the form of software functional units.

Landscapes

  • Engineering & Computer Science (AREA)
  • Signal Processing (AREA)
  • Multimedia (AREA)
  • Human Computer Interaction (AREA)
  • Business, Economics & Management (AREA)
  • General Business, Economics & Management (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Databases & Information Systems (AREA)
  • Information Transfer Between Computers (AREA)
  • Telephonic Communication Services (AREA)

Abstract

本申请公开了媒体资源播放方法和相关装置,属于通信技术领域。本申请实施例提供的技术方案,通过第二媒体服务器获取第一媒体资源的交互数据,进而,主叫终端能够在播放第一媒体资源的同时,在界面上增加一个叠加层,用以显示该交互数据,例如互动按钮、弹幕以及动画效果等,实现了彩铃场景下的媒体交互,丰富了彩铃用户的体验,增加了视频彩铃业务的趣味性。

Description

媒体资源播放方法和相关装置
本申请要求于2020年8月31日提交的申请号为202010901206.7、申请名称为“媒体资源播放方法和相关装置”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。
技术领域
本申请涉及通信技术领域,特别涉及一种媒体资源播放方法和相关装置。
背景技术
随着通信技术的不断发展,高清语音(voice over long term evolution,VOLTE)技术逐渐进入人们的生活,人们可以享受各种类型的媒体资源体验。例如,用户在语音呼叫的同时可以享受视频体验,如视频彩铃、视频客服等,使得通话前的等待阶段变得更有趣味性,极大地提升了用户的呼叫体验。
目前,媒体资源播放方法通常为基于CT(communication technology,通信技术)域的媒体资源播放,该CT域也可以称为电信域,相应过程可以为:主叫终端向被叫终端发起呼叫,当被叫终端振铃后,CT域的媒体服务器基于用户签约信息,向媒体资源服务器拉取用户签约信息对应的媒体资源,进而在接收到主叫终端的确认播放消息后,指示媒体资源服务器开始为主叫终端播放媒体资源。
上述方案中,通过CT域的媒体服务器,在呼叫阶段为用户播放CT域的媒体资源,呼叫界面上仅能显示该媒体资源的画面,播放形式单一,用户体验感较差。
发明内容
本申请实施例提供了一种媒体资源播放方法和相关装置,能够实现彩铃场景下的媒体交互,丰富了彩铃用户的体验,增加了视频彩铃业务的趣味性。该媒体资源播放方法和相关装置的技术方案如下:
第一方面,提供了一种媒体资源播放方法,应用在主叫终端,该方法的实现过程可以是:
向被叫终端发送呼叫请求,该呼叫请求经过第一媒体服务器;
确定该第一媒体服务器提供的第一媒体资源的资源信息;
通过目标应用客户端,向第二媒体服务器发送交互数据获取请求,该交互数据获取请求携带该第一媒体资源的资源信息;
接收该第二媒体服务器基于该资源信息返回的该第一媒体资源的交互数据;
接收并播放该第一媒体服务器发送的该第一媒体资源;
在该第一媒体资源的播放画面上,显示该第一媒体资源的交互数据。
本申请实施例提供的技术方案,通过第二媒体服务器获取第一媒体资源的交互数据,进而,主叫终端能够在播放第一媒体资源的同时,在界面上增加一个叠加层,用以显示该交互数据,例如互动按钮、弹幕以及动画效果等,实现了彩铃场景下的媒体交互,丰富了彩铃用 户的体验,增加了视频彩铃业务的趣味性,且,随着5G的发展和商用,在IT网络资源不构成瓶颈、网络时延更短的背景下,让视频彩铃业务可以更灵活地拓展,例如视频叠层播放、视频界面互动、视频内容滚动切换等,增加视频彩铃业务的趣味性。
在一种可能的实现方式中,该第一媒体服务器为位于通信技术CT域的服务器,该第二媒体服务器为位于互联网技术IT域的服务器。以IT域的交互数据对CT域的媒体资源播放进行补充,提高了视频彩铃业务的趣味性和灵活性。
在一种可能的实现方式中,该确定该第一媒体服务器提供的第一媒体资源的资源信息包括下述任一项:从该第一媒体服务器发送的第一媒体协商消息中,获取该第一媒体资源的资源信息;从该第一媒体服务器传输的第一媒体资源的媒体流的头部,获取该第一媒体资源所携带的资源信息。该实现方式中,提供了多种获取第一媒体资源的资源信息的方式,提高了实现方式的灵活性。
在一种可能的实现方式中,该第一媒体协商消息为Update消息,该资源信息位于该Update消息的会话描述协议SDP信息中。
在一种可能的实现方式中,该资源信息位于该媒体流的头部的附加增强信息SEI中。
在一种可能的实现方式中,该交互数据为交互内容数据,该通过目标应用客户端,向第二媒体服务器发送交互数据获取请求之前,该方法还包括:
向该第二媒体服务器发送显示数据获取请求,该显示数据获取请求用于指示获取交互控件的显示数据;
接收该第二媒体服务器基于该显示数据获取请求返回的地址信息,该地址信息用于提供交互控件的显示数据,基于该地址信息,从该第二媒体服务器获取该交互控件的显示数据。
该实现方式中,先从第二媒体服务器获取交互控件的显示数据,再根据资源信息从第二媒体服务器获取交互内容数据的过程,通过上述过程能够提前获取到交互控件的显示数据,再获取交互内容数据,从而使得在后续在播放第一媒体资源时,无需再从第二媒体服务器进行交互控件的显示数据的获取,降低了数据获取量,降低了显示的延时。
在一种可能的实现方式中,该交互数据包括交互内容数据和交互控件的显示数据。
该实现方式中,一次性获取交互内容数据和交互控件的显示数据,保证了数据获取和显示的同步。
在一种可能的实现方式中,该在该第一媒体资源的播放画面上,显示该第一媒体资源的交互数据之后,该方法还包括:若检测到对任一交互控件的触发操作,向该第二媒体服务器发送该交互控件对应的交互请求,该交互请求用于实现基于该第一媒体资源的交互。该实现方式基于交互控件提供了IT域的互动功能,丰富了CT域媒体资源的可视化内容,提升了彩铃业务的体验。
在一种可能的实现方式中,该在该第一媒体资源的播放画面上,显示该第一媒体资源的交互数据之后,该方法还包括:若接收到该被叫终端的摘机消息,关闭该第一媒体资源的交互数据的显示。
在一种可能的实现方式中,该在该第一媒体资源的播放画面上,显示该第一媒体资源的交互数据之后,该方法还包括:若接收到该被叫终端的摘机消息,停止播放该第一媒体资源,并显示该第一媒体资源的停止画面。该实现方式中,保留停止画面的显示,可以为后续操作提供入口。
在一种可能的实现方式中,该若接收到该被叫终端的摘机消息,停止播放该第一媒体资源,并显示该第一媒体资源的停止画面之后,该方法还包括:若检测到对该停止画面的继续播放操作,从该第二媒体服务器获取第三媒体资源,播放该第三媒体资源,该第三媒体资源与该第一媒体资源匹配。上述过程中,主叫用户可以在通话过程中或者通话结束后的任一时刻,点击停止画面,能够通过从IT域获取具有相同媒体内容的媒体资源,提供了连续且完整的视听体验。
在一种可能的实现方式中,该若接收到该被叫终端的摘机消息,停止播放该第一媒体资源,并显示该第一媒体资源的停止画面之后,该方法还包括:若检测到对该停止画面的触发操作,在打开的该目标应用客户端中显示门户网站界面。在上述实施例中,若主叫用户想要访问门户网站,则可以通过点击停止画面即可实现,操作方便且简单。
在一种可能的实现方式中,该若接收到该被叫终端的摘机消息,停止播放该第一媒体资源,并显示该第一媒体资源的停止画面之后,该方法还包括:若检测到对该停止画面的关闭操作或熄屏,关闭该目标应用客户端。该实现方式提供了方便快捷的关闭方式,提升了操作性能。
在一种可能的实现方式中,该若检测到对该停止画面的关闭操作或熄屏之后,该方法还包括:向该第二媒体服务器发送会话结束消息,该会话结束消息用于指示会话释放。第二媒体服务器在接收到主叫终端发送的会话结束消息后,释放该预加载的第二媒体资源的交互内容数据,以节约内存空间。
在一种可能的实现方式中,其特征在于,该方法还包括:通过该目标应用客户端,从该第二媒体服务器获取第二媒体资源;
该播放该第一媒体服务器发送的该第一媒体资源包括:以全屏模式播放该第一媒体资源;
该方法还包括:在该第一媒体资源的播放画面上的悬浮窗内,静音播放该第二媒体资源。
在一种可能的实现方式中,该方法还包括:以全屏模式播放该第二媒体资源;
该播放该第一媒体服务器发送的第一媒体资源包括:在该第二媒体资源的播放画面上的悬浮窗内,静音播放该第一媒体资源。
在一种可能的实现方式中,该播放该第一媒体服务器发送的第一媒体资源包括:以悬浮窗形式播放该第一媒体资源;
该方法还包括:以悬浮窗形式播放该第二媒体资源。
上述过程提供了在同时播放IT域和CT域媒体资源时的多种不同播放方式,为用户提供了直观的界面显示效果,也给用户对媒体资源进行选择的余地。进一步地,一个媒体资源全屏显示,另一个悬浮窗显示,还形成了一种画中画的效果,在不会造成视觉混乱的情况下,提供了较好的视觉感受。可选地,还为运营提供了投放位,广告主可以通过在第一媒体服务器或者第二媒体服务器进行媒体资源的投放,以对终端进行推送,达到运营目的。
在一种可能的实现方式中,该交互数据包括主页访问数据、点赞数据、评论数据、分享数据和下载数据中至少一项。在一种可能的实现方式中,该交互控件包括主页访问控件、点赞控件、评论控件、分享控件和下载控件中至少一项。上述实现方式丰富了互动功能,大大提高了用户体验。
第二方面,提供了一种媒体资源播放方法,应用在第二媒体服务器,该方法的实现过程 可以是:
接收主叫终端通过目标应用客户端发送的交互数据获取请求,该交互数据获取请求携带第一媒体资源的资源信息;
基于该资源信息,确定该第一媒体资源的交互数据;
向该主叫终端返回该交互数据。
本申请实施例提供的技术方案通过第二媒体服务器获取第一媒体资源的交互数据,进而,主叫终端能够在播放第一媒体资源的同时,在界面上增加一个叠加层,用以显示该交互数据,例如互动按钮、弹幕以及动画效果等,实现了彩铃场景下的媒体交互,丰富了彩铃用户的体验,增加了视频彩铃业务的趣味性,且,随着5G的发展和商用,在IT网络资源不构成瓶颈、网络时延更短的背景下,让视频彩铃业务可以更灵活地拓展,例如视频叠层播放、视频界面互动、视频内容滚动切换等,增加视频彩铃业务的趣味性。
在一种可能的实现方式中,其特征在于,该基于该资源信息,确定该第一媒体资源的交互数据包括:确定该资源信息对应的地址信息,该地址信息用于提供该交互数据;
该向该主叫终端返回该交互数据包括:向该主叫终端返回该地址信息。
在一种可能的实现方式中,该交互数据为交互内容数据,该接收主叫终端通过目标应用客户端发送的交互数据获取请求之前,该方法还包括:
接收该主叫终端通过目标应用客户端发送的显示数据获取请求,该显示数据获取请求用于指示获取交互控件的显示数据;
确定该交互控件的显示数据的地址信息,该地址信息用于提供交互控件的显示数据;
向该主叫终端返回该地址信息。
在该实现方式中,通过第二媒体服务器,提取获取交互控件的显示数据,保证了后续媒体资源播放时交互控件的及时显示,避免了显示延时的问题。
在一种可能的实现方式中,该确定该交互控件的显示数据的地址信息之后,该方法还包括:预加载该交互控件的显示数据。
通过上述预加载的方法,能够优化后续交互控件显示的时延问题,提高了显示数据的获取效率,也就不会导致后续媒体资源播放时发生控件显示时间较长的问题。
在一种可能的实现方式中,该显示数据获取请求还携带该主叫终端参与的此次呼叫的相关信息,该接收该主叫终端通过目标应用客户端发送的显示数据获取请求之后,该方法还包括:基于该呼叫的相关信息,对用户进行合法性鉴权和功能服务鉴权。上述过程,通过对用户进行鉴权,来验证用户的合法性和有效性中至少一项,以确保后续媒体资源播放时交互控件的安全显示,提高了IT域的交互控件显示的安全性和可靠性。
在一种可能的实现方式中,该向该主叫终端返回该交互数据之后,该方法还包括:接收该主叫终端发送的资源获取请求,该资源获取请求携带该第一媒体资源的资源信息;基于该资源信息,确定该资源信息对应的第三媒体资源,该第三媒体资源与该第一媒体资源匹配;向该主叫终端返回该第三媒体资源。上述过程中,主叫用户可以在通话过程中或者通话结束后的任一时刻,点击停止画面,能够通过从IT域获取具有相同媒体内容的媒体资源,提供了连续且完整的视听体验。
在一种可能的实现方式中,该基于该资源信息,确定该资源信息对应的第三媒体资源之后,该方法还包括:基于该第三媒体资源,建立与该主叫终端之间的会话,该会话用于记录 第三媒体资源的播放信息。
在一种可能的实现方式中,该向该主叫终端返回该第三媒体资源之后,该方法还包括:若接收到该主叫终端的会话结束消息,释放与该主叫终端之间的会话,该会话结束消息用于表示会话结束。通过及时对会话进行释放,能够避免对服务器资源的浪费。
第三方面,提供了一种媒体资源播放装置,用于执行上述媒体资源播放方法。具体地,该媒体资源播放装置包括用于执行上述第一方面或上述第一方面的任一种可选方式提供的媒体资源播放方法的功能模块。
第四方面,提供了一种媒体服务器,用于执行上述媒体资源播放方法。具体地,该媒体服务器包括用于执行上述第二方面或上述第二方面的任一种可选方式提供的媒体资源播放方法的功能模块。
第五方面,提供了一种媒体资源播放方法,该方法包括:
主叫终端向被叫终端发送呼叫请求,该呼叫请求经过第一媒体服务器;主叫终端确定该第一媒体服务器提供的第一媒体资源的资源信息;主叫终端通过目标应用客户端,向第二媒体服务器发送交互数据获取请求,该交互数据获取请求携带该第一媒体资源的资源信息;
第二媒体服务器接收主叫终端通过目标应用客户端发送的交互数据获取请求,该交互数据获取请求携带第一媒体资源的资源信息;第二媒体服务器基于该资源信息,确定该第一媒体资源的交互数据;第二媒体服务器向该主叫终端返回该交互数据。
主叫终端接收该第二媒体服务器基于该资源信息返回的该第一媒体资源的交互数据;接收并播放该第一媒体服务器发送的该第一媒体资源;在该第一媒体资源的播放画面上,显示该第一媒体资源的交互数据。
第六方面,提供了一种媒体资源播放系统,该系统包括主叫终端、第一媒体服务器与第二媒体服务器,该主叫终端、第一媒体服务器与第二媒体服务器用于执行上述第五方面提供的媒体资源播放方法。
第七方面,提供了一种终端,该终端包括处理器和存储器,该存储器中存储有至少一条程序代码,该程序代码由该处理器加载并执行以实现如上述第一方面或上述第一方面的任一种可选方式提供的媒体资源播放方法。
第八方面,提供了一种服务器,该服务器包括处理器和存储器,该存储器中存储有至少一条程序代码,该程序代码由该处理器加载并执行以实现如上述第二方面或上述第二方面的任一种可选方式提供的媒体资源播放方法。
第九方面,提供了一种计算机程序产品,当该计算机程序产品在计算机上运行时,使得该计算机执行第一方面或第二方面或第一方面和第二方面的任一种可选方式的任意方法的部分或全部步骤。
第十方面,提供了一种计算机存储介质,该计算机存储介质中存储有至少一条程序代码,该程序代码由处理器加载并执行以实现如第一方面或第二方面或第一方面和第二方面的任一种可选方式的媒体资源播放方法。
附图说明
为了更清楚地说明本申请实施例中的技术方案,下面将对实施例描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图仅仅是本申请的一些实施例,对于本领域 普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其他的附图。
图1是本申请实施例提供的一种媒体资源系统架构图;
图2是本申请实施例提供的一种视频彩铃云平台在不同域所提供服务的示意图;
图3是本申请实施例提供的一种媒体资源系统架构图;
图4是本申请实施例提供的一种终端的系统架构图;
图5是本申请实施例提供的一种终端的结构示意图;
图6是本申请实施例提供的一种服务器的结构示意图;
图7是本申请实施例提供的一种媒体资源播放方法的流程图;
图8是本申请实施例提供的一种媒体资源的播放示意图;
图9是本申请实施例提供的一种Update消息携带资源信息的示意图;
图10是本申请实施例提供的一种媒体流携带资源信息的示意图;
图11是本申请实施例提供的一种媒体资源播放方法的流程图;
图12是本申请实施例提供的一种媒体资源的播放示意图;
图13是本申请实施例提供的一种媒体资源的播放示意图;
图14是本申请实施例提供的一种基于媒体资源播放的交互方法的流程图;
图15是本申请实施例提供的一种基于媒体资源播放的播放暂停处理方法的流程图;
图16是本申请实施例提供的一种媒体资源的播放示意图;
图17是本申请实施例提供的一种媒体资源的播放示意图;
图18是本申请实施例提供的一种媒体资源播放方法的流程图;
图19是本申请实施例提供的一种媒体资源播放装置的结构示意图;
图20是本申请实施例提供的一种媒体服务器的结构示意图。
具体实施方式
下面将结合附图对本申请实施方式作进一步地详细描述。
本申请实施例可以适用于第4代(4G)、第5代(5G)移动通信网络架构或未来网络。为了描述方便,下面以基于4G的VoLTE网络为例来说明该方案的网络架构和方法流程。
图1是本申请实施例提供的一种媒体资源系统架构图,参见图1,该系统可以包括位于CT域的第一媒体服务器、IT域的第二媒体服务器、主叫终端以及被叫终端。
CT域也可以称为电信域,CT域通过演进分组核心网(evolved packet core,EPC)和媒体子系统(internet protocol multimedia subsystem,IMS)域核心网等来实现通信。该IMS域核心网包括若干个应用服务器(application server,AS),如,第一媒体服务器,第一媒体服务器用于为终端提供第一媒体资源的播放,例如,该第一媒体服务器在提供视频彩铃服务时也称为视频彩铃平台。该第一媒体服务器可以包括媒体应用服务器和媒体资源服务器(media resource subsystem,MRS)。该媒体资源服务器也称为回铃音平台,媒体资源服务器用于提供视频彩铃、视频彩振、视频广告、视频客服等媒体资源,例如,该媒体资源服务器制作与管理上述媒体资源。媒体应用服务器和媒体资源服务器可以合设,也可以物理上分开。媒体应用服务器处理会话发起协议(session initiation protocol,SIP)信令消息,媒体资源服务器为主叫终端和/或被叫终端提供音频流和/或视频流。
另外,IMS域核心网还包括:服务-呼叫会话控制功能(serving-call session control  function,S-CSCF)设备、查询-呼叫会话控制功能(interrogating-call session control function,I-CSCF)设备、代理-呼叫会话控制功能(proxy-call session control function,P-CSCF)设备、归属用户服务器(home subscriber server,HSS)设备、会话边界控制器(session border controller,SBC)设备,以及若干个应用服务器,如电话应用服务器(telephony application server,TAS),多媒体电话应用服务器(multimedia telephony application server,MMTel AS)、业务连续性应用服务器(server centralization and continuity application server,SCC AS)等。其中I-CSCF设备可以和S-CSCF设备合设在一起,可以简称为“I/S-CSCF”设备。SBC设备和P-CSCF设备可以合设在一起,可以简称为“SBC/P-CSCF”设备。EPC中可包括分组数据网网关PGW设备、服务网关SGW设备和移动管理实体(mobile management entity,MME)设备。
S/P-GW设备,用于提供服务网关和分组数据网网关逻辑实体的功能。SGW是本地移动的锚点,主要面向无线接入网,以进行业务面数据的传输,P-GW是EPS锚点,主要面向其它数据网络,实现与多个公共数据网的访问交互。SGW设备可以用于IMS核心网与无线网络的连接,PGW设备可以用于IMS核心网和网际互连协议(internet protocol,IP)网络的连接。MME设备是EPC网络的核心设备,用于提供MME逻辑实体的功能。
参见图1,在第一媒体服务器和主叫终端进行交互时,其消息数据流为CT域信令流,即其控制消息为SIP消息。而在进行媒体资源的传输时,其媒体数据流为CT域媒体流,即RTP媒体流。终端通过EPC中的S/P-GW接入IMS域核心网,来访问第一媒体服务器,例如,主叫终端在呼叫流程中,会通过与IMS域核心网中的网络设备之间进行SIP消息的交互,进而由第一媒体服务器与主叫终端进行媒体协商等流程,若本次呼叫存在CT域的媒体资源,则由第一媒体服务器将第一媒体资源以RTP媒体流的形式推送给主叫终端。
IT域也可以称为互联网域。第二媒体服务器用于为用户提供交互数据服务,还能够提供媒体资源播放等服务,还提供可视化界面入口,便于为用户提供媒体资源设置与管理等服务。可选地,第一媒体服务器还提供设置功能,用户通过目标应用客户端来设置,以得到用于指示如叠加层样式等个性化显示的设置信息。可选地,第二媒体服务器存储用户信息,或者关联有数据库,采用数据库存储用户信息以及交互控件的显示数据、媒体资源的交互内容数据以及媒体资源等。在本申请实施例中,以该第二媒体服务器关联有交互数据库为例进行说明。
在本申请实施例中,主叫终端上安装有目标应用客户端,如,视频彩铃应用,主叫终端能够通过该目标应用客户端来获取上述第二媒体服务器所提供的交互数据服务,也即是,获取第二媒体服务器所提供的媒体资源的交互数据,并在呼叫过程中基于媒体资源的播放进行显示。
第二媒体服务器提供交互数据服务,用户通过目标应用客户端能够订阅该交互数据服务,例如通过目标应用客户端访问第二媒体服务器的门户网站,在门户网站上进行操作来进行订阅,在订阅完成后,第二媒体服务器将用户信息存储至交互数据库,以便后续基于所存储的信息来提供交互数据服务。可选地,第二媒体服务器还提供媒体资源的订阅服务,以订阅呼叫过程中所播放的媒体资源,从而使得终端在呼叫过程中通过目标应用客户端来获取媒体资源进行播放。
可选地,第二媒体服务器和主叫终端在进行信令交互时,采用的传输协议为超文本传输协议(hypertext transfer protocol,HTTP)或超文本传输安全协议(hypertext transfer  protocol over secure socket layer,HTTPS),本申请实施例中,第二媒体服务器支持多种传输协议,相比CT域单一的信令流协议,能够更加方便、灵活的实现基于媒体资源播放的信令交互。
可选地,第二媒体服务器和主叫终端在进行媒体流传输时,采用的传输协议实时消息传输协议(real time messaging protocol,RTMP)或复用HTTP的流式协议(hypertext transfer protocol flash video,Http Flv)。本申请实施例中,第二媒体服务器支持多种传输协议,相比CT域单一的信令流协议,能够更加方便、灵活的实现媒体资源的播放。
需要说明的是,第二媒体服务器和主叫终端进行交互时,采用IT域的信令流或媒体流。主叫终端和第一媒体服务器进行交互时,采用CT域的信令流或媒体流。相应地,在第二媒体服务器和主叫终端进行交互时,若采用HTTP协议,则其交互的控制消息为HTTP消息、HTTPS消息、HTTP响应或HTTPS响应,其消息数据流为IT域信令流,例如,超文本标记语言流(hypertext markup language flow,HTML Flow)。而在进行媒体资源的传输时,其媒体数据流为IT域媒体流,例如,RTMP媒体流、Http Flv媒体流。
参见图1,主叫终端通过网关来访问第二媒体服务器。网关可包括分组数据网网关(packet data network gateway,PGW)设备、服务网关(serving gateway,SGW)设备。其中,PGW设备和SGW设备可以合设在一起,可以简称为“S/P-GW”设备。具体地,主叫终端在发起呼叫后,通过目标应用客户端将相应请求发送至S/P-GW,由S/P-GW发送给第二媒体服务器,在主叫终端进行CT域的呼叫过程中,该目标应用客户端通过IT网络访问第二媒体服务器,获取IT域的信令流,进而在振铃阶段在CT域媒体资源的播放画面上,显示交互控件和相应的交互内容数据,从而实现基于CT域彩铃业务的互动功能。应理解地,IT域的媒体资源播放是在CT域的呼叫过程中进行的,其获取由CT域的呼叫过程触发,但是,由于主叫终端和第二媒体服务器的交互是通过IT域进行,因此,并不会影响CT域的呼叫流程,可以保证呼叫的正常进行。
可选地,参见图2所示的视频彩铃云平台在不同域所提供服务的示意图。本申请实施例提供的第一媒体服务器用于提供给运营商,以进行广告投放、企业宣传等相关业务,例如,个人视频彩铃、企业视频彩铃、媒体广告、固话视频彩铃、关怀视频彩铃以及情景视频彩铃等,终端通过CT域对该第一媒体服务器进行访问,终端和第一媒体服务器之间基于CT域信令流进行交互,例如采用SIP消息进行信令交互,第一媒体服务器以CT域媒体流的形式向终端发送媒体资源,例如,采用RTP媒体流发送第一媒体资源。而第二媒体服务器能够提供交互服务,如内容黄页、点赞服务、评论服务、下载服务、分享服务以及更多内容的服务中的任一项,可选地,第二媒体服务器还能够在振铃阶段提供IT域的视频彩铃服务,后续采用第二媒体资源来表示IT域的媒体资源。终端通过IT域对该第二媒体服务器进行访问,终端和第二媒体服务器之间基于IT域信令流进行交互,例如采用HTTP消息、HTTPS消息、HTTP响应或HTTPS响应进行信令交互,第一媒体服务器以IT域媒体流的形式向终端发送媒体资源,例如,采用RTMP媒体流发送第二媒体资源。
对于主叫终端来说,其入网方式多种多样,上述图1是以通过网关入网为例进行说明,而在另一种系统架构下,主叫终端可以通过其他网络设备入网,例如,通过接入网或城域网,图3是本申请实施例提供的一种媒体资源系统架构图,该系统包括位于接入网或城域网、IT 域的第二媒体服务器、位于CT域的第一媒体服务器、主叫终端以及被叫终端。其中,接入网或城域网中包括宽带远程接入服务器(broadband remote access server,BRAS)和路由器,BRAS为用于完成各种宽带接入方式的宽带网络用户的接入、认证、计费、控制、管理的网络设备。路由器是连接各局域网、广域网的设备,能够根据信道的情况自动选择和设定路由,以最佳路径来传输消息,其他网络设备参见图1所示实施例。终端通过目标应用客户端发送的资源获取请求由BRAS和路由器来向第二媒体服务器进行转发来访问第二媒体服务器,从而实现本申请实施例提供的媒体资源的播放。主叫终端与第一媒体服务器之间的信令交互和媒体资源传输参见上述图1实施例。
上述图1和图3的系统架构是基于终端的不同入网方式所示出的系统架构,而随着4G网络的普及以及5G网络的建设,移动互联网和家庭宽带固网的网络带宽和时延都具备非常高的可用性,能够保障通信的安全性和可靠性,通信效率高。
本申请实施例中所涉及的终端,是具有无线收发功能的设备,可以部署在陆地上,包括室内或室外、手持或车载;也可以部署在水面上(如轮船等);还可以部署在空中(例如飞机、气球和卫星上等)。具体的,上述终端可以是可接入移动网络的终端设备(terminal device),手机(mobile phone)、平板电脑(pad)、带无线收发功能的电脑、虚拟现实(virtual reality,VR)终端、增强现实(augmented reality,AR)终端、工业控制(industrial control)中的无线终端、无人驾驶(self driving)中的无线终端、远程医疗(remote medical)中的无线终端、智能电网(smart grid)中的无线终端、运输安全(transportation safety)中的无线终端、智慧城市(smart city)中的无线终端、智慧家庭(smart home)中的无线终端等等。终端还可以是可接入固网的终端设备,例如有线电话机等;终端还可以是具有呼叫功能的应用软件对应的软终端。图4是本申请实施例提供的一种终端的系统架构图,参见图4,终端可以包括:应用(applications)、应用架构(application framework)、硬件抽象层(hardware abstraction layer,HAL)、库(librarles)与linux内核(linux kernel)。其中,应用(applications)包括电话拨号应用(dialer)、视频彩铃应用(video RBT)等。电话拨号应用具备呼叫、拨号的功能。视频彩铃应用也即是上述目标应用,目标应用客户端也即是视频彩铃应用客户端。应用架构包括窗口管理模块(window manager)、呼叫管理模块(telephone manager)、资源管理模块(resource manager)等。硬件抽象层是位于操作系统(linux)内核与硬件电路之间的接口层,用于将硬件抽象化,本申请实施例中,该硬件抽象层是操作系统的接口层。库用于存储安卓系统应用以及第三方应用的文件,通过该库,便于一个应用调用其他应用的一些功能。例如,电话拨号应用在发起呼叫后调用视频彩铃应用,以获取来自IT域的交互数据,进一步可选地,获取来自IT域的彩铃。在本申请实施例的实施过程中,通过电话拨号应用和视频彩铃应用相互配合,即可在媒体资源播放的同时实现交互数据的显示。
图5是本申请实施例提供的一种终端的结构示意图。该终端可以用于执行下述各个实施例中主叫终端侧的媒体资源播放方法。参见图5,该终端500包括:
终端500可以包括射频(radio frequency,RF)电路501、包括有一个或一个以上计算机可读存储介质的存储器502、输入单元503、显示单元504、音频电路505、无线保真(wireless  fidelity,WiFi)模块506、包括有一个或者一个以上处理核心的处理器507、以及电源508等部件。本领域技术人员可以理解,图5中示出的终端结构并不构成对终端的限定,可以包括比图示更多或更少的部件,或者组合某些部件,或者不同的部件布置。其中:
RF电路501可用于收发信息或通话过程中,信号的接收和发送,特别地,将基站的下行信息接收后,交由一个或者一个以上处理器507处理;另外,将涉及上行的数据发送给基站。通常,RF电路501包括但不限于天线、至少一个放大器、调谐器、一个或多个振荡器、用户身份模块(SIM)卡、收发信机、耦合器、低噪声放大器(low noise amplifier,LNA)、双工器等。此外,RF电路501还可以通过无线通信与网络和其他设备通信。该无线通信可以使用任一通信标准或协议,包括但不限于全球移动通讯系统(global system of mobile communication,GSM)、通用分组无线服务(general packet radio service,GPRS)、码分多址(code division multiple access,CDMA)、宽带码分多址(wideband code division multiple access,WCDMA)、长期演进(long term evolution,LTE)、电子邮件、短消息服务(short messaging service,SMS)等。该RF电路501用于实现本申请实施例中的呼叫建立过程以及通话等过程。
存储器502可用于存储软件程序以及模块,处理器507通过运行存储在存储器502的软件程序以及模块,从而执行各种功能应用以及数据处理。存储器502可主要包括存储程序区和存储数据区,其中,存储程序区可存储操作系统、至少一个功能所需的应用(比如声音播放功能、图像播放功能等)等;存储数据区可存储根据终端500的使用所创建的数据(比如音频数据、电话本等)等。此外,存储器502可以包括高速随机存取存储器,还可以包括非易失性存储器,例如至少一个磁盘存储器件、闪存器件、或其他易失性固态存储器件。相应地,存储器502还可以包括存储器控制器,以提供处理器507和输入单元503对存储器502的访问。该存储器502还用于存储本申请实施例中终端所获取到的交互控件的显示数据、交互内容数据、第二媒体资源和第一媒体资源中的至少一项。
输入单元503可用于接收输入的数字或字符信息,以及产生与用户设置以及功能控制有关的键盘、鼠标、操作杆、光学或者轨迹球信号输入。具体地,输入单元503可包括触敏表面5031以及其他输入设备5032。触敏表面5031,也称为触摸显示屏或者触控板,可收集用户在其上或附近的触摸操作(比如用户使用手指、触笔等任何适合的物体或附件在触敏表面5031上或在触敏表面5031附近的操作),并根据预先设定的程式驱动相应的连接装置。可选的,触敏表面5031可包括触摸检测装置和触摸控制器两个部分。其中,触摸检测装置检测用户的触摸方位,并检测触摸操作带来的信号,将信号传送给触摸控制器;触摸控制器从触摸检测装置上接收触摸信息,并将它转换成触点坐标,再送给处理器507,并能接收处理器507发来的命令并加以执行。此外,可以采用电阻式、电容式、红外线以及表面声波等多种类型实现触敏表面5031。除了触敏表面5031,输入单元503还可以包括其他输入设备5032。具体地,其他输入设备5032可以包括但不限于物理键盘、功能键(比如音量控制控件、开关控件等)、轨迹球、鼠标、操作杆等中的一种或多种。上述输入单元503用于接收用户对该输入单元503的操作所触发的信号并基于信号传送给相应控制器,例如,本申请实施例中用户能够通过对输入单元503上进行触摸操作来进行媒体资源的播放选择以及交互控件的操作。
显示单元504可用于显示由用户输入的信息或提供给用户的信息以及终端500的各种图形用户接口,这些图形用户接口可以由图形、文本、图标、视频和其任意组合来构成。显示 单元504可包括显示面板5041,可选的,可以采用液晶显示器(liquid crystal display,LCD)、有机发光二极管(organic light-emitting diode,OLED)等形式来配置显示面板5041。进一步的,触敏表面5031可覆盖显示面板5041,当触敏表面5031检测到在其上或附近的触摸操作后,传送给处理器507以确定触摸事件的类型,随后处理器507根据触摸事件的类型在显示面板5041上提供相应的视觉输出。例如,上述显示单元504能够显示第二媒体资源和第一媒体资源的播放画面中的至少一项。虽然在图5中,触敏表面5031与显示面板5041是作为两个独立的部件来实现输入和输入功能,但是在某些实施例中,可以将触敏表面5031与显示面板5041集成而实现输入和输出功能。
音频电路505、扬声器5051,传声器5052可提供用户与终端500之间的音频接口。音频电路505可将接收到的音频数据转换后的电信号,传输到扬声器5051,由扬声器5051转换为声音信号输出;另一方面,传声器5052将收集的声音信号转换为电信号,由音频电路505接收后转换为音频数据,再将音频数据输出处理器507处理后,经RF电路501以发送给比如另一终端,或者将音频数据输出至存储器502以便进一步处理。音频电路505还可能包括耳塞插孔,以提供外设耳机与终端500的通信。在本申请实施例中,音频电路505、扬声器5051能够实现终端侧的音频播放相关过程,例如,在对任一媒体资源进行播放时,终端通过音频电路505、扬声器5051来实现音频数据的处理和外放。
WiFi属于短距离无线传输技术,终端500通过WiFi模块506可以帮助用户收发电子邮件、浏览网页和访问流式媒体等,它为用户提供了无线的宽带互联网访问。虽然图5示出了WiFi模块506,但是可以理解的是,其并不属于终端500的必须构成,完全可以根据需要在不改变发明的本质的范围内而省略。
处理器507是终端500的控制中心,利用各种接口和线路连接整个手机的各个部分,通过运行或执行存储在存储器502内的软件程序和/或模块,以及调用存储在存储器502内的数据,执行终端500的各种功能和处理数据,从而对手机进行整体监控。可选的,处理器507可包括一个或多个处理核心;可选的,处理器507可集成应用处理器和调制解调处理器,其中,应用处理器主要处理操作系统、用户界面和应用等,调制解调处理器主要处理无线通信。可以理解的是,上述调制解调处理器也可以不集成到处理器507中。
终端500还包括给各个部件供电的电源508(比如电池),可选的,电源可以通过电源管理系统与处理器507逻辑相连,从而通过电源管理系统实现管理充电、放电、以及功耗管理等功能。电源508还可以包括一个或一个以上的直流或交流电源、再充电系统、电源故障检测电路、电源转换器或者逆变器、电源状态指示器等任意组件。尽管未示出,终端500还可以包括摄像头、蓝牙模块等,在此不再赘述。
图6是本申请实施例提供的一种服务器的结构示意图,该服务器600包括可因配置或性能不同而产生比较大的差异,可以包括一个或一个以上处理器601和一个或一个以上的存储器602,其中,该存储器602中存储有至少一条程序代码,该至少一条程序代码由该处理器601加载并执行以实现上述每个方法实施例中第二媒体服务器所执行的媒体资源播放方法。当然,该服务器600还可以具有有线或无线网络接口、键盘以及输入输出接口等部件,以便进行输入输出,该服务器600还可以包括其他用于实现设备功能的部件,在此不做赘述。该处理器可以是中央处理器(central processing units,CPU)、图形处理器(graphics  processing unit,GPU)、张量处理器(tensor processing unit,TPU)、神经网络处理器(neural network processing unit,NPU)、大脑处理器(brain processing unit,BPU)、深度学习处理器(deep learning processing unit,DPU)、全息图像处理器(holographic processing unit,HPU)、矢量处理器(vector processing unit,VPU)以及智能处理器(intelligence processing unit,IPU)等任一处理器。
处理器601可以采用通用的CPU、微处理器、应用专用集成电路(application specific integrated circuit,ASIC),GPU或者一个或多个集成电路,用于执行相关程序,以实现上述的媒体资源播放方法。
处理器601还可以是一种集成电路芯片,具有信号的处理能力。在实现过程中,本申请的媒体资源播放方法的各个步骤可以通过处理器601中的硬件的集成逻辑电路或者软件形式的程序代码完成。上述的处理器601还可以是通用处理器、数字信号处理器(digital signal processing,DSP)、ASI、现成可编程门阵列(field programmable gate array,FPGA)或者其他可编程逻辑器件、分立门或者晶体管逻辑器件、分立硬件组件。可以实现或者执行本申请实施例中的公开的各方法、步骤及逻辑框图。通用处理器可以是微处理器或者该处理器也可以是任何常规的处理器等。结合本申请实施例所公开的方法的步骤可以直接体现为硬件译码处理器执行完成,或者用译码处理器中的硬件及软件模块组合执行完成。软件模块可以位于随机存储器(random access memory,RAM),闪存、只读存储器(read-only memory,ROM),可编程只读存储器或者电可擦写可编程存储器、寄存器等本领域成熟的存储介质中。该存储介质位于存储器602,处理器601读取存储器602中的信息,结合其硬件完成本申请实施例的第二媒体服务器中包括的模块所需执行的功能,或者执行本申请方法实施例的第二媒体服务器侧的媒体资源播放方法。
需要说明的是,本申请实施例中涉及的各种请求的名称并不限制请求的功能,且该各种请求的命名可以基于协议等改变。
上述实施例中对系统架构以及所涉及到的硬件等结构进行了介绍,本申请实施例的方法应用在图1或图3所示的系统架构中,当然也可以应用在其他通信场景中,本申请实施例在此不作限定。下面将结合具体实施例阐述本申请的方案。图7是本申请实施例提供的一种媒体资源播放方法的流程图,该流程为若本次呼叫过程中存在CT域的媒体资源,则播放CT域的媒体资源,并在CT域的媒体资源的播放画面上显示IT域的交互数据,其中,对于交互控件的显示数据和交互数据的获取采用分批获取的方式进行,该图7包括以下步骤。
701、主叫终端向被叫终端发送呼叫请求,该呼叫请求经过第一媒体服务器。
其中,呼叫请求为视频呼叫请求或音频呼叫请求。第一媒体服务器为位于通信技术CT域的服务器。
在一种可能的实现方式中,主叫终端向被叫终端发起呼叫时,通过主叫终端的电话拨号应用向被叫终端发送呼叫请求。应理解地,该呼叫请求通过CT域的网络设备透传至被叫终端,该呼叫请求会经过CT域的第一媒体服务器。
702、主叫终端通过目标应用客户端,向第二媒体服务器发送显示数据获取请求,该显示数据获取请求携带主叫终端参与的此次呼叫的相关信息。
其中,目标应用客户端为具有在振铃阶段播放媒体资源功能的应用客户端,如视频彩铃 应用客户端,该目标应用客户端内置有媒体播放器。第二媒体服务器为位于互联网技术IT域的服务器。显示数据获取请求用于指示获取交互控件的显示数据。可选地,呼叫的相关信息包括主叫的号码、被叫的号码、主叫的位置信息、呼叫类型、和/或呼叫时间等。需要说明的是,本实施例仅采用显示数据获取请求,来代表IT域中用于获取显示数据的请求,本申请实施例对显示数据获取请求的名称不作限定。交互控件包括主页访问控件(也称为内容黄页控件)、点赞控件、评论控件、分享控件和下载控件中至少一项。交互控件的显示数据也即是交互控件的渲染数据。例如,图8是本申请实施例提供的一种媒体资源的播放示意图,图8的播放页面上显示有内容黄页、点赞、评论、分享、下载、更多内容的多个交互控件。应理解地,步骤702还未开始播放媒体资源,在此仅以图8为例来解释多个交互控件的渲染。
在一种可能的实现方式中,主叫终端向被叫终端发送呼叫请求后,则拉起目标应用客户端(也可以叫做打开目标应用客户端),由主叫终端的电话拨号应用向目标应用客户端发送此次呼叫的相关信息,目标应用客户端接收到此次呼叫的相关信息后,将携带有该相关信息的显示数据获取请求通过IT域发送至第二媒体服务器。其中,该显示数据获取请求采用HTTP协议发送或HTTPS协议发送。
703、第二媒体服务器接收到主叫终端通过目标应用客户端发送的显示数据获取请求。
在一种可能的实现方式中,第二媒体服务器接收到主叫终端通过目标应用客户端发送的显示数据获取请求后,从该显示数据获取请求的对应字段中,读取此次呼叫的相关信息。
704、第二媒体服务器基于该呼叫的相关信息,对用户进行合法性鉴权和功能服务鉴权,若鉴权通过,则执行步骤705。
本申请实施例中,鉴权包括对用户功能信息的鉴权(也称为有效性鉴权),可选地,该鉴权还包括对用户身份信息的鉴权。用户功能信息鉴权是指对用户是否开通了显示交互控件功能的验证。用户身份信息鉴权是指对用户的身份验证,例如对被叫的手机号码的验证,以确认该手机号码是否为合法手机号码。可选地,对用户进行鉴权为对主叫用户和被叫用户中至少一方进行鉴权。
在一种可能的实现方式中,以对主叫号码鉴权为例,相应的鉴权过程包括:第二媒体服务器获取该显示数据获取请求携带的信息后,根据该信息中的主叫号码,判断该主叫号码是否为合法号码,若该主叫号码为合法号码,则在订阅数据库中进行查询,若查询到该主叫号码已订阅显示交互控件功能,则该用户鉴权通过,并执行后续步骤705。若未查询到该主叫号码,则用户鉴权不通过。对被叫用户的鉴权过程与上述过程同理。
可选地,第二媒体服务器还记录此次呼叫的相关信息,可以作为后续运营的数据参考。
上述过程,通过对用户进行鉴权,来验证用户的合法性和有效性中至少一项,以确保后续媒体资源播放时交互控件的安全显示,提高了IT域的交互控件显示的安全性和可靠性。
需要说明的是,在第二媒体服务器对用户鉴权通过后,与主叫终端之间建立会话连接,该会话连接是基于HTTP协议或HTTPS协议,以便在后续过程中,通过该会话连接与主叫终端之间进行资源的相关信息等的交互。
705、第二媒体服务器向主叫终端发送交互控件的显示数据。
其中,第二媒体服务器或第二媒体服务器所关联的数据库存储有交互控件的显示数据,该显示数据为渲染数据,用以指示交互控件的显示形态,终端能够基于该显示数据进行渲染以显示交互控件。例如,默认的显示数据包括主页访问控件、点赞控件、评论控件、分享控 件和下载控件的默认叠加层样式。
在一种可能的实现方式中,第二媒体服务器响应于显示数据获取请求,从本服务器或者关联的数据库中,获取该交互控件的显示数据,并向终端发送该交互控件的显示数据。可选地,第二媒体服务器通过数据包的形式,能实现显示数据的快速传输,提高了主叫终端获取显示数据的效率。
在另一种可能的实现方式中,目标应用客户端还能够为用户提供多种叠加层样式,则用户通过目标应用客户端,能够选择其想要的叠加层样式,进而,第二媒体服务器根据主叫终端的设置信息,能够确定出用户提前设置的叠加层样式,进一步查询本服务器或订阅数据库,获取与叠加层样式对应的该交互控件的显示数据,并向终端发送该交互控件的显示数据,从而实现个性化的显示效果。
上述步骤702至705是在主叫终端发起呼叫后对交互控件的显示数据进行获取的过程,通过上述过程能够提前获取到交互控件的显示数据,从而使得在后续在播放第一媒体资源时,无需再从第二媒体服务器进行交互控件的显示数据的获取,降低了数据获取量,降低了显示的延时。
本申请实施例中,是以第二媒体服务器向主叫终端推送交互控件的显示数据为例进行说明。在一种可能实现方式中,主叫终端和第二媒体服务器通过多次交互以获取该交互控件的显示数据,例如,第二媒体服务器响应于显示数据获取请求,向主叫终端发送该交互控件的显示数据的地址信息。其中,地址信息是指显示数据的统一资源定位符(uniform resource locator,URL)地址,该URL地址也称为网页地址。主叫终端接收该交互控件的显示数据的地址信息,基于接收到的地址信息,从第二媒体服务器获取该交互控件的显示数据。可选地,在该过程中,第二媒体服务器在确定了交互控件的显示数据的地址信息后,预加载该交互控件的显示数据,通过预加载,能够优化后续交互控件显示的时延问题,提高了显示数据的获取效率,也就不会导致后续媒体资源播放时发生控件显示时间较长的问题。
可选地,第二媒体服务器预加载交互控件的显示数据的方法包括下述任一项:
一种可能的实现方式中,交互控件的显示数据存储于第二媒体服务器的硬盘中,则第二媒体服务器确定出交互控件的显示数据的地址信息后,从第二媒体服务器的硬盘中,预加载该交互控件的显示数据至内存或缓存中。
又一种可能的实现方式中,交互控件的显示数据存储于第二媒体服务器以外的服务器的硬盘或内存中,则第二媒体服务器确定出交互控件的显示数据的地址信息后,从其他服务器的硬盘或内存中,预加载该交互控件的显示数据至第二媒体服务器的内存或缓存中。
另一种可能的实现方式中,交互控件的显示数据存储于交互数据库中,则第二媒体服务器确定出交互控件的显示数据的地址信息后,从交互数据库中预加载该交互控件的显示数据至第二媒体服务器的内存或缓存中。
其中,该交互数据库可能是第二媒体服务器所关联的数据库,也可能是其他服务器所关联的数据库,本申请实施例对数据库不作限定。
应理解地,上述三种实现方式根据交互控件的显示数据的存储位置不同,对应不同的预加载过程。如果第二媒体服务器在主叫终端获取该交互控件的显示数据时再进行数据的加载,会造成一定时延,而通过上述预加载的方法,能够优化后续交互控件显示的时延问题,提高了显示数据的获取效率,也就不会导致后续媒体资源播放时发生控件显示时间较长的问题。
706、主叫终端接收该交互控件的显示数据。
可选地,该交互控件的显示数据为数据包的形式,则主叫终端在接收到该数据包后,通过解析数据包,来获取交互控件的显示数据。
707、主叫终端接收第一媒体服务器发送的第一媒体协商消息,该第一媒体协商消息用于进行媒体协商。
该第一媒体协商消息用于第一媒体服务器与主叫终端之间进行媒体协商,该第一媒体服务器位于CT域。可选地,该第一媒体协商消息为Update消息,该第一媒体协商消息携带第一媒体服务器的媒体能力信息,即该第一媒体协商消息为携带第一媒体服务器的SDP信息的SIP消息。具体地,该第一媒体协商消息的头域携带早媒体信息例如P-Early-Media:SDP,和/或该第一媒体协商消息的SDP信息携带a=contengt:g3gpp.cat媒体属性,以指示该第一媒体协商消息是用于第一媒体服务器与主叫终端之间进行早媒体的媒体协商的。主叫终端若接收到第一媒体协商消息,则确定本次呼叫存在CT域待播放的媒体资源,继续执行后续步骤。
708、主叫终端从第一媒体协商消息中,获取第一媒体资源的资源信息。
其中,第一媒体资源为CT域的媒体资源,例如视频彩铃、视频广告、视频客服等媒体资源。资源信息为用于指示第一媒体资源的任一种信息,例如,第一媒体资源的资源ID(identification,身份标识号码)。可选地,资源信息还用于指示第一媒体资源的运营商信息(运营商标识)。
一种可能的实现方式中,从该第一媒体服务器发送的第一媒体协商消息中,获取该第一媒体资源的资源信息。可选地,该第一媒体协商消息为Update消息,该资源信息位于该Update消息的会话描述协议SDP信息中。例如,图9是本申请实施例提供的一种Update消息携带资源信息的示意图,如图9所示,在Update消息中SDP信息中携带了资源信息。可选地,资源信息包括媒体资源的视频信息和音频信息,参见图9,在SDP信息的m=audio内容中,携带音频铃音ID,在SDP信息的m=video内容中,携带视频铃音ID。另外,在图9的m=audio内容和m=video内容中,还可以增加字段a=carrierinfo:xx,用于携带运营商ID,如采用01来表示运营商A,采用02来表示运营商B,采用03来表示运营商C。
709、主叫终端通过目标应用客户端,基于该第一媒体资源的资源信息,从第二媒体服务器接收该第一媒体资源的交互内容数据。
在本申请实施例中,交互内容数据用于显示基于第一媒体资源的交互内容。该交互内容数据包括主页访问数据、点赞数据、评论数据、分享数据和下载数据中至少一项。如图8所示,图8示出了点赞数据174.5w,评论数据5.9w,分享数据2.5w。应理解地,交互内容数据还包含各个交互控件的相关数据,例如,对于点赞控件,对应的相关数据为点赞数据,点赞数据包含点赞账号、点赞账号头像等,对于评论控件,对应的相关数据为评论数据,评论数据包含评论账号、评论内容等,对于下载控件,对应的相关数据为下载数据,下载数据包含下载服务链接。可选地,该交互内容数据为弹幕数据或者动画特效数据等,从而在主叫终端对第一媒体资源的播放画面上显示弹幕或者动画特效,补充了CT域的互动显示效果,提高了趣味性。该交互数据还包括其他媒体资源的链接,供用户进行选择。
在一种可能实现方式中,主叫终端通过目标应用客户端,向第二媒体服务器发送交互数据获取请求,该交互数据获取请求携带该第一媒体资源的资源信息。第二媒体服务器接收到主叫终端通过目标应用客户端发送的交互数据获取请求后,基于该资源信息,确定该第一媒 体资源的交互内容数据。例如,从该交互数据获取请求的对应字段中,读取第一媒体资源的资源信息,基于该资源信息查询到该第一媒体资源的交互内容数据。第二媒体服务器向该主叫终端返回该交互内容数据,从而主叫终端接收到交互内容数据。在另一种可能实现方式中,主叫终端与第二媒体服务器通过交互来获取交互内容数据的过程基于交互内容数据的地址信息进行,其过程参见上述对交互控件的显示数据的过程描述。
需要说明的是,本申请实施例以交互数据为交互内容数据为例,上述步骤701至步骤709是先得到交互控件的显示数据,再得到交互内容数据的过程,通过上述过程能够提前获取到交互控件的显示数据,再获取交互内容数据,从而使得在后续在播放第一媒体资源时,无需再从第二媒体服务器进行交互控件的显示数据的获取,降低了数据获取量,降低了显示的延时。
在另一种可能的实现方式中,交互数据包括交互内容数据和交互控件的显示数据,在步骤701后,执行步骤709,根据第一媒体资源的资源信息,从第二媒体服务器获取第一媒体资源的交互内容数据和交互控件的显示数据,能够一次性获取包括有交互内容数据和交互控件的显示数据的交互数据,无需执行步骤702至步骤706。也即是,该步骤702至709替换为下述过程:主叫终端确定该第一媒体服务器提供的第一媒体资源的资源信息(过程参见步骤707和708)。主叫终端通过目标应用客户端,基于该第一媒体资源的资源信息,从第二媒体服务器接收该第一媒体资源的交互内容数据和交互控件的显示数据。
710、主叫终端向第一媒体服务器发送第二媒体协商消息,该第二媒体协商消息用于进行媒体协商。
可选地,该第二媒体协商消息为200OK(UPDATE)消息,该200OK(UPDATE)消息携带主叫终端的媒体能力信息,即主叫终端和第二媒体服务器之间的媒体协商结果。
需要说明的是,该步骤709和步骤710的执行顺序不受当前序号限制,该两个步骤的执行顺序可以是同步执行709和710,还可以是先执行710以及后续步骤,再执行709,也即是,通过IT域来得到交互内容数据的流程和CT域进行媒体协商等流程不互相影响。
需要说明的是,上述步骤707和710简单描述了主叫终端和第一媒体服务器之间的媒体协商过程,下面以一个示例性过程来说明上述媒体协商过程:第一媒体服务器向主叫终端发送第一媒体协商消息,该第一媒体协商消息携带第一媒体服务器的媒体能力信息,例如,该第一媒体协商消息为update或者18*消息;主叫终端接收该第一媒体协商消息后,基于自身的能力确定媒体协商结果,向第一媒体服务器发送第二媒体协商消息,该第二媒体协商消息携带媒体协商结果,也可以叫做主叫终端的媒体能力信息,例如,该第二媒体协商消息为200OK消息。
711、第一媒体服务器向主叫终端发送180振铃消息。
其中,振铃消息为基于SIP协议的180振铃消息,该振铃消息用于指示被叫终端已振铃。
712、第一媒体服务器向主叫终端发送第一媒体资源的媒体流。
第一媒体服务器基于媒体协商结果,向主叫终端发送第一媒体资源的媒体流(例如RTP流),该过程中,主叫终端不会主动获取第一媒体资源,而是由第一媒体服务器进行媒体流的推送。
713、主叫终端接收该第一媒体资源的媒体流。
在本申请实施例是以在步骤708中从第一媒体协商消息中获取资源信息为例进行说明, 在又一种可能的实现方式中,主叫终端通过媒体流来获取资源信息,也即是,主叫终端接收到第一媒体资源的媒体流,从该第一媒体服务器传输的第一媒体资源的媒体流的头部,获取该第一媒体资源所携带的资源信息。可选地,该资源信息位于该媒体流的头部的附加增强信息(supplemental enhancementinformation,SEI)中。例如,图10是本申请实施例提供的一种媒体流携带资源信息的示意图,如图10所示,在SEI的自定义帧中携带铃音(ID)。可选地,SEI的自定义帧还携带运营商信息。
在上述过程中,提供了两种携带资源信息的方式,通过在第一媒体协商消息中或者媒体流中携带资源信息,便于主叫终端获取资源信息,保证后续交互数据显示的正常进行。另外,在一次实施过程中,可以在通过第一媒体协商消息或者媒体流中任一项来携带资源信息,也可以通过第一媒体协商消息和媒体流均携带资源信息,这样,在对其中一项的资源信息读取失败时,对另外一项的资源信息进行读取,同样能够获取到资源信息,能够确保资源信息的获取。
714、主叫终端接收到180振铃消息,基于接收到的媒体流,播放第一媒体服务器发送的第一媒体资源,在第一媒体资源的播放画面上,基于接收到的交互控件的显示数据和交互内容数据,显示交互控件和对应的交互内容数据。
在一种可能的实现方式中,主叫终端在该第一媒体资源的播放画面上,叠加显示该交互控件和对应交互内容数据。例如,该目标应用客户端基于交互控件的显示数据和交互内容数据,在播放画面上,渲染交互图层,该交互图层中包括交互控件以及对应的交互内容数据,如显示点赞控件以及点赞数等信息。
本申请实施例提供的技术方案实现了对CT域媒体资源的补充,在存在CT域的第一媒体资源的情况下,通过IT域的第二媒体服务器获取第一媒体资源的交互数据,进而,主叫终端能够在播放第一媒体资源的同时,在界面上增加一个叠加层,用以显示该交互控件和相应的交互内容数据,例如互动按钮、弹幕以及动画效果等,实现了彩铃场景下的媒体交互,丰富了彩铃用户的体验,增加了视频彩铃业务的趣味性,且,随着5G的发展和商用,在IT网络资源不构成瓶颈、网络时延更短的背景下,让视频彩铃业务可以更灵活地拓展,例如视频叠层播放、视频界面互动、视频内容滚动切换等,增加视频彩铃业务的趣味性。
上述图7所示实施例是以播放CT域的媒体资源,并在播放画面上增加叠加层来显示交互相关的数据为例进行说明,而对于主叫终端来说,还能够同时对IT域的媒体资源和CT域的媒体资源进行播放,下面结合图11所示过程来对该媒体资源播放方法进行介绍,参见图11,该过程包括:
1101、主叫终端向被叫终端发送呼叫请求,该呼叫请求经过第一媒体服务器。
该步骤1101参考步骤701。
1102、主叫终端确定该第一媒体服务器提供的第一媒体资源的资源信息。
该步骤1102参见步骤708或者713。
1103、主叫终端通过目标应用客户端,从第二媒体服务器获取该第一媒体资源的交互内容数据和交互控件的显示数据。
该步骤1103的过程参见图7所示实施例中对该第一媒体资源的交互内容数据和交互控件的显示数据的获取过程。
1104、主叫终端通过目标应用客户端,从第二媒体服务器接收第二媒体资源。
其中,第二媒体服务器为IT域的服务器,第二媒体资源为IT域的媒体资源。在该实施例中,第二媒体资源为推送的视频广告、视频动画等,该实施例能够实现振铃阶段的IT域视频的播放,丰富了视频彩铃用户的观看体验。
可选地,该主叫终端通过目标用用客户端发送资源获取请求,而第二媒体服务器接收到资源获取请求后,向主叫终端发送第二媒体资源,例如第二媒体服务器以RTMP流的形式向主叫终端发送第二媒体资源的媒体流。在该过程中,第二媒体服务器是主动推送媒体流,而不是主叫终端基于地址信息的获取。可选地,第二媒体服务器能够为用户提供多种第二媒体资源,则用户通过目标应用客户端,能够选择其想要播放的第二媒体资源。
可选地,第二媒体服务器向主叫终端发送预设的第二媒体资源,以实现广告推送等运营目的。
在该步骤1104中,主叫终端通过目标应用客户端向第二媒体服务器发送本次呼叫的相关信息,第二媒体服务器接收到该相关信息,对相关信息进行鉴权,例如,对相关信息中的主叫号码和被叫号码中至少一项进行鉴权,又例如,该鉴权包括如步骤704中的合法性鉴权和功能服务鉴权,在鉴权通过后,第二媒体服务器确定第二媒体资源(如与相关信息匹配的第二媒体资源)的地址信息,将该第二媒体资源的地址信息发送给主叫终端,主叫终端接收到该第二媒体资源的地址信息后,可选地,基于该第二媒体资源的地址信息从第二媒体服务器获取第二媒体资源,又或者,在步骤1105接收到振铃消息后,再获取第二媒体资源。可选地,第二媒体服务器在确定第二媒体资源的地址信息后,预加载第二媒体资源。可选地,第二媒体服务器在接收到主叫终端发送的会话结束消息后,释放该预加载的第二媒体资源,以节约内存空间。
可选地,主叫终端还通过目标应用客户端,从第二媒体服务器获取第二媒体资源的交互内容数据。例如,其获取过程包括:第二媒体服务器确定第二媒体资源后,确定第二媒体资源对应的交互内容数据,将该第二媒体资源对应的交互内容数据的地址信息发送给主叫终端,主叫终端接收到该第二媒体资源对应的交互内容数据的地址信息后,可选地,基于该第二媒体资源对应的交互内容数据的地址信息从第二媒体服务器获取第二媒体资源的交互内容数据,又或者,在步骤1105接收到振铃消息后,再获取第二媒体资源的交互内容数据。可选地,第二媒体服务器在确定第二媒体资源对应的交互内容数据的地址信息后,预加载第二媒体资源的交互内容数据。可选地,第二媒体服务器在接收到主叫终端发送的会话结束消息后,释放该预加载的第二媒体资源的交互内容数据,以节约内存空间。可选地,主叫终端还通过目标应用客户端,从第二媒体服务器获取第二媒体资源的交互控件的显示数据。该过程参考上述对第二媒体资源的交互内容数据的获取过程。需要说明的是,主叫终端可以不获取该交互控件的显示数据,而是基于已获取到的交互控件的显示数据进行显示。
1105、主叫终端在接收到被叫终端的振铃消息后,基于第一媒体服务器发送的第一媒体资源进行播放,在该第一媒体资源的播放画面上,显示交互控件和交互内容数据,以及对第二媒体资源进行播放。
该对第一媒体资源的获取和播放过程参见图7所示实施例。可选地,在播放第二媒体资源时,还显示交互控件和第二媒体资源的交互内容数据,显示方式参考第一媒体资源的交互控件和交互内容数据的显示方式。
可选地,上述播放第一媒体资源和播放第二媒体资源是主叫终端根据设置信息实现的。例如,主叫终端的设置信息指示播放IT域和CT域的媒体资源,则执行步骤1105,而若设置信息指示播放CT域的媒体资源,则可以在前述步骤中不执行相关获取步骤,通过这种方式,给用户带来了新的体验和更多的个性化选择,提升了用户对视频彩铃业务的体验和参与度。
可选地,主叫终端在播放第一媒体资源和第二媒体资源时,还可以采用下述播放模式:
一种可能的实现方式中,主叫终端全屏播放CT域媒体资源,悬浮窗播放IT域媒体资源,也即是以全屏模式播放该第一媒体资源,在该第一媒体资源的播放画面上的悬浮窗内,静音播放该第二媒体资源。例如,图12是本申请实施例提供的一种媒体资源的播放示意图,如图12所示,图12以对第一媒体资源进行全屏播放、对第二媒体资源进行悬浮窗播放为例对方案进行说明。另一种可能的实现方式中,主叫终端全屏播放IT域媒体资源,悬浮窗播放CT域媒体资源,也即是以全屏模式播放该第二媒体资源,在该第二媒体资源的播放画面上的悬浮窗内,静音播放该第一媒体资源。在该实现方式中,一个媒体资源采用全屏播放的形式,另一个媒体资源则采用悬浮窗播放的形式,能够清晰直观的显示两个媒体资源,并且通过全屏正常播放(有音)、悬浮窗静音播放的方式,避免了两段音频同时播放而导致的用户体验感差。
可选地,在两个媒体资源的播放一个采用悬浮窗模式、另一个采用全屏模式的场景下,主叫终端基于对悬浮窗的点击操作来进行相应处理的过程包括下述任一项:
一种可能的实现方式中,主叫终端检测到对悬浮窗的点击操作,则将该悬浮窗所播放的媒体资源切换为全屏模式播放,将另一媒体资源关闭。在该过程中,主叫用户在观看媒体资源的过程中,可以对其感兴趣的媒体资源对应的悬浮窗实施点击操作,采用全屏播放来对该媒体资源播放,并将另一个媒体资源关闭,可以保留主叫用户想要观看的媒体资源,能够给主叫用户带来更佳的观看体验。
又一种可能的实现方式中,若主叫终端检测到对悬浮窗的点击操作,则将该悬浮窗所播放的媒体资源切换为全屏模式播放,将另一媒体资源切换为悬浮窗模式进行播放。在该过程中,主叫用户对其感兴趣的媒体资源对应的悬浮窗实施点击操作后,主叫终端以全屏播放该媒体资源,并将另一个媒体资源切换为悬浮窗,使得用户还可以继续切换至另一个媒体资源进行观看。
另一种可能的实现方式中,主叫终端还能够以悬浮窗形式播放该第一媒体资源,且以悬浮窗形式播放该第二媒体资源。例如,图13是本申请实施例提供的一种媒体资源的播放示意图,参见图13,分别在不同悬浮窗内播放该第一媒体资源和该第二媒体资源。在这种实现方式中,分别在悬浮窗内播放两个媒体资源,同样能够清晰直观的显示两个媒体资源。可选地,该第一媒体资源和该第二媒体资源中至少一个媒体资源静音播放。在该过程中,由于存在至少一个媒体资源是静音播放的,则也避免了由于两段音频同时播放而造成的听觉混乱,提升了用户体验。可选地,第二媒体资源的播放是通过目标应用客户端(视频彩铃应用)的播放器来实现,第一媒体资源的播放是通过系统应用(电话拨号应用)的播放器来实现。
可选地,在两个媒体资源的播放均采用悬浮窗模式的场景下,主叫终端基于对悬浮窗的点击操作来进行相应处理的过程包括下述任一项:
一种可能的实现方式中,若两个媒体资源的播放均采用悬浮窗模式,主叫终端检测到对 任一悬浮窗的点击操作,则将该悬浮窗所播放的媒体资源切换为全屏模式播放,将另一媒体资源关闭。在该过程中,主叫用户在观看媒体资源的过程中,可以对其感兴趣的媒体资源对应的悬浮窗实施点击操作,采用全屏播放来对该媒体资源播放,并将另一个媒体资源关闭,可以保留主叫用户想要观看的媒体资源,能够给主叫用户带来更佳的观看体验。
又一种可能的实现方式中,若主叫终端检测到对悬浮窗的点击操作,则将该悬浮窗所播放的媒体资源切换为全屏模式播放,保持另一媒体资源的悬浮窗模式。在该过程中,主叫用户对其感兴趣的媒体资源对应的悬浮窗实施点击操作后,主叫终端以全屏播放该媒体资源,并将另一个媒体资源切换为悬浮窗,使得用户保留切换回该另一个媒体资源进行播放的入口,提供了更加丰富的操作体验。
应理解地,上述两种实现方式中,对悬浮窗的点击操作是指对悬浮窗内除关闭键外的任一位置的点击操作,如图12或图13所示,每个悬浮窗均设置有关闭键,用户通过对该关闭键的点击操作,能够关闭不想要继续播放的媒体资源,达到个性化选择播放的目的。也即是,若主叫终端检测到对悬浮窗的关闭键的点击操作,将悬浮窗关闭。
在另一种可能的实现方式中,在处于全屏播放模式的媒体资源的播放画面上,叠加显示该交互控件和资源对应的交互内容数据,在另一媒体资源对应的悬浮窗内,显示该另一媒体资源的播放画面。在该过程中,仅对全屏播放的媒体资源叠加显示对应的交互控件和交互数据,这样,能够保证播放界面的整洁性和直观性,便于用户的交互操作或终端控制。
本申请提供的技术方案,通过主叫终端与第一媒体服务器、第二媒体服务器之间的交互,能够在有CT域的媒体资源和IT域的媒体资源的情况下,同时播放CT域的媒体资源和IT域的媒体资源,给用户带来了新的体验和更多的选择性,提升了用户对视频彩铃业务的体验和参与度。
上述图7和图11对媒体资源的播放以及播放过程中对互动控件的显示进行了说明。另外,在播放媒体资源的过程中,还可以提供基于媒体资源的交互功能。图14是本申请实施例提供的一种基于媒体资源播放的交互方法的流程图,参见图14:
1401、若主叫终端检测到对任一交互控件的触发操作,向该第二媒体服务器发送该交互控件对应的交互请求,该交互请求用于实现基于该第一媒体资源的交互。
其中,交互请求携带交互对象和交互内容。交互对象是指交互控件对应的交互数据,而交互内容是指对该交互对象的更新内容,例如,交互对象为点赞数据,交互内容为对点赞数据加一。其中,该交互请求还携带第一媒体资源的资源信息,从而使得第二媒体服务器基于该资源信息对第一媒体资源对应的交互内容进行处理。
1402、第二媒体服务器接收到主叫终端发送的交互请求,基于该第一媒体资源进行处理。
在一种可能的实现方式中,第一媒体服务器在接收到交互请求后,根据交互请求所携带的资源标识,确定第一媒体资源对应的交互对象,再根据交互请求所携带的交互内容进行处理。例如,若该交互请求携带的交互对象为点赞数据,且交互内容为点赞数加一,则第二媒体服务器对存储的第一媒体资源的点赞数据加一,并且主叫终端上所显示的点赞数据加一。
需要说明的是,在图14的流程中,主叫终端与第二媒体服务器之间所采用的媒体传输流的类型为HTML流。由主叫终端向第二媒体服务器发送HTTP消息或HTTPS消息,第二媒体服务器再向主叫终端返回HTTP响应或HTTPS响应。
本申请实施例提供的技术方案,在振铃阶段播放媒体资源时,通过提供交互控件及交互数据,实现了彩铃场景下的媒体交互,增加了视频彩铃业务的趣味性,能够大大提升媒体资源播放的趣味性,交互的灵活性更高,交互的内容更加丰富。
在上述实施例中,对基于第一媒体资源的播放和交互功能进行了介绍,而在播放过程中,还会涉及到基于通话的播放暂停等过程,下面基于该播放暂停的处理进行介绍,参见图15,图15以播放第一媒体资源为例对方案进行说明:
1501、若主叫终端接收到该被叫终端的摘机消息,关闭该第一媒体资源的交互数据的显示。
其中,摘机消息用于表示被叫终端已摘机,则主叫用户和被叫用户开始通话。
在一种可能的实现方式中,若主叫终端接收到摘机消息,通过电话拨号应用通知目标应用客户端停止显示交互数据的叠加层,则目标应用客户端控制播放器停止显示。
1502、主叫终端停止播放该第一媒体资源,显示该第一媒体资源的停止画面。
其中,停止画面是为预设画面(例如,目标应用客户端的入口画面)或者是停止播放的时刻对应的媒体画面(如截屏画面)。步骤1502中,主叫用户通过在主叫终端上设置,能够实现通话过程中保留第一媒体资源的画面显示的功能,也即是,主叫终端的设置信息确定通话过程中保持媒体资源的显示状态,上述实施例可以在通话过程中提供仍然保留画面显示,为用户提供了进一步操作的入口,提高了媒体播放的灵活性。
例如,图16是本申请实施例提供的一种媒体资源的播放示意图,参见图16,在通话过程中保持第一媒体资源的显示状态时,该第一媒体资源以悬浮窗的形式显示在呼叫界面上。
可选地,主叫终端还支持通话结束后的显示状态设置,相应过程如下:
一种可能的实现方式中,若基于该主叫终端的设置信息,确定通话结束后保持第一媒体资源的显示状态,则通话结束时,在主叫终端的主界面上以悬浮窗形式显示该第一媒体资源的停止画面。图17是本申请实施例提供的一种媒体资源的播放示意图,参见图17,在通话结束后第一媒体资源以悬浮窗的形式显示在主叫终端的主界面上。
1503、若主叫终端检测到对该停止画面的继续播放操作,与第二媒体服务器进行交互,获取该第一媒体资源的资源信息对应的第三媒体资源,该第三媒体资源与该第一媒体资源匹配。
其中,第三媒体资源与第一媒体资源匹配,是指第三媒体资源是第一媒体资源的资源信息(如资源ID)对应的媒体资源。应理解地,第三媒体资源与第一媒体资源的媒体内容相同。
在一种可能实现方式中,主叫终端向该第二媒体服务器发送资源获取请求,该资源获取请求携带该第一媒体资源的资源信息。第二媒体服务器接收该主叫终端发送的资源获取请求,基于该资源信息,确定该资源信息对应的第三媒体资源,该第三媒体资源与该第一媒体资源匹配。第二媒体服务器向该主叫终端返回该第三媒体资源的地址信息,主叫终端接收到该地址信息后,基于地址信息,从第二媒体服务器获取该第三媒体资源。可选地,第二媒体服务器在确定了第三媒体资源的地址信息后,预加载第三媒体资源,以便提高数据传输效率。该预加载过程参考上述交互控件的显示数据的预加载过程。
在上述过程中,第二媒体服务器与主叫终端之间建立会话,从而基于该会话来进行交互。
1504、主叫终端播放该第三媒体资源。
在一种的可能的实现方式中,主叫终端接收第二媒体服务器发送的第三媒体资源后,播放该第三媒体资源。可选地,基于停止播放时的播放进度,对该第三媒体资源进行播放。
需要说明的是,上述步骤1503至步骤1504,是基于对该停止画面的继续播放操作,而触发第三媒体资源的获取过程,若主叫用户想要继续观看该第一媒体资源,点击悬浮窗即可实现第一媒体资源的继续播放,操作方便且简单。
在另一种可选的实现方式中,第三媒体资源的获取过程可以在主叫终端接收到摘机消息后执行,也即是在用户通话的过程中,由第二媒体服务器返回第三媒体资源的地址信息,而在检测到继续播放操作后获取第三媒体资源。通过该过程,能够在用户接通电话的同时,查询第三媒体资源,为后续第三媒体资源的播放做好准备,提高了媒体资源的播放效率,也就不会导致后续媒体资源播放时发生播放时间较长的问题。进一步地,若检测到对该停止画面的关闭操作或熄屏,关闭该目标应用客户端,并向该第二媒体服务器发送会话结束消息,第二媒体服务器接收到该主叫终端的会话结束消息后,释放与主叫终端之间对于第三媒体资源的会话。
在另一种可能的实现方式中,该停止画面作为目标应用客户端的门户网站入口,若检测到对停止画面的点击操作,在打开的目标应用客户端中显示门户网站界面,也即是,拉起目标应用客户端,在所述目标应用客户端中显示目标应用客户端对应的门户网站界面。在上述实施例中,若主叫用户想要访问门户网站,则可以通过点击停止画面即可实现,操作方便且简单。可选地,若主叫用户想要观看其他媒体资源,通过在该门户网站界面上浏览其他媒体资源,并对想要观看的媒体资源实施点击操作,则主叫终端开始播放该点击操作对应的媒体资源。若主叫用户想要在目标应用客户端上进行相关业务设置,在该门户网站界面上点击相应的设置按键,同样能够实现相应功能。
可选地,主叫终端基于不同时刻对停止画面的点击操作,播放第三媒体资源的方式不同,相应过程如下:一种可能的实现方式中,若主叫终端在通话过程中检测到对该停止画面的点击操作,静音播放该第三媒体资源。又一种可能的实现方式中,若主叫终端在通话结束后检测到对该停止画面的点击操作,则取消第一媒体资源的静音模式,正常播放该第三媒体资源。
上述过程中,主叫用户可以在通话过程中或者通话结束后的任一时刻,点击停止画面,能够通过从IT域获取具有相同媒体内容的媒体资源,提供了连续且完整的视听体验。进一步地,主叫终端能够根据主叫用户点击对应的时刻是通话过程中还是通话结束后,来进行不同方式的播放,在通话过程中静音播放,使得主叫用户能够清晰的听到通话内容,避免漏掉通话过程中重要的通话内容,在通话结束后,则可以正常播放,也即是非静音播放。
下面结合VoLTE网络中的信令交互对本次呼叫中播放CT域的媒体资源来实现振铃阶段的播放,并基于IT域的交互数据来进行交互界面显示的过程进行说明。图18是本申请实施例提供的一种媒体资源播放方法的流程图,参见图18:
1801、主叫终端发送INVITE消息,该INVITE消息携带主叫终端的SDP信息(如SDPA1信息)。
1802、主叫终端通过目标应用客户端,向第二媒体服务器发送显示数据获取请求,该显示数据获取请求用于指示获取交互控件的显示数据。
1803、第二媒体服务器接收到主叫终端通过目标应用客户端发送的显示数据获取请求。
1804、第二媒体服务器基于该呼叫的相关信息,对用户进行合法性鉴权和功能服务鉴权, 若鉴权通过,则执行步骤1805。
需要说明的是,在进行鉴权时,第二媒体服务器基于该相关信息中的主叫号码等进行鉴权,其具体鉴权方法如上述实施例所述,在此不做赘述。
1805、第二媒体服务器向该主叫终端发送交互控件的显示数据。
该步骤1805参考步骤705。
1806、主叫终端接收交互控件的显示数据。
需要说明的是,上述步骤对交互控件的显示数据的获取,在得到其地址信息之后以及主叫终端开始播放第一媒体资源之间的任一时机进行,以实现对基础渲染内容的获取。例如,在获取交互内容数据之前进行,又例如,在后续主叫终端接收到180振铃消息后立刻进行,本申请实施例对此不作限定。
1807、被叫终端接收该INVITE消息,发送对呼叫请求的183消息,该183消息携带被叫终端的SDP信息(如SDPB1信息)。
需要说明的是,该被叫终端接收INVITE消息等步骤与上述主叫终端在IT域的交互之间的发生时序不互相影响,也即是,对于主叫终端来说,在向被叫终端发送INVITE消息后,可以立刻进行交互控件的显示数据的获取流程,在其他可能实现方式中,该主叫终端在进行媒体资源播放之前的任一时机,均可执行上述交互控件的显示数据的获取流程。
1808、主叫终端接收该183消息,向被叫终端发送PRACK消息,该PRACK消息用于指示主叫终端已接收被叫终端发送的183消息。
1809、被叫终端接收PRACK消息,向主叫终端发送200OK(PRACK),该200OK(PRACK)用于指示被叫终端已接收主叫终端发送的PRACK消息。
1810、主叫终端接收该200OK(PRACK),向被叫终端发送UPDATE消息,该UPDATE消息携带的SDPA2信息指示主叫终端对于本次呼叫的资源预留成功。
1811、被叫终端接收UPDATE消息,向主叫终端发送200UPDATE消息,该200UPDATE消息携带的SDPB2信息指示被叫终端对于本次呼叫的资源预留成功。
至步骤1813,主叫终端与被叫终端对于本次呼叫的资源预留成功。
1812、被叫终端开始振铃,向第一媒体服务器发送该180振铃消息,该180振铃消息携带本次呼叫的相关信息。
1813、第一媒体服务器接收该180振铃消息,根据本次呼叫的相关信息,确定该相关信息对应的第一媒体资源。
需要说明的是,本实施例是以在接收到180振铃消息后进行第一媒体资源的确定为例进行说明,而在其他可能实现方式中,第一媒体服务器在接收到主叫终端的INVITE消息之后的任一时机均能够进行第一媒体资源的确定。
1814、第一媒体服务器向主叫终端发送UPDATE消息,该UPDATE消息携带第一媒体资源的SDP信息,该SDP信息用于进行媒体协商。
需要说明的是,该UPDATE消息为上述图7所示实施例中所涉及的第一媒体协商消息的一种示例,该UPDATE消息携带有第一媒体资源的资源信息,如资源ID,也即是该第一媒体资源的SDP信息的特定字段中携带资源信息。
1815、主叫终端接收该UPDATE消息,从UPDATE消息中获取第一媒体资源的资源信息,向第一媒体服务器发送200OK(UPDATE)消息,该200OK(UPDATE)消息携带主叫终端的媒体能 力信息,即主叫终端和第二媒体服务器之间的媒体协商结果(如SDPA3信息)。
其中,该200OK(UPDATE)消息为上述图7所示实施例中所涉及的第二媒体协商消息的一种示例。从UPDATE消息中获取第一媒体资源的资源信息的过程参见步骤708。
1816、第一媒体服务器向主叫终端发送180振铃消息。
1817、第一媒体服务器向主叫终端发送第一媒体资源的媒体流。
1818、主叫终端接收该第一媒体资源的媒体流。
需要说明的是,上述步骤1814和1815中,是主叫终端基于第一媒体协商消息,也即是UPDATE消息来确定第一媒体资源的资源信息的示例,而在另一种可能实现方式中,主叫终端基于步骤1818所接收到的第一媒体资源的媒体流来确定第一媒体资源的资源信息。应理解地,在一次实施过程中,主叫终端可以选择其中一种方式来确定资源信息,也可以分别确定资源信息,以确保资源信息的获取。
1819、主叫终端通过目标应用客户端,基于该第一媒体资源的资源信息,从第二媒体服务器接收该第一媒体资源的交互内容数据。
应理解的是,主叫终端在CT域的信令交互或者媒体流传输以及IT域的交互不互相影响,在上述步骤中,仅以主叫终端接收到媒体流后,执行步骤1819的交互内容数据过程为例进行说明。而在一些可能实现方式中,该步骤1819可以在确定了第一媒体资源的资源信息后的任一时机进行,例如,该步骤1816至1818和步骤1819的执行顺序可以颠倒,还可以并行执行。
1820、主叫终端接收180振铃消息,基于接收到的媒体流,播放第一媒体服务器发送的第一媒体资源,在第一媒体资源的播放画面上,基于接收到的交互控件的显示数据以及交互内容数据,显示交互控件和对应的交互内容数据。
应理解的是,本申请实施例对180振铃消息和媒体流的接收顺序不做限定,在本实施例中,是以先接收到媒体流,再接收到180振铃消息为例进行说明,在一些可能实现方式中,主叫终端会先接收到180振铃消息,再接收到媒体流,或者同步接收到媒体流,而开始播放接收到的媒体流,需要说明的是,对于主叫终端来说,在接收到180振铃消息后开始播放接收到的媒体流,即能够达到资源播放和交互控件等显示的目的。
可选地,主叫终端接收第二媒体服务器推送第二媒体资源的媒体流,从而在显示第一媒体资源的播放画面中,显示第二媒体资源的播放画面,形成画中画的效果。可选地,第二媒体资源的播放画面中显示该第二媒体资源的交互控件和对应的交互内容数据。
1821、若主叫终端检测到对任一交互控件的触发操作,向该第二媒体服务器发送该交互控件对应的交互请求,该交互请求用于实现基于该第一媒体资源的交互。
1822、第二媒体服务器接收到主叫终端发送的交互请求,基于该第一媒体资源进行处理。
1823、被叫终端向主叫终端发送200OK(INVITE)消息,该200OK(INVITE)消息用于指示被叫终端已摘机。
1824、主叫终端停止播放该第一媒体资源,显示该第一媒体资源的停止画面。
1825、被叫终端向主叫终端发送Bye消息,该Bye消息用于指示被叫终端已挂机。
1826、主叫终端若检测到对停止画面的点击操作,在打开的目标应用客户端中显示门户网站界面。
本申请实施例提供的技术方案,通过第二媒体服务器获取第一媒体资源的交互数据,进 而,主叫终端能够在播放第一媒体资源的同时,在界面上增加一个叠加层,用以显示该交互数据,例如互动按钮、弹幕以及动画效果等,实现了彩铃场景下的媒体交互,丰富了彩铃用户的体验,增加了视频彩铃业务的趣味性。
图19是本申请实施例提供的一种媒体资源播放装置的结构示意图,该媒体资源播放装置用于执行上述实施例中主叫终端所执行的方法。参见图19,媒体资源播放装置包括呼叫请求发送模块1901,确定模块1902、获取请求发送模块1903、接收模块1904、播放模块1905与显示模块1906,其中:
呼叫请求发送模块1901,用于向被叫终端发送呼叫请求,该呼叫请求经过第一媒体服务器;
确定模块1902,用于确定该第一媒体服务器提供的第一媒体资源的资源信息;
获取请求发送模块1903,用于通过目标应用客户端,向第二媒体服务器发送交互数据获取请求,该交互数据获取请求携带该第一媒体资源的资源信息;
接收模块1904,用于接收该第二媒体服务器基于该资源信息返回的该第一媒体资源的交互数据;
播放模块1905,用于接收并播放该第一媒体服务器发送的该第一媒体资源;
显示模块1906,用于在该第一媒体资源的播放画面上,显示该第一媒体资源的交互数据。
在一种可能的实现方式中,该第一媒体服务器为位于通信技术CT域的服务器,该第二媒体服务器为位于互联网技术IT域的服务器。
在一种可能的实现方式中,该确定模块1902,包括下述任一项:
第一获取子模块,用于执行步骤708或步骤1102中从第一媒体协商消息中获取资源信息的过程;
第二获取子模块,用于执行步骤713或步骤1102中从媒体流的头部获取资源信息的过程。
在一种可能的实现方式中,该第一媒体协商消息为Update消息,该资源信息位于该Update消息的会话描述协议SDP信息中。
在一种可能的实现方式中,该资源信息位于该媒体流的头部的附加增强信息SEI中。
在一种可能的实现方式中,该交互数据为交互内容数据,该装置还包括:
该获取请求发送模块1903,还用于执行步骤702或步骤1802;
地址信息接收模块1904,用于执行步骤706或步骤1806。
在一种可能的实现方式中,该交互数据包括交互内容数据和交互控件的显示数据。
在一种可能的实现方式中,该装置还包括:
交互请求发送模块,用于执行步骤1401或步骤1821。
在一种可能的实现方式中,该装置还包括关闭模块,用于执行步骤1501。
在一种可能的实现方式中,该播放模块1905,还用于执行步骤1502或步骤1824。
在一种可能的实现方式中,该播放模块1905,还用于执行步骤1504。
在一种可能的实现方式中,该显示模块1906,还用于执行步骤1826或步骤1504中显示门户网站界面的过程。
在一种可能的实现方式中,该关闭模块,还用于执行步骤1504中关闭目标应用客户端的 过程。
在一种可能的实现方式中,该装置还包括会话消息发送模块,用于执行步骤1504中发送会话结束消息的过程。
在一种可能的实现方式中,该装置还包括:
获取模块,用于执行步骤1104;
该播放模块1905,用于执行步骤1105中全屏模式播放第一媒体资源的过程;
该播放模块1905,还用于执行步骤1105中悬浮窗播放第二媒体资源的过程。
在一种可能的实现方式中,该播放模块1905,还用于执行步骤1105中全屏播放第二媒体资源,悬浮窗播放第一媒体资源的过程。
在一种可能的实现方式中,该播放模块1905,还用于执行步骤1105中悬浮窗播放第一媒体资源,悬浮窗播放第二媒体资源的过程。
在一种可能的实现方式中,该交互数据包括主页访问数据、点赞数据、评论数据、分享数据和下载数据中至少一项。
在一种可能的实现方式中,该交互控件包括主页访问控件、点赞控件、评论控件、分享控件和下载控件中至少一项。
需要说明的是:上述实施例提供的媒体资源播放装置在进行媒体资源播放时,仅以上述各功能模块的划分进行举例说明,实际应用中,可以根据需要而将上述功能分配由不同的功能模块完成,即将装置的内部结构划分成不同的功能模块,以完成以上描述的全部或者部分功能。另外,上述实施例提供的媒体资源播放方法实施例属于同一构思,其具体实现过程详见方法实施例,这里不再赘述。
本申请实施例提供的技术方案实现了对CT域媒体资源的补充,在存在CT域的第一媒体资源的情况下,通过IT域的第二媒体服务器获取第一媒体资源的交互数据,进而,主叫终端能够在播放第一媒体资源的同时,在界面上增加一个叠加层,用以显示该交互控件和相应的交互内容数据,例如互动按钮、弹幕以及动画效果等,实现了彩铃场景下的媒体交互,丰富了彩铃用户的体验,增加了视频彩铃业务的趣味性,且,随着5G的发展和商用,在IT网络资源不构成瓶颈、网络时延更短的背景下,让视频彩铃业务可以更灵活地拓展,例如视频叠层播放、视频界面互动、视频内容滚动切换等,增加视频彩铃业务的趣味性。
图20是本申请实施例提供的一种媒体服务器的结构示意图,参见图20,媒体服务器包括接收模块2001,确定模块2002与返回模块2003,其中:
接收模块2001,用于接收主叫终端通过目标应用客户端发送的交互数据获取请求,该交互数据获取请求携带第一媒体资源的资源信息;
确定模块2002,用于基于该资源信息,确定该第一媒体资源的交互数据;
返回模块2003,用于向该主叫终端返回该交互数据。
在一种可能的实现方式中,该确定模块2002,用于执行步骤709或步骤1819中确定交互内容数据的过程;该返回模块2003,用于执行步骤709或步骤1819中返回交互内容数据的过程。
在一种可能的实现方式中,该交互数据为交互内容数据,该装置还包括:
接收模块2001,还用于执行步骤703或步骤1803;
确定模块2002,还用于执行步骤705中确定显示数据的过程;
返回模块2003,还用于执行步骤705或步骤1805。
在一种可能的实现方式中,该装置还包括加载模块,用于执行步骤705中预加载显示数据的过程。
在一种可能的实现方式中,该显示数据获取请求还携带该主叫终端参与的此次呼叫的相关信息,该装置还包括鉴权模块,用于执行步骤704或步骤1804。
在一种可能的实现方式中,该装置还包括:
该接收模块2001,还用于执行步骤1503中接收资源获取请求的过程;
该确定模块2002,还用于执行步骤1503中确定第三媒体资源的过程;
该返回模块2003,还用于执行步骤1503中返回第三媒体资源的过程。
在一种可能的实现方式中,该装置还包括建立模块,用于执行步骤1503中建立会话的过程。
在一种可能的实现方式中,该装置还包括释放模块,用于执行步骤1504中释放会话的过程。
需要说明的是:上述实施例提供的媒体服务器在进行媒体资源播放时,仅以上述各功能模块的划分进行举例说明,实际应用中,可以根据需要而将上述功能分配由不同的功能模块完成,即将装置的内部结构划分成不同的功能模块,以完成以上描述的全部或者部分功能。另外,上述实施例提供的媒体资源播放方法中第二媒体服务器侧的方法实施例属于同一构思,其具体实现过程详见方法实施例,这里不再赘述。
本申请实施例提供的技术方案实现了对CT域媒体资源的补充,在存在CT域的第一媒体资源的情况下,通过IT域的第二媒体服务器获取第一媒体资源的交互数据,进而,主叫终端能够在播放第一媒体资源的同时,在界面上增加一个叠加层,用以显示该交互控件和相应的交互内容数据,例如互动按钮、弹幕以及动画效果等,实现了彩铃场景下的媒体交互,丰富了彩铃用户的体验,增加了视频彩铃业务的趣味性,且,随着5G的发展和商用,在IT网络资源不构成瓶颈、网络时延更短的背景下,让视频彩铃业务可以更灵活地拓展,例如视频叠层播放、视频界面互动、视频内容滚动切换等,增加视频彩铃业务的趣味性。
在示例性实施例中,还提供了一种计算机存储介质,该计算机存储介质可以是计算机可读存储介质,例如包括程序代码的存储器,上述程序代码可由终端中的处理器执行以完成上述实施例中的主叫终端侧的媒体资源播放方法。例如,该计算机可读存储介质可以是ROM、RAM、只读光盘(compact disc read-only memory,CD-ROM)、磁带、软盘和光数据存储设备等。
在示例性实施例中,还提供了一种计算机存储介质,该计算机存储介质可以是计算机可读存储介质,例如包括程序代码的存储器,上述程序代码可由终端中的处理器执行以完成上述实施例中的第二媒体服务器侧的方法。例如,该计算机可读存储介质可以是ROM、RAM、只读光盘(compact disc read-only memory,CD-ROM)、磁带、软盘和光数据存储设备等。
本申请还提供一种媒体资源播放的系统,该系统包括主叫终端、第一媒体服务器与第二 媒体服务器。在一种可能实现方式中,该主叫终端、该第一媒体服务器与该第二媒体服务器分别用于执行上述图7、图11、图14、图15、图18所示实施例提供的媒体资源播放方法中主叫终端、第一媒体服务器与第二媒体服务器侧的方法。
本领域普通技术人员可以理解实现上述实施例的全部或部分步骤可以通过硬件来完成,也可以通过程序来指令相关的硬件完成,所述的程序可以存储于一种计算机可读存储介质中,上述提到的存储介质可以是只读存储器,磁盘或光盘等。
需要说明的是,本领域普通技术人员可以意识到,结合本文中所公开的实施例描述的各示例的单元、模块、芯片及方法步骤,能够以电子硬件、计算机软件或者二者的结合来实现,为了清楚地说明硬件和软件的可互换性,在上述说明中已经按照功能一般性地描述了各示例的组成及步骤。这些功能究竟以硬件还是软件方式来执行,取决于技术方案的特定应用和设计约束条件。专业技术人员可以对每个特定的应用来使用不同方法来实现所描述的功能,但是这种实现不应认为超出本申请的范围。
所属领域的技术人员可以清楚地了解到,为了描述的方便和简洁,上述描述的系统、装置、模块、单元或者芯片的具体工作过程,可以参考媒体资源播放方法实施例中的对应过程,在此不再赘述。
在本申请所提供的几个实施例中,应该理解到,所揭露的系统、装置和方法,可以通过其它的方式实现。例如,以上所描述的装置实施例仅仅是示意性的,例如,装置内的模块或者模块中的单元的划分,仅仅为一种逻辑功能划分,实际实现时可以有另外的划分方式,例如多个模块或单元可以结合或者可以集成到另一个系统,或一些特征可以忽略,或不执行。另外,所显示或讨论的相互之间的耦合或直接耦合或通信连接可以是通过一些接口、装置或单元的间接耦合或通信连接,也可以是电的,机械的或其它的形式连接。
所述作为分离部件说明的模块或者单元可以是或者也可以不是物理上分开的,作为模块或单元显示的部件可以是或者也可以不是物理模块或者物理单元,即可以位于一个地方,或者也可以分布到多个计算机设备或者芯片。可以根据实际的需要选择其中的部分或者全部模块或单元来实现本申请实施例方案的目的。
另外,在本申请各个实施例中的各功能模块或者单元可以集成在一个目标处理模块中,也可以是各个模块或者单元单独物理存在,也可以是两个或两个以上模块或者单元集成在一个目标处理模块中。上述集成的模块或者单元既可以采用硬件的形式实现,也可以采用软件功能单元的形式实现。
以上所述仅为本申请的可选实施例,并不用以限制本申请,凡在本申请的精神和原则之内,所作的任何修改、等同替换、改进等,均应包含在本申请的保护范围之内。

Claims (28)

  1. 一种媒体资源播放方法,其特征在于,应用于主叫终端,所述方法包括:
    向被叫终端发送呼叫请求,所述呼叫请求经过第一媒体服务器;
    确定所述第一媒体服务器提供的第一媒体资源的资源信息;
    通过目标应用客户端,向第二媒体服务器发送交互数据获取请求,所述交互数据获取请求携带所述第一媒体资源的资源信息;
    接收所述第二媒体服务器基于所述资源信息返回的所述第一媒体资源的交互数据;
    接收并播放所述第一媒体服务器发送的所述第一媒体资源;
    在所述第一媒体资源的播放画面上,显示所述第一媒体资源的交互数据。
  2. 根据权利要求1所述的方法,其特征在于,所述第一媒体服务器为位于通信技术CT域的服务器,所述第二媒体服务器为位于互联网技术IT域的服务器。
  3. 根据权利要求1或2所述的方法,其特征在于,所述确定所述第一媒体服务器提供的第一媒体资源的资源信息包括下述任一项:
    从所述第一媒体服务器发送的第一媒体协商消息中,获取所述第一媒体资源的资源信息;
    从所述第一媒体服务器传输的第一媒体资源的媒体流的头部,获取所述第一媒体资源所携带的资源信息。
  4. 根据权利要求3所述的方法,其特征在于,所述第一媒体协商消息为Update消息,所述资源信息位于所述Update消息的会话描述协议SDP信息中。
  5. 根据权利要求3所述的方法,其特征在于,所述资源信息位于所述媒体流的头部的附加增强信息SEI中。
  6. 根据权利要求1至5任一项所述的方法,其特征在于,所述交互数据为交互内容数据,所述通过目标应用客户端,向第二媒体服务器发送交互数据获取请求之前,所述方法还包括:
    向所述第二媒体服务器发送显示数据获取请求,所述显示数据获取请求用于指示获取交互控件的显示数据;
    接收所述第二媒体服务器基于所述显示数据获取请求返回的地址信息,所述地址信息用于提供交互控件的显示数据,基于所述地址信息,从所述第二媒体服务器获取所述交互控件的显示数据。
  7. 根据权利要求6所述的方法,其特征在于,所述在所述第一媒体资源的播放画面上,显示所述第一媒体资源的交互数据之后,所述方法还包括:
    若检测到对任一交互控件的触发操作,向所述第二媒体服务器发送所述交互控件对应的交互请求,所述交互请求用于实现基于所述第一媒体资源的交互。
  8. 根据权利要求1至7任一项所述的方法,其特征在于,所述在所述第一媒体资源的播放画面上,显示所述第一媒体资源的交互数据之后,所述方法还包括:
    若接收到所述被叫终端的摘机消息,停止播放所述第一媒体资源,并显示所述第一媒体资源的停止画面。
  9. 根据权利要求8所述的方法,其特征在于,所述若接收到所述被叫终端的摘机消息,停止播放所述第一媒体资源,并显示所述第一媒体资源的停止画面之后,所述方法还包括:
    若检测到对所述停止画面的继续播放操作,从所述第二媒体服务器获取第三媒体资源,播放所述第三媒体资源,所述第三媒体资源与所述第一媒体资源匹配。
  10. 根据权利要求8所述的方法,其特征在于,所述若接收到所述被叫终端的摘机消息,停止播放所述第一媒体资源,并显示所述第一媒体资源的停止画面之后,所述方法还包括:
    若检测到对所述停止画面的触发操作,在打开的所述目标应用客户端中显示门户网站界面。
  11. 根据权利要求1至10任一项所述的方法,其特征在于,所述方法还包括:
    通过所述目标应用客户端,从所述第二媒体服务器获取第二媒体资源;
    所述播放所述第一媒体服务器发送的所述第一媒体资源包括:
    以全屏模式播放所述第一媒体资源;
    所述方法还包括:在所述第一媒体资源的播放画面上的悬浮窗内,静音播放所述第二媒体资源。
  12. 根据权利要求1-11任一项所述的方法,其特征在于,所述交互数据包括主页访问数据、点赞数据、评论数据、分享数据和下载数据中至少一项。
  13. 根据权利要求6-7任一项所述的方法,其特征在于,所述交互控件包括主页访问控件、点赞控件、评论控件、分享控件和下载控件中至少一项。
  14. 一种媒体资源播放装置,其特征在于,应用于主叫终端,所述装置包括:
    呼叫请求发送模块,用于向被叫终端发送呼叫请求,所述呼叫请求经过第一媒体服务器;
    确定模块,用于确定所述第一媒体服务器提供的第一媒体资源的资源信息;
    获取请求发送模块,用于通过目标应用客户端,向第二媒体服务器发送交互数据获取请求,所述交互数据获取请求携带所述第一媒体资源的资源信息;
    接收模块,用于接收所述第二媒体服务器基于所述资源信息返回的所述第一媒体资源的交互数据;
    播放模块,用于接收并播放所述第一媒体服务器发送的所述第一媒体资源;
    显示模块,用于在所述第一媒体资源的播放画面上,显示所述第一媒体资源的交互数据。
  15. 根据权利要求14所述的装置,其特征在于,所述第一媒体服务器为位于通信技术CT域的服务器,所述第二媒体服务器为位于互联网技术IT域的服务器。
  16. 根据权利要求14或15所述的装置,其特征在于,所述确定模块,包括下述任一项:
    第一获取子模块,用于从所述第一媒体服务器发送的第一媒体协商消息中,获取所述第一媒体资源的资源信息;
    第二获取子模块,用于从所述第一媒体服务器传输的第一媒体资源的媒体流的头部,获取所述第一媒体资源所携带的资源信息。
  17. 根据权利要求16所述的装置,其特征在于,所述第一媒体协商消息为Update消息,所述资源信息位于所述Update消息的会话描述协议SDP信息中。
  18. 根据权利要求16所述的装置,其特征在于,所述资源信息位于所述媒体流的头部的附加增强信息SEI中。
  19. 根据权利要求14至18任一项所述的装置,其特征在于,所述交互数据为交互内容 数据,所述装置还包括:
    所述获取请求发送模块,还用于向所述第二媒体服务器发送显示数据获取请求,所述显示数据获取请求用于指示获取交互控件的显示数据;
    地址信息接收模块,用于接收所述第二媒体服务器基于所述显示数据获取请求返回的地址信息,所述地址信息用于提供交互控件的显示数据,基于所述地址信息,从所述第二媒体服务器获取所述交互控件的显示数据。
  20. 根据权利要求19所述的装置,其特征在于,所述装置还包括:
    交互请求发送模块,用于若检测到对任一交互控件的触发操作,向所述第二媒体服务器发送所述交互控件对应的交互请求,所述交互请求用于实现基于所述第一媒体资源的交互。
  21. 根据权利要求14至20任一项所述的装置,其特征在于,所述播放模块,还用于:
    若接收到所述被叫终端的摘机消息,停止播放所述第一媒体资源,并显示所述第一媒体资源的停止画面。
  22. 根据权利要求21所述的装置,其特征在于,所述播放模块,还用于:
    若检测到对所述停止画面的继续播放操作,从所述第二媒体服务器获取第三媒体资源,播放所述第三媒体资源,所述第三媒体资源与所述第一媒体资源匹配。
  23. 根据权利要求21所述的装置,其特征在于,所述显示模块,还用于:
    若检测到对所述停止画面的触发操作,在打开的所述目标应用客户端中显示门户网站界面。
  24. 根据权利要求14至23任一项所述的装置,其特征在于,所述装置还包括:
    获取模块,用于通过所述目标应用客户端,从所述第二媒体服务器获取第二媒体资源;
    所述播放模块,用于以全屏模式播放所述第一媒体资源;
    所述播放模块,还用于在所述第一媒体资源的播放画面上的悬浮窗内,静音播放所述第二媒体资源。
  25. 根据权利要求14-24任一项所述的装置,其特征在于,所述交互数据包括主页访问数据、点赞数据、评论数据、分享数据和下载数据中至少一项。
  26. 根据权利要求19-20任一项所述的装置,其特征在于,所述交互控件包括主页访问控件、点赞控件、评论控件、分享控件和下载控件中至少一项。
  27. 一种终端,其特征在于,所述终端包括处理器和存储器,所述存储器中存储有至少一条程序代码,所述程序代码由所述处理器加载并执行以实现如权利要求1至权利要求13任一项所述的媒体资源播放方法。
  28. 一种计算机存储介质,其特征在于,所述存储介质中存储有至少一条程序代码,所述程序代码由处理器加载并执行以实现如权利要求1至权利要求13任一项所述的媒体资源播放方法。
PCT/CN2021/113116 2020-08-31 2021-08-17 媒体资源播放方法和相关装置 WO2022042382A1 (zh)

Priority Applications (1)

Application Number Priority Date Filing Date Title
EP21860220.9A EP4192019A4 (en) 2020-08-31 2021-08-17 MEDIA RESOURCE PLAYBACK METHOD AND ASSOCIATED APPARATUS

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202010901206.7A CN114125510A (zh) 2020-08-31 2020-08-31 媒体资源播放方法和相关装置
CN202010901206.7 2020-08-31

Publications (1)

Publication Number Publication Date
WO2022042382A1 true WO2022042382A1 (zh) 2022-03-03

Family

ID=80354614

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2021/113116 WO2022042382A1 (zh) 2020-08-31 2021-08-17 媒体资源播放方法和相关装置

Country Status (3)

Country Link
EP (1) EP4192019A4 (zh)
CN (1) CN114125510A (zh)
WO (1) WO2022042382A1 (zh)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114640877A (zh) * 2022-03-28 2022-06-17 北京达佳互联信息技术有限公司 信息展示方法、装置、电子设备及存储介质

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20160044112A1 (en) * 2014-08-06 2016-02-11 Verizon Patent And Licensing Inc. User Feedback Systems and Methods
CN110798575A (zh) * 2019-09-29 2020-02-14 中国联合网络通信集团有限公司 视频彩铃交互方法及设备
CN110891123A (zh) * 2018-09-07 2020-03-17 华为技术有限公司 交互信息传输方法及装置
CN111416910A (zh) * 2019-01-07 2020-07-14 中国移动通信有限公司研究院 一种实现彩铃业务的交互方法和装置
CN111491062A (zh) * 2019-01-29 2020-08-04 华为技术有限公司 一种视频彩铃的交互方法和装置
CN111510414A (zh) * 2019-01-30 2020-08-07 华为技术有限公司 一种内容发送方法、接收方法和装置

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101352025A (zh) * 2005-12-30 2009-01-21 艾利森电话股份有限公司 用于向主叫方播放消息的方法和通信系统
CN101192851A (zh) * 2006-11-28 2008-06-04 华为技术有限公司 防止彩铃串音的方法、系统和应用服务器
CN101409952B (zh) * 2007-10-09 2012-11-21 华为技术有限公司 一种实现多媒体彩振业务以及彩振过滤的方法和装置

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20160044112A1 (en) * 2014-08-06 2016-02-11 Verizon Patent And Licensing Inc. User Feedback Systems and Methods
CN110891123A (zh) * 2018-09-07 2020-03-17 华为技术有限公司 交互信息传输方法及装置
CN111416910A (zh) * 2019-01-07 2020-07-14 中国移动通信有限公司研究院 一种实现彩铃业务的交互方法和装置
CN111491062A (zh) * 2019-01-29 2020-08-04 华为技术有限公司 一种视频彩铃的交互方法和装置
CN111510414A (zh) * 2019-01-30 2020-08-07 华为技术有限公司 一种内容发送方法、接收方法和装置
CN110798575A (zh) * 2019-09-29 2020-02-14 中国联合网络通信集团有限公司 视频彩铃交互方法及设备

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
See also references of EP4192019A4

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114640877A (zh) * 2022-03-28 2022-06-17 北京达佳互联信息技术有限公司 信息展示方法、装置、电子设备及存储介质
CN114640877B (zh) * 2022-03-28 2024-01-05 北京达佳互联信息技术有限公司 信息展示方法、装置、电子设备及存储介质

Also Published As

Publication number Publication date
EP4192019A4 (en) 2023-12-06
CN114125510A (zh) 2022-03-01
EP4192019A1 (en) 2023-06-07

Similar Documents

Publication Publication Date Title
US11962840B2 (en) Services over wireless communication with high flexibility and efficiency
US8577953B2 (en) System and method for providing multimedia services
US20090316688A1 (en) Method for controlling advanced multimedia features and supplemtary services in sip-based phones and a system employing thereof
US9584563B2 (en) Communication system and method for content access
KR102133014B1 (ko) 멀티미디어 서비스를 이용한 통화 장치, 방법 및 시스템
KR100964211B1 (ko) 통신 시스템에서 멀티미디어 포탈 컨텐츠 및 부가 서비스제공 방법 및 시스템
US20070223668A1 (en) Inserting content into a connection using an intermediary
WO2020048386A1 (zh) 交互信息传输方法及装置
TWI581601B (zh) Integration of IMS and intelligent terminal technology to support the wisdom of the guidance system and methods
US20080119173A1 (en) Multimedia Hold Method and Apparatus
US20070072648A1 (en) Method and apparatus for identifying a calling party
WO2022042382A1 (zh) 媒体资源播放方法和相关装置
US20070165800A1 (en) Connection control apparatus, method, and program
WO2022042381A1 (zh) 媒体资源播放方法、相关装置及系统
CN109391666A (zh) 一种可视通话方法、装置及计算机可读介质
WO2015014174A1 (zh) 实现回铃播放的方法、装置和回铃业务系统
US7822014B2 (en) Voice communication system and a server apparatus
US20080256452A1 (en) Control of an object in a virtual representation by an audio-only device
US20080162650A1 (en) User-chosen media content
EP3664423B1 (en) Incoming call voice calling method and terminal
KR101977670B1 (ko) 복합 ars 서비스 방법 및 장치
US20220311812A1 (en) Method and system for integrating video content in a video conference session
US9042528B2 (en) Data communication
CN114567704A (zh) 应用于呼叫的交互方法和相关装置
CN113727177A (zh) 投屏资源播放方法及其装置、设备与介质

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 21860220

Country of ref document: EP

Kind code of ref document: A1

ENP Entry into the national phase

Ref document number: 2021860220

Country of ref document: EP

Effective date: 20230302

NENP Non-entry into the national phase

Ref country code: DE