CN112134999A - Method and device for processing video color ring and computer readable storage medium - Google Patents

Method and device for processing video color ring and computer readable storage medium Download PDF

Info

Publication number
CN112134999A
CN112134999A CN202010984425.6A CN202010984425A CN112134999A CN 112134999 A CN112134999 A CN 112134999A CN 202010984425 A CN202010984425 A CN 202010984425A CN 112134999 A CN112134999 A CN 112134999A
Authority
CN
China
Prior art keywords
video
color ring
file
calling terminal
playing
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202010984425.6A
Other languages
Chinese (zh)
Other versions
CN112134999B (en
Inventor
李琳
周效军
苏毅
吴耀华
李鹏飞
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
China Mobile Communications Group Co Ltd
MIGU Culture Technology Co Ltd
Original Assignee
China Mobile Communications Group Co Ltd
MIGU Culture Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by China Mobile Communications Group Co Ltd, MIGU Culture Technology Co Ltd filed Critical China Mobile Communications Group Co Ltd
Priority to CN202010984425.6A priority Critical patent/CN112134999B/en
Publication of CN112134999A publication Critical patent/CN112134999A/en
Application granted granted Critical
Publication of CN112134999B publication Critical patent/CN112134999B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M3/00Automatic or semi-automatic exchanges
    • H04M3/42Systems providing special services or facilities to subscribers
    • H04M3/42017Customized ring-back tones
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04WWIRELESS COMMUNICATION NETWORKS
    • H04W4/00Services specially adapted for wireless communication networks; Facilities therefor
    • H04W4/12Messaging; Mailboxes; Announcements

Landscapes

  • Engineering & Computer Science (AREA)
  • Signal Processing (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Television Signal Processing For Recording (AREA)

Abstract

The invention discloses a processing method and equipment of a video color ring back tone and a computer readable storage medium, relates to the technical field of color ring back tones and aims to solve the problem that the playing effect of the video color ring back tone in a calling terminal cannot be ensured in the prior art. The method is applied to a color ring platform and comprises the steps of obtaining color ring files in a plurality of playing forms corresponding to video contents under the condition that the video contents selected or uploaded by a user are received; under the condition of receiving a calling request of a calling terminal, acquiring information of the calling terminal carried in the calling request, wherein the information of the calling terminal comprises at least one of equipment type information and network signal information; and according to the information of the calling terminal, determining a target color ring file matched with the information of the calling terminal from the color ring files in the plurality of playing forms, and playing the target color ring file to the calling terminal. The embodiment of the invention can ensure the playing effect of the video color ring on the calling terminal.

Description

Method and device for processing video color ring and computer readable storage medium
Technical Field
The present invention relates to the field of communications technologies, and in particular, to a method and an apparatus for processing a video color ring, and a computer-readable storage medium.
Background
The video color ring back tone service is a service which is customized by a called user and provides a piece of pleasant, colorful or interesting video media for a calling user to replace a common network ring back tone. The user is called a Voice over Long-Term Evolution (VoLTE) user after the user opens a VoLTE function, the VoLTE user can set personalized video color ring by himself after applying for opening the video color ring function, and the system plays the video color ring media set by the personalization for the calling user in the ringing stage when the user is called. When the called user is in the non-idle state such as busy, out of service area, power off, etc., the notification provided by the original network system is still played.
At present, video color ring back tones set by a user are set around the own mobile phone experience, namely, the user is the video color ring back tone tested from the own mobile phone, and when a call is made, a calling terminal is unpredictable, so that the problem that the playing effect of the video color ring back tone in the calling terminal cannot be ensured exists.
Disclosure of Invention
The embodiment of the invention provides a method and equipment for processing a video color ring back tone and a computer readable storage medium, which are used for solving the problem that the playing effect of the video color ring back tone on a calling terminal cannot be ensured in the prior art.
In a first aspect, an embodiment of the present invention provides a method for processing a video color ring, which is applied to a color ring platform, and includes:
under the condition of receiving video content selected or uploaded by a user, obtaining polyphonic ringtone files of a plurality of playing forms corresponding to the video content;
under the condition of receiving a calling request of a calling terminal, acquiring information of the calling terminal carried in the calling request, wherein the information of the calling terminal comprises at least one of equipment type information and network signal information;
according to the information of the calling terminal, determining a target color ring back tone file matched with the information of the calling terminal from the color ring back tone files in the plurality of playing forms;
and playing the target color ring back tone file to the calling terminal.
Optionally, the color ring back tone files in multiple playing forms include at least one of the following:
a horizontal screen format video file corresponding to the video content;
a vertical screen format video file corresponding to the video content;
a high-definition video file corresponding to the horizontal screen format video file;
a high-definition video file corresponding to the vertical screen format video file;
a high-definition video file corresponding to the video content;
a non-high-definition video file corresponding to the video content;
and the audio file corresponds to the video content.
Optionally, the color ring platform includes a central node and a distributed node; the polyphonic ringtone files in the plurality of playing forms comprise audio files corresponding to the video contents;
the step of obtaining the polyphonic ringtone files of the multiple playing forms corresponding to the video content comprises the following steps:
audio and video splitting is carried out on the video content through a central node, and an audio file and a video file corresponding to the video content are obtained;
storing the audio file at the distributed node.
Optionally, the color ring platform includes a central node and a distributed node; the polyphonic ringtone files in the plurality of playing forms comprise high-definition video files corresponding to the video contents;
the step of obtaining the polyphonic ringtone files of the multiple playing forms corresponding to the video content comprises the following steps:
audio and video splitting is carried out on the video content through the central node, and an audio file and a video file corresponding to the video content are obtained;
in the distributed nodes, preprocessing the video file to obtain the constituent elements of the video content;
performing high-definition processing on the constituent elements to obtain high-definition elements of the video content;
and the central node generates a high-definition video file corresponding to the video content according to the high-definition elements of the video content.
Optionally, the high-cleaning treatment includes: at least one of character high-definition processing, action high-definition processing, scene high-definition processing and emotion component high-definition processing.
Optionally, the color ring back tone files in the multiple playing forms include vertical screen video files corresponding to the video content;
under the condition that the video content is in a horizontal screen format, the step of obtaining the polyphonic ringtone files in the multiple playing forms corresponding to the video content comprises the following steps:
splitting audio and video of the video content to obtain a video file corresponding to the video content;
acquiring vertical screen center coordinates of each continuous frame of video image in the video file and a time interval of a target video clip of the video file;
and generating a vertical screen format video file corresponding to the video file according to the vertical screen center coordinate and the time interval of the target video clip.
Optionally, the step of obtaining a time interval of a target video segment of the video file includes:
carrying out shot switching detection on the video file to obtain a time interval of a complete shot of the video file;
obtaining the time interval of a target video clip of the video file through a preset algorithm according to the time interval of the complete shot;
the preset algorithm comprises at least one of character recognition, action recognition, emotion recognition, scene recognition and video significance detection.
Optionally, the step of obtaining the vertical screen center coordinates of each frame of continuous video images in the video file includes:
carrying out shot switching detection on the video file to obtain a time interval of a complete shot of the video file; and
acquiring a user preference parameter;
acquiring tracking track coordinates corresponding to the video file according to the time interval of the complete shot and the user preference parameters;
and fitting the tracking track coordinates to obtain the vertical screen center coordinates of each continuous frame of video image in the video file.
Optionally, the step of obtaining the tracking track coordinate corresponding to the video file according to the time interval of the complete shot and the user preference parameter includes:
and detecting a target person and/or a target object related to the user preference parameter in the video file to obtain the tracking track coordinates of the target person and/or the target object in the video file.
Optionally, the step of obtaining the tracking track coordinate corresponding to the video file according to the time interval of the complete shot and the user preference parameter includes:
and under the condition that the video file does not comprise the target person and/or the target object related to the user preference parameter, carrying out significance detection on the video file to obtain the tracking track coordinates of the image dynamic significant region in the video file.
Optionally, the step of determining, from the polyphonic ringtone files in the multiple play forms according to the information of the calling terminal, a target polyphonic ringtone file adapted to the information of the calling terminal includes:
determining a target color ring back tone file matched with the information of the calling terminal from the color ring back tone files in the plurality of playing forms according to the first information and/or the second information;
the first information is determined according to the device type information, and the first information is used for indicating whether the calling terminal supports vertical screen video playing or not;
the second information is determined according to the network signal information, and the second information is used for indicating whether the current network signal of the calling terminal meets a high-definition playing condition.
Optionally, the step of determining, from the polyphonic ringtone files in the multiple play forms according to the information of the calling terminal, a target polyphonic ringtone file adapted to the information of the calling terminal includes any one of:
under the condition that the calling terminal supports vertical screen video playing according to the equipment type information, determining that the target color ring back tone file is a vertical screen format video file corresponding to the video content and an audio file corresponding to the video content;
under the condition that the calling terminal does not support vertical screen video playing according to the equipment type information, determining that the target color ring back tone file is a horizontal screen format video file corresponding to the video content and an audio file corresponding to the video content;
determining the target color ring back tone file as an audio file corresponding to the video content under the condition that the calling terminal does not support video color ring back tone playing according to the equipment type information;
optionally, the step of determining, from the polyphonic ringtone files in the multiple play forms according to the information of the calling terminal, a target polyphonic ringtone file adapted to the information of the calling terminal includes any one of:
determining that the calling terminal supports vertical screen video playing according to the equipment type information, and determining that the target color ring back tone file is a high-definition video file corresponding to the vertical screen format video file and an audio file corresponding to the video content under the condition that the current network signal of the calling terminal meets a high-definition playing condition according to the network signal information;
determining that the calling terminal supports vertical screen video playing according to the equipment type information, and determining that the target color ring back tone file is a vertical screen format video file corresponding to the video content and an audio file corresponding to the video content under the condition that the current network signal of the calling terminal does not meet a high-definition playing condition according to the network signal information;
determining that the calling terminal does not support vertical screen video playing according to the equipment type information, and determining that the target color ring back tone file is a high-definition video file corresponding to the horizontal screen format video file and an audio file corresponding to the video content under the condition that the current network signal of the calling terminal meets a high-definition playing condition according to the network signal information;
determining that the calling terminal does not support vertical screen video playing according to the equipment type information, and determining that the target color ring back tone file is a horizontal screen format video file corresponding to the video content and an audio file corresponding to the video content under the condition that the current network signal of the calling terminal does not meet a high-definition playing condition according to the network signal information;
determining the target color ring back tone file as an audio file corresponding to the video content under the condition that the calling terminal does not support the video color ring back tone playing condition according to the equipment type information;
and under the condition that the current network signal of the calling terminal is determined not to meet the video color ring playing condition in the network signal information, determining the target color ring file as an audio file corresponding to the video content.
Optionally, the step of determining, from the polyphonic ringtone files in the multiple play forms according to the information of the calling terminal, a target polyphonic ringtone file adapted to the information of the calling terminal includes any one of:
under the condition that the current network signal of the calling terminal is determined to meet a high-definition playing condition according to the network signal information, determining that the target color ring back tone file is a high-definition video file corresponding to the video content and an audio file corresponding to the video content;
under the condition that the current network signal of the calling terminal is determined not to meet the high-definition playing condition according to the network signal information, determining that the target color ring back tone file is a non-high-definition video file corresponding to the video content and an audio file corresponding to the video content;
and under the condition that the current network signal of the calling terminal is determined not to meet the video color ring playing condition according to the network signal information, determining the target color ring file as an audio file corresponding to the video content.
In a second aspect, an embodiment of the present invention further provides a device for processing a video color ring, where the device includes: a transceiver, a memory, a processor, and a computer program stored on the memory and executable on the processor; the processor is configured to read a program in the memory to implement the steps in the method for processing a video color ring.
In a third aspect, an embodiment of the present invention further provides a computer-readable storage medium, where a computer program is stored on the computer-readable storage medium, and when the computer program is executed by a processor, the steps in the method for processing a video color ring are implemented.
In the embodiment of the invention, under the condition of receiving the video content selected or uploaded by a user, the polyphonic ringtone files of a plurality of playing forms corresponding to the video content are obtained; under the condition of receiving a calling request of a calling terminal, acquiring information of the calling terminal carried in the calling request, wherein the information of the calling terminal comprises at least one of equipment type information and network signal information; according to the information of the calling terminal, determining a target color ring back tone file matched with the information of the calling terminal from the color ring back tone files in the plurality of playing forms; and playing the target color ring back tone file to the calling terminal. Therefore, according to the scheme of the embodiment of the invention, the target color ring back tone file matched with the calling terminal can be selected from the color ring back tone files in a plurality of playing forms according to the information of the calling terminal, so that the playing effect of the video color ring back tone set by the user in the calling terminal is ensured, and the user experience is favorably improved.
Drawings
In order to more clearly illustrate the technical solutions of the embodiments of the present invention, the drawings needed to be used in the description of the embodiments of the present invention will be briefly introduced below, and it is obvious that the drawings in the following description are only some embodiments of the present invention, and it is obvious for those skilled in the art that other drawings can be obtained according to these drawings without inventive exercise.
Fig. 1 is a schematic view of a cross-screen video color ring playing real scene provided by an embodiment of the present invention;
fig. 2 is a schematic view of a vertical-screen video color ring playing real scene provided by the embodiment of the present invention;
fig. 3 is a flowchart of a method for processing a video color ring according to an embodiment of the present invention;
fig. 4 is a second flowchart of a method for processing a video color ring according to an embodiment of the present invention;
fig. 5 is one of the flowcharts for generating a high-definition vertical-screen video file according to the embodiment of the present invention;
fig. 6 is a second flowchart of generating a high-definition vertical-screen video file according to an embodiment of the present invention;
fig. 7 is a flowchart of a process of playing a video color ring according to an embodiment of the present invention;
fig. 8 is a flowchart of processing and preparing a color ring to be played by a video color ring platform according to an embodiment of the present invention;
fig. 9 is a structural diagram of a processing apparatus for video color ring according to an embodiment of the present invention;
fig. 10 is a structural diagram of a processing device for video color ring according to an embodiment of the present invention.
Detailed Description
The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are some, not all, embodiments of the present invention. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
In order to facilitate understanding of the embodiments of the present invention, a video color ring and a color ring are first described.
1) From the aspect of function scope, the function scope of the video color ring and the color ring is the inclusion relation. Specifically, the video color ring is the upgrade of the color ring, and can support the playing of videos and the like besides meeting all functions of the color ring.
2) From the perspective of technical support, the technical support of video color ring and color ring is different. In the process of paging a called end by a calling end, compared with a signaling flow of the color ring, at least two processes of media resource capability negotiation and resource reservation are added in the signaling flow of the video color ring.
3) From the aspect of terminal support, the playing of the video polyphonic ringtone requires the capability of the corresponding terminal for supporting the video playing, and the playing of the polyphonic ringtone only requires the capability of the corresponding terminal for supporting the audio playing.
4) From the perspective of core network bearing, the video color ring can allocate a video bearing with the bandwidth of 974Kbps and an audio bearing with the bandwidth of 146 Kbps; and the polyphonic ringtone is only allocated with an audio bearer with the bandwidth of 146 Kbps.
In summary, the video color ring can satisfy all the requirements of the color ring, but the color ring cannot satisfy all the requirements of the video color ring.
It should be noted that, after the user subscribes to the VoLTE video color ring service, the playing conditions of the audio and video color ring under different scenes may be specifically as shown in table 1 below:
Figure BDA0002688731490000081
TABLE 1 video CRBT Play scene
When a user dials a call through a mobile phone terminal, the video color ring is played, and the display mode can be divided into a horizontal screen and a vertical screen according to the terminal equipment, wherein the display mode is based on the equipment model of the mobile phone of the calling user.
As shown in fig. 1, it shows a cross-screen video color ring playing scene diagram. The transverse screen video color ring playing is to play a section of video in the middle of a screen of a mobile phone terminal, wherein the video format is H.264 format, and the resolution can be 720P or 480P. The outer blank of the video above the screen of the mobile phone terminal is a dialed mobile phone number, and the outer blank of the video below the screen of the mobile phone terminal is a dialed keyboard such as a contact, a mute, a dial and the like.
As shown in fig. 2, it shows a vertical screen video color ring playing real-scene schematic diagram, where the vertical screen video color ring playing is to play a segment of video on the full screen of the mobile phone terminal screen, the video format is h.264 format, and the resolution may be 720P or 480P. The dialed mobile phone number is above the video of the mobile phone terminal screen, and the keyboard such as a contact person, a mute, a dial and the like is dialed below the video of the mobile phone terminal screen.
At present, when a general user sets a video polyphonic ringtone, the general user sets the video polyphonic ringtone around the experience of the user on the mobile phone, namely, the user tests the video polyphonic ringtone from the mobile phone, so that the problem exists.
Based on the above problems, embodiments of the present invention provide a method, a device, and a computer-readable storage medium for processing a video color ring, so as to solve the problem in the prior art that the playing effect of the video color ring on a calling terminal cannot be guaranteed.
Specifically, referring to fig. 3, it shows a flowchart of a method for processing a video polyphonic ringtone according to an embodiment of the present invention. As in fig. 3, the method comprises the steps of:
step 11, under the condition of receiving video content selected or uploaded by a user, obtaining polyphonic ringtone files of a plurality of playing forms corresponding to the video content;
for example, the video content selected or uploaded by the user may be local video content uploaded by the user through an APP (Application program) in the mobile phone terminal, or a certain piece of video content in the APP selected by the user.
Wherein, the playing form includes: a horizontal screen format playing form, a vertical screen format playing form, a high definition format playing form, an audio format playing form and the like.
In the embodiment of the present invention, the polyphonic ringtone files in the multiple playing forms include at least one of the following:
a horizontal screen format video file corresponding to the video content;
a vertical screen format video file corresponding to the video content;
a high-definition video file corresponding to the horizontal screen format video file;
a high-definition video file corresponding to the vertical screen format video file;
a high-definition video file corresponding to the video content;
a non-high-definition video file corresponding to the video content;
and the audio file corresponds to the video content.
It should be noted that the common source of the color ring back tone files in multiple playing forms is the video content selected or uploaded by the user. The high-definition video file corresponding to the horizontal screen format video file, the high-definition video file corresponding to the vertical screen format video file, the high-definition video file corresponding to the video content and the non-high-definition video file corresponding to the video content are all video files which do not include an audio part (the audio part is split). That is, the audio file corresponding to the video content obtained in the present application has only one copy, and the video files in different playing forms include at least one of the above types.
The non-high-definition video file refers to a video file with a resolution lower than a certain threshold, such as an ordinary video file with a physical resolution below 720 p.
Therefore, the problems of storage resource waste, low efficiency, low utilization rate and high cost caused by storing a plurality of sets of homologous video color ring with different resolutions for adapting to various opposite terminals (called terminals) in the prior art are avoided, the problem of storing redundant audio files can be solved, and a large amount of storage hardware resources are saved.
Step 12, under the condition of receiving a call request of a calling terminal, obtaining information of the calling terminal carried in the call request, wherein the information of the calling terminal comprises at least one of equipment type information and network signal information;
in the embodiment of the invention, the equipment type information can reflect the information such as equipment type, screen parameters and the like; the network signal information is used for reflecting the signal quality of the network where the calling terminal is currently located.
Step 13, according to the information of the calling terminal, determining a target color ring back tone file matched with the information of the calling terminal from the color ring back tone files in the plurality of playing forms;
and step 14, playing the target color ring back tone file to the calling terminal.
In the embodiment of the invention, under the condition of receiving the video content selected or uploaded by a user, the polyphonic ringtone files of a plurality of playing forms corresponding to the video content are obtained; further, under the condition of receiving a call request of a calling terminal, acquiring information of the calling terminal carried in the call request, wherein the information of the calling terminal comprises at least one of equipment type information and network signal information; the method can determine a target color ring back tone file matched with the information of the calling terminal from the color ring back tone files in the plurality of playing forms according to the information of the calling terminal; and playing the target color ring back tone file to the calling terminal. Therefore, the calling terminal can play the color ring back tone file matched with the information of the calling terminal, the playing effect of the video color ring back tone set by the user in the calling terminal is ensured, the situations of image playing distortion, playing pause and other playing quality reduction caused by the fact that the color ring back tone file is not matched with the calling terminal are avoided, and the user experience is favorably improved. Meanwhile, the problems of storage resource waste, low efficiency, low utilization rate and high cost caused by the fact that the existing video color ring is matched with various opposite terminal terminals to store a plurality of sets of homologous video color ring with different resolutions can be solved.
Step 11 described above will be described below.
Method for acquiring audio file corresponding to video content
In an embodiment of the invention, the color ring platform comprises a central node and distributed nodes; the polyphonic ringtone files in the plurality of playing forms comprise audio files corresponding to the video contents;
the step 11 includes:
audio and video splitting is carried out on the video content through a central node, and an audio file and a video file corresponding to the video content are obtained; storing the audio file at the distributed node.
In the embodiment of the invention, the distributed nodes are nodes arranged around the location of the user, and the central node is a color ring central platform. In the embodiment, the audio file and the video file corresponding to the video content are separated, and the audio file is stored in the distributed node and is called in the dialing process. Therefore, under the condition that the network signal of the calling terminal is poor or the calling terminal does not support video polyphonic ringtone playing, the audio polyphonic ringtone which is the same as the video content can be played through the distributed nodes, so that the time consumption of switching from the polyphonic ringtone system to the traditional standby polyphonic ringtone system is avoided on one hand, and the problem of low user experience caused by playing the traditional polyphonic ringtone which cannot be customized by a user is also avoided on the other hand; still on the one hand, can satisfy the real-time beautiful real my demand of showing of user, compare at present, can fall down to the mode of ordinary ring tone sometimes, the user experience that this application brought promotes highly.
Second, obtaining mode of non-high-definition video file corresponding to video content
In one embodiment, audio and video splitting is performed on the video content to obtain an audio file and a video file corresponding to the video content;
for example, the video content may be audio-video split by the central node.
Further, a non-high-definition video file corresponding to the video content may be generated according to the video file.
Third, obtaining mode of vertical screen video file corresponding to video content
In an embodiment of the invention, the color ring platform comprises a central node and distributed nodes; the polyphonic ringtone files in the plurality of playing forms comprise vertical screen video files corresponding to the video contents;
as shown in fig. 4, in the step 11, obtaining a vertical screen video file corresponding to the video content under the condition that the video content is in a horizontal screen format includes:
step 111, splitting the audio and video of the video content to obtain a video file corresponding to the video content;
step 112, acquiring the vertical screen center coordinates of each continuous frame of video image in the video file and the time interval of the target video clip of the video file;
and 113, generating a vertical screen format video file corresponding to the video file according to the vertical screen center coordinate and the time interval of the target video clip.
In this embodiment, processing such as frame extraction, clipping, synthesis, intelligent anti-shake and the like is performed according to the output vertical screen center coordinate and the time interval of the target video clip, so as to output a vertical screen video file corresponding to the final video content. Wherein, the target video clip may include: highlight video clips, or video clips that satisfy certain conditions.
In step 112, the step of obtaining the time interval of the target video segment of the video file includes:
carrying out shot switching detection on the video file to obtain a time interval of a complete shot of the video file; obtaining the time interval of a target video clip of the video file through a preset algorithm according to the time interval of the complete shot; the preset algorithm comprises at least one of character recognition, action recognition, emotion recognition, scene recognition and video significance detection.
In this embodiment, a complete shot refers to a complete scene or a video clip of a complete episode. The target video file obtained based on the time interval of the complete shot can ensure that the content and plot of the target video file do not jump, and is beneficial to improving the user experience.
In step 112, the step of obtaining the vertical screen center coordinates of each frame of continuous video images in the video file includes:
carrying out shot switching detection on the video file to obtain a time interval of a complete shot of the video file; and obtaining user preference parameters;
acquiring tracking track coordinates corresponding to the video file according to the time interval of the complete shot and the user preference parameters;
and fitting the tracking track coordinates to obtain the vertical screen center coordinates of each continuous frame of video image in the video file.
In this embodiment, the user preference parameter is used to reflect the type of video content preferred by the user, such as game type content, cartoon reality type content, and the like. A complete shot refers to a complete scene or video clip of a complete episode. In the embodiment, by combining the time interval of the complete shot and the user preference parameters, the vertical screen center coordinates of each continuous frame of video image which is preferred by the user and in the complete video content interval can be obtained from the video content, the preference of the user is ensured, the video plot is ensured not to jump, and the user experience is improved.
Specifically, for implementation, the user preference parameter may be converted into identification parameter data for application.
Further, the step of obtaining the tracking track coordinate corresponding to the video file according to the time interval of the complete shot and the user preference parameter includes the following two ways:
the first method is as follows:
and detecting a target person and/or a target object related to the user preference parameter in the video file to obtain the tracking track coordinates of the target person and/or the target object in the video file.
The second method comprises the following steps:
and under the condition that the video file does not comprise the target person and/or the target object related to the user preference parameter, carrying out significance detection on the video file to obtain the tracking track coordinates of the image dynamic significant region in the video file.
Therefore, through the two modes in the embodiment, the tracking track coordinate corresponding to the video file can be obtained by combining the user preference parameters, a basis is provided for obtaining the vertical screen video which is favored and expected by the user, and the improvement of user experience is facilitated.
Further, after generating the vertical screen video file, the method further comprises:
and carrying out picture special effect processing on the generated vertical screen video file so as to improve the use experience of a user.
Specifically, the screen special effect processing includes: jitter, filters, illusion, impression pie, reverse play, slow play, etc.; and realizing the beautifying effect on specific objects and local pictures through target segmentation. Providing an image restoration technology to remove unnecessary objects or characters in the video; providing OCR technology to provide various illusive effects on the text in the video, such as subtitle auto-fill color, highlighting, etc.; and the generation network is utilized to realize special effects such as replacing human faces.
Based on the above embodiment, for the audio file portion, the method further comprises:
and carrying out special effect processing on the generated audio file so as to improve the use experience of a user.
Specifically, the audio special effect processing includes: and through an audio emotion recognition technology, the effect of playing the audios such as the bass and the treble is enhanced, and some DJ special effect sound effects are automatically added according to the recognition result. And simultaneously, background music with similar styles is provided for the user to replace and replace through music style identification.
Fourth, for the obtaining mode of the horizontal screen video file corresponding to the video content, it should be noted that the current video content is in a horizontal screen format in general by default, and therefore, the generation process of the horizontal screen video file in the present application is not specifically described, and reference may be made to the prior art specifically.
Fifthly, obtaining modes of high-definition video files corresponding to video contents and high-definition video files corresponding to horizontal screen video files and vertical screen video files
In one embodiment, the color ring platform comprises a central node and distributed nodes; the polyphonic ringtone files in the plurality of playing forms comprise high-definition video files corresponding to the video contents;
the step of obtaining the polyphonic ringtone files of the multiple playing forms corresponding to the video content comprises the following steps:
audio and video splitting is carried out on the video content through the central node, and an audio file and a video file corresponding to the video content are obtained;
in the distributed nodes, preprocessing the video file to obtain the constituent elements of the video content;
performing high-definition processing on the constituent elements to obtain high-definition elements of the video content;
and the central node generates a high-definition video file corresponding to the video content according to the high-definition elements of the video content.
Specifically, the high-definition processing includes: at least one of character high-definition processing, action high-definition processing, scene high-definition processing and emotion component high-definition processing.
In this embodiment, the central node splits the audio and video of the video content to obtain an audio file and a video file corresponding to the video content; further, preprocessing and high-definition processing are carried out on the video file at a distributed node to generate high-definition elements; and generating a high-definition video file corresponding to the video content by using the high-definition element at the central node. Therefore, the distributed nodes are used for asynchronously preprocessing the video file, high-definition video elements are intelligently and asynchronously generated, the processing efficiency of the central node for high-definition video can be improved, the problems of large transmission quantity, long consumed time and long waiting time of opposite ends of video color ring can be solved, and the user experience is greatly improved while the video quality is optimized.
In an embodiment, high-definition enhancement processing can be performed on non-high-definition video files, horizontal screen video files and vertical screen video files corresponding to video contents by using high-definition video elements (high-definition elements) generated intelligently and asynchronously by using distributed nodes, so that high-definition video files corresponding to the video contents and high-definition video files corresponding to the horizontal screen video files and the vertical screen video files are obtained. Therefore, the processing efficiency of the central node for performing high-definition on the video can be improved, the problems of large transmission quantity, long consumed time and long waiting time of the opposite end of the video color ring can be avoided, the video quality is optimized, and the user experience is greatly improved.
Further, the step of performing high-definition enhancement on the non-high-definition video file, the horizontal screen video file and the vertical screen video file corresponding to the video content further comprises:
and generating a corresponding high-definition video file by adopting a super-resolution technology and a mode of increasing the size of each frame of image.
For example, for a vertical screen video file, by using a super-resolution technology, resolution of the vertical screen video file corresponding to video content is improved, sizes of each frame of image in the vertical screen video file corresponding to the video content are respectively increased, meanwhile, the picture quality is guaranteed not to be reduced basically, and finally, a high-definition video file corresponding to the vertical screen video file is generated.
Step 13 is described below, wherein step 13 includes:
determining a target color ring back tone file matched with the information of the calling terminal from the color ring back tone files in the plurality of playing forms according to the first information and/or the second information;
the first information is determined according to the device type information, and the first information is used for indicating whether the calling terminal supports vertical screen video playing or not; the second information is determined according to the network signal information, and the second information is used for indicating whether the current network signal of the calling terminal meets a high-definition playing condition.
In this embodiment, the playing form of the video color ring can be determined according to at least one of the device type and the network signal of the calling terminal, and when the calling terminal and the network signal have corresponding conditions, the target color ring file with better experience is preferentially played, for example, when the calling terminal supports vertical screen video playing and the current network signal of the calling terminal meets the high-definition playing condition, the high-definition vertical screen video file can be preferentially played. Therefore, the method has good adaptability and is beneficial to greatly improving the user experience.
Specifically, the step 13 may include the following three ways:
mode A: determining a target color ring back tone file matched with the calling terminal according to the device type information, wherein the method specifically includes any one of the following 3 conditions:
case 1:
under the condition that the calling terminal supports vertical screen video playing according to the equipment type information, determining that the target color ring back tone file is a vertical screen format video file corresponding to the video content and an audio file corresponding to the video content;
case 2:
under the condition that the calling terminal does not support vertical screen video playing according to the equipment type information, determining that the target color ring back tone file is a horizontal screen format video file corresponding to the video content and an audio file corresponding to the video content;
case 3:
determining the target color ring back tone file as an audio file corresponding to the video content under the condition that the calling terminal does not support video color ring back tone playing according to the equipment type information;
for example, when the calling mobile phone terminal UEa requests for applying for playing a video color ring to a video color ring platform of a called domain where the called mobile phone terminal UEb is located, the device type information of the calling mobile phone terminal carried in the transmitted call request is analyzed by the nearest video color ring distributed node. And the video color ring distributed node processes and prepares an audio file to be played. Meanwhile, for a terminal supporting a vertical screen playing format, automatically playing a video file in the vertical screen format corresponding to the video content and an audio file corresponding to the video content; automatically playing a transverse screen format video file corresponding to the video content and an audio file corresponding to the video content for a terminal which does not support a vertical screen playing format; and automatically playing the audio file corresponding to the video content for the terminal which does not support the video color ring.
Determining whether the calling terminal supports vertical screen video playing according to the device type information may include: and determining whether the equipment type of the current calling terminal supports vertical screen video playing or not according to the preset corresponding relation between different equipment types and the horizontal screen format and the vertical screen format. It will be appreciated that different device types, corresponding to different screen parameters (e.g., screen resolution), may be matched to different playback formats (landscape/portrait).
In the embodiment, the target color ring back tone file adaptive to the playing form can be selected according to the equipment type of the calling terminal, when the calling terminal has corresponding conditions, the target color ring back tone file with better experience is preferentially played, and if the conditions are met, the vertical screen video file can be preferentially played. Therefore, the method has good adaptability and is beneficial to greatly improving the user experience.
Mode B: determining a target color ring back tone file matched with the calling terminal according to the device type information and the network signal information, wherein the method specifically includes any one of the following conditions in 6:
case 1:
determining that the calling terminal supports vertical screen video playing according to the equipment type information, and determining that the target color ring back tone file is a high-definition video file corresponding to the vertical screen format video file and an audio file corresponding to the video content under the condition that the current network signal of the calling terminal meets a high-definition playing condition according to the network signal information;
case 2:
determining that the calling terminal supports vertical screen video playing according to the equipment type information, and determining that the target color ring back tone file is a vertical screen format video file corresponding to the video content and an audio file corresponding to the video content under the condition that the current network signal of the calling terminal does not meet a high-definition playing condition according to the network signal information;
case 3:
determining that the calling terminal does not support vertical screen video playing according to the equipment type information, and determining that the target color ring back tone file is a high-definition video file corresponding to the horizontal screen format video file and an audio file corresponding to the video content under the condition that the current network signal of the calling terminal meets a high-definition playing condition according to the network signal information;
case 4:
determining that the calling terminal does not support vertical screen video playing according to the equipment type information, and determining that the target color ring back tone file is a horizontal screen format video file corresponding to the video content and an audio file corresponding to the video content under the condition that the current network signal of the calling terminal does not meet a high-definition playing condition according to the network signal information;
case 5:
determining the target color ring back tone file as an audio file corresponding to the video content under the condition that the calling terminal does not support the video color ring back tone playing condition according to the equipment type information;
case 6:
and under the condition that the current network signal of the calling terminal is determined not to meet the video color ring playing condition in the network signal information, determining the target color ring file as an audio file corresponding to the video content.
For example, when the calling mobile phone terminal UEa requests the video color ring platform of the called domain where the called mobile phone terminal UEb is located to apply for video color ring playing, the video color ring distributed node processes and prepares the audio to be played. Meanwhile, the network signal condition of the calling terminal is judged according to the network signal information through the nearest video color ring distributed node, and the equipment type information of the calling mobile phone terminal in the transmitted call request is analyzed. And selecting a proper target color ring back tone file according to the equipment type and the network signal condition of the calling terminal.
Wherein, judge that the concrete judgement basis that present network signal satisfies high definition broadcast includes: the current network signal of the calling terminal is a 5G signal, or the current network signal of the calling terminal is a 4G signal, and the signal strength is between-75 dBm and-85 dBm.
The specific judgment basis for judging that the current network signal does not meet the video color ring playing condition comprises the following steps: the current network signal of the calling terminal is a 4G signal and is between-100 dBm and-110 dBm; or the current network signal of the calling terminal is an early signal such as 3G, 2G and the like.
Determining whether the calling terminal supports vertical screen video playing according to the device type information may include: and determining whether the equipment type of the current calling terminal supports vertical screen video playing or not according to the preset corresponding relation between different equipment types and the horizontal screen format and the vertical screen format. It will be appreciated that different device types, corresponding to different screen parameters (e.g., screen resolution), may be matched to different playback formats (landscape/portrait).
In the embodiment, the video color ring playing mode can be selected according to the equipment type and the network signal of the calling terminal, when the calling terminal has corresponding conditions, the target color ring file with better experience is preferentially played, and if the conditions are met, the high-definition vertical screen video file can be preferentially played. Therefore, the method has good adaptability and is beneficial to greatly improving the user experience.
Mode C: determining a target color ring back tone file matched with the calling terminal according to the network signal information, wherein the method specifically includes any one of the following conditions in 3:
case 1:
under the condition that the current network signal of the calling terminal is determined to meet a high-definition playing condition according to the network signal information, determining that the target color ring back tone file is a high-definition video file corresponding to the video content and an audio file corresponding to the video content;
case 2:
under the condition that the current network signal of the calling terminal is determined not to meet the high-definition playing condition according to the network signal information, determining that the target color ring back tone file is a non-high-definition video file corresponding to the video content and an audio file corresponding to the video content;
case 3:
and under the condition that the current network signal of the calling terminal is determined not to meet the video color ring playing condition according to the network signal information, determining the target color ring file as an audio file corresponding to the video content.
The specific judgment basis for judging that the current network signal does not meet the video color ring playing condition comprises the following steps: the current network signal of the calling terminal is a 4G signal and is between-100 dBm and-110 dBm; or the current network signal of the calling terminal is an early signal such as 3G, 2G and the like.
For example, when the calling mobile phone terminal UEa requests the video color ring platform of the called domain where the called mobile phone terminal UEb is located to apply for video color ring playing, the distributed node processes and prepares the audio to be played. And simultaneously, selecting a proper target color ring back tone file according to the current network signal condition of the terminal. Specifically, for a terminal supporting a high-definition playing condition, a high-definition video file corresponding to video content and an audio file corresponding to the video content are automatically played; automatically playing a non-high-definition video file corresponding to video content and an audio file corresponding to the video content for a terminal which does not support a high-definition playing format; and automatically playing the audio file corresponding to the video content for the terminal which does not meet the video color ring playing condition.
In the embodiment, the video color ring playing mode can be selected according to the current network signal of the calling terminal, when the calling terminal has corresponding conditions, the target color ring file with better experience is preferentially played, and if the conditions are met, the high-definition video file can be preferentially played. Therefore, the method has good adaptability and is beneficial to greatly improving the user experience.
The following describes a process of generating a high-definition vertical screen video file in conjunction with a specific embodiment.
As shown in fig. 5, the process of generating a high-definition vertical-screen video file mainly includes the following steps:
and in the process 1, a user uploads local video content through an APP in a mobile phone terminal, or selects a certain section of video content in the APP, and transmits user preferences.
And 2, the color ring back tone central platform splits the audio and video files uploaded by the user and transfers the audio files to the video color ring back tone node system of the user attribution for storage.
A flow 3, the color ring central platform transmits the video file to the distributed nodes to request the analysis and high-definition preprocessing of the video content;
and 4, preprocessing the video color ring file by the video color ring element main system in the color ring distributed node, and analyzing main constituent elements of the video.
The color ring distributed node is composed of a plurality of specialized functional units, namely a character high-definition processing unit, an action high-definition processing unit, a scene high-definition processing unit, emotion component high-definition processing and other units, and can be used for pixel superposition and fusion processing based on time dimension, space dimension and characteristic element edge.
And 5, marking the vertical screen content file after the editing by the video color ring central platform, and storing in a warehouse.
And 6, displaying the processed video color ring content effect to the user by the APP in the mobile phone terminal.
In the embodiment, the video and audio splitting is carried out on the video content selected or uploaded by the user by the common video color ring central platform, the audio file is stored on the nearby video color ring system node, and the dialing process is called; through the flow of high-definition element identification and processing performed by each distributed node of the video color ring, the intelligent asynchronous generation of high-definition vertical screen video elements is realized, the materials of the vertical screen video color ring can be enriched, the labor cost is saved, the user operation is simplified, the video quality is optimized, the processing efficiency of video content can be improved, the problems of large transmission quantity, long consumed time and long waiting time of an opposite terminal of the video color ring are solved, and the user experience is greatly improved.
Specifically, as shown in fig. 6, the detailed process of generating the high-definition vertical-screen video file may include the following steps:
step 1, the color ring central platform receives the video content uploaded by the user or a certain section of video content selected on the network and receives the favorite data of the user.
Step 2, preprocessing a video color ring file, verifying a content file, and converting user preference data into identification parameter data;
step 3, requesting video content processing and transmitting identification parameter data corresponding to the user preference data;
furthermore, the video color ring central platform calls algorithm atomic capability, intelligently generates a vertical screen video and provides video enhancement and special effect functions. Specifically, the method comprises the following steps:
and 4.1, judging whether the video is in a vertical screen standard format, and if so, directly jumping to the step 6 by taking the set content as the video color ring version of the current user normally.
And 4.2, carrying out shot switching detection on the video in the non-vertical screen standard format, and outputting a time interval of a complete shot.
Step 4.3, according to the lens time interval and the user preference parameters obtained in the step 4.2, identifying and detecting the characters, the objects and the like to determine target characters, objects and the like to be tracked, and respectively outputting the changed track coordinates of the target characters, the objects, the human faces/the human heads and the moving objects; and adopting saliency detection for the video without salient persons and objects, and outputting the track coordinates of the dynamic salient region of the image. And realizing preference tracking.
And 4.4, fitting the coordinates through a filtering algorithm, and finally outputting the center coordinates of each continuous frame of picture.
And 4.5, comprehensively analyzing and predicting the time interval of the highlight in the video according to shot information and output results of basic algorithm capabilities such as character recognition, action recognition, emotion recognition, scene recognition, video significance detection and the like.
And 4.6, performing frame extraction, cutting, synthesis, intelligent anti-shaking and other processing according to the output vertical screen center coordinates and the time axis of the wonderful segment, and outputting the final wonderful vertical screen short video.
And 4.7, performing video enhancement and special effect operation according to the requirement of the user, and jumping to the step 5 if the user does not need the operation. Step 4.7 specifically includes:
and 4.7.1, performing ultra-high definition enhancement on the video needing video enhancement, and improving the resolution of the video obtained in the step 4.6 by adopting a super-resolution technology, so that the size of each frame of image is increased, and the picture quality is ensured not to be reduced basically. And after the processing is finished, the vertical screen short video is synthesized again.
And 4.7.2, performing picture special effect processing on the video needing to be added with the special effect, and providing a specific playing effect for the whole video picture obtained in the step 5.6, such as: jitter, filters, illusion, impression, flip, slow play, etc. And the beautification effect is realized on specific objects and local pictures through target segmentation. An image restoration technique is provided to remove unwanted objects or text in the video. OCR techniques are provided to provide various illusive effects on text in a video, such as subtitle auto-fill color, highlighting, etc. And the generation network is utilized to realize special effects such as replacing human faces. Or may proceed directly to step 4.7.3 without action.
And 4.7.3, performing audio special effect processing on the audio needing to be added with special effects, enhancing the audio playing effects of bass, treble and the like through an audio emotion recognition technology, and automatically adding some DJ special effect sound effects according to the recognition result. And simultaneously, background music with similar styles is provided for the user to replace and replace through music style identification. Or the vertical screen short video can be synthesized again after the processing is finished without operation.
And 5, verifying the conformity of the file content, the scene and the favorite parameters and the file format.
And 6, marking the content file and warehousing.
And 7, displaying the content effect of the preprocessed video color ring back tone for the user.
And 8, confirming by the user. And the user confirms whether the high-definition vertical screen video color ring content synthesized by the machine meets the expectation. If so, the process is complete. If not, the user can click to continue the optimization, and the process flow starts the optimization iteration from step 1.
In the above embodiment, the video polyphonic ringtone central platform (central node) performs audio and video splitting on the video content selected or uploaded by the user, and the distributed node of the video polyphonic ringtone performs a process of identifying and processing high-definition elements of the split video file. The video color ring back tone file is preprocessed through a video color ring back tone element main system in the color ring back tone distributed node, main constituent elements of a video are analyzed, and high-definition vertical screen color ring generation can be automatically completed according to user preference and track tracking.
The following introduces the processing procedure of the color ring platform when the calling terminal dials the called terminal, with reference to the specific embodiment.
It should be noted that the process of successful application and ready response of the video color ring platform of the called domain where the called terminal UEb is located to the calling terminal UEa synchronous video color ring file is consistent with the existing process.
As shown in fig. 7, the process flow of playing video polyphonic ringtone in the process of making a call by a user mainly includes the following flows:
the method comprises the following steps that 1, a calling terminal UEa establishes a dialing request to a called terminal UEb, and network parameters of the calling terminal UEa are transmitted;
in the process 2, the called terminal UEb performs resource reservation response to the calling terminal UEa;
the process 3, the calling terminal UEa applies for playing the video color ring to the color ring system of the called domain, and transmits the model and screen parameters of the calling terminal UEa;
the process 4, the color ring back tone system prepares and processes the video color ring back tone to be played;
the process 5, the color ring system carries out the negotiation response of the video color ring file to the calling terminal UEa;
the process 6, the color ring system sends a video color ring playing instruction to the calling terminal UEa;
step 7, when the called party is off-hook, the called terminal UEb sends an instruction of stopping playing the video color ring to the calling terminal UEa;
and 8, the calling terminal UEa responds to the color ring back tone system of the called domain, and the color ring back tone system stops playing the color ring back tone.
Specifically, as shown in fig. 8, the process of processing and preparing the color ring back tone file to be played by the video color ring back tone central platform may include:
step a, the video color ring distributed node receives the equipment type (model, screen parameter, etc.) and network signal information of the calling terminal UEa.
And b, the color ring back tone central platform checks whether the current called number is set to be effective and the color ring back tone of the video to be played is a high-definition vertical screen video.
If the video color ring back tone file to be played is not a high-definition vertical screen format video and the opposite end has the condition of playing the high-definition video, performing step d 1;
and c, if the video color ring file to be played set by the user is a vertical screen format video, the color ring central platform checks whether the equipment type of the current calling terminal supports the vertical screen format video.
If the current calling terminal model does not support the vertical screen video, performing step d 2; if the current calling terminal model supports the vertical screen video, performing step d 3; if the current calling terminal model does not support the video color ring or whether the current network signal does not meet the video color ring playing is judged, the step d4 is carried out;
and d1, performing high-definition gain processing on the video by using the analysis data generated by the early-stage distributed nodes, and generating a high-definition video file corresponding to the video file in the vertical screen format. It should be noted that the high definition video file may also be generated and stored in advance when the user sets the video color ring.
Step d2, playing the horizontal screen version video of the current video content;
step d3, playing the vertical screen version video of the current video content;
and d4, sending an instruction to the video color ring distributed node, and enabling the home color ring distributed node to play the audio color ring of the current video content.
And e, transmitting the video file to the calling terminal for playing.
The embodiment of the invention also provides a processing device of the video color ring. Referring to fig. 9, fig. 9 is a structural diagram of a processing apparatus for video color ring according to an embodiment of the present invention. Because the principle of the video color ring processing device for solving the problems is similar to the method for processing the video color ring in the embodiment of the invention, the implementation of the video color ring processing device can refer to the implementation of the method, and repeated parts are not described again.
As shown in fig. 9, the apparatus 900 for processing video polyphonic ringtone includes:
a first processing module 901, configured to obtain multiple color ring back tone files in a playing form corresponding to video content when the video content selected or uploaded by a user is received;
an obtaining module 902, configured to obtain information of a calling terminal carried in a call request when the call request of the calling terminal is received, where the information of the calling terminal includes at least one of device type information and network signal information;
a determining module 903, configured to determine, according to the information of the calling terminal, a target polyphonic ringtone file that is adapted to the information of the calling terminal from the polyphonic ringtone files in the multiple playing forms;
a second processing module 904, configured to play the target color ring back tone file to the calling terminal.
Optionally, the color ring back tone files in multiple playing forms include at least one of the following:
a horizontal screen format video file corresponding to the video content;
a vertical screen format video file corresponding to the video content;
a high-definition video file corresponding to the horizontal screen format video file;
a high-definition video file corresponding to the vertical screen format video file;
a high-definition video file corresponding to the video content;
a non-high-definition video file corresponding to the video content;
and the audio file corresponds to the video content.
Optionally, the color ring platform includes a central node and a distributed node; the polyphonic ringtone files in the plurality of playing forms comprise audio files corresponding to the video contents;
the first processing module 901 includes:
the first processing submodule is used for carrying out audio and video splitting on the video content through a central node to obtain an audio file and a video file corresponding to the video content;
and the second processing submodule is used for storing the audio file in the distributed nodes.
Optionally, the color ring platform includes a central node and a distributed node; the polyphonic ringtone files in the plurality of playing forms comprise high-definition video files corresponding to the video contents;
the first processing module 901 includes:
the third processing submodule is used for carrying out audio and video splitting on the video content through the central node to obtain an audio file and a video file corresponding to the video content;
the fourth processing submodule is used for preprocessing the video file in the distributed node to obtain the constituent elements of the video content;
the fifth processing submodule is used for carrying out high-definition processing on the composition elements to obtain high-definition elements of the video content;
and the sixth processing submodule is used for generating a high-definition video file corresponding to the video content by the central node according to the high-definition elements of the video content.
Optionally, the high-cleaning treatment includes: at least one of character high-definition processing, action high-definition processing, scene high-definition processing and emotion component high-definition processing.
Optionally, the color ring back tone files in the multiple playing forms include vertical screen video files corresponding to the video content; in the case where the video content is in a landscape format, the first processing module 801 includes:
the seventh processing submodule is used for acquiring the vertical screen center coordinates of each continuous frame of video image in the video file and the time interval of a target video clip of the video file; and the eighth processing submodule is used for generating a vertical screen format video file corresponding to the video file according to the vertical screen center coordinate and the time interval of the target video clip.
Optionally, the seventh processing sub-module includes:
the first processing unit is used for carrying out shot switching detection on the video file to obtain a time interval of a complete shot of the video file;
the second processing unit is used for obtaining the time interval of the target video clip of the video file through a preset algorithm according to the time interval of the complete shot; the preset algorithm comprises at least one of character recognition, action recognition, emotion recognition, scene recognition and video significance detection.
Optionally, the seventh processing sub-module further includes:
the third processing unit is used for carrying out shot switching detection on the video file to obtain a time interval of a complete shot of the video file; and
the fourth processing unit is used for acquiring the user preference parameters;
the fifth processing unit is used for obtaining tracking track coordinates corresponding to the video file according to the time interval of the complete shot and the user preference parameters;
and the sixth processing unit is used for fitting the tracking track coordinates to obtain the vertical screen center coordinates of each continuous frame of video image in the video file.
Optionally, the fifth processing unit is specifically configured to: and detecting a target person and/or a target object related to the user preference parameter in the video file to obtain the tracking track coordinates of the target person and/or the target object in the video file.
Optionally, the fifth processing unit is further specifically configured to: and under the condition that the video file does not comprise the target person and/or the target object related to the user preference parameter, carrying out significance detection on the video file to obtain the tracking track coordinates of the image dynamic significant region in the video file.
Optionally, the determining module 903 includes:
the first determining submodule is used for determining a target color ring file matched with the information of the calling terminal from the color ring files in the plurality of playing forms according to the first information and/or the second information;
the first information is determined according to the device type information, and the first information is used for indicating whether the calling terminal supports vertical screen video playing or not; the second information is determined according to the network signal information, and the second information is used for indicating whether the current network signal of the calling terminal meets a high-definition playing condition.
Optionally, the first determining sub-module includes:
a first determining unit, configured to determine that the target color ring back tone file is a vertical screen format video file corresponding to the video content and an audio file corresponding to the video content, when it is determined that the calling terminal supports vertical screen video playing according to the device type information;
a second determining unit, configured to determine that the target color ring back tone file is a horizontal screen format video file corresponding to the video content and an audio file corresponding to the video content under the condition that it is determined that the calling terminal does not support vertical screen video playing according to the device type information;
a third determining unit, configured to determine, when it is determined that the calling terminal does not support video polyphonic ringtone playing according to the device type information, that the target polyphonic ringtone file is an audio file corresponding to the video content;
optionally, the first determining sub-module includes:
a fourth determining unit, configured to determine that the calling terminal supports vertical screen video playing according to the device type information, and determine that the target color ring back tone file is a high definition video file corresponding to the vertical screen format video file and an audio file corresponding to the video content when determining that a current network signal of the calling terminal meets a high definition playing condition according to the network signal information;
a fifth determining unit, configured to determine that the calling terminal supports vertical screen video playing according to the device type information, and determine that the target color ring back tone file is a vertical screen format video file corresponding to the video content and an audio file corresponding to the video content when it is determined that the current network signal of the calling terminal does not satisfy a high-definition playing condition according to the network signal information;
a sixth determining unit, configured to determine that the target color ring back tone file is a high-definition video file corresponding to the horizontal-screen format video file and an audio file corresponding to the video content, when it is determined that the calling terminal does not support vertical-screen video playing according to the device type information, and it is determined that a current network signal of the calling terminal satisfies a high-definition playing condition according to the network signal information;
a seventh determining unit, configured to determine that the target color ring back tone file is a horizontal screen format video file corresponding to the video content and an audio file corresponding to the video content when it is determined that the calling terminal does not support vertical screen video playing according to the device type information and it is determined that a current network signal of the calling terminal does not satisfy a high-definition playing condition according to the network signal information;
an eighth determining unit block, configured to determine, when it is determined according to the device type information that the calling terminal does not support a video polyphonic ringtone playing condition, that the target polyphonic ringtone file is an audio file corresponding to the video content;
a ninth determining unit, configured to determine, in the network signal information, that the current network signal of the calling terminal does not satisfy the video polyphonic ringtone playing condition, that the target polyphonic ringtone file is an audio file corresponding to the video content.
Optionally, the first determining sub-module includes:
a tenth determining unit, configured to determine, when it is determined that the current network signal of the calling terminal meets a high-definition playing condition according to the network signal information, that the target color ring back tone file is a high-definition video file corresponding to the video content and an audio file corresponding to the video content;
an eleventh determining unit, configured to determine, when it is determined according to the network signal information that the current network signal of the calling terminal does not meet a high-definition playing condition, that the target polyphonic ringtone file is a non-high-definition video file corresponding to the video content and an audio file corresponding to the video content;
and a twelfth determining unit, configured to determine, when it is determined that the current network signal of the calling terminal does not satisfy the video polyphonic ringtone playing condition according to the network signal information, that the target polyphonic ringtone file is an audio file corresponding to the video content.
The apparatus provided in the embodiment of the present invention may implement the method embodiments, and the implementation principle and the technical effect are similar, which are not described herein again.
As shown in fig. 10, the processing device for video polyphonic ringtone according to the embodiment of the present invention includes: a transceiver 1010, a processor 1000, a memory 1020 and a computer program stored on the memory 1020 and executable on the processor 1000, the processor 1000 implementing the steps of the video processing method described above when executing the computer program. Specifically, the processor 1000 is configured to read a program in the memory 1020 and execute the following processes:
under the condition of receiving video content selected or uploaded by a user, obtaining polyphonic ringtone files of a plurality of playing forms corresponding to the video content;
under the condition of receiving a calling request of a calling terminal, acquiring information of the calling terminal carried in the calling request, wherein the information of the calling terminal comprises at least one of equipment type information and network signal information;
according to the information of the calling terminal, determining a target color ring back tone file matched with the information of the calling terminal from the color ring back tone files in the plurality of playing forms;
and playing the target color ring back tone file to the calling terminal.
A transceiver 1010 for receiving and transmitting data under the control of the processor 1000.
Wherein in fig. 7, the bus architecture may include any number of interconnected buses and bridges, with one or more processors, represented by processor 1000, and various circuits, represented by memory 1020, being linked together. The bus architecture may also link together various other circuits such as peripherals, voltage regulators, power management circuits, and the like, which are well known in the art, and therefore, will not be described any further herein. The bus interface provides an interface. The transceiver 1010 may be a number of elements including a transmitter and a transceiver providing a means for communicating with various other apparatus over a transmission medium. The processor 1000 is responsible for managing the bus architecture and general processing, and the memory 1020 may store data used by the processor 1000 in performing operations.
The processor 1000 is responsible for managing the bus architecture and general processing, and the memory 1020 may store data used by the processor 1000 in performing operations.
Optionally, the color ring back tone files in multiple playing forms include at least one of the following:
a horizontal screen format video file corresponding to the video content;
a vertical screen format video file corresponding to the video content;
a high-definition video file corresponding to the horizontal screen format video file;
a high-definition video file corresponding to the vertical screen format video file;
a high-definition video file corresponding to the video content;
a non-high-definition video file corresponding to the video content;
and the audio file corresponds to the video content.
Optionally, the color ring platform includes a central node and a distributed node; the polyphonic ringtone files in the plurality of playing forms comprise audio files corresponding to the video contents; the processor 1000 is further configured to read the computer program and execute the following steps:
audio and video splitting is carried out on the video content through a central node, and an audio file and a video file corresponding to the video content are obtained;
storing the audio file at the distributed node.
Optionally, the color ring platform includes a central node and a distributed node; the polyphonic ringtone files in the plurality of playing forms comprise high-definition video files corresponding to the video contents; the processor 1000 is further configured to read the computer program and execute the following steps:
audio and video splitting is carried out on the video content through the central node, and an audio file and a video file corresponding to the video content are obtained;
in the distributed nodes, preprocessing the video file to obtain the constituent elements of the video content;
performing high-definition processing on the constituent elements to obtain high-definition elements of the video content;
and the central node generates a high-definition video file corresponding to the video content according to the high-definition elements of the video content.
Optionally, the high-cleaning treatment includes: at least one of character high-definition processing, action high-definition processing, scene high-definition processing and emotion component high-definition processing.
Optionally, the color ring back tone files in the multiple playing forms include vertical screen video files corresponding to the video content; in case the video content is in a landscape format, the processor 1000 is further configured to read the computer program, and perform the following steps:
splitting audio and video of the video content to obtain a video file corresponding to the video content;
acquiring vertical screen center coordinates of each continuous frame of video image in the video file and a time interval of a target video clip of the video file;
and generating a vertical screen format video file corresponding to the video file according to the vertical screen center coordinate and the time interval of the target video clip.
The processor 1000 is further configured to read the computer program and execute the following steps:
carrying out shot switching detection on the video file to obtain a time interval of a complete shot of the video file;
obtaining the time interval of a target video clip of the video file through a preset algorithm according to the time interval of the complete shot;
the preset algorithm comprises at least one of character recognition, action recognition, emotion recognition, scene recognition and video significance detection.
The processor 1000 is further configured to read the computer program and execute the following steps:
carrying out shot switching detection on the video file to obtain a time interval of a complete shot of the video file; and
acquiring a user preference parameter;
acquiring tracking track coordinates corresponding to the video file according to the time interval of the complete shot and the user preference parameters;
and fitting the tracking track coordinates to obtain the vertical screen center coordinates of each continuous frame of video image in the video file.
The processor 1000 is further configured to read the computer program and execute the following steps:
and detecting a target person and/or a target object related to the user preference parameter in the video file to obtain the tracking track coordinates of the target person and/or the target object in the video file.
The processor 1000 is further configured to read the computer program and execute the following steps:
and under the condition that the video file does not comprise the target person and/or the target object related to the user preference parameter, carrying out significance detection on the video file to obtain the tracking track coordinates of the image dynamic significant region in the video file.
The processor 1000 is further configured to read the computer program and execute the following steps:
determining a target color ring back tone file matched with the information of the calling terminal from the color ring back tone files in the plurality of playing forms according to the first information and/or the second information;
the first information is determined according to the device type information, and the first information is used for indicating whether the calling terminal supports vertical screen video playing or not;
the second information is determined according to the network signal information, and the second information is used for indicating whether the current network signal of the calling terminal meets a high-definition playing condition.
The processor 1000 is further configured to read the computer program and execute the following steps:
under the condition that the calling terminal supports vertical screen video playing according to the equipment type information, determining that the target color ring back tone file is a vertical screen format video file corresponding to the video content and an audio file corresponding to the video content;
under the condition that the calling terminal does not support vertical screen video playing according to the equipment type information, determining that the target color ring back tone file is a horizontal screen format video file corresponding to the video content and an audio file corresponding to the video content;
determining the target color ring back tone file as an audio file corresponding to the video content under the condition that the calling terminal does not support video color ring back tone playing according to the equipment type information;
the processor 1000 is further configured to read the computer program and execute the following steps:
determining that the calling terminal supports vertical screen video playing according to the equipment type information, and determining that the target color ring back tone file is a high-definition video file corresponding to the vertical screen format video file and an audio file corresponding to the video content under the condition that the current network signal of the calling terminal meets a high-definition playing condition according to the network signal information;
determining that the calling terminal supports vertical screen video playing according to the equipment type information, and determining that the target color ring back tone file is a vertical screen format video file corresponding to the video content and an audio file corresponding to the video content under the condition that the current network signal of the calling terminal does not meet a high-definition playing condition according to the network signal information;
determining that the calling terminal does not support vertical screen video playing according to the equipment type information, and determining that the target color ring back tone file is a high-definition video file corresponding to the horizontal screen format video file and an audio file corresponding to the video content under the condition that the current network signal of the calling terminal meets a high-definition playing condition according to the network signal information;
determining that the calling terminal does not support vertical screen video playing according to the equipment type information, and determining that the target color ring back tone file is a horizontal screen format video file corresponding to the video content and an audio file corresponding to the video content under the condition that the current network signal of the calling terminal does not meet a high-definition playing condition according to the network signal information;
determining the target color ring back tone file as an audio file corresponding to the video content under the condition that the calling terminal does not support the video color ring back tone playing condition according to the equipment type information;
and under the condition that the current network signal of the calling terminal is determined not to meet the video color ring playing condition in the network signal information, determining the target color ring file as an audio file corresponding to the video content.
The processor 1000 is further configured to read the computer program and execute the following steps:
under the condition that the current network signal of the calling terminal is determined to meet a high-definition playing condition according to the network signal information, determining that the target color ring back tone file is a high-definition video file corresponding to the video content and an audio file corresponding to the video content;
under the condition that the current network signal of the calling terminal is determined not to meet the high-definition playing condition according to the network signal information, determining that the target color ring back tone file is a non-high-definition video file corresponding to the video content and an audio file corresponding to the video content;
and under the condition that the current network signal of the calling terminal is determined not to meet the video color ring playing condition according to the network signal information, determining the target color ring file as an audio file corresponding to the video content.
The device provided by the embodiment of the present invention may implement the above method embodiment, and the implementation principle and technical effect are similar, which are not described herein again.
Furthermore, a computer-readable storage medium of an embodiment of the present invention stores a computer program executable by a processor to implement:
under the condition of receiving video content selected or uploaded by a user, obtaining polyphonic ringtone files of a plurality of playing forms corresponding to the video content;
under the condition of receiving a calling request of a calling terminal, acquiring information of the calling terminal carried in the calling request, wherein the information of the calling terminal comprises at least one of equipment type information and network signal information;
according to the information of the calling terminal, determining a target color ring back tone file matched with the information of the calling terminal from the color ring back tone files in the plurality of playing forms;
and playing the target color ring back tone file to the calling terminal.
Wherein the polyphonic ringtone files in the plurality of playing forms comprise at least one of the following items:
a horizontal screen format video file corresponding to the video content;
a vertical screen format video file corresponding to the video content;
a high-definition video file corresponding to the horizontal screen format video file;
a high-definition video file corresponding to the vertical screen format video file;
a high-definition video file corresponding to the video content;
a non-high-definition video file corresponding to the video content;
and the audio file corresponds to the video content.
The color ring platform comprises a central node and distributed nodes; the polyphonic ringtone files in the plurality of playing forms comprise audio files corresponding to the video contents; the step of obtaining the polyphonic ringtone files of the multiple playing forms corresponding to the video content comprises the following steps:
audio and video splitting is carried out on the video content through a central node, and an audio file and a video file corresponding to the video content are obtained;
storing the audio file at the distributed node.
The color ring platform comprises a central node and distributed nodes; the polyphonic ringtone files in the plurality of playing forms comprise high-definition video files corresponding to the video contents;
the step of obtaining the polyphonic ringtone files of the multiple playing forms corresponding to the video content comprises the following steps:
audio and video splitting is carried out on the video content through the central node, and an audio file and a video file corresponding to the video content are obtained;
in the distributed nodes, preprocessing the video file to obtain the constituent elements of the video content;
performing high-definition processing on the constituent elements to obtain high-definition elements of the video content;
and the central node generates a high-definition video file corresponding to the video content according to the high-definition elements of the video content.
Wherein the high-cleaning treatment comprises: at least one of character high-definition processing, action high-definition processing, scene high-definition processing and emotion component high-definition processing.
The color ring back tone files in the plurality of playing forms comprise vertical screen video files corresponding to the video contents; under the condition that the video content is in a horizontal screen format, the step of obtaining the polyphonic ringtone files in the multiple playing forms corresponding to the video content comprises the following steps:
splitting audio and video of the video content to obtain a video file corresponding to the video content;
acquiring vertical screen center coordinates of each continuous frame of video image in the video file and a time interval of a target video clip of the video file;
and generating a vertical screen format video file corresponding to the video file according to the vertical screen center coordinate and the time interval of the target video clip.
Wherein the step of obtaining the time interval of the target video clip of the video file comprises:
carrying out shot switching detection on the video file to obtain a time interval of a complete shot of the video file;
obtaining the time interval of a target video clip of the video file through a preset algorithm according to the time interval of the complete shot; the preset algorithm comprises at least one of character recognition, action recognition, emotion recognition, scene recognition and video significance detection.
The step of obtaining the vertical screen center coordinates of each continuous frame of video image in the video file comprises the following steps:
carrying out shot switching detection on the video file to obtain a time interval of a complete shot of the video file; and
acquiring a user preference parameter;
acquiring tracking track coordinates corresponding to the video file according to the time interval of the complete shot and the user preference parameters;
and fitting the tracking track coordinates to obtain the vertical screen center coordinates of each continuous frame of video image in the video file.
The step of obtaining the track coordinate corresponding to the video file according to the time interval of the complete shot and the user preference parameter comprises the following steps:
and detecting a target person and/or a target object related to the user preference parameter in the video file to obtain the tracking track coordinates of the target person and/or the target object in the video file.
The step of obtaining the tracking track coordinate corresponding to the video file according to the time interval of the complete shot and the user preference parameter comprises the following steps:
and under the condition that the video file does not comprise the target person and/or the target object related to the user preference parameter, carrying out significance detection on the video file to obtain the tracking track coordinates of the image dynamic significant region in the video file.
The step of determining a target color ring back tone file adapted to the information of the calling terminal from the color ring back tone files in the plurality of playing forms according to the information of the calling terminal includes:
determining a target color ring back tone file matched with the information of the calling terminal from the color ring back tone files in the plurality of playing forms according to the first information and/or the second information;
the first information is determined according to the device type information, and the first information is used for indicating whether the calling terminal supports vertical screen video playing or not;
the second information is determined according to the network signal information, and the second information is used for indicating whether the current network signal of the calling terminal meets a high-definition playing condition.
The step of determining a target polyphonic ringtone file adapted to the information of the calling terminal from the polyphonic ringtone files in the plurality of playing forms according to the information of the calling terminal includes any one of the following steps:
under the condition that the calling terminal supports vertical screen video playing according to the equipment type information, determining that the target color ring back tone file is a vertical screen format video file corresponding to the video content and an audio file corresponding to the video content;
under the condition that the calling terminal does not support vertical screen video playing according to the equipment type information, determining that the target color ring back tone file is a horizontal screen format video file corresponding to the video content and an audio file corresponding to the video content;
determining the target color ring back tone file as an audio file corresponding to the video content under the condition that the calling terminal does not support video color ring back tone playing according to the equipment type information;
the step of determining a target polyphonic ringtone file adapted to the information of the calling terminal from the polyphonic ringtone files in the plurality of playing forms according to the information of the calling terminal includes any one of the following steps:
determining that the calling terminal supports vertical screen video playing according to the equipment type information, and determining that the target color ring back tone file is a high-definition video file corresponding to the vertical screen format video file and an audio file corresponding to the video content under the condition that the current network signal of the calling terminal meets a high-definition playing condition according to the network signal information;
determining that the calling terminal supports vertical screen video playing according to the equipment type information, and determining that the target color ring back tone file is a vertical screen format video file corresponding to the video content and an audio file corresponding to the video content under the condition that the current network signal of the calling terminal does not meet a high-definition playing condition according to the network signal information;
determining that the calling terminal does not support vertical screen video playing according to the equipment type information, and determining that the target color ring back tone file is a high-definition video file corresponding to the horizontal screen format video file and an audio file corresponding to the video content under the condition that the current network signal of the calling terminal meets a high-definition playing condition according to the network signal information;
determining that the calling terminal does not support vertical screen video playing according to the equipment type information, and determining that the target color ring back tone file is a horizontal screen format video file corresponding to the video content and an audio file corresponding to the video content under the condition that the current network signal of the calling terminal does not meet a high-definition playing condition according to the network signal information;
determining the target color ring back tone file as an audio file corresponding to the video content under the condition that the calling terminal does not support the video color ring back tone playing condition according to the equipment type information;
and under the condition that the current network signal of the calling terminal is determined not to meet the video color ring playing condition in the network signal information, determining the target color ring file as an audio file corresponding to the video content.
The step of determining a target polyphonic ringtone file adapted to the information of the calling terminal from the polyphonic ringtone files in the plurality of playing forms according to the information of the calling terminal includes any one of the following steps:
under the condition that the current network signal of the calling terminal is determined to meet a high-definition playing condition according to the network signal information, determining that the target color ring back tone file is a high-definition video file corresponding to the video content and an audio file corresponding to the video content;
under the condition that the current network signal of the calling terminal is determined not to meet the high-definition playing condition according to the network signal information, determining that the target color ring back tone file is a non-high-definition video file corresponding to the video content and an audio file corresponding to the video content;
and under the condition that the current network signal of the calling terminal is determined not to meet the video color ring playing condition according to the network signal information, determining the target color ring file as an audio file corresponding to the video content.
Compared with the prior art, the scheme at least has the following advantages:
1. the problem of fuzziness caused by low pixel playing of full-screen video of most video color ring in the prior art is solved.
2. The problem that in the prior art, a plurality of sets of homologous video color ring files with different resolutions are stored for adapting to various opposite terminal terminals by the video color ring and the problem of storing redundant audio files are solved, and a large amount of storage hardware resources can be saved.
3. The problems of large transmission quantity, long consumed time and long waiting time of the opposite end of the video color ring in the prior art are solved, and the user experience is greatly improved.
4. The problem that in the prior art, video polyphonic ringtone is not well transmitted in a network or an opposite terminal does not support the video polyphonic ringtone and plays the traditional polyphonic ringtone for the video polyphonic ringtone is solved, the user requirement of showing real people in real time is met, and the user experience is greatly improved.
5. The problem that a large amount of video color ring files are stored for terminal adaptation in the prior art is solved, so that the process of providing offline processing for all opposite terminals can be avoided, the process is time-consuming and resource-consuming, the utilization rate of the video color ring files with the multiple adaptations is low, and the cost investment is huge.
6. In the process of dialing a called party by a calling party, the video color ring central platform can select the playing form of the video color ring according to the type of a calling terminal, so that when the calling terminal has corresponding conditions, the high-definition vertical-screen video color ring with better experience can be preferentially played.
7. According to the scheme, in the user setting process, high-definition vertical screen video elements are generated intelligently and asynchronously by using distributed nodes of the video color ring, the materials of the vertical screen video color ring are enriched, the labor cost is saved, the user operation is simplified, and the video quality is optimized; in the process of dialing a called party by a calling party, the playing mode can be selected according to the model of the calling mobile phone, so that the method has good adaptability.
In the several embodiments provided in the present application, it should be understood that the disclosed method and apparatus may be implemented in other ways. For example, the above-described apparatus embodiments are merely illustrative, and for example, the division of the units is only one logical division, and other divisions may be realized in practice, for example, a plurality of units or components may be combined or integrated into another system, or some features may be omitted, or not executed. In addition, the shown or discussed mutual coupling or direct coupling or communication connection may be an indirect coupling or communication connection through some interfaces, devices or units, and may be in an electrical, mechanical or other form.
In addition, functional units in the embodiments of the present invention may be integrated into one processing unit, or each unit may be physically included alone, or two or more units may be integrated into one unit. The integrated unit can be realized in a form of hardware, or in a form of hardware plus a software functional unit.
The integrated unit implemented in the form of a software functional unit may be stored in a computer readable storage medium. The software functional unit is stored in a storage medium and includes several instructions to enable a computer device (which may be a personal computer, a server, or a network device) to execute some steps of the transceiving method according to various embodiments of the present invention. And the aforementioned storage medium includes: various media capable of storing program codes, such as a usb disk, a removable hard disk, a Read-Only Memory (ROM), a Random Access Memory (RAM), a magnetic disk, or an optical disk.
While the foregoing is directed to the preferred embodiment of the present invention, it will be understood by those skilled in the art that various changes and modifications may be made without departing from the spirit and scope of the invention as defined in the appended claims.

Claims (10)

1. A method for processing video color ring is applied to a color ring platform, and comprises the following steps:
under the condition of receiving video content selected or uploaded by a user, obtaining polyphonic ringtone files of a plurality of playing forms corresponding to the video content;
under the condition of receiving a calling request of a calling terminal, acquiring information of the calling terminal carried in the calling request, wherein the information of the calling terminal comprises at least one of equipment type information and network signal information;
according to the information of the calling terminal, determining a target color ring back tone file matched with the information of the calling terminal from the color ring back tone files in the plurality of playing forms;
and playing the target color ring back tone file to the calling terminal.
2. The processing method according to claim 1, wherein the color ring platform comprises a central node and a distributed node; the polyphonic ringtone files in the plurality of playing forms comprise audio files corresponding to the video contents;
the step of obtaining the polyphonic ringtone files of the multiple playing forms corresponding to the video content comprises the following steps:
audio and video splitting is carried out on the video content through a central node, and an audio file and a video file corresponding to the video content are obtained;
storing the audio file at the distributed node.
3. The processing method according to claim 1, wherein the color ring platform comprises a central node and a distributed node; the polyphonic ringtone files in the plurality of playing forms comprise high-definition video files corresponding to the video contents;
the step of obtaining the polyphonic ringtone files of the multiple playing forms corresponding to the video content comprises the following steps:
audio and video splitting is carried out on the video content through the central node, and an audio file and a video file corresponding to the video content are obtained;
in the distributed nodes, preprocessing the video file to obtain the constituent elements of the video content;
performing high-definition processing on the constituent elements to obtain high-definition elements of the video content;
and the central node generates a high-definition video file corresponding to the video content according to the high-definition elements of the video content.
4. The process of claim 3, wherein the high-cleaning process comprises: at least one of character high-definition processing, action high-definition processing, scene high-definition processing and emotion component high-definition processing.
5. The processing method according to claim 1, wherein the polyphonic ringtone files in the plurality of playing forms include a vertical screen video file corresponding to the video content;
under the condition that the video content is in a horizontal screen format, the step of obtaining the polyphonic ringtone files in the multiple playing forms corresponding to the video content comprises the following steps:
splitting audio and video of the video content to obtain a video file corresponding to the video content;
acquiring vertical screen center coordinates of each continuous frame of video image in the video file and a time interval of a target video clip of the video file;
and generating a vertical screen format video file corresponding to the video file according to the vertical screen center coordinate and the time interval of the target video clip.
6. The processing method according to claim 5, wherein the step of obtaining the time interval of the target video segment of the video file comprises:
carrying out shot switching detection on the video file to obtain a time interval of a complete shot of the video file;
obtaining the time interval of a target video clip of the video file through a preset algorithm according to the time interval of the complete shot;
the preset algorithm comprises at least one of character recognition, action recognition, emotion recognition, scene recognition and video significance detection.
7. The processing method according to claim 5, wherein the step of obtaining the vertical screen center coordinates of each frame of video image in succession in the video file comprises:
carrying out shot switching detection on the video file to obtain a time interval of a complete shot of the video file; and
acquiring a user preference parameter;
acquiring tracking track coordinates corresponding to the video file according to the time interval of the complete shot and the user preference parameters;
and fitting the tracking track coordinates to obtain the vertical screen center coordinates of each continuous frame of video image in the video file.
8. The processing method according to claim 1, wherein the step of determining the target ring back tone file adapted to the information of the calling terminal from the ring back tone files in the plurality of play formats according to the information of the calling terminal comprises:
determining a target color ring back tone file matched with the information of the calling terminal from the color ring back tone files in the plurality of playing forms according to the first information and/or the second information;
the first information is determined according to the device type information, and the first information is used for indicating whether the calling terminal supports vertical screen video playing or not;
the second information is determined according to the network signal information, and the second information is used for indicating whether the current network signal of the calling terminal meets a high-definition playing condition.
9. A processing device of video color ring comprises: a transceiver, a memory, a processor, and a computer program stored on the memory and executable on the processor; the processor is configured to read a program in the memory to implement the steps in the method for processing a video coloring ring back tone according to any one of claims 1 to 8.
10. A computer-readable storage medium for storing a computer program, wherein the computer program, when executed by a processor, implements the steps in the method for processing a video ring back tone according to any one of claims 1 to 8.
CN202010984425.6A 2020-09-18 2020-09-18 Method and device for processing video color ring and computer readable storage medium Active CN112134999B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010984425.6A CN112134999B (en) 2020-09-18 2020-09-18 Method and device for processing video color ring and computer readable storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010984425.6A CN112134999B (en) 2020-09-18 2020-09-18 Method and device for processing video color ring and computer readable storage medium

Publications (2)

Publication Number Publication Date
CN112134999A true CN112134999A (en) 2020-12-25
CN112134999B CN112134999B (en) 2021-12-03

Family

ID=73842347

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010984425.6A Active CN112134999B (en) 2020-09-18 2020-09-18 Method and device for processing video color ring and computer readable storage medium

Country Status (1)

Country Link
CN (1) CN112134999B (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114285944A (en) * 2021-11-29 2022-04-05 咪咕文化科技有限公司 Video color ring back tone generation method and device and electronic equipment

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103702172A (en) * 2013-12-13 2014-04-02 乐视网信息技术(北京)股份有限公司 Method and system for carrying out dolby transcoding on AV (Audio/Video)
CN106331753A (en) * 2015-06-30 2017-01-11 意法半导体国际有限公司 Synchronized rendering of split multimedia content on network clients
CN109151222A (en) * 2017-06-19 2019-01-04 中国电信股份有限公司 For controlling the method and color ring platform of trigger of CRBT
CN109803056A (en) * 2017-11-15 2019-05-24 中兴通讯股份有限公司 A kind of method, apparatus and system that video frequency color ring plays
CN110536028A (en) * 2019-08-15 2019-12-03 咪咕文化科技有限公司 Implementation method, color ring platform and the terminal of video frequency color ring
WO2020073999A1 (en) * 2018-10-12 2020-04-16 华为技术有限公司 Method for controlling video ringing playback and related apparatus
CN111049995A (en) * 2018-10-12 2020-04-21 华为技术有限公司 Method and related device for controlling video color ring in call

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103702172A (en) * 2013-12-13 2014-04-02 乐视网信息技术(北京)股份有限公司 Method and system for carrying out dolby transcoding on AV (Audio/Video)
CN106331753A (en) * 2015-06-30 2017-01-11 意法半导体国际有限公司 Synchronized rendering of split multimedia content on network clients
CN109151222A (en) * 2017-06-19 2019-01-04 中国电信股份有限公司 For controlling the method and color ring platform of trigger of CRBT
CN109803056A (en) * 2017-11-15 2019-05-24 中兴通讯股份有限公司 A kind of method, apparatus and system that video frequency color ring plays
WO2020073999A1 (en) * 2018-10-12 2020-04-16 华为技术有限公司 Method for controlling video ringing playback and related apparatus
CN111049995A (en) * 2018-10-12 2020-04-21 华为技术有限公司 Method and related device for controlling video color ring in call
CN110536028A (en) * 2019-08-15 2019-12-03 咪咕文化科技有限公司 Implementation method, color ring platform and the terminal of video frequency color ring

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114285944A (en) * 2021-11-29 2022-04-05 咪咕文化科技有限公司 Video color ring back tone generation method and device and electronic equipment
CN114285944B (en) * 2021-11-29 2023-09-19 咪咕文化科技有限公司 Video color ring generation method and device and electronic equipment

Also Published As

Publication number Publication date
CN112134999B (en) 2021-12-03

Similar Documents

Publication Publication Date Title
US10187668B2 (en) Method, system and server for live streaming audio-video file
CN107613357B (en) Sound and picture synchronous optimization method and device and readable storage medium
RU2334371C2 (en) System and method for multiplex transmission via media information network using limited connection resources and stored knowledge/experience of called or calling party
US7508413B2 (en) Video conference data transmission device and data transmission method adapted for small display of mobile terminals
US6989868B2 (en) Method of converting format of encoded video data and apparatus therefor
US20210337248A1 (en) Method and system for synthesizing audio/video
JP4994646B2 (en) Communication terminal, communication system, and communication terminal display method
US20060259552A1 (en) Live video icons for signal selection in a videoconferencing system
CN110536156B (en) Information processing method, system, communication device, and computer-readable storage medium
US20140118471A1 (en) Video Conferencing Method and Device Thereof
CN108337465B (en) Video processing method and device
US20060242676A1 (en) Live streaming broadcast method, live streaming broadcast device, live streaming broadcast system, program, recording medium, broadcast method, and broadcast device
US20090262136A1 (en) Methods, Systems, and Products for Transforming and Rendering Media Data
JP2004128614A (en) Image display controller and image display control program
JP2009510897A (en) Media content management
CN110650255B (en) Method and device for editing color ring back tone, color ring back tone editing unit and storage medium
EP1465423A1 (en) Videophone device and data transmitting/receiving method applied thereto
EP3174052A1 (en) Method and device for realizing voice message visualization service
WO2001059957A1 (en) Server device, communication terminal, relay server, conversion rule management server, and recording medium storing program
CN112134999B (en) Method and device for processing video color ring and computer readable storage medium
CN111464704B (en) Method, server, terminal and storage medium for implementing color ring back tone service
CN112492110B (en) Video color ring interaction method, system, electronic equipment and storage medium
CN101568007B (en) Video information processing method and system based on 3G video calling center
JPH10294896A (en) Method and device for processing digital information with superimposed characters
CN114938461A (en) Video processing method, device and equipment and readable storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant