Background technology
Growing along with network, based on network multimedia transmission is more and more universal, and people improve day by day to the requirement of multimedia communication content, people no longer are satisfied with single, passive traditional media entertainment way, and personalized, the interactive more multimedia communication mode that needs.Particularly along with further developing of mobile network communication technology and popularizing of mobile multi-media service, mobile multimedia has shown more and more that its future extensively and get over ripe user's cognition and acceptance.
At present, terminal sends the shortcode (shortcode be the service number of by operator being distributed to value-added service provider and affiliate use) of text message to appointment, after service end is handled, receiving side terminal is received a pairing multimedia message of text message or video, and this is a kind of terminal entertainment way that comes into vogue at present.But the multimedia message of being received, no matter video is picture or sound, serves as that the basis generates with the existing material of service end all, lacks user's personalized customization content.
Such as, existing first kind of implementation:
Transmitting terminal is (for example prompting or SMS advertisement on the webpage) by appointment, editor is content of short message fixedly, for example " 18 " are to certain shortcode, " 18 " this text is given to the processing service end by the short message service end. and handle service end and find a ready-made in advance motion picture or the video that is mated for " 18 ", it is sent to the receiving terminal mobile phone then.
Existing second kind of implementation:
Transmitting terminal editor short message " I Love You " is to certain shortcode, " I Love You " this text is given to the processing service end by the short message service end. handle service end by Text To Speech switching software (TTS, Text To Speech), " I Love You " is converted to the audio file of sending out " I Love You " pronunciation. " I Love You " will be converted into a motion picture or video then, and method is as follows:
(1) directly uses a fixing motion picture or video;
(2) with " I Love You " as a phrase, mate a certain motion picture or video;
(3) " I Love You " is split as 3 words, 3 words mate 3 pictures or video respectively, then 3 medium are merged into medium chronologically, as, can be to form a motion picture or a continuous videos.
Then, again motion picture or video are combined with audio file, finally generate the video that comprises the animation of speech (as " I Love You ") or comprise speech (as " I Love You "), be handed down to the receiving terminal mobile phone then in service end.
In realizing the invention process, the inventor finds that above-mentioned first kind individualized experience degree will be higher than second kind.But in above background technology, picture or sound that the user still can not arbitrary use oneself wants, existing implementation is difficult to satisfy user's individual demand, and more and more higher interest and recreational requirement, can not satisfy user experience.
Embodiment
Below in conjunction with drawings and Examples, the specific embodiment of the present invention is described in further detail.
Embodiment one:
The technical scheme that the embodiment of the invention disclosed makes the user to follow one's bent and uses picture or the sound of oneself wanting, and farthest satisfies user's individual demand, and more and more higher interest and recreational requirement, improves user experience.
The user sends text message by transmitting terminal, personalized audio files, and picture is to service end.Wherein, text message is will send, and personalized audio files, perhaps picture, and the user can select to send, and also can select not send.Service end receives and handles these information that send over from user terminal, generates multimedia, then described multimedia is sent to receiving terminal.
Said transmitting terminal of present embodiment and receiving terminal can be catv terminals, as PC; Can be wireless terminal also, as mobile phone.Send mode can be to send by the application software that mobile phone terminal itself carries, and also can pass through WAP (wireless application protocol) (WAP, Wireless ApplicationProtocol) and enter behind the WAP inputting interface again that edit file sends; If catv terminal, as be connected to the PC of internet, also can pass through internet browser such as IE access websites, enter the concurrent carry information of edit page editor.
The source of the text message that the said terminal of present embodiment sends can comprise: the user is by the text message of keyboard input, perhaps the text message that obtains after transforming by speech recognition software of the voice of terminal microphone input.
The text message that the alleged user side of present embodiment sends can comprise:
1, receiving terminal information
Particularly, such as the phone number that can be the other side; Alternatively, can not comprise receiving terminal information; Optionally, receiving terminal information can be transmitting terminal itself.
2, statement information
Described statement information, such as Word messages such as " I Love You ", in the present embodiment, statement information and audio files the two, have and can only have one, illustrate as: statement information and audio files hypothesis have had A just B can not be arranged with A and B representative, there has been B just A can not be arranged, and an appearance must have been arranged among A and the B.
3, media conversion indication information, such as, can be to judge to generate the video or the indication information of caricature.
As mentioned before, the picture that said user sends by terminal in the present embodiment, the user can select to upload, and also can select not upload.
Service end is discerned described text message after receiving text message from transmitting terminal, comprises in the described text message: receiving terminal information (if transmitting terminal sends this information); The media conversion indication information is as judging that generating video still is the indication information of caricature cartoon; Statement information (if transmitting terminal sends this information).Service end is converted to first kind of media file according to the media conversion indication information in the described text message with picture, such as, when described media conversion indication information indication need be converted to video, then described first kind of media file was video; Perhaps, when described media conversion indication information indication need be converted to the caricature cartoon, then described first kind of media file was the caricature cartoon; That is to say, when described media conversion indication information indication need convert different medium to, be corresponding different medium with described picture and first kind of media conversion then.
In described service end according to the media conversion indication information in the described text message, picture is converted to before first kind of media file, receive the picture that comes from transmitting terminal if service end is judged, then described picture is the described picture that comes from transmitting terminal that receives; If service end does not receive the picture that comes from transmitting terminal, then whether service end can send over picture before according to the described transmitting terminal of transmitting terminal information searching that transmitting terminal sends, if find the picture that sends over before the described transmitting terminal, then described picture is the picture that described transmitting terminal sends over before; If not, then described picture is the default picture of service end.
Alternatively, if transmitting terminal does not send under the situation of transmitting terminal information, then system is defaulted as automatically: directly pick out satisfactory picture in the picture resource of systemic presupposition.
If the user has uploaded audio file in terminal, then the phonetic feature with this audio file extracts, further, described phonetic feature can be saved in the phonetic feature storehouse corresponding, also can cover counterpart in the described phonetic feature storehouse automatically with this user side according to user side information.If user side is not uploaded audio file, in the present embodiment, then this user must send statement information, and then described service end is mated the phonetic feature storehouse according to the transmitting terminal information that receives, and is the audio file that meets described phonetic feature storehouse with the statement information translation.This process specifically can comprise: according to the transmitting terminal information that transmitting terminal transmits, judge whether described transmitting terminal has corresponding phonetic feature storehouse, and the words that have are then finished above-mentioned conversion according to the phonetic feature storehouse of this transmitting terminal user correspondence; If the corresponding phonetic feature of described transmitting terminal storehouse does not exist, then described service end is finished above-mentioned conversion according to the phonetic feature storehouse of systemic presupposition.
That is to say when statement information changes into audio file that at first reference is the phonetic feature storehouse (if any) that this user kept in the past, so that sound sounds is very similar with the user.
Above-mentioned conversion for example as: if user's statement information is " I Love You ", text " I Love You " can become an audio file so. it sends the pronunciation of " I Love You "; In the time of simultaneously because of this audio file of generation,, pronounce very similar, more true to nature with the user so sound sounds with reference to this user's voice feature database.
Further, service end is audio file and first kind of above-mentioned media file, merges such as video or caricature cartoon, converts second kind of media file to, as generation with as described in the video file of audio frequency, or generate a sound caricature cartoon.
In the present embodiment, described first kind of media file forms second kind of media file after changing, described first kind of media file that media file can be a kind of interstage form, described second kind of media file can be the final file that forms in the present embodiment, it briefly can be the relation of procedure file and definitive document, in this case, first kind of medium can be the identical media file of form with second kind of medium, such as, in the present embodiment: when first kind of medium is videos, then these first kind of medium (video) and audio frequency combine, and form second kind of media file, at this moment second kind of media file that media file also is the same video form.
Certainly described first kind of media file forms second kind of media file after changing, and this first kind of media file and second kind of media file also can be the media files of different-format.
Described statement information can be used as captions and embeds in the video or sound caricature cartoon that generates with the various forms of expression, plays in real time, takes the form of the below that roll titles appears at video such as a kind of.
Further, this final video or sound caricature cartoon that generates can be sent to receiving terminal, shows or broadcast at receiving terminal.Such as, can receive, show or play with mobile phone, also can connect internet and receive, show or play with PC.
Receiving terminal can be mobile phone or the PC that is different from user's transmitting terminal; Also can be exactly transmitting terminal itself, as can import the transmitting terminal number in receiving terminal information, input user's oneself phone number can be realized.
The medium that present embodiment is alleged, include but not limited to: MPEG, AVI, RMVB, WMV, SWF, VIV, ASF, RM, RA, RP, RT, MOV, QT, 3GPP, MP4,3D, JPEG, PNG, GIF, BMP, AMR, MMF, 3GPP, MP4, RM, AVI, WAV, APE, MP3/MP2/MP1/MPGA, WMA/ASF, MIDI/MID, VQF, AIF/AIFF, AU, VOC, AAC, VOX etc.
By implementing the technical scheme that present embodiment disclosed, picture or sound that the user can personalized selection oneself wants farthest satisfy user's individual demand.
Embodiment two:
Present embodiment has disclosed a kind of method of individual customizing media, can be based on embodiment one, method comprises: the user transmitting terminal by a client software with statement information " Guessing Who I Am for you? " need the indication information " 0 " of generation video to send to service end with recipient's phone number " 13891027634 " and judgement, optionally, upload oneself a head portrait simultaneously and shine service end.
Described client application includes but not limited to Kjava, Symbian, SmartPhone, Mophun, Brew or PDA, and other program of developing on the basis based on these programs.
After service end is received above-mentioned information, (represent with 1 here according to judging the flag information " 0 " that generates video, can certainly be with 00,01,10,11 grades are represented the multiple medium type that generates), if uploading the head portrait of oneself, the user shines, just user's head portrait is shone and convert video to, again with statement information " Guessing Who I Am for you? " conversion generates the audio frequency of corresponding text pronunciation, and the phonetic feature of audio frequency meets this user's voice feature database, if not this user's voice feature database, then using system phonetic feature storehouse.Statement information can be used as captions and embeds in the video that generates with certain form of expression, and for example a kind of form of expression can be the bottom that roll titles appears at video.
Further, the video that generates can be sent to receiving terminal, such as the other user's mobile phone, phone number is as 13891027634.
By implementing the technical scheme that present embodiment disclosed, picture or sound that the user can arbitrary selection oneself wants farthest satisfy user's individual demand.
Embodiment three:
Present embodiment has disclosed a kind of method of individual customizing media, can be based on embodiment one, method comprises: the user by PC on certain website of the Internet, upload oneself audio file and head portrait picture file, and with recipient's phone number " 13891027634 ", and judge that the indication information " 1 " that needs to generate the caricature cartoon also sends to service end.
After service end is received above-mentioned information. according to judging the flag information " 1 " that generates caricature, head portrait picture is changed into the picture with caricature cartoon effect, the audio file that the while analysis user is uploaded, the extraction phonetic feature is also preserved so that use later on.
Then, the picture of caricature cartoon effect and audio file are combined into a sound caricature cartoon, can further be sent on recipient's mobile phone 13891027634.
By implementing the technical scheme that present embodiment disclosed, picture or sound that the user can arbitrary selection oneself wants farthest satisfy user's individual demand.
Embodiment four:
With reference to shown in Figure 1, present embodiment provides a kind of service end of individual customizing media, comprising:
Receiving element is used to receive the text message that includes the media conversion indication information;
Processing unit is used for the media conversion indication information according to described receiving element reception, picture is converted to first kind of media file, and audio file and described first kind of media file are converted to second kind of media file;
Transmitting element is used to send described second kind of media file of described processing unit.
By implementing the technical scheme that present embodiment disclosed, picture or sound that the user can arbitrary selection oneself wants farthest satisfy user's individual demand.
Device embodiment described above only is schematic, wherein said unit as the separating component explanation can or can not be physically to separate also, the parts that show as the unit can be or can not be physical locations also, promptly can be positioned at a place, perhaps also can be distributed on a plurality of network element.Those of ordinary skills promptly can understand and implement under the situation of not paying performing creative labour.
Embodiment five:
Present embodiment provides a kind of system of individual customizing media, comprising:
Transmitting terminal, the text message that is used for including the media conversion indication information sends to service end;
Service end is used for the media conversion sign of the text message that sends according to described transmitting terminal, and picture is converted to first kind of media file, and audio file and described first kind of media file are converted to second kind of media file send to receiving terminal;
Receiving terminal is used to receive the described second kind of media file that comes from described service end, and shows or play described second kind of media file.
By implementing the technical scheme that present embodiment disclosed, the user can indiscriminately ad. as one wishes select picture or the sound oneself wanted, farthest satisfies user's individual demand.
Through the above description of the embodiments, those skilled in the art can be well understood to the present invention and can realize by the mode that software adds essential general hardware platform, can certainly pass through hardware, but the former is better execution mode under a lot of situation.Based on such understanding, the part that technical scheme of the present invention contributes to background technology in essence in other words can embody with the form of software product, this computer software product is stored in the storage medium, comprises that some instructions are used so that a computer equipment is carried out the described method of each embodiment of the present invention.
Certainly, the above only is several concrete exemplary applications of the present invention.Should be pointed out that for those skilled in the art under the prerequisite that does not break away from the principle of the invention, can also make some improvements and modifications, these improvements and modifications also should be considered as protection scope of the present invention.