CN109218630A

CN109218630A - A kind of method for processing multimedia information and device, terminal, storage medium

Info

Publication number: CN109218630A
Application number: CN201710546333.8A
Authority: CN
Inventors: 赵亮; 冯驰伟; 张中宝; 王文涛
Original assignee: Tencent Technology Shenzhen Co Ltd
Current assignee: Tencent Technology Shenzhen Co Ltd
Priority date: 2017-07-06
Filing date: 2017-07-06
Publication date: 2019-01-15
Anticipated expiration: 2037-07-06
Also published as: CN109218630B

Abstract

The embodiment of the invention discloses a kind of method for processing multimedia information and device, terminal, storage mediums, wherein the described method includes: client obtains at least two multimedia messages that at least two cameras in more than two cameras of terminal are respectively shot；Synthetic parameters when determining for being synthesized at least two multimedia messages；At least two multimedia messages are being synthesized according to the synthetic parameters, the multimedia messages after being synthesized；The first operation is received, first operation is for being sent to the corresponding server of the client for the multimedia after the synthesis；First operation is responded, the multimedia after the synthesis is sent to the corresponding server of the client.

Description

A kind of method for processing multimedia information and device, terminal, storage medium

Technical field

The present invention relates to Internet technology more particularly to a kind of method for processing multimedia information and device, terminal, storage Jie Matter.

Background technique

Present more and more clients, such as most of social categories apply (APP, Application) or instant messaging Class application, all support recorded video, which perhaps shoot picture or directly choose locally existing video or picture, simply to be located It is sent after reason, such as QQ, wechat.Prior art is all that the video recording and take pictures that the single camera of mobile phone provides is utilized Function can support the shooting for completing video record and picture in oneself APP, while also support to do after the completion of shooting Simple processing.For example, a camera is once used only and carries out shooting video/figure when such client publication video/picture Piece, for example the video of self-timer is shot with front camera, or record some events or information etc., shooting with rear camera Just video/picture is sent by simple process after the completion.Also some clients support is compiled from local selecting video Volume, such as video cutting etc., then the video/picture after cutting is distributed away.

However the video/picture acquisition scheme of these clients is in the form of a single, the content of expression only has single camera and adopts The video content collected, content itself are not abundant enough, three-dimensional.

Summary of the invention

In view of this, the embodiment of the present invention be solve the problems, such as it is existing in the prior art at least one and a kind of more matchmakers are provided Body information processing method and device, terminal, storage medium.

The technical solution of the embodiment of the present invention is achieved in that

The embodiment of the present invention provides a kind of method for processing multimedia information, which comprises

Client obtain terminal more than two cameras at least two cameras respectively shoot more than at least two Media information；

Synthetic parameters when determining for being synthesized at least two multimedia messages；

At least two multimedia messages are being synthesized according to the synthetic parameters, the multimedia after being synthesized Information；

The first operation is received, first operation is corresponding for the multimedia after the synthesis to be sent to the client Server；

First operation is responded, the multimedia after the synthesis is sent to the corresponding server of the client.

The embodiment of the present invention provides a kind of device for processing muti-medium information, described device include acquiring unit, determination unit, Synthesis unit, receiving unit and transmission unit, in which:

The acquiring unit, what at least two cameras in more than two cameras for obtaining terminal were respectively shot At least two multimedia messages；

The determination unit, synthesis ginseng when for determining for being synthesized at least two multimedia messages Number；

The synthesis unit, for being synthesized according to the synthetic parameters at least two multimedia messages, Multimedia messages after being synthesized；

The receiving unit, for receiving the first operation, first operation is for sending out the multimedia after the synthesis Give the client corresponding server；

Multimedia after the synthesis is sent to the client for responding first operation by the transmission unit Hold corresponding server.

The embodiment of the present invention provides a kind of terminal, including memory, processor and storage on a memory and can handled The computer program run on device, the processor are configured to realize above-mentioned multimedia signal processing side when executing described program Method.

The embodiment of the present invention provides a kind of computer readable storage medium, is stored thereon with computer program, the computer Above-mentioned method for processing multimedia information is realized when program is executed by processor.

In the embodiment of the present invention, wherein client obtains at least two cameras in more than two cameras of terminal At least two multimedia messages respectively shot；Synthesis when determining for being synthesized at least two multimedia messages Parameter；At least two multimedia messages are being synthesized according to the synthetic parameters, the multimedia letter after being synthesized Breath；The first operation is received, first operation is for being sent to the corresponding clothes of the client for the multimedia after the synthesis Business device；First operation is responded, the multimedia after the synthesis is sent to the corresponding server of the client；In this way, Client can synthesize the image that multiple cameras acquire, and the video for ultimately generating an independent completion is sent out Cloth；Thus, it is possible to express Same Scene from multiple latitudes, or once express multiple and different scenes, thus when watch video person " that scape of that feelings " can be understood from multiple latitudes, keep content richer, and expression is also more three-dimensional, considerably increases presence and entertaining Property, the user experience is improved.

Detailed description of the invention

Figure 1A is a kind of system architecture schematic diagram；

Figure 1B is the implementation process schematic diagram of method for processing multimedia information of the embodiment of the present invention；

Fig. 2A is the example schematic of the embodiment of the present invention；

Fig. 2 B is the example schematic that the embodiment of the present invention determines synthesising position region based on the operation of user；

Fig. 2 C is the example schematic that the embodiment of the present invention determines empty window region；

Fig. 2 D is the example schematic of Boundary Extraction of the embodiment of the present invention；

Fig. 3 A is the implementation process schematic diagram of image synthesizing method of the embodiment of the present invention；

Fig. 3 B is the schematic diagram of a scenario one of the embodiment of the present invention；

Fig. 3 C is the schematic diagram of a scenario two of the embodiment of the present invention；

Fig. 3 D is example schematic three when the embodiment of the present invention synthesizes；

Fig. 3 E is example schematic four when the embodiment of the present invention synthesizes；

Fig. 4 is the implementation process schematic diagram of image synthesizing method of the embodiment of the present invention；

Fig. 5 is the implementation process schematic diagram of image synthesizing method of the embodiment of the present invention；

Fig. 6 is the composed structure schematic diagram of device for processing muti-medium information of the embodiment of the present invention；

Fig. 7 is a kind of hardware entities schematic diagram of terminal in the embodiment of the present invention.

Specific embodiment

Each embodiment for a better understanding of the present invention now explains that relevant term is as follows:

Front camera: the camera being arranged with the main screen of terminal in the same face；If terminal only includes one aobvious Display screen, then the screen is just main screen, then the camera that the display screen one side is arranged in is front camera；Some ends End setting more than one display screen, including more than two display screens, then showing that the display screen at the interface of application is main screen Curtain, then front camera be and main screen the same face camera.It taking the mobile phone as an example, mobile phone only includes a display screen, Camera so above the display screen of mobile phone is front camera, and in general, front camera is used for self-timer.

Rear camera: the camera not being arranged in the same face with the main screen of terminal；If terminal only includes one Display screen, then the screen is just main screen, then the camera that the display screen back side is arranged in is rear camera；Some More than one display screen, including more than two display screens is arranged in terminal, then based on showing the display screen at the interface of application i.e. Screen does not show that the display screen of application interface is secondary screen, then rear camera is with secondary screen in the same face Camera.It takes the mobile phone as an example, mobile phone only includes a display screen, then the camera above the display screen of mobile phone is preposition takes the photograph As head, the camera at the mobile phone display screen back side is rear camera, positioned at the one side opposite with the display screen of mobile phone.

Dual camera: the general name of front camera and rear camera.

Advanced video: the video content of front camera acquisition.

Postposition video: the video content of rear camera acquisition.

The technical solution of the present invention is further elaborated in the following with reference to the drawings and specific embodiments.

Figure 1A is a kind of system architecture schematic diagram, and Figure 1A shows a communication system 1, which includes 11 kimonos of terminal Business device 12, wherein installs in terminal 11 and has run various clients (such as application program (APP, Application)), client End includes social software, instant message applications, to share clients, the server such as software from media software, information be client institute Corresponding server.In this example, terminal 11 and server 12 can be one or more, and therefore, above system 1 includes one Or multiple terminals 11 for being equipped with client and one or more server 12, these terminals 11 and server 12 pass through network 13 connections.In embodiments of the present invention, network side server 12 can be interacted with terminal 11 by client, and terminal 11 will Material to be released is to server 12, and then server releases the material to be released received.Wherein, element to be released Material includes text and multimedia messages, and wherein multimedia messages include at least photos and videos alternative one, is also possible to text With the combination of photo.

It should be noted that the system architecture that some embodiments of the present invention can be proposed based on Figure 1A.

The embodiment of the present invention provides a kind of method for processing multimedia information, is applied to terminal, the function that this method is realized It can be realized by the processor caller code in terminal, certain program code can be stored in computer storage medium In, during realization, which can be used as client, so as to install at the terminal, and by the terminal It is run.From the above, it can be seen that the terminal includes at least pocessor and storage media.Figure 1B is multimedia of the embodiment of the present invention The implementation process schematic diagram of information processing method, as shown in Figure 1B, this method comprises:

Step S101, client obtain terminal more than two cameras at least two cameras respectively shoot to Few two multimedia messages；

It is described two above including two, three, four etc. in the present embodiment.

In the present embodiment, camera includes various being able to carry out image acquisition component.

In the present embodiment, client includes the various clients that can be uploaded and issue multimedia messages, such as social software, Instant message applications shares the clients such as software, information upload software from media software, information, and user passes through social software (such as Facebook, QQ space, microblogging etc.) distribution of multimedia information of shooting is gone out, user can also will be clapped by instant message applications The multimedia messages taken the photograph are sent to oneself good friend, and user can certainly be by uploading the multimedia oneself shot from media software Information, user can also be shared software (such as video software YouTube, serge mile serge mile, bat visitor etc.) by information and share oneself The multimedia messages of shooting.

In this example, multimedia messages include at least photos and videos alternative one, for example, the photo or video of user's shooting It can be used as multimedia messages；User carries out editing and processing, such as rotation, cutting, filter, beauty to the photo or video of shooting Face increases text, enhancing saturation degree, highlights, enhances contrast etc., and photo or video after editing and processing can also be made For multimedia messages.

Terminal includes more than two cameras, such as two cameras, three cameras, four camera shootings in the present embodiment Head, wherein the mobile terminals such as general smart phone, tablet computer generally comprise two cameras, respectively the first camera and Second camera；And with feature functionality, such as with three-dimensional imaging (3D, 3Dimensions) function, augmented reality (AR, Augmented Reality) or virtual reality (VR, Virtual Reality) function mobile terminal may include three or Three or more cameras.It is illustrated by taking two cameras as an example below, the terminal includes that the first camera and second is taken the photograph As head, step S101, comprising: what the first multimedia messages and second camera of client acquisition the first camera shooting were shot Second multimedia messages.In general, when terminal includes two cameras, the first camera and second camera can be respectively Front camera and rear camera, then step S101, comprising: client obtains the first multimedia of front camera shooting Second multimedia messages of information and rear camera shooting.

Step S102, synthetic parameters when determining for being synthesized at least two multimedia messages；

In this example, synthetic parameters include various timess that can carry out at least two multimedia messages to synthesize an entirety One or more of anticipating parameters, such as synthesis pattern or synthesis template, synthesising position, prospect label, background label, composition rule Deng.

Synthesis pattern or synthesis template can be the parameter that typesetting is carried out at least two multimedia messages to acquisition, Synthesis pattern synthesizes template and can be and sets in default in advance, and is updated by server timing, is also possible to user oneself The some formwork styles or template being arranged.

Step S103 is synthesizing at least two multimedia messages according to the synthetic parameters, is being synthesized Multimedia messages afterwards；

In other embodiments, the terminal includes front camera and rear camera, and the synthetic parameters include closing Synthetic parameters at the band of position, when the determination is for being synthesized at least two multimedia messages, comprising: determine The second multimedia messages that the first multimedia messages synthesis of front camera shooting is shot in the rear camera Synthesising position region；Accordingly, described that at least two multimedia messages are being synthesized according to the synthetic parameters, it obtains Multimedia messages after to synthesis, comprising: the first multimedia messages for shooting the front camera are added in the synthesis Multimedia messages on the band of position, after being synthesized.

Step S104, receives the first operation, and first operation is described for the multimedia after the synthesis to be sent to The corresponding server of client；

Step S105 responds first operation, it is corresponding that the multimedia after the synthesis is sent to the client Server.

In the embodiment of the present invention, user is allowed to open multiple cameras of terminal simultaneously when shooting, and supporting will be multiple The image of camera acquisition merges preview and shows, and supports to increase a variety of combined rendering effects, and it is complete to ultimately generate an independence Whole video is distributed.Thus, it is possible to which viewing video person can be from multiple latitudes from the meaning of multiple latitudes expression video itself Degree understands " that scape of that feelings ", keeps content richer, and expression is also more three-dimensional, considerably increases presence and interest, improves use Family experience.

Wherein first operation and following embodiment in second operation, third operation, the 4th operation etc. other all may be used Think the operation of user, the type of operation and the input equipment of terminal are related, if input equipment is mouse, operation is click Operation, if input equipment is touch screen, operation is touch operation.The input equipment of terminal can also be other kinds of, Terminal receives the operation of user by touch screen or mouse, then generates instruction depending on the user's operation, then executes the instruction, Such as first act on the multimedia after the synthesis being sent to the corresponding server of the client, then terminal root The instruction of " for the multimedia after the synthesis to be sent to the corresponding server of the client " is generated according to the first operation, so After execute instruction, the multimedia after the synthesis is sent to the corresponding server of the client.Second operation below, the Other are similar with the first operation for three operations, the 4th operation etc., can understand refering to the first operation.

It is illustrated by taking the first and second multimedia messages of two cameras acquisition as an example below, such as shown in Fig. 2A A1 figure is rear camera the second image collected (the second multimedia messages), and a1 figure is front camera collected first Image (the first multimedia messages)；B1 figure to b3 figure is respectively synthesis pattern or synthesis template, i.e., according to these patterns or template First image and the second image are synthesized.Wherein, scheming template shown in b1 includes the first image of addition and the second image Region b11 and b12 can exist the first image or the addition of the second image when being synthesized the first image and the second image Region b11 can also scheme c1 and show the addition of the first image in area by the second image or the addition of the first image in region b12 Second image is added the interface schematic diagram in region b11 by domain b12；Scheming template shown in b2 includes the first image of addition and the The region b13 and b14 of two images can be by the first image or the second figures when being synthesized the first image and the second image As adding in region b13, it can also scheme c2 by the second image or the addition of the first image at region b14 and show by the first image It adds and the second image is added to the interface schematic diagram in region b14 in region b13；Scheming template shown in b3 includes addition first The region b15 and b16 of image and the second image can be by the first images when being synthesized the first image and the second image Or second image addition in region b15, can also by the second image or the addition of the first image in region b16, scheme c3 show by Second image is added the interface schematic diagram in region b16 in region b15 by the addition of the first image.In addition, mould shown in figure b3 In plate region b15 can be it is fixed may not be fixed, the position of b15 can be set in user, is also possible to terminal oneself Determining.

It can be seen that from Fig. 2A shown figures in the multimedia messages after utilizing synthesis pattern or synthesis templated synthesis, energy It is enough to realize at least two multimedia messages that show acquisition simultaneously in an interface, wherein template shown in b1 and b2 Respectively multimedia messages (the first image and the second image) can be mutually indepedent in multimedia messages Central Plains after synthesis, shown in b3 Respectively multimedia messages (the first image and the second image) can also be mutually nested in multimedia messages Central Plains after synthesizing in template, Scheme the difference for having prospect and background in template shown in b3.

During realization, the mutually nested (figure of such as Fig. 2A of the respective multimedia messages of original in pattern or template is synthesized Shown in b3) when, synthetic parameters include synthesising position region, and accordingly, the determination is used to believe at least two multimedia Synthetic parameters when breath is synthesized, comprising: determine the process in synthesising position region, the wherein determination in synthesising position region includes Following two mode:

Mode one receives third operation, is determined and is made from least two multimedia messages based on third operation For the multimedia messages of background, i.e., the described third operation is for being determined as background from least two multimedia messages Multimedia messages, then terminal shows the multimedia messages as background on the display interface of terminal；Receive the 4th behaviour Make, wherein the 4th operation is for determining synthesis of other multimedia messages on the multimedia messages as background The band of position；The synthesis for determining other multimedia messages on the multimedia messages as background is operated based on the described 4th The band of position adds other described multimedia messages in the multimedia messages as background according to the synthesising position region On.Wherein other described multimedias include at least two multimedia messages except the multimedia messages as background it Outer all or part of multimedia messages.

In other embodiments, the terminal includes front camera and rear camera, the determination in aforesaid way one The correlation step of multimedia messages as background can be omitted, i.e., mode one includes following two step: step SA11, receive For the second operation of second multimedia messages；Step SA12 determines synthesis based on the corresponding position of second operation The band of position.In this example, the second multimedia messages are rear camera shooting, therefore the second multimedia is suitable as background, And the first multimedia messages are front camera shooting, therefore the first multimedia messages are suitable as prospect, therefore can be Then the second multimedia messages of display screen display of terminal receive user for the second behaviour of second multimedia messages Make, second operation is for determining synthesis position of first multimedia messages on second multimedia messages as background Set region.If certain user wants change background, replacement operation can also be provided in the process of implementation, and user uses replacement Background is changed to the first multimedia messages from the second multimedia messages by operation.

In the embodiment shown in SA11 and SA12, terminal needs the more matchmakers shot from front camera and rear camera Determine which is rear camera shooting in body information, so as to the conduct background for shooting rear camera, therefore, in other realities It applies in example, the method also includes: client determines that rear camera is shot according to the attribute of the first and second multimedia messages Multimedia messages, the attribute of multimedia messages includes the size of file, format, source identification, shooting time, if multimedia Information is photo, then shooting time is the moment, if multimedia messages are videos, shooting time be include shooting start time And duration, wherein source identification indicates that multimedia messages are front camera shooting or rear camera shooting. Then the multimedia messages (the second multimedia messages) by rear camera shooting are used as background.

It is illustrated by taking Fig. 2 B as an example below, after client gets at least two multimedia messages, obtains at least two The multimedia messages that source identification in attribute is rear camera are determined background, it is assumed that client by the attribute of multimedia messages Hold the second multimedia messages using a1 is schemed as background, then user carries out the second operation on the second multimedia messages, such as A circle 21 is drawn on the second multimedia messages and is used as synthesising position region, and client is according to the corresponding position of the second operation, example A circle 20 is such as drawn, determines synthesising position region.Then client adds the first multimedia messages according to synthesising position region To get the multimedia messages to after synthesizing on the second multimedia messages.

Mode one provides the mode that a kind of base user's operation determines synthesising position region, below mode two one kind is provided Client automatically determines the mode in synthesising position region, wherein the synthetic parameters include prospect label, background label, synthesis position Region is set, mode two includes:

In SB11, prospect label or background mark are determined for each multimedia messages at least two multimedia messages Label；

In SB12, image recognition is carried out to the multimedia messages with background label, obtains sky window region；

In other embodiments, image recognition may include color identification and image texture identification, in general, color one Effective information brought by the relatively good region of cause property is with regard to fewer；Similarly, brought by the fewer region of image texture Effective information is also fewer；The fewer region in the relatively good region of colour consistency and image texture can be so determined as Empty window region.During realization, the region that textural characteristics can be met to preset condition is determined as sky window region.Work as image When identification is using color identification, the present embodiment further includes the steps that the color value in the determining empty window region.

In SB13, the empty window region is determined as synthesising position region.

Accordingly, step S104, it is described that at least two multimedia messages are being closed according to the synthetic parameters At multimedia messages after being synthesized, comprising: will add with the multimedia messages of prospect label in the synthesising position area Domain, the multimedia messages after being synthesized.

In the present embodiment, region is colour consistency region and does not include text the sky window, and colour consistency region can It is realized with being identified using color, such as pixel region of the color difference in threshold range can be determined as colour consistency region. The described pair of multimedia messages with background label carry out color identification, obtain sky window region, comprising: have background mark to described The multimedia messages of label carry out color identification, obtain the colour consistency area on the multimedia messages with background label Domain；C referring to fig. 2, it is assumed that the multimedia messages with background label are A figure or B figure in 2C, analyze A figure, obtain face Color Uniform Domains 2c1 and 2c2；B figure is analyzed, colour consistency region 2c3 and 2c4 is obtained；Then by the color Region on Uniform Domains including text is rejected, and sky window region is obtained.In other embodiments, this method further include: judgement Whether the sky window region is greater than preset pixel region, such as in general, two-dimension code area at least wants 100 pixel × 100 Pixel, if too small, adding synthesising position region can be too small, so region is unsatisfactory for the threshold value of the setting of pixel region, institute With cannot be as empty window region.If meeting the empty window region is greater than preset pixel region, and the empty window region is true It is set to described image region.

In the present embodiment, because multimedia messages may be video, the synthesis of multimedia messages may be the synthesis of video, Due to video be exactly one by one, then the video of the video of multiple cameras such as front camera and rear camera it Between synthesis, it is necessary to synthesize one by one, it is assumed that front camera has three frames [a1, a2, a3], and rear camera has three frames [b1, b2, b3], the video after synthesis also have three frames [c1, c2, c3]；Wherein, c1 may be a1 and b1 synthesis as a result, with public affairs Formula is expressed as c1=a1+b1；C1 may be a1 and b3 synthesis as a result, be formulated as c1=a1+b3, c1 may be a2 and B3 synthesis as a result, being formulated as c1=a2+b3；Relationship represented by formula can be embodied with related information above. The multimedia messages described in this way are video, described to be closed according to the synthetic parameters at least two multimedia messages At multimedia messages after being synthesized, comprising: before being determined for multimedia messages each at least two multimedia messages Scape label or background label；Image recognition is carried out to the multimedia messages with background label, obtains sky window region；By the sky Window region is determined as synthesising position region；Determine the frame sequential of each multimedia messages at least two multimedia messages； The frame sequential of multimedia messages with prospect label is associated with the foundation of the frame sequential of the multimedia messages with background label Information.Accordingly, described to be added in the synthesising position region, after being synthesized with the multimedia messages of prospect label Multimedia messages, comprising: will be added with the multimedia messages of prospect label in the synthesising position according to the related information Region, the multimedia messages after being synthesized.In other embodiments, judgement has the frame number of the multimedia messages of background label Whether amount is consistent with the number of frames of the multimedia messages with prospect label, is corresponded to, that is, had using null frame if inconsistent Have powerful connections label multimedia messages number of frames be less than with prospect label multimedia messages number of frames, broadcasting prospect It is sometimes multimedia messages not with background label during the number of frames of the multimedia messages of label.At other It is that the two can also be aligned according to the smallest quantity in embodiment.

In the present embodiment, multiple multimedia messages are synthesized, then can have which multimedia messages is superimposed upon The problem of on another multimedia messages, the video or image that the present embodiment acquires each give a label, i.e., this A video is the prospect or background of the video or image after synthesis；Accordingly, the multimedia messages are video, described according to institute It states synthetic parameters to synthesize at least two multimedia messages, the multimedia messages after being synthesized, comprising: for institute It states each multimedia messages at least two multimedia messages and determines prospect label or background label；To with the more of background label Media information carries out image recognition, obtains sky window region；The empty window region is determined as synthesising position region.It is described to have The multimedia messages of prospect label add the multimedia messages in the synthesising position region, after being synthesized, comprising: step S31A carries out border detection to the multimedia messages with prospect label, obtains boundary exterior domain；Step S32A, according to described The color value in synthesising position region carries out color filling to the boundary exterior domain, obtains filled more with prospect label Media information；Step S33A adds the multimedia messages with prospect label after color filling in the synthesising position area Domain, the multimedia messages after being synthesized.As shown in Figure 2 D, it is assumed that a that the multimedia messages with prospect label are Fig. 2 D schemes It is shown, to the result after a figure progress Boundary Extraction (also known as edge detection) of Fig. 2 D as shown in the b figure of Fig. 2 D, and in boundary Region is as shown in the c figure of Fig. 2 D, and boundary exterior domain is as shown in the d figure of Fig. 2 D, then to the d figure of Fig. 2 D according to synthesising position region Color value carry out color filling, it is assumed that the color value in synthesising position region be green, then by d figure oblique line indicate shade Area filling is green.

In the present embodiment, the multimedia messages be video, it is described according to the synthetic parameters to more than described at least two Media information is being synthesized, the multimedia messages after being synthesized, comprising: is each at least two multimedia messages Multimedia messages determine prospect label or background label；Image recognition is carried out to the multimedia messages with background label, is obtained Empty window region；The empty window region is determined as synthesising position region；It is described to be added with the multimedia messages of prospect label Multimedia messages in the synthesising position region, after being synthesized, comprising: step S31B, to more matchmakers with prospect label Body information carries out Boundary Extraction, obtains boundary inner region；Step S32B adds the boundary inner region in the synthesising position Region, the multimedia messages after being synthesized.As shown in Figure 2 D, it is assumed that the multimedia messages with prospect label are a of Fig. 2 D Shown in figure, the result after Boundary Extraction (also known as edge detection) is carried out to a figure of Fig. 2 D is as shown in the b figure of Fig. 2 D, and boundary Multimedia messages of the inner region as shown in the c figure of Fig. 2 D, by the c figure addition of Fig. 2 D in synthesis region, after being synthesized.

In other embodiments of the invention, for the difference of terminal kinds, for example some terminals are supported while being opened more A camera, and some terminals are only supported once to open a camera.For the difference on this terminal capability, the present embodiment Different solutions is also provided.Accordingly, step S101, it is described obtain terminal more than two cameras at least two At least two multimedia messages that camera is respectively shot, comprising: step S31C, judge the terminal whether support it is described at least Two cameras are shot simultaneously, step S32C, if it is determined that the terminal support at least two camera simultaneously into When row shooting, while at least two camera being called to be shot；Obtain the described of at least two cameras shooting At least two multimedia messages.Step S33C, if it is determined that the terminal is not supported at least two camera while being carried out When shooting, the camera defaulted at least two camera is called to be shot；One for obtaining the default takes the photograph After the multimedia messages as captured by head, it is called at least two camera in addition to a camera of the default His camera is successively shot；Obtain the multimedia messages that other cameras are successively shot.

The embodiment of the present invention can assist user while utilize the recording function of multiple cameras before and after mobile phone, carry out real-time The acquisition of video concentrates displaying, working process and finally synthesizes a solution that a video is externally distributed, while this hair Bright embodiment is also that user can be assisted to realize the working process for existing local video and carry out the one of more Video Compositions Kind solution.

In the embodiment of the present invention, user can be adopted using the multiple cameras progress videos of the front and rear of terminal simultaneously or successively Collection.The processing on real-time to picture material, such as various filtering effects, pendant, mosaic, barrage text are supported in collection process Deng.Before formally generating video file, support the content after the collected processing of multiple cameras merging displaying, and can Respective positions, size, shape, rotation when user's real-time selection being allowed to merge the scheme and effect shown, such as more video/pictures superposition Gyration etc., also the same various rendering effects for supporting to be supported in collection process.After user confirms final synthetic effect, Just the unique video file externally distributed comprising all the elements is generated.

For the difference of the type of terminal, for example some terminals are supported to open multiple cameras simultaneously, and some equipment are only It supports once to open a camera.For the difference in this capacity of equipment, this technology provides different solutions.Fig. 3 A For the implementation process schematic diagram of image synthesizing method of the embodiment of the present invention, the step S303 to step S311 on the left of Fig. 3 A is only to prop up The implementation process schematic diagram for once opening the equipment of a camera is held, the step S313 to step S318 on the right side of Fig. 3 A is simultaneously Support the implementation process schematic diagram of the equipment of the multiple cameras of unlatching, as shown in Figure 3A, this method comprises:

Step S301, entered function entrance；

Here, user operates the APP in terminal, terminal entered function entrance depending on the user's operation；For example, with APP is opened at family, and clicking " camera " button can start to record into the video record page.The each step recorded has corresponding Prompt and guide, to guide user's operation.Such as instant messaging APP, as shown in Figure 3B, Yong Hu are installed on user mobile phone The selection of chat interface 30 and the friend of oneself chat, such as user wants the view for sharing oneself to friend " Maffylee " 31 Frequently, then user's click function entrance 32, using camera icon 32 as functional entrance in this example.Into after recording interface, mobile phone Camera just starts work, it is assumed that front camera is started to work, and the head portrait 33 (referring to Fig. 3 C) of user, mobile phone screen are collected On can show the picture of camera live preview.Have on screen button support the switching of preposition and rear camera button 34, And button 35 for whether opening flash lamp etc..If user records and finishes, user can click stop button 36.In the present invention Other embodiments in, recording interface can permit user and select corresponding effect handled in real time acquisition data, Middle treatment effect 37 includes filter, pendant etc., and then user, which clicks, records button 36, just starts recorded video, stop button and Recording button is same button.

After the completion of multistage video capture, it may be selected to edit video or picture, such as effect of adjustment Video Composition, Size, position, direction of video/picture of superposition etc. is adjusted, is increased and is delivered text, watermark, expression, video or picture beaten Mosaic, modification background music, increase filtering effects etc..Increased effect supports live preview, can send out after user's confirmation It send.

Step S302 judges to support front and rear multi-cam while record, if so, supporting front and rear multi-cam same When record, enter step S313；If not, do not support front and rear multi-cam while recording, S303 is entered step；

Step S303 starts any camera and selects scape preview；

Here, since terminal only supports one camera of starting to record a video, then terminal will start the camera of default, such as The default camera head of fruit starting is not that user thinks camera to be started, then user can switch on interface；Such as The camera of terminal default starting is rear camera, and user is not desired to the rear camera of starting and wants the preposition camera shooting of starting Head, then user can operate on interface, the camera then started is switched over, i.e., is switched to rear camera Front camera.

Here, into after recording interface, the camera of mobile phone just starts work, it is assumed that front camera is started to work, and is adopted Collect the head portrait 33 (referring to Fig. 3 C) of user, the picture of camera live preview can be shown (referring also to figure on mobile phone screen The a figure of 3D, a of Fig. 3 E scheme).There is button to support the button 34 of preposition and rear camera switching and whether open on screen Open the button 35 etc. of flash lamp.If user records and finishes, user can click stop button 36.In other implementations of the invention In example, recording interface can permit user and select corresponding effect handled in real time acquisition data, wherein treatment effect 37 include filter, pendant etc., and then user, which clicks, records button 36, just starts recorded video, and stop button and recording button are Same button.

Step S304 acquires data prediction；

Here, terminal acquires original video data, then pre-processes to collected original video data；It is wherein pre- Processing such as includes increasing filtering effects, the cutting of video etc..Referring to the b figure of Fig. 3 D or the b figure of Fig. 3 E, icon 43 and 44 is The template of editor will give corresponding effect in the collected original image a increase of camera when user selects a certain template, Assuming that user has selected template 43, then correspondence increases 41 and 42 corresponding effects on image a.

Step S305 generates video/picture；

Here, pretreated video data is generated video/picture by terminal；Scheme referring to the c figure of Fig. 3 D or the c of Fig. 3 E, After the editor of b figure, the video/picture as shown in figure c is generated.

Step S306, video/picture playback confirmation；

Here, user can carry out playback confirmation to generated video, if user wants playback confirmation, user Playback button is clicked on interface, then terminal carries out playback operation according to the playback button that user clicks；If user is to rigid The video just generated is dissatisfied, then may repeat step S303 to step S305, until video/picture of the user to generation is full It means only.

Step S307 starts another camera and selects scape preview, and will acquire and show on video before content is added to Show；

Here, continue to accept aforementioned step S303 to step S305, step S303 to step S305 is a camera The process of video/picture is generated, and step S307 to step S309 is the process that another camera generates video/picture, with S303 above-mentioned is similar to step S305.

Here, after user starts front camera shooting video/picture, it is desirable to start rear camera shooting video/figure Piece, user select switching push button on interface, front camera are switched to rear camera, this starts another with terminal Camera (rear camera) selects scape preview, starts to record a video.

Step S308 acquires data prediction；

Here, terminal acquires original video data, then pre-processes to collected original video data；

Step S309 generates another video/picture；

Here, the pretreated video data of step S308 is generated video/picture by terminal；

Step S310, more video/picture superposition previews and effect adjustment；

Here, terminal synthesizes the video/picture that step S309 is generated with the step S306 video/picture generated. Referring to the d figure of Fig. 3 D or the d figure of Fig. 3 E, it is assumed that be superimposed upon c figure on the image of one desk, determine one in the image of the desk Then a synthesising position region can cut c figure, the image after cutting is placed on position shown in icon 47, then To after synthesis video or picture be adjusted, such as increase text 46, increase smiling face 45.

Here, user can also the video generated to step S309 carry out playback confirmation, if to want playback true by user Recognize, then user clicks playback button on interface, then terminal carries out playback operation according to the playback button that user clicks；Such as The video that fruit user generates step S309 is dissatisfied, then step S307 to step S309 may be repeated, until user is to life At video/picture it is satisfied until.

In other implementations of the invention, step S307 acquisition original video content can be added to step S306 generation Video/picture on the basis of, the original video content of certain step S307 acquisition can not also be added to step S306 generation Video/picture on the basis of.Video/figure that step S306 is generated before if the original video content of S307 acquisition is added to On the basis of piece, superimposed video can be obtained then handled to step S309.

Step S311, synthetic effect confirmation；

Here it is possible to show preview option on the interface of terminal, after user selects preview, the dialog box of confirmation is popped up, If user is satisfied to synthetic effect, user will do it confirmation operation, if user is dissatisfied to synthetic effect, use Family would not carry out confirmation operation, to will do it cancellation operation, after terminal receives confirmation operation, can jump to publication interface, If terminal receives the cancellation operation of user, synthesis interface can be re-started, so that terminal can re-start synthesis.

Step S312, publication；

Continue to accept aforementioned step S311, if after user carries out confirmation operation, publication interface, Yong Hu can be jumped to Publication interface can issue operation, and after terminal receives publication operation, terminal will do it publication, i.e. terminal (wraps the image after synthesis Include photos and videos) it is uploaded to server, server is issued after receiving the image of upload, if the client that user uses Wechat, after publication, then the friend of user then circle of friends can see user upload synthesis after image.If user makes Client is video sharing software, then other visitors then can see the image after the synthesis of user's upload.

Step S313, starting multi-cam select scape preview；

Into after recording interface, the camera of mobile phone just starts work, it is assumed that front camera is started to work, and use is collected The head portrait 33 (referring to Fig. 3 C) at family can show the picture of camera live preview on mobile phone screen.Before having button support on screen Set with the button 34 of the switching of rear camera and the button 35 for whether opening flash lamp etc..If user records and finishes, use Family can click stop button 36.In other embodiments of the invention, recording interface can permit the corresponding effect of user's selection Fruit handles acquisition data in real time, and wherein treatment effect 37 includes filter, pendant etc., then user clicks recording button 36, just start recorded video, stop button and recording button are same buttons.

Step S314, more video datas are handled in real time；

In this example, terminal can be handled multiple video datas in real time, in other examples, may not be Processing in real time, i.e. previous video are processed before can be, and handle the step of being only limitted to synthesis in real time.

Step S315 generates video/picture；

Video/picture in this example, according to step S314 treated image, after generating synthesis.

Step S316, video/picture playback confirmation；

Step S317, more video/picture superposition previews and effect adjustment；

In this example, step S316 may refer to aforementioned step S310.

Step S318, synthetic effect confirmation.

In this example, step S316 may refer to aforementioned step S311.

For the Video Composition logic that the video of real-time recording can be shown according to such as following figure, by multistage video according to user Edit effect synthesize a video.Fig. 4 is the implementation process schematic diagram of image synthesizing method of the embodiment of the present invention, such as Fig. 4 It is shown, this method comprises:

Step S401 starts camera；

Here, user operates the APP in terminal, terminal entered function entrance depending on the user's operation；For example, with APP is opened at family, and clicking " camera " button can start to record into the video record page, and such terminal starts camera. The each step recorded has corresponding prompt and guides, to guide user's operation.

Step S402, camera acquire data prediction；

Camera acquisition data prediction may include: that scenery by the optical imagery that camera lens (Lens) generates projects figure As then turning to electric signal on sensor surface, become number after analog-to-digital conversion (A/D, Analog/Digital) conversion Picture signal is then sent through working process in digital signal processing chip (DSP, Digital Signal Processing), then leads to It crosses data-interface such as universal serial bus (USB, Universal Serial Bus) interface and is transmitted to the processor of terminal such as Central processing unit (CPU, Central Processing Unit).

Step S403, multi-cam parallel data；

Step S404, data prediction: transcoding plus rendering effect；

Step S405 saves data to caching or local file；

Step S406 plays back preview, effect confirmation；

Step S401 is to step S406, and after terminal enters camera page, a camera of mobile phone just starts work, Mobile phone screen The picture of camera live preview can be shown on curtain.There is button to support preposition and rear camera switching on screen, and whether Open flash lamp etc..There is entrance to can permit user simultaneously and selects corresponding effect handled in real time acquisition data, such as Filter, pendant etc..It clicks and records button, just start recorded video.

Step S407, terminal judge to acquire whether data are completed；

Here, terminal judges to acquire whether data are completed, if it is not complete, then continue to acquire, if completed, Enter step S408.During realization, it can operate according to the user's choice to determine whether completing data acquisition.

Step S408, Video Composition；

Here, the Video Composition step in step S408 includes the determination of synthetic parameters, carries out video according to synthetic parameters Synthesis, synthetic parameters include prospect label, background label, synthesising position region etc..

Step S409, data render；

Here, it after the completion of data render includes: multistage video capture, may be selected to edit video or picture, such as The effect of Video Composition, size, position, the direction of video/picture of adjustment superposition etc. are adjusted, increases and delivers text, watermark, table Feelings break video or picture mosaic, modification background music, increase filtering effects etc..

Step S410, real-time exhibition, synthesis；

Step S411 plays back preview, effect confirmation；

Step S412, publication.

This example may refer to aforementioned step S312.

In the present embodiment, the data frame of imaging is captured in real time by mobile phone camera, when preview, data frame is carried out real-time The presentation of various video effects is completed in processing, is supplied to user's preview.User confirms that effect back can start the recording of video. After video generates, the rendering effect selected before user can be also generated in video file together.

In the present embodiment, the mobile phone recorded for supporting while starting multiple cameras, recording and synthesis can be same Shi Jinhang；For the mobile phone not supporting multiple cameras while recording, a video file is once recorded, multistages video is waited all to divide Not Lu Zhi after the completion of, summarize together superposition show, and be supplied to user carry out the later period editor and production entrance.Then starting is closed At logic, multistage video is synthesized on a video according to the edited effect of user.

For the video of non real-time recording, for example for local already existing video, can also be shown according to such as following figure Video Composition logic, multistage video is synthesized into a video according to the edit effect of user.Fig. 5 is view of the embodiment of the present invention The implementation process schematic diagram of frequency synthesis method, as shown in figure 5, this method comprises:

Step S501, Video Composition start；

Here, user operates the APP in terminal, terminal entered function entrance depending on the user's operation；For example, with APP is opened at family, is clicked " camera " button and is started the Video Composition stage, according to the user's choice from local or reception other equipment Video cache/file data of transmission.

Step S502 parses video cache/file data；

Here, terminal parses video cache/file data, and the format of APP defined is generated after being parsed File.

Step S503 judges whether it is single video data, if so, S508 is entered step, conversely, entering step S504.

If not single video data, then it is the corresponding video data of multiple cameras, then no longer needs to be synthesized.User It can choose a local image file, then acquire an image file in real time；Can also two files be all local.

Step S504, video frame synthesize frame by frame；

Step S504 synthesizes the haplopia frequency file of step S503.

Step S505, synthetic frame rendering；

In this example, after the completion of synthetic frame rendering includes: multistage video capture, it may be selected to compile video or picture Volume, such as adjust the effect of Video Composition, the size of the video/picture of adjustment superposition, position, direction, increase and deliver text, water Print, expression break video or picture mosaic, modification background music, increase filtering effects etc..Increased effect is supported real When preview, user confirmation after can send.

Step S506 generates video file；

Step S507, is incorporated into audio frequency effect；

Step S508 issues pre-treatment；

Step S509, Video Composition terminate.

Advantage of the invention is that the multi-cam camera function of mobile phone can be utilized, can sufficiently be sent out when shooting video The effect for waving each camera, expression main view frequency itself effective content simultaneously, also carry out auxiliary record using other cameras System, and auxiliary content is added on main view frequency, so that original video is carried richer information.The present invention can be supported preferably User expresses impression, experience, experience or the mood of itself from multiple latitudes.

The embodiment of the present invention provides a kind of device for processing muti-medium information again, each unit included by the device, each unit Each submodule included by included each module, each module can be realized by the processor in terminal, certainly may be used It is realized by logic circuit；In the process of implementation, processor can be central processing unit (CPU), microprocessor (MPU), number Word signal processor (DSP) or field programmable gate array (FPGA) etc..Wherein, terminal realization when can using calculate set It is standby to realize, wherein the calculating equipment can be various types of electricity with information processing capability in the process of implementation Sub- equipment, such as the electronic equipment may include mobile phone, tablet computer, desktop computer, personal digital assistant etc..

Fig. 6 is the composed structure schematic diagram of device for processing muti-medium information of the embodiment of the present invention, as shown in fig. 6, the device 600 include acquiring unit 601, determination unit 602, synthesis unit 603, receiving unit 604 and transmission unit 605, in which:

The acquiring unit 601, each self-timer of at least two cameras in more than two cameras for obtaining terminal At least two multimedia messages taken the photograph；

The determination unit 602, synthesis when for determining for being synthesized at least two multimedia messages Parameter；

The synthesis unit 603, for being closed according to the synthetic parameters at least two multimedia messages At multimedia messages after being synthesized；

The receiving unit 604, for receiving the first operation, first operation is for by the multimedia after the synthesis It is sent to the corresponding server of the client；

Multimedia after the synthesis is sent to the visitor for responding first operation by the transmission unit 605 The corresponding server in family end.

In other embodiments of the invention, the terminal includes front camera and rear camera, the synthesis ginseng Number includes synthesising position region, the determination unit, for determining that the first multimedia messages of the front camera shooting close At the synthesising position region of the second multimedia messages shot in the rear camera；The synthesis unit, being used for will be described First multimedia messages of front camera shooting add on the synthesising position region, the multimedia letter after being synthesized Breath.

In other embodiments of the invention, the determination unit includes receiving module and the first determining module, in which: institute Receiving module is stated, for receiving the second operation for being directed to second multimedia messages；First determining module, for being based on Described second, which operates corresponding position, determines synthesising position region.

In other embodiments of the invention, the synthetic parameters include prospect label, background label, synthesising position area Domain, the determination unit include the second determining module, identification module and third determining module, in which: second determining module, For determining prospect label or background label for each multimedia messages at least two multimedia messages；The identification mould Block obtains sky window region for carrying out image recognition to the multimedia messages with background label；The third determining module, For the empty window region to be determined as synthesising position region；The synthesis unit includes adding module, for that will have prospect The multimedia messages of label add the multimedia messages in the synthesising position region, after being synthesized.

In other embodiments of the invention, the synthesis unit further includes the 4th determining module and establishes module, in which: 4th determining module, for determining the frame sequential of each multimedia messages at least two multimedia messages；It is described Module is established, for that there will be the frame of the frame sequential of the multimedia messages of prospect label with the multimedia messages with background label Sequence establishes related information；The adding module, for according to the related information by the multimedia messages with prospect label Add the multimedia messages in the synthesising position region, after being synthesized.

In other embodiments of the invention, the adding module includes detection sub-module, filling submodule and addition Module, in which: the detection sub-module obtains boundary for carrying out border detection to the multimedia messages with prospect label Exterior domain；The filling submodule, for carrying out face to the boundary exterior domain according to the color value in the synthesising position region Color filling, obtains the filled multimedia messages with prospect label；The addition submodule, for will be after color filling Multimedia messages with prospect label add the multimedia messages in the synthesising position region, after being synthesized.

In other embodiments of the invention, the adding module includes extracting sub-module and addition submodule, in which: institute Extracting sub-module is stated, for carrying out Boundary Extraction to the multimedia messages with prospect label, obtains boundary inner region；It is described to add Add submodule, for the boundary inner region to be added to the multimedia messages in the synthesising position region, after being synthesized.

In other embodiments of the invention, the acquiring unit includes that the 5th determining module and first obtain module, In: the 5th determining module, when for determining that the terminal is supported at least two camera while being shot, simultaneously At least two camera is called to be shot；Described first obtains module, claps for obtaining at least two camera At least two multimedia messages taken the photograph.

In other embodiments of the invention, the acquiring unit includes the 6th determining module, second obtains module and the Three obtain module, in which: the 6th determining module, for determining that the terminal does not support at least two camera simultaneously When being shot, the camera defaulted at least two camera is called to be shot；Described second obtains module, After obtaining multimedia messages captured by a camera of the default, calls and remove institute at least two camera Other cameras except a camera of default are stated successively to be shot；The third obtains module, described for obtaining The multimedia messages that other cameras are successively shot.

The description of apparatus above embodiment, be with the description of above method embodiment it is similar, have same embodiment of the method Similar beneficial effect, therefore do not repeat them here.For undisclosed technical detail in apparatus of the present invention embodiment, this hair is please referred to The description of bright embodiment of the method and understand.

In the embodiment of the present invention, if realizing above-mentioned method for processing multimedia information in the form of software function module, And when sold or used as an independent product, it also can store in a computer readable storage medium.Based in this way Understanding, substantially the part that contributes to existing technology can be produced the technical solution of the embodiment of the present invention in other words with software The form of product embodies, which is stored in a storage medium, including some instructions are used so that one Platform computer equipment (can be personal computer, server or network equipment etc.) executes described in each embodiment of the present invention The all or part of method.And storage medium above-mentioned includes: USB flash disk, mobile hard disk, read-only memory (ROM, Read Only Memory), the various media that can store program code such as magnetic or disk.In this way, the embodiment of the present invention is not limited to appoint What specific hardware and software combines.

The embodiment of the present invention provides a kind of computer storage medium, and being stored with computer in the computer storage medium can It executes instruction, the computer executable instructions are for executing method for processing multimedia information provided in an embodiment of the present invention.

The embodiment of the present invention provides a kind of multimedia signal processing equipment, comprising:

Storage medium is configured to storage executable instruction；

Processor is configured to execute the executable instruction of storage, and the executable instruction is for executing above-mentioned multimedia Information processing method.

The description of apparatus above embodiment, the description with above-mentioned storage medium and apparatus embodiments be it is similar, have same The similar beneficial effect of embodiment of the method, therefore do not repeat them here.For not disclosed in storage medium of the present invention and apparatus embodiments Technical detail, please refer to the description of embodiment of the present invention method and understand.

Fig. 7 is a kind of hardware entities schematic diagram of terminal in the embodiment of the present invention, as shown in fig. 7, the hardware of the terminal 700 Entity includes: processor 701, communication interface 702, input module 703, display module 704 and memory 705, wherein

The usually control of processor 701 calculates the overall operation of equipment 700.For example, input module 703 may be embodied as touching Screen exports the operating characteristics of characterization touch screen to the processor 701 (including contact position, number of contacts, triggering pressure) User's operation data, processor 701 can parse user's operation data and determine the function that user triggers in display interface, generate The display data of the function of corresponding triggering, so that display module 704 loads the page of the function of corresponding triggering.

Communication interface 702 can make calculating equipment pass through network and other terminals or server communication.

Input module 703 can be configured to receive the character information of input, and generate and user setting and function control There is OFF signal input.Wherein, input module may include touch-control surface, which collects the touching of user on it or nearby Touching operation, (for example user is attached in touch-control surface or in touch-control surface using any suitable object or attachment such as finger, stylus Close operation), touch operation bring signal is obtained, contact coordinate is converted the signal into, then gives the processing of processor 701, and The order that processor 701 is sent can be received and executed.

Display module 704 is configurable to the function and relevant information of the realization of video-stream processor 701.

Memory 705 is configured to store the instruction and application that can be performed by processor 701, can also cache device to be processed 701 and calculate equipment 700 in each module it is to be processed or processed data (for example, image data, audio data, voice Communication data and video communication data), flash memory (FLASH) or random access storage device 705 (RAM, Random can be passed through Access Memory) it realizes.

It should be understood that " one embodiment " or " embodiment " that specification is mentioned in the whole text mean it is related with embodiment A particular feature, structure, or characteristic is included at least one embodiment of the present invention.Therefore, occur everywhere in the whole instruction " in one embodiment " or " in one embodiment " not necessarily refer to identical embodiment.In addition, these specific features, knot Structure or characteristic can combine in any suitable manner in one or more embodiments.It should be understood that in various implementations of the invention In example, magnitude of the sequence numbers of the above procedures are not meant that the order of the execution order, and the execution sequence of each process should be with its function It can determine that the implementation process of the embodiments of the invention shall not be constituted with any limitation with internal logic.The embodiments of the present invention Serial number is for illustration only, does not represent the advantages or disadvantages of the embodiments.

It should be noted that, in this document, the terms "include", "comprise" or its any other variant are intended to non-row His property includes, so that the process, method, article or the device that include a series of elements not only include those elements, and And further include other elements that are not explicitly listed, or further include for this process, method, article or device institute it is intrinsic Element.In the absence of more restrictions, the element limited by sentence "including a ...", it is not excluded that including being somebody's turn to do There is also other identical elements in the process, method of element, article or device.

In several embodiments provided herein, it should be understood that disclosed device and method can pass through it Its mode is realized.Apparatus embodiments described above are merely indicative, for example, the division of the unit, only A kind of logical function partition, there may be another division manner in actual implementation, such as: multiple units or components can combine, or It is desirably integrated into another system, or some features can be ignored or not executed.In addition, shown or discussed each composition portion Mutual coupling or direct-coupling or communication connection is divided to can be through some interfaces, the INDIRECT COUPLING of equipment or unit Or communication connection, it can be electrical, mechanical or other forms.

Above-mentioned unit as illustrated by the separation member, which can be or may not be, to be physically separated, aobvious as unit The component shown can be or may not be physical unit；Both it can be located in one place, and may be distributed over multiple network lists In member；Some or all of units can be selected to achieve the purpose of the solution of this embodiment according to the actual needs.

In addition, each functional unit in various embodiments of the present invention can be fully integrated in one processing unit, it can also To be each unit individually as a unit, can also be integrated in one unit with two or more units；It is above-mentioned Integrated unit both can take the form of hardware realization, can also realize in the form of hardware adds SFU software functional unit.

Those of ordinary skill in the art will appreciate that: realize that all or part of the steps of above method embodiment can pass through The relevant hardware of program instruction is completed, and program above-mentioned can store in computer-readable storage medium, which exists When execution, step including the steps of the foregoing method embodiments is executed；And storage medium above-mentioned includes: movable storage device, read-only deposits The various media that can store program code such as reservoir (Read Only Memory, ROM), magnetic or disk.

If alternatively, the above-mentioned integrated unit of the present invention is realized in the form of software function module and as independent product When selling or using, it also can store in a computer readable storage medium.Based on this understanding, the present invention is implemented Substantially the part that contributes to existing technology can be embodied in the form of software products the technical solution of example in other words, The computer software product is stored in a storage medium, including some instructions are used so that computer equipment (can be with It is personal computer, server or network equipment etc.) execute all or part of each embodiment the method for the present invention. And storage medium above-mentioned includes: various Jie that can store program code such as movable storage device, ROM, magnetic or disk Matter.

The above description is merely a specific embodiment, but scope of protection of the present invention is not limited thereto, any Those familiar with the art in the technical scope disclosed by the present invention, can easily think of the change or the replacement, and should all contain Lid is within protection scope of the present invention.Therefore, protection scope of the present invention should be based on the protection scope of the described claims.

Claims

1. a kind of method for processing multimedia information, which is characterized in that the described method includes:

Client obtains at least two multimedias that at least two cameras in more than two cameras of terminal are respectively shot Information；

At least two multimedia messages are being synthesized according to the synthetic parameters, the multimedia letter after being synthesized Breath；

The first operation is received, first operation is for being sent to the corresponding clothes of the client for the multimedia after the synthesis Business device；

2. the method according to claim 1, wherein the synthetic parameters include prospect label, background label, conjunction Synthetic parameters at the band of position, when the determination is for being synthesized at least two multimedia messages, comprising: for institute It states each multimedia messages at least two multimedia messages and determines prospect label or background label；To with the more of background label Media information carries out image recognition, obtains sky window region；The empty window region is determined as synthesising position region；

It is described that at least two multimedia messages are being synthesized according to the synthetic parameters, the multimedia after being synthesized Information, comprising: the multimedia in the synthesising position region, after being synthesized will be added with the multimedia messages of prospect label Information.

3. according to the method described in claim 2, it is characterized in that, the terminal include the first camera and second camera, The synthetic parameters include synthesising position region, when the determination is for being synthesized at least two multimedia messages Synthetic parameters, comprising: determine that the first multimedia messages synthesis of the first camera shooting is shot in the second camera The second multimedia messages synthesising position region；

It is described that at least two multimedia messages are being synthesized according to the synthetic parameters, the multimedia after being synthesized Information, comprising: the first multimedia messages of first camera shooting are added on the synthesising position region, are closed Multimedia messages after.

4. according to the method described in claim 3, it is characterized in that, more than first matchmaker of the determination the first camera shooting The synthesising position region for the second multimedia messages that the synthesis of body information is shot in the second camera, comprising:

Receive the second operation for second multimedia messages；

Synthesising position region is determined based on the corresponding position of second operation.

5. according to the method described in claim 2, it is characterized in that, it is described according to the synthetic parameters to more than described at least two Media information is being synthesized, the multimedia messages after being synthesized, further includes: is determined at least two multimedia messages The frame sequential of each multimedia messages；By the frame sequential of the multimedia messages with prospect label and with more matchmakers of background label The frame sequential of body information establishes related information；

The multimedia letter by with the addition of the multimedia messages of prospect label in the synthesising position region, after being synthesized Breath, comprising: will be added with the multimedia messages of prospect label in the synthesising position region, obtained according to the related information Multimedia messages after synthesis.

6. according to the method described in claim 2, it is characterized in that, described will exist with the addition of the multimedia messages of prospect label The synthesising position region, the multimedia messages after being synthesized, comprising:

Border detection is carried out to the multimedia messages with prospect label, obtains boundary exterior domain；

Color filling is carried out to the boundary exterior domain according to the color value in the synthesising position region, obtains filled having The multimedia messages of prospect label；

The multimedia messages with prospect label after color filling are added in the synthesising position region, after being synthesized Multimedia messages.

7. according to the method described in claim 2, it is characterized in that, described will exist with the addition of the multimedia messages of prospect label The synthesising position region, the multimedia messages after being synthesized, comprising:

Boundary Extraction is carried out to the multimedia messages with prospect label, obtains boundary inner region；

The boundary inner region is added into the multimedia messages in the synthesising position region, after being synthesized.

8. method according to any one of claims 1 to 7, which is characterized in that the more than two camera shootings for obtaining terminal At least two multimedia messages that at least two cameras in head are respectively shot, comprising:

When determining that the terminal is supported at least two camera while being shot, while calling at least two camera shooting Head is shot；

Obtain at least two multimedia messages of at least two cameras shooting.

9. method according to any one of claims 1 to 7, which is characterized in that the more than two camera shootings for obtaining terminal At least two multimedia messages that at least two cameras in head are respectively shot, comprising:

When determining that the terminal is not supported at least two camera while being shot, at least two camera is called One camera of middle default is shot；

After obtaining multimedia messages captured by a camera of the default, calls and remove institute at least two camera Other cameras except a camera of default are stated successively to be shot；

Obtain the multimedia messages that other cameras are successively shot.

10. a kind of device for processing muti-medium information, which is characterized in that described device includes acquiring unit, determination unit, synthesis list Member, receiving unit and transmission unit, in which:

The acquiring unit, at least two cameras in more than two cameras for obtaining terminal are respectively shot at least Two multimedia messages；

The determination unit, synthetic parameters when for determining for being synthesized at least two multimedia messages；

The synthesis unit is obtained for being synthesized according to the synthetic parameters at least two multimedia messages Multimedia messages after synthesis；

The receiving unit, for receiving the first operation, first operation is for the multimedia after the synthesis to be sent to The corresponding server of the client；

Multimedia after the synthesis is sent to the client pair for responding first operation by the transmission unit The server answered.

11. device according to claim 10, which is characterized in that the synthetic parameters include prospect label, background label, Synthesising position region, the determination unit include the second determining module, identification module and third determining module, in which:

Second determining module, for determining prospect label for each multimedia messages at least two multimedia messages Or background label；

The identification module obtains sky window region for carrying out image recognition to the multimedia messages with background label；

The third determining module, for the empty window region to be determined as synthesising position region；

The synthesis unit includes adding module, for that will have the addition of the multimedia messages of prospect label in the synthesising position Region, the multimedia messages after being synthesized.

12. a kind of terminal including memory, processor and stores the computer journey that can be run on a memory and on a processor Sequence, which is characterized in that the processor realizes the described in any item more matchmakers of claim 1 to 9 when being configured to execute described program Body information processing method.

13. a kind of computer readable storage medium, is stored thereon with computer program, which is characterized in that the computer program quilt Claim 1 to 9 described in any item method for processing multimedia information are realized when processor executes.