WO2023045710A1 - Multimedia display and matching methods and apparatuses, device and medium - Google Patents

Multimedia display and matching methods and apparatuses, device and medium Download PDF

Info

Publication number
WO2023045710A1
WO2023045710A1 PCT/CN2022/115521 CN2022115521W WO2023045710A1 WO 2023045710 A1 WO2023045710 A1 WO 2023045710A1 CN 2022115521 W CN2022115521 W CN 2022115521W WO 2023045710 A1 WO2023045710 A1 WO 2023045710A1
Authority
WO
WIPO (PCT)
Prior art keywords
multimedia data
multimedia
target
matched
data
Prior art date
Application number
PCT/CN2022/115521
Other languages
French (fr)
Chinese (zh)
Inventor
黄造军
徐之俊
冯宇飞
邓子建
吴铭泽
Original Assignee
北京字节跳动网络技术有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 北京字节跳动网络技术有限公司 filed Critical 北京字节跳动网络技术有限公司
Publication of WO2023045710A1 publication Critical patent/WO2023045710A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T5/00Image enhancement or restoration
    • G06T5/77Retouching; Inpainting; Scratch removal
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/40Information retrieval; Database structures therefor; File system structures therefor of multimedia data, e.g. slideshows comprising image and additional audio data
    • G06F16/44Browsing; Visualisation therefor
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/40Information retrieval; Database structures therefor; File system structures therefor of multimedia data, e.g. slideshows comprising image and additional audio data
    • G06F16/48Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • G06F16/483Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content

Definitions

  • the present disclosure relates to the technical field of multimedia processing, in particular to a multimedia display and matching method, device, equipment and medium.
  • the present disclosure provides a multimedia display and matching method, device, equipment and medium.
  • the present disclosure provides a multimedia display method, including:
  • the present disclosure provides a multimedia matching method, including:
  • the multimedia data to be matched is obtained by performing special effect processing on the original multimedia data
  • the target multimedia data matching the first multimedia feature is queried, and the target multimedia data is used to generate merged multimedia data with the multimedia data to be matched.
  • the present disclosure provides a multimedia display device, including:
  • a data receiving unit configured to receive original multimedia data
  • the special effect editing unit is configured to perform special effect editing on the original multimedia data to obtain the multimedia data to be matched;
  • the data synthesis unit is configured to generate synthesized multimedia data based on the multimedia data to be matched and the target multimedia data, and the target multimedia data is obtained by matching the first multimedia feature of the multimedia data to be matched;
  • the data display unit is configured to display synthesized multimedia data.
  • the present disclosure provides a multimedia matching device, including:
  • the data receiving unit is configured to receive the multimedia data to be matched, and the multimedia data to be matched is obtained after performing special effect processing on the original multimedia data;
  • a feature extraction unit configured to extract a first multimedia feature from the multimedia data to be matched
  • a data acquisition unit configured to acquire a plurality of candidate multimedia data corresponding to the multimedia data to be matched
  • the data query unit is configured to query target multimedia data that matches the first multimedia feature among multiple candidate multimedia data, and the target multimedia data is used to generate merged multimedia data with the multimedia data to be matched.
  • the present disclosure provides a computing device, including:
  • the processor is used to read executable instructions from the memory, and execute the executable instructions to implement the multimedia display method of the first aspect, or to implement the multimedia matching method of the second aspect.
  • the present disclosure provides a computer-readable storage medium, the storage medium stores a computer program, and when the computer program is executed by a processor, the processor implements the multimedia display method of the first aspect, or implements the second aspect The multimedia matching method.
  • the multimedia display and matching method, device, device and medium of the disclosed embodiments generate and display synthesized multimedia data based on the edited multimedia data to be matched and target multimedia data after performing special effect editing on the received original multimedia data. Since the target pair of media data is obtained based on the first multimedia feature matching of the multimedia data to be matched, in addition to the special effects in the multimedia data to be matched, the synthesized multimedia data obtained based on the original multimedia data may also include The multimedia data matches the content of the target multimedia data, so that the multimedia data image has multiple elements, enriches the beautification effect of the multimedia data, improves the interest of the multimedia data display, and enables users to interact through the multimedia data to achieve It improves the diversity of interactions between users and improves the user experience.
  • FIG. 1 shows an architecture diagram of a multimedia display system provided by an embodiment of the present disclosure
  • FIG. 2 shows an architecture diagram of another multimedia display system provided by an embodiment of the present disclosure
  • FIG. 3 shows a schematic flowchart of a multimedia display method provided by an embodiment of the present disclosure
  • FIG. 4 shows a schematic diagram of a shooting preview interface provided by an embodiment of the present disclosure
  • Fig. 5 shows a schematic diagram of a special effect editing interface provided by an embodiment of the present disclosure
  • FIG. 6 shows a schematic diagram of a display interface of multimedia data to be matched provided by an embodiment of the present disclosure
  • FIG. 7 shows a schematic diagram of matching logic of multimedia data provided by an embodiment of the present disclosure
  • FIG. 8 shows a schematic diagram of a display interface for synthesizing multimedia data provided by an embodiment of the present disclosure
  • FIG. 9 shows a schematic flowchart of another multimedia display method provided by an embodiment of the present disclosure.
  • FIG. 10 shows a schematic flowchart of another multimedia display method provided by an embodiment of the present disclosure.
  • FIG. 11 shows a schematic flowchart of a multimedia matching method provided by an embodiment of the present disclosure
  • FIG. 12 shows a schematic structural diagram of a multimedia display device provided by an embodiment of the present disclosure
  • FIG. 13 shows a schematic structural diagram of a multimedia matching device provided by an embodiment of the present disclosure
  • Fig. 14 shows a schematic structural diagram of a computing device provided by an embodiment of the present disclosure.
  • the term “comprise” and its variants are open-ended, i.e. “including but not limited to”.
  • the term “based on” is “based at least in part on”.
  • the term “one embodiment” means “at least one embodiment”; the term “another embodiment” means “at least one further embodiment”; the term “some embodiments” means “at least some embodiments.” Relevant definitions of other terms will be given in the description below.
  • preset special effect templates to beautify multimedia data. For example, preset stickers or preset special effects can be added to the captured pictures.
  • the user can only select a special effect template from preset special effect tools, and the beautification effect is relatively simple, lacks interest, and reduces user experience.
  • embodiments of the present disclosure provide a multimedia display and matching method, device, device and medium capable of displaying synthesized multimedia data generated by matching multimedia data and target multimedia data.
  • the multimedia display method provided by the present disclosure can be applied to the architecture shown in FIG. 1 and FIG. 2 , and will be described in detail with reference to FIG. 1 and FIG. 2 .
  • Fig. 1 shows a structure diagram of a multimedia display system provided by an embodiment of the present disclosure.
  • the multimedia display system may include at least one electronic device 101 at the client end and at least one server 102 at the server end.
  • the electronic device 101 can establish a connection with the server 102 and exchange information through a network protocol such as Hyper Text Transfer Protocol over Secure Socket Layer (HTTPS).
  • HTTPS Hyper Text Transfer Protocol over Secure Socket Layer
  • the electronic device 101 may be a mobile phone, a tablet computer, a desktop computer, a notebook computer, a vehicle terminal, a wearable device, an all-in-one machine, a smart home device, and other devices with communication functions, or it may be a device simulated by a virtual machine or a simulator.
  • the server 102 may be a device with storage and computing functions such as a cloud server or a server cluster.
  • the user can edit original multimedia data with special effects on the electronic device 101 through a specific service platform, and generate and display synthesized multimedia data.
  • the specific service platform may be a specific application program or a specific website, such as a social platform or a video playing platform with social functions.
  • the electronic device 101 can obtain original multimedia data such as images or videos, and edit the original multimedia data with special effects to obtain the multimedia data to be matched. And after obtaining multiple pieces of candidate multimedia data including the target multimedia data P11 from the server 102, the electronic device 101 may query the target multimedia data P11 from the candidate multimedia data based on the first multimedia feature of the multimedia data to be matched. Then, the electronic device 101 may generate composite multimedia data P12 by matching the target multimedia data obtained based on the first multimedia feature of the multimedia data to be matched with the multimedia data to be matched. Optionally, continuing to refer to FIG. 1 , the electronic device 101 may upload the generated synthesized multimedia data P12 to the server 102 .
  • the electronic device 101 may upload the multimedia data to be matched to the server 102 . Then, after receiving the multimedia data to be matched, the server 102 can match the target multimedia data P11 from multiple pieces of candidate multimedia data, and send the target multimedia data P11 to the electronic device 101 . Then, the electronic device 101 may generate synthesized multimedia data P12 based on the target multimedia data obtained by matching the first multimedia feature of the multimedia data to be matched with the multimedia data to be matched.
  • multimedia display method provided by the present disclosure can be applied to a specific scenario where users of multiple electronic devices interact through multimedia data, and the architecture shown in FIG. 2 will be described below.
  • FIG. 2 shows a structural diagram of another multimedia display system provided by an embodiment of the present disclosure.
  • the multimedia display system may include at least one first electronic device 201 and at least one second electronic device 202 on the client side, and at least one server 203 on the server side.
  • the first electronic device 201, the second electronic device 202, and the server 203 can respectively establish connections and perform information exchange through network protocols such as HTTPS.
  • the first electronic device 201 and the second electronic device 202 may be devices with communication functions such as mobile phones, tablet computers, desktop computers, notebook computers, vehicle-mounted terminals, wearable devices, all-in-one machines, and smart home devices, or It is a device simulated by a virtual machine or an emulator.
  • the server 203 may be a device with storage and computing functions such as a cloud server or a server cluster.
  • the first user can log in to a specific service platform on the first electronic device 201
  • the second user can log in to the same specific service platform on the second electronic device 202 .
  • the second user can use the second electronic device 202 to send the request to the first user through the server 203 of the specific social platform in the specific social platform.
  • the specific social platform may be a specific application or a specific website with social functions.
  • the server 203 may send the candidate multimedia data including the target multimedia data P22 to the first electronic equipment 201. If the first electronic device 201 determines that the target multimedia data P21 matches the to-be-matched multimedia data after special effect processing, it can generate synthesized multimedia data P23 and send the synthesized multimedia data P21 to the second electronic device 202 through the server 203 .
  • the server 23 receives the specially edited multimedia data P21 sent by the first user through the first electronic device 201 and the specially edited multimedia data P21 sent by the second user through the second electronic device 202 .
  • the target multimedia data P22 if the target to-be-matched multimedia data P22 matches the to-be-matched multimedia data P21, then the target multimedia data P22 is sent to the first electronic device 201.
  • the first electronic device 201 After the first electronic device 201 generates the synthesized multimedia data P23 , it sends the synthesized multimedia data P21 to the second electronic device 202 through the server 203 .
  • Fig. 3 shows a schematic flowchart of a multimedia display method provided by an embodiment of the present disclosure.
  • the multimedia display method may be executed by an electronic device.
  • the electronic equipment may include but not limited to such as mobile phone, notebook computer, digital broadcast receiver, PDA (personal digital assistant), PAD (tablet computer), PMP (portable multimedia player), vehicle terminal (such as vehicle navigation terminal) , wearable devices, etc., and fixed terminals such as digital TVs, desktop computers, smart home devices, etc.
  • the multimedia display method may include the following steps.
  • the user may trigger related operations in the target application program when he wants to edit special effects on the image or wants to, and the electronic device may receive the original multimedia data in response to the related triggered operations.
  • the target application program may be a social platform or a video publishing platform.
  • the original multimedia data may be multimedia data including visual information, such as video data or image data.
  • the original multimedia data may be collected by the user in real time.
  • the above-mentioned related operation may be an operation of opening the shooting page by the user.
  • the above related operation may be a shooting operation performed by the user on the shooting page.
  • the above related operations may be triggered by the user on the multimedia synthesis function on the live broadcast page or the shooting page.
  • the original multimedia data may be locally stored by the electronic device.
  • the relevant operation may be the user's selection operation on an image or video in the electronic album.
  • the original multimedia data may be downloaded by the user.
  • the related operation may be a user's download operation for images or videos in a download page of a browser, a target application program, or a third-party application program.
  • the original multimedia data may be sent to the electronic device by other devices.
  • the electronic device receives the multimedia data sent by other devices, it can use it as the original multimedia data.
  • the original multimedia data may be multimedia data including target objects such as people, animals, plants or objects.
  • the original multimedia data may be a user's selfie, a user's selfie video, and the like.
  • the original multimedia data may include a partial image or an overall image of the target object.
  • the original multimedia data may only include a person's face image, or include images of a face and other body parts.
  • the electronic device may perform special effect editing on the original multimedia data in response to the user's trigger operation of the special effect editing function or the multimedia co-shooting function to obtain the multimedia data to be matched.
  • special effect editing can change the features of the target object itself in the original multimedia data by adding features or replacing original features, or can be used to change the features of the accessory parts of the target object.
  • special effect editing may be to perform special effect editing on the original multimedia image by using at least one of special effect editing tools such as beautification, image modification, special effect props, filters, image style transfer tools, and stickers.
  • the special effect editing tool may be provided by a target application program, a third-party application program, a web page, and the like. It should be noted that, in order to facilitate the description of the subsequent part, in the following part of the embodiments of the present disclosure, the target object after the special effect processing in the multimedia data to be matched is referred to as the first special effect object.
  • the electronic device can change the face features of the target object by means of beautification, image modification, special effects and props. For example, you can adjust the features of the target object's facial contours, eyes, skin, nose, mouth, etc.
  • features such as height, overall or partial fatness and thinness of the target object can be changed through functions such as beautification, image modification, and special effects props.
  • the features of accessory components such as clothing, headgear, glasses, makeup, masks, and facial special effects that do not change the original features of the face can be added or changed through texture maps, special effect props, etc.
  • the facial special effects that do not change the features of the original parts of the face may include animal beards and the like.
  • the image style of the entire original multimedia data, or the overall or partial image style of the target object can be style transferred by using an image style transfer tool or filter.
  • the image style of the original multimedia data can be converted into an animation style, and correspondingly, the target object in the original multimedia data becomes a cartoon character.
  • a target special effect template may be selected from multiple selectable special effect templates of the special effect editing tool to perform special effect editing on the original multimedia data.
  • the target special effect template can be used to perform static or dynamic special effect editing on the partial image or the overall image of the original image to generate the multimedia data to be matched in image or video format.
  • the original multimedia data is a video
  • one or more key video frames can be extracted from the original video
  • the target special effect template can be used to perform static or dynamic special effects editing on the partial image or the overall image of the key video frame to generate an image
  • the multimedia data to be matched in video format the key video frame may be a video frame containing the target object in the original video.
  • S120 may include at least the following two implementation manners.
  • S120 may specifically include: in response to the template selection operation on the target special effect template, performing special effect editing on the original multimedia data based on the target special effect template to obtain the multimedia data to be matched.
  • the electronic device may respond to the user's trigger operation of selecting a target special effect template from multiple optional special effect templates in the special effect editing tool,
  • the target special effect template selected by the user is used to perform special effect editing on the original multimedia data to obtain a multimedia template to be matched.
  • S120 may specifically include: performing special effect editing on the original multimedia data based on the target special effect template corresponding to the original multimedia data, to obtain the multimedia data to be matched.
  • the target special effect template can be directly used to perform special effect editing on the original multimedia data to obtain the multimedia data to be matched.
  • an appropriate special effect template can be matched as the target special effect template.
  • multimedia data can be photographed in the template, and correspondingly, the multimedia data to be matched is directly displayed on the photographing interface.
  • the electronic device may splice the target image part of the multimedia data to be matched with the target image part of the target multimedia data directly or after a certain conversion. Or the way of image fusion is added to the target image area in the target multimedia template to obtain composite multimedia data, so that the composite multimedia data can have the first special effect role in the multimedia data to be matched and the second special effect role in the target multimedia data at the same time. At least some of the features in the target multimedia template.
  • the synthesized multimedia data Before introducing the synthesized multimedia data, the following part of the embodiments of the present disclosure will specifically describe the first multimedia feature of the multimedia data to be matched and the target multimedia data.
  • the first multimedia feature of the multimedia data to be matched may be a feature of the first special effect character itself or a feature of an accessory.
  • the characteristics of the first special effects character may include facial features or body characteristics such as height, fatness and thinness of the first special effects character.
  • the facial features may include facial features such as head aspect ratio, face shape, ratio of chin to head width, ratio of forehead length to head length, and the like.
  • eye characteristics such as eye size, distance between eyes, pupil color, pupil size, eye shape, etc.
  • nasal features such as nose length, nose wing width, nose bridge height, nose bridge width, etc.
  • hair characteristics such as hair length, hair color, hair shape (curly, straight), etc. may be included.
  • skin information such as skin color, skin roughness, etc. may be included.
  • the characteristics of the accessory accessories may include whether to wear glasses, whether to wear a mask, whether to wear accessories, whether to wear headgear, whether to wear makeup, whether to have facial effects that do not change the characteristics of the original parts of the face, and the like.
  • the first special effect object wears an accessory
  • specific features of the accessory may be included.
  • the first multimedia feature may also include the model and name of the mask.
  • target multimedia data having the same or matching features with the multimedia data to be matched can be obtained through matching.
  • the target multimedia data may include an object edited with special effects.
  • the object in the target multimedia data and the target object in the original multimedia data may be different objects.
  • the object in the target multimedia data may be the image of the second user edited with special effects
  • the target object in the multimedia data to be matched may be the image of the first user edited with special effects.
  • the edited object in the target multimedia data may be referred to as a second special effect object.
  • the target multimedia data may be pre-stored data in a target application program, a third-party application program, or a multimedia database of a web page.
  • the target multimedia data may be multimedia data uploaded by other users and edited with special effects.
  • the multimedia data to be matched and the target multimedia data may be the same type of multimedia data.
  • both are images, or both are videos.
  • the multimedia data to be matched and the target multimedia data may be different types of multimedia data.
  • One of the two is an image, and the other of the two is a video.
  • the embodiment of the present disclosure will describe synthesizing multimedia data.
  • the synthesized multimedia data can be used to present the interaction between the first special effect role in the multimedia data to be matched and the second special effect role in the target multimedia data in the scene corresponding to the target multimedia template behavior or interaction.
  • the target multimedia template may be an image scene template or a video scene template, and its specific type is not limited.
  • the electronic device may generate synthesized multimedia data based on the user's operation of selecting a target multimedia template from multiple optional scene templates.
  • the optional scene template may be obtained from a scene template library of a target application program, a third-party application program, or a web page.
  • the electronic device may determine a matching target scene template according to features of the first special effect object in the multimedia data to be matched and features of the second special effect object in the target multimedia data.
  • the special effect of the first special effect object and the feature of the second special effect object may be their actions.
  • the action of the first special effect object is a toast
  • the action of the second special effect object is also a toast
  • the partial or overall images of the first special effect object and the second special effect object can be added to scene templates such as parties, bars, etc. Synthetic multimedia data such as cheers are generated.
  • the action of the first special effect object is to be hugged by a princess
  • the action of the second special effect object is to hug a person
  • the partial or overall images of the first special effect object and the second special effect object can be added to images such as weddings, beautiful sky, etc.
  • romantic scene templates such as , generate synthetic multimedia data such as a spinning princess hug.
  • the partial or overall features of the first special effect object and the second special effect object can be added to scene templates such as sports fields, Generate synthetic multimedia data such as football matches.
  • the electronic device may generate synthesized multimedia data based on a target multimedia template uploaded by a user.
  • text, music, special effects, etc. may also be added to the synthesized multimedia data in order to improve interest.
  • the electronic device may display and display the composite multimedia data in response to a user's composite operation on the multimedia data or a user's trigger operation for displaying the composite multimedia data.
  • the electronic device does not need to respond to the trigger operation, and directly displays the synthesized multimedia data on a relevant interface after generating the synthesized multimedia data.
  • the video display method, device, device and medium of the embodiments of the present disclosure generate and display synthesized multimedia data based on the edited multimedia data to be matched and target multimedia data after performing special effect editing on the received original multimedia data. Since the target pair of media data is obtained based on the first multimedia feature matching of the multimedia data to be matched, in addition to the special effects in the multimedia data to be matched, the synthesized multimedia data obtained based on the original multimedia data may also include The multimedia data matches the content of the target multimedia data, so that the multimedia data image has multiple elements, thereby enriching the beautification effect of the multimedia data, thereby improving the fun of multimedia data display, and enabling users to interact through multimedia data , realize the diversified interaction between users, and improve the user experience.
  • Fig. 4 shows a schematic diagram of a shooting preview interface provided by an embodiment of the present disclosure.
  • the electronic device can display a target object 41 in the shooting preview interface 40, and various special effect editing tools such as a filter tool 401, a beauty tool 402, and a special effect tool 403, and can also display a multimedia synthesis tool 404.
  • the filter tool 401, the beauty tool 402, and the special effect tool 403 may each include one or more special effect templates.
  • FIG. 5 shows a schematic diagram of a special effect editing interface provided by an embodiment of the present disclosure.
  • FIG. 6 shows a schematic diagram of a display interface of multimedia data to be matched provided by an embodiment of the present disclosure.
  • the display interface 60 of the multimedia data to be matched may include a first special effect character 61 after special effect processing and a multimedia synthesis tool 404 .
  • the matching step of the multimedia data can be performed by the electronic device or the server.
  • Fig. 7 shows a schematic diagram of matching logic of multimedia data provided by an embodiment of the present disclosure.
  • the generated composite multimedia data is shown in FIG. 8 .
  • FIG. 8 shows a schematic diagram of a display interface for synthesizing multimedia data provided by an embodiment of the present disclosure.
  • the synthesized multimedia data P81 may present a scene in which the first special effect character 61 and the second special effect character 73 toast in the masquerade scene in the form of images or videos.
  • FIG. 9 shows a schematic flowchart of another multimedia display method provided by the embodiments of the present disclosure.
  • the multimedia display method may be executed by an electronic device.
  • the electronic equipment may include but not limited to such as mobile phone, notebook computer, digital broadcast receiver, PDA (personal digital assistant), PAD (tablet computer), PMP (portable multimedia player), vehicle terminal (such as vehicle navigation terminal) , wearable devices, etc., and fixed terminals such as digital TVs, desktop computers, smart home devices, etc.
  • the multimedia display method may include the following steps.
  • S910 Receive original multimedia data.
  • the specific content of S910 is similar to the specific content of S310, which will not be repeated here.
  • S920 performing special effect editing on the original multimedia data to obtain the multimedia data to be matched.
  • the specific content of S920 is similar to the specific content of S320, which will not be repeated here.
  • image feature extraction technology or video frame feature extraction technology may be used to extract the first multimedia feature from the multimedia data to be matched.
  • candidate multimedia data may be obtained from a multimedia database of a target application, a third-party application, or a webpage.
  • the candidate multimedia data may be pre-stored data in a multimedia database.
  • the candidate multimedia data may be multimedia data uploaded by other users and edited with special effects.
  • target multimedia data can be determined from multiple candidate multimedia data by means of feature matching.
  • S950 may specifically include the following steps.
  • Step A1 determining at least one feature tag corresponding to the first multimedia feature.
  • the feature label may be a label obtained by classifying the first special effect object from one or more dimensions based on one or a type of first multimedia feature.
  • the feature tag can classify the first special effect object from the dimension of the first special effect object itself or the accessory component.
  • the feature tags of the first special effect object may include tags for characterizing the nose, eyes, gender, action, skin state, etc.
  • the first special effect object can be classified from the characteristics of the character itself.
  • the attached tags of the first special effect object may include whether to wear glasses, whether to wear a mask, whether to wear makeup, and so on.
  • Step A2 for each candidate multimedia data, determine a common label that is the same as at least one feature label.
  • the same label may be used as a common label of the multimedia data to be matched and the candidate multimedia data.
  • user A's tags include wearing glasses, tall nose, yellow skin, tall, and female; user B's tags include no glasses, small mouth, thin, and male.
  • the common tags of the two may include glasses tags (wearing glasses or not wearing glasses), gender tags (male or female).
  • Step A3 calculating the tag matching score between the multimedia data to be matched and each candidate multimedia data according to the weight values corresponding to the common tags.
  • the weight value of the common tag may be preset.
  • the weight value of the common tag may be set according to the user's selection. For tags that users do not pay attention to, set a low weight value a. For tags that the user values or pays attention to (such as whether the user likes or dislikes whether other characters wear glasses and whether they have ponytails), a high weight value b can be set for the glasses tag and hairstyle tag. Wherein, the weight value b is greater than the weight value a.
  • a high weight value c can be set for tags that the user is interested in, and a low weight value d can be set for tags that the user dislikes. Wherein, the weight value c is greater than the weight value a, and the weight value a is greater than the weight value d.
  • the tag matching score is used to reflect the degree of matching between each candidate multimedia data and the multimedia data to be matched in terms of a feature or a type of feature corresponding to the tag.
  • a tag score of the multimedia data corresponding to the tag may be generated according to the feature. For example, for the glasses tag, if the first special effect object in the multimedia data to be matched wears glasses, the tag score of the glasses tag can be 100; if the first special effect object does not wear glasses, the tag score of the glasses tag can be 0.
  • the calculation method of the tag score of the candidate multimedia data is the same as that of the multimedia data to be matched, and will not be repeated here.
  • the similarity score of the two can be calculated according to the tag scores of the common tags of the two. Then calculate the label matching score of the two according to the similarity score and the weight value between the two.
  • the closeness of tag scores of each candidate multimedia data to the multimedia data to be matched is positively correlated with the similarity score between the two. That is to say, the closer the tag scores of each candidate multimedia data and the multimedia data to be matched are, the higher the similarity score between them will be. For example, if both are wearing glasses, their matching score is high.
  • the similarity score corresponding to this type of feature label may be equal to the preset value minus the target label score difference.
  • the target label score difference is the difference between the label score of each candidate multimedia data in this category of labels and the label score of the multimedia data to be matched in this category of labels.
  • the closeness of the label scores of each candidate multimedia data to the multimedia data to be matched is negatively correlated with the similarity score between the two. That is to say, the greater the tag score difference between each candidate multimedia data and the multimedia data to be matched, the lower the similarity score between the two. For example, if both genders are the same, their similarity score is low. If the two genders are opposite, their similarity score is high. Exemplarily, the similarity score corresponding to this type of feature label may be equal to the target label score difference.
  • tag scores for multiple candidate multimedia data can be recorded in a matching table.
  • the electronic device acquires the feature tags of the multimedia data to be matched and the tag scores of each feature tag, it calculates the tag matching scores of the multimedia data to be matched and each candidate multimedia data based on the above calculation method, so as to calculate from the matching table and Find the target multimedia data.
  • the tag matching score of each tag may be obtained according to the weight value of the tag and the feature matching score between each candidate multimedia data and the multimedia data to be matched.
  • the tag matching score of each tag may be equal to the product of the weight value of the tag and the feature matching score between each candidate multimedia data and the multimedia data to be matched.
  • the similarity between each candidate multimedia data and the multimedia data to be matched is positively correlated with the feature matching score between the two. That is to say, the higher the similarity between each candidate multimedia data and the multimedia data to be matched, the higher the feature matching score between the two. For example, if both wear glasses, the feature matching score is high.
  • the similarity between each candidate multimedia data and the multimedia data to be matched is negatively correlated with the feature matching score between the two. That is to say, the lower the similarity between each candidate multimedia data and the multimedia data to be matched is, the higher the score of the two is. For example, if both genders are the same, their feature matching score will be low. If the two genders are opposite, the feature matching score is high.
  • correlation between the similarity of each feature tag and the feature matching score can be set according to actual scenarios and specific requirements, and there is no specific limitation on this.
  • Step A4 sorting the tag matching scores of the candidate multimedia data to determine the target multimedia data.
  • the data with tag matching scores from high to low may be sorted, and the candidate multimedia data with the highest score is used as the target multimedia data.
  • the tag matching scores between the multimedia data to be matched and multiple candidate multimedia data can be recorded in the matching table in descending order or in descending order.
  • the target multimedia data may also satisfy one or more of the following conditions.
  • the special effect editing method of the target multimedia data is the same as the special effect editing method of the to-be-matched multimedia data.
  • the special effect editing methods of the two are the same.
  • the target multimedia data and the multimedia data to be matched are edited with special effects of adding masks
  • the special effect editing methods of the two are the same.
  • the target multimedia data adopts the first special effect
  • the to-be-matched multimedia data adopts the second special effect
  • the two special effect editing methods are the same.
  • the user to which the target multimedia data belongs is an online user. That is to say, if the user opens the interface of the target application program through the electronic device, or the target application program is running in the background on the electronic device, the user is considered to be an online user.
  • the location distance between the distribution location of the target multimedia data and the distribution location of the original multimedia data is less than or equal to a preset distance threshold.
  • the distance threshold may be a default value set by the system, or may be a target distance threshold selected by the user from multiple optional distance thresholds. Alternatively, if the user to which the target multimedia data belongs is in the same region as the user to which the original multimedia data belongs, such as the same district, city, or province, the distance between the two locations is considered to be less than or equal to a preset distance threshold. Wherein, the distance threshold may be set according to an actual situation or a specific scene, which is not limited.
  • the condition C4 the historical matching times of the target multimedia data satisfies the preset times filtering condition.
  • the frequency filtering condition may be that the historical matching frequency is within a preset value range.
  • the value range of the preset number of times may be the value set by the system by default, or may be the value range of the target number of times selected by the user from multiple optional value ranges of the number of times.
  • the target multimedia data can be selected from the candidate multimedia data through at least one of the above conditions C1 to C4.
  • the multiple target multimedia data can be further screened through at least one of the above conditions C1 to C4 , to obtain the target multimedia data.
  • At least one of the above conditions C1 to C4 may be directly used to screen out target multimedia data from multiple candidate multimedia data.
  • the candidate multimedia data may be obtained after screening by at least one of the above conditions C1 to C4.
  • the candidate multimedia data can be screened in order according to the preset condition usage order until the last one is used. After a condition is filtered to obtain the target multimedia data.
  • the target multimedia data can be obtained.
  • the target multimedia data is also obtained by matching the second multimedia feature of the original multimedia data.
  • the target multimedia data can be matched from the candidate multimedia data by using the second multimedia feature of the original multimedia data.
  • the multiple target multimedia data can be further processed through the second multimedia feature of the original multimedia data. Screening to obtain the target multimedia data.
  • the second multimedia feature may be a multimedia feature of the target object in the original multimedia data.
  • the second multimedia feature is similar to the above-mentioned first multimedia feature, and the method of using the second multimedia feature to query the target multimedia data is similar to the method of using the first multimedia feature to query the target multimedia data. No longer.
  • S960 Generate composite multimedia data based on the multimedia data to be matched and the target multimedia data.
  • the target multimedia data is obtained by matching according to the first multimedia feature of the multimedia data to be matched.
  • the specific content of the S960 is similar to that of the S330, which will not be repeated here.
  • S970 displaying the synthesized multimedia data.
  • the specific content of the S970 is similar to that of the S340, and will not be repeated here.
  • the multimedia display method of the embodiment of the present disclosure can use the first data feature of the multimedia data to be matched to accurately match the target multimedia data with the same feature from multiple candidate multimedia data, so that the generated composite multimedia data includes the feature matching degree
  • the high first special effect object and the second special effect object improve the interest of the multimedia display method.
  • FIG. 10 shows a schematic flowchart of another multimedia display method provided by the embodiments of the present disclosure.
  • the multimedia display method may be executed by an electronic device.
  • the electronic equipment may include but not limited to such as mobile phone, notebook computer, digital broadcast receiver, PDA (personal digital assistant), PAD (tablet computer), PMP (portable multimedia player), vehicle terminal (such as vehicle navigation terminal) , wearable devices, etc., and fixed terminals such as digital TVs, desktop computers, smart home devices, etc.
  • the multimedia display method may include the following steps.
  • S1010 Receive original multimedia data.
  • the specific content of S1010 is similar to the specific content of S310, which will not be repeated here.
  • S1020 Perform special effect editing on the original multimedia data to obtain multimedia data to be matched.
  • the specific content of S1020 is similar to the specific content of S320, which will not be repeated here.
  • S1030 Generate composite multimedia data based on the multimedia data to be matched and the target multimedia data, where the target multimedia data is obtained by matching according to the first multimedia feature of the multimedia data to be matched.
  • the specific content of S1030 is similar to the specific content of S330, which will not be repeated here.
  • S1040 displaying the synthesized multimedia data.
  • the specific content of S1040 is similar to the specific content of S340, which will not be repeated here.
  • a trigger operation for synthesizing multimedia data when the user wants to interact with the user to whom the target multimedia data belongs, a trigger operation for synthesizing multimedia data is performed.
  • the triggering operation may be triggered when the synthesized multimedia data is generated, or after the synthesized multimedia data is previewed, and its trigger timing is not limited.
  • the electronic device may distribute the synthesized multimedia data to the user to which the original multimedia data belongs and the user to which the target multimedia data belongs through the server.
  • the electronic device can display the original multimedia data and city multimedia data in the image/video favorites or display column of the target application program of the user to which the original multimedia data belongs and the user to which the target multimedia data belongs. A mark is added to the corresponding icon to prompt the user to view the synthesized multimedia data.
  • S1050 may include the following steps.
  • Step D1 sending first prompting information to the user to which the original multimedia data belongs, the first prompting information is used to trigger display of the synthesized multimedia data and display the social home page of the user to which the target multimedia data belongs.
  • the first prompt information can be released in the form of text, picture, voice, etc. through a chat box, a display window on the interface, or a broadcast column on the interface.
  • the specific form of the first prompt message may be "You just participated in a masquerade party with XXX (the scene corresponding to the synthesized multimedia video), go to TA's homepage to see/chat with TA"
  • the first prompt information may include links such as text/QR code of the synthesized multimedia data display interface, or the user can jump to the synthesized multimedia display interface by triggering the information bar of the first prompt information.
  • the first prompt information may also include links such as text/QR code of the user to which the target multimedia data belongs.
  • the synthesized multimedia data display interface may include a control for accessing the user's social homepage to which the target multimedia data belongs, or the synthesized multimedia data display interface may include a control for adding friends of the user to whom the target multimedia data belongs, or, the synthesized multimedia data display interface may A control for establishing a chat with the user to whom the target multimedia data belongs may be included.
  • Step D2 sending second prompting information to the user to whom the target multimedia data belongs, the second prompting information being used to trigger playing the composite multimedia data and displaying the social homepage of the user to which the original multimedia data belongs.
  • the second prompt information is similar to the first prompt information, which will not be repeated here.
  • synthesized multimedia data can be released to the users who belong to the original multimedia data and the users who belong to the target multimedia data, so that the interaction between the users who belong to the original multimedia data and the users who belong to the target multimedia data can be realized by synthesizing multimedia data, and the multimedia data can be improved.
  • the interestingness of the display improves the user experience.
  • Fig. 11 shows a schematic flowchart of a multimedia matching method provided by an embodiment of the present disclosure.
  • the multimedia matching method may be executed by a server.
  • the server may be a device with storage and computing functions such as a cloud server or a server cluster.
  • the multimedia matching method may include the following steps.
  • S1140 may include:
  • For each candidate multimedia data determine a common label identical to at least one feature label
  • the tag matching scores of the candidate multimedia data are sorted to determine the target multimedia data.
  • multimedia matching method shown in S1110 to S1140 is similar to the multimedia display method shown in conjunction with S910 to S970 above, and will not be repeated here.
  • the multimedia matching method may further include: receiving a publishing instruction of synthesized multimedia data, and distributing the synthesized multimedia data to the user to which the original multimedia data belongs and the user to which the target multimedia data belongs.
  • the publishing instruction is generated by the electronic device after detecting a trigger operation on the synthesized multimedia data. Wherein, this step is similar to S1050 and will not be repeated here.
  • the multimedia matching method may further include:
  • the first prompt information is sent to the user to which the original multimedia data belongs, and the first prompt information is used to trigger the display of the composite multimedia data and display the social home page of the user to which the target multimedia data belongs.
  • this step is similar to the above step D1 and will not be repeated here.
  • the second prompt information is sent to the user to which the target multimedia data belongs, and the second prompt information is used to trigger the playing of the composite multimedia data and display the social home page of the user to which the original multimedia data belongs.
  • this step is similar to the above step D2 and will not be repeated here.
  • the synthesized multimedia data obtained based on the original multimedia data may also include
  • the multimedia data matches the content of the target multimedia data, so that the multimedia data image has multiple elements, thereby enriching the beautification effect of the multimedia data, thereby improving the fun of multimedia data display, and enabling users to interact through multimedia data , realize the diversified interaction between users, and improve the user experience.
  • An embodiment of the present disclosure also provides a multimedia display device for implementing the above multimedia display method, which will be described below with reference to FIG. 12 .
  • the multimedia display device may be an electronic device, for example, the multimedia display device may be the first electronic device 101 in the client shown in FIG. 1 .
  • electronic devices may include devices with communication functions such as mobile phones, tablet computers, desktop computers, notebook computers, vehicle terminals, wearable electronic devices, all-in-one computers, and smart home devices, and may also be devices simulated by virtual machines or simulators. .
  • FIG. 12 shows a schematic structural diagram of a multimedia display device provided by an embodiment of the present disclosure.
  • the multimedia display device 1200 may include a data receiving unit 1210 , a special effect editing unit 1220 , a data synthesis unit 1230 and a data display unit 1240 .
  • a data receiving unit 1210 configured to receive original multimedia data
  • the special effect editing unit 1220 is configured to perform special effect editing on the original multimedia data to obtain the multimedia data to be matched;
  • the data synthesis unit 1230 is configured to generate synthesized multimedia data based on the multimedia data to be matched and the target multimedia data, and the target multimedia data is obtained by matching the first multimedia feature of the multimedia data to be matched;
  • the data display unit 1240 is configured to display synthesized multimedia data.
  • the multimedia display device of the disclosed embodiment generates and displays synthesized multimedia data based on the edited multimedia data to be matched and target multimedia data after performing special effect editing on the received original multimedia data. Since the target pair of media data is obtained based on the first multimedia feature matching of the multimedia data to be matched, in addition to the special effects in the multimedia data to be matched, the synthesized multimedia data obtained based on the original multimedia data may also include The multimedia data matches the content of the target multimedia data, so that the multimedia data image has multiple elements, thereby enriching the beautification effect of the multimedia data, thereby improving the fun of multimedia data display, and enabling users to interact through multimedia data , realize the diversified interaction between users, and improve the user experience.
  • the special effect editing unit 1220 may be further configured to: in response to the template selection operation on the target special effect template, perform special effect editing on the original multimedia data based on the target special effect template to obtain the multimedia data to be matched;
  • the special effect editing unit 1220 may be further configured to: based on the target special effect template corresponding to the original multimedia data, perform special effect editing on the original multimedia data to obtain the multimedia data to be matched.
  • the multimedia display device 1200 may further include a feature extraction unit, a data acquisition unit, and a data query unit.
  • a feature extraction unit configured to extract a first multimedia feature from the multimedia data to be matched
  • a data acquisition unit configured to acquire a plurality of candidate multimedia data corresponding to the multimedia data to be matched
  • the data query unit is configured to query the target multimedia data matching the first multimedia feature among the plurality of candidate multimedia data.
  • the data query unit may be further configured as:
  • For each candidate multimedia data determine a common label identical to at least one feature label
  • the tag matching scores of the candidate multimedia data are sorted to determine the target multimedia data.
  • the target multimedia data satisfies at least one of the following:
  • the special effect editing method of the target multimedia data is the same as the special effect editing method of the multimedia data to be matched;
  • the user to which the target multimedia data belongs is an online user
  • the location distance between the distribution location of the target multimedia data and the distribution location of the original multimedia data is less than or equal to a preset distance threshold
  • the historical matching times of the target multimedia data meet the preset times filtering condition.
  • the target multimedia data is also obtained by matching the second multimedia feature of the original multimedia data.
  • the multimedia display device 1200 may further include a data publishing unit.
  • the data distribution unit is configured to distribute the composite multimedia data to the user to which the original multimedia data belongs and the user to which the target multimedia data belongs when a trigger operation on the composite multimedia data is detected.
  • the data publishing unit may be further configured as:
  • the first prompt information is used to trigger display of the composite multimedia data and display the social home page of the user to which the target multimedia data belongs;
  • the second prompting information is used to trigger playing the synthesized multimedia data and displaying the social home page of the user to which the original multimedia data belongs.
  • the multimedia display device 1200 shown in FIG. 12 can execute each step in the method embodiment shown in FIG. 3 to FIG. 10 , and realize each process and effects, which will not be described here.
  • the multimedia display device may be a server, for example, the multimedia matching device may be the server 102 in the client shown in FIG. 1 .
  • the server may be a device with storage and computing functions such as a cloud server or a server cluster.
  • Fig. 13 shows a schematic structural diagram of a multimedia matching device provided by an embodiment of the present disclosure.
  • the multimedia matching device 1300 may include a data receiving unit 1310 , a feature extraction unit 1320 , a data obtaining unit 1330 and a data query unit 1340 .
  • the data receiving unit 1310 is configured to receive the multimedia data to be matched, and the multimedia data to be matched is obtained based on special effects processing on the original multimedia data;
  • the feature extraction unit 1320 is configured to extract a first multimedia feature from the multimedia data to be matched
  • the data obtaining unit 1330 is configured to obtain a plurality of candidate multimedia data corresponding to the multimedia data to be matched;
  • the data query unit 1340 is configured to query the target multimedia data matching the first multimedia feature among the plurality of candidate multimedia data, and the target multimedia data is used to generate combined multimedia data with the multimedia data to be matched.
  • the multimedia matching device of the disclosed embodiment generates and displays synthesized multimedia data based on the edited multimedia data to be matched and target multimedia data after performing special effect editing on the received original multimedia data. Since the target pair of media data is obtained based on the first multimedia feature matching of the multimedia data to be matched, in addition to the special effects in the multimedia data to be matched, the synthesized multimedia data obtained based on the original multimedia data may also include The multimedia data matches the content of the target multimedia data, so that the multimedia data image has multiple elements, thereby enriching the beautification effect of the multimedia data, thereby improving the fun of multimedia data display, and enabling users to interact through multimedia data , realize the diversified interaction between users, and improve the user experience.
  • the data query unit 1340 may be further configured to:
  • For each candidate multimedia data determine a common label identical to at least one feature label
  • the tag matching scores of the candidate multimedia data are sorted to determine the target multimedia data.
  • the target multimedia data satisfies at least one of the following:
  • the special effect editing method of the target multimedia data is the same as the special effect editing method of the multimedia data to be matched;
  • the user to which the target multimedia data belongs is an online user
  • the location distance between the distribution location of the target multimedia data and the distribution location of the original multimedia data is less than or equal to a preset distance threshold
  • the historical matching times of the target multimedia data meet the preset times filtering condition.
  • the target multimedia data is also matched according to the second multimedia feature of the original multimedia data.
  • the target multimedia data is also obtained by matching the second multimedia feature of the original multimedia data.
  • the multimedia matching apparatus 1300 may further include a data publishing unit.
  • the data distribution unit is configured to distribute the composite multimedia data to the user to which the original multimedia data belongs and the user to which the target multimedia data belongs in response to the composite multimedia data distribution instruction.
  • the publishing instruction is generated by the electronic device after detecting a trigger operation on the synthesized multimedia data.
  • the data publishing unit may be further configured as:
  • the first prompt information is used to trigger display of the composite multimedia data and display the social home page of the user to which the target multimedia data belongs;
  • the second prompt information is sent to the user to which the target multimedia data belongs, and the second prompt information is used to trigger the playing of the composite multimedia data and display the social home page of the user to which the original multimedia data belongs.
  • the multimedia matching device 1300 shown in FIG. 13 can execute each step in the method embodiment shown in FIG. 11 , and realize each process and effect in the method embodiment shown in FIG. 11 , which will not be described here. repeat.
  • An embodiment of the present disclosure also provides a computing device, which may include a processor and a memory, and the memory may be used to store executable instructions.
  • the processor may be configured to read executable instructions from the memory, and execute the executable instructions to implement the multimedia display method and/or the multimedia matching method in the above embodiments.
  • Fig. 14 shows a schematic structural diagram of a computing device provided by an embodiment of the present disclosure. Referring to FIG. 14 in detail below, it shows a schematic structural diagram for implementing a computing device 1400 in an embodiment of the present disclosure.
  • the computing device 1400 in the embodiments of the present disclosure may include but not limited to mobile phones, notebook computers, digital broadcast receivers, PDA (Personal Digital Assistant) , PAD (Tablet Computer), PMP (Portable Multimedia Player), vehicle-mounted terminals (such as vehicle-mounted navigation terminals), wearable devices, etc. mobile terminals and fixed terminals such as digital TVs, desktop computers, smart home devices, etc.
  • PDA Personal Digital Assistant
  • PAD Tablet Computer
  • PMP Portable Multimedia Player
  • vehicle-mounted terminals such as vehicle-mounted navigation terminals
  • wearable devices etc. mobile terminals and fixed terminals such as digital TVs, desktop computers, smart home devices, etc.
  • the computing device 1400 in the embodiments of the present disclosure may include, but not limited to, devices with storage and computing functions such as cloud servers or server clusters.
  • computing device 1400 shown in FIG. 14 is only an example, and should not limit the functions and scope of use of this embodiment of the present disclosure.
  • the computing device 1400 may include a processing device (such as a central processing unit, a graphics processing unit, etc.) 1401, which may be stored in a read-only memory (ROM) 1402 or loaded into a random Various appropriate actions and processes are executed by accessing programs in the memory (RAM) 1403 . In the RAM 1403, various programs and data necessary for the operation of the computing device 1400 are also stored.
  • the processing device 1401, ROM 1402, and RAM 1403 are connected to each other through a bus 1404.
  • An input/output (I/O) interface 1405 is also connected to the bus 1404 .
  • the following devices can be connected to the I/O interface 1405: input devices 1406 including, for example, a touch screen, touchpad, keyboard, mouse, camera, microphone, accelerometer, gyroscope, etc.; including, for example, a liquid crystal display (LCD), speaker, vibration an output device 1407 such as a computer; a storage device 1408 including, for example, a magnetic tape, a hard disk, etc.; and a communication device 1409.
  • the communication means 1409 may allow the computing device 1400 to communicate with other devices wirelessly or by wire to exchange data. While FIG. 14 shows computing device 1400 having various means, it is to be understood that implementing or possessing all of the illustrated means is not a requirement. More or fewer means may alternatively be implemented or provided.
  • An embodiment of the present disclosure also provides a computer-readable storage medium, the storage medium stores a computer program, and when the computer program is executed by a processor, the processor implements the multimedia display method or the multimedia matching method in the foregoing embodiments.
  • An embodiment of the present disclosure also provides a computer program product, the computer program product may include a computer program, and when the computer program is executed by a processor, the processor is made to implement the video editing method or the video playing method in the foregoing embodiments.
  • embodiments of the present disclosure include a computer program product, which includes a computer program carried on a non-transitory computer readable medium, where the computer program includes program code for executing the method shown in the flowchart.
  • the computer program may be downloaded and installed from a network via communication means 1409, or from storage means 1408, or from ROM 1402.
  • the processing device 1401 When the computer program is executed by the processing device 1401, the above-mentioned functions defined in the multimedia display method or the multimedia matching method of the embodiment of the present disclosure are executed.
  • the above-mentioned computer-readable medium in the present disclosure may be a computer-readable signal medium or a computer-readable storage medium or any combination of the above two.
  • a computer readable storage medium may be, for example, but not limited to, an electrical, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, or device, or any combination thereof. More specific examples of computer-readable storage media may include, but are not limited to, electrical connections with one or more wires, portable computer diskettes, hard disks, random access memory (RAM), read-only memory (ROM), erasable Programmable read-only memory (EPROM or flash memory), optical fiber, portable compact disk read-only memory (CD-ROM), optical storage device, magnetic storage device, or any suitable combination of the above.
  • a computer-readable storage medium may be any tangible medium that contains or stores a program that can be used by or in conjunction with an instruction execution system, apparatus, or device.
  • a computer-readable signal medium may include a data signal propagated in baseband or as part of a carrier wave carrying computer-readable program code therein. Such propagated data signals may take many forms, including but not limited to electromagnetic signals, optical signals, or any suitable combination of the foregoing.
  • a computer-readable signal medium may also be any computer-readable medium other than a computer-readable storage medium, which can transmit, propagate, or transmit a program for use by or in conjunction with an instruction execution system, apparatus, or device .
  • Program code embodied on a computer readable medium may be transmitted by any appropriate medium, including but not limited to wires, optical cables, RF (radio frequency), etc., or any suitable combination of the above.
  • clients and servers can communicate using any currently known or future developed network protocol, such as HTTP, and can be interconnected with any form or medium of digital data communication (eg, a communication network).
  • a communication network examples include local area networks (“LANs”), wide area networks (“WANs”), internetworks (e.g., the Internet), and peer-to-peer networks (e.g., ad hoc peer-to-peer networks), as well as any currently known or future developed network of.
  • LANs local area networks
  • WANs wide area networks
  • Internet internetworks
  • peer-to-peer networks e.g., ad hoc peer-to-peer networks
  • the above-mentioned computer-readable medium may be included in the above-mentioned computing device, or may exist independently without being assembled into the computing device.
  • the above-mentioned computer-readable medium carries one or more programs, and when the above-mentioned one or more programs are executed by the electronic device, the computing device is made to execute:
  • the multimedia data to be matched is obtained based on performing special effect processing on the original multimedia data; extracting the first multimedia feature from the multimedia data to be matched; obtaining a plurality of candidate multimedia data corresponding to the multimedia data to be matched; Among the plurality of candidate multimedia data, the target multimedia data matching the first multimedia feature is queried, and the target multimedia data is used to generate merged multimedia data with the multimedia data to be matched.
  • computer program codes for performing the operations of the present disclosure may be written in one or more programming languages or combinations thereof, including but not limited to object-oriented programming languages such as Java, Smalltalk, C++, and also conventional procedural programming languages - such as "C" or similar programming languages.
  • the program code may execute entirely on the user's computer, partly on the user's computer, as a stand-alone software package, partly on the user's computer and partly on a remote computer or entirely on the remote computer or server.
  • the remote computer can be connected to the user computer through any kind of network, including a local area network (LAN) or a wide area network (WAN), or it can be connected to an external computer (such as through an Internet service provider). Internet connection).
  • each block in a flowchart or block diagram may represent a module, program segment, or portion of code that contains one or more logical functions for implementing specified executable instructions.
  • the functions noted in the block may occur out of the order noted in the figures. For example, two blocks shown in succession may, in fact, be executed substantially concurrently, or they may sometimes be executed in the reverse order, depending upon the functionality involved.
  • each block of the block diagrams and/or flowchart illustrations, and combinations of blocks in the block diagrams and/or flowchart illustrations can be implemented by a dedicated hardware-based system that performs the specified functions or operations , or may be implemented by a combination of dedicated hardware and computer instructions.
  • the units involved in the embodiments described in the present disclosure may be implemented by software or by hardware. Wherein, the name of a unit does not constitute a limitation of the unit itself under certain circumstances.
  • FPGAs Field Programmable Gate Arrays
  • ASICs Application Specific Integrated Circuits
  • ASSPs Application Specific Standard Products
  • SOCs System on Chips
  • CPLD Complex Programmable Logical device
  • a machine-readable medium may be a tangible medium that may contain or store a program for use by or in conjunction with an instruction execution system, apparatus, or device.
  • a machine-readable medium may be a machine-readable signal medium or a machine-readable storage medium.
  • a machine-readable medium may include, but is not limited to, electronic, magnetic, optical, electromagnetic, infrared, or semiconductor systems, apparatus, or devices, or any suitable combination of the foregoing.
  • machine-readable storage media would include one or more wire-based electrical connections, portable computer discs, hard drives, random access memory (RAM), read only memory (ROM), erasable programmable read only memory (EPROM or flash memory), optical fiber, compact disk read only memory (CD-ROM), optical storage, magnetic storage, or any suitable combination of the foregoing.
  • RAM random access memory
  • ROM read only memory
  • EPROM or flash memory erasable programmable read only memory
  • CD-ROM compact disk read only memory
  • magnetic storage or any suitable combination of the foregoing.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Library & Information Science (AREA)
  • Multimedia (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • Processing Or Creating Images (AREA)

Abstract

The preset disclosure relates to multimedia display and matching methods and apparatuses, a device and a medium. The multimedia display method comprises: receiving original multimedia data; performing special effect editing on the original multimedia data to obtain multimedia data to be matched; generating synthetic multimedia data on the basis of the multimedia data to be matched and target multimedia data, the target multimedia data being obtained by matching according to a first multimedia feature of the multimedia data to be matched; and displaying the synthetic multimedia data. According to the embodiments of the present disclosure, the beautification effect of multimedia data is enriched, and the display appeal of the multimedia data is improved. Moreover, users may interact by means of the multimedia data, diversified interaction between the users is implemented, and the use experience of the users is improved.

Description

多媒体显示及匹配方法、装置、设备及介质Multimedia display and matching method, device, equipment and medium
相关申请的交叉引用Cross References to Related Applications
本申请基于申请号为202111136435.5、申请日为2021年09月27日,名称为“多媒体显示及匹配方法、装置、设备及介质”的中国专利申请提出,并要求该中国专利申请的优先权,该中国专利申请的全部内容在此引入本申请作为参考。This application is based on the Chinese patent application with the application number 202111136435.5, the filing date is September 27, 2021, and the title is "Multimedia Display and Matching Method, Device, Equipment and Medium", and claims the priority of the Chinese patent application. The entire content of the Chinese patent application is hereby incorporated by reference into this application.
技术领域technical field
本公开涉及多媒体处理技术领域,尤其涉及一种多媒体显示及匹配方法、装置、设备及介质。The present disclosure relates to the technical field of multimedia processing, in particular to a multimedia display and matching method, device, equipment and medium.
背景技术Background technique
随着计算机技术和移动通信技术的迅速发展,基于电子设备的各种网络平台得到了普遍应用,极大地丰富了人们的日常生活。越来越多的用户乐于在网络平台上对诸如图像或者视频等多媒体数据进行美化,以得到效果满意的照片或者视频。With the rapid development of computer technology and mobile communication technology, various network platforms based on electronic equipment have been widely used, greatly enriching people's daily life. More and more users are willing to beautify multimedia data such as images or videos on network platforms to obtain photos or videos with satisfactory effects.
目前,虽然用户可以利用预设的特效模板对多媒体数据进行美化,但用户之间的交互方式比较单一,缺少趣味性,降低了用户的体验。At present, although users can use preset special effect templates to beautify multimedia data, the interaction mode between users is relatively simple, which lacks interest and reduces user experience.
发明内容Contents of the invention
为了解决上述技术问题或者至少部分地解决上述技术问题,本公开提供了一种多媒体显示及匹配方法、装置、设备及介质。In order to solve the above technical problems or at least partly solve the above technical problems, the present disclosure provides a multimedia display and matching method, device, equipment and medium.
第一方面,本公开提供了一种多媒体显示方法,包括:In a first aspect, the present disclosure provides a multimedia display method, including:
接收原始多媒体数据;receiving raw multimedia data;
对原始多媒体数据进行特效编辑,得到待匹配多媒体数据;Perform special effect editing on the original multimedia data to obtain the multimedia data to be matched;
基于待匹配多媒体数据和目标多媒体数据,生成合成多媒体数据,目标多媒体数据根据待匹配多媒体数据的第一多媒体特征匹配得到;Based on the multimedia data to be matched and the target multimedia data, synthetic multimedia data is generated, and the target multimedia data is obtained by matching the first multimedia feature of the multimedia data to be matched;
显示合成多媒体数据。Display composite multimedia data.
第二方面,本公开提供了一种多媒体匹配方法,包括:In a second aspect, the present disclosure provides a multimedia matching method, including:
接收待匹配多媒体数据,待匹配多媒体数据是对原始多媒体数据进行特效处理后得到的;receiving the multimedia data to be matched, the multimedia data to be matched is obtained by performing special effect processing on the original multimedia data;
从待匹配多媒体数据中提取第一多媒体特征;Extracting the first multimedia feature from the multimedia data to be matched;
获取待匹配多媒体数据对应的多个候选多媒体数据;Obtain a plurality of candidate multimedia data corresponding to the multimedia data to be matched;
在多个候选多媒体数据中,查询与第一多媒体特征相匹配的目标多媒体数据,目标多媒体数据用于与待匹配多媒体数据生成合并多媒体数据。Among the plurality of candidate multimedia data, the target multimedia data matching the first multimedia feature is queried, and the target multimedia data is used to generate merged multimedia data with the multimedia data to be matched.
第三方面,本公开提供了一种多媒体显示装置,包括:In a third aspect, the present disclosure provides a multimedia display device, including:
数据接收单元,配置为接收原始多媒体数据;a data receiving unit configured to receive original multimedia data;
特效编辑单元,配置为对原始多媒体数据进行特效编辑,得到待匹配多媒体数据;The special effect editing unit is configured to perform special effect editing on the original multimedia data to obtain the multimedia data to be matched;
数据合成单元,配置为基于待匹配多媒体数据和目标多媒体数据,生成合成多媒体数据,目标多媒体数据根据待匹配多媒体数据的第一多媒体特征匹配得到;The data synthesis unit is configured to generate synthesized multimedia data based on the multimedia data to be matched and the target multimedia data, and the target multimedia data is obtained by matching the first multimedia feature of the multimedia data to be matched;
数据显示单元,配置为显示合成多媒体数据。The data display unit is configured to display synthesized multimedia data.
第四方面,本公开提供了一种多媒体匹配装置,包括:In a fourth aspect, the present disclosure provides a multimedia matching device, including:
数据接收单元,配置为接收待匹配多媒体数据,待匹配多媒体数据是对原始多媒体数据进行特效处理后得到的;The data receiving unit is configured to receive the multimedia data to be matched, and the multimedia data to be matched is obtained after performing special effect processing on the original multimedia data;
特征提取单元,配置为从待匹配多媒体数据中提取第一多媒体特征;A feature extraction unit configured to extract a first multimedia feature from the multimedia data to be matched;
数据获取单元,配置为获取待匹配多媒体数据对应的多个候选多媒体数据;A data acquisition unit configured to acquire a plurality of candidate multimedia data corresponding to the multimedia data to be matched;
数据查询单元,配置为在多个候选多媒体数据中,查询与第一多媒体特征相匹配的目标多媒体数据,目标多媒体数据用于与待匹配多媒体数据生成合并多媒体数据。The data query unit is configured to query target multimedia data that matches the first multimedia feature among multiple candidate multimedia data, and the target multimedia data is used to generate merged multimedia data with the multimedia data to be matched.
第五方面,本公开提供了一种计算设备,包括:In a fifth aspect, the present disclosure provides a computing device, including:
处理器;processor;
存储器,用于存储可执行指令;memory for storing executable instructions;
其中,处理器用于从存储器中读取可执行指令,并执行可执行指令以实现第一方面的多媒体显示方法,或者以实现第二方面的多媒体匹配方法。Wherein, the processor is used to read executable instructions from the memory, and execute the executable instructions to implement the multimedia display method of the first aspect, or to implement the multimedia matching method of the second aspect.
第六方面,本公开提供了一种计算机可读存储介质,该存储介质存储有计算机程序,当计算机程序被处理器执行时,使得处理器实现第一方面的多媒体显示方法,或者实现第二方面的多媒体匹配方法。In a sixth aspect, the present disclosure provides a computer-readable storage medium, the storage medium stores a computer program, and when the computer program is executed by a processor, the processor implements the multimedia display method of the first aspect, or implements the second aspect The multimedia matching method.
本公开实施例提供的技术方案与现有技术相比具有如下优点:Compared with the prior art, the technical solutions provided by the embodiments of the present disclosure have the following advantages:
本公开实施例的多媒体显示及匹配方法、装置、设备及介质,在对所接收的原始多媒体数据进行特效编辑之后,基于编辑得到的待匹配多媒体数据与目标多媒体数据,生成并显示合成多媒体数据。由于目标对媒体数据是基于待匹配多媒体数据的第一多媒体特征匹配得到的,基于原始多媒体数据得到的合成多媒体数据中除了待匹配多媒体数据中的特效效果之外,还可以包括与待匹配多媒体数据相匹配的目标多媒体数据的内容,使得多媒体数据图像具有多种元素,丰富了多媒体数据的美化效果,提高了多媒体数据显示的趣味性,以及还可以使得用户可以通过多媒体数据进行交互,实现了用户之间的多样性交互,提高了用户的使用体验。The multimedia display and matching method, device, device and medium of the disclosed embodiments generate and display synthesized multimedia data based on the edited multimedia data to be matched and target multimedia data after performing special effect editing on the received original multimedia data. Since the target pair of media data is obtained based on the first multimedia feature matching of the multimedia data to be matched, in addition to the special effects in the multimedia data to be matched, the synthesized multimedia data obtained based on the original multimedia data may also include The multimedia data matches the content of the target multimedia data, so that the multimedia data image has multiple elements, enriches the beautification effect of the multimedia data, improves the interest of the multimedia data display, and enables users to interact through the multimedia data to achieve It improves the diversity of interactions between users and improves the user experience.
附图说明Description of drawings
结合附图并参考以下具体实施方式,本公开各实施例的上述和其他特征、优点及方面将变得更加明显。贯穿附图中,相同或相似的附图标记表示相同或相似的元素。应当理解附图是示意性的,原件和元素不一定按照比例绘制。The above and other features, advantages and aspects of the various embodiments of the present disclosure will become more apparent with reference to the following detailed description in conjunction with the accompanying drawings. Throughout the drawings, the same or similar reference numerals denote the same or similar elements. It should be understood that the drawings are schematic and that elements and elements are not necessarily drawn to scale.
图1示出了本公开实施例提供的一种多媒体显示系统的架构图;FIG. 1 shows an architecture diagram of a multimedia display system provided by an embodiment of the present disclosure;
图2示出了本公开实施例提供的另一种多媒体显示系统的架构图;FIG. 2 shows an architecture diagram of another multimedia display system provided by an embodiment of the present disclosure;
图3示出了本公开实施例提供的一种多媒体显示方法的流程示意图;FIG. 3 shows a schematic flowchart of a multimedia display method provided by an embodiment of the present disclosure;
图4示出了本公开实施例提供的一种拍摄预览界面的示意图;FIG. 4 shows a schematic diagram of a shooting preview interface provided by an embodiment of the present disclosure;
图5示出了本公开实施例提供的一种特效编辑界面的示意图;Fig. 5 shows a schematic diagram of a special effect editing interface provided by an embodiment of the present disclosure;
图6示出了本公开实施例提供的一种待匹配多媒体数据的显示界面的示意图;FIG. 6 shows a schematic diagram of a display interface of multimedia data to be matched provided by an embodiment of the present disclosure;
图7示出了本公开实施例提供的多媒体数据的匹配逻辑的示意图;FIG. 7 shows a schematic diagram of matching logic of multimedia data provided by an embodiment of the present disclosure;
图8示出了本公开实施例提供的一种合成多媒体数据的显示界面示意图;FIG. 8 shows a schematic diagram of a display interface for synthesizing multimedia data provided by an embodiment of the present disclosure;
图9示出了本公开实施例提供的另一种多媒体显示方法的流程示意图;FIG. 9 shows a schematic flowchart of another multimedia display method provided by an embodiment of the present disclosure;
图10示出了本公开实施例提供的又一种多媒体显示方法的流程示意图;FIG. 10 shows a schematic flowchart of another multimedia display method provided by an embodiment of the present disclosure;
图11示出了本公开实施例提供的一种多媒体匹配方法的流程示意图;FIG. 11 shows a schematic flowchart of a multimedia matching method provided by an embodiment of the present disclosure;
图12示出了本公开实施例提供的一种多媒体显示装置的结构示意图;FIG. 12 shows a schematic structural diagram of a multimedia display device provided by an embodiment of the present disclosure;
图13示出了本公开实施例提供的一种多媒体匹配装置的结构示意图;FIG. 13 shows a schematic structural diagram of a multimedia matching device provided by an embodiment of the present disclosure;
图14示出了本公开实施例提供的一种计算设备的结构示意图。Fig. 14 shows a schematic structural diagram of a computing device provided by an embodiment of the present disclosure.
具体实施方式Detailed ways
下面将参照附图更详细地描述本公开的实施例。虽然附图中显示了本公开的某些实施例,然而应当理解的是,本公开可以通过各种形式来实现,而且不应该被解释为限于这里阐述的实施例,相反提供这些实施例是为了更加透彻和完整地理解本公开。应当理解的是,本公开的附图及实施例仅用于示例性作用,并非用于限制本公开的保护范围。Embodiments of the present disclosure will be described in more detail below with reference to the accompanying drawings. Although certain embodiments of the present disclosure are shown in the drawings, it should be understood that the disclosure may be embodied in various forms and should not be construed as limited to the embodiments set forth herein; A more thorough and complete understanding of the present disclosure. It should be understood that the drawings and embodiments of the present disclosure are for exemplary purposes only, and are not intended to limit the protection scope of the present disclosure.
应当理解,本公开的方法实施方式中记载的各个步骤可以按照不同的顺序执行,和/或并行执行。此外,方法实施方式可以包括附加的步骤和/或省略执行示出的步骤。本公开的范围在此方面不受限制。It should be understood that the various steps described in the method implementations of the present disclosure may be executed in different orders, and/or executed in parallel. Additionally, method embodiments may include additional steps and/or omit performing illustrated steps. The scope of the present disclosure is not limited in this respect.
本文使用的术语“包括”及其变形是开放性包括,即“包括但不限 于”。术语“基于”是“至少部分地基于”。术语“一个实施例”表示“至少一个实施例”;术语“另一实施例”表示“至少一个另外的实施例”;术语“一些实施例”表示“至少一些实施例”。其他术语的相关定义将在下文描述中给出。As used herein, the term "comprise" and its variants are open-ended, i.e. "including but not limited to". The term "based on" is "based at least in part on". The term "one embodiment" means "at least one embodiment"; the term "another embodiment" means "at least one further embodiment"; the term "some embodiments" means "at least some embodiments." Relevant definitions of other terms will be given in the description below.
需要注意,本公开中提及的“第一”、“第二”等概念仅用于对不同的装置、模块或单元进行区分,并非用于限定这些装置、模块或单元所执行的功能的顺序或者相互依存关系。It should be noted that concepts such as "first" and "second" mentioned in this disclosure are only used to distinguish different devices, modules or units, and are not used to limit the sequence of functions performed by these devices, modules or units or interdependence.
需要注意,本公开中提及的“一个”、“多个”的修饰是示意性而非限制性的,本领域技术人员应当理解,除非在上下文另有明确指出,否则应该理解为“一个或多个”。It should be noted that the modifications of "one" and "multiple" mentioned in the present disclosure are illustrative and not restrictive, and those skilled in the art should understand that unless the context clearly indicates otherwise, it should be understood as "one or more" multiple".
本公开实施方式中的多个装置之间所交互的消息或者信息的名称仅用于说明性的目的,而并不是用于对这些消息或信息的范围进行限制。The names of messages or information exchanged between multiple devices in the embodiments of the present disclosure are used for illustrative purposes only, and are not used to limit the scope of these messages or information.
随着计算机技术和移动通信技术的迅速发展,基于电子设备的各种网络平台得到了普遍应用,极大地丰富了人们的日常生活。越来越多的用户乐于在网络平台上对进行诸如图像或者视频等多媒体数据进行美化拍摄,以得到效果满意的照片或者视频。With the rapid development of computer technology and mobile communication technology, various network platforms based on electronic equipment have been widely used, greatly enriching people's daily life. More and more users are willing to beautify and shoot multimedia data such as images or videos on a network platform to obtain photos or videos with satisfactory effects.
目前,用户可以利用预设的特效模板对多媒体数据进行美化。比如,可以在拍摄得到的图片中增加预设的贴纸或者添加预设特效等。Currently, users can use preset special effect templates to beautify multimedia data. For example, preset stickers or preset special effects can be added to the captured pictures.
但是,该方式用户仅能从预设的特效工具中选择特效模板,美化的效果比较单一,缺少趣味性,降低了用户的体验。However, in this way, the user can only select a special effect template from preset special effect tools, and the beautification effect is relatively simple, lacks interest, and reduces user experience.
为了解决上述问题,本公开实施例提供了一种能够显示经待匹配多媒体数据和目标多媒体数据生成的合成多媒体数据的多媒体显示及匹配法方法、装置、设备及介质。In order to solve the above problems, embodiments of the present disclosure provide a multimedia display and matching method, device, device and medium capable of displaying synthesized multimedia data generated by matching multimedia data and target multimedia data.
本公开所提供的多媒体显示方法可以应用于图1和图2所示的架构中,具体结合图1和图2进行详细说明。The multimedia display method provided by the present disclosure can be applied to the architecture shown in FIG. 1 and FIG. 2 , and will be described in detail with reference to FIG. 1 and FIG. 2 .
图1示出了本公开实施例提供的一种多媒体显示系统的架构图。Fig. 1 shows a structure diagram of a multimedia display system provided by an embodiment of the present disclosure.
如图1所示,该多媒体显示系统中可以包括客户端的至少一个电 子设备101和服务端的至少一个服务器102。电子设备101可以通过网络协议如超文本传输安全协议(Hyper Text Transfer Protocol over Secure Socket Layer,HTTPS)与服务器102建立连接并进行信息交互。其中,电子设备101可以是移动电话、平板电脑、台式计算机、笔记本电脑、车载终端、可穿戴设备、一体机、智能家居设备等具有通信功能的设备,也可以是虚拟机或者模拟器模拟的设备。服务器102可以是云服务器或者服务器集群等具有存储及计算功能的设备。As shown in Figure 1, the multimedia display system may include at least one electronic device 101 at the client end and at least one server 102 at the server end. The electronic device 101 can establish a connection with the server 102 and exchange information through a network protocol such as Hyper Text Transfer Protocol over Secure Socket Layer (HTTPS). Among them, the electronic device 101 may be a mobile phone, a tablet computer, a desktop computer, a notebook computer, a vehicle terminal, a wearable device, an all-in-one machine, a smart home device, and other devices with communication functions, or it may be a device simulated by a virtual machine or a simulator. . The server 102 may be a device with storage and computing functions such as a cloud server or a server cluster.
基于上述架构,用户可以在电子设备101上通过特定业务平台对原始多媒体数据进行特效编辑,以及生成并显示合成多媒体数据。其中,特定业务平台可以为特定应用程序或者特定网站,比如可以是社交平台或者具有社交功能的视频播放平台等。Based on the above architecture, the user can edit original multimedia data with special effects on the electronic device 101 through a specific service platform, and generate and display synthesized multimedia data. Wherein, the specific service platform may be a specific application program or a specific website, such as a social platform or a video playing platform with social functions.
在一些实施例中,在用户通过电子设备101登录特定业务平台之后,电子设备101可以获取诸如图像或者视频的原始多媒体数据,并对原始多媒体数据进行特效编辑,得到待匹配多媒体数据。以及在从服务器102获取包括目标多媒体数据P11在内的多张候选多媒体数据之后,电子设备101可以基于待匹配多媒体数据的第一多媒体特征,从候选多媒体数据中查询出目标多媒体数据P11。然后,电子设备101可以将基于待匹配多媒体数据的第一多媒体特征匹配得到的目标多媒体数据与待匹配多媒体数据生成合成多媒体数据P12。可选地,继续参见图1,电子设备101可以将生成的合成多媒体数据P12上传至服务器102。In some embodiments, after the user logs in to the specific service platform through the electronic device 101, the electronic device 101 can obtain original multimedia data such as images or videos, and edit the original multimedia data with special effects to obtain the multimedia data to be matched. And after obtaining multiple pieces of candidate multimedia data including the target multimedia data P11 from the server 102, the electronic device 101 may query the target multimedia data P11 from the candidate multimedia data based on the first multimedia feature of the multimedia data to be matched. Then, the electronic device 101 may generate composite multimedia data P12 by matching the target multimedia data obtained based on the first multimedia feature of the multimedia data to be matched with the multimedia data to be matched. Optionally, continuing to refer to FIG. 1 , the electronic device 101 may upload the generated synthesized multimedia data P12 to the server 102 .
在另一些实施例中,电子设备101可以将待匹配多媒体数据上传至服务器102。然后,服务器102在接收到待匹配多媒体数据之后,可以从多张候选多媒体数据中匹配出目标多媒体数据P11,并将目标多媒体数据P11发送至电子设备101。然后,电子设备101可以基于一=以待匹配多媒体数据的第一多媒体特征匹配得到的目标多媒体数据与待匹配多媒体数据生成合成多媒体数据P12。In some other embodiments, the electronic device 101 may upload the multimedia data to be matched to the server 102 . Then, after receiving the multimedia data to be matched, the server 102 can match the target multimedia data P11 from multiple pieces of candidate multimedia data, and send the target multimedia data P11 to the electronic device 101 . Then, the electronic device 101 may generate synthesized multimedia data P12 based on the target multimedia data obtained by matching the first multimedia feature of the multimedia data to be matched with the multimedia data to be matched.
另外,本公开所提供的多媒体显示方法可以应用于多个电子设备 的用户通过多媒体数据进行互动的具体场景中,下面以图2所示的架构进行说明。In addition, the multimedia display method provided by the present disclosure can be applied to a specific scenario where users of multiple electronic devices interact through multimedia data, and the architecture shown in FIG. 2 will be described below.
图2示出了本公开实施例提供的另一种多媒体显示系统的架构图。FIG. 2 shows a structural diagram of another multimedia display system provided by an embodiment of the present disclosure.
如图2所示,该多媒体显示系统中可以包括客户端的至少一个第一电子设备201和至少一个第二电子设备202、以及服务端的至少一个服务器203。第一电子设备201、第二电子设备202和服务器203之间可以分别通过网络协议如HTTPS建立连接并进行信息交互。其中,第一电子设备201和第二电子设备202可以分别是移动电话、平板电脑、台式计算机、笔记本电脑、车载终端、可穿戴设备、一体机、智能家居设备等具有通信功能的设备,也可以是虚拟机或者模拟器模拟的设备。服务器203可以是云服务器或者服务器集群等具有存储及计算功能的设备。As shown in FIG. 2 , the multimedia display system may include at least one first electronic device 201 and at least one second electronic device 202 on the client side, and at least one server 203 on the server side. The first electronic device 201, the second electronic device 202, and the server 203 can respectively establish connections and perform information exchange through network protocols such as HTTPS. Wherein, the first electronic device 201 and the second electronic device 202 may be devices with communication functions such as mobile phones, tablet computers, desktop computers, notebook computers, vehicle-mounted terminals, wearable devices, all-in-one machines, and smart home devices, or It is a device simulated by a virtual machine or an emulator. The server 203 may be a device with storage and computing functions such as a cloud server or a server cluster.
基于上述架构,第一用户可以在第一电子设备201上登录特定业务平台,第二用户可以在第二电子设备202上登录相同的特定业务平台。在第一用户通过特定业务平台与第一用户进行互动的过程中,第二用户可以使用第二电子设备202在特定社交平台内通过特定社交平台的服务器203向第一用户发送需要由第一用户进行合成处理的目标多媒体数据P22。其中,特定社交平台可以为具有社交功能的特定应用程序或者特定网站。Based on the above architecture, the first user can log in to a specific service platform on the first electronic device 201 , and the second user can log in to the same specific service platform on the second electronic device 202 . During the interaction between the first user and the first user through the specific service platform, the second user can use the second electronic device 202 to send the request to the first user through the server 203 of the specific social platform in the specific social platform. The target multimedia data P22 subjected to synthesis processing. Wherein, the specific social platform may be a specific application or a specific website with social functions.
在一个实施例中,在第二用户通过第二电子设备202向服务器203发送经特效编辑过的目标多媒体数据P22之后,服务器203可以将包括目标多媒体数据P22在内的候选多媒体数据发送至第一电子设备201。若第一电子设备201确定目标多媒体数据P21与经过特效处理后的待匹配多媒体数据相匹配,则可以生成合成多媒体数据P23,并经过服务器203将合成多媒体数据P21发送至第二电子设备202。In one embodiment, after the second user sends the target multimedia data P22 edited by special effects to the server 203 through the second electronic device 202, the server 203 may send the candidate multimedia data including the target multimedia data P22 to the first electronic equipment 201. If the first electronic device 201 determines that the target multimedia data P21 matches the to-be-matched multimedia data after special effect processing, it can generate synthesized multimedia data P23 and send the synthesized multimedia data P21 to the second electronic device 202 through the server 203 .
在另一个实施例中,在服务器23接收到第一用户通过第一电子设备201发送的经特效编辑过的待匹配多媒体数据P21以及第二用户通过第二电子设备202发送的经特效编辑过后的目标多媒体数据P22之 后,若目标待匹配多媒体数据P22与待匹配多媒体数据P21相匹配,则将目标多媒体数据P22发送至第一电子设备201。第一电子设备201在生成合成多媒体数据P23之后,经过服务器203将合成多媒体数据P21发送至第二电子设备202。In another embodiment, the server 23 receives the specially edited multimedia data P21 sent by the first user through the first electronic device 201 and the specially edited multimedia data P21 sent by the second user through the second electronic device 202 . After the target multimedia data P22, if the target to-be-matched multimedia data P22 matches the to-be-matched multimedia data P21, then the target multimedia data P22 is sent to the first electronic device 201. After the first electronic device 201 generates the synthesized multimedia data P23 , it sends the synthesized multimedia data P21 to the second electronic device 202 through the server 203 .
在通过图1和图2介绍了本公开实施例的多媒体显示系统的架构之后,下面首先结合图3至图8对本公开实施例提供的多媒体显示方法进行说明。After introducing the architecture of the multimedia display system according to the embodiment of the present disclosure through FIG. 1 and FIG. 2 , the multimedia display method provided by the embodiment of the present disclosure will be described below with reference to FIG. 3 to FIG. 8 .
图3示出了本公开实施例提供的一种多媒体显示方法的流程示意图。Fig. 3 shows a schematic flowchart of a multimedia display method provided by an embodiment of the present disclosure.
在本公开实施例中,该多媒体显示方法可以由电子设备执行。其中,电子设备可以包括但不限于诸如移动电话、笔记本电脑、数字广播接收器、PDA(个人数字助理)、PAD(平板电脑)、PMP(便携式多媒体播放器)、车载终端(例如车载导航终端)、可穿戴设备等等的移动终端以及诸如数字TV、台式计算机、智能家居设备等等的固定终端。In the embodiment of the present disclosure, the multimedia display method may be executed by an electronic device. Among them, the electronic equipment may include but not limited to such as mobile phone, notebook computer, digital broadcast receiver, PDA (personal digital assistant), PAD (tablet computer), PMP (portable multimedia player), vehicle terminal (such as vehicle navigation terminal) , wearable devices, etc., and fixed terminals such as digital TVs, desktop computers, smart home devices, etc.
如图3所示,该多媒体显示方法可以包括如下步骤。As shown in Fig. 3, the multimedia display method may include the following steps.
S310、接收原始多媒体数据。S310. Receive original multimedia data.
在本公开实施例中,用户可以在想要对图像进行特效编辑或者是想要时在目标应用程序触发相关操作,电子设备可以响应于该相关触发操作,接收原始多媒体数据。其中,目标应用程序可以是社交平台或者视频发布平台。具体地,原始多媒体数据可以是视频数据或者图像数据等包含视觉化信息的多媒体数据。In the embodiment of the present disclosure, the user may trigger related operations in the target application program when he wants to edit special effects on the image or wants to, and the electronic device may receive the original multimedia data in response to the related triggered operations. Wherein, the target application program may be a social platform or a video publishing platform. Specifically, the original multimedia data may be multimedia data including visual information, such as video data or image data.
在一些实施例中,原始多媒体数据可以是用户实时采集得到的。相应地,上述相关操作可以是用户对拍摄页面的打开操作。或者,上述相关操作可以是用户在拍摄页面上的拍摄操作。又或者,上述相关操作可以用户在直播页面或者拍摄页面上针对多媒体合成功能的触发操作。In some embodiments, the original multimedia data may be collected by the user in real time. Correspondingly, the above-mentioned related operation may be an operation of opening the shooting page by the user. Alternatively, the above related operation may be a shooting operation performed by the user on the shooting page. Alternatively, the above related operations may be triggered by the user on the multimedia synthesis function on the live broadcast page or the shooting page.
在另一些实施例中,原始多媒体数据可以是电子设备在本地存储的。相应地,相关操作可以是用户在电子相册对图像或者视频的选择操作。In other embodiments, the original multimedia data may be locally stored by the electronic device. Correspondingly, the relevant operation may be the user's selection operation on an image or video in the electronic album.
在又一些实施例中,原始多媒体数据可以是用户下载的。相应地,相关操作可以是用户在浏览器、目标应用程序或者第三方应用程序的下载页面内针对图像或者视频的下载操作。In yet other embodiments, the original multimedia data may be downloaded by the user. Correspondingly, the related operation may be a user's download operation for images or videos in a download page of a browser, a target application program, or a third-party application program.
在再一些实施例中,原始多媒体数据可以是其他设备发送至电子设备的。相应地,电子设备在接收到其他设备发送的多媒体数据之后,可以将其作为原始多媒体数据。In still other embodiments, the original multimedia data may be sent to the electronic device by other devices. Correspondingly, after the electronic device receives the multimedia data sent by other devices, it can use it as the original multimedia data.
在一些实施例中,原始多媒体数据中可以是包含人物、动物、植物或者物体等目标对象的多媒体数据。比如,原始多媒体数据可以是用户自拍照、用户的自拍视频等。可选地,原始多媒体数据中可以包括目标对象的局部影像或者整体影像,比如原始多媒体数据可以仅包括人物脸部影像,或者包括脸部和其他身体部位的影像。In some embodiments, the original multimedia data may be multimedia data including target objects such as people, animals, plants or objects. For example, the original multimedia data may be a user's selfie, a user's selfie video, and the like. Optionally, the original multimedia data may include a partial image or an overall image of the target object. For example, the original multimedia data may only include a person's face image, or include images of a face and other body parts.
S320,对原始多媒体数据进行特效编辑,得到待匹配多媒体数据。S320. Perform special effect editing on the original multimedia data to obtain multimedia data to be matched.
在本公开实施例中,电子设备在接收到原始多媒体数据后,可以响应于用户的特效编辑功能或者多媒体合拍功能的触发操作,对原始多媒体数据进行特效编辑,得到待匹配多媒体数据。In the embodiment of the present disclosure, after receiving the original multimedia data, the electronic device may perform special effect editing on the original multimedia data in response to the user's trigger operation of the special effect editing function or the multimedia co-shooting function to obtain the multimedia data to be matched.
在一些实施例中,特效编辑可以通过添加特征或者替换原有特征等方式改变原始多媒体数据中目标对象自身的特征,或者用于改变目标对象的附属部件的特征。具体的,特效编辑可以是通过美颜、图像修改、特效道具、滤镜、图像风格迁移工具、贴图等特效编辑工具中的至少一种对原始多媒体图像进行特效编辑。其中,特效编辑工具可以是目标应用程序、第三方应用程序、网页等提供的。需要说明的是,为了便于后续部分的描述,本公开实施例的下述部分将待匹配多媒体数据中、经特效处理后的目标对象称为第一特效对象。In some embodiments, special effect editing can change the features of the target object itself in the original multimedia data by adding features or replacing original features, or can be used to change the features of the accessory parts of the target object. Specifically, special effect editing may be to perform special effect editing on the original multimedia image by using at least one of special effect editing tools such as beautification, image modification, special effect props, filters, image style transfer tools, and stickers. Wherein, the special effect editing tool may be provided by a target application program, a third-party application program, a web page, and the like. It should be noted that, in order to facilitate the description of the subsequent part, in the following part of the embodiments of the present disclosure, the target object after the special effect processing in the multimedia data to be matched is referred to as the first special effect object.
在一个示例中,电子设备可以通过美颜、图像修改、特效道具等方式改变目标对象的脸部特征。比如,可以调整目标对象脸部轮廓、眼部、皮肤、鼻子、嘴部等的特征。In one example, the electronic device can change the face features of the target object by means of beautification, image modification, special effects and props. For example, you can adjust the features of the target object's facial contours, eyes, skin, nose, mouth, etc.
在另一个示例中,可以通过美颜、图像修改、特效道具等功能来改变目标对象诸如身高、整体或者局部胖瘦的特征。In another example, features such as height, overall or partial fatness and thinness of the target object can be changed through functions such as beautification, image modification, and special effects props.
在又一些实施例中,可以通过贴图、特效道具等添加或改变目标对象服饰、头饰、眼镜、妆容、面具、非改变脸部原有部件特征的脸部特效等附属部件的特征。其中,非改变脸部原有部件特征的脸部特效可以包括动物胡须等。In some other embodiments, the features of accessory components such as clothing, headgear, glasses, makeup, masks, and facial special effects that do not change the original features of the face can be added or changed through texture maps, special effect props, etc. Among them, the facial special effects that do not change the features of the original parts of the face may include animal beards and the like.
在再一些实施例中,可以通过图像风格迁移工具或者滤镜,对整个原始多媒体数据图像风格、或者目标对象的整体或者局部图像风格进行风格迁移。比如,可以将原始多媒体数据的图像风格转换为动画风格,相应地,原始多媒体数据中的目标对象变为卡通人物。In some other embodiments, the image style of the entire original multimedia data, or the overall or partial image style of the target object can be style transferred by using an image style transfer tool or filter. For example, the image style of the original multimedia data can be converted into an animation style, and correspondingly, the target object in the original multimedia data becomes a cartoon character.
在一些实施例中,可以从特效编辑工具的多个可选特效模板中选择目标特效模板,对原始多媒体数据进行特效编辑。具体的,若原始多媒体数据是图像,则可以利用目标特效模板对原始图像的局部影像或者整体影像进行静态或者动态的特效编辑,生成图像或者视频格式的待匹配多媒体数据。又或者,若原始多媒体数据是视频,则可以从原始视频中提取一幅或者多幅关键视频帧,利用目标特效模板对关键视频帧的局部影像或者整体影像进行静态或者动态的特效编辑,生成图像或者视频格式的待匹配多媒体数据。可选地,关键视频帧可以是原始视频中包含目标对象的视频帧。In some embodiments, a target special effect template may be selected from multiple selectable special effect templates of the special effect editing tool to perform special effect editing on the original multimedia data. Specifically, if the original multimedia data is an image, the target special effect template can be used to perform static or dynamic special effect editing on the partial image or the overall image of the original image to generate the multimedia data to be matched in image or video format. Or, if the original multimedia data is a video, one or more key video frames can be extracted from the original video, and the target special effect template can be used to perform static or dynamic special effects editing on the partial image or the overall image of the key video frame to generate an image Or the multimedia data to be matched in video format. Optionally, the key video frame may be a video frame containing the target object in the original video.
进一步地,根据特效编辑的方式不同,S120可以至少包括以下两种实施方式。Further, according to different ways of editing special effects, S120 may include at least the following two implementation manners.
在一些实施例中,S120可以具体包括:响应于对目标特效模板的模板选择操作,基于目标特效模板对原始多媒体数据进行特效编辑,得到待匹配多媒体数据。In some embodiments, S120 may specifically include: in response to the template selection operation on the target special effect template, performing special effect editing on the original multimedia data based on the target special effect template to obtain the multimedia data to be matched.
具体得,在接收到原始多媒体数据之后,若用户想要对原始多媒体数据进行特效编辑,则电子设备可以响应于用户从特效编辑工具的多个可选特效模板中选择目标特效模板的触发操作,利用用户所选择的目标特效模板对原始多媒体数据进行特效编辑,得到待匹配多媒体模板。Specifically, after receiving the original multimedia data, if the user wants to edit special effects on the original multimedia data, the electronic device may respond to the user's trigger operation of selecting a target special effect template from multiple optional special effect templates in the special effect editing tool, The target special effect template selected by the user is used to perform special effect editing on the original multimedia data to obtain a multimedia template to be matched.
在另一些实施例中,S120可以具体包括:基于原始多媒体数据对 应的目标特效模板,对原始多媒体数据进行特效编辑,得到待匹配多媒体数据。In some other embodiments, S120 may specifically include: performing special effect editing on the original multimedia data based on the target special effect template corresponding to the original multimedia data, to obtain the multimedia data to be matched.
具体地,若用户预先选择了目标特效模板,则可以直接用目标特效模板对原始多媒体数据进行特效编辑,得到待匹配多媒体数据。又或者,可以基于原始多媒体数据的多媒体特征,为其匹配合适的特效模板作为目标特效模板。再或者,若用户先选择了目标特效模板,则可以在该模板内拍摄多媒体数据,相应地,在拍摄界面直接显示待匹配多媒体数据。Specifically, if the user pre-selects the target special effect template, the target special effect template can be directly used to perform special effect editing on the original multimedia data to obtain the multimedia data to be matched. Alternatively, based on the multimedia characteristics of the original multimedia data, an appropriate special effect template can be matched as the target special effect template. Alternatively, if the user first selects a target special effect template, multimedia data can be photographed in the template, and correspondingly, the multimedia data to be matched is directly displayed on the photographing interface.
S330,基于待匹配多媒体数据和目标多媒体数据,生成合成多媒体数据。S330. Generate composite multimedia data based on the multimedia data to be matched and the target multimedia data.
在本公开实施例中,电子设备可以在接收到待匹配多媒体数据和目标多媒体数据之后,将待匹配多媒体数据的目标图像部分和目标多媒体数据的目标图像部分直接或者经过一定的转换后以图像拼接或者图像融合的方式添加至目标多媒体模板中的目标图像区域内,得到合成多媒体数据,从而使得该合成多媒体数据可以同时具有待匹配多媒体数据中第一特效角色和目标多媒体数据中第二特效角色在目标多媒体模板中的至少部分特征。In the embodiment of the present disclosure, after receiving the multimedia data to be matched and the target multimedia data, the electronic device may splice the target image part of the multimedia data to be matched with the target image part of the target multimedia data directly or after a certain conversion. Or the way of image fusion is added to the target image area in the target multimedia template to obtain composite multimedia data, so that the composite multimedia data can have the first special effect role in the multimedia data to be matched and the second special effect role in the target multimedia data at the same time. At least some of the features in the target multimedia template.
为了便于解释合成多媒体数据,在介绍合成多媒体数据之前,本公开实施例的下述部分将对待匹配多媒体数据的第一多媒体特征和目标多媒体数据展开具体说明。To facilitate the explanation of the synthesized multimedia data, before introducing the synthesized multimedia data, the following part of the embodiments of the present disclosure will specifically describe the first multimedia feature of the multimedia data to be matched and the target multimedia data.
待匹配多媒体数据的第一多媒体特征可以是第一特效角色自身特征或者附属配件的特征。The first multimedia feature of the multimedia data to be matched may be a feature of the first special effect character itself or a feature of an accessory.
在一些实施例中,第一特效角色自身特征可以包括第一特效角色的脸部特征或者诸如身高、胖瘦等身体特征。In some embodiments, the characteristics of the first special effects character may include facial features or body characteristics such as height, fatness and thinness of the first special effects character.
可选地,脸部特征可以包括诸如头部长宽比、脸型、下巴与头部宽度之比、额头长度与头部长度之比等脸型特征。或者,可以包括诸如眼睛大小、眼间距、瞳孔颜色、瞳孔大小、眼睛形状等眼部特征。又或者,可以包括诸如鼻子长度、鼻翼宽度、鼻梁高度、鼻梁宽度等鼻部特征。 再或者,可以包括诸如头发长度、头发颜色、头发形状(卷发、直发)等毛发特征。再或者,可以包括诸如皮肤颜色、皮肤粗糙度等皮肤信息。Optionally, the facial features may include facial features such as head aspect ratio, face shape, ratio of chin to head width, ratio of forehead length to head length, and the like. Alternatively, eye characteristics such as eye size, distance between eyes, pupil color, pupil size, eye shape, etc. may be included. Alternatively, nasal features such as nose length, nose wing width, nose bridge height, nose bridge width, etc. may be included. Still alternatively, hair characteristics such as hair length, hair color, hair shape (curly, straight), etc. may be included. Alternatively, skin information such as skin color, skin roughness, etc. may be included.
在一些实施例中,附属配件的特征可以包括是否佩戴眼镜、是否佩戴面具、是否佩戴饰品、是否佩戴头饰、是否化妆、是否有非改变脸部原有部件特征的脸部特效等特征。示例性地,若第一特效对象佩戴有附属配件,则可以包括附属配件的具体特征。比如,若第一特效对象佩戴有面具,则第一多媒体特征还可以包括面具的型号、名称等。In some embodiments, the characteristics of the accessory accessories may include whether to wear glasses, whether to wear a mask, whether to wear accessories, whether to wear headgear, whether to wear makeup, whether to have facial effects that do not change the characteristics of the original parts of the face, and the like. Exemplarily, if the first special effect object wears an accessory, specific features of the accessory may be included. For example, if the first special effect object wears a mask, the first multimedia feature may also include the model and name of the mask.
通过上述示出的第一多媒体特征,可以匹配得到与待匹配多媒体数据具有相同或者相配合特征的目标多媒体数据。Through the first multimedia feature shown above, target multimedia data having the same or matching features with the multimedia data to be matched can be obtained through matching.
对于目标多媒体数据,可选地,目标多媒体数据中可以包括经特效编辑后的对象。示例性地,目标多媒体数据中的对象与原始多媒体数据中的目标对象可以是不同的对象。比如,目标多媒体数据中的对象可以是第二用户的、经特效编辑后的影像,待匹配多媒体数据中的目标对象可以是第一用户的、经特效编辑后的影像。为了便于说明,目标多媒体数据中经编辑后的对象可以称为第二特效对象。For the target multimedia data, optionally, the target multimedia data may include an object edited with special effects. Exemplarily, the object in the target multimedia data and the target object in the original multimedia data may be different objects. For example, the object in the target multimedia data may be the image of the second user edited with special effects, and the target object in the multimedia data to be matched may be the image of the first user edited with special effects. For ease of description, the edited object in the target multimedia data may be referred to as a second special effect object.
在一些实施例中,目标多媒体数据可以是目标应用程序、第三方应用程序或者网页的多媒体数据库中预存的数据。In some embodiments, the target multimedia data may be pre-stored data in a target application program, a third-party application program, or a multimedia database of a web page.
在另一些实施例中,目标多媒体数据可以是其他用户上传的、经特效编辑后的多媒体数据。In other embodiments, the target multimedia data may be multimedia data uploaded by other users and edited with special effects.
此外,其他用户生成目标多媒体数据的方式与生成待匹配多媒体数据的方式类似,在此不再赘述。In addition, the manner in which other users generate target multimedia data is similar to the manner in which to-be-matched multimedia data is generated, which will not be repeated here.
在一些实施例中,待匹配多媒体数据和目标多媒体数据可以是同一类多媒体数据。比如,二者均为图像,或者二者均为视频。又或者,待匹配多媒体数据和目标多媒体数据可以是不同类的多媒体数据。二者中的一者为图像,二者中的另一者为视频。In some embodiments, the multimedia data to be matched and the target multimedia data may be the same type of multimedia data. For example, both are images, or both are videos. Alternatively, the multimedia data to be matched and the target multimedia data may be different types of multimedia data. One of the two is an image, and the other of the two is a video.
在详细介绍了待匹配多媒体数据和目标多媒体数据之后,接下来本公开实施例将对合成多媒体数据进行说明。After the multimedia data to be matched and the target multimedia data are introduced in detail, the embodiment of the present disclosure will describe synthesizing multimedia data.
在一些实施例中,若目标多媒体模板是场景模板,合成多媒体数据 可以用于呈现待匹配多媒体数据中第一特效角色和目标多媒体数据中第二特效角色在目标多媒体模板所对应的场景中的互动行为或者互动动作。其中,目标多媒体模板可以是图像场景模板或者视频场景模板,对其具体类型不作限定。In some embodiments, if the target multimedia template is a scene template, the synthesized multimedia data can be used to present the interaction between the first special effect role in the multimedia data to be matched and the second special effect role in the target multimedia data in the scene corresponding to the target multimedia template behavior or interaction. Wherein, the target multimedia template may be an image scene template or a video scene template, and its specific type is not limited.
在一个示例中,电子设备可以基于用户从多个可选场景模板中选择目标多媒体模板的操作,生成合成多媒体数据。其中,可选场景模板可以是从目标应用程序、第三方应用程序或者网页的场景模板库中获取的。In an example, the electronic device may generate synthesized multimedia data based on the user's operation of selecting a target multimedia template from multiple optional scene templates. Wherein, the optional scene template may be obtained from a scene template library of a target application program, a third-party application program, or a web page.
在另一个示例中,电子设备可以根据待匹配多媒体数据中的第一特效对象的特征以及目标多媒体数据中第二特效对象的特征,确定相匹配的目标场景模板。其中,第一特效对象的特效和第二特效对象的特征可以是其动作。In another example, the electronic device may determine a matching target scene template according to features of the first special effect object in the multimedia data to be matched and features of the second special effect object in the target multimedia data. Wherein, the special effect of the first special effect object and the feature of the second special effect object may be their actions.
比如,若第一特效对象的动作是举杯,第二特效对象的动作也是举杯,则可以将第一特效对象和第二特效对象的局部或者整体图像添加到派对、酒吧等场景模板中,生成诸如干杯等合成多媒体数据。For example, if the action of the first special effect object is a toast, and the action of the second special effect object is also a toast, then the partial or overall images of the first special effect object and the second special effect object can be added to scene templates such as parties, bars, etc. Synthetic multimedia data such as cheers are generated.
又比如,若第一特效对象的动作是被公主抱,第二特效对象的动作是抱人,则可以将第一特效对象和第二特效对象的局部或者整体图像添加到诸如婚礼、美丽天空下等浪漫场景模板中,生成诸如转圈公主抱等合成多媒体数据。For another example, if the action of the first special effect object is to be hugged by a princess, and the action of the second special effect object is to hug a person, then the partial or overall images of the first special effect object and the second special effect object can be added to images such as weddings, beautiful sky, etc. In romantic scene templates such as , generate synthetic multimedia data such as a spinning princess hug.
再比如,若第一特效对象的动作是踢球射门,第二特效对象的动作是防守,则可以将第一特效对象和第二特效对象的局部或者整体特征添加到体育赛场等场景模板中,生成诸如足球比赛等合成多媒体数据。For another example, if the action of the first special effect object is kicking and shooting, and the action of the second special effect object is defense, then the partial or overall features of the first special effect object and the second special effect object can be added to scene templates such as sports fields, Generate synthetic multimedia data such as football matches.
在又一个示例中,电子设备可以基于用户上传的目标多媒体模板,生成合成多媒体数据。In yet another example, the electronic device may generate synthesized multimedia data based on a target multimedia template uploaded by a user.
在一些实施例中,为了提高趣味性,还可以在合成多媒体数据中添加文字、音乐、特效等。In some embodiments, text, music, special effects, etc. may also be added to the synthesized multimedia data in order to improve interest.
S340,显示合成多媒体数据。S340, displaying the synthesized multimedia data.
在本公开实施例中,电子设备可以响应于用户对多媒体数据的合 成操作或者用户用于显示合成多媒体数据的触发操作显示,显示合成多媒体数据。又或者,电子设备无需响应于上述触发操作,在生成合成多媒体数据之后,直接在相关界面显示合成多媒体数据。In an embodiment of the present disclosure, the electronic device may display and display the composite multimedia data in response to a user's composite operation on the multimedia data or a user's trigger operation for displaying the composite multimedia data. Alternatively, the electronic device does not need to respond to the trigger operation, and directly displays the synthesized multimedia data on a relevant interface after generating the synthesized multimedia data.
本公开实施例的视频显示方法、装置、设备及介质,在对所接收的原始多媒体数据进行特效编辑之后,基于编辑得到的待匹配多媒体数据与目标多媒体数据,生成并显示合成多媒体数据。由于目标对媒体数据是基于待匹配多媒体数据的第一多媒体特征匹配得到的,基于原始多媒体数据得到的合成多媒体数据中除了待匹配多媒体数据中的特效效果之外,还可以包括与待匹配多媒体数据相匹配的目标多媒体数据的内容,使得多媒体数据图像具有多种元素,进而丰富了多媒体数据的美化效果,从而提高了多媒体数据显示的趣味性,以及还可以使得用户可以通过多媒体数据进行交互,实现了用户之间的多样性交互,提高了用户的使用体验。The video display method, device, device and medium of the embodiments of the present disclosure generate and display synthesized multimedia data based on the edited multimedia data to be matched and target multimedia data after performing special effect editing on the received original multimedia data. Since the target pair of media data is obtained based on the first multimedia feature matching of the multimedia data to be matched, in addition to the special effects in the multimedia data to be matched, the synthesized multimedia data obtained based on the original multimedia data may also include The multimedia data matches the content of the target multimedia data, so that the multimedia data image has multiple elements, thereby enriching the beautification effect of the multimedia data, thereby improving the fun of multimedia data display, and enabling users to interact through multimedia data , realize the diversified interaction between users, and improve the user experience.
为了便于理解,接下来本公开实施例通过图4-图8对本公开实施例提供的多媒体显示方法展开具体说明。For ease of understanding, the following embodiments of the present disclosure will specifically describe the multimedia display method provided by the embodiments of the present disclosure through FIGS. 4-8 .
图4示出了本公开实施例提供的一种拍摄预览界面的示意图。Fig. 4 shows a schematic diagram of a shooting preview interface provided by an embodiment of the present disclosure.
如图4所示,电子设备可以在拍摄预览界面40中显示目标对象41,以及诸如滤镜工具401、美颜工具402、特效工具403等多种特效编辑工具,以及还可以显示多媒体合成工具404。其中,滤镜工具401、美颜工具402、特效工具403均可以包括一个或者多个特效模板。As shown in FIG. 4 , the electronic device can display a target object 41 in the shooting preview interface 40, and various special effect editing tools such as a filter tool 401, a beauty tool 402, and a special effect tool 403, and can also display a multimedia synthesis tool 404. . Wherein, the filter tool 401, the beauty tool 402, and the special effect tool 403 may each include one or more special effect templates.
当用户点击特效工具403时,显示的界面可以如图5所示的界面。图5示出了本公开实施例提供的一种特效编辑界面的示意图。When the user clicks the special effect tool 403, the displayed interface may be as shown in FIG. 5 . Fig. 5 shows a schematic diagram of a special effect editing interface provided by an embodiment of the present disclosure.
如图5所示,特效编辑界面50上可以显示特效工具403的多个特效模板4031至4034。当用户从中选择面具特效模板4033之后,生成的待匹配多媒体数据如图6所示。图6示出了本公开实施例提供的一种待匹配多媒体数据的显示界面的示意图。As shown in FIG. 5 , multiple special effect templates 4031 to 4034 of the special effect tool 403 may be displayed on the special effect editing interface 50 . After the user selects the mask special effect template 4033, the generated multimedia data to be matched is shown in FIG. 6 . Fig. 6 shows a schematic diagram of a display interface of multimedia data to be matched provided by an embodiment of the present disclosure.
如图6所示,待匹配多媒体数据的显示界面60上可以包括经过特效处理后的第一特效角色61以及多媒体合成工具404。当用户点击多 媒体合成工具404之后,即可由电子设备或者服务器进行多媒体数据的匹配步骤。图7示出了本公开实施例提供的多媒体数据的匹配逻辑的示意图。As shown in FIG. 6 , the display interface 60 of the multimedia data to be matched may include a first special effect character 61 after special effect processing and a multimedia synthesis tool 404 . After the user clicks on the multimedia synthesis tool 404, the matching step of the multimedia data can be performed by the electronic device or the server. Fig. 7 shows a schematic diagram of matching logic of multimedia data provided by an embodiment of the present disclosure.
如图7所示,若根据包含第一特效角色61的待匹配多媒体数据P71匹配得到的包含第二特效角色73的目标多媒体数据P72之后,生成的合成多媒体数据如图8所示。As shown in FIG. 7 , after matching the target multimedia data P72 containing the second special effect character 73 according to the matching multimedia data P71 containing the first special effect character 61 , the generated composite multimedia data is shown in FIG. 8 .
图8示出了本公开实施例提供的一种合成多媒体数据的显示界面示意图。如8所示,合成多媒体数据P81可以以图像或者视频的形式呈现第一特效角色61和第二特效角色73在假面舞会场景中进行干杯的场景。FIG. 8 shows a schematic diagram of a display interface for synthesizing multimedia data provided by an embodiment of the present disclosure. As shown in 8, the synthesized multimedia data P81 may present a scene in which the first special effect character 61 and the second special effect character 73 toast in the masquerade scene in the form of images or videos.
在本公开实施例提供的一些实施例中,图9示出了本公开实施例提供的另一种多媒体显示方法的流程示意图。In some embodiments provided by the embodiments of the present disclosure, FIG. 9 shows a schematic flowchart of another multimedia display method provided by the embodiments of the present disclosure.
在本公开实施例中,该多媒体显示方法可以由电子设备执行。其中,电子设备可以包括但不限于诸如移动电话、笔记本电脑、数字广播接收器、PDA(个人数字助理)、PAD(平板电脑)、PMP(便携式多媒体播放器)、车载终端(例如车载导航终端)、可穿戴设备等等的移动终端以及诸如数字TV、台式计算机、智能家居设备等等的固定终端。In the embodiment of the present disclosure, the multimedia display method may be executed by an electronic device. Among them, the electronic equipment may include but not limited to such as mobile phone, notebook computer, digital broadcast receiver, PDA (personal digital assistant), PAD (tablet computer), PMP (portable multimedia player), vehicle terminal (such as vehicle navigation terminal) , wearable devices, etc., and fixed terminals such as digital TVs, desktop computers, smart home devices, etc.
如图9所示,该多媒体显示方法可以包括如下步骤。As shown in FIG. 9, the multimedia display method may include the following steps.
S910,接收原始多媒体数据。其中,S910的具体内容与S310的具体内容类似,对此不再赘述。S910. Receive original multimedia data. Wherein, the specific content of S910 is similar to the specific content of S310, which will not be repeated here.
S920,对原始多媒体数据进行特效编辑,得到待匹配多媒体数据。其中,S920的具体内容与S320的具体内容类似,对此不再赘述。S920, performing special effect editing on the original multimedia data to obtain the multimedia data to be matched. Wherein, the specific content of S920 is similar to the specific content of S320, which will not be repeated here.
S930,从待匹配多媒体数据中提取第一多媒体特征。S930. Extract a first multimedia feature from the multimedia data to be matched.
在一些实施例中,可以利用图像特征提取技术或者视频帧特征提取技术从待匹配多媒体数据中提取第一多媒体特征。In some embodiments, image feature extraction technology or video frame feature extraction technology may be used to extract the first multimedia feature from the multimedia data to be matched.
其中,第一多媒体特征的具体内容可以参见本公开实施例上述部分对S330的相关说明,再次不再赘述。For the specific content of the first multimedia feature, refer to the relevant description of S330 in the above part of the embodiment of the present disclosure, and details will not be repeated again.
S940,获取待匹配多媒体数据对应的多个候选多媒体数据。S940. Acquire a plurality of candidate multimedia data corresponding to the multimedia data to be matched.
在一些实施例中,可以从目标应用程序、第三方应用程序或者网页的多媒体数据库获取候选多媒体数据。In some embodiments, candidate multimedia data may be obtained from a multimedia database of a target application, a third-party application, or a webpage.
在一个示例中,候选多媒体数据可以是多媒体数据库中预存的数据。In one example, the candidate multimedia data may be pre-stored data in a multimedia database.
在另一个示例中,候选多媒体数据可以是其他用户上传的、经特效编辑后的多媒体数据。In another example, the candidate multimedia data may be multimedia data uploaded by other users and edited with special effects.
S950,在多个候选多媒体数据中,查询与第一多媒体特征相匹配的目标多媒体数据。S950. Query target multimedia data matching the first multimedia feature among multiple candidate multimedia data.
在一些实施例中,可以通过特征匹配的方式,从多个候选多媒体数据中确定目标多媒体数据。In some embodiments, target multimedia data can be determined from multiple candidate multimedia data by means of feature matching.
在一个实施例中,S950可以具体包括下述步骤。In an embodiment, S950 may specifically include the following steps.
步骤A1,确定第一多媒体特征对应的至少一个特征标签。Step A1, determining at least one feature tag corresponding to the first multimedia feature.
可选地,特征标签可以是基于一个或者一类第一多媒体特征对第一特效对象从一个或者多个维度进行分类后得到的标签。示例性地,特征标签可以从第一特效对象自身或者附属部件的维度对第一特效对象进行分类。Optionally, the feature label may be a label obtained by classifying the first special effect object from one or more dimensions based on one or a type of first multimedia feature. Exemplarily, the feature tag can classify the first special effect object from the dimension of the first special effect object itself or the accessory component.
比如,第一特效对象自身的特征标签可以包括用于表征鼻子的标签、眼睛的标签、性别的标签、动作标签、皮肤状态的标签等能够从人物本身特征上对第一特效对象进行分类。For example, the feature tags of the first special effect object may include tags for characterizing the nose, eyes, gender, action, skin state, etc. The first special effect object can be classified from the characteristics of the character itself.
又比如,第一特效对象的附属标签可以包括是否佩戴眼镜的标签、是否佩戴面具的标签、是否化妆标签等。For another example, the attached tags of the first special effect object may include whether to wear glasses, whether to wear a mask, whether to wear makeup, and so on.
步骤A2,针对每个候选多媒体数据,确定与至少一个特征标签相同的共同标签。Step A2, for each candidate multimedia data, determine a common label that is the same as at least one feature label.
也就是说,在待匹配多媒体数据的标签和候选多媒体数据的标签存在相同的标签,则可以将相同的标签作为待匹配多媒体数据和候选多媒体数据的共同标签。That is to say, if the same label exists between the label of the multimedia data to be matched and the label of the candidate multimedia data, the same label may be used as a common label of the multimedia data to be matched and the candidate multimedia data.
示例性得、若用户A的标签包括戴眼镜、高鼻子、黄皮肤、高个子、女性;用户B的标签包括不戴眼镜、小嘴巴、瘦、男性。则二者 共同标签可以包括眼镜标签(戴眼镜或者不戴眼镜)、性别标签(男或者女)。Exemplarily, if user A's tags include wearing glasses, tall nose, yellow skin, tall, and female; user B's tags include no glasses, small mouth, thin, and male. Then the common tags of the two may include glasses tags (wearing glasses or not wearing glasses), gender tags (male or female).
步骤A3,根据共同标签对应的权重值,计算待匹配多媒体数据与每个候选多媒体数据之间的标签匹配分数。Step A3, calculating the tag matching score between the multimedia data to be matched and each candidate multimedia data according to the weight values corresponding to the common tags.
在一个示例中,共同标签的权重值可以是预先设置的。In one example, the weight value of the common tag may be preset.
在又一个示例中,共同标签的权重值可以是根据用户的选择设置的。对于用户不关注的标签,则设置低权重值a。对于用户重视或者关注的标签(比如用户喜欢或不喜欢其他人物是否佩戴眼镜以及是否是马尾辫),可以为眼镜标签和发型标签设置高权重值b。其中,权重值b大于权重值a。又或者可以为用户感兴趣的标签设置高权重值c,为用户反感的标签设置低权重值d。其中,权重值c大于权重值a,权重值a大于权重值d。In yet another example, the weight value of the common tag may be set according to the user's selection. For tags that users do not pay attention to, set a low weight value a. For tags that the user values or pays attention to (such as whether the user likes or dislikes whether other characters wear glasses and whether they have ponytails), a high weight value b can be set for the glasses tag and hairstyle tag. Wherein, the weight value b is greater than the weight value a. Alternatively, a high weight value c can be set for tags that the user is interested in, and a low weight value d can be set for tags that the user dislikes. Wherein, the weight value c is greater than the weight value a, and the weight value a is greater than the weight value d.
标签匹配分数用于反映每一候选多媒体数据与待匹配多媒体数据在该标签所对应的一个特征或者一类特征方面的匹配程度。The tag matching score is used to reflect the degree of matching between each candidate multimedia data and the multimedia data to be matched in terms of a feature or a type of feature corresponding to the tag.
在一些实施例中,对于每一特征标签,可以根据特征生成多媒体数据对应于该标签的标签分数。比如,对于眼镜标签,若待匹配多媒体数据中第一特效对象佩戴了眼镜,则眼镜标签的标签得分可以是100,若第一特效对象未佩戴眼镜,则眼镜标签的标签得分可以是0。In some embodiments, for each feature tag, a tag score of the multimedia data corresponding to the tag may be generated according to the feature. For example, for the glasses tag, if the first special effect object in the multimedia data to be matched wears glasses, the tag score of the glasses tag can be 100; if the first special effect object does not wear glasses, the tag score of the glasses tag can be 0.
需要说明的是,候选多媒体数据的标签得分与待匹配多媒体数据的计算方式相同,在此不再赘述。It should be noted that the calculation method of the tag score of the candidate multimedia data is the same as that of the multimedia data to be matched, and will not be repeated here.
相应地,在获取候选多媒体数据的标签得分与待匹配多媒体数据的标签得分之后,可以根据二者共同标签的标签得分计算二者的相似度得分。然后再根据二者之间的相似度得分以及权重值计算二者的标签匹配分数。Correspondingly, after acquiring the tag scores of the candidate multimedia data and the tag scores of the multimedia data to be matched, the similarity score of the two can be calculated according to the tag scores of the common tags of the two. Then calculate the label matching score of the two according to the similarity score and the weight value between the two.
可选地,对于一些特征标签,每一候选多媒体数据和待匹配多媒体数据的标签分数的接近程度与二者之间的相似度得分正相关。也就是说,每一候选多媒体数据与待匹配多媒体数据的标签分数越接近,则二者之间的相似度得分越高。比如,如果两者都佩戴了眼镜,则其匹配度 得分高。示例性地,该类特征标签对应的相似度得分可以等于预设值减去目标标签分数差值。其中,目标标签分数差值为每一候选多媒体数据在该类标签的标签分数与待匹配多媒体数据在该类标签的标签分数之间的差值。Optionally, for some feature tags, the closeness of tag scores of each candidate multimedia data to the multimedia data to be matched is positively correlated with the similarity score between the two. That is to say, the closer the tag scores of each candidate multimedia data and the multimedia data to be matched are, the higher the similarity score between them will be. For example, if both are wearing glasses, their matching score is high. Exemplarily, the similarity score corresponding to this type of feature label may be equal to the preset value minus the target label score difference. Wherein, the target label score difference is the difference between the label score of each candidate multimedia data in this category of labels and the label score of the multimedia data to be matched in this category of labels.
对于另一些特征标签,每一候选多媒体数据和待匹配多媒体数据的标签分数的接近程度与二者之间的相似度得分负相关。也就是说,每一候选多媒体数据与待匹配多媒体数据的标签分数差值越大,则二者之间的相似度得分越低。比如,若两者性别相同,则其相似度得分低。若二者性别相反,则其相似度得分高。示例性地,该类特征标签对应的相似度得分可以等于目标标签分数差值。For other feature labels, the closeness of the label scores of each candidate multimedia data to the multimedia data to be matched is negatively correlated with the similarity score between the two. That is to say, the greater the tag score difference between each candidate multimedia data and the multimedia data to be matched, the lower the similarity score between the two. For example, if both genders are the same, their similarity score is low. If the two genders are opposite, their similarity score is high. Exemplarily, the similarity score corresponding to this type of feature label may be equal to the target label score difference.
在一个示例中,多个候选多媒体数据的标签分数可以记录于匹配表中。相应地,电子设备在获取到待匹配多媒体数据的特征标签以及各特征标签的标签分数之后,基于上述计算方法计算待匹配多媒体数据与各候选多媒体数据的标签匹配分数,以从匹配表中计算并查找到目标多媒体数据。In one example, tag scores for multiple candidate multimedia data can be recorded in a matching table. Correspondingly, after the electronic device acquires the feature tags of the multimedia data to be matched and the tag scores of each feature tag, it calculates the tag matching scores of the multimedia data to be matched and each candidate multimedia data based on the above calculation method, so as to calculate from the matching table and Find the target multimedia data.
在另一些实施例中,每一标签的标签匹配分数可以是根据该标签的权重值以及每一候选多媒体数据与待匹配多媒体数据之间的特征匹配度得分得到的。比如,每一标签的标签匹配分数可以等于该标签的权重值以及每一候选多媒体数据与待匹配多媒体数据之间的特征匹配度得分的乘积。In some other embodiments, the tag matching score of each tag may be obtained according to the weight value of the tag and the feature matching score between each candidate multimedia data and the multimedia data to be matched. For example, the tag matching score of each tag may be equal to the product of the weight value of the tag and the feature matching score between each candidate multimedia data and the multimedia data to be matched.
可选地,对于一些特征标签,每一候选多媒体数据与待匹配多媒体数据之间相似度与二者之间的特征匹配度得分正相关。也就是说,每一候选多媒体数据与待匹配多媒体数据之间相似度越高,则二者之间的特征匹配度得分越高。比如,如果两者都佩戴了眼镜,则其特征匹配度得分高。Optionally, for some feature tags, the similarity between each candidate multimedia data and the multimedia data to be matched is positively correlated with the feature matching score between the two. That is to say, the higher the similarity between each candidate multimedia data and the multimedia data to be matched, the higher the feature matching score between the two. For example, if both wear glasses, the feature matching score is high.
对于另一些特征标签,每一候选多媒体数据与待匹配多媒体数据之间相似度与二者之间的特征匹配度得分负相关。也就是说,每一候选多媒体数据与待匹配多媒体数据之间相似度越低,则二者得分越高。比 如,若两者性别相同,则其特征匹配度得分低。若二者性别相反,则其特征匹配度得分高。For other feature tags, the similarity between each candidate multimedia data and the multimedia data to be matched is negatively correlated with the feature matching score between the two. That is to say, the lower the similarity between each candidate multimedia data and the multimedia data to be matched is, the higher the score of the two is. For example, if both genders are the same, their feature matching score will be low. If the two genders are opposite, the feature matching score is high.
需要说明的是,各特征标签的相似度与特征匹配度得分之间的相关关系可以根据实际场景和具体需求设置,对此不做具体限定。It should be noted that the correlation between the similarity of each feature tag and the feature matching score can be set according to actual scenarios and specific requirements, and there is no specific limitation on this.
步骤A4,对候选多媒体数据的标签匹配分数进行排序,确定目标多媒体数据。Step A4, sorting the tag matching scores of the candidate multimedia data to determine the target multimedia data.
在一个示例中,可以按照标签匹配分数从高到低的数据进行排序,并将得分最高的候选多媒体数据作为目标多媒体数据。其中,待匹配多媒体数据与多个候选多媒体数据之间的标签匹配分数可以按照从大到小或者从小到大的顺序记录于匹配表中。In an example, the data with tag matching scores from high to low may be sorted, and the candidate multimedia data with the highest score is used as the target multimedia data. Wherein, the tag matching scores between the multimedia data to be matched and multiple candidate multimedia data can be recorded in the matching table in descending order or in descending order.
在一些实施例中,为了提高匹配精度,目标多媒体数据还可以满足以下多个条件中的一个或者多个。In some embodiments, in order to improve matching accuracy, the target multimedia data may also satisfy one or more of the following conditions.
条件C1、目标多媒体数据的特效编辑方式与待匹配多媒体数据的特效编辑方式相同。示例性地,若目标多媒体数据和待匹配多媒体数据均进行了添加面具的特效编辑,则二者特效编辑方式相同。又一示例性的,若某一特效模板对应着第一特效和第二特效,目标多媒体数据采用了其中的第一特效,待匹配多媒体数据采用了第二特效,则二者特效编辑方式相同。Condition C1, the special effect editing method of the target multimedia data is the same as the special effect editing method of the to-be-matched multimedia data. Exemplarily, if both the target multimedia data and the multimedia data to be matched are edited with special effects of adding masks, the special effect editing methods of the two are the same. As another example, if a certain special effect template corresponds to the first special effect and the second special effect, the target multimedia data adopts the first special effect, and the to-be-matched multimedia data adopts the second special effect, then the two special effect editing methods are the same.
条件C2、目标多媒体数据所属用户为在线用户。也就是说,若用户通过电子设备打开了目标应用程序的界面,或者目标应用程序在电子设备上处于后台运行状态,则认为该用户为在线用户。Condition C2, the user to which the target multimedia data belongs is an online user. That is to say, if the user opens the interface of the target application program through the electronic device, or the target application program is running in the background on the electronic device, the user is considered to be an online user.
条件C3、目标多媒体数据的发布位置与原始多媒体数据的发布位置之间的位置距离小于或等于预设的距离阈值。Condition C3, the location distance between the distribution location of the target multimedia data and the distribution location of the original multimedia data is less than or equal to a preset distance threshold.
比如,距离阈值可以是系统默认设置的值,又或者,可以是用户从多个可选距离阈值中选择的目标距离阈值。又或者,目标多媒体数据所属用户与原始多媒体数据所属用户处于同一地域,比如同区、同市、同省,则认为二者的位置距离小于或等于预设的距离阈值。其中,距离阈值可以根据实际情况或者具体场景设置,对此不作限定。For example, the distance threshold may be a default value set by the system, or may be a target distance threshold selected by the user from multiple optional distance thresholds. Alternatively, if the user to which the target multimedia data belongs is in the same region as the user to which the original multimedia data belongs, such as the same district, city, or province, the distance between the two locations is considered to be less than or equal to a preset distance threshold. Wherein, the distance threshold may be set according to an actual situation or a specific scene, which is not limited.
条件C4、目标多媒体数据的历史匹配次数满足预设的次数筛选条件。其中,次数筛选条件可以是历史匹配次数处于预设次数取值范围内。其中,预设次数取值范围可以是系统默认设置的值,又或者,可以是用户从多个可选次数取值范围中选择的目标次数取值范围。The condition C4, the historical matching times of the target multimedia data satisfies the preset times filtering condition. Wherein, the frequency filtering condition may be that the historical matching frequency is within a preset value range. Wherein, the value range of the preset number of times may be the value set by the system by default, or may be the value range of the target number of times selected by the user from multiple optional value ranges of the number of times.
在一个示例中,为了提高匹配的灵活性,若用户通过步骤A1至A4无法筛选出目标多媒体数据,则可以通过上述条件C1至C4中至少一个条件从候选多媒体数据中选择出目标多媒体数据。In one example, in order to improve the flexibility of matching, if the user cannot filter out the target multimedia data through steps A1 to A4, the target multimedia data can be selected from the candidate multimedia data through at least one of the above conditions C1 to C4.
在另一个示例中,为了提高匹配的精确度,若用户通过步骤A1至A4筛选得到多个目标多媒体数据时,可以通过上述条件C1至C4中至少一个条件继续对多个目标多媒体数据进行进一步筛选,得到目标多媒体数据。In another example, in order to improve the accuracy of matching, if the user obtains multiple target multimedia data through the screening of steps A1 to A4, the multiple target multimedia data can be further screened through at least one of the above conditions C1 to C4 , to obtain the target multimedia data.
在又一些示例中,在获取待匹配多媒体数据之后,可以直接利用上述条件C1至C4中至少一个条件从多个候选多媒体数据中筛选出目标多媒体数据。In still some examples, after acquiring the multimedia data to be matched, at least one of the above conditions C1 to C4 may be directly used to screen out target multimedia data from multiple candidate multimedia data.
在再一个示例中,为了提高匹配速率,候选多媒体数据可以是利用上述条件C1至C4中至少一个条件筛选后得到的。In yet another example, in order to increase the matching rate, the candidate multimedia data may be obtained after screening by at least one of the above conditions C1 to C4.
在一些实施例中,若用户通过上述条件C1至C4中的至少两种条件进行目标多媒体数据的筛选,则可以根据预设设置的条件使用顺序,按顺序对候选多媒体数据进行筛选,直到使用最后一个条件后筛选得到目标多媒体数据。或者是,当使用多一个特征得到的目标多媒体数据的数据处于预设数量范围内,则可以得到目标多媒体数据。In some embodiments, if the user screens the target multimedia data through at least two of the above-mentioned conditions C1 to C4, the candidate multimedia data can be screened in order according to the preset condition usage order until the last one is used. After a condition is filtered to obtain the target multimedia data. Alternatively, when the data of the target multimedia data obtained by using one more feature is within a preset quantity range, the target multimedia data can be obtained.
在一些实施例中,目标多媒体数据还根据原始多媒体数据的第二多媒体特征匹配得到。In some embodiments, the target multimedia data is also obtained by matching the second multimedia feature of the original multimedia data.
在一个示例中,为了提高匹配的灵活性,若用户通过步骤A1至A4无法筛选出目标多媒体数据,则可以通过原始多媒体数据的第二多媒体特征从候选多媒体数据中匹配得到目标多媒体数据。In one example, in order to improve the flexibility of matching, if the user cannot filter out the target multimedia data through steps A1 to A4, the target multimedia data can be matched from the candidate multimedia data by using the second multimedia feature of the original multimedia data.
在另一个示例中,为了提高匹配的精确度,若用户通过步骤A1至A4筛选得到多个目标多媒体数据时,可以通过原始多媒体数据的第二 多媒体特征继续对多个目标多媒体数据进行进一步筛选,得到目标多媒体数据。In another example, in order to improve the accuracy of matching, if the user obtains multiple target multimedia data through steps A1 to A4, the multiple target multimedia data can be further processed through the second multimedia feature of the original multimedia data. Screening to obtain the target multimedia data.
在一些示例中,第二多媒体特征可以是原始多媒体数据中目标对象的多媒体特征。其中,第二多媒体特征与上述第一多媒体特征类似,且利用第二多媒体特征查询目标多媒体数据的方法与利用第一多媒体特征查询目标多媒体数据的方法类似,对此不再赘述。In some examples, the second multimedia feature may be a multimedia feature of the target object in the original multimedia data. Wherein, the second multimedia feature is similar to the above-mentioned first multimedia feature, and the method of using the second multimedia feature to query the target multimedia data is similar to the method of using the first multimedia feature to query the target multimedia data. No longer.
S960,基于待匹配多媒体数据和目标多媒体数据,生成合成多媒体数据,目标多媒体数据根据待匹配多媒体数据的第一多媒体特征匹配得到。其中,S960的具体内容与S330的具体内容类似,对此不再赘述。S960. Generate composite multimedia data based on the multimedia data to be matched and the target multimedia data. The target multimedia data is obtained by matching according to the first multimedia feature of the multimedia data to be matched. Among them, the specific content of the S960 is similar to that of the S330, which will not be repeated here.
S970,显示合成多媒体数据。其中,S970的具体内容与S340的具体内容类似,对此不再赘述。S970, displaying the synthesized multimedia data. Among them, the specific content of the S970 is similar to that of the S340, and will not be repeated here.
本公开实施例的多媒体显示方法,可以利用待匹配多媒体数据的第一数据特征从多个候选多媒体数据中准确的匹配出特征相同的目标多媒体数据,从而使得生成的合成多媒体数据中包括特征匹配度高的第一特效对象和第二特效对象,提高了多媒体显示方法的趣味性。The multimedia display method of the embodiment of the present disclosure can use the first data feature of the multimedia data to be matched to accurately match the target multimedia data with the same feature from multiple candidate multimedia data, so that the generated composite multimedia data includes the feature matching degree The high first special effect object and the second special effect object improve the interest of the multimedia display method.
在本公开实施例提供的一些实施例中,图10示出了本公开实施例提供的又一种多媒体显示方法的流程示意图。In some embodiments provided by the embodiments of the present disclosure, FIG. 10 shows a schematic flowchart of another multimedia display method provided by the embodiments of the present disclosure.
在本公开实施例中,该多媒体显示方法可以由电子设备执行。其中,电子设备可以包括但不限于诸如移动电话、笔记本电脑、数字广播接收器、PDA(个人数字助理)、PAD(平板电脑)、PMP(便携式多媒体播放器)、车载终端(例如车载导航终端)、可穿戴设备等等的移动终端以及诸如数字TV、台式计算机、智能家居设备等等的固定终端。In the embodiment of the present disclosure, the multimedia display method may be executed by an electronic device. Among them, the electronic equipment may include but not limited to such as mobile phone, notebook computer, digital broadcast receiver, PDA (personal digital assistant), PAD (tablet computer), PMP (portable multimedia player), vehicle terminal (such as vehicle navigation terminal) , wearable devices, etc., and fixed terminals such as digital TVs, desktop computers, smart home devices, etc.
如图10所示,该多媒体显示方法可以包括如下步骤。As shown in FIG. 10, the multimedia display method may include the following steps.
S1010,接收原始多媒体数据。其中,S1010的具体内容与S310的具体内容类似,对此不再赘述。S1010. Receive original multimedia data. Wherein, the specific content of S1010 is similar to the specific content of S310, which will not be repeated here.
S1020,对原始多媒体数据进行特效编辑,得到待匹配多媒体数据。其中,S1020的具体内容与S320的具体内容类似,对此不再赘述。S1020. Perform special effect editing on the original multimedia data to obtain multimedia data to be matched. Wherein, the specific content of S1020 is similar to the specific content of S320, which will not be repeated here.
S1030,基于待匹配多媒体数据和目标多媒体数据,生成合成多媒体数据,目标多媒体数据根据待匹配多媒体数据的第一多媒体特征匹配得到。其中,S1030的具体内容与S330的具体内容类似,对此不再赘述。S1030. Generate composite multimedia data based on the multimedia data to be matched and the target multimedia data, where the target multimedia data is obtained by matching according to the first multimedia feature of the multimedia data to be matched. Wherein, the specific content of S1030 is similar to the specific content of S330, which will not be repeated here.
S1040,显示合成多媒体数据。其中,S1040的具体内容与S340的具体内容类似,对此不再赘述。S1040, displaying the synthesized multimedia data. Wherein, the specific content of S1040 is similar to the specific content of S340, which will not be repeated here.
S1050,当检测到对合成多媒体数据的触发操作时,向原始多媒体数据所属用户和目标多媒体数据所属用户发布合成多媒体数据。S1050. When a trigger operation on the composite multimedia data is detected, publish the composite multimedia data to the user to which the original multimedia data belongs and the user to which the target multimedia data belongs.
在一些实施例中,用户想要与目标多媒体数据所属用户互动时,执行针对合成多媒体数据的触发操作。其中,触发操作可以是生成合成多媒体数据时触发的,或者是在预览合成多媒体数据之后触发的,对其触发时序不作限定。In some embodiments, when the user wants to interact with the user to whom the target multimedia data belongs, a trigger operation for synthesizing multimedia data is performed. Wherein, the triggering operation may be triggered when the synthesized multimedia data is generated, or after the synthesized multimedia data is previewed, and its trigger timing is not limited.
在一个实施例中,电子设备可以通过服务器向原始多媒体数据所属用户和目标多媒体数据所属用户发布合成多媒体数据。In an embodiment, the electronic device may distribute the synthesized multimedia data to the user to which the original multimedia data belongs and the user to which the target multimedia data belongs through the server.
在一个实施例中,电子设备可以在原始多媒体数据所属用户和目标多媒体数据所属用户的目标应用程序的图像/视频收藏夹或者展示栏内显示该和城市多媒体数据。并在相应地图标上添加标识以提示用户查看该合成多媒体数据。In one embodiment, the electronic device can display the original multimedia data and city multimedia data in the image/video favorites or display column of the target application program of the user to which the original multimedia data belongs and the user to which the target multimedia data belongs. A mark is added to the corresponding icon to prompt the user to view the synthesized multimedia data.
在另一个实施例中,S1050可以包括如下步骤。In another embodiment, S1050 may include the following steps.
步骤D1,向原始多媒体数据所属用户发送第一提示信息,第一提示信息用于触发显示合成多媒体数据以及显示目标多媒体数据所属用户的社交主页。Step D1, sending first prompting information to the user to which the original multimedia data belongs, the first prompting information is used to trigger display of the synthesized multimedia data and display the social home page of the user to which the target multimedia data belongs.
可选地,第一提示信息可以通过聊天框、界面上的显示窗口或者是界面上的广播栏等以文字、图片、语音等方式发布。示例性地,第一提示消息的具体形式可以是“您刚才和XXX参与了一场假面派对(合成多媒体视频对应的场景),去TA主页看看/与TA进行聊天吧”Optionally, the first prompt information can be released in the form of text, picture, voice, etc. through a chat box, a display window on the interface, or a broadcast column on the interface. Exemplarily, the specific form of the first prompt message may be "You just participated in a masquerade party with XXX (the scene corresponding to the synthesized multimedia video), go to TA's homepage to see/chat with TA"
示例性地,第一提示信息可以包括合成多媒体数据显示界面的文字/二维码等链接,或者用户触发第一提示信息的信息栏即可跳转至合 成多媒体显示界面。可选地,为了便于交互,第一提示信息还可以包括目标多媒体数据所属用户的文字/二维码等链接。又或者,合成多媒体数据显示界面上可以包括访问目标多媒体数据所属用户社交主页的控件,或者合成多媒体数据显示界面上可以包括添加目标多媒体数据所属用户好友的控件,又或者,合成多媒体数据显示界面上可以包括与目标多媒体数据所属用户建立聊天的控件。Exemplarily, the first prompt information may include links such as text/QR code of the synthesized multimedia data display interface, or the user can jump to the synthesized multimedia display interface by triggering the information bar of the first prompt information. Optionally, for the convenience of interaction, the first prompt information may also include links such as text/QR code of the user to which the target multimedia data belongs. Alternatively, the synthesized multimedia data display interface may include a control for accessing the user's social homepage to which the target multimedia data belongs, or the synthesized multimedia data display interface may include a control for adding friends of the user to whom the target multimedia data belongs, or, the synthesized multimedia data display interface may A control for establishing a chat with the user to whom the target multimedia data belongs may be included.
步骤D2,向目标多媒体数据所属用户发送第二提示信息,第二提示信息用于触发播放合成多媒体数据以及显示原始多媒体数据所属用户的社交主页。Step D2, sending second prompting information to the user to whom the target multimedia data belongs, the second prompting information being used to trigger playing the composite multimedia data and displaying the social homepage of the user to which the original multimedia data belongs.
其中,第二提示信息与第一提示信息类似,对此不再赘述。Wherein, the second prompt information is similar to the first prompt information, which will not be repeated here.
通过本公开实施例,可以向原始多媒体数据所属用户和目标多媒体数据所属用户发布合成多媒体数据,从而可以通过合成多媒体数据实现始多媒体数据所属用户和目标多媒体数据所属用户之间的互动,提高了多媒体显示的趣味性,提高了使用的使用体验。Through the embodiments of the present disclosure, synthesized multimedia data can be released to the users who belong to the original multimedia data and the users who belong to the target multimedia data, so that the interaction between the users who belong to the original multimedia data and the users who belong to the target multimedia data can be realized by synthesizing multimedia data, and the multimedia data can be improved. The interestingness of the display improves the user experience.
图11示出了本公开实施例提供的一种多媒体匹配方法的流程示意图。Fig. 11 shows a schematic flowchart of a multimedia matching method provided by an embodiment of the present disclosure.
在本公开实施例中,该多媒体匹配方法可以由服务器执行。其中,服务器可以是云服务器或者服务器集群等具有存储及计算功能的设备。In the embodiment of the present disclosure, the multimedia matching method may be executed by a server. Wherein, the server may be a device with storage and computing functions such as a cloud server or a server cluster.
如图11所示,该多媒体匹配方法可以包括如下步骤。As shown in Fig. 11, the multimedia matching method may include the following steps.
S1110,接收待匹配多媒体数据,待匹配多媒体数据是对原始多媒体数据进行特效处理后得到的。S1110. Receive the multimedia data to be matched, where the multimedia data to be matched is obtained by performing special effect processing on the original multimedia data.
S1120,从待匹配多媒体数据中提取第一多媒体特征。S1120. Extract a first multimedia feature from the multimedia data to be matched.
S1130,获取待匹配多媒体数据对应的多个候选多媒体数据。S1130. Acquire multiple candidate multimedia data corresponding to the multimedia data to be matched.
S1140,在多个候选多媒体数据中,查询与第一多媒体特征相匹配的目标多媒体数据,目标多媒体数据用于与待匹配多媒体数据生成合并多媒体数据。S1140. Search for target multimedia data that matches the first multimedia feature among multiple candidate multimedia data, and the target multimedia data is used to generate merged multimedia data with the multimedia data to be matched.
在本公开的一些实施例中,S1140可以包括:In some embodiments of the present disclosure, S1140 may include:
确定第一多媒体特征对应的至少一个特征标签;determining at least one feature label corresponding to the first multimedia feature;
针对每个候选多媒体数据,确定与至少一个特征标签相同的共同标签;For each candidate multimedia data, determine a common label identical to at least one feature label;
根据共同标签对应的权重值,计算待匹配多媒体数据与每个候选多媒体数据之间的标签匹配分数;Calculate the label matching score between the multimedia data to be matched and each candidate multimedia data according to the weight value corresponding to the common label;
对候选多媒体数据的标签匹配分数进行排序,确定目标多媒体数据。The tag matching scores of the candidate multimedia data are sorted to determine the target multimedia data.
需要说明的是,S1110至S1140示出的多媒体匹配方法与上述结合S910至S970示出的多媒体显示方法类似,在此不再赘述。It should be noted that the multimedia matching method shown in S1110 to S1140 is similar to the multimedia display method shown in conjunction with S910 to S970 above, and will not be repeated here.
在本公开的一些实施例中,S1140之后,该多媒体匹配方法还可以包括:接收合成多媒体数据的发布指令,向原始多媒体数据所属用户和目标多媒体数据所属用户发布合成多媒体数据。其中,该发布指令是电子设备在检测到对合成多媒体数据的触发操作后生成的。其中,该步骤与S1050类似,在此不再赘述。In some embodiments of the present disclosure, after S1140, the multimedia matching method may further include: receiving a publishing instruction of synthesized multimedia data, and distributing the synthesized multimedia data to the user to which the original multimedia data belongs and the user to which the target multimedia data belongs. Wherein, the publishing instruction is generated by the electronic device after detecting a trigger operation on the synthesized multimedia data. Wherein, this step is similar to S1050 and will not be repeated here.
在本公开的一些实施例中,S1140之后,该多媒体匹配方法还可以包括:In some embodiments of the present disclosure, after S1140, the multimedia matching method may further include:
向原始多媒体数据所属用户发送第一提示信息,第一提示信息用于触发显示合成多媒体数据以及显示目标多媒体数据所属用户的社交主页。其中,该步骤与上述步骤D1类似,在此不再赘述。The first prompt information is sent to the user to which the original multimedia data belongs, and the first prompt information is used to trigger the display of the composite multimedia data and display the social home page of the user to which the target multimedia data belongs. Wherein, this step is similar to the above step D1 and will not be repeated here.
向目标多媒体数据所属用户发送第二提示信息,第二提示信息用于触发播放合成多媒体数据以及显示原始多媒体数据所属用户的社交主页。其中,该步骤与上述步骤D2类似,在此不再赘述。The second prompt information is sent to the user to which the target multimedia data belongs, and the second prompt information is used to trigger the playing of the composite multimedia data and display the social home page of the user to which the original multimedia data belongs. Wherein, this step is similar to the above step D2 and will not be repeated here.
本公开实施例的多媒体匹配方法,在对所接收的原始多媒体数据进行特效编辑之后,基于编辑得到的待匹配多媒体数据与目标多媒体数据,生成并显示合成多媒体数据。由于目标对媒体数据是基于待匹配多媒体数据的第一多媒体特征匹配得到的,基于原始多媒体数据得到的合成多媒体数据中除了待匹配多媒体数据中的特效效果之外,还可以包括与待匹配多媒体数据相匹配的目标多媒体数据的内容,使得多媒体数据图像具有多种元素,进而丰富了多媒体数据的美化效果,从而 提高了多媒体数据显示的趣味性,以及还可以使得用户可以通过多媒体数据进行交互,实现了用户之间的多样性交互,提高了用户的使用体验。In the multimedia matching method of the embodiment of the present disclosure, after special effect editing is performed on the received original multimedia data, composite multimedia data is generated and displayed based on the edited multimedia data to be matched and target multimedia data. Since the target pair of media data is obtained based on the first multimedia feature matching of the multimedia data to be matched, in addition to the special effects in the multimedia data to be matched, the synthesized multimedia data obtained based on the original multimedia data may also include The multimedia data matches the content of the target multimedia data, so that the multimedia data image has multiple elements, thereby enriching the beautification effect of the multimedia data, thereby improving the fun of multimedia data display, and enabling users to interact through multimedia data , realize the diversified interaction between users, and improve the user experience.
本公开实施例还提供了一种用于实现上述的多媒体显示方法的多媒体显示装置,下面结合图12进行说明。An embodiment of the present disclosure also provides a multimedia display device for implementing the above multimedia display method, which will be described below with reference to FIG. 12 .
在本公开实施例中,该多媒体显示装置可以为电子设备,例如,该的多媒体显示装置可以为图1中所示的客户端中的第一电子设备101。其中,电子设备可以包括移动电话、平板电脑、台式计算机、笔记本电脑、车载终端、可穿戴电子设备、一体机、智能家居设备等具有通信功能的设备,也可以是虚拟机或者模拟器模拟的设备。In the embodiment of the present disclosure, the multimedia display device may be an electronic device, for example, the multimedia display device may be the first electronic device 101 in the client shown in FIG. 1 . Among them, electronic devices may include devices with communication functions such as mobile phones, tablet computers, desktop computers, notebook computers, vehicle terminals, wearable electronic devices, all-in-one computers, and smart home devices, and may also be devices simulated by virtual machines or simulators. .
图12示出了本公开实施例提供的一种多媒体显示装置的结构示意图。FIG. 12 shows a schematic structural diagram of a multimedia display device provided by an embodiment of the present disclosure.
如图12所示,该多媒体显示装置1200可以包括数据接收单元1210、特效编辑单元1220、数据合成单元1230和数据显示单元1240。As shown in FIG. 12 , the multimedia display device 1200 may include a data receiving unit 1210 , a special effect editing unit 1220 , a data synthesis unit 1230 and a data display unit 1240 .
数据接收单元1210,配置为接收原始多媒体数据;A data receiving unit 1210 configured to receive original multimedia data;
特效编辑单元1220,配置为对原始多媒体数据进行特效编辑,得到待匹配多媒体数据;The special effect editing unit 1220 is configured to perform special effect editing on the original multimedia data to obtain the multimedia data to be matched;
数据合成单元1230,配置为基于待匹配多媒体数据和目标多媒体数据,生成合成多媒体数据,目标多媒体数据根据待匹配多媒体数据的第一多媒体特征匹配得到;The data synthesis unit 1230 is configured to generate synthesized multimedia data based on the multimedia data to be matched and the target multimedia data, and the target multimedia data is obtained by matching the first multimedia feature of the multimedia data to be matched;
数据显示单元1240,配置为显示合成多媒体数据。The data display unit 1240 is configured to display synthesized multimedia data.
本公开实施例的多媒体显示装置,在对所接收的原始多媒体数据进行特效编辑之后,基于编辑得到的待匹配多媒体数据与目标多媒体数据,生成并显示合成多媒体数据。由于目标对媒体数据是基于待匹配多媒体数据的第一多媒体特征匹配得到的,基于原始多媒体数据得到的合成多媒体数据中除了待匹配多媒体数据中的特效效果之外,还可以包括与待匹配多媒体数据相匹配的目标多媒体数据的内容,使得多媒体数据图像具有多种元素,进而丰富了多媒体数据的美化效果,从而 提高了多媒体数据显示的趣味性,以及还可以使得用户可以通过多媒体数据进行交互,实现了用户之间的多样性交互,提高了用户的使用体验。The multimedia display device of the disclosed embodiment generates and displays synthesized multimedia data based on the edited multimedia data to be matched and target multimedia data after performing special effect editing on the received original multimedia data. Since the target pair of media data is obtained based on the first multimedia feature matching of the multimedia data to be matched, in addition to the special effects in the multimedia data to be matched, the synthesized multimedia data obtained based on the original multimedia data may also include The multimedia data matches the content of the target multimedia data, so that the multimedia data image has multiple elements, thereby enriching the beautification effect of the multimedia data, thereby improving the fun of multimedia data display, and enabling users to interact through multimedia data , realize the diversified interaction between users, and improve the user experience.
在本公开一些实施例中,该特效编辑单元1220可以进一步配置为:响应于对目标特效模板的模板选择操作,基于目标特效模板对原始多媒体数据进行特效编辑,得到待匹配多媒体数据;In some embodiments of the present disclosure, the special effect editing unit 1220 may be further configured to: in response to the template selection operation on the target special effect template, perform special effect editing on the original multimedia data based on the target special effect template to obtain the multimedia data to be matched;
在本公开另一些实施例中,该特效编辑单元1220可以进一步配置为:基于原始多媒体数据对应的目标特效模板,对原始多媒体数据进行特效编辑,得到待匹配多媒体数据。In other embodiments of the present disclosure, the special effect editing unit 1220 may be further configured to: based on the target special effect template corresponding to the original multimedia data, perform special effect editing on the original multimedia data to obtain the multimedia data to be matched.
在本公开一些实施例中,该多媒体显示装置1200还可以包括特征提取单元、数据获取单元以及数据查询单元。In some embodiments of the present disclosure, the multimedia display device 1200 may further include a feature extraction unit, a data acquisition unit, and a data query unit.
特征提取单元,被配置为从待匹配多媒体数据中提取第一多媒体特征;A feature extraction unit configured to extract a first multimedia feature from the multimedia data to be matched;
数据获取单元,被配置为获取待匹配多媒体数据对应的多个候选多媒体数据;a data acquisition unit configured to acquire a plurality of candidate multimedia data corresponding to the multimedia data to be matched;
数据查询单元,被配置为在多个候选多媒体数据中,查询与第一多媒体特征相匹配的目标多媒体数据。The data query unit is configured to query the target multimedia data matching the first multimedia feature among the plurality of candidate multimedia data.
在本公开一些实施例中,该数据查询单元可以进一步配置为:In some embodiments of the present disclosure, the data query unit may be further configured as:
确定第一多媒体特征对应的至少一个特征标签;determining at least one feature label corresponding to the first multimedia feature;
针对每个候选多媒体数据,确定与至少一个特征标签相同的共同标签;For each candidate multimedia data, determine a common label identical to at least one feature label;
根据共同标签对应的权重值,计算待匹配多媒体数据与每个候选多媒体数据之间的标签匹配分数;Calculate the label matching score between the multimedia data to be matched and each candidate multimedia data according to the weight value corresponding to the common label;
对候选多媒体数据的标签匹配分数进行排序,确定目标多媒体数据。The tag matching scores of the candidate multimedia data are sorted to determine the target multimedia data.
在本公开一些实施例中,目标多媒体数据满足下列中的至少一项:In some embodiments of the present disclosure, the target multimedia data satisfies at least one of the following:
目标多媒体数据的特效编辑方式与待匹配多媒体数据的特效编辑方式相同;The special effect editing method of the target multimedia data is the same as the special effect editing method of the multimedia data to be matched;
目标多媒体数据所属用户为在线用户;The user to which the target multimedia data belongs is an online user;
目标多媒体数据的发布位置与原始多媒体数据的发布位置之间的位置距离小于或等于预设的距离阈值;The location distance between the distribution location of the target multimedia data and the distribution location of the original multimedia data is less than or equal to a preset distance threshold;
目标多媒体数据的历史匹配次数满足预设的次数筛选条件。The historical matching times of the target multimedia data meet the preset times filtering condition.
在本公开一些实施例中,目标多媒体数据还根据原始多媒体数据的第二多媒体特征匹配得到。In some embodiments of the present disclosure, the target multimedia data is also obtained by matching the second multimedia feature of the original multimedia data.
在本公开一些实施例中,该多媒体显示装置1200还可以包括数据发布单元。In some embodiments of the present disclosure, the multimedia display device 1200 may further include a data publishing unit.
数据发布单元,被配置为当检测到对合成多媒体数据的触发操作时,向原始多媒体数据所属用户和目标多媒体数据所属用户发布合成多媒体数据。The data distribution unit is configured to distribute the composite multimedia data to the user to which the original multimedia data belongs and the user to which the target multimedia data belongs when a trigger operation on the composite multimedia data is detected.
在本公开一些实施例中,该数据发布单元可以进一步配置为:In some embodiments of the present disclosure, the data publishing unit may be further configured as:
向原始多媒体数据所属用户发送第一提示信息,第一提示信息用于触发显示合成多媒体数据以及显示目标多媒体数据所属用户的社交主页;Sending first prompt information to the user to whom the original multimedia data belongs, the first prompt information is used to trigger display of the composite multimedia data and display the social home page of the user to which the target multimedia data belongs;
向目标多媒体数据所属用户发送第二提示信息,第二提示信息用于触发播放合成多媒体数据以及显示原始多媒体数据所属用户的社交主页。Sending second prompting information to the user to which the target multimedia data belongs, the second prompting information is used to trigger playing the synthesized multimedia data and displaying the social home page of the user to which the original multimedia data belongs.
需要说明的是,图12所示的多媒体显示装置1200可以执行图3至图10所示的方法实施例中的各个步骤,并且实现图3至图10所示的方法实施例中的各个过程和效果,在此不做赘述。It should be noted that the multimedia display device 1200 shown in FIG. 12 can execute each step in the method embodiment shown in FIG. 3 to FIG. 10 , and realize each process and effects, which will not be described here.
本公开实施例还提供了一种用于实现上述多媒体匹配方法的多媒体匹配装置,下面结合图13进行说明。在本公开实施例中,该多媒体显示装置可以为服务器,例如,该多媒体匹配装置可以为图1中所示的客户端中的服务器102。其中,服务器可以是云服务器或者服务器集群等具有存储及计算功能的设备。An embodiment of the present disclosure also provides a multimedia matching device for implementing the above multimedia matching method, which will be described below with reference to FIG. 13 . In the embodiment of the present disclosure, the multimedia display device may be a server, for example, the multimedia matching device may be the server 102 in the client shown in FIG. 1 . Wherein, the server may be a device with storage and computing functions such as a cloud server or a server cluster.
图13示出了本公开实施例提供的一种多媒体匹配装置的结构示意图。Fig. 13 shows a schematic structural diagram of a multimedia matching device provided by an embodiment of the present disclosure.
如图13所示,该多媒体匹配装置1300可以包括数据接收单元1310、特征提取单元1320、数据获取单元1330和数据查询单元1340。As shown in FIG. 13 , the multimedia matching device 1300 may include a data receiving unit 1310 , a feature extraction unit 1320 , a data obtaining unit 1330 and a data query unit 1340 .
数据接收单元1310,配置为接收待匹配多媒体数据,待匹配多媒体数据基于对原始多媒体数据进行特效处理得到的;The data receiving unit 1310 is configured to receive the multimedia data to be matched, and the multimedia data to be matched is obtained based on special effects processing on the original multimedia data;
特征提取单元1320,配置为从待匹配多媒体数据中提取第一多媒体特征;The feature extraction unit 1320 is configured to extract a first multimedia feature from the multimedia data to be matched;
数据获取单元1330,配置为获取待匹配多媒体数据对应的多个候选多媒体数据;The data obtaining unit 1330 is configured to obtain a plurality of candidate multimedia data corresponding to the multimedia data to be matched;
数据查询单元1340,配置为在多个候选多媒体数据中,查询与第一多媒体特征相匹配的目标多媒体数据,目标多媒体数据用于与待匹配多媒体数据生成合并多媒体数据。The data query unit 1340 is configured to query the target multimedia data matching the first multimedia feature among the plurality of candidate multimedia data, and the target multimedia data is used to generate combined multimedia data with the multimedia data to be matched.
本公开实施例的多媒体匹配装置,在对所接收的原始多媒体数据进行特效编辑之后,基于编辑得到的待匹配多媒体数据与目标多媒体数据,生成并显示合成多媒体数据。由于目标对媒体数据是基于待匹配多媒体数据的第一多媒体特征匹配得到的,基于原始多媒体数据得到的合成多媒体数据中除了待匹配多媒体数据中的特效效果之外,还可以包括与待匹配多媒体数据相匹配的目标多媒体数据的内容,使得多媒体数据图像具有多种元素,进而丰富了多媒体数据的美化效果,从而提高了多媒体数据显示的趣味性,以及还可以使得用户可以通过多媒体数据进行交互,实现了用户之间的多样性交互,提高了用户的使用体验。The multimedia matching device of the disclosed embodiment generates and displays synthesized multimedia data based on the edited multimedia data to be matched and target multimedia data after performing special effect editing on the received original multimedia data. Since the target pair of media data is obtained based on the first multimedia feature matching of the multimedia data to be matched, in addition to the special effects in the multimedia data to be matched, the synthesized multimedia data obtained based on the original multimedia data may also include The multimedia data matches the content of the target multimedia data, so that the multimedia data image has multiple elements, thereby enriching the beautification effect of the multimedia data, thereby improving the fun of multimedia data display, and enabling users to interact through multimedia data , realize the diversified interaction between users, and improve the user experience.
在本公开的一些实施例中,该数据查询单元1340可以进一步被配置为:In some embodiments of the present disclosure, the data query unit 1340 may be further configured to:
确定第一多媒体特征对应的至少一个特征标签;determining at least one feature label corresponding to the first multimedia feature;
针对每个候选多媒体数据,确定与至少一个特征标签相同的共同标签;For each candidate multimedia data, determine a common label identical to at least one feature label;
根据共同标签对应的权重值,计算待匹配多媒体数据与每个候选多媒体数据之间的标签匹配分数;Calculate the label matching score between the multimedia data to be matched and each candidate multimedia data according to the weight value corresponding to the common label;
对候选多媒体数据的标签匹配分数进行排序,确定目标多媒体数据。The tag matching scores of the candidate multimedia data are sorted to determine the target multimedia data.
在本公开的一些实施例中,目标多媒体数据满足下列中的至少一项:In some embodiments of the present disclosure, the target multimedia data satisfies at least one of the following:
目标多媒体数据的特效编辑方式与待匹配多媒体数据的特效编辑方式相同;The special effect editing method of the target multimedia data is the same as the special effect editing method of the multimedia data to be matched;
目标多媒体数据所属用户为在线用户;The user to which the target multimedia data belongs is an online user;
目标多媒体数据的发布位置与原始多媒体数据的发布位置之间的位置距离小于或等于预设的距离阈值;The location distance between the distribution location of the target multimedia data and the distribution location of the original multimedia data is less than or equal to a preset distance threshold;
目标多媒体数据的历史匹配次数满足预设的次数筛选条件。The historical matching times of the target multimedia data meet the preset times filtering condition.
在本公开的一些实施例中,目标多媒体数据还根据原始多媒体数据的第二多媒体特征匹配得到。In some embodiments of the present disclosure, the target multimedia data is also matched according to the second multimedia feature of the original multimedia data.
在本公开的一些实施例中,在本公开一些实施例中,目标多媒体数据还根据原始多媒体数据的第二多媒体特征匹配得到。In some embodiments of the present disclosure, in some embodiments of the present disclosure, the target multimedia data is also obtained by matching the second multimedia feature of the original multimedia data.
在本公开一些实施例中,该多媒体匹配装置1300还可以包括数据发布单元。In some embodiments of the present disclosure, the multimedia matching apparatus 1300 may further include a data publishing unit.
数据发布单元,被配置为响应于合成多媒体数据的发布指令,向原始多媒体数据所属用户和目标多媒体数据所属用户发布合成多媒体数据。其中,该发布指令是电子设备在检测到对合成多媒体数据的触发操作后生成的。The data distribution unit is configured to distribute the composite multimedia data to the user to which the original multimedia data belongs and the user to which the target multimedia data belongs in response to the composite multimedia data distribution instruction. Wherein, the publishing instruction is generated by the electronic device after detecting a trigger operation on the synthesized multimedia data.
在本公开一些实施例中,该数据发布单元可以进一步配置为:In some embodiments of the present disclosure, the data publishing unit may be further configured as:
向原始多媒体数据所属用户发送第一提示信息,第一提示信息用于触发显示合成多媒体数据以及显示目标多媒体数据所属用户的社交主页;Sending first prompt information to the user to whom the original multimedia data belongs, the first prompt information is used to trigger display of the composite multimedia data and display the social home page of the user to which the target multimedia data belongs;
向目标多媒体数据所属用户发送第二提示信息,第二提示信息用于触发播放合成多媒体数据以及显示原始多媒体数据所属用户的社交主页。The second prompt information is sent to the user to which the target multimedia data belongs, and the second prompt information is used to trigger the playing of the composite multimedia data and display the social home page of the user to which the original multimedia data belongs.
需要说明的是,图13所示的多媒体匹配装置1300可以执行图11 所示的方法实施例中的各个步骤,并且实现图11所示的方法实施例中的各个过程和效果,在此不做赘述。It should be noted that the multimedia matching device 1300 shown in FIG. 13 can execute each step in the method embodiment shown in FIG. 11 , and realize each process and effect in the method embodiment shown in FIG. 11 , which will not be described here. repeat.
本公开实施例还提供了一种计算设备,该计算设备可以包括处理器和存储器,存储器可以用于存储可执行指令。其中,处理器可以用于从存储器中读取可执行指令,并执行可执行指令以实现上述实施例中的多媒体显示方法和/或者多媒体匹配方法。An embodiment of the present disclosure also provides a computing device, which may include a processor and a memory, and the memory may be used to store executable instructions. Wherein, the processor may be configured to read executable instructions from the memory, and execute the executable instructions to implement the multimedia display method and/or the multimedia matching method in the above embodiments.
图14示出了本公开实施例提供的一种计算设备的结构示意图。下面具体参考图14,其示出了用来实现本公开实施例中的计算设备1400的结构示意图。Fig. 14 shows a schematic structural diagram of a computing device provided by an embodiment of the present disclosure. Referring to FIG. 14 in detail below, it shows a schematic structural diagram for implementing a computing device 1400 in an embodiment of the present disclosure.
在一些实施例中,在实现上述实施例中的多媒体显示方法时,本公开实施例中的计算设备1400可以包括但不限于诸如移动电话、笔记本电脑、数字广播接收器、PDA(个人数字助理)、PAD(平板电脑)、PMP(便携式多媒体播放器)、车载终端(例如车载导航终端)、可穿戴设备、等等的移动终端以及诸如数字TV、台式计算机、智能家居设备等等的固定终端。In some embodiments, when implementing the multimedia display method in the above embodiments, the computing device 1400 in the embodiments of the present disclosure may include but not limited to mobile phones, notebook computers, digital broadcast receivers, PDA (Personal Digital Assistant) , PAD (Tablet Computer), PMP (Portable Multimedia Player), vehicle-mounted terminals (such as vehicle-mounted navigation terminals), wearable devices, etc. mobile terminals and fixed terminals such as digital TVs, desktop computers, smart home devices, etc.
在另一些实施例中,在实现上述实施例中的多媒体匹配方法时,本公开实施例中的计算设备1400可以包括但不限于云服务器或者服务器集群等具有存储及计算功能的设备。In other embodiments, when implementing the multimedia matching method in the above embodiments, the computing device 1400 in the embodiments of the present disclosure may include, but not limited to, devices with storage and computing functions such as cloud servers or server clusters.
需要说明的是,图14示出的计算设备1400仅仅是一个示例,不应对本公开实施例的功能和使用范围带来任何限制。It should be noted that the computing device 1400 shown in FIG. 14 is only an example, and should not limit the functions and scope of use of this embodiment of the present disclosure.
如图14所示,该计算设备1400可以包括处理装置(例如中央处理器、图形处理器等)1401,其可以根据存储在只读存储器(ROM)1402中的程序或者从存储装置1408加载到随机访问存储器(RAM)1403中的程序而执行各种适当的动作和处理。在RAM 1403中,还存储有计算设备1400操作所需的各种程序和数据。处理装置1401、ROM 1402以及RAM 1403通过总线1404彼此相连。输入/输出(I/O)接口1405也连接至总线1404。As shown in FIG. 14, the computing device 1400 may include a processing device (such as a central processing unit, a graphics processing unit, etc.) 1401, which may be stored in a read-only memory (ROM) 1402 or loaded into a random Various appropriate actions and processes are executed by accessing programs in the memory (RAM) 1403 . In the RAM 1403, various programs and data necessary for the operation of the computing device 1400 are also stored. The processing device 1401, ROM 1402, and RAM 1403 are connected to each other through a bus 1404. An input/output (I/O) interface 1405 is also connected to the bus 1404 .
通常,以下装置可以连接至I/O接口1405:包括例如触摸屏、触 摸板、键盘、鼠标、摄像头、麦克风、加速度计、陀螺仪等的输入装置1406;包括例如液晶显示器(LCD)、扬声器、振动器等的输出装置1407;包括例如磁带、硬盘等的存储装置1408;以及通信装置1409。通信装置1409可以允许计算设备1400与其他设备进行无线或有线通信以交换数据。虽然图14示出了具有各种装置的计算设备1400,但是应理解的是,并不要求实施或具备所有示出的装置。可以替代地实施或具备更多或更少的装置。Typically, the following devices can be connected to the I/O interface 1405: input devices 1406 including, for example, a touch screen, touchpad, keyboard, mouse, camera, microphone, accelerometer, gyroscope, etc.; including, for example, a liquid crystal display (LCD), speaker, vibration an output device 1407 such as a computer; a storage device 1408 including, for example, a magnetic tape, a hard disk, etc.; and a communication device 1409. The communication means 1409 may allow the computing device 1400 to communicate with other devices wirelessly or by wire to exchange data. While FIG. 14 shows computing device 1400 having various means, it is to be understood that implementing or possessing all of the illustrated means is not a requirement. More or fewer means may alternatively be implemented or provided.
本公开实施例还提供了一种计算机可读存储介质,该存储介质存储有计算机程序,当计算机程序被处理器执行时,使得处理器实现上述实施例中的多媒体显示方法或者多媒体匹配方法。An embodiment of the present disclosure also provides a computer-readable storage medium, the storage medium stores a computer program, and when the computer program is executed by a processor, the processor implements the multimedia display method or the multimedia matching method in the foregoing embodiments.
特别地,根据本公开的实施例,上文参考流程图描述的过程可以被实现为计算机软件程序。In particular, according to an embodiment of the present disclosure, the processes described above with reference to the flowcharts can be implemented as computer software programs.
本公开实施例还提供了一种计算机程序产品,该计算机程序产品可以包括计算机程序,当计算机程序被处理器执行时,使得处理器实现上述实施例中的视频编辑方法或者视频播放方法。An embodiment of the present disclosure also provides a computer program product, the computer program product may include a computer program, and when the computer program is executed by a processor, the processor is made to implement the video editing method or the video playing method in the foregoing embodiments.
例如,本公开的实施例包括一种计算机程序产品,其包括承载在非暂态计算机可读介质上的计算机程序,该计算机程序包含用于执行流程图所示的方法的程序代码。在这样的实施例中,该计算机程序可以通过通信装置1409从网络上被下载和安装,或者从存储装置1408被安装,或者从ROM 1402被安装。在该计算机程序被处理装置1401执行时,执行本公开实施例的多媒体显示方法或者多媒体匹配方法中限定的上述功能。For example, embodiments of the present disclosure include a computer program product, which includes a computer program carried on a non-transitory computer readable medium, where the computer program includes program code for executing the method shown in the flowchart. In such an embodiment, the computer program may be downloaded and installed from a network via communication means 1409, or from storage means 1408, or from ROM 1402. When the computer program is executed by the processing device 1401, the above-mentioned functions defined in the multimedia display method or the multimedia matching method of the embodiment of the present disclosure are executed.
需要说明的是,本公开上述的计算机可读介质可以是计算机可读信号介质或者计算机可读存储介质或者是上述两者的任意组合。计算机可读存储介质例如可以是——但不限于——电、磁、光、电磁、红外线、或半导体的系统、装置或器件,或者任意以上的组合。计算机可读存储介质的更具体的例子可以包括但不限于:具有一个或多个导线的电连接、便携式计算机磁盘、硬盘、随机访问存储器(RAM)、只读存 储器(ROM)、可擦式可编程只读存储器(EPROM或闪存)、光纤、便携式紧凑磁盘只读存储器(CD-ROM)、光存储器件、磁存储器件、或者上述的任意合适的组合。在本公开中,计算机可读存储介质可以是任何包含或存储程序的有形介质,该程序可以被指令执行系统、装置或者器件使用或者与其结合使用。而在本公开中,计算机可读信号介质可以包括在基带中或者作为载波一部分传播的数据信号,其中承载了计算机可读的程序代码。这种传播的数据信号可以采用多种形式,包括但不限于电磁信号、光信号或上述的任意合适的组合。计算机可读信号介质还可以是计算机可读存储介质以外的任何计算机可读介质,该计算机可读信号介质可以发送、传播或者传输用于由指令执行系统、装置或者器件使用或者与其结合使用的程序。计算机可读介质上包含的程序代码可以用任何适当的介质传输,包括但不限于:电线、光缆、RF(射频)等等,或者上述的任意合适的组合。It should be noted that the above-mentioned computer-readable medium in the present disclosure may be a computer-readable signal medium or a computer-readable storage medium or any combination of the above two. A computer readable storage medium may be, for example, but not limited to, an electrical, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, or device, or any combination thereof. More specific examples of computer-readable storage media may include, but are not limited to, electrical connections with one or more wires, portable computer diskettes, hard disks, random access memory (RAM), read-only memory (ROM), erasable Programmable read-only memory (EPROM or flash memory), optical fiber, portable compact disk read-only memory (CD-ROM), optical storage device, magnetic storage device, or any suitable combination of the above. In the present disclosure, a computer-readable storage medium may be any tangible medium that contains or stores a program that can be used by or in conjunction with an instruction execution system, apparatus, or device. In the present disclosure, however, a computer-readable signal medium may include a data signal propagated in baseband or as part of a carrier wave carrying computer-readable program code therein. Such propagated data signals may take many forms, including but not limited to electromagnetic signals, optical signals, or any suitable combination of the foregoing. A computer-readable signal medium may also be any computer-readable medium other than a computer-readable storage medium, which can transmit, propagate, or transmit a program for use by or in conjunction with an instruction execution system, apparatus, or device . Program code embodied on a computer readable medium may be transmitted by any appropriate medium, including but not limited to wires, optical cables, RF (radio frequency), etc., or any suitable combination of the above.
在一些实施方式中,客户端、服务器可以利用诸如HTTP之类的任何当前已知或未来研发的网络协议进行通信,并且可以与任意形式或介质的数字数据通信(例如,通信网络)互连。通信网络的示例包括局域网(“LAN”),广域网(“WAN”),网际网(例如,互联网)以及端对端网络(例如,ad hoc端对端网络),以及任何当前已知或未来研发的网络。In some embodiments, clients and servers can communicate using any currently known or future developed network protocol, such as HTTP, and can be interconnected with any form or medium of digital data communication (eg, a communication network). Examples of communication networks include local area networks ("LANs"), wide area networks ("WANs"), internetworks (e.g., the Internet), and peer-to-peer networks (e.g., ad hoc peer-to-peer networks), as well as any currently known or future developed network of.
上述计算机可读介质可以是上述计算设备中所包含的;也可以是单独存在,而未装配入该计算设备中。The above-mentioned computer-readable medium may be included in the above-mentioned computing device, or may exist independently without being assembled into the computing device.
上述计算机可读介质承载有一个或者多个程序,当上述一个或者多个程序被该电子设备执行时,使得该计算设备执行:The above-mentioned computer-readable medium carries one or more programs, and when the above-mentioned one or more programs are executed by the electronic device, the computing device is made to execute:
接收原始多媒体数据;对原始多媒体数据进行特效编辑,得到待匹配多媒体数据;基于待匹配多媒体数据和目标多媒体数据,生成合成多媒体数据,目标多媒体数据根据待匹配多媒体数据的第一多媒体特征匹配得到;显示合成多媒体数据。Receive the original multimedia data; edit the original multimedia data with special effects to obtain the multimedia data to be matched; generate synthetic multimedia data based on the multimedia data to be matched and the target multimedia data, and the target multimedia data is matched according to the first multimedia feature of the multimedia data to be matched Get; display composite multimedia data.
或者,接收待匹配多媒体数据,待匹配多媒体数据基于对原始多媒 体数据进行特效处理得到的;从待匹配多媒体数据中提取第一多媒体特征;获取待匹配多媒体数据对应的多个候选多媒体数据;在多个候选多媒体数据中,查询与第一多媒体特征相匹配的目标多媒体数据,目标多媒体数据用于与待匹配多媒体数据生成合并多媒体数据。Or, receiving the multimedia data to be matched, the multimedia data to be matched is obtained based on performing special effect processing on the original multimedia data; extracting the first multimedia feature from the multimedia data to be matched; obtaining a plurality of candidate multimedia data corresponding to the multimedia data to be matched; Among the plurality of candidate multimedia data, the target multimedia data matching the first multimedia feature is queried, and the target multimedia data is used to generate merged multimedia data with the multimedia data to be matched.
在本公开实施例中,可以以一种或多种程序设计语言或其组合来编写用于执行本公开的操作的计算机程序代码,上述程序设计语言包括但不限于面向对象的程序设计语言-诸如Java、Smalltalk、C++,还包括常规的过程式程序设计语言-诸如“C”语言或类似的程序设计语言。程序代码可以完全地在用户计算机上执行、部分地在用户计算机上执行、作为一个独立的软件包执行、部分在用户计算机上部分在远程计算机上执行、或者完全在远程计算机或服务器上执行。在涉及远程计算机的情形中,远程计算机可以通过任意种类的网络——包括局域网(LAN)或广域网(WAN)-连接到用户计算机,或者,可以连接到外部计算机(例如利用因特网服务提供商来通过因特网连接)。In the embodiments of the present disclosure, computer program codes for performing the operations of the present disclosure may be written in one or more programming languages or combinations thereof, including but not limited to object-oriented programming languages such as Java, Smalltalk, C++, and also conventional procedural programming languages - such as "C" or similar programming languages. The program code may execute entirely on the user's computer, partly on the user's computer, as a stand-alone software package, partly on the user's computer and partly on a remote computer or entirely on the remote computer or server. In cases involving a remote computer, the remote computer can be connected to the user computer through any kind of network, including a local area network (LAN) or a wide area network (WAN), or it can be connected to an external computer (such as through an Internet service provider). Internet connection).
附图中的流程图和框图,图示了按照本公开各种实施例的系统、方法和计算机程序产品的可能实现的体系架构、功能和操作。在这点上,流程图或框图中的每个方框可以代表一个模块、程序段、或代码的一部分,该模块、程序段、或代码的一部分包含一个或多个用于实现规定的逻辑功能的可执行指令。也应当注意,在有些作为替换的实现中,方框中所标注的功能也可以以不同于附图中所标注的顺序发生。例如,两个接连地表示的方框实际上可以基本并行地执行,它们有时也可以按相反的顺序执行,这依所涉及的功能而定。也要注意的是,框图和/或流程图中的每个方框、以及框图和/或流程图中的方框的组合,可以用执行规定的功能或操作的专用的基于硬件的系统来实现,或者可以用专用硬件与计算机指令的组合来实现。The flowchart and block diagrams in the Figures illustrate the architecture, functionality, and operation of possible implementations of systems, methods and computer program products according to various embodiments of the present disclosure. In this regard, each block in a flowchart or block diagram may represent a module, program segment, or portion of code that contains one or more logical functions for implementing specified executable instructions. It should also be noted that, in some alternative implementations, the functions noted in the block may occur out of the order noted in the figures. For example, two blocks shown in succession may, in fact, be executed substantially concurrently, or they may sometimes be executed in the reverse order, depending upon the functionality involved. It should also be noted that each block of the block diagrams and/or flowchart illustrations, and combinations of blocks in the block diagrams and/or flowchart illustrations, can be implemented by a dedicated hardware-based system that performs the specified functions or operations , or may be implemented by a combination of dedicated hardware and computer instructions.
描述于本公开实施例中所涉及到的单元可以通过软件的方式实现,也可以通过硬件的方式来实现。其中,单元的名称在某种情况下并不构成对该单元本身的限定。The units involved in the embodiments described in the present disclosure may be implemented by software or by hardware. Wherein, the name of a unit does not constitute a limitation of the unit itself under certain circumstances.
本文中以上描述的功能可以至少部分地由一个或多个硬件逻辑部件来执行。例如,非限制性地,可以使用的示范类型的硬件逻辑部件包括:现场可编程门阵列(FPGA)、专用集成电路(ASIC)、专用标准产品(ASSP)、片上系统(SOC)、复杂可编程逻辑设备(CPLD)等等。The functions described herein above may be performed at least in part by one or more hardware logic components. For example, without limitation, exemplary types of hardware logic components that may be used include: Field Programmable Gate Arrays (FPGAs), Application Specific Integrated Circuits (ASICs), Application Specific Standard Products (ASSPs), System on Chips (SOCs), Complex Programmable Logical device (CPLD) and so on.
在本公开的上下文中,机器可读介质可以是有形的介质,其可以包含或存储以供指令执行系统、装置或设备使用或与指令执行系统、装置或设备结合地使用的程序。机器可读介质可以是机器可读信号介质或机器可读储存介质。机器可读介质可以包括但不限于电子的、磁性的、光学的、电磁的、红外的、或半导体系统、装置或设备,或者上述内容的任何合适组合。机器可读存储介质的更具体示例会包括基于一个或多个线的电气连接、便携式计算机盘、硬盘、随机存取存储器(RAM)、只读存储器(ROM)、可擦除可编程只读存储器(EPROM或快闪存储器)、光纤、便捷式紧凑盘只读存储器(CD-ROM)、光学储存设备、磁储存设备、或上述内容的任何合适组合。In the context of the present disclosure, a machine-readable medium may be a tangible medium that may contain or store a program for use by or in conjunction with an instruction execution system, apparatus, or device. A machine-readable medium may be a machine-readable signal medium or a machine-readable storage medium. A machine-readable medium may include, but is not limited to, electronic, magnetic, optical, electromagnetic, infrared, or semiconductor systems, apparatus, or devices, or any suitable combination of the foregoing. More specific examples of machine-readable storage media would include one or more wire-based electrical connections, portable computer discs, hard drives, random access memory (RAM), read only memory (ROM), erasable programmable read only memory (EPROM or flash memory), optical fiber, compact disk read only memory (CD-ROM), optical storage, magnetic storage, or any suitable combination of the foregoing.
以上描述仅为本公开的较佳实施例以及对所运用技术原理的说明。本领域技术人员应当理解,本公开中所涉及的公开范围,并不限于上述技术特征的特定组合而成的技术方案,同时也应涵盖在不脱离上述公开构思的情况下,由上述技术特征或其等同特征进行任意组合而形成的其它技术方案。例如上述特征与本公开中公开的(但不限于)具有类似功能的技术特征进行互相替换而形成的技术方案。The above description is only a preferred embodiment of the present disclosure and an illustration of the applied technical principle. Those skilled in the art should understand that the disclosure scope involved in this disclosure is not limited to the technical solution formed by the specific combination of the above-mentioned technical features, but also covers the technical solutions formed by the above-mentioned technical features or Other technical solutions formed by any combination of equivalent features. For example, a technical solution formed by replacing the above-mentioned features with (but not limited to) technical features with similar functions disclosed in this disclosure.
此外,虽然采用特定次序描绘了各操作,但是这不应当理解为要求这些操作以所示出的特定次序或以顺序次序执行来执行。在一定环境下,多任务和并行处理可能是有利的。同样地,虽然在上面论述中包含了若干具体实现细节,但是这些不应当被解释为对本公开的范围的限制。在单独的实施例的上下文中描述的某些特征还可以组合地实现在单个实施例中。相反地,在单个实施例的上下文中描述的各种特征也可以单独地或以任何合适的子组合的方式实现在多个实施例中。In addition, while operations are depicted in a particular order, this should not be understood as requiring that the operations be performed in the particular order shown or performed in sequential order. Under certain circumstances, multitasking and parallel processing may be advantageous. Likewise, while the above discussion contains several specific implementation details, these should not be construed as limitations on the scope of the disclosure. Certain features that are described in the context of separate embodiments can also be implemented in combination in a single embodiment. Conversely, various features that are described in the context of a single embodiment can also be implemented in multiple embodiments separately or in any suitable subcombination.
尽管已经采用特定于结构特征和/或方法逻辑动作的语言描述了本 主题,但是应当理解所附权利要求书中所限定的主题未必局限于上面描述的特定特征或动作。相反,上面所描述的特定特征和动作仅仅是实现权利要求书的示例形式。Although the subject matter has been described in language specific to structural features and/or methodological acts, it is to be understood that the subject matter defined in the appended claims is not necessarily limited to the specific features or acts described above. Rather, the specific features and acts described above are merely example forms of implementing the claims.

Claims (14)

  1. 一种多媒体显示方法,其特征在于,包括:A multimedia display method, characterized in that, comprising:
    接收原始多媒体数据;receiving raw multimedia data;
    对所述原始多媒体数据进行特效编辑,得到待匹配多媒体数据;Editing the original multimedia data with special effects to obtain the multimedia data to be matched;
    基于所述待匹配多媒体数据和目标多媒体数据,生成合成多媒体数据,所述目标多媒体数据根据所述待匹配多媒体数据的第一多媒体特征匹配得到;Generate composite multimedia data based on the multimedia data to be matched and target multimedia data, and the target multimedia data is obtained by matching the first multimedia feature of the multimedia data to be matched;
    显示所述合成多媒体数据。The synthesized multimedia data is displayed.
  2. 根据权利要求1所述的方法,其特征在于,所述对所述原始多媒体数据进行特效编辑,得到待匹配多媒体数据,包括:The method according to claim 1, wherein said editing the original multimedia data with special effects to obtain the multimedia data to be matched comprises:
    响应于对目标特效模板的模板选择操作,基于所述目标特效模板对所述原始多媒体数据进行特效编辑,得到所述待匹配多媒体数据;In response to a template selection operation on a target special effect template, perform special effect editing on the original multimedia data based on the target special effect template to obtain the multimedia data to be matched;
    或者,基于所述原始多媒体数据对应的目标特效模板,对所述原始多媒体数据进行特效编辑,得到所述待匹配多媒体数据。Or, based on the target special effect template corresponding to the original multimedia data, perform special effect editing on the original multimedia data to obtain the multimedia data to be matched.
  3. 根据权利要求1所述的方法,其特征在于,在所述基于所述待匹配多媒体数据和目标多媒体数据,生成合成多媒体数据之前,所述方法还包括:The method according to claim 1, wherein, before generating composite multimedia data based on the multimedia data to be matched and the target multimedia data, the method further comprises:
    从所述待匹配多媒体数据中提取所述第一多媒体特征;extracting the first multimedia feature from the multimedia data to be matched;
    获取所述待匹配多媒体数据对应的多个候选多媒体数据;Acquiring a plurality of candidate multimedia data corresponding to the multimedia data to be matched;
    在所述多个候选多媒体数据中,查询与所述第一多媒体特征相匹配的所述目标多媒体数据。Among the plurality of candidate multimedia data, query the target multimedia data matching the first multimedia feature.
  4. 根据权利要求3所述的方法,其特征在于,所述在所述多个候选多媒体数据中,查询与所述第一多媒体特征相匹配的所述目标多媒体数据,包括:The method according to claim 3, wherein, among the plurality of candidate multimedia data, searching for the target multimedia data matching the first multimedia feature comprises:
    确定所述第一多媒体特征对应的至少一个特征标签;determining at least one feature tag corresponding to the first multimedia feature;
    针对每个所述候选多媒体数据,确定与所述至少一个特征标签相同的共同标签;For each of the candidate multimedia data, determine a common label that is the same as the at least one feature label;
    根据所述共同标签对应的权重值,计算所述待匹配多媒体数据与每个所述候选多媒体数据之间的标签匹配分数;According to the weight value corresponding to the common label, calculate the label matching score between the multimedia data to be matched and each of the candidate multimedia data;
    对所述候选多媒体数据的所述标签匹配分数进行排序,确定目标多媒体数据。sorting the tag matching scores of the candidate multimedia data to determine target multimedia data.
  5. 根据权利要求1所述的方法,其特征在于,所述目标多媒体数据满足下列中的至少一项:The method according to claim 1, wherein the target multimedia data satisfies at least one of the following:
    所述目标多媒体数据的特效编辑方式与所述待匹配多媒体数据的特效编辑方式相同;The special effect editing method of the target multimedia data is the same as the special effect editing method of the multimedia data to be matched;
    所述目标多媒体数据所属用户为在线用户;The user to which the target multimedia data belongs is an online user;
    所述目标多媒体数据的发布位置与所述原始多媒体数据的发布位置之间的位置距离小于或等于预设的距离阈值;The location distance between the distribution location of the target multimedia data and the distribution location of the original multimedia data is less than or equal to a preset distance threshold;
    所述目标多媒体数据的历史匹配次数满足预设的次数筛选条件。The historical matching times of the target multimedia data meet the preset times filtering condition.
  6. 根据权利要求1所述的方法,其特征在于,所述目标多媒体数据还根据所述原始多媒体数据的第二多媒体特征匹配得到。The method according to claim 1, wherein the target multimedia data is also obtained by matching the second multimedia feature of the original multimedia data.
  7. 根据权利要求1所述的方法,其特征在于,在所述显示所述合成多媒体数据之后,所述方法还包括:The method according to claim 1, characterized in that, after said displaying said synthesized multimedia data, said method further comprises:
    当检测到对所述合成多媒体数据的触发操作时,向所述原始多媒体数据所属用户和所述目标多媒体数据所属用户发布所述合成多媒体数据。When a trigger operation on the composite multimedia data is detected, the composite multimedia data is released to the user to which the original multimedia data belongs and the user to which the target multimedia data belongs.
  8. 根据权利要求7所述的方法,其特征在于,所述向所述原始多媒体数据所属用户和所述目标多媒体数据所属用户发布所述合成多媒体数据,包括:The method according to claim 7, wherein the distributing the synthesized multimedia data to the user to which the original multimedia data belongs and the user to which the target multimedia data belongs comprises:
    向所述原始多媒体数据所属用户发送第一提示信息,所述第一提示信息用于触发显示所述合成多媒体数据以及显示所述目标多媒体数据所属用户的社交主页;Sending first prompt information to the user to which the original multimedia data belongs, the first prompt information is used to trigger the display of the synthesized multimedia data and display the social home page of the user to which the target multimedia data belongs;
    向所述目标多媒体数据所属用户发送第二提示信息,所述第二提示信息用于触发播放所述合成多媒体数据以及显示所述原始多媒体数据所属用户的社交主页。Sending second prompt information to the user to which the target multimedia data belongs, where the second prompt information is used to trigger playing the synthesized multimedia data and display a social home page of the user to which the original multimedia data belongs.
  9. 一种多媒体匹配方法,其特征在于,包括:A multimedia matching method, characterized in that, comprising:
    接收待匹配多媒体数据,所述待匹配多媒体数据基于对原始多媒体数据进行特效处理得到的;Receiving multimedia data to be matched, the multimedia data to be matched is obtained based on special effects processing of the original multimedia data;
    从所述待匹配多媒体数据中提取所述第一多媒体特征;extracting the first multimedia feature from the multimedia data to be matched;
    获取所述待匹配多媒体数据对应的多个候选多媒体数据;Acquiring a plurality of candidate multimedia data corresponding to the multimedia data to be matched;
    在所述多个候选多媒体数据中,查询与所述第一多媒体特征相匹配的所述目标多媒体数据,所述目标多媒体数据用于与所述待匹配多媒体数据生成合并多媒体数据。Among the plurality of candidate multimedia data, the target multimedia data matching the first multimedia feature is searched, and the target multimedia data is used to generate merged multimedia data with the multimedia data to be matched.
  10. 根据权利要求9所述的方法,其特征在于,所述在所述多个候选多媒体数据中,查询与所述第一多媒体特征相匹配的所述目标多媒体数据,包括:The method according to claim 9, wherein, among the plurality of candidate multimedia data, searching for the target multimedia data matching the first multimedia feature comprises:
    确定所述第一多媒体特征对应的至少一个特征标签;determining at least one feature tag corresponding to the first multimedia feature;
    针对每个所述候选多媒体数据,确定与所述至少一个特征标签相同的共同标签;For each of the candidate multimedia data, determine a common label that is the same as the at least one feature label;
    根据所述共同标签对应的权重值,计算所述待匹配多媒体数据与每个所述候选多媒体数据之间的标签匹配分数;According to the weight value corresponding to the common label, calculate the label matching score between the multimedia data to be matched and each of the candidate multimedia data;
    对所述候选多媒体数据的所述标签匹配分数进行排序,确定目标多媒体数据。sorting the tag matching scores of the candidate multimedia data to determine target multimedia data.
  11. 一种多媒体显示装置,其特征在于,包括:A multimedia display device, characterized in that it comprises:
    数据接收单元,配置为接收原始多媒体数据;a data receiving unit configured to receive original multimedia data;
    特效编辑单元,配置为对所述原始多媒体数据进行特效编辑,得到待匹配多媒体数据;A special effect editing unit configured to perform special effect editing on the original multimedia data to obtain the multimedia data to be matched;
    数据合成单元,配置为基于所述待匹配多媒体数据和目标多媒体数据,生成合成多媒体数据,所述目标多媒体数据根据所述待匹配多媒体数据的第一多媒体特征匹配得到;The data synthesis unit is configured to generate synthesized multimedia data based on the multimedia data to be matched and the target multimedia data, and the target multimedia data is obtained by matching the first multimedia feature of the multimedia data to be matched;
    数据显示单元,配置为显示所述合成多媒体数据。A data display unit configured to display the synthesized multimedia data.
  12. 一种多媒体匹配装置,其特征在于,包括:A multimedia matching device is characterized in that it comprises:
    数据接收单元,配置为接收待匹配多媒体数据,所述待匹配多媒体 数据基于对原始多媒体数据进行特效处理得到的;The data receiving unit is configured to receive the multimedia data to be matched, and the multimedia data to be matched is obtained based on special effect processing to the original multimedia data;
    特征提取单元,配置为从所述待匹配多媒体数据中提取所述第一多媒体特征;A feature extraction unit configured to extract the first multimedia feature from the multimedia data to be matched;
    数据获取单元,配置为获取所述待匹配多媒体数据对应的多个候选多媒体数据;A data acquisition unit configured to acquire a plurality of candidate multimedia data corresponding to the multimedia data to be matched;
    数据查询单元,配置为在所述多个候选多媒体数据中,查询与所述第一多媒体特征相匹配的所述目标多媒体数据,所述目标多媒体数据用于与所述待匹配多媒体数据生成合并多媒体数据。A data query unit configured to query, among the plurality of candidate multimedia data, the target multimedia data that matches the first multimedia feature, and the target multimedia data is used to generate the multimedia data to be matched Merge multimedia data.
  13. 一种计算设备,其特征在于,包括:A computing device, comprising:
    处理器;processor;
    存储器,用于存储可执行指令;memory for storing executable instructions;
    其中,所述处理器用于从所述存储器中读取所述可执行指令,并执行所述可执行指令以实现上述权利要求1-8中任一项所述的多媒体显示方法或者上述权利要求9-10中任一项所述的多媒体匹配方法。Wherein, the processor is configured to read the executable instruction from the memory, and execute the executable instruction to implement the multimedia display method according to any one of the above claims 1-8 or the above claim 9 The multimedia matching method described in any one of -10.
  14. 一种计算机可读存储介质,其特征在于,所述存储介质存储有计算机程序,当所述计算机程序被处理器执行时,使得处理器实现上述权利要求1-8中任一项所述的多媒体显示方法或者上述权利要求9-10中任一项所述的多媒体匹配方法。A computer-readable storage medium, characterized in that the storage medium stores a computer program, and when the computer program is executed by a processor, the processor realizes the multimedia described in any one of claims 1-8 A display method or the multimedia matching method described in any one of claims 9-10 above.
PCT/CN2022/115521 2021-09-27 2022-08-29 Multimedia display and matching methods and apparatuses, device and medium WO2023045710A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202111136435.5 2021-09-27
CN202111136435.5A CN113870133B (en) 2021-09-27 2021-09-27 Multimedia display and matching method, device, equipment and medium

Publications (1)

Publication Number Publication Date
WO2023045710A1 true WO2023045710A1 (en) 2023-03-30

Family

ID=78991263

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2022/115521 WO2023045710A1 (en) 2021-09-27 2022-08-29 Multimedia display and matching methods and apparatuses, device and medium

Country Status (2)

Country Link
CN (1) CN113870133B (en)
WO (1) WO2023045710A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117370584A (en) * 2023-12-08 2024-01-09 中国信息通信研究院 Method and system for synthesizing multimedia data in depth

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113806306B (en) * 2021-08-04 2024-01-16 北京字跳网络技术有限公司 Media file processing method, device, equipment, readable storage medium and product
CN113870133B (en) * 2021-09-27 2024-03-12 抖音视界有限公司 Multimedia display and matching method, device, equipment and medium
CN115941841A (en) * 2022-12-06 2023-04-07 北京字跳网络技术有限公司 Associated information display method, device, equipment, storage medium and program product

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20100195912A1 (en) * 2009-02-05 2010-08-05 Naohisa Nakada Imaging device, image composition and display device, and image composition method
CN105338242A (en) * 2015-10-29 2016-02-17 努比亚技术有限公司 Image synthesis method and device
CN106528588A (en) * 2016-09-14 2017-03-22 厦门幻世网络科技有限公司 Method and apparatus for matching resources for text information
CN110866086A (en) * 2018-12-29 2020-03-06 北京安妮全版权科技发展有限公司 Article matching system
WO2021115346A1 (en) * 2019-12-13 2021-06-17 北京字节跳动网络技术有限公司 Media file processing method, device, readable medium, and electronic apparatus
CN113870133A (en) * 2021-09-27 2021-12-31 北京字节跳动网络技术有限公司 Multimedia display and matching method, device, equipment and medium

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108647245B (en) * 2018-04-13 2023-04-18 腾讯科技(深圳)有限公司 Multimedia resource matching method and device, storage medium and electronic device
CN112351327A (en) * 2019-08-06 2021-02-09 北京字节跳动网络技术有限公司 Face image processing method and device, terminal and storage medium
CN112597320A (en) * 2020-12-09 2021-04-02 上海掌门科技有限公司 Social information generation method, device and computer readable medium
CN112528049B (en) * 2020-12-17 2023-08-08 北京达佳互联信息技术有限公司 Video synthesis method, device, electronic equipment and computer readable storage medium
CN113099129A (en) * 2021-01-27 2021-07-09 北京字跳网络技术有限公司 Video generation method and device, electronic equipment and storage medium

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20100195912A1 (en) * 2009-02-05 2010-08-05 Naohisa Nakada Imaging device, image composition and display device, and image composition method
CN105338242A (en) * 2015-10-29 2016-02-17 努比亚技术有限公司 Image synthesis method and device
CN106528588A (en) * 2016-09-14 2017-03-22 厦门幻世网络科技有限公司 Method and apparatus for matching resources for text information
CN110866086A (en) * 2018-12-29 2020-03-06 北京安妮全版权科技发展有限公司 Article matching system
WO2021115346A1 (en) * 2019-12-13 2021-06-17 北京字节跳动网络技术有限公司 Media file processing method, device, readable medium, and electronic apparatus
CN113870133A (en) * 2021-09-27 2021-12-31 北京字节跳动网络技术有限公司 Multimedia display and matching method, device, equipment and medium

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117370584A (en) * 2023-12-08 2024-01-09 中国信息通信研究院 Method and system for synthesizing multimedia data in depth

Also Published As

Publication number Publication date
CN113870133B (en) 2024-03-12
CN113870133A (en) 2021-12-31

Similar Documents

Publication Publication Date Title
WO2023045710A1 (en) Multimedia display and matching methods and apparatuses, device and medium
US11483268B2 (en) Content navigation with automated curation
US11044288B2 (en) Systems and methods for interactive broadcasting
US10762675B2 (en) Systems and methods for interactive broadcasting
US11670015B2 (en) Method and apparatus for generating video
CN110462616B (en) Method for generating a spliced data stream and server computer
CN108701207A (en) For face recognition and video analysis to identify the personal device and method in context video flowing
JP5903187B1 (en) Automatic video content generation system
US9799373B2 (en) Computerized system and method for automatically extracting GIFs from videos
US20140253743A1 (en) User-generated content in a virtual reality environment
WO2022028151A1 (en) Video production method and apparatus, and video sharing method and apparatus
US11657575B2 (en) Generating augmented reality content based on third-party content
WO2022105846A1 (en) Virtual object display method and apparatus, electronic device, and medium
WO2019114328A1 (en) Augmented reality-based video processing method and device thereof
WO2019227429A1 (en) Method, device, apparatus, terminal, server for generating multimedia content
CN110992256B (en) Image processing method, device, equipment and storage medium
CN114331820A (en) Image processing method, image processing device, electronic equipment and storage medium
CN114463470A (en) Virtual space browsing method and device, electronic equipment and readable storage medium
WO2023241377A1 (en) Video data processing method and device, equipment, system, and storage medium
WO2022262473A1 (en) Image processing method and apparatus, and device and storage medium
US20220217430A1 (en) Systems and methods for generating new content segments based on object name identification
WO2022041202A1 (en) Object-based video combining method, client end, and system
US20170188120A1 (en) Method and electronic device for producing video highlights
JP7266356B1 (en) Program, information processing device, information processing system, and information processing method
CN116229585A (en) Image living body detection method and device, storage medium and electronic equipment

Legal Events

Date Code Title Description
NENP Non-entry into the national phase

Ref country code: DE