WO2023087703A1 - 媒体文件处理方法及装置 - Google Patents

媒体文件处理方法及装置 Download PDF

Info

Publication number
WO2023087703A1
WO2023087703A1 PCT/CN2022/100236 CN2022100236W WO2023087703A1 WO 2023087703 A1 WO2023087703 A1 WO 2023087703A1 CN 2022100236 W CN2022100236 W CN 2022100236W WO 2023087703 A1 WO2023087703 A1 WO 2023087703A1
Authority
WO
WIPO (PCT)
Prior art keywords
media file
target
target media
files
user
Prior art date
Application number
PCT/CN2022/100236
Other languages
English (en)
French (fr)
Other versions
WO2023087703A9 (zh
Inventor
叶子慧
舒莹
Original Assignee
北京达佳互联信息技术有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 北京达佳互联信息技术有限公司 filed Critical 北京达佳互联信息技术有限公司
Publication of WO2023087703A1 publication Critical patent/WO2023087703A1/zh
Publication of WO2023087703A9 publication Critical patent/WO2023087703A9/zh

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/10File systems; File servers
    • G06F16/16File or folder operations, e.g. details of user interfaces specifically adapted to file systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/10File systems; File servers
    • G06F16/17Details of further file system functions
    • G06F16/174Redundancy elimination performed by the file system

Definitions

  • the present disclosure relates to the field of computer technology, and in particular to a media file processing method, device, equipment and storage medium.
  • a smart terminal can automatically generate an album or video of a certain type or theme from the pictures stored in its album, for example, it can automatically generate a character album or video from all the photos of people in the album.
  • the present disclosure provides a media file processing method, device, equipment and storage medium.
  • the disclosed technical scheme is as follows:
  • a method for processing a media file including:
  • a synthesis process is performed on the first target media file and the selected second target media file.
  • the determining at least one second target media file belonging to the same target category as the first target media file from the plurality of media files includes:
  • At least one second target media file belonging to the same target category as the first target media file is determined from the multiple media files.
  • the determining at least one second target media file belonging to the same target category as the first target media file from the plurality of media files includes:
  • At least one second target media file belonging to the same target subcategory as the first target media file is determined from the at least one parent category media file according to the respective subcategories of the at least one parent category media file.
  • the synthesizing the first target media file and the selected second target media file in response to the user's selection operation on the at least one second target media file includes:
  • the method before performing video synthesis processing on the first target media file and the selected second target media file according to the video template, the method further includes:
  • the method further includes:
  • a preview video file is generated and displayed.
  • the method further includes:
  • a media file processing device including:
  • a display unit configured to display multiple media files in the album interface
  • the first determining unit is configured to perform, in response to the user's selection operation on the first target media file in the plurality of media files, determine from the plurality of media files that the first target media file belongs to the same target category at least one second target media file of;
  • the processing unit is configured to perform synthesis processing on the first target media file and the selected second target media file in response to the user's selection operation on the at least one second target media file.
  • the determining unit is further configured to perform:
  • At least one second target media file belonging to the same target category as the first target media file is determined from the multiple media files.
  • the determining unit is further configured to perform:
  • At least one second target media file belonging to the same target subcategory as the first target media file is determined from the at least one parent category media file according to the respective subcategories of the at least one parent category media file.
  • the processing unit is further configured to:
  • the device also includes:
  • the adding unit is configured to add the first target media file and the selected second target media file to the preset material display area in the album interface, so that the user can view and display them in the material display area to edit media files.
  • the device also includes:
  • the generating unit is configured to generate and display a preview video file in response to the user's preview operation on the synthesized video.
  • the device also includes:
  • the second determining unit is configured to determine the similarity between the at least one second target media file according to the corresponding content characteristics of the at least one second target media file;
  • the filtering unit is configured to perform repetitive filtering on the at least one second target media file according to the similarity.
  • an electronic device including:
  • the processor is configured to execute the instructions to implement the above media file processing method.
  • a computer-readable storage medium is provided, and when the instructions in the computer-readable storage medium are executed by a processor of the electronic device, the electronic device can execute the above media file processing method.
  • a computer program product including a computer program, and when the computer program is executed by a processor, the above media file processing method is implemented.
  • the album interface in response to the user's selection operation on the first target media file in the plurality of media files, it is determined from the plurality of media files that at least A second target media file, which can automatically match the same type of media file according to the user's choice, and quickly provide the user with the same type of material, so as to achieve the effect of generating the same type of media file according to the different needs of the user, and improve the user's Participation provides convenience for subsequent users to batch process the same type of media files.
  • the first target media file and the selected second target media file are processed, that is, after one-time selection, the first target media file and the selected second target media file can be processed.
  • the outputted second target media files are stored, deleted, and video synthesized in batches, and the entire media file processing process is automatically performed, which saves time and effort, and provides better user experience.
  • Fig. 1 is the flowchart of a kind of media file processing method shown according to an exemplary embodiment
  • Fig. 2 is a flowchart of a method for processing media files according to an exemplary embodiment
  • 3A-3E are diagrams illustrating an example of an album interface according to an exemplary embodiment
  • Fig. 4 is a block diagram of a media file processing device shown according to an exemplary embodiment
  • Fig. 5 is a block diagram of an electronic device according to an exemplary embodiment.
  • a smart terminal can automatically generate an album or video of a certain type or theme from the pictures stored in its album, for example, it can automatically generate a character album or video from all the photos of people in the album.
  • this method is not flexible enough, and users cannot generate albums or videos of corresponding types or themes according to their own needs, which may easily lead to the fact that the albums or videos automatically generated by smart terminals may include pictures that users do not need, and the user experience is poor.
  • an embodiment of the present disclosure provides a method for processing media files.
  • the method can display a plurality of media files in an album interface. Determine at least one second target media file belonging to the same target category as the first target media file in the media file, that is, the media file of the same category can be automatically matched according to the user's selection, and the material of the same category can be provided for the user quickly, so as to achieve
  • the effect of generating the same type of media files according to different needs of users improves user participation and provides convenience for subsequent users to batch process the same type of media files.
  • the first target media file and the selected second target media file are processed, that is, after one-time selection, the first target media file and the selected second target media file can be processed.
  • the outputted second target media files are stored, deleted, and video synthesized in batches, and the entire media file processing process is automatically performed, which saves time and effort, and provides better user experience.
  • Fig. 1 is a flow chart showing a method for processing a media file according to an exemplary embodiment. As shown in Fig. 1 , the method for processing a media file is used in a terminal, and specifically includes the following steps.
  • step S11 a plurality of media files are displayed in the album interface.
  • the album interface refers to the interface displayed after clicking on the album on smart terminals such as mobile phones and tablet computers, and multiple media files are photo files, video files, etc. displayed in the album interface, specifically See Figure 3A.
  • step S12 in response to the user's selection operation on the first target media file among the multiple media files, at least one second target media file belonging to the same target category as the first target media file is determined from the multiple media files.
  • the media file is the source media file.
  • at least one second target media file belonging to the same target category as the first target media file can be determined from multiple media files. For example, assuming that the first target media file is determined to be a photo of a cat, the user can check the picture, as shown in FIG. Automatically check the photos of other cats, see Figure 3C for details. At this time, for the convenience of the user, a "batch add" button will appear on the album. Click the "batch add” button, and all the selected cats Add photos from the video to the material display area at the bottom for editing.
  • determining at least one second target media file belonging to the same target category as the first target media file from a plurality of media files includes:
  • Content features of the multiple media files are identified to obtain respective categories corresponding to the multiple media files.
  • At least one second target media file belonging to the same target category as the first target media file is determined from the multiple media files.
  • the content characteristics of multiple media files can be baby, beach, building, car, cartoon, cat, dog, flower, food, mountain, lake, sea, night scene, sky, sculpture, sunset, text, tree, etc., specifically It can be customized according to needs, and according to the content characteristics, the corresponding categories of multiple media files can be obtained, and at least one second target media file belonging to the same target category as the first target media file can be determined from the multiple media files.
  • the multiple media files can also be systematically marked, so that the second target media file can be subsequently screened.
  • the rapid identification of the target media file category can be realized, and the efficiency is high.
  • the present disclosure can also set multiple time periods, and identify pictures in different time periods, so as to smoothly and quickly complete the identification of the target media file category.
  • determining at least one second target media file belonging to the same target category as the first target media file from a plurality of media files includes:
  • At least one second target media file belonging to the same target subcategory as the first target media file is determined from the at least one parent category media file according to the respective subcategories corresponding to the at least one parent category media file.
  • the smart terminal can classify the multiple media files in the album in advance, and according to the preset categories corresponding to the multiple media files in the album, at least one target media file belonging to the same target parent category as the first target media file can be determined.
  • parent media files for example, if the first target media file is a picture of a cat, then at least one parent media file of the same target parent category may be a picture of an animal or a pet. After determining the picture of the animal or pet, the content feature in the picture of the animal or pet can be identified.
  • the content feature can be hair, eyes, ears, etc., and through the content feature, at least one parent class media file corresponding to each According to this category, at least one second target media file belonging to the same target subcategory as the first target media file can be determined from at least one parent class media file (animal or pet class pictures).
  • the selection accuracy of the second target media file is improved, ensuring that the user can select the media file he needs, and further improving the user experience.
  • the album interface can be hidden except for the first target media file and the second target media file.
  • media files see Figure 3D for details, in Figure 3D all media files except cats have been hidden.
  • the disclosure also hides The following actions are required:
  • Repeatability filtering is performed on the at least one second target media file according to the similarity.
  • a similarity threshold can be set, after determining the similarity between at least one second target media file, compare the similarity with the similarity threshold, if the similarity is greater than the similarity threshold, then It is determined that the target media file is a repetitive media file, and the media file is filtered out.
  • the similarity threshold can be set to 80%-100% (such as 80%, 90%, 100%, etc.).
  • step S13 in response to the user's selection operation of at least one second target media file, the first target media file and the selected second target media file are synthesized.
  • the first target media file and the second target media file after hiding media files other than the first target media file and the second target media file in the photo album interface, only the first target media file and the second target media file remain in the photo album interface.
  • the first target media file and the selected second target media file can be processed immediately.
  • any one media file in the first target media file and the selected second target media file can be individually deleted, deleted in batches or replaced in batches, and can also be used for the first target media file and the selected second target media file.
  • the selected second target media file is subjected to video synthesis operations and the like.
  • the first target media file and the selected second target media file are synthesized, including the following steps:
  • step S21 in response to the user's selection operation on at least one second target media file, a video template corresponding to the target category is displayed in the album interface.
  • step S22 video synthesis processing is performed on the first target media file and the selected second target media file according to the video template.
  • a video template corresponding to the target category will be displayed in the album interface, and the first target can be edited according to the video template.
  • the media file and the selected second target media file are subjected to video synthesis processing.
  • the video template includes video special effects formed by automatically synthesizing videos from various photos.
  • the video template corresponding to the target category may be a video template corresponding to its own category, or a video template corresponding to its parent category. For example, if the first target media file corresponds to the selected second target media file If the target category is a cat, then the video template can recommend a template of a cat category, or a template of a pet category (that is, a parent category of a cat).
  • the method in order to facilitate the user to pre-edit the first target media file and the selected second target media file, and make the operation more focused, the first target media file and the selected second target media file are edited according to the video template.
  • the method also includes:
  • the first target media file and the selected second target media file are added to the preset material display area in the album interface, so that the user can edit the media files displayed in the material display area.
  • the material display area can be set at the bottom of the album interface.
  • the occupied area of the material display area can be set as 1/10-1/5 of the total interface area, such as 1/10, 1/8, 1/5, etc.
  • the editing operations performed on the media files displayed in the material display area can be confirming, deleting, etc.
  • For details of the material display area refer to the lowermost area of FIG. 3B-FIG. 3D.
  • the media file processing method method provided by the embodiment of the present disclosure also includes:
  • a preview video file is generated and displayed.
  • the present disclosure displays a plurality of media files in the album interface, and in response to the user's selection operation on the first target media file in the plurality of media files, determines from the plurality of media files which the first target media file belongs to.
  • At least one second target media file of the same target category can automatically match media files of the same category according to the user’s selection, and quickly provide users with materials of the same category, thereby achieving the effect of generating media files of the same type according to different needs of users , improving user participation, and providing convenience for subsequent users to perform batch processing of the same type of media files.
  • the first target media file and the selected second target media file are processed, that is, after one-time selection, the first target media file and the selected second target media file can be processed.
  • the outputted second target media files are stored, deleted, and video synthesized in batches, and the entire media file processing process is automatically performed, which saves time and effort, and provides better user experience.
  • Fig. 4 is a block diagram of a media file processing device according to an exemplary embodiment.
  • the device includes a display unit 401 , a first determination unit 402 and a processing unit 403 .
  • the display unit 401 is configured to display a plurality of media files in an album interface.
  • the first determination unit 402 is configured to perform, in response to the user's selection operation of the first target media file in the plurality of media files, determine at least one second target belonging to the same target category as the first target media file from the multiple media files media files.
  • the processing unit 403 is configured to perform synthesis processing on the first target media file and the selected second target media file in response to the user's selection operation on at least one second target media file.
  • the first determination unit 402 is further configured to perform:
  • At least one second target media file belonging to the same target category as the first target media file is determined from the multiple media files.
  • the first determination unit 402 is further configured to perform:
  • At least one second target media file belonging to the same target subcategory as the first target media file is determined from the at least one parent category media file according to the respective subcategories corresponding to the at least one parent category media file.
  • processing unit 403 is further configured to execute:
  • the device also includes:
  • the adding unit is configured to add the first target media file and the selected second target media file to the preset material display area in the album interface, so that the user can edit the media files displayed in the material display area.
  • the device also includes:
  • the generating unit is configured to generate and display a preview video file in response to the user's preview operation on the synthesized video.
  • the device also includes:
  • the second determining unit is configured to determine the similarity between at least one second target media file according to the corresponding content characteristics of at least one second target media file;
  • the filtering unit is configured to perform repetitive filtering on at least one second target media file according to the similarity.
  • Fig. 5 is a block diagram of an electronic device according to an exemplary embodiment.
  • the electronic device 500 may be an electronic device used by a user.
  • the electronic device 500 may be a smart phone, a smart watch, a desktop computer, a laptop computer and a laptop electronic device, a desktop electronic device, and other names.
  • the electronic device 500 includes: a processor 501 and a memory 502 .
  • the processor 501 may include one or more processing cores, such as a 4-core processor, an 8-core processor, and the like.
  • Processor 501 can adopt at least one hardware form in DSP (Digital Signal Processing, digital signal processing), FPGA (Field-Programmable Gate Array, field programmable gate array), PLA (Programmable Logic Array, programmable logic array) accomplish.
  • Processor 501 may also include a main processor and a coprocessor, and the main processor is a processor for processing data in a wake-up state, also known as a CPU (Central Processing Unit, central processing unit); the coprocessor is Low-power processor for processing data in standby state.
  • CPU Central Processing Unit, central processing unit
  • the coprocessor is Low-power processor for processing data in standby state.
  • the processor 501 may be integrated with a GPU (Graphics Processing Unit, image processor), and the GPU is used for rendering and drawing the content to be displayed on the display screen.
  • the processor 501 may also include an AI (Artificial Intelligence, artificial intelligence) processor, where the AI processor is used to process computing operations related to machine learning.
  • AI Artificial Intelligence, artificial intelligence
  • Memory 502 may include one or more storage media, which may be non-transitory.
  • the memory 502 may also include high-speed random access memory and non-volatile memory, such as one or more magnetic disk storage devices and flash memory storage devices.
  • the electronic device 500 may optionally further include: a peripheral device interface 503 and at least one peripheral device.
  • the processor 501, the memory 502, and the peripheral device interface 503 may be connected through buses or signal lines.
  • Each peripheral device can be connected to the peripheral device interface 503 through a bus, a signal line or a circuit board.
  • the peripheral device includes: at least one of a radio frequency circuit 504 , a display screen 505 , a camera component 506 , an audio circuit 507 , a positioning component 508 and a power supply 509 .
  • the peripheral device interface 503 may be used to connect at least one peripheral device related to I/O (Input/Output, input/output) to the processor 501 and the memory 502 .
  • the processor 501, memory 502 and peripheral device interface 503 are integrated on the same chip or circuit board; in some other embodiments, any one of the processor 501, memory 502 and peripheral device interface 503 or The two can be implemented on a separate chip or circuit board, which is not limited in this implementation.
  • the radio frequency circuit 504 is used to receive and transmit RF (Radio Frequency, radio frequency) signals, also called electromagnetic signals.
  • the radio frequency circuit 504 communicates with the communication network and other communication devices through electromagnetic signals.
  • the radio frequency circuit 504 converts electrical signals into electromagnetic signals for transmission, or converts received electromagnetic signals into electrical signals.
  • the radio frequency circuit 504 includes: an antenna system, an RF transceiver, one or more amplifiers, a tuner, an oscillator, a digital signal processor, a codec chipset, a subscriber identity module card, and the like.
  • the radio frequency circuit 504 can communicate with other terminals through at least one wireless communication protocol.
  • the wireless communication protocol includes but is not limited to: metropolitan area network, mobile communication networks of various generations (2G, 3G, 4G and 5G), wireless local area network and/or WiFi (Wireless Fidelity, wireless fidelity) network.
  • the radio frequency circuit 504 may also include circuits related to NFC (Near Field Communication, short-range wireless communication), which is not limited in the present disclosure.
  • the display screen 505 is used to display a UI (User Interface, user interface).
  • the UI may include images, text, icons, video, and any combination thereof.
  • the display screen 505 also has the ability to collect touch signals on or above the surface of the display screen 505 .
  • the touch signal can be input to the processor 501 as a control signal for processing.
  • the display screen 505 can also be used to provide virtual buttons and/or virtual keyboards, also called soft buttons and/or soft keyboards.
  • the display screen 505 there may be one display screen 505, which is provided on the front panel of the electronic device 500; in other embodiments, there may be at least two display screens 505, which are respectively provided on different surfaces of the electronic device 500 or in a folding design
  • the display screen 505 may be a flexible display screen, which is arranged on the curved surface or the folding surface of the electronic device 500 . Even, the display screen 505 can also be set as a non-rectangular irregular image, that is, a special-shaped screen.
  • the display screen 505 can be made of LCD (Liquid Crystal Display, liquid crystal display), OLED (Organic Light-Emitting Diode, organic light-emitting diode) and other materials.
  • the camera assembly 506 is used to capture images or videos.
  • the camera assembly 506 includes a front camera and a rear camera.
  • the front camera is set on the front panel of the terminal, and the rear camera is set on the back of the terminal.
  • there are at least two rear cameras which are any one of the main camera, depth-of-field camera, wide-angle camera, and telephoto camera, so as to realize the fusion of the main camera and the depth-of-field camera to realize the background blur function.
  • camera assembly 506 may also include a flash.
  • the flash can be a single-color temperature flash or a dual-color temperature flash. Dual color temperature flash refers to the combination of warm light flash and cold light flash, which can be used for light compensation under different color temperatures.
  • Audio circuitry 507 may include a microphone and speakers.
  • the microphone is used to collect sound waves of the user and the environment, and convert the sound waves into electrical signals and input them to the processor 501 for processing, or input them to the radio frequency circuit 504 to realize voice communication.
  • the microphone can also be an array microphone or an omnidirectional collection microphone.
  • the speaker is used to convert the electrical signal from the processor 501 or the radio frequency circuit 504 into sound waves.
  • the speaker can be a conventional membrane speaker or a piezoelectric ceramic speaker.
  • the audio circuit 507 may also include a headphone jack.
  • the positioning component 508 is used to locate the current geographic location of the electronic device 500, so as to realize navigation or LBS (Location Based Service, location-based service).
  • the positioning component 508 may be a positioning component based on the GPS (Global Positioning System, Global Positioning System) of the United States, the Beidou system of China, the Greinus system of Russia or the Galileo system of the European Union.
  • the power supply 509 is used to supply power to various components in the electronic device 500 .
  • Power source 509 may be AC, DC, disposable or rechargeable batteries.
  • the rechargeable battery can support wired charging or wireless charging.
  • the rechargeable battery can also be used to support fast charging technology.
  • the electronic device 500 further includes one or more sensors 510 .
  • the one or more sensors 510 include, but are not limited to: an acceleration sensor 511 , a gyro sensor 512 , a pressure sensor 513 , a fingerprint sensor 514 , an optical sensor 515 and a proximity sensor 516 .
  • the acceleration sensor 511 can detect the acceleration on the three coordinate axes of the coordinate system established by the electronic device 500 .
  • the acceleration sensor 511 can be used to detect the components of the acceleration of gravity on the three coordinate axes.
  • the processor 501 may control the display screen 505 to display the user interface in a landscape view or a portrait view according to the gravitational acceleration signal collected by the acceleration sensor 511 .
  • the acceleration sensor 511 can also be used for collecting game or user's motion data.
  • the gyro sensor 512 can detect the body direction and rotation angle of the electronic device 500 , and the gyro sensor 512 can cooperate with the acceleration sensor 511 to collect 3D actions of the user on the electronic device 500 .
  • the processor 501 can realize the following functions: motion sensing (such as changing the UI according to the user's tilt operation), image stabilization during shooting, game control and inertial navigation.
  • the pressure sensor 513 may be disposed on a side frame of the electronic device 500 and/or a lower layer of the display screen 505 .
  • the pressure sensor 513 can detect the user's grip signal on the electronic device 500 , and the processor 501 performs left and right hand recognition or shortcut operation according to the grip signal collected by the pressure sensor 513 .
  • the processor 501 controls the operable controls on the UI interface according to the user's pressure operation on the display screen 505.
  • the operable controls include at least one of button controls, scroll bar controls, icon controls, and menu controls.
  • the fingerprint sensor 514 is used to collect the user's fingerprint, and the processor 501 recognizes the identity of the user according to the fingerprint collected by the fingerprint sensor 514, or the fingerprint sensor 514 recognizes the user's identity according to the collected fingerprint. When the identity of the user is recognized as a trusted identity, the processor 501 authorizes the user to perform related sensitive operations, such sensitive operations include unlocking the screen, viewing encrypted information, downloading software, making payment, and changing settings.
  • the fingerprint sensor 514 may be disposed on the front, back or side of the electronic device 500 . When the electronic device 500 is provided with a physical button or a manufacturer's logo, the fingerprint sensor 514 may be integrated with the physical button or the manufacturer's Logo.
  • the optical sensor 515 is used to collect ambient light intensity.
  • the processor 501 may control the display brightness of the display screen 505 according to the ambient light intensity collected by the optical sensor 515 . Specifically, when the ambient light intensity is high, the display brightness of the display screen 505 is increased; when the ambient light intensity is low, the display brightness of the display screen 505 is decreased.
  • the processor 501 may also dynamically adjust shooting parameters of the camera assembly 506 according to the ambient light intensity collected by the optical sensor 515 .
  • the proximity sensor 516 also called a distance sensor, is usually arranged on the front panel of the electronic device 500 .
  • the proximity sensor 516 is used to collect the distance between the user and the front of the electronic device 500 .
  • the processor 501 controls the display screen 505 to switch from the bright screen state to the off-screen state; when the proximity sensor 516 detects When the distance between the user and the front of the electronic device 500 gradually increases, the processor 501 controls the display screen 505 to switch from the off-screen state to the on-screen state.
  • FIG. 5 does not constitute a limitation to the electronic device 500, and may include more or less components than shown in the figure, or combine some components, or adopt different component arrangements.
  • the present disclosure also provides a computer-readable storage medium including instructions, such as a memory including instructions, which can be executed by the processor 501 of the electronic device 500 to complete the above media file processing method.
  • the storage medium may be a non-transitory storage medium such as ROM, random access memory (RAM), CD-ROM, magnetic tape, floppy disk, and optical data storage device, for example.
  • the present disclosure also provides a computer program product, including a computer program that can be executed by a processor of an electronic device, so as to implement the above-mentioned media file processing method.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Human Computer Interaction (AREA)
  • Processing Or Creating Images (AREA)

Abstract

本公开关于一种媒体文件处理方法、装置、设备及存储介质,涉及计算机技术领域,该方法包括:在相册界面中显示多个媒体文件;响应于用户对多个媒体文件中第一目标媒体文件的选择操作,从多个媒体文件中确定与第一目标媒体文件属于同一目标类别的至少一个第二目标媒体文件;响应于用户对至少一个第二目标媒体文件的选择操作,对第一目标媒体文件和选出的第二目标媒体文件进行合成处理。

Description

媒体文件处理方法及装置
相关申请的交叉引用
本公开基于申请日为2021年11月19日、申请号为202111399886.8号的中国专利申请,并要求该中国专利申请的优先权,在此全文引用上述中国专利申请公开的内容以作为本公开的一部分。
技术领域
本公开涉及计算机技术领域,尤其涉及一种媒体文件处理方法、装置、设备及存储介质。
背景技术
用户的智能终端的相册中往往会存储有很多图片。在相关技术中,智能终端可以将其相册中存储的图片自动生成某种类型或某种主题的影集或视频,例如,其可以将相册中所有的人物照片自动生成一个人物影集或视频。
发明内容
本公开提供一种媒体文件处理方法、装置、设备及存储介质。本公开的技术方案如下:
根据本公开实施例的第一方面,提供一种媒体文件处理方法,包括:
在相册界面中显示多个媒体文件;
响应于用户对所述多个媒体文件中第一目标媒体文件的选择操作,从所述多个媒体文件中确定与所述第一目标媒体文件属于同一目标类别的至少一个第二目标媒体文件;
响应于所述用户对所述至少一个第二目标媒体文件的选择操作,对所述第一目标媒体文件和选出的第二目标媒体文件进行合成处理。
在一些实施例中,所述从所述多个媒体文件中确定与所述第一目标媒体文件属于同一目标类别的至少一个第二目标媒体文件,包括:
识别所述多个媒体文件的内容特征,以得到所述多个媒体文件各自对应的类别;
根据所述多个媒体文件各自对应的类别,从所述多个媒体文件中确定与所述第一目标媒体文件属于同一目标类别的至少一个第二目标媒体文件。
在一些实施例中,所述从所述多个媒体文件中确定与所述第一目标媒体文件属于同一目标类别的至少一个第二目标媒体文件,包括:
根据所述多个媒体文件各自对应的预设类别,确定与所述第一目标媒体文件属于同一目标父类别的至少一个父类媒体文件;
识别所述至少一个父类媒体文件的内容特征,以得到所述至少一个父类媒体文件各自 对应的子类别;
根据所述至少一个父类媒体文件各自对应的子类别,从所述至少一个父类媒体文件中确定与所述第一目标媒体文件属于同一目标子类别的至少一个第二目标媒体文件。
在一些实施例中,所述响应于所述用户对所述至少一个第二目标媒体文件的选择操作,对所述第一目标媒体文件和选出的第二目标媒体文件进行合成处理,包括:
响应于所述用户对所述至少一个第二目标媒体文件的选择操作,在所述相册界面中显示与所述目标类别对应的视频模板;
根据所述视频模板对所述第一目标媒体文件和选出的第二目标媒体文件进行视频合成处理。
在一些实施例中,所述根据所述视频模板对所述第一目标媒体文件和选出的第二目标媒体文件进行视频合成处理之前,所述方法还包括:
将所述第一目标媒体文件和选出的第二目标媒体文件添加到所述相册界面中预设的素材显示区域,以供用户对显示在所述素材显示区域内的媒体文件进行编辑。
在一些实施例中,所述根据所述视频模板对所述第一目标媒体文件和选出的第二目标媒体文件进行视频合成处理之后,所述方法还包括:
响应于所述用户对所述合成处理后的视频的预览操作,生成并展示预览视频文件。
在一些实施例中,所述从所述多个媒体文件中确定与所述第一目标媒体文件属于同一目标类别的至少一个第二目标媒体文件之后,所述方法还包括:
根据所述至少一个第二目标媒体文件各自对应的内容特征,确定所述至少一个第二目标媒体文件彼此之间的相似度;
根据所述相似度对所述至少一个第二目标媒体文件进行重复性过滤。
根据本公开实施例的第二方面,提供一种媒体文件处理装置,包括:
显示单元,被配置为执行在相册界面中显示多个媒体文件;
第一确定单元,被配置为执行响应于用户对所述多个媒体文件中第一目标媒体文件的选择操作,从所述多个媒体文件中确定与所述第一目标媒体文件属于同一目标类别的至少一个第二目标媒体文件;
处理单元,被配置为执行响应于所述用户对所述至少一个第二目标媒体文件的选择操作,对所述第一目标媒体文件和选出的第二目标媒体文件进行合成处理。
在一些实施例中,所述确定单元还被配置为执行:
识别所述多个媒体文件的内容特征,以得到所述多个媒体文件各自对应的类别;
根据所述多个媒体文件各自对应的类别,从所述多个媒体文件中确定与所述第一目标媒体文件属于同一目标类别的至少一个第二目标媒体文件。
在一些实施例中,所述确定单元还被配置为执行:
根据所述多个媒体文件各自对应的预设类别,确定与所述第一目标媒体文件属于同一目标父类别的至少一个父类媒体文件;
识别所述至少一个父类媒体文件的内容特征,以得到所述至少一个父类媒体文件各自对应的子类别;
根据所述至少一个父类媒体文件各自对应的子类别,从所述至少一个父类媒体文件中确定与所述第一目标媒体文件属于同一目标子类别的至少一个第二目标媒体文件。
在一些实施例中,所述处理单元还被配置为执行:
响应于所述用户对所述至少一个第二目标媒体文件的选择操作,在所述相册界面中显示与所述目标类别对应的视频模板;
根据所述视频模板对所述第一目标媒体文件和选出的第二目标媒体文件进行视频合成处理。
在一些实施例中,所述装置还包括:
添加单元,被配置为执行将所述第一目标媒体文件和选出的第二目标媒体文件添加到所述相册界面中预设的素材显示区域,以供用户对显示在所述素材显示区域内的媒体文件进行编辑。
在一些实施例中,所述装置还包括:
生成单元,被配置为执行响应于所述用户对所述合成处理后的视频的预览操作,生成并展示预览视频文件。
在一些实施例中,所述装置还包括:
第二确定单元,被配置为执行根据所述至少一个第二目标媒体文件各自对应的内容特征,确定所述至少一个第二目标媒体文件彼此之间的相似度;
过滤单元,被配置为执行根据所述相似度对所述至少一个第二目标媒体文件进行重复性过滤。
根据本公开实施例的第三方面,提供一种电子设备,包括:
处理器;
用于存储所述处理器可执行指令的存储器;
其中,所述处理器被配置为执行所述指令,以实现上述媒体文件处理方法。
根据本公开实施例的第四方面,提供一种计算机可读存储介质,当所述计算机可读存储介质中的指令由电子设备的处理器执行时,使得电子设备能够执行上述媒体文件处理方法。
根据本公开实施例的第五方面,提供一种计算机程序产品,包括计算机程序,所述计算机程序被处理器执行时实现上述媒体文件处理方法。
本公开通过在相册界面中显示多个媒体文件,响应于用户对多个媒体文件中第一目标媒体文件的选择操作,从多个媒体文件中确定与第一目标媒体文件属于同一目标类别的至少一个第二目标媒体文件,即根据用户的选择可以自动匹配同一类别的媒体文件,快捷地为用户提供同类别的素材,从而达到根据用户不同需求生成同类型的媒体文件的效果,提高了用户的参与度,为后续用户对同类型的媒体文件进行批量处理提供了便利。同时通过 响应于用户对至少一个第二目标媒体文件的选择操作,对第一目标媒体文件和选出的第二目标媒体文件进行处理,即经过一次性选择,可以对第一目标媒体文件和选出的第二目标媒体文件进行存储、删除、视频合成等批量处理,整个媒体文件处理过程自动进行,省时省力,用户体验较好。
附图说明
图1是根据一示例性实施例示出的一种媒体文件处理方法的流程图;
图2是根据一示例性实施例示出的一种媒体文件处理方法的流程图;
图3A-图3E是根据一示例性实施例示出的一种相册界面示例图;
图4是根据一示例性实施例示出的一种媒体文件处理装置框图;
图5是根据一示例性实施例示出的一种电子设备的框图。
具体实施方式
为了使本领域普通人员更好地理解本公开的技术方案,下面将结合附图,对本公开实施例中的技术方案进行清楚、完整地描述。
需要说明的是,本公开的说明书和权利要求书及上述附图中的术语“第一”、“第二”等是用于区别类似的对象,而不必用于描述特定的顺序或先后次序。应该理解这样使用的数据在适当情况下可以互换,以便这里描述的本公开的实施例能够以除了在这里图示或描述的那些以外的顺序实施。以下示例性实施例中所描述的实施方式并不代表与本公开相一致的所有实施方式。相反,它们仅是与如所附权利要求书中所详述的、本公开的一些方面相一致的装置和方法的例子。
用户的手机、平板电脑等智能终端的相册中往往会存储有很多图片。在相关技术中,智能终端可以将其相册中存储的图片自动生成某种类型或某种主题的影集或视频,例如,其可以将相册中所有的人物照片自动生成一个人物影集或视频。但是,这样的方式灵活性不够,用户无法根据自身的需求来生成相应类型或主题的影集或视频,进而容易导致智能终端自动生成的影集或视频可能包括用户不需要的图片,用户体验较差。
鉴于此,本公开实施例提供了一种媒体文件处理方法,该方法可以在相册界面中显示多个媒体文件,响应于用户对多个媒体文件中第一目标媒体文件的选择操作,从多个媒体文件中确定与第一目标媒体文件属于同一目标类别的至少一个第二目标媒体文件,即根据用户的选择可以自动匹配同一类别的媒体文件,快捷地为用户提供同类别的素材,从而达到根据用户不同需求生成同类型的媒体文件的效果,提高了用户的参与度,为后续用户对同类型的媒体文件进行批量处理提供了便利。同时通过响应于用户对至少一个第二目标媒体文件的选择操作,对第一目标媒体文件和选出的第二目标媒体文件进行处理,即经过一次性选择,可以对第一目标媒体文件和选出的第二目标媒体文件进行存储、删除、视频合成等批量处理,整个媒体文件处理过程自动进行,省时省力,用户体验较好。
图1是根据一示例性实施例示出的一种媒体文件处理方法的流程图,如图1所示,媒体文件处理方法用于终端中,具体包括以下步骤。
在步骤S11中,在相册界面中显示多个媒体文件。
在本公开的实施例中,相册界面指的是在手机、平板电脑等智能终端上点开相册后显示的界面,而多个媒体文件即为相册界面中显示的照片文件、视频文件等,具体可以参见图3A。
在步骤S12中,响应于用户对多个媒体文件中第一目标媒体文件的选择操作,从多个媒体文件中确定与第一目标媒体文件属于同一目标类别的至少一个第二目标媒体文件。
应理解,用户在打开相册,看到相册界面中显示的多个媒体文件后,如果想要针对某一类型照片制作一段短视频,那么首先要选择一个源媒体文件,本公开中的第一目标媒体文件即为该源媒体文件,在确定该第一目标媒体文件后,即可从多个媒体文件中确定与第一目标媒体文件属于同一目标类别的至少一个第二目标媒体文件。举例来说,假设第一目标媒体文件确定为猫的照片,那么用户则可以对该图片进行勾选,具体可参见图3B,此时,智能终端自动从相册中找出其他猫的照片,并对其他猫的照片进行自动勾选,具体可参见图3C,此时,为了便于用户操作,相册上还会出现“批量添加”按钮,点击该“批量添加”按钮,则可以将选中的所有猫的照片添加至底部的素材显示区域进行编辑。
其中,对于从多个媒体文件中确定与第一目标媒体文件属于同一目标类别的至少一个第二目标媒体文件的方式,举例来说:
在一些实施例中,从多个媒体文件中确定与第一目标媒体文件属于同一目标类别的至少一个第二目标媒体文件,包括:
识别多个媒体文件的内容特征,以得到多个媒体文件各自对应的类别。
根据多个媒体文件各自对应的类别,从多个媒体文件中确定与第一目标媒体文件属于同一目标类别的至少一个第二目标媒体文件。
其中,多个媒体文件的内容特征可以为婴儿、沙滩、建筑物、汽车、卡通、猫、狗、花、食物、山、湖、海、夜景、天空、雕塑、日落、文本、树等,具体可根据需要自定义,根据该内容特征即可得到多个媒体文件各自对应的类别,进而从多个媒体文件中确定与第一目标媒体文件属于同一目标类别的至少一个第二目标媒体文件。
而在得到多个媒体文件各自对应的类别后,还可以对多个媒体文件进行系统标记,以便后续对第二目标媒体文件进行筛选。
通过此种实现方式,可以实现对目标媒体文件类别的快速识别,效率较高,但是,考虑到用户的智能终端相册中可能存储的图片过多,在对其类型进行识别时较困难,严重时还会造成智能终端卡顿,基于此,本公开还可以设定多个时间段,分时间段对图片进行识别,以顺利、快速地完成对目标媒体文件类别的识别。
在一些实施例中,从多个媒体文件中确定与第一目标媒体文件属于同一目标类别的至少一个第二目标媒体文件,包括:
根据多个媒体文件各自对应的预设类别,确定与第一目标媒体文件属于同一目标父类别的至少一个父类媒体文件;
识别至少一个父类媒体文件的内容特征,以得到至少一个父类媒体文件各自对应的子类别;
根据至少一个父类媒体文件各自对应的子类别,从至少一个父类媒体文件中确定与第一目标媒体文件属于同一目标子类别的至少一个第二目标媒体文件。
具体实施时,智能终端可以预先对相册中的多个媒体文件进行分类,而根据相册中多个媒体文件各自对应的预设类别,可以确定与第一目标媒体文件属于同一目标父类别的至少一个父类媒体文件,例如,第一目标媒体文件为猫的图片,那么其同一目标父类别的至少一个父类媒体文件则可以为动物或宠物的图片。在确定动物或宠物的图片后,即可识别动物或宠物图片中的内容特征,此时,该内容特征可以为毛发、眼睛、耳朵等,通过该内容特征,得到至少一个父类媒体文件各自对应的子类别,即猫类,根据此类别,即可从至少一个父类媒体文件(动物或宠物类图片)中确定与第一目标媒体文件属于同一目标子类别的至少一个第二目标媒体文件。
通过此种实现方式,提高了对第二目标媒体文件的选择精度,保证用户可以选择到自己需要的媒体文件,进一步提高了用户体验。
进一步地,在从多个媒体文件中确定与第一目标媒体文件属于同一目标类别的至少一个第二目标媒体文件之后,还可以隐藏相册界面中除第一目标媒体文件和第二目标媒体文件外的媒体文件。
在实际应用中,确定第一目标媒体文件和第二目标媒体文件后,为了便于用户清楚的查看选择出的目标媒体文件,可以隐藏相册界面中除第一目标媒体文件和第二目标媒体文件外的媒体文件,具体可参见图3D,图3D中已经隐藏了除猫以外的所有媒体文件。
考虑到在选取第二目标媒体文件时,相册中可能存在很多重复的第二目标媒体文件(例如,用户在拍照时,为了保证拍出来的照片的质量,往往会对同一物体进行多次拍摄,这时在相册中就会产生多张重复的照片),基于此,为了提高后续工作效率,本公开在隐藏相册界面中除第一目标媒体文件和第二目标媒体文件外的媒体文件之前,还需要进行以下操作:
根据至少一个第二目标媒体文件各自对应的内容特征,确定至少一个第二目标媒体文件彼此之间的相似度;
根据相似度对至少一个第二目标媒体文件进行重复性过滤。
具体实施时,可以设定一个相似度阈值,在确定至少一个第二目标媒体文件彼此之间的相似度后,将该相似度与相似度阈值进行比较,若该相似度大于相似度阈值,则判定该目标媒体文件为重复性媒体文件,对此媒体文件进行过滤筛除。其中,相似度阈值可以设置为80%-100%(如80%、90%、100%等)。
在步骤S13中,响应于用户对至少一个第二目标媒体文件的选择操作,对第一目标媒 体文件和选出的第二目标媒体文件进行合成处理。
在本公开的实施例中,在隐藏相册界面中除第一目标媒体文件和第二目标媒体文件外的媒体文件后,相册界面中就只剩余第一目标媒体文件和第二目标媒体文件,此时即可对第一目标媒体文件和选出的第二目标媒体文件进行处理。具体实施时,举例来说,可以对第一目标媒体文件和选出的第二目标媒体文件中的任意一个媒体文件进行单独删除、批量删除操作或批量替换,也可以对第一目标媒体文件和选出的第二目标媒体文件进行视频合成操作等。
在一些实施例中,如图2所示,响应于用户对至少一个第二目标媒体文件的选择操作,对第一目标媒体文件和选出的第二目标媒体文件进行合成处理,包括以下步骤:
在步骤S21中,响应于用户对至少一个第二目标媒体文件的选择操作,在相册界面中显示与目标类别对应的视频模板。
在步骤S22中,根据视频模板对第一目标媒体文件和选出的第二目标媒体文件进行视频合成处理。
应理解,为了提高用户体验,激发用户的视频创作欲,在用户选择至少一个第二目标媒体文件后,相册界面中会显示与目标类别对应的视频模板,根据该视频模板即可对第一目标媒体文件和选出的第二目标媒体文件进行视频合成处理。
其中,视频模板中包含了多种照片自动合成视频后形成的视频特效。并且,该目标类别对应的视频模板可以是其本身类别对应的视频模板,也可以是其父类别对应的视频模板,举例来说,如果第一目标媒体文件和选出的第二目标媒体文件对应目标类别为猫,那么,该视频模板可以推荐猫类的模板,也可以推荐宠物类(即猫的父类别)的模板。
在本公开实施例中,为了便于用户对第一目标媒体文件和选出的第二目标媒体文件预先进行编辑,并使得操作更加聚焦,在根据视频模板对第一目标媒体文件和选出的第二目标媒体文件进行视频合成处理之前,该方法还包括:
将第一目标媒体文件和选出的第二目标媒体文件添加到相册界面中预设的素材显示区域,以供用户对显示在素材显示区域内的媒体文件进行编辑。
在实际应用中,该素材显示区域可以设置在相册界面的底部,为了不影响用户对第一目标媒体文件和选出的第二目标媒体文件的浏览,可以将素材显示区域的占用面积设置为相册界面总面积的1/10-1/5,如1/10、1/8、1/5等。而对显示在素材显示区域内的媒体文件进行的编辑操作可以为确认、删除等,该素材显示区域具体可参见图3B-图3D的最下端区域。
而在根据视频模板对第一目标媒体文件和选出的第二目标媒体文件进行视频合成处理之后,为了提高用户体验,可以为用户提供视频预览功能,用户在预览合成后的视频后,可以根据其满意程度,选择对该视频进行存储或删除等。基于此,本公开实施例提供的媒体文件处理方法方法还包括:
响应于用户对合成处理后的视频的预览操作,生成并展示预览视频文件。
综上所述,本公开通过在相册界面中显示多个媒体文件,响应于用户对多个媒体文件中第一目标媒体文件的选择操作,从多个媒体文件中确定与第一目标媒体文件属于同一目标类别的至少一个第二目标媒体文件,即根据用户的选择可以自动匹配同一类别的媒体文件,快捷地为用户提供同类别的素材,从而达到根据用户不同需求生成同类型的媒体文件的效果,提高了用户的参与度,为后续用户对同类型的媒体文件进行批量处理提供了便利。同时通过响应于用户对至少一个第二目标媒体文件的选择操作,对第一目标媒体文件和选出的第二目标媒体文件进行处理,即经过一次性选择,可以对第一目标媒体文件和选出的第二目标媒体文件进行存储、删除、视频合成等批量处理,整个媒体文件处理过程自动进行,省时省力,用户体验较好。通过将第一目标媒体文件和选出的第二目标媒体文件添加到相册界面中预设的素材显示区域,可以便于用户对显示在素材显示区域内的媒体文件进行编辑,进一步提高用户体验。
图4是根据一示例性实施例示出的一种媒体文件处理装置框图。参照图4,该装置包括显示单元401、第一确定单元402和处理单元403。
显示单元401被配置为执行在相册界面中显示多个媒体文件。
第一确定单元402被配置为执行响应于用户对多个媒体文件中第一目标媒体文件的选择操作,从多个媒体文件中确定与第一目标媒体文件属于同一目标类别的至少一个第二目标媒体文件。
处理单元403被配置为执行响应于用户对至少一个第二目标媒体文件的选择操作,对第一目标媒体文件和选出的第二目标媒体文件进行合成处理。
在本公开的一些实施例中,第一确定单元402还被配置为执行:
识别多个媒体文件的内容特征,以得到多个媒体文件各自对应的类别;
根据多个媒体文件各自对应的类别,从多个媒体文件中确定与第一目标媒体文件属于同一目标类别的至少一个第二目标媒体文件。
在本公开的一些实施例中,第一确定单元402还被配置为执行:
根据多个媒体文件各自对应的预设类别,确定与第一目标媒体文件属于同一目标父类别的至少一个父类媒体文件;
识别至少一个父类媒体文件的内容特征,以得到至少一个父类媒体文件各自对应的子类别;
根据至少一个父类媒体文件各自对应的子类别,从至少一个父类媒体文件中确定与第一目标媒体文件属于同一目标子类别的至少一个第二目标媒体文件。
在本公开的一些实施例中,处理单元403还被配置为执行:
响应于用户对至少一个第二目标媒体文件的选择操作,在相册界面中显示与目标类别对应的视频模板;
根据视频模板对第一目标媒体文件和选出的第二目标媒体文件进行视频合成处理。
在本公开的一些实施例中,该装置还包括:
添加单元,被配置为执行将第一目标媒体文件和选出的第二目标媒体文件添加到相册界面中预设的素材显示区域,以供用户对显示在素材显示区域内的媒体文件进行编辑。
在本公开的一些实施例中,该装置还包括:
生成单元,被配置为执行响应于用户对合成处理后的视频的预览操作,生成并展示预览视频文件。
在本公开的一些实施例中,该装置还包括:
第二确定单元,被配置为执行根据至少一个第二目标媒体文件各自对应的内容特征,确定至少一个第二目标媒体文件彼此之间的相似度;
过滤单元,被配置为执行根据相似度对至少一个第二目标媒体文件进行重复性过滤。
关于上述实施例中的装置,其中各个模块执行操作的具体方式已经在有关该方法的实施例中进行了详细描述,此处将不做详细阐述说明。
图5是根据一示例性实施例示出的一种电子设备的框图。该电子设备500可以为用户所使用的电子设备。该电子设备500可以是:智能手机、智能手表、台式电脑、手提电脑和膝上型电子设备、台式电子设备等其他名称。
通常,电子设备500包括有:处理器501和存储器502。
其中,处理器501可以包括一个或多个处理核心,比如4核心处理器、8核心处理器等。处理器501可以采用DSP(Digital Signal Processing,数字信号处理)、FPGA(Field-Programmable Gate Array,现场可编程门阵列)、PLA(Programmable Logic Array,可编程逻辑阵列)中的至少一种硬件形式来实现。处理器501也可以包括主处理器和协处理器,主处理器是用于对在唤醒状态下的数据进行处理的处理器,也称CPU(Central Processing Unit,中央处理器);协处理器是用于对在待机状态下的数据进行处理的低功耗处理器。在一些实施例中,处理器501可以在集成有GPU(Graphics Processing Unit,图像处理器),GPU用于负责显示屏所需要显示的内容的渲染和绘制。一些实施例中,处理器501还可以包括AI(Artificial Intelligence,人工智能)处理器,该AI处理器用于处理有关机器学习的计算操作。
存储器502可以包括一个或多个存储介质,该存储介质可以是非暂态的。存储器502还可以包括高速随机存取存储器,以及非易失性存储器,比如一个或多个磁盘存储设备、闪存存储设备。
在一些实施例中,电子设备500还可选包括有:外围设备接口503和至少一个外围设备。处理器501、存储器502和外围设备接口503之间可以通过总线或信号线相连。各个外围设备可以通过总线、信号线或电路板与外围设备接口503相连。具体地,外围设备包括:射频电路504、显示屏505、摄像头组件506、音频电路507、定位组件508和电源509中的至少一种。
外围设备接口503可被用于将I/O(Input/Output,输入/输出)相关的至少一个外围设备连接到处理器501和存储器502。在一些实施例中,处理器501、存储器502和外围设 备接口503被集成在同一芯片或电路板上;在一些其他实施例中,处理器501、存储器502和外围设备接口503中的任意一个或两个可以在单独的芯片或电路板上实现,本实施了对此不加以限定。
射频电路504用于接收和发射RF(Radio Frequency,射频)信号,也称电磁信号。射频电路504通过电磁信号与通信网络以及其他通信设备进行通信。射频电路504将电信号转换为电磁信号进行发送,或者,将接收到的电磁信号转换为电信号。在一些实施例中,射频电路504包括:天线系统、RF收发器、一个或多个放大器、调谐器、振荡器、数字信号处理器、编解码芯片组、用户身份模块卡等等。射频电路504可以通过至少一种无线通信协议来与其它终端进行通信。该无线通信协议包括但不限于:城域网、各代移动通信网络(2G、3G、4G及5G)、无线局域网和/或WiFi(Wireless Fidelity,无线保真)网络。在一些实施例中,射频电路504还可以包括NFC(Near Field Communication,近距离无线通信)有关的电路,本公开对此不加以限定。
显示屏505用于显示UI(User Interface,用户界面)。该UI可以包括图像、文本、图标、视频及其它们的任意组合。当显示屏505是触摸显示屏时,显示屏505还具有采集在显示屏505的表面或表面上方的触摸信号的能力。该触摸信号可以作为控制信号输入至处理器501进行处理。此时,显示屏505还可以用于提供虚拟按钮和/或虚拟键盘,也称软按钮和/或软键盘。在一些实施例中,显示屏505可以为一个,设置电子设备500的前面板;在另一些实施例中,显示屏505可以为至少两个,分别设置在电子设备500的不同表面或呈折叠设计;在再一些实施例中,显示屏505可以是柔性显示屏,设置在电子设备500的弯曲表面上或折叠面上。甚至,显示屏505还可以设置成非矩形的不规则图像,也即异形屏。显示屏505可以采用LCD(Liquid Crystal Display,液晶显示屏)、OLED(Organic Light-Emitting Diode,有机发光二极管)等材质制备。
摄像头组件506用于采集图像或视频。在一些实施例中,摄像头组件506包括前置摄像头和后置摄像头。通常,前置摄像头设置在终端的前面板,后置摄像头设置在终端的背面。在一些实施例中,后置摄像头为至少两个,分别为主摄像头、景深摄像头、广角摄像头、长焦摄像头中的任意一种,以实现主摄像头和景深摄像头融合实现背景虚化功能、主摄像头和广角摄像头融合实现全景拍摄以及VR(Virtual Reality,虚拟现实)拍摄功能或者其它融合拍摄功能。在一些实施例中,摄像头组件506还可以包括闪光灯。闪光灯可以是单色温闪光灯,也可以是双色温闪光灯。双色温闪光灯是指暖光闪光灯和冷光闪光灯的组合,可以用于不同色温下的光线补偿。
音频电路507可以包括麦克风和扬声器。麦克风用于采集用户及环境的声波,并将声波转换为电信号输入至处理器501进行处理,或者输入至射频电路504以实现语音通信。出于立体声采集或降噪的目的,麦克风可以为多个,分别设置在电子设备500的不同部位。麦克风还可以是阵列麦克风或全向采集型麦克风。扬声器则用于将来自处理器501或射频电路504的电信号转换为声波。扬声器可以是传统的薄膜扬声器,也可以是压电陶瓷扬声 器。当扬声器是压电陶瓷扬声器时,不仅可以将电信号转换为人类可听见的声波,也可以将电信号转换为人类听不见的声波以进行测距等用途。在一些实施例中,音频电路507还可以包括耳机插孔。
定位组件508用于定位电子设备500的当前地理位置,以实现导航或LBS(Location Based Service,基于位置的服务)。定位组件508可以是基于美国的GPS(Global Positioning System,全球定位系统)、中国的北斗系统、俄罗斯的格雷纳斯系统或欧盟的伽利略系统的定位组件。
电源509用于为电子设备500中的各个组件进行供电。电源509可以是交流电、直流电、一次性电池或可充电电池。当电源509包括可充电电池时,该可充电电池可以支持有线充电或无线充电。该可充电电池还可以用于支持快充技术。
在一些实施例中,电子设备500还包括有一个或多个传感器510。该一个或多个传感器510包括但不限于:加速度传感器511、陀螺仪传感器512、压力传感器513、指纹传感器514、光学传感器515以及接近传感器516。
加速度传感器511可以检测以电子设备500建立的坐标系的三个坐标轴上的加速度大小。比如,加速度传感器511可以用于检测重力加速度在三个坐标轴上的分量。处理器501可以根据加速度传感器511采集的重力加速度信号,控制显示屏505以横向视图或纵向视图进行用户界面的显示。加速度传感器511还可以用于游戏或者用户的运动数据的采集。
陀螺仪传感器512可以检测电子设备500的机体方向及转动角度,陀螺仪传感器512可以与加速度传感器511协同采集用户对电子设备500的3D动作。处理器501根据陀螺仪传感器512采集的数据,可以实现如下功能:动作感应(比如根据用户的倾斜操作来改变UI)、拍摄时的图像稳定、游戏控制以及惯性导航。
压力传感器513可以设置在电子设备500的侧边框和/或显示屏505的下层。当压力传感器513设置在电子设备500的侧边框时,可以检测用户对电子设备500的握持信号,由处理器501根据压力传感器513采集的握持信号进行左右手识别或快捷操作。当压力传感器513设置在显示屏505的下层时,由处理器501根据用户对显示屏505的压力操作,实现对UI界面上的可操作性控件进行控制。可操作性控件包括按钮控件、滚动条控件、图标控件、菜单控件中的至少一种。
指纹传感器514用于采集用户的指纹,由处理器501根据指纹传感器514采集到的指纹识别用户的身份,或者,由指纹传感器514根据采集到的指纹识别用户的身份。在识出用户的身份为可信身份时,由处理器501授权该用户执行相关的敏感操作,该敏感操作包括解锁屏幕、查看加密信息、下载软件、支付及更改设置等。指纹传感器514可以被设置电子设备500的正面、背面或侧面。当电子设备500上设置有物理按键或厂商Logo时,指纹传感器514可以与物理按键或厂商Logo集成在一起。
光学传感器515用于采集环境光强度。在一个实施例中,处理器501可以根据光学传感器515采集的环境光强度,控制显示屏505的显示亮度。具体地,当环境光强度较高时, 调高显示屏505的显示亮度;当环境光强度较低时,调低显示屏505的显示亮度。在另一个实施例中,处理器501还可以根据光学传感器515采集的环境光强度,动态调整摄像头组件506的拍摄参数。
接近传感器516,也称距离传感器,通常设置在电子设备500的前面板。接近传感器516用于采集用户与电子设备500的正面之间的距离。在一个实施例中,当接近传感器516检测到用户与电子设备500的正面之间的距离逐渐变小时,由处理器501控制显示屏505从亮屏状态切换为息屏状态;当接近传感器516检测到用户与电子设备500的正面之间的距离逐渐变大时,由处理器501控制显示屏505从息屏状态切换为亮屏状态。
本领域技术人员可以理解,图5中示出的结构并不构成对电子设备500的限定,可以包括比图示更多或更少的组件,或者组合某些组件,或者采用不同的组件布置。
在示例性实施例中,本公开还提供了一种包括指令的计算机可读存储介质,例如包括指令的存储器,上述指令可由电子设备500的处理器501执行以完成上述媒体文件处理方法。在一些实施例中,存储介质可以是非临时性存储介质,例如,所述非临时性存储介质可以是ROM、随机存取存储器(RAM)、CD-ROM、磁带、软盘和光数据存储设备等。
在示例性实施例中,本公开还提供了一种计算机程序产品,包括计算机程序,该计算机程序可以由电子设备的处理器执行,以实现上述媒体文件处理方法。
本公开所有实施例均可以单独被执行,也可以与其他实施例相结合被执行,均视为本公开要求的保护范围。
应当理解的是,本公开并不局限于上面已经描述并在附图中示出的精确结构,并且可以在不脱离其范围进行各种修改和改变。本公开的范围仅由所附的权利要求来限制。

Claims (23)

  1. 一种媒体文件处理方法,其特征在于,包括:
    在相册界面中显示多个媒体文件;
    响应于用户对所述多个媒体文件中第一目标媒体文件的选择操作,从所述多个媒体文件中确定与所述第一目标媒体文件属于同一目标类别的至少一个第二目标媒体文件;
    响应于所述用户对所述至少一个第二目标媒体文件的选择操作,对所述第一目标媒体文件和选出的第二目标媒体文件进行合成处理。
  2. 根据权利要求1所述的媒体文件处理方法,其特征在于,所述从所述多个媒体文件中确定与所述第一目标媒体文件属于同一目标类别的至少一个第二目标媒体文件,包括:
    识别所述多个媒体文件的内容特征,以得到所述多个媒体文件各自对应的类别;
    根据所述多个媒体文件各自对应的类别,从所述多个媒体文件中确定与所述第一目标媒体文件属于同一目标类别的至少一个第二目标媒体文件。
  3. 根据权利要求1所述的媒体文件处理方法,其特征在于,所述从所述多个媒体文件中确定与所述第一目标媒体文件属于同一目标类别的至少一个第二目标媒体文件,包括:
    根据所述多个媒体文件各自对应的预设类别,确定与所述第一目标媒体文件属于同一目标父类别的至少一个父类媒体文件;
    识别所述至少一个父类媒体文件的内容特征,以得到所述至少一个父类媒体文件各自对应的子类别;
    根据所述至少一个父类媒体文件各自对应的子类别,从所述至少一个父类媒体文件中确定与所述第一目标媒体文件属于同一目标子类别的至少一个第二目标媒体文件。
  4. 根据权利要求1-3任一项所述的媒体文件处理方法,其特征在于,所述响应于所述用户对所述至少一个第二目标媒体文件的选择操作,对所述第一目标媒体文件和选出的第二目标媒体文件进行合成处理,包括:
    响应于所述用户对所述至少一个第二目标媒体文件的选择操作,在所述相册界面中显示与所述目标类别对应的视频模板;
    根据所述视频模板对所述第一目标媒体文件和选出的第二目标媒体文件进行视频合成处理。
  5. 根据权利要求4所述的媒体文件处理方法,其特征在于,所述方法还包括:
    将所述第一目标媒体文件和选出的第二目标媒体文件添加到所述相册界面中预设的素材显示区域,以供用户对显示在所述素材显示区域内的媒体文件进行编辑。
  6. 根据权利要求4所述的媒体文件处理方法,其特征在于,所述方法还包括:
    响应于所述用户对所述合成处理后的视频的预览操作,生成并展示预览视频文件。
  7. 根据权利要求1-3任一项所述的媒体文件处理方法,其特征在于,所述方法还包 括:
    根据所述至少一个第二目标媒体文件各自对应的内容特征,确定所述至少一个第二目标媒体文件彼此之间的相似度;
    根据所述相似度对所述至少一个第二目标媒体文件进行重复性过滤。
  8. 一种媒体文件处理装置,其特征在于,包括:
    显示单元,被配置为执行在相册界面中显示多个媒体文件;
    第一确定单元,被配置为执行响应于用户对所述多个媒体文件中第一目标媒体文件的选择操作,从所述多个媒体文件中确定与所述第一目标媒体文件属于同一目标类别的至少一个第二目标媒体文件;
    处理单元,被配置为执行响应于所述用户对所述至少一个第二目标媒体文件的选择操作,对所述第一目标媒体文件和选出的第二目标媒体文件进行合成处理。
  9. 根据权利要求8所述的媒体文件处理装置,其特征在于,所述第一确定单元还被配置为执行:
    识别所述多个媒体文件的内容特征,以得到所述多个媒体文件各自对应的类别;
    根据所述多个媒体文件各自对应的类别,从所述多个媒体文件中确定与所述第一目标媒体文件属于同一目标类别的至少一个第二目标媒体文件。
  10. 根据权利要求8所述的媒体文件处理装置,其特征在于,所述第一确定单元还被配置为执行:
    根据所述多个媒体文件各自对应的预设类别,确定与所述第一目标媒体文件属于同一目标父类别的至少一个父类媒体文件;
    识别所述至少一个父类媒体文件的内容特征,以得到所述至少一个父类媒体文件各自对应的子类别;
    根据所述至少一个父类媒体文件各自对应的子类别,从所述至少一个父类媒体文件中确定与所述第一目标媒体文件属于同一目标子类别的至少一个第二目标媒体文件。
  11. 根据权利要求8-10任一项所述的媒体文件处理装置,其特征在于,所述处理单元还被配置为执行:
    响应于所述用户对所述至少一个第二目标媒体文件的选择操作,在所述相册界面中显示与所述目标类别对应的视频模板;
    根据所述视频模板对所述第一目标媒体文件和选出的第二目标媒体文件进行视频合成处理。
  12. 根据权利要求11所述的媒体文件处理装置,其特征在于,所述装置还包括:
    添加单元,被配置为执行将所述第一目标媒体文件和选出的第二目标媒体文件添加到所述相册界面中预设的素材显示区域,以供用户对显示在所述素材显示区域内的媒体文件进行编辑。
  13. 根据权利要求11所述的媒体文件处理装置,其特征在于,所述装置还包括:
    生成单元,被配置为执行响应于所述用户对所述合成处理后的视频的预览操作,生成并展示预览视频文件。
  14. 根据权利要求8-10任一项所述的媒体文件处理装置,其特征在于,所述装置还包括:
    第二确定单元,被配置为执行根据所述至少一个第二目标媒体文件各自对应的内容特征,确定所述至少一个第二目标媒体文件彼此之间的相似度;
    过滤单元,被配置为执行根据所述相似度对所述至少一个第二目标媒体文件进行重复性过滤。
  15. 一种电子设备,其特征在于,包括:
    处理器;
    用于存储所述处理器可执行指令的存储器;
    其中,所述处理器被配置为执行所述指令,以实现一种媒体文件处理方法,所述方法包括:
    在相册界面中显示多个媒体文件;
    响应于用户对所述多个媒体文件中第一目标媒体文件的选择操作,从所述多个媒体文件中确定与所述第一目标媒体文件属于同一目标类别的至少一个第二目标媒体文件;
    响应于所述用户对所述至少一个第二目标媒体文件的选择操作,对所述第一目标媒体文件和选出的第二目标媒体文件进行合成处理。
  16. 根据权利要求15所述的电子设备,其特征在于,所述处理器还被配置为执行:
    识别所述多个媒体文件的内容特征,以得到所述多个媒体文件各自对应的类别;
    根据所述多个媒体文件各自对应的类别,从所述多个媒体文件中确定与所述第一目标媒体文件属于同一目标类别的至少一个第二目标媒体文件。
  17. 根据权利要求15所述的电子设备,其特征在于,所述处理器还被配置为执行:
    根据所述多个媒体文件各自对应的预设类别,确定与所述第一目标媒体文件属于同一目标父类别的至少一个父类媒体文件;
    识别所述至少一个父类媒体文件的内容特征,以得到所述至少一个父类媒体文件各自对应的子类别;
    根据所述至少一个父类媒体文件各自对应的子类别,从所述至少一个父类媒体文件中确定与所述第一目标媒体文件属于同一目标子类别的至少一个第二目标媒体文件。
  18. 根据权利要求15-17任一项所述的电子设备,其特征在于,所述处理器还被配置为执行:
    响应于所述用户对所述至少一个第二目标媒体文件的选择操作,在所述相册界面中显示与所述目标类别对应的视频模板;
    根据所述视频模板对所述第一目标媒体文件和选出的第二目标媒体文件进行视频合成处理。
  19. 根据权利要求18所述的电子设备,其特征在于,所述处理器还被配置为执行:
    将所述第一目标媒体文件和选出的第二目标媒体文件添加到所述相册界面中预设的素材显示区域,以供用户对显示在所述素材显示区域内的媒体文件进行编辑。
  20. 根据权利要求19所述的电子设备,其特征在于,所述处理器还被配置为执行:
    响应于所述用户对所述合成处理后的视频的预览操作,生成并展示预览视频文件。
  21. 根据权利要求15-17任一项所述的电子设备,其特征在于,所述处理器还被配置为执行:
    根据所述至少一个第二目标媒体文件各自对应的内容特征,确定所述至少一个第二目标媒体文件彼此之间的相似度;
    根据所述相似度对所述至少一个第二目标媒体文件进行重复性过滤。
  22. 一种非易失性计算机可读存储介质,其特征在于,当所述计算机可读存储介质中的指令由电子设备的处理器执行时,使得电子设备能够执行一种媒体文件处理方法,所述方法包括:
    在相册界面中显示多个媒体文件;
    响应于用户对所述多个媒体文件中第一目标媒体文件的选择操作,从所述多个媒体文件中确定与所述第一目标媒体文件属于同一目标类别的至少一个第二目标媒体文件;
    响应于所述用户对所述至少一个第二目标媒体文件的选择操作,对所述第一目标媒体文件和选出的第二目标媒体文件进行合成处理。
  23. 一种计算机程序产品,包括计算机程序,其特征在于,所述计算机程序被处理器执行时实现一种媒体文件处理方法,所述方法包括:
    在相册界面中显示多个媒体文件;
    响应于用户对所述多个媒体文件中第一目标媒体文件的选择操作,从所述多个媒体文件中确定与所述第一目标媒体文件属于同一目标类别的至少一个第二目标媒体文件;
    响应于所述用户对所述至少一个第二目标媒体文件的选择操作,对所述第一目标媒体文件和选出的第二目标媒体文件进行合成处理。
PCT/CN2022/100236 2021-11-19 2022-06-21 媒体文件处理方法及装置 WO2023087703A1 (zh)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202111399886.8A CN114297150A (zh) 2021-11-19 2021-11-19 媒体文件处理方法、装置、设备及存储介质
CN202111399886.8 2021-11-19

Publications (2)

Publication Number Publication Date
WO2023087703A1 true WO2023087703A1 (zh) 2023-05-25
WO2023087703A9 WO2023087703A9 (zh) 2023-07-13

Family

ID=80966262

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2022/100236 WO2023087703A1 (zh) 2021-11-19 2022-06-21 媒体文件处理方法及装置

Country Status (2)

Country Link
CN (1) CN114297150A (zh)
WO (1) WO2023087703A1 (zh)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114297150A (zh) * 2021-11-19 2022-04-08 北京达佳互联信息技术有限公司 媒体文件处理方法、装置、设备及存储介质

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090055746A1 (en) * 2005-01-20 2009-02-26 Koninklijke Philips Electronics, N.V. Multimedia presentation creation
CN109167937A (zh) * 2018-11-05 2019-01-08 北京达佳互联信息技术有限公司 视频发布方法、装置、终端及存储介质
CN111246289A (zh) * 2020-03-09 2020-06-05 Oppo广东移动通信有限公司 视频生成方法及装置、电子设备、存储介质
CN112291484A (zh) * 2019-07-23 2021-01-29 腾讯科技(深圳)有限公司 视频合成方法、装置、电子设备及存储介质
CN112988671A (zh) * 2019-12-13 2021-06-18 北京字节跳动网络技术有限公司 媒体文件处理方法、装置、可读介质及电子设备
CN114297150A (zh) * 2021-11-19 2022-04-08 北京达佳互联信息技术有限公司 媒体文件处理方法、装置、设备及存储介质

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110891191B (zh) * 2018-09-07 2022-06-07 阿里巴巴(中国)有限公司 素材选择方法、装置及存储介质
CN110175252A (zh) * 2019-05-07 2019-08-27 深圳前海微众银行股份有限公司 一种图片显示的方法及装置
CN112579826A (zh) * 2020-12-07 2021-03-30 北京字节跳动网络技术有限公司 视频显示及处理方法、装置、系统、设备、介质
CN112989182B (zh) * 2021-02-01 2023-12-12 腾讯科技(深圳)有限公司 信息处理方法、装置、信息处理设备及存储介质

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090055746A1 (en) * 2005-01-20 2009-02-26 Koninklijke Philips Electronics, N.V. Multimedia presentation creation
CN109167937A (zh) * 2018-11-05 2019-01-08 北京达佳互联信息技术有限公司 视频发布方法、装置、终端及存储介质
CN112291484A (zh) * 2019-07-23 2021-01-29 腾讯科技(深圳)有限公司 视频合成方法、装置、电子设备及存储介质
CN112988671A (zh) * 2019-12-13 2021-06-18 北京字节跳动网络技术有限公司 媒体文件处理方法、装置、可读介质及电子设备
CN111246289A (zh) * 2020-03-09 2020-06-05 Oppo广东移动通信有限公司 视频生成方法及装置、电子设备、存储介质
CN114297150A (zh) * 2021-11-19 2022-04-08 北京达佳互联信息技术有限公司 媒体文件处理方法、装置、设备及存储介质

Also Published As

Publication number Publication date
WO2023087703A9 (zh) 2023-07-13
CN114297150A (zh) 2022-04-08

Similar Documents

Publication Publication Date Title
US11557322B2 (en) Method and device for generating multimedia resource
KR102635373B1 (ko) 이미지 처리 방법 및 장치, 단말 및 컴퓨터 판독 가능 저장 매체
CN108769562B (zh) 生成特效视频的方法和装置
CN110545476B (zh) 视频合成的方法、装置、计算机设备及存储介质
CN111065001B (zh) 视频制作的方法、装置、设备及存储介质
EP3941037A1 (en) Editing template generating method and apparatus, electronic device, and storage medium
WO2022048398A1 (zh) 多媒体数据拍摄方法及终端
CN111880888B (zh) 预览封面生成方法、装置、电子设备及存储介质
WO2022134632A1 (zh) 作品处理方法及装置
CN112363660B (zh) 封面图像的确定方法、装置、电子设备及存储介质
CN110225390B (zh) 视频预览的方法、装置、终端及计算机可读存储介质
CN111221457A (zh) 多媒体内容的调整方法、装置、设备及可读存储介质
WO2022033272A1 (zh) 图像处理方法以及电子设备
WO2023087703A9 (zh) 媒体文件处理方法及装置
CN111031394B (zh) 视频制作的方法、装置、设备及存储介质
CN110191236B (zh) 歌曲播放队列管理方法、装置、终端设备及存储介质
CN111539795A (zh) 图像处理方法、装置、电子设备及计算机可读存储介质
CN113301422B (zh) 获取视频封面的方法、终端及存储介质
CN113763486B (zh) 主色调提取方法、装置、电子设备及存储介质
CN111857793B (zh) 网络模型的训练方法、装置、设备及存储介质
CN114302253B (zh) 媒体数据处理方法、装置、设备及存储介质
CN113157310A (zh) 配置信息的获取方法、装置、设备及计算机可读存储介质
CN113420172A (zh) 图片分享方法、装置、计算机设备及介质
CN114972011A (zh) 图像编辑方法、装置、终端及存储介质

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 22894246

Country of ref document: EP

Kind code of ref document: A1