WO2017113873A1 - Image synthesizing method, device and computer storage medium - Google Patents

Image synthesizing method, device and computer storage medium Download PDF

Info

Publication number
WO2017113873A1
WO2017113873A1 PCT/CN2016/098207 CN2016098207W WO2017113873A1 WO 2017113873 A1 WO2017113873 A1 WO 2017113873A1 CN 2016098207 W CN2016098207 W CN 2016098207W WO 2017113873 A1 WO2017113873 A1 WO 2017113873A1
Authority
WO
WIPO (PCT)
Prior art keywords
picture
image
background
information
keyword
Prior art date
Application number
PCT/CN2016/098207
Other languages
French (fr)
Chinese (zh)
Inventor
李乐义
Original Assignee
努比亚技术有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 努比亚技术有限公司 filed Critical 努比亚技术有限公司
Publication of WO2017113873A1 publication Critical patent/WO2017113873A1/en

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N23/00Cameras or camera modules comprising electronic image sensors; Control thereof
    • H04N23/95Computational photography systems, e.g. light-field imaging systems
    • H04N23/951Computational photography systems, e.g. light-field imaging systems by using two or more images to influence resolution, frame rate or aspect ratio
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N5/00Details of television systems
    • H04N5/222Studio circuitry; Studio devices; Studio equipment
    • H04N5/262Studio circuits, e.g. for mixing, switching-over, change of character of image, other special effects ; Cameras specially adapted for the electronic generation of special effects
    • H04N5/272Means for inserting a foreground image in a background image, i.e. inlay, outlay

Definitions

  • the present invention relates to image processing technologies in the field of terminals, and in particular, to an image synthesis method, apparatus, and computer storage medium.
  • the terminal has various functions such as a telephone, a camera, a video recorder, and a music player.
  • the image taken by the camera function of the terminal has a good effect, only the shooting of the real-time scene can be realized. If the user wants a picture of a character and does not want to be photographed in an immersive way, he or she can only use a manual map to make a puzzle, for example, taking a picture of a character and then cutting the person with the user's finger or a touch pen. The cut characters are stitched into the landscape to get the pictures that the user needs.
  • embodiments of the present invention are expected to provide an image synthesizing method, apparatus, and computer storage medium.
  • an embodiment of the present invention provides an image synthesizing method, including:
  • the person information is synthesized on the background picture to obtain a composite picture.
  • the acquiring the background image by using the voice control method includes:
  • the acquiring the background image according to the voice information includes:
  • the background image is acquired in a preset image library according to the correspondence between the keyword and the picture.
  • the acquiring the background image in the preset image library according to the correspondence between the keyword and the image including:
  • the acquiring the background image according to the voice information includes:
  • the background image corresponding to the keyword is captured from the network by using a crawler program.
  • the capturing, by the crawler, the background image corresponding to the keyword from the network includes:
  • the picture confirmed by the user is taken as the background picture.
  • the extracting the character information from the character image includes:
  • the person information is extracted by using a channel extraction technique.
  • the solid color background is a blue background
  • the extracting the character information by using a channel extraction technology includes:
  • the character information is extracted using a blue screen technology.
  • the method further includes:
  • the weather information and/or the location information is added to the composite picture.
  • the method further includes:
  • an image synthesizing apparatus including:
  • a shooting unit configured to capture a portrait of a person on a solid background
  • An extracting unit configured to extract character information from the character image
  • the first obtaining unit is configured to obtain a background image by using a voice control manner
  • a synthesizing unit configured to synthesize the character information on the background image to obtain a composite picture.
  • the first acquiring unit is configured to:
  • the first acquiring unit is configured to:
  • the background image is acquired in a preset image library according to the correspondence between the keyword and the picture.
  • the first acquiring unit is configured to:
  • the first acquiring unit is configured to:
  • the background image corresponding to the keyword is captured from the network by using a crawler program.
  • the first acquiring unit is configured to:
  • the picture confirmed by the user is taken as the background picture.
  • the extracting unit is configured to:
  • the person information is extracted by using a channel extraction technique.
  • the device further includes:
  • a second acquiring unit configured to acquire weather information and/or location information of a geographic location corresponding to the current background image
  • An adding unit configured to add the weather information and/or the location information to the composite picture.
  • the adding unit is further configured to save or share the composite picture.
  • an embodiment of the present invention provides a computer storage medium, where the computer storage medium includes a set of instructions that, when executed, cause at least one processor to execute the image synthesis method described above.
  • Embodiments of the present invention provide an image synthesizing method, apparatus, and computer storage medium, which captures a character image in a solid color background; extracts character information from the character image; acquires a background image by using a voice control method; and synthesizes the character information On the background image, get a composite image sheet.
  • the background image is taken in a solid color background, the background is relatively simple, and the background can be deleted by intelligent image processing technology, and the character information is extracted, so that the outline of the character information is high and will not be lost.
  • the details of the characters, the composite picture is more natural, and the user experience is better.
  • FIG. 1 is a schematic structural diagram of hardware of a mobile terminal that can be implemented in an embodiment of the present invention
  • FIG. 2 is a schematic flowchart 1 of an image synthesizing method according to an embodiment of the present invention
  • FIG. 3 is a schematic flowchart 2 of an image synthesizing method according to an embodiment of the present invention.
  • FIG. 4 is a schematic structural diagram 1 of an image synthesizing apparatus according to an embodiment of the present invention.
  • FIG. 5 is a schematic structural diagram 2 of an image synthesizing apparatus according to an embodiment of the present invention.
  • the mobile terminal can be implemented in various forms.
  • the terminals described in the present invention may include, for example, mobile phones, smart phones, notebook computers, digital broadcast receivers, personal digital assistants (PDAs), tablet computers (PADs), portable multimedia players (PMPs), navigation devices, and the like.
  • Mobile terminals and fixed terminals such as digital TVs, desktop computers, and the like.
  • the terminal is a mobile terminal.
  • the configuration according to an embodiment of the present invention can be applied to a fixed type in addition to an element particularly for moving purposes. terminal.
  • FIG. 1 is a schematic diagram showing the hardware structure of a mobile terminal that can be implemented in various embodiments of the present invention. As shown in FIG. 1, the mobile terminal includes:
  • the A/V input unit 120 is configured to receive an audio or video signal.
  • the A/V input unit 120 may include a camera 121 and a microphone 122 that processes image data of still pictures or video obtained by the image capturing device in a video capturing mode or an image capturing mode.
  • the processed image frame can be displayed on the display unit 151.
  • the image frames processed by the camera 121 may be stored in the memory 160 (or other storage medium), and two or more cameras 121 may be provided according to the configuration of the mobile terminal.
  • the microphone 122 can receive sound (audio data) via a microphone in an operation mode of a telephone call mode, a recording mode, a voice recognition mode, and the like, and can process such sound as audio data.
  • the user input unit 130 may generate key input data according to a command input by the user to control various operations of the mobile terminal.
  • the user input unit 130 allows the user to input various types of information, and may include a keyboard, a pot, a touch pad (eg, a touch sensitive component that detects changes in resistance, pressure, capacitance, etc. due to contact), a scroll wheel , rocker, etc.
  • a touch screen can be formed.
  • Output unit 150 is configured to provide an output signal (eg, an audio signal, a video signal, an alarm signal, a vibration signal, etc.) in a visual, audio, and/or tactile manner.
  • the output unit 150 may include a display unit 151.
  • the display unit 151 can display information processed in the mobile terminal 100. For example, when the mobile terminal 100 is in a phone call mode, the display unit 151 can display a user interface (UI) or a graphical user interface (GUI) related to a call or other communication (eg, text messaging, multimedia file download, etc.). When the mobile terminal 100 is in a video call mode or an image capturing mode, the display unit 151 may display a captured image and/or a received image, a UI or GUI showing a video or image and related functions, and the like.
  • UI user interface
  • GUI graphical user interface
  • the display unit 151 can function as an input device and an output device.
  • the display unit 151 may include at least one of a liquid crystal display (LCD), a thin film transistor LCD (TFT-LCD), an organic light emitting diode (OLED) display, a flexible display, a three-dimensional (3D) display, and the like.
  • LCD liquid crystal display
  • TFT-LCD thin film transistor LCD
  • OLED organic light emitting diode
  • a flexible display a three-dimensional (3D) display, and the like.
  • 3D three-dimensional
  • Some of these displays may be configured to be transparent to allow a user to view from the outside, which may be referred to as a transparent display, and a typical transparent display may be, for example, a TOLED (Transparent Organic Light Emitting Diode) display or the like.
  • TOLED Transparent Organic Light Emitting Diode
  • the mobile terminal 100 may include two or more display units (or other display devices), for example, the mobile terminal may include an external display unit (not shown) and an internal display unit (not shown) .
  • the touch screen can be configured to detect touch input pressure as well as touch input position and touch input area.
  • the memory 160 may store a software program or the like that performs processing and control operations performed by the controller 180, or may temporarily store data (for example, a phone book, a message, a still image, a video, and the like) that has been output or is to be output. Moreover, the memory 160 can store data regarding vibrations and audio signals of various manners that are output when a touch is applied to the touch screen.
  • the memory 160 may include at least one type of storage medium including a flash memory, a hard disk, a multimedia card, a card type memory (eg, SD or DX memory, etc.), a random access memory (RAM), a static random access memory ( SRAM), read only memory (ROM), electrically erasable programmable read only memory (EEPROM), programmable read only memory (PROM), magnetic memory, magnetic disk, optical disk, and the like.
  • the mobile terminal 100 can cooperate with a network storage device that performs a storage function of the memory 160 through a network connection.
  • the controller 180 typically controls the overall operation of the mobile terminal. For example, the controller 180 performs the control and processing associated with voice calls, data communications, video calls, and the like.
  • the power supply unit 190 receives external power or internal power under the control of the controller 180 and provides appropriate power required to operate the various components and components.
  • the various embodiments described herein can be used, for example, in computer software, hardware, or any of them.
  • the combined computer readable medium is implemented.
  • the embodiments described herein may be through the use of application specific integrated circuits (ASICs), digital signal processors (DSPs), digital signal processing devices (DSPDs), programmable logic devices (PLDs), field programmable gate arrays (ASICs), digital signal processors (DSPs), digital signal processing devices (DSPDs), programmable logic devices (PLDs), field programmable gate arrays ( An FPGA, a processor, a controller, a microcontroller, a microprocessor, at least one of the electronic units designed to perform the functions described herein, in some cases, such an embodiment may be at the controller 180 Implemented in the middle.
  • implementations such as procedures or functions may be implemented with separate software modules that permit the execution of at least one function or operation.
  • the software code can be implemented by a software application (or program) written in any suitable programming language, which can be stored in memory
  • the mobile terminal has been described in terms of its function.
  • a slide type mobile terminal among various types of mobile terminals such as a folding type, a bar type, a swing type, a slide type mobile terminal, and the like will be described as an example. Therefore, the present invention can be applied to any type of mobile terminal, and is not limited to a slide type mobile terminal.
  • the mobile terminal 100 as shown in FIG. 1 may be configured to operate using a communication system such as a wired and wireless communication system and a satellite-based communication system that transmits data via frames or packets.
  • a communication system such as a wired and wireless communication system and a satellite-based communication system that transmits data via frames or packets.
  • the embodiment of the present invention provides an image synthesizing method, which is applied to a terminal.
  • the terminal may be a mobile phone, a smart phone, a tablet computer, etc., which is not limited in this embodiment of the present invention.
  • the image synthesis method includes:
  • Step 201 Shoot a character image on a solid color background.
  • the background color can not be selected in principle.
  • the commonly used background colors are green and blue. The reason is that the natural color of the human body does not contain these two colors, with green and blue. The background will not be mixed with the characters. If the clothes in front of the scene are green, use a blue background. If the clothes are blue, use a green background. At the same time, the green and blue colors are still two of the primary colors in the system, which is also easier to handle.
  • Step 202 Extract character information from the character image.
  • person information may be extracted using Matt Extraction, which may also be referred to as an image.
  • Matt Extraction which may also be referred to as an image.
  • Many film and television works can extract the foreground information of the pictures taken in the studio under the solid color background through the channel extraction technology, and synthesize the pictures taken with the exterior scene to create a more exciting picture effect.
  • Blue Screen is the most important method for channel extraction.
  • the blue screen technology is to take a picture of a person on a blue background, and then use the difference of chromaticity to remove the monochrome background and get the character. Information, so the blue screen technology has a scientific name called Chroma Keying.
  • the background of the key is selected in blue.
  • the software commonly used in the blue screen technology is AE (After Effect), which is a video editing and design software developed by Adobe, and is a professional non-linear editing software for video post-synthesis processing.
  • Step 203 Acquire a background image by using a voice control manner.
  • the user may send voice information to the terminal, where the voice information includes keywords related to the background picture required by the user, such as the composition of the scene, the elements appearing in the background picture, the name of the scenic spot, etc.
  • the terminal After receiving the voice information, the keyword related to the background image may be obtained by extracting the information, and then the background image that meets the requirement is obtained according to the keyword.
  • the correspondence between the keyword and the picture may be preset in the terminal during initialization, and the correspondence may be as shown in Table 1:
  • the picture corresponding to the “mobile phone” should be the picture C of the smart phone on the image, and then The picture C is displayed on the screen for the user to confirm. If the user determines that the picture C meets the requirements, the picture C can be used as a background picture.
  • the crawler program may also be used to retrieve keywords related to keywords from the network for selection by the user. For example, when the keyword extracted by the terminal from the voice information sent by the user is “mobile phone”, the terminal crawls all the pictures related to “mobile phone” from the network through the crawler program, and then displays the captured picture in turn. On the screen, for the user to confirm. If the user determines that an image meets the requirements, the selected image of the user can be used as the background image.
  • Step 204 Synthesize the character information on the background picture to obtain a composite picture.
  • the background picture acquired and meeting the user's needs is analyzed to obtain an optimal composition scheme, and then the acquired character information is synthesized on the background picture according to an optimal composition scheme to obtain a composite picture.
  • current weather information of the geographic information corresponding to the background image may also be acquired, and then the weather information and/or the location information is added to the composite picture.
  • the background image is the Badaling Great Wall.
  • the feeling of the Badaling Great Wall may also be added to the composite image, where the location information may be location information of a geographic location corresponding to the background image. For example, you can place a letter on the location of the Badaling Great Wall.
  • the information is added to the composite picture, and the location information of the Badaling Great Wall may be identified by latitude and longitude, or may be identified by a Chinese character, which is not limited by the embodiment of the present invention.
  • the composite picture may be further beautified and edited after the composite picture is obtained. For example, add a filter for processing, or adjust the contrast or brightness of a composite image.
  • An embodiment of the present invention provides an image synthesizing method, including: capturing a character image in a solid color background; extracting character information from the character image; acquiring a background image by using a voice control method; and synthesizing the character information in the background On the picture, get a composite picture.
  • the background image is taken in a solid color background, the background is relatively simple, and the background can be deleted by intelligent image processing technology, and the character information is extracted, so that the outline of the character information is high and will not be lost.
  • the details of the characters, the composite picture is more natural, and the user experience is better.
  • An embodiment of the present invention provides an image synthesizing method, which is applied to a terminal, as shown in FIG. 3, and includes:
  • Step 301 The correspondence between the preset keyword and the picture is performed, and step 302 is performed.
  • the correspondence between the keyword and the picture can be referred to Table 1.
  • Step 302 Receive voice information input by the user, and perform step 303.
  • the basic features of the desired background image can be spoken against the microphone of the terminal.
  • the terminal determines that the user inputs the voice information when detecting that the microphone receives the sound signal.
  • Step 303 Extract the keyword of the voice information, and perform step 304.
  • a keyword database may be set in advance in the terminal, and the keyword database stores sound characteristics of each keyword, including phonetic symbols, tones, audio, and the like. After receiving the voice information sent by the user, the terminal compares the voice information with each keyword in the keyword database to extract keywords of the voice information.
  • Step 304 Select a picture corresponding to the keyword of the voice information according to the correspondence between the keyword and the picture, and perform step 305.
  • the picture corresponding to the “tulip” should be the picture B of the tulip field on the image.
  • Step 305 Display a picture corresponding to the keyword of the voice information on the display screen. If the user confirms that the picture meets the requirements, go to step 306. If the user confirms that the picture does not meet the requirements, go to step 311.
  • a picture B of the tulip field on the image corresponding to the "tulip” is displayed on the display, and then the user is prompted to confirm.
  • the prompt information is displayed, and the prompt information includes a “confirm” button and a “cancel” button. If the user thinks that the picture B meets the requirements, the user may click the “confirm” button, and the terminal confirms that the user thinks the picture. B meets the requirements; if the user thinks that picture B does not meet the requirements, you can click the “Cancel” button, and the terminal confirms that the user thinks that picture B does not meet the requirements.
  • Step 306 Capture a character image on a solid color background, and perform step 307.
  • a person image can be taken on a pure blue background to ensure that the character does not contain the blue of the background.
  • Step 307 Extract character information from the character image, and perform step 308.
  • the blue screen technology may be used to extract the character information, that is, the difference between the chromaticity between the person and the background on the captured person image, and the blue background is removed to obtain the character information.
  • Step 308 Synthesize the character information on the background picture to obtain a composite picture.
  • the background picture acquired and meeting the user's needs is analyzed to obtain an optimal composition scheme, and then the acquired character information is synthesized on the background picture according to an optimal composition scheme to obtain a composite picture.
  • the specific synthesis method is a prior art, and details are not described herein.
  • Step 309 Add weather information to the composite picture, and perform step 310.
  • weather information of the geographic location where the tulip field in the picture B is located may also be acquired, and then the weather information is added on the composite picture.
  • the “cloudy, 24-32 ° C” may be added to the lower right corner of the composite picture.
  • the location information of the geographic location where the tulip field in the picture B is located may also be added to the composite picture. Assuming that the location information of the geographic location where the tulip field is located is "east longitude 4 ° 21 ', north latitude 51 ° 45 '", the "east longitude 4 ° 21 ', north latitude 51 ° 45 '" may be added to the right of the composite picture Lower corner.
  • Step 310 Save or share the composite picture, and the process ends.
  • the composite picture may be saved in a memory of the terminal, or the composite picture may be shared, for example, sent to a WeChat friend circle, or sent to the microblog, which is in the embodiment of the present invention. This is not limited.
  • Step 311 The crawler program is used to capture a picture related to the keyword of the voice information from the network.
  • the crawler program can also be used to grab the relevant picture from the network, and then display the captured picture in turn, so that the user can confirm.
  • the reptile program is a prior art, and the embodiments of the present invention are not described herein.
  • steps 306-310 are continued to complete the synthesis of the picture.
  • the embodiment of the present invention provides an image synthesizing method. Compared with the prior art, since a person image is captured in a solid color background, the background is relatively simple, and the background can be deleted by an intelligent image processing technology to extract character information. The outline of the character information is high, and the details of the character are not lost. The composite picture is more natural and the user experience is better.
  • an embodiment of the present invention provides an image synthesizing device 40, which is located at a terminal, as shown in FIG. 4, and includes:
  • the photographing unit 401 is configured to photograph a person image on a solid color background.
  • the extracting unit 402 is configured to extract character information from the character image (the person information can be extracted by a channel extraction technique).
  • the first obtaining unit 403 is configured to acquire a background image by using a voice control manner.
  • the synthesizing unit 404 is configured to synthesize the person information on the background picture to obtain a composite picture.
  • the background image is taken in a solid color background
  • the background is relatively simple, and the background can be deleted by intelligent image processing technology, and the character information is extracted, so that the outline of the character information is high, and the character details are not lost.
  • the composite picture is more natural and the user experience is better.
  • the first obtaining unit 403 is specifically configured to: receive voice information input by the user; and acquire the background image according to the voice information.
  • the first obtaining unit 403 is specifically configured to: extract a keyword in the voice information; and acquire the background image in a preset image library according to a correspondence between the keyword and the image.
  • the first acquiring unit 403 is specifically configured to:
  • the first obtaining unit 403 is specifically configured to: extract a keyword in the voice information; according to the keyword, use a crawler program to fetch the background image corresponding to the keyword from a network .
  • the first acquiring unit 403 is specifically configured to:
  • the apparatus 40 may further include: a second obtaining unit 405 configured to acquire weather information and/or location information of a geographic location corresponding to the current background image; and adding unit 406 And configured to add the weather information and/or the location information to the composite picture.
  • the adding unit 406 is further configured to save or share the composite picture.
  • the extracting unit 402, the first obtaining unit 403, the synthesizing unit 404, the second obtaining unit 405, and the adding unit 406 may each be processed by a central processing unit (CPU) located in the image synthesizing device 40.
  • CPU central processing unit
  • MPU Micro Processor Unit
  • DSP Digital Signal Processor
  • FPGA Field Programmable Gate Array
  • the photographing unit 401 is realized by a camera located in the image synthesizing device 40.
  • An embodiment of the present invention provides an image synthesizing apparatus, including: a photographing unit configured to photograph a person image in a solid color background.
  • An extracting unit configured to extract a person from the image of the person Information.
  • the first obtaining unit is configured to obtain a background image by using a voice control manner.
  • a synthesizing unit configured to synthesize the character information on the background image to obtain a composite picture.
  • the disclosed apparatus and method may be implemented in other manners.
  • the device embodiments described above are merely illustrative.
  • the division of the unit is only a logical function division.
  • there may be another division manner such as: multiple units or components may be combined, or Can be integrated into another system, or some features can be ignored or not executed.
  • the coupling, or direct coupling, or communication connection of the various components shown or discussed may be through some interface, device or unit.
  • the indirect coupling or communication connection can be electrical, mechanical or other form.
  • the units described above as separate components may or may not be physically separated, and the components displayed as the unit may or may not be physical units; they may be located in one place or distributed on multiple network units; Some or all of the units may be selected according to actual needs to achieve the purpose of the solution of the embodiment.
  • each functional unit in each embodiment of the present invention may be integrated into one processing unit, or each unit may be separately used as one unit, or two or more units may be integrated into one unit;
  • the unit can be implemented in the form of hardware or in the form of hardware plus software functional units.
  • the foregoing program may be stored in a computer readable storage medium, and when executed, the program includes The foregoing steps of the method embodiment; and the foregoing storage medium includes: a removable storage device, a read only memory (ROM), a magnetic disk, or an optical disk, and the like, which can store program codes.
  • ROM read only memory
  • the above-described integrated unit of the present invention may be stored in a computer readable storage medium if it is implemented in the form of a software function module and sold or used as a standalone product.
  • the technical solution of the embodiments of the present invention may be embodied in the form of a software product in essence or in the form of a software product stored in a storage medium, including a plurality of instructions.
  • a computer device (which may be a personal computer, server, or network device, etc.) is caused to perform all or part of the methods described in various embodiments of the present invention.
  • the foregoing storage medium includes various media that can store program codes, such as a mobile storage device, a ROM, a magnetic disk, or an optical disk.
  • an embodiment of the present invention provides a computer storage medium, where the computer storage medium includes a set of instructions that, when executed, cause at least one processor to perform the image synthesis method described in the embodiments of the present invention.

Abstract

Disclosed in an embodiment of the present invention is an image synthesizing method, the method comprising: photographing a portrait image against a solid color background; extracting information of the portrait subject from the portrait image; acquiring a background image by use of a voice control method; synthesizing information of the portrait subject with the background image and acquiring a synthesized image. Also disclosed in this embodiment of the present invention are an image synthesizing device and computer storage medium.

Description

一种图像合成方法、装置及计算机存储介质Image synthesis method, device and computer storage medium 技术领域Technical field
本发明涉及终端领域的图像处理技术,尤其涉及一种图像合成方法、装置及计算机存储介质。The present invention relates to image processing technologies in the field of terminals, and in particular, to an image synthesis method, apparatus, and computer storage medium.
背景技术Background technique
随着终端智能化程度越来越高,适用于安装在终端上的应用也越来越多,使得终端具备了电话,照相机,录像机,以及音乐播放器等多种功能。As the degree of terminal intelligence becomes higher and higher, more and more applications are applied to the terminal, and the terminal has various functions such as a telephone, a camera, a video recorder, and a music player.
虽然终端现有的照相功能拍摄的图像效果良好,但是仅能实现实时场景的拍摄。如果用户想要一张人物风景照,又不想身临其境去拍照时,只能选用手动抠图的方式进行拼图,例如,拍摄一张人物图像,然后采用用户的手指或者触摸笔裁剪人物,并将裁剪下来的人物拼接到风景照上,得到用户所需的图片。Although the image taken by the camera function of the terminal has a good effect, only the shooting of the real-time scene can be realized. If the user wants a picture of a character and does not want to be photographed in an immersive way, he or she can only use a manual map to make a puzzle, for example, taking a picture of a character and then cutting the person with the user's finger or a touch pen. The cut characters are stitched into the landscape to get the pictures that the user needs.
由于人物轮廓的提取是人工完成的,使得轮廓较为粗糙,精度不高,也容易丢失局部细节,导致拼接完成的图片过于生硬,合成痕迹较为明显,进而导致用户体验不佳。Since the extraction of the outline of the character is manually completed, the outline is rough, the precision is not high, and the local details are easily lost, resulting in the picture being spliced to be too rigid, and the synthetic trace is more obvious, resulting in poor user experience.
发明内容Summary of the invention
为解决上述技术问题,本发明实施例期望提供一种图像合成方法、装置及计算机存储介质。In order to solve the above technical problem, embodiments of the present invention are expected to provide an image synthesizing method, apparatus, and computer storage medium.
本发明实施例的技术方案是这样实现的:The technical solution of the embodiment of the present invention is implemented as follows:
一方面,本发明实施例提供一种图像合成方法,包括:In one aspect, an embodiment of the present invention provides an image synthesizing method, including:
拍摄在纯色背景下的人物图像;Shooting images of people on a solid background;
从所述人物图像上提取人物信息; Extracting character information from the character image;
采用声控方式获取背景图片;Acquire the background image by voice control;
将所述人物信息合成在所述背景图片上,得到合成图片。The person information is synthesized on the background picture to obtain a composite picture.
可选地,所述采用声控方式获取背景图片包括:Optionally, the acquiring the background image by using the voice control method includes:
接收用户输入的语音信息;Receiving voice information input by the user;
根据所述语音信息,获取所述背景图片。Obtaining the background picture according to the voice information.
可选地,所述根据所述语音信息,获取所述背景图片包括:Optionally, the acquiring the background image according to the voice information includes:
提取所述语音信息中的关键字;Extracting keywords in the voice information;
根据关键字与图片的对应关系,在预设图像库中获取所述背景图片。The background image is acquired in a preset image library according to the correspondence between the keyword and the picture.
可选地,所述根据关键字与图片的对应关系,在预设图像库中获取所述背景图片,包括:Optionally, the acquiring the background image in the preset image library according to the correspondence between the keyword and the image, including:
在显示屏上显示所述关键字对应的图片;Displaying a picture corresponding to the keyword on the display screen;
接收所述用户的确认操作;Receiving a confirmation operation of the user;
响应所述确认操作,确认显示的所述关键字对应的图片为所述背景图片。In response to the confirming operation, it is confirmed that the picture corresponding to the displayed keyword is the background picture.
可选地,所述根据所述语音信息,获取所述背景图片包括:Optionally, the acquiring the background image according to the voice information includes:
提取所述语音信息中的关键字;Extracting keywords in the voice information;
根据所述关键字,采用爬虫程序从网络上抓取与所述关键字对应的所述背景图片。According to the keyword, the background image corresponding to the keyword is captured from the network by using a crawler program.
可选地,所述采用爬虫程序从网络上抓取与所述关键字对应的所述背景图片包括:Optionally, the capturing, by the crawler, the background image corresponding to the keyword from the network includes:
通过爬虫程序从网络上抓取所有与所述关键字相关的图片;Grab all pictures related to the keyword from the network through a crawler;
依次将抓取的图片显示在屏幕上;Display the captured image in turn on the screen;
接收用户的确认操作;Receiving the user's confirmation operation;
响应所述确认操作,将所述用户确认的图片作为所述背景图片。In response to the confirming operation, the picture confirmed by the user is taken as the background picture.
可选地,所述从所述人物图像上提取人物信息包括: Optionally, the extracting the character information from the character image includes:
采用通道提取技术提取所述人物信息。The person information is extracted by using a channel extraction technique.
可选地,所述纯色背景为蓝色背景;所述采用通道提取技术提取所述人物信息包括:Optionally, the solid color background is a blue background; and the extracting the character information by using a channel extraction technology includes:
采用蓝屏幕技术提取所述人物信息。The character information is extracted using a blue screen technology.
可选地,在所述将所述人物信息合成在所述背景图片上,得到合成图片之后,所述方法还包括:Optionally, after the synthesizing the character information on the background image to obtain a composite image, the method further includes:
获取当前所述背景图片对应的地理位置的天气信息和/或位置信息;Obtaining weather information and/or location information of a geographic location corresponding to the current background image;
将所述天气信息和/或所述位置信息添加在所述合成图片上。The weather information and/or the location information is added to the composite picture.
可选地,所述方法还包括:Optionally, the method further includes:
保存或分享所述合成图片。Save or share the composite picture.
另一方面,本发明实施例提供一种图像合成装置,包括:In another aspect, an embodiment of the present invention provides an image synthesizing apparatus, including:
拍摄单元,配置为拍摄在纯色背景下的人物图像;a shooting unit configured to capture a portrait of a person on a solid background;
提取单元,配置为从所述人物图像上提取人物信息;An extracting unit configured to extract character information from the character image;
第一获取单元,配置为采用声控方式获取背景图片;The first obtaining unit is configured to obtain a background image by using a voice control manner;
合成单元,配置为将所述人物信息合成在所述背景图片上,得到合成图片。And a synthesizing unit configured to synthesize the character information on the background image to obtain a composite picture.
可选地,所述第一获取单元配置为:Optionally, the first acquiring unit is configured to:
接收用户输入的语音信息;Receiving voice information input by the user;
根据所述语音信息,获取所述背景图片。Obtaining the background picture according to the voice information.
可选地,所述第一获取单元配置为:Optionally, the first acquiring unit is configured to:
提取所述语音信息中的关键字;Extracting keywords in the voice information;
根据关键字与图片的对应关系,在预设图像库中获取所述背景图片。The background image is acquired in a preset image library according to the correspondence between the keyword and the picture.
可选地,所述第一获取单元配置为:Optionally, the first acquiring unit is configured to:
在显示屏上显示所述关键字对应的图片;Displaying a picture corresponding to the keyword on the display screen;
接收所述用户的确认操作; Receiving a confirmation operation of the user;
响应所述确认操作,确认显示的所述关键字对应的图片为所述背景图片。In response to the confirming operation, it is confirmed that the picture corresponding to the displayed keyword is the background picture.
可选地,所述第一获取单元配置为:Optionally, the first acquiring unit is configured to:
提取所述语音信息中的关键字;Extracting keywords in the voice information;
根据所述关键字,采用爬虫程序从网络上抓取与所述关键字对应的所述背景图片。According to the keyword, the background image corresponding to the keyword is captured from the network by using a crawler program.
可选地,所述第一获取单元配置为:Optionally, the first acquiring unit is configured to:
通过爬虫程序从网络上抓取所有与所述关键字相关的图片;Grab all pictures related to the keyword from the network through a crawler;
依次将抓取的图片显示在屏幕上;Display the captured image in turn on the screen;
接收用户的确认操作;Receiving the user's confirmation operation;
响应所述确认操作,将所述用户确认的图片作为所述背景图片。In response to the confirming operation, the picture confirmed by the user is taken as the background picture.
可选地,所述提取单元配置为:Optionally, the extracting unit is configured to:
采用通道提取技术提取所述人物信息。The person information is extracted by using a channel extraction technique.
可选地,所述装置还包括:Optionally, the device further includes:
第二获取单元,配置为获取当前所述背景图片对应的地理位置的天气信息和/或位置信息;a second acquiring unit, configured to acquire weather information and/or location information of a geographic location corresponding to the current background image;
添加单元,配置为将所述天气信息和/或所述位置信息添加在所述合成图片上。An adding unit configured to add the weather information and/or the location information to the composite picture.
可选地,所述添加单元,还配置为保存或分享所述合成图片。Optionally, the adding unit is further configured to save or share the composite picture.
第三方面,本发明实施例提供一种计算机存储介质,所述计算机存储介质包括一组指令,当执行所述指令时,引起至少一个处理器执行上述的图像合成方法。In a third aspect, an embodiment of the present invention provides a computer storage medium, where the computer storage medium includes a set of instructions that, when executed, cause at least one processor to execute the image synthesis method described above.
本发明实施例提供了一种图像合成方法、装置及计算机存储介质,拍摄在纯色背景下的人物图像;从所述人物图像上提取人物信息;采用声控方式获取背景图片;将所述人物信息合成在所述背景图片上,得到合成图 片。相较于现有技术,由于在纯色背景下拍摄人物图像,背景较为简单,可以通过智能化的图像处理技术删除所述背景,提取人物信息,使得人物信息的轮廓精细度较高,不会丢失人物细节,合成图片较为自然,用户体验较佳。Embodiments of the present invention provide an image synthesizing method, apparatus, and computer storage medium, which captures a character image in a solid color background; extracts character information from the character image; acquires a background image by using a voice control method; and synthesizes the character information On the background image, get a composite image sheet. Compared with the prior art, since the background image is taken in a solid color background, the background is relatively simple, and the background can be deleted by intelligent image processing technology, and the character information is extracted, so that the outline of the character information is high and will not be lost. The details of the characters, the composite picture is more natural, and the user experience is better.
附图说明DRAWINGS
图1为实现本发明实施例可选的一种移动终端的硬件结构示意图;1 is a schematic structural diagram of hardware of a mobile terminal that can be implemented in an embodiment of the present invention;
图2为本发明实施例提供的一种图像合成方法的流程示意图1;2 is a schematic flowchart 1 of an image synthesizing method according to an embodiment of the present invention;
图3为本发明实施例提供的一种图像合成方法的流程示意图2;FIG. 3 is a schematic flowchart 2 of an image synthesizing method according to an embodiment of the present invention;
图4为本发明实施例提供的一种图像合成装置的结构示意图1;4 is a schematic structural diagram 1 of an image synthesizing apparatus according to an embodiment of the present invention;
图5为本发明实施例提供的一种图像合成装置的结构示意图2。FIG. 5 is a schematic structural diagram 2 of an image synthesizing apparatus according to an embodiment of the present invention.
具体实施方式detailed description
下面将结合本发明实施例中的附图,对本发明实施例中的技术方案进行清楚、完整地描述。应当理解,此处所描述的具体实施例仅仅用以解释本发明,并不用于限定本发明。The technical solutions in the embodiments of the present invention will be clearly and completely described in the following with reference to the accompanying drawings. It is understood that the specific embodiments described herein are merely illustrative of the invention and are not intended to limit the invention.
现在将参考附图描述实现本发明各个实施例的移动终端。在后续的描述中,使用用于表示元件的诸如“模块”、“部件”或“单元”的后缀仅为了有利于本发明的说明,其本身并没有特定的意义。因此,"模块"与"部件"可以混合地使用。A mobile terminal embodying various embodiments of the present invention will now be described with reference to the accompanying drawings. In the following description, the use of suffixes such as "module", "component" or "unit" for indicating an element is merely an explanation for facilitating the present invention, and does not have a specific meaning per se. Therefore, "module" and "component" can be used in combination.
移动终端可以以各种形式来实施。例如,本发明中描述的终端可以包括诸如移动电话、智能电话、笔记本电脑、数字广播接收器、个人数字助理(PDA)、平板电脑(PAD)、便携式多媒体播放器(PMP)、导航装置等等的移动终端以及诸如数字TV、台式计算机等等的固定终端。下面,假设终端是移动终端。然而,本领域技术人员将理解的是,除了特别用于移动目的的元件之外,根据本发明的实施方式的构造也能够应用于固定类型的 终端。The mobile terminal can be implemented in various forms. For example, the terminals described in the present invention may include, for example, mobile phones, smart phones, notebook computers, digital broadcast receivers, personal digital assistants (PDAs), tablet computers (PADs), portable multimedia players (PMPs), navigation devices, and the like. Mobile terminals and fixed terminals such as digital TVs, desktop computers, and the like. In the following, it is assumed that the terminal is a mobile terminal. However, it will be understood by those skilled in the art that the configuration according to an embodiment of the present invention can be applied to a fixed type in addition to an element particularly for moving purposes. terminal.
图1为实现本发明各个实施例可选的一种移动终端的硬件结构示意。如图1所示,该移动终端包括:FIG. 1 is a schematic diagram showing the hardware structure of a mobile terminal that can be implemented in various embodiments of the present invention. As shown in FIG. 1, the mobile terminal includes:
A/V输入单元120配置为接收音频或视频信号。A/V输入单元120可以包括相机121和麦克风122,相机121对在视频捕获模式或图像捕获模式中由图像捕获装置获得的静态图片或视频的图像数据进行处理。处理后的图像帧可以显示在显示单元151上。经相机121处理后的图像帧可以存储在存储器160(或其它存储介质)中,可以根据移动终端的构造提供两个或更多相机121。麦克风122可以在电话通话模式、记录模式、语音识别模式等等运行模式中经由麦克风接收声音(音频数据),并且能够将这样的声音处理为音频数据。The A/V input unit 120 is configured to receive an audio or video signal. The A/V input unit 120 may include a camera 121 and a microphone 122 that processes image data of still pictures or video obtained by the image capturing device in a video capturing mode or an image capturing mode. The processed image frame can be displayed on the display unit 151. The image frames processed by the camera 121 may be stored in the memory 160 (or other storage medium), and two or more cameras 121 may be provided according to the configuration of the mobile terminal. The microphone 122 can receive sound (audio data) via a microphone in an operation mode of a telephone call mode, a recording mode, a voice recognition mode, and the like, and can process such sound as audio data.
用户输入单元130可以根据用户输入的命令生成键输入数据以控制移动终端的各种操作。用户输入单元130允许用户输入各种类型的信息,并且可以包括键盘、锅仔片、触摸板(例如,检测由于被接触而导致的电阻、压力、电容等等的变化的触敏组件)、滚轮、摇杆等等。特别地,当触摸板以层的形式叠加在显示单元151上时,可以形成触摸屏。The user input unit 130 may generate key input data according to a command input by the user to control various operations of the mobile terminal. The user input unit 130 allows the user to input various types of information, and may include a keyboard, a pot, a touch pad (eg, a touch sensitive component that detects changes in resistance, pressure, capacitance, etc. due to contact), a scroll wheel , rocker, etc. In particular, when the touch panel is superimposed on the display unit 151 in the form of a layer, a touch screen can be formed.
输出单元150被构造为以视觉、音频和/或触觉方式提供输出信号(例如,音频信号、视频信号、警报信号、振动信号等等)。输出单元150可以包括显示单元151。 Output unit 150 is configured to provide an output signal (eg, an audio signal, a video signal, an alarm signal, a vibration signal, etc.) in a visual, audio, and/or tactile manner. The output unit 150 may include a display unit 151.
显示单元151可以显示在移动终端100中处理的信息。例如,当移动终端100处于电话通话模式时,显示单元151可以显示与通话或其它通信(例如,文本消息收发、多媒体文件下载等等)相关的用户界面(UI)或图形用户界面(GUI)。当移动终端100处于视频通话模式或者图像捕获模式时,显示单元151可以显示捕获的图像和/或接收的图像、示出视频或图像以及相关功能的UI或GUI等等。 The display unit 151 can display information processed in the mobile terminal 100. For example, when the mobile terminal 100 is in a phone call mode, the display unit 151 can display a user interface (UI) or a graphical user interface (GUI) related to a call or other communication (eg, text messaging, multimedia file download, etc.). When the mobile terminal 100 is in a video call mode or an image capturing mode, the display unit 151 may display a captured image and/or a received image, a UI or GUI showing a video or image and related functions, and the like.
同时,当显示单元151和触摸板以层的形式彼此叠加以形成触摸屏时,显示单元151可以用作输入装置和输出装置。显示单元151可以包括液晶显示器(LCD)、薄膜晶体管LCD(TFT-LCD)、有机发光二极管(OLED)显示器、柔性显示器、三维(3D)显示器等等中的至少一种。这些显示器中的一些可以被构造为透明状以允许用户从外部观看,这可以称为透明显示器,典型的透明显示器可以例如为TOLED(透明有机发光二极管)显示器等等。根据特定想要的实施方式,移动终端100可以包括两个或更多显示单元(或其它显示装置),例如,移动终端可以包括外部显示单元(未示出)和内部显示单元(未示出)。触摸屏可配置为检测触摸输入压力以及触摸输入位置和触摸输入面积。Meanwhile, when the display unit 151 and the touch panel are superposed on each other in the form of a layer to form a touch screen, the display unit 151 can function as an input device and an output device. The display unit 151 may include at least one of a liquid crystal display (LCD), a thin film transistor LCD (TFT-LCD), an organic light emitting diode (OLED) display, a flexible display, a three-dimensional (3D) display, and the like. Some of these displays may be configured to be transparent to allow a user to view from the outside, which may be referred to as a transparent display, and a typical transparent display may be, for example, a TOLED (Transparent Organic Light Emitting Diode) display or the like. According to a particular desired embodiment, the mobile terminal 100 may include two or more display units (or other display devices), for example, the mobile terminal may include an external display unit (not shown) and an internal display unit (not shown) . The touch screen can be configured to detect touch input pressure as well as touch input position and touch input area.
存储器160可以存储由控制器180执行的处理和控制操作的软件程序等等,或者可以暂时地存储已经输出或将要输出的数据(例如,电话簿、消息、静态图像、视频等等)。而且,存储器160可以存储关于当触摸施加到触摸屏时输出的各种方式的振动和音频信号的数据。The memory 160 may store a software program or the like that performs processing and control operations performed by the controller 180, or may temporarily store data (for example, a phone book, a message, a still image, a video, and the like) that has been output or is to be output. Moreover, the memory 160 can store data regarding vibrations and audio signals of various manners that are output when a touch is applied to the touch screen.
存储器160可以包括至少一种类型的存储介质,所述存储介质包括闪存、硬盘、多媒体卡、卡型存储器(例如,SD或DX存储器等等)、随机访问存储器(RAM)、静态随机访问存储器(SRAM)、只读存储器(ROM)、电可擦除可编程只读存储器(EEPROM)、可编程只读存储器(PROM)、磁性存储器、磁盘、光盘等等。而且,移动终端100可以与通过网络连接执行存储器160的存储功能的网络存储装置协作。The memory 160 may include at least one type of storage medium including a flash memory, a hard disk, a multimedia card, a card type memory (eg, SD or DX memory, etc.), a random access memory (RAM), a static random access memory ( SRAM), read only memory (ROM), electrically erasable programmable read only memory (EEPROM), programmable read only memory (PROM), magnetic memory, magnetic disk, optical disk, and the like. Moreover, the mobile terminal 100 can cooperate with a network storage device that performs a storage function of the memory 160 through a network connection.
控制器180通常控制移动终端的总体操作。例如,控制器180执行与语音通话、数据通信、视频通话等等相关的控制和处理。The controller 180 typically controls the overall operation of the mobile terminal. For example, the controller 180 performs the control and processing associated with voice calls, data communications, video calls, and the like.
电源单元190在控制器180的控制下接收外部电力或内部电力并且提供操作各元件和组件所需的适当的电力。The power supply unit 190 receives external power or internal power under the control of the controller 180 and provides appropriate power required to operate the various components and components.
这里描述的各种实施方式可以以使用例如计算机软件、硬件或其任何 组合的计算机可读介质来实施。对于硬件实施,这里描述的实施方式可以通过使用特定用途集成电路(ASIC)、数字信号处理器(DSP)、数字信号处理装置(DSPD)、可编程逻辑装置(PLD)、现场可编程门阵列(FPGA)、处理器、控制器、微控制器、微处理器、被设计为执行这里描述的功能的电子单元中的至少一种来实施,在一些情况下,这样的实施方式可以在控制器180中实施。对于软件实施,诸如过程或功能的实施方式可以与允许执行至少一种功能或操作的单独的软件模块来实施。软件代码可以由以任何适当的编程语言编写的软件应用程序(或程序)来实施,软件代码可以存储在存储器160中并且由控制器180执行。The various embodiments described herein can be used, for example, in computer software, hardware, or any of them. The combined computer readable medium is implemented. For hardware implementations, the embodiments described herein may be through the use of application specific integrated circuits (ASICs), digital signal processors (DSPs), digital signal processing devices (DSPDs), programmable logic devices (PLDs), field programmable gate arrays ( An FPGA, a processor, a controller, a microcontroller, a microprocessor, at least one of the electronic units designed to perform the functions described herein, in some cases, such an embodiment may be at the controller 180 Implemented in the middle. For software implementations, implementations such as procedures or functions may be implemented with separate software modules that permit the execution of at least one function or operation. The software code can be implemented by a software application (or program) written in any suitable programming language, which can be stored in memory 160 and executed by controller 180.
至此,已经按照其功能描述了移动终端。下面,为了简要起见,将描述诸如折叠型、直板型、摆动型、滑动型移动终端等等的各种类型的移动终端中的滑动型移动终端作为示例。因此,本发明能够应用于任何类型的移动终端,并且不限于滑动型移动终端。So far, the mobile terminal has been described in terms of its function. Hereinafter, for the sake of brevity, a slide type mobile terminal among various types of mobile terminals such as a folding type, a bar type, a swing type, a slide type mobile terminal, and the like will be described as an example. Therefore, the present invention can be applied to any type of mobile terminal, and is not limited to a slide type mobile terminal.
如图1中所示的移动终端100可以被构造为利用经由帧或分组发送数据的诸如有线和无线通信系统以及基于卫星的通信系统来操作。The mobile terminal 100 as shown in FIG. 1 may be configured to operate using a communication system such as a wired and wireless communication system and a satellite-based communication system that transmits data via frames or packets.
实施例一Embodiment 1
本发明实施例提供一种图像合成方法,应用于终端,所示终端可以为移动电话,智能电话,平板电脑等,本发明实施例对此不做限定。如图2所示,所述图像合成方法包括:The embodiment of the present invention provides an image synthesizing method, which is applied to a terminal. The terminal may be a mobile phone, a smart phone, a tablet computer, etc., which is not limited in this embodiment of the present invention. As shown in FIG. 2, the image synthesis method includes:
步骤201、拍摄在纯色背景下的人物图像。Step 201: Shoot a character image on a solid color background.
示例地,选取背景色原则上不能选取前景物体上包含的颜色,常用的背景色有绿色和蓝色两种,其原因是人身体的自然颜色中不包含这两种色彩,用绿色和蓝色做背景不会和人物混在一起,如果幕前人物的衣服偏绿色,那就用蓝色背景,如果衣服偏蓝色,就用绿色背景。同时,绿色和蓝色颜色还是三原色系统中的其中两个原色,也比较方便处理。 For example, the background color can not be selected in principle. The commonly used background colors are green and blue. The reason is that the natural color of the human body does not contain these two colors, with green and blue. The background will not be mixed with the characters. If the clothes in front of the scene are green, use a blue background. If the clothes are blue, use a green background. At the same time, the green and blue colors are still two of the primary colors in the system, which is also easier to handle.
通常情况下,我国一般用蓝背景,在欧美国家绿屏幕和蓝屏幕都经常使用,尤其在拍摄人物时常用绿屏幕,因为很多欧美人的眼睛是蓝色的。为了便于后期制作时进行通道提取,进行纯色背景下的人物拍摄时,需要注意很多问题。例如,人物不能包含所选用的背景颜色;背景颜色必须一致,光照均匀,要尽可能避免背景或光照深浅不一的情况,以免给通道提取造成不便;有时背景尺寸很大,还需要用很多块布或板拼接而成。Under normal circumstances, China generally uses a blue background, which is often used in green screens and blue screens in Europe and the United States, especially when shooting people, the green screen is often used, because many European and American people's eyes are blue. In order to facilitate channel extraction during post-production, there are many problems to be aware of when shooting people in a solid color background. For example, the character can't contain the selected background color; the background color must be the same, the illumination is even, and the background or the light should be avoided as much as possible to avoid inconvenience to the channel extraction; sometimes the background size is large, and many blocks are needed. Cloth or board splicing.
步骤202、从所述人物图像上提取人物信息。Step 202: Extract character information from the character image.
示例地,可以采用通道提取(Matte Extraction)提取人物信息,所述通道提取也可称为抠像。很多影视作品都可以通过通道提取技术把摄影棚中在纯色背景下拍摄的图片的前景信息提取出来,与外景拍摄的图片进行合成,创建出更加精彩的画面效果。For example, person information may be extracted using Matt Extraction, which may also be referred to as an image. Many film and television works can extract the foreground information of the pictures taken in the studio under the solid color background through the channel extraction technology, and synthesize the pictures taken with the exterior scene to create a more exciting picture effect.
在实际应用中,蓝屏幕技术(Blue Screen)是通道提取最主要的方法,所述蓝屏幕技术为在蓝色背景下拍摄人物图像,然后利用色度的区别,把单色背景去掉,得到人物信息,所以蓝屏幕技术有个学名叫色度键(Chroma Keying)。在一实施例中,抠像的背景选择蓝色。其中,所述蓝屏幕技术常用的软件有AE(After Effect),是Adobe公司开发的一个视频剪辑及设计软件,是视频后期合成处理的专业非线性编辑软件。In practical applications, Blue Screen is the most important method for channel extraction. The blue screen technology is to take a picture of a person on a blue background, and then use the difference of chromaticity to remove the monochrome background and get the character. Information, so the blue screen technology has a scientific name called Chroma Keying. In an embodiment, the background of the key is selected in blue. Among them, the software commonly used in the blue screen technology is AE (After Effect), which is a video editing and design software developed by Adobe, and is a professional non-linear editing software for video post-synthesis processing.
步骤203、采用声控方式获取背景图片。Step 203: Acquire a background image by using a voice control manner.
示例地,用户可以向终端发送语音信息,所述语音信息中包括与所述用户所需的背景图片相关的关键词,例如景色的构成,背景图片中出现的要素,名胜古迹的名称等,终端接收到语音信息之后,可以通过信息的提取,获取所述与背景图片相关的关键词,然后根据所述关键词,获取符合用要求的背景图片。For example, the user may send voice information to the terminal, where the voice information includes keywords related to the background picture required by the user, such as the composition of the scene, the elements appearing in the background picture, the name of the scenic spot, etc., the terminal After receiving the voice information, the keyword related to the background image may be obtained by extracting the information, and then the background image that meets the requirement is obtained according to the keyword.
在一实施例中,初始化时可以在终端中预先设置关键词与图片的对应关系,所述对应关系可以如表1所示: In an embodiment, the correspondence between the keyword and the picture may be preset in the terminal during initialization, and the correspondence may be as shown in Table 1:
关键词Key words 图片image
秋千Swing 图像上存在秋千的图片APicture A of the swing on the image
郁金香tulip 图像上存在郁金香田的图片BPicture B of Tulip Field exists on the image
手机Mobile phone 图像上存在智能手机的图片CA picture C of the smartphone exists on the image
表1Table 1
例如,当终端从用户发送的语音信息中提取出的关键词是“手机”时,根据关键词与图片的对应关系,获取“手机”对应的图片应该为图像上存在智能手机的图片C,然后将所述图片C显示在屏幕上,以便于用户进行确认。若用户确定所述图片C符合要求,即可将所述图片C作为背景图片。For example, when the keyword extracted by the terminal from the voice information sent by the user is “mobile phone”, according to the correspondence between the keyword and the picture, the picture corresponding to the “mobile phone” should be the picture C of the smart phone on the image, and then The picture C is displayed on the screen for the user to confirm. If the user determines that the picture C meets the requirements, the picture C can be used as a background picture.
在一实施例中,也可以利用爬虫程序从网络上抓取与关键词相关的图片供用户选择。示例地,当终端从用户发送的语音信息中提取出的关键词是“手机”时,终端通过爬虫程序从网络上抓取所有与“手机”相关的图片,然后依次将抓取的图片显示在屏幕上,以便于用户进行确认。若用户确定某一张图片符合要求,即可将用户选中的图片作为背景图片。In an embodiment, the crawler program may also be used to retrieve keywords related to keywords from the network for selection by the user. For example, when the keyword extracted by the terminal from the voice information sent by the user is “mobile phone”, the terminal crawls all the pictures related to “mobile phone” from the network through the crawler program, and then displays the captured picture in turn. On the screen, for the user to confirm. If the user determines that an image meets the requirements, the selected image of the user can be used as the background image.
步骤204、将所述人物信息合成在所述背景图片上,得到合成图片。Step 204: Synthesize the character information on the background picture to obtain a composite picture.
示例地,对所获取并符合用户需求的背景图片进行分析,获取最佳的合成方案,然后将获取的人物信息按照最佳合成方案合成在所述背景图片上,得到合成图片。For example, the background picture acquired and meeting the user's needs is analyzed to obtain an optimal composition scheme, and then the acquired character information is synthesized on the background picture according to an optimal composition scheme to obtain a composite picture.
可选地,在得到合成图片之后,还可以获取背景图片对应的地理信息当前的天气信息,然后将所述天气信息和/或所述位置信息添加在所述合成图片上。例如,假设背景图片为八达岭长城,在得到合成图片之后,还可以获取当前八达岭的天气信息,然后将八达岭的天气信息添加到合成图片中,使得观看合成图片的观众有一种图片上的用户身临八达岭长城的感觉。实际应用中还可以在合成图片中添加位置信息,所述位置信息可以为所述背景图片对应的地理位置的位置信息。例如,可以将八达岭长城的位置信 息添加在合成图片中,所述八达岭长城的位置信息可以用经纬度标识,也可以用汉字标识,本发明实施例对此不做限定。Optionally, after the synthesized picture is obtained, current weather information of the geographic information corresponding to the background image may also be acquired, and then the weather information and/or the location information is added to the composite picture. For example, suppose the background image is the Badaling Great Wall. After obtaining the composite picture, you can also get the weather information of the current Badaling, and then add the weather information of Badaling to the composite picture, so that the viewer watching the composite picture has a user on the picture. The feeling of the Badaling Great Wall. In the actual application, location information may also be added to the composite image, where the location information may be location information of a geographic location corresponding to the background image. For example, you can place a letter on the location of the Badaling Great Wall. The information is added to the composite picture, and the location information of the Badaling Great Wall may be identified by latitude and longitude, or may be identified by a Chinese character, which is not limited by the embodiment of the present invention.
可选地,还可以在得到合成图片之后,对所述合成图片进行进一步的美化和编辑。例如,添加滤镜进行处理,或者调整合成图片的对比度或者亮度。Optionally, the composite picture may be further beautified and edited after the composite picture is obtained. For example, add a filter for processing, or adjust the contrast or brightness of a composite image.
本发明实施例提供了一种图像合成方法,包括:拍摄在纯色背景下的人物图像;从所述人物图像上提取人物信息;采用声控方式获取背景图片;将所述人物信息合成在所述背景图片上,得到合成图片。相较于现有技术,由于在纯色背景下拍摄人物图像,背景较为简单,可以通过智能化的图像处理技术删除所述背景,提取人物信息,使得人物信息的轮廓精细度较高,不会丢失人物细节,合成图片较为自然,用户体验较佳。An embodiment of the present invention provides an image synthesizing method, including: capturing a character image in a solid color background; extracting character information from the character image; acquiring a background image by using a voice control method; and synthesizing the character information in the background On the picture, get a composite picture. Compared with the prior art, since the background image is taken in a solid color background, the background is relatively simple, and the background can be deleted by intelligent image processing technology, and the character information is extracted, so that the outline of the character information is high and will not be lost. The details of the characters, the composite picture is more natural, and the user experience is better.
实施例二Embodiment 2
本发明实施例提供一种图像合成方法,应用于终端,如图3所示,包括:An embodiment of the present invention provides an image synthesizing method, which is applied to a terminal, as shown in FIG. 3, and includes:
步骤301、预设关键词与图片的对应关系,执行步骤302。Step 301: The correspondence between the preset keyword and the picture is performed, and step 302 is performed.
示例地,所述关键词与图片的对应关系可以参考表1所示。For example, the correspondence between the keyword and the picture can be referred to Table 1.
步骤302、接收用户输入的语音信息,执行步骤303。Step 302: Receive voice information input by the user, and perform step 303.
示例地,当用户需要选取背景图片时,可以对着终端的麦克风说出所需背景图片的基本特征。终端在检测到麦克风接收到声音信号时,确定用户输入语音信息。For example, when the user needs to select a background image, the basic features of the desired background image can be spoken against the microphone of the terminal. The terminal determines that the user inputs the voice information when detecting that the microphone receives the sound signal.
步骤303、提取所述语音信息的关键词,执行步骤304。Step 303: Extract the keyword of the voice information, and perform step 304.
示例地,可以预先在终端中设置关键词数据库,所述关键词数据库中存储了每个关键字的声音特征,包括音标,音调,音频等信息。当终端接收到用户发送的语音信息之后,将所述语音信息与关键词数据库中的各个关键词进行对比,提取所述语音信息的关键词。 For example, a keyword database may be set in advance in the terminal, and the keyword database stores sound characteristics of each keyword, including phonetic symbols, tones, audio, and the like. After receiving the voice information sent by the user, the terminal compares the voice information with each keyword in the keyword database to extract keywords of the voice information.
步骤304、根据关键词与图片的对应关系,选取所述语音信息的关键词对应的图片,执行步骤305。Step 304: Select a picture corresponding to the keyword of the voice information according to the correspondence between the keyword and the picture, and perform step 305.
示例地,假设终端提取的语音信息的关键词为“郁金香”,根据表1所示的关键词与图片的对应关系,获取“郁金香”对应的图片应该为图像上存在郁金香田的图片B。For example, if the keyword of the voice information extracted by the terminal is “tulip”, according to the correspondence between the keyword and the picture shown in Table 1, the picture corresponding to the “tulip” should be the picture B of the tulip field on the image.
步骤305、在显示屏上显示所述语音信息的关键词对应的图片,若用户确认所述图片符合要求,执行步骤306;若用户确认所述图片不符合要求,执行步骤311。Step 305: Display a picture corresponding to the keyword of the voice information on the display screen. If the user confirms that the picture meets the requirements, go to step 306. If the user confirms that the picture does not meet the requirements, go to step 311.
示例地,将“郁金香”对应的图像上存在郁金香田的图片B显示在显示上,然后提示用户进行确认。例如,在显示图片B的同时,显示提示信息,所述提示信息包括“确认”按钮和“取消”按钮,若用户认为图片B符合要求,可以点击“确认”按钮,此时终端确认用户认为图片B符合要求;若用户认为图片B不符合要求,可以点击“取消”按钮,此时终端确认用户认为图片B不符合要求。For example, a picture B of the tulip field on the image corresponding to the "tulip" is displayed on the display, and then the user is prompted to confirm. For example, while displaying the picture B, the prompt information is displayed, and the prompt information includes a “confirm” button and a “cancel” button. If the user thinks that the picture B meets the requirements, the user may click the “confirm” button, and the terminal confirms that the user thinks the picture. B meets the requirements; if the user thinks that picture B does not meet the requirements, you can click the “Cancel” button, and the terminal confirms that the user thinks that picture B does not meet the requirements.
步骤306、在纯色背景下拍摄人物图像,执行步骤307。Step 306: Capture a character image on a solid color background, and perform step 307.
示例地,可以在纯蓝色的背景下拍摄人物图像,确保人物不包含背景的蓝色。For example, a person image can be taken on a pure blue background to ensure that the character does not contain the blue of the background.
步骤307、从所述人物图像上提取人物信息,执行步骤308。Step 307: Extract character information from the character image, and perform step 308.
示例地,可以采用蓝屏幕技术提取人物信息,即将拍摄得到的人物图像上人物与背景之间的色度的区别,把蓝色背景去掉,得到人物信息。For example, the blue screen technology may be used to extract the character information, that is, the difference between the chromaticity between the person and the background on the captured person image, and the blue background is removed to obtain the character information.
步骤308、将所述人物信息合成在所述背景图片上,得到合成图片。Step 308: Synthesize the character information on the background picture to obtain a composite picture.
示例地,对所获取并符合用户需求的背景图片进行分析,获取最佳的合成方案,然后将获取的人物信息按照最佳合成方案合成在所述背景图片上,得到合成图片。其中,具体的合成方法为现有技术,本发明实施例对此不做赘述。 For example, the background picture acquired and meeting the user's needs is analyzed to obtain an optimal composition scheme, and then the acquired character information is synthesized on the background picture according to an optimal composition scheme to obtain a composite picture. The specific synthesis method is a prior art, and details are not described herein.
步骤309、在所述合成图片上添加天气信息,执行步骤310。Step 309: Add weather information to the composite picture, and perform step 310.
示例地,在得到合成图片之后,还可以获取图片B中的郁金香田所在的地理位置的天气信息,然后将所述天气信息添加在所述合成图片上。For example, after the composite picture is obtained, weather information of the geographic location where the tulip field in the picture B is located may also be acquired, and then the weather information is added on the composite picture.
假设当前郁金香田所在的地理位置的天气信息为“晴间多云,24~32℃”,可以将所述“晴间多云,24~32℃”添加在所述合成图片的右下角。Assuming that the current weather information of the geographic location of the tulip field is "cloudy, 24-32 ° C", the "cloudy, 24-32 ° C" may be added to the lower right corner of the composite picture.
实际应用中,还可以在合成图片上添加图片B中的郁金香田所在的地理位置的位置信息。假设郁金香田所在的地理位置的位置信息为“东经4°21'、北纬51°45'”,可以将所述“东经4°21'、北纬51°45'”添加在所述合成图片的右下角。In practical applications, the location information of the geographic location where the tulip field in the picture B is located may also be added to the composite picture. Assuming that the location information of the geographic location where the tulip field is located is "east longitude 4 ° 21 ', north latitude 51 ° 45 '", the "east longitude 4 ° 21 ', north latitude 51 ° 45 '" may be added to the right of the composite picture Lower corner.
步骤310、保存或分享所述合成图片,本流程结束。Step 310: Save or share the composite picture, and the process ends.
可选地,在完成合成图片之后,可以将所述合成图片保存在终端的存储器中,也可以分享所述合成图片,例如,发送到微信朋友圈,或者发送到微博,本发明实施例对此不做限定。Optionally, after the composite picture is completed, the composite picture may be saved in a memory of the terminal, or the composite picture may be shared, for example, sent to a WeChat friend circle, or sent to the microblog, which is in the embodiment of the present invention. This is not limited.
步骤311、采用爬虫程序从网络上抓取与所述语音信息的关键词相关的图片。Step 311: The crawler program is used to capture a picture related to the keyword of the voice information from the network.
示例地,在通过关键词和图片的对应关系无法获取符合用户要求的背景图片时,还可以采用爬虫程序从网络上抓取相关的图片,然后依次显示抓取的图片,以便于用户进行确认。其中,所述爬虫程序为现有技术,本发明实施例在此不做赘述。For example, when the background image that meets the user's requirements cannot be obtained through the correspondence between the keyword and the picture, the crawler program can also be used to grab the relevant picture from the network, and then display the captured picture in turn, so that the user can confirm. The reptile program is a prior art, and the embodiments of the present invention are not described herein.
在用户确认背景图片后,继续执行步骤306~310,以完成图片的合成。After the user confirms the background picture, steps 306-310 are continued to complete the synthesis of the picture.
需要说明的是,本发明实施例提供的图像合成方法步骤的先后顺序可以进行适当调整,步骤也可以根据情况进行相应增减,任何熟悉本技术领域的技术人员在本发明揭露的技术范围内,可轻易想到变化的方法,都应涵盖在本发明的保护范围之内,因此不再赘述。 It should be noted that the sequence of the steps of the image synthesizing method provided by the embodiment of the present invention may be appropriately adjusted, and the steps may also be correspondingly increased or decreased according to the situation, and any person skilled in the art may be within the technical scope disclosed by the present invention. Methods that can be easily conceived of variations are encompassed within the scope of the present invention and therefore will not be described again.
本发明实施例提供了一种图像合成方法,相较于现有技术,由于在纯色背景下拍摄人物图像,背景较为简单,可以通过智能化的图像处理技术删除所述背景,提取人物信息,使得人物信息的轮廓精细度较高,不会丢失人物细节,合成图片较为自然,用户体验较佳。The embodiment of the present invention provides an image synthesizing method. Compared with the prior art, since a person image is captured in a solid color background, the background is relatively simple, and the background can be deleted by an intelligent image processing technology to extract character information. The outline of the character information is high, and the details of the character are not lost. The composite picture is more natural and the user experience is better.
实施例三Embodiment 3
为实现本发明实施例的方法,本发明实施例提供一种图像合成装置40,位于终端,如图4所示,包括:In order to implement the method of the embodiment of the present invention, an embodiment of the present invention provides an image synthesizing device 40, which is located at a terminal, as shown in FIG. 4, and includes:
拍摄单元401,配置为拍摄在纯色背景下的人物图像。The photographing unit 401 is configured to photograph a person image on a solid color background.
提取单元402,配置为从所述人物图像上提取人物信息(可通过通道提取技术提取所述人物信息)。The extracting unit 402 is configured to extract character information from the character image (the person information can be extracted by a channel extraction technique).
第一获取单元403,配置为采用声控方式获取背景图片。The first obtaining unit 403 is configured to acquire a background image by using a voice control manner.
合成单元404,配置为将所述人物信息合成在所述背景图片上,得到合成图片。The synthesizing unit 404 is configured to synthesize the person information on the background picture to obtain a composite picture.
这样一来,由于在纯色背景下拍摄人物图像,背景较为简单,可以通过智能化的图像处理技术删除所述背景,提取人物信息,使得人物信息的轮廓精细度较高,不会丢失人物细节,合成图片较为自然,用户体验较佳。In this way, since the background image is taken in a solid color background, the background is relatively simple, and the background can be deleted by intelligent image processing technology, and the character information is extracted, so that the outline of the character information is high, and the character details are not lost. The composite picture is more natural and the user experience is better.
可选地,所述第一获取单元403具体配置为:接收用户输入的语音信息;根据所述语音信息,获取所述背景图片。Optionally, the first obtaining unit 403 is specifically configured to: receive voice information input by the user; and acquire the background image according to the voice information.
可选地,所述第一获取单元403具体配置为:提取所述语音信息中的关键字;根据关键字与图片的对应关系,在预设图像库中获取所述背景图片。Optionally, the first obtaining unit 403 is specifically configured to: extract a keyword in the voice information; and acquire the background image in a preset image library according to a correspondence between the keyword and the image.
其中,在一实施例中,所述第一获取单元403,具体配置为:In an embodiment, the first acquiring unit 403 is specifically configured to:
在显示屏上显示所述关键字对应的图片;Displaying a picture corresponding to the keyword on the display screen;
接收所述用户的确认操作;Receiving a confirmation operation of the user;
响应所述确认操作,确认显示的所述关键字对应的图片为所述背景图 片。In response to the confirming operation, confirming that the displayed picture corresponding to the keyword is the background image sheet.
可选地,所述第一获取单元403具体配置为:提取所述语音信息中的关键字;根据所述关键字,采用爬虫程序从网络上抓取与所述关键字对应的所述背景图片。Optionally, the first obtaining unit 403 is specifically configured to: extract a keyword in the voice information; according to the keyword, use a crawler program to fetch the background image corresponding to the keyword from a network .
其中,在一实施例中,所述第一获取单元403具体配置为:In an embodiment, the first acquiring unit 403 is specifically configured to:
通过爬虫程序从网络上抓取所有与所述关键字相关的图片;Grab all pictures related to the keyword from the network through a crawler;
依次将抓取的图片显示在屏幕上;Display the captured image in turn on the screen;
接收用户的确认操作;Receiving the user's confirmation operation;
响应所述确认操作,将所述用户确认的图片作为所述背景图片Responding to the confirming operation, using the picture confirmed by the user as the background image
在一实施例中,如图5所示,所述装置40还可以包括:第二获取单元405,配置为获取当前所述背景图片对应的地理位置的天气信息和/或位置信息;添加单元406,配置为将所述天气信息和/或所述位置信息添加在所述合成图片上。In an embodiment, as shown in FIG. 5, the apparatus 40 may further include: a second obtaining unit 405 configured to acquire weather information and/or location information of a geographic location corresponding to the current background image; and adding unit 406 And configured to add the weather information and/or the location information to the composite picture.
所述添加单元406,还配置为保存或分享所述合成图片。The adding unit 406 is further configured to save or share the composite picture.
需要说明的是,第一,所属领域的技术人员可以清楚地了解到,为描述的方便和简洁,上述描述的装置和单元的具体工作过程,可以参考前述方法实施例中的对应过程,在此不再赘述。It should be noted that, firstly, those skilled in the art can clearly understand that for the convenience and brevity of the description, the specific working process of the foregoing apparatus and unit can refer to the corresponding process in the foregoing method embodiment, where No longer.
第二,所述提取单元402、第一获取单元403、合成单元404、第二获取单元405和添加单元406均可由位于图像合成装置40中的中央处理器(Central Processing Unit,CPU)、微处理器(Micro Processor Unit,MPU)、数字信号处理器(Digital Signal Processor,DSP)、或现场可编程门阵列(Field Programmable Gate Array,FPGA)等实现。所述拍摄单元401由位于图像合成装置40中的照相机实现。Second, the extracting unit 402, the first obtaining unit 403, the synthesizing unit 404, the second obtaining unit 405, and the adding unit 406 may each be processed by a central processing unit (CPU) located in the image synthesizing device 40. (Micro Processor Unit, MPU), Digital Signal Processor (DSP), or Field Programmable Gate Array (FPGA). The photographing unit 401 is realized by a camera located in the image synthesizing device 40.
本发明实施例提供了一种图像合成装置,包括:拍摄单元,配置为拍摄在纯色背景下的人物图像。提取单元,配置为从所述人物图像上提取人 物信息。第一获取单元,配置为采用声控方式获取背景图片。合成单元,配置为将所述人物信息合成在所述背景图片上,得到合成图片。相较于现有技术,由于在纯色背景下拍摄人物图像,背景较为简单,可以通过智能化的图像处理技术删除所述背景,提取人物信息,使得人物信息的轮廓精细度较高,不会丢失人物细节,合成图片较为自然,用户体验较佳。An embodiment of the present invention provides an image synthesizing apparatus, including: a photographing unit configured to photograph a person image in a solid color background. An extracting unit configured to extract a person from the image of the person Information. The first obtaining unit is configured to obtain a background image by using a voice control manner. And a synthesizing unit configured to synthesize the character information on the background image to obtain a composite picture. Compared with the prior art, since the background image is taken in a solid color background, the background is relatively simple, and the background can be deleted by intelligent image processing technology, and the character information is extracted, so that the outline of the character information is high and will not be lost. The details of the characters, the composite picture is more natural, and the user experience is better.
应理解,说明书通篇中提到的“一个实施例”或“一实施例”意味着与实施例有关的特定特征、结构或特性包括在本发明的至少一个实施例中。因此,在整个说明书各处出现的“在一个实施例中”或“在一实施例中”未必一定指相同的实施例。此外,这些特定的特征、结构或特性可以任意适合的方式结合在一个或多个实施例中。应理解,在本发明的各种实施例中,上述各过程的序号的大小并不意味着执行顺序的先后,各过程的执行顺序应以其功能和内在逻辑确定,而不应对本发明实施例的实施过程构成任何限定。上述本发明实施例序号仅仅为了描述,不代表实施例的优劣。It is to be understood that the phrase "one embodiment" or "an embodiment" or "an" Thus, "in one embodiment" or "in an embodiment" or "an" In addition, these particular features, structures, or characteristics may be combined in any suitable manner in one or more embodiments. It should be understood that, in various embodiments of the present invention, the size of the sequence numbers of the above processes does not mean the order of execution, and the order of execution of each process should be determined by its function and internal logic, and should not be directed to the embodiments of the present invention. The implementation process constitutes any limitation. The serial numbers of the embodiments of the present invention are merely for the description, and do not represent the advantages and disadvantages of the embodiments.
需要说明的是,在本文中,术语“包括”、“包含”或者其任何其他变体意在涵盖非排他性的包含,从而使得包括一系列要素的过程、方法、物品或者装置不仅包括那些要素,而且还包括没有明确列出的其他要素,或者是还包括为这种过程、方法、物品或者装置所固有的要素。在没有更多限制的情况下,由语句“包括一个……”限定的要素,并不排除在包括该要素的过程、方法、物品或者装置中还存在另外的相同要素。It is to be understood that the term "comprises", "comprising", or any other variants thereof, is intended to encompass a non-exclusive inclusion, such that a process, method, article, or device comprising a series of elements includes those elements. It also includes other elements that are not explicitly listed, or elements that are inherent to such a process, method, article, or device. An element that is defined by the phrase "comprising a ..." does not exclude the presence of additional equivalent elements in the process, method, item, or device that comprises the element.
在本申请所提供的几个实施例中,应该理解到,所揭露的设备和方法,可以通过其它的方式实现。以上所描述的设备实施例仅仅是示意性的,例如,所述单元的划分,仅仅为一种逻辑功能划分,实际实现时可以有另外的划分方式,如:多个单元或组件可以结合,或可以集成到另一个系统,或一些特征可以忽略,或不执行。另外,所显示或讨论的各组成部分相互之间的耦合、或直接耦合、或通信连接可以是通过一些接口,设备或单元 的间接耦合或通信连接,可以是电性的、机械的或其它形式的。In the several embodiments provided by the present application, it should be understood that the disclosed apparatus and method may be implemented in other manners. The device embodiments described above are merely illustrative. For example, the division of the unit is only a logical function division. In actual implementation, there may be another division manner, such as: multiple units or components may be combined, or Can be integrated into another system, or some features can be ignored or not executed. In addition, the coupling, or direct coupling, or communication connection of the various components shown or discussed may be through some interface, device or unit. The indirect coupling or communication connection can be electrical, mechanical or other form.
上述作为分离部件说明的单元可以是、或也可以不是物理上分开的,作为单元显示的部件可以是、或也可以不是物理单元;既可以位于一个地方,也可以分布到多个网络单元上;可以根据实际的需要选择其中的部分或全部单元来实现本实施例方案的目的。The units described above as separate components may or may not be physically separated, and the components displayed as the unit may or may not be physical units; they may be located in one place or distributed on multiple network units; Some or all of the units may be selected according to actual needs to achieve the purpose of the solution of the embodiment.
另外,在本发明各实施例中的各功能单元可以全部集成在一个处理单元中,也可以是各单元分别单独作为一个单元,也可以两个或两个以上单元集成在一个单元中;上述集成的单元既可以采用硬件的形式实现,也可以采用硬件加软件功能单元的形式实现。In addition, each functional unit in each embodiment of the present invention may be integrated into one processing unit, or each unit may be separately used as one unit, or two or more units may be integrated into one unit; The unit can be implemented in the form of hardware or in the form of hardware plus software functional units.
本领域普通技术人员可以理解:实现上述方法实施例的全部或部分步骤可以通过程序指令相关的硬件来完成,前述的程序可以存储于计算机可读取存储介质中,该程序在执行时,执行包括上述方法实施例的步骤;而前述的存储介质包括:移动存储设备、只读存储器(Read Only Memory,ROM)、磁碟或者光盘等各种可以存储程序代码的介质。It will be understood by those skilled in the art that all or part of the steps of implementing the foregoing method embodiments may be performed by hardware related to program instructions. The foregoing program may be stored in a computer readable storage medium, and when executed, the program includes The foregoing steps of the method embodiment; and the foregoing storage medium includes: a removable storage device, a read only memory (ROM), a magnetic disk, or an optical disk, and the like, which can store program codes.
或者,本发明上述集成的单元如果以软件功能模块的形式实现并作为独立的产品销售或使用时,也可以存储在一个计算机可读取存储介质中。基于这样的理解,本发明实施例的技术方案本质上或者说对现有技术做出贡献的部分可以以软件产品的形式体现出来,该计算机软件产品存储在一个存储介质中,包括若干指令用以使得一台计算机设备(可以是个人计算机、服务器、或者网络设备等)执行本发明各个实施例所述方法的全部或部分。而前述的存储介质包括:移动存储设备、ROM、磁碟或者光盘等各种可以存储程序代码的介质。Alternatively, the above-described integrated unit of the present invention may be stored in a computer readable storage medium if it is implemented in the form of a software function module and sold or used as a standalone product. Based on such understanding, the technical solution of the embodiments of the present invention may be embodied in the form of a software product in essence or in the form of a software product stored in a storage medium, including a plurality of instructions. A computer device (which may be a personal computer, server, or network device, etc.) is caused to perform all or part of the methods described in various embodiments of the present invention. The foregoing storage medium includes various media that can store program codes, such as a mobile storage device, a ROM, a magnetic disk, or an optical disk.
基于此,本发明实施例提供一种计算机存储介质,所述计算机存储介质包括一组指令,当执行所述指令时,引起至少一个处理器执行本发明实施例所描述的图像合成方法。 Based on this, an embodiment of the present invention provides a computer storage medium, where the computer storage medium includes a set of instructions that, when executed, cause at least one processor to perform the image synthesis method described in the embodiments of the present invention.
以上所述,仅为本发明的较佳实施例而已,并非用于限定本发明的保护范围。 The above is only the preferred embodiment of the present invention and is not intended to limit the scope of the present invention.

Claims (20)

  1. 一种图像合成方法,包括:An image synthesis method comprising:
    拍摄在纯色背景下的人物图像;Shooting images of people on a solid background;
    从所述人物图像上提取人物信息;Extracting character information from the character image;
    采用声控方式获取背景图片;Acquire the background image by voice control;
    将所述人物信息合成在所述背景图片上,得到合成图片。The person information is synthesized on the background picture to obtain a composite picture.
  2. 根据权利要求1所述的方法,其中,所述采用声控方式获取背景图片包括:The method of claim 1, wherein the acquiring the background image by using a voice control method comprises:
    接收用户输入的语音信息;Receiving voice information input by the user;
    根据所述语音信息,获取所述背景图片。Obtaining the background picture according to the voice information.
  3. 根据权利要求2所述的方法,其中,所述根据所述语音信息,获取所述背景图片包括:The method according to claim 2, wherein the obtaining the background image according to the voice information comprises:
    提取所述语音信息中的关键字;Extracting keywords in the voice information;
    根据关键字与图片的对应关系,在预设图像库中获取所述背景图片。The background image is acquired in a preset image library according to the correspondence between the keyword and the picture.
  4. 根据权利要求3所述的方法,其中,所述根据关键字与图片的对应关系,在预设图像库中获取所述背景图片,包括:The method according to claim 3, wherein the obtaining the background image in a preset image library according to a correspondence between a keyword and a picture comprises:
    在显示屏上显示所述关键字对应的图片;Displaying a picture corresponding to the keyword on the display screen;
    接收所述用户的确认操作;Receiving a confirmation operation of the user;
    响应所述确认操作,确认显示的所述关键字对应的图片为所述背景图片。In response to the confirming operation, it is confirmed that the picture corresponding to the displayed keyword is the background picture.
  5. 根据权利要求2所述的方法,其中,所述根据所述语音信息,获取所述背景图片包括:The method according to claim 2, wherein the obtaining the background image according to the voice information comprises:
    提取所述语音信息中的关键字;Extracting keywords in the voice information;
    根据所述关键字,采用爬虫程序从网络上抓取与所述关键字对应的所述背景图片。 According to the keyword, the background image corresponding to the keyword is captured from the network by using a crawler program.
  6. 根据权利要求5所述的方法,其中,所述采用爬虫程序从网络上抓取与所述关键字对应的所述背景图片包括:The method of claim 5, wherein the crawling the background image corresponding to the keyword from the network by using a crawler program comprises:
    通过爬虫程序从网络上抓取所有与所述关键字相关的图片;Grab all pictures related to the keyword from the network through a crawler;
    依次将抓取的图片显示在屏幕上;Display the captured image in turn on the screen;
    接收用户的确认操作;Receiving the user's confirmation operation;
    响应所述确认操作,将所述用户确认的图片作为所述背景图片。In response to the confirming operation, the picture confirmed by the user is taken as the background picture.
  7. 根据权利要求1所述的方法,其中,所述从所述人物图像上提取人物信息包括:The method of claim 1, wherein the extracting the character information from the character image comprises:
    采用通道提取技术提取所述人物信息。The person information is extracted by using a channel extraction technique.
  8. 根据权利要求7所述的方法,其中,所述纯色背景为蓝色背景;所述采用通道提取技术提取所述人物信息包括:The method according to claim 7, wherein the solid color background is a blue background; and the extracting the character information by using a channel extraction technique comprises:
    采用蓝屏幕技术提取所述人物信息。The character information is extracted using a blue screen technology.
  9. 根据权利要求1至8任一项所述的方法,其中,在所述将所述人物信息合成在所述背景图片上,得到合成图片之后,所述方法还包括:The method according to any one of claims 1 to 8, wherein after the synthesizing the person information on the background picture to obtain a composite picture, the method further comprises:
    获取当前所述背景图片对应的地理位置的天气信息和/或位置信息;Obtaining weather information and/or location information of a geographic location corresponding to the current background image;
    将所述天气信息和/或所述位置信息添加在所述合成图片上。The weather information and/or the location information is added to the composite picture.
  10. 根据权利要求9所述的方法,其中,所述方法还包括:The method of claim 9 wherein the method further comprises:
    保存或分享所述合成图片。Save or share the composite picture.
  11. 一种图像合成装置,包括:An image synthesizing device comprising:
    拍摄单元,配置为拍摄在纯色背景下的人物图像;a shooting unit configured to capture a portrait of a person on a solid background;
    提取单元,配置为从所述人物图像上提取人物信息;An extracting unit configured to extract character information from the character image;
    第一获取单元,配置为采用声控方式获取背景图片;The first obtaining unit is configured to obtain a background image by using a voice control manner;
    合成单元,配置为将所述人物信息合成在所述背景图片上,得到合成图片。And a synthesizing unit configured to synthesize the character information on the background image to obtain a composite picture.
  12. 根据权利要求11所述的装置,其中,所述第一获取单元配置为: The apparatus of claim 11, wherein the first obtaining unit is configured to:
    接收用户输入的语音信息;Receiving voice information input by the user;
    根据所述语音信息,获取所述背景图片。Obtaining the background picture according to the voice information.
  13. 根据权利要求12所述的装置,其中,所述第一获取单元配置为:The apparatus of claim 12, wherein the first obtaining unit is configured to:
    提取所述语音信息中的关键字;Extracting keywords in the voice information;
    根据关键字与图片的对应关系,在预设图像库中获取所述背景图片。The background image is acquired in a preset image library according to the correspondence between the keyword and the picture.
  14. 根据权利要求13所述的装置,其中,所述第一获取单元配置为:The apparatus of claim 13, wherein the first obtaining unit is configured to:
    在显示屏上显示所述关键字对应的图片;Displaying a picture corresponding to the keyword on the display screen;
    接收所述用户的确认操作;Receiving a confirmation operation of the user;
    响应所述确认操作,确认显示的所述关键字对应的图片为所述背景图片。In response to the confirming operation, it is confirmed that the picture corresponding to the displayed keyword is the background picture.
  15. 根据权利要求12所述的装置,其中,所述第一获取单元配置为:The apparatus of claim 12, wherein the first obtaining unit is configured to:
    提取所述语音信息中的关键字;Extracting keywords in the voice information;
    根据所述关键字,采用爬虫程序从网络上抓取与所述关键字对应的所述背景图片。According to the keyword, the background image corresponding to the keyword is captured from the network by using a crawler program.
  16. 根据权利要求15所述的装置,其中,所述第一获取单元配置为:The apparatus of claim 15, wherein the first obtaining unit is configured to:
    通过爬虫程序从网络上抓取所有与所述关键字相关的图片;Grab all pictures related to the keyword from the network through a crawler;
    依次将抓取的图片显示在屏幕上;Display the captured image in turn on the screen;
    接收用户的确认操作;Receiving the user's confirmation operation;
    响应所述确认操作,将所述用户确认的图片作为所述背景图片。In response to the confirming operation, the picture confirmed by the user is taken as the background picture.
  17. 根据权利要求11所述的装置,其中,所述提取单元配置为:The apparatus of claim 11 wherein said extracting unit is configured to:
    采用通道提取技术提取所述人物信息。The person information is extracted by using a channel extraction technique.
  18. 根据权利要求11至17任一项所述的装置,其中,所述装置还包括:The device according to any one of claims 11 to 17, wherein the device further comprises:
    第二获取单元,配置为获取当前所述背景图片对应的地理位置的天气信息和/或位置信息; a second acquiring unit, configured to acquire weather information and/or location information of a geographic location corresponding to the current background image;
    添加单元,配置为将所述天气信息和/或所述位置信息添加在所述合成图片上。An adding unit configured to add the weather information and/or the location information to the composite picture.
  19. 根据权利要求18所述的装置,其中,所述添加单元,还配置为保存或分享所述合成图片。The apparatus of claim 18, wherein the adding unit is further configured to save or share the composite picture.
  20. 一种计算机存储介质,所述计算机存储介质包括一组指令,当执行所述指令时,引起至少一个处理器执行如权利要求1至10任一项所述的图像合成方法。 A computer storage medium comprising a set of instructions that, when executed, cause at least one processor to perform the image composition method of any one of claims 1 to 10.
PCT/CN2016/098207 2015-12-28 2016-09-06 Image synthesizing method, device and computer storage medium WO2017113873A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201511005225.7A CN105657254B (en) 2015-12-28 2015-12-28 A kind of image composition method and device
CN201511005225.7 2015-12-28

Publications (1)

Publication Number Publication Date
WO2017113873A1 true WO2017113873A1 (en) 2017-07-06

Family

ID=56478205

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2016/098207 WO2017113873A1 (en) 2015-12-28 2016-09-06 Image synthesizing method, device and computer storage medium

Country Status (2)

Country Link
CN (1) CN105657254B (en)
WO (1) WO2017113873A1 (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109191372A (en) * 2018-09-13 2019-01-11 上海宇佑船舶科技有限公司 A kind of plane display mirror for capableing of photo synthesis
CN111932455A (en) * 2020-07-30 2020-11-13 深圳市富途网络科技有限公司 Information sharing method and related product
CN113793288A (en) * 2021-08-26 2021-12-14 广州微咔世纪信息科技有限公司 Virtual character co-shooting method and device and computer readable storage medium

Families Citing this family (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105657254B (en) * 2015-12-28 2019-10-29 努比亚技术有限公司 A kind of image composition method and device
CN106547423A (en) * 2016-10-12 2017-03-29 百度在线网络技术(北京)有限公司 A kind of method and apparatus for updating background picture
CN107707836A (en) * 2017-09-11 2018-02-16 广东欧珀移动通信有限公司 Image processing method and device, electronic installation and computer-readable recording medium
CN107613227A (en) * 2017-09-11 2018-01-19 广东欧珀移动通信有限公司 Image processing method and device, electronic installation and computer-readable recording medium
CN108198159A (en) * 2017-12-28 2018-06-22 努比亚技术有限公司 A kind of image processing method, mobile terminal and computer readable storage medium
CN111475664B (en) * 2019-01-24 2023-06-09 阿里巴巴集团控股有限公司 Object display method and device and electronic equipment
CN110784739A (en) * 2019-10-25 2020-02-11 稿定(厦门)科技有限公司 Video synthesis method and device based on AE
CN111210450B (en) * 2019-12-25 2022-08-09 北京东宇宏达科技有限公司 Method and system for processing infrared image of sea-sky background
CN112802049B (en) * 2021-03-04 2022-10-11 山东大学 Method and system for constructing household article detection data set

Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1801888A (en) * 2005-01-05 2006-07-12 英华达(上海)电子有限公司 Method for realizing picture background transformation in digital camera
CN2901726Y (en) * 2006-01-17 2007-05-16 上海超澜数码科技有限公司 Virtual travel device
CN102662961A (en) * 2012-03-08 2012-09-12 北京百舜华年文化传播有限公司 Method, apparatus and terminal unit for matching semantics with image
CN103475826A (en) * 2013-09-27 2013-12-25 深圳市中视典数字科技有限公司 Video matting and synthesis method
CN104584527A (en) * 2012-08-05 2015-04-29 诚研科技股份有限公司 Image capture device and method for image processing by voice recognition
CN104703043A (en) * 2015-03-26 2015-06-10 努比亚技术有限公司 Video special effect adding method and device
CN105120189A (en) * 2015-08-31 2015-12-02 河海大学常州校区 Weather forecast program direction method based on Kinect
CN105657254A (en) * 2015-12-28 2016-06-08 努比亚技术有限公司 Image synthesizing method and device
CN105893419A (en) * 2015-11-30 2016-08-24 乐视致新电子科技(天津)有限公司 Generation device, device and equipment of multimedia photo, and mobile phone

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR102013331B1 (en) * 2013-02-23 2019-10-21 삼성전자 주식회사 Terminal device and method for synthesizing a dual image in device having a dual camera
CN103955485A (en) * 2014-04-11 2014-07-30 王玉娇 Server, system and related method capable of realizing real-time electronic map
CN105094760B (en) * 2014-04-28 2019-10-29 小米科技有限责任公司 A kind of picture indicia method and device
CN104483809A (en) * 2014-12-30 2015-04-01 杨守强 Instant stereograph photographing method and system
CN105100491B (en) * 2015-08-11 2018-06-01 努比亚技术有限公司 A kind of apparatus and method for handling photo
CN105185222B (en) * 2015-09-24 2018-03-06 百度在线网络技术(北京)有限公司 A kind of map renders methods of exhibiting and device

Patent Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1801888A (en) * 2005-01-05 2006-07-12 英华达(上海)电子有限公司 Method for realizing picture background transformation in digital camera
CN2901726Y (en) * 2006-01-17 2007-05-16 上海超澜数码科技有限公司 Virtual travel device
CN102662961A (en) * 2012-03-08 2012-09-12 北京百舜华年文化传播有限公司 Method, apparatus and terminal unit for matching semantics with image
CN104584527A (en) * 2012-08-05 2015-04-29 诚研科技股份有限公司 Image capture device and method for image processing by voice recognition
CN103475826A (en) * 2013-09-27 2013-12-25 深圳市中视典数字科技有限公司 Video matting and synthesis method
CN104703043A (en) * 2015-03-26 2015-06-10 努比亚技术有限公司 Video special effect adding method and device
CN105120189A (en) * 2015-08-31 2015-12-02 河海大学常州校区 Weather forecast program direction method based on Kinect
CN105893419A (en) * 2015-11-30 2016-08-24 乐视致新电子科技(天津)有限公司 Generation device, device and equipment of multimedia photo, and mobile phone
CN105657254A (en) * 2015-12-28 2016-06-08 努比亚技术有限公司 Image synthesizing method and device

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109191372A (en) * 2018-09-13 2019-01-11 上海宇佑船舶科技有限公司 A kind of plane display mirror for capableing of photo synthesis
CN111932455A (en) * 2020-07-30 2020-11-13 深圳市富途网络科技有限公司 Information sharing method and related product
CN111932455B (en) * 2020-07-30 2024-04-19 深圳市富途网络科技有限公司 Information sharing method and related product
CN113793288A (en) * 2021-08-26 2021-12-14 广州微咔世纪信息科技有限公司 Virtual character co-shooting method and device and computer readable storage medium

Also Published As

Publication number Publication date
CN105657254B (en) 2019-10-29
CN105657254A (en) 2016-06-08

Similar Documents

Publication Publication Date Title
WO2017113873A1 (en) Image synthesizing method, device and computer storage medium
WO2017071559A1 (en) Image processing apparatus and method
JP2018500611A (en) Image processing method and apparatus
CN106034206B (en) Electronic device and image display method
US10489873B2 (en) Embedding digital content within a digital photograph during capture of the digital photograph
WO2017114048A1 (en) Mobile terminal and method for identifying contact
US20220256099A1 (en) Method for processing video, terminal, and storage medium
CN103927165A (en) Wallpaper picture processing method and device
WO2022252660A1 (en) Video capturing method and electronic device
CN104935810A (en) Photographing guiding method and device
WO2017088609A1 (en) Image denoising apparatus and method
WO2017080084A1 (en) Font addition method and apparatus
WO2016090831A1 (en) Page display method and device, and electronic equipment
WO2016101592A1 (en) Method and device for providing photograph image
CN105744170A (en) Picture photographing device and method
CN103699621A (en) Method for recording graphic and text information on materials recorded by mobile device
WO2018032674A1 (en) Color gamut mapping method and device
US20160050387A1 (en) Image recording device and image recording method
CN109756783B (en) Poster generation method and device
CN105959588A (en) Mobile terminal, light-painted photograph shooting device and method
KR102138835B1 (en) Apparatus and method for providing information exposure protecting image
US20190087926A1 (en) Embedding digital content within a digital photograph during capture of the digital photograph
CN112396675A (en) Image processing method, device and storage medium
WO2017107605A1 (en) Image detail processing method, device, terminal and storage medium
US20210377454A1 (en) Capturing method and device

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 16880666

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 16880666

Country of ref document: EP

Kind code of ref document: A1