WO2017113873A1

WO2017113873A1 - Image synthesizing method, device and computer storage medium

Info

Publication number: WO2017113873A1
Application number: PCT/CN2016/098207
Authority: WO
Inventors: 李乐义
Original assignee: 努比亚技术有限公司
Priority date: 2015-12-28
Filing date: 2016-09-06
Publication date: 2017-07-06
Also published as: CN105657254B; CN105657254A

Abstract

Disclosed in an embodiment of the present invention is an image synthesizing method, the method comprising: photographing a portrait image against a solid color background; extracting information of the portrait subject from the portrait image; acquiring a background image by use of a voice control method; synthesizing information of the portrait subject with the background image and acquiring a synthesized image. Also disclosed in this embodiment of the present invention are an image synthesizing device and computer storage medium.

Description

Image synthesis method, device and computer storage medium

Technical field

The present invention relates to image processing technologies in the field of terminals, and in particular, to an image synthesis method, apparatus, and computer storage medium.

Background technique

As the degree of terminal intelligence becomes higher and higher, more and more applications are applied to the terminal, and the terminal has various functions such as a telephone, a camera, a video recorder, and a music player.

Although the image taken by the camera function of the terminal has a good effect, only the shooting of the real-time scene can be realized. If the user wants a picture of a character and does not want to be photographed in an immersive way, he or she can only use a manual map to make a puzzle, for example, taking a picture of a character and then cutting the person with the user's finger or a touch pen. The cut characters are stitched into the landscape to get the pictures that the user needs.

Since the extraction of the outline of the character is manually completed, the outline is rough, the precision is not high, and the local details are easily lost, resulting in the picture being spliced to be too rigid, and the synthetic trace is more obvious, resulting in poor user experience.

Summary of the invention

In order to solve the above technical problem, embodiments of the present invention are expected to provide an image synthesizing method, apparatus, and computer storage medium.

The technical solution of the embodiment of the present invention is implemented as follows:

In one aspect, an embodiment of the present invention provides an image synthesizing method, including:

Shooting images of people on a solid background;

Extracting character information from the character image;

Acquire the background image by voice control;

The person information is synthesized on the background picture to obtain a composite picture.

Optionally, the acquiring the background image by using the voice control method includes:

Receiving voice information input by the user;

Obtaining the background picture according to the voice information.

Optionally, the acquiring the background image according to the voice information includes:

Extracting keywords in the voice information;

The background image is acquired in a preset image library according to the correspondence between the keyword and the picture.

Optionally, the acquiring the background image in the preset image library according to the correspondence between the keyword and the image, including:

Displaying a picture corresponding to the keyword on the display screen;

Receiving a confirmation operation of the user;

In response to the confirming operation, it is confirmed that the picture corresponding to the displayed keyword is the background picture.

Extracting keywords in the voice information;

According to the keyword, the background image corresponding to the keyword is captured from the network by using a crawler program.

Optionally, the capturing, by the crawler, the background image corresponding to the keyword from the network includes:

Grab all pictures related to the keyword from the network through a crawler;

Display the captured image in turn on the screen;

Receiving the user's confirmation operation;

In response to the confirming operation, the picture confirmed by the user is taken as the background picture.

Optionally, the extracting the character information from the character image includes:

The person information is extracted by using a channel extraction technique.

Optionally, the solid color background is a blue background; and the extracting the character information by using a channel extraction technology includes:

The character information is extracted using a blue screen technology.

Optionally, after the synthesizing the character information on the background image to obtain a composite image, the method further includes:

Obtaining weather information and/or location information of a geographic location corresponding to the current background image;

The weather information and/or the location information is added to the composite picture.

Optionally, the method further includes:

Save or share the composite picture.

In another aspect, an embodiment of the present invention provides an image synthesizing apparatus, including:

a shooting unit configured to capture a portrait of a person on a solid background;

An extracting unit configured to extract character information from the character image;

The first obtaining unit is configured to obtain a background image by using a voice control manner;

And a synthesizing unit configured to synthesize the character information on the background image to obtain a composite picture.

Optionally, the first acquiring unit is configured to:

Receiving voice information input by the user;

Obtaining the background picture according to the voice information.

Optionally, the first acquiring unit is configured to:

Extracting keywords in the voice information;

Optionally, the first acquiring unit is configured to:

Displaying a picture corresponding to the keyword on the display screen;

Receiving a confirmation operation of the user;

Optionally, the first acquiring unit is configured to:

Extracting keywords in the voice information;

Optionally, the first acquiring unit is configured to:

Grab all pictures related to the keyword from the network through a crawler;

Display the captured image in turn on the screen;

Receiving the user's confirmation operation;

Optionally, the extracting unit is configured to:

The person information is extracted by using a channel extraction technique.

Optionally, the device further includes:

a second acquiring unit, configured to acquire weather information and/or location information of a geographic location corresponding to the current background image;

An adding unit configured to add the weather information and/or the location information to the composite picture.

Optionally, the adding unit is further configured to save or share the composite picture.

In a third aspect, an embodiment of the present invention provides a computer storage medium, where the computer storage medium includes a set of instructions that, when executed, cause at least one processor to execute the image synthesis method described above.

Embodiments of the present invention provide an image synthesizing method, apparatus, and computer storage medium, which captures a character image in a solid color background; extracts character information from the character image; acquires a background image by using a voice control method; and synthesizes the character information On the background image, get a composite image sheet. Compared with the prior art, since the background image is taken in a solid color background, the background is relatively simple, and the background can be deleted by intelligent image processing technology, and the character information is extracted, so that the outline of the character information is high and will not be lost. The details of the characters, the composite picture is more natural, and the user experience is better.

DRAWINGS

1 is a schematic structural diagram of hardware of a mobile terminal that can be implemented in an embodiment of the present invention;

2 is a schematic flowchart 1 of an image synthesizing method according to an embodiment of the present invention;

FIG. 3 is a schematic flowchart 2 of an image synthesizing method according to an embodiment of the present invention;

4 is a schematic structural diagram 1 of an image synthesizing apparatus according to an embodiment of the present invention;

FIG. 5 is a schematic structural diagram 2 of an image synthesizing apparatus according to an embodiment of the present invention.

detailed description

The technical solutions in the embodiments of the present invention will be clearly and completely described in the following with reference to the accompanying drawings. It is understood that the specific embodiments described herein are merely illustrative of the invention and are not intended to limit the invention.

A mobile terminal embodying various embodiments of the present invention will now be described with reference to the accompanying drawings. In the following description, the use of suffixes such as "module", "component" or "unit" for indicating an element is merely an explanation for facilitating the present invention, and does not have a specific meaning per se. Therefore, "module" and "component" can be used in combination.

The mobile terminal can be implemented in various forms. For example, the terminals described in the present invention may include, for example, mobile phones, smart phones, notebook computers, digital broadcast receivers, personal digital assistants (PDAs), tablet computers (PADs), portable multimedia players (PMPs), navigation devices, and the like. Mobile terminals and fixed terminals such as digital TVs, desktop computers, and the like. In the following, it is assumed that the terminal is a mobile terminal. However, it will be understood by those skilled in the art that the configuration according to an embodiment of the present invention can be applied to a fixed type in addition to an element particularly for moving purposes. terminal.

FIG. 1 is a schematic diagram showing the hardware structure of a mobile terminal that can be implemented in various embodiments of the present invention. As shown in FIG. 1, the mobile terminal includes:

The A/V input unit 120 is configured to receive an audio or video signal. The A/V input unit 120 may include a camera 121 and a microphone 122 that processes image data of still pictures or video obtained by the image capturing device in a video capturing mode or an image capturing mode. The processed image frame can be displayed on the display unit 151. The image frames processed by the camera 121 may be stored in the memory 160 (or other storage medium), and two or more cameras 121 may be provided according to the configuration of the mobile terminal. The microphone 122 can receive sound (audio data) via a microphone in an operation mode of a telephone call mode, a recording mode, a voice recognition mode, and the like, and can process such sound as audio data.

The user input unit 130 may generate key input data according to a command input by the user to control various operations of the mobile terminal. The user input unit 130 allows the user to input various types of information, and may include a keyboard, a pot, a touch pad (eg, a touch sensitive component that detects changes in resistance, pressure, capacitance, etc. due to contact), a scroll wheel , rocker, etc. In particular, when the touch panel is superimposed on the display unit 151 in the form of a layer, a touch screen can be formed.

Output unit 150 is configured to provide an output signal (eg, an audio signal, a video signal, an alarm signal, a vibration signal, etc.) in a visual, audio, and/or tactile manner. The output unit 150 may include a display unit 151.

The display unit 151 can display information processed in the mobile terminal 100. For example, when the mobile terminal 100 is in a phone call mode, the display unit 151 can display a user interface (UI) or a graphical user interface (GUI) related to a call or other communication (eg, text messaging, multimedia file download, etc.). When the mobile terminal 100 is in a video call mode or an image capturing mode, the display unit 151 may display a captured image and/or a received image, a UI or GUI showing a video or image and related functions, and the like.

Meanwhile, when the display unit 151 and the touch panel are superposed on each other in the form of a layer to form a touch screen, the display unit 151 can function as an input device and an output device. The display unit 151 may include at least one of a liquid crystal display (LCD), a thin film transistor LCD (TFT-LCD), an organic light emitting diode (OLED) display, a flexible display, a three-dimensional (3D) display, and the like. Some of these displays may be configured to be transparent to allow a user to view from the outside, which may be referred to as a transparent display, and a typical transparent display may be, for example, a TOLED (Transparent Organic Light Emitting Diode) display or the like. According to a particular desired embodiment, the mobile terminal 100 may include two or more display units (or other display devices), for example, the mobile terminal may include an external display unit (not shown) and an internal display unit (not shown) . The touch screen can be configured to detect touch input pressure as well as touch input position and touch input area.

The memory 160 may store a software program or the like that performs processing and control operations performed by the controller 180, or may temporarily store data (for example, a phone book, a message, a still image, a video, and the like) that has been output or is to be output. Moreover, the memory 160 can store data regarding vibrations and audio signals of various manners that are output when a touch is applied to the touch screen.

The memory 160 may include at least one type of storage medium including a flash memory, a hard disk, a multimedia card, a card type memory (eg, SD or DX memory, etc.), a random access memory (RAM), a static random access memory ( SRAM), read only memory (ROM), electrically erasable programmable read only memory (EEPROM), programmable read only memory (PROM), magnetic memory, magnetic disk, optical disk, and the like. Moreover, the mobile terminal 100 can cooperate with a network storage device that performs a storage function of the memory 160 through a network connection.

The controller 180 typically controls the overall operation of the mobile terminal. For example, the controller 180 performs the control and processing associated with voice calls, data communications, video calls, and the like.

The power supply unit 190 receives external power or internal power under the control of the controller 180 and provides appropriate power required to operate the various components and components.

The various embodiments described herein can be used, for example, in computer software, hardware, or any of them. The combined computer readable medium is implemented. For hardware implementations, the embodiments described herein may be through the use of application specific integrated circuits (ASICs), digital signal processors (DSPs), digital signal processing devices (DSPDs), programmable logic devices (PLDs), field programmable gate arrays ( An FPGA, a processor, a controller, a microcontroller, a microprocessor, at least one of the electronic units designed to perform the functions described herein, in some cases, such an embodiment may be at the controller 180 Implemented in the middle. For software implementations, implementations such as procedures or functions may be implemented with separate software modules that permit the execution of at least one function or operation. The software code can be implemented by a software application (or program) written in any suitable programming language, which can be stored in memory 160 and executed by controller 180.

So far, the mobile terminal has been described in terms of its function. Hereinafter, for the sake of brevity, a slide type mobile terminal among various types of mobile terminals such as a folding type, a bar type, a swing type, a slide type mobile terminal, and the like will be described as an example. Therefore, the present invention can be applied to any type of mobile terminal, and is not limited to a slide type mobile terminal.

The mobile terminal 100 as shown in FIG. 1 may be configured to operate using a communication system such as a wired and wireless communication system and a satellite-based communication system that transmits data via frames or packets.

Embodiment 1

The embodiment of the present invention provides an image synthesizing method, which is applied to a terminal. The terminal may be a mobile phone, a smart phone, a tablet computer, etc., which is not limited in this embodiment of the present invention. As shown in FIG. 2, the image synthesis method includes:

Step 201: Shoot a character image on a solid color background.

For example, the background color can not be selected in principle. The commonly used background colors are green and blue. The reason is that the natural color of the human body does not contain these two colors, with green and blue. The background will not be mixed with the characters. If the clothes in front of the scene are green, use a blue background. If the clothes are blue, use a green background. At the same time, the green and blue colors are still two of the primary colors in the system, which is also easier to handle.

Under normal circumstances, China generally uses a blue background, which is often used in green screens and blue screens in Europe and the United States, especially when shooting people, the green screen is often used, because many European and American people's eyes are blue. In order to facilitate channel extraction during post-production, there are many problems to be aware of when shooting people in a solid color background. For example, the character can't contain the selected background color; the background color must be the same, the illumination is even, and the background or the light should be avoided as much as possible to avoid inconvenience to the channel extraction; sometimes the background size is large, and many blocks are needed. Cloth or board splicing.

Step 202: Extract character information from the character image.

For example, person information may be extracted using Matt Extraction, which may also be referred to as an image. Many film and television works can extract the foreground information of the pictures taken in the studio under the solid color background through the channel extraction technology, and synthesize the pictures taken with the exterior scene to create a more exciting picture effect.

In practical applications, Blue Screen is the most important method for channel extraction. The blue screen technology is to take a picture of a person on a blue background, and then use the difference of chromaticity to remove the monochrome background and get the character. Information, so the blue screen technology has a scientific name called Chroma Keying. In an embodiment, the background of the key is selected in blue. Among them, the software commonly used in the blue screen technology is AE (After Effect), which is a video editing and design software developed by Adobe, and is a professional non-linear editing software for video post-synthesis processing.

Step 203: Acquire a background image by using a voice control manner.

For example, the user may send voice information to the terminal, where the voice information includes keywords related to the background picture required by the user, such as the composition of the scene, the elements appearing in the background picture, the name of the scenic spot, etc., the terminal After receiving the voice information, the keyword related to the background image may be obtained by extracting the information, and then the background image that meets the requirement is obtained according to the keyword.

In an embodiment, the correspondence between the keyword and the picture may be preset in the terminal during initialization, and the correspondence may be as shown in Table 1:

关键词Key words	图片image
秋千Swing	图像上存在秋千的图片APicture A of the swing on the image
郁金香tulip	图像上存在郁金香田的图片BPicture B of Tulip Field exists on the image
手机Mobile phone	图像上存在智能手机的图片CA picture C of the smartphone exists on the image

Table 1

For example, when the keyword extracted by the terminal from the voice information sent by the user is “mobile phone”, according to the correspondence between the keyword and the picture, the picture corresponding to the “mobile phone” should be the picture C of the smart phone on the image, and then The picture C is displayed on the screen for the user to confirm. If the user determines that the picture C meets the requirements, the picture C can be used as a background picture.

In an embodiment, the crawler program may also be used to retrieve keywords related to keywords from the network for selection by the user. For example, when the keyword extracted by the terminal from the voice information sent by the user is “mobile phone”, the terminal crawls all the pictures related to “mobile phone” from the network through the crawler program, and then displays the captured picture in turn. On the screen, for the user to confirm. If the user determines that an image meets the requirements, the selected image of the user can be used as the background image.

Step 204: Synthesize the character information on the background picture to obtain a composite picture.

For example, the background picture acquired and meeting the user's needs is analyzed to obtain an optimal composition scheme, and then the acquired character information is synthesized on the background picture according to an optimal composition scheme to obtain a composite picture.

Optionally, after the synthesized picture is obtained, current weather information of the geographic information corresponding to the background image may also be acquired, and then the weather information and/or the location information is added to the composite picture. For example, suppose the background image is the Badaling Great Wall. After obtaining the composite picture, you can also get the weather information of the current Badaling, and then add the weather information of Badaling to the composite picture, so that the viewer watching the composite picture has a user on the picture. The feeling of the Badaling Great Wall. In the actual application, location information may also be added to the composite image, where the location information may be location information of a geographic location corresponding to the background image. For example, you can place a letter on the location of the Badaling Great Wall. The information is added to the composite picture, and the location information of the Badaling Great Wall may be identified by latitude and longitude, or may be identified by a Chinese character, which is not limited by the embodiment of the present invention.

Optionally, the composite picture may be further beautified and edited after the composite picture is obtained. For example, add a filter for processing, or adjust the contrast or brightness of a composite image.

An embodiment of the present invention provides an image synthesizing method, including: capturing a character image in a solid color background; extracting character information from the character image; acquiring a background image by using a voice control method; and synthesizing the character information in the background On the picture, get a composite picture. Compared with the prior art, since the background image is taken in a solid color background, the background is relatively simple, and the background can be deleted by intelligent image processing technology, and the character information is extracted, so that the outline of the character information is high and will not be lost. The details of the characters, the composite picture is more natural, and the user experience is better.

Embodiment 2

An embodiment of the present invention provides an image synthesizing method, which is applied to a terminal, as shown in FIG. 3, and includes:

Step 301: The correspondence between the preset keyword and the picture is performed, and step 302 is performed.

For example, the correspondence between the keyword and the picture can be referred to Table 1.

Step 302: Receive voice information input by the user, and perform step 303.

For example, when the user needs to select a background image, the basic features of the desired background image can be spoken against the microphone of the terminal. The terminal determines that the user inputs the voice information when detecting that the microphone receives the sound signal.

Step 303: Extract the keyword of the voice information, and perform step 304.

For example, a keyword database may be set in advance in the terminal, and the keyword database stores sound characteristics of each keyword, including phonetic symbols, tones, audio, and the like. After receiving the voice information sent by the user, the terminal compares the voice information with each keyword in the keyword database to extract keywords of the voice information.

Step 304: Select a picture corresponding to the keyword of the voice information according to the correspondence between the keyword and the picture, and perform step 305.

For example, if the keyword of the voice information extracted by the terminal is “tulip”, according to the correspondence between the keyword and the picture shown in Table 1, the picture corresponding to the “tulip” should be the picture B of the tulip field on the image.

Step 305: Display a picture corresponding to the keyword of the voice information on the display screen. If the user confirms that the picture meets the requirements, go to step 306. If the user confirms that the picture does not meet the requirements, go to step 311.

For example, a picture B of the tulip field on the image corresponding to the "tulip" is displayed on the display, and then the user is prompted to confirm. For example, while displaying the picture B, the prompt information is displayed, and the prompt information includes a “confirm” button and a “cancel” button. If the user thinks that the picture B meets the requirements, the user may click the “confirm” button, and the terminal confirms that the user thinks the picture. B meets the requirements; if the user thinks that picture B does not meet the requirements, you can click the “Cancel” button, and the terminal confirms that the user thinks that picture B does not meet the requirements.

Step 306: Capture a character image on a solid color background, and perform step 307.

For example, a person image can be taken on a pure blue background to ensure that the character does not contain the blue of the background.

Step 307: Extract character information from the character image, and perform step 308.

For example, the blue screen technology may be used to extract the character information, that is, the difference between the chromaticity between the person and the background on the captured person image, and the blue background is removed to obtain the character information.

Step 308: Synthesize the character information on the background picture to obtain a composite picture.

For example, the background picture acquired and meeting the user's needs is analyzed to obtain an optimal composition scheme, and then the acquired character information is synthesized on the background picture according to an optimal composition scheme to obtain a composite picture. The specific synthesis method is a prior art, and details are not described herein.

Step 309: Add weather information to the composite picture, and perform step 310.

For example, after the composite picture is obtained, weather information of the geographic location where the tulip field in the picture B is located may also be acquired, and then the weather information is added on the composite picture.

Assuming that the current weather information of the geographic location of the tulip field is "cloudy, 24-32 ° C", the "cloudy, 24-32 ° C" may be added to the lower right corner of the composite picture.

In practical applications, the location information of the geographic location where the tulip field in the picture B is located may also be added to the composite picture. Assuming that the location information of the geographic location where the tulip field is located is "east longitude 4 ° 21 ', north latitude 51 ° 45 '", the "east longitude 4 ° 21 ', north latitude 51 ° 45 '" may be added to the right of the composite picture Lower corner.

Step 310: Save or share the composite picture, and the process ends.

Optionally, after the composite picture is completed, the composite picture may be saved in a memory of the terminal, or the composite picture may be shared, for example, sent to a WeChat friend circle, or sent to the microblog, which is in the embodiment of the present invention. This is not limited.

Step 311: The crawler program is used to capture a picture related to the keyword of the voice information from the network.

For example, when the background image that meets the user's requirements cannot be obtained through the correspondence between the keyword and the picture, the crawler program can also be used to grab the relevant picture from the network, and then display the captured picture in turn, so that the user can confirm. The reptile program is a prior art, and the embodiments of the present invention are not described herein.

After the user confirms the background picture, steps 306-310 are continued to complete the synthesis of the picture.

It should be noted that the sequence of the steps of the image synthesizing method provided by the embodiment of the present invention may be appropriately adjusted, and the steps may also be correspondingly increased or decreased according to the situation, and any person skilled in the art may be within the technical scope disclosed by the present invention. Methods that can be easily conceived of variations are encompassed within the scope of the present invention and therefore will not be described again.

The embodiment of the present invention provides an image synthesizing method. Compared with the prior art, since a person image is captured in a solid color background, the background is relatively simple, and the background can be deleted by an intelligent image processing technology to extract character information. The outline of the character information is high, and the details of the character are not lost. The composite picture is more natural and the user experience is better.

Embodiment 3

In order to implement the method of the embodiment of the present invention, an embodiment of the present invention provides an image synthesizing device 40, which is located at a terminal, as shown in FIG. 4, and includes:

The photographing unit 401 is configured to photograph a person image on a solid color background.

The extracting unit 402 is configured to extract character information from the character image (the person information can be extracted by a channel extraction technique).

The first obtaining unit 403 is configured to acquire a background image by using a voice control manner.

The synthesizing unit 404 is configured to synthesize the person information on the background picture to obtain a composite picture.

In this way, since the background image is taken in a solid color background, the background is relatively simple, and the background can be deleted by intelligent image processing technology, and the character information is extracted, so that the outline of the character information is high, and the character details are not lost. The composite picture is more natural and the user experience is better.

Optionally, the first obtaining unit 403 is specifically configured to: receive voice information input by the user; and acquire the background image according to the voice information.

Optionally, the first obtaining unit 403 is specifically configured to: extract a keyword in the voice information; and acquire the background image in a preset image library according to a correspondence between the keyword and the image.

In an embodiment, the first acquiring unit 403 is specifically configured to:

Displaying a picture corresponding to the keyword on the display screen;

Receiving a confirmation operation of the user;

In response to the confirming operation, confirming that the displayed picture corresponding to the keyword is the background image sheet.

Optionally, the first obtaining unit 403 is specifically configured to: extract a keyword in the voice information; according to the keyword, use a crawler program to fetch the background image corresponding to the keyword from a network .

In an embodiment, the first acquiring unit 403 is specifically configured to:

Grab all pictures related to the keyword from the network through a crawler;

Display the captured image in turn on the screen;

Receiving the user's confirmation operation;

Responding to the confirming operation, using the picture confirmed by the user as the background image

In an embodiment, as shown in FIG. 5, the apparatus 40 may further include: a second obtaining unit 405 configured to acquire weather information and/or location information of a geographic location corresponding to the current background image; and adding unit 406 And configured to add the weather information and/or the location information to the composite picture.

The adding unit 406 is further configured to save or share the composite picture.

It should be noted that, firstly, those skilled in the art can clearly understand that for the convenience and brevity of the description, the specific working process of the foregoing apparatus and unit can refer to the corresponding process in the foregoing method embodiment, where No longer.

Second, the extracting unit 402, the first obtaining unit 403, the synthesizing unit 404, the second obtaining unit 405, and the adding unit 406 may each be processed by a central processing unit (CPU) located in the image synthesizing device 40. (Micro Processor Unit, MPU), Digital Signal Processor (DSP), or Field Programmable Gate Array (FPGA). The photographing unit 401 is realized by a camera located in the image synthesizing device 40.

An embodiment of the present invention provides an image synthesizing apparatus, including: a photographing unit configured to photograph a person image in a solid color background. An extracting unit configured to extract a person from the image of the person Information. The first obtaining unit is configured to obtain a background image by using a voice control manner. And a synthesizing unit configured to synthesize the character information on the background image to obtain a composite picture. Compared with the prior art, since the background image is taken in a solid color background, the background is relatively simple, and the background can be deleted by intelligent image processing technology, and the character information is extracted, so that the outline of the character information is high and will not be lost. The details of the characters, the composite picture is more natural, and the user experience is better.

It is to be understood that the phrase "one embodiment" or "an embodiment" or "an" Thus, "in one embodiment" or "in an embodiment" or "an" In addition, these particular features, structures, or characteristics may be combined in any suitable manner in one or more embodiments. It should be understood that, in various embodiments of the present invention, the size of the sequence numbers of the above processes does not mean the order of execution, and the order of execution of each process should be determined by its function and internal logic, and should not be directed to the embodiments of the present invention. The implementation process constitutes any limitation. The serial numbers of the embodiments of the present invention are merely for the description, and do not represent the advantages and disadvantages of the embodiments.

It is to be understood that the term "comprises", "comprising", or any other variants thereof, is intended to encompass a non-exclusive inclusion, such that a process, method, article, or device comprising a series of elements includes those elements. It also includes other elements that are not explicitly listed, or elements that are inherent to such a process, method, article, or device. An element that is defined by the phrase "comprising a ..." does not exclude the presence of additional equivalent elements in the process, method, item, or device that comprises the element.

In the several embodiments provided by the present application, it should be understood that the disclosed apparatus and method may be implemented in other manners. The device embodiments described above are merely illustrative. For example, the division of the unit is only a logical function division. In actual implementation, there may be another division manner, such as: multiple units or components may be combined, or Can be integrated into another system, or some features can be ignored or not executed. In addition, the coupling, or direct coupling, or communication connection of the various components shown or discussed may be through some interface, device or unit. The indirect coupling or communication connection can be electrical, mechanical or other form.

The units described above as separate components may or may not be physically separated, and the components displayed as the unit may or may not be physical units; they may be located in one place or distributed on multiple network units; Some or all of the units may be selected according to actual needs to achieve the purpose of the solution of the embodiment.

In addition, each functional unit in each embodiment of the present invention may be integrated into one processing unit, or each unit may be separately used as one unit, or two or more units may be integrated into one unit; The unit can be implemented in the form of hardware or in the form of hardware plus software functional units.

It will be understood by those skilled in the art that all or part of the steps of implementing the foregoing method embodiments may be performed by hardware related to program instructions. The foregoing program may be stored in a computer readable storage medium, and when executed, the program includes The foregoing steps of the method embodiment; and the foregoing storage medium includes: a removable storage device, a read only memory (ROM), a magnetic disk, or an optical disk, and the like, which can store program codes.

Alternatively, the above-described integrated unit of the present invention may be stored in a computer readable storage medium if it is implemented in the form of a software function module and sold or used as a standalone product. Based on such understanding, the technical solution of the embodiments of the present invention may be embodied in the form of a software product in essence or in the form of a software product stored in a storage medium, including a plurality of instructions. A computer device (which may be a personal computer, server, or network device, etc.) is caused to perform all or part of the methods described in various embodiments of the present invention. The foregoing storage medium includes various media that can store program codes, such as a mobile storage device, a ROM, a magnetic disk, or an optical disk.

Based on this, an embodiment of the present invention provides a computer storage medium, where the computer storage medium includes a set of instructions that, when executed, cause at least one processor to perform the image synthesis method described in the embodiments of the present invention.

The above is only the preferred embodiment of the present invention and is not intended to limit the scope of the present invention.

Claims

An image synthesis method comprising:

Shooting images of people on a solid background;

Extracting character information from the character image;

Acquire the background image by voice control;

The person information is synthesized on the background picture to obtain a composite picture.
The method of claim 1, wherein the acquiring the background image by using a voice control method comprises:

Receiving voice information input by the user;

Obtaining the background picture according to the voice information.
The method according to claim 2, wherein the obtaining the background image according to the voice information comprises:

Extracting keywords in the voice information;

The background image is acquired in a preset image library according to the correspondence between the keyword and the picture.
The method according to claim 3, wherein the obtaining the background image in a preset image library according to a correspondence between a keyword and a picture comprises:

Displaying a picture corresponding to the keyword on the display screen;

Receiving a confirmation operation of the user;

In response to the confirming operation, it is confirmed that the picture corresponding to the displayed keyword is the background picture.
The method according to claim 2, wherein the obtaining the background image according to the voice information comprises:

Extracting keywords in the voice information;

According to the keyword, the background image corresponding to the keyword is captured from the network by using a crawler program.
The method of claim 5, wherein the crawling the background image corresponding to the keyword from the network by using a crawler program comprises:

Grab all pictures related to the keyword from the network through a crawler;

Display the captured image in turn on the screen;

Receiving the user's confirmation operation;

In response to the confirming operation, the picture confirmed by the user is taken as the background picture.
The method of claim 1, wherein the extracting the character information from the character image comprises:

The person information is extracted by using a channel extraction technique.
The method according to claim 7, wherein the solid color background is a blue background; and the extracting the character information by using a channel extraction technique comprises:

The character information is extracted using a blue screen technology.
The method according to any one of claims 1 to 8, wherein after the synthesizing the person information on the background picture to obtain a composite picture, the method further comprises:

Obtaining weather information and/or location information of a geographic location corresponding to the current background image;

The weather information and/or the location information is added to the composite picture.
The method of claim 9 wherein the method further comprises:

Save or share the composite picture.
An image synthesizing device comprising:

a shooting unit configured to capture a portrait of a person on a solid background;

An extracting unit configured to extract character information from the character image;

The first obtaining unit is configured to obtain a background image by using a voice control manner;

And a synthesizing unit configured to synthesize the character information on the background image to obtain a composite picture.
The apparatus of claim 11, wherein the first obtaining unit is configured to:

Receiving voice information input by the user;

Obtaining the background picture according to the voice information.
The apparatus of claim 12, wherein the first obtaining unit is configured to:

Extracting keywords in the voice information;

The background image is acquired in a preset image library according to the correspondence between the keyword and the picture.
The apparatus of claim 13, wherein the first obtaining unit is configured to:

Displaying a picture corresponding to the keyword on the display screen;

Receiving a confirmation operation of the user;

In response to the confirming operation, it is confirmed that the picture corresponding to the displayed keyword is the background picture.
The apparatus of claim 12, wherein the first obtaining unit is configured to:

Extracting keywords in the voice information;

According to the keyword, the background image corresponding to the keyword is captured from the network by using a crawler program.
The apparatus of claim 15, wherein the first obtaining unit is configured to:

Grab all pictures related to the keyword from the network through a crawler;

Display the captured image in turn on the screen;

Receiving the user's confirmation operation;

In response to the confirming operation, the picture confirmed by the user is taken as the background picture.
The apparatus of claim 11 wherein said extracting unit is configured to:

The person information is extracted by using a channel extraction technique.
The device according to any one of claims 11 to 17, wherein the device further comprises:

a second acquiring unit, configured to acquire weather information and/or location information of a geographic location corresponding to the current background image;

An adding unit configured to add the weather information and/or the location information to the composite picture.
The apparatus of claim 18, wherein the adding unit is further configured to save or share the composite picture.
A computer storage medium comprising a set of instructions that, when executed, cause at least one processor to perform the image composition method of any one of claims 1 to 10.