WO2019184539A1 - 图片处理 - Google Patents

图片处理 Download PDF

Info

Publication number
WO2019184539A1
WO2019184539A1 PCT/CN2019/070040 CN2019070040W WO2019184539A1 WO 2019184539 A1 WO2019184539 A1 WO 2019184539A1 CN 2019070040 W CN2019070040 W CN 2019070040W WO 2019184539 A1 WO2019184539 A1 WO 2019184539A1
Authority
WO
WIPO (PCT)
Prior art keywords
picture
character string
string
user
key
Prior art date
Application number
PCT/CN2019/070040
Other languages
English (en)
French (fr)
Inventor
刘双喜
Original Assignee
阿里巴巴集团控股有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 阿里巴巴集团控股有限公司 filed Critical 阿里巴巴集团控股有限公司
Publication of WO2019184539A1 publication Critical patent/WO2019184539A1/zh

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/26Speech to text systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T11/002D [Two Dimensional] image generation
    • G06T11/60Editing figures and text; Combining figures or text
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
    • G10L25/51Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
    • G10L25/54Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination for retrieval

Definitions

  • the embodiments of the present specification relate to the field of image processing, and more particularly, to a picture processing method and apparatus.
  • Embodiments of the present specification aim to provide a more efficient solution to the deficiencies in the prior art.
  • an aspect of the present specification provides a picture processing method, including: after a user opens a picture, receiving a user's voice in response to a user operation; and identifying a first character string from the voice as an added item; Adding the added item to the picture.
  • Another aspect of the present disclosure provides a picture processing method, including: after a user opens a picture, receiving a user's voice in response to a user operation; identifying a first character string from the voice; according to a preset key string library, Obtaining at least one second character string corresponding to the first character string, and/or at least one graphic corresponding to the first character string as at least one added item; and adding the at least the image on the picture An add-on.
  • At least one second character string corresponding to the first character string and/or at least one graphic corresponding to the first character string are acquired as at least one addition.
  • the item includes: obtaining, from the first string, a string that matches a key string in the key string library as an added item.
  • At least one second character string corresponding to the first character string and/or at least one graphic corresponding to the first character string are acquired as at least one addition.
  • the item includes: obtaining, from the first character string, a third character string that matches a key string in the key string library, wherein the third character string is a character string representing a unit of the quantity, and The first character string is a numeric character string before the third character string, and the obtaining sequence includes the numeric character string and the character string of the third character string as an added item.
  • At least one second character string corresponding to the first character string and/or at least one graphic corresponding to the first character string are acquired as at least one addition.
  • the item includes: obtaining, from the first character string, a fourth character string that matches a key string in the key string library, wherein the fourth character string is preset to correspond to a specific graphic, And, the specific graphic is obtained as an added item.
  • At least one second character string corresponding to the first character string and/or at least one graphic corresponding to the first character string are acquired as at least one addition.
  • the item includes: obtaining, from the first string, a fifth character string that matches a key string in the key string library, wherein the fifth string is preset to correspond to a specific graphic, and The specific graphic is added as an item.
  • the image processing method further includes: after the user opens the image, acquiring, according to the image application scenario selected by the user, at least one graphic preset to be corresponding to the scene as at least one added item, and in the At least one added item acquired according to the scene is added to the picture.
  • the picture application scenario is a merchandise marketing scenario
  • the at least one graphic preset to correspond to the merchandising marketing scenario includes: a ruler, a label, a frame, and arrow.
  • the preset key string library includes a key string library corresponding to a picture application scenario selected by the user.
  • the scene is a merchandise marketing scene
  • the key string library corresponding to the scene includes a key string with respect to the following attributes: material, size, color, Price and appearance.
  • the image processing method further includes: displaying a voice input content prompt corresponding to the picture application scenario on the screen before or after receiving the user's voice.
  • the image processing method further includes: after adding the added item in the picture, performing at least one of the following modifications according to the user gesture or input: changing a position of the added item, changing a size of the added item Editing the content of the added item and deleting the added item.
  • the user opens the picture
  • the user opens the picture in the album of the terminal
  • the user opens the picture in the social APP
  • the user is in the APP for executing the method. Open the picture.
  • the present disclosure provides a picture processing apparatus, including: a receiving unit, configured to: after the user opens the picture, receive a voice of the user in response to the user operation; and the identifying unit is configured to: identify the first voice from the voice a string, as an add-on; and an adding unit configured to add the added item to the picture.
  • the present disclosure provides a picture processing apparatus, including: a receiving unit, configured to: after the user opens the picture, receive a voice of the user in response to the user operation; and the identifying unit is configured to: identify the first voice from the voice a string obtaining unit configured to acquire at least one second character string corresponding to the first character string and/or at least one graphic corresponding to the first character string according to a preset key string library And as at least one addition item; and an adding unit configured to separately add the at least one added item on the picture.
  • Another aspect of the present specification provides a computer readable storage medium having stored thereon an instruction code that, when executed in a computer, causes the computer to process the image processing described above.
  • Figure 1 shows schematically a system 100 in accordance with an embodiment of the present specification
  • FIG. 2 shows a flow chart of a picture processing method according to an embodiment of the present specification
  • FIG. 3 shows a flow chart of a picture processing method according to an embodiment of the present specification
  • Figure 4 shows an example of a merchandising scenario
  • FIG. 5 is a schematic diagram showing a voice input content prompt on the screen in a merchandise marketing scenario
  • FIG. 6 is a schematic diagram showing text addition items, label addition items, and ruler addition items respectively added on a picture;
  • FIG. 7 illustrates a picture processing apparatus 700 in accordance with an embodiment of the present specification
  • FIG. 8 shows a picture processing device 800 in accordance with an embodiment of the present specification.
  • FIG. 1 schematically illustrates a system 100 in accordance with an embodiment of the present specification.
  • the system 100 includes a display unit 11, a voice receiving unit 12, a voice recognition unit 13, an acquisition unit 14, a key string library 15, and a picture editing unit 16.
  • the user opens the picture through the display unit 11.
  • the user can trigger the voice receiving unit 12 through the interface of the voice receiving unit 12.
  • the voice receiving unit 12 is triggered to start receiving voice.
  • the voice receiving unit 12 transmits the received voice to the voice recognition unit 13.
  • the voice recognition unit 13 recognizes the received voice as a character string by a voice recognition function, and the character string may include characters, numbers, letters, symbols, and the like.
  • the speech recognition unit 13 transmits the recognized character string to the picture editing unit 16, so that the picture editing unit 16 adds the character string to the picture.
  • the speech recognition unit sends the recognized character string to the obtaining unit 14, and the obtaining unit 14 performs the key string and the key string in the thesaurus by calling the key string library 15. Matching, thereby acquiring a key string in the string, or a corresponding string combination, or a corresponding graphic as an added item, and transmitting the added item to the picture editing unit 16. Thereafter, the picture editing unit 16 adds the added item to the picture.
  • FIG. 2 shows a flow chart of a picture processing method in accordance with an embodiment of the present specification.
  • the method includes: in step S21, after the user opens the picture, receiving the user's voice in response to the user operation; in step S22, identifying the character string from the voice as an added item; and in step S23, in the step S23 Add the added item to the picture.
  • step S21 after the user opens the picture, the user's voice is received in response to the user operation.
  • the device in which the user opens the picture is not limited, for example, the user can open the picture in the portable smart device, or the user can open the picture in the computer.
  • the specific open position of the user is not limited.
  • a user may open a picture in a mobile phone album with a picture processing function according to an embodiment of the present specification, and may open in a social APP (eg, a circle of friends, a living circle, etc.) with a picture processing function according to an embodiment of the present specification.
  • the user can perform an operation for opening the interface for voice reception. For example, in the case where a user opens a picture through a computer, the user can start voice reception of the computer by turning on the microphone. In the case where the user opens the picture using the mobile phone, the user can press and hold the microphone icon on the screen to start voice reception of the mobile phone. In one embodiment, the user can single-point the microphone icon on the screen (the icon is located outside the picture) and then long press a specific location in the picture for voice input. Thereby, the tag obtained by voice recognition can be inserted at a specific position in the picture.
  • a character string is recognized from the speech as an added item.
  • speech recognition can be performed by the existing speech recognition function.
  • the corresponding character string is recognized from the input voice.
  • the corresponding character string may include a man character, a numeric character, an alphabetic character, or a matching character.
  • step S23 the added item is added to the picture. That is, the above character string is added as a text box to the picture.
  • the user long presses the microphone icon in the screen for voice input, in which case the system randomly adds the added item to a location in the picture.
  • the user long presses a specific location in the picture after the single point microphone icon, in which case the system adds the added item to a particular location in the picture.
  • the APP may provide a selection button of a plurality of scenes.
  • the plurality of scenarios include, for example, a merchandising marketing scene, a self-portrait scene, a teaching scene, a matchmaking scene, and the like.
  • the user can pre-select a scene before opening the picture, or select a scene after opening the picture.
  • a corresponding graphic is preset for a part of the scene.
  • the preset corresponding graphic includes a ruler, a label, a picture, an arrow, and the like. Therefore, after the user opens the picture, in the case that the user selects the product marketing scene, the APP automatically acquires the corresponding graphic ruler, label, etc., and automatically adds the ruler and the label on the picture.
  • the picture is opened in the APP here for illustrative purposes only. For example, the user can also open the picture in the mobile phone album and select the picture application scene after the picture is opened.
  • the voice input content prompt corresponding to the picture application scenario is displayed on the screen before or after receiving the user's voice.
  • the user can perform various operations on the added item. For example, in the case where the user uses the mobile phone, the user can change the position of the added item by a gesture, change the size of the added item, for example, by sliding the added item on the screen to adjust the added item to the new one. Position, the two items are rotated by two fingers, and the angle of the added item is adjusted, and the size of the added item is adjusted by sliding two fingers in the diagonal direction of the added item.
  • the user may input a new character or delete an existing character in the added item, or the user may press the added item to display more operation buttons, for example, delete the button, thereby performing more Edit operation for this added item.
  • FIG. 3 shows a flow chart of a picture processing method in accordance with an embodiment of the present specification.
  • the method includes: in step S31, after the user opens the picture, receiving the user's voice in response to the user operation; in step S32, identifying the first character string from the voice; and in step S33, according to the preset key character a string library, acquiring at least one second character string corresponding to the first character string, and/or at least one graphic corresponding to the first character string as at least one added item; and in step S34, in the step S34 The at least one added item is separately added to the picture.
  • Steps S31 and S32 in the method are substantially the same as steps S21 and S22 in FIG. 2, and details are not described herein again.
  • step S33 at least one second character string corresponding to the first character string and/or at least one graphic corresponding to the first character string are acquired according to a preset key string library, and added as at least one item.
  • the second character string is the first character string.
  • the key string library can be obtained by manual finishing or machine learning. It can include key strings that correspond to each particular scene.
  • a specific scenario is a merchandising scenario in which a user needs to tag various attributes of an item in order to promote an item in the picture, for example, the attribute includes material, size, color, price, appearance, etc. . Therefore, in the key string library corresponding to the merchandising scene, key strings regarding the respective attributes described above may be included.
  • a key string representing a material may be included, and in the dimension, "cm", “m”, “cm”, etc. may be included.
  • the scene is a matchmaking scene, in which the user needs to put various character attribute tags on the characters in order to introduce the characters in the picture.
  • the attributes include age, profession, work unit, and the like.
  • the key string library corresponding to the matchmaking scene may include key strings corresponding to the above attributes, such as age unit (years), physics, biology, automation, company, office, and the like.
  • the scene is a self-timer scene.
  • the user can put a mood on the selfie picture, feel the label, and the like. Therefore, the key string library corresponding to the scene may include key strings such as "happy, angry, anxious".
  • the user may select a picture application scenario.
  • a scene option button may be displayed on the screen, and the user may select a desired picture application scene through the button, or the user may pre-select the picture application scene before opening the picture.
  • the system acquires the added item according to the preset key string library corresponding to the scene.
  • Figure 4 shows an example of a merchandising scenario. After the user opens the picture as shown in FIG. 4, the "commodity marketing scene" can be selected.
  • the system calls a key string library corresponding to the merchandise marketing scenario to match the character string.
  • the system after receiving the picture application scenario selected by the user, displays a voice input content prompt corresponding to the scene on the screen before or after receiving the user's voice.
  • Figure 5 is a schematic diagram showing the voice input content on the screen in the merchandising scene, including "length 120 inside” (size), “metal is brushed polished copper material” (material), “spring new”, “50 yuan Take the price of goods (price) and so on.
  • the voice input content prompt may be preset in accordance with a specific scene.
  • the user inputs the voice “30 cm high and 35 cm wide by long pressing the microphone on the screen.
  • the hardware is pure copper hardware matte, and the decoration is a round nail. Hole, the price is 120 yuan.”
  • the system recognizes the speech as a string, the string matches the key string in the key string library corresponding to the merchandising scenario.
  • the key string "pure copper metal matte” is included, and the key string “pinning hole punching” is included in the classification of the appearance, so that “pure copper metal matte” is obtained.
  • “Pin Hole Punch” as an addition to be added to the picture.
  • key strings for material and appearance are preset to correspond to the label graphic in the key string library. Therefore, after obtaining the added items "Pure Copper Hardware Scrub” and “Tail Pin Punching", the system also automatically obtains the label graphic as an added item.
  • the label graphic is used to mark the specific position corresponding to the "pure copper metal matte” material in the picture, and the specific position corresponding to the appearance of the "stud hole punching".
  • a "cm” matching the key string "cm” in the classification of the size of the key string library may be obtained from the above-mentioned character string, and it may be judged that in the above-mentioned character string, " The cm” is preceded by a numeric string, so the "30cm” and “35cm” in the obtained string are added as additional items to the picture.
  • "cm” is set to correspond to the ruler graphic in the key string library, so that after the additions "30cm” and "35cm” are acquired, the system also automatically acquires the ruler graphic as an addition.
  • the key string "meta” is included in the classification of prices for the key string library so that the key string "meta” can be obtained from the above string. And it can be judged that in the above-mentioned character string, the "meta” is preceded by a numeric character string, so "120 yuan” in the above-mentioned character string is obtained as an added item to be added to the image.
  • the key strings "high” and “wide” are included in the classification of the size in the key string, and "high” is set to correspond to the ruler graphic in the key string library. Therefore, after obtaining the key strings "high” and “wide” in the string, the system obtains the ruler graphic as an add-on.
  • the added graphic is not limited to the above-described labels and scales, but may also be arrows, various geometric shapes for looping, frames, and the like.
  • the label can be set to correspond to a key string such as a color or a material in the key string library
  • the ruler is set to correspond to a string representing the length or length unit in the key string.
  • a frame corresponding to the conversation content, an expression icon corresponding to the mood, and the like may be added according to the key string matching.
  • FIG. 6 is a schematic diagram showing text addition items, label addition items, and ruler addition items respectively added to pictures.
  • the user may perform at least one of the following modifications according to the gesture or the input: changing the position of the added item, changing the size of the added item, editing the content of the added item, and deleting the Add an item.
  • the user can move the two ends of the scale by gestures, change the length of the scale, rotate the scale by gesture, change the angle of the scale, delete the scale by gestures, and the like.
  • At least one graphic preset to be corresponding to the scene is acquired as at least one added item, and Adding at least one added item acquired according to the scene is respectively added to the picture.
  • a specific example thereof is as described with reference to FIG. 2, and details are not described herein again.
  • the user can also add a two-dimensional code to the image by, for example, an interface for adding a two-dimensional code on the screen, so that the image can be saved and the image can be shared.
  • a two-dimensional code to the image by, for example, an interface for adding a two-dimensional code on the screen, so that the image can be saved and the image can be shared.
  • the attributes of the product are accurately and clearly displayed through the labels in the figure, so that the buyer can quickly understand the product, thereby promoting the marketing of the product.
  • FIG. 7 illustrates a picture processing apparatus 700 according to an embodiment of the present disclosure, including: a receiving unit 71 configured to receive a voice of a user in response to a user operation after the user opens the picture; and the identifying unit 72 is configured to A first character string is recognized from the voice as an added item; and an adding unit 73 is configured to add the added item to the picture.
  • FIG. 8 illustrates a picture processing apparatus 800 according to an embodiment of the present disclosure, including: a receiving unit 81 configured to receive a voice of a user in response to a user operation after the user opens the picture; and the identifying unit 82 is configured to Identifying the first character string from the voice; the first obtaining unit 83 is configured to acquire at least one second character string corresponding to the first character string, and/or with the preset key string library At least one graphic corresponding to the first character string is included as at least one added item; and the first adding unit 84 is configured to separately add the at least one added item to the picture.
  • the first obtaining unit is further configured to: acquire, from the first character string, a character string that matches a key string in the key string library, As an addition.
  • the first acquiring unit is further configured to acquire, from the first character string, a third character that matches a key string in the key string library. a string, wherein the third character string is a character string representing a unit of the quantity, and in the first character string, a digit string before the third character string, and the obtaining order includes the number A string and a string of the third string are added.
  • the first acquiring unit is further configured to acquire, from the first character string, a fourth character that matches a key string in the key string library.
  • the string is an addition item, wherein the fourth character string is preset to correspond to a specific graphic, and the specific graphic is acquired as an added item.
  • the first acquiring unit is further configured to acquire, from the first character string, a fifth character that matches a key string in the key string library. a string, wherein the fifth character string is preset to correspond to a specific graphic, and the specific graphic is acquired as an added item.
  • the image processing apparatus 800 further includes: a second obtaining unit 85, configured to: after the user opens the image, acquire at least one graphic preset to be corresponding to the scene according to the image application scenario selected by the user As at least one added item, and the second adding unit 86, configured to add at least one added item acquired according to the scene on the picture
  • the image processing apparatus 800 further includes a prompting unit 87 configured to display a voice input content prompt corresponding to the scene on the screen after receiving the picture application scene selected by the user.
  • the image processing apparatus 800 further includes a modifying unit 88 configured to: after adding the added item in the picture, perform at least one of the following modifications according to the user gesture or input: changing the location of the added item, Changing the size of the added item, editing the content of the added item, and deleting the added item.
  • a modifying unit 88 configured to: after adding the added item in the picture, perform at least one of the following modifications according to the user gesture or input: changing the location of the added item, Changing the size of the added item, editing the content of the added item, and deleting the added item.
  • the embodiment of the present specification further provides a computer readable storage medium having stored thereon an instruction code for causing a computer to execute the picture processing method as described above when executed in a computer.
  • the picture is marked by voice input, which reduces the difficulty of image processing, greatly improves the image processing efficiency, and satisfies the user's needs.
  • the steps of a method or algorithm described in connection with the embodiments disclosed herein may be implemented in hardware, in a software module in a processor orbit, or in a combination of the two.
  • the software module can be placed in random access memory (RAM), memory, read only memory (ROM), electrically programmable ROM, electrically erasable programmable ROM, registers, hard disk, removable disk, CD-ROM, or technical field. Any other form of storage medium known.

Abstract

本说明书实施例公开了一种图片处理方法和装置,所述方法包括:在用户打开图片之后,响应于用户操作,接收用户的语音;从所述语音识别出第一字符串,作为添加项;以及在所述图片上添加所述添加项。

Description

图片处理
相关申请的交叉引用
本专利申请要求于2018年3月28日提交的、申请号为201810266755.4、发明名称为“一种图片处理方法和装置”的中国专利申请的优先权,该申请的全文以引用的方式并入本文中。
技术领域
本说明书实施例涉及图像处理领域,更具体地,涉及一种图片处理方法和装置。
背景技术
随着互联网技术的发展,人们越来越多的在社交平台中发布图片或者向好友发送图片。例如,在朋友圈发布物品的图片以推广该物品。在该情况中,需要在图片中标注物品的一些特征,如尺寸、材料、细节、外观等等。再例如,在朋友圈发布自己的照片。在该情况中,人们可能希望在图片中标注出自己的心情、感受等。目前的方案是,通过一些图片编辑软件手工标注尺寸、材质、心情、感受等信息。因此,需要一种更有效的图片处理方法,以方便、快速地在图片中进行标注打标签。
发明内容
本说明书实施例旨在提供一种更有效的,以解决现有技术中的不足。
为实现上述目的,本说明书一个方面提供一种图片处理方法,包括:在用户打开图片之后,响应于用户操作,接收用户的语音;从所述语音识别出第一字符串,作为添加项;以及在所述图片上添加所述添加项。
本说明书另一方面提供一种图片处理方法,包括:在用户打开图片之后,响应于用户操作,接收用户的语音;从所述语音识别出第一字符串;根据预设的关键字符串库,获取与所述第一字符串对应的至少一个第二字符串、和/或与所述第一字符串对应的至少一个图形,作为至少一个添加项;以及在所述图片上分别添加所述至少一个添加项。
在一个实施例中,在上述图片处理方法中,获取与所述第一字符串对应的至少一个第二字符串、和/或与所述第一字符串对应的至少一个图形,作为至少一个添加项包括, 从所述第一字符串中获取与所述关键字符串库中的关键字符串匹配的字符串,作为添加项。
在一个实施例中,在上述图片处理方法中,获取与所述第一字符串对应的至少一个第二字符串、和/或与所述第一字符串对应的至少一个图形,作为至少一个添加项包括,从所述第一字符串中获取与所述关键字符串库中的关键字符串匹配的第三字符串,其中,所述第三字符串为表示量的单位的字符串,并且在所述第一字符串中,在所述第三字符串之前为数字字符串,以及,获取顺序包括所述数字字符串和所述第三字符串的字符串作为添加项。
在一个实施例中,在上述图片处理方法中,获取与所述第一字符串对应的至少一个第二字符串、和/或与所述第一字符串对应的至少一个图形,作为至少一个添加项包括,从所述第一字符串中获取与所述关键字符串库中的关键字符串匹配的第四字符串作为添加项,其中,所述第四字符串预设为对应于特定图形,以及,获取所述特定图形作为添加项。
在一个实施例中,在上述图片处理方法中,获取与所述第一字符串对应的至少一个第二字符串、和/或与所述第一字符串对应的至少一个图形,作为至少一个添加项包括,从所述第一字符串中获取与所述关键字符串库中的关键字符串匹配的第五字符串,其中,所述第五字符串预设为对应于特定图形,以及,获取所述特定图形作为添加项。
在一个实施例中,上述图片处理方法还包括,在用户打开图片之后,根据用户选择的图片应用场景,获取预设为与所述场景对应的至少一个图形作为至少一个添加项,以及在所述图片上分别添加根据所述场景获取的至少一个添加项。
在一个实施例中,在上述图片处理方法中,所述图片应用场景为商品营销场景,以及,其中,预设为与所述商品营销场景对应的至少一个图形包括:标尺、标签、图框和箭头。
在一个实施例中,在上述图片处理方法中,所述根据预设的关键字符串库包括,根据与用户选择的图片应用场景对应的关键字符串库。
在一个实施例中,在上述图片处理方法中,所述场景为商品营销场景,以及,其中,与所述场景对应的关键字符串库包括关于以下属性的关键字符串:材质、尺寸、颜色、价格和外观。
在一个实施例中,上述图片处理方法还包括,在接收用户的语音之前或之后,在屏幕上显示与所述图片应用场景对应的语音输入内容提示。
在一个实施例中,上述图片处理方法还包括,在图片中添加所述添加项之后,根据用户手势或输入进行以下至少一种修改:改变所述添加项的位置、改变所述添加项的尺寸、编辑所述添加项的内容、以及删除所述添加项。
在一个实施例中,在上述图片处理方法中,所述用户打开图片包括,用户在其终端的相册中打开图片、用户在社交APP中打开图片、或者用户在用于执行所述方法的APP中打开图片。
本说明书另一方面提供一种图片处理装置,包括:接收单元,配置为,在用户打开图片之后,响应于用户操作,接收用户的语音;识别单元,配置为,从所述语音识别出第一字符串,作为添加项;以及添加单元,配置为,在所述图片上添加所述添加项。
本说明书另一方面提供一种图片处理装置,包括:接收单元,配置为,在用户打开图片之后,响应于用户操作,接收用户的语音;识别单元,配置为,从所述语音识别出第一字符串;获取单元,配置为,根据预设的关键字符串库,获取与所述第一字符串对应的至少一个第二字符串、和/或与所述第一字符串对应的至少一个图形,作为至少一个添加项;以及添加单元,配置为,在所述图片上分别添加所述至少一个添加项。
本说明书另一方面提供一种计算机可读的存储介质,其上存储有指令代码,所述指令代码在计算机中执行时,令计算机上述图像处理方法。
附图说明
通过结合附图描述本说明书实施例,可以使得本说明书实施例更加清楚:
图1示意示出了根据本说明书实施例的系统100;
图2示出了根据本说明书实施例的一种图片处理方法的流程图;
图3示出了根据本说明书实施例的一种图片处理方法的流程图;
图4示出了商品营销场景的示例;
图5示意示出在商品营销场景下,屏幕上的语音输入内容提示;
图6示出了在图片上分别添加的文本添加项、标签添加项、及标尺添加项的示意图;
图7示出了根据本说明书实施例的一种图片处理装置700;以及
图8示出了根据本说明书实施例的一种图片处理装置800。
具体实施方式
下面将结合附图描述本说明书实施例。
图1示意示出了根据本说明书实施例的系统100。如图1所示,系统100包括显示单元11、语音接收单元12、语音识别单元13、获取单元14、关键字符串库15以及图片编辑单元16。首先,用户通过显示单元11打开图片。在打开图片之后,用户可通过语音接收单元12的接口触发语音接收单元12。例如,通过长按屏幕上显示的麦克风图标,从而触发语音接收单元12开始接收语音。在用户断开语音接收单元12的接口(例如,松开所述麦克风图标)之后,语音接收单元12将接收到的语音发送给语音识别单元13。语音识别单元13通过语音识别功能将接收的语音识别为字符串,该字符串可包括文字、数字、字母、符号等。在一个实施例中,语音识别单元13将识别出的字符串发送给图片编辑单元16,从而图片编辑单元16在图片上添加所述字符串。在另一个实施例中,语音识别单元将识别出的字符串发送给获取单元14,获取单元14通过调用关键字符串库15,而将所述字符串与所述词库中的关键字符串进行匹配,从而获取所述字符串中的关键字符串、或对应的字符串组合、或对应的图形作为添加项,并将该添加项发送给图片编辑单元16。之后,图片编辑单元16在图片上添加所述添加项。
图2示出了根据本说明书实施例的一种图片处理方法的流程图。所述方法包括:在步骤S21,在用户打开图片之后,响应于用户操作,接收用户的语音;在步骤S22,从所述语音识别出字符串,作为添加项;以及在步骤S23,在所述图片上添加所述添加项。
首先,在步骤S21,在用户打开图片之后,响应于用户操作,接收用户的语音。这里,不限定用户打开图片的设备,例如,用户可在便携式智能设备中打开图片,或者,用户可在计算机中打开图片。当用户在例如手机中打开图片时,不限定用户具体的打开位置。例如,用户可在带有根据本说明书实施例的图片处理功能的手机相册中打开图片、可在带有根据本说明书实施例的图片处理功能的社交APP(例如朋友圈、生活圈等)中打开图片,或者可在用于执行根据本说明书实施例的图片处理方法的APP中打开图片。
用户在打开图片之后,可进行用于打开语音接收的接口的操作。例如,在用户通过计算机打开图片的情况中,用户可通过打开麦克风以开始计算机的语音接收。在用户使 用手机打开图片的情况中,用户可长按屏幕上的麦克风图标,以开始手机的语音接收。在一个实施例中,用户可单点屏幕上的麦克风图标(该图标位于图片外部),然后长按图片中的特定位置,进行语音输入。从而可以在图片中的特定位置插入通过语音识别获得的标签。
在步骤S22,从所述语音识别出字符串,作为添加项。这里,可通过已有的语音识别功能进行语音识别。从而从输入的语音识别出对应的字符串。所述对应的字符串可包括汉子字符、数字字符、字母字符、或符合字符等。
在步骤S23,在所述图片上添加所述添加项。即,将上述字符串作为文本框添加到图片中。在一个实施例中,用户长按屏幕中的麦克风图标进行语音输入,在该情况中,系统将所述添加项随机添加到图片中的一个位置。在另一个实施例中,用户在单点麦克风图标之后,长按图片中的特定位置进行语音输入,在该情况中,系统将所述添加项添加到图片中的特定位置。
在一个实施例中,在用户打开图片之后,根据用户选择的图片应用场景,获取预设为与所述场景对应的至少一个图形作为至少一个添加项,以及在所述图片上分别添加根据所述场景获取的至少一个添加项。例如,当在根据本说明书实施例的图片处理APP中进行所述图片处理时,APP可提供多个场景的选择按钮。所述多个场景例如包括:商品营销场景、自拍场景、教学场景、婚介场景等。在该APP中,用户可在打开图片之前预先选择好场景,也可以在打开图片之后选择场景。在该APP中,对部分场景预设对应的图形,例如,对于商品营销场景,预设对应的图形包括,标尺、标签、图片、箭头等。从而,在用户打开图片之后,在用户选择了商品营销场景的情况下,APP自动获取对应的图形标尺、标签等,并在图片上自动添加标尺和标签。本领域技术人员可以理解,这里在APP中打开图片只是为了示例说明,例如,用户也可以在手机相册中打开图片,并在图片打开之后选择图片应用场景。
在一个实施例中,在接收用户的语音之前或之后,在屏幕上显示与所述图片应用场景对应的语音输入内容提示。
在添加了所述添加项之后,用户可对该添加项进行各种操作。例如,在用户使用手机的情况中,用户可以通过手势,改变所述添加项的位置、改变所述添加项的尺寸,例如通过按着添加项在屏幕上滑动,以将添加项调整到新的位置,通过两个手指对所述添加项进行旋转,而调整添加项的角度,通过在添加项的对角线方向滑动两个手指,从而 调整添加项的大小等。另外,用户可在所述添加项中输入新的字符或删除已有的字符,或者,用户通过长按所述添加项,以显示更多的操作按钮,例如,删除按钮,从而进行更多的对该添加项的编辑操作。
图3示出了根据本说明书实施例的一种图片处理方法的流程图。所述方法包括:在步骤S31,在用户打开图片之后,响应于用户操作,接收用户的语音;在步骤S32,从所述语音识别出第一字符串;在步骤S33,根据预设的关键字符串库,获取与所述第一字符串对应的至少一个第二字符串、和/或与所述第一字符串对应的至少一个图形,作为至少一个添加项;以及在步骤S34,在所述图片上分别添加所述至少一个添加项。
该方法中的步骤S31和S32与图2中的步骤S21和S22基本相同,在此不再赘述。
在步骤S33,根据预设的关键字符串库,获取与所述第一字符串对应的至少一个第二字符串、和/或与所述第一字符串对应的至少一个图形,作为至少一个添加项。
在一个实施例中,所述第二字符串为所述第一字符串。
所述关键字符串库可通过人工整理、或机器学习获得。其可以包括对应于各个具体场景的关键字符串。例如,一个具体的场景为商品营销场景,在该场景中,用户为了推广图片中的物品,需要对物品的各种属性打上标签,例如,所述属性包括材质、尺寸、颜色、价格、外观等。因此,在对应于商品营销场景的关键字符串库中,可包括关于上述各个属性的关键字符串。例如,在材质这类中,可包括“纯铜”、“塑料”、“玻璃”等表示材料的关键字符串,在尺寸这类中,可包括“cm”、“m”、“公分”等表示尺寸单位的关键字符串,在颜色这类中,可包括“红色”、“藕荷色”、“洋红色”等表示颜色的关键字符串,在价格这类中,可包括“元”、“美元”等表示货币单位的关键字符串,以及,在外观这类中,可包括“金属拉丝”、“抛光”等表示外观的关键字符串。
再例如,所述场景为婚介场景,在该场景中,用户为了介绍图片中的人物,需要给人物打上各种人物属性标签。例如,所述属性包括年龄、专业、工作单位等。则与婚介场景对应的关键字符串库中可包括与上述属性对应的关键字符串,如年龄单位(岁)、物理、生物、自动化、公司、事务所等等。
再例如,所述场景为自拍场景。在该场景中,用户可以给自拍图打上心情、感受标签等。从而,与该场景对应的关键字符串库中可包括“开心、愤怒、焦虑”等关键字符串。
在一个实施例中,用户可选择图片应用场景。例如,在用户打开图片之后,可在屏幕上显示场景选项按钮,用户可通过所述按钮选择希望的图片应用场景,或者,用户可在打开图片之前预先选择好图片应用场景。在用户选择了场景之后,系统根据与该场景对应的预设关键字符串库,获取所述添加项。例如,图4示出了商品营销场景的示例。用户在打开如图4所示的图片之后,可选择“商品营销场景”。从而,系统在对用户的语音输入语音识别为字符串之后,调用对应于商品营销场景的关键字符串库与所述字符串进行匹配。
在一个实施例中,系统在接收用户选择的图片应用场景之后,在接收用户的语音之前或之后,在屏幕上显示与所述场景对应的语音输入内容提示。图5示意示出在商品营销场景下,屏幕上的语音输入内容提示,包括“长120里面”(尺寸)、“金属是拉丝抛光纯铜材质”(材质)、“春季新款”、“50元拿货价”(价格)等。可对应于特定的场景预先设定所述语音输入内容提示。
在一个实施例中,例如用户在如上所述选择商品营销场景之后,用户通过长按屏幕上的麦克风输入语音“高30cm,宽35cm,五金材质是纯铜五金磨砂,装饰物为圆头钉打孔,价格120元”。系统在将该语音识别为字符串之后,将该字符串与对应于商品营销场景的关键字符串库中的关键字符串相匹配。在所述关键字符串库的关于材质的分类中包括关键字符串“纯铜五金磨砂”、在关于外观的分类中包括关键字符串“圆头钉打孔”,因此,获取“纯铜五金磨砂”和“圆头钉打孔”作为将要添加到图片上的添加项。在一个实施例中,在关键字符串库中将关于材质和外观的关键字符串预设为对应于标签图形。从而在获取添加项“纯铜五金磨砂”和“圆头钉打孔”之后,系统还自动获取标签图形作为添加项。所述标签图形用于在图片中标注出“纯铜五金磨砂”材质对应的具体位置,以及“圆头钉打孔”外观对应的具体位置。
在一个实施例中,从上述字符串可获取,与所述关键字符串库的关于尺寸的分类中的关键字符串“cm”匹配的“cm”,并且可判断出在上述字符串中,“cm”之前为数字字符串,因此获取字符串中的“30cm”和“35cm”作为添加项分别添加到图片上。在一个实施例中,在关键字符串库中将“cm”设定为对应于标尺图形,从而在获取添加项“30cm”和“35cm”之后,系统还自动获取标尺图形作为添加项。
在一个实施例中,在所述关键字符串库的关于价格的分类中包括关键字符串“元”,从而可从上述字符串中获取关键字符串“元”。并且可判断,在上述字符串中,“元” 的之前为数字字符串,因此获取上述字符串中的“120元”作为添加项添加到图片上。
在一个实施例中,在所述关键字符串中的关于尺寸的分类中包括关键字符串“高”和“宽”,而在关键字符串库中将“高”设定为对应于标尺图形。因此,在获取字符串中的关键字符串“高”和“宽”之后,系统获取标尺图形作为添加项。
所述添加的图形不限于上述标签和标尺,还可以是箭头、各种用于圈注的几何形状、图框等等。例如,可将标签设置为与关键字符串库中的颜色、材质等关键字符串对应,将标尺设置为与关键字符串中的表示长度或长度单位的字符串对应。而在例如自拍场景中,还可以根据关键字符串匹配,添加与对话内容对应的图框,与心情对应的表情图标等。
再回到图3,在步骤S34,在所述图片上分别添加所述至少一个添加项。图6示出了在图片上分别添加的文本添加项、标签添加项、及标尺添加项的示意图。在添加了所述添加项之后,用户可根据手势或输入进行以下至少一种修改:改变所述添加项的位置、改变所述添加项的尺寸、编辑所述添加项的内容、以及删除所述添加项。例如,如图6所示,对于图中的标尺,用户可通过手势移动标尺的两端,改变标尺的长度,可通过手势旋转标尺,改变标尺的角度,通过手势删除所述标尺等。
在一个实施例中,如参考图2中所述,在用户打开图片之后,根据用户选择的图片应用场景,获取预设为与所述场景对应的至少一个图形作为至少一个添加项,以及在所述图片上分别添加根据所述场景获取的至少一个添加项。其具体实例如参考图2所述,在此不再赘述。
另外,在完成上述编辑之后,用户还可以通过例如屏幕上的添加二维码的接口对图片添加二维码,从而可以保存图片,并分享图片。在该分享图片中,通过图中的标签准确明了地展现了商品的各个属性,便于购买者对该商品进行快速地了解,从而促进了对商品的营销。
图7示出了根据本说明书实施例的一种图片处理装置700,包括:接收单元71,配置为,在用户打开图片之后,响应于用户操作,接收用户的语音;识别单元72,配置为,从所述语音识别出第一字符串,作为添加项;以及添加单元73,配置为,在所述图片上添加所述添加项。
图8示出了根据本说明书实施例的一种图片处理装置800,包括:接收单元81,配置为,在用户打开图片之后,响应于用户操作,接收用户的语音;识别单元82,配置为, 从所述语音识别出第一字符串;第一获取单元83,配置为,根据预设的关键字符串库,获取与所述第一字符串对应的至少一个第二字符串、和/或与所述第一字符串对应的至少一个图形,作为至少一个添加项;以及第一添加单元84,配置为,在所述图片上分别添加所述至少一个添加项。
在一个实施例中,在上述图片处理装置800中,所述第一获取单元还配置为,从所述第一字符串中获取与所述关键字符串库中的关键字符串匹配的字符串,作为添加项。
在一个实施例中,在上述图片处理装置800中,所述第一获取单元还配置为,从所述第一字符串中获取与所述关键字符串库中的关键字符串匹配的第三字符串,其中,所述第三字符串为表示量的单位的字符串,并且在所述第一字符串中,在所述第三字符串之前为数字字符串,以及,获取顺序包括所述数字字符串和所述第三字符串的字符串作为添加项。
在一个实施例中,在上述图片处理装置800中,所述第一获取单元还配置为,从所述第一字符串中获取与所述关键字符串库中的关键字符串匹配的第四字符串作为添加项,其中,所述第四字符串预设为对应于特定图形,以及,获取所述特定图形作为添加项。
在一个实施例中,在上述图片处理装置800中,所述第一获取单元还配置为,从所述第一字符串中获取与所述关键字符串库中的关键字符串匹配的第五字符串,其中,所述第五字符串预设为对应于特定图形,以及,获取所述特定图形作为添加项。
在一个实施例中,上述图片处理装置800还包括:第二获取单元85,配置为,在用户打开图片之后,根据用户选择的图片应用场景,获取预设为与所述场景对应的至少一个图形作为至少一个添加项,以及第二添加单元86,配置为,在所述图片上分别添加根据所述场景获取的至少一个添加项
在一个实施例中,上述图片处理装置800还包括,提示单元87,配置为,在接收用户选择的图片应用场景之后,在屏幕上显示与所述场景对应的语音输入内容提示。
在一个实施例中,上述图片处理装置800还包括修改单元88,配置为,在图片中添加所述添加项之后,根据用户手势或输入进行以下至少一种修改:改变所述添加项的位置、改变所述添加项的尺寸、编辑所述添加项的内容、以及删除所述添加项。
本说明书实施例还提供一种计算机可读的存储介质,其上存储有指令代码,所述指 令代码在计算机中执行时,令计算机执行如上所述的图片处理方法。
在根据本说明书实施例的图片处理方法和装置中,通过以语音输入的方式对图片打标签,降低了图片处理难度,大大提高了图片处理效率,满足了用户的需求。
本领域普通技术人员应该还可以进一步意识到,结合本文中所公开的实施例描述的各示例的单元及算法步骤,能够以电子硬件、计算机软件或者二者的结合来实现,为了清楚地说明硬件和软件的可互换性,在上述说明中已经按照功能一般性地描述了各示例的组成及步骤。这些功能究竟以硬件还是软件方式来执轨道,取决于技术方案的特定应用和设计约束条件。本领域普通技术人员可以对每个特定的应用来使用不同方法来实现所描述的功能,但是这种实现不应认为超出本申请的范围。
结合本文中所公开的实施例描述的方法或算法的步骤可以用硬件、处理器执轨道的软件模块,或者二者的结合来实施。软件模块可以置于随机存储器(RAM)、内存、只读存储器(ROM)、电可编程ROM、电可擦除可编程ROM、寄存器、硬盘、可移动磁盘、CD-ROM、或技术领域内所公知的任意其它形式的存储介质中。
以上所述的具体实施方式,对本发明的目的、技术方案和有益效果进行了进一步详细说明,所应理解的是,以上所述仅为本发明的具体实施方式而已,并不用于限定本发明的保护范围,凡在本发明的精神和原则之内,所做的任何修改、等同替换、改进等,均应包含在本发明的保护范围之内。

Claims (27)

  1. 一种图片处理方法,包括:
    在用户打开图片之后,响应于用户操作,接收用户的语音;
    从所述语音识别出第一字符串,作为添加项;以及
    在所述图片上添加所述添加项。
  2. 一种图片处理方法,包括:
    在用户打开图片之后,响应于用户操作,接收用户的语音;
    从所述语音识别出第一字符串;
    根据预设的关键字符串库,获取与所述第一字符串对应的至少一个第二字符串、和/或与所述第一字符串对应的至少一个图形,作为至少一个添加项;以及
    在所述图片上分别添加所述至少一个添加项。
  3. 根据权利要求2所述的图片处理方法,其中,获取与所述第一字符串对应的至少一个第二字符串、和/或与所述第一字符串对应的至少一个图形,作为至少一个添加项,包括,
    从所述第一字符串中获取与所述关键字符串库中的关键字符串匹配的字符串,作为所述添加项。
  4. 根据权利要求2所述的图片处理方法,其中,获取与所述第一字符串对应的至少一个第二字符串、和/或与所述第一字符串对应的至少一个图形,作为至少一个添加项,包括,
    从所述第一字符串中获取与所述关键字符串库中的关键字符串匹配的第三字符串,其中,所述第三字符串为表示量的单位的字符串,并且在所述第一字符串中,在所述第三字符串之前为数字字符串,以及,
    获取顺序包括所述数字字符串和所述第三字符串的字符串作为所述添加项。
  5. 根据权利要求2所述的图片处理方法,其中,获取与所述第一字符串对应的至少一个第二字符串、和/或与所述第一字符串对应的至少一个图形,作为至少一个添加项,包括,
    从所述第一字符串中获取与所述关键字符串库中的关键字符串匹配的第四字符串作为添加项,其中,所述第四字符串预设为对应于特定图形,以及,
    获取所述特定图形作为添加项。
  6. 根据权利要求2所述的图片处理方法,其中,获取与所述第一字符串对应的至少一个第二字符串、和/或与所述第一字符串对应的至少一个图形,作为至少一个添加项, 包括,
    从所述第一字符串中获取与所述关键字符串库中的关键字符串匹配的第五字符串,其中,所述第五字符串预设为对应于特定图形,以及,
    获取所述特定图形作为添加项。
  7. 根据权利要求1或2所述的图片处理方法,还包括,
    在用户打开图片之后,根据用户选择的图片应用场景,获取预设为与所述场景对应的至少一个图形作为至少一个添加项,以及
    在所述图片上分别添加根据所述场景获取的至少一个添加项。
  8. 根据权利要求7所述的图片处理方法,其中所述图片应用场景为商品营销场景,以及,其中,预设为与所述商品营销场景对应的至少一个图形包括:标尺、标签、图框和箭头。
  9. 根据权利要求2所述的图片处理方法,其中,所述根据预设的关键字符串库包括,根据与用户选择的图片应用场景对应的关键字符串库。
  10. 根据权利要求9所述的图片处理方法,其中,所述场景为商品营销场景,以及,其中,与所述场景对应的关键字符串库包括关于以下属性的关键字符串:材质、尺寸、颜色、价格和外观。
  11. 根据权利要求7或9所述的图片处理方法,还包括,在接收用户的语音之前或之后,在屏幕上显示与所述图片应用场景对应的语音输入内容提示。
  12. 根据权利要求1、2和7中任一项所述的图片处理方法,还包括,在图片中添加所述添加项之后,根据用户手势或输入进行以下至少一种修改:改变所述添加项的位置、改变所述添加项的尺寸、编辑所述添加项的内容、以及删除所述添加项。
  13. 根据权利要求1或2所述的图片处理方法,其中,所述用户打开图片包括,用户在其终端的相册中打开图片、用户在社交APP中打开图片、或者用户在用于执行所述方法的APP中打开图片。
  14. 一种图片处理装置,包括:
    接收单元,配置为,在用户打开图片之后,响应于用户操作,接收用户的语音;
    识别单元,配置为,从所述语音识别出第一字符串,作为添加项;以及
    添加单元,配置为,在所述图片上添加所述添加项。
  15. 一种图片处理装置,包括:
    接收单元,配置为,在用户打开图片之后,响应于用户操作,接收用户的语音;
    识别单元,配置为,从所述语音识别出第一字符串;
    第一获取单元,配置为,根据预设的关键字符串库,获取与所述第一字符串对应的至少一个第二字符串、和/或与所述第一字符串对应的至少一个图形,作为至少一个添加项;以及
    第一添加单元,配置为,在所述图片上分别添加所述至少一个添加项。
  16. 根据权利要求15所述的图片处理装置,其中,所述第一获取单元还配置为,从所述第一字符串中获取与所述关键字符串库中的关键字符串匹配的字符串,作为添加项。
  17. 根据权利要求15所述的图片处理装置,其中,所述第一获取单元还配置为,从所述第一字符串中获取与所述关键字符串库中的关键字符串匹配的第三字符串,其中,所述第三字符串为表示量的单位的字符串,并且在所述第一字符串中,在所述第三字符串之前为数字字符串,以及,获取顺序包括所述数字字符串和所述第三字符串的字符串作为添加项。
  18. 根据权利要求15所述的图片处理装置,其中,所述第一获取单元还配置为,从所述第一字符串中获取与所述关键字符串库中的关键字符串匹配的第四字符串作为添加项,其中,所述第四字符串预设为对应于特定图形,以及,获取所述特定图形作为添加项。
  19. 根据权利要求15所述的图片处理装置,其中,所述第一获取单元还配置为,从所述第一字符串中获取与所述关键字符串库中的关键字符串匹配的第五字符串,其中,所述第五字符串预设为对应于特定图形,以及,获取所述特定图形作为添加项。
  20. 根据权利要求14或15所述的图片处理装置,还包括,第二获取单元,配置为,在用户打开图片之后,根据用户选择的图片应用场景,获取预设为与所述场景对应的至少一个图形作为至少一个添加项,以及第二添加单元,配置为,在所述图片上分别添加根据所述场景获取的至少一个添加项。
  21. 根据权利要求20所述的图片处理装置,其中所述图片应用场景为商品营销场景,以及,其中,预设为与所述商品营销场景对应的至少一个图形包括:标尺、标签、图框和箭头。
  22. 根据权利要求15所述的图片处理装置,其中,所述根据预设的关键字符串库包括,根据与用户选择的图片应用场景对应的关键字符串库。
  23. 根据权利要求22所述的图片处理装置,其中,所述场景为商品营销场景,以及,其中,与所述场景对应的关键字符串库包括关于以下属性的关键字符串:材质、尺寸、颜色、价格和外观。
  24. 根据权利要求20或22所述的图片处理装置,还包括提示单元,配置为,在接收 用户的语音之前或之后,在屏幕上显示与所述图片应用场景对应的语音输入内容提示。
  25. 根据权利要求14、15和20中任一项所述的图片处理装置,还包括,修改单元,配置为,在图片中添加所述添加项之后,根据用户手势或输入进行以下至少一种修改:改变所述添加项的位置、改变所述添加项的尺寸、编辑所述添加项的内容、以及删除所述添加项。
  26. 根据权利要求14或15所述的图片处理装置,其中,所述用户打开图片包括,用户在其终端的相册中打开图片、用户在社交APP中打开图片、或者用户在用于执行所述方法的APP中打开图片。
  27. 一种计算机可读的存储介质,其上存储有指令代码,所述指令代码在计算机中执行时,令计算机执行权利要求1-13中任一项所述的方法。
PCT/CN2019/070040 2018-03-28 2019-01-02 图片处理 WO2019184539A1 (zh)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201810266755.4 2018-03-28
CN201810266755.4A CN108805958A (zh) 2018-03-28 2018-03-28 一种图片处理方法和装置

Publications (1)

Publication Number Publication Date
WO2019184539A1 true WO2019184539A1 (zh) 2019-10-03

Family

ID=64095398

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2019/070040 WO2019184539A1 (zh) 2018-03-28 2019-01-02 图片处理

Country Status (3)

Country Link
CN (1) CN108805958A (zh)
TW (1) TWI698835B (zh)
WO (1) WO2019184539A1 (zh)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108805958A (zh) * 2018-03-28 2018-11-13 阿里巴巴集团控股有限公司 一种图片处理方法和装置
JP6807621B1 (ja) * 2020-08-05 2021-01-06 株式会社インタラクティブソリューションズ 音声に基づいて画像を変更するためのシステム

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7921037B2 (en) * 2002-04-01 2011-04-05 Hewlett-Packard Development Company, L.P. Personalized messaging determined from detected content
CN103365970A (zh) * 2013-06-25 2013-10-23 广东小天才科技有限公司 自动获取学习资料信息的方法和装置
CN104766353A (zh) * 2015-04-25 2015-07-08 陈包容 背景中添加文字内容的方法及装置
CN108805958A (zh) * 2018-03-28 2018-11-13 阿里巴巴集团控股有限公司 一种图片处理方法和装置

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB2409365B (en) * 2003-12-19 2009-07-08 Nokia Corp Image handling
TWI402767B (zh) * 2008-11-28 2013-07-21 Hon Hai Prec Ind Co Ltd 具有圖片編輯功能的電子裝置及方法
TWI534647B (zh) * 2015-07-07 2016-05-21 中華電信股份有限公司 自訂圖片樣版系統
CN105302786B (zh) * 2015-11-10 2019-05-24 百度在线网络技术(北京)有限公司 数据的编辑方法和装置
CN107707836A (zh) * 2017-09-11 2018-02-16 广东欧珀移动通信有限公司 图像处理方法和装置、电子装置和计算机可读存储介质

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7921037B2 (en) * 2002-04-01 2011-04-05 Hewlett-Packard Development Company, L.P. Personalized messaging determined from detected content
CN103365970A (zh) * 2013-06-25 2013-10-23 广东小天才科技有限公司 自动获取学习资料信息的方法和装置
CN104766353A (zh) * 2015-04-25 2015-07-08 陈包容 背景中添加文字内容的方法及装置
CN108805958A (zh) * 2018-03-28 2018-11-13 阿里巴巴集团控股有限公司 一种图片处理方法和装置

Also Published As

Publication number Publication date
TWI698835B (zh) 2020-07-11
TW201942873A (zh) 2019-11-01
CN108805958A (zh) 2018-11-13

Similar Documents

Publication Publication Date Title
US20210303140A1 (en) Combining first user interface content into second user interface
CN108156503B (zh) 一种生成礼物的方法及装置
RU2488232C2 (ru) Сеть связи и устройства для преобразования текста в речь и текста в анимацию лица
US20210405831A1 (en) Updating avatar clothing for a user of a messaging system
US11335088B2 (en) Augmented reality item collections
CN114787813A (zh) 上下文敏感化身字幕
WO2014192612A1 (ja) 画像認識装置、その処理方法、およびプログラム
US11657575B2 (en) Generating augmented reality content based on third-party content
CN111986076A (zh) 图像处理方法及装置、互动式展示装置和电子设备
WO2016000536A1 (zh) 一种激活应用程序的方法、用户终端和服务器
CN106789551B (zh) 会话消息展示方法及装置
KR102577630B1 (ko) 메시징 애플리케이션에서의 증강 현실 콘텐츠의 디스플레이
US11769500B2 (en) Augmented reality-based translation of speech in association with travel
US10498677B2 (en) Turn-based pictorial chatting application and method for pictorial chatting installed in user terminal
JP6114706B2 (ja) 検索システム及び検索システムの制御方法
US20210304451A1 (en) Speech-based selection of augmented reality content for detected objects
US20230091214A1 (en) Augmented reality items based on scan
WO2019184539A1 (zh) 图片处理
KR20230028553A (ko) 여행과 연관된 증강 현실 기반 번역들
KR20220155601A (ko) 검출된 객체들에 대한 증강 현실 콘텐츠의 음성 기반 선택
CN106791091B (zh) 图像生成方法、装置和移动终端
KR20230031323A (ko) 리뷰들을 위한 여행 기반 증강 현실 콘텐츠
KR20230029945A (ko) 제품 데이터에 기초한 증강 현실 콘텐츠
CN110036356B (zh) Vr系统中的图像处理
WO2022212669A1 (en) Determining classification recommendations for user content

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 19777325

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 19777325

Country of ref document: EP

Kind code of ref document: A1