WO2017000109A1 - Search method, search apparatus, user equipment, and computer program product - Google Patents

Search method, search apparatus, user equipment, and computer program product Download PDF

Info

Publication number
WO2017000109A1
WO2017000109A1 PCT/CN2015/082628 CN2015082628W WO2017000109A1 WO 2017000109 A1 WO2017000109 A1 WO 2017000109A1 CN 2015082628 W CN2015082628 W CN 2015082628W WO 2017000109 A1 WO2017000109 A1 WO 2017000109A1
Authority
WO
WIPO (PCT)
Prior art keywords
image
target
character information
target image
information
Prior art date
Application number
PCT/CN2015/082628
Other languages
French (fr)
Chinese (zh)
Inventor
姚聪
周舒畅
周昕宇
吴育昕
Original Assignee
北京旷视科技有限公司
北京小孔科技有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 北京旷视科技有限公司, 北京小孔科技有限公司 filed Critical 北京旷视科技有限公司
Priority to CN201580000313.XA priority Critical patent/CN105518678B/en
Priority to PCT/CN2015/082628 priority patent/WO2017000109A1/en
Publication of WO2017000109A1 publication Critical patent/WO2017000109A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/951Indexing; Web crawling techniques

Definitions

  • the present disclosure relates to the field of information technology, and more particularly, to a search method, a search device, a user equipment, and a computer program product.
  • the Internet provides a new trading platform and entertainment platform. For example, you can buy goods on the Internet, download music, watch videos online, and more.
  • the search can be performed on the Internet based on the keywords of the object to be searched.
  • the keyword-based object search system relies on the character representation entered by the user. However, when the keyword input by the user is inaccurate or there is an error, it is difficult to obtain a satisfactory search result.
  • e-services such as e-commerce continues to expand and the number and variety of goods or services grows rapidly, consumers may need to spend more time browsing to find objects or products of interest.
  • Embodiments of the present disclosure provide a search method, a search device, a user device, and a computer program product, which enable accurate and convenient searching of related object information of a target object, thereby improving a user's use experience.
  • a search method which is applied to a server, the search method may include: receiving a search request, the search request including a target image of a target object to be searched; and extracting the target from the target image Character information and image features associated with the object; searching for related object information associated with the target object based on the character information and the image feature; transmitting the related object information.
  • the extracting the character information and the image feature associated with the target object from the target image may include: utilizing optical character recognition The OCR identifies a character and a symbol from the target image; and selects an identification character for identifying the target object from the recognized characters and symbols as character information associated with the target object.
  • the searching for related object information associated with the target object based on the character information and the image feature may include: The character information and the image feature search for the related object information from a pre-established object database, wherein the object database includes image features, character information, and associated information of each candidate object.
  • the searching for the related object information from the pre-established object database based on the character information and the image feature may include: Calculating image feature similarity between the target object and each candidate object by image features of the target image and image features of the respective candidate objects; calculating the target object based on character information of the target image and character information of each candidate object Character information similarity with each candidate object; searching for related object information associated with the target object from the plurality of candidate objects based on the image feature similarity and the character information similarity.
  • the searching for the location and the location information based on the image feature similarity and the character information similarity may include: performing weighted averaging on the image feature similarity and the character information similarity to obtain an average similarity between the target object and each candidate object; according to the average similarity
  • the descending order of degrees selects a predetermined number of candidate objects from the plurality of candidate objects; information corresponding to the selected candidate objects is used as related object information associated with the target object.
  • the image feature based on the target image and the image feature of each candidate object are used to calculate between the target object and each candidate object.
  • the image feature similarity may include calculating a cosine similarity between the image feature of the target object and an image feature between the respective candidate objects as the image feature similarity.
  • the character information based on the target image and the character information of each candidate object are calculated between the target object and each candidate object.
  • the character information similarity may include: calculating an edit distance between the character information of the target object and the character information of each candidate object; based on the edit distance, the length of the character information of the target object, the character of the candidate object The length of the information is used to calculate the similarity of the character information.
  • the extracting the character information and the image feature associated with the target object from the target image may include at least one of the following operations One: calculating a color histogram feature of the target image as the image feature; and calculating a word bag model feature of the target image as the image feature.
  • the target image may satisfy a predetermined condition.
  • a search method for application to a user equipment.
  • the search method may include: collecting a target image of the target object to be searched; determining whether the target image satisfies a predetermined condition; and when the target image satisfies a predetermined condition, issuing a search request for the target object, the search request including The target image; receiving related object information associated with the target object, wherein the related object information is obtained based on character information and image feature search associated with the target object extracted from the target image.
  • the determining whether the target image meets the predetermined condition may include: determining an illumination parameter in the process of acquiring the target image; and when the illumination parameter is greater than or equal to When the illuminance is preset, it is determined that the target image satisfies a predetermined condition.
  • the determining whether the target image meets a predetermined condition may include: determining an average gradient of pixel points of an edge of the collected target image; When the average gradient of the pixel points of the edge of the target image is less than the preset gradient threshold, it is determined that the target image satisfies a predetermined condition.
  • a search device for use in a server.
  • the search device can include a transceiver that receives a search request, the search request including a target image of a target object to be searched, a processor, a memory, and computer program instructions stored in the memory. Performing the steps of: extracting, from the target image, character information and image features associated with the target object when the computer program instructions are executed by the processor; searching and said based on the character information and image features Relevant object information associated with the target object; the searched related object information is provided to the transceiver for transmission.
  • the extracting the character information and the image feature associated with the target object from the target image may include: using the optical character recognition OCR from the target Identifying characters and symbols in the image; selecting an identification character for identifying the target object from the recognized characters and symbols as character information associated with the target object.
  • Searching for related object information associated with the target object based on the character information and the image feature may include: searching for the related object information from a pre-established object database based on the character information and the image feature, wherein The object database includes image features, character information, and associated information of each candidate object.
  • the searching for the related object information from the pre-established object database based on the character information and the image feature may include: Calculating image feature similarity between the target object and each candidate object by image features of the target image and image features of the respective candidate objects; calculating the target object based on character information of the target image and character information of each candidate object Character information similarity with each candidate object; searching for related object information associated with the target object from the plurality of candidate objects based on the image feature similarity and the character information similarity.
  • the related object information associated with the target object may include: performing weighted averaging on the image feature similarity and the character information similarity to obtain an average similarity between the target object and each candidate object; according to the average similarity
  • the descending order of degrees selects a predetermined number of candidate objects from the plurality of candidate objects; information corresponding to the selected candidate objects is used as related object information associated with the target object.
  • the image feature based on the target image and the image feature of each candidate object are calculated between the target object and each candidate object.
  • the image feature similarity may include calculating a cosine similarity between the image feature of the target object and an image feature between the respective candidate objects as the image feature similarity.
  • the character information based on the target image and the character information of each candidate object are calculated between the target object and each candidate object.
  • the character information similarity may include: calculating an edit distance between the character information of the target object and the character information of each candidate object; based on the edit distance, the length of the character information of the target object, and the candidate objects The length of the character information is used to calculate the similarity of the character information.
  • extracting the character information and the image feature associated with the target object from the target image may include at least one of the following operations: Calculating a color histogram feature of the target image as the image feature; And calculating a word bag model feature of the target image as the image feature.
  • the target image may satisfy a predetermined condition.
  • a user equipment may include: an image collector for acquiring a target image of the target object to be searched; a processor for determining whether the target image satisfies a predetermined condition; and a transceiver, when the target image satisfies a predetermined condition, The issuing a search request for the target object, the search request including the target image, and receiving related object information associated with the target object, wherein the related object information is based on extraction from the target image Character information and image feature search associated with the target object are obtained.
  • the user equipment may further include an illuminometer for measuring an illumination parameter of the target object
  • the processor may instruct the illuminometer to be in image collection
  • the illumination parameter of the target object is measured during the process of acquiring the target image, and determining that the target image satisfies a predetermined condition when the illumination parameter is greater than or equal to the preset illumination.
  • the processor analyzes the target image to determine an average gradient of pixel points of the edge thereof, and a pixel point at an edge of the target image When the average gradient is less than the preset gradient threshold, it is determined that the target image satisfies a predetermined condition.
  • a computer program product for searching for an object can include a computer readable storage medium. Storing computer program instructions on the computer readable storage medium, the computer program instructions being executed by a processor to cause the processor to: receive a search request, the search request including a target image of a target object to be searched; Extracting character information and image features associated with the target object in the target image; searching for related object information associated with the target object based on the character information and the image feature; and transmitting the related object information.
  • a computer program product for searching for an object can include a computer readable storage medium.
  • Computer program instructions are stored on the computer readable storage medium.
  • the computer program instructions may be executed by a processor to cause the processor to: acquire an object image of a target object to be searched using an image collector; determine whether the target image satisfies a predetermined condition; and when the target image satisfies a predetermined condition Transmitting, by the transceiver, a search request for the target object, the search request including the target image; and receiving, by the transceiver, related object information associated with the target object, wherein the related object information is based on the The character information and image feature associated with the target object extracted in the target image are searched.
  • character information and image features associated with the target object are extracted from a target image of a target object to be searched for
  • the search is performed based on the character information and the image feature, and the related object information of the target object can be searched accurately and conveniently, thereby improving the user experience.
  • a search request is issued based on the target image, so that It can accurately and conveniently search related object information of the target object, thereby improving the user experience.
  • FIG. 1(a) schematically illustrates an application scenario according to an embodiment of the present disclosure
  • Figure 1 (b) schematically illustrates a schematic diagram of a target image taken by a user device
  • FIG. 2 is a flow chart that schematically illustrates a search method for a server in accordance with an embodiment of the present disclosure
  • FIG. 3 is a flow chart schematically illustrating related object information of a target object based on image features and character information in the search method of FIG. 2;
  • FIG. 4 is a flow chart that schematically illustrates a search method for a user equipment in accordance with an embodiment of the present disclosure
  • FIG. 5 is a block diagram schematically illustrating a first search device according to an embodiment of the present disclosure
  • FIG. 6 is a block diagram schematically illustrating a second search device for a server according to an embodiment of the present disclosure
  • FIG. 7 is a block diagram schematically illustrating a user equipment in accordance with an embodiment of the present disclosure.
  • FIG. 1(a) schematically illustrates an application scenario in accordance with an embodiment of the present disclosure.
  • the user equipment 10 is communicatively coupled to the search server 20 via a network.
  • the user device 10 is, for example, a smart phone, a tablet computer, a notebook computer, or the like.
  • the search server 20 is a cloud server, a web server, or the like. Communication between user device 10 and search server 20 may be implemented using a variety of techniques including, but not limited to, the Internet, local area networks, third generation mobile communication technologies, and the like. For example, a user of a user device browses a Taobao web page to expect to purchase a particular item, ie, a target object.
  • the user equipment is connected to the search server of Taobao via the Internet.
  • the user inputs a keyword of the product to be purchased in the Taobao webpage of the user equipment, and the user equipment transmits the keyword to the search server of Taobao via the Internet, and the latter performs a search based on the keyword and via the Internet. Sending the search result to the user equipment.
  • the keyword input by the user is inaccurate or there is an error, it is difficult to obtain a satisfactory search result.
  • a plurality of items associated with keywords may be included in the search results, which may make it impossible for the user to find the target object to be purchased from the search results.
  • the user equipment 10 performs image acquisition on a target object to be purchased using a camera or the like, and transmits the collected target image to the search server 20.
  • the search server 20 extracts character information and/or image information from the target image, and performs a search based on the extracted information, and transmits the search result to the user device via the Internet.
  • the target image it usually carries rich information about the target object, such as the appearance, name, trademark, manufacturer, date of manufacture, and the like of the target object.
  • the search server can more accurately search for the target object of the user, thereby improving the accuracy of the search.
  • the search server can automatically extract information in the target image without requiring the user to manually input keywords or the like, which makes the user's search operation more convenient.
  • FIG. 1(b) schematically illustrates a schematic diagram of a target image taken by the user device 10.
  • the captured target images (1), (2), and (3) are respectively Evian mineral water, calbee potato chips, and blue moon laundry liquid.
  • the target image (1) includes information on the appearance of the shape of the bottled water, the name of the evian, the shape of the mountain, the capacity of 550 ml, etc., based on the information, the search server 20 can accurately search for the target object of the user. However, if the user enters the keyword "Evian mineral water", it will search for Evian mineral water in different packaging, different series and different capacities.
  • the search server 20 can accurately search for each target object.
  • the search method 200 is applicable to a search server as shown in FIG. 1(a).
  • the search method 200 may include receiving a search request including a target image of a target object to be searched (S210); extracting characters associated with the target object from the target image Information and image features (S220); searching for related object information associated with the target object based on the character information and the image feature (S230); transmitting the related object information (S240).
  • the server receives a search request from the user device, the search request including a target image of the target object to be searched.
  • the target image is any one of the target images as shown in FIG. 1(b).
  • the target image contains various information of the target object to be searched, including but not limited to brand name, object content, series, appearance, capacity, production date, and the like.
  • the target image may be collected by the user equipment by using the image collection device, or may be received by the user equipment from other electronic devices.
  • the manner in which the user equipment acquires the target image does not constitute a limitation on the embodiments of the present disclosure.
  • the server extracts information from the target image to search for a target object. Accordingly, the image quality of the target image will directly affect the search results. For example, in the target image (1) of FIG. 1(b), if the target image is blurred and information such as the brand name evian, capacity, and the like cannot be extracted, it is difficult to accurately search for the target object. Therefore, a request can be made for the target image, for example, the target image satisfies a predetermined condition.
  • the predetermined condition may be a condition regarding the brightness of the target image or a condition regarding the sharpness of the target image.
  • the target image when the brightness of the target image is greater than or equal to the preset brightness threshold, it is determined that the target image satisfies a predetermined condition; when the brightness of the target image is less than the preset brightness threshold, it is determined that the target image does not satisfy the predetermined condition.
  • the sharpness of the target image when the sharpness of the target image is greater than or equal to the preset sharpness threshold, it is determined that the target image satisfies a predetermined condition; when the brightness of the target image is less than the preset sharpness threshold, it is determined that the target image does not satisfy the predetermined condition.
  • the preset brightness threshold or the preset definition threshold may be adjusted according to the processing capability of the server.
  • the preset brightness threshold or the preset definition threshold may be set to a lower value; when the processing capability of the server is weak, the preset brightness threshold or the preset definition threshold may be set. Is a higher value.
  • character information associated with the target object is extracted from the target image and Image features.
  • the character information included in the target image is, for example, a product name, a capacity, a brand name, a date of manufacture, and the like, and the character information is a character or a symbol.
  • the image special diagnosis included in the target image is a color component of the image, a composition ratio of each color component, and the like. Typically, different techniques are employed to extract character information and image features in the target image.
  • the character information in the target image can be extracted by using Optical Character Recognition (OCR) technology.
  • OCR Optical Character Recognition
  • the server determines its shape by detecting the dark and bright patterns of the target image, and then uses the character recognition method to translate the shape into computer text.
  • other techniques may be employed to perform character recognition on the target image to obtain character information therein.
  • Character information associated with the target object may be extracted from the target image by recognizing words and symbols from the target image using optical character recognition OCR; selecting for identification from the identified characters and symbols
  • the identification character of the target object is the character information associated with the target object.
  • rich information is included in the target image, and some of the information may be closely related to the search of the target object, such as product name, brand, capacity, and the like.
  • the target image may also include information that is not related to the search of the target object, such as components, security reminders, etc., which may be information related to all similar products, which cannot be used to identify the target object. Therefore, after performing character recognition on the target image, it is necessary to filter out information required for searching the target object, that is, an identification character for identifying the target object.
  • Image features are index-valued image feature representations, such as using vectors to represent image characteristics.
  • the image features of the target image may be represented in various ways that are present or appearing in the future.
  • a color histogram and a Bag of Words feature are taken as an example of an image feature. It is to be noted that, in the application, any one of the color histogram and the word bag model feature may be used to represent the image feature of the target image, and both the color histogram and the bag model feature may be used to represent the image feature of the target image.
  • the extracting the image object associated with the target object from the target image includes at least one of: calculating a color histogram feature of the target image as the image feature; and calculating a location
  • the word bag model feature of the target image is used as the image feature.
  • a color histogram is a statistical representation of the color characteristics of an image that is used to represent the proportion of different colors in the entire target image, without concern for the spatial location of each color.
  • Color histograms are closely related to how color space is represented. Common color histograms include RGB spatial color histograms, HSV spatial color histograms, and Lab space color histograms. In different color spaces The color histogram of the target image has different values.
  • the word bag model feature is a statistical representation of the texture features of an image that can effectively describe the overall and local characteristics of the image.
  • the word bag model feature of the target image can be obtained by extracting feature descriptors from the target image, such as Scale Invariant Feature Transform (SIFT), Directional Histogram (HOG, Histogram of Oriented Gradient). ); for each descriptor, search for the most similar cluster center in the pre-accurate codebook, and count the frequency of occurrence of different cluster centers in the target image to form a histogram; The processing is performed to obtain the word bag model feature of the target image.
  • SIFT Scale Invariant Feature Transform
  • HOG Directional Histogram of Oriented Gradient
  • the pre-accurate codebook can be obtained by randomly extracting a large number of image descriptors (for example, SIFT, HOG, etc.) from the set of training images, and clustering the image descriptors by using a clustering algorithm to obtain multiple Category, all the categories obtained by clustering constitute the codebook.
  • image descriptors for example, SIFT, HOG, etc.
  • related object information associated with the target object is searched based on the character information and the image feature obtained in S220. Specifically, the related object information is searched from a pre-established object database based on the character information and the image feature.
  • the object database includes image features, character information, and associated information of respective candidate objects.
  • f I (p j ) represents an image feature of the object p j , which may be a color histogram feature, or a bag model feature, or a vector composed of color histogram features and word bag model features.
  • f T (p j ) is character information of the object p j , which is typically a character string such as a name, a brand, a content, and the like.
  • a(p j ) represents other associated information associated with the object p j , such as price, sales volume, user rating, promotional video, and hyperlinks.
  • each object p j can also be represented by a binary group ⁇ f I (p j ), f T (p j ) ⁇ .
  • the image feature and the character information of the target image of the target object q to be searched are f I (q) and f T (q), respectively, and accordingly, the character information f T (q) and the image obtained in S220 can be obtained.
  • the feature f I (q) is compared with the character information f T (p j ) of each candidate object in the object database P and the image feature f I (p j ) to perform a search.
  • FIG. 3 is a flowchart schematically illustrating related object information (S230) of searching for a target object based on image features and character information in the search method of FIG. 2.
  • image feature similarity between the target object and each candidate object is calculated based on image features of the target image and image features of the respective candidate objects (S231); character information based on the target image and each device Selecting character information of the object to calculate a character information similarity between the target object and each candidate object (S232); performing weighted averaging on the image feature similarity and the character information similarity to obtain the target object and each device Selecting an average similarity between the objects (S233); selecting a predetermined number of candidate objects from the plurality of candidate objects in descending order of the average similarity (S234); corresponding to the selected candidate objects
  • the information is related object information associated with the target object (S235).
  • a cosine similarity s I (q, p j ) between the image feature f I (q) of the target object q and the image feature f I (p j ) between the respective candidate objects p j can be calculated. ) as the image feature similarity.
  • the cosine similarity s I (q, p j ) can be calculated by the following formula (1):
  • the character information similarity between the target object q and each candidate object p j may be calculated as follows: calculating character information f T (q) of the target object q and each candidate object p j An edit distance d(f T (q), f T (p j )) between the character information f T (p j ); based on the edit distance, the length of the character information f T (q) of the target object, The character information similarity is calculated by the length of the character information f T (p j ) of the candidate object.
  • Edit distance is the minimum number of edit operations required to convert from one string to another between two strings. The allowed editing operations include replacing one character with another, inserting a character, and deleting One character.
  • the edit distance d(f T (q), f T (p j )) is the minimum number of editing operations required to convert the character information f T (q) into the character information f T (p j ).
  • Character information length f T (q) for example, the number of characters and symbols included in the character information f T (q) in.
  • the length of the object character information f T (p j) for example, the number of characters and symbols included in the character information f T (p j) in.
  • the character information similarity s T (q, p j ) can be calculated by the following formula (2):
  • d(f T (q), f T (p j )) is the edit distance between the character information f T (q) and the character information f T (p j ), and L(f T (q)) is a character
  • the length of the information f T (q), L(f T (p j )), is the length of the character information f T (p j ).
  • the image feature similarity s I (q, p j ) and the character information similarity s T (q, p j ) are weighted and averaged to obtain an average between the target object and each candidate object. Similarity.
  • the average similarity s(q, p j ) can be calculated by the following formula (3):
  • is the weight coefficient.
  • the image feature similarity s I (q, p j ) increases in the average similarity
  • the character information similarity s T (q, p j ) decreases in the average similarity.
  • the weight coefficient ⁇ decreases, the image feature similarity s I (q, p j ) decreases in the average similarity, and the character information similarity s T (q, p j ) increases in the average similarity.
  • a predetermined number of candidate objects are selected from the plurality of candidate objects in descending order of the average similarity s(q, p j ).
  • the average similarities may be arranged in descending order, and for example, a predetermined number of R candidate objects with an average degree of similarity are selected, and the R candidate objects are search results.
  • the average similarity between the R candidate objects and the target object is high, indicating that the R candidate objects are closer to the target object, so that there is a larger target object that the user desires.
  • R is a configurable parameter, and its typical value can be set to 10, 20, 100, and so on.
  • information corresponding to the selected R candidate objects is taken as related object information associated with the target object.
  • the picture, the character description, the related information, and the like of the R objects are used as related object information.
  • the related information is, for example, price, sales volume, user rating, promotional video, hyperlink, and the like.
  • related object information associated with the target object is searched from the plurality of candidate objects based on the image feature similarity and the character information similarity.
  • the related object information may be searched in such a manner that R1 candidate objects are selected from the plurality of candidate objects in descending order of image feature similarity; in descending order of similarity of character information R2 candidate objects are selected from the plurality of candidate objects; information corresponding to the selected R1 candidate objects and R2 candidate objects is used as related object information associated with the target object.
  • R1 is a natural number smaller than N.
  • R2 is also a natural number smaller than N.
  • the server sends the searched related object information as a search result to the user equipment.
  • the server can transmit the correlation by using various networks or communication technologies such as the Internet and a local area network.
  • Object information is, for example, a picture, a text description, and associated information of the R candidate objects, or a picture, a text description, and associated information of the R1 plus R2 candidate object described above.
  • the user equipment may display the related object information on the screen of the user equipment for the user to view.
  • character information and image features associated with the target object are extracted from a target image of a target object to be searched based on the character information and
  • the image feature performs a search, and can accurately and conveniently search related object information of the target object, thereby improving the user experience.
  • the step of manually inputting a keyword by the user is eliminated by automatically recognizing the character information contained in the target image.
  • the search method 400 may include: collecting a target image of a target object to be searched (S410); determining whether the target image satisfies a predetermined condition (S420); and when the target image satisfies a predetermined condition, issuing a search request for the target object, the search request including the target image (S430); receiving related object information associated with the target object (S440), wherein the related object information is based on from the target image
  • S410 target image of a target object to be searched
  • S420 determining whether the target image satisfies a predetermined condition
  • S440 receiving related object information associated with the target object
  • the related object information is based on from the target image
  • the extracted character information and image feature search associated with the target object are obtained.
  • the image capturing device in the user device may be utilized to collect the target image of the target object to be searched. For example, if the blue moon laundry liquid in the user's FIG. 1(b) is exhausted and it is desired to purchase the blue moon laundry liquid, the user utilizes an image capturing device built in the user device 10 or an image acquisition connected to the user device. The device performs image acquisition on the existing Blue Moon laundry detergent. The positional relationship between the image capture device and the user equipment does not constitute a limitation on the embodiments of the present disclosure.
  • the server is to extract information from the target image to search for the target object, the image quality of the target image will directly affect the search result.
  • the target image (1) of FIG. 1(b) as an example, if the target image is blurred and information such as the brand name evian, capacity, etc. cannot be extracted, it is difficult to accurately search for the target object.
  • a requirement may be made for the target image at S420, for example, the target image satisfies a predetermined condition.
  • the predetermined condition may be a condition regarding the brightness of the target image or a condition regarding the sharpness of the target image.
  • the target image acquired in S410 may be converted into image data of the HSL color space in which the luminance information is included in the image data of the HSL color space. Then, the average value of the illumination components (ie, L components) of all pixels in the image data of the HSL color space is counted Average value of the illumination component used for the target image When the predetermined brightness threshold T L is greater than or equal to, it may be determined that the target image satisfies a predetermined condition. Average value of the illumination component used for the target image When it is less than the predetermined brightness threshold T L , it may be judged that the target image does not satisfy the predetermined condition.
  • the predetermined brightness threshold T L is typically 64.
  • the quality of the target image can be indirectly determined by measuring the lighting conditions in the image acquisition environment. For example, the illumination parameter in the process of acquiring the target image may be determined; when the illumination parameter is greater than or equal to the preset illumination, determining that the target image satisfies a predetermined condition; when the illumination parameter is less than the preset illumination, determining the location The target image does not satisfy the predetermined condition.
  • an edge of the target image acquired in S410 may be extracted using a predetermined algorithm (for example, Canny algorithm) in S420, and each of the edges located in the target image is calculated.
  • the gradient G of the pixel and then further calculate the average of the gradients of all the pixel points at the edge in the target image.
  • the average of the gradients of all pixel points at the edge of the target image When it is greater than or equal to the preset gradient threshold T G , it may be determined that the target image satisfies a predetermined condition.
  • the average of the gradients of all pixel points at the edge of the target image When it is less than the preset gradient threshold T G , it may be determined that the target image does not satisfy the predetermined condition.
  • the preset gradient threshold T G is typically 100.
  • the predetermined brightness threshold T L or the preset gradient threshold T G described above may be adjusted according to the processing capability of the server performing the search. For example, when the processing capability of the server is strong, the predetermined brightness threshold T L or the preset gradient threshold T G may be set to a lower value; when the processing capability of the server is weak, the predetermined brightness threshold T L or a preset gradient may be used. The threshold T G is set to a higher value.
  • a search request for the target object is issued in S430, the search request including the target image.
  • the search server 20 extracts character information and image features associated with the target object from the target image, and performs a search based on the character information and the image feature, ie, The various steps of the search method described in connection with FIG. 2 are performed. Since the brightness or sharpness of the target image is good, the character information and the image feature can be accurately extracted in the server, thereby ensuring the accuracy of the search.
  • a retake prompt message may be output in the user equipment to prompt re-execution S410 to collect the target image of the target object to be searched.
  • the retake prompt message it is also possible to specifically list the reason why the target image does not satisfy the predetermined condition.
  • the average of the illumination components of the target image When less than the predetermined brightness threshold T L , the brightness may be indicated in the replay prompt message; the average of the gradients of all the pixel points located at the edge of the target image
  • the preset gradient threshold T G is smaller than the preset gradient threshold T G , it is indicated that the sharpness is insufficient in the replay prompt message.
  • the shooting of the target image can be adjusted according to the replay prompt message until the target image that satisfies the predetermined condition is acquired.
  • the setting parameters of the image pickup device may be automatically adjusted in accordance with the determination result of S420 until the target image satisfying the predetermined condition is acquired.
  • the server After the user device issues a search request to the server in S430, the server performs the search method described in connection with FIGS. 2 and 3 and obtains related object information associated with the target object. That is, the related object information is obtained based on character information and image feature search associated with the target object extracted from the target image.
  • the user equipment receives relevant object information associated with the target object in S440.
  • the user equipment can receive the related object information from the server by using various networks or communication technologies such as the Internet and a local area network.
  • the related object information is, for example, a picture of a plurality of candidate objects, a text description, and associated information.
  • the associated information is, for example, price, sales volume, user rating, promotional video, hyperlinks, etc., which assists the user in performing selection operations among a plurality of candidate objects.
  • the user equipment may display the related object information on the screen of the user equipment for the user to view.
  • the user device can automatically calculate the illumination condition and the degree of clarity of the image. If the lighting conditions and clarity of the image meet the requirements, the user device is allowed to issue a search request based on the acquired target image. If the lighting conditions and clarity of the image do not meet the requirements, the user equipment is prompted or automatically instructed to re-shoot until the desired target image is obtained.
  • a search request is issued based on the target image, so that the target object can be searched accurately and conveniently Relevant object information, thereby improving the user experience.
  • FIG. 5 is a block diagram schematically illustrating a first search device 500 in accordance with an embodiment of the present disclosure.
  • the first search device 500 can be applied to a user equipment or server.
  • the first data processing apparatus 500 may include one or more processors 510, a storage unit 520, an input unit 530, an output unit 540, a communication unit 550, and an image acquisition unit 560. These components are interconnected by a bus system 570 and/or other form of connection mechanism (not shown).
  • the first search shown in Figure 5 The components and structures of the device 500 are merely exemplary and not limiting.
  • the first search device 500 may also have other components and structures as needed, and may, for example, not include the input unit 530, the output unit 540, and the image acquisition unit 560. Wait.
  • Processor 510 can be a central processing unit (CPU) or other form of processing unit with data processing capabilities and/or instruction execution capabilities, and can control other components in first search device 500 to perform desired functions.
  • CPU central processing unit
  • Processor 510 can be a central processing unit (CPU) or other form of processing unit with data processing capabilities and/or instruction execution capabilities, and can control other components in first search device 500 to perform desired functions.
  • Storage unit 520 can include one or more computer program products, which can include various forms of computer readable storage media, such as volatile memory and/or nonvolatile memory.
  • the volatile memory may include, for example, a random access memory (RAM) and/or a cache or the like.
  • the nonvolatile memory may include, for example, a read only memory (ROM), a hard disk, a flash memory, or the like.
  • One or more computer program instructions may be stored on the computer readable storage medium, and the processor 510 may execute the program instructions to implement various of the search methods described above in connection with FIGS. 2 and 3 of embodiments of the present disclosure. Step, at this time, the first search device 500 can be included in the server.
  • the processor 510 can execute the program instructions to implement the various steps of the search method described above in connection with FIG. 4 of the embodiments of the present disclosure, at which time the first search device 500 can be included in the user equipment.
  • Various applications and various data such as an operating state of the display screen, an operational state of the application, and the like can also be stored in the computer readable storage medium.
  • the input unit 530 may be a unit used by a user to input an instruction, and may include one or more of a keyboard, a mouse, a microphone, a touch screen, and the like.
  • the output unit 540 may output various information (such as an image or a sound) to the outside (for example, a user), and may include one or more of a display, a speaker, and the like.
  • Communication unit 550 can communicate with other units (e.g., personal computers, servers, mobile stations, base stations, etc.) via a network or other technology, which can be the Internet, a wireless local area network, a mobile communication network, and the like.
  • the character information and the image feature associated with the target object are extracted from the target image of the target object to be searched, based on the character information and the image feature.
  • the search can accurately and conveniently search for related object information of the target object, thereby improving the user experience.
  • FIG. 6 is a block diagram schematically illustrating a second search device 600 for a server in accordance with an embodiment of the present disclosure.
  • the second search device 600 is applicable to a search server as shown in FIG. 1(a).
  • the second search device 600 may include a first receiving unit 610, an extracting unit 620, and a search. Unit 630 and first transmitting unit 640.
  • the first receiving unit 610 receives a search request including a target image of a target object to be searched for.
  • the target image is any one of the target images as shown in FIG. 1(b).
  • the target image contains various information of the target object to be searched, including but not limited to brand name, object content, series, appearance, capacity, production date, and the like.
  • the target image may be collected by the user equipment by using the image collection device, or may be received by the user equipment from other electronic devices, and the manner in which the target image is acquired does not constitute a limitation on the embodiments of the present disclosure.
  • the first receiving unit 610 corresponds to the communication unit 550 in FIG. 5 and can be implemented by using a radio frequency circuit and a signal receiving circuit.
  • the target image preferably satisfies a predetermined condition.
  • the predetermined condition may be a condition regarding the brightness of the target image or a condition regarding the sharpness of the target image.
  • the sharpness of the target image is greater than or equal to the preset sharpness threshold, it is determined that the target image satisfies a predetermined condition; when the brightness of the target image is less than the preset sharpness threshold, it is determined that the target image does not satisfy the predetermined condition.
  • the preset brightness threshold or the preset definition threshold may be adjusted according to the processing capability of the server. For example, when the processing capability of the server is strong, the preset brightness threshold or the preset definition threshold may be set to a lower value; when the processing capability of the server is weak, the preset brightness threshold or the preset definition threshold may be set. Is a higher value.
  • the extracting unit 620 extracts character information and image features associated with the target object from the target image.
  • the character information included in the target image is, for example, a product name, a capacity, a brand name, a date of manufacture, and the like, and the character information is a character or a symbol.
  • the image special diagnosis included in the target image is a color component of the image, a composition ratio of each color component, and the like. Typically, different techniques are employed to extract character information and image features in the target image.
  • Extraction unit 620 can be implemented using the memory and processor of FIG.
  • the extracting unit 620 may extract the character information in the target image using OCR technology or other techniques.
  • OCR technology the server determines its shape by detecting the dark and bright patterns of the target image, and then uses the character recognition method to translate the shape into computer text.
  • the extracting unit 620 may include an OCR module, and may extract the target pair from the target image by the following operation
  • OCR module may extract the target pair from the target image by the following operation
  • Linked character information Rich information is included in the target image, and some of the information may be closely related to the search of the target object, such as product name, brand, capacity, and the like.
  • the target image may also include information that is not related to the search of the target object, such as components, security reminders, etc., which may be information related to all similar products, which cannot be used to identify the target object. Therefore, after performing the character recognition on the target image, the extracting unit 620 needs to filter out information required for searching the target object, that is, the identification character for identifying the target object.
  • Image features are index-valued image feature representations, such as using vectors to represent image characteristics.
  • the image features of the target image may be represented in various ways that are present or appearing in the future.
  • the extracting unit 620 may include an image feature extraction module, and the image feature extraction module may perform at least one of: extracting an image feature: calculating a color histogram feature of the target image as the image feature; and calculating the image
  • the word bag model feature of the target image is used as the image feature. That is, the extracting unit 620 may represent the image features of the target image using at least one of a color histogram and a bag model feature.
  • a color histogram is a statistical representation of the color characteristics of an image that is used to represent the proportion of different colors in the entire target image, without concern for the spatial location of each color. Common color histograms include RGB spatial color histograms, HSV spatial color histograms, and Lab space color histograms. In different color spaces, the color histogram of the target image has different values.
  • the word bag model feature is a statistical representation of the texture features of an image that can effectively describe the overall and local characteristics of the image.
  • the extracting unit 620 can obtain the word bag model feature of the target image by extracting feature descriptors such as SIFT, HOG, etc.
  • the clustering center counts the frequency of occurrence of different clustering centers in the target image to form a histogram; normalizes the histogram to obtain the word bag model feature of the target image.
  • the pre-accurate codebook can be obtained by randomly extracting a large number of image descriptors from a set of training images, and clustering the image descriptors by using a clustering algorithm to obtain a plurality of categories, and all the clusters are obtained.
  • the category is the codebook.
  • the search unit 630 searches for related object information associated with the target object based on the character information and the image feature. For example, the search unit 630 searches the related object information from a pre-established object database based on the character information and the image feature.
  • each object p j can also be represented by a binary group ⁇ f I (p j ), f T (p j ) ⁇ . It is assumed that the image features and character information of the target image of the target object q to be searched are f I (q) and f T (q), respectively, and accordingly, the search unit 630 can pass the character information f T extracted by the extracting unit 620 ( q) The image feature f I (q) is compared with the character information f T (p j ) of each candidate object in the object database P and the image feature f I (p j ) to perform a search. Search unit 630 can be implemented using the memory and processor of FIG.
  • the searching unit 630 is operable to search for related object information associated with the target object: calculating image features similar to the target object and each candidate object based on the image features of the target image and the image features of the respective candidate objects Calculating a similarity of character information between the target object and each candidate object based on the character information of the target image and the character information of each candidate object; and based on the image feature similarity and the character information similarity Searching for related object information associated with the target object from the plurality of candidate objects.
  • the search unit 630 may calculate a cosine similarity s I (q, between the image feature f I (q) of the target object q and the image feature f I (p j ) between the respective candidate objects p j , p j ) as the image feature similarity.
  • the search unit 630 can calculate the cosine similarity s I (q, p j ) according to the above formula (1), and can be specifically referred to the description above in connection with the formula (1).
  • the search unit 630 may also take the Pearson correlation coefficient between the image feature of the target object and the image feature between the respective candidate objects as the image feature similarity.
  • the search unit 630 may be calculated as the similarity of the character information: calculating the edit distance d between the target object character information q f T (q) of each candidate character information of the object p j f T (p j) (f T (q), f T (p j )); based on the edit distance, the length of the character information f T (q) of the target object, the length of the character information f T (p j ) of the candidate object To calculate the similarity of the character information.
  • the edit distance d(f T (q), f T (p j )) is the minimum number of editing operations required to convert the character information f T (q) into the character information f T (p j ).
  • the search unit 630 can calculate the character information similarity s T (q, p j ), for example, by the above formula (2). Alternatively, the search unit 630 may also use the edit distance d(f T (q), f T (p j )) as the character information similarity.
  • the searching unit 630 may search for related object information by selecting R1 candidate objects from the plurality of candidate objects in descending order of image feature similarity; from the plurality of devices in descending order of character information similarity R2 candidate objects are selected among the selected objects; information corresponding to the selected R1 candidate objects and R2 candidate objects is used as related object information associated with the target object.
  • R1 is a natural number smaller than N.
  • R2 is also a natural number smaller than N.
  • the searching unit 630 may further search for related object information based on the image feature similarity and the character information similarity in such a manner that the image feature similarity and the character information similarity are weighted and averaged to obtain the target.
  • R is a configurable parameter, and its typical value can be set to 10, 20, 100, and so on.
  • the first sending unit 640 sends the related object information, that is, the searched related object information is sent to the user equipment as a search result.
  • the first transmitting unit 640 can transmit the related object information by using various networks or communication technologies such as the Internet and a local area network.
  • the related object information is, for example, a picture, a text description, and associated information of the R candidate objects, or a picture, a text description, and associated information of the R1 plus R2 candidate object described above.
  • the user equipment may display the related object information on the screen of the user equipment for the user to view.
  • the first transmitting unit 640 may correspond to the communication unit 550 in FIG. 5 and may be implemented by using a radio frequency circuit and a signal transmitting circuit.
  • the character information and the image feature associated with the target object are extracted from the target image of the target object to be searched, based on the character
  • the information and image feature performs a search, and can accurately and conveniently search for related object information of the target object, thereby improving the user experience.
  • the step of manually inputting a keyword by the user is eliminated by automatically recognizing the character information contained in the target image.
  • FIG. 7 is a block diagram that schematically illustrates a user device 700 in accordance with an embodiment of the present disclosure.
  • the user equipment 700 corresponds to the user equipment shown in FIG. 1(a).
  • the user equipment 700 may include an image acquisition unit 710, a determination unit 720, a second transmission unit 730, and a second reception unit 740.
  • the image acquisition unit 710 collects a target image of the target object to be searched for.
  • Image acquisition unit 710 is typically disposed in the user device. For example, if the user's blue moon laundry liquid is exhausted and it is desired to purchase the blue moon laundry liquid, the user uses the image acquisition unit 710 to perform image acquisition on the existing blue moon laundry liquid.
  • the image acquisition unit 710 is illustrated as being included in the user equipment in FIG. 7, but the image acquisition unit 710 may also be external to the user equipment, coupled to the user equipment, and capable of receiving instructions of the user equipment, And transmitting the acquired target image to the user equipment.
  • the positional relationship between the image capture device and the user equipment does not constitute a limitation on the embodiments of the present disclosure.
  • the image acquisition unit 710 can be a camera, a camera, or the like.
  • the image acquisition unit 710 corresponds to the image acquisition unit 560 of FIG.
  • the judging unit 720 judges whether or not the target image satisfies a predetermined condition. Since the server is to extract information from the target image to search for the target object, the image quality of the target image will directly affect the search result.
  • the determining unit 720 can make a request for the target image using a predetermined condition.
  • the predetermined condition may be a condition regarding the brightness of the target image or a condition regarding the sharpness of the target image.
  • the determining unit 720 can be implemented using the memory and processor in FIG.
  • the determination unit 720 may convert the acquired target image into image data of an HSL color space in which the luminance information is included in the image data of the HSL color space. Then, the judging unit 720 counts the average value of the illumination components (ie, the L component) of all the pixels in the image data of the HSL color space. And comparing it to a predetermined brightness threshold T L . Average value of the illumination component used for the target image When the predetermined brightness threshold T L is greater than or equal to, the determination unit 720 may determine that the target image satisfies a predetermined condition.
  • the determination unit 720 may determine that the target image does not satisfy the predetermined condition.
  • the predetermined brightness threshold T L is typically 64.
  • the determining unit 720 can also indirectly determine the quality of the target image by measuring the lighting conditions in the image capturing environment by means of the illuminometer.
  • the user equipment 700 may further include an illuminometer 750 for measuring an illumination parameter of the target object, the determination unit 720 communicating with the illuminometer to determine an illumination parameter in the process of acquiring the target image; When the parameter is greater than or equal to the preset illuminance, it is determined that the target image satisfies a predetermined condition; when the illumination parameter is less than the preset illuminance, it is determined that the target image does not satisfy the predetermined condition.
  • the determination unit 720 may extract an edge of the acquired target image using a predetermined algorithm (for example, the Canny algorithm), and calculate the edge located in the target image.
  • the gradient G of each pixel and then further calculate the average of the gradients of all the pixel points at the edge in the target image
  • the average of the gradients of all pixel points at the edge of the target image When it is greater than or equal to the preset gradient threshold T G , the determining unit 720 may determine that the target image satisfies a predetermined condition.
  • the average of the gradients of all pixel points at the edge of the target image When it is less than the preset gradient threshold T G , the determination unit 720 may determine that the target image does not satisfy the predetermined condition.
  • the preset gradient threshold T G is typically 100.
  • the predetermined brightness threshold T L or the preset gradient threshold T G described above may be adjusted according to the processing capability of the server performing the search. For example, when the processing capability of the server is strong, the predetermined brightness threshold T L or the preset gradient threshold T G may be set to a lower value; when the processing capability of the server is weak, the predetermined brightness threshold T L or a preset gradient may be used. The threshold T G is set to a higher value.
  • the second transmitting unit 730 issues a search request for the target object when the target image satisfies a predetermined condition, the search request including the target image.
  • a search device as shown in FIG. 5 or FIG. 6 extracts character information and image features associated with the target object from the target image, and performs a search based on the character information and the image features. For example, after receiving the search request, the first receiving unit 610 in FIG.
  • the COR module extracts character information associated with the target object from the target image; the search unit 630 searches for the related object information associated with the target object from the object database based on the character information and the image feature;
  • the two transmitting unit 640 transmits the searched related object information to the user equipment. Since the brightness or sharpness of the target image is good, the character information and the image feature can be accurately extracted in the server, thereby ensuring the accuracy of the search.
  • the second transmitting unit 730 corresponds to the transceiver unit 550 in FIG. 5, and can be implemented by using a radio frequency circuit and a signal transmitting circuit.
  • the user equipment 700 may further include an output unit for outputting a retake prompt message to prompt the user to operate the image collection device to collect the target image of the target object to be searched.
  • a retake prompt message it is also possible to specifically list the reason why the target image does not satisfy the predetermined condition.
  • the average of the illumination components of the target image When less than the predetermined brightness threshold T L , it may be indicated that the brightness is insufficient in the re-scuing message; the average of the gradients of all the pixel points located at the edge of the target image
  • the preset gradient threshold T G is smaller than the preset gradient threshold T G , it is indicated that the sharpness is insufficient in the replay prompt message.
  • the shooting of the target image can be adjusted according to the replay prompt message until the target image that satisfies the predetermined condition is acquired.
  • the determination unit 720 determines that the target image does not satisfy the predetermined condition
  • the setting parameters of the image acquisition unit 710 may be automatically adjusted until the target image satisfying the predetermined condition is acquired.
  • the second receiving unit 740 receives related object information associated with the target object. After the second transmitting unit 730 issues a search request to the server, the server performs the search method described in connection with FIGS. 2 and 3, and obtains related object information associated with the target object. Correspondingly, the second receiving unit 740 receives related object information associated with the target object. The related object information is obtained based on character information and image feature search associated with the target object extracted from the target image. The second receiving unit 740 can receive the related object information from the server through various networks or communication technologies such as the Internet, a local area network, and the like.
  • the related object information is, for example, a picture of a plurality of candidate objects, a text description, and associated information.
  • the associated information is, for example, price, sales volume, user rating, promotional video, hyperlinks, etc., which assists the user in performing selection operations among a plurality of candidate objects.
  • the second receiving unit 740 may display the related object information on the screen of the user equipment for the user to view.
  • the second receiving unit 740 corresponds to the transceiver unit 550 in FIG. 5 and can be implemented by using a radio frequency circuit and a signal receiving circuit.
  • the user device can automatically calculate the illumination condition and the degree of clarity of the image. If the lighting conditions and clarity of the image meet the requirements, the user device is allowed to issue a search request based on the acquired target image. If the lighting conditions and clarity of the image do not meet the requirements, the user equipment is prompted or automatically instructed to re-shoot until the desired target image is obtained.
  • the search request is issued based on the target image, so that the related object information of the target object can be accurately and conveniently searched, Thereby improving the user experience.
  • an electronic device or server including any of the first search device and the second search device is also within the scope of the present disclosure.
  • the disclosed apparatus and method may be implemented in other manners.
  • the device embodiments described above are merely illustrative.
  • the division of the unit is only a logical function division.
  • there may be another division manner for example, multiple units or components may be combined or Can be integrated into another device, or some features can be ignored or not executed.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Image Analysis (AREA)

Abstract

A search method, a search apparatus, a user equipment, and a computer program product. The search method used for a server comprises: receiving a search request, the search request comprising a target image of a target object to be searched for (S210); extracting character information and image characteristic associated with the target object from the target image (S220); searching for related object information associated with the target object according to the character information and the image characteristic (S230); and sending the related object information (S240). Related object information of a target object can be accurately and conveniently searched for, thereby improving use experience of users.

Description

搜索方法、搜索装置、用户设备和计算机程序产品Search method, search device, user equipment, and computer program product 技术领域Technical field
本公开涉及信息技术领域,更具体地,涉及一种搜索方法、搜索装置、用户设备和计算机程序产品。The present disclosure relates to the field of information technology, and more particularly, to a search method, a search device, a user equipment, and a computer program product.
背景技术Background technique
随着互联网的发展以及用户设备的普及,基于用户设备的电子商务在近年来呈现蓬勃发展之势。通过用户设备在互联上搜索和购买商品已经成为人们日常生活中常见活动。所述用户设备例如为智能手机、平板计算机、笔记本计算机等。互联网提供了新交易平台和娱乐平台。例如,可以在互联网上购买商品、下载音乐、在线观看视频等。With the development of the Internet and the popularity of user equipment, e-commerce based on user equipment has been booming in recent years. Searching and purchasing goods on the Internet through user devices has become a common activity in people's daily lives. The user equipment is, for example, a smart phone, a tablet computer, a notebook computer, or the like. The Internet provides a new trading platform and entertainment platform. For example, you can buy goods on the Internet, download music, watch videos online, and more.
典型地,可以基于要搜索的对象的关键词在互联网上执行搜索。基于关键词的对象搜索系统依赖于用户输入的字符表述。然而,当用户输入的关键词不准确或存在错误时,难以获得令人满意的搜索结果。随着诸如电子商务的电子服务规模的不断扩大,商品或服务个数和种类快速增长,消费者可能需要花费较多的时间进行浏览才能找到自己关注的对象或商品。Typically, the search can be performed on the Internet based on the keywords of the object to be searched. The keyword-based object search system relies on the character representation entered by the user. However, when the keyword input by the user is inaccurate or there is an error, it is difficult to obtain a satisfactory search result. As the scale of e-services such as e-commerce continues to expand and the number and variety of goods or services grows rapidly, consumers may need to spend more time browsing to find objects or products of interest.
因此,期望提供一种搜索技术来帮助用户准确地搜索到感兴趣的商品或服务,并提供更丰富的信息和更细致的服务,从而提高用户的使用体验。Therefore, it is desirable to provide a search technology to help users accurately search for goods or services of interest, and provide richer information and more detailed services, thereby improving the user experience.
发明内容Summary of the invention
本公开实施例提供一种搜索方法、搜索装置、用户设备和计算机程序产品,其使能够准确且便利地搜索目标对象的相关对象信息,从而提高用户的使用体验。Embodiments of the present disclosure provide a search method, a search device, a user device, and a computer program product, which enable accurate and convenient searching of related object information of a target object, thereby improving a user's use experience.
第一方面,提供了一种搜索方法,应用于一服务器,该搜索方法可包括:接收搜索请求,该搜索请求包括要搜索的目标对象的目标图像;从所述目标图像中提取与所述目标对象相关联的字符信息和图像特征;基于所述字符信息和所述图像特征搜索与所述目标对象相关联的相关对象信息;发送所述相关对象信息。In a first aspect, a search method is provided, which is applied to a server, the search method may include: receiving a search request, the search request including a target image of a target object to be searched; and extracting the target from the target image Character information and image features associated with the object; searching for related object information associated with the target object based on the character information and the image feature; transmitting the related object information.
结合第一方面,在第一方面的一种实现方式中,所述从所述目标图像中提取与所述目标对象相关联的字符信息和图像特征可包括:利用光学字符识 别OCR从所述目标图像中识别文字和符号;从所识别的文字和符号中选择用于标识所述目标对象的标识字符,作为与所述目标对象相关联的字符信息。In conjunction with the first aspect, in an implementation of the first aspect, the extracting the character information and the image feature associated with the target object from the target image may include: utilizing optical character recognition The OCR identifies a character and a symbol from the target image; and selects an identification character for identifying the target object from the recognized characters and symbols as character information associated with the target object.
结合第一方面及其上述实现方式,在第一方面的另一实现方式中,所述基于所述字符信息和所述图像特征搜索与所述目标对象相关联的相关对象信息可包括:基于所述字符信息和所述图像特征从预先建立的对象数据库中搜索所述相关对象信息,其中,所述对象数据库包括各个备选对象的图像特征、字符信息和关联信息。In conjunction with the first aspect and the foregoing implementation manner, in another implementation manner of the first aspect, the searching for related object information associated with the target object based on the character information and the image feature may include: The character information and the image feature search for the related object information from a pre-established object database, wherein the object database includes image features, character information, and associated information of each candidate object.
结合第一方面及其上述实现方式,在第一方面的另一实现方式中,所述基于所述字符信息和所述图像特征从预先建立的对象数据库中搜索所述相关对象信息可包括:基于目标图像的图像特征和各个备选对象的图像特征计算所述目标对象与各个备选对象之间的图像特征相似度;基于目标图像的字符信息和各个备选对象的字符信息计算所述目标对象与各个备选对象之间的字符信息相似度;基于所述图像特征相似度和所述字符信息相似度来从所述多个备选对象中搜索与所述目标对象相关联的相关对象信息。In conjunction with the first aspect and the foregoing implementation manner, in another implementation manner of the first aspect, the searching for the related object information from the pre-established object database based on the character information and the image feature may include: Calculating image feature similarity between the target object and each candidate object by image features of the target image and image features of the respective candidate objects; calculating the target object based on character information of the target image and character information of each candidate object Character information similarity with each candidate object; searching for related object information associated with the target object from the plurality of candidate objects based on the image feature similarity and the character information similarity.
结合第一方面及其上述实现方式,在第一方面的另一实现方式中,所述基于所述图像特征相似度和所述字符信息相似度来从所述多个备选对象中搜索与所述目标对象相关联的相关对象信息可包括:对所述图像特征相似度和字符信息相似度进行加权平均来获得所述目标对象与各个备选对象之间的平均相似度;按照所述平均相似度的递减顺序从所述多个备选对象中选择预定数目备选对象;将与所选择的备选对象对应的信息作为与所述目标对象相关联的相关对象信息。In combination with the first aspect and the foregoing implementation manner, in another implementation manner of the first aspect, the searching for the location and the location information based on the image feature similarity and the character information similarity The related object information associated with the target object may include: performing weighted averaging on the image feature similarity and the character information similarity to obtain an average similarity between the target object and each candidate object; according to the average similarity The descending order of degrees selects a predetermined number of candidate objects from the plurality of candidate objects; information corresponding to the selected candidate objects is used as related object information associated with the target object.
结合第一方面及其上述实现方式,在第一方面的另一实现方式中,所述基于目标图像的图像特征和各个备选对象的图像特征计算所述目标对象与各个备选对象之间的图像特征相似度可包括:计算所述目标对象的图像特征与各个备选对象之间的图像特征之间的余弦相似度,作为所述图像特征相似度。In conjunction with the first aspect and the foregoing implementation manner, in another implementation manner of the first aspect, the image feature based on the target image and the image feature of each candidate object are used to calculate between the target object and each candidate object. The image feature similarity may include calculating a cosine similarity between the image feature of the target object and an image feature between the respective candidate objects as the image feature similarity.
结合第一方面及其上述实现方式,在第一方面的另一实现方式中,所述基于目标图像的字符信息和各个备选对象的字符信息计算所述目标对象与各个备选对象之间的字符信息相似度可包括:计算所述目标对象的字符信息与各个备选对象的字符信息之间的编辑距离;基于所述编辑距离、所述目标对象的字符信息的长度、备选对象的字符信息的长度来计算所述字符信息相似度。 In conjunction with the first aspect and the foregoing implementation manner, in another implementation manner of the first aspect, the character information based on the target image and the character information of each candidate object are calculated between the target object and each candidate object. The character information similarity may include: calculating an edit distance between the character information of the target object and the character information of each candidate object; based on the edit distance, the length of the character information of the target object, the character of the candidate object The length of the information is used to calculate the similarity of the character information.
结合第一方面及其上述实现方式,在第一方面的另一实现方式中,所述从所述目标图像中提取与所述目标对象相关联的字符信息和图像特征可包括如下操作中的至少一个:计算所述目标图像的颜色直方图特征作为所述图像特征;和计算所述目标图像的词袋模型特征作为所述图像特征。In conjunction with the first aspect and the foregoing implementation manner, in another implementation of the first aspect, the extracting the character information and the image feature associated with the target object from the target image may include at least one of the following operations One: calculating a color histogram feature of the target image as the image feature; and calculating a word bag model feature of the target image as the image feature.
结合第一方面及其上述实现方式,在第一方面的另一实现方式中,所述目标图像可满足预定条件。In conjunction with the first aspect and the above implementation thereof, in another implementation of the first aspect, the target image may satisfy a predetermined condition.
第二方面,提供了一种搜索方法,应用于一用户设备。该搜索方法可包括:采集要搜索的目标对象的目标图像;判断所述目标图像是否满足预定条件;在所述目标图像满足预定条件时,发出针对所述目标对象的搜索请求,该搜索请求包括所述目标图像;接收与所述目标对象相关联的相关对象信息,其中所述相关对象信息基于从所述目标图像中提取的与所述目标对象相关联的字符信息和图像特征搜索得到。In a second aspect, a search method is provided for application to a user equipment. The search method may include: collecting a target image of the target object to be searched; determining whether the target image satisfies a predetermined condition; and when the target image satisfies a predetermined condition, issuing a search request for the target object, the search request including The target image; receiving related object information associated with the target object, wherein the related object information is obtained based on character information and image feature search associated with the target object extracted from the target image.
结合第二方面,在第二方面的一种实现方式中,所述判断所述目标图像是否满足预定条件可包括:确定在采集所述目标图像过程中的光照参数;当所述光照参数大于等于预设照度时,确定所述目标图像满足预定条件。With reference to the second aspect, in an implementation manner of the second aspect, the determining whether the target image meets the predetermined condition may include: determining an illumination parameter in the process of acquiring the target image; and when the illumination parameter is greater than or equal to When the illuminance is preset, it is determined that the target image satisfies a predetermined condition.
结合第二方面及其上述实现方式,在第二方面的另一实现方式中,所述判断所述目标图像是否满足预定条件可包括:确定所采集的目标图像的边缘的像素点的平均梯度;当所述目标图像的边缘的像素点的平均梯度小于预设梯度阈值时,确定所述目标图像满足预定条件。With reference to the second aspect and the foregoing implementation manner, in another implementation manner of the second aspect, the determining whether the target image meets a predetermined condition may include: determining an average gradient of pixel points of an edge of the collected target image; When the average gradient of the pixel points of the edge of the target image is less than the preset gradient threshold, it is determined that the target image satisfies a predetermined condition.
第三方面,提供了一种搜索装置,应用于一服务器。该搜索装置可包括:收发器,接收搜索请求,该搜索请求包括要搜索的目标对象的目标图像;处理器;存储器;和存储在所述存储器中的计算机程序指令。在所述计算机程序指令被所述处理器运行时执行以下步骤:从所述目标图像中提取与所述目标对象相关联的字符信息和图像特征;基于所述字符信息和图像特征搜索与所述目标对象相关联的相关对象信息;将所搜索的相关对象信息提供给所述收发器,以发送出去。In a third aspect, a search device is provided for use in a server. The search device can include a transceiver that receives a search request, the search request including a target image of a target object to be searched, a processor, a memory, and computer program instructions stored in the memory. Performing the steps of: extracting, from the target image, character information and image features associated with the target object when the computer program instructions are executed by the processor; searching and said based on the character information and image features Relevant object information associated with the target object; the searched related object information is provided to the transceiver for transmission.
结合第三方面,在第三方面的一种实现方式中,所述从所述目标图像中提取与所述目标对象相关联的字符信息和图像特征可包括:利用光学字符识别OCR从所述目标图像中识别文字和符号;从所识别的文字和符号中选择用于标识所述目标对象的标识字符,作为与所述目标对象相关联的字符信息。In conjunction with the third aspect, in an implementation of the third aspect, the extracting the character information and the image feature associated with the target object from the target image may include: using the optical character recognition OCR from the target Identifying characters and symbols in the image; selecting an identification character for identifying the target object from the recognized characters and symbols as character information associated with the target object.
结合第三方面及其上述实现方式,在第三方面的另一实现方式中,所述 基于所述字符信息和所述图像特征搜索与所述目标对象相关联的相关对象信息可包括:基于所述字符信息和所述图像特征从预先建立的对象数据库中搜索所述相关对象信息,其中,所述对象数据库包括各个备选对象的图像特征、字符信息和关联信息。With reference to the third aspect and the foregoing implementation manner, in another implementation manner of the third aspect, Searching for related object information associated with the target object based on the character information and the image feature may include: searching for the related object information from a pre-established object database based on the character information and the image feature, wherein The object database includes image features, character information, and associated information of each candidate object.
结合第三方面及其上述实现方式,在第三方面的另一实现方式中,所述基于所述字符信息和所述图像特征从预先建立的对象数据库中搜索所述相关对象信息可包括:基于目标图像的图像特征和各个备选对象的图像特征计算所述目标对象与各个备选对象之间的图像特征相似度;基于目标图像的字符信息和各个备选对象的字符信息计算所述目标对象与各个备选对象之间的字符信息相似度;基于所述图像特征相似度和所述字符信息相似度来从所述多个备选对象中搜索与所述目标对象相关联的相关对象信息。With reference to the third aspect and the foregoing implementation manner, in another implementation manner of the third aspect, the searching for the related object information from the pre-established object database based on the character information and the image feature may include: Calculating image feature similarity between the target object and each candidate object by image features of the target image and image features of the respective candidate objects; calculating the target object based on character information of the target image and character information of each candidate object Character information similarity with each candidate object; searching for related object information associated with the target object from the plurality of candidate objects based on the image feature similarity and the character information similarity.
结合第三方面及其上述实现方式,在第三方面的另一实现方式中,所述基于所述图像特征相似度和所述字符信息相似度来从所述多个备选对象中搜索与所述目标对象相关联的相关对象信息可包括:对所述图像特征相似度和字符信息相似度进行加权平均来获得所述目标对象与各个备选对象之间的平均相似度;按照所述平均相似度的递减顺序从所述多个备选对象中选择预定数目备选对象;将与所选择的备选对象对应的信息作为与所述目标对象相关联的相关对象信息。In conjunction with the third aspect and the foregoing implementation manner, in another implementation manner of the third aspect, the searching, searching, and searching from the multiple candidate objects based on the image feature similarity and the character information similarity The related object information associated with the target object may include: performing weighted averaging on the image feature similarity and the character information similarity to obtain an average similarity between the target object and each candidate object; according to the average similarity The descending order of degrees selects a predetermined number of candidate objects from the plurality of candidate objects; information corresponding to the selected candidate objects is used as related object information associated with the target object.
结合第三方面及其上述实现方式,在第三方面的另一实现方式中,所述基于目标图像的图像特征和各个备选对象的图像特征计算所述目标对象与各个备选对象之间的图像特征相似度可包括:计算所述目标对象的图像特征与各个备选对象之间的图像特征之间的余弦相似度,作为所述图像特征相似度。In conjunction with the third aspect and the foregoing implementation manner, in another implementation manner of the third aspect, the image feature based on the target image and the image feature of each candidate object are calculated between the target object and each candidate object. The image feature similarity may include calculating a cosine similarity between the image feature of the target object and an image feature between the respective candidate objects as the image feature similarity.
结合第三方面及其上述实现方式,在第三方面的另一实现方式中,所述基于目标图像的字符信息和各个备选对象的字符信息计算所述目标对象与各个备选对象之间的字符信息相似度可包括:计算所述目标对象的字符信息与各个备选对象的字符信息之间的编辑距离;基于所述编辑距离、所述目标对象的字符信息的长度、各个备选对象的字符信息的长度来计算所述字符信息相似度。In conjunction with the third aspect and the foregoing implementation manner, in another implementation manner of the third aspect, the character information based on the target image and the character information of each candidate object are calculated between the target object and each candidate object. The character information similarity may include: calculating an edit distance between the character information of the target object and the character information of each candidate object; based on the edit distance, the length of the character information of the target object, and the candidate objects The length of the character information is used to calculate the similarity of the character information.
结合第三方面及其上述实现方式,在第三方面的另一实现方式中,从所述目标图像中提取与所述目标对象相关联的字符信息和图像特征可包括如下操作中的至少一个:计算所述目标图像的颜色直方图特征作为所述图像特征; 和计算所述目标图像的词袋模型特征作为所述图像特征。In conjunction with the third aspect and the above implementation thereof, in another implementation of the third aspect, extracting the character information and the image feature associated with the target object from the target image may include at least one of the following operations: Calculating a color histogram feature of the target image as the image feature; And calculating a word bag model feature of the target image as the image feature.
结合第三方面及其上述实现方式,在第三方面的另一实现方式中,所述目标图像可满足预定条件。In conjunction with the third aspect and the above implementation thereof, in another implementation of the third aspect, the target image may satisfy a predetermined condition.
第四方面,提供了一种用户设备。该用户设备可包括:图像采集器,用于采集要搜索的目标对象的目标图像;处理器,用于判断所述目标图像是否满足预定条件;收发器,在所述目标图像满足预定条件时,所述发出针对所述目标对象的搜索请求,该搜索请求包括所述目标图像,并接收与所述目标对象相关联的相关对象信息,其中所述相关对象信息基于从所述目标图像中提取的与所述目标对象相关联的字符信息和图像特征搜索得到。In a fourth aspect, a user equipment is provided. The user equipment may include: an image collector for acquiring a target image of the target object to be searched; a processor for determining whether the target image satisfies a predetermined condition; and a transceiver, when the target image satisfies a predetermined condition, The issuing a search request for the target object, the search request including the target image, and receiving related object information associated with the target object, wherein the related object information is based on extraction from the target image Character information and image feature search associated with the target object are obtained.
结合第四方面,在第四方面的一种实现方式中,所述用户设备还可包括用于测量所述目标对象的光照参数的照度计,所述处理器可指令所述照度计在图像采集器采集目标图像的过程中测量所述目标对象的光照参数,并且在所述光照参数大于等于预设照度时确定所述目标图像满足预定条件。In conjunction with the fourth aspect, in an implementation manner of the fourth aspect, the user equipment may further include an illuminometer for measuring an illumination parameter of the target object, the processor may instruct the illuminometer to be in image collection The illumination parameter of the target object is measured during the process of acquiring the target image, and determining that the target image satisfies a predetermined condition when the illumination parameter is greater than or equal to the preset illumination.
结合第四方面及其上述实现方式,在第四方面的另一实现方式中,所述处理器分析目标图像以确定其边缘的像素点的平均梯度,并且在所述目标图像的边缘的像素点的平均梯度小于预设梯度阈值时,确定所述目标图像满足预定条件。In conjunction with the fourth aspect and the above implementation manner, in another implementation of the fourth aspect, the processor analyzes the target image to determine an average gradient of pixel points of the edge thereof, and a pixel point at an edge of the target image When the average gradient is less than the preset gradient threshold, it is determined that the target image satisfies a predetermined condition.
第五方面,提供了一种用于搜索对象的计算机程序产品,可包括计算机可读存储介质。在所述计算机可读存储介质上存储了计算机程序指令,所述计算机程序指令由处理器执行以使得所述处理器:接收搜索请求,该搜索请求包括要搜索的目标对象的目标图像;从所述目标图像中提取与所述目标对象相关联的字符信息和图像特征;基于所述字符信息和所述图像特征搜索与所述目标对象相关联的相关对象信息;发送所述相关对象信息。In a fifth aspect, a computer program product for searching for an object is provided, which can include a computer readable storage medium. Storing computer program instructions on the computer readable storage medium, the computer program instructions being executed by a processor to cause the processor to: receive a search request, the search request including a target image of a target object to be searched; Extracting character information and image features associated with the target object in the target image; searching for related object information associated with the target object based on the character information and the image feature; and transmitting the related object information.
第六方面,提供了一种用于搜索对象的计算机程序产品,可包括计算机可读存储介质。在所述计算机可读存储介质上存储了计算机程序指令。所述计算机程序指令可以由处理器执行以使得所述处理器:利用图像采集器采集要搜索的目标对象的目标图像;判断所述目标图像是否满足预定条件;在所述目标图像满足预定条件时,利用收发器发出针对所述目标对象的搜索请求,该搜索请求包括所述目标图像;以及利用收发器接收与所述目标对象相关联的相关对象信息,其中所述相关对象信息基于从所述目标图像中提取的与所述目标对象相关联的字符信息和图像特征搜索得到。 In a sixth aspect, a computer program product for searching for an object is provided, which can include a computer readable storage medium. Computer program instructions are stored on the computer readable storage medium. The computer program instructions may be executed by a processor to cause the processor to: acquire an object image of a target object to be searched using an image collector; determine whether the target image satisfies a predetermined condition; and when the target image satisfies a predetermined condition Transmitting, by the transceiver, a search request for the target object, the search request including the target image; and receiving, by the transceiver, related object information associated with the target object, wherein the related object information is based on the The character information and image feature associated with the target object extracted in the target image are searched.
在根据本公开实施例的用于服务器的搜索方法、搜索装置和计算机程序产品的技术方案中,通过从要搜索的目标对象的目标图像中提取与所述目标对象相关联的字符信息和图像特征,基于所字符信息和图像特征执行搜索,能够准确且便利地搜索目标对象的相关对象信息,从而提高用户的使用体验。In a technical solution for a search method, a search device, and a computer program product for a server according to an embodiment of the present disclosure, character information and image features associated with the target object are extracted from a target image of a target object to be searched for The search is performed based on the character information and the image feature, and the related object information of the target object can be searched accurately and conveniently, thereby improving the user experience.
在根据本公开实施例的用于用户设备的搜索方法、该用户设备和计算机程序产品的技术方案中,在所采集的目标对象的目标图像满足预定条件时,基于该目标图像发出搜索请求,使能够准确且便利地搜索目标对象的相关对象信息,从而提高用户的使用体验。In a technical solution for a search method for a user equipment, the user equipment, and a computer program product according to an embodiment of the present disclosure, when a target image of the collected target object satisfies a predetermined condition, a search request is issued based on the target image, so that It can accurately and conveniently search related object information of the target object, thereby improving the user experience.
附图说明DRAWINGS
为了更清楚地说明本公开实施例的技术方案,下面将对实施例或现有技术描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图仅仅是本公开的一些实施例,对于本领域普通技术人员来讲,还可以根据这些附图获得其他的附图。In order to more clearly illustrate the technical solutions of the embodiments of the present disclosure, the drawings used in the embodiments or the description of the prior art will be briefly described below. Obviously, the drawings in the following description are only some of the disclosure. For the embodiments, other drawings can also be obtained from those skilled in the art based on these drawings.
图1(a)示意性图示了根据本公开实施例的应用场景;FIG. 1(a) schematically illustrates an application scenario according to an embodiment of the present disclosure;
图1(b)示意性图示了由用户设备拍摄的目标图像的示意图;Figure 1 (b) schematically illustrates a schematic diagram of a target image taken by a user device;
图2是示意性图示了根据本公开实施例的用于服务器的搜索方法的流程图;2 is a flow chart that schematically illustrates a search method for a server in accordance with an embodiment of the present disclosure;
图3是示意性图示了图2的搜索方法中的基于图像特征和字符信息搜索目标对象的相关对象信息的流程图;3 is a flow chart schematically illustrating related object information of a target object based on image features and character information in the search method of FIG. 2;
图4是示意性图示了根据本公开实施例的用于用户设备的搜索方法流程图;4 is a flow chart that schematically illustrates a search method for a user equipment in accordance with an embodiment of the present disclosure;
图5是示意性图示了根据本公开实施例的第一搜索装置的框图;FIG. 5 is a block diagram schematically illustrating a first search device according to an embodiment of the present disclosure; FIG.
图6是示意性图示了根据本公开实施例的用于服务器的第二搜索装置的框图;FIG. 6 is a block diagram schematically illustrating a second search device for a server according to an embodiment of the present disclosure; FIG.
图7是示意性图示了根据本公开实施例的用户设备的框图。FIG. 7 is a block diagram schematically illustrating a user equipment in accordance with an embodiment of the present disclosure.
具体实施方式detailed description
下面将结合本公开实施例中的附图,对本公开实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例是本公开一部分实施例,而不是全部的实施例。本领域普通技术人员基于本公开中的实施例所获得的所有其 他实施例,都属于本公开保护的范围。The technical solutions in the embodiments of the present disclosure are clearly and completely described in the following with reference to the accompanying drawings in the embodiments of the present disclosure. It is obvious that the described embodiments are a part of the embodiments of the present disclosure, and not all of the embodiments. All of those obtained by those of ordinary skill in the art based on the embodiments of the present disclosure The embodiments thereof are all within the scope of the protection of the present disclosure.
图1(a)示意性图示了根据本公开实施例的应用场景。如图1(a)所示,用户设备10通过网络与搜索服务器20通信连接。所述用户设备10例如为智能手机、平板计算机、笔记本计算机等。所述搜索服务器20为云服务器,网站服务器等。用户设备10与搜索服务器20之间的通信可以采用各种技术来实现,包括但不限于互联网、局域网、第三代移动通信技术等。例如,用户设备的用户浏览淘宝网网页,以期望购买特定的商品,即目标对象。该用户设备通过互联网连接到淘宝网的搜索服务器。FIG. 1(a) schematically illustrates an application scenario in accordance with an embodiment of the present disclosure. As shown in FIG. 1(a), the user equipment 10 is communicatively coupled to the search server 20 via a network. The user device 10 is, for example, a smart phone, a tablet computer, a notebook computer, or the like. The search server 20 is a cloud server, a web server, or the like. Communication between user device 10 and search server 20 may be implemented using a variety of techniques including, but not limited to, the Internet, local area networks, third generation mobile communication technologies, and the like. For example, a user of a user device browses a Taobao web page to expect to purchase a particular item, ie, a target object. The user equipment is connected to the search server of Taobao via the Internet.
通常,用户在用户设备的淘宝网网页中输入要购买的商品的关键词,用户设备经由互联网将所述关键词传送给淘宝网的搜索服务器,后者基于所述关键词执行搜索,并经由互联网将所述搜索结果发送给用户设备。当用户输入的关键词不准确或存在错误时,难以获得令人满意的搜索结果。而且,由于商品或服务个数和种类繁多,在搜索结果中可能包括与关键词相关联的多种商品,这可使用户不能从搜索结果中找到要购买的目标对象。Generally, the user inputs a keyword of the product to be purchased in the Taobao webpage of the user equipment, and the user equipment transmits the keyword to the search server of Taobao via the Internet, and the latter performs a search based on the keyword and via the Internet. Sending the search result to the user equipment. When the keyword input by the user is inaccurate or there is an error, it is difficult to obtain a satisfactory search result. Moreover, due to the large number and variety of goods or services, a plurality of items associated with keywords may be included in the search results, which may make it impossible for the user to find the target object to be purchased from the search results.
在本公开的实施例中,用户设备10利用摄像头等对要购买的目标对象进行图像采集,将所采集的目标图像传送给搜索服务器20。搜索服务器20从所述目标图像中提取字符信息和/或图像信息,并基于所提取的信息执行搜索,并经由互联网将所述搜索结果发送给用户设备。在所述目标图像中,通常携带关于目标对象的丰富信息,例如目标对象的外观、名称、商标、生产商、生产日期等。基于目标图像中的丰富信息,搜索服务器能够更准确地搜索到用户的目标对象,从而提高搜索的准确度。此外,搜索服务器可以自动地提取目标图像中的信息,而不需要用户手动地输入关键词等,这使得用户的搜索操作更为便捷。In the embodiment of the present disclosure, the user equipment 10 performs image acquisition on a target object to be purchased using a camera or the like, and transmits the collected target image to the search server 20. The search server 20 extracts character information and/or image information from the target image, and performs a search based on the extracted information, and transmits the search result to the user device via the Internet. In the target image, it usually carries rich information about the target object, such as the appearance, name, trademark, manufacturer, date of manufacture, and the like of the target object. Based on the rich information in the target image, the search server can more accurately search for the target object of the user, thereby improving the accuracy of the search. In addition, the search server can automatically extract information in the target image without requiring the user to manually input keywords or the like, which makes the user's search operation more convenient.
图1(b)示意性图示了由用户设备10拍摄的目标图像的示意图。如图1(b)所示,所拍摄的目标图像(1)、(2)、(3)分别是依云(evian)矿泉水、卡乐比(calbee)薯片、蓝月亮洗衣液。在所述目标图像(1)中包括如下信息:瓶装水形状的外观、为evian的名称、山脉形状、容量550毫升等,依据这些信息,搜索服务器20能够准确地搜索到用户的目标对象。然而,如果用户输入关键字“依云矿泉水”,则会搜索不同包装、不同系列、不同容量的依云矿泉水。类似地,图1(b)的目标图像(2)中也包括了丰富的信息:例如,品牌名称“卡乐B”、商品内容“Potato Chips”、口味系列“烧烤味”、袋 装的商品外观、容量“90g”等;图1(b)的目标图像(3)中也包括了丰富的信息:例如,品牌名称“蓝月亮”、商品内容“洗衣液”、桶装的商品外观、容量“2kg”、产品系列“清雅丁香”等。基于目标图像中包含的丰富信息,搜索服务器20能够准确地搜索到各个目标对象。FIG. 1(b) schematically illustrates a schematic diagram of a target image taken by the user device 10. As shown in Fig. 1(b), the captured target images (1), (2), and (3) are respectively Evian mineral water, calbee potato chips, and blue moon laundry liquid. The target image (1) includes information on the appearance of the shape of the bottled water, the name of the evian, the shape of the mountain, the capacity of 550 ml, etc., based on the information, the search server 20 can accurately search for the target object of the user. However, if the user enters the keyword "Evian mineral water", it will search for Evian mineral water in different packaging, different series and different capacities. Similarly, the target image (2) of Fig. 1(b) also includes a wealth of information: for example, the brand name "Carle B", the product content "Potato Chips", the taste series "Barbecue", bags Appearance, capacity "90g", etc.; the target image (3) of Figure 1(b) also contains a wealth of information: for example, the brand name "blue moon", the product content "laundry liquid", the barreled product Appearance, capacity "2kg", product series "clear clove" and so on. Based on the rich information contained in the target image, the search server 20 can accurately search for each target object.
图2是示意性图示了根据本公开实施例的用于服务器的搜索方法200的流程图。该搜索方法200可应用于如图1(a)所示的搜索服务器。如图2所示,所述搜索方法200可包括:接收搜索请求,该搜索请求包括要搜索的目标对象的目标图像(S210);从所述目标图像中提取与所述目标对象相关联的字符信息和图像特征(S220);基于所述字符信息和所述图像特征搜索与所述目标对象相关联的相关对象信息(S230);发送所述相关对象信息(S240)。2 is a flow chart that schematically illustrates a search method 200 for a server in accordance with an embodiment of the present disclosure. The search method 200 is applicable to a search server as shown in FIG. 1(a). As shown in FIG. 2, the search method 200 may include receiving a search request including a target image of a target object to be searched (S210); extracting characters associated with the target object from the target image Information and image features (S220); searching for related object information associated with the target object based on the character information and the image feature (S230); transmitting the related object information (S240).
在S210中,服务器从用户设备接收搜索请求,该搜索请求包括要搜索的目标对象的目标图像。所述目标图像是如图1(b)中所示的任一个目标图像。该目标图像包含了要搜索的目标对象的各种信息,包含但不限于品牌名称、对象内容、系列、外观、容量、生产日期等。该目标图像可以是所述用户设备利用其图像采集装置所采集的,也可以是所述用户设备从其它电子设备接收的,用户设备获取所述目标图像的方式不构成对本公开实施例的限制。In S210, the server receives a search request from the user device, the search request including a target image of the target object to be searched. The target image is any one of the target images as shown in FIG. 1(b). The target image contains various information of the target object to be searched, including but not limited to brand name, object content, series, appearance, capacity, production date, and the like. The target image may be collected by the user equipment by using the image collection device, or may be received by the user equipment from other electronic devices. The manner in which the user equipment acquires the target image does not constitute a limitation on the embodiments of the present disclosure.
如结合图1(a)所示描述的,服务器要从所述目标图像中提取信息来搜索目标对象。相应地,所述目标图像的图像质量将直接影响搜索结果。例如在图1(b)的目标图像(1)中,如果目标图像模糊而导致不能提取其品牌名称evian、容量等信息,则难以准确地搜索到目标对象。因此,可以对所述目标图像做出要求,例如所述目标图像满足预定条件。所述预定条件可以是关于目标图像的亮度的条件、或者是关于所述目标图像的清晰度的条件。As described in connection with FIG. 1(a), the server extracts information from the target image to search for a target object. Accordingly, the image quality of the target image will directly affect the search results. For example, in the target image (1) of FIG. 1(b), if the target image is blurred and information such as the brand name evian, capacity, and the like cannot be extracted, it is difficult to accurately search for the target object. Therefore, a request can be made for the target image, for example, the target image satisfies a predetermined condition. The predetermined condition may be a condition regarding the brightness of the target image or a condition regarding the sharpness of the target image.
例如,当目标图像的亮度大于等于预设亮度阈值时,确定所述目标图像满足预定条件;当目标图像的亮度小于预设亮度阈值时,确定所述目标图像不满足预定条件。或者,当目标图像的清晰度大于等于预设清晰度阈值时,确定所述目标图像满足预定条件;当目标图像的亮度小于预设清晰度阈值时,确定所述目标图像不满足预定条件。所述预设亮度阈值或预设清晰度阈值可以根据所述服务器的处理能力来调整。例如,当服务器的处理能力强时,可以将预设亮度阈值或预设清晰度阈值设置为较低的值;当服务器的处理能力弱时,可以将预设亮度阈值或预设清晰度阈值设置为较高的值。For example, when the brightness of the target image is greater than or equal to the preset brightness threshold, it is determined that the target image satisfies a predetermined condition; when the brightness of the target image is less than the preset brightness threshold, it is determined that the target image does not satisfy the predetermined condition. Alternatively, when the sharpness of the target image is greater than or equal to the preset sharpness threshold, it is determined that the target image satisfies a predetermined condition; when the brightness of the target image is less than the preset sharpness threshold, it is determined that the target image does not satisfy the predetermined condition. The preset brightness threshold or the preset definition threshold may be adjusted according to the processing capability of the server. For example, when the processing capability of the server is strong, the preset brightness threshold or the preset definition threshold may be set to a lower value; when the processing capability of the server is weak, the preset brightness threshold or the preset definition threshold may be set. Is a higher value.
在S220中,从所述目标图像中提取与所述目标对象相关联的字符信息和 图像特征。所述目标图像所包括的字符信息例如为产品名称、容量、品牌名称、生产日期等,这些字符信息是文字或符号。所述目标图像所包括的图像特诊是图像的颜色分量、各个颜色分量的组成比例等。典型地,采用不同的技术手段来提取目标图像中的字符信息和图像特征。In S220, character information associated with the target object is extracted from the target image and Image features. The character information included in the target image is, for example, a product name, a capacity, a brand name, a date of manufacture, and the like, and the character information is a character or a symbol. The image special diagnosis included in the target image is a color component of the image, a composition ratio of each color component, and the like. Typically, different techniques are employed to extract character information and image features in the target image.
对于字符信息,可以采用光学字符识别(OCR,Optical Character Recognition)技术提取目标图像中的字符信息。在OCR技术中,服务器通过检测目标图像的暗、亮的模式确定其形状,然后用字符识别方法将形状翻译成计算机文字。或者,还可以采用其它技术来对目标图像进行字符识别以获取其中的字符信息。For character information, the character information in the target image can be extracted by using Optical Character Recognition (OCR) technology. In OCR technology, the server determines its shape by detecting the dark and bright patterns of the target image, and then uses the character recognition method to translate the shape into computer text. Alternatively, other techniques may be employed to perform character recognition on the target image to obtain character information therein.
可通过如下操作从所述目标图像中提取与所述目标对象相关联的字符信息:利用光学字符识别OCR从所述目标图像中识别文字和符号;从所识别的文字和符号中选择用于标识所述目标对象的标识字符,作为与所述目标对象相关联的字符信息。如前所述,在目标图像中包括丰富的信息,部分信息可能与目标对象的搜索密切相关,例如产品名称、品牌、容量等。然而,目标图像中还可能包括与目标对象的搜索无关的信息,例如成分、安全提醒等,这些信息可能是所有的同类产品都涉及的信息,其不能用于标识目标对象。因此,在对目标图像进行字符识别之后,需要从中筛选出搜索目标对象所需的信息,即用于标识所述目标对象的标识字符。Character information associated with the target object may be extracted from the target image by recognizing words and symbols from the target image using optical character recognition OCR; selecting for identification from the identified characters and symbols The identification character of the target object is the character information associated with the target object. As mentioned before, rich information is included in the target image, and some of the information may be closely related to the search of the target object, such as product name, brand, capacity, and the like. However, the target image may also include information that is not related to the search of the target object, such as components, security reminders, etc., which may be information related to all similar products, which cannot be used to identify the target object. Therefore, after performing character recognition on the target image, it is necessary to filter out information required for searching the target object, that is, an identification character for identifying the target object.
图像特征是指数值化的图像特性表达,例如使用向量来表示图像特性。可以采用现有的或将来出现的各种方式来表示目标图像的图像特征。这里以颜色直方图和词袋模型(Bag of Words)特征作为图像特征为例进行描述。要注意,在应用中,可以采用颜色直方图和词袋模型特征中的任一个来表示目标图像的图像特征,也可以采用颜色直方图和词袋模型特征二者来示目标图像的图像特征。也就是说,所述从所述目标图像中提取与所述目标对象相关联和图像特征包括如下操作中的至少一个:计算所述目标图像的颜色直方图特征作为所述图像特征;和计算所述目标图像的词袋模型特征作为所述图像特征。Image features are index-valued image feature representations, such as using vectors to represent image characteristics. The image features of the target image may be represented in various ways that are present or appearing in the future. Here, a color histogram and a Bag of Words feature are taken as an example of an image feature. It is to be noted that, in the application, any one of the color histogram and the word bag model feature may be used to represent the image feature of the target image, and both the color histogram and the bag model feature may be used to represent the image feature of the target image. That is, the extracting the image object associated with the target object from the target image includes at least one of: calculating a color histogram feature of the target image as the image feature; and calculating a location The word bag model feature of the target image is used as the image feature.
颜色直方图是图像的颜色特征的一种统计表达,其用于表示不同色彩在整幅目标图像中所占的比例,而并不关心每种色彩所处的空间位置。颜色直方图与颜色空间表示方式密切相关。常用的颜色直方图包括RGB空间颜色直方图,HSV空间颜色直方图以及Lab空间颜色直方图等。在不同的颜色空间 中,目标图像的颜色直方图具有不同的数值。A color histogram is a statistical representation of the color characteristics of an image that is used to represent the proportion of different colors in the entire target image, without concern for the spatial location of each color. Color histograms are closely related to how color space is represented. Common color histograms include RGB spatial color histograms, HSV spatial color histograms, and Lab space color histograms. In different color spaces The color histogram of the target image has different values.
词袋模型特征是图像的纹理特征的一种统计表达,其可以有效描述图像的整体和局部特性。例如,可通过如下操作获得目标图像的词袋模型特征:从目标图像中提取特征描述符,例如尺度不变特征变换(SIFT,Scale Invariant Feature Transform)、方向梯度直方图(HOG,Histogram of Oriented Gradient)等;对于每一个描述符,在预先准确的码本中搜索最相似的聚类中心,统计不同聚类中心在该目标图像中出现的频度,形成一个直方图;对该直方图作归一化处理,从而得到目标图像的词袋模型特征。所述预先准确的码本可通过如下方式获得:从训练图像的集合中随机提取大量的图像描述符(例如SIFT,HOG等),采用聚类算法对这些图像描述符进行聚类,得到多个类别,聚类得到的所有类别即组成码本。The word bag model feature is a statistical representation of the texture features of an image that can effectively describe the overall and local characteristics of the image. For example, the word bag model feature of the target image can be obtained by extracting feature descriptors from the target image, such as Scale Invariant Feature Transform (SIFT), Directional Histogram (HOG, Histogram of Oriented Gradient). ); for each descriptor, search for the most similar cluster center in the pre-accurate codebook, and count the frequency of occurrence of different cluster centers in the target image to form a histogram; The processing is performed to obtain the word bag model feature of the target image. The pre-accurate codebook can be obtained by randomly extracting a large number of image descriptors (for example, SIFT, HOG, etc.) from the set of training images, and clustering the image descriptors by using a clustering algorithm to obtain multiple Category, all the categories obtained by clustering constitute the codebook.
在S230中,基于在S220中获得的字符信息和图像特征搜索与所述目标对象相关联的相关对象信息。具体地,基于所述字符信息和所述图像特征从预先建立的对象数据库中搜索所述相关对象信息。所述对象数据库包括各个备选对象的图像特征、字符信息和关联信息。In S230, related object information associated with the target object is searched based on the character information and the image feature obtained in S220. Specifically, the related object information is searched from a pre-established object database based on the character information and the image feature. The object database includes image features, character information, and associated information of respective candidate objects.
假设对象数据库P中包含N个对象,每一个对象pj可以用三元组{fI(pj),fT(pj),a(pj)}表示,其中,j=1,2,...,N。fI(pj)表示对象pj的图像特征,其可以是颜色直方图特征,或者是词袋模型特征,或者是由颜色直方图特征和词袋模型特征拼接而成的向量。fT(pj)是对象pj的字符信息,其典型地是字符串,该字符串例如为名称、品牌、含量等。a(pj)表示与对象pj的关联的其他关联信息,如价格、销量、用户评价、宣传视频以及超链接等。或者,每一个对象pj还可以用二元组{fI(pj),fT(pj)}表示。假设要搜索的目标对象q的目标图像的图像特征和字符信息分别是fI(q)和fT(q),相应地,可通过将在S220中获得的字符信息fT(q)和图像特征fI(q)与对象数据库P中的各个备选对象的字符信息fT(pj)和图像特征fI(pj)进行比对来执行搜索。Assuming that the object database P contains N objects, each object p j can be represented by a triplet {f I (p j ), f T (p j ), a(p j )}, where j=1, 2 ,...,N. f I (p j ) represents an image feature of the object p j , which may be a color histogram feature, or a bag model feature, or a vector composed of color histogram features and word bag model features. f T (p j ) is character information of the object p j , which is typically a character string such as a name, a brand, a content, and the like. a(p j ) represents other associated information associated with the object p j , such as price, sales volume, user rating, promotional video, and hyperlinks. Alternatively, each object p j can also be represented by a binary group {f I (p j ), f T (p j )}. It is assumed that the image feature and the character information of the target image of the target object q to be searched are f I (q) and f T (q), respectively, and accordingly, the character information f T (q) and the image obtained in S220 can be obtained. The feature f I (q) is compared with the character information f T (p j ) of each candidate object in the object database P and the image feature f I (p j ) to perform a search.
图3是示意性图示了图2的搜索方法中的基于图像特征和字符信息搜索目标对象的相关对象信息(S230)的流程图。如图3所示,基于目标图像的图像特征和各个备选对象的图像特征计算所述目标对象与各个备选对象之间的图像特征相似度(S231);基于目标图像的字符信息和各个备选对象的字符信息计算所述目标对象与各个备选对象之间的字符信息相似度(S232);对所 述图像特征相似度和字符信息相似度进行加权平均来获得所述目标对象与各个备选对象之间的平均相似度(S233);按照所述平均相似度的递减顺序从所述多个备选对象中选择预定数目备选对象(S234);将与所选择的备选对象对应的信息作为与所述目标对象相关联的相关对象信息(S235)。下面以目标对象q与对象数据库P中包含N个对象pj为例进行描述。FIG. 3 is a flowchart schematically illustrating related object information (S230) of searching for a target object based on image features and character information in the search method of FIG. 2. As shown in FIG. 3, image feature similarity between the target object and each candidate object is calculated based on image features of the target image and image features of the respective candidate objects (S231); character information based on the target image and each device Selecting character information of the object to calculate a character information similarity between the target object and each candidate object (S232); performing weighted averaging on the image feature similarity and the character information similarity to obtain the target object and each device Selecting an average similarity between the objects (S233); selecting a predetermined number of candidate objects from the plurality of candidate objects in descending order of the average similarity (S234); corresponding to the selected candidate objects The information is related object information associated with the target object (S235). The following describes the target object q and the object database P including N objects p j as an example.
在S231中,可以计算所述目标对象q的图像特征fI(q)与各个备选对象pj之间的图像特征fI(pj)之间的余弦相似度sI(q,pj),作为所述图像特征相似度。所述余弦相似度sI(q,pj)可通过如下的公式(1)来计算:In S231, a cosine similarity s I (q, p j ) between the image feature f I (q) of the target object q and the image feature f I (p j ) between the respective candidate objects p j can be calculated. ) as the image feature similarity. The cosine similarity s I (q, p j ) can be calculated by the following formula (1):
   公式(1), Formula 1),
其中,||fI(q)||是图像特征fI(q)的模,||fI(pj)||是图像特征fI(pj)的模。公式(1)所示的余弦相似度仅仅是图像特征相似度的一种表述方式。在实践中,还可以采取其它的函数来表示图像特征相似度,例如,可以采取目标对象的图像特征与各个备选对象之间的图像特征之间的皮尔森相关系数作为所述图像特征相似度。Where ||f I (q)|| is the modulus of the image feature f I (q), and ||f I (p j )|| is the modulus of the image feature f I (p j ). The cosine similarity shown in equation (1) is only a representation of the similarity of image features. In practice, other functions may also be adopted to represent the image feature similarity. For example, a Pearson correlation coefficient between the image feature of the target object and the image feature between each candidate object may be taken as the image feature similarity. .
在S232中,可如下地计算所述目标对象q与各个备选对象pj之间的字符信息相似度:计算所述目标对象q的字符信息fT(q)与各个备选对象pj的字符信息fT(pj)之间的编辑距离d(fT(q),fT(pj));基于所述编辑距离、所述目标对象的字符信息fT(q)的长度、备选对象的字符信息fT(pj)的长度来计算所述字符信息相似度。编辑距离是指在两个字符串之间,由一个字符串转成另一个字符串所需的最少编辑操作次数,所许可的编辑操作包括将一个字符替换成另一个字符,插入一个字符,删除一个字符。因此,编辑距离d(fT(q),fT(pj))是将字符信息fT(q)转成字符信息fT(pj)所需的最少编辑操作次数。字符信息fT(q)的长度例如是该字符信息fT(q)中包括的文字和符号的数目。备选对象的字符信息fT(pj)的长度例如是该字符信息fT(pj)中包括的文字和符号的数目。例如,可通过如下的公式(2)来计算字符信息相似度sT(q,pj):In S232, the character information similarity between the target object q and each candidate object p j may be calculated as follows: calculating character information f T (q) of the target object q and each candidate object p j An edit distance d(f T (q), f T (p j )) between the character information f T (p j ); based on the edit distance, the length of the character information f T (q) of the target object, The character information similarity is calculated by the length of the character information f T (p j ) of the candidate object. Edit distance is the minimum number of edit operations required to convert from one string to another between two strings. The allowed editing operations include replacing one character with another, inserting a character, and deleting One character. Therefore, the edit distance d(f T (q), f T (p j )) is the minimum number of editing operations required to convert the character information f T (q) into the character information f T (p j ). Character information length f T (q), for example, the number of characters and symbols included in the character information f T (q) in. Alternatively, the length of the object character information f T (p j), for example, the number of characters and symbols included in the character information f T (p j) in. For example, the character information similarity s T (q, p j ) can be calculated by the following formula (2):
Figure PCTCN2015082628-appb-000002
   公式(2)
Figure PCTCN2015082628-appb-000002
Formula (2)
其中,d(fT(q),fT(pj))是字符信息fT(q)和字符信息fT(pj)之间的编辑距离, L(fT(q))是字符信息fT(q)的长度,L(fT(pj))是字符信息fT(pj)的长度。Where d(f T (q), f T (p j )) is the edit distance between the character information f T (q) and the character information f T (p j ), and L(f T (q)) is a character The length of the information f T (q), L(f T (p j )), is the length of the character information f T (p j ).
在S233中,对所述图像特征相似度sI(q,pj)和字符信息相似度sT(q,pj)进行加权平均来获得所述目标对象与各个备选对象之间的平均相似度。例如,可通过下面的公式(3)来计算所述平均相似度s(q,pj):In S233, the image feature similarity s I (q, p j ) and the character information similarity s T (q, p j ) are weighted and averaged to obtain an average between the target object and each candidate object. Similarity. For example, the average similarity s(q, p j ) can be calculated by the following formula (3):
s(q,pj)=ω·sI(q,pj)+(1-ω)sT(q,pj)   公式(3),s(q,p j )=ω·s I (q,p j )+(1-ω)s T (q,p j ) Equation (3),
其中,ω是权重系数。该权重系数ω是可调的参数,其取值范围是[0,1],典型取值为ω=0.6。当权重系数ω增加时,图像特征相似度sI(q,pj)在平均相似度中权重增加,字符信息相似度sT(q,pj)在平均相似度中权重降低。当权重系数ω减少时,图像特征相似度sI(q,pj)在平均相似度中权重降低,字符信息相似度sT(q,pj)在平均相似度中权重增加。Where ω is the weight coefficient. The weight coefficient ω is an adjustable parameter, and its value range is [0, 1], and the typical value is ω=0.6. When the weight coefficient ω increases, the image feature similarity s I (q, p j ) increases in the average similarity, and the character information similarity s T (q, p j ) decreases in the average similarity. When the weight coefficient ω decreases, the image feature similarity s I (q, p j ) decreases in the average similarity, and the character information similarity s T (q, p j ) increases in the average similarity.
在S234中,按照所述平均相似度s(q,pj)的递减顺序从所述多个备选对象中选择预定数目备选对象。在S233中,计算了目标对象与各个备选对象之间的平均相似度s(q,pj),j=1,2,...,N,即得到N个平均相似度,对这N个平均相似度可以按照递减的顺序排列,并例如选择平均相似度靠前的预定数目R个备选对象,这R个备选对象即是搜索结果。所述R个备选对象与目标对象之间的平均相似度较高,说明这R个备选对象与目标对象较接近,从而有较大的可能是用户期望的目标对象。R是一个可配置的参数,其典型值可设为10、20和100等。In S234, a predetermined number of candidate objects are selected from the plurality of candidate objects in descending order of the average similarity s(q, p j ). In S233, the average similarity s(q, p j ) between the target object and each candidate object is calculated, j=1, 2, . . . , N, that is, N average similarities are obtained, for which N The average similarities may be arranged in descending order, and for example, a predetermined number of R candidate objects with an average degree of similarity are selected, and the R candidate objects are search results. The average similarity between the R candidate objects and the target object is high, indicating that the R candidate objects are closer to the target object, so that there is a larger target object that the user desires. R is a configurable parameter, and its typical value can be set to 10, 20, 100, and so on.
在S235中,将与所选择的R个备选对象对应的信息作为与所述目标对象相关联的相关对象信息。将此R个对象的图片、字符描述以及关联信息等作为相关对象信息。所述关联信息例如为价格、销量、用户评价、宣传视频以及超链接等。In S235, information corresponding to the selected R candidate objects is taken as related object information associated with the target object. The picture, the character description, the related information, and the like of the R objects are used as related object information. The related information is, for example, price, sales volume, user rating, promotional video, hyperlink, and the like.
在上面的S233至S235中,基于所述图像特征相似度和所述字符信息相似度来从所述多个备选对象中搜索与所述目标对象相关联的相关对象信息。替换所述S233至S235,还例如可以按照如下的方式搜索相关对象信息:按照图像特征相似度的递减顺序从所述多个备选对象中选R1个备选对象;按照字符信息相似度的递减顺序从所述多个备选对象中选择R2个备选对象;将与所述选择的R1个备选对象和R2个备选对象对应的信息作为与所述目标对象相关联的相关对象信息。R1是小于N的自然数。R2也是小于N的自然数。In the above S233 to S235, related object information associated with the target object is searched from the plurality of candidate objects based on the image feature similarity and the character information similarity. Replacing the S233 to S235, for example, the related object information may be searched in such a manner that R1 candidate objects are selected from the plurality of candidate objects in descending order of image feature similarity; in descending order of similarity of character information R2 candidate objects are selected from the plurality of candidate objects; information corresponding to the selected R1 candidate objects and R2 candidate objects is used as related object information associated with the target object. R1 is a natural number smaller than N. R2 is also a natural number smaller than N.
在S240中,服务器将所搜索到的相关对象信息作为搜索结果发送给用户设备。服务器可以利用互联网、局域网等各种网络或通信技术发送所述相关 对象信息。相关对象信息例如是所述R个备选对象的图片、文字描述以及关联信息,或者是上述的R1加上R2备选对象的图片、文字描述以及关联信息。用户设备在接收到相关对象信息之后,可以将所述相关对象信息显示在用户设备的屏幕上,以供用户查看。In S240, the server sends the searched related object information as a search result to the user equipment. The server can transmit the correlation by using various networks or communication technologies such as the Internet and a local area network. Object information. The related object information is, for example, a picture, a text description, and associated information of the R candidate objects, or a picture, a text description, and associated information of the R1 plus R2 candidate object described above. After receiving the related object information, the user equipment may display the related object information on the screen of the user equipment for the user to view.
在根据本公开实施例的用于服务器的搜索方法的技术方案中,通过从要搜索的目标对象的目标图像中提取与所述目标对象相关联的字符信息和图像特征,基于所述字符信息和图像特征执行搜索,能够准确且便利地搜索目标对象的相关对象信息,从而提高用户的使用体验。此外,通过自动识别目标图像中包含的字符信息,免去用户手动输入关键字的步骤。In a technical solution for a search method of a server according to an embodiment of the present disclosure, character information and image features associated with the target object are extracted from a target image of a target object to be searched based on the character information and The image feature performs a search, and can accurately and conveniently search related object information of the target object, thereby improving the user experience. In addition, the step of manually inputting a keyword by the user is eliminated by automatically recognizing the character information contained in the target image.
图4是示意性图示了根据本公开实施例的用于用户设备的搜索方法400流程图。该搜索方法400应用与图1(a)所示的用户设备。如图4所示,该搜索方法400可包括:采集要搜索的目标对象的目标图像(S410);判断所述目标图像是否满足预定条件(S420);在所述目标图像满足预定条件时,发出针对所述目标对象的搜索请求,该搜索请求包括所述目标图像(S430);接收与所述目标对象相关联的相关对象信息(S440),其中所述相关对象信息基于从所述目标图像中提取的与所述目标对象相关联的字符信息和图像特征搜索得到。4 is a flow chart that schematically illustrates a search method 400 for a user device in accordance with an embodiment of the disclosure. The search method 400 is applied to the user equipment shown in FIG. 1(a). As shown in FIG. 4, the search method 400 may include: collecting a target image of a target object to be searched (S410); determining whether the target image satisfies a predetermined condition (S420); and when the target image satisfies a predetermined condition, issuing a search request for the target object, the search request including the target image (S430); receiving related object information associated with the target object (S440), wherein the related object information is based on from the target image The extracted character information and image feature search associated with the target object are obtained.
在S410中,可以利用用户设备中的图像采集装置来采集要搜索的目标对象的目标图像。例如,用户的图1(b)中的蓝月亮洗衣液快用尽了,并期望购买该蓝月亮洗衣液,则用户利用用户设备10中内置的图像采集装置、或者与用户设备连接的图像采集装置对现有的蓝月亮洗衣液进行图像采集。图像采集装置与用户设备的位置关系不构成对本公开实施例的限制。In S410, the image capturing device in the user device may be utilized to collect the target image of the target object to be searched. For example, if the blue moon laundry liquid in the user's FIG. 1(b) is exhausted and it is desired to purchase the blue moon laundry liquid, the user utilizes an image capturing device built in the user device 10 or an image acquisition connected to the user device. The device performs image acquisition on the existing Blue Moon laundry detergent. The positional relationship between the image capture device and the user equipment does not constitute a limitation on the embodiments of the present disclosure.
在S420中,判断所述目标图像是否满足预定条件。由于服务器要从所述目标图像中提取信息来搜索目标对象,所以所述目标图像的图像质量将直接影响搜索结果。以图1(b)的目标图像(1)为例,如果目标图像模糊而导致不能提取其品牌名称evian、容量等信息,则难以准确地搜索到目标对象。在该S420可以对所述目标图像做出要求,例如所述目标图像满足预定条件。所述预定条件可以是关于目标图像的亮度的条件、或者是关于所述目标图像的清晰度的条件。In S420, it is determined whether the target image satisfies a predetermined condition. Since the server is to extract information from the target image to search for the target object, the image quality of the target image will directly affect the search result. Taking the target image (1) of FIG. 1(b) as an example, if the target image is blurred and information such as the brand name evian, capacity, etc. cannot be extracted, it is difficult to accurately search for the target object. A requirement may be made for the target image at S420, for example, the target image satisfies a predetermined condition. The predetermined condition may be a condition regarding the brightness of the target image or a condition regarding the sharpness of the target image.
作为基于目标图像的亮度来判断是否满足预定条件的示例,在S420中可以将S410中采集的目标图像转换为HSL颜色空间的图像数据,该HSL颜色 空间的图像数据中包括了亮度信息。然后,统计所述HSL颜色空间的图像数据中的所有像素的光照分量(即L分量)的平均值
Figure PCTCN2015082628-appb-000003
当用于该目标图像的光照分量的平均值
Figure PCTCN2015082628-appb-000004
大于等于预定亮度阈值TL时,可以判断所述目标图像满足预定条件。当用于该目标图像的光照分量的平均值
Figure PCTCN2015082628-appb-000005
小于预定亮度阈值TL时,可以判断所述目标图像不满足预定条件。预定亮度阈值TL典型地为64。或者,可以通过测量图像采集环境中的光照条件来间接地判断目标图像的质量。例如,可以确定在采集所述目标图像过程中的光照参数;当所述光照参数大于等于预设照度时,确定所述目标图像满足预定条件;当所述光照参数小于预设照度时,确定所述目标图像不满足预定条件。
As an example of judging whether or not the predetermined condition is satisfied based on the brightness of the target image, the target image acquired in S410 may be converted into image data of the HSL color space in which the luminance information is included in the image data of the HSL color space. Then, the average value of the illumination components (ie, L components) of all pixels in the image data of the HSL color space is counted
Figure PCTCN2015082628-appb-000003
Average value of the illumination component used for the target image
Figure PCTCN2015082628-appb-000004
When the predetermined brightness threshold T L is greater than or equal to, it may be determined that the target image satisfies a predetermined condition. Average value of the illumination component used for the target image
Figure PCTCN2015082628-appb-000005
When it is less than the predetermined brightness threshold T L , it may be judged that the target image does not satisfy the predetermined condition. The predetermined brightness threshold T L is typically 64. Alternatively, the quality of the target image can be indirectly determined by measuring the lighting conditions in the image acquisition environment. For example, the illumination parameter in the process of acquiring the target image may be determined; when the illumination parameter is greater than or equal to the preset illumination, determining that the target image satisfies a predetermined condition; when the illumination parameter is less than the preset illumination, determining the location The target image does not satisfy the predetermined condition.
作为基于目标图像的清晰度来判断是否满足预定条件的示例,在S420中可以利用预定算法(例如,Canny算法)提取在S410中采集的目标图像的边缘,计算所述目标图像中位于边缘的各个像素点的梯度G,然后进一步计算目标图像中所有位于边缘的像素点的梯度的平均值
Figure PCTCN2015082628-appb-000006
当该目标图像的所有位于边缘的像素点的梯度的平均值
Figure PCTCN2015082628-appb-000007
大于等于预设梯度阈值TG时,可以判断所述目标图像满足预定条件。当该目标图像的所有位于边缘的像素点的梯度的平均值
Figure PCTCN2015082628-appb-000008
小于预设梯度阈值TG时,可以判断所述目标图像不满足预定条件。该预设梯度阈值TG典型地为100。
As an example of judging whether or not a predetermined condition is satisfied based on the sharpness of the target image, an edge of the target image acquired in S410 may be extracted using a predetermined algorithm (for example, Canny algorithm) in S420, and each of the edges located in the target image is calculated. The gradient G of the pixel, and then further calculate the average of the gradients of all the pixel points at the edge in the target image
Figure PCTCN2015082628-appb-000006
The average of the gradients of all pixel points at the edge of the target image
Figure PCTCN2015082628-appb-000007
When it is greater than or equal to the preset gradient threshold T G , it may be determined that the target image satisfies a predetermined condition. The average of the gradients of all pixel points at the edge of the target image
Figure PCTCN2015082628-appb-000008
When it is less than the preset gradient threshold T G , it may be determined that the target image does not satisfy the predetermined condition. The preset gradient threshold T G is typically 100.
上述的预定亮度阈值TL或预设梯度阈值TG可以根据执行搜索的服务器的处理能力来调整。例如,当服务器的处理能力强时,可以将预定亮度阈值TL或预设梯度阈值TG设置为较低的值;当服务器的处理能力弱时,可以将预定亮度阈值TL或预设梯度阈值TG设置为较高的值。The predetermined brightness threshold T L or the preset gradient threshold T G described above may be adjusted according to the processing capability of the server performing the search. For example, when the processing capability of the server is strong, the predetermined brightness threshold T L or the preset gradient threshold T G may be set to a lower value; when the processing capability of the server is weak, the predetermined brightness threshold T L or a preset gradient may be used. The threshold T G is set to a higher value.
当在S420中判断目标图像满足预定条件时,在S430中发出针对所述目标对象的搜索请求,该搜索请求包括所述目标图像。然后,如图1(a)所示的搜索服务器20从所述目标图像中提取与所述目标对象相关联的字符信息和图像特征,并基于所述字符信息和所述图像特征执行搜索,即执行结合图2描述的搜索方法的各个步骤。由于目标图像的亮度或清晰度较好,所以在服务器中能够准确地提取字符信息和图像特征,从而保证了搜索的准确度。When it is judged in S420 that the target image satisfies the predetermined condition, a search request for the target object is issued in S430, the search request including the target image. Then, the search server 20 as shown in FIG. 1(a) extracts character information and image features associated with the target object from the target image, and performs a search based on the character information and the image feature, ie, The various steps of the search method described in connection with FIG. 2 are performed. Since the brightness or sharpness of the target image is good, the character information and the image feature can be accurately extracted in the server, thereby ensuring the accuracy of the search.
当在S420中判断目标图像不满足预定条件时,意味着在S410中采集的目标图像不符合要求,可能导致难以准确地提取其中的字符信息和图像特征。此时,可以在用户设备中输出重拍提示消息,以提示重新执行S410来采集要搜索的目标对象的目标图像。在该重拍提示消息中,还可以具体列出目标图 像不满足预定条件的原因。例如,在目标图像的光照分量的平均值
Figure PCTCN2015082628-appb-000009
小于预定亮度阈值TL时,可以在重拍提示消息中指出亮度不够;在该目标图像的所有位于边缘的像素点的梯度的平均值
Figure PCTCN2015082628-appb-000010
小于预设梯度阈值TG时,可以在重拍提示消息中指出清晰度不够。这样,可以根据所述重拍提示消息调整目标图像的拍摄,直到获取满足所述预定条件的目标图像。或者,当在S420中判断目标图像不满足预定条件时,可以直接根据S420的判断结果自动地调整图像采集装置的设置参数,直到获取满足所述预定条件的目标图像。
When it is judged in S420 that the target image does not satisfy the predetermined condition, it means that the target image acquired in S410 does not meet the requirement, which may make it difficult to accurately extract the character information and the image feature therein. At this time, a retake prompt message may be output in the user equipment to prompt re-execution S410 to collect the target image of the target object to be searched. In the retake prompt message, it is also possible to specifically list the reason why the target image does not satisfy the predetermined condition. For example, the average of the illumination components of the target image
Figure PCTCN2015082628-appb-000009
When less than the predetermined brightness threshold T L , the brightness may be indicated in the replay prompt message; the average of the gradients of all the pixel points located at the edge of the target image
Figure PCTCN2015082628-appb-000010
When the preset gradient threshold T G is smaller than the preset gradient threshold T G , it is indicated that the sharpness is insufficient in the replay prompt message. In this way, the shooting of the target image can be adjusted according to the replay prompt message until the target image that satisfies the predetermined condition is acquired. Alternatively, when it is judged in S420 that the target image does not satisfy the predetermined condition, the setting parameters of the image pickup device may be automatically adjusted in accordance with the determination result of S420 until the target image satisfying the predetermined condition is acquired.
用户设备在S430中向服务器发出搜索请求之后,服务器执行结合图2和3描述的搜索方法,并得到与目标对象相关联的相关对象信息。也就是说,所述相关对象信息基于从所述目标图像中提取的与所述目标对象相关联的字符信息和图像特征搜索得到。对应地,用户设备在S440中接收与目标对象相关联的相关对象信息。用户设备可以利用互联网、局域网等各种网络或通信技术从服务器接收所述相关对象信息。相关对象信息例如是多个备选对象的图片、文字描述以及关联信息。该关联信息例如是价格、销量、用户评价、宣传视频以及超链接等,其有助于用户在多个备选对象中执行选择操作。用户设备在接收到相关对象信息之后,可以将所述相关对象信息显示在用户设备的屏幕上,以供用户查看。After the user device issues a search request to the server in S430, the server performs the search method described in connection with FIGS. 2 and 3 and obtains related object information associated with the target object. That is, the related object information is obtained based on character information and image feature search associated with the target object extracted from the target image. Correspondingly, the user equipment receives relevant object information associated with the target object in S440. The user equipment can receive the related object information from the server by using various networks or communication technologies such as the Internet and a local area network. The related object information is, for example, a picture of a plurality of candidate objects, a text description, and associated information. The associated information is, for example, price, sales volume, user rating, promotional video, hyperlinks, etc., which assists the user in performing selection operations among a plurality of candidate objects. After receiving the related object information, the user equipment may display the related object information on the screen of the user equipment for the user to view.
因此,在用户设备拍摄目标对象的图像的过程中,用户设备可以自动计算图像的光照条件和清晰程度。如果图像的光照条件和清晰程度达到要求,则允许用户设备基于所采集的目标图像发出搜索请求。如果图像的光照条件和清晰程度不能达到要求,则提示或自动指令用户设备重新拍摄,直到获取达到要求的目标图像。Therefore, in the process of the user device taking an image of the target object, the user device can automatically calculate the illumination condition and the degree of clarity of the image. If the lighting conditions and clarity of the image meet the requirements, the user device is allowed to issue a search request based on the acquired target image. If the lighting conditions and clarity of the image do not meet the requirements, the user equipment is prompted or automatically instructed to re-shoot until the desired target image is obtained.
在根据本公开实施例的用于用户设备的搜索方法的技术方案中,在所采集的目标对象的目标图像满足预定条件时,基于该目标图像发出搜索请求,使能够准确且便利地搜索目标对象的相关对象信息,从而提高用户的使用体验。In a technical solution for a search method of a user device according to an embodiment of the present disclosure, when a target image of the acquired target object satisfies a predetermined condition, a search request is issued based on the target image, so that the target object can be searched accurately and conveniently Relevant object information, thereby improving the user experience.
图5是示意性图示了根据本公开实施例的第一搜索装置500的框图。该第一搜索装置500可应用于用户设备或服务器。如图5所示,所述第一数据处理装置500可包括一个或多个处理器510、存储单元520、输入单元530、输出单元540、通信单元550、图像采集单元560。这些组件通过总线系统570和/或其它形式的连接机构(未示出)互连。应当注意,图5所示的第一搜索 装置500的组件和结构只是示例性的,而非限制性的,根据需要,第一搜索装置500也可以具有其他组件和结构,并且例如可以不包括输入单元530、输出单元540、图像采集单元560等。FIG. 5 is a block diagram schematically illustrating a first search device 500 in accordance with an embodiment of the present disclosure. The first search device 500 can be applied to a user equipment or server. As shown in FIG. 5, the first data processing apparatus 500 may include one or more processors 510, a storage unit 520, an input unit 530, an output unit 540, a communication unit 550, and an image acquisition unit 560. These components are interconnected by a bus system 570 and/or other form of connection mechanism (not shown). It should be noted that the first search shown in Figure 5 The components and structures of the device 500 are merely exemplary and not limiting. The first search device 500 may also have other components and structures as needed, and may, for example, not include the input unit 530, the output unit 540, and the image acquisition unit 560. Wait.
处理器510可以是中央处理单元(CPU)或者具有数据处理能力和/或指令执行能力的其它形式的处理单元,并且可以控制第一搜索装置500中的其它组件以执行期望的功能。Processor 510 can be a central processing unit (CPU) or other form of processing unit with data processing capabilities and/or instruction execution capabilities, and can control other components in first search device 500 to perform desired functions.
存储单元520可以包括一个或多个计算机程序产品,所述计算机程序产品可以包括各种形式的计算机可读存储介质,例如易失性存储器和/或非易失性存储器。所述易失性存储器例如可以包括随机存取存储器(RAM)和/或高速缓冲存储器(cache)等。所述非易失性存储器例如可以包括只读存储器(ROM)、硬盘、闪存等。在所述计算机可读存储介质上可以存储一个或多个计算机程序指令,处理器510可以运行所述程序指令,以实现上述的本公开的实施例的结合图2和3描述的搜索方法的各个步骤,此时第一搜索装置500可被包括在服务器中。或者,处理器510可以运行所述程序指令,实现上述的本公开的实施例的结合图4描述的搜索方法的各个步骤,此时第一搜索装置500可被包括在用户设备中。在所述计算机可读存储介质中还可以存储各种应用程序和各种数据,例如显示屏幕的工作状态、应用程序的操作状态等。Storage unit 520 can include one or more computer program products, which can include various forms of computer readable storage media, such as volatile memory and/or nonvolatile memory. The volatile memory may include, for example, a random access memory (RAM) and/or a cache or the like. The nonvolatile memory may include, for example, a read only memory (ROM), a hard disk, a flash memory, or the like. One or more computer program instructions may be stored on the computer readable storage medium, and the processor 510 may execute the program instructions to implement various of the search methods described above in connection with FIGS. 2 and 3 of embodiments of the present disclosure. Step, at this time, the first search device 500 can be included in the server. Alternatively, the processor 510 can execute the program instructions to implement the various steps of the search method described above in connection with FIG. 4 of the embodiments of the present disclosure, at which time the first search device 500 can be included in the user equipment. Various applications and various data such as an operating state of the display screen, an operational state of the application, and the like can also be stored in the computer readable storage medium.
输入单元530可以是用户用来输入指令的单元,并且可以包括键盘、鼠标、麦克风和触摸屏等中的一个或多个。输出单元540可以向外部(例如用户)输出各种信息(例如图像或声音),并且可以包括显示器、扬声器等中的一个或多个。通信单元550可以通过网络或其它技术与其它单元(例如个人计算机、服务器、移动台、基站等)通信,所述网络可以是因特网、无线局域网、移动通信网络等。The input unit 530 may be a unit used by a user to input an instruction, and may include one or more of a keyboard, a mouse, a microphone, a touch screen, and the like. The output unit 540 may output various information (such as an image or a sound) to the outside (for example, a user), and may include one or more of a display, a speaker, and the like. Communication unit 550 can communicate with other units (e.g., personal computers, servers, mobile stations, base stations, etc.) via a network or other technology, which can be the Internet, a wireless local area network, a mobile communication network, and the like.
在本公开实施例的第一搜索装置500的技术方案中,通过从要搜索的目标对象的目标图像中提取与所述目标对象相关联的字符信息和图像特征,基于所字符信息和图像特征执行搜索,能够准确且便利地搜索目标对象的相关对象信息,从而提高用户的使用体验。In the technical solution of the first search device 500 of the embodiment of the present disclosure, the character information and the image feature associated with the target object are extracted from the target image of the target object to be searched, based on the character information and the image feature. The search can accurately and conveniently search for related object information of the target object, thereby improving the user experience.
图6是示意性图示了根据本公开实施例的用于服务器的第二搜索装置600的框图。该第二搜索装置600可应用于如图1(a)所示的搜索服务器。如图6所示,该第二搜索装置600可包括第一接收单元610、提取单元620、搜索 单元630和第一发送单元640。FIG. 6 is a block diagram schematically illustrating a second search device 600 for a server in accordance with an embodiment of the present disclosure. The second search device 600 is applicable to a search server as shown in FIG. 1(a). As shown in FIG. 6, the second search device 600 may include a first receiving unit 610, an extracting unit 620, and a search. Unit 630 and first transmitting unit 640.
第一接收单元610接收搜索请求,该搜索请求包括要搜索的目标对象的目标图像。所述目标图像是如图1(b)中所示的任一个目标图像。该目标图像包含了要搜索的目标对象的各种信息,包含但不限于品牌名称、对象内容、系列、外观、容量、生产日期等。该目标图像可以是所述用户设备利用其图像采集装置所采集的,也可以是所述用户设备从其它电子设备接收的,所述目标图像的获取方式不构成对本公开实施例的限制。该第一接收单元610对应于图5中的通信单元550,并可以利用射频电路、信号接收电路来实现。The first receiving unit 610 receives a search request including a target image of a target object to be searched for. The target image is any one of the target images as shown in FIG. 1(b). The target image contains various information of the target object to be searched, including but not limited to brand name, object content, series, appearance, capacity, production date, and the like. The target image may be collected by the user equipment by using the image collection device, or may be received by the user equipment from other electronic devices, and the manner in which the target image is acquired does not constitute a limitation on the embodiments of the present disclosure. The first receiving unit 610 corresponds to the communication unit 550 in FIG. 5 and can be implemented by using a radio frequency circuit and a signal receiving circuit.
所述目标图像的图像质量将直接影响搜索结果。例如在图1(b)的目标图像(1)中,如果目标图像模糊而导致不能提取其品牌名称evian、容量等信息,则难以准确地搜索到目标对象。因此,所述目标图像优选地满足预定条件。所述预定条件可以是关于目标图像的亮度的条件、或者是关于所述目标图像的清晰度的条件。当目标图像的亮度大于等于预设亮度阈值时,确定所述目标图像满足预定条件;当目标图像的亮度小于预设亮度阈值时,确定所述目标图像不满足预定条件。或者,当目标图像的清晰度大于等于预设清晰度阈值时,确定所述目标图像满足预定条件;当目标图像的亮度小于预设清晰度阈值时,确定所述目标图像不满足预定条件。所述预设亮度阈值或预设清晰度阈值可以根据所述服务器的处理能力来调整。例如,当服务器的处理能力强时,可以将预设亮度阈值或预设清晰度阈值设置为较低的值;当服务器的处理能力弱时,可以将预设亮度阈值或预设清晰度阈值设置为较高的值。The image quality of the target image will directly affect the search results. For example, in the target image (1) of FIG. 1(b), if the target image is blurred and information such as the brand name evian, capacity, and the like cannot be extracted, it is difficult to accurately search for the target object. Therefore, the target image preferably satisfies a predetermined condition. The predetermined condition may be a condition regarding the brightness of the target image or a condition regarding the sharpness of the target image. When the brightness of the target image is greater than or equal to the preset brightness threshold, it is determined that the target image satisfies a predetermined condition; when the brightness of the target image is less than the preset brightness threshold, it is determined that the target image does not satisfy the predetermined condition. Alternatively, when the sharpness of the target image is greater than or equal to the preset sharpness threshold, it is determined that the target image satisfies a predetermined condition; when the brightness of the target image is less than the preset sharpness threshold, it is determined that the target image does not satisfy the predetermined condition. The preset brightness threshold or the preset definition threshold may be adjusted according to the processing capability of the server. For example, when the processing capability of the server is strong, the preset brightness threshold or the preset definition threshold may be set to a lower value; when the processing capability of the server is weak, the preset brightness threshold or the preset definition threshold may be set. Is a higher value.
提取单元620从所述目标图像中提取与所述目标对象相关联的字符信息和图像特征。所述目标图像所包括的字符信息例如为产品名称、容量、品牌名称、生产日期等,这些字符信息是文字或符号。所述目标图像所包括的图像特诊是图像的颜色分量、各个颜色分量的组成比例等。典型地,采用不同的技术手段来提取目标图像中的字符信息和图像特征。提取单元620可以利用图5中的存储器和处理器来实现。The extracting unit 620 extracts character information and image features associated with the target object from the target image. The character information included in the target image is, for example, a product name, a capacity, a brand name, a date of manufacture, and the like, and the character information is a character or a symbol. The image special diagnosis included in the target image is a color component of the image, a composition ratio of each color component, and the like. Typically, different techniques are employed to extract character information and image features in the target image. Extraction unit 620 can be implemented using the memory and processor of FIG.
对于字符信息,提取单元620可以采用OCR技术或其它技术提取目标图像中的字符信息。在OCR技术中,服务器通过检测目标图像的暗、亮的模式确定其形状,然后用字符识别方法将形状翻译成计算机文字。提取单元620可包括OCR模块,并且可通过如下操作从所述目标图像中提取与所述目标对 象相关联的字符信息:利用光学字符识别OCR从所述目标图像中识别文字和符号;从所识别的文字和符号中选择用于标识所述目标对象的标识字符,作为与所述目标对象相关联的字符信息。在目标图像中包括丰富的信息,部分信息可能与目标对象的搜索密切相关,例如产品名称、品牌、容量等。然而,目标图像中还可能包括与目标对象的搜索无关的信息,例如成分、安全提醒等,这些信息可能是所有的同类产品都涉及的信息,其不能用于标识目标对象。因此,提取单元620在对目标图像进行字符识别之后,需要从中筛选出搜索目标对象所需的信息,即用于标识所述目标对象的标识字符。For the character information, the extracting unit 620 may extract the character information in the target image using OCR technology or other techniques. In OCR technology, the server determines its shape by detecting the dark and bright patterns of the target image, and then uses the character recognition method to translate the shape into computer text. The extracting unit 620 may include an OCR module, and may extract the target pair from the target image by the following operation Like associated character information: identifying characters and symbols from the target image by using optical character recognition OCR; selecting identification characters for identifying the target object from the recognized characters and symbols as related to the target object Linked character information. Rich information is included in the target image, and some of the information may be closely related to the search of the target object, such as product name, brand, capacity, and the like. However, the target image may also include information that is not related to the search of the target object, such as components, security reminders, etc., which may be information related to all similar products, which cannot be used to identify the target object. Therefore, after performing the character recognition on the target image, the extracting unit 620 needs to filter out information required for searching the target object, that is, the identification character for identifying the target object.
图像特征是指数值化的图像特性表达,例如使用向量来表示图像特性。可以采用现有的或将来出现的各种方式来表示目标图像的图像特征。提取单元620可包括图像特征提取模块,并利用该图像特征提取模块可以执行如下操作中的至少一个来提取图像特征:计算所述目标图像的颜色直方图特征作为所述图像特征;和计算所述目标图像的词袋模型特征作为所述图像特征。也就是说,提取单元620可以采用颜色直方图和词袋模型特征中的至少一个来表示目标图像的图像特征。Image features are index-valued image feature representations, such as using vectors to represent image characteristics. The image features of the target image may be represented in various ways that are present or appearing in the future. The extracting unit 620 may include an image feature extraction module, and the image feature extraction module may perform at least one of: extracting an image feature: calculating a color histogram feature of the target image as the image feature; and calculating the image The word bag model feature of the target image is used as the image feature. That is, the extracting unit 620 may represent the image features of the target image using at least one of a color histogram and a bag model feature.
颜色直方图是图像的颜色特征的一种统计表达,其用于表示不同色彩在整幅目标图像中所占的比例,而并不关心每种色彩所处的空间位置。常用的颜色直方图包括RGB空间颜色直方图,HSV空间颜色直方图以及Lab空间颜色直方图等。在不同的颜色空间中,目标图像的颜色直方图具有不同的数值。词袋模型特征是图像的纹理特征的一种统计表达,其可以有效描述图像的整体和局部特性。例如,提取单元620可通过如下操作获得目标图像的词袋模型特征:从目标图像中提取特征描述符,例如SIFT、HOG等;对于每一个描述符,在预先准确的码本中搜索最相似的聚类中心,统计不同聚类中心在该目标图像中出现的频度,形成一个直方图;对该直方图作归一化处理,从而得到目标图像的词袋模型特征。所述预先准确的码本可通过如下方式获得:从训练图像的集合中随机提取大量的图像描述符,采用聚类算法对这些图像描述符进行聚类而得到多个类别,聚类得到的所有类别即组成码本。A color histogram is a statistical representation of the color characteristics of an image that is used to represent the proportion of different colors in the entire target image, without concern for the spatial location of each color. Common color histograms include RGB spatial color histograms, HSV spatial color histograms, and Lab space color histograms. In different color spaces, the color histogram of the target image has different values. The word bag model feature is a statistical representation of the texture features of an image that can effectively describe the overall and local characteristics of the image. For example, the extracting unit 620 can obtain the word bag model feature of the target image by extracting feature descriptors such as SIFT, HOG, etc. from the target image; for each descriptor, searching for the most similar in the pre-accurate codebook The clustering center counts the frequency of occurrence of different clustering centers in the target image to form a histogram; normalizes the histogram to obtain the word bag model feature of the target image. The pre-accurate codebook can be obtained by randomly extracting a large number of image descriptors from a set of training images, and clustering the image descriptors by using a clustering algorithm to obtain a plurality of categories, and all the clusters are obtained. The category is the codebook.
搜索单元630基于所述字符信息和所述图像特征搜索与所述目标对象相关联的相关对象信息。例如,搜索单元630基于所述字符信息和所述图像特征从预先建立的对象数据库中搜索所述相关对象信息。所述对象数据库包括各个备选对象的图像特征、字符信息和关联信息。如上所述,假设对象数据 库P中包含N个对象,每一个对象pj可以用三元组{fI(pj),fT(pj),a(pj)}表示,其中,j=1,2,...,N,三元组中的各个分量的含义如上所述。或者,每一个对象pj还可以用二元组{fI(pj),fT(pj)}表示。假设要搜索的目标对象q的目标图像的图像特征和字符信息分别是fI(q)和fT(q),相应地,搜索单元630可通过将提取单元620所提取的字符信息fT(q)和图像特征fI(q)与对象数据库P中的各个备选对象的字符信息fT(pj)和图像特征fI(pj)进行比对来执行搜索。搜索单元630可以利用图5中的存储器和处理器来实现。The search unit 630 searches for related object information associated with the target object based on the character information and the image feature. For example, the search unit 630 searches the related object information from a pre-established object database based on the character information and the image feature. The object database includes image features, character information, and associated information of respective candidate objects. As described above, it is assumed that the object database P contains N objects, and each object p j can be represented by a triplet {f I (p j ), f T (p j ), a(p j )}, where j The meaning of each component in the =1, 2, ..., N, triplet is as described above. Alternatively, each object p j can also be represented by a binary group {f I (p j ), f T (p j )}. It is assumed that the image features and character information of the target image of the target object q to be searched are f I (q) and f T (q), respectively, and accordingly, the search unit 630 can pass the character information f T extracted by the extracting unit 620 ( q) The image feature f I (q) is compared with the character information f T (p j ) of each candidate object in the object database P and the image feature f I (p j ) to perform a search. Search unit 630 can be implemented using the memory and processor of FIG.
搜索单元630可操作如下以搜索与所述目标对象相关联的相关对象信息:基于目标图像的图像特征和各个备选对象的图像特征计算所述目标对象与各个备选对象之间的图像特征相似度;基于目标图像的字符信息和各个备选对象的字符信息计算所述目标对象与各个备选对象之间的字符信息相似度;和基于所述图像特征相似度和所述字符信息相似度来从所述多个备选对象中搜索与所述目标对象相关联的相关对象信息。The searching unit 630 is operable to search for related object information associated with the target object: calculating image features similar to the target object and each candidate object based on the image features of the target image and the image features of the respective candidate objects Calculating a similarity of character information between the target object and each candidate object based on the character information of the target image and the character information of each candidate object; and based on the image feature similarity and the character information similarity Searching for related object information associated with the target object from the plurality of candidate objects.
作为示例,搜索单元630可以计算所述目标对象q的图像特征fI(q)与各个备选对象pj之间的图像特征fI(pj)之间的余弦相似度sI(q,pj),作为所述图像特征相似度。典型地,搜索单元630可以根据上述的公式(1)计算所述余弦相似度sI(q,pj),并具体可以参见上面结合公式(1)进行的描述。此外,搜索单元630还可以采取目标对象的图像特征与各个备选对象之间的图像特征之间的皮尔森相关系数作为所述图像特征相似度。As an example, the search unit 630 may calculate a cosine similarity s I (q, between the image feature f I (q) of the target object q and the image feature f I (p j ) between the respective candidate objects p j , p j ) as the image feature similarity. Typically, the search unit 630 can calculate the cosine similarity s I (q, p j ) according to the above formula (1), and can be specifically referred to the description above in connection with the formula (1). Further, the search unit 630 may also take the Pearson correlation coefficient between the image feature of the target object and the image feature between the respective candidate objects as the image feature similarity.
搜索单元630可以如下地计算所述字符信息相似度:计算所述目标对象q的字符信息fT(q)与各个备选对象pj的字符信息fT(pj)之间的编辑距离d(fT(q),fT(pj));基于所述编辑距离、所述目标对象的字符信息fT(q)的长度、备选对象的字符信息fT(pj)的长度来计算所述字符信息相似度。编辑距离d(fT(q),fT(pj))是将字符信息fT(q)转成字符信息fT(pj)所需的最少编辑操作次数。字符信息fT(q)的长度例如是该字符信息fT(q)中包括的文字和符号的数目。字符信息fT(pj)的长度例如是该字符信息fT(pj)中包括的文字和符号的数目。搜索单元630例如可通过上面的公式(2)来计算字符信息相似度sT(q,pj)。或者,搜索单元630还可以将所述编辑距离d(fT(q),fT(pj))作为所述字符信息相似度。 The search unit 630 may be calculated as the similarity of the character information: calculating the edit distance d between the target object character information q f T (q) of each candidate character information of the object p j f T (p j) (f T (q), f T (p j )); based on the edit distance, the length of the character information f T (q) of the target object, the length of the character information f T (p j ) of the candidate object To calculate the similarity of the character information. The edit distance d(f T (q), f T (p j )) is the minimum number of editing operations required to convert the character information f T (q) into the character information f T (p j ). Character information length f T (q), for example, the number of characters and symbols included in the character information f T (q) in. Character information length f T (p j), for example, the number of characters and symbols included in the character information f T (p j) in. The search unit 630 can calculate the character information similarity s T (q, p j ), for example, by the above formula (2). Alternatively, the search unit 630 may also use the edit distance d(f T (q), f T (p j )) as the character information similarity.
搜索单元630可以按照如下的方式搜索相关对象信息:按照图像特征相似度的递减顺序从所述多个备选对象中选R1个备选对象;按照字符信息相似度的递减顺序从所述多个备选对象中选择R2个备选对象;将与所述选择的R1个备选对象和R2个备选对象对应的信息作为与所述目标对象相关联的相关对象信息。R1是小于N的自然数。R2也是小于N的自然数。The searching unit 630 may search for related object information by selecting R1 candidate objects from the plurality of candidate objects in descending order of image feature similarity; from the plurality of devices in descending order of character information similarity R2 candidate objects are selected among the selected objects; information corresponding to the selected R1 candidate objects and R2 candidate objects is used as related object information associated with the target object. R1 is a natural number smaller than N. R2 is also a natural number smaller than N.
替换地,搜索单元630还可以按照如下的方式基于图像特征相似度和所述字符信息相似度来搜索相关对象信息:对所述图像特征相似度和字符信息相似度进行加权平均来获得所述目标对象与各个备选对象之间的平均相似度;按照所述平均相似度的递减顺序从所述多个备选对象中选择预定数目备选对象;将与所选择的备选对象对应的信息作为与所述目标对象相关联的相关对象信息。Alternatively, the searching unit 630 may further search for related object information based on the image feature similarity and the character information similarity in such a manner that the image feature similarity and the character information similarity are weighted and averaged to obtain the target. An average similarity between the object and each of the candidate objects; selecting a predetermined number of candidate objects from the plurality of candidate objects in descending order of the average similarity; using information corresponding to the selected candidate object as Relevant object information associated with the target object.
搜索单元630可以利用上述的公式(3)计算所述平均相似度,并具体参见上面结合公式(3)进行的描述。在计算了目标对象与各个备选对象之间的平均相似度s(q,pj),j=1,2,...,N之后,搜索单元630对这N个平均相似度可以按照递减的顺序排列,并例如选择平均相似度靠前的预定数目R个备选对象,将与所选择的R个备选对象对应的信息作为与所述目标对象相关联的相关对象信息,即搜索结果。R是一个可配置的参数,其典型值可设为10、20和100等。The search unit 630 can calculate the average similarity using the above formula (3), and specifically refer to the description made above in connection with the formula (3). After calculating the average similarity s(q, p j ) between the target object and each candidate object, j=1, 2, . . . , N, the search unit 630 may decrement the N average similarities. Arranging in order, and selecting, for example, a predetermined number of R candidate objects with an average degree of similarity, and information corresponding to the selected R candidate objects as related object information associated with the target object, ie, search result . R is a configurable parameter, and its typical value can be set to 10, 20, 100, and so on.
第一发送单元640发送所述相关对象信息,即将所搜索到的相关对象信息作为搜索结果发送给用户设备。第一发送单元640可以利用互联网、局域网等各种网络或通信技术发送所述相关对象信息。相关对象信息例如是所述R个备选对象的图片、文字描述以及关联信息,或者是上述的R1加上R2备选对象的图片、文字描述以及关联信息。用户设备在接收到相关对象信息之后,可以将所述相关对象信息显示在用户设备的屏幕上,以供用户查看。第一发送单元640可对应于图5中的通信单元550,并可以利用射频电路、信号发送电路来实现。The first sending unit 640 sends the related object information, that is, the searched related object information is sent to the user equipment as a search result. The first transmitting unit 640 can transmit the related object information by using various networks or communication technologies such as the Internet and a local area network. The related object information is, for example, a picture, a text description, and associated information of the R candidate objects, or a picture, a text description, and associated information of the R1 plus R2 candidate object described above. After receiving the related object information, the user equipment may display the related object information on the screen of the user equipment for the user to view. The first transmitting unit 640 may correspond to the communication unit 550 in FIG. 5 and may be implemented by using a radio frequency circuit and a signal transmitting circuit.
在根据本公开实施例的用于服务器的第二搜索装置600的技术方案中,通过从要搜索的目标对象的目标图像中提取与所述目标对象相关联的字符信息和图像特征,基于所字符信息和图像特征执行搜索,能够准确且便利地搜索目标对象的相关对象信息,从而提高用户的使用体验。此外,通过自动识别目标图像中包含的字符信息,免去用户手动输入关键字的步骤。 In the technical solution of the second search device 600 for a server according to an embodiment of the present disclosure, the character information and the image feature associated with the target object are extracted from the target image of the target object to be searched, based on the character The information and image feature performs a search, and can accurately and conveniently search for related object information of the target object, thereby improving the user experience. In addition, the step of manually inputting a keyword by the user is eliminated by automatically recognizing the character information contained in the target image.
图7是示意性图示了根据本公开实施例的用户设备700的框图。该用户设备700对应于图1(a)所示的用户设备。如图7所示,该用户设备700可包括:图像采集单元710、判断单元720、第二发送单元730、和第二接收单元740。FIG. 7 is a block diagram that schematically illustrates a user device 700 in accordance with an embodiment of the present disclosure. The user equipment 700 corresponds to the user equipment shown in FIG. 1(a). As shown in FIG. 7, the user equipment 700 may include an image acquisition unit 710, a determination unit 720, a second transmission unit 730, and a second reception unit 740.
图像采集单元710采集要搜索的目标对象的目标图像。图像采集单元710典型地设置于所述用户设备中。例如,用户的蓝月亮洗衣液快用尽了,并期望购买该蓝月亮洗衣液,则用户利用图像采集单元710对现有的蓝月亮洗衣液进行图像采集。在图7中将图像采集单元710图示为包括于用户设备中,但是该图像采集单元710还可位于所述用户设备外部,而耦接到所述用户设备,并能够接收用户设备的指令,和将所采集的目标图像传送给用户设备。图像采集装置与用户设备的位置关系不构成对本公开实施例的限制。图像采集单元710可以是摄像头、照相机等。图像采集单元710对应于图5的图像采集单元560。The image acquisition unit 710 collects a target image of the target object to be searched for. Image acquisition unit 710 is typically disposed in the user device. For example, if the user's blue moon laundry liquid is exhausted and it is desired to purchase the blue moon laundry liquid, the user uses the image acquisition unit 710 to perform image acquisition on the existing blue moon laundry liquid. The image acquisition unit 710 is illustrated as being included in the user equipment in FIG. 7, but the image acquisition unit 710 may also be external to the user equipment, coupled to the user equipment, and capable of receiving instructions of the user equipment, And transmitting the acquired target image to the user equipment. The positional relationship between the image capture device and the user equipment does not constitute a limitation on the embodiments of the present disclosure. The image acquisition unit 710 can be a camera, a camera, or the like. The image acquisition unit 710 corresponds to the image acquisition unit 560 of FIG.
判断单元720判断所述目标图像是否满足预定条件。由于服务器要从所述目标图像中提取信息来搜索目标对象,所以所述目标图像的图像质量将直接影响搜索结果。判断单元720可以利用预定条件对所述目标图像做出要求。所述预定条件可以是关于目标图像的亮度的条件、或者是关于所述目标图像的清晰度的条件。判断单元720可以利用图5中的存储器和处理器来实现。The judging unit 720 judges whether or not the target image satisfies a predetermined condition. Since the server is to extract information from the target image to search for the target object, the image quality of the target image will directly affect the search result. The determining unit 720 can make a request for the target image using a predetermined condition. The predetermined condition may be a condition regarding the brightness of the target image or a condition regarding the sharpness of the target image. The determining unit 720 can be implemented using the memory and processor in FIG.
作为判断单元720基于目标图像的亮度来判断是否满足预定条件的示例,判断单元720可以将所采集的目标图像转换为HSL颜色空间的图像数据,该HSL颜色空间的图像数据中包括了亮度信息。然后,判断单元720统计所述HSL颜色空间的图像数据中的所有像素的光照分量(即L分量)的平均值
Figure PCTCN2015082628-appb-000011
并将其与预定亮度阈值TL进行比较。当用于该目标图像的光照分量的平均值大于等于预定亮度阈值TL时,判断单元720可以判断所述目标图像满足预定条件。当用于该目标图像的光照分量的平均值
Figure PCTCN2015082628-appb-000013
小于预定亮度阈值TL时,判断单元720可以判断所述目标图像不满足预定条件。预定亮度阈值TL典型地为64。或者,判断单元720还可以借助于照度计测量图像采集环境中的光照条件来间接地判断目标图像的质量。例如,所述用户设备700还可以包括用于测量所述目标对象的光照参数的照度计750,判断单元720与照度计通信以确定在采集所述目标图像过程中的光照参数;当所述光照参数大于等于预设照度时,确定所述目标图像满足预定条件;当所述光照参数小于预设照 度时,确定所述目标图像不满足预定条件。
As an example in which the determination unit 720 determines whether the predetermined condition is satisfied based on the brightness of the target image, the determination unit 720 may convert the acquired target image into image data of an HSL color space in which the luminance information is included in the image data of the HSL color space. Then, the judging unit 720 counts the average value of the illumination components (ie, the L component) of all the pixels in the image data of the HSL color space.
Figure PCTCN2015082628-appb-000011
And comparing it to a predetermined brightness threshold T L . Average value of the illumination component used for the target image When the predetermined brightness threshold T L is greater than or equal to, the determination unit 720 may determine that the target image satisfies a predetermined condition. Average value of the illumination component used for the target image
Figure PCTCN2015082628-appb-000013
When it is less than the predetermined brightness threshold T L , the determination unit 720 may determine that the target image does not satisfy the predetermined condition. The predetermined brightness threshold T L is typically 64. Alternatively, the determining unit 720 can also indirectly determine the quality of the target image by measuring the lighting conditions in the image capturing environment by means of the illuminometer. For example, the user equipment 700 may further include an illuminometer 750 for measuring an illumination parameter of the target object, the determination unit 720 communicating with the illuminometer to determine an illumination parameter in the process of acquiring the target image; When the parameter is greater than or equal to the preset illuminance, it is determined that the target image satisfies a predetermined condition; when the illumination parameter is less than the preset illuminance, it is determined that the target image does not satisfy the predetermined condition.
作为判断单元720基于目标图像的清晰度来判断是否满足预定条件的示例,判断单元720可以利用预定算法(例如,Canny算法)提取所采集的目标图像的边缘,计算所述目标图像中位于边缘的各个像素点的梯度G,然后进一步计算目标图像中所有位于边缘的像素点的梯度的平均值
Figure PCTCN2015082628-appb-000014
当该目标图像的所有位于边缘的像素点的梯度的平均值
Figure PCTCN2015082628-appb-000015
大于等于预设梯度阈值TG时,判断单元720可以判断所述目标图像满足预定条件。当该目标图像的所有位于边缘的像素点的梯度的平均值
Figure PCTCN2015082628-appb-000016
小于预设梯度阈值TG时,判断单元720可以判断所述目标图像不满足预定条件。该预设梯度阈值TG典型地为100。
As an example in which the determination unit 720 determines whether the predetermined condition is satisfied based on the sharpness of the target image, the determination unit 720 may extract an edge of the acquired target image using a predetermined algorithm (for example, the Canny algorithm), and calculate the edge located in the target image. The gradient G of each pixel, and then further calculate the average of the gradients of all the pixel points at the edge in the target image
Figure PCTCN2015082628-appb-000014
The average of the gradients of all pixel points at the edge of the target image
Figure PCTCN2015082628-appb-000015
When it is greater than or equal to the preset gradient threshold T G , the determining unit 720 may determine that the target image satisfies a predetermined condition. The average of the gradients of all pixel points at the edge of the target image
Figure PCTCN2015082628-appb-000016
When it is less than the preset gradient threshold T G , the determination unit 720 may determine that the target image does not satisfy the predetermined condition. The preset gradient threshold T G is typically 100.
上述的预定亮度阈值TL或预设梯度阈值TG可以根据执行搜索的服务器的处理能力来调整。例如,当服务器的处理能力强时,可以将预定亮度阈值TL或预设梯度阈值TG设置为较低的值;当服务器的处理能力弱时,可以将预定亮度阈值TL或预设梯度阈值TG设置为较高的值。The predetermined brightness threshold T L or the preset gradient threshold T G described above may be adjusted according to the processing capability of the server performing the search. For example, when the processing capability of the server is strong, the predetermined brightness threshold T L or the preset gradient threshold T G may be set to a lower value; when the processing capability of the server is weak, the predetermined brightness threshold T L or a preset gradient may be used. The threshold T G is set to a higher value.
第二发送单元730在所述目标图像满足预定条件时,发出针对所述目标对象的搜索请求,该搜索请求包括所述目标图像。如图5或图6所示的搜索装置从所述目标图像中提取与所述目标对象相关联的字符信息和图像特征,并基于所述字符信息和所述图像特征执行搜索。例如,图6中的第一接收单元610在接收到该搜索请求之后,提取单元620利用其中的图像特征提取模块从所述目标图像中提取与所述目标对象相关联的图像特征,并利用其中的COR模块从所述目标图像中提取与所述目标对象相关联的字符信息;搜索单元630基于所述字符信息和图像特征从对象数据库中搜索与所述目标对象相关联的相关对象信息;第二发送单元640将所搜索的相关对象信息发送给用户设备。由于目标图像的亮度或清晰度较好,所以在服务器中能够准确地提取字符信息和图像特征,从而保证了搜索的准确度。第二发送单元730对应于图5中的收发单元550,并可以利用射频电路、信号发送电路来实现。The second transmitting unit 730 issues a search request for the target object when the target image satisfies a predetermined condition, the search request including the target image. A search device as shown in FIG. 5 or FIG. 6 extracts character information and image features associated with the target object from the target image, and performs a search based on the character information and the image features. For example, after receiving the search request, the first receiving unit 610 in FIG. 6 extracts an image feature associated with the target object from the target image by using an image feature extraction module therein, and utilizes the image feature The COR module extracts character information associated with the target object from the target image; the search unit 630 searches for the related object information associated with the target object from the object database based on the character information and the image feature; The two transmitting unit 640 transmits the searched related object information to the user equipment. Since the brightness or sharpness of the target image is good, the character information and the image feature can be accurately extracted in the server, thereby ensuring the accuracy of the search. The second transmitting unit 730 corresponds to the transceiver unit 550 in FIG. 5, and can be implemented by using a radio frequency circuit and a signal transmitting circuit.
当判断单元720判断目标图像不满足预定条件时,意味着所采集的目标图像不符合要求,可能导致难以准确地提取其中的字符信息和图像特征。此时,用户设备700还可以包括输出单元,该输出单元用于输出重拍提示消息,以提示用户操作图像采集装置来采集要搜索的目标对象的目标图像。在该重拍提示消息中,还可以具体列出目标图像不满足预定条件的原因。例如,在目标图像的光照分量的平均值
Figure PCTCN2015082628-appb-000017
小于预定亮度阈值TL时,可以在重拍提示消 息中指出亮度不够;在该目标图像的所有位于边缘的像素点的梯度的平均值
Figure PCTCN2015082628-appb-000018
小于预设梯度阈值TG时,可以在重拍提示消息中指出清晰度不够。这样,可以根据所述重拍提示消息调整目标图像的拍摄,直到获取满足所述预定条件的目标图像。或者,当在判断单元720判断目标图像不满足预定条件时,可以自动地调整图像采集单元710的设置参数,直到获取满足所述预定条件的目标图像。
When the judging unit 720 judges that the target image does not satisfy the predetermined condition, it means that the acquired target image does not meet the requirement, which may result in difficulty in accurately extracting the character information and the image feature therein. At this time, the user equipment 700 may further include an output unit for outputting a retake prompt message to prompt the user to operate the image collection device to collect the target image of the target object to be searched. In the retake prompt message, it is also possible to specifically list the reason why the target image does not satisfy the predetermined condition. For example, the average of the illumination components of the target image
Figure PCTCN2015082628-appb-000017
When less than the predetermined brightness threshold T L , it may be indicated that the brightness is insufficient in the re-scuing message; the average of the gradients of all the pixel points located at the edge of the target image
Figure PCTCN2015082628-appb-000018
When the preset gradient threshold T G is smaller than the preset gradient threshold T G , it is indicated that the sharpness is insufficient in the replay prompt message. In this way, the shooting of the target image can be adjusted according to the replay prompt message until the target image that satisfies the predetermined condition is acquired. Alternatively, when the determination unit 720 determines that the target image does not satisfy the predetermined condition, the setting parameters of the image acquisition unit 710 may be automatically adjusted until the target image satisfying the predetermined condition is acquired.
第二接收单元740接收与所述目标对象相关联的相关对象信息。第二发送单元730向服务器发出搜索请求之后,服务器执行结合图2和3描述的搜索方法,并得到与目标对象相关联的相关对象信息。对应地,第二接收单元740接收与目标对象相关联的相关对象信息。该相关对象信息是基于从所述目标图像中提取的与所述目标对象相关联的字符信息和图像特征搜索得到的。第二接收单元740可以通过互联网、局域网等各种网络或通信技术从服务器接收所述相关对象信息。相关对象信息例如是多个备选对象的图片、文字描述以及关联信息。该关联信息例如是价格、销量、用户评价、宣传视频以及超链接等,其有助于用户在多个备选对象中执行选择操作。第二接收单元740在接收到相关对象信息之后,用户设备700可以将所述相关对象信息显示在用户设备的屏幕上,以供用户查看。第二接收单元740对应于图5中的收发单元550,并可以利用射频电路、信号接收电路来实现。The second receiving unit 740 receives related object information associated with the target object. After the second transmitting unit 730 issues a search request to the server, the server performs the search method described in connection with FIGS. 2 and 3, and obtains related object information associated with the target object. Correspondingly, the second receiving unit 740 receives related object information associated with the target object. The related object information is obtained based on character information and image feature search associated with the target object extracted from the target image. The second receiving unit 740 can receive the related object information from the server through various networks or communication technologies such as the Internet, a local area network, and the like. The related object information is, for example, a picture of a plurality of candidate objects, a text description, and associated information. The associated information is, for example, price, sales volume, user rating, promotional video, hyperlinks, etc., which assists the user in performing selection operations among a plurality of candidate objects. After receiving the related object information, the second receiving unit 740 may display the related object information on the screen of the user equipment for the user to view. The second receiving unit 740 corresponds to the transceiver unit 550 in FIG. 5 and can be implemented by using a radio frequency circuit and a signal receiving circuit.
因此,在用户设备拍摄目标对象的图像的过程中,用户设备可以自动计算图像的光照条件和清晰程度。如果图像的光照条件和清晰程度达到要求,则允许用户设备基于所采集的目标图像发出搜索请求。如果图像的光照条件和清晰程度不能达到要求,则提示或自动指令用户设备重新拍摄,直到获取达到要求的目标图像。Therefore, in the process of the user device taking an image of the target object, the user device can automatically calculate the illumination condition and the degree of clarity of the image. If the lighting conditions and clarity of the image meet the requirements, the user device is allowed to issue a search request based on the acquired target image. If the lighting conditions and clarity of the image do not meet the requirements, the user equipment is prompted or automatically instructed to re-shoot until the desired target image is obtained.
在根据本公开实施例的用户设备的技术方案中,在所采集的目标对象的目标图像满足预定条件时,基于该目标图像发出搜索请求,使能够准确且便利地搜索目标对象的相关对象信息,从而提高用户的使用体验。In the technical solution of the user equipment according to the embodiment of the present disclosure, when the target image of the acquired target object satisfies a predetermined condition, the search request is issued based on the target image, so that the related object information of the target object can be accurately and conveniently searched, Thereby improving the user experience.
在上面描述了根据本公开实施例的第一搜索装置和第二搜索装置之后,包括第一搜索装置和第二搜索装置中任一个的电子设备或服务器也都处于本公开的范围之内。After the first search device and the second search device according to an embodiment of the present disclosure are described above, an electronic device or server including any of the first search device and the second search device is also within the scope of the present disclosure.
本领域普通技术人员可以意识到,结合本文中所公开的实施例描述的各示例的单元及算法步骤,能够以电子硬件、或者计算机软件和电子硬件的结 合来实现。这些功能究竟以硬件还是软件方式来执行,取决于技术方案的特定应用和设计约束条件。专业技术人员可以对每个特定的应用来使用不同方法来实现所描述的功能,但是这种实现不应认为超出本发明的范围。Those of ordinary skill in the art will appreciate that the elements and algorithm steps of the various examples described in connection with the embodiments disclosed herein can be implemented in electronic hardware, or in computer software and electronic hardware. Come together to achieve. Whether these functions are performed in hardware or software depends on the specific application and design constraints of the solution. A person skilled in the art can use different methods for implementing the described functions for each particular application, but such implementation should not be considered to be beyond the scope of the present invention.
在本申请所提供的几个实施例中,应该理解到,所揭露的装置和方法,可以通过其它的方式实现。例如,以上所描述的装置实施例仅仅是示意性的,例如,所述单元的划分,仅仅为一种逻辑功能划分,实际实现时可以有另外的划分方式,例如多个单元或组件可以结合或者可以集成到另一个设备,或一些特征可以忽略,或不执行。In the several embodiments provided by the present application, it should be understood that the disclosed apparatus and method may be implemented in other manners. For example, the device embodiments described above are merely illustrative. For example, the division of the unit is only a logical function division. In actual implementation, there may be another division manner, for example, multiple units or components may be combined or Can be integrated into another device, or some features can be ignored or not executed.
以上所述,仅为本发明的具体实施方式,但本发明的保护范围并不局限于此,任何熟悉本技术领域的技术人员在本发明揭露的技术范围内,可轻易想到变化或替换,都应涵盖在本发明的保护范围之内。因此,本发明的保护范围应以所述权利要求的保护范围为准。 The above is only a specific embodiment of the present invention, but the scope of the present invention is not limited thereto, and any person skilled in the art can easily think of changes or substitutions within the technical scope of the present invention. It should be covered by the scope of the present invention. Therefore, the scope of the invention should be determined by the scope of the appended claims.

Claims (26)

  1. 一种搜索方法,应用于一服务器,该搜索方法包括:A search method is applied to a server, the search method includes:
    接收搜索请求,该搜索请求包括要搜索的目标对象的目标图像;Receiving a search request including a target image of a target object to be searched;
    从所述目标图像中提取与所述目标对象相关联的字符信息和图像特征;Extracting character information and image features associated with the target object from the target image;
    基于所述字符信息和所述图像特征搜索与所述目标对象相关联的相关对象信息;Searching for related object information associated with the target object based on the character information and the image feature;
    发送所述相关对象信息。Send the related object information.
  2. 根据权利要求1的搜索方法,所述从所述目标图像中提取与所述目标对象相关联的字符信息和图像特征包括:The search method according to claim 1, wherein extracting character information and image features associated with the target object from the target image comprises:
    利用光学字符识别OCR从所述目标图像中识别文字和符号;Identifying text and symbols from the target image using optical character recognition OCR;
    从所识别的文字和符号中选择用于标识所述目标对象的标识字符,作为与所述目标对象相关联的字符信息。An identification character for identifying the target object is selected from the recognized characters and symbols as character information associated with the target object.
  3. 根据权利要求1的搜索方法,其中,所述基于所述字符信息和所述图像特征搜索与所述目标对象相关联的相关对象信息包括:The search method according to claim 1, wherein said searching for related object information associated with said target object based on said character information and said image feature comprises:
    基于所述字符信息和所述图像特征从预先建立的对象数据库中搜索所述相关对象信息,Searching for the related object information from a pre-established object database based on the character information and the image feature,
    其中,所述对象数据库包括各个备选对象的图像特征、字符信息和关联信息。The object database includes image features, character information, and associated information of each candidate object.
  4. 根据权利要求3的搜索方法,其中,所述基于所述字符信息和所述图像特征从预先建立的对象数据库中搜索所述相关对象信息包括:The search method according to claim 3, wherein said searching for said related object information from a pre-established object database based on said character information and said image feature comprises:
    基于目标图像的图像特征和各个备选对象的图像特征计算所述目标对象与各个备选对象之间的图像特征相似度;Calculating image feature similarity between the target object and each candidate object based on image features of the target image and image features of the respective candidate objects;
    基于目标图像的字符信息和各个备选对象的字符信息计算所述目标对象与各个备选对象之间的字符信息相似度;Calculating a similarity of character information between the target object and each candidate object based on the character information of the target image and the character information of each candidate object;
    基于所述图像特征相似度和所述字符信息相似度来从所述多个备选对象中搜索与所述目标对象相关联的相关对象信息。Searching for related object information associated with the target object from the plurality of candidate objects based on the image feature similarity and the character information similarity.
  5. 根据权利要求4的搜索方法,其中,所述基于所述图像特征相似度和所述字符信息相似度来从所述多个备选对象中搜索与所述目标对象相关联的相关对象信息包括:The search method according to claim 4, wherein said searching for related object information associated with said target object from said plurality of candidate objects based on said image feature similarity and said character information similarity comprises:
    对所述图像特征相似度和字符信息相似度进行加权平均来获得所述目标 对象与各个备选对象之间的平均相似度;Performing a weighted average of the image feature similarity and the character information similarity to obtain the target The average similarity between the object and each candidate;
    按照所述平均相似度的递减顺序从所述多个备选对象中选择预定数目备选对象;Selecting a predetermined number of candidate objects from the plurality of candidate objects in descending order of the average similarity;
    将与所选择的备选对象对应的信息作为与所述目标对象相关联的相关对象信息。Information corresponding to the selected candidate object is used as related object information associated with the target object.
  6. 根据权利要求4的搜索方法,其中,所述基于目标图像的图像特征和各个备选对象的图像特征计算所述目标对象与各个备选对象之间的图像特征相似度包括:计算所述目标对象的图像特征与各个备选对象之间的图像特征之间的余弦相似度,作为所述图像特征相似度。The search method according to claim 4, wherein the calculating the image feature similarity between the target object and each candidate object based on the image feature of the target image and the image feature of each candidate object comprises: calculating the target object The cosine similarity between the image features and the image features between the respective candidate objects is taken as the image feature similarity.
  7. 根据权利要求4的搜索方法,其中,所述基于目标图像的字符信息和各个备选对象的字符信息计算所述目标对象与各个备选对象之间的字符信息相似度包括:The search method according to claim 4, wherein the calculating the similarity of the character information between the target object and each of the candidate objects based on the character information of the target image and the character information of each candidate object comprises:
    计算所述目标对象的字符信息与各个备选对象的字符信息之间的编辑距离;Calculating an edit distance between the character information of the target object and the character information of each candidate object;
    基于所述编辑距离、所述目标对象的字符信息的长度、备选对象的字符信息的长度来计算所述字符信息相似度。The character information similarity is calculated based on the edit distance, the length of the character information of the target object, and the length of the character information of the candidate object.
  8. 根据权利要求1至7中任一项的搜索方法,其中,所述从所述目标图像中提取与所述目标对象相关联的字符信息和图像特征包括如下操作中的至少一个:The search method according to any one of claims 1 to 7, wherein said extracting character information and image features associated with said target object from said target image comprises at least one of the following operations:
    计算所述目标图像的颜色直方图特征作为所述图像特征;和Calculating a color histogram feature of the target image as the image feature; and
    计算所述目标图像的词袋模型特征作为所述图像特征。A word bag model feature of the target image is calculated as the image feature.
  9. 根据权利要求1的搜索方法,其中,所述目标图像满足预定条件。The search method according to claim 1, wherein said target image satisfies a predetermined condition.
  10. 一种搜索方法,应用于一用户设备,该搜索方法包括:A search method is applied to a user equipment, and the search method includes:
    采集要搜索的目标对象的目标图像;Collecting a target image of the target object to be searched;
    判断所述目标图像是否满足预定条件;Determining whether the target image satisfies a predetermined condition;
    在所述目标图像满足预定条件时,发出针对所述目标对象的搜索请求,该搜索请求包括所述目标图像;Sending a search request for the target object when the target image satisfies a predetermined condition, the search request including the target image;
    接收与所述目标对象相关联的相关对象信息,其中所述相关对象信息基于从所述目标图像中提取的与所述目标对象相关联的字符信息和图像特征搜索得到。A related object information associated with the target object is received, wherein the related object information is obtained based on character information and image feature search associated with the target object extracted from the target image.
  11. 根据权利要求10的搜索方法,其中所述判断所述目标图像是否满足 预定条件包括:The search method according to claim 10, wherein said judging whether said target image is satisfied The booking conditions include:
    确定在采集所述目标图像过程中的光照参数;Determining an illumination parameter during the process of acquiring the target image;
    当所述光照参数大于等于预设照度时,确定所述目标图像满足预定条件。When the illumination parameter is greater than or equal to the preset illumination, it is determined that the target image satisfies a predetermined condition.
  12. 根据权利要求10的搜索方法,其中所述判断所述目标图像是否满足预定条件包括:The search method according to claim 10, wherein said determining whether said target image satisfies predetermined conditions comprises:
    确定所采集的目标图像的边缘的像素点的平均梯度;Determining an average gradient of pixels of an edge of the acquired target image;
    当所述目标图像的边缘的像素点的平均梯度小于预设梯度阈值时,确定所述目标图像满足预定条件。When the average gradient of the pixel points of the edge of the target image is less than the preset gradient threshold, it is determined that the target image satisfies a predetermined condition.
  13. 一种搜索装置,应用于一服务器,该搜索装置包括:A search device is applied to a server, the search device comprising:
    收发器,接收搜索请求,该搜索请求包括要搜索的目标对象的目标图像;a transceiver receiving a search request, the search request including a target image of a target object to be searched;
    处理器;processor;
    存储器;和Memory; and
    存储在所述存储器中的计算机程序指令,在所述计算机程序指令被所述处理器运行时执行以下步骤:Computer program instructions stored in the memory perform the following steps when the computer program instructions are executed by the processor:
    从所述目标图像中提取与所述目标对象相关联的字符信息和图像特征;Extracting character information and image features associated with the target object from the target image;
    基于所述字符信息和图像特征搜索与所述目标对象相关联的相关对象信息;Searching for related object information associated with the target object based on the character information and image features;
    将所搜索的相关对象信息提供给所述收发器,以发送出去。The searched related object information is provided to the transceiver for transmission.
  14. 根据权利要求13的搜索装置,其中,所述从所述目标图像中提取与所述目标对象相关联的字符信息和图像特征包括:The search apparatus according to claim 13, wherein said extracting character information and image features associated with said target object from said target image comprises:
    利用光学字符识别OCR从所述目标图像中识别文字和符号;Identifying text and symbols from the target image using optical character recognition OCR;
    从所识别的文字和符号中选择用于标识所述目标对象的标识字符,作为与所述目标对象相关联的字符信息。An identification character for identifying the target object is selected from the recognized characters and symbols as character information associated with the target object.
  15. 根据权利要求13的搜索装置,其中,所述基于所述字符信息和所述图像特征搜索与所述目标对象相关联的相关对象信息包括:The search apparatus according to claim 13, wherein said searching for related object information associated with said target object based on said character information and said image feature comprises:
    基于所述字符信息和所述图像特征从预先建立的对象数据库中搜索所述相关对象信息,Searching for the related object information from a pre-established object database based on the character information and the image feature,
    其中,所述对象数据库包括各个备选对象的图像特征、字符信息和关联信息。The object database includes image features, character information, and associated information of each candidate object.
  16. 根据权利要求15的搜索装置,其中,所述基于所述字符信息和所述 图像特征从预先建立的对象数据库中搜索所述相关对象信息包括:The search device according to claim 15, wherein said said based on said character information and said Searching for the related object information from the pre-established object database includes:
    基于目标图像的图像特征和各个备选对象的图像特征计算所述目标对象与各个备选对象之间的图像特征相似度;Calculating image feature similarity between the target object and each candidate object based on image features of the target image and image features of the respective candidate objects;
    基于目标图像的字符信息和各个备选对象的字符信息计算所述目标对象与各个备选对象之间的字符信息相似度;Calculating a similarity of character information between the target object and each candidate object based on the character information of the target image and the character information of each candidate object;
    基于所述图像特征相似度和所述字符信息相似度来从所述多个备选对象中搜索与所述目标对象相关联的相关对象信息。Searching for related object information associated with the target object from the plurality of candidate objects based on the image feature similarity and the character information similarity.
  17. 根据权利要求16的搜索装置,其中,所述基于所述图像特征相似度和所述字符信息相似度来从所述多个备选对象中搜索与所述目标对象相关联的相关对象信息包括:The search apparatus according to claim 16, wherein said searching for related object information associated with said target object from said plurality of candidate objects based on said image feature similarity and said character information similarity comprises:
    对所述图像特征相似度和字符信息相似度进行加权平均来获得所述目标对象与各个备选对象之间的平均相似度;Performing a weighted average of the image feature similarity and the character information similarity to obtain an average similarity between the target object and each candidate object;
    按照所述平均相似度的递减顺序从所述多个备选对象中选择预定数目备选对象;Selecting a predetermined number of candidate objects from the plurality of candidate objects in descending order of the average similarity;
    将与所选择的备选对象对应的信息作为与所述目标对象相关联的相关对象信息。Information corresponding to the selected candidate object is used as related object information associated with the target object.
  18. 根据权利要求16的搜索装置,其中,所述基于目标图像的图像特征和各个备选对象的图像特征计算所述目标对象与各个备选对象之间的图像特征相似度包括:计算所述目标对象的图像特征与各个备选对象之间的图像特征之间的余弦相似度,作为所述图像特征相似度。The search device according to claim 16, wherein the calculating the image feature similarity between the target object and each candidate object based on the image feature of the target image and the image feature of each candidate object comprises: calculating the target object The cosine similarity between the image features and the image features between the respective candidate objects is taken as the image feature similarity.
  19. 根据权利要求16的搜索装置,其中,所述基于目标图像的字符信息和各个备选对象的字符信息计算所述目标对象与各个备选对象之间的字符信息相似度包括:The search apparatus according to claim 16, wherein the character information similarity between the target object and each candidate object is calculated based on the character information of the target image and the character information of each candidate object, including:
    计算所述目标对象的字符信息与各个备选对象的字符信息之间的编辑距离;Calculating an edit distance between the character information of the target object and the character information of each candidate object;
    基于所述编辑距离、所述目标对象的字符信息的长度、各个备选对象的字符信息的长度来计算所述字符信息相似度。The character information similarity is calculated based on the edit distance, the length of the character information of the target object, and the length of the character information of each candidate object.
  20. 根据权利要求13至19中任一项的搜索装置,其中,从所述目标图像中提取与所述目标对象相关联的字符信息和图像特征包括如下操作中的至少一个:The search device according to any one of claims 13 to 19, wherein extracting character information and image features associated with the target object from the target image comprises at least one of the following operations:
    计算所述目标图像的颜色直方图特征作为所述图像特征;和 Calculating a color histogram feature of the target image as the image feature; and
    计算所述目标图像的词袋模型特征作为所述图像特征。A word bag model feature of the target image is calculated as the image feature.
  21. 根据权利要求13的搜索装置,其中,所述目标图像满足预定条件。The search device according to claim 13, wherein said target image satisfies a predetermined condition.
  22. 一种用户设备,包括:A user equipment comprising:
    图像采集器,用于采集要搜索的目标对象的目标图像;An image collector for collecting a target image of a target object to be searched;
    处理器,用于判断所述目标图像是否满足预定条件;a processor, configured to determine whether the target image meets a predetermined condition;
    收发器,在所述目标图像满足预定条件时,所述发出针对所述目标对象的搜索请求,该搜索请求包括所述目标图像,并接收与所述目标对象相关联的相关对象信息,其中所述相关对象信息基于从所述目标图像中提取的与所述目标对象相关联的字符信息和图像特征搜索得到。a transceiver, when the target image satisfies a predetermined condition, the issuing a search request for the target object, the search request including the target image, and receiving related object information associated with the target object, where The related object information is obtained based on character information and image feature search associated with the target object extracted from the target image.
  23. 根据权利要求22的用户设备,其中,A user equipment according to claim 22, wherein
    所述用户设备还包括用于测量所述目标对象的光照参数的照度计,The user equipment further includes an illuminometer for measuring an illumination parameter of the target object,
    所述处理器指令所述照度计在图像采集器采集目标图像的过程中测量所述目标对象的光照参数,并且在所述光照参数大于等于预设照度时确定所述目标图像满足预定条件。The processor instructs the illuminometer to measure an illumination parameter of the target object in a process of acquiring an image of the target by the image collector, and determining that the target image satisfies a predetermined condition when the illumination parameter is greater than or equal to a preset illuminance.
  24. 根据权利要求22的用户设备,其中,所述处理器分析目标图像以确定其边缘的像素点的平均梯度,并且在所述目标图像的边缘的像素点的平均梯度小于预设梯度阈值时,确定所述目标图像满足预定条件。The user equipment according to claim 22, wherein said processor analyzes the target image to determine an average gradient of pixel points of the edge thereof, and determines when the average gradient of the pixel points of the edge of the target image is less than a preset gradient threshold The target image satisfies a predetermined condition.
  25. 一种用于搜索对象的计算机程序产品,包括计算机可读存储介质,在所述计算机可读存储介质上存储了计算机程序指令,所述计算机程序指令由处理器执行以使得所述处理器:A computer program product for searching for an object, comprising a computer readable storage medium having stored thereon computer program instructions, the computer program instructions being executed by a processor to cause the processor to:
    接收搜索请求,该搜索请求包括要搜索的目标对象的目标图像;Receiving a search request including a target image of a target object to be searched;
    从所述目标图像中提取与所述目标对象相关联的字符信息和图像特征;Extracting character information and image features associated with the target object from the target image;
    基于所述字符信息和所述图像特征搜索与所述目标对象相关联的相关对象信息;Searching for related object information associated with the target object based on the character information and the image feature;
    发送所述相关对象信息。Send the related object information.
  26. 一种用于搜索对象的计算机程序产品,包括计算机可读存储介质,在所述计算机可读存储介质上存储了计算机程序指令,所述计算机程序指令由处理器执行以使得所述处理器:A computer program product for searching for an object, comprising a computer readable storage medium having stored thereon computer program instructions, the computer program instructions being executed by a processor to cause the processor to:
    利用图像采集器采集要搜索的目标对象的目标图像;Acquiring a target image of the target object to be searched by using an image collector;
    判断所述目标图像是否满足预定条件;Determining whether the target image satisfies a predetermined condition;
    在所述目标图像满足预定条件时,利用收发器发出针对所述目标对象的 搜索请求,该搜索请求包括所述目标图像;以及Transmitting, by the transceiver, the target object when the target image satisfies a predetermined condition Searching for a request, the search request including the target image;
    利用收发器接收与所述目标对象相关联的相关对象信息,其中所述相关对象信息基于从所述目标图像中提取的与所述目标对象相关联的字符信息和图像特征搜索得到。 The related object information associated with the target object is received by the transceiver, wherein the related object information is obtained based on character information and image feature search associated with the target object extracted from the target image.
PCT/CN2015/082628 2015-06-29 2015-06-29 Search method, search apparatus, user equipment, and computer program product WO2017000109A1 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
CN201580000313.XA CN105518678B (en) 2015-06-29 2015-06-29 Searching method, searcher and user equipment
PCT/CN2015/082628 WO2017000109A1 (en) 2015-06-29 2015-06-29 Search method, search apparatus, user equipment, and computer program product

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2015/082628 WO2017000109A1 (en) 2015-06-29 2015-06-29 Search method, search apparatus, user equipment, and computer program product

Publications (1)

Publication Number Publication Date
WO2017000109A1 true WO2017000109A1 (en) 2017-01-05

Family

ID=55725016

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2015/082628 WO2017000109A1 (en) 2015-06-29 2015-06-29 Search method, search apparatus, user equipment, and computer program product

Country Status (2)

Country Link
CN (1) CN105518678B (en)
WO (1) WO2017000109A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116737982A (en) * 2023-08-11 2023-09-12 拓锐科技有限公司 Intelligent screening management system for picture search results based on data analysis

Families Citing this family (19)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107315181B (en) * 2016-04-26 2020-11-06 天津远度科技有限公司 Obstacle avoidance method and device and aircraft
CN107403128B (en) * 2016-05-20 2020-12-29 株式会社理光 Article identification method and device
CN107403179B (en) * 2016-05-20 2020-10-23 株式会社理光 Registration method and device for article packaging information
CN106777177A (en) * 2016-12-22 2017-05-31 百度在线网络技术(北京)有限公司 Search method and device
WO2018145577A1 (en) 2017-02-08 2018-08-16 腾讯科技(深圳)有限公司 Facial-expression recommendation method and device
CN108401005B (en) * 2017-02-08 2021-05-14 腾讯科技(深圳)有限公司 Expression recommendation method and device
TWI650656B (en) * 2017-05-26 2019-02-11 虹光精密工業股份有限公司 Method of searching an image file in a computer system, related image file searching device, and related computer system
CN109213332B (en) * 2017-06-29 2022-11-08 北京搜狗科技发展有限公司 Input method and device of expression picture
CN107679070B (en) * 2017-08-22 2021-10-01 科大讯飞股份有限公司 Intelligent reading recommendation method and device and electronic equipment
CN107569848B (en) * 2017-08-30 2020-08-04 武汉斗鱼网络科技有限公司 Game classification method and device and electronic equipment
CN107862239A (en) * 2017-09-15 2018-03-30 广州唯品会研究院有限公司 A kind of combination text carries out the method and its device of picture recognition with picture
CN107798070A (en) * 2017-09-26 2018-03-13 平安普惠企业管理有限公司 A kind of web data acquisition methods and terminal device
CN107633086A (en) * 2017-09-27 2018-01-26 北京萌哥玛丽科技有限公司 It is a kind of that video or control method and system that audio is searched are realized by picture
CN107944022A (en) * 2017-12-11 2018-04-20 努比亚技术有限公司 Picture classification method, mobile terminal and computer-readable recording medium
CN108563663A (en) * 2018-01-04 2018-09-21 出门问问信息科技有限公司 Picture recommendation method, device, equipment and storage medium
CN108960234A (en) * 2018-06-13 2018-12-07 山东师范大学 A kind of Logo recognition methods and system based on bag of words
CN110532413B (en) * 2019-07-22 2023-08-08 平安科技(深圳)有限公司 Information retrieval method and device based on picture matching and computer equipment
CN111782841A (en) * 2019-11-27 2020-10-16 北京沃东天骏信息技术有限公司 Image searching method, device, equipment and computer readable medium
CN117216308B (en) * 2023-11-09 2024-04-26 天津华来科技股份有限公司 Searching method, system, equipment and medium based on large model

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101388023A (en) * 2008-09-12 2009-03-18 北京搜狗科技发展有限公司 Electronic map interest point data redundant detecting method and system
CN102625937A (en) * 2009-08-07 2012-08-01 谷歌公司 Architecture for responding to a visual query
CN102682091A (en) * 2012-04-25 2012-09-19 腾讯科技(深圳)有限公司 Cloud-service-based visual search method and cloud-service-based visual search system
CN103473327A (en) * 2013-09-13 2013-12-25 广东图图搜网络科技有限公司 Image retrieval method and image retrieval system
CN103493069A (en) * 2010-12-01 2014-01-01 谷歌公司 Identifying matching canonical documents in response to a visual query
CN104572905A (en) * 2014-12-26 2015-04-29 小米科技有限责任公司 Photo index creation method, photo searching method and devices

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101593278B (en) * 2008-05-27 2013-01-16 佳能株式会社 Method and system for distinguishing language of document image

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101388023A (en) * 2008-09-12 2009-03-18 北京搜狗科技发展有限公司 Electronic map interest point data redundant detecting method and system
CN102625937A (en) * 2009-08-07 2012-08-01 谷歌公司 Architecture for responding to a visual query
CN103493069A (en) * 2010-12-01 2014-01-01 谷歌公司 Identifying matching canonical documents in response to a visual query
CN102682091A (en) * 2012-04-25 2012-09-19 腾讯科技(深圳)有限公司 Cloud-service-based visual search method and cloud-service-based visual search system
CN103473327A (en) * 2013-09-13 2013-12-25 广东图图搜网络科技有限公司 Image retrieval method and image retrieval system
CN104572905A (en) * 2014-12-26 2015-04-29 小米科技有限责任公司 Photo index creation method, photo searching method and devices

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116737982A (en) * 2023-08-11 2023-09-12 拓锐科技有限公司 Intelligent screening management system for picture search results based on data analysis
CN116737982B (en) * 2023-08-11 2023-10-31 拓锐科技有限公司 Intelligent screening management system for picture search results based on data analysis

Also Published As

Publication number Publication date
CN105518678A (en) 2016-04-20
CN105518678B (en) 2018-07-31

Similar Documents

Publication Publication Date Title
WO2017000109A1 (en) Search method, search apparatus, user equipment, and computer program product
US10706094B2 (en) System and method for customizing a display of a user device based on multimedia content element signatures
US9665596B2 (en) Data access based on content of image recorded by a mobile device
CN109284729B (en) Method, device and medium for acquiring face recognition model training data based on video
EP3028184B1 (en) Method and system for searching images
US9875303B2 (en) System and process for building a catalog using visual objects
US11461386B2 (en) Visual recognition using user tap locations
US10380461B1 (en) Object recognition
WO2015051629A1 (en) Picture ordering method and device
TWI781554B (en) Method of determining item name of object, device, computer equipment and storage medium
US10380267B2 (en) System and method for tagging multimedia content elements
WO2012158808A1 (en) Gesture-based visual search
CN110020093A (en) Video retrieval method, edge device, video frequency searching device and storage medium
CN111291765A (en) Method and device for determining similar pictures
JP2019520662A (en) Content-based search and retrieval of trademark images
JP6310599B1 (en) SEARCH DEVICE, SEARCH METHOD, AND SEARCH PROGRAM
KR20220039578A (en) Method for providing clothing recommendation information based on user-selected clothing, and server and program using the same
CN109189960A (en) Information acquisition method, information acquisition device and computer storage medium
US20150052155A1 (en) Method and system for ranking multimedia content elements
KR20150101846A (en) Image classification service system based on a sketch user equipment, service equipment, service method based on sketch and computer readable medium having computer program recorded therefor
US20200074218A1 (en) Information processing system, information processing apparatus, and non-transitory computer readable medium
CN109766052B (en) Dish picture uploading method and device, computer equipment and readable storage medium
WO2016155537A1 (en) Method and device for ranking search results of picture objects
KR20200024540A (en) Providing Method of summary information for an image searching and service device thereof
WO2022267160A1 (en) Product recommendation method and apparatus, electronic device and storage medium

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 15896640

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC , EPO FORM 1205A DATED 12.04.18.

122 Ep: pct application non-entry in european phase

Ref document number: 15896640

Country of ref document: EP

Kind code of ref document: A1