WO2016155627A1 - Method and apparatus for recognizing characters in picture - Google Patents

Method and apparatus for recognizing characters in picture Download PDF

Info

Publication number
WO2016155627A1
WO2016155627A1 PCT/CN2016/077865 CN2016077865W WO2016155627A1 WO 2016155627 A1 WO2016155627 A1 WO 2016155627A1 CN 2016077865 W CN2016077865 W CN 2016077865W WO 2016155627 A1 WO2016155627 A1 WO 2016155627A1
Authority
WO
WIPO (PCT)
Prior art keywords
character
picture
candidate
candidate character
probability
Prior art date
Application number
PCT/CN2016/077865
Other languages
French (fr)
Chinese (zh)
Inventor
薛永刚
贾文杰
项碧波
Original Assignee
北京奇虎科技有限公司
奇智软件(北京)有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 北京奇虎科技有限公司, 奇智软件(北京)有限公司 filed Critical 北京奇虎科技有限公司
Publication of WO2016155627A1 publication Critical patent/WO2016155627A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/10Image acquisition

Definitions

  • the present invention relates to the field of computer technologies, and in particular, to a method and apparatus for recognizing characters in a picture.
  • the present invention has been made in order to provide a method and apparatus for recognizing characters in a picture that overcomes the above problems or at least partially solves or alleviates the above problems.
  • a method of identifying a character in a picture comprising:
  • Identifying a character placeholder included in the picture obtaining a candidate character set corresponding to each character placeholder and a probability parameter corresponding to each candidate character;
  • one candidate character is selected from each candidate character set corresponding to each character placeholder to obtain a candidate character combination; multiple selections are performed, and different character combinations are selected each time. , obtaining multiple candidate character combinations;
  • the candidate characters with the highest probability are combined as the result of character recognition for the picture.
  • an apparatus for recognizing characters in a picture includes:
  • An obtaining unit configured to identify a character placeholder included in the picture, obtain a candidate character set corresponding to each character placeholder, and a probability parameter corresponding to each candidate character;
  • the pre-processing unit is adapted to select one candidate character from the candidate character set corresponding to each character placeholder in turn according to the order of the characters occupied by the picture to obtain a candidate character combination; Select different character combinations to get multiple candidate character combinations;
  • the identifying unit is adapted to calculate the probability of each of the obtained candidate character combinations, and combine the candidate characters with the highest probability as the character recognition result for the picture.
  • a computer program comprising computer readable code, when the computer readable code is run on a terminal device, causes the terminal device to perform the identification picture of any of the above The method of characters in .
  • a computer readable medium storing a computer program as described above is provided.
  • the probability of the candidate character set corresponding to the character placeholder included in the picture and the probability parameter corresponding to each candidate character is calculated, and the probability of all candidate character combinations that may be included in the picture is calculated, and the probability is the highest.
  • the candidate character combination is used as a technical solution for the character recognition result of the picture, and the image information is converted into text information that can be recognized and processed by the computer, which greatly improves the efficiency of the user in storing, retrieving and processing the character information in the picture.
  • FIG. 1 shows a flow chart of a method of identifying characters in a picture, in accordance with one embodiment of the present invention
  • FIG. 2 illustrates a method of identifying search keywords in accordance with one embodiment of the present invention.
  • FIG. 3 shows a schematic diagram of an apparatus for identifying characters in a picture, in accordance with one embodiment of the present invention
  • FIG. 4 shows a schematic diagram of an apparatus for identifying search keywords in accordance with one embodiment of the present invention
  • FIG. 5A shows a schematic diagram of a picture for character recognition according to an embodiment of the present invention
  • FIG. 5B shows a schematic diagram of a first picture according to another embodiment of the present invention.
  • FIG. 5C is a schematic diagram showing a second picture according to another embodiment of the present invention.
  • Figure 6 shows schematically a block diagram of a terminal device for carrying out the method according to the invention
  • Fig. 7 schematically shows a storage unit for holding or carrying program code implementing the method according to the invention.
  • FIG. 1 shows a flow chart of a method of identifying characters in a picture, in accordance with one embodiment of the present invention. As shown in Figure 1, the method includes:
  • Step S110 identifying a character placeholder included in the picture, acquiring a candidate character set corresponding to each character placeholder and a probability parameter corresponding to each candidate character.
  • Step S120 sequentially select one candidate character from the candidate character set corresponding to each character placeholder according to the order of each character occupying position of the picture to obtain a candidate character combination; perform multiple selections, and select different characters each time. Combine to get multiple candidate character combinations.
  • Step S130 calculating the probability of each of the obtained candidate character combinations.
  • step S140 the candidate characters with the highest probability are combined as the character recognition result for the picture.
  • the method shown in FIG. 1 obtains all candidate characters that may be included in the image by acquiring a candidate character set corresponding to the character placeholder included in the picture and a probability parameter corresponding to each candidate character. The combined probability is calculated, and the candidate character with the highest probability is combined as the character recognition result for the picture.
  • the program can convert the image information into text information that can be recognized and processed by the computer, which greatly improves the efficiency of the user in storing, retrieving and processing the character information in the picture.
  • step S110 of the method shown in FIG. 1 uses an optical character recognition (OCR) technology to identify a character occupying position included in a picture, and obtains a candidate character set corresponding to each character placeholder and The probability parameter corresponding to each candidate character.
  • OCR optical character recognition
  • the probability of each of the candidate character combinations obtained by step S130 of the method shown in FIG. 1 is: based on the n-gram model, the probability of each of the candidate character combinations obtained is calculated. Specifically, for each candidate character in a candidate character combination, a conditional probability that the candidate character appears under the condition that the first n-1 candidate characters have been determined is calculated according to the probability parameter of each candidate character; and the candidate character is calculated. The product of the conditional probabilities of the candidate characters in the combination is taken as the probability of the candidate character combination.
  • FIG. 5A is a schematic diagram of a picture for performing character recognition according to an embodiment of the present invention.
  • the character occupied by the picture is recognized, and each character place is acquired.
  • the corresponding candidate character set and the result of the probability parameter corresponding to each candidate character are as shown in Table 1:
  • Character placeholder a set of candidate characters corresponding to each character placeholder Probability parameter corresponding to each candidate character 1 Hand, season, care, year, abundance 35,49,51,53,57 2 Machine, ⁇ , ⁇ , ⁇ , maple 22,43,52,52,56 3 Tube, ⁇ , camp, camp, ⁇ 26,49,52,52,52 4 Home, ⁇ , elephant, ⁇ , frozen 23,61,81,82,83
  • one candidate character is selected from each candidate character set corresponding to each character placeholder to obtain a candidate character combination;
  • the probability of each candidate character combination is calculated based on the 4-ary grammar model, that is, for each candidate character in a candidate character combination, the probability parameter is calculated according to the probability parameter of each candidate character.
  • the probability of occurrence under the condition that its first three candidate characters have been determined is: P(hand
  • P (mobile phone housekeeper) P (hand
  • the calculation process of other candidate character combinations is the same, and will not be described again.
  • the candidate character with the highest probability is combined as the character recognition result for the picture.
  • the candidate character combination with the highest probability is “mobile phone housekeeper”, that is, the character recognition result of the picture shown in FIG. 5A.
  • FIG. 2 shows a flow chart of a method of identifying search keywords in accordance with one embodiment of the present invention. As shown in Figure 2, the method includes:
  • step S210 in response to the user's touch screen operation, the picture is intercepted according to the operation range to obtain the first picture; and the predetermined area is expanded according to the operation range to perform picture interception, and the second picture is obtained.
  • Step S220 respectively identifying characters in the first picture and the second picture to obtain corresponding character combinations.
  • Step S230 Select a character combination from the combination of characters corresponding to the first picture and the second picture as the search keyword after the recognition according to the preset policy.
  • the method shown in FIG. 2 intercepts two images with different range sizes in response to the user's touch screen operation, and selects two images from each of the two images by separately identifying and then processing the two images according to the preset strategy.
  • the recognition result of the letter is searched as a search keyword.
  • the solution has the following beneficial effects: Firstly, the recognition of the search keyword for the user's touch screen operation is realized, and the user does not need to manually input the search keyword, thereby simplifying the search operation process and meeting the user's needs; secondly, adopting Two ways to comprehensively identify related images, avoiding the missing information in a single picture or More than enough, the accuracy of character recognition in the picture is further improved, thereby improving the accuracy of identifying the search keyword.
  • step S220 of the method shown in FIG. 2 respectively identifies characters in the first picture and the second picture, and obtains corresponding character combinations as: by identifying the picture as described in any of the above embodiments.
  • the character method respectively identifies the characters in the first picture and the second picture to obtain a corresponding character combination.
  • step S220 of the method shown in FIG. 2 respectively identifies characters in the first picture and the second picture, and obtaining corresponding character combinations further includes: obtaining pixel coordinates of each character occupying position in the character combination. .
  • step S230 of the method shown in FIG. 2, according to a preset policy, selecting a character combination from the combination of characters corresponding to the first picture and the second picture as the search keyword includes:
  • step S231 in the character combination corresponding to the second picture, the character combination positions corresponding to the first picture are the same and the same length combination is used.
  • the second picture and the first picture are retained.
  • the corresponding character combination position is the same and the length is the same character combination.
  • Step S232 determining whether the average language model score of the character combination retained in the second picture is smaller than the average model score of the character combination corresponding to the first picture.
  • the average language model score of the character combination refers to the logarithm of the probability of the character combination, and the value obtained by averaging the number of characters in the character combination.
  • Step S233 yes, select a character combination corresponding to the first picture as a search keyword to perform a search; otherwise, select a character combination corresponding to the second picture as a search keyword to perform a search.
  • FIG. 5B is a schematic diagram showing a first picture according to another embodiment of the present invention
  • FIG. 5C is a schematic diagram showing a second picture according to another embodiment of the present invention, and the specific implementation shown in FIG. 5B and FIG. 5C
  • the image in response to the touch screen operation of the user, the image is intercepted according to the operation range, and the first picture as shown in FIG. 5B is obtained; and the predetermined area is expanded according to the operation range, and the picture is intercepted, and the second picture as shown in FIG. 5C is obtained.
  • Identifying the character placeholders included in the first picture obtaining a candidate character set corresponding to each character placeholder, a probability parameter corresponding to each candidate character, and a pixel coordinate of each character placeholder, and the result is shown in Table 2:
  • the probability of each candidate character combination is calculated based on the 4-ary grammar model, and the character combination with the highest probability of identifying the first picture is “Mobile Phone Easy”.
  • the specific recognition process has been described in detail in the foregoing, and will not be described again.
  • the character occupying position included in the second picture is identified, the candidate character set corresponding to each character placeholder, the probability parameter corresponding to each candidate character, and the pixel coordinate of each character placeholder are obtained, and the result is shown in Table 3. :
  • each candidate character combination is calculated based on the 4-ary grammar model, and the character combination with the highest probability of identifying the second picture is “t’ae. mobile phone housekeeper”.
  • the specific identification process has been described in detail in the foregoing, and will not be described again.
  • the "t'ae.” part of the character combination is located at a higher position in the second picture, and the "phone housekeeper” part is located at a lower position in the second picture.
  • the character combination corresponding to the first picture, "mobile phone tube” has the same character combination and the same length, according to "mobile phone management” and The pixel coordinate boundary of "t'ae. mobile phone housekeeper” and the pixel coordinates of each character placeholder, we can see that in the character combination "t'ae. mobile phone housekeeper” corresponding to the second picture, the “mobile phone housekeeper” part is The phone is easy to use in the same character group with the same length, so keep the "phone butler” character combination in the second picture.
  • the apparatus 300 for recognizing characters in a picture includes:
  • the obtaining unit 310 is adapted to identify a character placeholder included in the picture, obtain a candidate character set corresponding to each character placeholder, and a probability parameter corresponding to each candidate character.
  • the pre-processing unit 320 is adapted to select one candidate character from the candidate character set corresponding to each character placeholder in sequence according to the order of the characters occupied by the picture to obtain a candidate character combination; Select different combinations of characters to get multiple candidate character combinations.
  • the identifying unit 330 is adapted to calculate the obtained probability of each candidate character combination, and combine the candidate characters with the highest probability as the character recognition result for the picture.
  • the device shown in FIG. 3 acquires the candidate character set corresponding to the character placeholder included in the picture and the probability parameter corresponding to each candidate character through the mutual cooperation of the units, and the probability of all candidate character combinations that may be included in the picture.
  • the calculation is performed to combine the candidate characters with the highest probability as the character recognition result for the picture.
  • the program can convert image information into text information that can be recognized and processed by a computer, which greatly improves the efficiency of data storage, retrieval and processing by users.
  • the acquiring unit 310 of the apparatus shown in FIG. 3 is adapted to identify a character placeholder included in a picture by using an optical character recognition technology, obtain a candidate character set corresponding to each character placeholder, and each candidate The probability parameter corresponding to the character.
  • the identification unit 330 of the apparatus shown in FIG. 3 is adapted to be based on n A meta-grammar model that calculates the probability of each candidate combination of characters obtained.
  • the identifying unit 330 is adapted to calculate, for each candidate character in a candidate character combination, a condition that the candidate character appears under the condition that the first n-1 candidate characters have been determined according to the probability parameter of each candidate character. Probability; the product of the conditional probability of each candidate character in the candidate character combination is calculated as the probability of the candidate character combination.
  • the apparatus 400 for identifying a search keyword includes:
  • the image obtaining unit 410 is adapted to perform a picture capture according to the operation range in response to the user's touch screen operation to obtain a first picture; and further expand the predetermined area according to the operation range to perform picture interception to obtain a second picture.
  • the identification processing unit 420 is adapted to respectively identify the characters in the first picture and the second picture to obtain a corresponding character combination.
  • the search processing unit 430 is adapted to select a combination of characters from the combination of characters corresponding to the first picture and the second picture as the search keyword after the recognition according to the preset policy.
  • the device shown in FIG. 4 intercepts two images with different range sizes in response to the user's touch screen operation through the mutual cooperation of the units, and separately processes the two images according to the preset strategy, and then In the picture, select a more reliable recognition result as a search keyword to search.
  • the solution has the following beneficial effects: Firstly, the recognition of the search keyword for the user's touch screen operation is realized, and the user does not need to manually input the search keyword, thereby simplifying the search operation process and meeting the user's needs; secondly, adopting The method of comprehensively identifying two related pictures avoids the lack or surplus of information in a single picture, further improves the accuracy of character recognition in the picture, and further improves the accuracy of identifying the search keyword.
  • the identification processing unit 420 of the apparatus shown in FIG. 4 is adapted to respectively identify the first picture and the second picture by means of the apparatus 300 for recognizing characters in the picture as described in any of the above embodiments.
  • the characters in the box get the corresponding character combination.
  • identification processing unit 420 of the apparatus shown in FIG. 4 is further adapted to obtain pixel coordinates of each character occupying in the character combination.
  • the search processing unit 430 of the apparatus shown in FIG. 4 is adapted to In the character combination corresponding to the second picture, the character combination position corresponding to the first picture is the same and the length is the same character combination; determining whether the average language model score of the reserved character combination in the second picture is smaller than the character combination corresponding to the first picture The average model score; yes, the character combination corresponding to the first picture is selected as the search keyword for searching; otherwise, the character combination corresponding to the second picture is selected as the search keyword for searching.
  • the search processing unit 430 is adapted to retain, according to the pixel coordinate boundary of the character combination corresponding to the first picture and the second picture, and the pixel coordinates of each character placeholder, retain the second picture corresponding to the first picture.
  • the average language model score for the combination of characters refers to the logarithm of the probability of the combination of characters, the value obtained by averaging the number of characters in the combination of characters.
  • the technical solution provided by the present invention as a whole, on the one hand, obtains a candidate character set corresponding to a character placeholder included in a picture and a probability parameter corresponding to each candidate character, which may be included in the picture.
  • the probability of all candidate character combinations is calculated, and the candidate character with the highest probability is combined as the character recognition result for the picture.
  • two pictures with different range sizes are intercepted, and a more reliable recognition result is selected from the two pictures by separately identifying and then processing the two pictures according to the preset strategy. Search as a search keyword.
  • the scheme has the following beneficial effects: 1.
  • modules in the devices of the embodiments can be adaptively changed and placed in one or more devices different from the embodiment.
  • the modules or units or components of the embodiments may be combined into one module or unit or component, and further they may be divided into a plurality of sub-modules or sub-units or sub-components.
  • any combination of the features disclosed in the specification, including the accompanying claims, the abstract and the drawings, and any methods so disclosed, or All processes or units of the device are combined.
  • Each feature disclosed in this specification (including the accompanying claims, the abstract and the drawings) may be replaced by alternative features that provide the same, equivalent or similar purpose.
  • the various component embodiments of the present invention may be implemented in hardware, or in a software module running on one or more processors, or in a combination thereof.
  • a microprocessor or digital signal processor may be used in practice to implement some of some or all of the means for identifying characters in a picture or in accordance with an embodiment of the present invention. All features.
  • the invention can also be implemented as a device or device program (e.g., a computer program and a computer program product) for performing some or all of the methods described herein.
  • Such a program implementing the invention may be stored on a computer readable medium or may be in the form of one or more signals. Such signals may be downloaded from an Internet website, provided on a carrier signal, or provided in any other form.
  • Figure 6 shows a terminal device in which the method according to the invention can be implemented.
  • the and terminal devices conventionally include a processor 610 and a computer program product or computer readable medium in the form of a memory 620.
  • the memory 620 may be an electronic memory such as a flash memory, an EEPROM (Electrically Erasable Programmable Read Only Memory), an EPROM, a hard disk, or a ROM.
  • Memory 620 has a memory space 630 for program code 631 for performing any of the method steps described above.
  • storage space 630 for program code may include various program code 631 for implementing various steps in the above methods, respectively.
  • the program code can be read from or written to one or more computer program products.
  • Such computer program products include program code carriers such as hard disks, compact disks (CDs), memory cards or floppy disks.
  • Such a computer program product is typically a portable or fixed storage unit as described with reference to FIG.
  • the storage unit may have a storage section, a storage space, and the like arranged similarly to the storage 620 in the terminal device of FIG.
  • the program code can be compressed, for example, in an appropriate form.
  • the storage unit comprises computer readable code 631', ie code that can be read by a processor, such as 610, which when executed by the terminal device causes the terminal device to perform each of the methods described above step.

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Theoretical Computer Science (AREA)
  • Character Discrimination (AREA)

Abstract

A method and apparatus for recognizing characters in a picture. The method comprises: recognizing character placeholders contained in a picture, and acquiring a candidate character set corresponding to each character placeholder and a probability parameter corresponding to each candidate character (S110); sequentially selecting a candidate character from the candidate character set corresponding to each character placeholder to obtain a candidate character combination according to the sequence of the character placeholders contained in the picture; performing selection many times, selecting different character combinations each time, and obtaining a plurality of candidate character combinations (S120); calculating the probability of each obtained candidate character combination (S130), and using a candidate character combination having the highest probability as a character recognition result of the picture (S140). By means of the technical scheme, image information is completely and accurately converted into text information capable of being recognized and processed by a computer, character information in the picture is automatically extracted and does not need to be input manually by a user, and user needs are met.

Description

一种识别图片中的字符的方法和装置Method and device for recognizing characters in pictures 技术领域Technical field
本发明涉及计算机技术领域,具体涉及一种识别图片中的字符的方法和装置。The present invention relates to the field of computer technologies, and in particular, to a method and apparatus for recognizing characters in a picture.
背景技术Background technique
在信息技术及终端技术日益普及的今天,如何将字符方便、快捷地输入到终端中已经成为影响人机接口效率的一个重要难题。现有技术中,大部分用户仍然依赖于传统的键盘输入或手写输入来完成字符的输入,能够满足用户基本输入需求。但是,传统的输入方式也为用户带来了很多不便,例如,当用户对一个图片中所包含的字符有疑问,需要进行搜索时,需要将图片中所包含的字符依次手动输入到搜索栏中;或者,当用户需要保存一个图片中的电话号码时,也需要将图片中所包含的电话号码记在其他地方,再依次手动输入到电话薄中。可见,由于终端无法识别图片中所包含的字符,导致用户对图片中所包含的字符的处理十分费时费力,不符合用户的需求。Nowadays, with the increasing popularity of information technology and terminal technology, how to input characters into the terminal conveniently and quickly has become an important problem affecting the efficiency of human-machine interface. In the prior art, most users still rely on traditional keyboard input or handwriting input to complete character input, which can meet the basic input requirements of the user. However, the traditional input method also brings a lot of inconvenience to the user. For example, when the user has a question about the characters contained in an image and needs to search, the characters contained in the image need to be manually input into the search bar. Or, when the user needs to save the phone number in an image, the phone number contained in the picture needs to be recorded elsewhere, and then manually entered into the phone book. It can be seen that because the terminal cannot recognize the characters contained in the picture, the user's processing of the characters contained in the picture is time consuming and laborious, and does not meet the needs of the user.
发明内容Summary of the invention
鉴于上述问题,提出了本发明以便提供一种克服上述问题或者至少部分地解决或者减缓上述问题的一种识别图片中的字符的方法和装置。In view of the above problems, the present invention has been made in order to provide a method and apparatus for recognizing characters in a picture that overcomes the above problems or at least partially solves or alleviates the above problems.
依据本发明的一个方面,提供了一种识别图片中的字符的方法,该方法包括:According to an aspect of the present invention, a method of identifying a character in a picture is provided, the method comprising:
识别图片所包含的字符占位,获取每个字符占位对应的候选字符集合以及每个候选字符对应的概率参数;Identifying a character placeholder included in the picture, obtaining a candidate character set corresponding to each character placeholder and a probability parameter corresponding to each candidate character;
按照所述图片所包含的各字符占位的顺序,依次从每个字符占位对应的候选字符集合中选取一个候选字符得到一个候选字符组合;进行多次选取,每次选取到不同的字符组合,得到多个候选字符组合;According to the order of the characters occupied by the picture, one candidate character is selected from each candidate character set corresponding to each character placeholder to obtain a candidate character combination; multiple selections are performed, and different character combinations are selected each time. , obtaining multiple candidate character combinations;
计算所得到的每个候选字符组合的概率,Calculate the probability of each candidate combination of characters obtained,
将概率最高的候选字符组合作为对所述图片的字符识别结果。The candidate characters with the highest probability are combined as the result of character recognition for the picture.
依据本发明的又一个方面,提供了一种识别图片中的字符的装置,该装 置包括:According to still another aspect of the present invention, an apparatus for recognizing characters in a picture is provided Set includes:
获取单元,适于识别图片所包含的字符占位,获取每个字符占位对应的候选字符集合以及每个候选字符对应的概率参数;An obtaining unit, configured to identify a character placeholder included in the picture, obtain a candidate character set corresponding to each character placeholder, and a probability parameter corresponding to each candidate character;
预处理单元,适于按照所述图片所包含的各字符占位的顺序,依次从每个字符占位对应的候选字符集合中选取一个候选字符得到一个候选字符组合;进行多次选取,每次选取到不同的字符组合,得到多个候选字符组合;The pre-processing unit is adapted to select one candidate character from the candidate character set corresponding to each character placeholder in turn according to the order of the characters occupied by the picture to obtain a candidate character combination; Select different character combinations to get multiple candidate character combinations;
识别单元,适于计算所得到的每个候选字符组合的概率,将概率最高的候选字符组合作为对所述图片的字符识别结果。The identifying unit is adapted to calculate the probability of each of the obtained candidate character combinations, and combine the candidate characters with the highest probability as the character recognition result for the picture.
根据本发明的又一个方面,提供了一种计算机程序,包括计算机可读代码,当所述计算机可读代码在终端设备上运行时,导致所述终端设备执行如上任一项所述的识别图片中的字符的方法。According to still another aspect of the present invention, a computer program comprising computer readable code, when the computer readable code is run on a terminal device, causes the terminal device to perform the identification picture of any of the above The method of characters in .
根据本发明的再一个方面,提供了一种计算机可读介质,其中存储了如上所述的计算机程序。According to still another aspect of the present invention, a computer readable medium storing a computer program as described above is provided.
由上述可知,本发明这种通过获取图片所包含的字符占位对应的候选字符集合以及每个候选字符对应的概率参数,对图片中可能包含的所有候选字符组合的概率进行计算,将概率最高的候选字符组合作为对图片的字符识别结果的技术方案,将图像信息转化为可以供计算机识别和处理的文本信息,大大提高了用户对图片中的字符信息进行资料存储、检索、加工的效率。It can be seen from the above that the probability of the candidate character set corresponding to the character placeholder included in the picture and the probability parameter corresponding to each candidate character is calculated, and the probability of all candidate character combinations that may be included in the picture is calculated, and the probability is the highest. The candidate character combination is used as a technical solution for the character recognition result of the picture, and the image information is converted into text information that can be recognized and processed by the computer, which greatly improves the efficiency of the user in storing, retrieving and processing the character information in the picture.
上述说明仅是本发明技术方案的概述,为了能够更清楚了解本发明的技术手段,而可依照说明书的内容予以实施,并且为了让本发明的上述和其它目的、特征和优点能够更明显易懂,以下特举本发明的具体实施方式。The above description is only an overview of the technical solutions of the present invention, and the above-described and other objects, features and advantages of the present invention can be more clearly understood. Specific embodiments of the invention are set forth below.
附图说明DRAWINGS
通过阅读下文优选实施方式的详细描述,各种其他的优点和益处对于本领域普通技术人员将变得清楚明了。附图仅用于示出优选实施方式的目的,而并不认为是对本发明的限制。而且在整个附图中,用相同的参考符号表示相同的部件。在附图中:Various other advantages and benefits will become apparent to those skilled in the art from a The drawings are only for the purpose of illustrating the preferred embodiments and are not to be construed as limiting. Throughout the drawings, the same reference numerals are used to refer to the same parts. In the drawing:
图1示出了根据本发明一个实施例的一种识别图片中的字符的方法的流程图;1 shows a flow chart of a method of identifying characters in a picture, in accordance with one embodiment of the present invention;
图2示出了根据本发明一个实施例的一种识别搜索关键词的方法 的流程图;2 illustrates a method of identifying search keywords in accordance with one embodiment of the present invention. Flow chart
图3示出了根据本发明一个实施例的一种识别图片中的字符的装置的示意图;3 shows a schematic diagram of an apparatus for identifying characters in a picture, in accordance with one embodiment of the present invention;
图4示出了根据本发明一个实施例的一种识别搜索关键词的装置的示意图;4 shows a schematic diagram of an apparatus for identifying search keywords in accordance with one embodiment of the present invention;
图5A示出了根据本发明一个实施例的用于进行字符识别的图片的示意图;FIG. 5A shows a schematic diagram of a picture for character recognition according to an embodiment of the present invention; FIG.
图5B示出了根据本发明另一个实施例的第一图片的示意图;FIG. 5B shows a schematic diagram of a first picture according to another embodiment of the present invention; FIG.
图5C示出了根据本发明另一个实施例的第二图片的示意图;FIG. 5C is a schematic diagram showing a second picture according to another embodiment of the present invention; FIG.
图6示意性地示出了用于执行根据本发明的方法的终端设备的框图;以及Figure 6 shows schematically a block diagram of a terminal device for carrying out the method according to the invention;
图7示意性地示出了用于保持或者携带实现根据本发明的方法的程序代码的存储单元。Fig. 7 schematically shows a storage unit for holding or carrying program code implementing the method according to the invention.
具体实施例Specific embodiment
下面将参照附图更详细地描述本公开的示例性实施例。虽然附图中显示了本公开的示例性实施例,然而应当理解,可以以各种形式实现本公开而不应被这里阐述的实施例所限制。相反,提供这些实施例是为了能够更透彻地理解本公开,并且能够将本公开的范围完整的传达给本领域的技术人员。Exemplary embodiments of the present disclosure will be described in more detail below with reference to the accompanying drawings. While the embodiments of the present invention have been shown in the drawings, the embodiments Rather, these embodiments are provided so that this disclosure will be more fully understood and the scope of the disclosure will be fully disclosed.
图1示出了根据本发明一个实施例的一种识别图片中的字符的方法的流程图。如图1所示,该方法包括:1 shows a flow chart of a method of identifying characters in a picture, in accordance with one embodiment of the present invention. As shown in Figure 1, the method includes:
步骤S110,识别图片所包含的字符占位,获取每个字符占位对应的候选字符集合以及每个候选字符对应的概率参数。Step S110, identifying a character placeholder included in the picture, acquiring a candidate character set corresponding to each character placeholder and a probability parameter corresponding to each candidate character.
步骤S120,按照图片所包含的各字符占位的顺序,依次从每个字符占位对应的候选字符集合中选取一个候选字符得到一个候选字符组合;进行多次选取,每次选取到不同的字符组合,得到多个候选字符组合。Step S120: sequentially select one candidate character from the candidate character set corresponding to each character placeholder according to the order of each character occupying position of the picture to obtain a candidate character combination; perform multiple selections, and select different characters each time. Combine to get multiple candidate character combinations.
步骤S130,计算所得到的每个候选字符组合的概率。Step S130, calculating the probability of each of the obtained candidate character combinations.
步骤S140,将概率最高的候选字符组合作为对图片的字符识别结果。In step S140, the candidate characters with the highest probability are combined as the character recognition result for the picture.
可见,图1所示的方法通过获取图片所包含的字符占位对应的候选字符集合以及每个候选字符对应的概率参数,对图片中可能包含的所有候选字符 组合的概率进行计算,将概率最高的候选字符组合作为对图片的字符识别结果。本方案可以将图像信息转化为可以供计算机识别和处理的文本信息,大大提高了用户对图片中的字符信息进行资料存储、检索、加工的效率。It can be seen that the method shown in FIG. 1 obtains all candidate characters that may be included in the image by acquiring a candidate character set corresponding to the character placeholder included in the picture and a probability parameter corresponding to each candidate character. The combined probability is calculated, and the candidate character with the highest probability is combined as the character recognition result for the picture. The program can convert the image information into text information that can be recognized and processed by the computer, which greatly improves the efficiency of the user in storing, retrieving and processing the character information in the picture.
在本发明的一个实施例中,图1所示方法的步骤S110利用光学字符识别(Optical Character Recognition,OCR)技术识别图片所包含的字符占位,获取每个字符占位对应的候选字符集合以及每个候选字符对应的概率参数。In an embodiment of the present invention, step S110 of the method shown in FIG. 1 uses an optical character recognition (OCR) technology to identify a character occupying position included in a picture, and obtains a candidate character set corresponding to each character placeholder and The probability parameter corresponding to each candidate character.
在本发明的一个实施例中,图1所示方法的步骤S130计算所得到的每个候选字符组合的概率为:基于n元语法模型,计算所得到的每个候选字符组合的概率。具体为,对于一个候选字符组合中的每个候选字符,根据各候选字符的概率参数,计算该候选字符在其前n-1个候选字符已经确定的条件下出现的条件概率;计算该候选字符组合中各候选字符的条件概率的乘积,作为该候选字符组合的概率。In one embodiment of the present invention, the probability of each of the candidate character combinations obtained by step S130 of the method shown in FIG. 1 is: based on the n-gram model, the probability of each of the candidate character combinations obtained is calculated. Specifically, for each candidate character in a candidate character combination, a conditional probability that the candidate character appears under the condition that the first n-1 candidate characters have been determined is calculated according to the probability parameter of each candidate character; and the candidate character is calculated. The product of the conditional probabilities of the candidate characters in the combination is taken as the probability of the candidate character combination.
图5A示出了根据本发明一个实施例的用于进行字符识别的图片的示意图,在图5A所示的具体实施例中,首先,识别图片所包含的字符占位,获取每个字符占位对应的候选字符集合以及每个候选字符对应的概率参数的结果如表1所示:FIG. 5A is a schematic diagram of a picture for performing character recognition according to an embodiment of the present invention. In the specific embodiment shown in FIG. 5A, first, the character occupied by the picture is recognized, and each character place is acquired. The corresponding candidate character set and the result of the probability parameter corresponding to each candidate character are as shown in Table 1:
表1Table 1
字符占位Character placeholder 每个字符占位对应的候选字符集合a set of candidate characters corresponding to each character placeholder 每个候选字符对应的概率参数Probability parameter corresponding to each candidate character
11 手,季,乎,年,丰Hand, season, care, year, abundance 35,49,51,53,5735,49,51,53,57
22 机,杌,枧,视,枫Machine, 杌, 枧, 视, maple 22,43,52,52,5622,43,52,52,56
33 管,菅,营,營,眚Tube, 菅, camp, camp, 眚 26,49,52,52,5226,49,52,52,52
44 家,冢,象,彖,冻Home, 冢, elephant, 彖, frozen 23,61,81,82,8323,61,81,82,83
如表1所示,图片中共有4个字符占位,每个字符占位对应的候选字符集合中包括5个候选字符。As shown in Table 1, there are 4 characters in the picture, and 5 candidate characters are included in the candidate character set corresponding to each character placeholder.
接着,按照图5A所示图片中字符占位从左到右的顺序,依次从每个字符占位对应的候选字符集合中选取一个候选字符得到一个候选字符组合;进行多次选取,每次选取到不同的字符组合,能够得到54=625个候选字符组合。Then, according to the order of the character occupations in the picture shown in FIG. 5A from left to right, one candidate character is selected from each candidate character set corresponding to each character placeholder to obtain a candidate character combination; To a different combination of characters, a combination of 5 4 = 625 candidate characters can be obtained.
本实施例中,基于4元语法模型计算每个候选字符组合的概率,即对于一个候选字符组合中的每个候选字符,根据各候选字符的概率参数,计算该 候选字符在其前3个候选字符已经确定的条件下出现的条件概率;计算该候选字符组合中各候选字符的条件概率的乘积,作为该候选字符组合的概率。In this embodiment, the probability of each candidate character combination is calculated based on the 4-ary grammar model, that is, for each candidate character in a candidate character combination, the probability parameter is calculated according to the probability parameter of each candidate character. The conditional probability that the candidate character appears under the condition that the first three candidate characters have been determined; the product of the conditional probability of each candidate character in the candidate character combination is calculated as the probability of the candidate character combination.
具体地,以一个候选字符组合“手机管家”为例,为了表示图片中字符组合的完整性,要考虑字符组合的开始和结束,先将该字符组合处理为“O手机管家$”,“O”表示字符组合开始,“$”表示字符组合结束。对于“手机管家”中的“手”字,在其前3个候选字符已经确定的条件下出现的概率是:P(手|OOO);对于“机”字,在其前3个候选字符已经确定的条件下出现的概率是:P(机|OO手);对于“管”字,在其前3个候选字符已经确定的条件下出现的概率是:P(管|O手机);对于“家”字,在其前3个候选字符已经确定的条件下出现的概率是:P(家|手机管);对于“$”,在其前3个候选字符已经确定的条件下出现的概率是:P($|机管家)。Specifically, taking a candidate character combination "mobile phone housekeeper" as an example, in order to indicate the integrity of the character combination in the picture, the start and end of the character combination should be considered, and the character combination is first processed as "O mobile phone housekeeper $", "O "Expresses the beginning of the character combination, and "$" indicates the end of the character combination. For the "hand" word in "mobile phone housekeeper", the probability of occurrence under the condition that its first three candidate characters have been determined is: P(hand|OOO); for the "machine" word, the first three candidate characters have already The probability of occurrence under certain conditions is: P (machine | OO hand); for the word "pipe", the probability of occurrence under the condition that the first three candidate characters have been determined is: P (tube | O handset); The probability that the word "home" appears under the condition that its first three candidate characters have been determined is: P (home | mobile phone tube); for "$", the probability that it appears under the condition that the first three candidate characters have been determined is :P($|machine housekeeper).
因此,“手机管家”的概率为:P(手机管家)=P(手|OOO)×P(机|OO手)×P(管|O手机)×P(家|手机管)×P($|机管家)。其他候选字符组合的计算过程同理,不再赘述。将概率最高的候选字符组合作为对图片的字符识别结果,在本实施例中,概率最高的候选字符组合为“手机管家”,即为图5A所示图片的字符识别结果。Therefore, the probability of "mobile phone housekeeper" is: P (mobile phone housekeeper) = P (hand | OOO) × P (machine | OO hand) × P (tube | O mobile phone) × P (home | mobile phone tube) × P ($ | Machine Manager). The calculation process of other candidate character combinations is the same, and will not be described again. The candidate character with the highest probability is combined as the character recognition result for the picture. In this embodiment, the candidate character combination with the highest probability is “mobile phone housekeeper”, that is, the character recognition result of the picture shown in FIG. 5A.
图2示出了根据本发明一个实施例的一种识别搜索关键词的方法的流程图。如图2所示,该方法包括:2 shows a flow chart of a method of identifying search keywords in accordance with one embodiment of the present invention. As shown in Figure 2, the method includes:
步骤S210,响应于用户触屏操作,按照操作范围进行图片截取,获得第一图片;再按照操作范围扩大预定面积进行图片截取,获得第二图片。In step S210, in response to the user's touch screen operation, the picture is intercepted according to the operation range to obtain the first picture; and the predetermined area is expanded according to the operation range to perform picture interception, and the second picture is obtained.
步骤S220,分别识别第一图片和第二图片中的字符,得到相应的字符组合。Step S220, respectively identifying characters in the first picture and the second picture to obtain corresponding character combinations.
步骤S230,根据预设策略,从第一图片和第二图片对应的字符组合中选择一个字符组合作为识别后的搜索关键词。Step S230: Select a character combination from the combination of characters corresponding to the first picture and the second picture as the search keyword after the recognition according to the preset policy.
可见,图2所示的方法响应于用户触屏操作截取了两张范围大小不同的图片,通过先分别识别再根据预设策略综合处理两张图片的方式,从两张图片中选择出更可信的识别结果作为搜索关键词进行搜索。与现有技术相比,本方案具有以下有益效果:首先,实现了对用户触屏操作的搜索关键词的识别,无需用户手动输入搜索关键词,简化搜索操作流程,符合用户需求;其次,采用了两张相关图片综合识别的方式,避免了单张图片中信息的缺失或 赘余,进一步提高了对图片中字符识别的准确率,进而提高了识别搜索关键词的准确率。It can be seen that the method shown in FIG. 2 intercepts two images with different range sizes in response to the user's touch screen operation, and selects two images from each of the two images by separately identifying and then processing the two images according to the preset strategy. The recognition result of the letter is searched as a search keyword. Compared with the prior art, the solution has the following beneficial effects: Firstly, the recognition of the search keyword for the user's touch screen operation is realized, and the user does not need to manually input the search keyword, thereby simplifying the search operation process and meeting the user's needs; secondly, adopting Two ways to comprehensively identify related images, avoiding the missing information in a single picture or More than enough, the accuracy of character recognition in the picture is further improved, thereby improving the accuracy of identifying the search keyword.
在本发明的一个实施例中,图2所示方法的步骤S220分别识别第一图片和第二图片中的字符,得到相应的字符组合为:通过如上文中任一实施例所述的识别图片中的字符的方法,分别识别第一图片和第二图片中的字符,得到相应的字符组合。In an embodiment of the present invention, step S220 of the method shown in FIG. 2 respectively identifies characters in the first picture and the second picture, and obtains corresponding character combinations as: by identifying the picture as described in any of the above embodiments. The character method respectively identifies the characters in the first picture and the second picture to obtain a corresponding character combination.
在本发明的另一个实施例中,图2所示方法的步骤S220分别识别第一图片和第二图片中的字符,得到相应的字符组合进一步包括:得到字符组合中各字符占位的像素坐标。In another embodiment of the present invention, step S220 of the method shown in FIG. 2 respectively identifies characters in the first picture and the second picture, and obtaining corresponding character combinations further includes: obtaining pixel coordinates of each character occupying position in the character combination. .
在本发明的一个实施例中,图2所示方法的步骤S230根据预设策略,从第一图片和第二图片对应的字符组合中选择一个字符组合作为搜索关键词包括:In an embodiment of the present invention, step S230 of the method shown in FIG. 2, according to a preset policy, selecting a character combination from the combination of characters corresponding to the first picture and the second picture as the search keyword includes:
步骤S231,在第二图片对应的字符组合中,保留与第一图片对应的字符组合位置相同且长度相同字符组合。In step S231, in the character combination corresponding to the second picture, the character combination positions corresponding to the first picture are the same and the same length combination is used.
本步骤中,依据一个具体的实施例,具体为:根据第一图片和第二图片分别对应的字符组合的像素坐标边界和每个字符占位的像素坐标,保留第二图片中与第一图片对应的字符组合位置相同且长度相同字符组合。In this step, according to a specific embodiment, specifically, according to the pixel coordinate boundary of the character combination corresponding to the first picture and the second picture, and the pixel coordinates of each character placeholder, the second picture and the first picture are retained. The corresponding character combination position is the same and the length is the same character combination.
步骤S232,判断第二图片中保留的字符组合的平均语言模型分数是否小于第一图片对应的字符组合的平均模型分数。Step S232, determining whether the average language model score of the character combination retained in the second picture is smaller than the average model score of the character combination corresponding to the first picture.
本步骤中,字符组合的平均语言模型分数是指:字符组合的概率的对数值,按照字符组合中的字符数取平均后得到的值。In this step, the average language model score of the character combination refers to the logarithm of the probability of the character combination, and the value obtained by averaging the number of characters in the character combination.
步骤S233,是则,选择第一图片对应的字符组合作为搜索关键词进行搜索;否则,选择第二图片对应的字符组合作为搜索关键词进行搜索。Step S233, yes, select a character combination corresponding to the first picture as a search keyword to perform a search; otherwise, select a character combination corresponding to the second picture as a search keyword to perform a search.
图5B示出了根据本发明另一个实施例的第一图片的示意图;图5C示出了根据本发明另一个实施例的第二图片的示意图,在图5B和图5C所示的具体的实施例中,响应于用户的触屏操作,按照操作范围进行图片截取,获得如图5B所示的第一图片;再按照操作范围扩大预定面积进行图片截取,获得如图5C所示的第二图片。识别第一图片所包含的字符占位,获取每个字符占位对应的候选字符集合、每个候选字符对应的概率参数和每个字符占位的像素坐标,结果如表2所示: 5B is a schematic diagram showing a first picture according to another embodiment of the present invention; FIG. 5C is a schematic diagram showing a second picture according to another embodiment of the present invention, and the specific implementation shown in FIG. 5B and FIG. 5C In the example, in response to the touch screen operation of the user, the image is intercepted according to the operation range, and the first picture as shown in FIG. 5B is obtained; and the predetermined area is expanded according to the operation range, and the picture is intercepted, and the second picture as shown in FIG. 5C is obtained. . Identifying the character placeholders included in the first picture, obtaining a candidate character set corresponding to each character placeholder, a probability parameter corresponding to each candidate character, and a pixel coordinate of each character placeholder, and the result is shown in Table 2:
表2Table 2
Figure PCTCN2016077865-appb-000001
Figure PCTCN2016077865-appb-000001
基于4元语法模型计算每个候选字符组合的概率,识别第一图片的概率最高的字符组合为“手机管易”,具体的识别过程在前文中已详细描述,不再赘述。The probability of each candidate character combination is calculated based on the 4-ary grammar model, and the character combination with the highest probability of identifying the first picture is “Mobile Phone Easy”. The specific recognition process has been described in detail in the foregoing, and will not be described again.
同理,识别第二图片所包含的字符占位,获取每个字符占位对应的候选字符集合、每个候选字符对应的概率参数和每个字符占位的像素坐标,结果如表3所示:Similarly, the character occupying position included in the second picture is identified, the candidate character set corresponding to each character placeholder, the probability parameter corresponding to each candidate character, and the pixel coordinate of each character placeholder are obtained, and the result is shown in Table 3. :
表3table 3
Figure PCTCN2016077865-appb-000002
Figure PCTCN2016077865-appb-000002
基于4元语法模型计算每个候选字符组合的概率,识别第二图片的概率最高的字符组合为“t’ae.手机管家”,具体的识别过程在前文中已详细描述,不再赘述。根据各字符占位对应的像素坐标可知,该字符组合中“t’ae.”部分位于第二图片中较高的位置,“手机管家”部分位于第二图片中较低的位置。 The probability of each candidate character combination is calculated based on the 4-ary grammar model, and the character combination with the highest probability of identifying the second picture is “t’ae. mobile phone housekeeper”. The specific identification process has been described in detail in the foregoing, and will not be described again. According to the pixel coordinates corresponding to each character placeholder, the "t'ae." part of the character combination is located at a higher position in the second picture, and the "phone housekeeper" part is located at a lower position in the second picture.
接着,在第二图片对应的字符组合“t’ae.手机管家”中,保留与第一图片对应的字符组合“手机管易”位置相同且长度相同的字符组合,根据“手机管易”和“t’ae.手机管家”的像素坐标边界和每个字符占位的像素坐标,可知,在第二图片对应的字符组合“t’ae.手机管家”中,“手机管家”部分是与“手机管易”位置相同且长度相同的字符组合,因此,保留第二图片中“手机管家”字符组合。Then, in the character combination "t'ae. mobile phone housekeeper" corresponding to the second picture, the character combination corresponding to the first picture, "mobile phone tube", has the same character combination and the same length, according to "mobile phone management" and The pixel coordinate boundary of "t'ae. mobile phone housekeeper" and the pixel coordinates of each character placeholder, we can see that in the character combination "t'ae. mobile phone housekeeper" corresponding to the second picture, the "mobile phone housekeeper" part is The phone is easy to use in the same character group with the same length, so keep the "phone butler" character combination in the second picture.
分别计算“手机管家”与“手机管易”的概率的对数值,按照字符组合中的字符数取平均后得到的值,得到ln[P(手机管家)]/4>ln[P(手机管家)]/4,因此,选择第二图片对应的字符组合“手机管家”作为识别出的搜索关键词进行搜索。Calculate the logarithm of the probability of "mobile phone housekeeper" and "mobile phone management" separately, and obtain the value obtained by averaging according to the number of characters in the character combination, and get ln[P(Mobile Manager)]/4>ln[P(Mobile Manager) )]/4, therefore, the character combination "mobile phone housekeeper" corresponding to the second picture is selected as the search keyword to be searched.
图3示出了根据本发明一个实施例的一种识别图片中的字符的装置的示意图。如图3所示,该识别图片中的字符的装置300包括:3 shows a schematic diagram of an apparatus for identifying characters in a picture, in accordance with one embodiment of the present invention. As shown in FIG. 3, the apparatus 300 for recognizing characters in a picture includes:
获取单元310,适于识别图片所包含的字符占位,获取每个字符占位对应的候选字符集合以及每个候选字符对应的概率参数。The obtaining unit 310 is adapted to identify a character placeholder included in the picture, obtain a candidate character set corresponding to each character placeholder, and a probability parameter corresponding to each candidate character.
预处理单元320,适于按照所述图片所包含的各字符占位的顺序,依次从每个字符占位对应的候选字符集合中选取一个候选字符得到一个候选字符组合;进行多次选取,每次选取到不同的字符组合,得到多个候选字符组合。The pre-processing unit 320 is adapted to select one candidate character from the candidate character set corresponding to each character placeholder in sequence according to the order of the characters occupied by the picture to obtain a candidate character combination; Select different combinations of characters to get multiple candidate character combinations.
识别单元330,适于计算所得到的每个候选字符组合的概率,将概率最高的候选字符组合作为对所述图片的字符识别结果。The identifying unit 330 is adapted to calculate the obtained probability of each candidate character combination, and combine the candidate characters with the highest probability as the character recognition result for the picture.
可见,图3所示的装置通过各单元的相互配合,获取图片所包含的字符占位对应的候选字符集合以及每个候选字符对应的概率参数,对图片中可能包含的所有候选字符组合的概率进行计算,将概率最高的候选字符组合作为对图片的字符识别结果。本方案可以将图像信息转化为可以供计算机识别和处理的文本信息,大大提高了用户进行资料存储、检索、加工的效率。It can be seen that the device shown in FIG. 3 acquires the candidate character set corresponding to the character placeholder included in the picture and the probability parameter corresponding to each candidate character through the mutual cooperation of the units, and the probability of all candidate character combinations that may be included in the picture. The calculation is performed to combine the candidate characters with the highest probability as the character recognition result for the picture. The program can convert image information into text information that can be recognized and processed by a computer, which greatly improves the efficiency of data storage, retrieval and processing by users.
在本发明的一个实施例中,图3所示装置的获取单元310,适于利用光学字符识别技术识别图片所包含的字符占位,获取每个字符占位对应的候选字符集合以及每个候选字符对应的概率参数。In an embodiment of the present invention, the acquiring unit 310 of the apparatus shown in FIG. 3 is adapted to identify a character placeholder included in a picture by using an optical character recognition technology, obtain a candidate character set corresponding to each character placeholder, and each candidate The probability parameter corresponding to the character.
在本发明的一个实施例中,图3所示装置的识别单元330,适于基于n 元语法模型,计算所得到的每个候选字符组合的概率。具体为,识别单元330,适于对于一个候选字符组合中的每个候选字符,根据各候选字符的概率参数,计算该候选字符在其前n-1个候选字符已经确定的条件下出现的条件概率;计算该候选字符组合中各候选字符的条件概率的乘积,作为该候选字符组合的概率。In an embodiment of the present invention, the identification unit 330 of the apparatus shown in FIG. 3 is adapted to be based on n A meta-grammar model that calculates the probability of each candidate combination of characters obtained. Specifically, the identifying unit 330 is adapted to calculate, for each candidate character in a candidate character combination, a condition that the candidate character appears under the condition that the first n-1 candidate characters have been determined according to the probability parameter of each candidate character. Probability; the product of the conditional probability of each candidate character in the candidate character combination is calculated as the probability of the candidate character combination.
具体的实施例如图5A所在的实施例,上文中已详细说明,在此不再赘述。The specific implementation is, for example, the embodiment in which FIG. 5A is located, which has been described in detail above, and is not described herein again.
图4示出了根据本发明一个实施例的一种识别搜索关键词的装置的示意图。如图4所示,该识别搜索关键词的装置400包括:4 shows a schematic diagram of an apparatus for identifying search keywords in accordance with one embodiment of the present invention. As shown in FIG. 4, the apparatus 400 for identifying a search keyword includes:
图片获取单元410,适于响应于用户触屏操作,按照所述操作范围进行图片截取,获得第一图片;再按照所述操作范围扩大预定面积进行图片截取,获得第二图片。The image obtaining unit 410 is adapted to perform a picture capture according to the operation range in response to the user's touch screen operation to obtain a first picture; and further expand the predetermined area according to the operation range to perform picture interception to obtain a second picture.
识别处理单元420,适于分别识别第一图片和第二图片中的字符,得到相应的字符组合。The identification processing unit 420 is adapted to respectively identify the characters in the first picture and the second picture to obtain a corresponding character combination.
搜索处理单元430,适于根据预设策略,从第一图片和第二图片对应的字符组合中选择一个字符组合作为识别后的搜索关键词。The search processing unit 430 is adapted to select a combination of characters from the combination of characters corresponding to the first picture and the second picture as the search keyword after the recognition according to the preset policy.
可见,图4所示的装置通过各单元的相互配合,响应于用户触屏操作截取了两张范围大小不同的图片,通过先分别识别再根据预设策略综合处理两张图片的方式,从两张图片中选择出更可信的识别结果作为搜索关键词进行搜索。与现有技术相比,本方案具有以下有益效果:首先,实现了对用户触屏操作的搜索关键词的识别,无需用户手动输入搜索关键词,简化搜索操作流程,符合用户需求;其次,采用了两张相关图片综合识别的方式,避免了单张图片中信息的缺失或赘余,进一步提高了对图片中字符识别的准确率,进而提高了识别搜索关键词的准确率。It can be seen that the device shown in FIG. 4 intercepts two images with different range sizes in response to the user's touch screen operation through the mutual cooperation of the units, and separately processes the two images according to the preset strategy, and then In the picture, select a more reliable recognition result as a search keyword to search. Compared with the prior art, the solution has the following beneficial effects: Firstly, the recognition of the search keyword for the user's touch screen operation is realized, and the user does not need to manually input the search keyword, thereby simplifying the search operation process and meeting the user's needs; secondly, adopting The method of comprehensively identifying two related pictures avoids the lack or surplus of information in a single picture, further improves the accuracy of character recognition in the picture, and further improves the accuracy of identifying the search keyword.
在本发明的一个实施例中,图4所示装置的识别处理单元420,适于通过如上文中任一实施例所述的识别图片中的字符的装置300,分别识别第一图片和第二图片中的字符,得到相应的字符组合。In an embodiment of the present invention, the identification processing unit 420 of the apparatus shown in FIG. 4 is adapted to respectively identify the first picture and the second picture by means of the apparatus 300 for recognizing characters in the picture as described in any of the above embodiments. The characters in the box get the corresponding character combination.
进一步地,图4所示装置的识别处理单元420,还适于得到字符组合中各字符占位的像素坐标。Further, the identification processing unit 420 of the apparatus shown in FIG. 4 is further adapted to obtain pixel coordinates of each character occupying in the character combination.
在本发明的一个实施例中,图4所示装置的搜索处理单元430,适于 在第二图片对应的字符组合中,保留与第一图片对应的字符组合位置相同且长度相同字符组合;判断第二图片中保留的字符组合的平均语言模型分数是否小于第一图片对应的字符组合的平均模型分数;是则,选择第一图片对应的字符组合作为搜索关键词进行搜索;否则,选择第二图片对应的字符组合作为搜索关键词进行搜索。其中,具体地,搜索处理单元430,适于根据第一图片和第二图片分别对应的字符组合的像素坐标边界和每个字符占位的像素坐标,保留第二图片中与第一图片对应的字符组合位置相同且长度相同字符组合。此外,在一个实施例中,字符组合的平均语言模型分数是指:字符组合的概率的对数值,按照字符组合中的字符数取平均后得到的值。In an embodiment of the present invention, the search processing unit 430 of the apparatus shown in FIG. 4 is adapted to In the character combination corresponding to the second picture, the character combination position corresponding to the first picture is the same and the length is the same character combination; determining whether the average language model score of the reserved character combination in the second picture is smaller than the character combination corresponding to the first picture The average model score; yes, the character combination corresponding to the first picture is selected as the search keyword for searching; otherwise, the character combination corresponding to the second picture is selected as the search keyword for searching. Specifically, the search processing unit 430 is adapted to retain, according to the pixel coordinate boundary of the character combination corresponding to the first picture and the second picture, and the pixel coordinates of each character placeholder, retain the second picture corresponding to the first picture. A combination of characters with the same position and the same length. Moreover, in one embodiment, the average language model score for the combination of characters refers to the logarithm of the probability of the combination of characters, the value obtained by averaging the number of characters in the combination of characters.
具体的实施例如图5B和图5C所在的实施例,上文中已详细说明,在此不再赘述。The specific implementations are as shown in the embodiment of FIG. 5B and FIG. 5C, which have been described in detail above, and are not described herein again.
综上所述,本发明提供的技术方案作为一个整体来看,一方面,通过获取图片所包含的字符占位对应的候选字符集合以及每个候选字符对应的概率参数,对图片中可能包含的所有候选字符组合的概率进行计算,将概率最高的候选字符组合作为对图片的字符识别结果。另一方面,响应于用户触屏操作截取了两张范围大小不同的图片,通过先分别识别再根据预设策略综合处理两张图片的方式,从两张图片中选择出更可信的识别结果作为搜索关键词进行搜索。与现有技术相比,本方案具有以下有益效果:1、利用自然语言n元语法模型修正光学字符识别技术本身的识别错误问题,优化效果;2、动态规划找到最优的候选字符组合,提升识别效果;3、采用大图和小图的综合比较算法,相互验证和补充,避免了单张图片中信息的缺失或赘余;4、基于小图像素坐标来选择最终识别结果的位置和长度。本发明提高了对图片中字符识别的准确率,进而提高了识别搜索关键词的准确率,无需用户手动输入,提高搜索效率,符合用户需求。In summary, the technical solution provided by the present invention as a whole, on the one hand, obtains a candidate character set corresponding to a character placeholder included in a picture and a probability parameter corresponding to each candidate character, which may be included in the picture. The probability of all candidate character combinations is calculated, and the candidate character with the highest probability is combined as the character recognition result for the picture. On the other hand, in response to the user's touch screen operation, two pictures with different range sizes are intercepted, and a more reliable recognition result is selected from the two pictures by separately identifying and then processing the two pictures according to the preset strategy. Search as a search keyword. Compared with the prior art, the scheme has the following beneficial effects: 1. Using the natural language n-gram model to correct the recognition problem of the optical character recognition technology itself, and optimizing the effect; 2. Dynamic programming finds the optimal candidate character combination and improves Recognition effect; 3. Comprehensive comparison algorithm of large image and small image, mutual verification and supplement, avoiding the missing or surplus of information in a single picture; 4. Selecting the position and length of the final recognition result based on the pixel coordinates of the small image . The invention improves the accuracy of character recognition in the picture, thereby improving the accuracy of identifying the search keyword, without manual input by the user, improving the search efficiency and meeting the user's demand.
需要说明的是:It should be noted:
在此提供的算法和显示不与任何特定计算机、虚拟装置或者其它设备固有相关。各种通用装置也可以与基于在此的示教一起使用。根 据上面的描述,构造这类装置所要求的结构是显而易见的。此外,本发明也不针对任何特定编程语言。应当明白,可以利用各种编程语言实现在此描述的本发明的内容,并且上面对特定语言所做的描述是为了披露本发明的最佳实施方式。The algorithms and displays provided herein are not inherently related to any particular computer, virtual device, or other device. Various general purpose devices can also be used with the teaching based on the teachings herein. Root The structure required to construct such a device is apparent from the above description. Moreover, the invention is not directed to any particular programming language. It is to be understood that the invention may be embodied in a variety of programming language, and the description of the specific language has been described above in order to disclose the preferred embodiments of the invention.
在此处所提供的说明书中,说明了大量具体细节。然而,能够理解,本发明的实施例可以在没有这些具体细节的情况下实践。在一些实例中,并未详细示出公知的方法、结构和技术,以便不模糊对本说明书的理解。In the description provided herein, numerous specific details are set forth. However, it is understood that the embodiments of the invention may be practiced without these specific details. In some instances, well-known methods, structures, and techniques are not shown in detail so as not to obscure the understanding of the description.
类似地,应当理解,为了精简本公开并帮助理解各个发明方面中的一个或多个,在上面对本发明的示例性实施例的描述中,本发明的各个特征有时被一起分组到单个实施例、图、或者对其的描述中。然而,并不应将该公开的方法解释成反映如下意图:即所要求保护的本发明要求比在每个权利要求中所明确记载的特征更多的特征。更确切地说,如下面的权利要求书所反映的那样,发明方面在于少于前面公开的单个实施例的所有特征。因此,遵循具体实施方式的权利要求书由此明确地并入该具体实施方式,其中每个权利要求本身都作为本发明的单独实施例。Similarly, the various features of the invention are sometimes grouped together into a single embodiment, in the above description of the exemplary embodiments of the invention, Figure, or a description of it. However, the method disclosed is not to be interpreted as reflecting the intention that the claimed invention requires more features than those recited in the claims. Rather, as the following claims reflect, inventive aspects reside in less than all features of the single embodiments disclosed herein. Therefore, the claims following the specific embodiments are hereby explicitly incorporated into the embodiments, and each of the claims as a separate embodiment of the invention.
本领域那些技术人员可以理解,可以对实施例中的设备中的模块进行自适应性地改变并且把它们设置在与该实施例不同的一个或多个设备中。可以把实施例中的模块或单元或组件组合成一个模块或单元或组件,以及此外可以把它们分成多个子模块或子单元或子组件。除了这样的特征和/或过程或者单元中的至少一些是相互排斥之外,可以采用任何组合对本说明书(包括伴随的权利要求、摘要和附图)中公开的所有特征以及如此公开的任何方法或者设备的所有过程或单元进行组合。除非另外明确陈述,本说明书(包括伴随的权利要求、摘要和附图)中公开的每个特征可以由提供相同、等同或相似目的的替代特征来代替。Those skilled in the art will appreciate that the modules in the devices of the embodiments can be adaptively changed and placed in one or more devices different from the embodiment. The modules or units or components of the embodiments may be combined into one module or unit or component, and further they may be divided into a plurality of sub-modules or sub-units or sub-components. In addition to such features and/or at least some of the processes or units being mutually exclusive, any combination of the features disclosed in the specification, including the accompanying claims, the abstract and the drawings, and any methods so disclosed, or All processes or units of the device are combined. Each feature disclosed in this specification (including the accompanying claims, the abstract and the drawings) may be replaced by alternative features that provide the same, equivalent or similar purpose.
此外,本领域的技术人员能够理解,尽管在此所述的一些实施例包括其它实施例中所包括的某些特征而不是其它特征,但是不同实施例的特征的组合意味着处于本发明的范围之内并且形成不同的实施 例。例如,在下面的权利要求书中,所要求保护的实施例的任意之一都可以以任意的组合方式来使用。In addition, those skilled in the art will appreciate that, although some embodiments described herein include certain features that are included in other embodiments and not in other features, combinations of features of different embodiments are intended to be within the scope of the present invention. Within and form different implementations example. For example, in the following claims, any one of the claimed embodiments can be used in any combination.
本发明的各个部件实施例可以以硬件实现,或者以在一个或者多个处理器上运行的软件模块实现,或者以它们的组合实现。本领域的技术人员应当理解,可以在实践中使用微处理器或者数字信号处理器(DSP)来实现根据本发明实施例的一种识别图片中的字符的装置中的一些或者全部部件的一些或者全部功能。本发明还可以实现为用于执行这里所描述的方法的一部分或者全部的设备或者装置程序(例如,计算机程序和计算机程序产品)。这样的实现本发明的程序可以存储在计算机可读介质上,或者可以具有一个或者多个信号的形式。这样的信号可以从因特网网站上下载得到,或者在载体信号上提供,或者以任何其他形式提供。The various component embodiments of the present invention may be implemented in hardware, or in a software module running on one or more processors, or in a combination thereof. Those skilled in the art will appreciate that a microprocessor or digital signal processor (DSP) may be used in practice to implement some of some or all of the means for identifying characters in a picture or in accordance with an embodiment of the present invention. All features. The invention can also be implemented as a device or device program (e.g., a computer program and a computer program product) for performing some or all of the methods described herein. Such a program implementing the invention may be stored on a computer readable medium or may be in the form of one or more signals. Such signals may be downloaded from an Internet website, provided on a carrier signal, or provided in any other form.
例如,图6示出了可以实现根据本发明的方法的终端设备。该和终端设备传统上包括处理器610和以存储器620形式的计算机程序产品或者计算机可读介质。存储器620可以是诸如闪存、EEPROM(电可擦除可编程只读存储器)、EPROM、硬盘或者ROM之类的电子存储器。存储器620具有用于执行上述方法中的任何方法步骤的程序代码631的存储空间630。例如,用于程序代码的存储空间630可以包括分别用于实现上面的方法中的各种步骤的各个程序代码631。这些程序代码可以从一个或者多个计算机程序产品中读出或者写入到这一个或者多个计算机程序产品中。这些计算机程序产品包括诸如硬盘,紧致盘(CD)、存储卡或者软盘之类的程序代码载体。这样的计算机程序产品通常为如参考图7所述的便携式或者固定存储单元。该存储单元可以具有与图6的终端设备中的存储器620类似布置的存储段、存储空间等。程序代码可以例如以适当形式进行压缩。通常,存储单元包括计算机可读代码631’,即可以由例如诸如610之类的处理器读取的代码,这些代码当由终端设备运行时,导致该终端设备执行上面所描述的方法中的各个步骤。For example, Figure 6 shows a terminal device in which the method according to the invention can be implemented. The and terminal devices conventionally include a processor 610 and a computer program product or computer readable medium in the form of a memory 620. The memory 620 may be an electronic memory such as a flash memory, an EEPROM (Electrically Erasable Programmable Read Only Memory), an EPROM, a hard disk, or a ROM. Memory 620 has a memory space 630 for program code 631 for performing any of the method steps described above. For example, storage space 630 for program code may include various program code 631 for implementing various steps in the above methods, respectively. The program code can be read from or written to one or more computer program products. These computer program products include program code carriers such as hard disks, compact disks (CDs), memory cards or floppy disks. Such a computer program product is typically a portable or fixed storage unit as described with reference to FIG. The storage unit may have a storage section, a storage space, and the like arranged similarly to the storage 620 in the terminal device of FIG. The program code can be compressed, for example, in an appropriate form. Typically, the storage unit comprises computer readable code 631', ie code that can be read by a processor, such as 610, which when executed by the terminal device causes the terminal device to perform each of the methods described above step.
本文中所称的“一个实施例”、“实施例”或者“一个或者多个实施例”意味着,结合实施例描述的特定特征、结构或者特性包括在 本发明的至少一个实施例中。此外,请注意,这里“在一个实施例中”的词语例子不一定全指同一个实施例。The term "one embodiment", "an embodiment" or "one or more embodiments" is used herein to mean that the specific features, structures, or characteristics described in connection with the embodiments are included. In at least one embodiment of the invention. In addition, it is noted that the phrase "in one embodiment" is not necessarily referring to the same embodiment.
应该注意的是上述实施例对本发明进行说明而不是对本发明进行限制,并且本领域技术人员在不脱离所附权利要求的范围的情况下可设计出替换实施例。在权利要求中,不应将位于括号之间的任何参考符号构造成对权利要求的限制。单词“包含”不排除存在未列在权利要求中的元件或步骤。位于元件之前的单词“一”或“一个”不排除存在多个这样的元件。本发明可以借助于包括有若干不同元件的硬件以及借助于适当编程的计算机来实现。在列举了若干装置的单元权利要求中,这些装置中的若干个可以是通过同一个硬件项来具体体现。单词第一、第二、以及第三等的使用不表示任何顺序。可将这些单词解释为名称。It is to be noted that the above-described embodiments are illustrative of the invention and are not intended to be limiting, and that the invention may be devised without departing from the scope of the appended claims. In the claims, any reference signs placed between parentheses shall not be construed as a limitation. The word "comprising" does not exclude the presence of the elements or steps that are not recited in the claims. The word "a" or "an" The invention can be implemented by means of hardware comprising several distinct elements and by means of a suitably programmed computer. In the unit claims enumerating several means, several of these means can be embodied by the same hardware item. The use of the words first, second, and third does not indicate any order. These words can be interpreted as names.
此外,还应当注意,本说明书中使用的语言主要是为了可读性和教导的目的而选择的,而不是为了解释或者限定本发明的主题而选择的。因此,在不偏离所附权利要求书的范围和精神的情况下,对于本技术领域的普通技术人员来说许多修改和变更都是显而易见的。对于本发明的范围,对本发明所做的公开是说明性的,而非限制性的,本发明的范围由所附权利要求书限定。 In addition, it should be noted that the language used in the specification has been selected for the purpose of readability and teaching, and is not intended to be construed or limited. Therefore, many modifications and changes will be apparent to those skilled in the art without departing from the scope of the invention. The disclosure of the present invention is intended to be illustrative, and not restrictive, and the scope of the invention is defined by the appended claims.

Claims (10)

  1. 一种识别图片中的字符的方法,其中,该方法包括:A method of identifying characters in a picture, wherein the method comprises:
    识别图片所包含的字符占位,获取每个字符占位对应的候选字符集合以及每个候选字符对应的概率参数;Identifying a character placeholder included in the picture, obtaining a candidate character set corresponding to each character placeholder and a probability parameter corresponding to each candidate character;
    按照所述图片所包含的各字符占位的顺序,依次从每个字符占位对应的候选字符集合中选取一个候选字符得到一个候选字符组合;进行多次选取,每次选取到不同的字符组合,得到多个候选字符组合;According to the order of the characters occupied by the picture, one candidate character is selected from each candidate character set corresponding to each character placeholder to obtain a candidate character combination; multiple selections are performed, and different character combinations are selected each time. , obtaining multiple candidate character combinations;
    计算所得到的每个候选字符组合的概率,Calculate the probability of each candidate combination of characters obtained,
    将概率最高的候选字符组合作为对所述图片的字符识别结果。The candidate characters with the highest probability are combined as the result of character recognition for the picture.
  2. 如权利要求1所述的方法,其中,所述计算所得到的每个候选字符组合的概率为:The method of claim 1 wherein said calculating the probability of each candidate character combination obtained is:
    基于n元语法模型,计算所得到的每个候选字符组合的概率。Based on the n-gram model, the probability of each candidate combination of characters obtained is calculated.
  3. 如权利要求1或2所述的方法,其中,所述基于n元语法模型,计算所得到的每个候选字符组合的概率包括:The method of claim 1 or 2, wherein the probability of calculating each of the candidate character combinations obtained based on the n-gram model comprises:
    对于一个候选字符组合中的每个候选字符,根据各候选字符的概率参数,计算该候选字符在其前n-1个候选字符已经确定的条件下出现的条件概率;For each candidate character in a candidate character combination, calculating a conditional probability that the candidate character appears under the condition that the first n-1 candidate characters have been determined according to the probability parameter of each candidate character;
    计算该候选字符组合中各候选字符的条件概率的乘积,作为该候选字符组合的概率。The product of the conditional probabilities of the candidate characters in the candidate character combination is calculated as the probability of the candidate character combination.
  4. 如权利要求1-3中任一项所述的方法,其中,A method according to any one of claims 1 to 3, wherein
    利用光学字符识别技术识别图片所包含的字符占位,获取每个字符占位对应的候选字符集合以及每个候选字符对应的概率参数。The optical character recognition technology is used to identify the character occupying positions included in the picture, and the candidate character set corresponding to each character placeholder and the probability parameter corresponding to each candidate character are obtained.
  5. 一种识别图片中的字符的装置,其中,该装置包括:A device for recognizing characters in a picture, wherein the device comprises:
    获取单元,适于识别图片所包含的字符占位,获取每个字符占位对应的候选字符集合以及每个候选字符对应的概率参数;An obtaining unit, configured to identify a character placeholder included in the picture, obtain a candidate character set corresponding to each character placeholder, and a probability parameter corresponding to each candidate character;
    预处理单元,适于按照所述图片所包含的各字符占位的顺序,依次从每个字符占位对应的候选字符集合中选取一个候选字符得到一个候选字符组合;进行多次选取,每次选取到不同的字符组合,得到多个候选字符组合;The pre-processing unit is adapted to select one candidate character from the candidate character set corresponding to each character placeholder in turn according to the order of the characters occupied by the picture to obtain a candidate character combination; Select different character combinations to get multiple candidate character combinations;
    识别单元,适于计算所得到的每个候选字符组合的概率,将概率最高 的候选字符组合作为对所述图片的字符识别结果。The identification unit is adapted to calculate the probability of each candidate character combination obtained, and has the highest probability The candidate character combination is used as the character recognition result for the picture.
  6. 如权利要求5所述的装置,其中,The apparatus according to claim 5, wherein
    所述识别单元,适于基于n元语法模型,计算所得到的每个候选字符组合的概率。The identification unit is adapted to calculate a probability of each of the candidate character combinations obtained based on the n-gram model.
  7. 如权利要求5或6所述的装置,其中,The apparatus according to claim 5 or 6, wherein
    所述识别单元,适于对于一个候选字符组合中的每个候选字符,根据各候选字符的概率参数,计算该候选字符在其前n-1个候选字符已经确定的条件下出现的条件概率;再计算该候选字符组合中各候选字符的条件概率的乘积,作为该候选字符组合的概率。The identifying unit is adapted to calculate, for each candidate character in a candidate character combination, a conditional probability that the candidate character appears under the condition that the first n-1 candidate characters have been determined according to the probability parameter of each candidate character; The product of the conditional probabilities of the candidate characters in the candidate character combination is then calculated as the probability of the candidate character combination.
  8. 如权利要求5-7中任一项所述的装置,其中,A device according to any of claims 5-7, wherein
    所述获取单元,适于利用光学字符识别技术识别图片所包含的字符占位,获取每个字符占位对应的候选字符集合以及每个候选字符对应的概率参数The acquiring unit is configured to identify a character occupying position included in the picture by using an optical character recognition technology, obtain a candidate character set corresponding to each character placeholder, and a probability parameter corresponding to each candidate character.
  9. 一种计算机程序,包括计算机可读代码,当所述计算机可读代码在终端设备上运行时,导致所述终端设备执行根据权利要求1-4中的任一个所述的识别图片中的字符的方法。A computer program comprising computer readable code, when said computer readable code is run on a terminal device, causing said terminal device to perform recognition of a character in a picture according to any one of claims 1-4 method.
  10. 一种计算机可读介质,其中存储了如权利要求9所述的计算机程序。 A computer readable medium storing the computer program of claim 9.
PCT/CN2016/077865 2015-04-03 2016-03-30 Method and apparatus for recognizing characters in picture WO2016155627A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201510158353.9A CN104766077B (en) 2015-04-03 2015-04-03 Method and device for recognizing characters in picture
CN201510158353.9 2015-04-03

Publications (1)

Publication Number Publication Date
WO2016155627A1 true WO2016155627A1 (en) 2016-10-06

Family

ID=53647891

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2016/077865 WO2016155627A1 (en) 2015-04-03 2016-03-30 Method and apparatus for recognizing characters in picture

Country Status (2)

Country Link
CN (1) CN104766077B (en)
WO (1) WO2016155627A1 (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111444906A (en) * 2020-03-24 2020-07-24 腾讯科技(深圳)有限公司 Image recognition method based on artificial intelligence and related device
CN111914825A (en) * 2020-08-03 2020-11-10 腾讯科技(深圳)有限公司 Character recognition method and device and electronic equipment
CN112232347A (en) * 2020-09-07 2021-01-15 平安科技(深圳)有限公司 Character recognition method, device and equipment based on probability matrix and storage medium
CN113221890A (en) * 2021-05-25 2021-08-06 深圳市瑞驰信息技术有限公司 OCR-based cloud mobile phone text content supervision method, system and system

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104766077B (en) * 2015-04-03 2017-04-12 北京奇虎科技有限公司 Method and device for recognizing characters in picture
CN105069898B (en) * 2015-07-15 2018-03-30 广州敦和信息技术有限公司 The method and device that a kind of invoice brief note is automatically positioned
JP6759306B2 (en) * 2018-11-26 2020-09-23 キヤノン株式会社 Image processing device and its control method, program
CN110414450A (en) * 2019-07-31 2019-11-05 北京字节跳动网络技术有限公司 Keyword detection method, apparatus, storage medium and electronic equipment

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090208107A1 (en) * 2008-02-19 2009-08-20 Fujitsu Limited Character recognition apparatus and character recognition method
CN102024139A (en) * 2009-09-18 2011-04-20 富士通株式会社 Device and method for recognizing character strings
CN102982330A (en) * 2012-11-21 2013-03-20 新浪网技术(中国)有限公司 Method and device recognizing characters in character images
CN104766077A (en) * 2015-04-03 2015-07-08 北京奇虎科技有限公司 Method and device for recognizing characters in picture

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102298486A (en) * 2010-06-22 2011-12-28 广东国笔科技股份有限公司 Fast calling system and method based on touch screen
CN102750272B (en) * 2012-07-02 2015-01-14 安徽科大讯飞信息科技股份有限公司 Method and system for optimizing hand-input candidate item of character
CN102799379A (en) * 2012-07-13 2012-11-28 北京小米科技有限责任公司 Character input method and character input device
CN103076892B (en) * 2012-12-31 2016-09-28 百度在线网络技术(北京)有限公司 A kind of method and apparatus of the input candidate item for providing corresponding to input character string
CN104360759B (en) * 2014-11-21 2017-03-08 百度在线网络技术(北京)有限公司 Candidate word sort method, device and character input method, equipment

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090208107A1 (en) * 2008-02-19 2009-08-20 Fujitsu Limited Character recognition apparatus and character recognition method
CN102024139A (en) * 2009-09-18 2011-04-20 富士通株式会社 Device and method for recognizing character strings
CN102982330A (en) * 2012-11-21 2013-03-20 新浪网技术(中国)有限公司 Method and device recognizing characters in character images
CN104766077A (en) * 2015-04-03 2015-07-08 北京奇虎科技有限公司 Method and device for recognizing characters in picture

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111444906A (en) * 2020-03-24 2020-07-24 腾讯科技(深圳)有限公司 Image recognition method based on artificial intelligence and related device
CN111444906B (en) * 2020-03-24 2023-09-29 腾讯科技(深圳)有限公司 Image recognition method and related device based on artificial intelligence
CN111914825A (en) * 2020-08-03 2020-11-10 腾讯科技(深圳)有限公司 Character recognition method and device and electronic equipment
CN111914825B (en) * 2020-08-03 2023-10-27 腾讯科技(深圳)有限公司 Character recognition method and device and electronic equipment
CN112232347A (en) * 2020-09-07 2021-01-15 平安科技(深圳)有限公司 Character recognition method, device and equipment based on probability matrix and storage medium
CN112232347B (en) * 2020-09-07 2023-06-20 平安科技(深圳)有限公司 Character recognition method, device, equipment and storage medium based on probability matrix
CN113221890A (en) * 2021-05-25 2021-08-06 深圳市瑞驰信息技术有限公司 OCR-based cloud mobile phone text content supervision method, system and system

Also Published As

Publication number Publication date
CN104766077B (en) 2017-04-12
CN104766077A (en) 2015-07-08

Similar Documents

Publication Publication Date Title
WO2016155627A1 (en) Method and apparatus for recognizing characters in picture
US11461392B2 (en) Providing relevant cover frame in response to a video search query
US10824874B2 (en) Method and apparatus for processing video
US8649614B2 (en) Method of compiling image database, image database compilation program, image retrieval method
WO2015184992A1 (en) Method for recognizing duplicate image, and image search and deduplication method and device thereof
CN110555372A (en) Data entry method, device, equipment and storage medium
JP5615932B2 (en) Search method and system
US8577882B2 (en) Method and system for searching multilingual documents
US11461386B2 (en) Visual recognition using user tap locations
JP2018527656A (en) Method and device for comparing similarity of high-dimensional features of images
WO2019080411A1 (en) Electrical apparatus, facial image clustering search method, and computer readable storage medium
US10482146B2 (en) Systems and methods for automatic customization of content filtering
WO2015197029A1 (en) Human face similarity recognition method and system
CN110321537B (en) Method and device for generating file
WO2016015621A1 (en) Human face picture name recognition method and system
US20140289632A1 (en) Picture drawing support apparatus and method
CN108595679B (en) Label determining method, device, terminal and storage medium
US10152540B2 (en) Linking thumbnail of image to web page
CN106156794B (en) Character recognition method and device based on character style recognition
US11829710B2 (en) Deriving global intent from a composite document to facilitate editing of the composite document
JP2016015018A (en) Service providing device, method, and program
WO2016058520A1 (en) Method and apparatus for recognizing name of face picture
WO2017000341A1 (en) Information processing method, device, and terminal
CN107909054B (en) Similarity evaluation method and device for picture texts
WO2015131528A1 (en) Method and apparatus for determining topic distribution of given text

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 16771382

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 16771382

Country of ref document: EP

Kind code of ref document: A1