WO2024051427A1 - Coin identification method and system, and storage medium - Google Patents

Coin identification method and system, and storage medium Download PDF

Info

Publication number
WO2024051427A1
WO2024051427A1 PCT/CN2023/111654 CN2023111654W WO2024051427A1 WO 2024051427 A1 WO2024051427 A1 WO 2024051427A1 CN 2023111654 W CN2023111654 W CN 2023111654W WO 2024051427 A1 WO2024051427 A1 WO 2024051427A1
Authority
WO
WIPO (PCT)
Prior art keywords
coin
area
image
identification method
user
Prior art date
Application number
PCT/CN2023/111654
Other languages
French (fr)
Chinese (zh)
Inventor
徐青松
李青
Original Assignee
杭州睿胜软件有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 杭州睿胜软件有限公司 filed Critical 杭州睿胜软件有限公司
Publication of WO2024051427A1 publication Critical patent/WO2024051427A1/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/764Arrangements for image or video recognition or understanding using pattern recognition or machine learning using classification, e.g. of video objects
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/20Image preprocessing
    • G06V10/26Segmentation of patterns in the image field; Cutting or merging of image elements to establish the pattern region, e.g. clustering-based techniques; Detection of occlusion
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/20Image preprocessing
    • G06V10/30Noise filtering
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/40Extraction of image or video features
    • G06V10/44Local feature extraction by analysis of parts of the pattern, e.g. by detecting edges, contours, loops, corners, strokes or intersections; Connectivity analysis, e.g. of connected components
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/74Image or video pattern matching; Proximity measures in feature spaces
    • G06V10/761Proximity, similarity or dissimilarity measures

Definitions

  • the invention relates to the field of computer technology, and in particular to a coin identification method, system and storage medium.
  • Coins can be divided into different types such as coins and commemorative coins. Whether they are circulating currencies, collected coins or commemorative coins, there is a certain need for identification. However, due to reasons such as less circulation or natural rarity of some coins, samples are scarce. At the same time, Due to the unique materials of coins, different textures between old and new, problems with light reflection, uncertain user shooting environment, and complex backgrounds, even the same coins have various characteristics, and the difference between user coin images and standard coin images is relatively large. Large, different currencies have similar shapes, and the key distinguishing features are textures, text and other features. Therefore, the shooting quality has a great impact on recognition. A slight blur will lead to the loss of key information. At the same time, there are different situations where the shooting is too far or the resolution is low. Therefore, Coin recognition is technically difficult, resulting in poor recognition accuracy of existing coins.
  • One of the purposes of this disclosure is to provide a coin identification method, including:
  • identifying the coin image to obtain the coin area includes: applying UNet image segmentation processing to the coin image to obtain the coin area.
  • identifying the coin image to obtain the coin area, and segmenting the coin area image to obtain the coin area include:
  • the method further includes: performing ellipse detection and perfect circle correction on the coin area image.
  • the method further includes: performing image preprocessing on the coin area image obtained by the segmentation to obtain a coin area image with background and noise removed;
  • Identifying the coin area picture and obtaining multiple characteristic areas includes: identifying the coin area picture that has undergone the preprocessing and obtaining multiple characteristic areas.
  • the method further includes: performing image preprocessing on the coin image provided by the user to obtain the coin image with background and noise removed;
  • Recognizing the coin image to obtain the coin area, and segmenting the coin area image to obtain the coin area image includes: identifying the pre-processed coin image to obtain the coin area, and segmenting the coin area image to obtain the coin area image.
  • the method further includes: performing post-processing on the coin area image with a CV algorithm to remove excess noise.
  • obtaining the coin picture provided by the user includes: providing at least two windows for the user to select on the interactive interface, so that the user can respectively choose to provide pictures of different sides of the coin currently to be identified.
  • identifying the coin area picture and obtaining multiple feature areas includes: identifying the coin area picture according to a pre-trained feature area recognition model to obtain multiple feature areas.
  • a self-supervised pre-training backbone model based on contrastive learning is used to identify the plurality of feature areas respectively and obtain feature information of multiple coins.
  • the combination of multiple coin characteristic information and obtaining the classification information of the coin includes: using a similarity comparison method to obtain the comprehensive similarity of the coin, and classifying the coins according to the comprehensive similarity. Classification information is sorted.
  • a coin identification system including a processor and storage
  • a program is stored in the memory, and when the program is executed by the processor, the coin identification method as described above is implemented.
  • a storage medium is proposed on which a program is stored, which when executed implements the coin identification method as described above.
  • Figure 1 shows a schematic flowchart of a coin identification method according to an embodiment of the present invention.
  • Figure 2 shows a schematic structural diagram of a coin identification system provided by an embodiment of the present invention.
  • any specific values are to be construed as illustrative only and not as limiting. Accordingly, other examples of the exemplary embodiments may have different values.
  • FIG. 1 shows a schematic flowchart of a coin identification method according to an embodiment of the present invention.
  • This method can be implemented in an application program (app) installed on a smart terminal such as a mobile phone or tablet computer.
  • the method includes:
  • Step S100 Obtain the coin image provided by the user
  • Step S200 Identify the coin image to obtain the coin area, and segment the coin area image to obtain the coin area image;
  • Step S300 Identify the coin area image and obtain multiple feature areas
  • Step S400 Identify the multiple feature areas respectively and obtain multiple coin feature information
  • Step S500 Combine the feature information of the multiple coins to obtain the classification information of the coins.
  • identifying the coin image to obtain the coin area includes: applying UNet image segmentation processing to the coin image to obtain the coin area.
  • the UNet image segmentation adopts a pre-trained neural network model, which can identify coin images to obtain coin areas, and perform foreground segmentation on coin images to obtain coin areas.
  • other image segmentation processing methods can also be used, which is not the case in this disclosure. Make restrictions.
  • identifying the coin image to obtain the coin area, and segmenting the coin area image to obtain the coin area include:
  • the coin area is identified according to the area detection model, the coin mask area is obtained, and the coin area image is segmented and obtained. You can use the coin area obtained by the above UNet image segmentation process and directly crop it to obtain the coin area picture. You can also identify the coin area based on the pre-trained coin area detection model to obtain a more accurate coin area and crop it. Get a picture of the coin area.
  • the method further includes: performing ellipse detection and perfect circle correction on the coin area image.
  • the actual processing is to perform ellipse detection on the coin area picture, obtain the equation of the ellipse, and then calculate the pixel mapping parameters to restore the circle according to the ellipse equation, perform affine transformation and interpolation processing, and convert the coin ellipse in the user's coin picture into A perfect circle shape, thereby completing the perfect circle correction process, which facilitates later feature recognition processing and corresponding content recognition.
  • the method further includes: performing image preprocessing on the segmented coin area image, obtaining a coin area image with background and noise removed, and then identifying the preprocessed coin area image, Get multiple feature areas.
  • the method further includes: performing image preprocessing on the coin image provided by the user, obtaining the coin image with the background and noise removed, and then identifying the preprocessed coin image to obtain the coin area, and segmenting Get a picture of the coin area.
  • the preprocessing step is advanced to after obtaining the coin image provided by the user, and the user's coin image is directly preprocessed into a clean, background-free, and noise-free image, so as to minimize the Interference from redundant information.
  • the pre-processed coin image is identified to obtain the coin area according to the pre-trained area recognition model (such as YOLO and other models), and the coin area image is segmented to obtain the coin area image.
  • Advancing the preprocessing steps can also remove the interference of background factors and improve the accuracy of recognition.
  • the coin area image is post-processed with a CV algorithm to remove excess noise.
  • Segmentation to obtain the coin area image may include interference factors such as redundant image blocks.
  • the edges of the coin area can be obtained through edge detection models or algorithms, such as Hough circle detection algorithms, pre-trained coin edge detection models, etc. to obtain the coins. Rounded edges can also be assisted by morphological processing methods to remove excess noise interference in the coin area and obtain a more stable coin mask area.
  • coin pictures with more interference factors can be re-invested into the training model as supplementary samples, so as to obtain a more accurate recognition model for identifying coin areas.
  • obtaining the coin picture provided by the user includes: providing at least two windows for the user to select on the interactive interface, so that the user can respectively choose to provide pictures of different sides of the coin currently to be identified.
  • the user-side interactive interface guides the user to provide pictures of the front and back of the coin, which can facilitate the acquisition of more coin features and make subsequent identification processing more accurate.
  • Identification can be performed, and this disclosure does not limit this.
  • the front and back sides of coins are classified through the image binary classification method, and the coin front and back sides classification and recognition model is obtained through the pre-established sample training model.
  • identifying the coin area picture and obtaining multiple feature areas includes: identifying the coin area picture according to a pre-trained feature area recognition model to obtain multiple feature areas.
  • Different coins have similar basic image structures and can be roughly classified, such as portraits or patterns on the front, patterns on the back, year, text (letters or numbers), etc., which can be disassembled and classified. Due to the various problems described in the background art, the overall recognition accuracy of coins in the prior art is low. However, it is possible to split the coin features into multiple pre-classified feature areas for separate identification, and to obtain the classification information of the coins through comprehensive judgment. Significantly improve the accuracy of coin recognition.
  • the pre-trained feature area recognition model performs area recognition, detection and cropping processing on the key common areas of the coins, and processes the overall features of the coins into multiple different feature areas to facilitate subsequent feature recognition.
  • a self-supervised pre-training backbone model based on contrastive learning is used to separately identify multiple feature areas and obtain feature information of multiple coins. By extracting features from multiple different feature areas of the coin, and then identifying or retrieving them, the corresponding feature identification information is obtained.
  • the unique distinguishing features between coins are textures, text, etc.
  • the general pre-training model is less effective. The core difficulty lies in the small number of training images. Pre-training is done through self-supervision based on contrastive learning, similarity comparison is performed, and the correspondence is obtained.
  • the self-supervised pre-training backbone model based on contrastive learning can significantly improve the recognition accuracy. Conduct fine-tuning and regression testing on the self-supervised pre-training backbone model based on contrastive learning, adjust parameters and test sample sets, and obtain bones with higher recognition accuracy. dry model.
  • the combination of multiple coin characteristic information and obtaining the classification information of the coin includes using a similarity comparison method to identify the characteristic information of multiple different characteristic areas of the coin, and through the combination with the standard
  • the characteristic information is compared with the obtained similarities, and then the different similarities of the multiple characteristic areas are combined to obtain the comprehensive similarity of the coin, and the classification information of the coins is sorted according to the comprehensive similarity.
  • the recognition results of one or more coins with the highest similarity matching degree can be output and displayed to the user, or all coin classification information can be output and displayed to the user in order of similarity, so that the user can browse and select.
  • the similarity between each feature area and standard feature information can be obtained by searching in a preset standard feature information database or using pre-trained model recognition. Calculating the comprehensive similarity of coins can also be done by setting different weights according to the information of different feature areas.
  • the recognition results of each of the multiple feature areas and the confidence of the recognition results are obtained according to the self-supervised pre-training backbone model based on contrastive learning (the confidence is the current value obtained by model recognition).
  • the credibility of the identification result thereby comprehensively obtaining the overall confidence of the identification result of the coin to be identified, and at the same time, combined with the comprehensive similarity of the coin to be identified that has been obtained above, the overall confidence and comprehensive similarity
  • the probability score of the current recognition result of the coin to be recognized is obtained through weighted calculation, and finally the coin recognition results are sorted and output and displayed according to the probability score. Calculating the overall confidence of the coin can also be done by setting different weights according to the information in different feature areas.
  • the year recognition results of the coins to be recognized are obtained, filtered according to the closest coin year number, and then the recognition results are output, for example, the same coins
  • the recognition results have multiple year versions, and the recognition results of the same year or the closest year version as the current coin to be recognized are filtered and output are displayed.
  • the present invention also proposes a coin identification system, which includes a processor and a memory.
  • a program is stored on the memory.
  • the coin identification method as described above is implemented.
  • Figure 2 is a schematic structural diagram of a coin identification system provided by an embodiment of the present invention.
  • the coin identification system includes a processor 301, a communication interface 302, memory 303 and communication bus 304.
  • the processor 301, the communication interface 302, and the memory 303 complete communication with each other through the communication bus 304.
  • the memory 303 is used to store computer programs.
  • processor 301 When the processor 301 is used to execute the program stored in the memory 303, it implements the following steps:
  • the communication bus 304 mentioned in the above-mentioned electronic equipment may be a Peripheral Component Interconnect (PCI) bus or an Extended Industry Standard Architecture (EISA) bus, etc.
  • PCI Peripheral Component Interconnect
  • EISA Extended Industry Standard Architecture
  • the communication bus 304 can be divided into an address bus, a data bus, a control bus, etc. For ease of presentation, only one thick line is used in the figure, but it does not mean that there is only one bus or one type of bus.
  • the communication interface 302 is used for communication between the above-mentioned electronic device and other devices.
  • the processor 301 may be a central processing unit (CPU), or other general-purpose processor, a digital signal processor (Digital Signal Processor, DSP), an application specific integrated circuit (Application Specific Integrated Circuit, ASIC), Field-Programmable Gate Array (FPGA) or other programmable logic devices, discrete gate or transistor logic devices, discrete hardware components, etc.
  • the general processor may be a microprocessor or the processor may be any conventional processor, etc.
  • the processor 301 is the control center of the electronic device and uses various interfaces and lines to connect various parts of the entire electronic device.
  • the memory 303 can be used to store the computer program.
  • the processor 301 implements various functions of the electronic device by running or executing the computer program stored in the memory 303 and calling the data stored in the memory 303. Function.
  • the memory 303 may include non-volatile and/or volatile memory.
  • Non-volatile memory may include read-only memory (ROM), programmable ROM (PROM), electrically programmable ROM (EPROM), electrically erasable programmable ROM (EEPROM), or flash memory.
  • Volatile memory may include random access memory (RAM) or external cache memory.
  • RAM is available in many forms, such as static RAM (SRAM), dynamic RAM (DRAM), synchronous DRAM (SDRAM), double data rate SDRAM (DDRSDRAM), enhanced SDRAM (ESDRAM), synchronous chain Synchlink DRAM (SLDRAM), memory bus (Rambus) direct RAM (RDRAM), direct memory bus dynamic RAM (DRDRAM), and memory bus dynamic RAM (RDRAM), etc.
  • the present invention also proposes a storage medium on which a program is stored.
  • the program When the program is executed, the following steps are implemented:
  • the computer-readable storage medium in the embodiment of the present invention may be any combination of one or more computer-readable media.
  • the computer-readable medium may be a computer-readable signal medium or a computer-readable storage medium.
  • the computer-readable storage medium may be, for example, but not limited to, an electrical, magnetic, optical, electromagnetic, infrared or semiconductor system, device or device, or any combination thereof.
  • a computer-readable storage medium may be any tangible medium that contains or stores a program for use by or in combination with an instruction execution system, apparatus, or device.
  • a computer-readable signal medium may include a data signal propagated in baseband or as part of a carrier wave carrying computer-readable program code therein. Such propagated data signals may take many forms, including but not limited to electromagnetic signals, optical signals, or any suitable combination of the above.
  • a computer-readable signal medium may also be any computer-readable medium other than a computer-readable storage medium that can send, propagate, or transmit a program for use by or in connection with an instruction execution system, apparatus, or device .
  • Computer program code for performing the operations of the present invention may be written in one or more programming languages, or a combination thereof, including object-oriented programming languages such as Java, Smalltalk, C++, and conventional Procedural programming language - such as "C" or similar programming language.
  • the program code may execute entirely on the user's computer, partly on the user's computer, as a stand-alone software package, partly on the user's computer and partly on a remote computer or entirely on the remote computer or server.
  • the remote computer can be connected to the user's computer through any kind of network, including a local area network (LAN) or a wide area network (WAN), or it can be connected to an external computer (such as an Internet service provider) through the Internet. ).
  • LAN local area network
  • WAN wide area network
  • Internet service provider such as an Internet service provider
  • each block in the flowchart or block diagrams may represent a module, program, or portion of code that contains one or more operable functions for implementing the specified logical functions.
  • Execution instructions, the module, program segment or part of the code contains one or more executable instructions for implementing the specified logical function.
  • the functions noted in the block may occur out of the order noted in the figures.
  • each block in the block diagram and/or flowchart illustration, and combinations of blocks in the block diagram and/or flowchart illustration can be designed into specialized hardware-based systems that perform the specified functions or acts. Implemented, or may be implemented using a combination of dedicated hardware and computer instructions.
  • each functional module in each embodiment of this article can be integrated together to form an independent part, each module can exist alone, or two or more modules can be integrated to form an independent part.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Computing Systems (AREA)
  • Databases & Information Systems (AREA)
  • Evolutionary Computation (AREA)
  • General Health & Medical Sciences (AREA)
  • Medical Informatics (AREA)
  • Software Systems (AREA)
  • Testing Of Coins (AREA)

Abstract

Provided in the present invention are a coin identification method and system, and a storage medium. The method comprises: acquiring a coin picture, which is provided by a user; performing identification on the coin picture to acquire a coin region, and performing segmentation to acquire coin region pictures; performing identification on the coin region pictures to acquire a plurality of feature regions; performing identification on each of the plurality of feature regions to acquire a plurality of pieces of coin feature information; and synthesizing the plurality of pieces of coin feature information to acquire classification information of a coin. The coin identification method provided in the present invention can improve the accuracy of coin identification.

Description

硬币识别方法、系统及存储介质Coin identification method, system and storage medium 技术领域Technical field
本发明涉及计算机技术领域,特别涉及一种硬币识别方法、系统及存储介质。The invention relates to the field of computer technology, and in particular to a coin identification method, system and storage medium.
背景技术Background technique
硬币可以分为钱币以及纪念币等不同种类,无论是流通货币还是收藏钱币或是纪念币的识别都具有一定的需求,然而由于部分硬币流通较少或是天然较为稀有等原因造成样本稀少,同时由于硬币存在材质的特殊性,新旧的纹理不同,光线反射的问题,用户拍摄环境不确定,背景复杂,造成即便是相同的硬币也具有各种不同的特征,用户硬币图和标准硬币图差别较大,不同币种形态接近,关键区分度特征为纹理,文字等特征,因此拍摄质量对识别影响大,稍微模糊会导致关键信息全无,同时存在过远拍摄或分辨率低的不同情况,因此硬币识别在技术上具有一定难度,导致现有硬币的识别准确率较差。Coins can be divided into different types such as coins and commemorative coins. Whether they are circulating currencies, collected coins or commemorative coins, there is a certain need for identification. However, due to reasons such as less circulation or natural rarity of some coins, samples are scarce. At the same time, Due to the unique materials of coins, different textures between old and new, problems with light reflection, uncertain user shooting environment, and complex backgrounds, even the same coins have various characteristics, and the difference between user coin images and standard coin images is relatively large. Large, different currencies have similar shapes, and the key distinguishing features are textures, text and other features. Therefore, the shooting quality has a great impact on recognition. A slight blur will lead to the loss of key information. At the same time, there are different situations where the shooting is too far or the resolution is low. Therefore, Coin recognition is technically difficult, resulting in poor recognition accuracy of existing coins.
发明内容Contents of the invention
本公开的目的之一是提供一种硬币识别方法,包括:One of the purposes of this disclosure is to provide a coin identification method, including:
获取用户提供的硬币图片;Get the coin image provided by the user;
对所述硬币图片进行识别获取硬币区域,并分割获取硬币区域图片;Identify the coin image to obtain the coin area, and segment the coin area image to obtain the coin area image;
对所述硬币区域图片进行识别,获取多个特征区域;Identify the coin area pictures and obtain multiple feature areas;
对所述多个特征区域分别进行识别,获取多个硬币特征信息;Respectively identify the multiple feature areas to obtain multiple coin feature information;
综合所述多个硬币特征信息,获取所述硬币的分类信息。Combining the feature information of the multiple coins, the classification information of the coins is obtained.
在一些实施例中,对所述硬币图片进行识别获取硬币区域包括:对所述硬币图片采用UNet图像分割处理,从而获取所述硬币区域。In some embodiments, identifying the coin image to obtain the coin area includes: applying UNet image segmentation processing to the coin image to obtain the coin area.
在一些实施例中,对所述硬币图片进行识别获取硬币区域,并分割获取硬币区域图片包括:In some embodiments, identifying the coin image to obtain the coin area, and segmenting the coin area image to obtain the coin area include:
根据区域检测模型对所述硬币区域进行识别,获取硬币掩膜区域,并分 割获取硬币区域图片。Identify the coin area according to the area detection model, obtain the coin mask area, and classify Cut to get the coin area picture.
在一些实施例中,该方法还包括:对所述硬币区域图片进行椭圆检测以及正圆矫正处理。In some embodiments, the method further includes: performing ellipse detection and perfect circle correction on the coin area image.
在一些实施例中,该方法还包括:对所述分割获取的硬币区域图片进行图像预处理,获取去除背景和噪声的硬币区域图片;In some embodiments, the method further includes: performing image preprocessing on the coin area image obtained by the segmentation to obtain a coin area image with background and noise removed;
对所述硬币区域图片进行识别,获取多个特征区域包括:对经过所述预处理的硬币区域图片进行识别,获取多个特征区域。Identifying the coin area picture and obtaining multiple characteristic areas includes: identifying the coin area picture that has undergone the preprocessing and obtaining multiple characteristic areas.
在一些实施例中,该方法还包括:对用户提供的硬币图片进行图像预处理,获取去除背景和噪声的硬币图片;In some embodiments, the method further includes: performing image preprocessing on the coin image provided by the user to obtain the coin image with background and noise removed;
对所述硬币图片进行识别获取硬币区域,并分割获取硬币区域图片包括:对经过所述预处理的硬币图片进行识别获取硬币区域,并分割获取硬币区域图片。Recognizing the coin image to obtain the coin area, and segmenting the coin area image to obtain the coin area image includes: identifying the pre-processed coin image to obtain the coin area, and segmenting the coin area image to obtain the coin area image.
在一些实施例中,所述分割获取硬币区域图片后,该方法还包括:对所述硬币区域图片进行CV算法的后处理,去除多余噪点。In some embodiments, after the coin area image is obtained by segmentation, the method further includes: performing post-processing on the coin area image with a CV algorithm to remove excess noise.
在一些实施例中,所述获取用户提供的硬币图片包括:在交互界面提供供用户选择的至少两个窗口,以供用户分别选择提供当前待识别硬币的不同面的图片。In some embodiments, obtaining the coin picture provided by the user includes: providing at least two windows for the user to select on the interactive interface, so that the user can respectively choose to provide pictures of different sides of the coin currently to be identified.
在一些实施例中,根据预训练的正反面识别模型判断用户提供的硬币图片属于正面或是反面。In some embodiments, it is determined whether the coin image provided by the user belongs to the front or the back according to a pre-trained front and back recognition model.
在一些实施例中,所述对硬币区域图片进行识别,获取多个特征区域包括:根据预训练的特征区域识别模型对所述硬币区域图片进行识别,获取多个特征区域。In some embodiments, identifying the coin area picture and obtaining multiple feature areas includes: identifying the coin area picture according to a pre-trained feature area recognition model to obtain multiple feature areas.
在一些实施例中,采用基于对比学习的自监督预训练骨干模型对所述对多个特征区域分别进行识别,获取多个硬币特征信息。In some embodiments, a self-supervised pre-training backbone model based on contrastive learning is used to identify the plurality of feature areas respectively and obtain feature information of multiple coins.
在一些实施例中,所述综合多个硬币特征信息,获取所述硬币的分类信息包括:采用相似度比较方法获取所述硬币的综合相似度,并按照所述综合相似度将所述硬币的分类信息进行排序。In some embodiments, the combination of multiple coin characteristic information and obtaining the classification information of the coin includes: using a similarity comparison method to obtain the comprehensive similarity of the coin, and classifying the coins according to the comprehensive similarity. Classification information is sorted.
根据本公开的另一方面,提出了一种硬币识别系统,包括处理器和存储 器,所述存储器上存储有程序,所述程序被所述处理器执行时,实现如上所述的硬币识别方法。According to another aspect of the present disclosure, a coin identification system is proposed, including a processor and storage A program is stored in the memory, and when the program is executed by the processor, the coin identification method as described above is implemented.
根据本公开的另一方面,提出了一种存储介质,其上存储有程序,所述程序被执行时实现如上所述的硬币识别方法。According to another aspect of the present disclosure, a storage medium is proposed on which a program is stored, which when executed implements the coin identification method as described above.
通过以下参照附图对本公开的示例性实施例的详细描述,本公开的其它特征及其优点将会变得更为清楚。Other features and advantages of the present disclosure will become more apparent from the following detailed description of exemplary embodiments of the present disclosure with reference to the accompanying drawings.
附图说明Description of the drawings
构成说明书的一部分的附图描述了本公开的实施例,并且连同说明书一起用于解释本公开的原理。The accompanying drawings, which are incorporated in and constitute a part of this specification, illustrate embodiments of the disclosure and, together with the description, serve to explain principles of the disclosure.
参照附图,根据下面的详细描述,可以更加清楚地理解本公开,其中:The present disclosure may be more clearly understood from the following detailed description with reference to the accompanying drawings, in which:
图1所示为本发明一实施例提供的硬币识别方法的流程示意图。Figure 1 shows a schematic flowchart of a coin identification method according to an embodiment of the present invention.
图2所示为本发明一实施例提供的硬币识别系统的结构示意图。Figure 2 shows a schematic structural diagram of a coin identification system provided by an embodiment of the present invention.
注意,在以下说明的实施方式中,有时在不同的附图之间共同使用同一附图标记来表示相同部分或具有相同功能的部分,而省略其重复说明。在一些情况中,使用相似的标号和字母表示类似项,因此,一旦某一项在一个附图中被定义,则在随后的附图中不需要对其进行进一步讨论。Note that in the embodiments described below, the same reference numerals are sometimes commonly used between different drawings to represent the same parts or parts having the same functions, and repeated description thereof is omitted. In some instances, similar numbers and letters are used to identify similar items so that, once an item is defined in one figure, it does not require further discussion in subsequent figures.
为了便于理解,在附图等中所示的各结构的位置、尺寸及范围等有时不表示实际的位置、尺寸及范围等。因此,本公开并不限于附图等所公开的位置、尺寸及范围等。In order to facilitate understanding, the positions, dimensions, ranges, etc. of each structure shown in the drawings and the like may not represent the actual positions, dimensions, ranges, etc. Therefore, the present disclosure is not limited to the positions, dimensions, ranges, etc. disclosed in the drawings and the like.
具体实施方式Detailed ways
下面将参照附图来详细描述本公开的各种示例性实施例。应注意到:除非另外具体说明,否则在这些实施例中阐述的部件和步骤的相对布置、数字表达式和数值不限制本公开的范围。Various exemplary embodiments of the present disclosure will be described in detail below with reference to the accompanying drawings. It should be noted that the relative arrangement of components and steps, numerical expressions, and numerical values set forth in these examples do not limit the scope of the disclosure unless otherwise specifically stated.
以下对至少一个示例性实施例的描述实际上仅仅是说明性的,决不作为对本公开及其应用或使用的任何限制。也就是说,本文中的结构及方法是以示例性的方式示出,来说明本公开中的结构和方法的不同实施例。然而,本 领域技术人员将会理解,它们仅仅说明可以用来实施的本公开的示例性方式,而不是穷尽的方式。此外,附图不必按比例绘制,一些特征可能被放大以示出具体组件的细节。The following description of at least one exemplary embodiment is merely illustrative in nature and is in no way intended to limit the disclosure, its application or uses. That is, the structures and methods herein are shown in an exemplary manner to illustrate different embodiments of the structures and methods in the present disclosure. However, this Those skilled in the art will understand that they are merely illustrative of exemplary ways in which the disclosure may be practiced, and are not exhaustive. Furthermore, the drawings are not necessarily to scale and some features may be exaggerated to illustrate details of specific components.
对于相关领域普通技术人员已知的技术、方法和设备可能不作详细讨论,但在适当情况下,所述技术、方法和设备应当被视为授权说明书的一部分。Techniques, methods and devices known to those of ordinary skill in the relevant art may not be discussed in detail, but where appropriate, such techniques, methods and devices should be considered part of the authorized specification.
在这里示出和讨论的所有示例中,任何具体值应被解释为仅仅是示例性的,而不是作为限制。因此,示例性实施例的其它示例可以具有不同的值。In all examples shown and discussed herein, any specific values are to be construed as illustrative only and not as limiting. Accordingly, other examples of the exemplary embodiments may have different values.
图1所示为本发明一实施例提供的硬币识别方法的流程示意图。该方法可以在例如手机、平板电脑等智能终端上安装的应用程序(app)中实现。如图1所示,该方法包括:Figure 1 shows a schematic flowchart of a coin identification method according to an embodiment of the present invention. This method can be implemented in an application program (app) installed on a smart terminal such as a mobile phone or tablet computer. As shown in Figure 1, the method includes:
步骤S100:获取用户提供的硬币图片;Step S100: Obtain the coin image provided by the user;
步骤S200:对所述硬币图片进行识别获取硬币区域,并分割获取硬币区域图片;Step S200: Identify the coin image to obtain the coin area, and segment the coin area image to obtain the coin area image;
步骤S300:对所述硬币区域图片进行识别,获取多个特征区域;Step S300: Identify the coin area image and obtain multiple feature areas;
步骤S400:对所述多个特征区域分别进行识别,获取多个硬币特征信息;Step S400: Identify the multiple feature areas respectively and obtain multiple coin feature information;
步骤S500:综合所述多个硬币特征信息,获取所述硬币的分类信息。Step S500: Combine the feature information of the multiple coins to obtain the classification information of the coins.
在一些实施例中,对所述硬币图片进行识别获取硬币区域包括:对所述硬币图片采用UNet图像分割处理,从而获取所述硬币区域。所述UNet图像分割采用预先训练的神经网络模型,其可以对硬币图片进行识别获取硬币区域,对硬币图片进行前景分割从而得到硬币区域,此外也可以采用其他图像分割处理方法,本公开并不对此进行限制。In some embodiments, identifying the coin image to obtain the coin area includes: applying UNet image segmentation processing to the coin image to obtain the coin area. The UNet image segmentation adopts a pre-trained neural network model, which can identify coin images to obtain coin areas, and perform foreground segmentation on coin images to obtain coin areas. In addition, other image segmentation processing methods can also be used, which is not the case in this disclosure. Make restrictions.
在一些实施例中,对所述硬币图片进行识别获取硬币区域,并分割获取硬币区域图片包括:In some embodiments, identifying the coin image to obtain the coin area, and segmenting the coin area image to obtain the coin area include:
根据区域检测模型对所述硬币区域进行识别,获取硬币掩膜区域,并分割获取硬币区域图片。可以采用上述UNet图像分割处理获取的硬币区域,并直接裁剪处理获取硬币区域图片,也可以再根据预训练的硬币区域检测模型对所述硬币区域进行识别,获取更加精准的硬币区域,并且裁剪处理获取硬币区域图片。 The coin area is identified according to the area detection model, the coin mask area is obtained, and the coin area image is segmented and obtained. You can use the coin area obtained by the above UNet image segmentation process and directly crop it to obtain the coin area picture. You can also identify the coin area based on the pre-trained coin area detection model to obtain a more accurate coin area and crop it. Get a picture of the coin area.
在一些实施例中,该方法还包括:对所述硬币区域图片进行椭圆检测以及正圆矫正处理。实际处理中是对所述硬币区域图片进行椭圆检测,获取椭圆的方程,再根据椭圆方程计算恢复成圆的像素映射参数,进行仿射变换以及插值处理,将用户硬币图片中的硬币椭圆转换为正圆形状,从而完成正圆矫正处理,方便后期的特征识别处理和相应的内容识别。In some embodiments, the method further includes: performing ellipse detection and perfect circle correction on the coin area image. The actual processing is to perform ellipse detection on the coin area picture, obtain the equation of the ellipse, and then calculate the pixel mapping parameters to restore the circle according to the ellipse equation, perform affine transformation and interpolation processing, and convert the coin ellipse in the user's coin picture into A perfect circle shape, thereby completing the perfect circle correction process, which facilitates later feature recognition processing and corresponding content recognition.
在一些实施例中,该方法还包括:对所述分割获取的硬币区域图片进行图像预处理,获取去除背景和噪声的硬币区域图片,之后再对所述经过预处理的硬币区域图片进行识别,获取多个特征区域。将分割获取的硬币区域图片处理成干净,无背景,无噪声的图像,尽可能减少冗余信息的干扰,例如硬币图片背景中存在的桌子或其他硬币等非前景待识别硬币区域的其他背景因素会对识别造成一定干扰,去除这类背景干扰因素可以准确获取待识别的硬币区域图片,便于后续识别硬币区域图片的特征区域时不会识别到背景区域造成干扰。In some embodiments, the method further includes: performing image preprocessing on the segmented coin area image, obtaining a coin area image with background and noise removed, and then identifying the preprocessed coin area image, Get multiple feature areas. Process the segmented coin area image into a clean, background-free, noise-free image, minimizing the interference of redundant information, such as tables or other coins in the background of the coin image and other non-foreground background factors of the coin area to be identified. It will cause certain interference to the recognition. Removing such background interference factors can accurately obtain the image of the coin area to be identified, so that when the characteristic area of the coin area image is subsequently identified, the background area will not be recognized and cause interference.
在一些实施例中,该方法还包括:对用户提供的硬币图片进行图像预处理,获取去除背景和噪声的硬币图片,之后再对所述经过预处理的硬币图片进行识别获取硬币区域,并分割获取硬币区域图片。在此实施例中,将预处理的步骤提前到获取到用户提供的硬币图片后,直接对用户的硬币图片进行预处理,将其处理成干净,无背景,无噪声的图像,以便尽可能减少冗余信息的干扰。之后在根据预训练的区域识别模型(例如YOLO等模型)对经过预处理的硬币图片进行识别获取硬币区域,并分割获取硬币区域图片。将预处理的步骤提前,同样可以去除背景因素的干扰,提高识别的准确率。In some embodiments, the method further includes: performing image preprocessing on the coin image provided by the user, obtaining the coin image with the background and noise removed, and then identifying the preprocessed coin image to obtain the coin area, and segmenting Get a picture of the coin area. In this embodiment, the preprocessing step is advanced to after obtaining the coin image provided by the user, and the user's coin image is directly preprocessed into a clean, background-free, and noise-free image, so as to minimize the Interference from redundant information. Afterwards, the pre-processed coin image is identified to obtain the coin area according to the pre-trained area recognition model (such as YOLO and other models), and the coin area image is segmented to obtain the coin area image. Advancing the preprocessing steps can also remove the interference of background factors and improve the accuracy of recognition.
在一些实施例中,所述分割获取硬币区域图片后,对所述硬币区域图片进行CV算法的后处理,去除多余噪点。分割获取硬币区域图片,可能会包括多余图像块等干扰因素,可以通过边缘检测模型或算法的方式获取硬币区域的边缘,例如霍夫圆检测算法、预训练的硬币边缘检测模型等方式得到硬币的圆形边缘,也可以辅助以形态学处理方法,去除硬币区域的多余噪点干扰,获取更稳定的硬币掩膜区域。同时可以对出现较多干扰因素的硬币图片作为补充样本重新投入训练模型中,便于获取识别硬币区域更准确的识别模型。 In some embodiments, after the coin area image is obtained through the segmentation, the coin area image is post-processed with a CV algorithm to remove excess noise. Segmentation to obtain the coin area image may include interference factors such as redundant image blocks. The edges of the coin area can be obtained through edge detection models or algorithms, such as Hough circle detection algorithms, pre-trained coin edge detection models, etc. to obtain the coins. Rounded edges can also be assisted by morphological processing methods to remove excess noise interference in the coin area and obtain a more stable coin mask area. At the same time, coin pictures with more interference factors can be re-invested into the training model as supplementary samples, so as to obtain a more accurate recognition model for identifying coin areas.
在一些实施例中,所述获取用户提供的硬币图片包括:在交互界面提供供用户选择的至少两个窗口,以供用户分别选择提供当前待识别硬币的不同面的图片。在用户端的交互界面引导用户提供硬币的正反面图片,可以便于获取更多硬币特征,使得后续的识别处理更加准确,此外当用户只提供硬币的一面图片或者多张相同或不同图片的情况下也可以进行识别,本公开对此不做限制。In some embodiments, obtaining the coin picture provided by the user includes: providing at least two windows for the user to select on the interactive interface, so that the user can respectively choose to provide pictures of different sides of the coin currently to be identified. The user-side interactive interface guides the user to provide pictures of the front and back of the coin, which can facilitate the acquisition of more coin features and make subsequent identification processing more accurate. In addition, when the user only provides a picture of one side of the coin or multiple identical or different pictures, Identification can be performed, and this disclosure does not limit this.
在一些实施例中,根据预训练的正反面识别模型判断用户提供的硬币图片属于正面或是反面。通过图像二分类方法实现硬币正反面分类,并通过预先建立的样本训练模型获取硬币正反面分类识别模型。通过识别确认用户提供的硬币图片的正反面分类,可以更准确的对图片上的硬币特征区域进行分类识别,提高硬币识别的准确率。In some embodiments, it is determined whether the coin image provided by the user belongs to the front or the back according to a pre-trained front and back recognition model. The front and back sides of coins are classified through the image binary classification method, and the coin front and back sides classification and recognition model is obtained through the pre-established sample training model. By identifying and confirming the classification of the front and back sides of the coin pictures provided by the user, the characteristic areas of the coins on the pictures can be classified and identified more accurately, thereby improving the accuracy of coin recognition.
在一些实施例中,所述对硬币区域图片进行识别,获取多个特征区域包括:根据预训练的特征区域识别模型对所述硬币区域图片进行识别,获取多个特征区域。不同的硬币具有基本的图像结构类似可以进行大致分类,如正面人像或图案,背面花纹,年份,文字(字母或数字)等,可以进行拆解分类。由于背景技术中描述的各种问题,现有技术硬币的整体识别准确率较低,而将硬币特征拆分为多个预先分类的特征区域来分别进行识别,综合判断获取硬币的分类信息则可以显著提高硬币识别的准确率。预训练的特征区域识别模型对硬币的关键共性区域进行区域识别检测,裁剪处理,将硬币的整体特征处理为多个不同特征区域,方便后续的特征识别。In some embodiments, identifying the coin area picture and obtaining multiple feature areas includes: identifying the coin area picture according to a pre-trained feature area recognition model to obtain multiple feature areas. Different coins have similar basic image structures and can be roughly classified, such as portraits or patterns on the front, patterns on the back, year, text (letters or numbers), etc., which can be disassembled and classified. Due to the various problems described in the background art, the overall recognition accuracy of coins in the prior art is low. However, it is possible to split the coin features into multiple pre-classified feature areas for separate identification, and to obtain the classification information of the coins through comprehensive judgment. Significantly improve the accuracy of coin recognition. The pre-trained feature area recognition model performs area recognition, detection and cropping processing on the key common areas of the coins, and processes the overall features of the coins into multiple different feature areas to facilitate subsequent feature recognition.
在一些实施例中,所述对多个特征区域分别进行识别,获取多个硬币特征信息采用基于对比学习的自监督预训练骨干模型。通过对获取硬币的多个不同特征区域,分别进行特征提取,然后进行识别或检索,获取相应的特征识别信息。硬币之间特有的区分特征在于纹理,文字等,通用的预训练模型效果较差,其核心难点在于训练图较少,而通过基于对比学习的自监督做预训练,进行相似度比较,获取对应的基于对比学习的自监督预训练骨干模型,可以显著提高识别准确率。对所述基于对比学习的自监督预训练骨干模型进行微调和回归测试,调整参数和测试样本集,从而获取识别准确率较高的骨 干模型。In some embodiments, a self-supervised pre-training backbone model based on contrastive learning is used to separately identify multiple feature areas and obtain feature information of multiple coins. By extracting features from multiple different feature areas of the coin, and then identifying or retrieving them, the corresponding feature identification information is obtained. The unique distinguishing features between coins are textures, text, etc. The general pre-training model is less effective. The core difficulty lies in the small number of training images. Pre-training is done through self-supervision based on contrastive learning, similarity comparison is performed, and the correspondence is obtained. The self-supervised pre-training backbone model based on contrastive learning can significantly improve the recognition accuracy. Conduct fine-tuning and regression testing on the self-supervised pre-training backbone model based on contrastive learning, adjust parameters and test sample sets, and obtain bones with higher recognition accuracy. dry model.
在一些实施例中,所述综合多个硬币特征信息,获取所述硬币的分类信息包括采用相似度比较方法,根据识别到所述硬币的多个不同特征区域的特征信息,并通过其和标准特征信息对比获取的相似度,之后综合所述多个特征区域的不同相似度得到该硬币的综合相似度,并按照综合相似度将所述硬币的分类信息进行排序。可以将相似度匹配度最高的一个或多个硬币识别结果输出显示给用户,也可以将全部硬币分类信息按照相似度的排序顺序输出显示给用户,以便用户进行浏览和选择。所述各个特征区域和标准特征信息的相似度可以根据在预设的标准特征信息数据库中检索或利用预训练的模型识别的方式获取。计算硬币的综合相似度还可以按照不同特征区域的信息设置不同的权重进行处理。In some embodiments, the combination of multiple coin characteristic information and obtaining the classification information of the coin includes using a similarity comparison method to identify the characteristic information of multiple different characteristic areas of the coin, and through the combination with the standard The characteristic information is compared with the obtained similarities, and then the different similarities of the multiple characteristic areas are combined to obtain the comprehensive similarity of the coin, and the classification information of the coins is sorted according to the comprehensive similarity. The recognition results of one or more coins with the highest similarity matching degree can be output and displayed to the user, or all coin classification information can be output and displayed to the user in order of similarity, so that the user can browse and select. The similarity between each feature area and standard feature information can be obtained by searching in a preset standard feature information database or using pre-trained model recognition. Calculating the comprehensive similarity of coins can also be done by setting different weights according to the information of different feature areas.
在一些实施例中,根据基于对比学习的自监督预训练骨干模型分别获取所述多个特征区域中每个特征区域的识别结果以及所述识别结果的置信度(置信度即模型识别获取的当前识别结果的可信度),从而综合得到所述待识别硬币的识别结果的整体置信度,同时结合上述已获得的所述待识别硬币的综合相似度,将所述整体置信度和综合相似度通过加权计算获取所述待识别硬币的当前识别结果的可能性得分,最终将所述硬币识别结果按照可能性得分进行排序输出显示。计算硬币的整体置信度还可以按照不同特征区域的信息设置不同的权重进行处理。In some embodiments, the recognition results of each of the multiple feature areas and the confidence of the recognition results are obtained according to the self-supervised pre-training backbone model based on contrastive learning (the confidence is the current value obtained by model recognition). The credibility of the identification result), thereby comprehensively obtaining the overall confidence of the identification result of the coin to be identified, and at the same time, combined with the comprehensive similarity of the coin to be identified that has been obtained above, the overall confidence and comprehensive similarity The probability score of the current recognition result of the coin to be recognized is obtained through weighted calculation, and finally the coin recognition results are sorted and output and displayed according to the probability score. Calculating the overall confidence of the coin can also be done by setting different weights according to the information in different feature areas.
在一些实施例中,上述硬币识别结果按照相似度排序或者可能性得分排序后,获取所述待识别硬币的年份识别结果,按照最接近的硬币年份数字进行筛选后输出识别结果,例如相同的硬币识别结果具有多个年份版本,筛选和当前待识别硬币相同的年份或最接近的年份版本的识别结果进行输出显示。In some embodiments, after the above-mentioned coin recognition results are sorted according to similarity or likelihood score, the year recognition results of the coins to be recognized are obtained, filtered according to the closest coin year number, and then the recognition results are output, for example, the same coins The recognition results have multiple year versions, and the recognition results of the same year or the closest year version as the current coin to be recognized are filtered and output are displayed.
基于同一发明构思,本发明还提出了一种硬币识别系统,包括处理器和存储器,所述存储器上存储有程序,所述程序被所述处理器执行时,实现如上所述的硬币识别方法。请参考图2,图2所示为本发明一实施例提供的硬币识别系统的结构示意图,如图2所示,所述硬币识别系统包括处理器301、通 信接口302、存储器303和通信总线304。Based on the same inventive concept, the present invention also proposes a coin identification system, which includes a processor and a memory. A program is stored on the memory. When the program is executed by the processor, the coin identification method as described above is implemented. Please refer to Figure 2, which is a schematic structural diagram of a coin identification system provided by an embodiment of the present invention. As shown in Figure 2, the coin identification system includes a processor 301, a communication interface 302, memory 303 and communication bus 304.
其中,所述处理器301、所述通信接口302、所述存储器303通过所述通信总线304完成相互间的通信。The processor 301, the communication interface 302, and the memory 303 complete communication with each other through the communication bus 304.
所述存储器303,用于存放计算机程序。The memory 303 is used to store computer programs.
所述处理器301,用于执行存储器303上所存放的程序时,实现如下步骤:When the processor 301 is used to execute the program stored in the memory 303, it implements the following steps:
获取用户提供的硬币图片;Get the coin image provided by the user;
对所述硬币图片进行识别获取硬币区域,并分割获取硬币区域图片;Identify the coin image to obtain the coin area, and segment the coin area image to obtain the coin area image;
对所述硬币区域图片进行识别,获取多个特征区域;Identify the coin area pictures and obtain multiple feature areas;
对所述多个特征区域分别进行识别,获取多个硬币特征信息;Respectively identify the multiple feature areas to obtain multiple coin feature information;
综合所述多个硬币特征信息,获取所述硬币的分类信息。Combining the feature information of the multiple coins, the classification information of the coins is obtained.
关于该方法各个步骤的具体实现以及相关解释内容可以参见上述图1所示的方法实施方式,在此不做赘述。For the specific implementation and related explanations of each step of the method, please refer to the method implementation shown in Figure 1 above, and will not be described again here.
另外,处理器301执行存储器303上所存放的程序而实现的硬币识别方法的其他实现方式,与前述方法实施方式部分所提及的实现方式相同,这里也不再赘述。In addition, other implementations of the coin identification method implemented by the processor 301 executing the program stored on the memory 303 are the same as the implementations mentioned in the foregoing method implementation section, and will not be described again here.
上述电子设备提到的通信总线304可以是外设部件互连标准(Peripheral Component Interconnect,PCI)总线或扩展工业标准结构(Extended Industry Standard Architecture,EISA)总线等。该通信总线304可以分为地址总线、数据总线、控制总线等。为便于表示,图中仅用一条粗线表示,但并不表示仅有一根总线或一种类型的总线。The communication bus 304 mentioned in the above-mentioned electronic equipment may be a Peripheral Component Interconnect (PCI) bus or an Extended Industry Standard Architecture (EISA) bus, etc. The communication bus 304 can be divided into an address bus, a data bus, a control bus, etc. For ease of presentation, only one thick line is used in the figure, but it does not mean that there is only one bus or one type of bus.
通信接口302用于上述电子设备与其他设备之间的通信。The communication interface 302 is used for communication between the above-mentioned electronic device and other devices.
所述处理器301可以是中央处理单元(Central Processing Unit,CPU),还可以是其他通用处理器、数字信号处理器(Digital Signal Processor,DSP)、专用集成电路(Application Specific Integrated Circuit,ASIC)、现场可编程门阵列(Field-Programmable Gate Array,FPGA)或者其他可编程逻辑器件、分立门或者晶体管逻辑器件、分立硬件组件等。通用处理器可以是微处理器或者该处理器也可以是任何常规的处理器等,所述处理器301是所述电子设备的控制中心,利用各种接口和线路连接整个电子设备的各个部分。 The processor 301 may be a central processing unit (CPU), or other general-purpose processor, a digital signal processor (Digital Signal Processor, DSP), an application specific integrated circuit (Application Specific Integrated Circuit, ASIC), Field-Programmable Gate Array (FPGA) or other programmable logic devices, discrete gate or transistor logic devices, discrete hardware components, etc. The general processor may be a microprocessor or the processor may be any conventional processor, etc. The processor 301 is the control center of the electronic device and uses various interfaces and lines to connect various parts of the entire electronic device.
所述存储器303可用于存储所述计算机程序,所述处理器301通过运行或执行存储在所述存储器303内的计算机程序,以及调用存储在存储器303内的数据,实现所述电子设备的各种功能。The memory 303 can be used to store the computer program. The processor 301 implements various functions of the electronic device by running or executing the computer program stored in the memory 303 and calling the data stored in the memory 303. Function.
所述存储器303可以包括非易失性和/或易失性存储器。非易失性存储器可包括只读存储器(ROM)、可编程ROM(PROM)、电可编程ROM(EPROM)、电可擦除可编程ROM(EEPROM)或闪存。易失性存储器可包括随机存取存储器(RAM)或者外部高速缓冲存储器。作为说明而非局限,RAM以多种形式可得,诸如静态RAM(SRAM)、动态RAM(DRAM)、同步DRAM(SDRAM)、双数据率SDRAM(DDRSDRAM)、增强型SDRAM(ESDRAM)、同步链路(Synchlink)DRAM(SLDRAM)、存储器总线(Rambus)直接RAM(RDRAM)、直接存储器总线动态RAM(DRDRAM)、以及存储器总线动态RAM(RDRAM)等。The memory 303 may include non-volatile and/or volatile memory. Non-volatile memory may include read-only memory (ROM), programmable ROM (PROM), electrically programmable ROM (EPROM), electrically erasable programmable ROM (EEPROM), or flash memory. Volatile memory may include random access memory (RAM) or external cache memory. By way of illustration and not limitation, RAM is available in many forms, such as static RAM (SRAM), dynamic RAM (DRAM), synchronous DRAM (SDRAM), double data rate SDRAM (DDRSDRAM), enhanced SDRAM (ESDRAM), synchronous chain Synchlink DRAM (SLDRAM), memory bus (Rambus) direct RAM (RDRAM), direct memory bus dynamic RAM (DRDRAM), and memory bus dynamic RAM (RDRAM), etc.
根据本公开的另一方面,本发明还提出了一种存储介质,其上存储有程序,所述程序被执行时实现如下步骤:According to another aspect of the present disclosure, the present invention also proposes a storage medium on which a program is stored. When the program is executed, the following steps are implemented:
获取用户提供的硬币图片;Get the coin image provided by the user;
对所述硬币图片进行识别获取硬币区域,并分割获取硬币区域图片;Identify the coin image to obtain the coin area, and segment the coin area image to obtain the coin area image;
对所述硬币区域图片进行识别,获取多个特征区域;Identify the coin area pictures and obtain multiple feature areas;
对所述多个特征区域分别进行识别,获取多个硬币特征信息;Respectively identify the multiple feature areas to obtain multiple coin feature information;
综合所述多个硬币特征信息,获取所述硬币的分类信息。Combining the feature information of the multiple coins, the classification information of the coins is obtained.
本发明实施方式的计算机可读存储介质,可以采用一个或多个计算机可读的介质的任意组合。计算机可读介质可以是计算机可读信号介质或者计算机可读存储介质。计算机可读存储介质例如可以是但不限于电、磁、光、电磁、红外线或半导体的系统、装置或器件,或者任意以上的组合。计算机可读存储介质的更具体的例子(非穷举的列表)包括:具有一个或多个导线的电连接、便携式计算机硬盘、硬盘、随机存取存储器(RAM)、只读存储器(ROM)、可擦式可编程只读存储器(EPROM或闪存)、光纤、便携式紧凑磁盘只读存储器(CD-ROM)、光存储器件、磁存储器件、或者上述的任意合适的组合。在本文中,计算机可读存储介质可以是任何包含或存储程序的有形介质,该程序可以被指令执行系统、装置或者器件使用或者与其组合使用。 The computer-readable storage medium in the embodiment of the present invention may be any combination of one or more computer-readable media. The computer-readable medium may be a computer-readable signal medium or a computer-readable storage medium. The computer-readable storage medium may be, for example, but not limited to, an electrical, magnetic, optical, electromagnetic, infrared or semiconductor system, device or device, or any combination thereof. More specific examples (non-exhaustive list) of computer readable storage media include: an electrical connection having one or more conductors, a portable computer hard drive, a hard drive, random access memory (RAM), read only memory (ROM), Erasable programmable read-only memory (EPROM or flash memory), optical fiber, portable compact disk read-only memory (CD-ROM), optical storage device, magnetic storage device, or any suitable combination of the above. As used herein, a computer-readable storage medium may be any tangible medium that contains or stores a program for use by or in combination with an instruction execution system, apparatus, or device.
计算机可读的信号介质可以包括在基带中或者作为载波一部分传播的数据信号,其中承载了计算机可读的程序代码。这种传播的数据信号可以采用多种形式,包括但不限于电磁信号、光信号或上述的任意合适的组合。计算机可读的信号介质还可以是计算机可读存储介质以外的任何计算机可读介质,该计算机可读介质可以发送、传播或者传输用于由指令执行系统、装置或者器件使用或者与其结合使用的程序。A computer-readable signal medium may include a data signal propagated in baseband or as part of a carrier wave carrying computer-readable program code therein. Such propagated data signals may take many forms, including but not limited to electromagnetic signals, optical signals, or any suitable combination of the above. A computer-readable signal medium may also be any computer-readable medium other than a computer-readable storage medium that can send, propagate, or transmit a program for use by or in connection with an instruction execution system, apparatus, or device .
可以以一种或多种程序设计语言或其组合来编写用于执行本发明操作的计算机程序代码,所述程序设计语言包括面向对象的程序设计语言-诸如Java、Smalltalk、C++,还包括常规的过程式程序设计语言-诸如“C”语言或类似的程序设计语言。程序代码可以完全地在用户计算机上执行、部分地在用户计算机上执行、作为一个独立的软件包执行、部分在用户计算机上部分在远程计算机上执行、或者完全在远程计算机或服务器上执行。在涉及远程计算机的情形中,远程计算机可以通过任意种类的网络——包括局域网(LAN)或广域网(WAN)连接到用户计算机,或者可以连接到外部计算机(例如利用因特网服务提供商来通过因特网连接)。Computer program code for performing the operations of the present invention may be written in one or more programming languages, or a combination thereof, including object-oriented programming languages such as Java, Smalltalk, C++, and conventional Procedural programming language - such as "C" or similar programming language. The program code may execute entirely on the user's computer, partly on the user's computer, as a stand-alone software package, partly on the user's computer and partly on a remote computer or entirely on the remote computer or server. In situations involving remote computers, the remote computer can be connected to the user's computer through any kind of network, including a local area network (LAN) or a wide area network (WAN), or it can be connected to an external computer (such as an Internet service provider) through the Internet. ).
应当注意的是,在本文的实施方式中所揭露的装置和方法,也可以通过其他的方式实现。以上所描述的装置实施方式仅仅是示意性的,例如,附图中的流程图和框图显示了根据本文的多个实施方式的装置、方法和计算机程序产品的可能实现的体系架构、功能和操作。在这点上,流程图或框图中的每个方框可以代表一个模块、程序或代码的一部分,所述模块、程序段或代码的一部分包含一个或多个用于实现规定的逻辑功能的可执行指令,所述模块、程序段或代码的一部分包含一个或多个用于实现规定的逻辑功能的可执行指令。也应当注意,在有些作为替换的实现方式中,方框中所标注的功能也可以以不同于附图中所标注的顺序发生。例如,两个连续的方框实际上可以基本并行地执行,它们有时也可以按相反的顺序执行,这依所涉及的功能而定。也要注意的是,框图和/或流程图中的每个方框、以及框图和/或流程图中的方框的组合,可以用于执行规定的功能或动作的专用的基于硬件的系统来实现,或者可以用专用硬件与计算机指令的组合来实现。 It should be noted that the devices and methods disclosed in the embodiments of this article can also be implemented in other ways. The device embodiments described above are only illustrative. For example, the flowcharts and block diagrams in the accompanying drawings show the possible implementation architecture, functions and operations of the devices, methods and computer program products according to various embodiments of this document. . In this regard, each block in the flowchart or block diagrams may represent a module, program, or portion of code that contains one or more operable functions for implementing the specified logical functions. Execution instructions, the module, program segment or part of the code contains one or more executable instructions for implementing the specified logical function. It should also be noted that, in some alternative implementations, the functions noted in the block may occur out of the order noted in the figures. For example, two consecutive blocks may actually execute substantially in parallel, or they may sometimes execute in the reverse order, depending on the functionality involved. It will also be noted that each block in the block diagram and/or flowchart illustration, and combinations of blocks in the block diagram and/or flowchart illustration, can be designed into specialized hardware-based systems that perform the specified functions or acts. Implemented, or may be implemented using a combination of dedicated hardware and computer instructions.
另外,在本文各个实施方式中的各功能模块可以集成在一起形成一个独立的部分,也可以是各个模块单独存在,也可以两个或两个以上模块集成形成一个独立的部分。In addition, each functional module in each embodiment of this article can be integrated together to form an independent part, each module can exist alone, or two or more modules can be integrated to form an independent part.
上述描述仅是对本发明较佳实施方式的描述,并非对本发明范围的任何限定,本发明领域的普通技术人员根据上述揭示内容做的任何变更、修饰,均属于权利要求书的保护范围。 The above description is only a description of the preferred embodiments of the present invention and does not limit the scope of the present invention in any way. Any changes or modifications made by those of ordinary skill in the field of the present invention based on the above disclosure shall fall within the scope of the claims.

Claims (14)

  1. 一种硬币识别方法,其特征在于,包括:A coin identification method, characterized by including:
    获取用户提供的硬币图片;Get the coin image provided by the user;
    对所述硬币图片进行识别获取硬币区域,并分割获取硬币区域图片;Identify the coin image to obtain the coin area, and segment the coin area image to obtain the coin area image;
    对所述硬币区域图片进行识别,获取多个特征区域;Identify the coin area pictures and obtain multiple feature areas;
    对所述多个特征区域分别进行识别,获取多个硬币特征信息;Respectively identify the multiple feature areas to obtain multiple coin feature information;
    综合所述多个硬币特征信息,获取所述硬币的分类信息。Combining the characteristic information of the multiple coins, the classification information of the coins is obtained.
  2. 根据权利要求1所述的硬币识别方法,其特征在于,对所述硬币图片进行识别获取硬币区域包括:对所述硬币图片采用UNet图像分割处理,从而获取所述硬币区域。The coin identification method according to claim 1, wherein identifying the coin image to obtain the coin area includes: applying UNet image segmentation processing to the coin image to obtain the coin area.
  3. 根据权利要求1所述的硬币识别方法,其特征在于,对所述硬币图片进行识别获取硬币区域,并分割获取硬币区域图片包括:The coin identification method according to claim 1, wherein identifying the coin image to obtain the coin area and segmenting the coin area image includes:
    根据区域检测模型对所述硬币区域进行识别,获取硬币掩膜区域,并分割获取硬币区域图片。The coin area is identified according to the area detection model, the coin mask area is obtained, and the coin area image is segmented and obtained.
  4. 根据权利要求1所述的硬币识别方法,其特征在于,该方法还包括:对所述硬币区域图片进行椭圆检测以及正圆矫正处理。The coin identification method according to claim 1, characterized in that the method further includes: performing ellipse detection and perfect circle correction processing on the coin area image.
  5. 根据权利要求1所述的硬币识别方法,其特征在于,该方法还包括:对所述分割获取的硬币区域图片进行图像预处理,获取去除背景和噪声的硬币区域图片;The coin identification method according to claim 1, characterized in that the method further includes: performing image preprocessing on the coin area pictures obtained by the segmentation to obtain the coin area pictures with background and noise removed;
    对所述硬币区域图片进行识别,获取多个特征区域包括:对经过所述预处理的硬币区域图片进行识别,获取多个特征区域。Identifying the coin area picture and obtaining multiple characteristic areas includes: identifying the coin area picture that has undergone the preprocessing and obtaining multiple characteristic areas.
  6. 根据权利要求1所述的硬币识别方法,其特征在于,该方法还包括:对用户提供的硬币图片进行图像预处理,获取去除背景和噪声的硬币图片;The coin identification method according to claim 1, characterized in that the method further includes: performing image preprocessing on the coin pictures provided by the user to obtain the coin pictures with background and noise removed;
    对所述硬币图片进行识别获取硬币区域,并分割获取硬币区域图片包括:对经过所述预处理的硬币图片进行识别获取硬币区域,并分割获取硬币区域图片。Recognizing the coin image to obtain the coin area, and segmenting the coin area image to obtain the coin area image includes: identifying the pre-processed coin image to obtain the coin area, and segmenting the coin area image to obtain the coin area image.
  7. 根据权利要求1所述的硬币识别方法,其特征在于,所述分割获取硬 币区域图片后,该方法还包括:对所述硬币区域图片进行CV算法的后处理,去除多余噪点。The coin identification method according to claim 1, characterized in that the segmentation acquisition hard After collecting the coin area image, the method also includes: performing post-processing on the coin area image with a CV algorithm to remove excess noise.
  8. 根据权利要求1所述的硬币识别方法,其特征在于,所述获取用户提供的硬币图片包括:在交互界面提供供用户选择的至少两个窗口,以供用户分别选择提供当前待识别硬币的不同面的图片。The coin identification method according to claim 1, wherein the obtaining the coin picture provided by the user includes: providing at least two windows for the user to select on the interactive interface, so that the user can respectively choose to provide different images of the current coins to be identified. Pictures above.
  9. 根据权利要求8所述的硬币识别方法,其特征在于,根据预训练的正反面识别模型判断用户提供的硬币图片属于正面或是反面。The coin recognition method according to claim 8, characterized in that it is judged whether the coin picture provided by the user belongs to the front or the back according to the pre-trained front and back recognition model.
  10. 根据权利要求1所述的硬币识别方法,其特征在于,所述对硬币区域图片进行识别,获取多个特征区域包括:根据预训练的特征区域识别模型对所述硬币区域图片进行识别,获取多个特征区域。The coin identification method according to claim 1, characterized in that identifying the coin area picture and obtaining a plurality of characteristic areas includes: identifying the coin area picture according to a pre-trained characteristic area recognition model, and obtaining multiple characteristic areas. characteristic area.
  11. 根据权利要求1所述的硬币识别方法,其特征在于,采用基于对比学习的自监督预训练骨干模型对所述对多个特征区域分别进行识别,获取多个硬币特征信息。The coin identification method according to claim 1, characterized in that a self-supervised pre-training backbone model based on contrastive learning is used to identify the plurality of characteristic areas respectively to obtain the plurality of coin characteristic information.
  12. 根据权利要求1所述的硬币识别方法,其特征在于,所述综合多个硬币特征信息,获取所述硬币的分类信息包括:采用相似度比较方法获取所述硬币的综合相似度,并按照所述综合相似度将所述硬币的分类信息进行排序。The coin identification method according to claim 1, wherein said integrating multiple coin characteristic information and obtaining the classification information of the coin includes: using a similarity comparison method to obtain the comprehensive similarity of the coin, and according to the required The comprehensive similarity ranks the classification information of the coins.
  13. 一种硬币识别系统,其特征在于,包括处理器和存储器,所述存储器上存储有程序,所述程序被所述处理器执行时,实现根据权利要求1~12中任一项所述的硬币识别方法。A coin identification system, characterized in that it includes a processor and a memory, and a program is stored on the memory. When the program is executed by the processor, the coin according to any one of claims 1 to 12 is realized. recognition methods.
  14. 一种存储介质,其上存储有程序,其特征在于,所述程序被执行时实现根据权利要求1~12中任一项所述的硬币识别方法。 A storage medium on which a program is stored, characterized in that when the program is executed, the coin identification method according to any one of claims 1 to 12 is implemented.
PCT/CN2023/111654 2022-09-09 2023-08-08 Coin identification method and system, and storage medium WO2024051427A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202211104660.5 2022-09-09
CN202211104660.5A CN115359302A (en) 2022-09-09 2022-09-09 Coin identification method, system and storage medium

Publications (1)

Publication Number Publication Date
WO2024051427A1 true WO2024051427A1 (en) 2024-03-14

Family

ID=84007057

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2023/111654 WO2024051427A1 (en) 2022-09-09 2023-08-08 Coin identification method and system, and storage medium

Country Status (2)

Country Link
CN (1) CN115359302A (en)
WO (1) WO2024051427A1 (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115359302A (en) * 2022-09-09 2022-11-18 杭州睿胜软件有限公司 Coin identification method, system and storage medium

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2007241372A (en) * 2006-03-06 2007-09-20 Seiko Epson Corp Object identification device, object identification method, and, program for object identification
CN107330359A (en) * 2017-05-23 2017-11-07 深圳市深网视界科技有限公司 A kind of method and apparatus of face contrast
CN107705417A (en) * 2017-10-10 2018-02-16 深圳怡化电脑股份有限公司 Recognition methods, device, finance device and the storage medium of bank note version
CN110415425A (en) * 2019-07-16 2019-11-05 广州广电运通金融电子股份有限公司 Detecting of coin and recognition methods, system and storage medium based on image
CN110610575A (en) * 2019-09-20 2019-12-24 北京百度网讯科技有限公司 Coin identification method and device and cash register
CN115359302A (en) * 2022-09-09 2022-11-18 杭州睿胜软件有限公司 Coin identification method, system and storage medium

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2007241372A (en) * 2006-03-06 2007-09-20 Seiko Epson Corp Object identification device, object identification method, and, program for object identification
CN107330359A (en) * 2017-05-23 2017-11-07 深圳市深网视界科技有限公司 A kind of method and apparatus of face contrast
CN107705417A (en) * 2017-10-10 2018-02-16 深圳怡化电脑股份有限公司 Recognition methods, device, finance device and the storage medium of bank note version
CN110415425A (en) * 2019-07-16 2019-11-05 广州广电运通金融电子股份有限公司 Detecting of coin and recognition methods, system and storage medium based on image
CN110610575A (en) * 2019-09-20 2019-12-24 北京百度网讯科技有限公司 Coin identification method and device and cash register
CN115359302A (en) * 2022-09-09 2022-11-18 杭州睿胜软件有限公司 Coin identification method, system and storage medium

Also Published As

Publication number Publication date
CN115359302A (en) 2022-11-18

Similar Documents

Publication Publication Date Title
CN108664996B (en) Ancient character recognition method and system based on deep learning
CN108898137B (en) Natural image character recognition method and system based on deep neural network
WO2020252917A1 (en) Fuzzy face image recognition method and apparatus, terminal device, and medium
Yang et al. A framework for improved video text detection and recognition
CN112347244B (en) Yellow-based and gambling-based website detection method based on mixed feature analysis
CN112561080B (en) Sample screening method, sample screening device and terminal equipment
CN111353491B (en) Text direction determining method, device, equipment and storage medium
CN109213862B (en) Object recognition method and device, and computer-readable storage medium
US20200134382A1 (en) Neural network training utilizing specialized loss functions
CN110909784B (en) Training method and device of image recognition model and electronic equipment
WO2024051427A1 (en) Coin identification method and system, and storage medium
CN113221918B (en) Target detection method, training method and device of target detection model
CN110738030A (en) Table reconstruction method and device, electronic equipment and storage medium
US20230147685A1 (en) Generalized anomaly detection
Agrahari et al. Multi-oriented text detection in natural scene images based on the intersection of MSER with the locally binarized image
Akopyan et al. Text recognition on images from social media
CN113052243A (en) Target detection method based on CycleGAN and condition distribution self-adaption
CN114842478A (en) Text area identification method, device, equipment and storage medium
CN111178254A (en) Signature identification method and device
CN109460768B (en) Text detection and removal method for histopathology microscopic image
US11715288B2 (en) Optical character recognition using specialized confidence functions
US20230069960A1 (en) Generalized anomaly detection
Cruz et al. Em-based layout analysis method for structured documents
CN114708420A (en) Visual positioning method and device based on local variance and posterior probability classifier
CN114283429A (en) Material work order data processing method, device, equipment and storage medium

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 23862125

Country of ref document: EP

Kind code of ref document: A1