CN112507895A - Method and device for automatically classifying qualification certificate files based on big data analysis - Google Patents

Method and device for automatically classifying qualification certificate files based on big data analysis Download PDF

Info

Publication number
CN112507895A
CN112507895A CN202011465009.1A CN202011465009A CN112507895A CN 112507895 A CN112507895 A CN 112507895A CN 202011465009 A CN202011465009 A CN 202011465009A CN 112507895 A CN112507895 A CN 112507895A
Authority
CN
China
Prior art keywords
qualification
image
neural network
image block
clustering center
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202011465009.1A
Other languages
Chinese (zh)
Inventor
佟忠正
赵永发
林俊
王泽涌
洪雨天
郑杰生
黄杰韬
王喆
吴赟
臧笑宇
陈非
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Guangdong Electric Power Information Technology Co Ltd
Original Assignee
Guangdong Electric Power Information Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Guangdong Electric Power Information Technology Co Ltd filed Critical Guangdong Electric Power Information Technology Co Ltd
Priority to CN202011465009.1A priority Critical patent/CN112507895A/en
Publication of CN112507895A publication Critical patent/CN112507895A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/40Document-oriented image-based pattern recognition
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/214Generating training patterns; Bootstrap methods, e.g. bagging or boosting
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/23Clustering techniques
    • G06F18/232Non-hierarchical techniques
    • G06F18/2321Non-hierarchical techniques using statistics or function optimisation, e.g. modelling of probability density functions
    • G06F18/23213Non-hierarchical techniques using statistics or function optimisation, e.g. modelling of probability density functions with fixed number of clusters, e.g. K-means clustering
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition
    • G06V30/14Image acquisition
    • G06V30/148Segmentation of character regions
    • G06V30/153Segmentation of character regions using recognition of characters or words
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/40Document-oriented image-based pattern recognition
    • G06V30/41Analysis of document content
    • G06V30/413Classification of content, e.g. text, photographs or tables

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • General Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • Artificial Intelligence (AREA)
  • Multimedia (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • General Engineering & Computer Science (AREA)
  • Evolutionary Computation (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Evolutionary Biology (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Biomedical Technology (AREA)
  • Health & Medical Sciences (AREA)
  • Probability & Statistics with Applications (AREA)
  • Biophysics (AREA)
  • Computational Linguistics (AREA)
  • General Health & Medical Sciences (AREA)
  • Molecular Biology (AREA)
  • Computing Systems (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Image Analysis (AREA)

Abstract

The invention relates to the technical field of bid inviting purchase management, and provides a method and a device for automatically classifying qualification certificate files based on big data analysis, which are used for solving the classification problem of the qualification files. The invention provides a method for automatically classifying qualification certificate files based on big data analysis, which comprises the following steps: collecting images according to the qualification types; dividing the acquired image into a plurality of image blocks to obtain preprocessed image blocks; parallelizing a K-means algorithm, taking the preprocessed image block information as input, and extracting a dictionary; after a dictionary is extracted, mapping the preprocessed image block information into a new feature expression to obtain input information; training a neural network to obtain a trained qualification image recognition neural network; and acquiring qualification images in the bidding documents to be identified, and inputting the qualification images to identify the neural network for classification. The method improves the classification efficiency of the qualification pictures, greatly improves the accuracy of the evaluation of the suppliers, and improves the speed of tender purchase.

Description

Method and device for automatically classifying qualification certificate files based on big data analysis
Technical Field
The invention relates to the technical field of bid inviting purchase management, in particular to a method and a device for automatically classifying qualification certificate files based on big data analysis.
Background
According to the overall requirements of 'notice on the analysis table of advanced bidding management reform tasks of the issuing company' of the file No. 2019 of the radio and television enterprise '8', intelligent recommendation, risk analysis and intelligent early warning are realized by utilizing the technologies of supplier data reconstruction and the like, the high compliance efficiency of the selected suppliers for bidding purchase is ensured, and the risks of performing and auditing caused by the self risks of the suppliers in the purchasing process are prevented.
In the tendering procurement process, a bidder needs to provide a large number of scanned documents of qualification documents, and the tenderer needs to classify a large number of qualification documents to evaluate suppliers, but the method for classifying the qualification documents is lacked at present.
Disclosure of Invention
The technical problem solved by the invention is the classification problem of the qualification documents, and provides a method for automatically classifying the qualification certificate documents based on big data analysis.
In order to solve the technical problems, the technical scheme provided by the invention is as follows:
a method for automatically classifying qualification certificate files based on big data analysis comprises the following steps:
acquiring qualification types required by tender purchase, and acquiring images according to the qualification types, wherein each qualification type acquires at least one image;
dividing the acquired image into a plurality of image blocks, and performing regularization and whitening processing on each image block to remove interference information and retain key information to obtain a preprocessed image block;
parallelizing a K-means algorithm, taking the preprocessed image block information as input, and extracting a dictionary;
after a dictionary is extracted, a feature mapping function is constructed, and preprocessed image block information is mapped into a new feature expression to obtain input information;
inputting input information into a neural network for training, constructing a loss function, and obtaining a trained qualification image recognition neural network by taking the minimum loss function as a target and enabling a homeopathic function to be smaller than a training threshold;
obtaining a qualification image in a bidding document to be identified, carrying out image block division, regularization and whitening treatment, then carrying out feature extraction, and inputting the qualification image into a qualification image identification neural network for classification.
A large number of image blocks are generated after the qualification pictures are preprocessed, the features are extracted by adopting a clustering algorithm, and then the extracted features are input into a neural network for training to obtain the neural network, so that the qualification pictures can be effectively classified.
The method improves the classification efficiency of the qualification pictures, greatly improves the accuracy of the evaluation of the suppliers, and improves the speed of tender purchase.
Preferably, the dictionary extraction process is as follows: taking the preprocessed image block as the input of the Map nodes, initializing a clustering center, reading the preprocessed image block data in parallel by a plurality of Map nodes, calculating elements distributed to each clustering center, counting all the elements of each category on Reduce nodes, recalculating a new clustering center, comparing whether the change of the new clustering center and the previous clustering center is smaller than a set threshold value, if so, finishing iteration, outputting the clustering center, otherwise, updating the clustering center, and restarting a new iteration process.
Preferably, the method for mapping the preprocessed image block information into a new feature expression includes:
and parallelly distributing the dictionary to a plurality of Map nodes, simultaneously inputting a new label-free image data set to each Map node, performing feature learning on the image data set on the Map nodes, and performing feature mapping on the input image data to obtain new feature expression. After the clustering algorithm is used for processing, new features are obtained and used for neural network classification, so that the accuracy of qualification picture classification can be effectively improved.
Preferably, the loss function is:
Figure 100002_DEST_PATH_IMAGE001
where N =1,2, … … N, N representing the total number of samples per category, K =1,2 … … K, K representing the number of categories,
Figure 449365DEST_PATH_IMAGE002
the activation function value of the nth sample calculated by the image recognition neural network is represented in the k-th case,
Figure 100002_DEST_PATH_IMAGE003
indicating the probability that the nth sample is of class k,
Figure 485323DEST_PATH_IMAGE004
a value representing a first loss calculation parameter,
Figure 100002_DEST_PATH_IMAGE005
represents a second loss calculation parameter value, R () represents regularization, W represents a network parameter of the image recognition neural network,
Figure 938301DEST_PATH_IMAGE006
another network parameter of the neural network is identified for the image.
Preferably, the
Figure DEST_PATH_IMAGE007
Is a softmax function.
An apparatus for automatically categorizing a qualification certificate file based on big data analysis, comprising:
the image acquisition module acquires qualification types required by tender purchase and acquires images according to the qualification types, and each qualification type acquires at least one image;
the image processing device comprises a preprocessing module, a processing module and a processing module, wherein the preprocessing module divides an acquired image into a plurality of image blocks, and carries out regularization and whitening processing on each image block to remove interference information and keep key information to obtain a preprocessed image block;
the clustering module parallelizes the K-means algorithm, takes the preprocessed image block information as input, and extracts the dictionary;
the feature extraction module is used for constructing a feature mapping function after the dictionary is extracted by the feature extraction module, and mapping the preprocessed image block information into a new feature expression to obtain input information;
the training module inputs input information into the neural network for training, constructs a loss function, and obtains a qualified image recognition neural network after training by taking the minimum loss function as a target and enabling a homeopathic function to be smaller than a training threshold;
and the classification module acquires the qualification images in the bid documents to be identified, performs image block division, regularization and whitening treatment, performs feature extraction, and inputs the qualification images into a qualification image identification neural network for classification.
Preferably, the method for extracting the dictionary by the clustering module comprises the following steps:
taking the preprocessed image block as the input of the Map nodes, initializing a clustering center, reading the preprocessed image block data in parallel by a plurality of Map nodes, calculating elements distributed to each clustering center, counting all the elements of each category on Reduce nodes, recalculating a new clustering center, comparing whether the change of the new clustering center and the previous clustering center is smaller than a set threshold value, if so, finishing iteration, outputting the clustering center, otherwise, updating the clustering center, and restarting a new iteration process.
Preferably, the method for mapping the preprocessed image block information into the new feature expression by the feature extraction module includes:
and parallelly distributing the dictionary to a plurality of Map nodes, simultaneously inputting a new label-free image data set to each Map node, performing feature learning on the image data set on the Map nodes, and performing feature mapping on the input image data to obtain new feature expression.
Preferably, the loss function constructed by the training module is:
Figure 392416DEST_PATH_IMAGE008
where N =1,2, … … N, N representing the total number of samples per category, K =1,2 … … K, K representing the number of categories,
Figure DEST_PATH_IMAGE009
the activation function value of the nth sample calculated by the image recognition neural network is represented in the k-th case,
Figure 889125DEST_PATH_IMAGE010
indicating the probability that the nth sample is of class k,
Figure 694270DEST_PATH_IMAGE011
a value representing a first loss calculation parameter,
Figure 216518DEST_PATH_IMAGE012
represents a second loss calculation parameter value, R () represents regularization, W represents a network parameter of the image recognition neural network,
Figure 790719DEST_PATH_IMAGE006
another network parameter of the neural network is identified for the image.
Preferably, the loss function constructed by the training module
Figure 255067DEST_PATH_IMAGE013
Is a softmax function.
Compared with the prior art, the invention has the beneficial effects that: the method improves the classification efficiency of the qualification pictures, greatly improves the accuracy of the evaluation of the suppliers, and improves the speed of tender purchase.
Drawings
Fig. 1 is a schematic diagram of a method for automatically categorizing a qualification certificate file based on big data analysis.
Fig. 2 is a schematic diagram of an apparatus for automatically categorizing a qualification certificate file based on big data analysis.
Detailed Description
The following examples are further illustrative of the present invention and are not intended to be limiting thereof.
A method for automatically categorizing a qualification certificate file based on big data analysis, in some embodiments of the present application, comprising:
s100, acquiring qualification types required by tender purchase, and acquiring images according to the qualification types, wherein each qualification type acquires at least one image;
s200, dividing the acquired image into a plurality of image blocks, and performing regularization and whitening processing on each image block to remove interference information and retain key information to obtain a preprocessed image block;
s300, parallelizing a K-means algorithm, taking the preprocessed image block information as input, and extracting a dictionary;
s400, after a dictionary is extracted, a feature mapping function is constructed, and preprocessed image block information is mapped into a new feature expression to obtain input information;
s500, inputting input information into a neural network for training, constructing a loss function, and obtaining a qualification image recognition neural network after training by taking the minimum loss function as a target until a homeopathic function is smaller than a training threshold value;
s600, obtaining a qualification image in the bidding document to be identified, carrying out image block division, regularization and whitening, then carrying out feature extraction, and inputting the qualification image into a qualification image identification neural network for classification.
A large number of image blocks are generated after the qualification pictures are preprocessed, the features are extracted by adopting a clustering algorithm, and then the extracted features are input into a neural network for training to obtain the neural network, so that the qualification pictures can be effectively classified.
The method improves the classification efficiency of the qualification pictures, greatly improves the accuracy of the evaluation of the suppliers, and improves the speed of tender purchase.
In some embodiments of the present application, the dictionary extraction process is: taking the preprocessed image block as the input of the Map nodes, initializing a clustering center, reading the preprocessed image block data in parallel by a plurality of Map nodes, calculating elements distributed to each clustering center, counting all the elements of each category on Reduce nodes, recalculating a new clustering center, comparing whether the change of the new clustering center and the previous clustering center is smaller than a set threshold value, if so, finishing iteration, outputting the clustering center, otherwise, updating the clustering center, and restarting a new iteration process.
In some embodiments of the present application, a method for mapping the preprocessed image block information into a new feature expression includes:
and parallelly distributing the dictionary to a plurality of Map nodes, simultaneously inputting a new label-free image data set to each Map node, performing feature learning on the image data set on the Map nodes, and performing feature mapping on the input image data to obtain new feature expression.
After the clustering algorithm is used for processing, new features are obtained and used for neural network classification, so that the accuracy of qualification picture classification can be effectively improved.
In some embodiments of the present application, the loss function is:
Figure 281929DEST_PATH_IMAGE014
where N =1,2, … … N, N representing the total number of samples per category, K =1,2 … … K, K representing the number of categories,
Figure 873447DEST_PATH_IMAGE015
the activation function value of the nth sample calculated by the image recognition neural network is represented in the k-th case,
Figure DEST_PATH_IMAGE016
indicating the probability that the nth sample is of class k,
Figure 770996DEST_PATH_IMAGE017
a value representing a first loss calculation parameter,
Figure DEST_PATH_IMAGE018
represents a second loss calculation parameter value, R () represents regularization, W represents a network parameter of the image recognition neural network,
Figure 891399DEST_PATH_IMAGE006
another network parameter of the neural network is identified for the image.
In some embodiments of the present application, the
Figure 654825DEST_PATH_IMAGE019
Is a softmax function.
An apparatus for automatically categorizing a certification document based on big data analysis, in some embodiments of the present application, comprises:
the system comprises an image acquisition module 100, wherein the image acquisition module 100 acquires qualification types of tender purchase requirements, and acquires images according to the qualification types, and each qualification type acquires at least one image;
the image processing device comprises a preprocessing module 200, wherein the preprocessing module 200 divides an acquired image into a plurality of image blocks, and regularizes and whitens each image block to remove interference information and retain key information to obtain a preprocessed image block;
the clustering module 300 parallelizes the K-means algorithm, and extracts the dictionary by taking the preprocessed image block information as input;
the feature extraction module 400, after extracting the dictionary, constructs a feature mapping function, and maps the preprocessed image block information into a new feature expression to obtain input information by the feature extraction module 400;
the training module 500 inputs the input information into the neural network for training, constructs a loss function, and obtains a trained qualification image recognition neural network by taking the minimum loss function as a target and enabling the homeopathic function to be smaller than a training threshold;
and the classification module 600 acquires the qualification images in the bid documents to be identified, performs image block division, regularization and whitening processing, performs feature extraction, and inputs the qualification images into a qualification image identification neural network for classification.
In some embodiments of the present application, the method for extracting a dictionary by the clustering module includes:
taking the preprocessed image block as the input of the Map nodes, initializing a clustering center, reading the preprocessed image block data in parallel by a plurality of Map nodes, calculating elements distributed to each clustering center, counting all the elements of each category on Reduce nodes, recalculating a new clustering center, comparing whether the change of the new clustering center and the previous clustering center is smaller than a set threshold value, if so, finishing iteration, outputting the clustering center, otherwise, updating the clustering center, and restarting a new iteration process.
In some embodiments of the present application, the method for mapping the preprocessed image block information into a new feature expression by the feature extraction module is as follows:
and parallelly distributing the dictionary to a plurality of Map nodes, simultaneously inputting a new label-free image data set to each Map node, performing feature learning on the image data set on the Map nodes, and performing feature mapping on the input image data to obtain new feature expression.
In some embodiments of the present application, the loss function constructed by the training module is:
Figure DEST_PATH_IMAGE020
where N =1,2, … … N, N representing the total number of samples per category, K =1,2 … … K, K representing the number of categories,
Figure 253296DEST_PATH_IMAGE021
the activation function value of the nth sample calculated by the image recognition neural network is represented in the k-th case,
Figure DEST_PATH_IMAGE022
indicating the probability that the nth sample is of class k,
Figure 270931DEST_PATH_IMAGE023
a value representing a first loss calculation parameter,
Figure DEST_PATH_IMAGE024
represents a second loss calculation parameter value, R () represents regularization, W represents a network parameter of the image recognition neural network,
Figure 545923DEST_PATH_IMAGE006
another network parameter of the neural network is identified for the image.
In some embodiments of the present application, the training module constructs a loss function
Figure 281798DEST_PATH_IMAGE025
Is a softmax function.
The above detailed description is specific to possible embodiments of the present invention, and the above embodiments are not intended to limit the scope of the present invention, and all equivalent implementations or modifications that do not depart from the scope of the present invention should be included in the present claims.

Claims (10)

1. A method for automatically classifying qualification certificate files based on big data analysis is characterized by comprising the following steps:
acquiring qualification types required by tender purchase, and acquiring images according to the qualification types, wherein each qualification type acquires at least one image;
dividing the acquired image into a plurality of image blocks, and performing regularization and whitening processing on each image block to remove interference information and retain key information to obtain a preprocessed image block;
parallelizing a K-means algorithm, taking the preprocessed image block information as input, and extracting a dictionary;
after a dictionary is extracted, a feature mapping function is constructed, and preprocessed image block information is mapped into a new feature expression to obtain input information;
inputting input information into a neural network for training, constructing a loss function, and obtaining a trained qualification image recognition neural network by taking the minimum loss function as a target and enabling a homeopathic function to be smaller than a training threshold;
obtaining a qualification image in a bidding document to be identified, carrying out image block division, regularization and whitening treatment, then carrying out feature extraction, and inputting the qualification image into a qualification image identification neural network for classification.
2. The method for automatically categorizing qualification certificate files based on big data analysis according to claim 1, wherein the dictionary extraction process is as follows: taking the preprocessed image block as the input of the Map nodes, initializing a clustering center, reading the preprocessed image block data in parallel by a plurality of Map nodes, calculating elements distributed to each clustering center, counting all the elements of each category on Reduce nodes, recalculating a new clustering center, comparing whether the change of the new clustering center and the previous clustering center is smaller than a set threshold value, if so, finishing iteration, outputting the clustering center, otherwise, updating the clustering center, and restarting a new iteration process.
3. The method for automatically classifying qualification certificate files based on big data analysis according to claim 1, wherein the method for mapping the preprocessed image block information into a new feature expression comprises:
and parallelly distributing the dictionary to a plurality of Map nodes, simultaneously inputting a new label-free image data set to each Map node, performing feature learning on the image data set on the Map nodes, and performing feature mapping on the input image data to obtain new feature expression.
4. The method for automatically categorizing a certification document based on big data analysis of claim 2, wherein the loss function is:
Figure DEST_PATH_IMAGE001
where N =1,2, … … N, N representing the total number of samples per category, K =1,2 … … K, K representing the number of categories,
Figure 340666DEST_PATH_IMAGE002
the activation function value of the nth sample calculated by the image recognition neural network is represented in the k-th case,
Figure DEST_PATH_IMAGE003
indicating the probability that the nth sample is of class k,
Figure 179309DEST_PATH_IMAGE004
a value representing a first loss calculation parameter,
Figure DEST_PATH_IMAGE005
represents a second loss calculation parameter value, R () represents regularization, W represents a network parameter of the image recognition neural network,
Figure 538746DEST_PATH_IMAGE006
another network parameter of the neural network is identified for the image.
5. The method for automatically categorizing a certification document based on big data analysis of claim 1, wherein the method further comprises the step of automatically categorizing the certification document based on big data analysis
Figure 70221DEST_PATH_IMAGE003
Is a softmax function.
6. Device based on big data analysis automatic classification qualification certificate file, its characterized in that includes:
the image acquisition module acquires qualification types required by tender purchase and acquires images according to the qualification types, and each qualification type acquires at least one image;
the image processing device comprises a preprocessing module, a processing module and a processing module, wherein the preprocessing module divides an acquired image into a plurality of image blocks, and carries out regularization and whitening processing on each image block to remove interference information and keep key information to obtain a preprocessed image block;
the clustering module parallelizes the K-means algorithm, takes the preprocessed image block information as input, and extracts the dictionary;
the feature extraction module is used for constructing a feature mapping function after the dictionary is extracted by the feature extraction module, and mapping the preprocessed image block information into a new feature expression to obtain input information;
the training module inputs input information into the neural network for training, constructs a loss function, and obtains a qualified image recognition neural network after training by taking the minimum loss function as a target and enabling a homeopathic function to be smaller than a training threshold;
and the classification module acquires the qualification images in the bid documents to be identified, performs image block division, regularization and whitening treatment, performs feature extraction, and inputs the qualification images into a qualification image identification neural network for classification.
7. The apparatus for automatically categorizing qualification certificate files based on big data analysis of claim 6, wherein the clustering module extracts dictionaries by a method comprising:
taking the preprocessed image block as the input of the Map nodes, initializing a clustering center, reading the preprocessed image block data in parallel by a plurality of Map nodes, calculating elements distributed to each clustering center, counting all the elements of each category on Reduce nodes, recalculating a new clustering center, comparing whether the change of the new clustering center and the previous clustering center is smaller than a set threshold value, if so, finishing iteration, outputting the clustering center, otherwise, updating the clustering center, and restarting a new iteration process.
8. The apparatus for automatically classifying qualification certificate files based on big data analysis according to claim 6, wherein the method for mapping the preprocessed image block information into the new feature expression by the feature extraction module comprises:
and parallelly distributing the dictionary to a plurality of Map nodes, simultaneously inputting a new label-free image data set to each Map node, performing feature learning on the image data set on the Map nodes, and performing feature mapping on the input image data to obtain new feature expression.
9. The apparatus for automatically categorizing qualification certificate files based on big data analysis of claim 6, wherein the loss function constructed by the training module is:
Figure 866008DEST_PATH_IMAGE007
where N =1,2, … … N, N representing the total number of samples per category, K =1,2 … … K, K representing the number of categories,
Figure 773921DEST_PATH_IMAGE008
the activation function value of the nth sample calculated by the image recognition neural network is represented in the k-th case,
Figure 253444DEST_PATH_IMAGE009
indicating the probability that the nth sample is of class k,
Figure 955821DEST_PATH_IMAGE004
a value representing a first loss calculation parameter,
Figure 35641DEST_PATH_IMAGE005
represents a second loss calculation parameter value, R () represents regularization, W represents a network parameter of the image recognition neural network,
Figure 481666DEST_PATH_IMAGE006
another network parameter of the neural network is identified for the image.
10. The apparatus according to claim 6, wherein the training module constructs a loss function
Figure 81274DEST_PATH_IMAGE009
Is a softmax function.
CN202011465009.1A 2020-12-14 2020-12-14 Method and device for automatically classifying qualification certificate files based on big data analysis Pending CN112507895A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202011465009.1A CN112507895A (en) 2020-12-14 2020-12-14 Method and device for automatically classifying qualification certificate files based on big data analysis

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202011465009.1A CN112507895A (en) 2020-12-14 2020-12-14 Method and device for automatically classifying qualification certificate files based on big data analysis

Publications (1)

Publication Number Publication Date
CN112507895A true CN112507895A (en) 2021-03-16

Family

ID=74972300

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202011465009.1A Pending CN112507895A (en) 2020-12-14 2020-12-14 Method and device for automatically classifying qualification certificate files based on big data analysis

Country Status (1)

Country Link
CN (1) CN112507895A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113673849A (en) * 2021-08-09 2021-11-19 唐山鑫正工程项目管理有限公司 Engineering bidding management method and system

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103955707A (en) * 2014-05-04 2014-07-30 电子科技大学 Mass image sorting system based on deep character learning
CN104933445A (en) * 2015-06-26 2015-09-23 电子科技大学 Mass image classification method based on distributed K-means
CN110837856A (en) * 2019-10-31 2020-02-25 深圳市商汤科技有限公司 Neural network training and target detection method, device, equipment and storage medium
CN111275114A (en) * 2020-01-20 2020-06-12 黄惠芬 Network qualification image identification method based on ensemble learning under SDN architecture
CN111860834A (en) * 2020-07-09 2020-10-30 中国科学院深圳先进技术研究院 Neural network tuning method, system, terminal and storage medium

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103955707A (en) * 2014-05-04 2014-07-30 电子科技大学 Mass image sorting system based on deep character learning
CN104933445A (en) * 2015-06-26 2015-09-23 电子科技大学 Mass image classification method based on distributed K-means
CN110837856A (en) * 2019-10-31 2020-02-25 深圳市商汤科技有限公司 Neural network training and target detection method, device, equipment and storage medium
CN111275114A (en) * 2020-01-20 2020-06-12 黄惠芬 Network qualification image identification method based on ensemble learning under SDN architecture
CN111860834A (en) * 2020-07-09 2020-10-30 中国科学院深圳先进技术研究院 Neural network tuning method, system, terminal and storage medium

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
李涛: "基于卷积神经网络的图像分类模型的研究与应用", 《中国优秀硕士学位论文全文数据库信息科技辑》, no. 01, 15 January 2019 (2019-01-15), pages 138 - 3255 *

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113673849A (en) * 2021-08-09 2021-11-19 唐山鑫正工程项目管理有限公司 Engineering bidding management method and system

Similar Documents

Publication Publication Date Title
Kumar et al. Image based leaf segmentation and counting in rosette plants
Oo et al. Plant leaf disease detection and classification using image processing
White et al. Generating segmentation masks of herbarium specimens and a data set for training segmentation models using deep learning
JP2022027473A5 (en)
CN108764361B (en) Working condition identification method of indicator diagram of beam-pumping unit based on integrated learning
CN109815987B (en) Crowd classification method and system
CN112785441B (en) Data processing method, device, terminal equipment and storage medium
CN111401339A (en) Method and device for identifying age of person in face image and electronic equipment
CN111914902A (en) Traditional Chinese medicine identification and surface defect detection method based on deep neural network
CN116664944A (en) Vineyard pest identification method based on attribute feature knowledge graph
CN109614928B (en) Panda face recognition method based on limited training data
CN111882034A (en) Neural network processing and face recognition method, device, equipment and storage medium
CN106529490B (en) Based on the sparse system and method for realizing writer verification from coding code book
CN112507895A (en) Method and device for automatically classifying qualification certificate files based on big data analysis
CN112613341B (en) Training method and device, fingerprint identification method and device and electronic equipment
Panigrahi et al. Evaluations of fractal geometry and invariant moments for shape classification of corn germplasm
CN112508000B (en) Method and equipment for generating OCR image recognition model training data
CN113706258B (en) Product recommendation method, device, equipment and storage medium based on combined model
Khavalko et al. Classification and Recognition of Medical Images Based on the SGTM Neuroparadigm.
Lambert et al. Automatic identification of diatom morphology using deep learning
Mahalle et al. Data Acquisition and Preparation
CN114792374A (en) Image recognition method based on texture classification, electronic device and storage medium
Riyaz et al. A Novel Prediction Analysing the False Acceptance Rate and False Rejection Rate using CNN Model to Improve the Accuracy for Iris Recognition System for Biometric Security in Clouds Comparing with Traditional Inception Model
Liang et al. FARLut: a two-stage tobacco foreign body detection model incorporating color information and attention mechanism
Dhali et al. Pattern Recognition Techniques in Image-Based Material Classification of Ancient Manuscripts

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination